Query lcl|Aclame:protein:vir:105563|NCBI_annot:hypothetical protein|genbank:acc:YP_164316;genbank:gi:56692963;genbank:GeneID:3197174 Match_columns 396 No_of_seqs 76 out of 88 Neff 6.0 Searched_HMMs 1612 Date Sun Dec 1 02:31:42 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_3 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_3_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:105563 Length: 396 100.0 6E-144 4E-147 805.9 31.0 396 1-396 1-396 (396) 2 protein:vir:5120 Length: 615 # 100.0 2.3E-96 1.4E-99 544.9 27.0 358 1-396 30-404 (615) 3 protein:vir:2792 Length: 567 # 100.0 2.7E-94 1.7E-97 533.5 28.7 363 1-396 27-424 (567) 4 protein:vir:10145 Length: 567 100.0 2.7E-94 1.7E-97 533.5 28.7 363 1-396 27-424 (567) 5 protein:vir:9979 Length: 567 # 100.0 2.7E-94 1.7E-97 533.5 28.7 363 1-396 27-424 (567) 6 protein:vir:3306 Length: 567 # 100.0 2.7E-94 1.7E-97 533.5 28.7 363 1-396 27-424 (567) 7 protein:vir:827 Length: 567 # 100.0 4.9E-94 3E-97 532.2 28.5 363 1-396 27-424 (567) 8 protein:vir:104388 Length: 566 100.0 7.7E-94 4.8E-97 531.1 28.7 363 1-396 26-423 (566) 9 protein:vir:93631 Length: 580 100.0 4.4E-92 2.7E-95 521.4 29.1 360 1-396 1-377 (580) 10 protein:vir:107423 Length: 681 98.8 3E-08 1.9E-11 61.8 24.1 371 1-396 1-442 (681) 11 protein:vir:98487 Length: 681 98.8 3E-08 1.9E-11 61.8 24.1 371 1-396 1-442 (681) 12 protein:vir:107802 Length: 681 98.8 3E-08 1.9E-11 61.8 24.1 371 1-396 1-442 (681) 13 protein:vir:99677 Length: 794 98.7 3.7E-08 2.3E-11 61.3 23.1 376 1-396 1-526 (794) 14 protein:vir:8837 Length: 513 # 98.7 2.9E-08 1.8E-11 61.9 20.1 270 1-396 3-317 (513) 15 protein:vir:80253 Length: 777 98.6 2.8E-07 1.7E-10 56.5 24.6 375 1-396 1-522 (777) 16 protein:vir:2203 Length: 794 # 98.6 2.9E-07 1.8E-10 56.4 23.2 375 1-396 1-527 (794) 17 protein:vir:103790 Length: 768 98.5 3.3E-07 2E-10 56.2 23.8 370 1-396 1-516 (768) 18 protein:vir:10452 Length: 794 98.5 3.8E-07 2.4E-10 55.8 26.0 375 1-396 1-527 (794) 19 protein:vir:94713 Length: 785 98.5 3.9E-07 2.4E-10 55.7 22.6 376 1-396 1-523 (785) 20 protein:vir:94583 Length: 792 98.3 1.3E-06 8.1E-10 52.9 23.4 375 1-396 1-524 (792) 21 protein:vir:97014 Length: 800 98.3 1.7E-06 1E-09 52.2 22.0 366 1-396 1-535 (800) 22 protein:vir:105647 Length: 800 98.2 3.1E-06 1.9E-09 50.8 24.2 375 1-396 1-535 (800) 23 protein:vir:1543 Length: 801 # 98.1 4E-06 2.5E-09 50.2 23.0 375 1-396 1-534 (801) 24 protein:vir:3366 Length: 801 # 98.1 4.2E-06 2.6E-09 50.1 22.8 375 1-396 1-534 (801) 25 protein:vir:6326 Length: 826 # 98.1 5.6E-06 3.5E-09 49.4 25.4 373 1-396 1-569 (826) 26 protein:vir:7021 Length: 803 # 98.0 7.5E-06 4.7E-09 48.7 23.2 374 1-396 1-527 (803) 27 protein:vir:63741 Length: 468 97.9 1.7E-07 1E-10 57.7 9.4 254 1-291 150-468 (468) 28 protein:vir:80491 Length: 467 97.9 1.8E-07 1.1E-10 57.6 9.4 254 1-291 149-467 (467) 29 protein:vir:78957 Length: 826 97.9 1E-05 6.3E-09 48.0 24.0 377 1-396 1-569 (826) 30 protein:vir:8887 Length: 808 # 97.8 1.5E-05 9.6E-09 47.0 25.9 376 1-396 1-543 (808) 31 protein:vir:96666 Length: 462 97.7 4.6E-07 2.8E-10 55.3 8.7 247 1-289 150-462 (462) 32 protein:vir:95324 Length: 823 97.7 2.9E-05 1.8E-08 45.5 24.0 368 1-396 1-489 (823) 33 protein:vir:7329 Length: 825 # 97.5 4.7E-05 2.9E-08 44.3 23.0 365 1-396 1-491 (825) 34 protein:vir:103341 Length: 806 97.4 7.6E-05 4.7E-08 43.2 20.6 374 1-396 1-537 (806) 35 protein:vir:108312 Length: 458 96.9 0.00029 1.8E-07 40.0 20.8 270 1-396 1-291 (458) 36 protein:vir:2625 Length: 715 # 96.2 0.00083 5.1E-07 37.5 20.4 354 1-396 1-470 (715) 37 protein:vir:95603 Length: 463 96.1 5.6E-05 3.5E-08 43.9 7.1 255 1-291 150-463 (463) 38 protein:vir:99311 Length: 463 96.1 5.6E-05 3.5E-08 43.9 7.1 255 1-291 150-463 (463) 39 protein:vir:102644 Length: 594 96.1 0.00098 6.1E-07 37.1 20.6 303 1-396 1-345 (594) 40 protein:vir:80835 Length: 464 95.3 0.00019 1.2E-07 40.9 7.0 264 1-293 146-464 (464) 41 protein:vir:105525 Length: 472 94.8 0.0036 2.2E-06 34.0 21.1 279 1-396 1-309 (472) 42 protein:vir:3133 Length: 911 # 92.3 0.012 7.3E-06 31.2 20.0 364 1-396 1-539 (911) 43 protein:vir:1778 Length: 680 # 92.2 0.012 7.7E-06 31.0 22.3 361 1-396 166-657 (680) 44 protein:vir:2109 Length: 472 # 90.9 0.018 1.1E-05 30.1 19.2 281 1-396 1-309 (472) 45 protein:vir:105428 Length: 472 90.3 0.021 1.3E-05 29.7 18.5 252 133-396 1-309 (472) 46 protein:vir:9268 Length: 472 # 90.0 0.023 1.4E-05 29.5 19.2 281 1-396 1-309 (472) 47 protein:vir:100851 Length: 514 89.9 0.0039 2.4E-06 33.8 6.0 262 1-291 169-514 (514) 48 protein:vir:177 Length: 472 # 84.5 0.06 3.7E-05 27.3 18.6 252 133-396 1-309 (472) 49 protein:vir:95475 Length: 771 83.5 0.068 4.2E-05 27.0 20.7 374 1-396 1-514 (771) 50 protein:vir:100960 Length: 472 83.3 0.069 4.3E-05 26.9 19.4 281 1-396 1-309 (472) 51 protein:vir:3529 Length: 477 # 81.2 0.088 5.5E-05 26.4 20.3 280 1-396 7-314 (477) 52 protein:vir:102823 Length: 470 79.8 0.018 1.1E-05 30.2 4.4 264 1-291 103-470 (470) 53 protein:vir:100022 Length: 976 79.6 0.1 6.4E-05 26.0 18.9 314 1-396 325-693 (976) 54 protein:vir:78703 Length: 905 41.3 0.93 0.00058 20.8 21.0 359 1-396 181-649 (905) No 1 >protein:vir:105563 Length: 396 # NCBI annotation: hypothetical protein # Family: family:all:27455 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164316;genbank:gi:56692963;genbank:GeneID:3197174 Probab=100.00 E-value=5.8e-144 Score=805.86 Aligned_cols=396 Identities=100% Similarity=1.494 Sum_probs=391.6 Q ss_pred CCcccccceeccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccccccCCcccccEEEEECCeE Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPLHGDAFGALGDQW 80 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~lw~s~~~~~~~~~~dg~L 80 (396) |+++.|.||+|||||++|++|++++|++.+++|||+||||+++|+.+||.|+||++++++|+|||+|+|+|+|+++++.| T Consensus 1 ~~~~~~~~~~ginnv~~e~~l~~~~~~~~~~~r~a~nvdi~~~G~~~~r~~~tr~~~g~l~~~~~~~~~~~~~~~~~~tl 80 (396) T protein:vir:10 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPLHGDAFGALGDQW 80 (396) T ss_pred CcceeeeeeecccccccccccccCCCcccceeeeeeeecccCCCchhhhccCcccCCceecccccCccccceeeeCCceE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecCCCceeecccccCcceehhhcCCeEEEEcCCcceeecCceeeeccccCCccceeecCCCCcccceEEEEEEEEcC Q lcl|Aclame:pro 81 GKVDPHSWTFEPLAQIGEGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLDTPAPPLLVAGAGSLSQGTYGAAVAWLRG 160 (396) Q Consensus 81 ~~i~~~~w~~~vl~~ig~gpV~~~v~n~rvy~t~~~~~~~~~g~~~~~l~ip~Pa~p~~~~~~Gsl~~g~y~ya~T~V~~ 160 (396) +|+|.++|+..++++++++||||+++||||||||+..+|+++++..|+|++|+|++++.+...|++++++|+|++|||+. T Consensus 81 ~~~~~~~w~~~~~v~v~~~pva~d~~~~Rvy~t~~~~p~~~~~~~~y~L~vp~P~~a~~~a~~Gsl~~~~~~Y~~t~V~~ 160 (396) T protein:vir:10 81 GKVDPHSWTFEPLAQIGEGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLDTPAPPLLVAGAGSLSQGTYGAAVAWLRG 160 (396) T ss_pred EEEeCCeEEEEeeeeeccCchhccccCCeEEEEcCCCceeeeCCcceecCcCCCcccccccccCccCCceEEEEEEEEec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcccccccceeEecCCCccEEEeecCCCCCcceEEEEEEecCCCeEEEEEeecceeEEEEcCCchhhcccccchhcCCc Q lcl|Aclame:pro 161 PQESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLYLTRANGGELLLAGDYPLGAATVILPTLPELGRPAQFRHLSPM 240 (396) Q Consensus 161 ~gEeg~~~~~S~~vt~~~~~~v~lp~~~~~~i~~~RIYrs~~~g~~~~lv~e~~~~~~~~~d~~~~~lg~~l~t~~~~pp 240 (396) +||||+|+++|++|+++++++|+++.+++++|+++|||||.++|++|||++|+++++++|.+++.++++++++++++.|| T Consensus 161 ~gEEs~p~~~S~~v~~~gg~~vtl~~~~~~~i~~~RiYrS~~~G~~~~l~aE~~a~~~s~vlPs~~w~gpP~~~~gL~pm 240 (396) T protein:vir:10 161 PQESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLYLTRANGGELLLAGDYPLGAATVILPTLPELGRPAQFRHLSPM 240 (396) T ss_pred CCCcCcccccccccCCCCCcEEEEEcccCCCcceEEEEEeCCChhhhhheehhccceeeeeeecCCCCCCCccccccccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCceeeccCCEEEEEECCEEEEccCCCCcccccccccEecCcceEEEEEcCCcEEEEEcCcEEEEEcCchhheeeeeec Q lcl|Aclame:pro 241 PTGKHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPVDGGIWVGQVDHVAFLDGADPASLSVSRRA 320 (396) Q Consensus 241 P~g~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v~~gl~V~T~~~~y~l~G~~p~~m~~~~~~ 320 (396) |.|++++.|+||+|+|+||+|||||||+||+|++.++|++|++||++|+|+++||||+|+++|||++|++|++|+++|++ T Consensus 241 P~G~~~A~faGRi~~A~Gn~V~FSEp~~Ph~~~~~~~~~~~~~~Iv~lapv~~gL~Vgt~~~~y~~~G~dP~sms~~~l~ 320 (396) T protein:vir:10 241 PTGKHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPVDGGIWVGQVDHVAFLDGADPASLSVSRRA 320 (396) T ss_pred chhHhhhhhcceEEEEeCCEEEEecCCCCceecchhccCCCCCceEEEEEecCeEEEEEcCcEEEEEcCChhHcceeecc Confidence 99999999999999999999999999999999999899999999999999999999999999999999999999999999 Q ss_pred cCCCcccceeecchhhhccccccCcccEEEEecCCCEEEEcCCCcEEEEecceeeccccccceEEEeCcEEEEEeC Q lcl|Aclame:pro 321 SRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVMGTSSGAIAEVHAGVLAGITGRAGTSVVFDRRLLTAVS 396 (396) Q Consensus 321 ~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~Glv~g~~~G~~~~lt~~~~~~~~a~~~~~~~~~rr~v~~~~ 396 (396) +.+|++|||+.+|++|++++++++++.+++|+|+||||+|+++|++.++|++++++....+++++..+|||++.+| T Consensus 321 ~~~pvp~S~v~~p~~~~s~rs~~~~~~~~lwas~dGl~~g~~~G~v~~l~~~~i~p~~~~A~~~~~~drRy~~~~~ 396 (396) T protein:vir:10 321 SRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVMGTSSGAIAEVHAGVLAGITGRAGTSVVFDRRLLTAVS 396 (396) T ss_pred cCCCcccchhcccchhhhcccccccCcEEEEccCCcEEEEcCCceeeeecccccCCCcccceEEEeecCeEEEEeC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:5120 Length: 615 # NCBI annotation: unknown # Family: family:all:1544 # MgeID: mge:114 # MgeName: PBC5 # Cross-refs: genbank:acc:NP_542277;genbank:gi:18071220;genbank:GeneID:929342 Probab=100.00 E-value=2.3e-96 Score=544.87 Aligned_cols=358 Identities=17% Similarity=0.177 Sum_probs=304.0 Q ss_pred CCcccccceeccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccccccCCcccccEEEEECCeE Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPLHGDAFGALGDQW 80 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~lw~s~~~~~~~~~~dg~L 80 (396) |+.|+|.-|.|.-++..++-| |-.+..-|.|+-+ ++|.|....--..+. .+-+.. ...+|...++ T Consensus 30 M~~I~i~~f~Ge~Prl~P~lL------P~~~A~~A~N~~~-~~G~ltP~~~~~~~~-----~~~~~~-~~Tif~~~~~-- 94 (615) T protein:vir:51 30 MVAIKISAFAGEQPMLLPRLL------PETGATAAMNVRL-NDGGLTPINKPIEVA-----TIATAS-QKTIYRHQGS-- 94 (615) T ss_pred eEEEeecccccccccchhhhc------cCcccceEEeeee-cCCeeeeecCccccc-----cccccc-ceeeeeecCc-- Confidence 999999999999999999988 4556677889999 668887655433322 233221 1344544433 Q ss_pred EEEecCCCceeecccccCcceehhhcCCeEEEEcCCcceeecCceeeeccccCCcccee--ecCCCCcccceEEEEEEEE Q lcl|Aclame:pro 81 GKVDPHSWTFEPLAQIGEGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLDTPAPPLL--VAGAGSLSQGTYGAAVAWL 158 (396) Q Consensus 81 ~~i~~~~w~~~vl~~ig~gpV~~~v~n~rvy~t~~~~~~~~~g~~~~~l~ip~Pa~p~~--~~~~Gsl~~g~y~ya~T~V 158 (396) |..|+..| ++.++||+. ||||||+|+.+|+..++..|+|+||+|+.++. .++.|+++.++|.|+|||| T Consensus 95 ----W~~w~~~V--~av~sPvA~----DRvy~tgdg~Pkv~~~~~sY~LgVpaPs~ap~~~~~g~g~~d~etr~Yv~TfV 164 (615) T protein:vir:51 95 ----WLSWPNVV--NAVPGPVAQ----DRLYFTGDGAPKVKIGGVDYALKVPRPTGALTAALSGTGSGDIQSRTYVYTWV 164 (615) T ss_pred ----eeccCCce--eEccCCccc----ceeEEcCCCcceEeecccCccccccCCCccceEEecCCCCccccceEEEEEEE Confidence 55788877 567899985 49999999999999999999999999976443 4678889999999999999 Q ss_pred cCCCcccccccceeEecCCCccEEEee----cCCCCCcceEEEEEEecC--CCeEEEEEeecceeEEEEcCC-chhhccc Q lcl|Aclame:pro 159 RGPQESAPSLIAFAEVTDAGALEVTFP----LCLDASVTGARLYLTRAN--GGELLLAGDYPLGAATVILPT-LPELGRP 231 (396) Q Consensus 159 ~~~gEeg~~~~~S~~vt~~~~~~v~lp----~~~~~~i~~~RIYrs~~~--g~~~~lv~e~~~~~~~~~d~~-~~~lg~~ 231 (396) +.+||||+|||+|..++++.+..|+|. ...+.+|+++|||||.++ |++|+|++|+++++.+|+|+. .++|+++ T Consensus 165 t~~GeES~PSp~S~~v~v~~g~tVtLs~~pa~~~~~~i~~rRIYRS~tg~~gtdy~lVAel~as~~sf~D~~~~~~Lg~~ 244 (615) T protein:vir:51 165 TSFGEESAPCPASIIVDWKPGQTVTLSGFAATPGGRSITTQRIYRSQTGKTGTGLYLIAERAASAGNFTDNIAVDQFQEP 244 (615) T ss_pred cCCCCcCCCCccceeeEecCCCeEEEeeccCCcCCCceeeEEEEEeccCCCceeeEEEeeecccceeeeeccchhhcCcc Confidence 999999999999999998877666653 345668999999998775 679999999999999999995 5678999 Q ss_pred ccchhcCCcCCC--ceeeccCCEEEEEECCEEEEccCCCCcccccccccEecCcceEEEEEcCCcEEEEEcCcEEEEEcC Q lcl|Aclame:pro 232 AQFRHLSPMPTG--KHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPVDGGIWVGQVDHVAFLDGA 309 (396) Q Consensus 232 l~t~~~~ppP~g--~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v~~gl~V~T~~~~y~l~G~ 309 (396) |+|.+|.+||++ ++|+|+||||++|.||+|||||||+|||||++| ++++++||++|++++++|||+|+++||+++|+ T Consensus 245 Lps~~w~~PP~~l~GL~~m~NGimAgF~GneV~FsEpy~PyAWP~~Y-r~t~d~dIVaiA~~gt~LVV~TkG~PYl~sG~ 323 (615) T protein:vir:51 245 LPSADWNEPPDGLAGLAEMPNGMMAAFVGRSIYFCEPYRPHAWPEKY-SRNVGSDIVGIAALGSILVVVTKGKPYLLAGT 323 (615) T ss_pred cccccccCcCcchhhhhccccceEEeecCCEEEEecCCCCcccchhc-ccCcCCCeeEEEecccEEEEEEcCceEEEEcC Confidence 999999999998 589999999999999999999999999999999 79999999999999999999999999999999 Q ss_pred chhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCCEEEEcCCCcEEEEecceeecc------ccccce Q lcl|Aclame:pro 310 DPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVMGTSSGAIAEVHAGVLAGI------TGRAGT 383 (396) Q Consensus 310 ~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~Glv~g~~~G~~~~lt~~~~~~~------~a~~~~ 383 (396) +|++|+++||+..|| |+++|++++++.+++|+|+||||+++++|.+.++|++.|..+ |+..- T Consensus 324 sP~sms~~kL~~~qp-----------CvS~rsiV~~~~~v~Yas~dGLV~v~~~G~a~vvT~~l~t~~qW~~l~P~ti~- 391 (615) T protein:vir:51 324 HPDSMQQQQLEENLP-----------CINARSIVDLGHAVCYASNDGLVAVRGDGSIRLVTEQLLSREKWLDLSPFTII- 391 (615) T ss_pred Chhhccccccccccc-----------cccccceeEecceEEeecCCceEEEecCCchhhhhhhccChhHHHhcCCceEE- Confidence 999999999998887 999999999999999999999999999999999999999664 33333 Q ss_pred EEEeCcEEEEEeC Q lcl|Aclame:pro 384 SVVFDRRLLTAVS 396 (396) Q Consensus 384 ~~~~~rr~v~~~~ 396 (396) +-..+.||++.-+ T Consensus 392 a~~~eG~Y~~~Y~ 404 (615) T protein:vir:51 392 GGQINGAYLLFYD 404 (615) T ss_pred EEeecCeEEEEec Confidence 3344445555443 No 3 >protein:vir:2792 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612909;genbank:gi:20065826;genbank:GeneID:935648 Probab=100.00 E-value=2.7e-94 Score=533.55 Aligned_cols=363 Identities=15% Similarity=0.147 Sum_probs=301.2 Q ss_pred CCcccccceeccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccccccCCcccccEEEEECCeE Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPLHGDAFGALGDQW 80 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~lw~s~~~~~~~~~~dg~L 80 (396) |+-++|.-|.|.-++..++=| |-.+..-|.|+-+ +.|.|....--..+ ..+... ..+.+|...|+.| T Consensus 27 M~~i~i~~f~Ge~Prl~p~lL------P~~~a~~A~n~~~-~~G~itP~~~~~~~-----~~~~~~-~~~Tif~y~~~~W 93 (567) T protein:vir:27 27 MPYIDITTMRGMMPRVVTSML------PEHSAVLAEDCHF-RFGVITPERQISGV-----EKTFTI-KPKTIFHYRDDFW 93 (567) T ss_pred eeEEeecccccccccchhhhc------cccccceEEeeec-cCCeeeeeeccccc-----cccccc-CceeeEEEcCcEE Confidence 999999999999999999888 5666778999999 55877654322111 111111 1245666666655 Q ss_pred EEEecCCCceeecccccCcceehhhcCCeEEEEcCCcceeecCc-----------eeeeccccCCccceee--cCCCC-- Q lcl|Aclame:pro 81 GKVDPHSWTFEPLAQIGEGDLSHEVLNNRVCVAGTAGIFTYDGA-----------QAERLTLDTPAPPLLV--AGAGS-- 145 (396) Q Consensus 81 ~~i~~~~w~~~vl~~ig~gpV~~~v~n~rvy~t~~~~~~~~~g~-----------~~~~l~ip~Pa~p~~~--~~~Gs-- 145 (396) + .|+..| +..++||+.|.. +|||||||+.+|+.++. ..|+|++|+|+.++.. ++.++ T Consensus 94 ~-----~w~~~V--~~ir~PvAqD~~-~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~ 165 (567) T protein:vir:27 94 F-----AWPDVV--DVIRSPIAQDPH-GRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVS 165 (567) T ss_pred E-----EeCCce--eeccCccccCCc-ceEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCC Confidence 5 588877 557899998644 89999999999998765 4679999999765543 34444 Q ss_pred ----cccceEEEEEEEEcCCCcccccccceeEecCC-Cc--cEEEe--ecCCCCCcceEEEEEEecC--CCeEEEEEeec Q lcl|Aclame:pro 146 ----LSQGTYGAAVAWLRGPQESAPSLIAFAEVTDA-GA--LEVTF--PLCLDASVTGARLYLTRAN--GGELLLAGDYP 214 (396) Q Consensus 146 ----l~~g~y~ya~T~V~~~gEeg~~~~~S~~vt~~-~~--~~v~l--p~~~~~~i~~~RIYrs~~~--g~~~~lv~e~~ 214 (396) +++++|+|++|||+.+||||+||++|.++++. .+ +++++ ++...++|+++|||||.++ |++|+|++|++ T Consensus 166 ~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~ 245 (567) T protein:vir:27 166 DDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELD 245 (567) T ss_pred CCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCceEEEeeccCCccccccceEEEEEecCCCCceeeEEEEeec Confidence 67899999999999999999999999988874 33 44444 3456778999999998776 46999999999 Q ss_pred ceeEEEEcCCc-hhhcccccchhcCCcCCC--ceeeccCCEEEEEECCEEEEccCCCCcccccccccEecCcceEEEEEc Q lcl|Aclame:pro 215 LGAATVILPTL-PELGRPAQFRHLSPMPTG--KHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPV 291 (396) Q Consensus 215 ~~~~~~~d~~~-~~lg~~l~t~~~~ppP~g--~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v 291 (396) +++.+|+|+.. ++|+++|+|.+|.+||++ ++|+|+||||++|.||+|||||||+|||||++| ++++++||++|+++ T Consensus 246 as~~sf~D~~~~~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Y-r~t~~~dIVaiA~~ 324 (567) T protein:vir:27 246 ASVLSYTDKIPAKNLGPSLATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVN-RHTTAEDIVAICPL 324 (567) T ss_pred cceeeeeeccchhhcccccccccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhh-ccCCCCCeEEEeec Confidence 99999999964 578999999999999998 599999999999999999999999999999999 69999999999999 Q ss_pred CCcEEEEEcCcEEEEEcCchhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCCEEEEcCCCcEEEEec Q lcl|Aclame:pro 292 DGGIWVGQVDHVAFLDGADPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVMGTSSGAIAEVHA 371 (396) Q Consensus 292 ~~gl~V~T~~~~y~l~G~~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~Glv~g~~~G~~~~lt~ 371 (396) +++|||+|+++||+++|++|++|+++||...|| |+++|++++++.+++|+|+||||+++++|++.++|+ T Consensus 325 gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qp-----------CvS~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~ 393 (567) T protein:vir:27 325 GTSLVVATKGEPYLFSGVSPSTISGSKIPSMQA-----------CLSRRSMVAMEGFVLYAGTNGLVSVDANGNVALATE 393 (567) T ss_pred ccEEEEEEcCceEEEEcCChhhccccccccccc-----------cccccceeEeccEEEeecCCcEEEEecCCchhhhhh Confidence 999999999999999999999999999998887 999999999999999999999999999999999999 Q ss_pred ceeecccccc------ceEEEeCcEEEEEeC Q lcl|Aclame:pro 372 GVLAGITGRA------GTSVVFDRRLLTAVS 396 (396) Q Consensus 372 ~~~~~~~a~~------~~~~~~~rr~v~~~~ 396 (396) +.|..+.+++ -.+-..+-||++.-. T Consensus 394 ~l~t~~qW~a~~~P~ti~A~~~eG~Y~a~Y~ 424 (567) T protein:vir:27 394 QIVSPEQWQSQFNPASIVAYPWRGEYIACYT 424 (567) T ss_pred hccChHHHHhcCCcceEEEEeecCeEEEEEe Confidence 9997654432 344455556666655 No 4 >protein:vir:10145 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859275;genbank:gi:32171031;genbank:GeneID:2653447 Probab=100.00 E-value=2.7e-94 Score=533.55 Aligned_cols=363 Identities=15% Similarity=0.147 Sum_probs=301.2 Q ss_pred CCcccccceeccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccccccCCcccccEEEEECCeE Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPLHGDAFGALGDQW 80 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~lw~s~~~~~~~~~~dg~L 80 (396) |+-++|.-|.|.-++..++=| |-.+..-|.|+-+ +.|.|....--..+ ..+... ..+.+|...|+.| T Consensus 27 M~~i~i~~f~Ge~Prl~p~lL------P~~~a~~A~n~~~-~~G~itP~~~~~~~-----~~~~~~-~~~Tif~y~~~~W 93 (567) T protein:vir:10 27 MPYIDITTMRGMMPRVVTSML------PEHSAVLAEDCHF-RFGVITPERQISGV-----EKTFTI-KPKTIFHYRDDFW 93 (567) T ss_pred eeEEeecccccccccchhhhc------cccccceEEeeec-cCCeeeeeeccccc-----cccccc-CceeeEEEcCcEE Confidence 999999999999999999888 5666778999999 55877654322111 111111 1245666666655 Q ss_pred EEEecCCCceeecccccCcceehhhcCCeEEEEcCCcceeecCc-----------eeeeccccCCccceee--cCCCC-- Q lcl|Aclame:pro 81 GKVDPHSWTFEPLAQIGEGDLSHEVLNNRVCVAGTAGIFTYDGA-----------QAERLTLDTPAPPLLV--AGAGS-- 145 (396) Q Consensus 81 ~~i~~~~w~~~vl~~ig~gpV~~~v~n~rvy~t~~~~~~~~~g~-----------~~~~l~ip~Pa~p~~~--~~~Gs-- 145 (396) + .|+..| +..++||+.|.. +|||||||+.+|+.++. ..|+|++|+|+.++.. ++.++ T Consensus 94 ~-----~w~~~V--~~ir~PvAqD~~-~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~ 165 (567) T protein:vir:10 94 F-----AWPDVV--DVIRSPIAQDPH-GRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVS 165 (567) T ss_pred E-----EeCCce--eeccCccccCCc-ceEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCC Confidence 5 588877 557899998644 89999999999998765 4679999999765543 34444 Q ss_pred ----cccceEEEEEEEEcCCCcccccccceeEecCC-Cc--cEEEe--ecCCCCCcceEEEEEEecC--CCeEEEEEeec Q lcl|Aclame:pro 146 ----LSQGTYGAAVAWLRGPQESAPSLIAFAEVTDA-GA--LEVTF--PLCLDASVTGARLYLTRAN--GGELLLAGDYP 214 (396) Q Consensus 146 ----l~~g~y~ya~T~V~~~gEeg~~~~~S~~vt~~-~~--~~v~l--p~~~~~~i~~~RIYrs~~~--g~~~~lv~e~~ 214 (396) +++++|+|++|||+.+||||+||++|.++++. .+ +++++ ++...++|+++|||||.++ |++|+|++|++ T Consensus 166 ~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~ 245 (567) T protein:vir:10 166 DDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELD 245 (567) T ss_pred CCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCceEEEeeccCCccccccceEEEEEecCCCCceeeEEEEeec Confidence 67899999999999999999999999988874 33 44444 3456778999999998776 46999999999 Q ss_pred ceeEEEEcCCc-hhhcccccchhcCCcCCC--ceeeccCCEEEEEECCEEEEccCCCCcccccccccEecCcceEEEEEc Q lcl|Aclame:pro 215 LGAATVILPTL-PELGRPAQFRHLSPMPTG--KHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPV 291 (396) Q Consensus 215 ~~~~~~~d~~~-~~lg~~l~t~~~~ppP~g--~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v 291 (396) +++.+|+|+.. ++|+++|+|.+|.+||++ ++|+|+||||++|.||+|||||||+|||||++| ++++++||++|+++ T Consensus 246 as~~sf~D~~~~~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Y-r~t~~~dIVaiA~~ 324 (567) T protein:vir:10 246 ASVLSYTDKIPAKNLGPSLATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVN-RHTTAEDIVAICPL 324 (567) T ss_pred cceeeeeeccchhhcccccccccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhh-ccCCCCCeEEEeec Confidence 99999999964 578999999999999998 599999999999999999999999999999999 69999999999999 Q ss_pred CCcEEEEEcCcEEEEEcCchhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCCEEEEcCCCcEEEEec Q lcl|Aclame:pro 292 DGGIWVGQVDHVAFLDGADPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVMGTSSGAIAEVHA 371 (396) Q Consensus 292 ~~gl~V~T~~~~y~l~G~~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~Glv~g~~~G~~~~lt~ 371 (396) +++|||+|+++||+++|++|++|+++||...|| |+++|++++++.+++|+|+||||+++++|++.++|+ T Consensus 325 gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qp-----------CvS~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~ 393 (567) T protein:vir:10 325 GTSLVVATKGEPYLFSGVSPSTISGSKIPSMQA-----------CLSRRSMVAMEGFVLYAGTNGLVSVDANGNVALATE 393 (567) T ss_pred ccEEEEEEcCceEEEEcCChhhccccccccccc-----------cccccceeEeccEEEeecCCcEEEEecCCchhhhhh Confidence 999999999999999999999999999998887 999999999999999999999999999999999999 Q ss_pred ceeecccccc------ceEEEeCcEEEEEeC Q lcl|Aclame:pro 372 GVLAGITGRA------GTSVVFDRRLLTAVS 396 (396) Q Consensus 372 ~~~~~~~a~~------~~~~~~~rr~v~~~~ 396 (396) +.|..+.+++ -.+-..+-||++.-. T Consensus 394 ~l~t~~qW~a~~~P~ti~A~~~eG~Y~a~Y~ 424 (567) T protein:vir:10 394 QIVSPEQWQSQFNPASIVAYPWRGEYIACYT 424 (567) T ss_pred hccChHHHHhcCCcceEEEEeecCeEEEEEe Confidence 9997654432 344455556666655 No 5 >protein:vir:9979 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859109;genbank:gi:32170864;genbank:GeneID:2653256 Probab=100.00 E-value=2.7e-94 Score=533.55 Aligned_cols=363 Identities=15% Similarity=0.147 Sum_probs=301.2 Q ss_pred CCcccccceeccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccccccCCcccccEEEEECCeE Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPLHGDAFGALGDQW 80 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~lw~s~~~~~~~~~~dg~L 80 (396) |+-++|.-|.|.-++..++=| |-.+..-|.|+-+ +.|.|....--..+ ..+... ..+.+|...|+.| T Consensus 27 M~~i~i~~f~Ge~Prl~p~lL------P~~~a~~A~n~~~-~~G~itP~~~~~~~-----~~~~~~-~~~Tif~y~~~~W 93 (567) T protein:vir:99 27 MPYIDITTMRGMMPRVVTSML------PEHSAVLAEDCHF-RFGVITPERQISGV-----EKTFTI-KPKTIFHYRDDFW 93 (567) T ss_pred eeEEeecccccccccchhhhc------cccccceEEeeec-cCCeeeeeeccccc-----cccccc-CceeeEEEcCcEE Confidence 999999999999999999888 5666778999999 55877654322111 111111 1245666666655 Q ss_pred EEEecCCCceeecccccCcceehhhcCCeEEEEcCCcceeecCc-----------eeeeccccCCccceee--cCCCC-- Q lcl|Aclame:pro 81 GKVDPHSWTFEPLAQIGEGDLSHEVLNNRVCVAGTAGIFTYDGA-----------QAERLTLDTPAPPLLV--AGAGS-- 145 (396) Q Consensus 81 ~~i~~~~w~~~vl~~ig~gpV~~~v~n~rvy~t~~~~~~~~~g~-----------~~~~l~ip~Pa~p~~~--~~~Gs-- 145 (396) + .|+..| +..++||+.|.. +|||||||+.+|+.++. ..|+|++|+|+.++.. ++.++ T Consensus 94 ~-----~w~~~V--~~ir~PvAqD~~-~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~ 165 (567) T protein:vir:99 94 F-----AWPDVV--DVIRSPIAQDPH-GRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVS 165 (567) T ss_pred E-----EeCCce--eeccCccccCCc-ceEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCC Confidence 5 588877 557899998644 89999999999998765 4679999999765543 34444 Q ss_pred ----cccceEEEEEEEEcCCCcccccccceeEecCC-Cc--cEEEe--ecCCCCCcceEEEEEEecC--CCeEEEEEeec Q lcl|Aclame:pro 146 ----LSQGTYGAAVAWLRGPQESAPSLIAFAEVTDA-GA--LEVTF--PLCLDASVTGARLYLTRAN--GGELLLAGDYP 214 (396) Q Consensus 146 ----l~~g~y~ya~T~V~~~gEeg~~~~~S~~vt~~-~~--~~v~l--p~~~~~~i~~~RIYrs~~~--g~~~~lv~e~~ 214 (396) +++++|+|++|||+.+||||+||++|.++++. .+ +++++ ++...++|+++|||||.++ |++|+|++|++ T Consensus 166 ~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~ 245 (567) T protein:vir:99 166 DDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELD 245 (567) T ss_pred CCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCceEEEeeccCCccccccceEEEEEecCCCCceeeEEEEeec Confidence 67899999999999999999999999988874 33 44444 3456778999999998776 46999999999 Q ss_pred ceeEEEEcCCc-hhhcccccchhcCCcCCC--ceeeccCCEEEEEECCEEEEccCCCCcccccccccEecCcceEEEEEc Q lcl|Aclame:pro 215 LGAATVILPTL-PELGRPAQFRHLSPMPTG--KHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPV 291 (396) Q Consensus 215 ~~~~~~~d~~~-~~lg~~l~t~~~~ppP~g--~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v 291 (396) +++.+|+|+.. ++|+++|+|.+|.+||++ ++|+|+||||++|.||+|||||||+|||||++| ++++++||++|+++ T Consensus 246 as~~sf~D~~~~~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Y-r~t~~~dIVaiA~~ 324 (567) T protein:vir:99 246 ASVLSYTDKIPAKNLGPSLATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVN-RHTTAEDIVAICPL 324 (567) T ss_pred cceeeeeeccchhhcccccccccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhh-ccCCCCCeEEEeec Confidence 99999999964 578999999999999998 599999999999999999999999999999999 69999999999999 Q ss_pred CCcEEEEEcCcEEEEEcCchhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCCEEEEcCCCcEEEEec Q lcl|Aclame:pro 292 DGGIWVGQVDHVAFLDGADPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVMGTSSGAIAEVHA 371 (396) Q Consensus 292 ~~gl~V~T~~~~y~l~G~~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~Glv~g~~~G~~~~lt~ 371 (396) +++|||+|+++||+++|++|++|+++||...|| |+++|++++++.+++|+|+||||+++++|++.++|+ T Consensus 325 gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qp-----------CvS~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~ 393 (567) T protein:vir:99 325 GTSLVVATKGEPYLFSGVSPSTISGSKIPSMQA-----------CLSRRSMVAMEGFVLYAGTNGLVSVDANGNVALATE 393 (567) T ss_pred ccEEEEEEcCceEEEEcCChhhccccccccccc-----------cccccceeEeccEEEeecCCcEEEEecCCchhhhhh Confidence 999999999999999999999999999998887 999999999999999999999999999999999999 Q ss_pred ceeecccccc------ceEEEeCcEEEEEeC Q lcl|Aclame:pro 372 GVLAGITGRA------GTSVVFDRRLLTAVS 396 (396) Q Consensus 372 ~~~~~~~a~~------~~~~~~~rr~v~~~~ 396 (396) +.|..+.+++ -.+-..+-||++.-. T Consensus 394 ~l~t~~qW~a~~~P~ti~A~~~eG~Y~a~Y~ 424 (567) T protein:vir:99 394 QIVSPEQWQSQFNPASIVAYPWRGEYIACYT 424 (567) T ss_pred hccChHHHHhcCCcceEEEEeecCeEEEEEe Confidence 9997654432 344455556666655 No 6 >protein:vir:3306 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049522;genbank:gi:9632528;genbank:GeneID:1262016 Probab=100.00 E-value=2.7e-94 Score=533.55 Aligned_cols=363 Identities=15% Similarity=0.147 Sum_probs=301.2 Q ss_pred CCcccccceeccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccccccCCcccccEEEEECCeE Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPLHGDAFGALGDQW 80 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~lw~s~~~~~~~~~~dg~L 80 (396) |+-++|.-|.|.-++..++=| |-.+..-|.|+-+ +.|.|....--..+ ..+... ..+.+|...|+.| T Consensus 27 M~~i~i~~f~Ge~Prl~p~lL------P~~~a~~A~n~~~-~~G~itP~~~~~~~-----~~~~~~-~~~Tif~y~~~~W 93 (567) T protein:vir:33 27 MPYIDITTMRGMMPRVVTSML------PEHSAVLAEDCHF-RFGVITPERQISGV-----EKTFTI-KPKTIFHYRDDFW 93 (567) T ss_pred eeEEeecccccccccchhhhc------cccccceEEeeec-cCCeeeeeeccccc-----cccccc-CceeeEEEcCcEE Confidence 999999999999999999888 5666778999999 55877654322111 111111 1245666666655 Q ss_pred EEEecCCCceeecccccCcceehhhcCCeEEEEcCCcceeecCc-----------eeeeccccCCccceee--cCCCC-- Q lcl|Aclame:pro 81 GKVDPHSWTFEPLAQIGEGDLSHEVLNNRVCVAGTAGIFTYDGA-----------QAERLTLDTPAPPLLV--AGAGS-- 145 (396) Q Consensus 81 ~~i~~~~w~~~vl~~ig~gpV~~~v~n~rvy~t~~~~~~~~~g~-----------~~~~l~ip~Pa~p~~~--~~~Gs-- 145 (396) + .|+..| +..++||+.|.. +|||||||+.+|+.++. ..|+|++|+|+.++.. ++.++ T Consensus 94 ~-----~w~~~V--~~ir~PvAqD~~-~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~ 165 (567) T protein:vir:33 94 F-----AWPDVV--DVIRSPIAQDPH-GRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVS 165 (567) T ss_pred E-----EeCCce--eeccCccccCCc-ceEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCC Confidence 5 588877 557899998644 89999999999998765 4679999999765543 34444 Q ss_pred ----cccceEEEEEEEEcCCCcccccccceeEecCC-Cc--cEEEe--ecCCCCCcceEEEEEEecC--CCeEEEEEeec Q lcl|Aclame:pro 146 ----LSQGTYGAAVAWLRGPQESAPSLIAFAEVTDA-GA--LEVTF--PLCLDASVTGARLYLTRAN--GGELLLAGDYP 214 (396) Q Consensus 146 ----l~~g~y~ya~T~V~~~gEeg~~~~~S~~vt~~-~~--~~v~l--p~~~~~~i~~~RIYrs~~~--g~~~~lv~e~~ 214 (396) +++++|+|++|||+.+||||+||++|.++++. .+ +++++ ++...++|+++|||||.++ |++|+|++|++ T Consensus 166 ~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~ 245 (567) T protein:vir:33 166 DDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELD 245 (567) T ss_pred CCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCceEEEeeccCCccccccceEEEEEecCCCCceeeEEEEeec Confidence 67899999999999999999999999988874 33 44444 3456778999999998776 46999999999 Q ss_pred ceeEEEEcCCc-hhhcccccchhcCCcCCC--ceeeccCCEEEEEECCEEEEccCCCCcccccccccEecCcceEEEEEc Q lcl|Aclame:pro 215 LGAATVILPTL-PELGRPAQFRHLSPMPTG--KHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPV 291 (396) Q Consensus 215 ~~~~~~~d~~~-~~lg~~l~t~~~~ppP~g--~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v 291 (396) +++.+|+|+.. ++|+++|+|.+|.+||++ ++|+|+||||++|.||+|||||||+|||||++| ++++++||++|+++ T Consensus 246 as~~sf~D~~~~~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Y-r~t~~~dIVaiA~~ 324 (567) T protein:vir:33 246 ASVLSYTDKIPAKNLGPSLATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVN-RHTTAEDIVAICPL 324 (567) T ss_pred cceeeeeeccchhhcccccccccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhh-ccCCCCCeEEEeec Confidence 99999999964 578999999999999998 599999999999999999999999999999999 69999999999999 Q ss_pred CCcEEEEEcCcEEEEEcCchhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCCEEEEcCCCcEEEEec Q lcl|Aclame:pro 292 DGGIWVGQVDHVAFLDGADPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVMGTSSGAIAEVHA 371 (396) Q Consensus 292 ~~gl~V~T~~~~y~l~G~~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~Glv~g~~~G~~~~lt~ 371 (396) +++|||+|+++||+++|++|++|+++||...|| |+++|++++++.+++|+|+||||+++++|++.++|+ T Consensus 325 gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qp-----------CvS~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~ 393 (567) T protein:vir:33 325 GTSLVVATKGEPYLFSGVSPSTISGSKIPSMQA-----------CLSRRSMVAMEGFVLYAGTNGLVSVDANGNVALATE 393 (567) T ss_pred ccEEEEEEcCceEEEEcCChhhccccccccccc-----------cccccceeEeccEEEeecCCcEEEEecCCchhhhhh Confidence 999999999999999999999999999998887 999999999999999999999999999999999999 Q ss_pred ceeecccccc------ceEEEeCcEEEEEeC Q lcl|Aclame:pro 372 GVLAGITGRA------GTSVVFDRRLLTAVS 396 (396) Q Consensus 372 ~~~~~~~a~~------~~~~~~~rr~v~~~~ 396 (396) +.|..+.+++ -.+-..+-||++.-. T Consensus 394 ~l~t~~qW~a~~~P~ti~A~~~eG~Y~a~Y~ 424 (567) T protein:vir:33 394 QIVSPEQWQSQFNPASIVAYPWRGEYIACYT 424 (567) T ss_pred hccChHHHHhcCCcceEEEEeecCeEEEEEe Confidence 9997654432 344455556666655 No 7 >protein:vir:827 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050560;genbank:gi:9633457;genbank:GeneID:1262210 Probab=100.00 E-value=4.9e-94 Score=532.16 Aligned_cols=363 Identities=15% Similarity=0.146 Sum_probs=301.4 Q ss_pred CCcccccceeccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccccccCCcccccEEEEECCeE Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPLHGDAFGALGDQW 80 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~lw~s~~~~~~~~~~dg~L 80 (396) |+-++|.-|.|.-++..++=| |-.+..-|.|+-+ +.|.|....--..+ ..+... ..+.+|...++.| T Consensus 27 M~~i~i~~f~Ge~Prl~p~lL------P~~~a~~A~n~~~-~~G~itP~~~~~~~-----~~~~~~-~~~Tif~y~~~~W 93 (567) T protein:vir:82 27 MPYIDITTMRGMMPRVVTSML------PEHSAVLAEDCHF-RFGVITPERQISGV-----EKTFTI-KPKTIFHYRDDFW 93 (567) T ss_pred eeEEeecccccccccchhhhc------cccccceEEeeee-cCCeeeeeeccccc-----cccccc-CceeeeeecCcEe Confidence 999999999999999999888 5666778999999 55877654322111 111111 1245666666655 Q ss_pred EEEecCCCceeecccccCcceehhhcCCeEEEEcCCcceeecCc-----------eeeeccccCCccceee--cCCCC-- Q lcl|Aclame:pro 81 GKVDPHSWTFEPLAQIGEGDLSHEVLNNRVCVAGTAGIFTYDGA-----------QAERLTLDTPAPPLLV--AGAGS-- 145 (396) Q Consensus 81 ~~i~~~~w~~~vl~~ig~gpV~~~v~n~rvy~t~~~~~~~~~g~-----------~~~~l~ip~Pa~p~~~--~~~Gs-- 145 (396) + .|+..| +..++||+.|.. +|||||||+.+|+.++. ..|+|++|+|+.++.. ++.++ T Consensus 94 ~-----~w~~~V--~~ir~PvAqD~~-~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~ 165 (567) T protein:vir:82 94 F-----AWPDVV--DVIRSPIAQDPH-GRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVS 165 (567) T ss_pred E-----EeCCce--eeccCccccCCc-ccEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCC Confidence 4 588877 557899998644 89999999999998765 4679999999765543 34444 Q ss_pred ----cccceEEEEEEEEcCCCcccccccceeEecCC-Cc--cEEEe--ecCCCCCcceEEEEEEecC--CCeEEEEEeec Q lcl|Aclame:pro 146 ----LSQGTYGAAVAWLRGPQESAPSLIAFAEVTDA-GA--LEVTF--PLCLDASVTGARLYLTRAN--GGELLLAGDYP 214 (396) Q Consensus 146 ----l~~g~y~ya~T~V~~~gEeg~~~~~S~~vt~~-~~--~~v~l--p~~~~~~i~~~RIYrs~~~--g~~~~lv~e~~ 214 (396) +++++|+|++|||+.+||||+||++|.++++. .+ +++++ ++...++|+++|||||.++ |++|+|++|++ T Consensus 166 ~~~p~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~ 245 (567) T protein:vir:82 166 DDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELD 245 (567) T ss_pred CCCCccccceEEEEEEEcCCCCcCCCcccccceeeecCCceEEEeeccCCccccccceEEEEEecCCCCceeeEEEEeec Confidence 67789999999999999999999999988874 33 44544 3456778999999998776 46999999999 Q ss_pred ceeEEEEcCCc-hhhcccccchhcCCcCCC--ceeeccCCEEEEEECCEEEEccCCCCcccccccccEecCcceEEEEEc Q lcl|Aclame:pro 215 LGAATVILPTL-PELGRPAQFRHLSPMPTG--KHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPV 291 (396) Q Consensus 215 ~~~~~~~d~~~-~~lg~~l~t~~~~ppP~g--~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v 291 (396) +++.+|+|+.. ++|+++|+|.+|.+||++ ++|+|+||||++|.||+|||||||+|||||++| ++++++||++|+++ T Consensus 246 as~~sf~D~~~~~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Y-r~t~~~dIVaiA~~ 324 (567) T protein:vir:82 246 ASVLSYTDKIPAKNLGPSLATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVN-RHTTAEDIVAICPL 324 (567) T ss_pred cceeeeeeccchhhcccccccccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhh-ccCCCCCeEEEEec Confidence 99999999964 578999999999999998 599999999999999999999999999999999 69999999999999 Q ss_pred CCcEEEEEcCcEEEEEcCchhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCCEEEEcCCCcEEEEec Q lcl|Aclame:pro 292 DGGIWVGQVDHVAFLDGADPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVMGTSSGAIAEVHA 371 (396) Q Consensus 292 ~~gl~V~T~~~~y~l~G~~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~Glv~g~~~G~~~~lt~ 371 (396) +++|||+|+++||+++|++|++|+++||...|| |+++|++++++.+++|+|+||||+++++|++.++|+ T Consensus 325 gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qp-----------CvS~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~ 393 (567) T protein:vir:82 325 RTSLVVATKGEPYLFSGVSPSTISGSKIPSMQA-----------CLSRRSMVAMEGFVLYAGTNGLVSVDANGNVALATE 393 (567) T ss_pred ccEEEEEEcCceEEEEcCChhhccccccccccc-----------cccccceeeecceEEeecCCcEEEEecCCchhhhhh Confidence 999999999999999999999999999998887 999999999999999999999999999999999999 Q ss_pred ceeecccccc------ceEEEeCcEEEEEeC Q lcl|Aclame:pro 372 GVLAGITGRA------GTSVVFDRRLLTAVS 396 (396) Q Consensus 372 ~~~~~~~a~~------~~~~~~~rr~v~~~~ 396 (396) +.|..+.+++ -.+-..+-||++.-. T Consensus 394 ~l~t~~qW~a~~~P~ti~A~~~eG~Y~a~Y~ 424 (567) T protein:vir:82 394 QIVSPEQWQSQFNPASIVAYPWRGEYIACYT 424 (567) T ss_pred hccChHHHHhcCCcceEEEEeecCeEEEEEe Confidence 9997654432 344455556666655 No 8 >protein:vir:104388 Length: 566 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794072;genbank:gi:116222017;genbank:GeneID:4397450 Probab=100.00 E-value=7.7e-94 Score=531.07 Aligned_cols=363 Identities=15% Similarity=0.139 Sum_probs=300.5 Q ss_pred CCcccccceeccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccccccCCcccccEEEEECCeE Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPLHGDAFGALGDQW 80 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~lw~s~~~~~~~~~~dg~L 80 (396) |+-++|.-|.|+-++..++=| |-.+..-|.|+-+ +.|.|....--..+ ..+... ..+.+|...|+.| T Consensus 26 M~~i~i~~f~Ge~Pr~~p~lL------P~~~a~~A~n~~~-~~G~itP~~~~~~~-----~~~~~~-~~kTif~y~~~~W 92 (566) T protein:vir:10 26 MPYIDITTMRGMMPRVVTSML------PDHSAVLAEDCHF-RFGVITPERQISGV-----EKTFTI-KPKTIFHYRDDFW 92 (566) T ss_pred eeEEeecccccccccchhhhc------cccccceEEeeee-cCCeeeeeeccccc-----cccccc-CceeeeeecCcEe Confidence 999999999999999999888 5667778999999 55877654322111 111111 1245666666655 Q ss_pred EEEecCCCceeecccccCcceehhhcCCeEEEEcCCcceeecC-----------ceeeeccccCCcccee---ecCC--- Q lcl|Aclame:pro 81 GKVDPHSWTFEPLAQIGEGDLSHEVLNNRVCVAGTAGIFTYDG-----------AQAERLTLDTPAPPLL---VAGA--- 143 (396) Q Consensus 81 ~~i~~~~w~~~vl~~ig~gpV~~~v~n~rvy~t~~~~~~~~~g-----------~~~~~l~ip~Pa~p~~---~~~~--- 143 (396) + .|+..| ++.++||+.|.. +||||+|++.||++++ +..|+|+||+|+.+++ .++. T Consensus 93 ~-----~w~~~V--~~ir~PvAqD~~-~rvY~tg~~~Pk~t~~diAt~g~~~~pa~~y~LgVPaPs~apv~~~~~~sg~~ 164 (566) T protein:vir:10 93 F-----AWPDVV--DVIRSPVAQDNY-GRIYYTDGKFPKVTAAEIATKGEGNFPAASYRLGIPAPTTAPVCTVQKGEGAT 164 (566) T ss_pred E-----EeCCce--eeccCccccCCc-ceEEEeeCCcceeeecceeeccccccccccccccCCCCcccceeeccCCCccc Confidence 5 588877 557899998644 8999999999999875 6778999999975432 2223 Q ss_pred --CCcccceEEEEEEEEcCCCcccccccceeEecCCC-c--cEEEee--cCCCCCcceEEEEEEecC--CCeEEEEEeec Q lcl|Aclame:pro 144 --GSLSQGTYGAAVAWLRGPQESAPSLIAFAEVTDAG-A--LEVTFP--LCLDASVTGARLYLTRAN--GGELLLAGDYP 214 (396) Q Consensus 144 --Gsl~~g~y~ya~T~V~~~gEeg~~~~~S~~vt~~~-~--~~v~lp--~~~~~~i~~~RIYrs~~~--g~~~~lv~e~~ 214 (396) +.+++|+|.|++|||+.+||||+||++|.++++.. + ++|++. +...++|+++|||||.++ |++|+|++|++ T Consensus 165 ~~~~~d~~tr~Yv~TfVt~~GeES~PS~~S~~v~v~~~gs~V~ltl~~~p~~~~~i~~~RIYRS~tg~~gtdy~lVael~ 244 (566) T protein:vir:10 165 DENPNDDETRFYTETFVSAYGEEGPPGPESLEVTVGIPDTPVQLTLSPVPLQDANINRRRIYRSVSGGGEADFLLVAELE 244 (566) T ss_pred CCCCcccceeEEEEEEEcCCCCcCCCccccceeEecCCCceEEEEecCCCcCcCCceeEEEEEecCCCCceeEEEEeeec Confidence 44889999999999999999999999998888743 3 455543 346778999999998865 56999999999 Q ss_pred ceeEEEEcCCc-hhhcccccchhcCCcCCC--ceeeccCCEEEEEECCEEEEccCCCCcccccccccEecCcceEEEEEc Q lcl|Aclame:pro 215 LGAATVILPTL-PELGRPAQFRHLSPMPTG--KHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPV 291 (396) Q Consensus 215 ~~~~~~~d~~~-~~lg~~l~t~~~~ppP~g--~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v 291 (396) +++.+|+|+.. ++|+++|+|.+|.+||++ ++|+|+||||++|.||+|||||||+|||||++| ++++++||++|++. T Consensus 245 as~~sf~Dd~~~~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Y-r~t~~~dIVaiA~~ 323 (566) T protein:vir:10 245 ASVLSYTDNIPAKNLGPSLATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVN-RHTTAEDIVAVCPL 323 (566) T ss_pred ccceeeeccccccccCcccccccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhh-ccCCCCCeEEEEec Confidence 99999999964 578999999999999998 599999999999999999999999999999999 69999999999999 Q ss_pred CCcEEEEEcCcEEEEEcCchhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCCEEEEcCCCcEEEEec Q lcl|Aclame:pro 292 DGGIWVGQVDHVAFLDGADPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVMGTSSGAIAEVHA 371 (396) Q Consensus 292 ~~gl~V~T~~~~y~l~G~~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~Glv~g~~~G~~~~lt~ 371 (396) +++|||+|+|+||+++|++|++|+++||...|| |+++|++++++.+++|+|+||||+++++|++.++|+ T Consensus 324 gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qa-----------CvS~rsiV~~~g~v~Yas~dGLv~v~a~g~a~vvT~ 392 (566) T protein:vir:10 324 GTSLVVATKGEPYLFSGVSPSTISGSKIPSMQA-----------CLSRQSMVAMEGFVLYAGTNGLVSVDANGNAALATE 392 (566) T ss_pred cceEEEEEcCceEEEEcCChhhccccccccccc-----------cccccceeeecceEEeecCCceEEEecCCChhhhhh Confidence 999999999999999999999999999998887 999999999999999999999999999999999999 Q ss_pred ceeecccccc------ceEEEeCcEEEEEeC Q lcl|Aclame:pro 372 GVLAGITGRA------GTSVVFDRRLLTAVS 396 (396) Q Consensus 372 ~~~~~~~a~~------~~~~~~~rr~v~~~~ 396 (396) +.|..+.+++ -.+-..+-||++.-+ T Consensus 393 ~l~t~~qW~~~~~P~ti~A~~~eG~Y~a~Y~ 423 (566) T protein:vir:10 393 QIISPEQWQTQFNPASIVAYPWRGEYIACYT 423 (566) T ss_pred hhcChhHHHhcCCcceEEEEeecCeEEEEEe Confidence 9997654432 234444455555544 No 9 >protein:vir:93631 Length: 580 # NCBI annotation: Bcep22gp67 # Family: family:all:1544 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944296;genbank:gi:38640373;genbank:GeneID:2658280 Probab=100.00 E-value=4.4e-92 Score=521.44 Aligned_cols=360 Identities=18% Similarity=0.210 Sum_probs=300.7 Q ss_pred CCcccccceeccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccccccCCcccc-cEEEEECCe Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPLHG-DAFGALGDQ 79 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~lw~s~~~~-~~~~~~dg~ 79 (396) |+-|+|.-|.|.-++..++=| |-.+..-|.|+-+ +.|.|....-=..+.+ +.+-|... +.++-+|+. T Consensus 1 M~~i~i~~f~Ge~Prl~p~lL------P~~~a~~a~n~~~-~~G~i~P~~~~~~~~~-----~~~i~~~~~~t~~~~~~~ 68 (580) T protein:vir:93 1 MTIIKITGFSGEIPRLVPRLL------PDTAAQNATNARL-ESGGLTPYRKPKFITR-----ISTIPAGQIETIYRNGET 68 (580) T ss_pred CeeEeecccccccccchhhhc------cccccceEEeeec-cCCeeeeeeCchhhcc-----ccccCcCcceEEEecCce Confidence 999999999999999999888 5667778999999 5688765432111111 11223322 345556665 Q ss_pred EEEEecCCCceeecccccCcceehhhcCCeEEEEcCCcceeecCceeeeccccCCcccee--ecCCCCcccceEEEEEEE Q lcl|Aclame:pro 80 WGKVDPHSWTFEPLAQIGEGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLDTPAPPLL--VAGAGSLSQGTYGAAVAW 157 (396) Q Consensus 80 L~~i~~~~w~~~vl~~ig~gpV~~~v~n~rvy~t~~~~~~~~~g~~~~~l~ip~Pa~p~~--~~~~Gsl~~g~y~ya~T~ 157 (396) |+ .|+..| +..++||+. |||||||++.|++..++..|+|++|.|+.+++ ..++|+++.|+|.|++|| T Consensus 69 W~-----~w~~~V--~~i~~PvA~----DRvy~Td~g~Pkvt~~g~sy~lgVpaPs~Apt~~~~g~g~l~~~~y~Yv~Tf 137 (580) T protein:vir:93 69 WM-----AWDKPV--YAAPGPVAA----DRLYVMGDGAPKMIVGGTTYPLAVPMPSAALTAATSGTGTGDVFSRVYVYTF 137 (580) T ss_pred eE-----EeCCce--eeecCcccc----ceeEEcCCcccceecCCccccccCCCcccCceeeecCCCCcCccceEEEEEE Confidence 55 588877 557899886 49999999999999999999999999976554 357889999999999999 Q ss_pred EcCCCcccccccceeEecCCCccEEEee----cCCCCCcceEEEEEEecC--CCeEEEEEeecceeEEEEcCCc-hhhcc Q lcl|Aclame:pro 158 LRGPQESAPSLIAFAEVTDAGALEVTFP----LCLDASVTGARLYLTRAN--GGELLLAGDYPLGAATVILPTL-PELGR 230 (396) Q Consensus 158 V~~~gEeg~~~~~S~~vt~~~~~~v~lp----~~~~~~i~~~RIYrs~~~--g~~~~lv~e~~~~~~~~~d~~~-~~lg~ 230 (396) |+.+||||+||++|..++++.+..|+|. +..+++|+++|||||.++ |++|||++|+++++.+|+|+.. .+|++ T Consensus 138 Vt~~GeES~PS~~S~~vtv~~g~tVtLs~~p~p~~~~~i~~~RIYRS~tG~~gtdy~lVAel~Ag~~sF~Dd~s~a~Lge 217 (580) T protein:vir:93 138 VTGFGEESEPSAISNEVNWQAGQTVTLSGFQAAPAGRNITKQRIYRSQTSLSGTDLYFIAERDASAANFVDNVPLSDQNE 217 (580) T ss_pred EcCCCCcCCCcccccceeeCCCCeEEEEecCCCCCCCccceEEEEEeccCCCceeEEEEeeeccceeeeeeccccccccc Confidence 9999999999999998887655444442 244667999999998776 5699999999999999999864 57899 Q ss_pred cccchhcCCcCCC--ceeeccCCEEEEEECCEEEEccCCCCcccccccccEecCcceEEEEEcCCcEEEEEcCcEEEEEc Q lcl|Aclame:pro 231 PAQFRHLSPMPTG--KHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPVDGGIWVGQVDHVAFLDG 308 (396) Q Consensus 231 ~l~t~~~~ppP~g--~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v~~gl~V~T~~~~y~l~G 308 (396) +|+|.+|.+||+. ++|+||||||++|.||+|||||||+|||||++| +++++++|++|+|++++|||+|+++|||++| T Consensus 218 ~Lps~~~~~PP~~m~gL~~m~nGi~agF~Gnev~fsEpy~P~AWP~~y-r~t~~~~Ivaia~~g~~LvV~T~g~pyl~~G 296 (580) T protein:vir:93 218 PLPSLEWNAPPDDLTGLISLPNGMMAAFRGKELWLCEPWRPHAWPQKY-VLTMDYNIVALGAYGTTIVVATDGQPYIVSG 296 (580) T ss_pred ccchhhccCcCCCcceEEeeccceEEEEeCCEEEEecCCCCccchhhc-CCCCCCCceeEeeeCceEEEEEcCceEEEEc Confidence 9999999999998 699999999999999999999999999999999 6999999999999999999999999999999 Q ss_pred CchhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCCEEEEcCCCcEEEEecceeeccccc-----cce Q lcl|Aclame:pro 309 ADPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVMGTSSGAIAEVHAGVLAGITGR-----AGT 383 (396) Q Consensus 309 ~~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~Glv~g~~~G~~~~lt~~~~~~~~a~-----~~~ 383 (396) ++|++|+++||+..|| |+++|++++++.+++|+|+||||+++++| +.++|++.|..+.++ .-. T Consensus 297 ~~P~~ms~~kL~~~q~-----------CvS~rsiV~~~~~v~Yas~dGLv~i~~~g-a~vvT~~l~t~~qW~~~~P~ti~ 364 (580) T protein:vir:93 297 ASPDAMSQEKLELNLP-----------CINARGLVDLGYAIAYPSHDGLVVASSSG-ARVVTDQLMTRNDWLKTAPGRFV 364 (580) T ss_pred cChhhccccccccccc-----------cccccceeecCceEEeecCCcEEEEeCCh-HHHHHhhccChhHHHhcCCceEE Confidence 9999999999998887 99999999999999999999999999999 799999999664322 233 Q ss_pred EEEeCcEEEEEeC Q lcl|Aclame:pro 384 SVVFDRRLLTAVS 396 (396) Q Consensus 384 ~~~~~rr~v~~~~ 396 (396) +-..+-||++.-+ T Consensus 365 a~~~eG~Y~a~Y~ 377 (580) T protein:vir:93 365 SGQFFGRYLASYE 377 (580) T ss_pred EEeecCeEEEEEc Confidence 4444555555543 No 10 >protein:vir:107423 Length: 681 # NCBI annotation: Bbp13 # Family: family:all:780 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958682;genbank:gi:41179374;genbank:GeneID:2717217 Probab=98.79 E-value=3e-08 Score=61.84 Aligned_cols=371 Identities=11% Similarity=0.067 Sum_probs=169.5 Q ss_pred CCcccc--cce-eccCCcCChhheeeCCCc--hhhheeeeeeeeeCCCccEEECCcceeecCCcccc--cc------CCc Q lcl|Aclame:pro 1 MATTSL--VPL-AGINNVAEDAALQRGGES--PRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQ--LW------QSP 67 (396) Q Consensus 1 m~~~~~--~p~-~G~nn~~~~~~L~~~~~~--~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~--lw------~s~ 67 (396) |+++.+ .-| .|. +++. |.-+.|= =..-++.+-|+-+...|.+.||+|.+-+...+..+ +| +. T Consensus 1 m~~~~~~~~~f~~Ge--~~p~--l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~- 75 (681) T protein:vir:10 1 MSNVRVLQRSFGGGE--ISPE--MFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSV- 75 (681) T ss_pred CcceeEeeeecCCce--eeee--eccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCC- Confidence 887544 223 232 1221 1111111 12357889999999999999999999877665433 11 11 Q ss_pred ccccEEEEECCeEEEEecCCCce-------eecccccC---cceehhhcCCeEEEEcCCcce----eecCcee--eeccc Q lcl|Aclame:pro 68 LHGDAFGALGDQWGKVDPHSWTF-------EPLAQIGE---GDLSHEVLNNRVCVAGTAGIF----TYDGAQA--ERLTL 131 (396) Q Consensus 68 ~~~~~~~~~dg~L~~i~~~~w~~-------~vl~~ig~---gpV~~~v~n~rvy~t~~~~~~----~~~g~~~--~~l~i 131 (396) .-.+++..-++.+ ++..++... ++---+.. --+.|.+..|.+|++....+- ++....+ ..+.+ T Consensus 76 ~~~~~l~~g~~~~-r~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~W~l~~~~f 154 (681) T protein:vir:10 76 TQTMVIELGAGYF-RFHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATNWQLATIAF 154 (681) T ss_pred CceEEEEEeCCeE-EEEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCceEEEEEEe Confidence 1123332223333 343332211 11111221 346777788999999976532 2222211 12222 Q ss_pred c-CCccceeecCC--CCcccceEEEEEEEEcCCC-cccccccceeEecCCC---ccEEEeecCCCCCcceEEEEEEecCC Q lcl|Aclame:pro 132 D-TPAPPLLVAGA--GSLSQGTYGAAVAWLRGPQ-ESAPSLIAFAEVTDAG---ALEVTFPLCLDASVTGARLYLTRANG 204 (396) Q Consensus 132 p-~Pa~p~~~~~~--Gsl~~g~y~ya~T~V~~~g-Eeg~~~~~S~~vt~~~---~~~v~lp~~~~~~i~~~RIYrs~~~g 204 (396) . .|+.|...... ..-..-++.|.++-++..+ .|+.+.. .+.+++.. +-..++.........+.|||+-. + T Consensus 155 ~~~p~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~-~~tvt~~~~~~~~~~t~~w~a~~g~~~~~V~~~~--~ 231 (681) T protein:vir:10 155 TSPVATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSS-AGTCTNNLFTNGGANTIAWSASSGASRYNVYKEQ--G 231 (681) T ss_pred ccccccceeeeeeccCCccceeEeEEEEEeecccceeecCCc-ceEEeeeeecCCcceeEEEEecCCceeeeecccc--e Confidence 2 33333322221 1112235667777776543 3443322 12222211 11112222222233456677432 2 Q ss_pred CeEEEEEeecceeEEEEcCCchhhcccccchhcCCcCC-C---ceeeccCCEEEEE----ECCEEEEccCCCCccccccc Q lcl|Aclame:pro 205 GELLLAGDYPLGAATVILPTLPELGRPAQFRHLSPMPT-G---KHLAYWRGRLLIA----RANVLRFSEALAYHLHDERY 276 (396) Q Consensus 205 ~~~~lv~e~~~~~~~~~d~~~~~lg~~l~t~~~~ppP~-g---~~~~~~nGrl~~a----~Gn~l~fSEp~~p~aw~~~y 276 (396) +-.-+++ ...++ .+.+.........-.-+...+-.+ + ..+.++++||..+ ..+.||+|.+..++-+..+- T Consensus 232 gi~g~ig-~~~~~-~~~~~~~~~~~~~t~~~~~~~~~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~ 309 (681) T protein:vir:10 232 GLYGYIG-QTTGT-SLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSL 309 (681) T ss_pred eEEEEee-cccee-eeeecccccCccccccccccccccCCCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccC Confidence 2111221 11222 223322211111111111111111 1 3588999999998 56789999999988874221 Q ss_pred -----ccE--ec----CcceEEEEEcCCcEEEEEcCcEEEEEc-----CchhheeeeeeccCCCcccceeecchhhhccc Q lcl|Aclame:pro 277 -----GFV--QM----PQRITFVQPVDGGIWVGQVDHVAFLDG-----ADPASLSVSRRASRAPVPGSAVLVPAEVVGTN 340 (396) Q Consensus 277 -----~~~--~~----~~~I~~i~~v~~gl~V~T~~~~y~l~G-----~~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~ 340 (396) +-+ ++ ...|.-+.+++ .|+|+|.+.-|++++ .+|++.++.+.+... ++. - T Consensus 310 ~~~ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g---~~~----------~ 375 (681) T protein:vir:10 310 PVRDDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVG---ATD----------V 375 (681) T ss_pred CCCCCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeeeec---ccc----------c Confidence 111 12 33478888985 599999999999998 455565555544322 111 1 Q ss_pred cccCcccEEEEecCCC-----EEEEcCCCc--EEEEe---cceeeccccccceEEEeCcE-EEEEeC Q lcl|Aclame:pro 341 ASPDGSPVAVWLAENG-----YVMGTSSGA--IAEVH---AGVLAGITGRAGTSVVFDRR-LLTAVS 396 (396) Q Consensus 341 ~~~~~~~~~lw~s~~G-----lv~g~~~G~--~~~lt---~~~~~~~~a~~~~~~~~~rr-~v~~~~ 396 (396) .....+..++|++++| +.--..++. ...+| +-.+..+.-..-+...-++. +.++++ T Consensus 376 ~Pv~vg~~v~fv~~~g~~vre~~y~~~~d~~~~~dlt~~a~Hl~~~~~i~~~a~~~~p~~~~~~v~~ 442 (681) T protein:vir:10 376 QPVVVNNTTIYGAARGGHVRELAYNWQANGFVTGDLSLRAAHLFDNLDILDMAYAKAPQPIVWFISS 442 (681) T ss_pred cceeeCCeEEEEecCCCEEEEEEEeeecCceeccchhhhhhhhcCCCCeEEEEEecCCCEEEEEEec Confidence 1234456899999998 222222221 11222 22222211011111111122 122222 No 11 >protein:vir:98487 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996575;genbank:gi:45569506;genbank:GeneID:2767815 Probab=98.79 E-value=3e-08 Score=61.84 Aligned_cols=371 Identities=11% Similarity=0.067 Sum_probs=169.5 Q ss_pred CCcccc--cce-eccCCcCChhheeeCCCc--hhhheeeeeeeeeCCCccEEECCcceeecCCcccc--cc------CCc Q lcl|Aclame:pro 1 MATTSL--VPL-AGINNVAEDAALQRGGES--PRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQ--LW------QSP 67 (396) Q Consensus 1 m~~~~~--~p~-~G~nn~~~~~~L~~~~~~--~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~--lw------~s~ 67 (396) |+++.+ .-| .|. +++. |.-+.|= =..-++.+-|+-+...|.+.||+|.+-+...+..+ +| +. T Consensus 1 m~~~~~~~~~f~~Ge--~~p~--l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~- 75 (681) T protein:vir:98 1 MSNVRVLQRSFGGGE--ISPE--MFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSV- 75 (681) T ss_pred CcceeEeeeecCCce--eeee--eccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCC- Confidence 887544 223 232 1221 1111111 12357889999999999999999999877665433 11 11 Q ss_pred ccccEEEEECCeEEEEecCCCce-------eecccccC---cceehhhcCCeEEEEcCCcce----eecCcee--eeccc Q lcl|Aclame:pro 68 LHGDAFGALGDQWGKVDPHSWTF-------EPLAQIGE---GDLSHEVLNNRVCVAGTAGIF----TYDGAQA--ERLTL 131 (396) Q Consensus 68 ~~~~~~~~~dg~L~~i~~~~w~~-------~vl~~ig~---gpV~~~v~n~rvy~t~~~~~~----~~~g~~~--~~l~i 131 (396) .-.+++..-++.+ ++..++... ++---+.. --+.|.+..|.+|++....+- ++....+ ..+.+ T Consensus 76 ~~~~~l~~g~~~~-r~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~W~l~~~~f 154 (681) T protein:vir:98 76 TQTMVIELGAGYF-RFHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATNWQLATIAF 154 (681) T ss_pred CceEEEEEeCCeE-EEEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCceEEEEEEe Confidence 1123332223333 343332211 11111221 346777788999999976532 2222211 12222 Q ss_pred c-CCccceeecCC--CCcccceEEEEEEEEcCCC-cccccccceeEecCCC---ccEEEeecCCCCCcceEEEEEEecCC Q lcl|Aclame:pro 132 D-TPAPPLLVAGA--GSLSQGTYGAAVAWLRGPQ-ESAPSLIAFAEVTDAG---ALEVTFPLCLDASVTGARLYLTRANG 204 (396) Q Consensus 132 p-~Pa~p~~~~~~--Gsl~~g~y~ya~T~V~~~g-Eeg~~~~~S~~vt~~~---~~~v~lp~~~~~~i~~~RIYrs~~~g 204 (396) . .|+.|...... ..-..-++.|.++-++..+ .|+.+.. .+.+++.. +-..++.........+.|||+-. + T Consensus 155 ~~~p~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~-~~tvt~~~~~~~~~~t~~w~a~~g~~~~~V~~~~--~ 231 (681) T protein:vir:98 155 TSPVATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSS-AGTCTNNLFTNGGANTIAWSASSGASRYNVYKEQ--G 231 (681) T ss_pred ccccccceeeeeeccCCccceeEeEEEEEeecccceeecCCc-ceEEeeeeecCCcceeEEEEecCCceeeeecccc--e Confidence 2 33333322221 1112235667777776543 3443322 12222211 11112222222233456677432 2 Q ss_pred CeEEEEEeecceeEEEEcCCchhhcccccchhcCCcCC-C---ceeeccCCEEEEE----ECCEEEEccCCCCccccccc Q lcl|Aclame:pro 205 GELLLAGDYPLGAATVILPTLPELGRPAQFRHLSPMPT-G---KHLAYWRGRLLIA----RANVLRFSEALAYHLHDERY 276 (396) Q Consensus 205 ~~~~lv~e~~~~~~~~~d~~~~~lg~~l~t~~~~ppP~-g---~~~~~~nGrl~~a----~Gn~l~fSEp~~p~aw~~~y 276 (396) +-.-+++ ...++ .+.+.........-.-+...+-.+ + ..+.++++||..+ ..+.||+|.+..++-+..+- T Consensus 232 gi~g~ig-~~~~~-~~~~~~~~~~~~~t~~~~~~~~~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~ 309 (681) T protein:vir:98 232 GLYGYIG-QTTGT-SLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSL 309 (681) T ss_pred eEEEEee-cccee-eeeecccccCccccccccccccccCCCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccC Confidence 2111221 11222 223322211111111111111111 1 3588999999998 56789999999988874221 Q ss_pred -----ccE--ec----CcceEEEEEcCCcEEEEEcCcEEEEEc-----CchhheeeeeeccCCCcccceeecchhhhccc Q lcl|Aclame:pro 277 -----GFV--QM----PQRITFVQPVDGGIWVGQVDHVAFLDG-----ADPASLSVSRRASRAPVPGSAVLVPAEVVGTN 340 (396) Q Consensus 277 -----~~~--~~----~~~I~~i~~v~~gl~V~T~~~~y~l~G-----~~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~ 340 (396) +-+ ++ ...|.-+.+++ .|+|+|.+.-|++++ .+|++.++.+.+... ++. - T Consensus 310 ~~~ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g---~~~----------~ 375 (681) T protein:vir:98 310 PVRDDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVG---ATD----------V 375 (681) T ss_pred CCCCCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeeeec---ccc----------c Confidence 111 12 33478888985 599999999999998 455565555544322 111 1 Q ss_pred cccCcccEEEEecCCC-----EEEEcCCCc--EEEEe---cceeeccccccceEEEeCcE-EEEEeC Q lcl|Aclame:pro 341 ASPDGSPVAVWLAENG-----YVMGTSSGA--IAEVH---AGVLAGITGRAGTSVVFDRR-LLTAVS 396 (396) Q Consensus 341 ~~~~~~~~~lw~s~~G-----lv~g~~~G~--~~~lt---~~~~~~~~a~~~~~~~~~rr-~v~~~~ 396 (396) .....+..++|++++| +.--..++. ...+| +-.+..+.-..-+...-++. +.++++ T Consensus 376 ~Pv~vg~~v~fv~~~g~~vre~~y~~~~d~~~~~dlt~~a~Hl~~~~~i~~~a~~~~p~~~~~~v~~ 442 (681) T protein:vir:98 376 QPVVVNNTTIYGAARGGHVRELAYNWQANGFVTGDLSLRAAHLFDNLDILDMAYAKAPQPIVWFISS 442 (681) T ss_pred cceeeCCeEEEEecCCCEEEEEEEeeecCceeccchhhhhhhhcCCCCeEEEEEecCCCEEEEEEec Confidence 1234456899999998 222222221 11222 22222211011111111122 122222 No 12 >protein:vir:107802 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996623;genbank:gi:45580757;genbank:GeneID:2767878 Probab=98.79 E-value=3e-08 Score=61.84 Aligned_cols=371 Identities=11% Similarity=0.067 Sum_probs=169.5 Q ss_pred CCcccc--cce-eccCCcCChhheeeCCCc--hhhheeeeeeeeeCCCccEEECCcceeecCCcccc--cc------CCc Q lcl|Aclame:pro 1 MATTSL--VPL-AGINNVAEDAALQRGGES--PRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQ--LW------QSP 67 (396) Q Consensus 1 m~~~~~--~p~-~G~nn~~~~~~L~~~~~~--~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~--lw------~s~ 67 (396) |+++.+ .-| .|. +++. |.-+.|= =..-++.+-|+-+...|.+.||+|.+-+...+..+ +| +. T Consensus 1 m~~~~~~~~~f~~Ge--~~p~--l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~- 75 (681) T protein:vir:10 1 MSNVRVLQRSFGGGE--ISPE--MFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSV- 75 (681) T ss_pred CcceeEeeeecCCce--eeee--eccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCC- Confidence 887544 223 232 1221 1111111 12357889999999999999999999877665433 11 11 Q ss_pred ccccEEEEECCeEEEEecCCCce-------eecccccC---cceehhhcCCeEEEEcCCcce----eecCcee--eeccc Q lcl|Aclame:pro 68 LHGDAFGALGDQWGKVDPHSWTF-------EPLAQIGE---GDLSHEVLNNRVCVAGTAGIF----TYDGAQA--ERLTL 131 (396) Q Consensus 68 ~~~~~~~~~dg~L~~i~~~~w~~-------~vl~~ig~---gpV~~~v~n~rvy~t~~~~~~----~~~g~~~--~~l~i 131 (396) .-.+++..-++.+ ++..++... ++---+.. --+.|.+..|.+|++....+- ++....+ ..+.+ T Consensus 76 ~~~~~l~~g~~~~-r~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~W~l~~~~f 154 (681) T protein:vir:10 76 TQTMVIELGAGYF-RFHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATNWQLATIAF 154 (681) T ss_pred CceEEEEEeCCeE-EEEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCceEEEEEEe Confidence 1123332223333 343332211 11111221 346777788999999976532 2222211 12222 Q ss_pred c-CCccceeecCC--CCcccceEEEEEEEEcCCC-cccccccceeEecCCC---ccEEEeecCCCCCcceEEEEEEecCC Q lcl|Aclame:pro 132 D-TPAPPLLVAGA--GSLSQGTYGAAVAWLRGPQ-ESAPSLIAFAEVTDAG---ALEVTFPLCLDASVTGARLYLTRANG 204 (396) Q Consensus 132 p-~Pa~p~~~~~~--Gsl~~g~y~ya~T~V~~~g-Eeg~~~~~S~~vt~~~---~~~v~lp~~~~~~i~~~RIYrs~~~g 204 (396) . .|+.|...... ..-..-++.|.++-++..+ .|+.+.. .+.+++.. +-..++.........+.|||+-. + T Consensus 155 ~~~p~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~-~~tvt~~~~~~~~~~t~~w~a~~g~~~~~V~~~~--~ 231 (681) T protein:vir:10 155 TSPVATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSS-AGTCTNNLFTNGGANTIAWSASSGASRYNVYKEQ--G 231 (681) T ss_pred ccccccceeeeeeccCCccceeEeEEEEEeecccceeecCCc-ceEEeeeeecCCcceeEEEEecCCceeeeecccc--e Confidence 2 33333322221 1112235667777776543 3443322 12222211 11112222222233456677432 2 Q ss_pred CeEEEEEeecceeEEEEcCCchhhcccccchhcCCcCC-C---ceeeccCCEEEEE----ECCEEEEccCCCCccccccc Q lcl|Aclame:pro 205 GELLLAGDYPLGAATVILPTLPELGRPAQFRHLSPMPT-G---KHLAYWRGRLLIA----RANVLRFSEALAYHLHDERY 276 (396) Q Consensus 205 ~~~~lv~e~~~~~~~~~d~~~~~lg~~l~t~~~~ppP~-g---~~~~~~nGrl~~a----~Gn~l~fSEp~~p~aw~~~y 276 (396) +-.-+++ ...++ .+.+.........-.-+...+-.+ + ..+.++++||..+ ..+.||+|.+..++-+..+- T Consensus 232 gi~g~ig-~~~~~-~~~~~~~~~~~~~t~~~~~~~~~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~ 309 (681) T protein:vir:10 232 GLYGYIG-QTTGT-SLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSL 309 (681) T ss_pred eEEEEee-cccee-eeeecccccCccccccccccccccCCCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccC Confidence 2111221 11222 223322211111111111111111 1 3588999999998 56789999999988874221 Q ss_pred -----ccE--ec----CcceEEEEEcCCcEEEEEcCcEEEEEc-----CchhheeeeeeccCCCcccceeecchhhhccc Q lcl|Aclame:pro 277 -----GFV--QM----PQRITFVQPVDGGIWVGQVDHVAFLDG-----ADPASLSVSRRASRAPVPGSAVLVPAEVVGTN 340 (396) Q Consensus 277 -----~~~--~~----~~~I~~i~~v~~gl~V~T~~~~y~l~G-----~~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~ 340 (396) +-+ ++ ...|.-+.+++ .|+|+|.+.-|++++ .+|++.++.+.+... ++. - T Consensus 310 ~~~ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g---~~~----------~ 375 (681) T protein:vir:10 310 PVRDDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVG---ATD----------V 375 (681) T ss_pred CCCCCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeeeec---ccc----------c Confidence 111 12 33478888985 599999999999998 455565555544322 111 1 Q ss_pred cccCcccEEEEecCCC-----EEEEcCCCc--EEEEe---cceeeccccccceEEEeCcE-EEEEeC Q lcl|Aclame:pro 341 ASPDGSPVAVWLAENG-----YVMGTSSGA--IAEVH---AGVLAGITGRAGTSVVFDRR-LLTAVS 396 (396) Q Consensus 341 ~~~~~~~~~lw~s~~G-----lv~g~~~G~--~~~lt---~~~~~~~~a~~~~~~~~~rr-~v~~~~ 396 (396) .....+..++|++++| +.--..++. ...+| +-.+..+.-..-+...-++. +.++++ T Consensus 376 ~Pv~vg~~v~fv~~~g~~vre~~y~~~~d~~~~~dlt~~a~Hl~~~~~i~~~a~~~~p~~~~~~v~~ 442 (681) T protein:vir:10 376 QPVVVNNTTIYGAARGGHVRELAYNWQANGFVTGDLSLRAAHLFDNLDILDMAYAKAPQPIVWFISS 442 (681) T ss_pred cceeeCCeEEEEecCCCEEEEEEEeeecCceeccchhhhhhhhcCCCCeEEEEEecCCCEEEEEEec Confidence 1234456899999998 222222221 11222 22222211011111111122 122222 No 13 >protein:vir:99677 Length: 794 # NCBI annotation: Tail tubular protein B # Family: family:all:825 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249591;genbank:gi:68299742;genbank:GeneID:3799992 Probab=98.74 E-value=3.7e-08 Score=61.30 Aligned_cols=376 Identities=11% Similarity=0.025 Sum_probs=159.7 Q ss_pred CCcc--cccce-eccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccc-c-----ccC----Cc Q lcl|Aclame:pro 1 MATT--SLVPL-AGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFR-Q-----LWQ----SP 67 (396) Q Consensus 1 m~~~--~~~p~-~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~-~-----lw~----s~ 67 (396) |+-+ .++=| .||--+.+..++ ...+++++|+=++..|.++||+|.+.+..++.. . .|- +- T Consensus 1 M~~i~~s~~n~~~GvS~q~D~~ry-------~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~f~~~~ 73 (794) T protein:vir:99 1 MALISQSIKNLKGGISQQPDILRY-------SDQGSKQINGFSSEVEGLQKRPPSVHIKRLTDQFGLGQKPYCHIINRDE 73 (794) T ss_pred CceeeeecchhhcceecCCchHHh-------hhhHhhhhcceeeeccCcccCCccceeeeecCCCCCccccEEEEEEeCC Confidence 8862 45556 777766666666 346899999999999999999999988654321 1 111 00 Q ss_pred ccccEEEEECCeEEEEecCCCcee-ec---------ccccCcceehhhcCCeEEEEcCCcceeecC-------------- Q lcl|Aclame:pro 68 LHGDAFGALGDQWGKVDPHSWTFE-PL---------AQIGEGDLSHEVLNNRVCVAGTAGIFTYDG-------------- 123 (396) Q Consensus 68 ~~~~~~~~~dg~L~~i~~~~w~~~-vl---------~~ig~gpV~~~v~n~rvy~t~~~~~~~~~g-------------- 123 (396) .-.+++..-++.+.=++..+..-. +. ..-....+.|.+..|.+|+++...+-.... T Consensus 74 ~~~y~l~f~~~~irv~~~~~g~~~~v~~~~~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~ 153 (794) T protein:vir:99 74 VERYAVFFTGSNIRVFDLFTGDEKTVNAPNGLSYVSSSNPRKDLRMVTVADYTFILNRNVATAQGTTNTPSGLAPFGHFG 153 (794) T ss_pred CceEEEEEcCCeEEEEECCCCeEEEeeccccccccccCCccceeeEEEEccEEEEEcCCeeeeEeeeeccccCcCCCceE Confidence 112222222333322222222111 11 111123478888899999999765332110 Q ss_pred ------c---eeeeccccCCcc-ceeecCC--C--Ccccc-------------eEEEEEEEEcCCCccccccc-ceeEec Q lcl|Aclame:pro 124 ------A---QAERLTLDTPAP-PLLVAGA--G--SLSQG-------------TYGAAVAWLRGPQESAPSLI-AFAEVT 175 (396) Q Consensus 124 ------~---~~~~l~ip~Pa~-p~~~~~~--G--sl~~g-------------~y~ya~T~V~~~gEeg~~~~-~S~~vt 175 (396) + ..+.+.+..... +..++.+ + ..+.. .-.|.++-+..+.....++. ....++ T Consensus 154 ~~~v~~g~y~~~y~v~i~gs~ta~~~tp~~~~~~~~~~~s~~~ia~~l~~~l~~~g~~v~~~~g~~~i~~~~~~~v~t~s 233 (794) T protein:vir:99 154 LVVIRGGQYGRTYRIKVNGSVEASFETPLGDQVAHAKQIDIAYIIDQLAAGLINKGWAVTKGSGYFYFSKSGSVIINSLE 233 (794) T ss_pred EEEeccCCCCceEEEEecCCcccceeeccCcccccccccchhhhhhhhHhhhhcccceEEeCCeEEEEEecCCceeEEEE Confidence 0 001122211100 0000000 0 00000 00011111111111111111 111111 Q ss_pred --C------------CCccEEEeecCCCCCcceEEEEEEecC-CCeEEEEEeecceeE---------EEEcCC-ch-hhc Q lcl|Aclame:pro 176 --D------------AGALEVTFPLCLDASVTGARLYLTRAN-GGELLLAGDYPLGAA---------TVILPT-LP-ELG 229 (396) Q Consensus 176 --~------------~~~~~v~lp~~~~~~i~~~RIYrs~~~-g~~~~lv~e~~~~~~---------~~~d~~-~~-~lg 229 (396) . .-...-+||... ++...++|=-++.. .+++|+..+..-++= ...+.. .+ .+. T Consensus 234 ~~~g~~~t~~~~~~~~v~~~~~Lp~~~-~~G~~v~v~~~~~~~~~~y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~~v 312 (794) T protein:vir:99 234 VEDGYNGQLAWGIINDVQKTTQLPVYA-PNNYIIRVSGDPTLNQDDYYVRFDASRNVWTECPAPNIKADYNKATMPHVLI 312 (794) T ss_pred eecCCCCceeeEEeeeccceeecccCC-CCCeEEEEeccCCCCCCceEEEEEcCCceEEeeccceeecceeccceEEEEe Confidence 1 101112344322 23333333222211 123444333221110 000110 00 000 Q ss_pred c------cccchhc----------CCcCC--C---ceeeccCCEEEEEECCEEEEccCCCCcccccc-c------ccEec Q lcl|Aclame:pro 230 R------PAQFRHL----------SPMPT--G---KHLAYWRGRLLIARANVLRFSEALAYHLHDER-Y------GFVQM 281 (396) Q Consensus 230 ~------~l~t~~~----------~ppP~--g---~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~-y------~~~~~ 281 (396) + .+....| .|.|. | .-+.|+++||..+.++.||+|....++-|-.. . +-+.+ T Consensus 313 ~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~is~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~ 392 (794) T protein:vir:99 313 READGTFTFKQADWTHRAAGDDETNPYPSFIGNSINDIFFFRNRLGFLSGENVILSGSGNYFNFFPESVAVLTDTDPIDV 392 (794) T ss_pred ccCCCceeEeeccccccccCCcccCCCccccCcceeEEEEEeeeEEEecCCeEEEEecCCccccccccccCCCCCccEEE Confidence 0 0111122 23443 3 24789999999999999999999888776222 1 11222 Q ss_pred ------CcceEEEEEcCCcEEEEEcCcEEEEEcCc---hhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEe Q lcl|Aclame:pro 282 ------PQRITFVQPVDGGIWVGQVDHVAFLDGAD---PASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWL 352 (396) Q Consensus 282 ------~~~I~~i~~v~~gl~V~T~~~~y~l~G~~---p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~ 352 (396) ...|.-+.++...|+++|++.-|.|+|.+ |++.+..+.+... |...-..+..+..++|+ T Consensus 393 ~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~l~~~~~lTP~~~~~~~~s~~~------------~~~~~~Pv~vg~~v~f~ 460 (794) T protein:vir:99 393 AVSTNRISILKYAVPFSEELILWSDQAQFVLSSDGGLTPTTIRLDLTTEFE------------VTEQARPYGIGRGVYFV 460 (794) T ss_pred EecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEEee------------ccCCCCceEeCCeEEEE Confidence 34466688889999999999999999854 3443333332111 22222334556789999 Q ss_pred cCCCEE--------EEc-CCC-cEEEEe---cceeeccccccceE-------EE--eCcEEEEEeC Q lcl|Aclame:pro 353 AENGYV--------MGT-SSG-AIAEVH---AGVLAGITGRAGTS-------VV--FDRRLLTAVS 396 (396) Q Consensus 353 s~~Glv--------~g~-~~G-~~~~lt---~~~~~~~~a~~~~~-------~~--~~rr~v~~~~ 396 (396) +++|=. .-. .++ +...|| +..|....-...++ +| ++..-+..+. T Consensus 461 ~~~g~~~~v~r~~~~~~~~d~y~a~Dlt~~~~hl~~~~~~~~~a~~~~~~~~v~~~~~~g~l~~~~ 526 (794) T protein:vir:99 461 SPRAKFSSVRRFYAVQDVTQVKNAEDISAHVPYYVENGVFKMSGSSTENFLTILTEGNEQRVYFYK 526 (794) T ss_pred ecCCCeeEEEEeeeeccccCceehhhHHHHHHHhcCCCeEEEEEeCCCCcEEEEEEcCCCEEEEEE Confidence 999821 111 111 001111 01111100000000 00 1111111111 No 14 >protein:vir:8837 Length: 513 # NCBI annotation: constituent protein # Family: family:all:4957 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775245;genbank:gi:27476043;genbank:GeneID:2700591 Probab=98.65 E-value=2.9e-08 Score=61.94 Aligned_cols=270 Identities=11% Similarity=0.009 Sum_probs=140.7 Q ss_pred CCcccccceeccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccccccCCcccccEEE------ Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPLHGDAFG------ 74 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~lw~s~~~~~~~~------ 74 (396) |-++++----|+=+-..|.-| |.-+--+|.||=+ .+|.+.+|+||..+++. +=-++...+.|. T Consensus 3 ~~~~~~~~~~g~~~d~~p~~l------p~~a~s~~~N~~~-~~~~~~~~~g~~pv~a~----~~~~~~g~~~~~~~g~~~ 71 (513) T protein:vir:88 3 LERQEVKNPTGIVTDIAPADL------PLDKWSFGNNVRF-KNGKAQKALGHSPIFDT----AQAPILDMFPFIRNNIPY 71 (513) T ss_pred cCChhhcccccceeccChhhc------CCCcceeeeeeeE-ecceeeecCccceeeec----CCCCceeeeeeecCCCeE Confidence 667666444444444555556 3445568999999 56999999999999543 111111222222 Q ss_pred ---EECCeEEEEecCCCceeecccc-c--CcceehhhcCCeEEEEcCCc-ceeecCceeeeccccCCccceeecCCCCcc Q lcl|Aclame:pro 75 ---ALGDQWGKVDPHSWTFEPLAQI-G--EGDLSHEVLNNRVCVAGTAG-IFTYDGAQAERLTLDTPAPPLLVAGAGSLS 147 (396) Q Consensus 75 ---~~dg~L~~i~~~~w~~~vl~~i-g--~gpV~~~v~n~rvy~t~~~~-~~~~~g~~~~~l~ip~Pa~p~~~~~~Gsl~ 147 (396) +....+++.|.-+|+-.....+ | .-++.+.+.+|+++.+|+.. +..+++. T Consensus 72 ~~~~~~~~~~~~~~~t~~dvs~~~~~~~~~~~w~~~~f~~~i~a~ng~~~~q~~~~~----------------------- 128 (513) T protein:vir:88 72 WLLCSEKRLYLADGTTIIDVSPGPYSASVTNRWSVGSFNGVIFANDGVNPPHHLPPT----------------------- 128 (513) T ss_pred EEEeeceEEEEecCceeeeccccceeecccCceeeeeecCEEEEEcCCCcceEEcCC----------------------- Confidence 2222233333333221111111 0 11334444444444444211 1122110 Q ss_pred cceEEEEEEEEcCCCcccccccceeEecCCCccEEEeecCCCCCcceEEEEEEecCCCeEEEEEeecceeEEEEcCCchh Q lcl|Aclame:pro 148 QGTYGAAVAWLRGPQESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLYLTRANGGELLLAGDYPLGAATVILPTLPE 227 (396) Q Consensus 148 ~g~y~ya~T~V~~~gEeg~~~~~S~~vt~~~~~~v~lp~~~~~~i~~~RIYrs~~~g~~~~lv~e~~~~~~~~~d~~~~~ 227 (396) ...+.| T Consensus 129 ---------------------------------------------------------------------s~~f~d----- 134 (513) T protein:vir:88 129 ---------------------------------------------------------------------ESVFRV----- 134 (513) T ss_pred ---------------------------------------------------------------------Cceeee----- Confidence 000000 Q ss_pred hcccccchhcCCcCCCceeeccCCEEEEEE--------CCEEEEccCCCC----cccccc---c--ccEec---CcceEE Q lcl|Aclame:pro 228 LGRPAQFRHLSPMPTGKHLAYWRGRLLIAR--------ANVLRFSEALAY----HLHDER---Y--GFVQM---PQRITF 287 (396) Q Consensus 228 lg~~l~t~~~~ppP~g~~~~~~nGrl~~a~--------Gn~l~fSEp~~p----~aw~~~---y--~~~~~---~~~I~~ 287 (396) |+ ++-+..--.+++.++++|+... -|.|++|...-| -.|+.. . +|+.+ ...|+. T Consensus 135 l~------g~p~~~~a~~i~v~~~flv~~~~t~~~~~~PnrV~wS~~~D~~~~P~~W~~t~~t~~a~~~~l~d~~g~~v~ 208 (513) T protein:vir:88 135 LP------NFPANTTFRRLKSFKNFLIGLNVTSNSIEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVD 208 (513) T ss_pred cc------CCCcccceEEEEEEeeEEEEeecccCcCCCCceEEEecccCCcccccccccccccCcccccccCCCccceee Confidence 00 0000001133444444444432 367888887764 566421 1 23343 377999 Q ss_pred EEEcCCcEEEEEcCcEEEEE-cCchhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCCEEEEcCCCcE Q lcl|Aclame:pro 288 VQPVDGGIWVGQVDHVAFLD-GADPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVMGTSSGAI 366 (396) Q Consensus 288 i~~v~~gl~V~T~~~~y~l~-G~~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~Glv~g~~~G~~ 366 (396) ..+-++.++|.++...|.++ +-+|-.++++++... -+|+...+++..+..+.|++.+|+++. .+++. T Consensus 209 g~~~g~~liif~e~~i~~m~y~g~~~if~~~~i~~~-----------~G~~~p~SI~~~~~~~ffls~~Gf~~~-~G~~~ 276 (513) T protein:vir:88 209 GVKLRDSFIIYKEDSVYSMRYIGGLYIFQFQQLFND-----------VGILGPNCAIEFDGNHFVVGHGDVYVH-NGVQK 276 (513) T ss_pred eeecccceEEEecccEEEEEecCCCceEEEEeeccc-----------ccccCCceeEEECCeEEEEeCCceEEe-cCcee Confidence 99999999999999999998 666778888886443 358888888888999999999999976 45565 Q ss_pred EEEecceeec-----ccc-c--cceEEEeCc--E-EEEEeC Q lcl|Aclame:pro 367 AEVHAGVLAG-----ITG-R--AGTSVVFDR--R-LLTAVS 396 (396) Q Consensus 367 ~~lt~~~~~~-----~~a-~--~~~~~~~~r--r-~v~~~~ 396 (396) +.+-..++.- ... . ...+...++ + +++.-+ T Consensus 277 ~~Ig~ekVdk~f~~~~n~~~~~~~~~~~d~~~~~v~~~y~s 317 (513) T protein:vir:88 277 QSVIDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSS 317 (513) T ss_pred eecccchhhhhhhccCCcccceEEEEEEcCcccEEEEEecC Confidence 5554333311 000 0 111122221 1 222222 No 15 >protein:vir:80253 Length: 777 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522886;genbank:gi:158345179;genbank:GeneID:5687516 Probab=98.56 E-value=2.8e-07 Score=56.55 Aligned_cols=375 Identities=13% Similarity=0.067 Sum_probs=157.2 Q ss_pred CCc--ccccce-eccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCcc----cccc-----CCcc Q lcl|Aclame:pro 1 MAT--TSLVPL-AGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPF----RQLW-----QSPL 68 (396) Q Consensus 1 m~~--~~~~p~-~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~----~~lw-----~s~~ 68 (396) |+. ..+.-| .||--+.+..++ ...+++++|+=++..|.+++|+|.+-+..+.. +.++ +... T Consensus 1 M~~i~~~~~nf~~GvS~q~D~~ry-------~~q~~~~~N~~~~~~gG~~rRpGt~fv~~l~~~~~~~~~~~~~~~~~~~ 73 (777) T protein:vir:80 1 MSYFAGSYRQLLFGVSQQTAKDRL-------EGQVESQLNMQSDLVTGPRRRSPVHLIADAMAATDANRLAYSLATFSGR 73 (777) T ss_pred CceeeeecchhhcccccCCchHHh-------hhHHhhhhcceeeeccCceeCcchHhhhhhcCCCcccceeEEEEecCCC Confidence 887 344555 777777777666 34689999999999999999999887654321 1111 0000 Q ss_pred c-ccEEEEECCeEEEEecCCCceeecc--cc----cCcceehhhcCCeEEEEcCCcceeec------------------- Q lcl|Aclame:pro 69 H-GDAFGALGDQWGKVDPHSWTFEPLA--QI----GEGDLSHEVLNNRVCVAGTAGIFTYD------------------- 122 (396) Q Consensus 69 ~-~~~~~~~dg~L~~i~~~~w~~~vl~--~i----g~gpV~~~v~n~rvy~t~~~~~~~~~------------------- 122 (396) . .+++...+|.|.-++.++.....-. .+ -+..+.|.+.+|.+|+++...+-... T Consensus 74 e~~~~l~~g~g~irv~~~~~g~~~~~~~~~Yl~a~~~~~l~~~q~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~~~v 153 (777) T protein:vir:80 74 EVLLLVDTLDGTLTILDDATGEVLFTGTNSYLTAGTGRSIRFAALDDSVFVANTEVIPQTQLWSGASAYPDPTRAGYLYV 153 (777) T ss_pred eeEEEEEecCCeEEEEECCCCeEEEecCCCceeeccccceeEEEEcCEEEEEeCCccceeeecccCCCccCcccceEEEe Confidence 0 1222333444544444443322211 11 12358888899999999965433210 Q ss_pred -Cc---eeeeccccCCcccee-ecCCCCc----ccceEEEE-------------------EEEEcCCCcccccccceeEe Q lcl|Aclame:pro 123 -GA---QAERLTLDTPAPPLL-VAGAGSL----SQGTYGAA-------------------VAWLRGPQESAPSLIAFAEV 174 (396) Q Consensus 123 -g~---~~~~l~ip~Pa~p~~-~~~~Gsl----~~g~y~ya-------------------~T~V~~~gEeg~~~~~S~~v 174 (396) ++ ..|.+.+........ +...++. ..-+..|. +++.....-.--..+....+ T Consensus 154 ~~~~~g~~y~i~i~~~~~~~~~t~~~~t~~~~~~~~~~~~ia~~L~~~~~~~~~~~s~~~~~~~~~g~~~~i~~~~~~~~ 233 (777) T protein:vir:80 154 VAGAFSKQYRLSITNQVTGVTTSVDVTTSATEASQATGEYVITQLRTAAEADATIGTAAGFAYYQDGAYLYVTAPEAIAV 233 (777) T ss_pred eccCCCceeeEeecCCcCceeEEEecCCcccccccccchhhhhhhhhhhccccceeecCceEEEeCCcEEEEEecCceeE Confidence 00 011111111100000 0000000 00000000 01110000000000000111 Q ss_pred cCCCc-c------------EEEeecCCCCCcceEEEEEEecCCCeEEEEEeeccee------------------------ Q lcl|Aclame:pro 175 TDAGA-L------------EVTFPLCLDASVTGARLYLTRANGGELLLAGDYPLGA------------------------ 217 (396) Q Consensus 175 t~~~~-~------------~v~lp~~~~~~i~~~RIYrs~~~g~~~~lv~e~~~~~------------------------ 217 (396) +..++ - ..++|...+... .+.|-.++..++.+|+.-+..-+. T Consensus 234 t~~~g~~~~~~~~~~~v~~~~~lp~~~~~~~-~~~~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~t~p~~l~~~~ 312 (777) T protein:vir:80 234 STDSGSNFLRASNAASIRDAAELPAKLPADA-DGFIIATGAAKNKTYFRWVDLERKWDEDASRGAQAELIDMPLRITYSA 312 (777) T ss_pred ecCCcCccceeeeeEEEeecccccccccccc-ceEEEeCCCCCCceEEEEEccCcEEEEeecccccccccccceEEEecC Confidence 11100 0 111222211111 122222333334444322111110 Q ss_pred EEEEcCCchhhcccccchhcCCcCC--C---ceeeccCCEEEEEECCEEEEccCCCCcccccccc-------cEec---- Q lcl|Aclame:pro 218 ATVILPTLPELGRPAQFRHLSPMPT--G---KHLAYWRGRLLIARANVLRFSEALAYHLHDERYG-------FVQM---- 281 (396) Q Consensus 218 ~~~~d~~~~~lg~~l~t~~~~ppP~--g---~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~-------~~~~---- 281 (396) .+|.....++..+.+=...-.|.|. | .-+.|+++||..+.++.||+|.+..++-|-..-. -+.+ T Consensus 313 ~~~~~~~~~w~~r~~gd~~tn~~Psf~g~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdDpI~~~~ss 392 (777) T protein:vir:80 313 PNFSLTALNYERRASGDATSNPALKFTEQGISGMTTMQGRLVLLAGEYVCMSASGNPLRWFRASVSTQSDDDPIEVAATA 392 (777) T ss_pred CceEeeccCCccccccccccCCCceecCCceeEEEEEcceeeeecCCeEEEEeccCccccccccccCCCCCccEEEEEcC Confidence 0010000111111110111124442 3 2479999999999999999999999887732211 1111 Q ss_pred --CcceEEEEEcCCcEEEEEcCcEEEEEcCc---hhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCC Q lcl|Aclame:pro 282 --PQRITFVQPVDGGIWVGQVDHVAFLDGAD---PASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENG 356 (396) Q Consensus 282 --~~~I~~i~~v~~gl~V~T~~~~y~l~G~~---p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~G 356 (396) ...|.-+.++..+|+++|++.-|.|+|.+ |++.+..+.+..- |-..-..+..+..++|+++++ T Consensus 393 ~~~~~i~~~v~~~~~L~i~T~~~e~~l~~~~~lTP~~~~~~~~s~~~------------~~~~~~Pv~vG~~v~Fv~~r~ 460 (777) T protein:vir:80 393 PVASPYEYAVAFNKDLVLFAKTHQGLVPGANLLTSRNATAAVVTEYS------------FQNSCSPVVAGRTVFFASPRS 460 (777) T ss_pred CcceeeeeeeecCCcEEEEecCceEEEeCCCcccceeEEEEEEEeec------------cCCCCCceEeCCeEEEEecCC Confidence 33466688888999999999999999854 4444433332211 111222234556788887653 Q ss_pred -----EEE----EcCCCc--EEEEe-------cceeeccc--cccceEEE--eC-cEEEEEeC Q lcl|Aclame:pro 357 -----YVM----GTSSGA--IAEVH-------AGVLAGIT--GRAGTSVV--FD-RRLLTAVS 396 (396) Q Consensus 357 -----lv~----g~~~G~--~~~lt-------~~~~~~~~--a~~~~~~~--~~-rr~v~~~~ 396 (396) +.- ...++. ...|| ++.+.-.. ..-...++ ++ ++ +..+. T Consensus 461 g~~s~v~e~~~~~~~~d~y~a~Dlt~~~~hl~~~~v~~~a~s~~p~~v~~~~~~dg~-l~~~t 522 (777) T protein:vir:80 461 GPWSAVWEMLPSQYTDAQVEASDSTSHLPKYIAGPVRFLATSSTTSIVVVGTSNLRE-LVVHE 522 (777) T ss_pred CceeEEeeeeecccccCceehhHHHHHHHHhcCCceEEEEEcCCCceEEEEEcCCCe-EEEEE Confidence 211 111111 10111 11111100 00001111 11 12 11111 No 16 >protein:vir:2203 Length: 794 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_042000;swissprot:sw:p03747;genbank:gi:9627472;uniprot:P03747;genbank:GeneID:1261024 Probab=98.56 E-value=2.9e-07 Score=56.44 Aligned_cols=375 Identities=12% Similarity=0.020 Sum_probs=157.0 Q ss_pred CCcc--cccce-eccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCcc-----ccccCCc----- Q lcl|Aclame:pro 1 MATT--SLVPL-AGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPF-----RQLWQSP----- 67 (396) Q Consensus 1 m~~~--~~~p~-~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~-----~~lw~s~----- 67 (396) |+-+ .++=| .||--+.+..++ ...+++++|.=++..|.++||+|.+.+....- ++.|-.+ T Consensus 1 M~~i~~s~~n~~~GvSqq~D~~ry-------~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~~~~~~ 73 (794) T protein:vir:22 1 MALISQSIKNLKGGISQQPDILRY-------PDQGSRQVNGWSSETEGLQKRPPLVFLNTLGDNGALGQAPYIHLINRDE 73 (794) T ss_pred CceeeeecchhhcccccCCchHHh-------hhHHhhhhcceeeccCCceeCCchHhhhhhcccCCCCCccEEEEEEeCC Confidence 8862 45555 777766666666 34689999999999999999999988765321 1112111 Q ss_pred ccccEEEEECCeEEEEecCCCceeec---------ccccCcceehhhcCCeEEEEcCCcceeec---C------------ Q lcl|Aclame:pro 68 LHGDAFGALGDQWGKVDPHSWTFEPL---------AQIGEGDLSHEVLNNRVCVAGTAGIFTYD---G------------ 123 (396) Q Consensus 68 ~~~~~~~~~dg~L~~i~~~~w~~~vl---------~~ig~gpV~~~v~n~rvy~t~~~~~~~~~---g------------ 123 (396) ...+++..-++.|.-++.++-...+. .......+.|.+.+|-+|+++...+-... . T Consensus 74 ~~~y~l~~~~~~irv~~~~G~~~~v~~~~~~~y~~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~~~g~ 153 (794) T protein:vir:22 74 HEQYYAVFTGSGIRVFDLSGNEKQVRYPNGSNYIKTANPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGL 153 (794) T ss_pred CcEEEEEEcCCeEEEEecCCcEEEeecCCCccceecCCCcccEEEEEEcCEEEEEcCCeeeeEeeccccCCCCCCCceEE Confidence 11122222234443333332222221 11112347888899999999976532210 0 Q ss_pred --------ceeeeccccCCccceeecCCCC-----------------------------cccceEEEEEEEEcC------ Q lcl|Aclame:pro 124 --------AQAERLTLDTPAPPLLVAGAGS-----------------------------LSQGTYGAAVAWLRG------ 160 (396) Q Consensus 124 --------~~~~~l~ip~Pa~p~~~~~~Gs-----------------------------l~~g~y~ya~T~V~~------ 160 (396) +..+.+.|............|+ ...+...+.++.... T Consensus 154 v~v~~g~y~~ty~v~I~~~~~a~~~~p~gt~~~~~~~~~~~~ia~~L~~~l~~~~~~~t~~~~~~~~~i~a~~~~~~~~~ 233 (794) T protein:vir:22 154 INVRGGQYGRELIVHINGKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRTNLSDWTVNVGQGFIHVTAPSGQQIDSF 233 (794) T ss_pred EEccCCccceeEEEEeccCcceEEEEcCCCccccceeechhhhhhhhhhhheeccccceEEeCCceEEEEEcCCceEEEE Confidence 0001111110000000000000 000000011111100 Q ss_pred CCcccccccceeEecCCCccEEEeecCCCCCcceEEEEEEecCC-CeEEEEEeeccee---------EEEEcC------- Q lcl|Aclame:pro 161 PQESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLYLTRANG-GELLLAGDYPLGA---------ATVILP------- 223 (396) Q Consensus 161 ~gEeg~~~~~S~~vt~~~~~~v~lp~~~~~~i~~~RIYrs~~~g-~~~~lv~e~~~~~---------~~~~d~------- 223 (396) ..+.+........+.......-+||.... +...++|=-++... ++||.-.+-..+. ....+. T Consensus 234 t~~~g~~~t~~~~~~~~~~~~~~lp~~~~-~G~~v~i~~~~~~~~~~Y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~l 312 (794) T protein:vir:22 234 TTKDGYADQLINPVTHYAQSFSKLPPNAP-NGYMVKIVGDASKSADQYYVRYDAERKVWTETLGWNTEDQVLWETMPHAL 312 (794) T ss_pred eeecccCcceeEEEEeccccceeccccCC-CCeEEEEEeCCCCCcceeEEEEeccceEEEEeeeccceeeecccceeeEe Confidence 11111111111111100000112332211 12223332111111 1233221110000 000111 Q ss_pred -----------CchhhcccccchhcCCcCC--C---ceeeccCCEEEEEECCEEEEccCCCCcccccccccEec------ Q lcl|Aclame:pro 224 -----------TLPELGRPAQFRHLSPMPT--G---KHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQM------ 281 (396) Q Consensus 224 -----------~~~~lg~~l~t~~~~ppP~--g---~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~------ 281 (396) ..++..+.+=-..-.|.|. | .-+.++++||..+.++.||+|....++-|...-. ++. T Consensus 313 v~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~-~~~~DdD~i 391 (794) T protein:vir:22 313 VRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVFFFRNRLGFLSGENIILSRTAKYFNFYPASI-ANLSDDDPI 391 (794) T ss_pred eeccCCcEEEeeccccccccCccccCCcceecCCCcceEEEEcceEEEecCCeEEEEccCCccccccccC-cCCCCCccE Confidence 1111111111111234443 4 3478999999999999999999998887733321 111 Q ss_pred --------CcceEEEEEcCCcEEEEEcCcEEEEEcCc---hhheeeeeeccCCCcccceeecchhhhccccccCcccEEE Q lcl|Aclame:pro 282 --------PQRITFVQPVDGGIWVGQVDHVAFLDGAD---PASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAV 350 (396) Q Consensus 282 --------~~~I~~i~~v~~gl~V~T~~~~y~l~G~~---p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~l 350 (396) ...|.-+.++..+|+++|++.-|.|+|.+ |++.+..+.+... |...-..+..+..++ T Consensus 392 ~~~~ss~~~~~i~~~v~~~~~L~i~t~~~e~~l~~~~~lTP~~~~~~~~s~~~------------~~~~~~Pv~vg~~v~ 459 (794) T protein:vir:22 392 DVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFD------------VQDRARPFGIGRNVY 459 (794) T ss_pred EEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEEee------------ccCCCCceEeCCeEE Confidence 22356678888999999999999999864 3444433332211 222223344566899 Q ss_pred EecCCCE--------EEEc-CCC-cEEEEe-------cceeecccc--ccceEEEe---CcEEEEEeC Q lcl|Aclame:pro 351 WLAENGY--------VMGT-SSG-AIAEVH-------AGVLAGITG--RAGTSVVF---DRRLLTAVS 396 (396) Q Consensus 351 w~s~~Gl--------v~g~-~~G-~~~~lt-------~~~~~~~~a--~~~~~~~~---~rr~v~~~~ 396 (396) |+++.|= .... .++ +...|| ++.+....+ ..-..+++ +..-+.++. T Consensus 460 f~~~~g~~~~~~r~~~~~~~~d~y~~~Dlt~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~~~ 527 (794) T protein:vir:22 460 FASPRSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFCSVLSHGDPSKIFMYK 527 (794) T ss_pred EEecCCCeeEEEEeEeeecccCceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCEEEEEE Confidence 9999882 1111 111 111111 111100000 00011111 111111111 No 17 >protein:vir:103790 Length: 768 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024932;genbank:gi:48697202;genbank:GeneID:2846114 Probab=98.54 E-value=3.3e-07 Score=56.15 Aligned_cols=370 Identities=15% Similarity=0.081 Sum_probs=159.3 Q ss_pred CCccc--ccce-ec-----cCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccccccCCccc--- Q lcl|Aclame:pro 1 MATTS--LVPL-AG-----INNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPLH--- 69 (396) Q Consensus 1 m~~~~--~~p~-~G-----~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~lw~s~~~--- 69 (396) |+.+. ..-| .| |--+.+-.++ ...++++.|+=+++.|.+++|+|.+-+..++..+ -..++. T Consensus 1 M~~~~~~~~~F~~GelsP~l~~r~Dl~ry-------~~~~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~-~~~~lipf~ 72 (768) T protein:vir:10 1 MPKAAPQQVSFDAGELSPLLGARVDLAKY-------PNGCQVMENFIATVQGPAIRRGGKRFVAATKDST-KQSWLLPFI 72 (768) T ss_pred CCcceeeeeeccCceechhhcccchHHHH-------HHHHhhhhcceeeecCCceecCchhhhhhhcCCC-CCeeEEEEE Confidence 99744 4456 67 6666666666 3468999999999999999999999987654222 111121 Q ss_pred -----ccEEEEECCeEEEEecCCCce-------eeccccc---------CcceehhhcCCeEEEEcCCccee---ecCce Q lcl|Aclame:pro 70 -----GDAFGALGDQWGKVDPHSWTF-------EPLAQIG---------EGDLSHEVLNNRVCVAGTAGIFT---YDGAQ 125 (396) Q Consensus 70 -----~~~~~~~dg~L~~i~~~~w~~-------~vl~~ig---------~gpV~~~v~n~rvy~t~~~~~~~---~~g~~ 125 (396) .+++..-++.| +|-..+... ++..... +-.+.|.+..|.+|+++.+.+-. ..+.. T Consensus 73 ~~~~~~y~l~fg~~~i-rv~~~~g~v~~~~~~~e~~tp~~~~~l~~~~~~~~L~~~q~aD~~~i~~~~~~p~~l~r~~~~ 151 (768) T protein:vir:10 73 VADGIAYMLEFGDHYI-RFFVNRGQLVNAGAPVEIATPYALADLTTEDGTFAIRATQSADTMYLFHGGYPTQKLLRTSAT 151 (768) T ss_pred ecCccEEEEEEcCCEE-EEEECCcEEEecCeeEEEEcCCCcceeecccccceeEEEeecCEEEEEcCCcceeEEEEecCC Confidence 23333323333 222222211 1111111 12367777889999999764321 11111 Q ss_pred ee---ecc-ccCCcc------ceeecCCCC-----------ccc----ceEEEE------------EEEEcCCCcc-ccc Q lcl|Aclame:pro 126 AE---RLT-LDTPAP------PLLVAGAGS-----------LSQ----GTYGAA------------VAWLRGPQES-APS 167 (396) Q Consensus 126 ~~---~l~-ip~Pa~------p~~~~~~Gs-----------l~~----g~y~ya------------~T~V~~~gEe-g~~ 167 (396) .. ... .+.|.. .+.....+. ... +...+. ..+.....+. ... T Consensus 152 ~w~l~~~~~~~gp~~~~n~~~~vti~~s~~~~~~T~tasa~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~g~~~~~~~~ 231 (768) T protein:vir:10 152 TFSLQPVTFVGGPFAAVNSDNNVRVHASAGTGAVTLVASASVFRPSDVGTLFYLEQEDNSFVKPWVVHQKIGPSELRRVG 231 (768) T ss_pred CceeEEeeecCccccccccceeEEEEecccceeEEEeecCCccchhhcceeeeeeeeccccccccEEEEeeeeEEEEecC Confidence 00 000 001100 000000000 000 000000 0000000000 000 Q ss_pred ccceeEecCCCc---cEEEeecCCCCCcceEEEEEEecCCCe------------EE-------EEEeeccee-EE----E Q lcl|Aclame:pro 168 LIAFAEVTDAGA---LEVTFPLCLDASVTGARLYLTRANGGE------------LL-------LAGDYPLGA-AT----V 220 (396) Q Consensus 168 ~~~S~~vt~~~~---~~v~lp~~~~~~i~~~RIYrs~~~g~~------------~~-------lv~e~~~~~-~~----~ 220 (396) ......++.+.. ...+.+ +......+.|.+..+... +. ...++.-.+ .. . T Consensus 232 ~~~~~~~~~~~~~~~~~~t~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~t~~~~~~~~ 308 (768) T protein:vir:10 232 DRVYLCTAVGTATPQVTGTET---PTHTSGSRWDGTGQDESATDEYGSIGAEWEYQHSGYGTVLITGYTNDQVVTGTVAT 308 (768) T ss_pred CceEEeeeeccccccccceec---cccccCceEEEecCcccccccccccceEEEEEEcCCceEEEEEecCCeeEEeeeee Confidence 000000010000 001111 111222233332221110 00 001100000 00 0 Q ss_pred EcC---C--chhhcccccchhcCCcCCC------ceeeccCCEEEEEECCEEEEccCCCCcccccc-c------ccEec- Q lcl|Aclame:pro 221 ILP---T--LPELGRPAQFRHLSPMPTG------KHLAYWRGRLLIARANVLRFSEALAYHLHDER-Y------GFVQM- 281 (396) Q Consensus 221 ~d~---~--~~~lg~~l~t~~~~ppP~g------~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~-y------~~~~~- 281 (396) .+. . ....+....+..|...+.. .++.++++||..+.++.||+|.+..++-|-.. - +-+.+ T Consensus 309 ~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~g~Ps~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdD~I~~~ 388 (768) T protein:vir:10 309 NDPADPGMLPNTVVTLTGTYKWARSLFNSTDGFPQMGTFWRNRLCLMRDRWLAMSVSADFETFKTKDADQQTDDSAIVQQ 388 (768) T ss_pred ecCcccccccccccccCCCcccccCCCcCCCCCceEEEEEeeeEEEeeCCEEEEEcccccccccccccccccCCccEEEE Confidence 000 0 0111222223333322222 46899999999999999999999888877111 0 01111 Q ss_pred -----CcceEEEEEcCCcEEEEEcCcEEEEEc------CchhheeeeeeccCCCcccceeecchhhhccccccCcccEEE Q lcl|Aclame:pro 282 -----PQRITFVQPVDGGIWVGQVDHVAFLDG------ADPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAV 350 (396) Q Consensus 282 -----~~~I~~i~~v~~gl~V~T~~~~y~l~G------~~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~l 350 (396) ...|.-+.++ ++|+++|++.-|.|+| .+|++.++.+.+... ++. + ..+..+..++ T Consensus 389 ~ss~~~~~i~~~v~~-~~L~i~T~~~q~~l~~~~~~~~lTP~~~~i~~~s~~g---~~~--~--------~Pv~vG~~v~ 454 (768) T protein:vir:10 389 LNARQLNKLAWMVES-DSLLIGMTGDEWVIGPANASQPVSAANLNAARRTSYG---SKR--I--------QPVQVGGTIM 454 (768) T ss_pred ecCCcceeEEEEeec-CcEEEEecCceEEEecCCCCcccccceEEEEEeehhc---ccc--c--------ccEEeCCeEE Confidence 2457888898 4799999999999987 577887777655432 111 1 1223456899 Q ss_pred EecCCCEE-----EEcCCCc--EEEEe---cceeecccc---ccc--eEEEeCcEEE-EEeC Q lcl|Aclame:pro 351 WLAENGYV-----MGTSSGA--IAEVH---AGVLAGITG---RAG--TSVVFDRRLL-TAVS 396 (396) Q Consensus 351 w~s~~Glv-----~g~~~G~--~~~lt---~~~~~~~~a---~~~--~~~~~~rr~v-~~~~ 396 (396) |++++|=. --..++. ...+| +..+..... ..- +...-.++++ ++++ T Consensus 455 fv~~~g~~vre~~y~~~~d~y~a~DlT~~a~hl~~~~~~~~~~i~~~a~~~~p~~v~~~v~~ 516 (768) T protein:vir:10 455 FVQKAGRKLRDFKYDFSSDNYVSTDVTKIADHITRGRAGTNSGIMSLCFQQEPHSVVWAARA 516 (768) T ss_pred EEcCCCCEEEEEEeeeecCceecchhhhhhhhhccccCccccceeeEEEeecCCeEEEEEec Confidence 99999921 1111111 11222 333333221 111 1111123322 2222 No 18 >protein:vir:10452 Length: 794 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848299;genbank:gi:30387490;genbank:GeneID:1733952 Probab=98.52 E-value=3.8e-07 Score=55.78 Aligned_cols=375 Identities=12% Similarity=0.022 Sum_probs=161.7 Q ss_pred CCc--ccccce-eccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCcc-ccccCCccc------- Q lcl|Aclame:pro 1 MAT--TSLVPL-AGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPF-RQLWQSPLH------- 69 (396) Q Consensus 1 m~~--~~~~p~-~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~-~~lw~s~~~------- 69 (396) |+. -.++=| .||--+.+..++ ...+++++|.=.+..|.+++|+|.+.+..++. ..+...+.- T Consensus 1 M~~i~~s~~n~~~GvSqq~D~~Ry-------~~q~~~~~N~~~~~~gG~~rRpGt~fv~~l~~~~~~~~~~~~~~~~rd~ 73 (794) T protein:vir:10 1 MALISQSIKNLKGGISQQPDILRY-------PDQGSRQVNGWSSETEGLQKRPPLVFLKTLGDNGALGQAPYIHLINRDE 73 (794) T ss_pred CcceeeecchhhcccccCCchHHh-------hhhHhhhhcceeeeccCcccCcchhhheeccCCCccccceeeeEEecCC Confidence 886 245555 787777666666 34699999999999999999999998765532 112222111 Q ss_pred --ccEEEEECCeEEEEecCCCceeecccc---------cCcceehhhcCCeEEEEcCCcceeec---------------- Q lcl|Aclame:pro 70 --GDAFGALGDQWGKVDPHSWTFEPLAQI---------GEGDLSHEVLNNRVCVAGTAGIFTYD---------------- 122 (396) Q Consensus 70 --~~~~~~~dg~L~~i~~~~w~~~vl~~i---------g~gpV~~~v~n~rvy~t~~~~~~~~~---------------- 122 (396) .+...-.++.|.-++.++-...+.... .+..+.|.+..|.+|+++...+-..+ T Consensus 74 ~e~~~v~~~~~~irv~~~~G~~~~v~~~~~~~Y~~aa~~~~~l~~~q~aD~~fivn~~~~~~~~~~~~~~~~~~~~~~~~ 153 (794) T protein:vir:10 74 NEQYYAVFTGTGIRVFDLAGNEKQVRYPNGSNYIKTANPRSDLRMVTVADYTFIVNRNVVVQKDPNSVNLANYNPKQDGL 153 (794) T ss_pred CceEEEEEeCCeEEEEEcCCcEEEEEcCCCCcceecCCCcceEEEEEEcCEEEEEcCCeeeeeeccccccCCCCCCccEE Confidence 112222344555454454444333211 22358888999999999865432210 Q ss_pred ----Cce---eeeccccCCccceeecCCCCc------------------------------ccceEEEEEEE----Ec-C Q lcl|Aclame:pro 123 ----GAQ---AERLTLDTPAPPLLVAGAGSL------------------------------SQGTYGAAVAW----LR-G 160 (396) Q Consensus 123 ----g~~---~~~l~ip~Pa~p~~~~~~Gsl------------------------------~~g~y~ya~T~----V~-~ 160 (396) +++ .|.+.+............|+. .-+.+.|...- +. . T Consensus 154 ~~v~~g~y~r~y~i~i~~~~~at~~tpdgt~~~~~~~~s~~~ia~~L~~~l~a~~~g~t~~~~g~~i~i~a~s~~~~~t~ 233 (794) T protein:vir:10 154 INIRGGQYGRELIVHINGKDVATYKIPDGSKPEHVNNTDAQWLAERLAKQMRINLSGWTVNVGQGFIHVTAPSGQQIDSF 233 (794) T ss_pred EEecccccceEEEeccCCcceeEEEecCCCCcccceecchhhhhhhhhhhhhcccCCceEEeCCeEEEEEeccCceeccc Confidence 000 111222111000000001110 00112221100 00 0 Q ss_pred CCcccccccceeEecCCCccEEEeecCCCCCcceEEEEEEecCC-CeEEEEEeecc-----------------e------ Q lcl|Aclame:pro 161 PQESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLYLTRANG-GELLLAGDYPL-----------------G------ 216 (396) Q Consensus 161 ~gEeg~~~~~S~~vt~~~~~~v~lp~~~~~~i~~~RIYrs~~~g-~~~~lv~e~~~-----------------~------ 216 (396) ..+.+........+.......-+||.... +...++|=-++..+ +.||+..+-.- . T Consensus 234 s~~~~~~~~~~~~v~~~~~~~~~lp~~~~-~G~~v~i~~~~~~~~~~yyv~~~~~~~~w~E~~~~g~~~~~~~~tmP~~l 312 (794) T protein:vir:10 234 TTKDGYADQLINPVTHYAQSFSKLPPNAP-NGYMVKIVGDASKSADQYYVRYDAERKVWTETLGWNTENQVLLETMPHAL 312 (794) T ss_pred cccCCcCcceeEEEEeccCcceecccCCC-CCcEEEEEeCCCCCcceeEEEEEcCCcEEEEecccceeEEEecccceeEE Confidence 11111111111111100000112332211 22233331111111 12222111000 0 Q ss_pred ----eEEEEcCCchhhcccccchhcCCcCC--C---ceeeccCCEEEEEECCEEEEccCCCCcccccccccEec------ Q lcl|Aclame:pro 217 ----AATVILPTLPELGRPAQFRHLSPMPT--G---KHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQM------ 281 (396) Q Consensus 217 ----~~~~~d~~~~~lg~~l~t~~~~ppP~--g---~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~------ 281 (396) ..++.....+...+.+=-..-.|.|. | .-+.|+++||..+.++.||+|....++-|-..-. ++. T Consensus 313 ~r~~~~t~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~-~~~~DdD~I 391 (794) T protein:vir:10 313 VRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVFFFRNRLGFLSGENIILSRTAKYFNFYPASI-ANLSNDDPI 391 (794) T ss_pred EEeccceEEeeecccccccccccccCccCcccCCCccEEEEEcceEEEeeCCeEEEEecCCccccccccc-ccCCCCccE Confidence 01111111111111111111134444 3 2479999999999999999999988877632211 111 Q ss_pred --------CcceEEEEEcCCcEEEEEcCcEEEEEcCch---hheeeeeeccCCCcccceeecchhhhccccccCcccEEE Q lcl|Aclame:pro 282 --------PQRITFVQPVDGGIWVGQVDHVAFLDGADP---ASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAV 350 (396) Q Consensus 282 --------~~~I~~i~~v~~gl~V~T~~~~y~l~G~~p---~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~l 350 (396) ...|.-+.++...|+++|++.-|.++|.++ ++.+..+.+... |-..-..+..+..++ T Consensus 392 ~~~~ss~~~~~i~~~v~~~~~L~i~T~~~q~~l~~~~~lTP~~~~~~~~s~~~------------~~~~~~Pv~vg~~v~ 459 (794) T protein:vir:10 392 DVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSVELNLTTQFD------------VQDRARPYGIGRNVY 459 (794) T ss_pred EEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEeec------------ccCCCCceEeCCeEE Confidence 223566788889999999999999998753 333333322111 222223345567899 Q ss_pred EecCCCE----E--E--E-cCCC-cEEEEec---ceeeccc----c--ccceEEEe---CcEEEEEeC Q lcl|Aclame:pro 351 WLAENGY----V--M--G-TSSG-AIAEVHA---GVLAGIT----G--RAGTSVVF---DRRLLTAVS 396 (396) Q Consensus 351 w~s~~Gl----v--~--g-~~~G-~~~~lt~---~~~~~~~----a--~~~~~~~~---~rr~v~~~~ 396 (396) |+++.|= + . - ..++ ....||. ..+.... + ..-..+++ +..-+..+. T Consensus 460 f~~~~g~~~~~~r~~~~~~~~d~y~a~Dlt~~~~hl~~~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~ 527 (794) T protein:vir:10 460 FASPRSSYTSIHRYYAVQDVSSVKNSEDITSHVPNYIPNGVFSICGSGTENFCSVLSHGDPSKIFMYK 527 (794) T ss_pred EEecCCCeeEEEEEeeeccccCceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCEEEEEE Confidence 9999872 1 1 1 1111 1111111 1111100 0 00001111 111111111 No 19 >protein:vir:94713 Length: 785 # NCBI annotation: tail tube # Family: family:all:825 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338122;genbank:gi:77118200;genbank:GeneID:3707736 Probab=98.51 E-value=3.9e-07 Score=55.75 Aligned_cols=376 Identities=13% Similarity=0.070 Sum_probs=164.1 Q ss_pred CCc--ccccce-eccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCcc----ccccC----Cccc Q lcl|Aclame:pro 1 MAT--TSLVPL-AGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPF----RQLWQ----SPLH 69 (396) Q Consensus 1 m~~--~~~~p~-~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~----~~lw~----s~~~ 69 (396) |+. -.+.=| .||--+.+..++ ...+++++|+=++..|.++||+|.+.+...+. .-+|- +-.. T Consensus 1 M~~~~~s~~n~~~GvSqq~D~~ry-------~~q~~~~~N~~~~~~gG~~rRpG~~~v~~l~~~~~~~~~~~~f~~~~~~ 73 (785) T protein:vir:94 1 MPLITQSIKNLKGGISQQPDILRF-------SDQGEAQVNCWSSESDGLQKRPPTVFKRRLNIDVGSNPKFHLINRDEQE 73 (785) T ss_pred CcceeeecchhhcceecCCchHHh-------hhHHhhhhcceeeeccCcccCChhHhhhcccCCCCcCcEEEEEEeCCCc Confidence 886 344455 787777776666 34689999999999999999999998765422 22231 1111 Q ss_pred ccEEEEECCeEEEEecCCCceeecc--cc-----cCcceehhhcCCeEEEEcCCcceeec-C------------------ Q lcl|Aclame:pro 70 GDAFGALGDQWGKVDPHSWTFEPLA--QI-----GEGDLSHEVLNNRVCVAGTAGIFTYD-G------------------ 123 (396) Q Consensus 70 ~~~~~~~dg~L~~i~~~~w~~~vl~--~i-----g~gpV~~~v~n~rvy~t~~~~~~~~~-g------------------ 123 (396) .+++..-++.|.-++.++-...+-. .+ .+..+.|.+.+|-+|+++-..+-... . T Consensus 74 ~y~l~~~~~~irv~~~~G~~~~v~~~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 153 (785) T protein:vir:94 74 QYYIVFNGSNIQIVDLSGNQYSVSGSVDYVKSSNPRDDIRVVTVADYTFVVNRKVVVKGGSEKSHSGYNRKARALINLRG 153 (785) T ss_pred eEEEEEcCCeEEEEecCCcEEEEecCCCceeecCchhheeeEeeCCEEEEEcCCcceeeeeccCCcCCCCCCceEEEecc Confidence 2344444566654444432222110 11 12348888899999999865433221 0 Q ss_pred ce---eeecccc--------CCcc------ceeecCC-------CCcccceEEEE------EEEEcC---------CCcc Q lcl|Aclame:pro 124 AQ---AERLTLD--------TPAP------PLLVAGA-------GSLSQGTYGAA------VAWLRG---------PQES 164 (396) Q Consensus 124 ~~---~~~l~ip--------~Pa~------p~~~~~~-------Gsl~~g~y~ya------~T~V~~---------~gEe 164 (396) +. .|.+.+. +|+. +..++.. +++..+.-.|. +.|+.. ..+. T Consensus 154 g~y~~~y~i~i~g~~~at~~t~~~s~a~~s~~~~s~~~i~~~l~~~l~a~~t~~t~~~~g~~i~i~a~s~t~~~~~s~~~ 233 (785) T protein:vir:94 154 GQYGRTLKVGINGGVKVSHKLPAGNDAENDPPKVDAQAIGAALRDLLVTAYPTFTFDLGSGFLLITAPSGTDINSVETED 233 (785) T ss_pred cccceeEEEeeCCcceeEEEEccCccccccccccchHHHHHHHHHHhhccccceeEEecCcEEEEEecCCccccceeeec Confidence 00 0111111 0100 0000000 00000000000 011100 0000 Q ss_pred cccccceeEecCCCccEEEeecCCCCCcceEEEEEEec-CCCeEEEEEeecce---------eEEEEc------------ Q lcl|Aclame:pro 165 APSLIAFAEVTDAGALEVTFPLCLDASVTGARLYLTRA-NGGELLLAGDYPLG---------AATVIL------------ 222 (396) Q Consensus 165 g~~~~~S~~vt~~~~~~v~lp~~~~~~i~~~RIYrs~~-~g~~~~lv~e~~~~---------~~~~~d------------ 222 (396) +........+.......-+||... .+...++|--+.. .-+.+|+..+-.-+ .....+ T Consensus 234 ~~~~t~~~~~~~~~~~~~~Lp~~~-~~G~~v~v~~~~~~~~~~y~v~~~~~~g~w~e~~~~g~~~~~~~~tmp~~l~~~~ 312 (785) T protein:vir:94 234 GYANQLISPVLDTVQTISKLPLAA-PNGYIIKIQGETNSSADEYYVMYDSNTKTWKETVEPGVVTGFDNTTMPHALVRQS 312 (785) T ss_pred ccCCeEEEEEEeeccceecccccc-CCCCEEEEEccCCCCccceEEEEEcCCceEEEecccceeeeeeccccceEEEecc Confidence 000000011111111112333211 1222233321111 11123322211111 111111 Q ss_pred ------CCchhhcccccchhcCCcCC--C---ceeeccCCEEEEEECCEEEEccCCCCccccccc-------ccEec--- Q lcl|Aclame:pro 223 ------PTLPELGRPAQFRHLSPMPT--G---KHLAYWRGRLLIARANVLRFSEALAYHLHDERY-------GFVQM--- 281 (396) Q Consensus 223 ------~~~~~lg~~l~t~~~~ppP~--g---~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y-------~~~~~--- 281 (396) ...++..+.+--.+-.|.|. | .-+.|+++||..+.++.||+|....++-|-..- +-+.+ T Consensus 313 ~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~~ 392 (785) T protein:vir:94 313 DGSFEFKALDWSKRGAGNDDTNPMPSFVDATINDVFFYRNRLGFLSGENVIMSRSASYFAFFPKSVATLSDDDPIDVAVS 392 (785) T ss_pred CCceEEeccccccccCCCcccCCcceecccccceEEEEeceEEEecCCeEEEEccCCcccCccccccCCCCCccEEEEec Confidence 11111111111122244443 3 347999999999999999999998888773220 12222 Q ss_pred ---CcceEEEEEcCCcEEEEEcCcEEEEEcCc---hhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCC Q lcl|Aclame:pro 282 ---PQRITFVQPVDGGIWVGQVDHVAFLDGAD---PASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAEN 355 (396) Q Consensus 282 ---~~~I~~i~~v~~gl~V~T~~~~y~l~G~~---p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~ 355 (396) ...|.-+.++..+|+++|++.-|.++|.+ |++.+..+.+... |...-.....+..++|+++. T Consensus 393 ~~~~~~i~~~v~~~~~L~l~T~~~e~~l~~~~~lTP~~~~~~~~s~~~------------~~~~~~Pv~vg~~v~f~~~~ 460 (785) T protein:vir:94 393 HPRISILKYAVPFSEQLLLWSDEVQFVMTSSGVLTSKSIQLDVGSEFA------------LGDNARPFAVGRSVFFSAPR 460 (785) T ss_pred CCcceeeEEEeecCCcEEEEecCcEEEEcCCCcccceeEEEEEEEeee------------ccCCCCceEeCCeEEEEecC Confidence 34577788999999999999999999864 3444443332211 22222234456789999987 Q ss_pred CEE--------EEc-CC-CcEEEEecceeecccccc--ceEEEeCcE----------EEEEeC Q lcl|Aclame:pro 356 GYV--------MGT-SS-GAIAEVHAGVLAGITGRA--GTSVVFDRR----------LLTAVS 396 (396) Q Consensus 356 Glv--------~g~-~~-G~~~~lt~~~~~~~~a~~--~~~~~~~rr----------~v~~~~ 396 (396) |=. ... .+ .+...||.-.-..++... ..+.--+.. -+..+. T Consensus 461 g~~~~v~r~~~~~~~~d~y~~~dlt~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~g~l~~~~ 523 (785) T protein:vir:94 461 GSFTSIKRYFAVADVSDVKDADDTTGHVLSYIPNGVFDIQGTGTENYICVNSTGAYNRIYIYK 523 (785) T ss_pred CCeeEEEeeeeecccccceehhhHHHHHHHhcCCCcEEEEEecCCCcEEEEEEcCCCEEEEEE Confidence 721 111 11 122222211111111110 111111111 111111 No 20 >protein:vir:94583 Length: 792 # NCBI annotation: Tubular tail protein B # Family: family:all:825 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919014;genbank:gi:119637778;genbank:GeneID:5179343 Probab=98.33 E-value=1.3e-06 Score=52.85 Aligned_cols=375 Identities=11% Similarity=0.021 Sum_probs=161.2 Q ss_pred CCcc--cccce-eccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCc-----cccccCCc----- Q lcl|Aclame:pro 1 MATT--SLVPL-AGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQP-----FRQLWQSP----- 67 (396) Q Consensus 1 m~~~--~~~p~-~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~-----~~~lw~s~----- 67 (396) |+-+ .++-| .||--+.+..++ ...+++++|.=.+..|.++||+|.+-+..+. -++.+-.| T Consensus 1 M~~i~~s~~n~~~GiSqq~D~~ry-------~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~~~~~~ 73 (792) T protein:vir:94 1 MALISQSVKNLKGGISQQPNILRF-------PEQGSEQINGWSSETEGLQKRPPFVFTKTIGDQNALGAKPLVHLINRDS 73 (792) T ss_pred CcceeeecchhhcceecCcchHHh-------hhhhhhhhcceeeeccccccCChhHHHHhhhcCCCCCcccEEEEEEeCC Confidence 8862 45555 777766666666 3469999999999999999999987654321 11111001 Q ss_pred ccccEEEEECCeEEEEecCCCceeec-------ccccCcceehhhcCCeEEEEcCCcceee---cC-------------- Q lcl|Aclame:pro 68 LHGDAFGALGDQWGKVDPHSWTFEPL-------AQIGEGDLSHEVLNNRVCVAGTAGIFTY---DG-------------- 123 (396) Q Consensus 68 ~~~~~~~~~dg~L~~i~~~~w~~~vl-------~~ig~gpV~~~v~n~rvy~t~~~~~~~~---~g-------------- 123 (396) ...+++...++.+.-++.++-...+- ....+..+.|.+..|.+|+++.+.+-.. .+ T Consensus 74 ~q~y~l~f~~~~~rv~~~~g~~~~~~~~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~~~~~~~~~~~~~~~~~~~v~i~ 153 (792) T protein:vir:94 74 AEQYYVVFTGQGVRVFDLNGKEYDVKGDLSYVKVENPRDDLRMVTVADYTFIVNRNMVVRPDTTPLYTLKENGDCLINIR 153 (792) T ss_pred CceEEEEEcCCeEEEEecCCceEEecccCceeeecCCcceeEEEEEcCEEEEEeCCccceeEecCcCCCCCCceEEEEcc Confidence 01233333333443344343222211 1122345888889999999996543221 11 Q ss_pred -ce---eeecccc---------CCccceeecC----------------CCCcc------cceEEEEEEEEc-----CCCc Q lcl|Aclame:pro 124 -AQ---AERLTLD---------TPAPPLLVAG----------------AGSLS------QGTYGAAVAWLR-----GPQE 163 (396) Q Consensus 124 -~~---~~~l~ip---------~Pa~p~~~~~----------------~Gsl~------~g~y~ya~T~V~-----~~gE 163 (396) +. .+.+.+. .+.++..+.. ..++. .+.|.|++.--. ...+ T Consensus 154 ~g~y~~~y~i~i~~~~~~~~~~~~t~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 233 (792) T protein:vir:94 154 GGMYGRTLAFTINNTKIAYEIAHGDAPEHSKQTDAQWLVKKLAGLARLNVAFKGWTFTEGPGYIHVIAPSNSQINSLSTE 233 (792) T ss_pred CCCcceeEEEEecCceeeeeeecCcccceecccchhhhhhhhhhhccccccccccEEEECCeEEEEEecCCceeeeeecc Confidence 00 0112221 1111111000 00000 011111100000 0001 Q ss_pred ccccccceeEecCCCccEEEeecCCCCCcceEEEEEEecCC-CeEEEEEeecce---------eEEEEcCC-ch-hhccc Q lcl|Aclame:pro 164 SAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLYLTRANG-GELLLAGDYPLG---------AATVILPT-LP-ELGRP 231 (396) Q Consensus 164 eg~~~~~S~~vt~~~~~~v~lp~~~~~~i~~~RIYrs~~~g-~~~~lv~e~~~~---------~~~~~d~~-~~-~lg~~ 231 (396) .+...-....++..-...-.||.. ..+...++|--+...+ +.+|+..+-..+ .....+.. .+ .+.+. T Consensus 234 ~g~~~~~~~~~~~~v~~~~~lp~~-~~~G~~v~i~~~~~~~~d~y~v~~~~~~~~w~E~~~~~~~~~~~~~tmp~~lv~~ 312 (792) T protein:vir:94 234 DGYADQLMNAVMHTSQSFSRLPVE-APNGYTVKIVGDTSKTSDMFYVQYDNMKKVWKEVAGWGVQKGLNGGTMPHALVRQ 312 (792) T ss_pred cCcCcceeeeeeeccccccccccc-CCCCcEEEEEccCCCCccceEEEEEcCCceEEEecccceeeeecccccCeeEEEc Confidence 111110111111000001122221 1234445554333322 234443221111 11111111 01 11110 Q ss_pred c------cchhc----------CCcCC--C---ceeeccCCEEEEEECCEEEEccCCCCccccccc-------ccEec-- Q lcl|Aclame:pro 232 A------QFRHL----------SPMPT--G---KHLAYWRGRLLIARANVLRFSEALAYHLHDERY-------GFVQM-- 281 (396) Q Consensus 232 l------~t~~~----------~ppP~--g---~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y-------~~~~~-- 281 (396) . ....| .|.|. | .-+.++++||..+.++.||+|....++-|-..- +-+.+ T Consensus 313 ~~~~~~~~~~~w~~r~~gd~~tnp~psf~g~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ 392 (792) T protein:vir:94 313 ADGSFQMQVLPWTQRTCGDMDTNPTPSIVDQKINDVFFFRNRLGFLAGENIVMSRTSKYFSLFPASVANLSDDDPIDVAV 392 (792) T ss_pred CCCcEEEEeccccccccCccccCccceeccCCcceEEEEcceEEEecCCeEEEEccCCcccCccccccCCCCCccEEEEe Confidence 0 01112 23442 2 247999999999999999999998888773221 11222 Q ss_pred ----CcceEEEEEcCCcEEEEEcCcEEEEEcCc---hhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecC Q lcl|Aclame:pro 282 ----PQRITFVQPVDGGIWVGQVDHVAFLDGAD---PASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAE 354 (396) Q Consensus 282 ----~~~I~~i~~v~~gl~V~T~~~~y~l~G~~---p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~ 354 (396) ...|.-+.++..+|+++|++.-|.++|.+ |++.+..+.+... |...-..+..+..++|+++ T Consensus 393 ss~~~~~i~~~v~~~~~L~l~T~~~q~~l~~~~~lTP~~~~i~~~s~~~------------~~~~~~Pv~vG~~v~Fv~~ 460 (792) T protein:vir:94 393 SHNRISILKYAVPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTEFD------------VSDRARPFGVGRGVYFASP 460 (792) T ss_pred cCCcceeeeEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEEee------------ccCCCCceEeCCeEEEeec Confidence 35566688899999999999999999854 3444333332111 2222223445678999999 Q ss_pred CCE--------EEEc-CCC-cEEEEec---ceeeccccccceEEEeCcE----------EEEEeC Q lcl|Aclame:pro 355 NGY--------VMGT-SSG-AIAEVHA---GVLAGITGRAGTSVVFDRR----------LLTAVS 396 (396) Q Consensus 355 ~Gl--------v~g~-~~G-~~~~lt~---~~~~~~~a~~~~~~~~~rr----------~v~~~~ 396 (396) .|= ..-. .++ +...||. ..|.... ....+.--+.+ -+..+. T Consensus 461 ~g~~~~v~r~~~~~~~~d~y~a~DlT~~~~hl~~~~v-~~~~a~~~~~~~vv~~~~~~g~l~~~t 524 (792) T protein:vir:94 461 RASYTSLNRYYAVQDVSSVKSAEDMSAHVPNYIPNGV-FSIRGSSTENFISVLSSNAPSRIFLYK 524 (792) T ss_pred CCCeeEEEeeeeeccccCceehhhHHHHHHHhcCCce-EEEEEeCCCCcEEEEEEcCCCeEEEEE Confidence 882 1111 121 1111111 1111100 00000001111 111111 No 21 >protein:vir:97014 Length: 800 # NCBI annotation: 33 # Family: family:all:825 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654134;genbank:gi:108862018;genbank:GeneID:5075963 Probab=98.28 E-value=1.7e-06 Score=52.23 Aligned_cols=366 Identities=12% Similarity=0.037 Sum_probs=154.9 Q ss_pred CCc-ccccce-eccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCcc---ccccCCc----ccc- Q lcl|Aclame:pro 1 MAT-TSLVPL-AGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPF---RQLWQSP----LHG- 70 (396) Q Consensus 1 m~~-~~~~p~-~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~---~~lw~s~----~~~- 70 (396) |-. -.++=| .||--..+..++ ...+++++|.=.+..|.+++|+|.+.+..+.. ++.|-.- +.. T Consensus 1 ~~v~~s~~n~~~GvSqq~d~~R~-------~~q~~~~~N~~~~~~gGl~rRpGt~fva~l~~~~~~~~~~~~~~~~d~~e 73 (800) T protein:vir:97 1 MEVQGSLGRQIQGISQQPPAVRL-------DGQCTAMVNMIPDVVNGTQSRMGTTHIAKILDAGTDDMATHHYRRGDGDE 73 (800) T ss_pred CeeEeechhhhcccccCchhHhh-------hhhhhhhhcceeccccccccCCchhhheeecCCCcccceeEEEEEcCCce Confidence 665 445555 777766666666 34699999999999999999999998765432 2222110 011 Q ss_pred cEEE-EECCeEEEEecCCCc-eee---------c--ccccCcceehhhcCCeEEEEcCCcceeec--------------- Q lcl|Aclame:pro 71 DAFG-ALGDQWGKVDPHSWT-FEP---------L--AQIGEGDLSHEVLNNRVCVAGTAGIFTYD--------------- 122 (396) Q Consensus 71 ~~~~-~~dg~L~~i~~~~w~-~~v---------l--~~ig~gpV~~~v~n~rvy~t~~~~~~~~~--------------- 122 (396) ..|. ..+|+..+|...+.. ..+ + ....+..+.+.+.+|-+|+++...+-.+. T Consensus 74 q~~v~~~~~~~~rv~~~~G~~~~v~~~~~~~~y~~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~~~~~~~~~~~v 153 (800) T protein:vir:97 74 EYFFTLKKGQVPEIFDKYGRKCNVTSQDAPMTYLSEVVNPREDVQFMTIADVTFMLNRRKVVKASSRKSPKVGNKAIVFC 153 (800) T ss_pred EEEEEEEcCCEEEEEecCCcEEEEecCCcceEEEeccCCCccceeEEEEcCEEEEeeCceecccccccccCCCcceEEEE Confidence 1111 123444344322221 111 1 11123467888888999999855322111 Q ss_pred --Cce--eeeccccCCccceeecCCCC--------------------c------------ccceEEEEEEEEcCCCcccc Q lcl|Aclame:pro 123 --GAQ--AERLTLDTPAPPLLVAGAGS--------------------L------------SQGTYGAAVAWLRGPQESAP 166 (396) Q Consensus 123 --g~~--~~~l~ip~Pa~p~~~~~~Gs--------------------l------------~~g~y~ya~T~V~~~gEeg~ 166 (396) |.. .|.+.|...........+|+ + ..|.+.|.. ...+ T Consensus 154 ~~g~y~~~y~i~I~~~~~~~~~t~~~t~~~~~~~~~~~~ia~ql~~~~~~~~~~~~~t~~~~G~~~~i~-~~~~------ 226 (800) T protein:vir:97 154 AYGQYGTSYSIVINGANAASFKTPDGGSADHVEQIRTERITSELYSKLQQWSGVSDYEIQRDGTSIFIE-RRDG------ 226 (800) T ss_pred eecccceeeeeccCCcceEEEEEcCCCCcccceeccHHHHHHHHHHhhhccccccceEEEeCCcEEEEE-EcCC------ Confidence 000 01112211100000000000 0 000111100 0000 Q ss_pred cccceeEecCCCcc------------EEEeecCCCCCcceEEEEEEecCC---CeEEEEEeec-ce-------------- Q lcl|Aclame:pro 167 SLIAFAEVTDAGAL------------EVTFPLCLDASVTGARLYLTRANG---GELLLAGDYP-LG-------------- 216 (396) Q Consensus 167 ~~~~S~~vt~~~~~------------~v~lp~~~~~~i~~~RIYrs~~~g---~~~~lv~e~~-~~-------------- 216 (396) .-..++.+.+..- .-+||... ++. ..+++++.++ +.||+..|-. .+ T Consensus 227 -~~~~v~t~~g~~~~~~~~~~~~v~~~~~lp~~~-~~g--~~v~i~~~~~~~~~~y~~~~~~~~~~~~~w~e~~~~~~~~ 302 (800) T protein:vir:97 227 -ASFTITTTDGAKGKDLVAIKNKVSSTDLLPSRA-PAG--YKVQVWPTGSKPESRYWLQAEPKEGNLVSWKETIAADVLL 302 (800) T ss_pred -ceEEEEecCCcCceeeeEEeeeccchhhchhhC-CCC--cEEEEEccCCCCCceEEEEEEecccCcceEEEeecccccc Confidence 0000111111100 00111111 111 1223332221 1222211100 00 Q ss_pred -------------------eEEEEcCCchhhcccccchhcCCcCC------C---ceeeccCCEEEEEECCEEEEccCCC Q lcl|Aclame:pro 217 -------------------AATVILPTLPELGRPAQFRHLSPMPT------G---KHLAYWRGRLLIARANVLRFSEALA 268 (396) Q Consensus 217 -------------------~~~~~d~~~~~lg~~l~t~~~~ppP~------g---~~~~~~nGrl~~a~Gn~l~fSEp~~ 268 (396) ..++.....++..+.+--..-.|.|. + .-+.|+++||..+.++.||+|.... T Consensus 303 ~~~~~tmp~~~~~~~~~~~~g~~~~~~~~w~~r~~gd~~tnp~p~f~~~~~~~~~~~v~f~q~RL~f~~~~~v~~Srtgd 382 (800) T protein:vir:97 303 GFDKGTMPYIIERTDIINGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTIGGMFMVQNRLCFTAGEAVIASRTSY 382 (800) T ss_pred ceecccceEEEEEeecccccceeEEEeccccccccCccccCccccccCCcCCCCceeEEEEeeeEEEecCCeEEEEecCC Confidence 01111111122222211122233333 1 2378999999999999999999988 Q ss_pred Ccccccc-c------ccEec------CcceEEEEEcCCcEEEEEcCcEEEEEcCc---hhheeeeeeccCCCcccceeec Q lcl|Aclame:pro 269 YHLHDER-Y------GFVQM------PQRITFVQPVDGGIWVGQVDHVAFLDGAD---PASLSVSRRASRAPVPGSAVLV 332 (396) Q Consensus 269 p~aw~~~-y------~~~~~------~~~I~~i~~v~~gl~V~T~~~~y~l~G~~---p~~m~~~~~~~~~p~~~s~~~~ 332 (396) ++-|-.. - +-+.+ ...|.-+.++..+|+++|++.-|.|+|.+ |++.+..+.+.. . T Consensus 383 ~~nF~~~t~~~~~DdD~I~~~~ss~~v~~i~~~v~~~~~L~i~T~~~q~~ls~~~~lTP~~~~~~~~s~~--------~- 453 (800) T protein:vir:97 383 FFDFFRYTVISALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQFILPGDKPLEKSNALLKPVTTF--------E- 453 (800) T ss_pred ccccccccccCCCCCccEEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEee--------e- Confidence 8776221 0 12222 34577788999999999999999999864 333333332211 1 Q ss_pred chhhhccccccCcccEEEEecCCCE-------EEEcCCCc-----EEEEecceeecc----cc--ccceEEE---eCcEE Q lcl|Aclame:pro 333 PAEVVGTNASPDGSPVAVWLAENGY-------VMGTSSGA-----IAEVHAGVLAGI----TG--RAGTSVV---FDRRL 391 (396) Q Consensus 333 ~~~~~~~~~~~~~~~~~lw~s~~Gl-------v~g~~~G~-----~~~lt~~~~~~~----~a--~~~~~~~---~~rr~ 391 (396) |...-.....+..++|+++.|= .-...+.. ++.+-+..+... .+ ..-..++ ++... T Consensus 454 ---~~~~~~Pv~vG~~v~fv~~~g~~s~vre~~~~~~~d~~~a~DlT~~~~hl~~~~v~~~~~~~~~~~~v~~~~~~~~~ 530 (800) T protein:vir:97 454 ---VNNKVKPVVTGESVMFATNDGSYSGVREFYTDSYSDTKKAQAITSHVNKLIEGNITNMAASTNVNRLLVTTDKYRNI 530 (800) T ss_pred ---ccCCCCcEEeCCeEEEeeCCCCeeEEEEEeeeecccceehhhHHHHHHHhcCCceEEEEEeCCCCeEEEEEEcCCCE Confidence 2222223455678999999872 11111110 010101111110 00 0011111 11111 Q ss_pred EEEeC Q lcl|Aclame:pro 392 LTAVS 396 (396) Q Consensus 392 v~~~~ 396 (396) +..+. T Consensus 531 l~~~~ 535 (800) T protein:vir:97 531 IYCYD 535 (800) T ss_pred EEEEE Confidence 11111 No 22 >protein:vir:105647 Length: 800 # NCBI annotation: putative tail tubular B protein # Family: family:all:825 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425011;genbank:gi:83571759;uniprot:Q2WC41;genbank:GeneID:3837288 Probab=98.17 E-value=3.1e-06 Score=50.78 Aligned_cols=375 Identities=11% Similarity=0.042 Sum_probs=158.1 Q ss_pred CCc-ccccce-eccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccc---cccCC-----cccc Q lcl|Aclame:pro 1 MAT-TSLVPL-AGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFR---QLWQS-----PLHG 70 (396) Q Consensus 1 m~~-~~~~p~-~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~---~lw~s-----~~~~ 70 (396) |-. -.++-| .||--.-+..++ ...+++++|+=.+..|.+++|+|.+-+..+... +++-- -... T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~-------~~q~~~~~N~~~~~~gGl~rRpGt~fva~l~~~~~~~~~~~~~~~~~~~~ 73 (800) T protein:vir:10 1 MEVQGSLGRQIQGISQQPPAVRL-------DGQCTTMVNMVPDVVNGTQSRMGTTHIAKLLDEGTDNMATHHYRRGEGDE 73 (800) T ss_pred CeEEeecchhcccccccchhHhh-------hhhhhhhhcceeeeccCcccCCcceEEEeecCCCCCccEEEEEecCCccc Confidence 655 344445 565544555555 346999999999999999999999888654222 11110 0001 Q ss_pred c-EEEEECCeEEEEecCCCcee-ec-----------ccccCcceehhhcCCeEEEEcCCcceeec--------------- Q lcl|Aclame:pro 71 D-AFGALGDQWGKVDPHSWTFE-PL-----------AQIGEGDLSHEVLNNRVCVAGTAGIFTYD--------------- 122 (396) Q Consensus 71 ~-~~~~~dg~L~~i~~~~w~~~-vl-----------~~ig~gpV~~~v~n~rvy~t~~~~~~~~~--------------- 122 (396) . .+....|+..+|-..+.... +. .......+.+.+..|-+|+++...+-... T Consensus 74 ~~~~~~~~g~~~rv~~~~G~~~~v~~~~~~~~~~~~~~~~~~~l~~~tvaD~tfi~n~~~~~~~~~~~~~~~~~~~~~~v 153 (800) T protein:vir:10 74 EYFFTLKKGQVPEIFDKHGRKCNVISQDAPMTYLSEVVNPREDVQFMTIADVTFMLNRRKVVKVSNRKSPKVGDKAIVFC 153 (800) T ss_pred eEEEEEEcCCeEEEEecCCcEEEeecCCcceeeeeccCCchhhEEEEEEcCEEEEecCcccccccccCCCCCCceEEEEE Confidence 1 22333444444443322111 11 01112357888888999999966432211 Q ss_pred -Cce---eeeccccCCccceeecCC--------------------------CCcc------cceEEEEEEEEcCC----- Q lcl|Aclame:pro 123 -GAQ---AERLTLDTPAPPLLVAGA--------------------------GSLS------QGTYGAAVAWLRGP----- 161 (396) Q Consensus 123 -g~~---~~~l~ip~Pa~p~~~~~~--------------------------Gsl~------~g~y~ya~T~V~~~----- 161 (396) +++ .|.+.+...+.......+ +++. .+++.|. ..++.. T Consensus 154 r~g~y~~~y~i~i~g~~~~~~~t~~~~~~~~~~~~s~~~i~~~L~~~l~~~~~~~~~t~~~~g~~i~i-~~~~~~~~~~~ 232 (800) T protein:vir:10 154 AYGQYGTSYSIIINGTTAASFKTPDGGSAEHVEQIRTERITSELYSKLQQWSGVNDYEIQRDGTSIFI-ERRDGKSFTVT 232 (800) T ss_pred eccccccceeEEeccceEEEEEecCCCcccccccccHHHHHHHHHhhhhhcCcccceEEEEcCcEEEE-EEecCCceEEE Confidence 000 011111111000000000 0000 0111111 111100 Q ss_pred CcccccccceeEecCCCccEEEeecCCCCCcceEEEEEEecCC-CeEEEEEee--------------------cce---- Q lcl|Aclame:pro 162 QESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLYLTRANG-GELLLAGDY--------------------PLG---- 216 (396) Q Consensus 162 gEeg~~~~~S~~vt~~~~~~v~lp~~~~~~i~~~RIYrs~~~g-~~~~lv~e~--------------------~~~---- 216 (396) -+++...-.-..+.......-.||.. .++...+.|..++... +.||+..|- ..+ T Consensus 233 ~~~~~~~~~~~~~~~~v~~~~~Lp~~-~~~g~~~~i~~~~~~~~~~y~~~~~~~~~~~~~w~e~~~~~~~~~~~~~tmp~ 311 (800) T protein:vir:10 233 TTDGAKGKDLVAIKNKVSSTDLLPSR-APAGYKVQVWPTGSKPESRYWLQAEPKEGNLVSWKETIAADVLLGFDKGTMPY 311 (800) T ss_pred EeecCCcceEEEEEeeccceeecccc-CCCCceEEEEcCCCCCCceeEEEEEeccccceEEEeecccCceeeeecccccE Confidence 00111100000011000111123322 1223334443332211 122222220 000 Q ss_pred ----------eEEEEcCCchhhcccccchhcCCcCC--C-------ceeeccCCEEEEEECCEEEEccCCCCccccccc- Q lcl|Aclame:pro 217 ----------AATVILPTLPELGRPAQFRHLSPMPT--G-------KHLAYWRGRLLIARANVLRFSEALAYHLHDERY- 276 (396) Q Consensus 217 ----------~~~~~d~~~~~lg~~l~t~~~~ppP~--g-------~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y- 276 (396) ..++.....+...+.+--..-.|.|. | .-+.|+++||..+.++.||+|....++-|-..- T Consensus 312 ~lv~~~~~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~~~~~~~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~ 391 (800) T protein:vir:10 312 IIERTGIIDGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTIGGMFMVQNRLCFTAGEAVIASRTSYFFDFFRYTV 391 (800) T ss_pred EEEEeeeeecceeEEEEeccccccccCCCCCCCCchhcCCCCCCCceeEEEEeeeEEEeeCCeEEEEccCCccccccccc Confidence 01111111111112111122234443 1 237899999999999999999999888773221 Q ss_pred ------ccEec------CcceEEEEEcCCcEEEEEcCcEEEEEcCc---hhheeeeeeccCCCcccceeecchhhhcccc Q lcl|Aclame:pro 277 ------GFVQM------PQRITFVQPVDGGIWVGQVDHVAFLDGAD---PASLSVSRRASRAPVPGSAVLVPAEVVGTNA 341 (396) Q Consensus 277 ------~~~~~------~~~I~~i~~v~~gl~V~T~~~~y~l~G~~---p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~ 341 (396) +-+.+ ...|.-+.++..+|+++|++.-|.|+|.+ |++.+..+.+..- |...-. T Consensus 392 ~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~g~q~~l~g~~~lTP~~~~i~~~s~~~------------~~~~~~ 459 (800) T protein:vir:10 392 ISALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQFILPGDKPLEKSNALLKPVTTFE------------VNNKVK 459 (800) T ss_pred cCCCCCccEEEEEcCCcceeeeeEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEeee------------ccCCCC Confidence 11222 35566688899999999999999999864 3333333332111 222223 Q ss_pred ccCcccEEEEecCCCE-------EEEcCCCc-----EEEE----eccee-ecccc-ccceEEEe---CcEEEEEeC Q lcl|Aclame:pro 342 SPDGSPVAVWLAENGY-------VMGTSSGA-----IAEV----HAGVL-AGITG-RAGTSVVF---DRRLLTAVS 396 (396) Q Consensus 342 ~~~~~~~~lw~s~~Gl-------v~g~~~G~-----~~~l----t~~~~-~~~~a-~~~~~~~~---~rr~v~~~~ 396 (396) ....+..++|+++.|= .-...+.. ++.+ .++.+ ..... ..-..+++ +...+..+. T Consensus 460 Pv~vG~~v~Fv~~~g~~s~vre~~~~~~~d~~~a~DlT~~~~hl~~~~v~~~~~~~~~~~~v~~~~~~~~~l~~~~ 535 (800) T protein:vir:10 460 PVVTGESVMFATNDGSYSGVREFYTDSYSDTKKAQAITSHVNKLIEGNITNMAASTNVNRLLVTTDKYRNIIYCYD 535 (800) T ss_pred ceEeCCeEEEecCCCCeeEEEEEeeeecccceehhhHHhHHHHhcCCceEEEEEeCCCCeEEEEEEcCCCeEEEEE Confidence 3445668999999872 11111110 0001 11111 11000 00111111 111111111 No 23 >protein:vir:1543 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052111;swissprot:trembl:q9t105;genbank:gi:9634037;uniprot:Q9T105;genbank:GeneID:1262408 Probab=98.13 E-value=4e-06 Score=50.19 Aligned_cols=375 Identities=11% Similarity=0.043 Sum_probs=159.5 Q ss_pred CCcc--cccce-eccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccc-cccC---------Cc Q lcl|Aclame:pro 1 MATT--SLVPL-AGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFR-QLWQ---------SP 67 (396) Q Consensus 1 m~~~--~~~p~-~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~-~lw~---------s~ 67 (396) |+-+ .++-| .||--..+..++ ...+++++|.=.+..|.++||+|.+.+..++.. .+.. +- T Consensus 1 M~~i~~s~~n~~~GvSqq~d~~r~-------~~q~~~~~N~~~~~~gGl~rRpGt~~va~~~~~~~~~~~~~~~~~~~~~ 73 (801) T protein:vir:15 1 MALISQSIKNLKGGISQQPDILRF-------AEQGSVQINGWSSESEGLQKRPPMIHLKTLGPAGYVGAQPYVHLINRDE 73 (801) T ss_pred CceeeeecchhhcceecCcchHhh-------hhhHhhhhcceeccccCcccCCchheeeeecCCCCcccceeEEEEEeCC Confidence 8862 45555 776666666655 346999999999999999999999887554321 1111 11 Q ss_pred ccccEEEEECCeEEEEecCCCceeecc--cc-----cCcceehhhcCCeEEEEcCCcceeecCc---------------- Q lcl|Aclame:pro 68 LHGDAFGALGDQWGKVDPHSWTFEPLA--QI-----GEGDLSHEVLNNRVCVAGTAGIFTYDGA---------------- 124 (396) Q Consensus 68 ~~~~~~~~~dg~L~~i~~~~w~~~vl~--~i-----g~gpV~~~v~n~rvy~t~~~~~~~~~g~---------------- 124 (396) ...+++..-++.|.-++.++-...+-. .+ .+..+.+.+..|-+|+++-...-..... T Consensus 74 ~e~y~l~~~~~~irv~~~~G~~~~v~~~~~y~~~~~~~~~l~~~~~aD~~fi~nr~~~~~~~~~~~~~~~~~~~~~alv~ 153 (801) T protein:vir:15 74 FEQYFVVFTGEDIKVFDLDGKEYQVRGDRSYVRTANPREDLRMITVADYTFVTNRKVVVQSNDQSVNLPGFKDQGDALIN 153 (801) T ss_pred ceEEEEEEcCCeEEEEccCCcEEEEecCCccccccCchhheeEEEEcCEEEEeeCCeeeecccCccccCccCCCCceEEE Confidence 112333333555554444432222110 11 1234788888889998885543211100 Q ss_pred -------eeeeccccCCccce-eecCCCCccc-----------------------------ceEEEEEEEEcCCCccccc Q lcl|Aclame:pro 125 -------QAERLTLDTPAPPL-LVAGAGSLSQ-----------------------------GTYGAAVAWLRGPQESAPS 167 (396) Q Consensus 125 -------~~~~l~ip~Pa~p~-~~~~~Gsl~~-----------------------------g~y~ya~T~V~~~gEeg~~ 167 (396) ..+...+.. ..+. .....|+... +...+.++....+--...| T Consensus 154 v~~~~yg~t~~I~i~g-s~~~~~t~~~gs~~~~~~~~s~~~ia~~l~~~~~~~~p~~~~~~~~~~w~~~~~~g~~~i~a~ 232 (801) T protein:vir:15 154 VRGGQYGRRLSIEFNG-AERAAVQLPDGSQPAHVNEVDGQAIAEKLAAQLRNNLGNPNNDQDPNKWRFNVGPGFIHILAP 232 (801) T ss_pred eeeccCceeEEEEeCC-cceEEEEeccCcccchhhhcceeechHHHhhhhhhccCccceeccCccEEEEecCcEEEEeCC Confidence 001111100 0000 0000111100 0001111111000000000 Q ss_pred cc---ceeEecCCCc------------cEEEeecCCCCCcceEEEEEEe-cCCCeEEEEEeecc---------------- Q lcl|Aclame:pro 168 LI---AFAEVTDAGA------------LEVTFPLCLDASVTGARLYLTR-ANGGELLLAGDYPL---------------- 215 (396) Q Consensus 168 ~~---~S~~vt~~~~------------~~v~lp~~~~~~i~~~RIYrs~-~~g~~~~lv~e~~~---------------- 215 (396) .. .+....++.+ ..-.+|.. ..+...++|=-+. .+.+.+|+-.+-.. T Consensus 233 ~~~~~~~~~t~dg~~~~~~~~~~~~v~~~~~lp~~-~~~G~~v~v~~~~~~~~~~y~v~~~~~~~~w~E~a~~g~~~~~~ 311 (801) T protein:vir:15 233 NNDNVWGLQTKDGYADQLINPVTHYTQSFQKLPIN-APDGYIVKIVGDTSKTADQYYVRFDLNRKVWVETIGWNTRTHLY 311 (801) T ss_pred CCcccceeeeccccCceeeeEEeecccceeeeeee-cCCCcEEEEEecCCCccceEEEEEEcCCeeEEeecccccceeee Confidence 00 0001111110 01122221 1122222321110 11122332211111 Q ss_pred -----------eeEEEEcCCchhhcccccchhcCCcCC--C---ceeeccCCEEEEEECCEEEEccCCCCccccccc--- Q lcl|Aclame:pro 216 -----------GAATVILPTLPELGRPAQFRHLSPMPT--G---KHLAYWRGRLLIARANVLRFSEALAYHLHDERY--- 276 (396) Q Consensus 216 -----------~~~~~~d~~~~~lg~~l~t~~~~ppP~--g---~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y--- 276 (396) +..+|.....++..+.+--..-.|.|. | .-+.|+++||..+.++.||+|....++-|-..- T Consensus 312 ~~tmp~~lv~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~ 391 (801) T protein:vir:15 312 YHTMPWALVRASDGNFDFKVLEWGARTVGDDTTNPYPSFTGQTINDIFFFRNRLGFLSGENIILSRTSKYFNFFPASVSN 391 (801) T ss_pred ccccceEEEeeccceEEEeccccccccCCccccCCcccccCCCceEEEEEcceEEEeeCCeEEEEecCCccccccccccC Confidence 111222222222222222222234443 2 247999999999999999999998887763221 Q ss_pred ----ccEec------CcceEEEEEcCCcEEEEEcCcEEEEEcCc---hhheeeeeeccCCCcccceeecchhhhcccccc Q lcl|Aclame:pro 277 ----GFVQM------PQRITFVQPVDGGIWVGQVDHVAFLDGAD---PASLSVSRRASRAPVPGSAVLVPAEVVGTNASP 343 (396) Q Consensus 277 ----~~~~~------~~~I~~i~~v~~gl~V~T~~~~y~l~G~~---p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~ 343 (396) +-+.+ ...|.-+.++..+|+++|++.-|.|+|.+ |++.+..+.+... |...-..+ T Consensus 392 ~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~i~t~~~q~~ls~~~~lTP~~~~~~~~s~~~------------~~~~~~Pv 459 (801) T protein:vir:15 392 YSDDDPIDVAVSHNRVSTLKYAVPFSEELLLWSDQAQFVLTASGILSSRSVELNLTTQFD------------VQDRARPH 459 (801) T ss_pred CCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEcCCCcccceeEEEEEEEeee------------ccCCCCce Confidence 11222 34566688889999999999999999864 4444433332211 11222234 Q ss_pred CcccEEEEecCCCE--------EEEc-CCC-cEEEEe---cceeec----ccc--ccceEEEe---CcEEEEEeC Q lcl|Aclame:pro 344 DGSPVAVWLAENGY--------VMGT-SSG-AIAEVH---AGVLAG----ITG--RAGTSVVF---DRRLLTAVS 396 (396) Q Consensus 344 ~~~~~~lw~s~~Gl--------v~g~-~~G-~~~~lt---~~~~~~----~~a--~~~~~~~~---~rr~v~~~~ 396 (396) ..+..++|+++.|= ..-. .++ +...|| ...+.. ..+ ..-..+.+ +..-+..+. T Consensus 460 ~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~y~a~Dlt~~~~hl~~~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~ 534 (801) T protein:vir:15 460 GVGRNVYFASPRASFTSINRYYAVQDVSSVKNAEDMTAHVPNYIPNGVFSISGTTAENFAAILTSGAPNRVYIYK 534 (801) T ss_pred EeCCeEEEEecCCCeeEEEEEEeecccccceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCEEEEEE Confidence 45668999999881 1111 111 111111 111111 000 00111111 111111111 No 24 >protein:vir:3366 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523337;swissprot:trembl:q8w5u3;genbank:gi:17570828;goa:Q8W5U3;uniprot:Q8W5U3;genbank:GeneID:927453 Probab=98.12 E-value=4.2e-06 Score=50.07 Aligned_cols=375 Identities=11% Similarity=0.044 Sum_probs=160.1 Q ss_pred CCc--ccccce-eccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccc-cccCCcc-------- Q lcl|Aclame:pro 1 MAT--TSLVPL-AGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFR-QLWQSPL-------- 68 (396) Q Consensus 1 m~~--~~~~p~-~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~-~lw~s~~-------- 68 (396) |+. -.++-| .||--..+..++ ...+++++|.=.+..|.++||+|.+.+...+.. .+...|. T Consensus 1 M~~i~~~~~nl~~GvSqq~d~~r~-------~~q~~~~~N~~~~~~gG~~rRpGt~~va~~~~~~~~~~~~~~~~~~r~~ 73 (801) T protein:vir:33 1 MALISQSIKNLKGGISQQPDILRF-------TEQGSVQINGWSSESEGIQKRPPMIHLKTLGTAGYVGAQPYVHLINRDE 73 (801) T ss_pred CceeEeeccceecceeccchhHhh-------hhhHhhhhcceeecccCcccCchhHhhhhhcCCCccccceEEEEEEeCC Confidence 886 345555 676655555555 346899999999999999999999887654321 1111111 Q ss_pred -cccEEEEECCeEEEEecCCCceeecc-------cccCcceehhhcCCeEEEEcCCcceeecCceeeeccccCCccceee Q lcl|Aclame:pro 69 -HGDAFGALGDQWGKVDPHSWTFEPLA-------QIGEGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLDTPAPPLLV 140 (396) Q Consensus 69 -~~~~~~~~dg~L~~i~~~~w~~~vl~-------~ig~gpV~~~v~n~rvy~t~~~~~~~~~g~~~~~l~ip~Pa~p~~~ 140 (396) ..+++..-++.|.-++.++-...+-. ...+..+.+.+..|-+|+++-+..-..+.- ......+.+..-..+ T Consensus 74 ~~~y~l~~~~~~irv~~~~G~~~~v~~~~~y~~~~~~~~~l~~~t~aD~~fi~nr~~~p~~~~~-~~~~~~~~~~~~~li 152 (801) T protein:vir:33 74 FEQYFVVFTGEDIKVFDLDGKEYQVRGDRSYVRTANPREDLRMVTVADYTFVTNRKVVVQSNDQ-SVNLPGFKDQGDALI 152 (801) T ss_pred ceEEEEEEcCCeEEEEccCCcEEEEecCCcceeecCcchheEEEEEcCEEEEeeCCeeecccCC-cccccccCCCcceEE Confidence 12333334555554444433222111 112345788888899999885443221110 000001111000000 Q ss_pred ------------------------cCCCCc------------------------ccce-EE--EEEEEEcCCCc--cccc Q lcl|Aclame:pro 141 ------------------------AGAGSL------------------------SQGT-YG--AAVAWLRGPQE--SAPS 167 (396) Q Consensus 141 ------------------------~~~Gsl------------------------~~g~-y~--ya~T~V~~~gE--eg~~ 167 (396) ...|+. +... +. -.+++....+- -..| T Consensus 153 ~v~~~~yg~t~~I~i~gs~~~~~~~~~gs~~~~v~~~s~~~~A~~l~~~~~~~~~~~~~~~~~~~w~~~~~~g~~~i~~p 232 (801) T protein:vir:33 153 NVRGGQYGRRLSIEFNGAERAAVQLPDGSQPAHVNEVDGQAIAEKLAAQLRNNLGNPNNDQDPNKWRFNVGPGFIHILAP 232 (801) T ss_pred EEeecccceEEEEEECCcceEEEEeeccccccccccccchhhhhhhhhhhhccCccceeeecCceEEEEecCeEEEEecC Confidence 000000 0000 00 00000000000 0000 Q ss_pred ccce---eEecCCCc------------cEEEeecCCCCCcceEEEEEE-ecCCCeEEEEE-----------------ee- Q lcl|Aclame:pro 168 LIAF---AEVTDAGA------------LEVTFPLCLDASVTGARLYLT-RANGGELLLAG-----------------DY- 213 (396) Q Consensus 168 ~~~S---~~vt~~~~------------~~v~lp~~~~~~i~~~RIYrs-~~~g~~~~lv~-----------------e~- 213 (396) .... +...++.. ..-.+|... .+...++|--+ ..+...+|.-. .+ T Consensus 233 ~~~~~~~itt~~g~~~~~~~~~~~~v~~~~~lp~~~-~~g~~v~v~~~~~~~~~~y~v~~~~~~~~w~e~~~~g~~~~~~ 311 (801) T protein:vir:33 233 NNDNVWGLQTKDGYADQLINPVTHYTQSFQKLPINA-PDGYIVKIVGDTSKTADQYYVRFDLNRKVWVETIGWNTRTHLH 311 (801) T ss_pred CCcccccccccCCccceeEEEEeecccceeeeeeec-CCCcEEEEEecCCCcccceEEEEEcCCcEEEEeeccccceeee Confidence 0000 00000000 001122211 11111222110 00011111110 00 Q ss_pred ----c-----ceeEEEEcCCchhhcccccchhcCCcCC--C---ceeeccCCEEEEEECCEEEEccCCCCccccccc--- Q lcl|Aclame:pro 214 ----P-----LGAATVILPTLPELGRPAQFRHLSPMPT--G---KHLAYWRGRLLIARANVLRFSEALAYHLHDERY--- 276 (396) Q Consensus 214 ----~-----~~~~~~~d~~~~~lg~~l~t~~~~ppP~--g---~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y--- 276 (396) | .+..+|.....++..+..-..+..|.|. | .-+.|+++||..+.++.||+|....++-|-..- T Consensus 312 ~~tmp~~l~~~~~~tf~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~ 391 (801) T protein:vir:33 312 YHTMPWALVRASDGNFDFKYLEWGARTVGDDTTNPYPSFTGQTINDIFFFRNRLGFLSGENIILSRTSKYFNFFPASVSN 391 (801) T ss_pred ecccceEEEEccCceEEecccCccccccCCccccCcccccCCCceEEEEEcceEEEeeCCeEEEEecCCccccccccccC Confidence 0 0111233333333333333344456664 3 347999999999999999999998887763221 Q ss_pred ----ccEec------CcceEEEEEcCCcEEEEEcCcEEEEEcCc---hhheeeeeeccCCCcccceeecchhhhcccccc Q lcl|Aclame:pro 277 ----GFVQM------PQRITFVQPVDGGIWVGQVDHVAFLDGAD---PASLSVSRRASRAPVPGSAVLVPAEVVGTNASP 343 (396) Q Consensus 277 ----~~~~~------~~~I~~i~~v~~gl~V~T~~~~y~l~G~~---p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~ 343 (396) +-+.+ ...|.-+.++..+|+++|++.-|.++|.+ |++.+..+.+... |...-..+ T Consensus 392 ~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~l~~~~~lTP~~~~~~~~s~~~------------~~~~~~Pv 459 (801) T protein:vir:33 392 YSDDDPIDVAVSHDRVSTLKYAVPFSEELLLWSDQAQFVLTASDILSSRSVGLNLTTQFD------------VQDRARPH 459 (801) T ss_pred CCCCccEEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEeec------------ccCCCCce Confidence 11222 34466688899999999999999999854 4444433332211 22222234 Q ss_pred CcccEEEEecCCCE--------EEEc-CCCcE-EEEe---ccee-----eccc-cccceEEEeC---cEEEEEeC Q lcl|Aclame:pro 344 DGSPVAVWLAENGY--------VMGT-SSGAI-AEVH---AGVL-----AGIT-GRAGTSVVFD---RRLLTAVS 396 (396) Q Consensus 344 ~~~~~~lw~s~~Gl--------v~g~-~~G~~-~~lt---~~~~-----~~~~-a~~~~~~~~~---rr~v~~~~ 396 (396) ..+..++|+++.|= ..-. .++=+ ..|| ...| .... ...-..+++. ...+.++. T Consensus 460 ~vg~~v~f~~~~g~~~~v~r~~~~~~~~d~y~~~Dlt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 534 (801) T protein:vir:33 460 GVGRNVYFSSPRASFTSINRYYAVQDVSSVKNAEDMTAHVPNYIPNGVFSISGTTAENFVAILTSGAPNRVYIYK 534 (801) T ss_pred EecCeEEEEecCCCeeEEEEEEeecccccceehhhHHHHHHHhcCCceEEEEEcCCCCeEEEEEecCCCEEEEEE Confidence 45668999999982 1111 22211 1111 1111 1100 0011111111 11111111 No 25 >protein:vir:6326 Length: 826 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877473;genbank:gi:33300845;uniprot:Q7Y2D1;genbank:GeneID:1482615 Probab=98.06 E-value=5.6e-06 Score=49.38 Aligned_cols=373 Identities=12% Similarity=0.114 Sum_probs=157.9 Q ss_pred CCc--ccccce-eccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCcccc-ccCCcc-------- Q lcl|Aclame:pro 1 MAT--TSLVPL-AGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQ-LWQSPL-------- 68 (396) Q Consensus 1 m~~--~~~~p~-~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~-lw~s~~-------- 68 (396) |+. -.++-| .||--..+..++ ...+++++|.=.+..|.+++|+|...+...+..+ .+-.|. T Consensus 1 M~~i~~s~~n~~~GvSqq~d~~r~-------~~q~~~~~N~~~~~~~G~~rRpg~~~v~~~~~~~~~~~~~~~~~~~r~~ 73 (826) T protein:vir:63 1 MSYKQSAYPNLLMGVSQQVPFERL-------PGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGG 73 (826) T ss_pred CceeeeecchhhcceeccCchHhh-------hhhhhhhhcceeeccCCcccCchhHhhhhhccCCccccccEEEEEecCC Confidence 886 344455 777666666666 3469999999999999999999999987665422 111111 Q ss_pred --cccEEEEECCeEEEEecCCCceeec-c---c-cc---CcceehhhcCCeEEEEcCCcceeecCceeeeccccCCcc-c Q lcl|Aclame:pro 69 --HGDAFGALGDQWGKVDPHSWTFEPL-A---Q-IG---EGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLDTPAP-P 137 (396) Q Consensus 69 --~~~~~~~~dg~L~~i~~~~w~~~vl-~---~-ig---~gpV~~~v~n~rvy~t~~~~~~~~~g~~~~~l~ip~Pa~-p 137 (396) ..+++...+|.|.-++.++....+- . . +- ...+.+.+.+|-+|+++...+-.++.-...+ ..|.. . T Consensus 74 ~~~~~~~~~~~g~irv~~~~~g~~~~~~~~~~~y~~~~~~~~l~~~t~aD~~fi~n~~~~p~~~~~~~~~---~~~~~~~ 150 (826) T protein:vir:63 74 RSIAMLVAQHRGELYLFDERDGRLLMGQPLVHDYLKANDYRQLRAATVADDLFIANLSVKPEADRTDIKG---VDPNKAG 150 (826) T ss_pred CceEEEEEecCCcEEEEEcCCCeEEEcCCCCCceeeecCccceEEEEeCCEEEEEeCCeeeeeccccccc---cCCCCcE Confidence 1233334466776566665544321 1 1 11 1247778888999999876543321110000 00100 0 Q ss_pred eeecCCCCcccceE---------------EEEEEEEcCCCccccccc------------ceeEec--------------- Q lcl|Aclame:pro 138 LLVAGAGSLSQGTY---------------GAAVAWLRGPQESAPSLI------------AFAEVT--------------- 175 (396) Q Consensus 138 ~~~~~~Gsl~~g~y---------------~ya~T~V~~~gEeg~~~~------------~S~~vt--------------- 175 (396) +.-...|.- .-+| ....||-+..+++..... +..... T Consensus 151 ~~~v~~g~Y-~~~y~vti~~~~~~~gt~~s~t~t~~t~~~~~a~~~~~~~~~~~s~~yia~~l~~~~~a~~~~~~~~~t~ 229 (826) T protein:vir:63 151 WLYIKAGQY-SKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTK 229 (826) T ss_pred EEEeecccc-CceEEEEEEeccccCCccccceEEEEeccCCcccccccccceeeeeeeeeeeceeeeeeccccccCCCcc Confidence 000000110 0000 011112211111110000 000000 Q ss_pred -----------------------CCCcc--------EE-----------------------EeecCCCCC---cceEE-- Q lcl|Aclame:pro 176 -----------------------DAGAL--------EV-----------------------TFPLCLDAS---VTGAR-- 196 (396) Q Consensus 176 -----------------------~~~~~--------~v-----------------------~lp~~~~~~---i~~~R-- 196 (396) ..+.+ .+ .||...+.. ...+. T Consensus 230 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~~p~~~~~~~~~~~~ 309 (826) T protein:vir:63 230 KYPKVDPDANAATIAGYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFM 309 (826) T ss_pred ccceecCCcccceeecceeEecccccEEEEeeCCcccEEEccCCCCcceEEEEEeeccceeeccccCCCcccceEEEeeE Confidence 00000 00 000000000 00000 Q ss_pred E-EEEecC--CCeEEEEEeeccee--------------------------EEEEcCCchhhcccccchhcCCcCC--C-- Q lcl|Aclame:pro 197 L-YLTRAN--GGELLLAGDYPLGA--------------------------ATVILPTLPELGRPAQFRHLSPMPT--G-- 243 (396) Q Consensus 197 I-Yrs~~~--g~~~~lv~e~~~~~--------------------------~~~~d~~~~~lg~~l~t~~~~ppP~--g-- 243 (396) . +.-.++ .+.+|+..+-.-++ .+|.....++..+.+--..-.|.|. | T Consensus 310 ~~~~~~~g~~~d~~y~~~~~~~~~w~e~~~~~~~~~~~tmp~~l~~~~~~~~f~~~~~~w~~r~~Gd~~tnp~psf~g~~ 389 (826) T protein:vir:63 310 DGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTRG 389 (826) T ss_pred EeEEecCCCcccceEEEEEcCCceEEEEeecCcccccccceEEEEEeccCCeEEEeccccccccccccccCCCccccCCC Confidence 0 000000 01111111100000 0000000000000000011123333 3 Q ss_pred -ceeeccCCEEEEEECCEEEEccCCCCccccccc-------ccEec------CcceEEEEEcCCcEEEEEcCcEEEEEcC Q lcl|Aclame:pro 244 -KHLAYWRGRLLIARANVLRFSEALAYHLHDERY-------GFVQM------PQRITFVQPVDGGIWVGQVDHVAFLDGA 309 (396) Q Consensus 244 -~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y-------~~~~~------~~~I~~i~~v~~gl~V~T~~~~y~l~G~ 309 (396) .-+.|+++||..+.++.||+|....++-|-..- +-+.+ ...|.-+.++..+|+++|++.-|.|+|. T Consensus 390 ~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~~~q~~ls~~ 469 (826) T protein:vir:63 390 ITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGG 469 (826) T ss_pred ceEEEEEeceEEEeeCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeEEEeecCCcEEEEecCcEEEEeCC Confidence 247899999999999999999998888772221 11111 3456678889999999999999999985 Q ss_pred c---hhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCC-----E----EEEcCCCc--EEEEecceee Q lcl|Aclame:pro 310 D---PASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENG-----Y----VMGTSSGA--IAEVHAGVLA 375 (396) Q Consensus 310 ~---p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~G-----l----v~g~~~G~--~~~lt~~~~~ 375 (396) + |++.+..+.+..- |-..-.....+..++|++++| + .....++. ...+|.-.-. T Consensus 470 ~~lTP~~~~i~~~s~~~------------~~~~~~Pv~vG~~v~Fv~~~g~~~s~v~e~~~~~d~~~~y~~~dlt~~~~~ 537 (826) T protein:vir:63 470 GIVTPRTAVISITTQYD------------LDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPS 537 (826) T ss_pred CcccceeEEEEEEEeec------------ccCCCCceEeCCeEEEEecCCCceeEEEEEEeeeccccceehhHHHHHHHH Confidence 4 4444443332211 112222334566899999877 2 11222322 1111111101 Q ss_pred ccccc--cceEEEeCcEE---------EEEeC Q lcl|Aclame:pro 376 GITGR--AGTSVVFDRRL---------LTAVS 396 (396) Q Consensus 376 ~~~a~--~~~~~~~~rr~---------v~~~~ 396 (396) .++.. .-+...-+..+ +.++. T Consensus 538 l~~~~v~~~a~s~~~~~v~~~~~~dg~l~~~~ 569 (826) T protein:vir:63 538 YMPGPAEYIQAAASSGYLVFGTSTADEMICHQ 569 (826) T ss_pred hcCCCeEEEEEcCCCCEEEEEEcCCCEEEEEE Confidence 11100 00111111111 11111 No 26 >protein:vir:7021 Length: 803 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853594;genbank:gi:31711676;genbank:GeneID:1481802 Probab=98.00 E-value=7.5e-06 Score=48.67 Aligned_cols=374 Identities=12% Similarity=0.037 Sum_probs=152.0 Q ss_pred CCc-ccccce-eccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCc---cccccCC-----cccc Q lcl|Aclame:pro 1 MAT-TSLVPL-AGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQP---FRQLWQS-----PLHG 70 (396) Q Consensus 1 m~~-~~~~p~-~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~---~~~lw~s-----~~~~ 70 (396) |-. -.++-| .||--..+..++ ...+++++|.=.+..|.+++|+|.+.+..+. -++.|-. .++. T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~-------~~q~~~~~N~~~~~~gGl~rRpGt~~va~l~~~~~~~~~~~~~~~~~~~~ 73 (803) T protein:vir:70 1 MEVQGSLGRQIQGISQQPPAVRL-------DGQCSEMVNMVPDVVEGTKSRMGTTHIAKLLEYGEDDMAVHHYRRGGEGE 73 (803) T ss_pred CeEEeecchhccccccCchHHhh-------hhhhhhhhcceeeeccccccCChhhhhhhhcCCCcccceeeEEEecCCCc Confidence 655 344445 565544455555 3469999999999999999999998875532 1222211 1111 Q ss_pred ---cEEEEECCeEEEEecCCCceeec---------c--cccCcceehhhcCCeEEEEcCCcceeec-------------- Q lcl|Aclame:pro 71 ---DAFGALGDQWGKVDPHSWTFEPL---------A--QIGEGDLSHEVLNNRVCVAGTAGIFTYD-------------- 122 (396) Q Consensus 71 ---~~~~~~dg~L~~i~~~~w~~~vl---------~--~ig~gpV~~~v~n~rvy~t~~~~~~~~~-------------- 122 (396) +++..-++.|.=++.++-...+. . ......+.+.+..|-+|+++...+-..+ T Consensus 74 e~~~~~~~~~~~irv~~~~G~~~~v~~~~~~~~~l~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~~~~~~~~~~~ 153 (803) T protein:vir:70 74 EEYFFIMKKGQVPEIFDKQGRKCMVQSQDAPMTYLSEVTNPREDVQFMTIADVTFMLNRKKIVKARPERSPQVGSTAIVF 153 (803) T ss_pred eEEEEEEecCCeEEEEEcCCcEEEEecCCceeEEEeecCCChhheeEEEEcCEEEEecCceeeeeccccCCCCCCceEEE Confidence 11111233333222333222111 1 0112347777788889998854322110 Q ss_pred ---C--ceeeeccccCCccc-eeec-------------------------CCCCcc------cceEEEEEEEEcC---C- Q lcl|Aclame:pro 123 ---G--AQAERLTLDTPAPP-LLVA-------------------------GAGSLS------QGTYGAAVAWLRG---P- 161 (396) Q Consensus 123 ---g--~~~~~l~ip~Pa~p-~~~~-------------------------~~Gsl~------~g~y~ya~T~V~~---~- 161 (396) + +..|...|...... ..+. ..++.+ .+++.|. ...+. + T Consensus 154 vr~g~y~~~y~itIng~~~a~~~t~~~~~~~~~~~~~~~~ia~~l~~~~~~~~s~a~~~~~~~g~~~~i-~~~~~~~~~~ 232 (803) T protein:vir:70 154 MAYGQYGTHYKIIIDGVVAAGYKTRDGAEAHHIEDIRTESIAYNLYQSLQSWDKIADYEIQLDGTSIYI-TRRDGSTTFD 232 (803) T ss_pred EeecCCcceEEEEeCCcceEEEEeCCCcccccccccchhhhhhhhhhheeccccccceEEEECCcEEEE-EEcCCCCeeE Confidence 0 01111221110000 0000 000000 0111110 00000 0 Q ss_pred --CcccccccceeEecCCCccEEEeecCCCCCcceEEEEEEecCC-CeEEEEEe-------------------------- Q lcl|Aclame:pro 162 --QESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLYLTRANG-GELLLAGD-------------------------- 212 (396) Q Consensus 162 --gEeg~~~~~S~~vt~~~~~~v~lp~~~~~~i~~~RIYrs~~~g-~~~~lv~e-------------------------- 212 (396) -+++...-....+...-...-+||...+ +...+.|=.+++.. +.||...+ T Consensus 233 ~~t~~g~~~~~~~~~~~~v~~~~~Lp~~~~-~g~~v~v~~~g~~~~d~y~v~~~~~~~~~~~w~e~a~~g~~~~~~~~t~ 311 (803) T protein:vir:70 233 ITTEDGAKGKDLVAIKYKVASTDLLPSRAP-EGYKVQVWPTGSKPESRYWLQAEKQNGNIVSWKETLAADVLIGFDKSTM 311 (803) T ss_pred EEeecCcCCcEEEEEEecccceeeccccCC-CCceEEEEcCCCCCCceeeEEEEeccCCccceEeeeccceeeeeecccc Confidence 0000010011111110001112332221 11111211111100 11221111 Q ss_pred -------ecc-eeEEEEcCCchhhcccccchhcCCcCC------C---ceeeccCCEEEEEECCEEEEccCCCCcccccc Q lcl|Aclame:pro 213 -------YPL-GAATVILPTLPELGRPAQFRHLSPMPT------G---KHLAYWRGRLLIARANVLRFSEALAYHLHDER 275 (396) Q Consensus 213 -------~~~-~~~~~~d~~~~~lg~~l~t~~~~ppP~------g---~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~ 275 (396) ..+ +..++.....+...+.+-...-.|.|. + .-+.++++||..+.++.||+|.+..++-|-.. T Consensus 312 p~~~v~~~~~~~~~~~~~~~~~~~~r~~gdd~tnp~psf~~~~~~~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~ 391 (803) T protein:vir:70 312 PYIIERTGFVNGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTLGGMFMVQNRLCVTAGEAVIATRTSYFFDFFRY 391 (803) T ss_pred cEEEEEEEEeecceeEEEEeeccccccccccccCccccccCccCCCCceeEEEEeceEEEeeCCeEEEEccCCccccccc Confidence 000 011222222222222111111223332 2 23789999999999999999999888776322 Q ss_pred c-------ccEec------CcceEEEEEcCCcEEEEEcCcEEEEEcCc---hhheeeeeeccCCCcccceeecchhhhcc Q lcl|Aclame:pro 276 Y-------GFVQM------PQRITFVQPVDGGIWVGQVDHVAFLDGAD---PASLSVSRRASRAPVPGSAVLVPAEVVGT 339 (396) Q Consensus 276 y-------~~~~~------~~~I~~i~~v~~gl~V~T~~~~y~l~G~~---p~~m~~~~~~~~~p~~~s~~~~~~~~~~~ 339 (396) - +-+.+ ...|.-+.++...|+++|++.-|.|+|.+ |++.+..+.+... |... T Consensus 392 t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~i~T~~~q~~l~g~~~lTP~~~~i~~~s~~~------------~~~~ 459 (803) T protein:vir:70 392 TAVSAVATDPFDVFSDASEVYQLKHAVTLDGSTVLFADKSQFILPGDKPLEKSNVLLKPVTTFE------------VNNN 459 (803) T ss_pred cccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEEee------------ccCC Confidence 1 11222 35566788999999999999999999854 3444433332211 2222 Q ss_pred ccccCcccEEEEecCCCE-------EEEcCCC-----cEEEEecceeeccccccceEEEeCcEEEEEeC Q lcl|Aclame:pro 340 NASPDGSPVAVWLAENGY-------VMGTSSG-----AIAEVHAGVLAGITGRAGTSVVFDRRLLTAVS 396 (396) Q Consensus 340 ~~~~~~~~~~lw~s~~Gl-------v~g~~~G-----~~~~lt~~~~~~~~a~~~~~~~~~rr~v~~~~ 396 (396) -..+..+..++|+++.|= .-...+. .++.+-...|... -..-++..-+..++.... T Consensus 460 ~~Pv~vg~~v~fv~~~g~~s~vre~~~~~~~d~y~a~Dlt~~a~hl~~~~-v~~~~~~~~~~~~v~~~~ 527 (803) T protein:vir:70 460 VKPVATGESVMFATSEGAYSGIREFYTDSYSDTKKAQAITSHVNKLLEGN-VIMMSASTNVNRLLVLTD 527 (803) T ss_pred CccEEeCCeEEEeccCCCeeEEEEEeccccccceehhhhhhhhHhhcCCc-eEEEEEeCCCCeEEEEEE Confidence 223445678999999872 1111110 0111111111110 000001011111111111 No 27 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=97.94 E-value=1.7e-07 Score=57.71 Aligned_cols=254 Identities=18% Similarity=0.174 Sum_probs=101.8 Q ss_pred CCcccccceeccCCc--CC--hhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccccccCCc-ccccEEEE Q lcl|Aclame:pro 1 MATTSLVPLAGINNV--AE--DAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSP-LHGDAFGA 75 (396) Q Consensus 1 m~~~~~~p~~G~nn~--~~--~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~lw~s~-~~~~~~~~ 75 (396) |-.+-..-|=|=++. ++ +.+||- ||-. .|.|+- .+.-|+|-.+- -+.||..- ..+..|++ T Consensus 150 a~tiE~a~FyGds~l~~s~~~~~glqf--DGi~-~li~~e--------nviDa~G~~ls----~~~lneaa~~i~~gfG~ 214 (468) T protein:vir:63 150 AKTIEWASFFGDSDLSDSPEPQAGLEF--DGLA-KLINQD--------NVHDARGASLT----ESLLNQAAVMISKGYGT 214 (468) T ss_pred HHHHHHHhhhcccccccCCCccccccc--ccee-EEecCC--------ceeccCCCccC----HHHHHHHhhhccccccC Confidence 344555556555555 22 234443 3222 344433 33444444332 12232211 11233444 Q ss_pred ECCeEEEE-------ecC-CCcee-ecc---------cc-----cCcceehhhcCCeEEEEcCCcceeecCceeeecccc Q lcl|Aclame:pro 76 LGDQWGKV-------DPH-SWTFE-PLA---------QI-----GEGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLD 132 (396) Q Consensus 76 ~dg~L~~i-------~~~-~w~~~-vl~---------~i-----g~gpV~~~v~n~rvy~t~~~~~~~~~g~~~~~l~ip 132 (396) .-+-+.-+ +.. .-+++ .++ .+ .+|.|.- ++-++.-+...+ .- .....+ T Consensus 215 ~td~~~~~~v~a~~~~~~L~~q~~v~~~n~~~~~~G~~v~g~~sa~G~I~l---~gs~il~~~~~l-~~-----~~~~~~ 285 (468) T protein:vir:63 215 PTDAYMPVGVQADFVNQQLSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKL---HGSTVMENEQIL-DE-----RILALP 285 (468) T ss_pred hhhhhcchhHHhhhhhhhcCceEEEEcCCCCceeeeecccceecceeeeee---cCceeeccccCC-Cc-----cccccc Confidence 33322100 000 00001 000 00 0122221 111111111110 00 000111 Q ss_pred -CCccc-e-ee--cCC----CCcccceEEEEEEEEcCCCcccccccceeEecC---CCccEEEeecCCCCCcceEEEEEE Q lcl|Aclame:pro 133 -TPAPP-L-LV--AGA----GSLSQGTYGAAVAWLRGPQESAPSLIAFAEVTD---AGALEVTFPLCLDASVTGARLYLT 200 (396) Q Consensus 133 -~Pa~p-~-~~--~~~----Gsl~~g~y~ya~T~V~~~gEeg~~~~~S~~vt~---~~~~~v~lp~~~~~~i~~~RIYrs 200 (396) +|+++ + .+ .+. ...++++|.|++++|+..||+-++..+.+.|+- +.++++++++......+.++|||+ T Consensus 286 ~Apsp~~vsaT~~~~~~g~~~~~~~a~y~Y~v~~vs~~GES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~~p~yv~IYR~ 365 (468) T protein:vir:63 286 TAPQPAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRK 365 (468) T ss_pred ccccCCccceeeecccCCcccCCCcceEEEEEEEECCCCccccccceEEEecCcccceeEEEEecCCCCCcceEEEEEEe Confidence 12111 1 11 111 123667899999999999955554446666553 455677777666655677999999 Q ss_pred ecCCCeEEEEEeeccee-----EEEEcCC--chhhc---------ccccchhcCCc---CCCc------eeeccCCEEEE Q lcl|Aclame:pro 201 RANGGELLLAGDYPLGA-----ATVILPT--LPELG---------RPAQFRHLSPM---PTGK------HLAYWRGRLLI 255 (396) Q Consensus 201 ~~~g~~~~lv~e~~~~~-----~~~~d~~--~~~lg---------~~l~t~~~~pp---P~g~------~~~~~nGrl~~ 255 (396) ..||++||+++.++++. .+|.|.. .+.-+ ++.+...|.|| |-.. ++-+|-|-|+- T Consensus 366 ~~gg~~f~li~~va~~~a~~gt~tf~D~n~~iPgT~~~fVgem~~~~i~~~~llpm~~lplA~~n~~~~~~Vl~Ygalal 445 (468) T protein:vir:63 366 GAETGLFYLIARVPASKAENNVITFYDLNDSIPETVDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALAL 445 (468) T ss_pred CCCCcceeEeeeEeeeecCCCeEEEEcCCcccCCCcceeeeecChhHHHHHHHhccccCChhHhccchhhhhhhhhHHhh Confidence 99999999998877554 4677642 22111 11122233333 2111 22333333322 Q ss_pred EECCEEEEccCCCCcccccccccEecCcceEEEEEc Q lcl|Aclame:pro 256 ARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPV 291 (396) Q Consensus 256 a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v 291 (396) + .|--|=.-+| +- ..+|--+... T Consensus 446 ~-----------~Pk~~~~ikN-v~-~~~~~~~~~~ 468 (468) T protein:vir:63 446 R-----------APKKWVRIRN-VK-YIPVKNVHSN 468 (468) T ss_pred h-----------ccccceEEEE-ee-eeeeccccCC Confidence 2 2222211111 00 0111111111 No 28 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=97.94 E-value=1.8e-07 Score=57.60 Aligned_cols=254 Identities=18% Similarity=0.174 Sum_probs=101.7 Q ss_pred CCcccccceeccCCc--CC--hhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccccccCCc-ccccEEEE Q lcl|Aclame:pro 1 MATTSLVPLAGINNV--AE--DAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSP-LHGDAFGA 75 (396) Q Consensus 1 m~~~~~~p~~G~nn~--~~--~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~lw~s~-~~~~~~~~ 75 (396) |-.+-..-|=|=++. ++ +.+||- ||-. .|.|+- .+.-|+|-.+- -+.||..- ..+..|++ T Consensus 149 a~tiE~a~FyGds~l~~s~~~~~glqf--DGi~-~li~~e--------nviDa~G~~ls----~~~lneaa~~i~~gfG~ 213 (467) T protein:vir:80 149 AKTIEWASFFGDSDLSDSPEPQAGLEF--DGLA-KLINQD--------NVHDARGASLT----ESLLNQAAVMISKGYGT 213 (467) T ss_pred HHHHHHHhhhcccccccCCCccccccc--ccee-EEecCC--------ceeccCCCccC----HHHHHHHhhhccccccC Confidence 344555556555555 22 234443 3222 344433 33444444332 12232211 11233444 Q ss_pred ECCeEEEE-------ecC-CCcee-ecc---------cc-----cCcceehhhcCCeEEEEcCCcceeecCceeeecccc Q lcl|Aclame:pro 76 LGDQWGKV-------DPH-SWTFE-PLA---------QI-----GEGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLD 132 (396) Q Consensus 76 ~dg~L~~i-------~~~-~w~~~-vl~---------~i-----g~gpV~~~v~n~rvy~t~~~~~~~~~g~~~~~l~ip 132 (396) .-+-+.-+ +.. .-+++ .++ .+ .+|.|.- ++-++.-+...+ .- .....+ T Consensus 214 ~td~~~p~~v~a~~~~~~L~~q~~v~~~n~~~~~~G~~v~g~~sa~G~I~l---~gs~il~~~~~l-~~-----~~~~~~ 284 (467) T protein:vir:80 214 PTDAYMPVGVQADFVNQQLSKQTQLVRDNGNNVSVGFNIQGFHSARGFIKL---HGSTVMENEQIL-DE-----RILALP 284 (467) T ss_pred hhhhhcchhHHhhhhhhhcCceEEEEcCCCCceeeeecccceecceeeeee---cCceeeccccCC-Cc-----cccccc Confidence 33322100 000 00001 000 00 0122221 111111111110 00 000111 Q ss_pred -CCccc-e-ee--cCC----CCcccceEEEEEEEEcCCCcccccccceeEecC---CCccEEEeecCCCCCcceEEEEEE Q lcl|Aclame:pro 133 -TPAPP-L-LV--AGA----GSLSQGTYGAAVAWLRGPQESAPSLIAFAEVTD---AGALEVTFPLCLDASVTGARLYLT 200 (396) Q Consensus 133 -~Pa~p-~-~~--~~~----Gsl~~g~y~ya~T~V~~~gEeg~~~~~S~~vt~---~~~~~v~lp~~~~~~i~~~RIYrs 200 (396) +|+++ + .+ .+. ...++++|.|++++|+..||+-++..+.+.|+- +.++++++++......+.++|||+ T Consensus 285 ~Apsp~~vsaT~~~~~~g~~~~~~~a~y~Y~v~~vs~~GES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~~p~yv~IYR~ 364 (467) T protein:vir:80 285 TAPQPAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRK 364 (467) T ss_pred ccccCCccceeeecccCCcccCCCcceEEEEEEEECCCCccccccceEEEecCcccceeEEEEecCCCCCcceEEEEEEe Confidence 12111 1 11 111 123667899999999999955554446666553 455677777666655677999999 Q ss_pred ecCCCeEEEEEeeccee-----EEEEcCC--chhhc---------ccccchhcCCc---CCCc------eeeccCCEEEE Q lcl|Aclame:pro 201 RANGGELLLAGDYPLGA-----ATVILPT--LPELG---------RPAQFRHLSPM---PTGK------HLAYWRGRLLI 255 (396) Q Consensus 201 ~~~g~~~~lv~e~~~~~-----~~~~d~~--~~~lg---------~~l~t~~~~pp---P~g~------~~~~~nGrl~~ 255 (396) ..||++||+++.++++. .+|.|.. .+.-+ ++.+...|.|| |-.. ++-+|-|-|+- T Consensus 365 ~~gg~~f~li~~va~~~a~~gt~tf~D~n~~iPgT~~~fVgem~~~~i~~~~llpm~~lplA~~n~~~~~~Vl~Ygalal 444 (467) T protein:vir:80 365 GAETGLFYLIARVPASKAENNVITFYDLNDSIPETVDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALAL 444 (467) T ss_pred CCCCcceeEeeeEeeeecCCCeEEEEcCCcccCCCcceeeeecChhHHHHHHHhccccCChhHhccchhhhhhhhhHHhh Confidence 99999999998877554 4677642 22111 11122233333 2111 22333333322 Q ss_pred EECCEEEEccCCCCcccccccccEecCcceEEEEEc Q lcl|Aclame:pro 256 ARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPV 291 (396) Q Consensus 256 a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v 291 (396) + .|--|=.-+| +- ..+|--+... T Consensus 445 ~-----------~Pk~~~~ikN-v~-~~~~~~~~~~ 467 (467) T protein:vir:80 445 R-----------APKKWVRIRN-VK-YIPVKNVHSN 467 (467) T ss_pred h-----------ccccceEEEE-ee-eeeeccccCC Confidence 2 2222211111 00 0111111111 No 29 >protein:vir:78957 Length: 826 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522826;genbank:gi:158345061;genbank:GeneID:5687447 Probab=97.94 E-value=1e-05 Score=47.95 Aligned_cols=377 Identities=13% Similarity=0.099 Sum_probs=159.2 Q ss_pred CCc--ccccce-eccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCcccc-ccCCccc------- Q lcl|Aclame:pro 1 MAT--TSLVPL-AGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQ-LWQSPLH------- 69 (396) Q Consensus 1 m~~--~~~~p~-~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~-lw~s~~~------- 69 (396) |+- -.++-| .||--..+..++ ...+++++|.=++..|.+++|+|.+.+...+..+ .+..|.. T Consensus 1 M~~i~~~~~nl~gGvSqq~d~~r~-------~~q~~~~~N~~~~~~gG~~rRpgt~~va~~~~~~~~~~~~f~~~~~r~s 73 (826) T protein:vir:78 1 MSYKQSAYPNLLMGVSQQVAFERL-------PGQLSEQINMVSDPVSGLRRRSGIELMASLLHTDQPWPRPYLYHTNLGG 73 (826) T ss_pred CcceeeecchhccceecccchHhh-------hhhhhhhhcceeccccccccCCchHhhhhhccCCcCCceeEEEEeccCC Confidence 886 345555 677666666666 3469999999999999999999999876554311 1111111 Q ss_pred ---ccEEEEECCeEEEEecCCCceeecc--------cccCcceehhhcCCeEEEEcCCcceeecCcee--eecc----cc Q lcl|Aclame:pro 70 ---GDAFGALGDQWGKVDPHSWTFEPLA--------QIGEGDLSHEVLNNRVCVAGTAGIFTYDGAQA--ERLT----LD 132 (396) Q Consensus 70 ---~~~~~~~dg~L~~i~~~~w~~~vl~--------~ig~gpV~~~v~n~rvy~t~~~~~~~~~g~~~--~~l~----ip 132 (396) .+++..-+|.|.-++.++....+.. ...+-.+.+.+.+|-+|+++...+-..+.... ..+. +. T Consensus 74 ~e~~~~l~~~~g~irv~~~~~g~~~~~~~~~~~y~~~~~~~~l~~~t~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~ 153 (826) T protein:vir:78 74 RSIAMLVAQHRGELYLFDEKDGRLLMGQPLVHDYLKASDYRQLRAATVADDLFIANLEVRPEADKADVLGVDPSKTGWLY 153 (826) T ss_pred cceEEEEEEcCCcEEEEECCCCEEEEecCcccceeecCCcceeEEEEEcCEEEEEcCcEeeeeccccccCCCCCceEEEE Confidence 1222334566654555555433311 11122467777888999988654322110000 0000 00 Q ss_pred CCccce----eecCCCCcccc--eEEEEEEEEcCCCcccc---ccc---------------------------------- Q lcl|Aclame:pro 133 TPAPPL----LVAGAGSLSQG--TYGAAVAWLRGPQESAP---SLI---------------------------------- 169 (396) Q Consensus 133 ~Pa~p~----~~~~~Gsl~~g--~y~ya~T~V~~~gEeg~---~~~---------------------------------- 169 (396) ..+.+. ...-+|+-..+ +..-..+|.+..+.... ... T Consensus 154 v~~g~y~~~y~v~i~~~~~~~~~~~s~t~~y~t~~~~~~~~~~~~~~~~~~~~~~a~~l~~~~~~~~~~~~~~~t~~~~~ 233 (826) T protein:vir:78 154 IKAGQYSKAFSLTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLFGKFFGAPEYTLPNSTKKYPK 233 (826) T ss_pred ecccccCceeEEEeccceeecccccceeEEEEeccCCccccccccccceecchhhheecceeeccccceeeeccceeEee Confidence 000000 00000000000 00001111111100000 000 Q ss_pred -------------------------------ceeEecCCCc---------cEEE----eecCCCC--C-cceEEE---EE Q lcl|Aclame:pro 170 -------------------------------AFAEVTDAGA---------LEVT----FPLCLDA--S-VTGARL---YL 199 (396) Q Consensus 170 -------------------------------~S~~vt~~~~---------~~v~----lp~~~~~--~-i~~~RI---Yr 199 (396) .+...+...+ ..++ +|..++. . ...+++ +. T Consensus 234 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~a~~p~~~~~~~~~~~~~~~~ 313 (826) T protein:vir:78 234 VDPDPAAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAI 313 (826) T ss_pred ccccccceeeccceeecccccceEEEecCCCeEEEeccCCCccceEEEeeEEEecccceeeeecccccceEEEEEEeeeE Confidence 0000000000 0000 0111110 0 000110 01 Q ss_pred EecCC--CeEEEEEeecce--------------------------eEEEEcCCchhhcccccchhcCCcCC--C---cee Q lcl|Aclame:pro 200 TRANG--GELLLAGDYPLG--------------------------AATVILPTLPELGRPAQFRHLSPMPT--G---KHL 246 (396) Q Consensus 200 s~~~g--~~~~lv~e~~~~--------------------------~~~~~d~~~~~lg~~l~t~~~~ppP~--g---~~~ 246 (396) ..++. +.+|+..+-..+ ..+|.....++..+.+-...-.|.|. | .-+ T Consensus 314 ~~~g~~~~~~y~~~~~~~~~w~e~a~~g~~~~~~tmp~~l~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~i~~v 393 (826) T protein:vir:78 314 MATGSTKAPVYFAWDAANRRWAERAAYGTDWVLKKMPLALRWDESTDTYSLNELEYDRRGSGDEETNPTFNFVKRGITGM 393 (826) T ss_pred ecCCCcccceeEEEEcCCceEEEeeccCcccccccccEEEEEecCCCeEEEeeccccccccCcccccCcccccCCCceEE Confidence 11111 122221111100 11111111111111111112245553 3 347 Q ss_pred eccCCEEEEEECCEEEEccCCCCccccccc-------ccEec------CcceEEEEEcCCcEEEEEcCcEEEEEcCc--- Q lcl|Aclame:pro 247 AYWRGRLLIARANVLRFSEALAYHLHDERY-------GFVQM------PQRITFVQPVDGGIWVGQVDHVAFLDGAD--- 310 (396) Q Consensus 247 ~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y-------~~~~~------~~~I~~i~~v~~gl~V~T~~~~y~l~G~~--- 310 (396) .|+++||..+.++.||+|....++-|-..- +-+.+ ...|.-+.++...|+++|++.-|.|+|.+ T Consensus 394 ~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~~s~~~~~i~~~v~~~~~L~l~T~~~e~~l~~~~~lT 473 (826) T protein:vir:78 394 TTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVT 473 (826) T ss_pred EEEeceEEEeeCCeEEEEeccCccccccccccCCCCCCcEEEEEccCcceeEEEEEecCCcEEEEecCcEEEEeCCCccc Confidence 899999999999999999999988773221 11111 34567788899999999999999999953 Q ss_pred hhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCC-----E----EEEcCCCc--EEEEecceeecccc Q lcl|Aclame:pro 311 PASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENG-----Y----VMGTSSGA--IAEVHAGVLAGITG 379 (396) Q Consensus 311 p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~G-----l----v~g~~~G~--~~~lt~~~~~~~~a 379 (396) |++.+..+.+..- |-..-..+..+..++|++++| + .....++. ...||.-.-..++. T Consensus 474 P~~~~~~~~s~~~------------~~~~~~Pv~vG~~v~F~~~r~~~~s~v~e~~~~~~~~~~y~~~dlt~~~~~l~~~ 541 (826) T protein:vir:78 474 PRTAVISITTQYD------------VDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPG 541 (826) T ss_pred ceeEEEEEEEeec------------ccCCCCceEeCCeEEEEecCCCceeEEEEEEeeecccCccchHHHHHHHHHhcCC Confidence 4444443332211 122222344556899998876 2 11122321 11111111111111 Q ss_pred c--cceEEEeCcEEEE---------EeC Q lcl|Aclame:pro 380 R--AGTSVVFDRRLLT---------AVS 396 (396) Q Consensus 380 ~--~~~~~~~~rr~v~---------~~~ 396 (396) . .-+...-.+.+|- .+. T Consensus 542 ~v~~~a~s~~~~~~v~~~~~~g~l~~~t 569 (826) T protein:vir:78 542 PAEYIQAAASSGYLVFGTSAADEMICHQ 569 (826) T ss_pred CeEEEEEeCCCCeEEEEEcCCCeEEEEE Confidence 1 0011111122221 111 No 30 >protein:vir:8887 Length: 808 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813776;genbank:gi:29366731;genbank:GeneID:1258831 Probab=97.84 E-value=1.5e-05 Score=46.96 Aligned_cols=376 Identities=13% Similarity=0.051 Sum_probs=154.0 Q ss_pred CCc--ccccce-eccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCC---cc--cc-cc----CCc Q lcl|Aclame:pro 1 MAT--TSLVPL-AGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQ---PF--RQ-LW----QSP 67 (396) Q Consensus 1 m~~--~~~~p~-~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~---~~--~~-lw----~s~ 67 (396) |+. -.++-| .||--..+..++ ...+++++|.=.+..|.+++|+|.+.+... ++ ++ .| .+- T Consensus 1 M~~v~~s~~n~~~GvSqq~d~~R~-------~~q~~~~~N~~~~~~gG~~rRpgt~~v~~l~~~~~~~~~~~~~~~~~~~ 73 (808) T protein:vir:88 1 MGLVSQSVKNLKGGISQQPDILRF-------SNQGALQINGWSSETQGLQKRPPTTFTKRLQNKGFLGTKPLVHLINRDA 73 (808) T ss_pred CcceeeecchhccceeccchhHhh-------hhhhhhhhcceeeeccccccCCchheeeeeeccCCCCCCcEEEEEEeCc Confidence 886 345555 676655555555 346999999999999999999998887532 21 11 11 111 Q ss_pred ccccEEEEECCeEEEEecCCCceeec-------ccccCcceehhhcCCeEEEEcCCcceeecCc---------------- Q lcl|Aclame:pro 68 LHGDAFGALGDQWGKVDPHSWTFEPL-------AQIGEGDLSHEVLNNRVCVAGTAGIFTYDGA---------------- 124 (396) Q Consensus 68 ~~~~~~~~~dg~L~~i~~~~w~~~vl-------~~ig~gpV~~~v~n~rvy~t~~~~~~~~~g~---------------- 124 (396) ...+.....++.|.-++.++-...+- +.-....+.+.+.+|-+|+++-+.+-..... T Consensus 74 ~~~y~v~~~~~~i~v~~~~G~~~~v~~~~~y~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (808) T protein:vir:88 74 QEQYFVGFSGTGLAVWDLKGNNYTVRGYNGYANCANPRTDLRLITVADYTFVVNRNTVCQMGSTLTHAAYPRLDGRAIIN 153 (808) T ss_pred CceEEEEEeCCeEEEEEcCCceEEEeecCcceEecCChhheeEEEEcCEEEEEcCCcceeecccccccCCCCCCccEEEE Confidence 11122222233443333333222211 1112345778888889999885443321000 Q ss_pred -------eeeeccccCCccce--e---ecCCCCccc---ceE------------------------------EEEEEEEc Q lcl|Aclame:pro 125 -------QAERLTLDTPAPPL--L---VAGAGSLSQ---GTY------------------------------GAAVAWLR 159 (396) Q Consensus 125 -------~~~~l~ip~Pa~p~--~---~~~~Gsl~~---g~y------------------------------~ya~T~V~ 159 (396) ..|.+.|....... . ....|+.+. ++. .|.+.... T Consensus 154 vr~g~y~~~y~i~i~g~~s~~~~t~t~~~~~~s~~~v~~~~~~~~~~~~~~~~~~~ia~~l~~~~~~~~~~~~~~~~~~~ 233 (808) T protein:vir:88 154 VRGGQYGRTLSITINGDGTGSSPQASIKMPNGSAEKVPAGDPYAGMNQVDMTDASWIAAELARQLTVSLGGSGWSFQAGT 233 (808) T ss_pred EcccccCceEEEEEecCCcceeeeEeEEEccCcccceeeccceeecccCCccccccchhhheeeeeecccccceEEEecc Confidence 00111111100000 0 000000000 000 00000000 Q ss_pred CC--Ccccccccc-eeEecCCCc--------cEE----EeecCCCCCcceEEEEEEecC-CCeEEEEEeeccee------ Q lcl|Aclame:pro 160 GP--QESAPSLIA-FAEVTDAGA--------LEV----TFPLCLDASVTGARLYLTRAN-GGELLLAGDYPLGA------ 217 (396) Q Consensus 160 ~~--gEeg~~~~~-S~~vt~~~~--------~~v----~lp~~~~~~i~~~RIYrs~~~-g~~~~lv~e~~~~~------ 217 (396) .. -........ +.....+.+ -.+ .||... ++...+.|=-+.++ .+.+|+..+-..+. T Consensus 234 ~~~~i~~~a~~~~~~~~t~~g~~~~~~~~~~~~v~~~~~lp~~~-p~g~~v~i~~~~~~~~~~~yv~~~~~~~~w~e~~~ 312 (808) T protein:vir:88 234 GWILINAPANDNVRQIATKDGYADTLLSGFIYQVQTFTKLPANA-PPGYLVEITGESARSGDNYWVQYDASGKVWKETAK 312 (808) T ss_pred ceEEEEeccCceeEEEcccCCcCcceeeeeeeeccceeeccccC-CCCcEEEEEecCCCCCceeEEEEEcCCeEEEEeee Confidence 00 000000000 000000000 011 122211 12222333222111 12233322211110 Q ss_pred ---E------------------EEEcCCchhhcccccchhcCCcCC--C---ceeeccCCEEEEEECCEEEEccCCCCcc Q lcl|Aclame:pro 218 ---A------------------TVILPTLPELGRPAQFRHLSPMPT--G---KHLAYWRGRLLIARANVLRFSEALAYHL 271 (396) Q Consensus 218 ---~------------------~~~d~~~~~lg~~l~t~~~~ppP~--g---~~~~~~nGrl~~a~Gn~l~fSEp~~p~a 271 (396) . ++.....++..+..--..-.|.|. | .-+.++++||..+.++.||+|....++- T Consensus 313 ~~~~~~~~~~tmp~~lv~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~n 392 (808) T protein:vir:88 313 PKIIAGFNNATLPHALVRAADGQFDWTPLTWDGRNAGDDDTNPMPSFVGATINDVFFFRNRLGFLSGENVVMSRTSKYFN 392 (808) T ss_pred ccceeeecccceeEEEEecCCceEEEEecccccccccccccCccceecCCceeEEEEEcceEEEeeCCeEEEEeccCccc Confidence 0 111111111111111111234444 2 2478999999999999999999999887 Q ss_pred cccccc-------c--Eec----CcceEEEEEcCCcEEEEEcCcEEEEEcCch---hheeeeeeccCCCcccceeecchh Q lcl|Aclame:pro 272 HDERYG-------F--VQM----PQRITFVQPVDGGIWVGQVDHVAFLDGADP---ASLSVSRRASRAPVPGSAVLVPAE 335 (396) Q Consensus 272 w~~~y~-------~--~~~----~~~I~~i~~v~~gl~V~T~~~~y~l~G~~p---~~m~~~~~~~~~p~~~s~~~~~~~ 335 (396) |-..-. - ++. ...|.-+.++...|+++|++.-|.|+|.++ ++.+..+.+... T Consensus 393 F~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~i~T~~~e~~l~~~~~lTP~~~~~~~~s~~~------------ 460 (808) T protein:vir:88 393 FFPSSVATLSDDDPIDVAISHNRISILKYAVPFSEQLLLWSDQAQFVLSSKTILSSKTIELDLTTEFD------------ 460 (808) T ss_pred ccCCcccCCCCCccEEEEecCCccceeeEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEEec------------ Confidence 732210 0 111 233566889999999999999999998643 333333332111 Q ss_pred hhccccccCcccEEEEecCCCE--------EEEc-CCC-cEEEEe--cce-eecccc------ccceEE-E--eCcEEEE Q lcl|Aclame:pro 336 VVGTNASPDGSPVAVWLAENGY--------VMGT-SSG-AIAEVH--AGV-LAGITG------RAGTSV-V--FDRRLLT 393 (396) Q Consensus 336 ~~~~~~~~~~~~~~lw~s~~Gl--------v~g~-~~G-~~~~lt--~~~-~~~~~a------~~~~~~-~--~~rr~v~ 393 (396) |...-..+..+..++|+++.|= ..-. .++ +...|| ..+ +....- ..-.++ + +++.-+. T Consensus 461 ~~~~~~Pv~vG~~v~f~~~~g~~~~v~r~~~~~~~~d~y~~~dlt~~~~h~~~~~~~~~~~~~~~~~~~v~~~~~~g~l~ 540 (808) T protein:vir:88 461 VSDGARPYGIGRGVYFAAPRASFTSLKRYYAIQDVSDVKSAEDVSAHVPSYITNTVHAIHGSGTENFVSILSDGSPNKVF 540 (808) T ss_pred ccCCCCceEeCCeEEEEecCCCeeEEEEEEEeeeccCceehhhHHHHHHHhcCCCeEEEEEeCCCCeEEEEEEcCCCEEE Confidence 2222234455678999999982 1111 121 111111 111 111000 000111 1 1111121 Q ss_pred EeC Q lcl|Aclame:pro 394 AVS 396 (396) Q Consensus 394 ~~~ 396 (396) .+. T Consensus 541 ~~~ 543 (808) T protein:vir:88 541 IYK 543 (808) T ss_pred EEE Confidence 111 No 31 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=97.74 E-value=4.6e-07 Score=55.35 Aligned_cols=247 Identities=19% Similarity=0.228 Sum_probs=99.1 Q ss_pred CCcccccceeccCCcCChh---heeeCCCchhhheeeeeeeeeCCCcc-EE----------ECCcceeecCCccccccCC Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDA---ALQRGGESPRLYVRDAVNIDLSPAGK-AQ----------LRASVRQVTDQPFRQLWQS 66 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~---~L~~~~~~~~~~lrdAvNVD~~~~G~-l~----------~R~G~~~~~~~~~~~lw~s 66 (396) |-++-..-|=|=+..+++. .|+-||= -.|-|+-|| |++.|+ |. -+++|--.+| +|=+ T Consensus 150 a~tiE~a~Fygds~l~~~~~~~gleFDGl---~~lI~~~NV-iDarG~~Ls~~~ln~aa~~i~~~fGt~TD-----~~~p 220 (462) T protein:vir:96 150 AKTIEWASFYGDASLTADPTGQGLEFDGL---AKLIDKDNV-IDAKGESLTETLLNRSAVLIGKSFGTATD-----AYMP 220 (462) T ss_pred HHHHHHHHhhhhcccCCCccccccchhhh---hhhcCCCce-eecCCCCccHHHHhhhhhhcccccCChhh-----eecc Confidence 3334444555544444422 3555442 123355554 223231 11 1122221111 1111 Q ss_pred cccccEEE--EECCeEEEEecCCCceeecccccCcceehhhcCCeEEEEcCCc-----------ceeecCceeeecccc- Q lcl|Aclame:pro 67 PLHGDAFG--ALGDQWGKVDPHSWTFEPLAQIGEGDLSHEVLNNRVCVAGTAG-----------IFTYDGAQAERLTLD- 132 (396) Q Consensus 67 ~~~~~~~~--~~dg~L~~i~~~~w~~~vl~~ig~gpV~~~v~n~rvy~t~~~~-----------~~~~~g~~~~~l~ip- 132 (396) .-.+--|- -.+.+=+-+.+...+.... +++. . |++.-+. +...+-. +...| T Consensus 221 ~~v~a~f~~~~l~~qrv~~~~n~g~~~~G---------~~v~--~-f~s~~G~I~L~~s~~m~~~~i~~~~---~~~~p~ 285 (462) T protein:vir:96 221 IGVHADFVNSVLGRQMQLMQDNSGNVNAG---------YNVQ--G-FYSSRGFIKLHGSTVMENELILDES---LQPLPN 285 (462) T ss_pred hHHHHHHHHhhcCceEEEEcCCCCceeee---------eecc--c-eeeeeeeeeeCCceecCcccccccc---cccCCC Confidence 00000000 0011111111111111111 1111 0 2222111 1112111 11111 Q ss_pred CCcccee--e--cC-C---CC-cccceEEEEEEEEcCCCcccccccceeEec---CCCccEEEeecCCCCCcceEEEEEE Q lcl|Aclame:pro 133 TPAPPLL--V--AG-A---GS-LSQGTYGAAVAWLRGPQESAPSLIAFAEVT---DAGALEVTFPLCLDASVTGARLYLT 200 (396) Q Consensus 133 ~Pa~p~~--~--~~-~---Gs-l~~g~y~ya~T~V~~~gEeg~~~~~S~~vt---~~~~~~v~lp~~~~~~i~~~RIYrs 200 (396) +|+++.+ + .+ . +. -+.++|.|+++.|+..||+.|+..+..+++ ++..++++.+.......+.++|||+ T Consensus 286 ap~~~~vsaTv~t~~~g~f~~~~d~~~y~Y~V~avs~dgeS~PS~~VtaTva~~~~gv~ltIt~~a~~~~~~~~~~IYRk 365 (462) T protein:vir:96 286 APQPATVKATVETGKKGLFTDEHDRAELTYKVVVNSDDAQSAPSEAVTATVNNATDGVKLEISVNAMYQQQPQFVSIYRQ 365 (462) T ss_pred CCCCCceeEEEEeCCCCCCCCccCceeEEEEEEEECCCCccccceeeEeeeecccccceEEEEEcCCccccceEEEEEee Confidence 2222211 1 11 1 12 257899999999999998888777666654 3344555566677777888999999 Q ss_pred ecCCCeEEEEEeecc------eeEEEEcCC--chhhc-----c----cccchhcCCc---CCC------ceeeccCCEEE Q lcl|Aclame:pro 201 RANGGELLLAGDYPL------GAATVILPT--LPELG-----R----PAQFRHLSPM---PTG------KHLAYWRGRLL 254 (396) Q Consensus 201 ~~~g~~~~lv~e~~~------~~~~~~d~~--~~~lg-----~----~l~t~~~~pp---P~g------~~~~~~nGrl~ 254 (396) ..+++.|++++.+++ ++.+|+|.. .|+-. + +++...|.|| |.. ..+-+|-|-|+ T Consensus 366 ~~~sg~y~li~rv~~~~~n~~gt~tf~D~n~~iPgt~~~fVge~~p~vi~~~qllpm~~~plA~~n~~~~waVl~yG~La 445 (462) T protein:vir:96 366 GRKTGDFYLIKRLGMKEVNDEGKLVFYDLNETIPETTDVFVGEMSPQVLHLFELLPMMKLPLAQINASVTFAVLWYGALA 445 (462) T ss_pred cCCccccceeeeeeceeecCCcceeEeeccCCCCCcccceeecCCchhhhhhhhhhhhhcCcccccchhhhhhhhhhHHH Confidence 999999999988754 355665531 22211 0 1111222222 111 01222333222 Q ss_pred EEECCEEEEccCCCCcccccccccEecCcceEEEE Q lcl|Aclame:pro 255 IARANVLRFSEALAYHLHDERYGFVQMPQRITFVQ 289 (396) Q Consensus 255 ~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~ 289 (396) -+ .|--| ... ..|+-|. T Consensus 446 l~-----------~Pk~~------~~i-kNv~~~~ 462 (462) T protein:vir:96 446 LR-----------APKKW------VRI-KNVKYIV 462 (462) T ss_pred hh-----------ccccc------EEE-EEEEEeC Confidence 21 11111 111 1122222 No 32 >protein:vir:95324 Length: 823 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512269;genbank:gi:89152436;genbank:GeneID:3952993 Probab=97.69 E-value=2.9e-05 Score=45.50 Aligned_cols=368 Identities=13% Similarity=0.075 Sum_probs=151.1 Q ss_pred CCcccc--cce-ec-----cCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCcccc--------cc Q lcl|Aclame:pro 1 MATTSL--VPL-AG-----INNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQ--------LW 64 (396) Q Consensus 1 m~~~~~--~p~-~G-----~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~--------lw 64 (396) |+ +.+ .-| .| |--+.+-.++ ...++++.|.=+++.|.+++|+|.+-+...+..+ .. T Consensus 1 m~-i~~~q~sF~~GElsP~l~gR~Dl~ry-------~~q~~~~~N~~~~~~GGl~rRpGt~fva~~~~~~g~~rLipf~~ 72 (823) T protein:vir:95 1 MA-ISWIQPSFAGGEIGPSLYGRIDMAKY-------QVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPNRKCRLIPFQF 72 (823) T ss_pred Cc-ceeechhccCceechheeccchHHHH-------HHHHhhhhCcEeeecCCceecCchhhhhhhcCCCCCeeEEEEEe Confidence 66 333 556 67 7777777777 3468999999999999999999999876443221 11 Q ss_pred CCcccccEEEEECCeEEEEecCCC-------ceeeccc-ccC---cceehhhcCCeEEEEcCCcceee---cCceeeecc Q lcl|Aclame:pro 65 QSPLHGDAFGALGDQWGKVDPHSW-------TFEPLAQ-IGE---GDLSHEVLNNRVCVAGTAGIFTY---DGAQAERLT 130 (396) Q Consensus 65 ~s~~~~~~~~~~dg~L~~i~~~~w-------~~~vl~~-ig~---gpV~~~v~n~rvy~t~~~~~~~~---~g~~~~~l~ 130 (396) +. .-.+++..-++.+ ++-.++. +...++. +.. -.+.|.+..|.+|+++.+.+-.. .+...-.+. T Consensus 73 s~-~q~y~Lefg~~~i-rV~~~~g~vv~~~~~~~ev~tPy~~~~l~~Lr~~qsaD~~fivh~~~~p~~L~r~~~~~w~l~ 150 (823) T protein:vir:95 73 ST-VQTYALEFGHQYM-RVIKDGALVLNSSNVIYEIATPYTEADLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLV 150 (823) T ss_pred CC-CcEEEEEEcCCeE-EEEeCCcEEEecCCceeEEecccccccccceeEEEeccEEEEEcCCccceEEEecCCCCceEE Confidence 11 0012222223333 2322211 1111110 111 25677778899999987653221 111000000 Q ss_pred ----ccCCc---------------c--ceeecCC----CCcccceEEEE----EEEEcCCCcccccccceeEe-c----- Q lcl|Aclame:pro 131 ----LDTPA---------------P--PLLVAGA----GSLSQGTYGAA----VAWLRGPQESAPSLIAFAEV-T----- 175 (396) Q Consensus 131 ----ip~Pa---------------~--p~~~~~~----Gsl~~g~y~ya----~T~V~~~gEeg~~~~~S~~v-t----- 175 (396) ...|. . ..+..+. .....|.+.|. .+.+..+..+....+..... + T Consensus 151 ~~~~~~gp~~~~~~~~t~~v~~~~~~~~~t~ta~~~~~~~d~vg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (823) T protein:vir:95 151 DVVTKNGPFEDINIDESLTVYASASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYR 230 (823) T ss_pred EEEEeccccccccccceeEEeccccCceeEEeecccccchhhccceEEEeccccceeeecceeeeecccceEEeccccee Confidence 00000 0 0000000 00111222111 12222222111111111000 0 Q ss_pred -CCCccEEEeecCCCC--------------CcceEEEEEEecCCCeEEE--EEee--cceeEEEEcCCchhhcccccchh Q lcl|Aclame:pro 176 -DAGALEVTFPLCLDA--------------SVTGARLYLTRANGGELLL--AGDY--PLGAATVILPTLPELGRPAQFRH 236 (396) Q Consensus 176 -~~~~~~v~lp~~~~~--------------~i~~~RIYrs~~~g~~~~l--v~e~--~~~~~~~~d~~~~~lg~~l~t~~ 236 (396) ...+..-++.+.... .....++|.. +++.... +.+. ......+.+... ......+.. T Consensus 231 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~g~~~~t~v~~~~~~~~~~~~~~~~~--~~~~~~t~~ 306 (823) T protein:vir:95 231 AVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYLHS--GFGIARITAVNGTTATAEVISYIPSQV--VGEDNASYK 306 (823) T ss_pred eeeccccceeecccCCcceEEeceecccccceeEEEEEeC--CcceEEEEeecceeeeceEeeeecccc--ccCCcCCcc Confidence 000000111111000 0111222221 1111111 1110 011111111111 111122222 Q ss_pred cCCcCC----C--ceeeccCCEEEEEEC----CEEEEccCCCCcccccccc-----cE--ec----CcceEEEEEcCCcE Q lcl|Aclame:pro 237 LSPMPT----G--KHLAYWRGRLLIARA----NVLRFSEALAYHLHDERYG-----FV--QM----PQRITFVQPVDGGI 295 (396) Q Consensus 237 ~~ppP~----g--~~~~~~nGrl~~a~G----n~l~fSEp~~p~aw~~~y~-----~~--~~----~~~I~~i~~v~~gl 295 (396) |...+= | ..+.+|++||..+.+ +.||+|.+..++-|..+-. -+ ++ ...|.-+.+.+ .| T Consensus 307 ~~~~~~~~~~g~Ps~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~~~~DdD~I~~~~s~~~~~~i~~~v~~~-~L 385 (823) T protein:vir:95 307 WAKYAWNSVNGYPGTVVYYQQRLYFAASTAFPQTIWASRTGDYKDFGKSNPTQDDDRIIYTYAGRQVNEIRHLIDVG-SL 385 (823) T ss_pred ccccccCcCCCCccEEEEEeceEEEEEcCCCCcEEEEeccCCccccccccCCCCCCcEEEEEcCCcceEEEEEeecC-cE Confidence 221111 1 568899999998865 7999999888877653311 01 11 34577888885 79 Q ss_pred EEEEcCcEEEEEcC-----chhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCCEEE-----EcC-CC Q lcl|Aclame:pro 296 WVGQVDHVAFLDGA-----DPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVM-----GTS-SG 364 (396) Q Consensus 296 ~V~T~~~~y~l~G~-----~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~Glv~-----g~~-~G 364 (396) +|+|++.-|.++|. +|++.+..+.+... | ..-.....+..++|++++|=.+ -.. ++ T Consensus 386 li~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g------------~-~~~~Pv~vg~~~~Fv~~~g~~vre~~~~~~~d~ 452 (823) T protein:vir:95 386 VALTSGGEYVITGDQNKVLTPSSFAFSSQGSNG------------S-SNVPPIAVANIALFVQEKGSVVRDLAYSFDVDG 452 (823) T ss_pred EEEecCcEEEEEcCCCcccceeeEEEEEeeccc------------c-ccccceEeCCeEEEEecCCCEEEEEEEeeecCc Confidence 99999999999975 45555554443221 1 1112233456788888887221 001 11 Q ss_pred ----cEEEEecceeeccccccceEEEeCcEE-EEEeC Q lcl|Aclame:pro 365 ----AIAEVHAGVLAGITGRAGTSVVFDRRL-LTAVS 396 (396) Q Consensus 365 ----~~~~lt~~~~~~~~a~~~~~~~~~rr~-v~~~~ 396 (396) -++.+.+-.+....-..-+...-++.+ .++++ T Consensus 453 ~~~~dlT~~a~hl~~~~~i~~~a~~~~p~~~~~~v~~ 489 (823) T protein:vir:95 453 YQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRD 489 (823) T ss_pred eecchhhhhhhhhcCCCceEEEEEecCCCeEEEEEec Confidence 011111111111000000011111221 12222 No 33 >protein:vir:7329 Length: 825 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848220;genbank:gi:30387391;genbank:GeneID:2641863 Probab=97.55 E-value=4.7e-05 Score=44.31 Aligned_cols=365 Identities=12% Similarity=0.065 Sum_probs=148.2 Q ss_pred CCcccc-cce-ec-----cCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCcccc--------ccC Q lcl|Aclame:pro 1 MATTSL-VPL-AG-----INNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQ--------LWQ 65 (396) Q Consensus 1 m~~~~~-~p~-~G-----~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~--------lw~ 65 (396) |.--.+ .-| .| |--+.+..+. .-.++++.|.-+...|.++||+|.+.+...+..+ ..+ T Consensus 1 m~~~~~q~sF~~GElsP~l~gR~Dl~~y-------~~g~~~~~N~~~~p~Gg~~rRpGt~fva~~~~~~~~~rLipF~fs 73 (825) T protein:vir:73 1 MAFSWIQPSFAGGEIGPSLYGRIDMSKY-------QVALRKCDNFIVRQYGGVENRPGTRFVGPAKYPDRKCRLIPFQFS 73 (825) T ss_pred CccceeccccccceechhhcccchHHHH-------HHHHHHhcCcEEEecCCceecCchHHhHhhcCCCCCEEEEEEEeC Confidence 554221 455 55 5555665555 3468899999999999999999999876543211 111 Q ss_pred CcccccEEEEECCeEEEEecCCCc--------eeecccccC---cceehhhcCCeEEEEcCCcceee---cCce---eee Q lcl|Aclame:pro 66 SPLHGDAFGALGDQWGKVDPHSWT--------FEPLAQIGE---GDLSHEVLNNRVCVAGTAGIFTY---DGAQ---AER 128 (396) Q Consensus 66 s~~~~~~~~~~dg~L~~i~~~~w~--------~~vl~~ig~---gpV~~~v~n~rvy~t~~~~~~~~---~g~~---~~~ 128 (396) . .-.+++..-++.|. +..++.. .++--.+.. -.+.|.+..|.+|+++...+-.. .+-. ... T Consensus 74 ~-~q~y~Lefg~~~lr-v~~~gg~v~~~~~~~~e~~TPy~~~~l~~l~~~QsaD~~~i~h~~~pp~~L~r~~~~~W~l~~ 151 (825) T protein:vir:73 74 T-VQTYALEFGHNYMR-VIKDGAYVLTTSNVIYELAMPYADTDLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQIVD 151 (825) T ss_pred C-CcEEEEEEeCCeEE-EEeCCceEeccCCceEEEecccchhhhhhheeeeecCEEEEEcCCCceeEEEEecCCCcEEEE Confidence 1 00122222233332 2111111 111111111 14567778899999987653321 1110 011 Q ss_pred ccc-cCCccce------eecCCCCcccceEEEEEEEEcC-CCcccccccceeEec--------------------CCCcc Q lcl|Aclame:pro 129 LTL-DTPAPPL------LVAGAGSLSQGTYGAAVAWLRG-PQESAPSLIAFAEVT--------------------DAGAL 180 (396) Q Consensus 129 l~i-p~Pa~p~------~~~~~Gsl~~g~y~ya~T~V~~-~gEeg~~~~~S~~vt--------------------~~~~~ 180 (396) ... ..|..+. .....| .+..|.+|.... ++++..+........ -..+. T Consensus 152 ~~f~~gp~~~in~~~sv~v~asg----~tg~~TiTaS~a~~~~~~vG~~i~~~~~~v~si~~~~~~~~~~~~~v~~~~~~ 227 (825) T protein:vir:73 152 VTTKNGPFEDINVDETVKVYASA----STGTITLTASSAIFGAEQVGKLFYLEQPAVDSVPVWETSKTTAINDVRRADSN 227 (825) T ss_pred EeccCCccccccccccceeeecc----cCceeEEEeeccccCchhcCeEEEEecccccccceeeeeeEEEeeeEEECCCc Confidence 100 0111000 000011 112233344322 222221111100000 00000 Q ss_pred ---------EEEeecCCC--------------CCcceEEEEEEecCCCeEEEEEee------cceeEEEEcCCchhhccc Q lcl|Aclame:pro 181 ---------EVTFPLCLD--------------ASVTGARLYLTRANGGELLLAGDY------PLGAATVILPTLPELGRP 231 (396) Q Consensus 181 ---------~v~lp~~~~--------------~~i~~~RIYrs~~~g~~~~lv~e~------~~~~~~~~d~~~~~lg~~ 231 (396) .-++++... .....++.+++ +++.+-+-+-. ......+.... ..+.. T Consensus 228 ~~~~~~~~~~~t~~~~a~~g~~~~~~~g~~~~~~~~~~~~~~~--~~g~~~it~~~~~~~~~~~~~~~~~~~~--~~~~~ 303 (825) T protein:vir:73 228 YYRANTSGKTGTLRPSHTEGMSWDGWGGTGSDDTGIQWEYLHS--GFGIAKITAVAGDGLTATADVVSFIPSQ--VVGSA 303 (825) T ss_pred eeeeecccccceeeccccCCceeEeeeeecccCCceEEEEEec--CCceEEEeeccccceeeccccceecccc--cccCC Confidence 001110000 00001112211 11111111000 00001111100 00111 Q ss_pred ccchhcCCcCC----C--ceeeccCCEEEEE----ECCEEEEccCCCCcccccccc-----cEe--c----CcceEEEEE Q lcl|Aclame:pro 232 AQFRHLSPMPT----G--KHLAYWRGRLLIA----RANVLRFSEALAYHLHDERYG-----FVQ--M----PQRITFVQP 290 (396) Q Consensus 232 l~t~~~~ppP~----g--~~~~~~nGrl~~a----~Gn~l~fSEp~~p~aw~~~y~-----~~~--~----~~~I~~i~~ 290 (396) -.+..|.-.+- | .++.|+++||..+ ..+.||+|.+..++-+..+-. -+. + ...|..+.+ T Consensus 304 ~~t~~~~~~~~~~~~gyPs~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~~~~DdD~I~~~~s~~~~~~i~~~~~ 383 (825) T protein:vir:73 304 NASYKWAKYAWNSVNGYPSTVVYYQQRLYFAASTAYPQTIWASRTGDYKDFGKNNPIQDDDRIIYTYAGRQVNEIRHLID 383 (825) T ss_pred CCCcccccCCcccCCCCccEEEEEcceEEEeecCCCCCEEEEEccCCccccccCCCCCCCccEEEEEcCCcceeEEEEee Confidence 11222221111 1 5689999999998 679999999988877654321 011 1 345777888 Q ss_pred cCCcEEEEEcCcEEEEEcCc-----hhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCCEEE------ Q lcl|Aclame:pro 291 VDGGIWVGQVDHVAFLDGAD-----PASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVM------ 359 (396) Q Consensus 291 v~~gl~V~T~~~~y~l~G~~-----p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~Glv~------ 359 (396) .+ .|+++|++.-|.++|.. |++.+..+.+... ++. + .....+..++|.++.|=.+ T Consensus 384 ~~-~L~~~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g---~~~--~--------~Pv~vg~~~~Fv~~~g~~vre~~~~ 449 (825) T protein:vir:73 384 VG-NLVALTSGGEYTISGDQNKVLTPSAFSFSSQGNNG---SSN--V--------PPIAVANIALFIQEKGSVVRDLAYS 449 (825) T ss_pred cC-cEEEEecCceEEEecCCCcccceeeEEEEeeeeec---ccc--c--------cceEeCCeEEEEeCCCCeEEEEEEe Confidence 85 79999999999999754 4444444433221 111 1 1223446788888888211 Q ss_pred EcCCC----cEEEEecceeeccccccceEEEeCcEEEE-EeC Q lcl|Aclame:pro 360 GTSSG----AIAEVHAGVLAGITGRAGTSVVFDRRLLT-AVS 396 (396) Q Consensus 360 g~~~G----~~~~lt~~~~~~~~a~~~~~~~~~rr~v~-~~~ 396 (396) -..++ -++.+.+..+....-..-+...-+++++- +++ T Consensus 450 ~~~d~~~~~dlt~~a~hl~~~~~~~~~a~~~~p~~~~~~v~~ 491 (825) T protein:vir:73 450 FDVDGYQGTDLTILANHLFQKHSIVDWSFCIVPYSSAFCIRD 491 (825) T ss_pred eecCceeccchhhhhHhhccCCceEEEEEcCCCceEEEEEec Confidence 00111 01111111222110011111111122221 111 No 34 >protein:vir:103341 Length: 806 # NCBI annotation: tail tubular protein B-like protein # Family: family:all:825 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039670;genbank:gi:125999999;genbank:GeneID:4818417 Probab=97.40 E-value=7.6e-05 Score=43.16 Aligned_cols=374 Identities=11% Similarity=0.031 Sum_probs=152.2 Q ss_pred CCc-ccccce-eccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCc---cc-cccCCcc---cc- Q lcl|Aclame:pro 1 MAT-TSLVPL-AGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQP---FR-QLWQSPL---HG- 70 (396) Q Consensus 1 m~~-~~~~p~-~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~---~~-~lw~s~~---~~- 70 (396) |-. -.++-| .||--..+..++ ...+++++|.=.+..|.+++|+|.+.+..+. .+ .+|-.-. .. T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~-------~~q~~~~~N~~~~~~~Gl~rRPgt~~va~l~~~~~~~~~~~~~~~~~~~~ 73 (806) T protein:vir:10 1 MEVQGSYGRQLQGVSQQPIAVRL-------PGQVTSQLNAVPNVVDGLKTRMGSKHLARILNSLDANSLIHHYKRGDDAE 73 (806) T ss_pred CeeEeecchhccceeccChhHhh-------hhhhhhhhcceeccccccccCCchhhhhhhcCCCCccceEEEEEecCCce Confidence 654 344444 565444455555 3469999999999999999999999875441 11 1121111 11 Q ss_pred -cEEEEECCeEEEEecCCCc-ee---------ecc--cccCcceehhhcCCeEEEEcCCcceeec--------------- Q lcl|Aclame:pro 71 -DAFGALGDQWGKVDPHSWT-FE---------PLA--QIGEGDLSHEVLNNRVCVAGTAGIFTYD--------------- 122 (396) Q Consensus 71 -~~~~~~dg~L~~i~~~~w~-~~---------vl~--~ig~gpV~~~v~n~rvy~t~~~~~~~~~--------------- 122 (396) +.....+|.|.-++..+.. .. .+. ...+..+.+.+.+|-+|++|...+-... T Consensus 74 ~y~v~~~~g~i~v~~~~~G~~~~v~~~~~~~~yl~~~~~~~~~l~~~tvaD~tfi~n~~~~~~~~~~~~~~~~~~~~v~v 153 (806) T protein:vir:10 74 EYFVILQPGQVPVIFTVGGLACPVNTQGSAATYLSSSSLPRETTQLMTIGDYTFVLNRKMPVQARGDVTPSLDNKGLVYV 153 (806) T ss_pred EEEEEEcCCcEEEEEcCCCcEEEecCCCceEEEeccCCCCcceeeEEEEcCEEEEecCcEeeeecccccCCCCcceEEEE Confidence 2212334554434322221 11 111 0112347777888999999866432111 Q ss_pred -Cce---eeecccc--------CCccc-e----eecC-------CCCcccceEE-EEEEEEcC----CCcccccccceeE Q lcl|Aclame:pro 123 -GAQ---AERLTLD--------TPAPP-L----LVAG-------AGSLSQGTYG-AAVAWLRG----PQESAPSLIAFAE 173 (396) Q Consensus 123 -g~~---~~~l~ip--------~Pa~p-~----~~~~-------~Gsl~~g~y~-ya~T~V~~----~gEeg~~~~~S~~ 173 (396) +++ .+.+.|. +|+.. + .... ...+....-. =.++|... .-+.....+.+.. T Consensus 154 ~~g~y~~~y~i~Ing~~~a~~~t~~~~~~~~~~~~~~~~~a~~l~~~l~~~~~~~~~~~~~~~g~~~~i~~~~~~~~~~~ 233 (806) T protein:vir:10 154 AYANFSFTYQILINGQVAAEHKTASSEDVKNEDLVRTDYVAGKLLENFNSRTASFPGFSMYQDGNVLVVDNSNGANYALT 233 (806) T ss_pred eecccCceeeEEeccceEEEEEeccCCCcccccccchhHHHHHHHhhhcccccccceeEEEEcccEEEEecCCCCccEEE Confidence 000 0111111 01000 0 0000 0000000000 00111110 0000000011111 Q ss_pred ecCCC-cc-----------EEEeecCCCCCcceEEEEEEecCC---CeEEEEEeec------------------------ Q lcl|Aclame:pro 174 VTDAG-AL-----------EVTFPLCLDASVTGARLYLTRANG---GELLLAGDYP------------------------ 214 (396) Q Consensus 174 vt~~~-~~-----------~v~lp~~~~~~i~~~RIYrs~~~g---~~~~lv~e~~------------------------ 214 (396) ..++. +. .-.||... ++... ++++++++ +.||...+-. T Consensus 234 ~~~g~~~~~~~~~~~~v~~~~~lp~~~-~~g~~--v~i~~~~~~~~~~y~v~~~~~~~~~~~w~e~~~~~~~~~~~~~t~ 310 (806) T protein:vir:10 234 TVDGADGQDLVAIRHKVTNLDTLPNRA-PVGYK--VQVWPTGSKPESRYWLQAESQDGSKVTWVETIAPGVRKGWNAATM 310 (806) T ss_pred EeeCCCCceeEEeecccCccccCcccc-CCCcE--EEEeccCCCCCCceEEEEEeeccCceEEEeecccccccceecccc Confidence 11111 00 01122221 11211 23333222 1122111100 Q ss_pred -----------ceeEEEEcCCchhhcccccchhcCCcCC--C-------ceeeccCCEEEEEECCEEEEccCCCCccccc Q lcl|Aclame:pro 215 -----------LGAATVILPTLPELGRPAQFRHLSPMPT--G-------KHLAYWRGRLLIARANVLRFSEALAYHLHDE 274 (396) Q Consensus 215 -----------~~~~~~~d~~~~~lg~~l~t~~~~ppP~--g-------~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~ 274 (396) .+..++.....+...+.+--..-.|.|. | .-+.|+++||..+.++.||+|.+..++-|-. T Consensus 311 p~~~v~~~~~~~~~~~~~~~~~~w~~r~~Gd~~tn~~psF~~~~~~~~it~v~f~q~RL~f~s~~~v~~Srsgd~~nF~~ 390 (806) T protein:vir:10 311 PHVLVRESLNANGSANFTYRPGEWEDRDVGDDLTNDFPSLLNDSSPQPISSMLMVQNRLMLTSGEAVVASRTSRFFDFFR 390 (806) T ss_pred ceEEEeeeeeecccceeEEEecccccccccccccCccCcccCCCCCccceEEEEEeeeEEEecCCeEEEEccCCcccCcc Confidence 0001111111111111111111122222 1 2378999999999999999999998887732 Q ss_pred cc-------ccEec------CcceEEEEEcCCcEEEEEcCcEEEEEcCc---hhheeeeeeccCCCcccceeecchhhhc Q lcl|Aclame:pro 275 RY-------GFVQM------PQRITFVQPVDGGIWVGQVDHVAFLDGAD---PASLSVSRRASRAPVPGSAVLVPAEVVG 338 (396) Q Consensus 275 ~y-------~~~~~------~~~I~~i~~v~~gl~V~T~~~~y~l~G~~---p~~m~~~~~~~~~p~~~s~~~~~~~~~~ 338 (396) +- +-+.+ ...|.-+.++..+|+++|++.-|.|+|.+ |++.+..+.+... |-. T Consensus 391 ~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~t~~~q~~l~~~~~lTP~~~~~~~~s~~~------------~~~ 458 (806) T protein:vir:10 391 YTVLATVDTDPFDVFADIEEVYNIRWSAQMDGDVVLFTSDQQFTLPGDKPLTPTSAVIRPVTQFK------------MTP 458 (806) T ss_pred ccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEeec------------ccC Confidence 21 11222 34566788899999999999999999854 4444433332211 111 Q ss_pred cccccCcccEEEEecCCC----E---EEEcC-CCc----EEEEecceeecccccc------c-eEEE--eCcEEEEEeC Q lcl|Aclame:pro 339 TNASPDGSPVAVWLAENG----Y---VMGTS-SGA----IAEVHAGVLAGITGRA------G-TSVV--FDRRLLTAVS 396 (396) Q Consensus 339 ~~~~~~~~~~~lw~s~~G----l---v~g~~-~G~----~~~lt~~~~~~~~a~~------~-~~~~--~~rr~v~~~~ 396 (396) .-.....+..++|+++.| + .-... ++- ++.+-+..|....-.. - ..+| ++..-+..+. T Consensus 459 ~~~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~~~DlT~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~t 537 (806) T protein:vir:10 459 GVKPAPSGDSILFAFDQGSYSGIREFFTDSYSDTKKAQPATSHVDKYIRGKVLELSASSSFNRAFIITSPDRNILYVYD 537 (806) T ss_pred CCCceEeCCeEEEeeCCCCeeEEEEEEeeeeccceehhhHHHHHHHhcCCCeEEEEEeCCCCcEEEEEEcCCCEEEEEE Confidence 222334566899999988 1 11111 110 0000000011100000 0 0001 1111111111 No 35 >protein:vir:108312 Length: 458 # NCBI annotation: hypothetical protein # Family: family:all:1540 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552278;genbank:gi:160700603;genbank:GeneID:5758828 Probab=96.87 E-value=0.00029 Score=40.00 Aligned_cols=270 Identities=11% Similarity=0.082 Sum_probs=147.8 Q ss_pred CCcccccceeccCCcCChhheeeCCCchhhheeeeeeeeeC------CCccEEECCcceeecCCcc---ccccCCccccc Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLS------PAGKAQLRASVRQVTDQPF---RQLWQSPLHGD 71 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~------~~G~l~~R~G~~~~~~~~~---~~lw~s~~~~~ 71 (396) |+..+|+ . .--. .+.+. .-.+..||.-.. .++-|..-+|.++..+++. ..++.. .+. T Consensus 1 m~~~~ip-~--gsy~--a~~~~-------~daq~~VN~yp~~~e~g~ss~~l~~tPGl~~f~~~~~~~~~g~~~~--~g~ 66 (458) T protein:vir:10 1 MVQRQIP-L--VATT--AEGDV-------SGQEILVNVYPRKSDGGKYPFTLRHTPGLAFFCELPTFPVMAMHQN--GSR 66 (458) T ss_pred Cceeeec-e--eeee--ccccc-------ccceeeeeeeeecccccccccceEecCCceeeecCCCCceeeEEec--CCE Confidence 9998884 2 2111 22222 125567887664 3456777899999766644 444443 467 Q ss_pred EEEEECCeEEEEecCCCceeecccccCcceehhhcCCeEEEEcCCcceeecCceeeeccccCCccceeecCCCCcccceE Q lcl|Aclame:pro 72 AFGALGDQWGKVDPHSWTFEPLAQIGEGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLDTPAPPLLVAGAGSLSQGTY 151 (396) Q Consensus 72 ~~~~~dg~L~~i~~~~w~~~vl~~ig~gpV~~~v~n~rvy~t~~~~~~~~~g~~~~~l~ip~Pa~p~~~~~~Gsl~~g~y 151 (396) +|++.+..|++|..+.-.+++..--|.|||.++-..+.+-++++.....||+.+..-.-+. .+++.+ T Consensus 67 ly~v~g~~LY~V~~~~~~~~iG~i~gsg~VsMa~ng~q~vi~~G~~gY~yd~at~~~~~i~----------d~~~~~--- 133 (458) T protein:vir:10 67 AFAVTPRDMYEISKDGTYKRLGSVDFKGRVVMEDNGKQIVMVDGEKGYYYDSETEIVQEIK----------AEGFYP--- 133 (458) T ss_pred EEEeeCceEEEEeCCceEEEEecccCceeEEEeeCCcEEEEEECCeEEEEeecccEEEecc----------CccccC--- Confidence 8999999999998664434443333678887765555666665554444433211000000 001100 Q ss_pred EEEEEEEcCCCcccccccceeEecCCCccEEEeecCCCCCcceEEEEE-EecCCCeEEEEEeecceeEEEEcCCchhhcc Q lcl|Aclame:pro 152 GAAVAWLRGPQESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLYL-TRANGGELLLAGDYPLGAATVILPTLPELGR 230 (396) Q Consensus 152 ~ya~T~V~~~gEeg~~~~~S~~vt~~~~~~v~lp~~~~~~i~~~RIYr-s~~~g~~~~lv~e~~~~~~~~~d~~~~~lg~ 230 (396) ...++..+=|- =.+.|+..+++-+|.- T Consensus 134 -------------------------------------~~~v~~~dGy~V~~~~g~~~~~is~L~d--------------- 161 (458) T protein:vir:10 134 -------------------------------------ASTVTYQDGYFIFDRKGTGQFFISELLD--------------- 161 (458) T ss_pred -------------------------------------cceEEEeCcEEEEEeeCCCEEEEEecCc--------------- Confidence 11222222221 1111222233322100 Q ss_pred cccchhcCCcCCCceeeccCCEEEEEECCEEEEccCCCCcccccccccEecCcceEEEEEcCCcEEEEEcCc--EEEEEc Q lcl|Aclame:pro 231 PAQFRHLSPMPTGKHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPVDGGIWVGQVDH--VAFLDG 308 (396) Q Consensus 231 ~l~t~~~~ppP~g~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v~~gl~V~T~~~--~y~l~G 308 (396) .+ ++| +.| +--| --|+.|++|...-+-||+.=+.. +|..+| T Consensus 162 --~s--~d~---------------------l~f-------a~Ae-----~~pD~iv~i~~~~~~i~~fG~~TiEvw~ntG 204 (458) T protein:vir:10 162 --VA--FDP---------------------LDF-------ATAE-----GQPDPLLAVLSDHREVFMFGQETIEVWYNSG 204 (458) T ss_pred --ce--eCc---------------------cee-------eeec-----CCCCceEEEEeeccEEEEEeccceEEEEecC Confidence 00 110 111 1111 24789999999999998875544 599999 Q ss_pred CchhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCCEEEEcCCCcEEEEecceeec-----cccc-cc Q lcl|Aclame:pro 309 ADPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVMGTSSGAIAEVHAGVLAG-----ITGR-AG 382 (396) Q Consensus 309 ~~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~Glv~g~~~G~~~~lt~~~~~~-----~~a~-~~ 382 (396) ..+ ..+++. |+ ..++-+|++..+..-.++++.|++.||.|-...+++...++--.++- ..++ .+ T Consensus 205 ~a~--fpy~r~------~g--a~i~~Gcaa~~sv~~~~~t~~~l~~d~~Vy~l~g~~~~rIST~aIE~~i~sy~~~da~a 274 (458) T protein:vir:10 205 AAD--FPFERN------QG--AFIEKGIGAPYSVAKTNNTVYFIGSDLMIYQITGYTPVRISTHAVEQTLKGVNLSDAFA 274 (458) T ss_pred CCC--cceeec------cc--ceeeecccCcchhhhhCceEEEEcCCeEEEEecCceeEEeeCHHHHHHHhcCChhheEE Confidence 965 333332 23 34577899999999999999999999999888888887665433322 1222 33 Q ss_pred eEEEeCcE---EEEEeC Q lcl|Aclame:pro 383 TSVVFDRR---LLTAVS 396 (396) Q Consensus 383 ~~~~~~rr---~v~~~~ 396 (396) .+...+.- ++++-+ T Consensus 275 ~t~~~eGH~fy~LtfP~ 291 (458) T protein:vir:10 275 YTYQSEGHLFYVLTIPG 291 (458) T ss_pred EEEEecCeEEEEEECCC Confidence 33333433 333332 No 36 >protein:vir:2625 Length: 715 # NCBI annotation: gp27 # Family: family:all:5234 # MgeID: mge:55 # MgeName: SIO1 # Cross-refs: genbank:acc:NP_064766;genbank:gi:9964636;genbank:GeneID:1263056 Probab=96.25 E-value=0.00083 Score=37.48 Aligned_cols=354 Identities=16% Similarity=0.181 Sum_probs=167.1 Q ss_pred CCc------ccccceeccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeec----------------CC Q lcl|Aclame:pro 1 MAT------TSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVT----------------DQ 58 (396) Q Consensus 1 m~~------~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~----------------~~ 58 (396) |+. ++-+ +.|+-.-..+.. .|+ -..-|-+|+|+..+|.-+||.|..... -. T Consensus 1 m~~~~~~~~vNtF-v~GliTEas~lt---fpq---nasiDe~N~~l~rdG~r~RR~g~~~E~~~vls~~~vp~galv~~~ 73 (715) T protein:vir:26 1 MPQSLTQRTVNTF-IKGLITEASELT---FPE---NASVDELNCSLGRDGTRRRRKAVTLEDNHVLSDVVVPEGALVQTL 73 (715) T ss_pred CCcccchhHHhhh-hhheeecccccc---CCc---cceeeeeeeeecCCCcchhhccceeecceEEEEEeecCceeeeee Confidence 543 2222 355544433322 233 345678999999999999999987643 23 Q ss_pred ccccccCCcccccEEEEECCeEEEEecCCCceee-------------ccccc--C--cceehhhcCCeEEEEcCCc-cee Q lcl|Aclame:pro 59 PFRQLWQSPLHGDAFGALGDQWGKVDPHSWTFEP-------------LAQIG--E--GDLSHEVLNNRVCVAGTAG-IFT 120 (396) Q Consensus 59 ~~~~lw~s~~~~~~~~~~dg~L~~i~~~~w~~~v-------------l~~ig--~--gpV~~~v~n~rvy~t~~~~-~~~ 120 (396) .|.|.|.... ..++.++-|..+++--++-.... ++.+- | -.|.+.+.|+.+.+++... ++. T Consensus 74 ~W~na~G~v~-~~~livqvg~~l~f~q~t~~pLs~~n~~~svdl~~~~~~vn~SPsh~~v~v~~~~G~livanp~i~~~~ 152 (715) T protein:vir:26 74 DWYNVAGQVN-LEFLVVQVNNILYFYEKSTDPLSANKYSGSVDLNTHSASNNLSPSEERVQVTSLNGYLIVASPAINTFY 152 (715) T ss_pred chhhcccccC-cEEEEEEeccEEEEEeccCCccccCceeEEeeecceecccccccceeEEEEEEeeeEEEEecCCccEEE Confidence 3444554322 12333333333333222211100 00111 1 1344555666666666533 222 Q ss_pred --ecC-ce---eeeccccCC-----ccceee----cCCCCc--ccceE-EEEEEEEcCCCcccccccceeEecCCCccEE Q lcl|Aclame:pro 121 --YDG-AQ---AERLTLDTP-----APPLLV----AGAGSL--SQGTY-GAAVAWLRGPQESAPSLIAFAEVTDAGALEV 182 (396) Q Consensus 121 --~~g-~~---~~~l~ip~P-----a~p~~~----~~~Gsl--~~g~y-~ya~T~V~~~gEeg~~~~~S~~vt~~~~~~v 182 (396) ||. .. ..++-+-.= -.++.+ .+.|+. ++.-| .|-.+|+...| +..+...+.--| T Consensus 153 ~~~d~~t~s~t~~~ll~r~r~f~~qg~d~~~g~~y~~~gt~~tn~~iynlyN~gw~~p~g--------t~~~N~~~~yiV 224 (715) T protein:vir:26 153 LGFNTSTEAFTATSISFKERDFEWQGSDVDVTSLYFGEGTSVSNQRIYDTYNVGWVGPKG--------SAALNTYGSYIV 224 (715) T ss_pred EEecCCcceeEeeEEEEEeeeheeeccccccccccccCCcccCchhheecccceeeccee--------EEEEcCCCCceE Confidence 111 00 000000000 000000 001111 11111 13444444211 111111111000 Q ss_pred EeecCCCCCcceEEEEEEecCC-CeEEEEEeecceeEEEEcC-------C----chhhcccccchhcCCcCCCceeeccC Q lcl|Aclame:pro 183 TFPLCLDASVTGARLYLTRANG-GELLLAGDYPLGAATVILP-------T----LPELGRPAQFRHLSPMPTGKHLAYWR 250 (396) Q Consensus 183 ~lp~~~~~~i~~~RIYrs~~~g-~~~~lv~e~~~~~~~~~d~-------~----~~~lg~~l~t~~~~ppP~g~~~~~~n 250 (396) -|.. --.|-|..+. ..|-..+++.+.+.++..+ . ...+-+...++.++- .+.+| T Consensus 225 -ypa~-------s~~~~S~kd~n~afsk~ad~ei~tGt~~~~~G~yi~D~~~~g~~~leeev~k~R~rs------v~~ya 290 (715) T protein:vir:26 225 -YPAL-------THPWYSGKDANGAFNKADWLEIYTGSSLASNGHYVLDVFNKARTGLTTEVETGRFRS------VAAYA 290 (715) T ss_pred -eccc-------ccccCCCcccccccChhhccccccccccccCceEEEeeeecCCccchhhhhcCCCcc------eeeec Confidence 0000 0001010000 0122223333332222111 0 011122333343332 78899 Q ss_pred CEEEEE------ECCEEEEcc-CC------------------CCcccccccccEec--CcceEEEEEcCCcEEEEEcCcE Q lcl|Aclame:pro 251 GRLLIA------RANVLRFSE-AL------------------AYHLHDERYGFVQM--PQRITFVQPVDGGIWVGQVDHV 303 (396) Q Consensus 251 Grl~~a------~Gn~l~fSE-p~------------------~p~aw~~~y~~~~~--~~~I~~i~~v~~gl~V~T~~~~ 303 (396) ||+|-+ .|..++||. -- .|.+-+..=+++.. -++|+.|.-++..|.|.-+..+ T Consensus 291 GrV~yagiD~dkng~rilfSqLv~s~~di~nCyQd~DPTsee~~dLidTDGg~iri~gah~ii~Lv~f~~sLlvf~~NGV 370 (715) T protein:vir:26 291 GRVFYAGIDSAKNGGKVYFSRLTERMSDVGNCYQVNDPTSEVLSDLLDTDGGVVRIPDAHNIRKLHVLGASLLVFAENGV 370 (715) T ss_pred ceEEEeecccccCCCeEEEehhhcchhhcccccccCCCchhhhhhhhhcCCCEEEecCCCCceeEEEecceEEEEEecce Confidence 999999 467899996 22 23333444333333 5789999999999999999999 Q ss_pred EEEEcCch----hheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCCEEEEcCCC-----cEEEEeccee Q lcl|Aclame:pro 304 AFLDGADP----ASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVMGTSSG-----AIAEVHAGVL 374 (396) Q Consensus 304 y~l~G~~p----~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~Glv~g~~~G-----~~~~lt~~~~ 374 (396) |.+.|++- -+....|++.. +|-+-.|.|..+++.+|-+.+|+.....+- ...+||++.+ T Consensus 371 WAi~G~d~g~tATdY~ltKIs~v------------g~sspnSvVvv~~~i~~WsdtGIyal~~Nd~fn~~tAqNLTekTI 438 (715) T protein:vir:26 371 WAVAGVDNVFRATEYAITRISDV------------GLSNENSFVVADGIPIWWGKTGIYAVQQSENLNTPTAQNLSLSTI 438 (715) T ss_pred EEEeccCCceeeeeeEEEEeeee------------ccCCCccEEEecceEEEeeCCcEEEEEeccccCcchhhccchHHH Confidence 99988873 22455666542 377788999999999999999999876664 3456776665 Q ss_pred ec------cccccceEEEe---CcEEEEEe-C Q lcl|Aclame:pro 375 AG------ITGRAGTSVVF---DRRLLTAV-S 396 (396) Q Consensus 375 ~~------~~a~~~~~~~~---~rr~v~~~-~ 396 (396) .- .......+..+ +.|+--+. + T Consensus 439 q~~~~~I~~dk~knVtg~fd~~e~rVyW~yPn 470 (715) T protein:vir:26 439 QTLWNNISNAKKAQVTVEYDKINQRVFWFYPD 470 (715) T ss_pred HHHHhhcchhhhcceEEEEEccCCEEEEEEcC Confidence 22 23345555666 44433333 2 No 37 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=96.13 E-value=5.6e-05 Score=43.91 Aligned_cols=255 Identities=16% Similarity=0.200 Sum_probs=97.5 Q ss_pred CCcccccceeccCCcCCh---hheeeCCCchhhheeeeeeeeeCCCcc-E----------EECCcceeecCCccccccCC Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAED---AALQRGGESPRLYVRDAVNIDLSPAGK-A----------QLRASVRQVTDQPFRQLWQS 66 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~---~~L~~~~~~~~~~lrdAvNVD~~~~G~-l----------~~R~G~~~~~~~~~~~lw~s 66 (396) |-.+-..-|=|=+..++. ..|+-||= -.|-|+-|| +++.|+ | .-+.+|--.+ ++|=+ T Consensus 150 a~tiE~a~FyGds~l~~~~~~~gleFDGl---~~lId~env-iDarG~~Ls~~~ln~Aa~~i~~~fGt~T-----D~~lp 220 (463) T protein:vir:95 150 AKTIEWASFYGDASLTSEVEGEGLEFDGL---AKLIDKNNV-INAKGNQLTEKHLNEAAVRIGKGFGTAT-----DAYMP 220 (463) T ss_pred HHHHHHHHhhhhhccCCCcCccccchhhh---hhhcCCCCe-eecCCCcccHHHHhhhhhhhhcccCChh-----heecc Confidence 233333344333333331 34554332 223344443 112121 1 1112222222 12211 Q ss_pred cccccEEE--EECCeEEEEecCCCceeeccccc-----CcceehhhcCCeEEEEcCCcceeecCceeeeccccCCcccee Q lcl|Aclame:pro 67 PLHGDAFG--ALGDQWGKVDPHSWTFEPLAQIG-----EGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLDTPAPPLL 139 (396) Q Consensus 67 ~~~~~~~~--~~dg~L~~i~~~~w~~~vl~~ig-----~gpV~~~v~n~rvy~t~~~~~~~~~g~~~~~l~ip~Pa~p~~ 139 (396) .-.+--|- -.+.+=+-+.+...+......+. +|.+.. ++-++.- .+.+.+-. -.....+|++|.+ T Consensus 221 ~~vka~f~~~~l~~qrv~~~~N~~~~~~G~~v~~f~s~~G~I~L---~~s~~m~---~~~il~~~--~~~~p~ap~~~~~ 292 (463) T protein:vir:95 221 IGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKL---HGSTVME---NELILDES--LQPLPNAPQPAKV 292 (463) T ss_pred hHHHHHHHHHhcCceEEEEcCCCCceeeeeeccceeeeeeeeee---CCceecC---Ccccccch--hhcCCCCccCcee Confidence 10000000 00111111111111111111110 111110 0000000 11111111 0122233333322 Q ss_pred -----ecCCCC----cccceEEEEEEEEcCCCcccccccceeEec-CCCccE--EEeecCCCCCcceEEEEEEecCCCeE Q lcl|Aclame:pro 140 -----VAGAGS----LSQGTYGAAVAWLRGPQESAPSLIAFAEVT-DAGALE--VTFPLCLDASVTGARLYLTRANGGEL 207 (396) Q Consensus 140 -----~~~~Gs----l~~g~y~ya~T~V~~~gEeg~~~~~S~~vt-~~~~~~--v~lp~~~~~~i~~~RIYrs~~~g~~~ 207 (396) ....|+ -+.+.|.|++..++..||+-|+..+..++. ++.++. +++|....-..+.+-|||+..++++| T Consensus 293 tatv~~~~~~~~~~~~~~a~~~Y~vv~~s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~~~~~~v~IYR~~~~~g~~ 372 (463) T protein:vir:95 293 TATVETKQKGAFENEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQQPQFVSIYRQGKETGMY 372 (463) T ss_pred EEEEeeccCCCCCCcccccceEEEEEEECCCCCcccchheeeeeeeccceEEEEEEecCCcccceeEEEEEeecCCCCcc Confidence 112233 355679999999999999988777766554 244554 45565566678899999999999999 Q ss_pred EEEEeecce------eEEEEcCC--chhh-----cc-cccc-hhcCCcC-----CCc------eeeccCCEEEEEECCEE Q lcl|Aclame:pro 208 LLAGDYPLG------AATVILPT--LPEL-----GR-PAQF-RHLSPMP-----TGK------HLAYWRGRLLIARANVL 261 (396) Q Consensus 208 ~lv~e~~~~------~~~~~d~~--~~~l-----g~-~l~t-~~~~ppP-----~g~------~~~~~nGrl~~a~Gn~l 261 (396) ++++.+++. +.+|+|.. .+.- ++ .-+| +.|..+| -.. .+-+|-|-|+-+ T Consensus 373 ~~i~rv~v~~an~~gttt~~D~n~~IPgt~~vfVgems~~ti~~~ellPm~klpLA~~~~~~~waVl~YGaLal~----- 447 (463) T protein:vir:95 373 FLIKRVPVKDAQEDGTIVFVDKNETLPETADVFVGEMSPQVVHLFELLPMMKLPLAQINASITFAVLWYGALALR----- 447 (463) T ss_pred eeEEEEEecccCCCceEEEeecccccCCceeEeeeccCchhhhhHhhhHhhhCCchhccchhhhHHHHhhHHHhh----- Confidence 999888554 45666631 2211 11 0112 1112222 111 123333333222 Q ss_pred EEccCCCCcccccccccEecCcceEEEEEc Q lcl|Aclame:pro 262 RFSEALAYHLHDERYGFVQMPQRITFVQPV 291 (396) Q Consensus 262 ~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v 291 (396) .|--|=.-+| |.- .|| T Consensus 448 ------~Pk~~~~ikN-------v~~-~~v 463 (463) T protein:vir:95 448 ------APKKWARIKN-------VRY-IAV 463 (463) T ss_pred ------ccccceEEEE-------eeE-ecC Confidence 2222211111 111 111 No 38 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=96.13 E-value=5.6e-05 Score=43.91 Aligned_cols=255 Identities=16% Similarity=0.200 Sum_probs=97.5 Q ss_pred CCcccccceeccCCcCCh---hheeeCCCchhhheeeeeeeeeCCCcc-E----------EECCcceeecCCccccccCC Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAED---AALQRGGESPRLYVRDAVNIDLSPAGK-A----------QLRASVRQVTDQPFRQLWQS 66 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~---~~L~~~~~~~~~~lrdAvNVD~~~~G~-l----------~~R~G~~~~~~~~~~~lw~s 66 (396) |-.+-..-|=|=+..++. ..|+-||= -.|-|+-|| +++.|+ | .-+.+|--.+ ++|=+ T Consensus 150 a~tiE~a~FyGds~l~~~~~~~gleFDGl---~~lId~env-iDarG~~Ls~~~ln~Aa~~i~~~fGt~T-----D~~lp 220 (463) T protein:vir:99 150 AKTIEWASFYGDASLTSEVEGEGLEFDGL---AKLIDKNNV-INAKGNQLTEKHLNEAAVRIGKGFGTAT-----DAYMP 220 (463) T ss_pred HHHHHHHHhhhhhccCCCcCccccchhhh---hhhcCCCCe-eecCCCcccHHHHhhhhhhhhcccCChh-----heecc Confidence 233333344333333331 34554332 223344443 112121 1 1112222222 12211 Q ss_pred cccccEEE--EECCeEEEEecCCCceeeccccc-----CcceehhhcCCeEEEEcCCcceeecCceeeeccccCCcccee Q lcl|Aclame:pro 67 PLHGDAFG--ALGDQWGKVDPHSWTFEPLAQIG-----EGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLDTPAPPLL 139 (396) Q Consensus 67 ~~~~~~~~--~~dg~L~~i~~~~w~~~vl~~ig-----~gpV~~~v~n~rvy~t~~~~~~~~~g~~~~~l~ip~Pa~p~~ 139 (396) .-.+--|- -.+.+=+-+.+...+......+. +|.+.. ++-++.- .+.+.+-. -.....+|++|.+ T Consensus 221 ~~vka~f~~~~l~~qrv~~~~N~~~~~~G~~v~~f~s~~G~I~L---~~s~~m~---~~~il~~~--~~~~p~ap~~~~~ 292 (463) T protein:vir:99 221 IGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKL---HGSTVME---NELILDES--LQPLPNAPQPAKV 292 (463) T ss_pred hHHHHHHHHHhcCceEEEEcCCCCceeeeeeccceeeeeeeeee---CCceecC---Ccccccch--hhcCCCCccCcee Confidence 10000000 00111111111111111111110 111110 0000000 11111111 0122233333322 Q ss_pred -----ecCCCC----cccceEEEEEEEEcCCCcccccccceeEec-CCCccE--EEeecCCCCCcceEEEEEEecCCCeE Q lcl|Aclame:pro 140 -----VAGAGS----LSQGTYGAAVAWLRGPQESAPSLIAFAEVT-DAGALE--VTFPLCLDASVTGARLYLTRANGGEL 207 (396) Q Consensus 140 -----~~~~Gs----l~~g~y~ya~T~V~~~gEeg~~~~~S~~vt-~~~~~~--v~lp~~~~~~i~~~RIYrs~~~g~~~ 207 (396) ....|+ -+.+.|.|++..++..||+-|+..+..++. ++.++. +++|....-..+.+-|||+..++++| T Consensus 293 tatv~~~~~~~~~~~~~~a~~~Y~vv~~s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~~~~~~v~IYR~~~~~g~~ 372 (463) T protein:vir:99 293 TATVETKQKGAFENEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQQPQFVSIYRQGKETGMY 372 (463) T ss_pred EEEEeeccCCCCCCcccccceEEEEEEECCCCCcccchheeeeeeeccceEEEEEEecCCcccceeEEEEEeecCCCCcc Confidence 112233 355679999999999999988777766554 244554 45565566678899999999999999 Q ss_pred EEEEeecce------eEEEEcCC--chhh-----cc-cccc-hhcCCcC-----CCc------eeeccCCEEEEEECCEE Q lcl|Aclame:pro 208 LLAGDYPLG------AATVILPT--LPEL-----GR-PAQF-RHLSPMP-----TGK------HLAYWRGRLLIARANVL 261 (396) Q Consensus 208 ~lv~e~~~~------~~~~~d~~--~~~l-----g~-~l~t-~~~~ppP-----~g~------~~~~~nGrl~~a~Gn~l 261 (396) ++++.+++. +.+|+|.. .+.- ++ .-+| +.|..+| -.. .+-+|-|-|+-+ T Consensus 373 ~~i~rv~v~~an~~gttt~~D~n~~IPgt~~vfVgems~~ti~~~ellPm~klpLA~~~~~~~waVl~YGaLal~----- 447 (463) T protein:vir:99 373 FLIKRVPVKDAQEDGTIVFVDKNETLPETADVFVGEMSPQVVHLFELLPMMKLPLAQINASITFAVLWYGALALR----- 447 (463) T ss_pred eeEEEEEecccCCCceEEEeecccccCCceeEeeeccCchhhhhHhhhHhhhCCchhccchhhhHHHHhhHHHhh----- Confidence 999888554 45666631 2211 11 0112 1112222 111 123333333222 Q ss_pred EEccCCCCcccccccccEecCcceEEEEEc Q lcl|Aclame:pro 262 RFSEALAYHLHDERYGFVQMPQRITFVQPV 291 (396) Q Consensus 262 ~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v 291 (396) .|--|=.-+| |.- .|| T Consensus 448 ------~Pk~~~~ikN-------v~~-~~v 463 (463) T protein:vir:99 448 ------APKKWARIKN-------VRY-IAV 463 (463) T ss_pred ------ccccceEEEE-------eeE-ecC Confidence 2222211111 111 111 No 39 >protein:vir:102644 Length: 594 # NCBI annotation: Hypothetical protein # Family: family:all:780 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024422;genbank:gi:48696643;genbank:GeneID:2948111 Probab=96.12 E-value=0.00098 Score=37.08 Aligned_cols=303 Identities=10% Similarity=0.045 Sum_probs=131.6 Q ss_pred CCcccccce-ecc-----CCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccccccCCcccccEEE Q lcl|Aclame:pro 1 MATTSLVPL-AGI-----NNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPLHGDAFG 74 (396) Q Consensus 1 m~~~~~~p~-~G~-----nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~lw~s~~~~~~~~ 74 (396) ||++--.-| .|. --+.+..+- ...++.+.|+-+...|.+.||+|.+.+...+.- =|...|-.+.|. T Consensus 1 m~~~~~~~F~~GelsP~l~~r~Dl~~y-------~~~~~~~~n~~~~~~G~~~rR~G~~~~~~~~~~-~~~~~lipF~~s 72 (594) T protein:vir:10 1 MADFSQTSFKGGVIAPRLQFNEYESAY-------HHSIEDAVNFVVTEQGSLITRCGSEEVGLCQDG-EVRLFRLPAVDA 72 (594) T ss_pred CceeeccccCcceecceeccchhHHHH-------HHHHhhhhceEEEecCCeecCChhHhhhhccCC-CCCEEEEEEEeC Confidence 999777777 332 122222211 235889999999999999999999998755321 111111111111 Q ss_pred EECCeEEEEecCCCceeecccccCcceehhhcCCeEEEEcCCcceeecCceeeeccccCCccceeecCCCCcccceEEEE Q lcl|Aclame:pro 75 ALGDQWGKVDPHSWTFEPLAQIGEGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLDTPAPPLLVAGAGSLSQGTYGAA 154 (396) Q Consensus 75 ~~dg~L~~i~~~~w~~~vl~~ig~gpV~~~v~n~rvy~t~~~~~~~~~g~~~~~l~ip~Pa~p~~~~~~Gsl~~g~y~ya 154 (396) +-..-++ ..|...+ |+| .++......+++ .++.+.+|- .....+.|.. T Consensus 73 ------------~~~~~~l-e~g~~~~-------r~~-~~~~~~v~~~~~--~~~~~~tp~---~~t~~~~l~~------ 120 (594) T protein:vir:10 73 ------------PSNDVIV-EVGNTNI-------AVW-VNDVRQVVANTP--SEWRNTIDR---IQTAYDTIGD------ 120 (594) T ss_pred ------------CCCeEEE-EEcCCeE-------EEE-ecCcEEEEccCC--Ccccccccc---eeeccCCccc------ Confidence 0011111 2222211 222 222211112221 122222221 1111122321 Q ss_pred EEEEcCCCcccccccceeEecCCCccEEEeecCCCCCcceEEEEEEecCCCeEEEEEeecceeEEEEcCCchhhcccccc Q lcl|Aclame:pro 155 VAWLRGPQESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLYLTRANGGELLLAGDYPLGAATVILPTLPELGRPAQF 234 (396) Q Consensus 155 ~T~V~~~gEeg~~~~~S~~vt~~~~~~v~lp~~~~~~i~~~RIYrs~~~g~~~~lv~e~~~~~~~~~d~~~~~lg~~l~t 234 (396) +.|... +..+.+ ..++..-.||||.+ .+.+-+ ...+.....+.+. . T Consensus 121 i~~tqs------------------ad~~~~---~~~~~~p~~L~R~~--~~~w~~-~~~~~~~~p~~~~----------~ 166 (594) T protein:vir:10 121 DAGAAN------------------TGRLIM---VHPALQPKRLYRDN--NNAWQF-VNMHTGAVPAEWS----------P 166 (594) T ss_pred eEEEEE------------------eeEEEE---EcCCCCceEEEEcc--CCCceE-EecccCccccccc----------C Confidence 111110 001111 12233346788864 232222 2211111111100 0 Q ss_pred hhcCCcCCCceeeccCCEEEEEEC----CEEEEccCCCCcccccccccEec----------CcceEEEEEcCCcEEEEEc Q lcl|Aclame:pro 235 RHLSPMPTGKHLAYWRGRLLIARA----NVLRFSEALAYHLHDERYGFVQM----------PQRITFVQPVDGGIWVGQV 300 (396) Q Consensus 235 ~~~~ppP~g~~~~~~nGrl~~a~G----n~l~fSEp~~p~aw~~~y~~~~~----------~~~I~~i~~v~~gl~V~T~ 300 (396) .+| | ..+.+++.||+.+.. +.||+|.+..++-+...-. +.= +..|.-+.+...+|+++|+ T Consensus 167 ~~~---p--~~v~f~q~RL~f~~~~~~p~~v~~Srtgd~~nF~~~~~-~~ddd~i~~~~s~~~~~~~~v~~~~~L~i~t~ 240 (594) T protein:vir:10 167 SNY---P--QTVGIFQNRVWYVGSPVHRTYFWATRAGKLEDIAPSTA-NNPNDPISFVGIMEGTPCWIIASSDVLTIGTT 240 (594) T ss_pred Ccc---c--eEEEEEeeeEEEEeCCCCCceEEEEecccccccccCCC-CCCCccEEEEEecccceEEEEecCCceEEEec Confidence 111 1 567899999998876 4799999888775532211 111 2344455566778999999 Q ss_pred CcEEEEEc-----CchhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCC-----EEEE-----cCCCc Q lcl|Aclame:pro 301 DHVAFLDG-----ADPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENG-----YVMG-----TSSGA 365 (396) Q Consensus 301 ~~~y~l~G-----~~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~G-----lv~g-----~~~G~ 365 (396) +.-|.|+| .+|++.+..+.+... ++ .+ .....+..++|.+++| +.-- ..+.- T Consensus 241 ~~e~~l~~~~~~~lTp~~~~~~~~s~~g---~~--~~--------~P~~vg~~~~fv~~~g~~vre~~y~~~~d~y~~~d 307 (594) T protein:vir:10 241 INDYQLAASTGVSVTAATAILRRSSVQG---TA--AV--------QGIPAEEQVIFCSRNKSKVYAMNYVREQDNWIPDE 307 (594) T ss_pred CceEEEecCCCcccccceEEEEEeeeec---cC--CC--------cceeeCCeEEEEcCCCCEEEEEEEeeccCceeccc Confidence 99999988 445555555443221 11 11 1123456788988888 1110 01111 Q ss_pred EEEEecceeec----cccccceEE--EeCcEEE-EEeC Q lcl|Aclame:pro 366 IAEVHAGVLAG----ITGRAGTSV--VFDRRLL-TAVS 396 (396) Q Consensus 366 ~~~lt~~~~~~----~~a~~~~~~--~~~rr~v-~~~~ 396 (396) ++.+.+..+.- ........+ .-+++++ ++++ T Consensus 308 lt~~a~hl~~~~~~~~~~~i~~~a~~~~p~~~~~~v~~ 345 (594) T protein:vir:10 308 MSSQAQHLFTPISSAKGASVRRVAYISDAAKSLWVVLE 345 (594) T ss_pred hhhhhhhhcCccccccCceEEEEEEecCCceEEEEEeC Confidence 12221111100 000000000 1112222 2222 No 40 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=95.31 E-value=0.00019 Score=40.93 Aligned_cols=264 Identities=15% Similarity=0.143 Sum_probs=97.0 Q ss_pred CCcccccceeccCCcCCh----hheeeCCCchhhheeeeeeeeeCCCcc-E----------EECCcceeecCCcc----c Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAED----AALQRGGESPRLYVRDAVNIDLSPAGK-A----------QLRASVRQVTDQPF----R 61 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~----~~L~~~~~~~~~~lrdAvNVD~~~~G~-l----------~~R~G~~~~~~~~~----~ 61 (396) |-++-..-|=|=+..++. ..||-||= . .|-|+-|| |++.|+ | .-++||--.+|.=. | T Consensus 146 a~tiE~a~FyGds~l~~~~~~~~gleFDGl--~-~lI~~~NV-iDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v~ 221 (464) T protein:vir:80 146 AKTIEWASFYGDSDLSENPDAGSGLEFDGL--A-KLIDKHNV-LDAKGASLTEALLNQASVLVGKGYGTPTDAYMPIGVQ 221 (464) T ss_pred HHHHHHHHhhhccccCCCCCCccccchhhh--H-hhcCCCce-eecCCCCcCHHHHhhhhhhhhcccCChhhcccchhHH Confidence 333444455554444432 33554332 1 33444454 222222 1 11122222211100 0 Q ss_pred -cc-cCCcccccEEEEECCeEEEEecCCCceeecccc-----cCcceehhhcCCeEEEEcCCcceeecCceeeeccccCC Q lcl|Aclame:pro 62 -QL-WQSPLHGDAFGALGDQWGKVDPHSWTFEPLAQI-----GEGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLDTP 134 (396) Q Consensus 62 -~l-w~s~~~~~~~~~~dg~L~~i~~~~w~~~vl~~i-----g~gpV~~~v~n~rvy~t~~~~~~~~~g~~~~~l~ip~P 134 (396) ++ =|-....+.+..++++=. .....+ ..|.+.- ++-++ -++-.....+.. .. ..+| T Consensus 222 a~f~n~~l~~q~~~~~~n~~~~---------~~G~~v~~f~sa~G~i~L---~~s~~-m~~~~~ld~~~~--~~--~~ap 284 (464) T protein:vir:80 222 ADFVNQQLDRQVQVISDNGQNA---------TMGFNVKGFNSARGFIRL---HGSTV-MELEQILDENRM--QL--PNAP 284 (464) T ss_pred HHHHhhhcCceeEEEcCCCCcc---------eeeeecccccccccceec---cCccc-cCcccccccccc--cC--CCCc Confidence 00 000001111111111100 000000 0122221 01111 111111111110 00 1144 Q ss_pred cccee--ecCC---CCcc----cceEEEEEEEEcCCCcccccccceeEecC-CCccEEEee--cCCCCCcceEEEEEEec Q lcl|Aclame:pro 135 APPLL--VAGA---GSLS----QGTYGAAVAWLRGPQESAPSLIAFAEVTD-AGALEVTFP--LCLDASVTGARLYLTRA 202 (396) Q Consensus 135 a~p~~--~~~~---Gsl~----~g~y~ya~T~V~~~gEeg~~~~~S~~vt~-~~~~~v~lp--~~~~~~i~~~RIYrs~~ 202 (396) ++|.+ +... |... ++.|.|++..++..|||.|+..+...++. ..++.|++. ......-+.+.|||+.. T Consensus 285 aapsvt~tv~~~~~g~f~~~~~~~~~~Ykv~~vn~~GeS~ps~~~~~ti~~~~~~V~l~it~~~~~~~~p~yv~IYR~~~ 364 (464) T protein:vir:80 285 QKATVKATLEAGTKGKFRDEDLTIDTEYKVVVVSDDAESAPSDVASVVIDDKKKQVKLEITINNMYQARPQYVAIYRKGL 364 (464) T ss_pred CCceeEEEecCCcccCCccccccceeEEEEEEECCCCccccceeeeeeecCcccEEEEEEEeCCccccccceEEEEeecC Confidence 43322 1221 2222 45688999999999998888766555553 344555543 22222236799999999 Q ss_pred CCCeEEEEEeeccee-----EEEEcCC--chhhc-----cc-ccc-hhcCCcCCCce-eeccCCEEEEEECCEEEEccCC Q lcl|Aclame:pro 203 NGGELLLAGDYPLGA-----ATVILPT--LPELG-----RP-AQF-RHLSPMPTGKH-LAYWRGRLLIARANVLRFSEAL 267 (396) Q Consensus 203 ~g~~~~lv~e~~~~~-----~~~~d~~--~~~lg-----~~-l~t-~~~~ppP~g~~-~~~~nGrl~~a~Gn~l~fSEp~ 267 (396) ++++||+++.+|+.. .+|.|.. .+.-+ +. -++ +.|..+|-=++ ++..|- ....-++||--.. T Consensus 365 ~~g~f~~i~rv~~~~~~~gt~t~vD~n~~IPgt~~vfVgems~~ti~l~ellPm~rlplA~~n~---~~~waVl~YGaLa 441 (464) T protein:vir:80 365 ETGLFYQIARVPASKAVEGVITFIDVNDEIPETADVFVGELTPSVVHLFELLPMMRLPLAQVNA---SVTFAVLWYGALA 441 (464) T ss_pred CCCceeEEEEEeeccccCCceEEEecccccCCceeEeeecCCchHHHHHHHHHhhhCCchhccc---chhhhhhhhhHHh Confidence 999999999986654 4566631 22111 10 011 11111221111 111111 0111223332211 Q ss_pred CCcccccccccEecCcceEEEEEc--CC Q lcl|Aclame:pro 268 AYHLHDERYGFVQMPQRITFVQPV--DG 293 (396) Q Consensus 268 ~p~aw~~~y~~~~~~~~I~~i~~v--~~ 293 (396) +.-+++ |... ..|+.|++- ++ T Consensus 442 ---l~aPk~-~~~i-kNv~~~~~~~~~~ 464 (464) T protein:vir:80 442 ---LRAPKK-WARI-KNVKYIATGNVFN 464 (464) T ss_pred ---hhcccc-ceEE-EEEEEeecccCCC Confidence 111121 1111 234444432 22 No 41 >protein:vir:105525 Length: 472 # NCBI annotation: phage DNA stabilization protein # Family: family:all:1540 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516194;genbank:gi:89885997;genbank:GeneID:3964385 Probab=94.75 E-value=0.0036 Score=33.99 Aligned_cols=279 Identities=19% Similarity=0.192 Sum_probs=142.8 Q ss_pred CCcccccceeccCCcCChhheeeCCCchhhheeee-eeee------eCCCccEEECCcceeecCC--ccccccCCccccc Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDA-VNID------LSPAGKAQLRASVRQVTDQ--PFRQLWQSPLHGD 71 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdA-vNVD------~~~~G~l~~R~G~~~~~~~--~~~~lw~s~~~~~ 71 (396) |+-.+|+-..||-.--.+.. +..-- ||+- .+.+|-+..=+|.++..+. ..+-+|..-..+. T Consensus 1 m~~~q~pl~~g~~~~~~~~~----------~~~~lpvN~y~~p~~~~~ss~~lr~~PG~~~~~~~~g~~RG~~~~~~~~~ 70 (472) T protein:vir:10 1 MAIMQLPLLRGLGKARDDAD----------YIDALPVNMLATPKPVLNASGYLRSFPGITHKAEVAGVSRGVQYNTHEKT 70 (472) T ss_pred CCceeeecccccccCccccC----------ceeeeeeeeeeccccccccceeecccCCceeecCCCcccceeEeeeeCCe Confidence 98888865588754211110 11111 4442 2366777777899887554 5577764333467 Q ss_pred EEEEECCeEEEEecCCCceeecccc-cCcceehh--hcCCeEEEEcCCcceeecCceeeeccccCCccceeecCCCCccc Q lcl|Aclame:pro 72 AFGALGDQWGKVDPHSWTFEPLAQI-GEGDLSHE--VLNNRVCVAGTAGIFTYDGAQAERLTLDTPAPPLLVAGAGSLSQ 148 (396) Q Consensus 72 ~~~~~dg~L~~i~~~~w~~~vl~~i-g~gpV~~~--v~n~rvy~t~~~~~~~~~g~~~~~l~ip~Pa~p~~~~~~Gsl~~ 148 (396) +|.+.++.|++++.+ + ..| |.|||.+. --++.|.+.++...++|+|...+.-.- |+....+. +.+ T Consensus 71 lY~V~G~~Ly~v~~~-----v-G~iagsg~VsMa~~~~~q~v~v~g~~~~y~y~g~~~t~~~~--~~~~~it~--~dl-- 138 (472) T protein:vir:10 71 VYRGLGNQLYKGHKP-----I-ADLAGKGRISMAFSRNSQAVVAAGKMTLYRYDGTVKTLENW--PKEKKYTQ--YDI-- 138 (472) T ss_pred EEEEecceEEEEEee-----e-eeecccccEEEEecCCceEEEEecceeEEEeccchhhhhhc--cccccCCc--ccc-- Confidence 889999999998643 2 233 67777662 223345555544455554321110000 00000000 000 Q ss_pred ceEEEEEEEEcCCCcccccccceeEecCCCccEEEeecCCCCCcceEEEE-EEecCCCeEEEEEeecceeEEEEcCCchh Q lcl|Aclame:pro 149 GTYGAAVAWLRGPQESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLY-LTRANGGELLLAGDYPLGAATVILPTLPE 227 (396) Q Consensus 149 g~y~ya~T~V~~~gEeg~~~~~S~~vt~~~~~~v~lp~~~~~~i~~~RIY-rs~~~g~~~~lv~e~~~~~~~~~d~~~~~ 227 (396) + ....++..+=| +=...|+..+++-+|.-.+ +.+ T Consensus 139 ------------------------------------~--~~~~v~~~dGyfV~~~~gt~~~~iS~L~d~s--~~~----- 173 (472) T protein:vir:10 139 ------------------------------------G--NVRDMCHLRGRYVWCKDGSDIFGVTDLEDES--HPD----- 173 (472) T ss_pred ------------------------------------C--CceeEEEeCceEEEeecCCceEEEeecCCcc--cCC----- Confidence 0 00112222222 2122333333332221110 000 Q ss_pred hcccccchhcCCcCCCceeeccCCEEEEEECCEEEEccCCCCcccccccccEecCcceEEEEEcCCcEEEEEcCc--EEE Q lcl|Aclame:pro 228 LGRPAQFRHLSPMPTGKHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPVDGGIWVGQVDH--VAF 305 (396) Q Consensus 228 lg~~l~t~~~~ppP~g~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v~~gl~V~T~~~--~y~ 305 (396) ++.+++--| --|+.|++|...-+-||+.=+.. +|. T Consensus 174 --------------------------------------~~~~FatAE-----~~pD~Ivgi~~~~~~i~lfG~~TiEvw~ 210 (472) T protein:vir:10 174 --------------------------------------RYRALYRAE-----SQPDGIIGIDSWRDFIVCFGASTIEYFS 210 (472) T ss_pred --------------------------------------cccceeeec-----CCCCceEEEEeeccEEEEEeccceEEEE Confidence 222222212 23688888888888888764444 589 Q ss_pred EEcCchhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCC----CEEEEcCCCcEEEEecc----eeecc Q lcl|Aclame:pro 306 LDGADPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAEN----GYVMGTSSGAIAEVHAG----VLAGI 377 (396) Q Consensus 306 l~G~~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~----Glv~g~~~G~~~~lt~~----~~~~~ 377 (396) .+|..+ ..+++. .++| ++-++-+|++..+..-.++++.|+|.+ |.|--..++++..++-- .++-. T Consensus 211 ntG~a~--fpf~r~-~~~p----g~~iq~Gcaa~~sv~~~~~s~~~l~~d~~g~~~V~~~~g~q~~rIST~aIE~~i~~y 283 (472) T protein:vir:10 211 LTGAAD--GQSAIY-AAQP----ALMVEKGIAGTHCKTRLGDAHVIISHQATGAPSVFLINQAQATSIATATIEKILRSY 283 (472) T ss_pred ecCCCC--cceeee-ccCc----cceeeecccCchhhhhhCceEEEEecCCCcceEEEEccCceEEEecCHHHHHHHHhC Confidence 999886 444443 2233 234566799999999999999999999 77877888888877433 23222 Q ss_pred cc-c---c-ceEEEeC-cE-EEEEeC Q lcl|Aclame:pro 378 TG-R---A-GTSVVFD-RR-LLTAVS 396 (396) Q Consensus 378 ~a-~---~-~~~~~~~-rr-~v~~~~ 396 (396) ++ + + +=+..++ +. |+-++- T Consensus 284 ~~~e~~dA~~~s~~~eGH~fy~LtfP 309 (472) T protein:vir:10 284 THDELASAVMETVRFDSHELVLIHLS 309 (472) T ss_pred CcccccceeEEEEEeCCeEEEEEEcC Confidence 21 1 1 1122222 32 222222 No 42 >protein:vir:3133 Length: 911 # NCBI annotation: hypothetical protein # Family: family:all:5234 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640315;genbank:gi:21234408;genbank:GeneID:956056 Probab=92.34 E-value=0.012 Score=31.16 Aligned_cols=364 Identities=14% Similarity=0.133 Sum_probs=164.0 Q ss_pred CCc-----ccccceeccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCccee--------------------e Q lcl|Aclame:pro 1 MAT-----TSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQ--------------------V 55 (396) Q Consensus 1 m~~-----~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~--------------------~ 55 (396) |+- -++.|+.|--.--. |.--|+ ...-|.-|.||..+|--|||-|.-- + T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (911) T protein:vir:31 1 MAARKGAVNRFTPVRGWVTEGN---LANYGQ---DVALDVENMDIEKTGLTQRRFGLFAETSSEQFLSTFTATARARGLL 74 (911) T ss_pred CccccccccccccceeeeecCc---hhhcCc---eeEeeeccccchhcccchhheeeeeccchhhhhhhhhhhhhhccee Confidence 643 46788877532211 111111 1223567999999999999977421 1 Q ss_pred cCCccccccCCcccccEEEEECCeEEEEecCCCce---------------eecccccCcceehhhcCCeEEEEcCCc-ce Q lcl|Aclame:pro 56 TDQPFRQLWQSPLHGDAFGALGDQWGKVDPHSWTF---------------EPLAQIGEGDLSHEVLNNRVCVAGTAG-IF 119 (396) Q Consensus 56 ~~~~~~~lw~s~~~~~~~~~~dg~L~~i~~~~w~~---------------~vl~~ig~gpV~~~v~n~rvy~t~~~~-~~ 119 (396) .-..++.-|...+. .+++.+.|--+++--++-+- ..++.+--+||--.+--+.-.+++.+. +. T Consensus 75 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (911) T protein:vir:31 75 AVKEWREAWGDKDV-NMLIFHAGYKVHVVQDTAPLRDANILLTIDLLEAGIKLDGVIDSPVHISVGVGFAIITNPRIEPV 153 (911) T ss_pred ehhhHHHhhCCCcc-eEEEEecCcEEEEEecccCccccceEEEeeeeccCceeeeeecCceeEEeeceEEEeecCccceE Confidence 12235666776553 33444433222221121111 011111123433222223444454321 11 Q ss_pred ee--cCceeeeccccCCc-cce---------ee------cCCCCcccce--EEEEEEE---E------------------ Q lcl|Aclame:pro 120 TY--DGAQAERLTLDTPA-PPL---------LV------AGAGSLSQGT--YGAAVAW---L------------------ 158 (396) Q Consensus 120 ~~--~g~~~~~l~ip~Pa-~p~---------~~------~~~Gsl~~g~--y~ya~T~---V------------------ 158 (396) .. |.. ..-++|+-+ .|+ .+ ..+.+|.++. ..|---| - T Consensus 154 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (911) T protein:vir:31 154 LIKLDDV--DDEGVPTLSYEPLTLLIRTRELLTPYTTGTNYGDTLTPEEEWNLYNSGWATITRATKDKSGSGTVYVNPVQ 231 (911) T ss_pred EEEeecc--CccCcccccccceeeEeeehhhccccccccccCcccCchhhcccccccceeeeeecccCCccceEEEchhh Confidence 10 100 011111110 010 00 1111222221 0010000 0 Q ss_pred ----------------cCCCcccccccceeEecCC-CccEEEe---ecCCCCCcceEEEEEEecCCCeEEE--EEeecce Q lcl|Aclame:pro 159 ----------------RGPQESAPSLIAFAEVTDA-GALEVTF---PLCLDASVTGARLYLTRANGGELLL--AGDYPLG 216 (396) Q Consensus 159 ----------------~~~gEeg~~~~~S~~vt~~-~~~~v~l---p~~~~~~i~~~RIYrs~~~g~~~~l--v~e~~~~ 216 (396) ++..+|++-+++...|-.+ ..-.+++ .+++++-|- +.||| ++.+..+ T Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~ 300 (911) T protein:vir:31 232 YYFDKRGVYPSHSVLYNSMKQESAKEIVALNVFSPWADEKINFGTTTPPLGRYIH-----------SAYYFDSAAILSLG 300 (911) T ss_pred eeecccCcCcchhhhhhhhhhhccceeEEEeeeccccccccccccCCCchhhhhh-----------hheeeccceeeeec Confidence 1122222222221111000 0000110 011111111 12232 2344444 Q ss_pred eEEEEcCCchhhcc-cccchhcCCcCCC----------ceee----------------ccCCEEEEEE-----CCEEEEc Q lcl|Aclame:pro 217 AATVILPTLPELGR-PAQFRHLSPMPTG----------KHLA----------------YWRGRLLIAR-----ANVLRFS 264 (396) Q Consensus 217 ~~~~~d~~~~~lg~-~l~t~~~~ppP~g----------~~~~----------------~~nGrl~~a~-----Gn~l~fS 264 (396) ....+-+..+...+ .-++..=..-|.| |+|+ +++||+|.++ ++.++|| T Consensus 301 ~~~~~~~~~~~~~~~~~p~~~e~~np~gl~~igt~~n~k~~a~~~~~~~~~~r~r~~~~yaGRVfyaD~dkngk~rIlFS 380 (911) T protein:vir:31 301 IGNLTPPTSDGTTEGSGPAEEEISNPIGLDNIGTVNNLKLIAEGTVRWTVKDRPRCSGYHNGHVYFGDRDKNGKTRILVS 380 (911) T ss_pred ccccCCCCCCCccCCCCCchhhhcCCCCcccccchhceeeeeccceeeeecccccceeeeccEEEEeeeccCcceeEEEE Confidence 44443333321100 0111111112222 3555 8999999996 5689999 Q ss_pred c-----CCCCcccccc--------------cccEec--CcceEEEEEcCCcEEEEEcCcEEEEEcCchh-----heeeee Q lcl|Aclame:pro 265 E-----ALAYHLHDER--------------YGFVQM--PQRITFVQPVDGGIWVGQVDHVAFLDGADPA-----SLSVSR 318 (396) Q Consensus 265 E-----p~~p~aw~~~--------------y~~~~~--~~~I~~i~~v~~gl~V~T~~~~y~l~G~~p~-----~m~~~~ 318 (396) . .+.++-+... =+|+.. -++|+.|..++++|.|.-+..+|-+.|.+|- +.+..| T Consensus 381 qLv~sl~di~nCYQdaDPTSeee~DLIdTDGg~vri~gah~Ii~LV~~G~sLlVFcaNGVWAI~G~d~~g~TATdy~ItK 460 (911) T protein:vir:31 381 QLVNSLDNIPKCFQDADPTAEEINDLIATDGFTMYPVGMGAPITMVEFNKRLLLLCTNGVWAIRGTSGGGATATDFTLDK 460 (911) T ss_pred eeccccccccccccCCCccccccchhhhcCCcEEecCCCCCceEEEEecCeEEEEEeCcEEEEeccCCCceeeeeeEEEE Confidence 6 2233322211 011111 2559999999999999999999999999974 457778 Q ss_pred eccCCCcccceeecchhhhccccccCcccEEEEecCCCEEEEcCCC----cEEEEecceeec----ccc--ccceEEEe- Q lcl|Aclame:pro 319 RASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVMGTSSG----AIAEVHAGVLAG----ITG--RAGTSVVF- 387 (396) Q Consensus 319 ~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~Glv~g~~~G----~~~~lt~~~~~~----~~a--~~~~~~~~- 387 (396) ++.. +|-+..|.|..+...+|.|..|+|....+- ...+||++.+.. ++. ...+...+ T Consensus 461 Isdv------------GcsspNSVVvVgn~i~fWSd~GIyaLganqfnD~tAnNLTesTIQ~y~d~I~~dkIkNVtgtyd 528 (911) T protein:vir:31 461 VASV------------EFNSPQSVVDIGTAIVFWSERGIIAIGVNDFGDLTSNNLTENTIDEYYDSLDRDIIKNVKGTFI 528 (911) T ss_pred Eeee------------eeCCCCeEEEecCceEEeeCCcEEEEeecccCccccccccHHHHHHHHhhcChhhhceEEEEEE Confidence 7542 377788999999999999999999865542 233556555532 222 23444444 Q ss_pred --CcEEEEEeC Q lcl|Aclame:pro 388 --DRRLLTAVS 396 (396) Q Consensus 388 --~rr~v~~~~ 396 (396) +.|+.-++. T Consensus 529 ~de~rVyW~yP 539 (911) T protein:vir:31 529 NDENRVYWVVP 539 (911) T ss_pred ccCCEEEEEec Confidence 455555555 No 43 >protein:vir:1778 Length: 680 # NCBI annotation: tail protein A # Family: family:all:825 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570344;genbank:gi:18640503;genbank:GeneID:932716 Probab=92.19 E-value=0.012 Score=31.03 Aligned_cols=361 Identities=12% Similarity=0.066 Sum_probs=131.6 Q ss_pred CCcccccceeccC-----------------------CcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeecC Q lcl|Aclame:pro 1 MATTSLVPLAGIN-----------------------NVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTD 57 (396) Q Consensus 1 m~~~~~~p~~G~n-----------------------n~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~ 57 (396) +....+ =+-|-. .-++......+.+...-.+.++++++. +++..-.+ T Consensus 166 g~ty~v-~ing~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~Ag~~t~~~~~~a~la~~l~~~~---------~~~~~~~g 235 (680) T protein:vir:17 166 GTSYIV-DFATEDSGQQRRWAVQEMQAPKTKRKKGDGSPDEAGETTVNNWNGTGLSFRVKVEA---------RAFLVDDG 235 (680) T ss_pred eeEEEE-EEeccccceeeeeeeeeeeccccccccccccCCCCcceeeeeeeeeeeeeeeeecc---------ceeeecCC Confidence 111000 001111 111111111112222222333333221 11111111 Q ss_pred Ccc-------------ccccCCcccccEEEEECC-eE-EEEecC---------------------C--Ccee-ec----c Q lcl|Aclame:pro 58 QPF-------------RQLWQSPLHGDAFGALGD-QW-GKVDPH---------------------S--WTFE-PL----A 94 (396) Q Consensus 58 ~~~-------------~~lw~s~~~~~~~~~~dg-~L-~~i~~~---------------------~--w~~~-vl----~ 94 (396) ..+ .+-|++-. .++.....| ++ ++|+.. . ++.+ ++ . T Consensus 236 ~~~~~~y~~~~~l~~tg~~~~~~~-~t~~v~~~G~~y~IsI~~~~~~~~~~~~~s~~~~t~~~~~a~~at~~~Ia~~L~~ 314 (680) T protein:vir:17 236 EEYGHNYIPYVTLLTPGNNTSPFP-DTIRVDVSGEGWDIKVTKQIQSKVYANLGTAQFTTPVDQSGGGASTSDIVTGLSA 314 (680) T ss_pred CceEEEEeeEEEEecCCccccccC-ceEEEecccceeEEEEccceeeEeccCccceeeeeccCCcccceeHHHHHHHHHH Confidence 100 12232211 111111222 11 111111 0 1111 00 0 Q ss_pred cc-cCcceehhhcCCeEEEEcCC--c--cee---ecCceee-----ecccc----CCccc---eeecCCCCcccceEEEE Q lcl|Aclame:pro 95 QI-GEGDLSHEVLNNRVCVAGTA--G--IFT---YDGAQAE-----RLTLD----TPAPP---LLVAGAGSLSQGTYGAA 154 (396) Q Consensus 95 ~i-g~gpV~~~v~n~rvy~t~~~--~--~~~---~~g~~~~-----~l~ip----~Pa~p---~~~~~~Gsl~~g~y~ya 154 (396) .+ +.+-+.+.+..+-+|+-... . ... .+|...+ .-.+. -|+.. ..+...++.......|- T Consensus 315 ~i~~~~~~~~~~~g~~i~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~Lp~~a~~g~~v~v~~~~~~~~~~Yy 394 (680) T protein:vir:17 315 AINGLGTFTAESIGNVIRVRYSDPTRTDEFTMSARGGTSGTGLESIKYSVDTLAELPTKCWNDYQVAVRNTQDTEVDDYY 394 (680) T ss_pred hhcccCcEEEEECCCEEEEEeccCCCceEEEeeccCCCCceeeeeeeeeeccccccccccCCCcEEEEEeCCCCcccceE Confidence 11 11334455566667773211 1 111 1221111 01111 12211 11111223333445566 Q ss_pred EEEEcCCCcccccccceeEecCCCccEEEeecCCCCCcceEEEEEEecCCCeEEEEEeecceeEEEEcCCchhhcccccc Q lcl|Aclame:pro 155 VAWLRGPQESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLYLTRANGGELLLAGDYPLGAATVILPTLPELGRPAQF 234 (396) Q Consensus 155 ~T~V~~~gEeg~~~~~S~~vt~~~~~~v~lp~~~~~~i~~~RIYrs~~~g~~~~lv~e~~~~~~~~~d~~~~~lg~~l~t 234 (396) +.|....+++...++.+=.-+.+.++...+ ....-..+|||.+.+.-.|- ..+.......+.+.. .|. .+| T Consensus 395 v~~~~~~~~~~~~~~~~W~E~~~~~~~~~~----~~~tmp~~l~r~~~g~f~~~-~~~~~~~~~~~~~r~---~Gd-d~t 465 (680) T protein:vir:17 395 VKFETDVEDADVPGSGYWVETVKNGDDGGL----VDDTMPHVLVRNALGDFTFS-SLNNSSYGKTWADRS---VGS-EDT 465 (680) T ss_pred EEEeccCcccCcccccceeecccCccccee----ccCcceEEEEEccCceeEEE-eeccccccccccccc---cCC-ccc Confidence 777655444433332221001112221111 12222478998752221222 111111111122211 111 111 Q ss_pred hhcCCcCC----Cc---eeeccCCEEEEEECCEEEEccCCCCccccccc-------ccEec------CcceEEEEEcCCc Q lcl|Aclame:pro 235 RHLSPMPT----GK---HLAYWRGRLLIARANVLRFSEALAYHLHDERY-------GFVQM------PQRITFVQPVDGG 294 (396) Q Consensus 235 ~~~~ppP~----g~---~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y-------~~~~~------~~~I~~i~~v~~g 294 (396) .|+|. |. -+.|+++||..+.++.||+|....++-|-..- +-+.+ ...|.-+.++... T Consensus 466 ---np~psF~~~G~~p~~v~f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~ 542 (680) T protein:vir:17 466 ---NPHPTFTESGNGIYGMFMYKNRLGFLTQDAVIMSQVGDYFNFYATSGVTISDADPIDMATSDTKPVKLEAAISSTSG 542 (680) T ss_pred ---CCCcccccCCCCceEEEEEcceEEEeeCCeEEEEccCCcccccccccccCCCCccEEEEEcCCcceeeeEEeecCCc Confidence 22332 43 46899999999999999999998887763221 11222 4567778999999 Q ss_pred EEEEEcCcEEEEEcC----chhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCC-------EEEEcCC Q lcl|Aclame:pro 295 IWVGQVDHVAFLDGA----DPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENG-------YVMGTSS 363 (396) Q Consensus 295 l~V~T~~~~y~l~G~----~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~G-------lv~g~~~ 363 (396) |+++|++.-|.++|. +|++.++.+.+..- |-..-..+..+..++|+++.| +.....+ T Consensus 543 L~l~t~g~q~~ls~~~~~lTP~~~~i~~~s~~~------------~~~~~~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~ 610 (680) T protein:vir:17 543 AILFGNQAQFRLSSPDESFGPKTATLDKISNYT------------YESKADPVQTGVSMIFPTNMGTYSSVYELSTESAK 610 (680) T ss_pred EEEEecCeEEEEecCCceecceeEEEEEEEeec------------ccCCCCceEeCCeEEEeecCCCcceEEEEeeeecc Confidence 999999999999884 34444444432211 111222334556899998887 1111111 Q ss_pred CcE-----EEEecceeecccc-------ccceEEEe--CcEEEEEeC Q lcl|Aclame:pro 364 GAI-----AEVHAGVLAGITG-------RAGTSVVF--DRRLLTAVS 396 (396) Q Consensus 364 G~~-----~~lt~~~~~~~~a-------~~~~~~~~--~rr~v~~~~ 396 (396) ..- +.+-+..|....- ..-..++. +...+.++. T Consensus 611 d~y~a~DlT~~a~hl~~g~v~~~~~~~~~~~~~~~~~~~~~~l~~~~ 657 (680) T protein:vir:17 611 GTPVIEDSSRVIPRLIPSGLTWSTASMNNDTVFFGNAKKGRNVYVFR 657 (680) T ss_pred CceehhhHHHHHHHhcCCceEEEEeeCCCCeEEEEEEcCCCEEEEEE Confidence 110 0010111111000 00011111 111111111 No 44 >protein:vir:2109 Length: 472 # NCBI annotation: head completion protein # Family: family:all:1540 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059633;genbank:gi:9635541;genbank:GeneID:1262840 Probab=90.91 E-value=0.018 Score=30.09 Aligned_cols=281 Identities=15% Similarity=0.137 Sum_probs=136.0 Q ss_pred CCcccccceeccCCcCChhheeeCCCchhhheeeeeeeeeCC------CccEEECCcceeecC--CccccccCCcccccE Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSP------AGKAQLRASVRQVTD--QPFRQLWQSPLHGDA 72 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~------~G~l~~R~G~~~~~~--~~~~~lw~s~~~~~~ 72 (396) |+-.+++-..||---..+. ..+ -.+. ||+...+ .|-|..=||.++..+ +..+-+...-..+.+ T Consensus 1 m~~~q~Pl~~g~~~~~~~~-d~~------~~~p--VN~~a~~~~~~~s~~~lr~tPG~~~~~~~~g~~RG~~~~t~~~~l 71 (472) T protein:vir:21 1 MPIQQLPMMKGMGKDFKNA-DYI------DYLP--VNMLATPKEILNSSGYLRSFPGITKRYDMNGVSRGVEYNTAQNAV 71 (472) T ss_pred CceEEeecccccccccccc-cee------eeee--eeeeeeccCCcccceeeeecCCcceeccCCCceeeeeecccCCeE Confidence 9988886668886533332 111 1122 4544433 344666688887544 344666643334568 Q ss_pred EEEECCeEEEEecCCCceeecccc-cCcceehhhcCCeEEEEcCCcceeecCceeeeccccCCccceeecCCCCcccceE Q lcl|Aclame:pro 73 FGALGDQWGKVDPHSWTFEPLAQI-GEGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLDTPAPPLLVAGAGSLSQGTY 151 (396) Q Consensus 73 ~~~~dg~L~~i~~~~w~~~vl~~i-g~gpV~~~v~n~rvy~t~~~~~~~~~g~~~~~l~ip~Pa~p~~~~~~Gsl~~g~y 151 (396) |.+.+..|++++.+ + ..| |.|||.++-....+-+..++.... |.+...+++... ....+..+..+ T Consensus 72 y~V~G~~LY~v~~~-----~-G~i~gsgrVsMa~n~~~~~v~~~~~~~~------Y~~~~~~~t~~~-~~~d~~f~~~d- 137 (472) T protein:vir:21 72 YRVCGGKLYKGESE-----V-GDVAGSGRVSMAHGRTSQAVGVNGQLVE------YRYDGTVKTVSN-WPADSGFTQYE- 137 (472) T ss_pred EEEeCCceEEEeee-----e-eeecccccEEEeeCCeEEEEEECCceeE------EEEecchhhhhc-ccCcccccccc- Confidence 88999999999753 1 222 566776542222222222222211 122222221100 00001110000 Q ss_pred EEEEEEEcCCCcccccccceeEecCCCccEEEeecCCCCCcceEEEE-EEecCCCeEEEEEeecceeEEEEcCCchhhcc Q lcl|Aclame:pro 152 GAAVAWLRGPQESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLY-LTRANGGELLLAGDYPLGAATVILPTLPELGR 230 (396) Q Consensus 152 ~ya~T~V~~~gEeg~~~~~S~~vt~~~~~~v~lp~~~~~~i~~~RIY-rs~~~g~~~~lv~e~~~~~~~~~d~~~~~lg~ 230 (396) + +...+++..+=| +=.+.|+..|++-++.-.+ T Consensus 138 ------------------------------l----~~~~dv~f~dGyfV~~~~gt~~f~is~l~d~~------------- 170 (472) T protein:vir:21 138 ------------------------------L----GSVRDITRLRGRYAWSKDGTDSWFITDLEDES------------- 170 (472) T ss_pred ------------------------------c----cceeEEEEecceEEEccCCcceeEEecCCCCc------------- Confidence 0 000111111111 2123333323322211110 Q ss_pred cccchhcCCcCCCceeeccCCEEEEEECCEEEEccCCCCcccccccccEecCcceEEEEEcCCcEEEEEcCc--EEEEEc Q lcl|Aclame:pro 231 PAQFRHLSPMPTGKHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPVDGGIWVGQVDH--VAFLDG 308 (396) Q Consensus 231 ~l~t~~~~ppP~g~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v~~gl~V~T~~~--~y~l~G 308 (396) . | -.|.+++--| --|+.|++|...-+-||+.=+.. +|..+| T Consensus 171 --~-------~-----------------------~~y~~FatAE-----~~pD~Iv~i~~~~~~l~lfG~~TiEvw~ntG 213 (472) T protein:vir:21 171 --H-------P-----------------------DRYSAQYRAE-----SQPDGIIGIGTWRDFIVCFGSSTIEYFSLTG 213 (472) T ss_pred --c-------c-----------------------cCCccceeec-----cCCCceEEEEeeccEEEEEeccceEEEEecC Confidence 0 0 0111112111 24789999999999999875544 599999 Q ss_pred Cch-hheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCC----EEEEcCCCcEEEEecc----eeeccc- Q lcl|Aclame:pro 309 ADP-ASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENG----YVMGTSSGAIAEVHAG----VLAGIT- 378 (396) Q Consensus 309 ~~p-~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~G----lv~g~~~G~~~~lt~~----~~~~~~- 378 (396) ... ...-+++. ++..++-+|++..+..-.++++.|+|.++ .|--..+++...++-- .++--. T Consensus 214 ~ad~~~fpy~r~--------~g~~iq~Gcaa~~sv~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST~aIE~~i~~y~~ 285 (472) T protein:vir:21 214 ATTAGAALYVAQ--------PSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTA 285 (472) T ss_pred CCCcCcCceEEc--------CcceeeecccCcchhhecCceEEEEecCCCcccEEEEccCceeEEecCHHHHHHHHhcCC Confidence 873 44444432 33456778999999999999999999998 3555677777766322 222211 Q ss_pred -cccce---EEEeC-cE-EEEEeC Q lcl|Aclame:pro 379 -GRAGT---SVVFD-RR-LLTAVS 396 (396) Q Consensus 379 -a~~~~---~~~~~-rr-~v~~~~ 396 (396) ..+-| +..++ +. |+-++- T Consensus 286 ~e~~~A~~~t~~~eGH~fy~LtfP 309 (472) T protein:vir:21 286 EEMATGVMETLRFDSHELLIIHLP 309 (472) T ss_pred ccccceEEEEEEeCCeEEEEEEcC Confidence 11111 22233 32 222222 No 45 >protein:vir:105428 Length: 472 # NCBI annotation: gene 8 protein # Family: family:all:1540 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958184;genbank:gi:41057286;genbank:GeneID:2716675 Probab=90.32 E-value=0.021 Score=29.73 Aligned_cols=252 Identities=15% Similarity=0.119 Sum_probs=117.1 Q ss_pred CCccce-eecCCCC-cccceEEEEEEE-EcCCCcccccccceeEecCCCccEEE--eecCC-C-------C---CcceEE Q lcl|Aclame:pro 133 TPAPPL-LVAGAGS-LSQGTYGAAVAW-LRGPQESAPSLIAFAEVTDAGALEVT--FPLCL-D-------A---SVTGAR 196 (396) Q Consensus 133 ~Pa~p~-~~~~~Gs-l~~g~y~ya~T~-V~~~gEeg~~~~~S~~vt~~~~~~v~--lp~~~-~-------~---~i~~~R 196 (396) .|--.+ +..+.+. ...+++. -.+ |+.+-+++...-+|-.+...+|+..- +++.- + . -|.... T Consensus 1 m~~~~~pl~~G~~~~~~~~d~~--~~~pVN~~a~~~~~~~s~~~l~~tPGl~~~a~v~G~~RG~~~~~~~g~lY~V~G~~ 78 (472) T protein:vir:10 1 MPIQQLPLMKGVGKDFRNADYI--DYLPVNMLATPKEILNSSGYLRSFPGIAKRSDVNGVSRGVEYNMAQNAVYRVCGGK 78 (472) T ss_pred CCeeeeeeccCceeeccccchh--heeeeeeeeeccCCCcccceeecCCCceeeccCCccccceEEEeeCCeEEEEecce Confidence 221111 1111111 1222222 111 33333333333334444433343321 11110 0 0 022234 Q ss_pred EEEEec---------------CCCeEEEEEeecceeEEEEcCCchhhcccccchhcCCcCC-C--ceeeccCCEEE-EEE Q lcl|Aclame:pro 197 LYLTRA---------------NGGELLLAGDYPLGAATVILPTLPELGRPAQFRHLSPMPT-G--KHLAYWRGRLL-IAR 257 (396) Q Consensus 197 IYrs~~---------------~g~~~~lv~e~~~~~~~~~d~~~~~lg~~l~t~~~~ppP~-g--~~~~~~nGrl~-~a~ 257 (396) ||+-.+ |+...-.+.+ .-.+..+.+..+..+ ...++....|..+ | .-+++..|+.. .-. T Consensus 79 LY~v~~~iGsiag~grVsMa~n~~~~av~~~-g~~~~Y~yd~~v~t~-~~~~~d~~~p~~dlg~~~dv~f~dGyfV~~~~ 156 (472) T protein:vir:10 79 LYKGESEVGDVAGSGRVSMAHGRTSQAVGVN-GQLVEYRYDGTVKTV-SNWPTDSGFTQYELGSVRDITRLRGRYAWSKD 156 (472) T ss_pred EeeeecceecccCcccEEEecCCcEEEEEEC-CceeEEEeeccchhh-hccccccccccccccceeeeeeecceEEEecc Confidence 444111 1111111111 000111222222211 1112222223333 2 13567789944 445 Q ss_pred CCEEEEccCCCCcccccccccE----ecCcceEEEEEcCCcEEEEEcCc--EEEEEc-CchhheeeeeeccCCCccccee Q lcl|Aclame:pro 258 ANVLRFSEALAYHLHDERYGFV----QMPQRITFVQPVDGGIWVGQVDH--VAFLDG-ADPASLSVSRRASRAPVPGSAV 330 (396) Q Consensus 258 Gn~l~fSEp~~p~aw~~~y~~~----~~~~~I~~i~~v~~gl~V~T~~~--~y~l~G-~~p~~m~~~~~~~~~p~~~s~~ 330 (396) |..-||.-...+-.-|..|..+ --|+.|++|...-+-||+.=+.. +|..+| +++...-+++. ++. T Consensus 157 Gt~~~~is~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~~~~~i~lfG~~TiEvw~ntG~a~~~~fpy~r~--------~g~ 228 (472) T protein:vir:10 157 GTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAALYVAQ--------PSL 228 (472) T ss_pred CcceEEEeccCCccccccccccccccCCCCceEEEEeeccEEEEEeccceEEEEecCCCCcccCceeec--------ccc Confidence 7766666555554445555311 13889999999999999875544 599999 44444444442 345 Q ss_pred ecchhhhccccccCcccEEEEecCC----CEEEEcCCCcEEEEecce----eecccc----cc-ceEEEeC-cE-EEEEe Q lcl|Aclame:pro 331 LVPAEVVGTNASPDGSPVAVWLAEN----GYVMGTSSGAIAEVHAGV----LAGITG----RA-GTSVVFD-RR-LLTAV 395 (396) Q Consensus 331 ~~~~~~~~~~~~~~~~~~~lw~s~~----Glv~g~~~G~~~~lt~~~----~~~~~a----~~-~~~~~~~-rr-~v~~~ 395 (396) .++-+|++..+..-.++++.|+|.+ |.|--..++++..++--. ++-.++ ++ +=+..++ +. |+-++ T Consensus 229 ~iq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~~~g~~~~rIST~aIE~~i~~y~~~e~~dA~~~t~~~~GH~fy~Ltf 308 (472) T protein:vir:10 229 MVQKGIAGTYCKTPFADSYAFISNPATGAPSVYIIGSGQVSPIASASIEKILRSYTADELADGVMESLRFDAHELLIIHL 308 (472) T ss_pred eeeecccCcchhhecCceEEEEecCCccccEEEEccCceeEEecCHHHHHHHHhcCCccccceeEEEEEeCCeEEEEEEc Confidence 6677899999999999999999998 888888888888774332 222221 11 1122222 32 22222 Q ss_pred C Q lcl|Aclame:pro 396 S 396 (396) Q Consensus 396 ~ 396 (396) - T Consensus 309 P 309 (472) T protein:vir:10 309 P 309 (472) T ss_pred C Confidence 2 No 46 >protein:vir:9268 Length: 472 # NCBI annotation: 10 # Family: family:all:1540 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720332;genbank:gi:24371590;genbank:GeneID:955815 Probab=89.98 E-value=0.023 Score=29.53 Aligned_cols=281 Identities=14% Similarity=0.102 Sum_probs=131.0 Q ss_pred CCcccccceeccCCcCChhheeeCCCchhhheeeeeeeee------CCCccEEECCcceeecCC--ccccccCCcccccE Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDL------SPAGKAQLRASVRQVTDQ--PFRQLWQSPLHGDA 72 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~------~~~G~l~~R~G~~~~~~~--~~~~lw~s~~~~~~ 72 (396) |+-.+|+=-.||---..+.... -.-.||... ..++-|+.-+|.++..+. +.+-+.=.--.+.+ T Consensus 1 m~~~~ipl~~g~~~~~~~a~~~---------~~~pvn~y~~~~~~~~ss~~Lr~~pG~~~~a~~~G~~RG~~~~~~~~~l 71 (472) T protein:vir:92 1 MPIQQLPMMKGMGKDFKNADYI---------DYLPINMLATPKEVLDSSGYLRSFPGIAKRNDVNGVSRGVEYNTAQNAV 71 (472) T ss_pred CceeeccccccccccCccCcce---------eeeecccccccccccccccceeecccceeecCCCCcccceeeeeeCCeE Confidence 9998884448886532222110 000223222 245668888888886554 22222100012346 Q ss_pred EEEECCeEEEEecCCCceeecccc-cCcceehhhcCCeEEEEcCCcceeecCceeeeccccCCccceeecCCCCcccceE Q lcl|Aclame:pro 73 FGALGDQWGKVDPHSWTFEPLAQI-GEGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLDTPAPPLLVAGAGSLSQGTY 151 (396) Q Consensus 73 ~~~~dg~L~~i~~~~w~~~vl~~i-g~gpV~~~v~n~rvy~t~~~~~~~~~g~~~~~l~ip~Pa~p~~~~~~Gsl~~g~y 151 (396) |.+.+..|+|++. ++ ..| |.|||.++-....+-+.+++.... |.+...+++... -...+..+..+ T Consensus 72 y~V~G~~Ly~v~~-----~i-G~i~gsgrVsMa~n~~~~av~~~~~~~~------Y~~~~~~~t~~~-~~~d~~f~~~d- 137 (472) T protein:vir:92 72 YRVCGGKLYKGEA-----VV-GDVAGSGRVSMAHGRTSQAVGVNGQLIE------YRYDGAVKTVSN-WPADSGFTQYE- 137 (472) T ss_pred EEEeCcceEEEEe-----eE-eeccCcccEEEecCCeEEEEEECCceeE------EEEecchhhhhc-ccCcccccccc- Confidence 7777888887754 12 222 566776643333233333322211 222222221100 00001110000 Q ss_pred EEEEEEEcCCCcccccccceeEecCCCccEEEeecCCCCCcceEEEE-EEecCCCeEEEEEeecceeEEEEcCCchhhcc Q lcl|Aclame:pro 152 GAAVAWLRGPQESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLY-LTRANGGELLLAGDYPLGAATVILPTLPELGR 230 (396) Q Consensus 152 ~ya~T~V~~~gEeg~~~~~S~~vt~~~~~~v~lp~~~~~~i~~~RIY-rs~~~g~~~~lv~e~~~~~~~~~d~~~~~lg~ 230 (396) + ....+++..+=| +=.+.|+..+++-.+. |++ T Consensus 138 ------------------------------l----~~~~dv~f~dGyfV~~~~gt~~~~iS~l~-----------d~~-- 170 (472) T protein:vir:92 138 ------------------------------L----GSVRDITRLRGRYAWSKDGTDSWFITDLE-----------DES-- 170 (472) T ss_pred ------------------------------c----cceeEEEEecceEEEccCCCceEEEeccC-----------Ccc-- Confidence 0 000111111111 2123333323322211 110 Q ss_pred cccchhcCCcCCCceeeccCCEEEEEECCEEEEccCCCCcccccccccEecCcceEEEEEcCCcEEEEEcCc--EEEEEc Q lcl|Aclame:pro 231 PAQFRHLSPMPTGKHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPVDGGIWVGQVDH--VAFLDG 308 (396) Q Consensus 231 ~l~t~~~~ppP~g~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v~~gl~V~T~~~--~y~l~G 308 (396) . |+ .|.+++ .+- --|+.|++|...-+-||+.=+.. +|..+| T Consensus 171 --~-------~~-----------------------~y~~fa--~AE---~~pD~Ivgi~~~~~~l~lfG~~TiEvw~ntG 213 (472) T protein:vir:92 171 --H-------PD-----------------------RYSAEY--RAE---SQPDGIIGIGSWRDFIVCFGSSTIEYFSLTG 213 (472) T ss_pred --c-------cc-----------------------cccccc--ccc---CCCCceEEEEeeccEEEEEeccceEEEEecC Confidence 0 00 011111 121 24789999999999999875554 599999 Q ss_pred Cch-hheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCC----EEEEcCCCcEEEEec----ceeeccc- Q lcl|Aclame:pro 309 ADP-ASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENG----YVMGTSSGAIAEVHA----GVLAGIT- 378 (396) Q Consensus 309 ~~p-~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~G----lv~g~~~G~~~~lt~----~~~~~~~- 378 (396) ... ...-+++. ++..++-+|++..+..-.++++.|+|.++ .|--..+++...++- +.++--+ T Consensus 214 ~ad~~~fpy~r~--------~g~~i~~Gcaa~~sv~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST~aIE~~i~~y~~ 285 (472) T protein:vir:92 214 ATTVGAALYVAQ--------PSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTA 285 (472) T ss_pred CCCcCcCceEEc--------CcceeeecccCcchhhecCceEEEEecCCCcccEEEEccCceeEEecCHHHHHHHHhcCc Confidence 873 44444432 23456778999999999999999999998 355567777776632 2232222 Q ss_pred cccceEEEe-----CcE-EEEEeC Q lcl|Aclame:pro 379 GRAGTSVVF-----DRR-LLTAVS 396 (396) Q Consensus 379 a~~~~~~~~-----~rr-~v~~~~ 396 (396) .+...+.-. ++. |+-++- T Consensus 286 ~e~~~a~~~s~~~eGH~fy~LtfP 309 (472) T protein:vir:92 286 DELATGVMEALRFDSHELLIIHLP 309 (472) T ss_pred chhceeeEEEEEecCeeEEEEEcC Confidence 122222222 232 332222 No 47 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=89.92 E-value=0.0039 Score=33.76 Aligned_cols=262 Identities=15% Similarity=0.170 Sum_probs=95.5 Q ss_pred CCcccccceeccCCcCChhh---eeeCCCchhhheeeeeeeeeCCCccEEECCcceeecCCccccccCCcc-cccEEEEE Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAA---LQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPL-HGDAFGAL 76 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~---L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~~~lw~s~~-~~~~~~~~ 76 (396) |-.+-..-|-|=++..++.. |+.||= . .+ |...+.|-.|.++-- -+-||++-. .+..|++- T Consensus 169 a~tiE~a~FyGDs~L~s~~~~~gleFDGl--~-~l-------I~~~NvIDarG~~Ls-----~~~ln~aA~~i~~gfGt~ 233 (514) T protein:vir:10 169 IKTDEWAMFYGDADLTSGQKGEGLQFDGL--F-KL-------IAPENHIDLRGGRLS-----PAALNMAARKIGEGFGTP 233 (514) T ss_pred HHHHHHHHhhhcccCCCccccCcchhhhH--H-Hh-------hcCCCeEecCCCCcc-----HHHHhhhhhhhhcccCCh Confidence 22233334434333333222 333221 0 01 112233333333111 122222211 01223222 Q ss_pred CCeEEE--------EecCCCceeecccccCcceehhhcCCeEEEEcCC----cceeecCceeeecccc----CCccc--- Q lcl|Aclame:pro 77 GDQWGK--------VDPHSWTFEPLAQIGEGDLSHEVLNNRVCVAGTA----GIFTYDGAQAERLTLD----TPAPP--- 137 (396) Q Consensus 77 dg~L~~--------i~~~~w~~~vl~~ig~gpV~~~v~n~rvy~t~~~----~~~~~~g~~~~~l~ip----~Pa~p--- 137 (396) -+.+.- -+...-.+..++..+.+ +-..+.=++..-..+. ...+.+--...+-..+ +|++| T Consensus 234 TD~ylp~~vka~f~~~~~~~qRV~~~~n~~~-~~~G~~v~~f~s~~G~I~L~gs~im~~~n~L~~~~~~~~~Ap~~~~va 312 (514) T protein:vir:10 234 TDAYMPIGIKADFVNQHLNGQRVMLPGQTGG-MTTGLDIDKFLSAHGSIRIQGSTIMDSDNKLDFDRPVSPTAPTAPQLS 312 (514) T ss_pred hheeCchHHHHHHhhcccCcceEEeecCccc-eeeeeeccceeEeccceeecCCeeecccccCccCCccCCcCCCCCcce Confidence 222210 00111112222111000 0000000000000000 0011111111111111 23222 Q ss_pred eee-cCCCC-----------------cccc-eEEEEEEEEcCCCccccccccee-EecCCCccEEEeec-CCCCCc-ceE Q lcl|Aclame:pro 138 LLV-AGAGS-----------------LSQG-TYGAAVAWLRGPQESAPSLIAFA-EVTDAGALEVTFPL-CLDASV-TGA 195 (396) Q Consensus 138 ~~~-~~~Gs-----------------l~~g-~y~ya~T~V~~~gEeg~~~~~S~-~vt~~~~~~v~lp~-~~~~~i-~~~ 195 (396) +.+ ...++ -+.| .|.|++..++..|||.|++++-. ....+.++.|++.+ .....+ +.+ T Consensus 313 ~svT~~~~g~~~~ad~t~~~g~~~~~~~~g~~~sYaVv~~n~~GeS~ps~~vtaT~a~~~~~i~ltItp~~~~~~~p~yv 392 (514) T protein:vir:10 313 ATVTPDGGGLWHEADKTDSKGEVILNKEVGVEQSYVAVMVSRHGDSRPSLVQTATPTKKDDAITLTITPNAMQNVIPDYV 392 (514) T ss_pred EEEecCcccccCcccccccccccccccccceeEEEEEEEECCCCcccccceeeeeeeccCceEEEEEEeccCcccccceE Confidence 222 22221 1222 57899999999999998887543 33445677777663 223333 788 Q ss_pred EEEEEec--------------CCCeEEEEEeecc-----eeEEEEcCC--chhhc---------ccccchhcCCc---CC Q lcl|Aclame:pro 196 RLYLTRA--------------NGGELLLAGDYPL-----GAATVILPT--LPELG---------RPAQFRHLSPM---PT 242 (396) Q Consensus 196 RIYrs~~--------------~g~~~~lv~e~~~-----~~~~~~d~~--~~~lg---------~~l~t~~~~pp---P~ 242 (396) -|||+.. +.+.||+++++++ ++.+|+|.. ++.-+ ++.+...|.|| |- T Consensus 393 ~IYR~~~~~s~~~~~~~~~~~~tGdf~li~rv~~~~~~~gttt~~D~n~~IPgT~~vfVgemspevi~l~ellPm~klpL 472 (514) T protein:vir:10 393 AIYRKSNFDSDALEANTDASGNRGSYYLIGKVAVREQEGATITFVDTNARIAGCGDVFVIENRPETVALQEFIPLSKLNL 472 (514) T ss_pred EEEeccCCCcchhhhhccccccccceeEEEEEeeecCCCCeEEEeccccccCCcceeEEeeCchHHHHHHHHhhhhhcCh Confidence 9999853 4568999988775 677787742 23221 12222344444 11 Q ss_pred C------ceeeccCCEEEEEECCEEEEccCCCCcccccccccEecCcceEEEEEc Q lcl|Aclame:pro 243 G------KHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPV 291 (396) Q Consensus 243 g------~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v 291 (396) . ..+-+|-|-|+-+ .|--|=.-+|. .+.+|--++-. T Consensus 473 A~~na~~~waVlwYGaLal~-----------aPkr~~~IkNv--~~~~v~~~~~~ 514 (514) T protein:vir:10 473 AVTTTATSFVVLNYVALALY-----------YPKRGAVLENV--VYSRVEDLELS 514 (514) T ss_pred hhhcchHHHHHHHHhHHHhh-----------ccccceEEEee--eeeeccccccC Confidence 1 1223344433322 22222111121 11222222222 No 48 >protein:vir:177 Length: 472 # NCBI annotation: DNA stabilization protein # Family: family:all:1540 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112082;genbank:gi:13559872;genbank:GeneID:920982 Probab=84.53 E-value=0.06 Score=27.29 Aligned_cols=252 Identities=15% Similarity=0.117 Sum_probs=117.1 Q ss_pred CCccce-eecCCCC-cccceEEEEEEE-EcCCCcccccccceeEecCCCccEEE--eecCC-C-------C---CcceEE Q lcl|Aclame:pro 133 TPAPPL-LVAGAGS-LSQGTYGAAVAW-LRGPQESAPSLIAFAEVTDAGALEVT--FPLCL-D-------A---SVTGAR 196 (396) Q Consensus 133 ~Pa~p~-~~~~~Gs-l~~g~y~ya~T~-V~~~gEeg~~~~~S~~vt~~~~~~v~--lp~~~-~-------~---~i~~~R 196 (396) .|--.+ +..+.+. ...+++. -.+ |+.+-++....-+|-.+...+|+..- +++.- + . -|.... T Consensus 1 m~~~~~Pl~~G~~~~~~~~d~~--~~~pVN~~a~~~~~~~s~~~l~~tPGl~~~a~v~G~~RG~~~~~~~g~lY~V~G~~ 78 (472) T protein:vir:17 1 MPIQQLPLMKGVGKDFRNADYI--DYLPVNMLATPKEILNSSGYLRSFPGIAKRSDVNGVSRGVEYNMAQNAVYRVCGGK 78 (472) T ss_pred CCeeeeeeccCceeeccccchh--heeeeeeeeeccCCCcccceeecCCCceeeccCCccccceEEEeeCCeEEEEecce Confidence 221111 1111111 1222222 111 33333333333334444443443322 11110 0 0 022234 Q ss_pred EEEEec---------------CCCeEEEEEeecceeEEEEcCCchhhcccccchhcCCcCC-C--ceeeccCCEEE-EEE Q lcl|Aclame:pro 197 LYLTRA---------------NGGELLLAGDYPLGAATVILPTLPELGRPAQFRHLSPMPT-G--KHLAYWRGRLL-IAR 257 (396) Q Consensus 197 IYrs~~---------------~g~~~~lv~e~~~~~~~~~d~~~~~lg~~l~t~~~~ppP~-g--~~~~~~nGrl~-~a~ 257 (396) ||+-.+ |+..+-.+.+ ......+.+..+..+ ...++....|..+ | .-+++..|+.. .-. T Consensus 79 LY~v~~~iGsiag~grVsMa~n~~~~av~~~-g~~~~Y~y~~~v~t~-~~~~~d~~~~~~dlg~~~dv~f~dGyfV~~~~ 156 (472) T protein:vir:17 79 LYKGESEVGDVAGSGRVSMAHGRTSQAVGVN-GQLVEYRYDGTVKTV-SNWPTDSGFTQYELGSVRDITRLRGRYAWSKD 156 (472) T ss_pred EeeeecceecccCcccEEEecCCcEEEEEEC-CceeEEEeeccchhh-hccccccccccccccceeeeeeecceEEEecc Confidence 444111 1111111111 001111222222211 1111222223333 2 13567789944 445 Q ss_pred CCEEEEccCCCCcccccccccE----ecCcceEEEEEcCCcEEEEEcCc--EEEEEcCchh-heeeeeeccCCCccccee Q lcl|Aclame:pro 258 ANVLRFSEALAYHLHDERYGFV----QMPQRITFVQPVDGGIWVGQVDH--VAFLDGADPA-SLSVSRRASRAPVPGSAV 330 (396) Q Consensus 258 Gn~l~fSEp~~p~aw~~~y~~~----~~~~~I~~i~~v~~gl~V~T~~~--~y~l~G~~p~-~m~~~~~~~~~p~~~s~~ 330 (396) |..-||.-...+-.-|..|..+ --|+.|++|...-+-||+.=+.- +|..+|..+- ..-+++ .++. T Consensus 157 Gt~~~~is~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~~~~~i~lfG~~TiEvw~ntG~a~~~~fpy~r--------~~g~ 228 (472) T protein:vir:17 157 GTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAALYVA--------QPSL 228 (472) T ss_pred CcceEEEeccCCccccccccccccccCCCCceEEEEeeccEEEEEeccceEEEEeeCCCCCCcCceee--------cCcc Confidence 7766666555554445555311 13889999999999999875544 5999999863 233333 2344 Q ss_pred ecchhhhccccccCcccEEEEecCC----CEEEEcCCCcEEEEeccee----ecccc----cc-ceEEEeC-cE-EEEEe Q lcl|Aclame:pro 331 LVPAEVVGTNASPDGSPVAVWLAEN----GYVMGTSSGAIAEVHAGVL----AGITG----RA-GTSVVFD-RR-LLTAV 395 (396) Q Consensus 331 ~~~~~~~~~~~~~~~~~~~lw~s~~----Glv~g~~~G~~~~lt~~~~----~~~~a----~~-~~~~~~~-rr-~v~~~ 395 (396) .++-+|++..+..-.++++.|++.+ |.|--..++++..++--.+ +-.++ ++ +=+..++ +. |+-++ T Consensus 229 ~iq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~~~g~~~~rIST~aIE~~i~~y~~~e~~dA~~~t~~~~GH~fy~Ltf 308 (472) T protein:vir:17 229 MVQKGIAGTYCKTPFADSYAFISNPATGAPSVYIIGSGQVSPISSASIEKILRSYTADELADGVMESLRFDAHELLIIHL 308 (472) T ss_pred eeeecccCcchhhecCceEEEEecCCccccEEEEccCceeEEecCHHHHHHHHhcCCccccceeEEEEEeCCeEEEEEEc Confidence 5677899999999999999999998 8888888888887743332 22221 11 1122222 32 22222 Q ss_pred C Q lcl|Aclame:pro 396 S 396 (396) Q Consensus 396 ~ 396 (396) - T Consensus 309 P 309 (472) T protein:vir:17 309 P 309 (472) T ss_pred C Confidence 2 No 49 >protein:vir:95475 Length: 771 # NCBI annotation: hypothetical protein ORF038 # Family: family:all:5234 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294631;genbank:gi:149408197;genbank:GeneID:5237042 Probab=83.47 E-value=0.068 Score=26.98 Aligned_cols=374 Identities=14% Similarity=0.095 Sum_probs=182.7 Q ss_pred CCc------ccccceeccCCcCChhheeeCCCchhhheeeeeeeeeCCCccEEECCcceeec------------------ Q lcl|Aclame:pro 1 MAT------TSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVT------------------ 56 (396) Q Consensus 1 m~~------~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~------------------ 56 (396) |+. ++-+ +.|+-.-..+.. .|+ -..-|-+|+|+..+|.=+||.|..... T Consensus 1 m~~~~~~~~vNtF-v~GliTEas~lt---fpq---nasiDe~N~~l~rdG~r~RR~g~~~E~~~~~vls~~~vpa~g~~~ 73 (771) T protein:vir:95 1 MAKTTNAAEFNTF-VGGLITEASPLT---FPQ---NASIDEVNFILNRDGSRNRRNGMDFENGATKVVCNTLVPADGTIA 73 (771) T ss_pred CCcccchhHHhhh-hhheeecccccc---CCc---cceeeeeeeeecCCCcchhhceeeeecCCceEEEEEEecccceEE Confidence 543 2222 355544433322 233 345678999999999999999987654 Q ss_pred --CCccccccCCcccccEEEEECCeEEEEecCCCcee-----ec-ccccCc---ceehhhcCCeEEEEcCCc-cee--ec Q lcl|Aclame:pro 57 --DQPFRQLWQSPLHGDAFGALGDQWGKVDPHSWTFE-----PL-AQIGEG---DLSHEVLNNRVCVAGTAG-IFT--YD 122 (396) Q Consensus 57 --~~~~~~lw~s~~~~~~~~~~dg~L~~i~~~~w~~~-----vl-~~ig~g---pV~~~v~n~rvy~t~~~~-~~~--~~ 122 (396) -..|.|.|.... ..++.++-|..+++--++.... .+ +.+-.+ .+.+.+.|+.+.+++... ++. || T Consensus 74 v~~~~W~na~G~v~-~~~livqvg~~l~f~q~t~~pLs~~n~~~~a~~nlSPsh~isv~v~~G~livanp~i~~~~~~~d 152 (771) T protein:vir:95 74 VTSHNWENAGGEVG-RWISLVQVGTELKFFQTTGETLSEGNFYNYQFVNMSPSHKLSYAVVDGLLVVANGSRDIYVFEYD 152 (771) T ss_pred eeeechhhcccccC-cEEEEEEeccEEEEEecCCCcccccceeeeecceeccceeEEEEEeeeEEEEecCCccEEEEEec Confidence 123455555432 2334444444444433332111 11 222222 467888889999998754 332 22 Q ss_pred C-ce---eeeccccCCc--cceee----------cCCCCc--ccceE-EEEEEEEcCCCcccccccceeEec-------- Q lcl|Aclame:pro 123 G-AQ---AERLTLDTPA--PPLLV----------AGAGSL--SQGTY-GAAVAWLRGPQESAPSLIAFAEVT-------- 175 (396) Q Consensus 123 g-~~---~~~l~ip~Pa--~p~~~----------~~~Gsl--~~g~y-~ya~T~V~~~gEeg~~~~~S~~vt-------- 175 (396) . .. ..++-+-.=- ++.+. ...|+. ++.-| .|-.+|+-..+..+.-.+.+.+|+ T Consensus 153 ~~t~s~t~~~ll~r~rf~~q~~~~G~d~~~~~~~~~~gt~~tn~~iynlyN~gw~~pk~~~~snt~~~~iV~~y~a~~g~ 232 (771) T protein:vir:95 153 SGSVSVTTKRLLVRDLFGVQDIVNGVDLRQGNDIATRPTVQTNAHIYNLRNQTFGVPRVTWHSNEPSDPIVTFRSAASGK 232 (771) T ss_pred CCcceeEeeeeeeeehhhccccccccceecccccccCCcccCchhheeccccceeccccccccCCccccceEeeeccCCC Confidence 2 11 1111111100 11000 011221 11112 255666644333333333333332 Q ss_pred C-CCccEEEeecCC--CCCcceEEEEEEec---C--C-----CeEEEEEeecceeEEEEcCCchhhcccccchhcC---- Q lcl|Aclame:pro 176 D-AGALEVTFPLCL--DASVTGARLYLTRA---N--G-----GELLLAGDYPLGAATVILPTLPELGRPAQFRHLS---- 238 (396) Q Consensus 176 ~-~~~~~v~lp~~~--~~~i~~~RIYrs~~---~--g-----~~~~lv~e~~~~~~~~~d~~~~~lg~~l~t~~~~---- 238 (396) . ..+..+++.... .--|.....+|-.. + | -++|+..-+.-+.+..+-. -++-.+.++..+. T Consensus 233 ~pS~sd~~N~a~~k~~~~Ei~t~~~f~~~~~~~~~~Gt~~~~~G~yi~da~~~g~~~Lt~~--ve~~gr~~s~~~~~~~l 310 (771) T protein:vir:95 233 FPSNSDSVNLALSKRADVEPSTTDRFRAEDIVLNPIGTYETARGFFIIDAMARGKSRLEEI--VKLKQRYPSLSFGVSSL 310 (771) T ss_pred CcCCceeeccccchhhccceeeecccchhhhhhcccCcccccCcceeeehhhhccccccee--eeccccchhhhcccccc Confidence 1 122233332211 11122222222100 0 1 1234443332222221110 0111111111111 Q ss_pred ---CcCCC-ceeeccCCEEEEEE---------CCE------EEEcc-CC------------------CCcccccccccEe Q lcl|Aclame:pro 239 ---PMPTG-KHLAYWRGRLLIAR---------ANV------LRFSE-AL------------------AYHLHDERYGFVQ 280 (396) Q Consensus 239 ---ppP~g-~~~~~~nGrl~~a~---------Gn~------l~fSE-p~------------------~p~aw~~~y~~~~ 280 (396) --|-| ++++...||+|-+- +|- ++||. -- .|.+-+..=+|+. T Consensus 311 ~~~~t~~~~~~vaeyagRvwYag~~~~~iD~dkng~~~~~~ilfSqLv~s~~di~nCyQd~DPTsee~~dLidTDGg~ir 390 (771) T protein:vir:95 311 PQDETPGGASVVCEYAGRVWYAGFSGQIIDGDDQSPRLVSYILFSQLVDSPADIVNCYQDGDPTSTEEPELVDTDGGFIR 390 (771) T ss_pred ccccCCCCceeEEeeeeeEEEecceeEEeeccccCCceeeeEeeehhhcchhhcccccccCCCchhhhhhhhhcCCCEEE Confidence 12223 67898999988553 233 88886 11 2334444433333 Q ss_pred c--CcceEEEEEcCCcEEEEEcCcEEEE-----EcCchhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEec Q lcl|Aclame:pro 281 M--PQRITFVQPVDGGIWVGQVDHVAFL-----DGADPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLA 353 (396) Q Consensus 281 ~--~~~I~~i~~v~~gl~V~T~~~~y~l-----~G~~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s 353 (396) . -++|+.|.-++..|.|.-+..+|.+ .|..--+....|++.. +|-+-.|.|..+++.+|-| T Consensus 391 i~gah~ii~Lv~f~~sLlvfc~NGVWAi~ggsd~g~tAtdY~ltKIs~v------------g~sspnSvVvvg~~i~yws 458 (771) T protein:vir:95 391 IEGAHDIINLVNVGSAVMVVAANGIWMIQGGSDYGFTATNYLVTKISEH------------GCSSPNSVVVVDNSFMYWG 458 (771) T ss_pred ecCCCCceeEEEecceEEEEEecceEEEEeccCCceeeeeeEEEEeeee------------ccCCCccEEEecceEEEee Confidence 3 5789999999999999999999999 3333344667777542 2777889999999999999 Q ss_pred CCCEEEEcCCC----cEEEEecceeec------cccccceEEEe---CcEEEEEeC Q lcl|Aclame:pro 354 ENGYVMGTSSG----AIAEVHAGVLAG------ITGRAGTSVVF---DRRLLTAVS 396 (396) Q Consensus 354 ~~Glv~g~~~G----~~~~lt~~~~~~------~~a~~~~~~~~---~rr~v~~~~ 396 (396) .+|+.....+- ...+||++.+.- .......+..+ +.|+--+.. T Consensus 459 dtgIyal~~Ndfn~~tAqnLTekTIq~~~~~I~~dk~knVtg~fd~~e~rvyw~yP 514 (771) T protein:vir:95 459 DDGIYHLTRNQYGDYVANNLTEKTIQKYYEKIPSDAILNATGFYDSYDKKVKWLYN 514 (771) T ss_pred CCceEEEeecccCcchhhccchHHHHHHHhhcchhhhcceEEEEEccCCEEEEEec Confidence 99998765542 234566555521 23345556666 455444444 No 50 >protein:vir:100960 Length: 472 # NCBI annotation: gp10 # Family: family:all:1540 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006412;genbank:gi:46358704;genbank:GeneID:2777110 Probab=83.32 E-value=0.069 Score=26.94 Aligned_cols=281 Identities=14% Similarity=0.111 Sum_probs=129.2 Q ss_pred CCcccccceeccCCcCChhheeeCCCchhhheeeeeeeee------CCCccEEECCcceeecCC--ccccccCCcccccE Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDL------SPAGKAQLRASVRQVTDQ--PFRQLWQSPLHGDA 72 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~------~~~G~l~~R~G~~~~~~~--~~~~lw~s~~~~~~ 72 (396) |+-.+|+=-.||---..+.... -.-.||... ..++-|+.-+|.++..+. +.+-+.=.--.+.+ T Consensus 1 m~~~~ipl~~g~~~~~~~a~~~---------~~~pvn~y~~~~~~~~ss~~Lr~~pG~~~~a~~~G~~RG~~~~~~~~~l 71 (472) T protein:vir:10 1 MPIQQLPMMKGMGKDFKNADYI---------DYLPINMLATPKEVLNSSGYLRSFPGIAKRNDVNGVSRGVEYNTAQNAV 71 (472) T ss_pred CceeecccccccccCCCcCcce---------eeeeeccccccccccccccceeecccceeecCCCCcccceeeeeeCCeE Confidence 9998884448886533332220 000233222 245667888888886554 22222100012346 Q ss_pred EEEECCeEEEEecCCCceeecccc-cCcceehhhcCCeEEEEcCCcceeecCceeeeccccCCccceeecCCCCcccceE Q lcl|Aclame:pro 73 FGALGDQWGKVDPHSWTFEPLAQI-GEGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLDTPAPPLLVAGAGSLSQGTY 151 (396) Q Consensus 73 ~~~~dg~L~~i~~~~w~~~vl~~i-g~gpV~~~v~n~rvy~t~~~~~~~~~g~~~~~l~ip~Pa~p~~~~~~Gsl~~g~y 151 (396) |.+.+..|+|++. ++ ..| |.|||.++-....+-+..++.... |.+...+++... -...+..+..+ T Consensus 72 y~V~G~~Ly~v~~-----~i-G~i~gsgrVsMa~n~~~~~v~~~~~~~~------Y~~~~~~~t~~~-~~~d~~f~~~d- 137 (472) T protein:vir:10 72 YRVCGGKLYKGEA-----VV-GDVAGSGRVSMAHGRTSQAVGVNGQLIE------YRYDGAVKTVSN-WPADSGFTQYE- 137 (472) T ss_pred EEEeCcceEEEEe-----eE-eeccCcccEEEeeCCeEEEEEECCceeE------EEEecchhhhhc-ccCcccccccc- Confidence 7777778887653 12 222 556666542222222222222211 122222221100 00001000000 Q ss_pred EEEEEEEcCCCcccccccceeEecCCCccEEEeecCCCCCcceEEEE-EEecCCCeEEEEEeecceeEEEEcCCchhhcc Q lcl|Aclame:pro 152 GAAVAWLRGPQESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLY-LTRANGGELLLAGDYPLGAATVILPTLPELGR 230 (396) Q Consensus 152 ~ya~T~V~~~gEeg~~~~~S~~vt~~~~~~v~lp~~~~~~i~~~RIY-rs~~~g~~~~lv~e~~~~~~~~~d~~~~~lg~ 230 (396) + ....+++..+=| +=.+.|+..+++-.+. |++ T Consensus 138 ------------------------------l----~~~~dv~f~dGyfV~~~~gt~~~~iS~l~-----------d~~-- 170 (472) T protein:vir:10 138 ------------------------------L----GSVRDITRLRGRYAWSKDGTDSWFITDLE-----------DES-- 170 (472) T ss_pred ------------------------------c----cceeEEEEecceEEEccCCCceEEEeccC-----------Ccc-- Confidence 0 000111111111 2123333323322211 110 Q ss_pred cccchhcCCcCCCceeeccCCEEEEEECCEEEEccCCCCcccccccccEecCcceEEEEEcCCcEEEEEcCc--EEEEEc Q lcl|Aclame:pro 231 PAQFRHLSPMPTGKHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPVDGGIWVGQVDH--VAFLDG 308 (396) Q Consensus 231 ~l~t~~~~ppP~g~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v~~gl~V~T~~~--~y~l~G 308 (396) . |+ .|.+++ .+- --|+.|++|...-+-||+.=+.. +|..+| T Consensus 171 --~-------~~-----------------------~y~~fa--~AE---~~pD~Ivgi~~~~~~l~lfG~~TiEvw~ntG 213 (472) T protein:vir:10 171 --H-------PD-----------------------RYSAEY--RAE---SQPDGIIGIGSWRDFIVCFGSSTIEYFSLTG 213 (472) T ss_pred --c-------cc-----------------------cccccc--ccc---CCCCceEEEEeeccEEEEEeccceEEEEecC Confidence 0 00 011111 111 24789999999999999875544 599999 Q ss_pred Cch-hheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCC----EEEEcCCCcEEEEecc----eeeccc- Q lcl|Aclame:pro 309 ADP-ASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENG----YVMGTSSGAIAEVHAG----VLAGIT- 378 (396) Q Consensus 309 ~~p-~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~G----lv~g~~~G~~~~lt~~----~~~~~~- 378 (396) ... ...-+++. ++..++-+|++..+..-.++++.|+|.++ .|--..+++...++-- .++--. T Consensus 214 ~ad~~~fpy~r~--------~g~~i~~Gcaa~~sv~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST~aIE~~i~~y~~ 285 (472) T protein:vir:10 214 ATTVGAALYVAQ--------PSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKIIRSYTA 285 (472) T ss_pred CCCcCcCceEEc--------CcceeeecccCcchhhecCceEEEEecCCCcccEEEEccCceeEEecCHHHHHHHHhcCC Confidence 873 44444432 23456778999999999999999999998 3555677777766322 222211 Q ss_pred -cccce---EEEeC-cE-EEEEeC Q lcl|Aclame:pro 379 -GRAGT---SVVFD-RR-LLTAVS 396 (396) Q Consensus 379 -a~~~~---~~~~~-rr-~v~~~~ 396 (396) ..+-| +..++ +. |+-++- T Consensus 286 ~e~~~A~~~t~~~~GH~fy~LtfP 309 (472) T protein:vir:10 286 EELATGVMETLRFDSHELLIIHLP 309 (472) T ss_pred ccccceEEEEEEeCCeEEEEEEcC Confidence 11111 22222 32 222222 No 51 >protein:vir:3529 Length: 477 # NCBI annotation: P28 # Family: family:all:1540 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050989;genbank:gi:9633575;genbank:GeneID:1262322 Probab=81.19 E-value=0.088 Score=26.37 Aligned_cols=280 Identities=15% Similarity=0.132 Sum_probs=137.4 Q ss_pred CCcccccceeccCCcCChhheeeCCCchhhheeeeeeeee------CCCccEEECCcceeecCC--ccccccCCcccccE Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDL------SPAGKAQLRASVRQVTDQ--PFRQLWQSPLHGDA 72 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~------~~~G~l~~R~G~~~~~~~--~~~~lw~s~~~~~~ 72 (396) |+-.+|+=..||-.-..+.... -+=.||... ..+|.|..-+|.++..+. +.+-+.-.-..+.+ T Consensus 7 m~~~~ipl~~g~~~~~~~~d~~---------~~~PVN~~a~p~~~~~s~~~L~~~pG~~~~~~~~G~~RG~~~~~~~g~l 77 (477) T protein:vir:35 7 MPKIQIPLAKGLVKDIKTADYI---------DALPVNMLATPKEVLNASGYLRSFPGIEKKQDAKGVSRGVHFNTKNNAL 77 (477) T ss_pred eeeeccccccccccccccccce---------eeeeeccceeeccccccccccccCCcceeeccCCccccceeEeecCCeE Confidence 8888885448886533332110 000244442 255778888898886544 34444211113678 Q ss_pred EEEECCeEEEEecCCCceeecccccCcceehhhcCCeEEEEcCCcc--eeecCceeeeccccCCccceeecCCCCcccce Q lcl|Aclame:pro 73 FGALGDQWGKVDPHSWTFEPLAQIGEGDLSHEVLNNRVCVAGTAGI--FTYDGAQAERLTLDTPAPPLLVAGAGSLSQGT 150 (396) Q Consensus 73 ~~~~dg~L~~i~~~~w~~~vl~~ig~gpV~~~v~n~rvy~t~~~~~--~~~~g~~~~~l~ip~Pa~p~~~~~~Gsl~~g~ 150 (396) |.+.+..|+|+++ + +..--|.|||.++-..+.+-+.+++.. ..|++.... ...+ T Consensus 78 Y~V~G~~LY~v~~---~--vG~I~gsg~VsMa~n~~~~aIv~~g~~~gy~y~~t~~~---~~~~---------------- 133 (477) T protein:vir:35 78 YRVCGNTLYRNDK---E--VADIAGMSRVSMSHSSHSQAICFEGKVKLYRYDGTEKA---LSNW---------------- 133 (477) T ss_pred EEEecCeeEeeee---e--eeeecccccEEEeeCCcEEEEEECCcceeEEEecccce---eeec---------------- Confidence 8999999999873 2 222226788887654445545544332 333331100 0000 Q ss_pred EEEEEEEEcCCCcccccccceeEecCCCccEEEeecCCCCCcceEEE-EEEecCCCeEEEEEeecceeEEEEcCCchhhc Q lcl|Aclame:pro 151 YGAAVAWLRGPQESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARL-YLTRANGGELLLAGDYPLGAATVILPTLPELG 229 (396) Q Consensus 151 y~ya~T~V~~~gEeg~~~~~S~~vt~~~~~~v~lp~~~~~~i~~~RI-Yrs~~~g~~~~lv~e~~~~~~~~~d~~~~~lg 229 (396) .++.-|. ..+ ...-+++..+= |+=...|+..+++-+|.-. + . T Consensus 134 -----------~~~~~p~-----------~~l----~~~~~v~f~dGyfV~~~~gt~~~~iS~L~d~-------s----~ 176 (477) T protein:vir:35 134 -----------PKDKYPQ-----------YDL----GEVIDVCRNRGRYIWLQKGGERFGVTDLEDE-------S----K 176 (477) T ss_pred -----------CccccCC-----------ccc----cceeEEEeeCceEEEeecCCCeEEEeecCCc-------c----c Confidence 0000000 000 00001111121 1222233333333221111 0 0 Q ss_pred ccccchhcCCcCCCceeeccCCEEEEEECCEEEEccCCCCcccccccccEecCcceEEEEEcCCcEEEEEcCc--EEEEE Q lcl|Aclame:pro 230 RPAQFRHLSPMPTGKHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPVDGGIWVGQVDH--VAFLD 307 (396) Q Consensus 230 ~~l~t~~~~ppP~g~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v~~gl~V~T~~~--~y~l~ 307 (396) .-+|.+++--| --|+.|++|...-+-||+.=+.. +|..+ T Consensus 177 ----------------------------------~d~~~~FasAE-----~~pD~Ivgi~~~~~~i~lfG~~TiEvw~nt 217 (477) T protein:vir:35 177 ----------------------------------PDRYQPFYRAE-----SQPDGIVSVDAWRDLIVCFGSSSIEYFTLT 217 (477) T ss_pred ----------------------------------ccccccccccc-----CCCCceEEEEeeccEEEEEeccceEEEEec Confidence 11111122112 23677888888888887764443 58888 Q ss_pred cCchhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCC----CEEEEcCCCcEEEEeccee----eccc- Q lcl|Aclame:pro 308 GADPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAEN----GYVMGTSSGAIAEVHAGVL----AGIT- 378 (396) Q Consensus 308 G~~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~----Glv~g~~~G~~~~lt~~~~----~~~~- 378 (396) |..+-+.-+.+. +| +.-++-+|++..+..-.++++.|++.+ +.|--..++++..++--.+ +-.+ T Consensus 218 G~a~f~~p~~r~---~~----~~mIq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~~~g~q~~rIST~aIE~~i~ay~~ 290 (477) T protein:vir:35 218 GSADTSQPLYIH---QA----AYMIQAGIAGRDCKCRYQDKYAILSHQSTGQPAVYLIGAGEKNKISTATIDKIIRYYSA 290 (477) T ss_pred CCCCCCcceeec---CC----ceeeeecccCchhhhhhCceEEEEecCCCcccEEEEccCceeEEecCHHHHHHHHhcCC Confidence 888654333433 11 122466799999999999999999997 6677788888877743332 2211 Q ss_pred cccce----EEEeC-cE-EEEEeC Q lcl|Aclame:pro 379 GRAGT----SVVFD-RR-LLTAVS 396 (396) Q Consensus 379 a~~~~----~~~~~-rr-~v~~~~ 396 (396) .+.+. +..++ +. |+-++- T Consensus 291 ~e~a~af~~t~~~eGH~fy~LtfP 314 (477) T protein:vir:35 291 DELAASFMESIRFDNHELLLLHLP 314 (477) T ss_pred cchhceeEEEEEeCCeeEEEEEcC Confidence 11122 22233 32 222222 No 52 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=79.76 E-value=0.018 Score=30.20 Aligned_cols=264 Identities=11% Similarity=0.034 Sum_probs=88.4 Q ss_pred CCccccccee--cc-CCcCChhheee----------------CCCc------hh----hh---eeeeeeeeeCCCccEEE Q lcl|Aclame:pro 1 MATTSLVPLA--GI-NNVAEDAALQR----------------GGES------PR----LY---VRDAVNIDLSPAGKAQL 48 (396) Q Consensus 1 m~~~~~~p~~--G~-nn~~~~~~L~~----------------~~~~------~~----~~---lrdAvNVD~~~~G~l~~ 48 (396) |+++.+.=+. .. ||+.++...+. -||+ ++ ++ ++.- +|....+.+.= T Consensus 103 ~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~FyGDs~l~s~~~g~~~gleFDGl~~l--Id~~~~~NViD 180 (470) T protein:vir:10 103 GHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLAFYGDNLLGDDVPGSPNNLQQDGIINI--IKRGAPQNVLD 180 (470) T ss_pred eecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhhhhhccccccccCcccCceeccchhhh--ccCCCCccccc Confidence 3333333221 11 22322222210 0111 00 00 0000 11111111111 Q ss_pred CCcceeecCCccccccCCcc---cccEEEEECCeEE------------------EEecCCCceeecccc-----cCccee Q lcl|Aclame:pro 49 RASVRQVTDQPFRQLWQSPL---HGDAFGALGDQWG------------------KVDPHSWTFEPLAQI-----GEGDLS 102 (396) Q Consensus 49 R~G~~~~~~~~~~~lw~s~~---~~~~~~~~dg~L~------------------~i~~~~w~~~vl~~i-----g~gpV~ 102 (396) -+|-.+ ..+-||+... ...-|++--+.+. -+...+........+ ..|.++ T Consensus 181 arG~~L----s~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~~~~~~~qRv~~~~N~~~~~~G~~v~~f~sa~G~I~ 256 (470) T protein:vir:10 181 AGGRPL----SIDLLWEAESRVVSTQAFANPTAVFISYVDKLNLQASFYQISRVMTTADRRAGLLGADAQSYIGVRGEHS 256 (470) T ss_pred cCCCCc----cHHHHHHHHhhhcccccccChhhhccchhHHHHHHHhhcCceEEEEecCCCceeeeeeccceeeeeeeee Confidence 112222 1123444321 0122322212111 111111111111111 023333 Q ss_pred hhhcCCeEEEEcC--CcceeecCceeeeccccCCccceee------------cCCCC-cccceEEEEEEEEcCCCccccc Q lcl|Aclame:pro 103 HEVLNNRVCVAGT--AGIFTYDGAQAERLTLDTPAPPLLV------------AGAGS-LSQGTYGAAVAWLRGPQESAPS 167 (396) Q Consensus 103 ~~v~n~rvy~t~~--~~~~~~~g~~~~~l~ip~Pa~p~~~------------~~~Gs-l~~g~y~ya~T~V~~~gEeg~~ 167 (396) . |.-+++=+. ..+...+.. .. .+++|....++ +..|+ .+.+-+.|+|..++..||+ +| T Consensus 257 L---~~s~~m~~~~k~~p~~l~~~-v~--~~aAP~~~~tv~~t~~~~a~~~~sk~g~~~~~~v~sy~y~v~~~~gds-~s 329 (470) T protein:vir:10 257 L---YPSQFLGDFHKFNPARFGAE-VG--DFAAPSNSWTVSTTDNFVTLPYNSGLGDPANTTVYSYAFKAANFYGES-AA 329 (470) T ss_pred e---cccccccchhhcCcccCCcc-cC--CcccCceeEEeecCCCceeecccCCCCcccCcceeEEEEEEEEecCCC-Cc Confidence 2 111221100 011122211 11 12233211111 22232 4556678999999988888 44 Q ss_pred ccceeEec---CCCccEEEeecCCCCCcceEEEEEEecCCCeEEEEEeeccee-----EEEEcCC--chh---------- Q lcl|Aclame:pro 168 LIAFAEVT---DAGALEVTFPLCLDASVTGARLYLTRANGGELLLAGDYPLGA-----ATVILPT--LPE---------- 227 (396) Q Consensus 168 ~~~S~~vt---~~~~~~v~lp~~~~~~i~~~RIYrs~~~g~~~~lv~e~~~~~-----~~~~d~~--~~~---------- 227 (396) ..++++.+ .+.++.+++... .++..+-|||+.++++.||+++.+++.. ..+.|.. .+. T Consensus 330 ~~v~vt~t~~~v~kgv~ltI~~~--~~v~yv~IYRk~~~s~~~~li~rv~v~~~ng~~~~~~D~~e~i~tt~~v~~~~~~ 407 (470) T protein:vir:10 330 KYIDVYIDSTEAGKGVRFQFHGL--VNVKWLDVYRKDPGSQEYKFYKRVKVSTVNGDFTWIDDGHETVTTPSGVYRWKKI 407 (470) T ss_pred ceEEEEEeeehhcceeEEEEecC--CCCcEEEEEeecCCCCceeEEEEEeeeeccCCEEEEecccccCCCcceeeeeccc Confidence 44443322 456777777433 3588899999999999999999988544 3344420 000 Q ss_pred -----hcc----cccchhcCCcCCCce--eeccCCEEEEEECCEEEEccCCCCcccccccccEecCcceEEEEEc Q lcl|Aclame:pro 228 -----LGR----PAQFRHLSPMPTGKH--LAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMPQRITFVQPV 291 (396) Q Consensus 228 -----lg~----~l~t~~~~ppP~g~~--~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~y~~~~~~~~I~~i~~v 291 (396) .+. +..-..|.|||--++ ...|.=.+|+. -.| ++.-+++ |... ..|.-+..| T Consensus 408 Pgt~~Vgemsp~v~sl~~~l~m~l~klp~a~~~~~v~~~v--gal--------al~aPKr-~~~I-kNV~~~~~~ 470 (470) T protein:vir:10 408 PGTGVVVGIDPNVTTMAVWIGMELYRLPPALTHDYVIWKV--ASV--------FSRAPEF-NFLI-VNVGQEPIV 470 (470) T ss_pred CcceeccccCcchhhhhhhhhhhhhhcCHHHHHHHHHHHH--HHH--------HHhcccc-ceEE-EEeeeeecC Confidence 011 111123333332221 00000000100 000 1111111 1111 112222222 No 53 >protein:vir:100022 Length: 976 # NCBI annotation: T7-like tail tubular protein B # Family: family:all:825 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214208;genbank:gi:61806431;genbank:GeneID:3294702 Probab=79.64 E-value=0.1 Score=26.01 Aligned_cols=314 Identities=11% Similarity=0.094 Sum_probs=105.6 Q ss_pred CCcccc-c--------------ceeccCCcCChh----hee-eCCCchhhheeeeeeeeeCCCccEEECCcceeecCCcc Q lcl|Aclame:pro 1 MATTSL-V--------------PLAGINNVAEDA----ALQ-RGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPF 60 (396) Q Consensus 1 m~~~~~-~--------------p~~G~nn~~~~~----~L~-~~~~~~~~~lrdAvNVD~~~~G~l~~R~G~~~~~~~~~ 60 (396) +..+.+ . -..+..+..+.. ... ...+...-.|+.++ T Consensus 325 ~~~V~v~g~~Y~it~~~~~~~~~~a~~~~~~~~~t~~d~~~~~~~~~ia~~L~~~l------------------------ 380 (976) T protein:vir:10 325 YFYVWMKDGYYKITVEAISTANVQANLGLIRPNPTPFDTETAVTAESIIGDIRTAI------------------------ 380 (976) T ss_pred eEEEEccccceeeEEEEeeceeEEeccccccCcCCcCcccccccHHHHHHHHHHhh------------------------ Confidence 111000 0 011111110000 000 00000111111111 Q ss_pred ccccCCcccccEEEEECCeEEEEecCCCceeecccccCcceehhhcCCeEEEEcCCcceeecCceeeeccccCCcccee- Q lcl|Aclame:pro 61 RQLWQSPLHGDAFGALGDQWGKVDPHSWTFEPLAQIGEGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLDTPAPPLL- 139 (396) Q Consensus 61 ~~lw~s~~~~~~~~~~dg~L~~i~~~~w~~~vl~~ig~gpV~~~v~n~rvy~t~~~~~~~~~g~~~~~l~ip~Pa~p~~- 139 (396) ...|+. .+..+...+..|+-..+... -.+-. .+--.+ .++++ .-.....|..-.|..-.+ T Consensus 381 ~a~~~~--~g~tv~~~g~~~~i~~~~~~-~~~s~---~~~~~~------~~~~~-------~V~~~~~LP~~~~~g~~v~ 441 (976) T protein:vir:10 381 IATGNF--TSANVQQIGTGLYVTRPSGT-FNVTA---PSSDLL------RVMSG-------EVANVDDLPSQCKHGYVVK 441 (976) T ss_pred cccccc--cceEEEEcCcEEEEEecCcc-eEecC---CCceeE------EEEEe-------eecchhhhhhhccCCcEEE Confidence 122321 11122222233321111110 00000 000000 01111 000011121112211111 Q ss_pred ecCCCCcccceEEEEEEEEcCC--CcccccccceeEecCCCccEEEeecCCCCCcceEEEEEEecCCCeEEEEEeeccee Q lcl|Aclame:pro 140 VAGAGSLSQGTYGAAVAWLRGP--QESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLYLTRANGGELLLAGDYPLGA 217 (396) Q Consensus 140 ~~~~Gsl~~g~y~ya~T~V~~~--gEeg~~~~~S~~vt~~~~~~v~lp~~~~~~i~~~RIYrs~~~g~~~~lv~e~~~~~ 217 (396) +...++ ....| -+.|.... .+++. -.+.+..+..+.++ ...+ -..|.|.+ + +.|- . T Consensus 442 V~~~~~-~~d~y--yv~~~~~~~~~~~~~---w~E~~~~g~~~g~~-----~~tm-P~~l~~~~-~-g~f~-~------- 499 (976) T protein:vir:10 442 VANSEA-DADDY--YVKFFGHNNRDGDGV---WEECAKPSRNIEFD-----KGTM-PIQLVRQA-N-GTFT-V------- 499 (976) T ss_pred EecCCC-CceeE--EEEeeccccccccce---EEEeeccccccccc-----cccc-cEEEEecc-c-CeEE-e------- Confidence 212222 22233 33333221 11110 01111111111111 0000 12444432 1 1221 1 Q ss_pred EEEEcCCchhhcccccchhcCCcCC--C---ceeeccCCEEEEEECCEEEEccCCCCcccccc-c------ccEec---- Q lcl|Aclame:pro 218 ATVILPTLPELGRPAQFRHLSPMPT--G---KHLAYWRGRLLIARANVLRFSEALAYHLHDER-Y------GFVQM---- 281 (396) Q Consensus 218 ~~~~d~~~~~lg~~l~t~~~~ppP~--g---~~~~~~nGrl~~a~Gn~l~fSEp~~p~aw~~~-y------~~~~~---- 281 (396) ...++..+.+--.+-.|.|. | .-+.|+++||..+.++.||+|.+..++-|-.. . +-+.+ T Consensus 500 -----~~~~w~~r~vGd~~tnp~psf~g~~is~v~f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss 574 (976) T protein:vir:10 500 -----SQATWQNAEVGDELTNPNPSFVGKTINQLVFFRNRLVFLSDENVIMSRPGEFFNFWSKTATTFTPQDVIDLSCSS 574 (976) T ss_pred -----eeccccccccCCcccCcCceecccccceEEEEcceEEEecCCeEEEEecCCccccccccccCCCCCccEEEEecC Confidence 11111111111111233333 2 23789999999999999999999888777221 0 12333 Q ss_pred --CcceEEEEEcCCcEEEEEcCcEEEEEcC----chhheeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCC Q lcl|Aclame:pro 282 --PQRITFVQPVDGGIWVGQVDHVAFLDGA----DPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAEN 355 (396) Q Consensus 282 --~~~I~~i~~v~~gl~V~T~~~~y~l~G~----~p~~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~ 355 (396) ...|.-+.++...|+++|++.-|.|+|. +|++.+..+.+... |...-.....+..++|+++. T Consensus 575 ~~~~~i~~~v~~~~~L~l~T~g~e~~lsg~~~~lTP~t~~i~~~s~~~------------~~~~v~Pv~vG~~v~Fv~~~ 642 (976) T protein:vir:10 575 TYPAIVYDGIQVNAGLLLFTKNQQFMLTTDSDILSPETAKINAVSSYN------------FNEKTHPVSLGTTVAFIDNA 642 (976) T ss_pred CcceeeEEEEecCCcEEEEecCceEEEecCCceecceeEEEEEEEeee------------ccCCCccEEeCCeEEEEecC Confidence 4667888999999999999999999983 23443333332111 22222234456789999998 Q ss_pred CEEE----E---c-CCCc-EEEEecceeeccccccc-eEEEeCcEEEEEeC Q lcl|Aclame:pro 356 GYVM----G---T-SSGA-IAEVHAGVLAGITGRAG-TSVVFDRRLLTAVS 396 (396) Q Consensus 356 Glv~----g---~-~~G~-~~~lt~~~~~~~~a~~~-~~~~~~rr~v~~~~ 396 (396) |=.+ . . .++. ...||.-.-..++...- .+.--+...++... T Consensus 643 g~~~r~~~~~~~~~~~~~~~~dlt~~~~~l~~g~~~~~a~~~~~~~vv~~~ 693 (976) T protein:vir:10 643 NQFTRFFEMSNVVRQGEPDVVDQSKVISRLLDKNISLVSVSRENSVVFFSQ 693 (976) T ss_pred CCeEEEEEEeecccccccchhHHHHHhhhhcCCceEEEEEcCCCcEEEEEE Confidence 7322 1 1 1111 11221111011111110 00011111111111 No 54 >protein:vir:78703 Length: 905 # NCBI annotation: tail tube B # Family: family:all:825 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285450;genbank:gi:148724484;genbank:GeneID:5220174 Probab=41.34 E-value=0.93 Score=20.75 Aligned_cols=359 Identities=14% Similarity=0.116 Sum_probs=122.5 Q ss_pred CCcccccceeccCCcCChhheeeCCCchhhheeeeeeeeeCC--CccEEECC-cceeecC-------Ccccc------cc Q lcl|Aclame:pro 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSP--AGKAQLRA-SVRQVTD-------QPFRQ------LW 64 (396) Q Consensus 1 m~~~~~~p~~G~nn~~~~~~L~~~~~~~~~~lrdAvNVD~~~--~G~l~~R~-G~~~~~~-------~~~~~------lw 64 (396) +....+ .....+....-.+....+-++-.-|.-+.. ..+++.++ +.+..++ .+.-+ -| T Consensus 181 ~~~~a~------~~s~~s~~~~~~g~~~~~~~~~~~~~t~~~~~~l~f~~~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~ 254 (905) T protein:vir:78 181 YRAKAL------EISPGSFEVEDGGVCTEHDVQNYTNQTIGSSTGLAFQVRVQCAAYLENNEYRSRYNVSVVLQNGGTGF 254 (905) T ss_pred eccccc------eeccccccccccccccccceeeeecceeeccCCceeEEeeccccccCCCcccccccceeeeecccccc Confidence 111111 000001000000011111111111111100 00111110 0000000 00011 12 Q ss_pred CCcccccEE-EEECCeEEEEe--c--------------CCCceeecc------ccc---------CcceehhhcCCeEEE Q lcl|Aclame:pro 65 QSPLHGDAF-GALGDQWGKVD--P--------------HSWTFEPLA------QIG---------EGDLSHEVLNNRVCV 112 (396) Q Consensus 65 ~s~~~~~~~-~~~dg~L~~i~--~--------------~~w~~~vl~------~ig---------~gpV~~~v~n~rvy~ 112 (396) +- ++.| ....|.=++|. . ..+....-+ .|. .+-+.+.+..+.+|+ T Consensus 255 ~~---~~~~~v~~~g~~y~i~i~~~~~~~~~~~~~~~~~~t~~d~~a~~~~~~~i~~~l~~~~~~~~~~~~~~~g~~i~v 331 (905) T protein:vir:78 255 RK---GDMITVNLNGRDYNIRVTQEEFVYTYASDGTAAHTTPQDSTAGTLDIGQITAGLVNSVNLISNYSAQAVGNVIEI 331 (905) T ss_pred cc---CccEEEeeccceEEEEEecceeEEEecCCCcccccCccCccCccccHHHHHHHHHHhhcccccEEEEecCcEEEE Confidence 21 1111 21222112221 1 011100000 010 112233455556666 Q ss_pred EcCCcc----eeecCceeeec-----ccc----CCcc-c----eeecCCCCcccceEEEEEEEEcCCCcccccccceeEe Q lcl|Aclame:pro 113 AGTAGI----FTYDGAQAERL-----TLD----TPAP-P----LLVAGAGSLSQGTYGAAVAWLRGPQESAPSLIAFAEV 174 (396) Q Consensus 113 t~~~~~----~~~~g~~~~~l-----~ip----~Pa~-p----~~~~~~Gsl~~g~y~ya~T~V~~~gEeg~~~~~S~~v 174 (396) ..-..+ -..+|..-..+ .+. -|+. | +.+.+.+ ......|-+.|.+..+..+...-=.+.+ T Consensus 332 ~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~Lp~~~~~g~~v~v~~~~--~~~~d~yyv~~~~~~~~~~~~~~W~E~~ 409 (905) T protein:vir:78 332 ERTDGRDFNLGVRGGATNRAMTAIKGTANSIVDLPGQCFDGFELKVINTE--NAESDDYYVVFRSAAEGIPGSGSWEETV 409 (905) T ss_pred EecCCCccEEEEeccCCcceEEEEeccccccccCccccCCCcEEEEEeCC--CCCcceEEEEEEecccCCcCceeEEEec Confidence 432211 11222211111 111 1111 0 1112222 2233445566766533332221112222 Q ss_pred cCCCccEEEeecCCCCCcceEEEEEEecCCCeEEEEEeecceeEEEEcCCchhhcccccchhcCCcC--CC---ceeecc Q lcl|Aclame:pro 175 TDAGALEVTFPLCLDASVTGARLYLTRANGGELLLAGDYPLGAATVILPTLPELGRPAQFRHLSPMP--TG---KHLAYW 249 (396) Q Consensus 175 t~~~~~~v~lp~~~~~~i~~~RIYrs~~~g~~~~lv~e~~~~~~~~~d~~~~~lg~~l~t~~~~ppP--~g---~~~~~~ 249 (396) ..+-...+ +...--.+|||.+. | .|.+.+ +..+ ..+.+-..-..|. ....|.| .| .-+.|+ T Consensus 410 ~~~~~~~~------~~~tmp~~l~r~~~-g-~f~~~~-~~~~-~~~~~~~~r~~Gd----~~Tnp~psf~g~~is~v~f~ 475 (905) T protein:vir:78 410 APGIERGF------NTSTMPHALIRQAD-G-NFTLEA-LNDE-GTITGWAQREVGD----DDTNPKPSFVGRGISDMFFY 475 (905) T ss_pred cccccccc------ccccccEEEEEecC-c-eEEEEE-eccc-cccccccccccCC----cccCCCCcccCCCcceEEEE Confidence 21111111 11112367888652 2 233321 1111 1111100001121 1122333 34 347899 Q ss_pred CCEEEEEECCEEEEccCCCCccccccc-------ccEec------CcceEEEEEcCCcEEEEEcCcEEEEEcC----chh Q lcl|Aclame:pro 250 RGRLLIARANVLRFSEALAYHLHDERY-------GFVQM------PQRITFVQPVDGGIWVGQVDHVAFLDGA----DPA 312 (396) Q Consensus 250 nGrl~~a~Gn~l~fSEp~~p~aw~~~y-------~~~~~------~~~I~~i~~v~~gl~V~T~~~~y~l~G~----~p~ 312 (396) ++||..+.++.||+|.+..++-|-..- +-+.+ ...|.-+.++...|+++|++.-|.|+|. +|+ T Consensus 476 q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdDpI~~~~ss~~~~~i~~~v~~~~~L~ifT~g~ef~lsg~~~~lTP~ 555 (905) T protein:vir:78 476 NNRLGFLSEDAVIMSQPGDYFNFFVTSAITISDSDPIDVTASSTKPAILRAAIGAPKGLILFAENSQFLLASQEVVFSTA 555 (905) T ss_pred cceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeEEEeecCCcEEEEecCceEEEecCCccccce Confidence 999999999999999998888763221 11222 3455668888999999999999999983 345 Q ss_pred heeeeeeccCCCcccceeecchhhhccccccCcccEEEEecCCC----E---EEEcCCCc-E-EE-------Eecceeec Q lcl|Aclame:pro 313 SLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENG----Y---VMGTSSGA-I-AE-------VHAGVLAG 376 (396) Q Consensus 313 ~m~~~~~~~~~p~~~s~~~~~~~~~~~~~~~~~~~~~lw~s~~G----l---v~g~~~G~-~-~~-------lt~~~~~~ 376 (396) +.+..+.+... |-..-..+..+..++|+++.| + .-...+.. . .. +-++.+.. T Consensus 556 s~~i~~~S~~~------------~~~~v~Pv~vG~~vlFv~~~g~~s~vre~~y~~~~d~y~a~DlT~~a~hl~~g~v~~ 623 (905) T protein:vir:78 556 TIKLTEISDYF------------YRSLAKPVSTGVSIAFVSEADTYSKIFEMSIDSVDNRPQVADITRIVPEYVPTGLTW 623 (905) T ss_pred eEEEEeEEeec------------ccCCCCcEEeCCeEEEeecCCCeeEEEEEEeeecccceehhHHHHHHHHhcCCceEE Confidence 54444332211 111111233456799999987 2 11111110 0 00 11111111 Q ss_pred -cccccceEEEeC---cEEEEE--eC Q lcl|Aclame:pro 377 -ITGRAGTSVVFD---RRLLTA--VS 396 (396) Q Consensus 377 -~~a~~~~~~~~~---rr~v~~--~~ 396 (396) .....-..+++. +.+.+. +. T Consensus 624 ~~~s~~~~~v~~~~~~~~l~~ytyl~ 649 (905) T protein:vir:78 624 SVSTPNNSMMLFGDNSNTAYIFKFFN 649 (905) T ss_pred EEecCCCcEEEEEcCCCeEEEEEeec Confidence 000001112221 121111 11 Done!