Query lcl|NC_019442.1_cdsid_YP_007001480.1 [gene=F366_gp60] [protein=hypothetical protein] [protein_id=YP_007001480.1] [location=43249..44874] Match_columns 541 No_of_seqs 60 out of 85 Neff 5.1 Searched_HMMs 1612 Date Thu Nov 7 17:59:51 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_60 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_60_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:827 Length: 567 # 100.0 1E-254 7E-258 1412.7 55.7 541 1-541 27-567 (567) 2 protein:vir:2792 Length: 567 # 100.0 2E-254 1E-257 1412.0 55.9 541 1-541 27-567 (567) 3 protein:vir:9979 Length: 567 # 100.0 2E-254 1E-257 1412.0 55.9 541 1-541 27-567 (567) 4 protein:vir:10145 Length: 567 100.0 2E-254 1E-257 1412.0 55.9 541 1-541 27-567 (567) 5 protein:vir:3306 Length: 567 # 100.0 2E-254 1E-257 1412.0 55.9 541 1-541 27-567 (567) 6 protein:vir:104388 Length: 566 100.0 7E-254 4E-257 1408.5 55.5 541 1-541 26-566 (566) 7 protein:vir:93631 Length: 580 100.0 4E-220 2E-223 1223.5 52.5 514 1-541 1-579 (580) 8 protein:vir:5120 Length: 615 # 100.0 5E-219 3E-222 1217.5 48.5 515 1-541 30-613 (615) 9 protein:vir:105563 Length: 396 100.0 9E-102 5E-105 574.7 26.9 351 1-398 1-396 (396) 10 protein:vir:8837 Length: 513 # 99.4 3.4E-11 2.1E-14 78.0 37.4 419 1-537 3-513 (513) 11 protein:vir:3133 Length: 911 # 99.0 1.4E-09 8.6E-13 69.1 23.7 436 1-541 121-685 (911) 12 protein:vir:95475 Length: 771 98.8 2E-08 1.2E-11 62.8 22.6 442 1-541 117-657 (771) 13 protein:vir:2625 Length: 715 # 98.7 1.3E-07 8E-11 58.3 24.7 422 1-541 125-606 (715) 14 protein:vir:108312 Length: 458 98.6 3E-07 1.8E-10 56.4 32.8 394 127-539 1-458 (458) 15 protein:vir:9268 Length: 472 # 98.5 4.1E-07 2.6E-10 55.6 32.9 394 112-537 1-472 (472) 16 protein:vir:63741 Length: 468 98.4 6.4E-09 4E-12 65.5 10.2 249 1-301 203-468 (468) 17 protein:vir:80491 Length: 467 98.4 6.8E-09 4.2E-12 65.3 10.2 249 1-301 202-467 (467) 18 protein:vir:105525 Length: 472 98.3 1.8E-06 1.1E-09 52.0 33.2 389 112-537 1-472 (472) 19 protein:vir:96666 Length: 462 98.3 2.9E-08 1.8E-11 61.9 11.0 255 1-293 182-462 (462) 20 protein:vir:100960 Length: 472 98.2 3.1E-06 1.9E-09 50.8 33.0 394 112-537 1-472 (472) 21 protein:vir:177 Length: 472 # 98.1 5.7E-06 3.5E-09 49.4 31.4 386 112-540 1-472 (472) 22 protein:vir:105428 Length: 472 97.9 1.1E-05 6.6E-09 47.9 30.9 386 112-540 1-472 (472) 23 protein:vir:107423 Length: 681 97.9 1.3E-05 8E-09 47.4 37.0 483 1-541 36-615 (681) 24 protein:vir:98487 Length: 681 97.9 1.3E-05 8E-09 47.4 37.0 483 1-541 36-615 (681) 25 protein:vir:107802 Length: 681 97.9 1.3E-05 8E-09 47.4 37.0 483 1-541 36-615 (681) 26 protein:vir:2109 Length: 472 # 97.8 1.6E-05 9.9E-09 46.9 33.3 389 121-537 1-472 (472) 27 protein:vir:3529 Length: 477 # 97.5 4.8E-05 3E-08 44.3 33.2 396 112-537 1-477 (477) 28 protein:vir:100851 Length: 514 97.3 1.7E-05 1E-08 46.8 12.2 268 1-298 169-514 (514) 29 protein:vir:102644 Length: 594 97.3 0.00011 6.7E-08 42.3 28.6 430 1-541 1-530 (594) 30 protein:vir:80835 Length: 464 97.1 1.4E-05 9E-09 47.1 9.9 253 1-300 160-464 (464) 31 protein:vir:99311 Length: 463 96.7 3.4E-05 2.1E-08 45.1 9.3 234 1-298 202-463 (463) 32 protein:vir:95603 Length: 463 96.7 3.4E-05 2.1E-08 45.1 9.3 234 1-298 202-463 (463) 33 protein:vir:103790 Length: 768 96.7 0.00042 2.6E-07 39.1 35.8 516 1-541 1-697 (768) 34 protein:vir:1778 Length: 680 # 96.7 0.00043 2.7E-07 39.0 29.0 400 1-453 219-680 (680) 35 protein:vir:100022 Length: 976 95.6 0.0017 1.1E-06 35.7 38.4 511 1-541 216-899 (976) 36 protein:vir:8887 Length: 808 # 93.3 0.0082 5.1E-06 32.0 40.3 514 1-541 51-733 (808) 37 protein:vir:80253 Length: 777 91.0 0.018 1.1E-05 30.2 42.0 497 1-541 48-700 (777) 38 protein:vir:99677 Length: 794 90.5 0.02 1.3E-05 29.8 40.4 495 1-541 48-718 (794) 39 protein:vir:95324 Length: 823 90.3 0.022 1.3E-05 29.7 37.8 497 1-541 52-669 (823) 40 protein:vir:102823 Length: 470 87.7 0.037 2.3E-05 28.4 10.4 248 1-304 167-470 (470) 41 protein:vir:78957 Length: 826 87.3 0.04 2.5E-05 28.3 36.4 485 1-541 149-748 (826) 42 protein:vir:2203 Length: 794 # 70.7 0.21 0.00013 24.4 38.8 492 1-541 51-715 (794) 43 protein:vir:94583 Length: 792 70.6 0.21 0.00013 24.3 38.4 501 1-541 51-716 (792) 44 protein:vir:1543 Length: 801 # 68.6 0.24 0.00015 24.0 38.0 508 1-541 48-725 (801) 45 protein:vir:78703 Length: 905 40.4 0.97 0.0006 20.7 32.7 487 1-541 242-828 (905) 46 protein:vir:6326 Length: 826 # 30.2 1.6 0.00099 19.5 36.0 483 1-541 149-748 (826) 47 protein:vir:94713 Length: 785 27.7 1.8 0.0011 19.2 34.3 506 1-541 48-711 (785) 48 protein:vir:7329 Length: 825 # 24.6 2.2 0.0013 18.8 38.1 473 1-541 84-664 (825) 49 protein:vir:352 Length: 536 # 23.1 2.3 0.0015 18.6 26.7 416 71-541 1-523 (536) No 1 >protein:vir:827 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050560;genbank:gi:9633457;genbank:GeneID:1262210 Probab=100.00 E-value=1.2e-254 Score=1412.73 Aligned_cols=541 Identities=98% Similarity=1.515 Sum_probs=537.4 Q ss_pred CceEEecccccccccccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEECCcEEEEeCCeEEEeeCC Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSP 80 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W~~w~~~V~vv~sp 80 (541) |++|+|++|+||+||++||||||++||+|+||||++|+|+|+|+|++++++++..+||||||+++|||+|+++||||||| T Consensus 27 M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A~n~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~~w~~~V~~ir~P 106 (567) T protein:vir:82 27 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSP 106 (567) T ss_pred eeEEeecccccccccchhhhccccccceEEeeeecCCeeeeeecccccccccccCceeeeeecCcEeEEeCCceeeccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEEEEEecCC Q lcl|NC_019442. 81 IAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYG 160 (541) Q Consensus 81 ia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~V~~~G 160 (541) ||||+|||||||||++||+|+++||++|+|+||.+||+||||+|+++|++++++++++++++++|+++|+|+|||||++| T Consensus 107 vAqD~~~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~~~~p~d~etr~Yv~TfVt~~G 186 (567) T protein:vir:82 107 IAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYG 186 (567) T ss_pred cccCCcccEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCCCCCCccccceEEEEEEEcCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEEEeeccceEEEEecccccccCccchh Q lcl|NC_019442. 161 EEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPGKNLGPSLAT 240 (541) Q Consensus 161 eEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lVael~~~~~sf~D~~~~~~L~~~L~t 240 (541) |||+|||+|.++++..+|+.|+|+++|++++++||+++|||||++|++++||+||||+++++++|+|++++++|+++||| T Consensus 187 eES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~as~~sf~D~~~~~~lg~~Lps 266 (567) T protein:vir:82 187 EEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPAKNLGPSLAT 266 (567) T ss_pred CcCCCcccccceeeecCCceEEEeeccCCccccccceEEEEEecCCCCceeeEEEEeeccceeeeeeccchhhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhCCCCCcceEEeccCcEEEEEeCCEEEEecCCCcccCchhcccccCcceEEEEEcCCcEEEEEcCCEEEEEccCccc Q lcl|NC_019442. 241 WDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPST 320 (541) Q Consensus 241 ~~~~~pP~~~~gL~~m~NGi~a~f~Gn~l~fSep~~P~awp~~y~~t~~~~Iv~ia~v~~~lvV~T~~~py~l~G~~p~s 320 (541) ++|++||++|+|||+|+|||||||.||+|||||||+|||||++|++++++||||||+++++|||+|+|+||+++|++|++ T Consensus 267 ~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~s 346 (567) T protein:vir:82 267 WDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLRTSLVVATKGEPYLFSGVSPST 346 (567) T ss_pred ccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhhccCCCCCeEEEEecccEEEEEEcCceEEEEcCChhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEeecccccccccchheeCCccEEEecCCcEEEEeCCCceEEEecccCChhHhhhhcCcceEEEEEEcCeEEEEEecC Q lcl|NC_019442. 321 ISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDVNGNTALATEKIISPEQWQSQFNPASIVAYSWRGEYIACYTKP 400 (541) Q Consensus 321 ~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGLv~~~~~G~~~~vT~~~~~~~~W~~~l~P~ti~a~~~eG~Y~~~y~~~ 400 (541) |+++||+++|||+|+||||.++++|+|||+||||+++++|+++++|++||+|||||++|||+||+|++|||+||+||++. T Consensus 347 ms~~kL~~~qpCvS~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~~l~t~~qW~a~~~P~ti~A~~~eG~Y~a~Y~~~ 426 (567) T protein:vir:82 347 ISGSKIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDANGNVALATEQIVSPEQWQSQFNPASIVAYPWRGEYIACYTKP 426 (567) T ss_pred ccccccccccccccccceeeecceEEeecCCcEEEEecCCchhhhhhhccChHHHHhcCCcceEEEEeecCeEEEEEeCC Confidence 99999999999999999999999999999999999999999999999999999999889999999999999999999999 Q ss_pred CCccceEEEccCCceeEEEeecccEEEEEecCCEEEEEECCEEEEecCCCCceeEEEEcceEEeCcccceeEEEEeeCCC Q lcl|NC_019442. 401 DGKQDVFVFSPVNMDIRYLSTPFDCAWVDLAKDMMRVVTGDKMSVLAGGSLPSTIRWHSKIFSLPERTSFSCIRVKSPAP 480 (541) Q Consensus 401 ~g~~~~~i~d~~~~~~~~~~~~~d~~~~~~~~d~LY~~~g~~i~~~~~g~~~~~~~WrSk~f~~~~~~~~~~~~V~~~~~ 480 (541) +|++++|||||+++++++++++|||+|+|.++|+||+++|++||+|++|+++++++||||+|++|+++||+|+||++.++ T Consensus 427 ~g~~~~fifdp~~~~~~~i~~~~~~~~~d~~~d~Ly~~~~~~l~~~~~g~~~~~~~WrSK~f~~p~~~sf~~~rV~s~~~ 506 (567) T protein:vir:82 427 DGKQDVFVFSPVNMDIRYLSTPFDCAWVDLAKDMMRVVTGDKMSVLAGGALPSTIRWHSKIFSLPERTSFSCIRVKSPAP 506 (567) T ss_pred CCCcceEEEcccccEEEEEecCceeEEEEeecCeEEEeeCCEEeeecCCCCceeEEEecceEEecCccceeEEEEeccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccEEEEEEECCceeEeecccccCCcceEccCcccceEEEEEEecceEEEEEeecchhhcCC Q lcl|NC_019442. 481 ERVGITIMADDVPVIHFAPGTFKGSVVRLPAATGQNWQVMVSGFGQVERITLSTSMSEMPV 541 (541) Q Consensus 481 ~~~~v~~~~d~~~~~~~~~~~~~~~~~rLP~~~~~~w~iei~g~~~V~~i~la~s~~EL~~ 541 (541) ++++|++++||..++++++++++++|+|||++++|+|||||+|+++|+||+||+||+|||| T Consensus 507 ~~v~i~~~~dg~~v~~~~~g~~~~~~~rlp~~~ar~Weveisg~~~V~~v~LA~S~~EL~~ 567 (567) T protein:vir:82 507 ERVGITIMADDVPVIHFAPGTFKGSVVRLPAATGQNWQVMVSGFGQVERITLSTSMSEMPV 567 (567) T ss_pred CceeEEEEEcCCceeecCCcccCCceeeccCcccceEEEEEEecccEEEEEEecchhhcCC Confidence 9999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:2792 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612909;genbank:gi:20065826;genbank:GeneID:935648 Probab=100.00 E-value=1.6e-254 Score=1412.02 Aligned_cols=541 Identities=98% Similarity=1.520 Sum_probs=537.3 Q ss_pred CceEEecccccccccccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEECCcEEEEeCCeEEEeeCC Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSP 80 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W~~w~~~V~vv~sp 80 (541) |++|+|++|+||+||++||||||++||+|+||||++|+|+|+|+|++++++++..+||||||+++|||+|+++||||||| T Consensus 27 M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A~n~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~~w~~~V~~ir~P 106 (567) T protein:vir:27 27 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSP 106 (567) T ss_pred eeEEeecccccccccchhhhccccccceEEeeeccCCeeeeeecccccccccccCceeeEEEcCcEEEEeCCceeeccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEEEEEecCC Q lcl|NC_019442. 81 IAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYG 160 (541) Q Consensus 81 ia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~V~~~G 160 (541) ||||+|||||||||++||+|+++||++|+|+||.+||+||||+|+++|++++++++++++++++|+++|+|+|||||++| T Consensus 107 vAqD~~~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~~~~~~d~etr~Yv~TfVt~~G 186 (567) T protein:vir:27 107 IAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYG 186 (567) T ss_pred cccCCcceEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCCCCCCcccceeEEEEEEEcCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEEEeeccceEEEEecccccccCccchh Q lcl|NC_019442. 161 EEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPGKNLGPSLAT 240 (541) Q Consensus 161 eEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lVael~~~~~sf~D~~~~~~L~~~L~t 240 (541) |||+|||+|.++++..+|+.|+|+++|++++++||+++|||||++|++++||+||||+++++++|+|+.++++|+++||| T Consensus 187 eES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~as~~sf~D~~~~~~lg~~Lps 266 (567) T protein:vir:27 187 EEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPAKNLGPSLAT 266 (567) T ss_pred CcCCCcccccceeeecCCceEEEeeccCCccccccceEEEEEecCCCCceeeEEEEeeccceeeeeeccchhhccccccc Confidence 99999999999999989999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhCCCCCcceEEeccCcEEEEEeCCEEEEecCCCcccCchhcccccCcceEEEEEcCCcEEEEEcCCEEEEEccCccc Q lcl|NC_019442. 241 WDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPST 320 (541) Q Consensus 241 ~~~~~pP~~~~gL~~m~NGi~a~f~Gn~l~fSep~~P~awp~~y~~t~~~~Iv~ia~v~~~lvV~T~~~py~l~G~~p~s 320 (541) ++|++||++|+|||+|+|||||||.||+|||||||+|||||++|++++++||||||+++++|||+|+|+||+++|++|++ T Consensus 267 ~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~s 346 (567) T protein:vir:27 267 WDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPST 346 (567) T ss_pred ccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhhccCCCCCeEEEeecccEEEEEEcCceEEEEcCChhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEeecccccccccchheeCCccEEEecCCcEEEEeCCCceEEEecccCChhHhhhhcCcceEEEEEEcCeEEEEEecC Q lcl|NC_019442. 321 ISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDVNGNTALATEKIISPEQWQSQFNPASIVAYSWRGEYIACYTKP 400 (541) Q Consensus 321 ~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGLv~~~~~G~~~~vT~~~~~~~~W~~~l~P~ti~a~~~eG~Y~~~y~~~ 400 (541) |+++||+++|||+|+||||.++++|+|||+||||+++++|+++++|++||+|||||++|||+||+|++|||+||+||++. T Consensus 347 ms~~kL~~~qpCvS~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~~l~t~~qW~a~~~P~ti~A~~~eG~Y~a~Y~~~ 426 (567) T protein:vir:27 347 ISGSKIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDANGNVALATEQIVSPEQWQSQFNPASIVAYPWRGEYIACYTKP 426 (567) T ss_pred ccccccccccccccccceeEeccEEEeecCCcEEEEecCCchhhhhhhccChHHHHhcCCcceEEEEeecCeEEEEEecC Confidence 99999999999999999999999999999999999999999999999999999999889999999999999999999999 Q ss_pred CCccceEEEccCCceeEEEeecccEEEEEecCCEEEEEECCEEEEecCCCCceeEEEEcceEEeCcccceeEEEEeeCCC Q lcl|NC_019442. 401 DGKQDVFVFSPVNMDIRYLSTPFDCAWVDLAKDMMRVVTGDKMSVLAGGSLPSTIRWHSKIFSLPERTSFSCIRVKSPAP 480 (541) Q Consensus 401 ~g~~~~~i~d~~~~~~~~~~~~~d~~~~~~~~d~LY~~~g~~i~~~~~g~~~~~~~WrSk~f~~~~~~~~~~~~V~~~~~ 480 (541) +|++++|||||+++++++++++|||+|+|.++|+||+++|++||+|++|+++++++||||+|++|+++||+|+||++.++ T Consensus 427 ~g~~~~fifdp~~~~~~~i~~~~~~~~~d~~~d~Ly~~~~~~l~~~~~g~~~~~~~WrSK~f~~p~~~sf~~~rV~s~~~ 506 (567) T protein:vir:27 427 DGKQDVFVFSPVNMDIRYLSTPFDCAWVDLAKDMMRVVTGDKMSVLAGGALPSTIRWHSKIFSLPERTSFSCIRVKSPAP 506 (567) T ss_pred CCCcceEEEcccccEEEEEecCceeEEEEeecCeEEEeeCCEEeeecCCCCceeEEEecceEEecCccceeEEEEeccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccEEEEEEECCceeEeecccccCCcceEccCcccceEEEEEEecceEEEEEeecchhhcCC Q lcl|NC_019442. 481 ERVGITIMADDVPVIHFAPGTFKGSVVRLPAATGQNWQVMVSGFGQVERITLSTSMSEMPV 541 (541) Q Consensus 481 ~~~~v~~~~d~~~~~~~~~~~~~~~~~rLP~~~~~~w~iei~g~~~V~~i~la~s~~EL~~ 541 (541) ++++|++++||..++++++++++++|+|||++++|+|||||+|+++|+||+||+||+|||| T Consensus 507 ~~v~i~~~~dg~~v~~~~~g~~~~~~~rlp~~~ar~Weveisg~~~V~~v~LA~S~~EL~~ 567 (567) T protein:vir:27 507 ERVGITIMADDVPVIHFAPGTFKGSVVRLPAATGQNWQVMVSGFGQVERITLSTSMSEMPV 567 (567) T ss_pred cceeEEEEEcCCceeecCCccccCceeecCCcccceEEEEEEecccEEEEEEecchhhcCC Confidence 9999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:9979 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859109;genbank:gi:32170864;genbank:GeneID:2653256 Probab=100.00 E-value=1.6e-254 Score=1412.02 Aligned_cols=541 Identities=98% Similarity=1.520 Sum_probs=537.3 Q ss_pred CceEEecccccccccccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEECCcEEEEeCCeEEEeeCC Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSP 80 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W~~w~~~V~vv~sp 80 (541) |++|+|++|+||+||++||||||++||+|+||||++|+|+|+|+|++++++++..+||||||+++|||+|+++||||||| T Consensus 27 M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A~n~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~~w~~~V~~ir~P 106 (567) T protein:vir:99 27 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSP 106 (567) T ss_pred eeEEeecccccccccchhhhccccccceEEeeeccCCeeeeeecccccccccccCceeeEEEcCcEEEEeCCceeeccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEEEEEecCC Q lcl|NC_019442. 81 IAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYG 160 (541) Q Consensus 81 ia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~V~~~G 160 (541) ||||+|||||||||++||+|+++||++|+|+||.+||+||||+|+++|++++++++++++++++|+++|+|+|||||++| T Consensus 107 vAqD~~~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~~~~~~d~etr~Yv~TfVt~~G 186 (567) T protein:vir:99 107 IAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYG 186 (567) T ss_pred cccCCcceEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCCCCCCcccceeEEEEEEEcCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEEEeeccceEEEEecccccccCccchh Q lcl|NC_019442. 161 EEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPGKNLGPSLAT 240 (541) Q Consensus 161 eEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lVael~~~~~sf~D~~~~~~L~~~L~t 240 (541) |||+|||+|.++++..+|+.|+|+++|++++++||+++|||||++|++++||+||||+++++++|+|+.++++|+++||| T Consensus 187 eES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~as~~sf~D~~~~~~lg~~Lps 266 (567) T protein:vir:99 187 EEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPAKNLGPSLAT 266 (567) T ss_pred CcCCCcccccceeeecCCceEEEeeccCCccccccceEEEEEecCCCCceeeEEEEeeccceeeeeeccchhhccccccc Confidence 99999999999999989999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhCCCCCcceEEeccCcEEEEEeCCEEEEecCCCcccCchhcccccCcceEEEEEcCCcEEEEEcCCEEEEEccCccc Q lcl|NC_019442. 241 WDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPST 320 (541) Q Consensus 241 ~~~~~pP~~~~gL~~m~NGi~a~f~Gn~l~fSep~~P~awp~~y~~t~~~~Iv~ia~v~~~lvV~T~~~py~l~G~~p~s 320 (541) ++|++||++|+|||+|+|||||||.||+|||||||+|||||++|++++++||||||+++++|||+|+|+||+++|++|++ T Consensus 267 ~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~s 346 (567) T protein:vir:99 267 WDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPST 346 (567) T ss_pred ccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhhccCCCCCeEEEeecccEEEEEEcCceEEEEcCChhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEeecccccccccchheeCCccEEEecCCcEEEEeCCCceEEEecccCChhHhhhhcCcceEEEEEEcCeEEEEEecC Q lcl|NC_019442. 321 ISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDVNGNTALATEKIISPEQWQSQFNPASIVAYSWRGEYIACYTKP 400 (541) Q Consensus 321 ~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGLv~~~~~G~~~~vT~~~~~~~~W~~~l~P~ti~a~~~eG~Y~~~y~~~ 400 (541) |+++||+++|||+|+||||.++++|+|||+||||+++++|+++++|++||+|||||++|||+||+|++|||+||+||++. T Consensus 347 ms~~kL~~~qpCvS~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~~l~t~~qW~a~~~P~ti~A~~~eG~Y~a~Y~~~ 426 (567) T protein:vir:99 347 ISGSKIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDANGNVALATEQIVSPEQWQSQFNPASIVAYPWRGEYIACYTKP 426 (567) T ss_pred ccccccccccccccccceeEeccEEEeecCCcEEEEecCCchhhhhhhccChHHHHhcCCcceEEEEeecCeEEEEEecC Confidence 99999999999999999999999999999999999999999999999999999999889999999999999999999999 Q ss_pred CCccceEEEccCCceeEEEeecccEEEEEecCCEEEEEECCEEEEecCCCCceeEEEEcceEEeCcccceeEEEEeeCCC Q lcl|NC_019442. 401 DGKQDVFVFSPVNMDIRYLSTPFDCAWVDLAKDMMRVVTGDKMSVLAGGSLPSTIRWHSKIFSLPERTSFSCIRVKSPAP 480 (541) Q Consensus 401 ~g~~~~~i~d~~~~~~~~~~~~~d~~~~~~~~d~LY~~~g~~i~~~~~g~~~~~~~WrSk~f~~~~~~~~~~~~V~~~~~ 480 (541) +|++++|||||+++++++++++|||+|+|.++|+||+++|++||+|++|+++++++||||+|++|+++||+|+||++.++ T Consensus 427 ~g~~~~fifdp~~~~~~~i~~~~~~~~~d~~~d~Ly~~~~~~l~~~~~g~~~~~~~WrSK~f~~p~~~sf~~~rV~s~~~ 506 (567) T protein:vir:99 427 DGKQDVFVFSPVNMDIRYLSTPFDCAWVDLAKDMMRVVTGDKMSVLAGGALPSTIRWHSKIFSLPERTSFSCIRVKSPAP 506 (567) T ss_pred CCCcceEEEcccccEEEEEecCceeEEEEeecCeEEEeeCCEEeeecCCCCceeEEEecceEEecCccceeEEEEeccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccEEEEEEECCceeEeecccccCCcceEccCcccceEEEEEEecceEEEEEeecchhhcCC Q lcl|NC_019442. 481 ERVGITIMADDVPVIHFAPGTFKGSVVRLPAATGQNWQVMVSGFGQVERITLSTSMSEMPV 541 (541) Q Consensus 481 ~~~~v~~~~d~~~~~~~~~~~~~~~~~rLP~~~~~~w~iei~g~~~V~~i~la~s~~EL~~ 541 (541) ++++|++++||..++++++++++++|+|||++++|+|||||+|+++|+||+||+||+|||| T Consensus 507 ~~v~i~~~~dg~~v~~~~~g~~~~~~~rlp~~~ar~Weveisg~~~V~~v~LA~S~~EL~~ 567 (567) T protein:vir:99 507 ERVGITIMADDVPVIHFAPGTFKGSVVRLPAATGQNWQVMVSGFGQVERITLSTSMSEMPV 567 (567) T ss_pred cceeEEEEEcCCceeecCCccccCceeecCCcccceEEEEEEecccEEEEEEecchhhcCC Confidence 9999999999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:10145 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859275;genbank:gi:32171031;genbank:GeneID:2653447 Probab=100.00 E-value=1.6e-254 Score=1412.02 Aligned_cols=541 Identities=98% Similarity=1.520 Sum_probs=537.3 Q ss_pred CceEEecccccccccccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEECCcEEEEeCCeEEEeeCC Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSP 80 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W~~w~~~V~vv~sp 80 (541) |++|+|++|+||+||++||||||++||+|+||||++|+|+|+|+|++++++++..+||||||+++|||+|+++||||||| T Consensus 27 M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A~n~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~~w~~~V~~ir~P 106 (567) T protein:vir:10 27 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSP 106 (567) T ss_pred eeEEeecccccccccchhhhccccccceEEeeeccCCeeeeeecccccccccccCceeeEEEcCcEEEEeCCceeeccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEEEEEecCC Q lcl|NC_019442. 81 IAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYG 160 (541) Q Consensus 81 ia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~V~~~G 160 (541) ||||+|||||||||++||+|+++||++|+|+||.+||+||||+|+++|++++++++++++++++|+++|+|+|||||++| T Consensus 107 vAqD~~~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~~~~~~d~etr~Yv~TfVt~~G 186 (567) T protein:vir:10 107 IAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYG 186 (567) T ss_pred cccCCcceEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCCCCCCcccceeEEEEEEEcCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEEEeeccceEEEEecccccccCccchh Q lcl|NC_019442. 161 EEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPGKNLGPSLAT 240 (541) Q Consensus 161 eEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lVael~~~~~sf~D~~~~~~L~~~L~t 240 (541) |||+|||+|.++++..+|+.|+|+++|++++++||+++|||||++|++++||+||||+++++++|+|+.++++|+++||| T Consensus 187 eES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~as~~sf~D~~~~~~lg~~Lps 266 (567) T protein:vir:10 187 EEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPAKNLGPSLAT 266 (567) T ss_pred CcCCCcccccceeeecCCceEEEeeccCCccccccceEEEEEecCCCCceeeEEEEeeccceeeeeeccchhhccccccc Confidence 99999999999999989999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhCCCCCcceEEeccCcEEEEEeCCEEEEecCCCcccCchhcccccCcceEEEEEcCCcEEEEEcCCEEEEEccCccc Q lcl|NC_019442. 241 WDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPST 320 (541) Q Consensus 241 ~~~~~pP~~~~gL~~m~NGi~a~f~Gn~l~fSep~~P~awp~~y~~t~~~~Iv~ia~v~~~lvV~T~~~py~l~G~~p~s 320 (541) ++|++||++|+|||+|+|||||||.||+|||||||+|||||++|++++++||||||+++++|||+|+|+||+++|++|++ T Consensus 267 ~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~s 346 (567) T protein:vir:10 267 WDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPST 346 (567) T ss_pred ccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhhccCCCCCeEEEeecccEEEEEEcCceEEEEcCChhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEeecccccccccchheeCCccEEEecCCcEEEEeCCCceEEEecccCChhHhhhhcCcceEEEEEEcCeEEEEEecC Q lcl|NC_019442. 321 ISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDVNGNTALATEKIISPEQWQSQFNPASIVAYSWRGEYIACYTKP 400 (541) Q Consensus 321 ~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGLv~~~~~G~~~~vT~~~~~~~~W~~~l~P~ti~a~~~eG~Y~~~y~~~ 400 (541) |+++||+++|||+|+||||.++++|+|||+||||+++++|+++++|++||+|||||++|||+||+|++|||+||+||++. T Consensus 347 ms~~kL~~~qpCvS~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~~l~t~~qW~a~~~P~ti~A~~~eG~Y~a~Y~~~ 426 (567) T protein:vir:10 347 ISGSKIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDANGNVALATEQIVSPEQWQSQFNPASIVAYPWRGEYIACYTKP 426 (567) T ss_pred ccccccccccccccccceeEeccEEEeecCCcEEEEecCCchhhhhhhccChHHHHhcCCcceEEEEeecCeEEEEEecC Confidence 99999999999999999999999999999999999999999999999999999999889999999999999999999999 Q ss_pred CCccceEEEccCCceeEEEeecccEEEEEecCCEEEEEECCEEEEecCCCCceeEEEEcceEEeCcccceeEEEEeeCCC Q lcl|NC_019442. 401 DGKQDVFVFSPVNMDIRYLSTPFDCAWVDLAKDMMRVVTGDKMSVLAGGSLPSTIRWHSKIFSLPERTSFSCIRVKSPAP 480 (541) Q Consensus 401 ~g~~~~~i~d~~~~~~~~~~~~~d~~~~~~~~d~LY~~~g~~i~~~~~g~~~~~~~WrSk~f~~~~~~~~~~~~V~~~~~ 480 (541) +|++++|||||+++++++++++|||+|+|.++|+||+++|++||+|++|+++++++||||+|++|+++||+|+||++.++ T Consensus 427 ~g~~~~fifdp~~~~~~~i~~~~~~~~~d~~~d~Ly~~~~~~l~~~~~g~~~~~~~WrSK~f~~p~~~sf~~~rV~s~~~ 506 (567) T protein:vir:10 427 DGKQDVFVFSPVNMDIRYLSTPFDCAWVDLAKDMMRVVTGDKMSVLAGGALPSTIRWHSKIFSLPERTSFSCIRVKSPAP 506 (567) T ss_pred CCCcceEEEcccccEEEEEecCceeEEEEeecCeEEEeeCCEEeeecCCCCceeEEEecceEEecCccceeEEEEeccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccEEEEEEECCceeEeecccccCCcceEccCcccceEEEEEEecceEEEEEeecchhhcCC Q lcl|NC_019442. 481 ERVGITIMADDVPVIHFAPGTFKGSVVRLPAATGQNWQVMVSGFGQVERITLSTSMSEMPV 541 (541) Q Consensus 481 ~~~~v~~~~d~~~~~~~~~~~~~~~~~rLP~~~~~~w~iei~g~~~V~~i~la~s~~EL~~ 541 (541) ++++|++++||..++++++++++++|+|||++++|+|||||+|+++|+||+||+||+|||| T Consensus 507 ~~v~i~~~~dg~~v~~~~~g~~~~~~~rlp~~~ar~Weveisg~~~V~~v~LA~S~~EL~~ 567 (567) T protein:vir:10 507 ERVGITIMADDVPVIHFAPGTFKGSVVRLPAATGQNWQVMVSGFGQVERITLSTSMSEMPV 567 (567) T ss_pred cceeEEEEEcCCceeecCCccccCceeecCCcccceEEEEEEecccEEEEEEecchhhcCC Confidence 9999999999999999999999999999999999999999999999999999999999999 No 5 >protein:vir:3306 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049522;genbank:gi:9632528;genbank:GeneID:1262016 Probab=100.00 E-value=1.6e-254 Score=1412.02 Aligned_cols=541 Identities=98% Similarity=1.520 Sum_probs=537.3 Q ss_pred CceEEecccccccccccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEECCcEEEEeCCeEEEeeCC Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSP 80 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W~~w~~~V~vv~sp 80 (541) |++|+|++|+||+||++||||||++||+|+||||++|+|+|+|+|++++++++..+||||||+++|||+|+++||||||| T Consensus 27 M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A~n~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~~w~~~V~~ir~P 106 (567) T protein:vir:33 27 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSP 106 (567) T ss_pred eeEEeecccccccccchhhhccccccceEEeeeccCCeeeeeecccccccccccCceeeEEEcCcEEEEeCCceeeccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEEEEEecCC Q lcl|NC_019442. 81 IAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYG 160 (541) Q Consensus 81 ia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~V~~~G 160 (541) ||||+|||||||||++||+|+++||++|+|+||.+||+||||+|+++|++++++++++++++++|+++|+|+|||||++| T Consensus 107 vAqD~~~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~~~~~~d~etr~Yv~TfVt~~G 186 (567) T protein:vir:33 107 IAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYG 186 (567) T ss_pred cccCCcceEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCCCCCCcccceeEEEEEEEcCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEEEeeccceEEEEecccccccCccchh Q lcl|NC_019442. 161 EEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPGKNLGPSLAT 240 (541) Q Consensus 161 eEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lVael~~~~~sf~D~~~~~~L~~~L~t 240 (541) |||+|||+|.++++..+|+.|+|+++|++++++||+++|||||++|++++||+||||+++++++|+|+.++++|+++||| T Consensus 187 eES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~as~~sf~D~~~~~~lg~~Lps 266 (567) T protein:vir:33 187 EEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPAKNLGPSLAT 266 (567) T ss_pred CcCCCcccccceeeecCCceEEEeeccCCccccccceEEEEEecCCCCceeeEEEEeeccceeeeeeccchhhccccccc Confidence 99999999999999989999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhCCCCCcceEEeccCcEEEEEeCCEEEEecCCCcccCchhcccccCcceEEEEEcCCcEEEEEcCCEEEEEccCccc Q lcl|NC_019442. 241 WDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPST 320 (541) Q Consensus 241 ~~~~~pP~~~~gL~~m~NGi~a~f~Gn~l~fSep~~P~awp~~y~~t~~~~Iv~ia~v~~~lvV~T~~~py~l~G~~p~s 320 (541) ++|++||++|+|||+|+|||||||.||+|||||||+|||||++|++++++||||||+++++|||+|+|+||+++|++|++ T Consensus 267 ~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~s 346 (567) T protein:vir:33 267 WDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPST 346 (567) T ss_pred ccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhhccCCCCCeEEEeecccEEEEEEcCceEEEEcCChhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEeecccccccccchheeCCccEEEecCCcEEEEeCCCceEEEecccCChhHhhhhcCcceEEEEEEcCeEEEEEecC Q lcl|NC_019442. 321 ISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDVNGNTALATEKIISPEQWQSQFNPASIVAYSWRGEYIACYTKP 400 (541) Q Consensus 321 ~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGLv~~~~~G~~~~vT~~~~~~~~W~~~l~P~ti~a~~~eG~Y~~~y~~~ 400 (541) |+++||+++|||+|+||||.++++|+|||+||||+++++|+++++|++||+|||||++|||+||+|++|||+||+||++. T Consensus 347 ms~~kL~~~qpCvS~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~~l~t~~qW~a~~~P~ti~A~~~eG~Y~a~Y~~~ 426 (567) T protein:vir:33 347 ISGSKIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDANGNVALATEQIVSPEQWQSQFNPASIVAYPWRGEYIACYTKP 426 (567) T ss_pred ccccccccccccccccceeEeccEEEeecCCcEEEEecCCchhhhhhhccChHHHHhcCCcceEEEEeecCeEEEEEecC Confidence 99999999999999999999999999999999999999999999999999999999889999999999999999999999 Q ss_pred CCccceEEEccCCceeEEEeecccEEEEEecCCEEEEEECCEEEEecCCCCceeEEEEcceEEeCcccceeEEEEeeCCC Q lcl|NC_019442. 401 DGKQDVFVFSPVNMDIRYLSTPFDCAWVDLAKDMMRVVTGDKMSVLAGGSLPSTIRWHSKIFSLPERTSFSCIRVKSPAP 480 (541) Q Consensus 401 ~g~~~~~i~d~~~~~~~~~~~~~d~~~~~~~~d~LY~~~g~~i~~~~~g~~~~~~~WrSk~f~~~~~~~~~~~~V~~~~~ 480 (541) +|++++|||||+++++++++++|||+|+|.++|+||+++|++||+|++|+++++++||||+|++|+++||+|+||++.++ T Consensus 427 ~g~~~~fifdp~~~~~~~i~~~~~~~~~d~~~d~Ly~~~~~~l~~~~~g~~~~~~~WrSK~f~~p~~~sf~~~rV~s~~~ 506 (567) T protein:vir:33 427 DGKQDVFVFSPVNMDIRYLSTPFDCAWVDLAKDMMRVVTGDKMSVLAGGALPSTIRWHSKIFSLPERTSFSCIRVKSPAP 506 (567) T ss_pred CCCcceEEEcccccEEEEEecCceeEEEEeecCeEEEeeCCEEeeecCCCCceeEEEecceEEecCccceeEEEEeccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccEEEEEEECCceeEeecccccCCcceEccCcccceEEEEEEecceEEEEEeecchhhcCC Q lcl|NC_019442. 481 ERVGITIMADDVPVIHFAPGTFKGSVVRLPAATGQNWQVMVSGFGQVERITLSTSMSEMPV 541 (541) Q Consensus 481 ~~~~v~~~~d~~~~~~~~~~~~~~~~~rLP~~~~~~w~iei~g~~~V~~i~la~s~~EL~~ 541 (541) ++++|++++||..++++++++++++|+|||++++|+|||||+|+++|+||+||+||+|||| T Consensus 507 ~~v~i~~~~dg~~v~~~~~g~~~~~~~rlp~~~ar~Weveisg~~~V~~v~LA~S~~EL~~ 567 (567) T protein:vir:33 507 ERVGITIMADDVPVIHFAPGTFKGSVVRLPAATGQNWQVMVSGFGQVERITLSTSMSEMPV 567 (567) T ss_pred cceeEEEEEcCCceeecCCccccCceeecCCcccceEEEEEEecccEEEEEEecchhhcCC Confidence 9999999999999999999999999999999999999999999999999999999999999 No 6 >protein:vir:104388 Length: 566 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794072;genbank:gi:116222017;genbank:GeneID:4397450 Probab=100.00 E-value=6.8e-254 Score=1408.54 Aligned_cols=541 Identities=87% Similarity=1.409 Sum_probs=536.7 Q ss_pred CceEEecccccccccccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEECCcEEEEeCCeEEEeeCC Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSP 80 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W~~w~~~V~vv~sp 80 (541) |++|+|++|+||+||++||||||++||+|+||||++|+|+|+|+|++++++++.++||||||+++|||+|+++||||||| T Consensus 26 M~~i~i~~f~Ge~Pr~~p~lLP~~~a~~A~n~~~~~G~itP~~~~~~~~~~~~~~~kTif~y~~~~W~~w~~~V~~ir~P 105 (566) T protein:vir:10 26 MPYIDITTMRGMMPRVVTSMLPDHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSP 105 (566) T ss_pred eeEEeecccccccccchhhhccccccceEEeeeecCCeeeeeecccccccccccCceeeeeecCcEeEEeCCceeeccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEEEEEecCC Q lcl|NC_019442. 81 IAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYG 160 (541) Q Consensus 81 ia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~V~~~G 160 (541) ||||+|||||||||++||||.++|||+|+|+||+++|+||||+|+++|++++.+++++.+++++++++|.|++||||++| T Consensus 106 vAqD~~~rvY~tg~~~Pk~t~~diAt~g~~~~pa~~y~LgVPaPs~apv~~~~~~sg~~~~~~~d~~tr~Yv~TfVt~~G 185 (566) T protein:vir:10 106 VAQDNYGRIYYTDGKFPKVTAAEIATKGEGNFPAASYRLGIPAPTTAPVCTVQKGEGATDENPNDDETRFYTETFVSAYG 185 (566) T ss_pred cccCCcceEEEeeCCcceeeecceeeccccccccccccccCCCCcccceeeccCCCcccCCCCcccceeEEEEEEEcCCC Confidence 99999999999999999999999999999999999999999999999999999888888999999999999999999999 Q ss_pred CccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEEEeeccceEEEEecccccccCccchh Q lcl|NC_019442. 161 EEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPGKNLGPSLAT 240 (541) Q Consensus 161 eEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lVael~~~~~sf~D~~~~~~L~~~L~t 240 (541) |||+|||+|.++++.++|++|+|++++++++++||++||||||++|++++||+||+|+++++++|+|++++++|+++||| T Consensus 186 eES~PS~~S~~v~v~~~gs~V~ltl~~~p~~~~~i~~~RIYRS~tg~~gtdy~lVael~as~~sf~Dd~~~~~lg~~Lps 265 (566) T protein:vir:10 186 EEGPPGPESLEVTVGIPDTPVQLTLSPVPLQDANINRRRIYRSVSGGGEADFLLVAELEASVLSYTDNIPAKNLGPSLAT 265 (566) T ss_pred CcCCCccccceeEecCCCceEEEEecCCCcCcCCceeEEEEEecCCCCceeEEEEeeecccceeeeccccccccCccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhCCCCCcceEEeccCcEEEEEeCCEEEEecCCCcccCchhcccccCcceEEEEEcCCcEEEEEcCCEEEEEccCccc Q lcl|NC_019442. 241 WDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPST 320 (541) Q Consensus 241 ~~~~~pP~~~~gL~~m~NGi~a~f~Gn~l~fSep~~P~awp~~y~~t~~~~Iv~ia~v~~~lvV~T~~~py~l~G~~p~s 320 (541) ++|++||++|+|||+|+|||||||.||+|||||||+|||||++|++++++||||||+++++|||+|+|+||+++|++|++ T Consensus 266 ~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~s 345 (566) T protein:vir:10 266 WDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAVCPLGTSLVVATKGEPYLFSGVSPST 345 (566) T ss_pred ccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhhccCCCCCeEEEEeccceEEEEEcCceEEEEcCChhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEeecccccccccchheeCCccEEEecCCcEEEEeCCCceEEEecccCChhHhhhhcCcceEEEEEEcCeEEEEEecC Q lcl|NC_019442. 321 ISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDVNGNTALATEKIISPEQWQSQFNPASIVAYSWRGEYIACYTKP 400 (541) Q Consensus 321 ~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGLv~~~~~G~~~~vT~~~~~~~~W~~~l~P~ti~a~~~eG~Y~~~y~~~ 400 (541) |+++||+++|||+|+||||.++++|+|||+||||+++++|+++++|++||+|||||++|||+||+|++|||+||+||++. T Consensus 346 ms~~kL~~~qaCvS~rsiV~~~g~v~Yas~dGLv~v~a~g~a~vvT~~l~t~~qW~~~~~P~ti~A~~~eG~Y~a~Y~~~ 425 (566) T protein:vir:10 346 ISGSKIPSMQACLSRQSMVAMEGFVLYAGTNGLVSVDANGNAALATEQIISPEQWQTQFNPASIVAYPWRGEYIACYTKP 425 (566) T ss_pred ccccccccccccccccceeeecceEEeecCCceEEEecCCChhhhhhhhcChhHHHhcCCcceEEEEeecCeEEEEEeCC Confidence 99999999999999999999999999999999999999999999999999999999889999999999999999999999 Q ss_pred CCccceEEEccCCceeEEEeecccEEEEEecCCEEEEEECCEEEEecCCCCceeEEEEcceEEeCcccceeEEEEeeCCC Q lcl|NC_019442. 401 DGKQDVFVFSPVNMDIRYLSTPFDCAWVDLAKDMMRVVTGDKMSVLAGGSLPSTIRWHSKIFSLPERTSFSCIRVKSPAP 480 (541) Q Consensus 401 ~g~~~~~i~d~~~~~~~~~~~~~d~~~~~~~~d~LY~~~g~~i~~~~~g~~~~~~~WrSk~f~~~~~~~~~~~~V~~~~~ 480 (541) +|++++|||||.++++++++++|||.|+|..+|+||+++|++||||++|+++++++||||+|.+|+++||+|+||+++++ T Consensus 426 ~g~~~~fi~dp~g~~i~~l~~~~d~~~~d~~~d~ly~~~g~~i~~~~~g~~~~~~~WrSK~f~~p~~~sf~~~rV~s~~~ 505 (566) T protein:vir:10 426 DGEKDVFVFNPAGMDIRHLSTPFDCACVDLVNDVMRVVSGQNMSAMAGGRLPSLIRWHSKVFSLPERTSFSCLRVKSPTP 505 (566) T ss_pred CCCccEEEEcccCceEEEeccccceeEEeeccCeeeeeeCCeeeeecCCCCCceEEEecceEEecCCcceeEEEeecCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccEEEEEEECCceeEeecccccCCcceEccCcccceEEEEEEecceEEEEEeecchhhcCC Q lcl|NC_019442. 481 ERVGITIMADDVPVIHFAPGTFKGSVVRLPAATGQNWQVMVSGFGQVERITLSTSMSEMPV 541 (541) Q Consensus 481 ~~~~v~~~~d~~~~~~~~~~~~~~~~~rLP~~~~~~w~iei~g~~~V~~i~la~s~~EL~~ 541 (541) ++++|++++||..+.++++++++++++|||++++|+|||||+|+++|+||+||+||+|||| T Consensus 506 ~~v~i~i~ad~~~v~~~a~G~~~~~~~rLp~~~~~~Wevevsg~~~V~~v~La~S~~EL~~ 566 (566) T protein:vir:10 506 ERVGITVLADDVPVIHLAPGSLSGSVVRLPAATGQNWQVLVSGFGQVERITLSTSMSELPI 566 (566) T ss_pred cceeEEEEECCEEEEEeCCCccccceeecCCCccceEEEEEEecccEEEEEEecchhhcCC Confidence 9999999999999999999999999999999999999999999999999999999999999 No 7 >protein:vir:93631 Length: 580 # NCBI annotation: Bcep22gp67 # Family: family:all:1544 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944296;genbank:gi:38640373;genbank:GeneID:2658280 Probab=100.00 E-value=3.8e-220 Score=1223.50 Aligned_cols=514 Identities=28% Similarity=0.507 Sum_probs=481.4 Q ss_pred CceEEecccccccccccceecccccceEEEEeeecCCeeeeeecc---cccCccccccceeEEEECCcEEEEeCCeEEEe Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQI---SGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVI 77 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~---~~v~~~~~~~~~Tif~~~~~~W~~w~~~V~vv 77 (541) |++|+|++|+||+||++||||||++||+|+||||++|+|+|+|+| .++.++++.++|||||+ +++||+|+++|||| T Consensus 1 M~~i~i~~f~Ge~Prl~p~lLP~~~a~~a~n~~~~~G~i~P~~~~~~~~~~~~i~~~~~~t~~~~-~~~W~~w~~~V~~i 79 (580) T protein:vir:93 1 MTIIKITGFSGEIPRLVPRLLPDTAAQNATNARLESGGLTPYRKPKFITRISTIPAGQIETIYRN-GETWMAWDKPVYAA 79 (580) T ss_pred CeeEeecccccccccchhhhccccccceEEeeeccCCeeeeeeCchhhccccccCcCcceEEEec-CceeEEeCCceeee Confidence 999999999999999999999999999999999999999999999 56788888899999986 77999999999999 Q ss_pred eCCcccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEEEEEe Q lcl|NC_019442. 78 RSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVS 157 (541) Q Consensus 78 ~spia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~V~ 157 (541) ||||||| |||||||++||+|.. +++|+||||+|+++|++++.++ +.+++++|.|+||||| T Consensus 80 ~~PvA~D---Rvy~Td~g~Pkvt~~-----------g~sy~lgVpaPs~Apt~~~~g~------g~l~~~~y~Yv~TfVt 139 (580) T protein:vir:93 80 PGPVAAD---RLYVMGDGAPKMIVG-----------GTTYPLAVPMPSAALTAATSGT------GTGDVFSRVYVYTFVT 139 (580) T ss_pred cCccccc---eeEEcCCcccceecC-----------CccccccCCCcccCceeeecCC------CCcCccceEEEEEEEc Confidence 9999999 999999999999753 7889999999999999876543 3577899999999999 Q ss_pred cCCCccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEEEeeccceEEEEecccccccCcc Q lcl|NC_019442. 158 DYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPGKNLGPS 237 (541) Q Consensus 158 ~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lVael~~~~~sf~D~~~~~~L~~~ 237 (541) ++||||+||++|..++++ +|++|+|+++|+++++++|+++|||||++|+++++|+||||+++++++|+|+.++++|+++ T Consensus 140 ~~GeES~PS~~S~~vtv~-~g~tVtLs~~p~p~~~~~i~~~RIYRS~tG~~gtdy~lVAel~Ag~~sF~Dd~s~a~Lge~ 218 (580) T protein:vir:93 140 GFGEESEPSAISNEVNWQ-AGQTVTLSGFQAAPAGRNITKQRIYRSQTSLSGTDLYFIAERDASAANFVDNVPLSDQNEP 218 (580) T ss_pred CCCCcCCCcccccceeeC-CCCeEEEEecCCCCCCCccceEEEEEeccCCCceeEEEEeeeccceeeeeecccccccccc Confidence 999999999999999997 6889999999999999999999999999999999999999999999999999999999999 Q ss_pred chhhhhhCCCCCcceEEeccCcEEEEEeCCEEEEecCCCcccCchhcccccCcceEEEEEcCCcEEEEEcCCEEEEEccC Q lcl|NC_019442. 238 LATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVS 317 (541) Q Consensus 238 L~t~~~~~pP~~~~gL~~m~NGi~a~f~Gn~l~fSep~~P~awp~~y~~t~~~~Iv~ia~v~~~lvV~T~~~py~l~G~~ 317 (541) |||++|++||++|+|||+|||||||+|.||+|||||||+|||||++|++++++|||||++++++|||+|+|+||+++|++ T Consensus 219 Lps~~~~~PP~~m~gL~~m~nGi~agF~Gnev~fsEpy~P~AWP~~yr~t~~~~Ivaia~~g~~LvV~T~g~pyl~~G~~ 298 (580) T protein:vir:93 219 LPSLEWNAPPDDLTGLISLPNGMMAAFRGKELWLCEPWRPHAWPQKYVLTMDYNIVALGAYGTTIVVATDGQPYIVSGAS 298 (580) T ss_pred cchhhccCcCCCcceEEeeccceEEEEeCCEEEEecCCCCccchhhcCCCCCCCceeEeeeCceEEEEEcCceEEEEccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceEEeecccccccccchheeCCccEEEecCCcEEEEeCCCceEEEecccCChhHhhhhcCcceEEEEEEcCeEEEEE Q lcl|NC_019442. 318 PSTISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDVNGNTALATEKIISPEQWQSQFNPASIVAYSWRGEYIACY 397 (541) Q Consensus 318 p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGLv~~~~~G~~~~vT~~~~~~~~W~~~l~P~ti~a~~~eG~Y~~~y 397 (541) |++|+++||+++|||+|+||||+++++|+|+|+||||+++++| ++++|++||+|||||+ |||+||+|++|||+||+|| T Consensus 299 P~~ms~~kL~~~q~CvS~rsiV~~~~~v~Yas~dGLv~i~~~g-a~vvT~~l~t~~qW~~-~~P~ti~a~~~eG~Y~a~Y 376 (580) T protein:vir:93 299 PDAMSQEKLELNLPCINARGLVDLGYAIAYPSHDGLVVASSSG-ARVVTDQLMTRNDWLK-TAPGRFVSGQFFGRYLASY 376 (580) T ss_pred hhhccccccccccccccccceeecCceEEeecCCcEEEEeCCh-HHHHHhhccChhHHHh-cCCceEEEEeecCeEEEEE Confidence 9999999999999999999999999999999999999999998 8999999999999996 9999999999999999999 Q ss_pred ecCC----CccceEEEccCCce--eEEEeecccEEEEEecCCEEEEEECCEEEEecCCC-CceeEEEEcceEEeCcccce Q lcl|NC_019442. 398 TKPD----GKQDVFVFSPVNMD--IRYLSTPFDCAWVDLAKDMMRVVTGDKMSVLAGGS-LPSTIRWHSKIFSLPERTSF 470 (541) Q Consensus 398 ~~~~----g~~~~~i~d~~~~~--~~~~~~~~d~~~~~~~~d~LY~~~g~~i~~~~~g~-~~~~~~WrSk~f~~~~~~~~ 470 (541) ++.+ ++++.||||++.++ +++++..+||+|+|+++|+||+++|++||||++|+ ++++++||||+|++|+++|| T Consensus 377 ~~~~~~~~~~~g~fi~d~~~~~~~~~~~~~~~d~~~~d~~~d~Ly~~~~~~i~~~~~~~~~~~~~~WrSK~f~~~~~~sf 456 (580) T protein:vir:93 377 EYIDPAGTARRGSFIIDLTGQEAFLHRTNYKADATFYDITEGKLYLCIGQDIYEWDALDSENEILVWRSKQYVVQKPTNF 456 (580) T ss_pred cccccccccccceEEEecCCCcceeEEeccccceeeeeccCCeEEEEeCCEEEEEcCCCCCcceEEEecceEEecCCcCc Confidence 9766 67899999997776 77888889999999999999999999999999986 46689999999999999999 Q ss_pred eEEEEeeCCC------------------------------------------------------ccEEEEEEECCceeEe Q lcl|NC_019442. 471 SCIRVKSPAP------------------------------------------------------ERVGITIMADDVPVIH 496 (541) Q Consensus 471 ~~~~V~~~~~------------------------------------------------------~~~~v~~~~d~~~~~~ 496 (541) +|+||++++. ..+.+++++||..+.. T Consensus 457 ~~~rV~s~~~~~~~~~~a~~~~~~~~~a~n~~~~~~~~~~~~~~~~~v~~~~i~gd~~~~~~~~~~~~~~~~adG~~~~t 536 (580) T protein:vir:93 457 GVILIEGSVLMTPEEEAAEQAAIDAAKAHNDSIFGDASIGGELNGAALNVYPIDGDALVRIESSRFVAATVYADGKAVAT 536 (580) T ss_pred eEEEEeeccccchhhhhhhhhhhhhhhhhhhhcccccccccccccccceeeeeccccccccccccceEEEEeeCCeEEEE Confidence 9999986631 1244667778865533 Q ss_pred ecccccCCcceEccCc-ccceEEEEEEecceEEEEEeecchhhcCC Q lcl|NC_019442. 497 FAPGTFKGSVVRLPAA-TGQNWQVMVSGFGQVERITLSTSMSEMPV 541 (541) Q Consensus 497 ~~~~~~~~~~~rLP~~-~~~~w~iei~g~~~V~~i~la~s~~EL~~ 541 (541) .. ..++++|||++ ++++|||||+|+++|++|+||+||+||.- T Consensus 537 ~~---~~~~~~RLPag~~a~~Wev~vsg~~~V~~v~la~s~~EL~~ 579 (580) T protein:vir:93 537 VS---KLNRMCRLPSGFLAQTWEVEVSANADIAQVTLAGTGAELAG 579 (580) T ss_pred Ee---cCCceEEccCCccccEEEEEEEeccceeEEEEecChHHHhc Confidence 32 35699999965 99999999999999999999999999998 No 8 >protein:vir:5120 Length: 615 # NCBI annotation: unknown # Family: family:all:1544 # MgeID: mge:114 # MgeName: PBC5 # Cross-refs: genbank:acc:NP_542277;genbank:gi:18071220;genbank:GeneID:929342 Probab=100.00 E-value=4.7e-219 Score=1217.55 Aligned_cols=515 Identities=30% Similarity=0.527 Sum_probs=472.4 Q ss_pred CceEEecccccccccccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEECCcEEEEeCCeEEEeeCC Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSP 80 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W~~w~~~V~vv~sp 80 (541) |+.|+|++|+||+||++|||||+++||+|+||||++|+|+|+|+|+++++.+++.++|||||.++ ||+|+++||||||| T Consensus 30 M~~I~i~~f~Ge~Prl~P~lLP~~~A~~A~N~~~~~G~ltP~~~~~~~~~~~~~~~~Tif~~~~~-W~~w~~~V~av~sP 108 (615) T protein:vir:51 30 MVAIKISAFAGEQPMLLPRLLPETGATAAMNVRLNDGGLTPINKPIEVATIATASQKTIYRHQGS-WLSWPNVVNAVPGP 108 (615) T ss_pred eEEEeecccccccccchhhhccCcccceEEeeeecCCeeeeecCcccccccccccceeeeeecCc-eeccCCceeEccCC Confidence 99999999999999999999999999999999999999999999999999999899999999655 99999999999999 Q ss_pred cccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEEEEEecCC Q lcl|NC_019442. 81 IAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYG 160 (541) Q Consensus 81 ia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~V~~~G 160 (541) |||| |||||||++|||+ +|+.+|+||||+|+++|++++.+++ ++++++|.|+|||||++| T Consensus 109 vA~D---Rvy~tgdg~Pkv~-----------~~~~sY~LgVpaPs~ap~~~~~g~g------~~d~etr~Yv~TfVt~~G 168 (615) T protein:vir:51 109 VAQD---RLYFTGDGAPKVK-----------IGGVDYALKVPRPTGALTAALSGTG------SGDIQSRTYVYTWVTSFG 168 (615) T ss_pred cccc---eeEEcCCCcceEe-----------ecccCccccccCCCccceEEecCCC------CccccceEEEEEEEcCCC Confidence 9999 9999999999987 4689999999999999998876643 457899999999999999 Q ss_pred CccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEEEeeccceEEEEecccccccCccchh Q lcl|NC_019442. 161 EEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPGKNLGPSLAT 240 (541) Q Consensus 161 eEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lVael~~~~~sf~D~~~~~~L~~~L~t 240 (541) |||+|||+|..++++ +|++|+|+++|++++++||+++|||||++|++++||+|||||++++++|+|+++.++|+++||| T Consensus 169 eES~PSp~S~~v~v~-~g~tVtLs~~pa~~~~~~i~~rRIYRS~tg~~gtdy~lVAel~as~~sf~D~~~~~~Lg~~Lps 247 (615) T protein:vir:51 169 EESAPCPASIIVDWK-PGQTVTLSGFAATPGGRSITTQRIYRSQTGKTGTGLYLIAERAASAGNFTDNIAVDQFQEPLPS 247 (615) T ss_pred CcCCCCccceeeEec-CCCeEEEeeccCCcCCCceeeEEEEEeccCCCceeeEEEeeecccceeeeeccchhhcCccccc Confidence 999999999999997 6889999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhCCCCCcceEEeccCcEEEEEeCCEEEEecCCCcccCchhcccccCcceEEEEEcCCcEEEEEcCCEEEEEccCccc Q lcl|NC_019442. 241 WDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPST 320 (541) Q Consensus 241 ~~~~~pP~~~~gL~~m~NGi~a~f~Gn~l~fSep~~P~awp~~y~~t~~~~Iv~ia~v~~~lvV~T~~~py~l~G~~p~s 320 (541) ++|++||++|+|||+|||||||||.||+|||||||+|||||++|++++++||||||+++++|||+|+|+||+++|++|++ T Consensus 248 ~~w~~PP~~l~GL~~m~NGimAgF~GneV~FsEpy~PyAWP~~Yr~t~d~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~s 327 (615) T protein:vir:51 248 ADWNEPPDGLAGLAEMPNGMMAAFVGRSIYFCEPYRPHAWPEKYSRNVGSDIVGIAALGSILVVVTKGKPYLLAGTHPDS 327 (615) T ss_pred ccccCcCcchhhhhccccceEEeecCCEEEEecCCCCcccchhcccCcCCCeeEEEecccEEEEEEcCceEEEEcCChhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEeecccccccccchheeCCccEEEecCCcEEEEeCCCceEEEecccCChhHhhhhcCcceEEEEEEcCeEEEEEecC Q lcl|NC_019442. 321 ISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDVNGNTALATEKIISPEQWQSQFNPASIVAYSWRGEYIACYTKP 400 (541) Q Consensus 321 ~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGLv~~~~~G~~~~vT~~~~~~~~W~~~l~P~ti~a~~~eG~Y~~~y~~~ 400 (541) |+++||+++|||+|+||||.++++|+|||+||||+++++|++++||++||+|+|||+ |||+||+|++|||+||+||++. T Consensus 328 ms~~kL~~~qpCvS~rsiV~~~~~v~Yas~dGLV~v~~~G~a~vvT~~l~t~~qW~~-l~P~ti~a~~~eG~Y~~~Y~~~ 406 (615) T protein:vir:51 328 MQQQQLEENLPCINARSIVDLGHAVCYASNDGLVAVRGDGSIRLVTEQLLSREKWLD-LSPFTIIGGQINGAYLLFYDNL 406 (615) T ss_pred ccccccccccccccccceeEecceEEeecCCceEEEecCCchhhhhhhccChhHHHh-cCCceEEEEeecCeEEEEeccC Confidence 999999999999999999999999999999999999999999999999999999996 9999999999999999999987 Q ss_pred CCcc---ceEEE-ccCCce-eEEEeecccEEEEEecCCEEEEEE-C-CEEEEecC-CCCceeEEEEcceEEeCcccceeE Q lcl|NC_019442. 401 DGKQ---DVFVF-SPVNMD-IRYLSTPFDCAWVDLAKDMMRVVT-G-DKMSVLAG-GSLPSTIRWHSKIFSLPERTSFSC 472 (541) Q Consensus 401 ~g~~---~~~i~-d~~~~~-~~~~~~~~d~~~~~~~~d~LY~~~-g-~~i~~~~~-g~~~~~~~WrSk~f~~~~~~~~~~ 472 (541) ++.+ .++|+ +..++. +.+....++|.|+|..+|+||++. | +.|++|++ +.++++++||||+|.+++++||+| T Consensus 407 ~~~g~~~~g~~~~~~~~~~f~ir~~~~~~~~~~d~~~~~Ly~l~~g~~~i~~~~a~~g~~~~~~WrSK~F~~~~p~sf~~ 486 (615) T protein:vir:51 407 SASGERIAGSISIYVDGQPFLVRSSEIASSSFFDIGDTALYFMAPGSKTIQRFDAPQGAPQTLYWRSKEFITTSPSSMGA 486 (615) T ss_pred CCCcceeeeeeEEecCCceeEEEeecccceeeeEecCceEEEEEcCCceEEEEecCCCCcceEEecCceEEccCCCcceE Confidence 7763 23443 333333 234466789999999999999765 4 67999998 556999999999999999999999 Q ss_pred EEEeeCCCccE----------------EEEEEEC--------------------------------------------Cc Q lcl|NC_019442. 473 IRVKSPAPERV----------------GITIMAD--------------------------------------------DV 492 (541) Q Consensus 473 ~~V~~~~~~~~----------------~v~~~~d--------------------------------------------~~ 492 (541) ++|++++..++ .+++++| |. T Consensus 487 ~~V~~~~~~~~~e~~~~~~~~~~~~aa~~ti~a~g~~~~~l~~~~l~~~~i~gd~~~~ip~~~t~~~~~~v~~~l~a~G~ 566 (615) T protein:vir:51 487 VLVDSGSAISLKALEALQEERNQIIAANAALFAAGDLQGGINARPLNDRSINGDDLQPVPPPPTAADAASLTVSIFADGK 566 (615) T ss_pred EEEcCCcccchhhhhhhhhhhhhccccceeEEeccccccccccccccccccCcccccccccccccccccceeEEEecCCc Confidence 99998766544 2444444 32 Q ss_pred eeEeecccccCCcceEccCc-ccceEEEEEEecceEEEEEeecchhhcCC Q lcl|NC_019442. 493 PVIHFAPGTFKGSVVRLPAA-TGQNWQVMVSGFGQVERITLSTSMSEMPV 541 (541) Q Consensus 493 ~~~~~~~~~~~~~~~rLP~~-~~~~w~iei~g~~~V~~i~la~s~~EL~~ 541 (541) .+ + + .++.|+++|||.+ ++++|||||+|+++|++|+||+||+||.- T Consensus 567 ~~-~-t-~~k~~~~~RLP~g~~ar~Wevevsg~~~V~~v~LA~S~~EL~~ 613 (615) T protein:vir:51 567 LI-Q-T-IDKVDRIARVRAGLKARKWEVAISTNMQIAQVIMAASVEELKQ 613 (615) T ss_pred ee-e-e-eccCCceeEcccCcccceEEEEEEecccEEEEEEecChHHHHh Confidence 22 1 1 1245799999986 89999999999999999999999999998 No 9 >protein:vir:105563 Length: 396 # NCBI annotation: hypothetical protein # Family: family:all:27455 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164316;genbank:gi:56692963;genbank:GeneID:3197174 Probab=100.00 E-value=8.6e-102 Score=574.66 Aligned_cols=351 Identities=16% Similarity=0.125 Sum_probs=306.6 Q ss_pred CceEEecccccccccccceecc--------------------cccce-EEEEeeecCCeeeeeecccccCccccccceeE Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLP--------------------EHSAV-LAEDCHFRFGVITPERQISGVEKTFTIKPKTI 59 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp--------------------~~~a~-~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Ti 59 (541) |+...|.-|.|.+.-..-+-|+ +.++| .+-|+++.+|.|+|+.+.-.-+.++..+++|| T Consensus 1 ~~~~~~~~~~ginnv~~e~~l~~~~~~~~~~~r~a~nvdi~~~G~~~~r~~~tr~~~g~l~~~~~~~~~~~~~~~~~~tl 80 (396) T protein:vir:10 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPLHGDAFGALGDQW 80 (396) T ss_pred CcceeeeeeecccccccccccccCCCcccceeeeeeeecccCCCchhhhccCcccCCceecccccCccccceeeeCCceE Confidence 9999999999877665555444 35666 77889999999999998656678899999999 Q ss_pred EEECCcEEEEeCCeEEEeeCCcccCCCC-eEEEeCCCCcceeecceeeccccCCccceeeecCCCCCccceEEecCCCCC Q lcl|NC_019442. 60 FHYRDDFWFAWPDVVDVIRSPIAQDPHG-RIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDV 138 (541) Q Consensus 60 f~~~~~~W~~w~~~V~vv~spia~D~~~-Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~ 138 (541) |+|.++.|+.|. .|+++++||++|+++ ||||||++.||++. |+.+||||||+|+++++++. T Consensus 81 ~~~~~~~w~~~~-~v~v~~~pva~d~~~~Rvy~t~~~~p~~~~-----------~~~~y~L~vp~P~~a~~~a~------ 142 (396) T protein:vir:10 81 GKVDPHSWTFEP-LAQIGEGDLSHEVLNNRVCVAGTAGIFTYD-----------GAQAERLTLDTPAPPLLVAG------ 142 (396) T ss_pred EEEeCCeEEEEe-eeeeccCchhccccCCeEEEEcCCCceeee-----------CCcceecCcCCCcccccccc------ Confidence 999999999997 589999999999997 99999999999864 48999999999998877753 Q ss_pred CCCCCCcccceEEEEEEEecCCCccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEEEee Q lcl|NC_019442. 139 SDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLVAEL 218 (541) Q Consensus 139 ~~~~~~~~~ty~Yv~T~V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lVael 218 (541) .+++++++|+|++||||.+||||+|+++|.+++. +++. +|++++ +.+.||+++||||| +.++++|+|++|+ T Consensus 143 --~Gsl~~~~~~Y~~t~V~~~gEEs~p~~~S~~v~~-~gg~--~vtl~~--~~~~~i~~~RiYrS--~~~G~~~~l~aE~ 213 (396) T protein:vir:10 143 --AGSLSQGTYGAAVAWLRGPQESAPSLIAFAEVTD-AGAL--EVTFPL--CLDASVTGARLYLT--RANGGELLLAGDY 213 (396) T ss_pred --cCccCCceEEEEEEEEecCCCcCcccccccccCC-CCCc--EEEEEc--ccCCCcceEEEEEe--CCChhhhhheehh Confidence 3456778999999999999999999999999983 3454 445443 34678999999997 5555689999999 Q ss_pred ccceEEEEecccccccCccchhhhhhCCCCCcceEEeccCc-EEEEE-------eCCEEEEecCCCcccCch-hcccccC Q lcl|NC_019442. 219 DASVLSYTDKIPGKNLGPSLATWDYLPPPENMTGLCLMANG-IAAGF-------AGNEVMFSEAYLPYAWPE-VNRHTTA 289 (541) Q Consensus 219 ~~~~~sf~D~~~~~~L~~~L~t~~~~~pP~~~~gL~~m~NG-i~a~f-------~Gn~l~fSep~~P~awp~-~y~~t~~ 289 (541) ++++++| +||+.+|++||++|++|+.|||| ++|+| +||+|||||||+||+|++ +++++++ T Consensus 214 ~a~~~s~-----------vlPs~~w~gpP~~~~gL~pmP~G~~~A~faGRi~~A~Gn~V~FSEp~~Ph~~~~~~~~~~~~ 282 (396) T protein:vir:10 214 PLGAATV-----------ILPTLPELGRPAQFRHLSPMPTGKHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQMP 282 (396) T ss_pred ccceeee-----------eeecCCCCCCCccccccccCchhHhhhhhcceEEEEeCCEEEEecCCCCceecchhccCCCC Confidence 9999998 57889999999999999999999 66666 569999999999999887 4789999 Q ss_pred cceEEEEEcCCcEEEEEcCCEEEEEccCcccceEEeeccccc-----------ccccchheeCCccEEEecCCcEEEEeC Q lcl|NC_019442. 290 EDIVAICPLGTSLVVATKGEPYLFSGVSPSTISGSRIPSMQA-----------CLSRRSMVAMEGFVLYAGTNGLVSVDV 358 (541) Q Consensus 290 ~~Iv~ia~v~~~lvV~T~~~py~l~G~~p~s~~~~~l~~~~p-----------Cvs~rsiv~~~~~v~y~s~dGLv~~~~ 358 (541) +||++|++++++|||+|+++|||++|++|++|+++|+...+| |+++||++.+++.++|+|+||||++.+ T Consensus 283 ~~Iv~lapv~~gL~Vgt~~~~y~~~G~dP~sms~~~l~~~~pvp~S~v~~p~~~~s~rs~~~~~~~~lwas~dGl~~g~~ 362 (396) T protein:vir:10 283 QRITFVQPVDGGIWVGQVDHVAFLDGADPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVMGTS 362 (396) T ss_pred CceEEEEEecCeEEEEEcCcEEEEEcCChhHcceeecccCCCcccchhcccchhhhcccccccCcEEEEccCCcEEEEcC Confidence 999999999999999999999999999999999999965544 479999999999999999999999999 Q ss_pred CCceEEEecccCChhHhhhhcCcceEEEEEE---cCeEEEEEe Q lcl|NC_019442. 359 NGNTALATEKIISPEQWQSQFNPASIVAYSW---RGEYIACYT 398 (541) Q Consensus 359 ~G~~~~vT~~~~~~~~W~~~l~P~ti~a~~~---eG~Y~~~y~ 398 (541) +|+++++|+++ ++|.+.+|+++ |.||++||. T Consensus 363 ~G~v~~l~~~~---------i~p~~~~A~~~~~~drRy~~~~~ 396 (396) T protein:vir:10 363 SGAIAEVHAGV---------LAGITGRAGTSVVFDRRLLTAVS 396 (396) T ss_pred Cceeeeecccc---------cCCCcccceEEEeecCeEEEEeC Confidence 99999998854 67778888887 999999996 No 10 >protein:vir:8837 Length: 513 # NCBI annotation: constituent protein # Family: family:all:4957 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775245;genbank:gi:27476043;genbank:GeneID:2700591 Probab=99.40 E-value=3.4e-11 Score=78.00 Aligned_cols=419 Identities=13% Similarity=0.165 Sum_probs=214.2 Q ss_pred CceEEecccccccccccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEECCcEEEEeCCeEEEeeCC Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSP 80 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W~~w~~~V~vv~sp 80 (541) |.-+++.+=-|-+.-++|-.||.++-+.|.|..+..|.++|....+.|...+...++-+|.|.++ ...+.++-+- T Consensus 3 ~~~~~~~~~~g~~~d~~p~~lp~~a~s~~~N~~~~~~~~~~~~g~~pv~a~~~~~~~g~~~~~~~-----g~~~~~~~~~ 77 (513) T protein:vir:88 3 LERQEVKNPTGIVTDIAPADLPLDKWSFGNNVRFKNGKAQKALGHSPIFDTAQAPILDMFPFIRN-----NIPYWLLCSE 77 (513) T ss_pred cCChhhcccccceeccChhhcCCCcceeeeeeeEecceeeecCccceeeecCCCCceeeeeeecC-----CCeEEEEeec Confidence 78888888899999999999999999999999999999999988888744455555666655322 1122222111 Q ss_pred cccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEEEEEecCC Q lcl|NC_019442. 81 IAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYG 160 (541) Q Consensus 81 ia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~V~~~G 160 (541) +++|.-++. .+++.+ . +.| ..+. +.|-+..+| T Consensus 78 ------~~~~~~~~~--t~~dvs--~---~~~-----~~~~-------------------------~~~w~~~~f----- 109 (513) T protein:vir:88 78 ------KRLYLADGT--TIIDVS--P---GPY-----SASV-------------------------TNRWSVGSF----- 109 (513) T ss_pred ------eEEEEecCc--eeeecc--c---cce-----eecc-------------------------cCceeeeee----- Confidence 233332210 011100 0 000 0000 000000000 Q ss_pred CccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEEEeeccceEEEEecccccccCccchh Q lcl|NC_019442. 161 EEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPGKNLGPSLAT 240 (541) Q Consensus 161 eEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lVael~~~~~sf~D~~~~~~L~~~L~t 240 (541) +|..+- +... .+...+. ...++|+|- +. T Consensus 110 ----------------~~~i~a-~ng~--------~~~q~~~-----------------~~s~~f~dl----------~g 137 (513) T protein:vir:88 110 ----------------NGVIFA-NDGV--------NPPHHLP-----------------PTESVFRVL----------PN 137 (513) T ss_pred ----------------cCEEEE-EcCC--------CcceEEc-----------------CCCceeeec----------cC Confidence 011010 0000 0001110 000122111 00 Q ss_pred hhhhCCCCCcceEEeccCcEEEEEe--------CCEEEEecCCCcccCchhc----------cccc---CcceEEEEEcC Q lcl|NC_019442. 241 WDYLPPPENMTGLCLMANGIAAGFA--------GNEVMFSEAYLPYAWPEVN----------RHTT---AEDIVAICPLG 299 (541) Q Consensus 241 ~~~~~pP~~~~gL~~m~NGi~a~f~--------Gn~l~fSep~~P~awp~~y----------~~t~---~~~Iv~ia~v~ 299 (541) | ++.-..+ ++...++++.+.. -|.|++|...-|.-+|..+ ++.+ ...||..++.+ T Consensus 138 --~-p~~~~a~-~i~v~~~flv~~~~t~~~~~~PnrV~wS~~~D~~~~P~~W~~t~~t~~a~~~~l~d~~g~~v~g~~~g 213 (513) T protein:vir:88 138 --F-PANTTFR-RLKSFKNFLIGLNVTSNSIEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKLR 213 (513) T ss_pred --C-CcccceE-EEEEEeeEEEEeecccCcCCCCceEEEecccCCcccccccccccccCcccccccCCCccceeeeeecc Confidence 0 0000121 2233344444331 3779999888764433332 3332 36799999999 Q ss_pred CcEEEEEcCCEEEEE-ccCcccceEEeecccccccccchheeCCccEEEecCCcEEEEeCCCceEEEecccCChhHhhhh Q lcl|NC_019442. 300 TSLVVATKGEPYLFS-GVSPSTISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDVNGNTALATEKIISPEQWQSQ 378 (541) Q Consensus 300 ~~lvV~T~~~py~l~-G~~p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGLv~~~~~G~~~~vT~~~~~~~~W~~~ 378 (541) ..++|.++...|.++ +-+|-.++..++..+-+|++++||+..++.++|++++|++++. +++.+-|.++.+.+.=+. + T Consensus 214 ~~liif~e~~i~~m~y~g~~~if~~~~i~~~~G~~~p~SI~~~~~~~ffls~~Gf~~~~-G~~~~~Ig~ekVdk~f~~-~ 291 (513) T protein:vir:88 214 DSFIIYKEDSVYSMRYIGGLYIFQFQQLFNDVGILGPNCAIEFDGNHFVVGHGDVYVHN-GVQKQSVIDAQVRKFFFS-D 291 (513) T ss_pred cceEEEecccEEEEEecCCCceEEEEeecccccccCCceeEEECCeEEEEeCCceEEec-Cceeeecccchhhhhhhc-c Confidence 999999999999997 6667788999999999999999999999999999999999885 555667777667665664 4 Q ss_pred cCcce---EEEEE--EcCeEEEEEecCCCc-----cceEEEccCCceeEEEeec------------------------cc Q lcl|NC_019442. 379 FNPAS---IVAYS--WRGEYIACYTKPDGK-----QDVFVFSPVNMDIRYLSTP------------------------FD 424 (541) Q Consensus 379 l~P~t---i~a~~--~eG~Y~~~y~~~~g~-----~~~~i~d~~~~~~~~~~~~------------------------~d 424 (541) +|+.. |.+.. ..-+|+-+|...++. ...+|||...+...-..++ || T Consensus 292 ~n~~~~~~~~~~~d~~~~~v~~~y~s~~~~~~~~~~~~lVYd~~~~~Ws~~~~p~~~~g~~g~~~~~~~~~~~~~~~~~d 371 (513) T protein:vir:88 292 INPDNYQRTFVLADHVNTEMWVCYSSTRSEPGKHCDRAIIWNWKENTWSIRDLPNVLSGAYGIIDPKTSNLWDDDSNPWD 371 (513) T ss_pred CCcccceEEEEEEcCcccEEEEEecCCCCCCCcccceEEEEEccCCeEEEEeccchhhcccccccccccceecccccccc Confidence 67666 33332 234567777765553 2458999887765322211 11 Q ss_pred EE----EE---EecCCEEEEEE--CCEEEEecCCC----CceeEEEEcceEEeCcccceeEEEEe----eCCCccEEEEE Q lcl|NC_019442. 425 CA----WV---DLAKDMMRVVT--GDKMSVLAGGS----LPSTIRWHSKIFSLPERTSFSCIRVK----SPAPERVGITI 487 (541) Q Consensus 425 ~~----~~---~~~~d~LY~~~--g~~i~~~~~g~----~~~~~~WrSk~f~~~~~~~~~~~~V~----~~~~~~~~v~~ 487 (541) .. .. +...-.||+.. ++.++.|+.+. .++..+-.|..+.+..+-.+.-++-. ..+ ..+.+.+ T Consensus 372 ~~~~~~~~~~~~~~~~sl~~~~~~~~~~~~fd~~~~f~G~~lea~~~t~~~~~~~~~~~~~i~~v~~~~t~~-g~~t~~v 450 (513) T protein:vir:88 372 TDTSVWGEGSYNPAKSSMIFTSFQDAKLFLFGETSTFSGQSFTSTLERSDIYLGDDRMMKTVSAVIPHITGN-GVCNIWV 450 (513) T ss_pred cchhhhhccccccccceeEeeeccCCceeeecccccccCCceEEEEEecCccccCchhheeeeeeeeeeecc-eEEEEEE Confidence 11 01 11123456643 45566664322 25566667777777665554333221 111 1122222 Q ss_pred EECCc---eeE-e----ec-------ccccCCcc----eEccCcccceEEEEEEecceEEEEEeecchh Q lcl|NC_019442. 488 MADDV---PVI-H----FA-------PGTFKGSV----VRLPAATGQNWQVMVSGFGQVERITLSTSMS 537 (541) Q Consensus 488 ~~d~~---~~~-~----~~-------~~~~~~~~----~rLP~~~~~~w~iei~g~~~V~~i~la~s~~ 537 (541) -..+. .+. + .. ..-.+.|- +|+|. ...|.+ .|.- | +++-...+- T Consensus 451 g~~~~~~~~~~~s~~~~~~~~~~~~~~~r~~gRy~~~ri~i~~--~~~w~~--~G~~-v-e~~~~~g~R 513 (513) T protein:vir:88 451 GNAQVQGSGIRWKGPYPYRIGQDYKIDTKHVGRYIALKFDFAS--AGDWYF--NGYT-L-EMAPKAGMR 513 (513) T ss_pred eeeccCccccccccceeeecccCceEEeccCCceEEEEEEccC--CCceEE--eeEE-E-EEecCCCCC Confidence 11110 000 0 00 00001111 23342 222321 1110 0 000000000 No 11 >protein:vir:3133 Length: 911 # NCBI annotation: hypothetical protein # Family: family:all:5234 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640315;genbank:gi:21234408;genbank:GeneID:956056 Probab=99.00 E-value=1.4e-09 Score=69.14 Aligned_cols=436 Identities=13% Similarity=0.196 Sum_probs=184.1 Q ss_pred CceEEeccc----------cc----ccccccceecccccceEEEEeeecCCe----eeeeecccccCccccccceeEE-E Q lcl|NC_019442. 1 MPYIDITTM----------RG----MMPRVVTSMLPEHSAVLAEDCHFRFGV----ITPERQISGVEKTFTIKPKTIF-H 61 (541) Q Consensus 1 m~~i~i~~f----------~G----~~Pr~~p~llp~~~a~~a~N~~~~~G~----l~P~~~~~~v~~~~~~~~~Tif-~ 61 (541) -+.||+.+. -| ..||+.|-|+.-.-.- +.|. .+|+..-..... -..|+|.= . T Consensus 121 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~ 191 (911) T protein:vir:31 121 EAGIKLDGVIDSPVHISVGVGFAIITNPRIEPVLIKLDDVD-------DEGVPTLSYEPLTLLIRTRE--LLTPYTTGTN 191 (911) T ss_pred ccCceeeeeecCceeEEeeceEEEeecCccceEEEEeeccC-------ccCcccccccceeeEeeehh--hccccccccc Confidence 111221110 11 3688888887543211 1111 112111000000 00111100 1 Q ss_pred EC-----CcEEEE----eCCeEEEeeCCcccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCccceEEe Q lcl|NC_019442. 62 YR-----DDFWFA----WPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTV 132 (541) Q Consensus 62 ~~-----~~~W~~----w~~~V~vv~spia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v 132 (541) |. ...|-- |.....+-+--- --+.||..-- +-..---|+||+-+ T Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~--------~~~~~~~~~~~~~~---------------- 244 (911) T protein:vir:31 192 YGDTLTPEEEWNLYNSGWATITRATKDKS---GSGTVYVNPV--------QYYFDKRGVYPSHS---------------- 244 (911) T ss_pred cCcccCchhhcccccccceeeeeecccCC---ccceEEEchh--------heeecccCcCcchh---------------- Confidence 11 112322 111111111000 0012222100 00111123443332 Q ss_pred cCCCCCCCCCCCcccceEEEEEEEecCCCccCccccccceeecCC--CCEEEEccccCCCCccccceEEEEEeecCCCce Q lcl|NC_019442. 133 QQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTP--GTAVQLTLSPVPLQNASIKRRRIYRSASGGGEA 210 (541) Q Consensus 133 ~~~~~~~~~~~~~~~ty~Yv~T~V~~~GeEs~Ps~~S~~vtv~~~--g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~ 210 (541) ...++..+||+---+ .+.+-++ ...+...-..+| -++-| |---|- T Consensus 245 ---------------------~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~~~-------- 291 (911) T protein:vir:31 245 ---------------------VLYNSMKQESAKEIV--ALNVFSPWADEKINFGTTTPP-LGRYI-HSAYYF-------- 291 (911) T ss_pred ---------------------hhhhhhhhhccceeE--EEeeeccccccccccccCCCc-hhhhh-hhheee-------- Confidence 011222334422211 1111111 112232222222 22212 111110 Q ss_pred eEEEEEeeccceEEEEecccccccCccchhhhhhCCCCCcceEEeccCcEEEE----------------Ee--------- Q lcl|NC_019442. 211 DFLLVAELDASVLSYTDKIPGKNLGPSLATWDYLPPPENMTGLCLMANGIAAG----------------FA--------- 265 (541) Q Consensus 211 ~~~lVael~~~~~sf~D~~~~~~L~~~L~t~~~~~pP~~~~gL~~m~NGi~a~----------------f~--------- 265 (541) --.|-|..+..+.+--.++-.-.-.=|+.+-...|-+++.|-.+.|..+++ |- T Consensus 292 --~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~e~~np~gl~~igt~~n~k~~a~~~~~~~~~~r~r~~~~yaGRVfyaD~ 369 (911) T protein:vir:31 292 --DSAAILSLGIGNLTPPTSDGTTEGSGPAEEEISNPIGLDNIGTVNNLKLIAEGTVRWTVKDRPRCSGYHNGHVYFGDR 369 (911) T ss_pred --ccceeeeecccccCCCCCCCccCCCCCchhhhcCCCCcccccchhceeeeeccceeeeecccccceeeeccEEEEeee Confidence 001111222222222222211111234555556666666666555555555 21 Q ss_pred ----CCEEEEec-----CCCcccC--------chhc-------ccccC--cceEEEEEcCCcEEEEEcCCEEEEEccCcc Q lcl|NC_019442. 266 ----GNEVMFSE-----AYLPYAW--------PEVN-------RHTTA--EDIVAICPLGTSLVVATKGEPYLFSGVSPS 319 (541) Q Consensus 266 ----Gn~l~fSe-----p~~P~aw--------p~~y-------~~t~~--~~Iv~ia~v~~~lvV~T~~~py~l~G~~p~ 319 (541) ++.|+||. .+.|+-+ -+.- ++..+ ++|+.|..++++|.|++++.+|.+.|+++- T Consensus 370 dkngk~rIlFSqLv~sl~di~nCYQdaDPTSeee~DLIdTDGg~vri~gah~Ii~LV~~G~sLlVFcaNGVWAI~G~d~~ 449 (911) T protein:vir:31 370 DKNGKTRILVSQLVNSLDNIPKCFQDADPTAEEINDLIATDGFTMYPVGMGAPITMVEFNKRLLLLCTNGVWAIRGTSGG 449 (911) T ss_pred ccCcceeEEEEeeccccccccccccCCCccccccchhhhcCCcEEecCCCCCceEEEEecCeEEEEEeCcEEEEeccCCC Confidence 35888883 2333322 1111 11121 779999999999999999999999998876 Q ss_pred -----cceEEeecccccccccchheeCCccEEEecCCcEEEEeCCCceEEEecccCChhHhhhhcC--cce-E--EEEEE Q lcl|NC_019442. 320 -----TISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDVNGNTALATEKIISPEQWQSQFN--PAS-I--VAYSW 389 (541) Q Consensus 320 -----s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGLv~~~~~G~~~~vT~~~~~~~~W~~~l~--P~t-i--~a~~~ 389 (541) +..+.|+.. -+|.+.+|+|++|+.++|.|.+|+|.++.+ +..-.|++.+|..+.|.-++ |.+ | +.+.| T Consensus 450 g~TATdy~ItKIsd-vGcsspNSVVvVgn~i~fWSd~GIyaLgan-qfnD~tAnNLTesTIQ~y~d~I~~dkIkNVtgty 527 (911) T protein:vir:31 450 GATATDFTLDKVAS-VEFNSPQSVVDIGTAIVFWSERGIIAIGVN-DFGDLTSNNLTENTIDEYYDSLDRDIIKNVKGTF 527 (911) T ss_pred ceeeeeeEEEEEee-eeeCCCCeEEEecCceEEeeCCcEEEEeec-ccCccccccccHHHHHHHHhhcChhhhceEEEEE Confidence 457778766 599999999999999999999999999998 58889999999655554333 344 3 35666 Q ss_pred ---cCeEEEEEec-CCCc----cc---eEEEccCCceeEEEeecccEEEEEecCCEEEEEECCEEEEecCCCCceeEEEE Q lcl|NC_019442. 390 ---RGEYIACYTK-PDGK----QD---VFVFSPVNMDIRYLSTPFDCAWVDLAKDMMRVVTGDKMSVLAGGSLPSTIRWH 458 (541) Q Consensus 390 ---eG~Y~~~y~~-~~g~----~~---~~i~d~~~~~~~~~~~~~d~~~~~~~~d~LY~~~g~~i~~~~~g~~~~~~~Wr 458 (541) |+||+.+|.+ .|+. .. .|+||+..+.|+..+. .+. -| ++ . +++ T Consensus 528 d~de~rVyW~yPn~lDe~teykt~~~~ILVfdLatgaFYPwtv--s~g-------pL--l~-~------------p~y-- 581 (911) T protein:vir:31 528 INDENRVYWVVPNKQDSNGEYKTDGELVLVLNLDTGGFYKHTV--SGG-------PL--LH-A------------PFR-- 581 (911) T ss_pred EccCCEEEEEecCccCCccceeecCceEEEEEeccCcccceee--ecc-------ee--ec-c------------ccc-- Confidence 9999999985 4442 12 4777877777765321 111 11 00 0 000 Q ss_pred cceEEeCcccce------eEEEEeeCCCccEEEEEEECCce------eEeecccccCCcceEccCcccceEEEEEEecce Q lcl|NC_019442. 459 SKIFSLPERTSF------SCIRVKSPAPERVGITIMADDVP------VIHFAPGTFKGSVVRLPAATGQNWQVMVSGFGQ 526 (541) Q Consensus 459 Sk~f~~~~~~~~------~~~~V~~~~~~~~~v~~~~d~~~------~~~~~~~~~~~~~~rLP~~~~~~w~iei~g~~~ 526 (541) |.-.+.++++- .++.++.. ...+.++++.|.-. +..|.+++.. +.-=+-. -|-|-+-=.|+.+ T Consensus 582 -~Lv~TreEvtvPi~~etgaiIve~g-sdPV~~tl~vdttGvDg~ayLl~frdg~~g-~~~f~a~--~~~~~~~dw~~~~ 656 (911) T protein:vir:31 582 -RLVNTRAEVSIPITETDGTVITDTL-GDPVTVTRTVTTTGVDGLAYFASFDDGVNG-QFNFIAE--HQPWGFADWANVP 656 (911) T ss_pred -ccccccccceeeEEeecceEEEecC-CCCeEEEEeeecccccceeEEEeeccCCcc-eEEEEEe--ecCCeeeccccCc Confidence 11112222221 12223322 23444444322211 1112222211 1000000 1223333334321 Q ss_pred -EEEEEeecchhh--------------cCC Q lcl|NC_019442. 527 -VERITLSTSMSE--------------MPV 541 (541) Q Consensus 527 -V~~i~la~s~~E--------------L~~ 541 (541) .+|+-- +|+-| ||. T Consensus 657 ~~~~~~y-~s~~~~~y~~~~~~~~~~~~py 685 (911) T protein:vir:31 657 NMTRVNY-SSYVDFAYEYPEVMIGNISLPY 685 (911) T ss_pred cccccch-hHHHHhhhhhhhhhhhcccCce Confidence 111111 11111 222 No 12 >protein:vir:95475 Length: 771 # NCBI annotation: hypothetical protein ORF038 # Family: family:all:5234 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294631;genbank:gi:149408197;genbank:GeneID:5237042 Probab=98.77 E-value=2e-08 Score=62.83 Aligned_cols=442 Identities=13% Similarity=0.150 Sum_probs=184.3 Q ss_pred CceEEe--------ccccc----ccccccceecccccceEEEEee----ecCCeeeeeeccccc---------Ccccccc Q lcl|NC_019442. 1 MPYIDI--------TTMRG----MMPRVVTSMLPEHSAVLAEDCH----FRFGVITPERQISGV---------EKTFTIK 55 (541) Q Consensus 1 m~~i~i--------~~f~G----~~Pr~~p~llp~~~a~~a~N~~----~~~G~l~P~~~~~~v---------~~~~~~~ 55 (541) -+.|++ .-.-| ..||+.|-+|.-+..++|.-.. -++-.+.|+---.++ ++ +.. T Consensus 117 ~a~~nlSPsh~isv~v~~G~livanp~i~~~~~~~d~~t~s~t~~~ll~r~rf~~q~~~~G~d~~~~~~~~~~gt--~~t 194 (771) T protein:vir:95 117 YQFVNMSPSHKLSYAVVDGLLVVANGSRDIYVFEYDSGSVSVTTKRLLVRDLFGVQDIVNGVDLRQGNDIATRPT--VQT 194 (771) T ss_pred eecceeccceeEEEEEeeeEEEEecCCccEEEEEecCCcceeEeeeeeeeehhhccccccccceecccccccCCc--ccC Confidence 111111 11112 1344444444443333322110 011112222211111 11 112 Q ss_pred ceeEEEECCcEEEEeCCeEEEeeCCcccCCCCeEEEeCCCCcceeecc---------eeeccccCCccc--eeeecCCCC Q lcl|NC_019442. 56 PKTIFHYRDDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDAT---------IATKGDGNHPAS--SYSLGIPAP 124 (541) Q Consensus 56 ~~Tif~~~~~~W~~w~~~V~vv~spia~D~~~Rvy~t~~~~pk~t~~~---------ia~~g~g~~p~a--~y~LGVp~P 124 (541) ++.||......|- .||.+..+ ...+..|.||+- .|+|.+-.- T Consensus 195 n~~iynlyN~gw~---------------------------~pk~~~~snt~~~~iV~~y~a~~g~~pS~sd~~N~a~~k~ 247 (771) T protein:vir:95 195 NAHIYNLRNQTFG---------------------------VPRVTWHSNEPSDPIVTFRSAASGKFPSNSDSVNLALSKR 247 (771) T ss_pred chhheecccccee---------------------------ccccccccCCccccceEeeeccCCCCcCCceeeccccchh Confidence 2333333333333 23322211 122233445544 355555443 Q ss_pred CccceEEec-CCC-CCCC--CCCCcccceEEEEE-EEecCCC-----c---cCccccccceeecCCCCEEEEccccCCCC Q lcl|NC_019442. 125 TTAPVCTVQ-QGG-DVSD--DNPNDDETRFYTET-FVSDYGE-----E---GPPGPASLEVTLRTPGTAVQLTLSPVPLQ 191 (541) Q Consensus 125 ~~~pv~~v~-~~~-~~~~--~~~~~~~ty~Yv~T-~V~~~Ge-----E---s~Ps~~S~~vtv~~~g~~v~l~~~p~~~~ 191 (541) +..-..+.. ..+ .... -+....-.-+|+.. |-...+. | -.||.++....+.+.+..-.+++.- . T Consensus 248 ~~~Ei~t~~~f~~~~~~~~~~Gt~~~~~G~yi~da~~~g~~~Lt~~ve~~gr~~s~~~~~~~l~~~~t~~~~~~va---e 324 (771) T protein:vir:95 248 ADVEPSTTDRFRAEDIVLNPIGTYETARGFFIIDAMARGKSRLEEIVKLKQRYPSLSFGVSSLPQDETPGGASVVC---E 324 (771) T ss_pred hccceeeecccchhhhhhcccCcccccCcceeeehhhhcccccceeeeccccchhhhccccccccccCCCCceeEE---e Confidence 333222210 000 0000 01112222355542 2111111 1 1223333333332222111111110 0 Q ss_pred ccccceEE-EEEe-----e-cCCCc---eeEEEEEee----ccceEEEEeccccc-ccCccchhhhhhCCCCCcceEEec Q lcl|NC_019442. 192 NASIKRRR-IYRS-----A-SGGGE---ADFLLVAEL----DASVLSYTDKIPGK-NLGPSLATWDYLPPPENMTGLCLM 256 (541) Q Consensus 192 ~~~i~~~R-IYRs-----~-t~~~~---~~~~lVael----~~~~~sf~D~~~~~-~L~~~L~t~~~~~pP~~~~gL~~m 256 (541) + ..| -|-- . ...++ -.+.|+..| +.-..+|.|+.+.+ ++.+. +.. T Consensus 325 y----agRvwYag~~~~~iD~dkng~~~~~~ilfSqLv~s~~di~nCyQd~DPTsee~~dL----------------idT 384 (771) T protein:vir:95 325 Y----AGRVWYAGFSGQIIDGDDQSPRLVSYILFSQLVDSPADIVNCYQDGDPTSTEEPEL----------------VDT 384 (771) T ss_pred e----eeeEEEecceeEEeeccccCCceeeeEeeehhhcchhhcccccccCCCchhhhhhh----------------hhc Confidence 0 223 2321 0 01111 233333333 22234666655432 22222 222 Q ss_pred cCcEEEEEeCCEEEEecCCCcccCchhcccccCcceEEEEEcCCcEEEEEcCCEEEE-----EccCcccceEEeeccccc Q lcl|NC_019442. 257 ANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLF-----SGVSPSTISGSRIPSMQA 331 (541) Q Consensus 257 ~NGi~a~f~Gn~l~fSep~~P~awp~~y~~t~~~~Iv~ia~v~~~lvV~T~~~py~l-----~G~~p~s~~~~~l~~~~p 331 (541) ..|.| +. .=.++|+.|..++..|.|+++..+|.+ .|.+--+..+.|+.. .+ T Consensus 385 DGg~i--------ri---------------~gah~ii~Lv~f~~sLlvfc~NGVWAi~ggsd~g~tAtdY~ltKIs~-vg 440 (771) T protein:vir:95 385 DGGFI--------RI---------------EGAHDIINLVNVGSAVMVVAANGIWMIQGGSDYGFTATNYLVTKISE-HG 440 (771) T ss_pred CCCEE--------Ee---------------cCCCCceeEEEecceEEEEEecceEEEEeccCCceeeeeeEEEEeee-ec Confidence 22222 11 113889999999999999999999999 444555667778776 99 Q ss_pred ccccchheeCCccEEEecCCcEEEEeCCCceEEEecccCChhHhhhhcC--cceE---EEEEE---cCeEEEEEec-CCC Q lcl|NC_019442. 332 CLSRRSMVAMEGFVLYAGTNGLVSVDVNGNTALATEKIISPEQWQSQFN--PASI---VAYSW---RGEYIACYTK-PDG 402 (541) Q Consensus 332 Cvs~rsiv~~~~~v~y~s~dGLv~~~~~G~~~~vT~~~~~~~~W~~~l~--P~ti---~a~~~---eG~Y~~~y~~-~~g 402 (541) |.|.+|+|++++.++|.|.+|++.++.+ +...+|++++|..+.|+-++ |..+ +++.| |+||+-+|.. .|+ T Consensus 441 ~sspnSvVvvg~~i~ywsdtgIyal~~N-dfn~~tAqnLTekTIq~~~~~I~~dk~knVtg~fd~~e~rvyw~yPn~~D~ 519 (771) T protein:vir:95 441 CSSPNSVVVVDNSFMYWGDDGIYHLTRN-QYGDYVANNLTEKTIQKYYEKIPSDAILNATGFYDSYDKKVKWLYNTVLDG 519 (771) T ss_pred cCCCccEEEecceEEEeeCCceEEEeec-ccCcchhhccchHHHHHHHhhcchhhhcceEEEEEccCCEEEEEecceecC Confidence 9999999999999999999999999998 48899999999766665444 4443 57777 9999999962 122 Q ss_pred cc-ce--EEEccCCceeEEEeecccEEEEEecCCEEEEEECCEEEEecCCCCceeEEEEcceEEeCcccceeEEEEeeC- Q lcl|NC_019442. 403 KQ-DV--FVFSPVNMDIRYLSTPFDCAWVDLAKDMMRVVTGDKMSVLAGGSLPSTIRWHSKIFSLPERTSFSCIRVKSP- 478 (541) Q Consensus 403 ~~-~~--~i~d~~~~~~~~~~~~~d~~~~~~~~d~LY~~~g~~i~~~~~g~~~~~~~WrSk~f~~~~~~~~~~~~V~~~- 478 (541) .. .. ++||+.-|.|...... ++..+.|=+..|..-+-+- |...+-++++-....|.+. T Consensus 520 ~~e~~t~LV~dLalgaFYp~~i~------~~~ag~l~~~vg~~~~p~~------------~lv~T~~eV~v~~~~v~~tG 581 (771) T protein:vir:95 520 RTEPVTELVFDLALGAFYPSKIG------SLTAGRLPIPVGSVKIPPY------------KLVETGEEVTVASEQVTATG 581 (771) T ss_pred CCcceeeeeeeeccccccccccc------ccccCccceeeeeeecCcc------------ccccccceEEecceeeEecC Confidence 11 11 4555555544432111 2222333222222111000 1122223444444555433 Q ss_pred CCccEEEEEE---ECCceeEeecccccCCcceEccCcccceEEEEEEecceEEEEEeecchhh--------------cCC Q lcl|NC_019442. 479 APERVGITIM---ADDVPVIHFAPGTFKGSVVRLPAATGQNWQVMVSGFGQVERITLSTSMSE--------------MPV 541 (541) Q Consensus 479 ~~~~~~v~~~---~d~~~~~~~~~~~~~~~~~rLP~~~~~~w~iei~g~~~V~~i~la~s~~E--------------L~~ 541 (541) .+..|+.... .++..... ...++.+.|+-=+--+.|++-=.+++.=-. +=+.|.-| +|. T Consensus 582 ~~vtV~~~~r~~~~~~~~y~~---~~~dg~~g~~~Fa~~~~~~f~DW~sv~~~~-vdy~sy~~~gY~~~gd~~~~k~~PY 657 (771) T protein:vir:95 582 ELVTVKVSTRSPVIRETKYII---VEKLSSPMRISFGGYTDEEFVDWKSVDGIG-VDAPAYLLTGYLAGGDYQREKFVPY 657 (771) T ss_pred CceEEEEEEeeccccceEEEE---EEecCCCeeEEeccccCcceeecccCCCcc-cchHHHHHhhhhccchheeeeccce Confidence 2233333221 12211111 111222333322222445554444321111 00111110 111 No 13 >protein:vir:2625 Length: 715 # NCBI annotation: gp27 # Family: family:all:5234 # MgeID: mge:55 # MgeName: SIO1 # Cross-refs: genbank:acc:NP_064766;genbank:gi:9964636;genbank:GeneID:1263056 Probab=98.67 E-value=1.3e-07 Score=58.34 Aligned_cols=422 Identities=15% Similarity=0.148 Sum_probs=184.4 Q ss_pred CceEEe----ccccc----ccccccceecccccceEEEEe---eecCCee----eeeeccccc---CccccccceeEEEE Q lcl|NC_019442. 1 MPYIDI----TTMRG----MMPRVVTSMLPEHSAVLAEDC---HFRFGVI----TPERQISGV---EKTFTIKPKTIFHY 62 (541) Q Consensus 1 m~~i~i----~~f~G----~~Pr~~p~llp~~~a~~a~N~---~~~~G~l----~P~~~~~~v---~~~~~~~~~Tif~~ 62 (541) |+-..| ...-| ..||+.|-+|.-+..++|.-. -++.=.+ .|++.-.+. ++ +..++.||.. T Consensus 125 ~SPsh~~v~v~~~~G~livanp~i~~~~~~~d~~t~s~t~~~ll~r~r~f~~qg~d~~~g~~y~~~gt--~~tn~~iynl 202 (715) T protein:vir:26 125 LSPSEERVQVTSLNGYLIVASPAINTFYLGFNTSTEAFTATSISFKERDFEWQGSDVDVTSLYFGEGT--SVSNQRIYDT 202 (715) T ss_pred cccceeEEEEEEeeeEEEEecCCccEEEEEecCCcceeEeeEEEEEeeeheeeccccccccccccCCc--ccCchhheec Confidence 322111 11112 257777777665555544321 1111111 233333333 22 2345556655 Q ss_pred CCcEEEEeCCeEEEeeCCcccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCccceEEecCCCCCCCCC Q lcl|NC_019442. 63 RDDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDVSDDN 142 (541) Q Consensus 63 ~~~~W~~w~~~V~vv~spia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~ 142 (541) ....|-. ..+.+.-.|.. +++ || | +..+.|++-...=++=.|.+-..+. .+ T Consensus 203 yN~gw~~--p~gt~~~N~~~-~yi--Vy------p---------a~s~~~~S~kd~n~afsk~ad~ei~---------tG 253 (715) T protein:vir:26 203 YNVGWVG--PKGSAALNTYG-SYI--VY------P---------ALTHPWYSGKDANGAFNKADWLEIY---------TG 253 (715) T ss_pred ccceeec--ceeEEEEcCCC-Cce--Ee------c---------ccccccCCCcccccccChhhccccc---------cc Confidence 5555654 22222222211 111 11 0 1111222211111111111111110 01 Q ss_pred CCcccceEEEEEEEecCCCccCccccccceeec------CCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEEE Q lcl|NC_019442. 143 PNDDETRFYTETFVSDYGEEGPPGPASLEVTLR------TPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLVA 216 (541) Q Consensus 143 ~~~~~ty~Yv~T~V~~~GeEs~Ps~~S~~vtv~------~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lVa 216 (541) ...+-.-+|+ .|..-+-+ +++..+|... .=.++|-..+..-.-+ .-|||-| +||- T Consensus 254 t~~~~~G~yi---~D~~~~g~--~~leeev~k~R~rsv~~yaGrV~yagiD~dkn-----g~rilfS---------qLv~ 314 (715) T protein:vir:26 254 SSLASNGHYV---LDVFNKAR--TGLTTEVETGRFRSVAAYAGRVFYAGIDSAKN-----GGKVYFS---------RLTE 314 (715) T ss_pred cccccCceEE---EeeeecCC--ccchhhhhcCCCcceeeecceEEEeecccccC-----CCeEEEe---------hhhc Confidence 1222233565 23322222 3333444311 0122233222111111 1244432 3444 Q ss_pred eeccceEEEEeccccc-ccCccchhhhhhCCCCCcceEEeccCcEEEEEeCCEEEEecCCCcccCchhcccccCcceEEE Q lcl|NC_019442. 217 ELDASVLSYTDKIPGK-NLGPSLATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAI 295 (541) Q Consensus 217 el~~~~~sf~D~~~~~-~L~~~L~t~~~~~pP~~~~gL~~m~NGi~a~f~Gn~l~fSep~~P~awp~~y~~t~~~~Iv~i 295 (541) .++.-..+|.|+.+.+ ++.+.+.| ..|.| +. .=.++|+.| T Consensus 315 s~~di~nCyQd~DPTsee~~dLidT----------------DGg~i--------ri---------------~gah~ii~L 355 (715) T protein:vir:26 315 RMSDVGNCYQVNDPTSEVLSDLLDT----------------DGGVV--------RI---------------PDAHNIRKL 355 (715) T ss_pred chhhcccccccCCCchhhhhhhhhc----------------CCCEE--------Ee---------------cCCCCceeE Confidence 5544456888876643 33333322 22222 11 112889999 Q ss_pred EEcCCcEEEEEcCCEEEEEccC----cccceEEeecccccccccchheeCCccEEEecCCcEEEEeCCCceEEEecccCC Q lcl|NC_019442. 296 CPLGTSLVVATKGEPYLFSGVS----PSTISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDVNGNTALATEKIIS 371 (541) Q Consensus 296 a~v~~~lvV~T~~~py~l~G~~----p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGLv~~~~~G~~~~vT~~~~~ 371 (541) ..++..|.|+++..+|.+.|++ --...+.|+.. .+|.+.+|+|++++.++|.|.+|++.++.+-+...+|++++| T Consensus 356 v~f~~sLlvf~~NGVWAi~G~d~g~tATdY~ltKIs~-vg~sspnSvVvv~~~i~~WsdtGIyal~~Nd~fn~~tAqNLT 434 (715) T protein:vir:26 356 HVLGASLLVFAENGVWAVAGVDNVFRATEYAITRISD-VGLSNENSFVVADGIPIWWGKTGIYAVQQSENLNTPTAQNLS 434 (715) T ss_pred EEecceEEEEEecceEEEeccCCceeeeeeEEEEeee-eccCCCccEEEecceEEEeeCCcEEEEEeccccCcchhhccc Confidence 9999999999999999996665 33556777766 999999999999999999999999999999668899999999 Q ss_pred hhHhhhhcC--cceE---EEEEE---cCeEEEEEecCCC-----ccceEEEccCCceeEEEeecccEEEEEecCCEEEEE Q lcl|NC_019442. 372 PEQWQSQFN--PASI---VAYSW---RGEYIACYTKPDG-----KQDVFVFSPVNMDIRYLSTPFDCAWVDLAKDMMRVV 438 (541) Q Consensus 372 ~~~W~~~l~--P~ti---~a~~~---eG~Y~~~y~~~~g-----~~~~~i~d~~~~~~~~~~~~~d~~~~~~~~d~LY~~ 438 (541) ..+.|+-++ |..+ +++.| |+||+-+|.+.+. ..+.++||+.-|.|..-... |-++..-|++ T Consensus 435 ekTIq~~~~~I~~dk~knVtg~fd~~e~rVyW~yPn~dt~vdykyd~vLV~dLalgaFYp~~v~------~~a~~~~~~i 508 (715) T protein:vir:26 435 LSTIQTLWNNISNAKKAQVTVEYDKINQRVFWFYPDNDESVDYKYNNILVMDLALQAFYPWRVE------DEASSTSYII 508 (715) T ss_pred hHHHHHHHhhcchhhhcceEEEEEccCCEEEEEEcCCceeeceeecCeEEEEeccccccccccc------ccccccceee Confidence 666655444 4443 57777 9999999964332 12456666665555442211 1112222222 Q ss_pred ECCEEEEecCCCCceeEEEEcceEEeCcccceeEEEEeeC-CCccEEEEE----EECCceeEeecccccCCcceEccCc- Q lcl|NC_019442. 439 TGDKMSVLAGGSLPSTIRWHSKIFSLPERTSFSCIRVKSP-APERVGITI----MADDVPVIHFAPGTFKGSVVRLPAA- 512 (541) Q Consensus 439 ~g~~i~~~~~g~~~~~~~WrSk~f~~~~~~~~~~~~V~~~-~~~~~~v~~----~~d~~~~~~~~~~~~~~~~~rLP~~- 512 (541) ---.+ + ++ |......++ -+..+|... ....+++.- ..|.+- -....++.+.|+-=+ T Consensus 509 g~~~~----~---~~------~~~~t~~~v-v~~~~v~~~g~~~~v~~~~r~~~~~~~~~----~~~~~~~~~~~~~f~~ 570 (715) T protein:vir:26 509 GTSYY----G---GL------GSTSTETQV-VNGADVVVNGSDNVVATLYRDYLEGDSEI----KLLVRDGTTGKMTFAT 570 (715) T ss_pred eeeee----C---Cc------ccccchhhe-eccceEEEeccceEEEEeecccccccceE----EEEEEcCCceeEEEec Confidence 11111 0 00 000111111 112222211 111121111 011110 111223334444311 Q ss_pred -ccce---EEEEEEecceEEEEEeec--------chhhcCC Q lcl|NC_019442. 513 -TGQN---WQVMVSGFGQVERITLST--------SMSEMPV 541 (541) Q Consensus 513 -~~~~---w~iei~g~~~V~~i~la~--------s~~EL~~ 541 (541) +..+ |- .+.-..-.++. +..-.|. T Consensus 571 ~~~~~~~dw~-----s~d~~~~~~~gy~~~gd~~~~k~~py 606 (715) T protein:vir:26 571 FRGDTYLDWG-----SADYKSFAEAGYDFMGDITTFKNAPY 606 (715) T ss_pred ccCceeeecc-----ccchhhHHHhhhhhcccceeeecCce Confidence 2222 21 11000000000 0000000 No 14 >protein:vir:108312 Length: 458 # NCBI annotation: hypothetical protein # Family: family:all:1540 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552278;genbank:gi:160700603;genbank:GeneID:5758828 Probab=98.55 E-value=3e-07 Score=56.36 Aligned_cols=394 Identities=9% Similarity=0.049 Sum_probs=194.8 Q ss_pred cceEEecCCCCCCCCCCCcccceEEEEEEEecCCCccCccccccceeecCCCCEEEEccccCCCC------c--cccceE Q lcl|NC_019442. 127 APVCTVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQ------N--ASIKRR 198 (541) Q Consensus 127 ~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~------~--~~i~~~ 198 (541) -+..++..++.....-..+ -+-.+.+.=..+|++ -|.......||......+.+.|-. + .++.+. T Consensus 1 m~~~~ip~gsy~a~~~~~d---aq~~VN~yp~~~e~g----~ss~~l~~tPGl~~f~~~~~~~~~g~~~~~g~ly~v~g~ 73 (458) T protein:vir:10 1 MVQRQIPLVATTAEGDVSG---QEILVNVYPRKSDGG----KYPFTLRHTPGLAFFCELPTFPVMAMHQNGSRAFAVTPR 73 (458) T ss_pred Cceeeeceeeeeccccccc---ceeeeeeeeeccccc----ccccceEecCCceeeecCCCCceeeEEecCCEEEEeeCc Confidence 1111111111111110111 111222222223322 122333445776665433332211 1 245566 Q ss_pred EEEEeecCCCceeEEEEEeeccce-EEEEeccccccc--Cccc-----hh----hhhhCCCCCcceEEeccCcEEEEE-e Q lcl|NC_019442. 199 RIYRSASGGGEADFLLVAELDASV-LSYTDKIPGKNL--GPSL-----AT----WDYLPPPENMTGLCLMANGIAAGF-A 265 (541) Q Consensus 199 RIYRs~t~~~~~~~~lVael~~~~-~sf~D~~~~~~L--~~~L-----~t----~~~~~pP~~~~gL~~m~NGi~a~f-~ 265 (541) .|||=..++ .+-.+++++.+. .+.+|+....-. |+.+ .| ..|++--.+.+.++-+ .|++... . T Consensus 74 ~LY~V~~~~---~~~~iG~i~gsg~VsMa~ng~q~vi~~G~~gY~yd~at~~~~~i~d~~~~~~~~v~~~-dGy~V~~~~ 149 (458) T protein:vir:10 74 DMYEISKDG---TYKRLGSVDFKGRVVMEDNGKQIVMVDGEKGYYYDSETEIVQEIKAEGFYPASTVTYQ-DGYFIFDRK 149 (458) T ss_pred eEEEEeCCc---eEEEEecccCceeEEEeeCCcEEEEEECCeEEEEeecccEEEeccCccccCcceEEEe-CcEEEEEee Confidence 777722222 245666665432 245555421000 1100 01 1111111112233333 5555433 2 Q ss_pred -CCEEEEecCCCcccC-chhc-cc-ccCcceEEEEEcCCcEEEEEcCCE--EEEEccCcccceEEee-cccccccccchh Q lcl|NC_019442. 266 -GNEVMFSEAYLPYAW-PEVN-RH-TTAEDIVAICPLGTSLVVATKGEP--YLFSGVSPSTISGSRI-PSMQACLSRRSM 338 (541) Q Consensus 266 -Gn~l~fSep~~P~aw-p~~y-~~-t~~~~Iv~ia~v~~~lvV~T~~~p--y~l~G~~p~s~~~~~l-~~~~pCvs~rsi 338 (541) ++....|+.... .| |.+| .- --+|.|++|...-.-||++-+... |..+|..+-.++..+- ..+.+|.++.|+ T Consensus 150 g~~~~~is~L~d~-s~d~l~fa~Ae~~pD~iv~i~~~~~~i~~fG~~TiEvw~ntG~a~fpy~r~~ga~i~~Gcaa~~sv 228 (458) T protein:vir:10 150 GTGQFFISELLDV-AFDPLDFATAEGQPDPLLAVLSDHREVFMFGQETIEVWYNSGAADFPFERNQGAFIEKGIGAPYSV 228 (458) T ss_pred CCCEEEEEecCcc-eeCcceeeeecCCCCceEEEEeeccEEEEEeccceEEEEecCCCCcceeecccceeeecccCcchh Confidence 344555654332 34 3333 11 123789999999999999988776 9999998866555543 347799999999 Q ss_pred eeCCccEEEecCCcEEEEeCCCceEEEecccCChhHhhhhcCcceEEEEEE--cCe--EEEEEecCCCccceEEEccCCc Q lcl|NC_019442. 339 VAMEGFVLYAGTNGLVSVDVNGNTALATEKIISPEQWQSQFNPASIVAYSW--RGE--YIACYTKPDGKQDVFVFSPVNM 414 (541) Q Consensus 339 v~~~~~v~y~s~dGLv~~~~~G~~~~vT~~~~~~~~W~~~l~P~ti~a~~~--eG~--Y~~~y~~~~g~~~~~i~d~~~~ 414 (541) ..+++.++|+|+||.|-...+++++.|+-.=|+ +.|++ .+.++.+|..| ||. |+..+... .....||...+ T Consensus 229 ~~~~~t~~~l~~d~~Vy~l~g~~~~rIST~aIE-~~i~s-y~~~da~a~t~~~eGH~fy~LtfP~a---~~Tw~yD~~t~ 303 (458) T protein:vir:10 229 AKTNNTVYFIGSDLMIYQITGYTPVRISTHAVE-QTLKG-VNLSDAFAYTYQSEGHLFYVLTIPGK---NLTWCYDISSG 303 (458) T ss_pred hhhCceEEEEcCCeEEEEecCceeEEeeCHHHH-HHHhc-CChhheEEEEEEecCeEEEEEECCCC---CceeEEecccc Confidence 999999999999999888878778777655555 47776 78888887777 998 77776532 23689998876 Q ss_pred eeEEEee----cccEEEEEecCCEEEE--EECCEEEEecCC---CCceeEE--EEcceEEeCcc-cceeEEEEe------ Q lcl|NC_019442. 415 DIRYLST----PFDCAWVDLAKDMMRV--VTGDKMSVLAGG---SLPSTIR--WHSKIFSLPER-TSFSCIRVK------ 476 (541) Q Consensus 415 ~~~~~~~----~~d~~~~~~~~d~LY~--~~g~~i~~~~~g---~~~~~~~--WrSk~f~~~~~-~~~~~~~V~------ 476 (541) .-..+.. +|-+.-+-...+++.+ .+++.||+|+-. +...++. --++.|.-... +.+=.+.++ T Consensus 304 ~Wher~Sg~~~~~Ra~~~v~~~g~~~vGD~~ng~ly~ld~~~~td~g~~i~~~~~~p~~~~~~~rl~~~~~el~~~tGvg 383 (458) T protein:vir:10 304 SWHVRQSYQFDRHVSNNSIYFDQKTLVGDFQNGRIYIMADNYYTDDGDPVVREFILPVVNNGREFLTVDSLELDLSSGVG 383 (458) T ss_pred cceeeccCCCCceEEEEEEEeCCeEEEEEcCCCeEEEEcccCcCCCCceeeeeeeccceeCCCCeEEEEEEEEEEeccee Confidence 4222111 2333333344566666 347888888553 2223332 22444422111 111122221 Q ss_pred --eCCCccEEEEEE--ECCceeEeec-----cccc---CCc--ceEccCcccceEEEEEEecceEEEEEeecchhhc Q lcl|NC_019442. 477 --SPAPERVGITIM--ADDVPVIHFA-----PGTF---KGS--VVRLPAATGQNWQVMVSGFGQVERITLSTSMSEM 539 (541) Q Consensus 477 --~~~~~~~~v~~~--~d~~~~~~~~-----~~~~---~~~--~~rLP~~~~~~w~iei~g~~~V~~i~la~s~~EL 539 (541) .++.....|.++ -||+.-.... .+.. +-| ..|| |++|++-+||+=..+|-..-++.+++== T Consensus 384 ~~~~~~~~p~~~l~~S~d~g~~~s~~~~~~~lg~~gey~tr~~~~rl--G~ar~rvf~v~~s~p~~~~l~ga~~~~r 458 (458) T protein:vir:10 384 LTVGQGSDPELRVYFSKDNGNEYSQNFKVGKIGRKGEFLTRAKVNRF--GCARQFTFKVEISDPIPVDIGGAWVEVR 458 (458) T ss_pred eeeCCCCCceEEEEEeeCCCcccchhHHHhhcCCcchhhhhhhhhhh--ccCcceEEEEEEecchhhcceeeeEEeC Confidence 123334455554 2333221111 0111 111 1133 2455555666655665555555544322 No 15 >protein:vir:9268 Length: 472 # NCBI annotation: 10 # Family: family:all:1540 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720332;genbank:gi:24371590;genbank:GeneID:955815 Probab=98.51 E-value=4.1e-07 Score=55.58 Aligned_cols=394 Identities=12% Similarity=0.114 Sum_probs=186.8 Q ss_pred CccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEEEEEecCCCccCccccccceeecCCCCEEEEccccCCCC Q lcl|NC_019442. 112 HPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQ 191 (541) Q Consensus 112 ~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~ 191 (541) +|-. .||-+. +-.-..- .+.+. ..--|+.+..++.-.- |.......+|....-. .+-+.- T Consensus 1 m~~~----~ipl~~--------g~~~~~~-----~a~~~-~~~pvn~y~~~~~~~~-ss~~Lr~~pG~~~~a~-~~G~~R 60 (472) T protein:vir:92 1 MPIQ----QLPMMK--------GMGKDFK-----NADYI-DYLPINMLATPKEVLD-SSGYLRSFPGIAKRND-VNGVSR 60 (472) T ss_pred Ccee----eccccc--------cccccCc-----cCcce-eeeecccccccccccc-cccceeecccceeecC-CCCccc Confidence 1111 011111 0000000 00000 1113444433322222 2233344566544321 111111 Q ss_pred cc--ccceEEEEEeecCCCceeEEE---EEeeccce-EEEEeccccccc--Cc-----cch--hhhhhC-CCC-Cc---- Q lcl|NC_019442. 192 NA--SIKRRRIYRSASGGGEADFLL---VAELDASV-LSYTDKIPGKNL--GP-----SLA--TWDYLP-PPE-NM---- 250 (541) Q Consensus 192 ~~--~i~~~RIYRs~t~~~~~~~~l---Vael~~~~-~sf~D~~~~~~L--~~-----~L~--t~~~~~-pP~-~~---- 250 (541) |- +...=.+|| +.|+ .=|+- +++++.+. -+++||...-.. +. .+. +..... |.| .. T Consensus 61 G~~~~~~~~~ly~-V~G~--~Ly~v~~~iG~i~gsgrVsMa~n~~~~av~~~~~~~~Y~~~~~~~t~~~~~~d~~f~~~d 137 (472) T protein:vir:92 61 GVEYNTAQNAVYR-VCGG--KLYKGEAVVGDVAGSGRVSMAHGRTSQAVGVNGQLIEYRYDGAVKTVSNWPADSGFTQYE 137 (472) T ss_pred ceeeeeeCCeEEE-EeCc--ceEEEEeeEeeccCcccEEEecCCeEEEEEECCceeEEEEecchhhhhcccCcccccccc Confidence 11 112223555 2221 11221 34443221 244444321000 00 000 010111 211 11 Q ss_pred ----ceEEeccCcEEE-EEeCCEEEEecCCCcccCchhc--ccccC---cceEEEEEcCCcEEEEEcCCE--EEEEccCc Q lcl|NC_019442. 251 ----TGLCLMANGIAA-GFAGNEVMFSEAYLPYAWPEVN--RHTTA---EDIVAICPLGTSLVVATKGEP--YLFSGVSP 318 (541) Q Consensus 251 ----~gL~~m~NGi~a-~f~Gn~l~fSep~~P~awp~~y--~~t~~---~~Iv~ia~v~~~lvV~T~~~p--y~l~G~~p 318 (541) +.+|-+ .|++. .-.|...+|.-...+-.-|.+| |-+-+ |.|++|....+-||++-+... |..+|..+ T Consensus 138 l~~~~dv~f~-dGyfV~~~~gt~~~~iS~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~~~~~l~lfG~~TiEvw~ntG~ad 216 (472) T protein:vir:92 138 LGSVRDITRL-RGRYAWSKDGTDSWFITDLEDESHPDRYSAEYRAESQPDGIIGIGSWRDFIVCFGSSTIEYFSLTGATT 216 (472) T ss_pred ccceeEEEEe-cceEEEccCCCceEEEeccCCccccccccccccccCCCCceEEEEeeccEEEEEeccceEEEEecCCCC Confidence 122233 46644 3335555555555555566666 44433 789999999999999988777 99999874 Q ss_pred -ccceEEeec---ccccccccchheeCCccEEEecCCc----EEEEeCCCceEEEecccCChhHhhhhcC-cce----EE Q lcl|NC_019442. 319 -STISGSRIP---SMQACLSRRSMVAMEGFVLYAGTNG----LVSVDVNGNTALATEKIISPEQWQSQFN-PAS----IV 385 (541) Q Consensus 319 -~s~~~~~l~---~~~pCvs~rsiv~~~~~v~y~s~dG----Lv~~~~~G~~~~vT~~~~~~~~W~~~l~-P~t----i~ 385 (541) ......+.+ .+.+|.++.|+..+++.++|+|+|+ .|....+++++.|+-.=|+ +.|++ .+ |+- +. T Consensus 217 ~~~fpy~r~~g~~i~~Gcaa~~sv~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST~aIE-~~i~~-y~~~e~~~a~~~ 294 (472) T protein:vir:92 217 VGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIE-KIIRS-YTADELATGVME 294 (472) T ss_pred cCcCceEEcCcceeeecccCcchhhecCceEEEEecCCCcccEEEEccCceeEEecCHHHH-HHHHh-cCcchhceeeEE Confidence 445555554 6789999999999999999999998 2445556678777433333 36665 44 433 45 Q ss_pred EEEEcCe--EEEEEecCCCccceEEEccCCce--eEEE--ee-----cccEEEEEecCCEEEE--EECCEEEEe--cCCC Q lcl|NC_019442. 386 AYSWRGE--YIACYTKPDGKQDVFVFSPVNMD--IRYL--ST-----PFDCAWVDLAKDMMRV--VTGDKMSVL--AGGS 450 (541) Q Consensus 386 a~~~eG~--Y~~~y~~~~g~~~~~i~d~~~~~--~~~~--~~-----~~d~~~~~~~~d~LY~--~~g~~i~~~--~~g~ 450 (541) ..+.||. |+.++.+ ..+.||..++. .+|. .. +|-+.-+....+++-+ .+++.||++ +..+ T Consensus 295 s~~~eGH~fy~LtfP~-----~Tw~yD~at~~~~e~W~~~~sg~~~~~~R~~~~~~~~g~~ivGD~~nG~ly~l~~~~~t 369 (472) T protein:vir:92 295 ALRFDSHELLIIHLPR-----HVLVYDASSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQITCGDKSEAVTGQLQFDISS 369 (472) T ss_pred EEEecCeeEEEEEcCC-----ceEEEEcccCcCCceeeeecCCCcccceeEEEEEeeCCeEEEEEcCCCeEEEEeccccc Confidence 6678898 8887752 47999988763 1222 22 1333333334444444 246777777 3322 Q ss_pred CceeEEEE---cceEEeCcccceeEEEEeeC-----CCccEEEEEEECCceeEee------cccccCCc--ceEccCccc Q lcl|NC_019442. 451 LPSTIRWH---SKIFSLPERTSFSCIRVKSP-----APERVGITIMADDVPVIHF------APGTFKGS--VVRLPAATG 514 (541) Q Consensus 451 ~~~~~~Wr---Sk~f~~~~~~~~~~~~V~~~-----~~~~~~v~~~~d~~~~~~~------~~~~~~~~--~~rLP~~~~ 514 (541) ......|+ +..|..+..--|- ..|+.. ...++-++.--||..+-+- .++..+.| -.||=..|. T Consensus 370 ~~~~~~~~~~~~P~~~~dn~R~~d-~eve~~~Gv~q~~d~v~L~wSddG~~~~~~~~~~~g~~g~~~tr~~~~RlG~~r~ 448 (472) T protein:vir:92 370 QYDKQQEHLLFTPIFKADNARCFD-LEVESSTGVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRR 448 (472) T ss_pred cCCCcceEEEEeceEecCCCEEEE-EeeeccCCCCCcCceEEEEeeccccccccceeeccCCccchhcceeeeeeeeccc Confidence 22222333 4445444432232 444421 1122222222243322111 11111211 224444444 Q ss_pred ce-EEEEEEecceEEEEEeecchh Q lcl|NC_019442. 515 QN-WQVMVSGFGQVERITLSTSMS 537 (541) Q Consensus 515 ~~-w~iei~g~~~V~~i~la~s~~ 537 (541) +- .+|+++...+|.---+---+| T Consensus 449 ~v~f~~r~~~~~~~~l~g~~~~~E 472 (472) T protein:vir:92 449 LIGFKLRVITKSPVTLSGCQIRLE 472 (472) T ss_pred ceeEEEEEEecCcceeeeeEEeeC Confidence 43 888888999988777777777 No 16 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=98.39 E-value=6.4e-09 Score=65.49 Aligned_cols=249 Identities=12% Similarity=0.105 Sum_probs=88.4 Q ss_pred CceEEec-ccccccccccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEECCcEEEEeCCeEEEeeC Q lcl|NC_019442. 1 MPYIDIT-TMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRS 79 (541) Q Consensus 1 m~~i~i~-~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W~~w~~~V~vv~s 79 (541) +.++.|. +||=-. --+||..--..=+|.+|.+-..=+... ..+. +.=+. -+..++=.+.|.--+| T Consensus 203 eaa~~i~~gfG~~t----d~~~~~~v~a~~~~~~L~~q~~v~~~n-~~~~-------~~G~~--v~g~~sa~G~I~l~gs 268 (468) T protein:vir:63 203 QAAVMISKGYGTPT----DAYMPVGVQADFVNQQLSKQTQLVRDN-GNNV-------SVGFN--IQGFHSARGFIKLHGS 268 (468) T ss_pred HHhhhccccccChh----hhhcchhHHhhhhhhhcCceEEEEcCC-CCce-------eeeec--ccceecceeeeeecCc Confidence 3333222 121100 001111000001122222211111100 0000 11111 0111222222222222 Q ss_pred CcccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEEEEEecC Q lcl|NC_019442. 80 PIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDY 159 (541) Q Consensus 80 pia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~V~~~ 159 (541) -| -|+.... .-. -+++. .-|.|.... ++...+.. +.....+..+|.|++++|+.. T Consensus 269 ~i----------l~~~~~l-~~~---------~~~~~---~Apsp~~vs-aT~~~~~~-g~~~~~~~a~y~Y~v~~vs~~ 323 (468) T protein:vir:63 269 TV----------MENEQIL-DER---------ILALP---TAPQPAKVT-ATQEAGKK-GQFRAEDLAAHEYKVVVSSDD 323 (468) T ss_pred ee----------eccccCC-Ccc---------ccccc---ccccCCccc-eeeecccC-CcccCCCcceEEEEEEEECCC Confidence 11 1110000 000 00111 112332111 12212221 222335567899999999999 Q ss_pred CCccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEEEeeccc-----eEEEEeccccc-- Q lcl|NC_019442. 160 GEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLVAELDAS-----VLSYTDKIPGK-- 232 (541) Q Consensus 160 GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lVael~~~-----~~sf~D~~~~~-- 232 (541) | ||+||++ ..++|.+..+.+.|+.-..+..+..-+.++|||+. .++++|+|+++.+.+ +.+|.|+.+.. T Consensus 324 G-ES~pS~~-vtvTVaa~~dg~~ltIt~~~~~~~~p~yv~IYR~~--~gg~~f~li~~va~~~a~~gt~tf~D~n~~iPg 399 (468) T protein:vir:63 324 A-ESIASEV-ATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKG--AETGLFYLIARVPASKAENNVITFYDLNDSIPE 399 (468) T ss_pred C-ccccccc-eEEEecCcccceeEEEEecCCCCCcceEEEEEEeC--CCCcceeEeeeEeeeecCCCeEEEEcCCcccCC Confidence 9 9999974 46666564444444443323223334679999985 445699999877643 55899986653 Q ss_pred -------ccCccchhhhhhCCCCCcceEEeccCcEEEE-EeCCEEEEecCCCcccCchhc-ccccCcceEEEEEcCCc Q lcl|NC_019442. 233 -------NLGPSLATWDYLPPPENMTGLCLMANGIAAG-FAGNEVMFSEAYLPYAWPEVN-RHTTAEDIVAICPLGTS 301 (541) Q Consensus 233 -------~L~~~L~t~~~~~pP~~~~gL~~m~NGi~a~-f~Gn~l~fSep~~P~awp~~y-~~t~~~~Iv~ia~v~~~ 301 (541) ++.+ .+..|..-- .++.|++-.+-+ ...-++||--+..- -|+++ ++..-.. ..|.-+-.. T Consensus 400 T~~~fVgem~~--~~i~~~~ll----pm~~lplA~~n~~~~~~Vl~Ygalal~--~Pk~~~~ikNv~~-~~~~~~~~~ 468 (468) T protein:vir:63 400 TVDVFVGEMSA--NVVHLFELL----PMMRLPLAQINASVTFAVLWYGALALR--APKKWVRIRNVKY-IPVKNVHSN 468 (468) T ss_pred CcceeeeecCh--hHHHHHHHh----ccccCChhHhccchhhhhhhhhHHhhh--ccccceEEEEeee-eeeccccCC Confidence 2211 122222211 112222222111 11233333321110 12221 1111000 000111111 No 17 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=98.39 E-value=6.8e-09 Score=65.35 Aligned_cols=249 Identities=12% Similarity=0.105 Sum_probs=88.4 Q ss_pred CceEEec-ccccccccccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEECCcEEEEeCCeEEEeeC Q lcl|NC_019442. 1 MPYIDIT-TMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRS 79 (541) Q Consensus 1 m~~i~i~-~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W~~w~~~V~vv~s 79 (541) +.++.|. +||=-. --+||..--..=+|.+|.+-..=+... ..+. +.=+. -+..++=.+.|.--+| T Consensus 202 eaa~~i~~gfG~~t----d~~~p~~v~a~~~~~~L~~q~~v~~~n-~~~~-------~~G~~--v~g~~sa~G~I~l~gs 267 (467) T protein:vir:80 202 QAAVMISKGYGTPT----DAYMPVGVQADFVNQQLSKQTQLVRDN-GNNV-------SVGFN--IQGFHSARGFIKLHGS 267 (467) T ss_pred HHhhhccccccChh----hhhcchhHHhhhhhhhcCceEEEEcCC-CCce-------eeeec--ccceecceeeeeecCc Confidence 3333222 121100 001111000001122222211111100 0000 11111 0111222222222222 Q ss_pred CcccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEEEEEecC Q lcl|NC_019442. 80 PIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDY 159 (541) Q Consensus 80 pia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~V~~~ 159 (541) -| -|+.... .-. -+++. .-|.|.... ++...+.. +.....+..+|.|++++|+.. T Consensus 268 ~i----------l~~~~~l-~~~---------~~~~~---~Apsp~~vs-aT~~~~~~-g~~~~~~~a~y~Y~v~~vs~~ 322 (467) T protein:vir:80 268 TV----------MENEQIL-DER---------ILALP---TAPQPAKVT-ATQEAGKK-GQFRAEDLAAHEYKVVVSSDD 322 (467) T ss_pred ee----------eccccCC-Ccc---------ccccc---ccccCCccc-eeeecccC-CcccCCCcceEEEEEEEECCC Confidence 11 1110000 000 00111 112332111 12212221 222335567899999999999 Q ss_pred CCccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEEEeeccc-----eEEEEeccccc-- Q lcl|NC_019442. 160 GEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLVAELDAS-----VLSYTDKIPGK-- 232 (541) Q Consensus 160 GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lVael~~~-----~~sf~D~~~~~-- 232 (541) | ||+||++ ..++|.+..+.+.|+.-..+..+..-+.++|||+. .++++|+|+++.+.+ +.+|.|+.+.. T Consensus 323 G-ES~pS~~-vtvTVaa~~dg~~ltIt~~~~~~~~p~yv~IYR~~--~gg~~f~li~~va~~~a~~gt~tf~D~n~~iPg 398 (467) T protein:vir:80 323 A-ESIASEV-ATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKG--AETGLFYLIARVPASKAENNVITFYDLNDSIPE 398 (467) T ss_pred C-ccccccc-eEEEecCcccceeEEEEecCCCCCcceEEEEEEeC--CCCcceeEeeeEeeeecCCCeEEEEcCCcccCC Confidence 9 9999974 46666564444444443323223334679999985 445699999877643 55899986653 Q ss_pred -------ccCccchhhhhhCCCCCcceEEeccCcEEEE-EeCCEEEEecCCCcccCchhc-ccccCcceEEEEEcCCc Q lcl|NC_019442. 233 -------NLGPSLATWDYLPPPENMTGLCLMANGIAAG-FAGNEVMFSEAYLPYAWPEVN-RHTTAEDIVAICPLGTS 301 (541) Q Consensus 233 -------~L~~~L~t~~~~~pP~~~~gL~~m~NGi~a~-f~Gn~l~fSep~~P~awp~~y-~~t~~~~Iv~ia~v~~~ 301 (541) ++.+ .+..|..-- .++.|++-.+-+ ...-++||--+..- -|+++ ++..-.. ..|.-+-.. T Consensus 399 T~~~fVgem~~--~~i~~~~ll----pm~~lplA~~n~~~~~~Vl~Ygalal~--~Pk~~~~ikNv~~-~~~~~~~~~ 467 (467) T protein:vir:80 399 TVDVFVGEMSA--NVVHLFELL----PMMRLPLAQINASVTFAVLWYGALALR--APKKWVRIRNVKY-IPVKNVHSN 467 (467) T ss_pred CcceeeeecCh--hHHHHHHHh----ccccCChhHhccchhhhhhhhhHHhhh--ccccceEEEEeee-eeeccccCC Confidence 2211 122222211 112222222111 11233333321110 12221 1111000 000111111 No 18 >protein:vir:105525 Length: 472 # NCBI annotation: phage DNA stabilization protein # Family: family:all:1540 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516194;genbank:gi:89885997;genbank:GeneID:3964385 Probab=98.27 E-value=1.8e-06 Score=52.02 Aligned_cols=389 Identities=12% Similarity=0.146 Sum_probs=194.7 Q ss_pred CccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEEEE-EecCCCccCcccc--ccceeecCCCCEEEEccccC Q lcl|NC_019442. 112 HPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETF-VSDYGEEGPPGPA--SLEVTLRTPGTAVQLTLSPV 188 (541) Q Consensus 112 ~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~-V~~~GeEs~Ps~~--S~~vtv~~~g~~v~l~~~p~ 188 (541) +|-- -||-+. +- .++..+-.|.-.+ ||-+=+ |+++ |.-..-..||- ..+..++. T Consensus 1 m~~~----q~pl~~--------g~-------~~~~~~~~~~~~lpvN~y~~---p~~~~~ss~~lr~~PG~-~~~~~~~g 57 (472) T protein:vir:10 1 MAIM----QLPLLR--------GL-------GKARDDADYIDALPVNMLAT---PKPVLNASGYLRSFPGI-THKAEVAG 57 (472) T ss_pred CCce----eeeccc--------cc-------ccCccccCceeeeeeeeeec---cccccccceeecccCCc-eeecCCCc Confidence 1000 011111 00 0111233455455 555422 2222 11111122453 23333332 Q ss_pred CCCcc--ccceEEEEEeecCCCceeEEE----EEeeccc-eEEEEeccccc------c-----c-CccchhhhhhCCCC- Q lcl|NC_019442. 189 PLQNA--SIKRRRIYRSASGGGEADFLL----VAELDAS-VLSYTDKIPGK------N-----L-GPSLATWDYLPPPE- 248 (541) Q Consensus 189 ~~~~~--~i~~~RIYRs~t~~~~~~~~l----Vael~~~-~~sf~D~~~~~------~-----L-~~~L~t~~~~~pP~- 248 (541) +..|. +...=++|| +.|+ .++- +++++.+ --++.|+.... + . +.++...+|-.+.. T Consensus 58 ~~RG~~~~~~~~~lY~-V~G~---~Ly~v~~~vG~iagsg~VsMa~~~~~q~v~v~g~~~~y~y~g~~~t~~~~~~~~~i 133 (472) T protein:vir:10 58 VSRGVQYNTHEKTVYR-GLGN---QLYKGHKPIADLAGKGRISMAFSRNSQAVVAAGKMTLYRYDGTVKTLENWPKEKKY 133 (472) T ss_pred ccceeEeeeeCCeEEE-Eecc---eEEEEEeeeeeecccccEEEEecCCceEEEEecceeEEEeccchhhhhhccccccC Confidence 32222 222334455 1221 1111 2233211 11333322110 0 0 33333333333221 Q ss_pred ------CcceEEeccCcEE-EEEeCCEEE-EecCCCcccCchhc--cccc---CcceEEEEEcCCcEEEEEcCCE--EEE Q lcl|NC_019442. 249 ------NMTGLCLMANGIA-AGFAGNEVM-FSEAYLPYAWPEVN--RHTT---AEDIVAICPLGTSLVVATKGEP--YLF 313 (541) Q Consensus 249 ------~~~gL~~m~NGi~-a~f~Gn~l~-fSep~~P~awp~~y--~~t~---~~~Iv~ia~v~~~lvV~T~~~p--y~l 313 (541) +.+.+|-+ .|++ ..-.|...| +|.. ..-..|.+| |-+= +|.|++|...-+-||.+-+... |.. T Consensus 134 t~~dl~~~~~v~~~-dGyfV~~~~gt~~~~iS~L-~d~s~~~~~~~FatAE~~pD~Ivgi~~~~~~i~lfG~~TiEvw~n 211 (472) T protein:vir:10 134 TQYDIGNVRDMCHL-RGRYVWCKDGSDIFGVTDL-EDESHPDRYRALYRAESQPDGIIGIDSWRDFIVCFGASTIEYFSL 211 (472) T ss_pred CccccCCceeEEEe-CceEEEeecCCceEEEeec-CCcccCCcccceeeecCCCCceEEEEeeccEEEEEeccceEEEEe Confidence 12234444 4544 343454444 5543 222233343 2332 2889999999999999988776 999 Q ss_pred EccCcccceEEe-ec---ccccccccchheeCCccEEEecCC----cEEEEeCCCceEEEecccCChhHhhhhcC-cc-- Q lcl|NC_019442. 314 SGVSPSTISGSR-IP---SMQACLSRRSMVAMEGFVLYAGTN----GLVSVDVNGNTALATEKIISPEQWQSQFN-PA-- 382 (541) Q Consensus 314 ~G~~p~s~~~~~-l~---~~~pCvs~rsiv~~~~~v~y~s~d----GLv~~~~~G~~~~vT~~~~~~~~W~~~l~-P~-- 382 (541) +|..+-..+... .+ .+.+|.++.|+..+++.++|+|+| |.|....+++++.|+-.=|+ +.|++ .+ |+ T Consensus 212 tG~a~fpf~r~~~~pg~~iq~Gcaa~~sv~~~~~s~~~l~~d~~g~~~V~~~~g~q~~rIST~aIE-~~i~~-y~~~e~~ 289 (472) T protein:vir:10 212 TGAADGQSAIYAAQPALMVEKGIAGTHCKTRLGDAHVIISHQATGAPSVFLINQAQATSIATATIE-KILRS-YTHDELA 289 (472) T ss_pred cCCCCcceeeeccCccceeeecccCchhhhhhCceEEEEecCCCcceEEEEccCceEEEecCHHHH-HHHHh-CCccccc Confidence 999986665432 23 567999999999999999999999 67777778888888444443 46775 44 55 Q ss_pred e--EEEEEEcCe--EEEEEecCCCccceEEEccCCcee--EE--Eee-----cccEEEEEecCCEEEE--EECCEEEEec Q lcl|NC_019442. 383 S--IVAYSWRGE--YIACYTKPDGKQDVFVFSPVNMDI--RY--LST-----PFDCAWVDLAKDMMRV--VTGDKMSVLA 447 (541) Q Consensus 383 t--i~a~~~eG~--Y~~~y~~~~g~~~~~i~d~~~~~~--~~--~~~-----~~d~~~~~~~~d~LY~--~~g~~i~~~~ 447 (541) + ....+.||. |+.++.+ ..+.||..++.- ++ ... +|-+.-+-...+++.+ .+++.||+++ T Consensus 290 dA~~~s~~~eGH~fy~LtfP~-----~Tw~yD~at~~~~~~w~~~~~g~~~~~~Ra~~~~~~~g~~~vGD~~ng~l~~ld 364 (472) T protein:vir:10 290 SAVMETVRFDSHELVLIHLSR-----QVLCYDAAANQNGLQWSLLKTGFYHAPYRGIDFMFADHHLTCGDKNDSLLGQLD 364 (472) T ss_pred ceeEEEEEeCCeEEEEEEcCC-----eeEEEeccCCccceeeeeeecCCccCceEEEEEEEeCCeEEEEEcCCCeEEEEc Confidence 2 345668899 8888763 379999887632 11 111 1223333344566666 3578888875 Q ss_pred CCC--CceeEEE---EcceEEeCcccceeEEEEee-CCCccEE--EEEE-ECCce-------eEeecccccCCc--ceEc Q lcl|NC_019442. 448 GGS--LPSTIRW---HSKIFSLPERTSFSCIRVKS-PAPERVG--ITIM-ADDVP-------VIHFAPGTFKGS--VVRL 509 (541) Q Consensus 448 ~g~--~~~~~~W---rSk~f~~~~~~~~~~~~V~~-~~~~~~~--v~~~-~d~~~-------~~~~~~~~~~~~--~~rL 509 (541) -.. .....+| .++.|..+..--|. +.++. ....++. |-+. .||+. +........+-| -.|| T Consensus 365 ~~~~td~g~pi~~~~~tp~~~~~n~Rvfd-~el~~~tGvg~~~~~v~L~wSddg~~~~~~~~~~~~g~~~~~~r~~w~Rl 443 (472) T protein:vir:10 365 FASSAQYEKPQEHVLYTPLFKADNARVFD-FELEASTGVAHIADRLFLSATADGLHFGREQMINQNAPFAYDRRILWRRM 443 (472) T ss_pred CcCcCCCCceeEEEeeccceecCCCeEEE-EEEEeeCCcCccCceEEEEEeccccccchhHHHhhcCccchhheeeehee Confidence 422 2222333 47777777665554 33332 2222222 2222 23331 111111111212 1133 Q ss_pred cCcccc-eEEEEEEecceEEEEEeecchh Q lcl|NC_019442. 510 PAATGQ-NWQVMVSGFGQVERITLSTSMS 537 (541) Q Consensus 510 P~~~~~-~w~iei~g~~~V~~i~la~s~~ 537 (541) =..|.+ -..++|+-..+|..--++.-|| T Consensus 444 G~ar~~vgf~~rv~~s~pv~~~~~~a~~e 472 (472) T protein:vir:10 444 GRVRKNLGFKVRVITSSPVTLSGCQIRME 472 (472) T ss_pred eccccccceEEEEEEecccccccceeeeC Confidence 333333 1678899999999888888888 No 19 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=98.27 E-value=2.9e-08 Score=61.93 Aligned_cols=255 Identities=17% Similarity=0.187 Sum_probs=98.0 Q ss_pred CceEEecccccccccccceecccccceEEEEeeecCCeeeeeecccccCcccc---ccceeEEEECCcEEEEeCCeEEEe Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFT---IKPKTIFHYRDDFWFAWPDVVDVI 77 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~---~~~~Tif~~~~~~W~~w~~~V~vv 77 (541) ...=.+-+.+|+ .|...+|.+.+ .+..-..|..+=...|..+.--|. ...+..|--.+. +.+. + T Consensus 182 I~~~NViDarG~--~Ls~~~ln~aa----~~i~~~fGt~TD~~~p~~v~a~f~~~~l~~qrv~~~~n~------g~~~-~ 248 (462) T protein:vir:96 182 IDKDNVIDAKGE--SLTETLLNRSA----VLIGKSFGTATDAYMPIGVHADFVNSVLGRQMQLMQDNS------GNVN-A 248 (462) T ss_pred cCCCceeecCCC--CccHHHHhhhh----hhcccccCChhheecchHHHHHHHHhhcCceEEEEcCCC------Ccee-e Confidence 111112223332 12233333222 333334444444444433321111 011111111111 0000 0 Q ss_pred eCCcccCCCCeEEEeCCCCcceeecceeeccccCCcc--c--eeeecCCCCCccceE--EecCCCCCCCCCCCcccceEE Q lcl|NC_019442. 78 RSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPA--S--SYSLGIPAPTTAPVC--TVQQGGDVSDDNPNDDETRFY 151 (541) Q Consensus 78 ~spia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~--a--~y~LGVp~P~~~pv~--~v~~~~~~~~~~~~~~~ty~Y 151 (541) --+|- + |++- .+.|.+-|.-.+.. - ..+.-+|.+.+++.+ ++..+....-..+.+..+|.| T Consensus 249 G~~v~-----~-f~s~-------~G~I~L~~s~~m~~~~i~~~~~~~~p~ap~~~~vsaTv~t~~~g~f~~~~d~~~y~Y 315 (462) T protein:vir:96 249 GYNVQ-----G-FYSS-------RGFIKLHGSTVMENELILDESLQPLPNAPQPATVKATVETGKKGLFTDEHDRAELTY 315 (462) T ss_pred eeecc-----c-eeee-------eeeeeeCCceecCcccccccccccCCCCCCCCceeEEEEeCCCCCCCCccCceeEEE Confidence 00000 0 2221 22222222111000 0 011112332233333 322221111123445789999 Q ss_pred EEEEEecCCCccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEEEeec------cceEEE Q lcl|NC_019442. 152 TETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLVAELD------ASVLSY 225 (541) Q Consensus 152 v~T~V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lVael~------~~~~sf 225 (541) +++.|+.+| ||.||++ ..+|+-..+..+.|++..++..+...+..+|||+. .++++|+|++.++ .++.+| T Consensus 316 ~V~avs~dg-eS~PS~~-VtaTva~~~~gv~ltIt~~a~~~~~~~~~~IYRk~--~~sg~y~li~rv~~~~~n~~gt~tf 391 (462) T protein:vir:96 316 KVVVNSDDA-QSAPSEA-VTATVNNATDGVKLEISVNAMYQQQPQFVSIYRQG--RKTGDFYLIKRLGMKEVNDEGKLVF 391 (462) T ss_pred EEEEECCCC-cccccee-eEeeeecccccceEEEEEcCCccccceEEEEEeec--CCccccceeeeeeceeecCCcceeE Confidence 999999987 8899976 45555444555666666556677778899999974 5667999999885 446688 Q ss_pred Eeccccccc------Cc-cchhhhhhC--CCCCcceEEeccCcEEE-EEeCCEEEEecCCCcccCchhc-ccccCcceE Q lcl|NC_019442. 226 TDKIPGKNL------GP-SLATWDYLP--PPENMTGLCLMANGIAA-GFAGNEVMFSEAYLPYAWPEVN-RHTTAEDIV 293 (541) Q Consensus 226 ~D~~~~~~L------~~-~L~t~~~~~--pP~~~~gL~~m~NGi~a-~f~Gn~l~fSep~~P~awp~~y-~~t~~~~Iv 293 (541) .|.-.-..- ++ .-.+..|.+ |.- .+|.-..- ....-++||--+.+ .-|+++ ++..-..|| T Consensus 392 ~D~n~~iPgt~~~fVge~~p~vi~~~qllpm~------~~plA~~n~~~~waVl~yG~Lal--~~Pk~~~~ikNv~~~~ 462 (462) T protein:vir:96 392 YDLNETIPETTDVFVGEMSPQVLHLFELLPMM------KLPLAQINASVTFAVLWYGALAL--RAPKKWVRIKNVKYIV 462 (462) T ss_pred eeccCCCCCcccceeecCCchhhhhhhhhhhh------hcCcccccchhhhhhhhhhHHHh--hcccccEEEEEEEEeC Confidence 775422100 00 001122222 110 01100000 00111222221100 013332 222212222 No 20 >protein:vir:100960 Length: 472 # NCBI annotation: gp10 # Family: family:all:1540 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006412;genbank:gi:46358704;genbank:GeneID:2777110 Probab=98.17 E-value=3.1e-06 Score=50.76 Aligned_cols=394 Identities=11% Similarity=0.100 Sum_probs=185.5 Q ss_pred CccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEEEEEecCCCccCccccccceeecCCCCEEEEccccCCCC Q lcl|NC_019442. 112 HPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQ 191 (541) Q Consensus 112 ~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~ 191 (541) +|-. .||-+. +-.-.. ....+. ..--|+.+..++.-.- |.......+|....-. .+-+.- T Consensus 1 m~~~----~ipl~~--------g~~~~~-----~~a~~~-~~~pvn~y~~~~~~~~-ss~~Lr~~pG~~~~a~-~~G~~R 60 (472) T protein:vir:10 1 MPIQ----QLPMMK--------GMGKDF-----KNADYI-DYLPINMLATPKEVLN-SSGYLRSFPGIAKRND-VNGVSR 60 (472) T ss_pred Ccee----eccccc--------ccccCC-----CcCcce-eeeeeccccccccccc-cccceeecccceeecC-CCCccc Confidence 1110 111111 000000 001111 1113444443332222 2233344566544322 121111 Q ss_pred cc--ccceEEEEEeecCCCceeEEE---EEeeccc-eEEEEeccccccc--Cc-----cch--hhhhhC-CCC-Cc---- Q lcl|NC_019442. 192 NA--SIKRRRIYRSASGGGEADFLL---VAELDAS-VLSYTDKIPGKNL--GP-----SLA--TWDYLP-PPE-NM---- 250 (541) Q Consensus 192 ~~--~i~~~RIYRs~t~~~~~~~~l---Vael~~~-~~sf~D~~~~~~L--~~-----~L~--t~~~~~-pP~-~~---- 250 (541) |- +...=.+|| +.|+ .=|+- +++++.+ .-+++||...-.. +. .+. +..... |.| .. T Consensus 61 G~~~~~~~~~ly~-V~G~--~Ly~v~~~iG~i~gsgrVsMa~n~~~~~v~~~~~~~~Y~~~~~~~t~~~~~~d~~f~~~d 137 (472) T protein:vir:10 61 GVEYNTAQNAVYR-VCGG--KLYKGEAVVGDVAGSGRVSMAHGRTSQAVGVNGQLIEYRYDGAVKTVSNWPADSGFTQYE 137 (472) T ss_pred ceeeeeeCCeEEE-EeCc--ceEEEEeeEeeccCcccEEEeeCCeEEEEEECCceeEEEEecchhhhhcccCcccccccc Confidence 11 122223555 2222 11222 3444322 1244444321000 00 000 010111 222 11 Q ss_pred ----ceEEeccCcEEEEE-eCCEEEEecCCCcccCchhc--ccccC---cceEEEEEcCCcEEEEEcCCE--EEEEccCc Q lcl|NC_019442. 251 ----TGLCLMANGIAAGF-AGNEVMFSEAYLPYAWPEVN--RHTTA---EDIVAICPLGTSLVVATKGEP--YLFSGVSP 318 (541) Q Consensus 251 ----~gL~~m~NGi~a~f-~Gn~l~fSep~~P~awp~~y--~~t~~---~~Iv~ia~v~~~lvV~T~~~p--y~l~G~~p 318 (541) +.+|-+ .|++..- .|...+|.-...+-.-|.+| |-+-+ |.|++|....+-||++-+... |..+|..+ T Consensus 138 l~~~~dv~f~-dGyfV~~~~gt~~~~iS~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~~~~~l~lfG~~TiEvw~ntG~ad 216 (472) T protein:vir:10 138 LGSVRDITRL-RGRYAWSKDGTDSWFITDLEDESHPDRYSAEYRAESQPDGIIGIGSWRDFIVCFGSSTIEYFSLTGATT 216 (472) T ss_pred ccceeEEEEe-cceEEEccCCCceEEEeccCCccccccccccccccCCCCceEEEEeeccEEEEEeccceEEEEecCCCC Confidence 122233 4664433 35555555555555566666 44433 789999999999999988776 99999874 Q ss_pred -ccceEEeec---ccccccccchheeCCccEEEecCCc----EEEEeCCCceEEEecccCChhHhhhhcC-cce----EE Q lcl|NC_019442. 319 -STISGSRIP---SMQACLSRRSMVAMEGFVLYAGTNG----LVSVDVNGNTALATEKIISPEQWQSQFN-PAS----IV 385 (541) Q Consensus 319 -~s~~~~~l~---~~~pCvs~rsiv~~~~~v~y~s~dG----Lv~~~~~G~~~~vT~~~~~~~~W~~~l~-P~t----i~ 385 (541) ......+.+ .+.+|.++.|+..+++.++|+|+|+ .|....+++++.|+-.=|+ +.|++ .+ |+. +. T Consensus 217 ~~~fpy~r~~g~~i~~Gcaa~~sv~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST~aIE-~~i~~-y~~~e~~~A~~~ 294 (472) T protein:vir:10 217 VGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIE-KIIRS-YTAEELATGVME 294 (472) T ss_pred cCcCceEEcCcceeeecccCcchhhecCceEEEEecCCCcccEEEEccCceeEEecCHHHH-HHHHh-cCCccccceEEE Confidence 445555554 6789999999999999999999998 2455556678777444443 47776 44 663 34 Q ss_pred EEEEcCe--EEEEEecCCCccceEEEccCCcee--EEEee-------cccEEEEEecCCEEEE--EECCEEEEecCCC-- Q lcl|NC_019442. 386 AYSWRGE--YIACYTKPDGKQDVFVFSPVNMDI--RYLST-------PFDCAWVDLAKDMMRV--VTGDKMSVLAGGS-- 450 (541) Q Consensus 386 a~~~eG~--Y~~~y~~~~g~~~~~i~d~~~~~~--~~~~~-------~~d~~~~~~~~d~LY~--~~g~~i~~~~~g~-- 450 (541) ..+.||. |+.++. + ..+.||..++.- ++... +|-+.-+....+++-+ .+++.||+|+-+. T Consensus 295 t~~~~GH~fy~LtfP--~---~Tw~yD~at~~w~erw~~~~~g~~~~~~R~~~~~~~~g~~ivGD~~nG~ly~ld~~~~t 369 (472) T protein:vir:10 295 TLRFDSHELLIIHLP--R---HVLVYDASSSQNGPQWCVLKTGLYDDVYRAVDFMYEGNQITCGDKSEALTGQLQFDISS 369 (472) T ss_pred EEEeCCeEEEEEEcC--C---eeEEEEcccCcccceeeeecCCCcccceeEEEEEeeCCeEEEEEcCCCeEEEEecccCC Confidence 5668898 888775 2 379999887632 22111 2323333333444433 2356666664431 Q ss_pred ---CceeEEEEcceEEeCcccceeEEEEeeC-----CCccEEEEEEECCceeEee------cccccCCc--ceEccCccc Q lcl|NC_019442. 451 ---LPSTIRWHSKIFSLPERTSFSCIRVKSP-----APERVGITIMADDVPVIHF------APGTFKGS--VVRLPAATG 514 (541) Q Consensus 451 ---~~~~~~WrSk~f~~~~~~~~~~~~V~~~-----~~~~~~v~~~~d~~~~~~~------~~~~~~~~--~~rLP~~~~ 514 (541) .+-.-+-.+..+.-+..--|- ..|+.. ...++-++.--||..+-+- .++..+.| -.||=..|. T Consensus 370 ~~g~~~~~~~~~p~l~~dn~R~~d-~eve~~~Gv~~~~d~v~L~wSddG~~~~~~~~~~~g~~g~~~tr~~~~RlG~~r~ 448 (472) T protein:vir:10 370 QYGLQQEHLLFTPLFKADNARCFD-LEVESSTGVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVIWKRVGRIRR 448 (472) T ss_pred CCCCcccceEEcccccCCCCEEEE-EeeeccCCCCCcCcEEEEEeeccccccccceeeccCCccchhcceeeeeeeeccc Confidence 111112233333333222221 334321 1122222222233322111 11111211 224444444 Q ss_pred ce-EEEEEEecceEEEEEeecchh Q lcl|NC_019442. 515 QN-WQVMVSGFGQVERITLSTSMS 537 (541) Q Consensus 515 ~~-w~iei~g~~~V~~i~la~s~~ 537 (541) +- .+|+++...+|.---+---+| T Consensus 449 ~v~f~~r~~~~~~~~l~g~~~~~E 472 (472) T protein:vir:10 449 LIGFKLRVITKSPVTLSGCQIRLE 472 (472) T ss_pred ceeEEEEEEecCcceeeeeEEeeC Confidence 43 888888999988777777777 No 21 >protein:vir:177 Length: 472 # NCBI annotation: DNA stabilization protein # Family: family:all:1540 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112082;genbank:gi:13559872;genbank:GeneID:920982 Probab=98.06 E-value=5.7e-06 Score=49.35 Aligned_cols=386 Identities=12% Similarity=0.112 Sum_probs=186.2 Q ss_pred CccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEEEE-EecCCCccCccccccceeecCCCCEEEEccccCCC Q lcl|NC_019442. 112 HPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETF-VSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPL 190 (541) Q Consensus 112 ~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~-V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~ 190 (541) +|-- -||-+.- .+.+. ..+ .|+--+ |+-+=+|+.-. .|.-.....||-..-. ++++ T Consensus 1 m~~~----~~Pl~~G-~~~~~------------~~~--d~~~~~pVN~~a~~~~~~-~s~~~l~~tPGl~~~a---~v~G 57 (472) T protein:vir:17 1 MPIQ----QLPLMKG-VGKDF------------RNA--DYIDYLPVNMLATPKEIL-NSSGYLRSFPGIAKRS---DVNG 57 (472) T ss_pred CCee----eeeeccC-ceeec------------ccc--chhheeeeeeeeeccCCC-cccceeecCCCceeec---cCCc Confidence 1000 0111110 00000 001 222222 44333332222 2223333346654432 2222 Q ss_pred Cccc----c--------ceEEEEEeecCCCceeEEEEEeeccc-eEEEEeccccc-----c-c------Ccc-----chh Q lcl|NC_019442. 191 QNAS----I--------KRRRIYRSASGGGEADFLLVAELDAS-VLSYTDKIPGK-----N-L------GPS-----LAT 240 (541) Q Consensus 191 ~~~~----i--------~~~RIYRs~t~~~~~~~~lVael~~~-~~sf~D~~~~~-----~-L------~~~-----L~t 240 (541) -.+. + .+..|||-.+ =|++++.+ .-+++||...- . + +.+ .++ T Consensus 58 ~~RG~~~~~~~g~lY~V~G~~LY~v~~--------~iGsiag~grVsMa~n~~~~av~~~g~~~~Y~y~~~v~t~~~~~~ 129 (472) T protein:vir:17 58 VSRGVEYNMAQNAVYRVCGGKLYKGES--------EVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPT 129 (472) T ss_pred cccceEEEeeCCeEEEEecceEeeeec--------ceecccCcccEEEecCCcEEEEEECCceeEEEeeccchhhhcccc Confidence 1122 2 2344555111 02333211 11334432110 0 0 011 111 Q ss_pred hhhhCCCC--CcceEEeccCcEEE-EEeCCEEEEecCCCcccCchhc--cccc---CcceEEEEEcCCcEEEEEcCCE-- Q lcl|NC_019442. 241 WDYLPPPE--NMTGLCLMANGIAA-GFAGNEVMFSEAYLPYAWPEVN--RHTT---AEDIVAICPLGTSLVVATKGEP-- 310 (541) Q Consensus 241 ~~~~~pP~--~~~gL~~m~NGi~a-~f~Gn~l~fSep~~P~awp~~y--~~t~---~~~Iv~ia~v~~~lvV~T~~~p-- 310 (541) -...+..| +...+|-+ .|++. .-.|..-||.-...+-.-|.+| |-+- +|.|++|...-+-||++-+... T Consensus 130 d~~~~~~dlg~~~dv~f~-dGyfV~~~~Gt~~~~is~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~~~~~i~lfG~~TiEv 208 (472) T protein:vir:17 130 DSGFTQYELGSVRDITRL-RGRYAWSKDGTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEY 208 (472) T ss_pred ccccccccccceeeeeee-cceEEEeccCcceEEEeccCCccccccccccccccCCCCceEEEEeeccEEEEEeccceEE Confidence 11122222 11234433 46644 3346666655555554455554 3222 2789999999999999988776 Q ss_pred EEEEccCccc-ceEEeec---ccccccccchheeCCccEEEecCC----cEEEEeCCCceEEEecccCChhHhhhhcC-c Q lcl|NC_019442. 311 YLFSGVSPST-ISGSRIP---SMQACLSRRSMVAMEGFVLYAGTN----GLVSVDVNGNTALATEKIISPEQWQSQFN-P 381 (541) Q Consensus 311 y~l~G~~p~s-~~~~~l~---~~~pCvs~rsiv~~~~~v~y~s~d----GLv~~~~~G~~~~vT~~~~~~~~W~~~l~-P 381 (541) |..+|..+-. .-.++.+ .+.+|.++.|+..+++.++|+|+| |.|-...+++++.||-.=|+ +.|++ .+ | T Consensus 209 w~ntG~a~~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~~~g~~~~rIST~aIE-~~i~~-y~~~ 286 (472) T protein:vir:17 209 FSLTGATTVGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISNPATGAPSVYIIGSGQVSPISSASIE-KILRS-YTAD 286 (472) T ss_pred EEeeCCCCCCcCceeecCcceeeecccCcchhhecCceEEEEecCCccccEEEEccCceeEEecCHHHH-HHHHh-cCCc Confidence 9999998843 4555554 678999999999999999999998 88888888888888444443 47775 54 5 Q ss_pred ce----EEEEEEcCe--EEEEEecCCCccceEEEccCCce--eEEEee-------cccEEEEEecCCEEEE--EECCEEE Q lcl|NC_019442. 382 AS----IVAYSWRGE--YIACYTKPDGKQDVFVFSPVNMD--IRYLST-------PFDCAWVDLAKDMMRV--VTGDKMS 444 (541) Q Consensus 382 ~t----i~a~~~eG~--Y~~~y~~~~g~~~~~i~d~~~~~--~~~~~~-------~~d~~~~~~~~d~LY~--~~g~~i~ 444 (541) +- ....+.||. |+.++.+ ..+.||..++. .++... +|-+.-+-...+++.+ .+++.|| T Consensus 287 e~~dA~~~t~~~~GH~fy~LtfP~-----~Tw~yD~~t~~Wherw~~~~~g~~~~~~Ra~~~~~~~g~~~vGD~~ng~ly 361 (472) T protein:vir:17 287 ELADGVMESLRFDAHELLIIHLPR-----HVLVYDASSSANGPQWCVLKTGLYDDVYRAIDFIYEGNQITCGDKLESVTG 361 (472) T ss_pred cccceeEEEEEeCCeEEEEEEcCC-----ceeEeecccccCceeeeeecCCCccCceEEEEEEEeCCeEEEEEcCCCeEE Confidence 43 345668899 8888762 47999988764 232221 2323333344566666 3578888 Q ss_pred EecCC--CCceeEEE---EcceEEeCcccceeEEEEee-----CCCccEEEEEEECCcee------EeecccccCCc--c Q lcl|NC_019442. 445 VLAGG--SLPSTIRW---HSKIFSLPERTSFSCIRVKS-----PAPERVGITIMADDVPV------IHFAPGTFKGS--V 506 (541) Q Consensus 445 ~~~~g--~~~~~~~W---rSk~f~~~~~~~~~~~~V~~-----~~~~~~~v~~~~d~~~~------~~~~~~~~~~~--~ 506 (541) +|+-. ......+| .++.|..+..-=|- +.++. ..+.++-+.--.|+... ..-+++..+.| - T Consensus 362 ~ld~~~~td~g~pi~~~~~~p~~~~~~~RV~d-~el~~~tG~~~~adp~~l~~~sDg~~~g~~~~~~~~~~g~~~~R~~~ 440 (472) T protein:vir:17 362 KLQFDISSQYDKQQEHLLFTPLFKADNARVFD-LEVESSTGVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVLW 440 (472) T ss_pred EEcccCcCCCCceeEEEEecceeeCCCceEEE-EEEeeeCCcccCCCceEEEcccCCcccchhhhhhhccCcccccceee Confidence 88553 22223333 34445444431111 11111 11222223233452110 00011222222 2 Q ss_pred eEccCcccce-EEEEEEecceEEEEEeecchhhcC Q lcl|NC_019442. 507 VRLPAATGQN-WQVMVSGFGQVERITLSTSMSEMP 540 (541) Q Consensus 507 ~rLP~~~~~~-w~iei~g~~~V~~i~la~s~~EL~ 540 (541) .||=..|.+- .+|+++...+|. -++.+++ +- T Consensus 441 ~RlG~~r~~v~f~~~~~~~~~~~--l~~a~~~-~e 472 (472) T protein:vir:17 441 KRVGRIRKNVGFKLRVITKSPVT--LSGCQIR-IE 472 (472) T ss_pred eeeeeccccceEEEEEeecccce--eeeeEEE-eC Confidence 2444444443 788888888876 1222211 11 No 22 >protein:vir:105428 Length: 472 # NCBI annotation: gene 8 protein # Family: family:all:1540 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958184;genbank:gi:41057286;genbank:GeneID:2716675 Probab=97.93 E-value=1.1e-05 Score=47.86 Aligned_cols=386 Identities=12% Similarity=0.104 Sum_probs=187.6 Q ss_pred CccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEEEE-EecCCCccCccccccceeecCCCCEEEEccccCCC Q lcl|NC_019442. 112 HPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETF-VSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPL 190 (541) Q Consensus 112 ~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~-V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~ 190 (541) +|-- -||-+.- .+.+. ..+ .|+--+ |+-+=+|+.-.- |.-.....||-..-.. .+=+. T Consensus 1 m~~~----~~pl~~G-~~~~~------------~~~--d~~~~~pVN~~a~~~~~~~-s~~~l~~tPGl~~~a~-v~G~~ 59 (472) T protein:vir:10 1 MPIQ----QLPLMKG-VGKDF------------RNA--DYIDYLPVNMLATPKEILN-SSGYLRSFPGIAKRSD-VNGVS 59 (472) T ss_pred CCee----eeeeccC-ceeec------------ccc--chhheeeeeeeeeccCCCc-ccceeecCCCceeecc-CCccc Confidence 1000 0111110 00000 000 122222 443333322222 2233333466544321 11111 Q ss_pred Cc--cc--------cceEEEEEeecCCCceeEEEEEeeccc-eEEEEecc------------------cccccCccchhh Q lcl|NC_019442. 191 QN--AS--------IKRRRIYRSASGGGEADFLLVAELDAS-VLSYTDKI------------------PGKNLGPSLATW 241 (541) Q Consensus 191 ~~--~~--------i~~~RIYRs~t~~~~~~~~lVael~~~-~~sf~D~~------------------~~~~L~~~L~t~ 241 (541) -| .+ |.+..|||-.+. |++++.+ .-+++||. ....+ ...++- T Consensus 60 RG~~~~~~~g~lY~V~G~~LY~v~~~--------iGsiag~grVsMa~n~~~~av~~~g~~~~Y~yd~~v~t~-~~~~~d 130 (472) T protein:vir:10 60 RGVEYNMAQNAVYRVCGGKLYKGESE--------VGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTV-SNWPTD 130 (472) T ss_pred cceEEEeeCCeEEEEecceEeeeecc--------eecccCcccEEEecCCcEEEEEECCceeEEEeeccchhh-hccccc Confidence 11 11 234455551110 2222211 11333332 11110 011111 Q ss_pred hhhCCCC--CcceEEeccCcEEE-EEeCCEEEEecCCCcccCchhc--cccc---CcceEEEEEcCCcEEEEEcCCE--E Q lcl|NC_019442. 242 DYLPPPE--NMTGLCLMANGIAA-GFAGNEVMFSEAYLPYAWPEVN--RHTT---AEDIVAICPLGTSLVVATKGEP--Y 311 (541) Q Consensus 242 ~~~~pP~--~~~gL~~m~NGi~a-~f~Gn~l~fSep~~P~awp~~y--~~t~---~~~Iv~ia~v~~~lvV~T~~~p--y 311 (541) ...+..| +...+|-+ .|++. .-.|..-||.-...+-.-|.+| |-+- +|.|++|...-+-||++-+... | T Consensus 131 ~~~p~~dlg~~~dv~f~-dGyfV~~~~Gt~~~~is~l~d~~~~~~y~~fa~AE~~pD~Ivgi~~~~~~i~lfG~~TiEvw 209 (472) T protein:vir:10 131 SGFTQYELGSVRDITRL-RGRYAWSKDGTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYF 209 (472) T ss_pred cccccccccceeeeeee-cceEEEeccCcceEEEeccCCccccccccccccccCCCCceEEEEeeccEEEEEeccceEEE Confidence 1222222 11234433 46644 3346666655555554455554 2222 2789999999999999988776 9 Q ss_pred EEEccCcc-cceEEee---cccccccccchheeCCccEEEecCC----cEEEEeCCCceEEEecccCChhHhhhhcC-cc Q lcl|NC_019442. 312 LFSGVSPS-TISGSRI---PSMQACLSRRSMVAMEGFVLYAGTN----GLVSVDVNGNTALATEKIISPEQWQSQFN-PA 382 (541) Q Consensus 312 ~l~G~~p~-s~~~~~l---~~~~pCvs~rsiv~~~~~v~y~s~d----GLv~~~~~G~~~~vT~~~~~~~~W~~~l~-P~ 382 (541) ..+|.... ....++. ..+.+|.++.|+..+++.++|+|+| |.|-...+++++.|+-.=|+ +.|++ .+ |+ T Consensus 210 ~ntG~a~~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~~~g~~~~rIST~aIE-~~i~~-y~~~e 287 (472) T protein:vir:10 210 SLTGATTAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISNPATGAPSVYIIGSGQVSPIASASIE-KILRS-YTADE 287 (472) T ss_pred EecCCCCcccCceeecccceeeecccCcchhhecCceEEEEecCCccccEEEEccCceeEEecCHHHH-HHHHh-cCCcc Confidence 99996553 4555554 3577999999999999999999998 88888888888888444443 47775 54 54 Q ss_pred e----EEEEEEcCe--EEEEEecCCCccceEEEccCCce--eEEEee-------cccEEEEEecCCEEEE--EECCEEEE Q lcl|NC_019442. 383 S----IVAYSWRGE--YIACYTKPDGKQDVFVFSPVNMD--IRYLST-------PFDCAWVDLAKDMMRV--VTGDKMSV 445 (541) Q Consensus 383 t----i~a~~~eG~--Y~~~y~~~~g~~~~~i~d~~~~~--~~~~~~-------~~d~~~~~~~~d~LY~--~~g~~i~~ 445 (541) - ....+.||. |+.++.+ ..+.||..++. .++... +|-+.-+-...+++.+ .+++.||+ T Consensus 288 ~~dA~~~t~~~~GH~fy~LtfP~-----~Tw~yD~~t~~Wherw~~~~~g~~~~~~Ra~~~~~~~g~~~vGD~~ng~ly~ 362 (472) T protein:vir:10 288 LADGVMESLRFDAHELLIIHLPR-----HVLVYDASSSANGPQWCVLKTGLYDDVYRAIDFIYEGNQITCGDKLESVTGK 362 (472) T ss_pred ccceeEEEEEeCCeEEEEEEcCC-----ceeEeecccccCceeeeeecCCCccCceEEEEEEEeCCeEEEEEcCCCeEEE Confidence 3 345668899 8888762 47999988764 232221 2323333344566666 34788888 Q ss_pred ecCCC-----CceeEEEEcceEEeCcccceeEEEEeeC-----CCccEEEEEEECCceeEee-------cccccCCc--c Q lcl|NC_019442. 446 LAGGS-----LPSTIRWHSKIFSLPERTSFSCIRVKSP-----APERVGITIMADDVPVIHF-------APGTFKGS--V 506 (541) Q Consensus 446 ~~~g~-----~~~~~~WrSk~f~~~~~~~~~~~~V~~~-----~~~~~~v~~~~d~~~~~~~-------~~~~~~~~--~ 506 (541) |+-.. .+...+-.++.|..+..--|- +.|+.. ...++-+.--.||.. ... +++..+.| - T Consensus 363 l~~~~~td~G~~i~~~~~~p~~~~d~~Rv~d-~~ve~~~G~~~~adp~~~~~~sDg~~-~g~~~~~~~~~~g~~~~R~~~ 440 (472) T protein:vir:10 363 LQFDISSQYGLQQEHLLFTPLFKADNARCFD-LEVESSTGVAQYADRLFLSATTDGIN-YGREQMIEQNEPFVYDKRVLW 440 (472) T ss_pred EcccCcCcCCCcceEEEeccceeCCCCeEEE-EEEEeecCCCcccCceEEEeccCCcc-cchhhhhhhccCcccccceee Confidence 85531 233444456666555433232 233311 123333333345211 111 11222222 2 Q ss_pred eEccCcccce-EEEEEEecceEEEEEeecchhhcC Q lcl|NC_019442. 507 VRLPAATGQN-WQVMVSGFGQVERITLSTSMSEMP 540 (541) Q Consensus 507 ~rLP~~~~~~-w~iei~g~~~V~~i~la~s~~EL~ 540 (541) .||=..|.+- .+|+++-..+|. |..-.-+|- T Consensus 441 ~RlG~~r~~vgf~~r~~~~~~v~---l~ga~~~~e 472 (472) T protein:vir:10 441 KRVGRIRKNVGFKLRVITKSPVT---LSGAQIRIE 472 (472) T ss_pred eeeeeccccceEEEEEEeccccc---eeeeeEEeC Confidence 2443444453 788888888877 211111111 No 23 >protein:vir:107423 Length: 681 # NCBI annotation: Bbp13 # Family: family:all:780 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958682;genbank:gi:41179374;genbank:GeneID:2717217 Probab=97.88 E-value=1.3e-05 Score=47.38 Aligned_cols=483 Identities=14% Similarity=0.104 Sum_probs=204.2 Q ss_pred CceEEeccccc-----------------ccccccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEEC Q lcl|NC_019442. 1 MPYIDITTMRG-----------------MMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYR 63 (541) Q Consensus 1 m~~i~i~~f~G-----------------~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~ 63 (541) |=-.-+.-.|| ..-||.|=...++|+ .=..|-+|.|+=++....+-. .+.|..| T Consensus 36 ~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~~~~---~~l~~g~~~~r~~~~~~~~~~--~~~~~~~---- 106 (681) T protein:vir:10 36 CRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSVTQT---MVIELGAGYFRFHTNGGTLLD--GAVPYEI---- 106 (681) T ss_pred hcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCCCce---EEEEEeCCeEEEEeCCcEEee--CcEeEEe---- Confidence 11111111111 123444444444433 234456666666654332211 1111111 Q ss_pred CcEEEEeC-CeEEEeeCCcccCCCCeEEEeCCCCcceeecceeeccccCCccceeeec---CC-CCCccceEEecCCCCC Q lcl|NC_019442. 64 DDFWFAWP-DVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLG---IP-APTTAPVCTVQQGGDV 138 (541) Q Consensus 64 ~~~W~~w~-~~V~vv~spia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LG---Vp-~P~~~pv~~v~~~~~~ 138 (541) ..-|.+=. ...++.- ..| .+|++.-..|+...-. .++ ..| +|. +. .|.++...+.+.. T Consensus 107 ~tpy~~~~l~~l~~~q---~aD---~~~i~h~~~~p~~L~r---~~~----~~W-~l~~~~f~~~p~~p~~~~at~~--- 169 (681) T protein:vir:10 107 ANPYAEADLFNIHYVQ---SAD---VLTLVHPNYAPRELRR---LGA----TNW-QLATIAFTSPVATPTSVTATSN--- 169 (681) T ss_pred cCCCChhhhcCceEEE---EcC---EEEEECCCCcceEEEE---ccC----Cce-EEEEEEeccccccceeeeeecc--- Confidence 00011100 0111111 223 7788776666543221 122 233 221 22 2222222222111 Q ss_pred CCCCCCcccceEEEEEEEecCC-CccCccccccceeec--CCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEE Q lcl|NC_019442. 139 SDDNPNDDETRFYTETFVSDYG-EEGPPGPASLEVTLR--TPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLV 215 (541) Q Consensus 139 ~~~~~~~~~ty~Yv~T~V~~~G-eEs~Ps~~S~~vtv~--~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lV 215 (541) .....-+++|.++=|+..+ .|+.++.. ..++.. ..+...++...+ ..+. ...|||+. .+.-+-++ T Consensus 170 ---~~~~~~t~~~~v~avda~t~~~s~~~~~-~tvt~~~~~~~~~~t~~w~a--~~g~--~~~~V~~~----~~gi~g~i 237 (681) T protein:vir:10 170 ---NKGTDYTYRYVVTALDAEGKTESAPSSA-GTCTNNLFTNGGANTIAWSA--SSGA--SRYNVYKE----QGGLYGYI 237 (681) T ss_pred ---CCccceeEeEEEEEeecccceeecCCcc-eEEeeeeecCCcceeEEEEe--cCCc--eeeeeccc----ceeEEEEe Confidence 1122346788888888765 47766642 222221 122223333222 2222 35677763 22234444 Q ss_pred EeeccceEEEEecccccccCccchhhhhhCCCCC-cc---eEEeccCcEEEEE----eCCEEEEecCCCcccCchhcc-- Q lcl|NC_019442. 216 AELDASVLSYTDKIPGKNLGPSLATWDYLPPPEN-MT---GLCLMANGIAAGF----AGNEVMFSEAYLPYAWPEVNR-- 285 (541) Q Consensus 216 ael~~~~~sf~D~~~~~~L~~~L~t~~~~~pP~~-~~---gL~~m~NGi~a~f----~Gn~l~fSep~~P~awp~~y~-- 285 (541) .. ....++.++.-..... .+.+|...|-. .+ ..+.+.++++.-. ..+.||+|.+..++.|-.+-- T Consensus 238 g~--~~~~~~~~~~~~~~~~---~t~~~~~~~~~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~~~ 312 (681) T protein:vir:10 238 GQ--TTGTSLVDDNIAPDLS---VTPPIYDAVFNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLPVR 312 (681) T ss_pred ec--cceeeeeecccccCcc---ccccccccccccCCCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccCCCC Confidence 32 2223444443222221 12234333221 11 2334445555443 347899999999999732111 Q ss_pred ------cccC----cceEEEEEcCCcEEEEEcCCEEEEEc-----cCcccceEEeecccccccccchheeCCccEEEecC Q lcl|NC_019442. 286 ------HTTA----EDIVAICPLGTSLVVATKGEPYLFSG-----VSPSTISGSRIPSMQACLSRRSMVAMEGFVLYAGT 350 (541) Q Consensus 286 ------~t~~----~~Iv~ia~v~~~lvV~T~~~py~l~G-----~~p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~ 350 (541) +++. ..|.-+.+++ .|+|+|.+.-|+|++ .+|.+..+. +....+|- .=..+.+|+.++|+++ T Consensus 313 ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~-~~s~~g~~-~~~Pv~vg~~v~fv~~ 389 (681) T protein:vir:10 313 DDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVR-PQSYVGAT-DVQPVVVNNTTIYGAA 389 (681) T ss_pred CCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEE-Eeeeeccc-cccceeeCCeEEEEec Confidence 2222 3377788885 599999999999987 455565444 34456774 3457889999999999 Q ss_pred Cc-----EEEEeCCCceE--EEe---cccCChhHhhhhcCcceEEEEEEcCeEEEEEecCCCccceEEEccCCceeEEEe Q lcl|NC_019442. 351 NG-----LVSVDVNGNTA--LAT---EKIISPEQWQSQFNPASIVAYSWRGEYIACYTKPDGKQDVFVFSPVNMDIRYLS 420 (541) Q Consensus 351 dG-----Lv~~~~~G~~~--~vT---~~~~~~~~W~~~l~P~ti~a~~~eG~Y~~~y~~~~g~~~~~i~d~~~~~~~~~~ 420 (541) .| +..-..++..+ -+| ..|+.- . .-.-+|++-+..-+.++...+|+.-++.|+...+..-|.. T Consensus 390 ~g~~vre~~y~~~~d~~~~~dlt~~a~Hl~~~------~-~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~~eq~v~aW~~ 462 (681) T protein:vir:10 390 RGGHVRELAYNWQANGFVTGDLSLRAAHLFDN------L-DILDMAYAKAPQPIVWFISSSGKLLGLTYVPEQQIGAWHQ 462 (681) T ss_pred CCCEEEEEEEeeecCceeccchhhhhhhhcCC------C-CeEEEEEecCCCEEEEEEecCCcEEEEEEecccceeeEEE Confidence 99 22222222111 122 333321 1 1111455556666667777788877788886655444444 Q ss_pred ecccEEEEE------ecCCEEEEEECC-----EEEEecCCCCceeEEEEcceEEeCcccceeEE---EEeeC-CCccEEE Q lcl|NC_019442. 421 TPFDCAWVD------LAKDMMRVVTGD-----KMSVLAGGSLPSTIRWHSKIFSLPERTSFSCI---RVKSP-APERVGI 485 (541) Q Consensus 421 ~~~d~~~~~------~~~d~LY~~~g~-----~i~~~~~g~~~~~~~WrSk~f~~~~~~~~~~~---~V~~~-~~~~~~v 485 (541) ..++-.+.+ -..|.||++..+ ...-.+.=+ .....++..-|.+...+++..- .+.+- -.+--.+ T Consensus 463 ~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~yie~~~-~~~~~~~~~~~~vD~~~t~~~~~~~~~sgl~~leG~tv 541 (681) T protein:vir:10 463 HDTDGVFESCAVVAEGNEDRLYAVVRRTIGGNEVRYVERMA-SRQFDAQADAFFVDSGLTYSGEPVSHISGLEHLEGKTV 541 (681) T ss_pred EecCCcEEEEEEecCCCCcEEEEEEEecCCCCeEEEEEecC-CccccccccceEeeccccccCcceeeeccccCCCCcEE Confidence 333333222 236799997521 111111111 1223344445555555443221 11110 0122234 Q ss_pred EEEECCceeEeecccccCCcceEccCcc-----cceEEEEEE---------------ecceEEEEEee--cchhhcCC Q lcl|NC_019442. 486 TIMADDVPVIHFAPGTFKGSVVRLPAAT-----GQNWQVMVS---------------GFGQVERITLS--TSMSEMPV 541 (541) Q Consensus 486 ~~~~d~~~~~~~~~~~~~~~~~rLP~~~-----~~~w~iei~---------------g~~~V~~i~la--~s~~EL~~ 541 (541) .+..||... .+.+..+..+.|+..- +.....+++ ...+|.++.|- .|.. +-+ T Consensus 542 ~i~aDG~~~---~~~~V~~G~itl~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g~~~ri~rv~lr~~~S~g-~~~ 615 (681) T protein:vir:10 542 SILADGAVH---PQRVVTDGAIDLDVEAGTVHIGLPITAELQTLPVAMQLDGSFGQGRVKNINKLWLRVHRSSG-IFA 615 (681) T ss_pred EEEeCCeec---CcEeecCcEEEeCcCCceEEEeeeceeEEEecceeeecCCcccCCceEEEEEEEEEEEcccc-eEE Confidence 555666432 1111112222333110 011111110 01222222221 1100 000 No 24 >protein:vir:98487 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996575;genbank:gi:45569506;genbank:GeneID:2767815 Probab=97.88 E-value=1.3e-05 Score=47.38 Aligned_cols=483 Identities=14% Similarity=0.104 Sum_probs=204.2 Q ss_pred CceEEeccccc-----------------ccccccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEEC Q lcl|NC_019442. 1 MPYIDITTMRG-----------------MMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYR 63 (541) Q Consensus 1 m~~i~i~~f~G-----------------~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~ 63 (541) |=-.-+.-.|| ..-||.|=...++|+ .=..|-+|.|+=++....+-. .+.|..| T Consensus 36 ~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~~~~---~~l~~g~~~~r~~~~~~~~~~--~~~~~~~---- 106 (681) T protein:vir:98 36 CRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSVTQT---MVIELGAGYFRFHTNGGTLLD--GAVPYEI---- 106 (681) T ss_pred hcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCCCce---EEEEEeCCeEEEEeCCcEEee--CcEeEEe---- Confidence 11111111111 123444444444433 234456666666654332211 1111111 Q ss_pred CcEEEEeC-CeEEEeeCCcccCCCCeEEEeCCCCcceeecceeeccccCCccceeeec---CC-CCCccceEEecCCCCC Q lcl|NC_019442. 64 DDFWFAWP-DVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLG---IP-APTTAPVCTVQQGGDV 138 (541) Q Consensus 64 ~~~W~~w~-~~V~vv~spia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LG---Vp-~P~~~pv~~v~~~~~~ 138 (541) ..-|.+=. ...++.- ..| .+|++.-..|+...-. .++ ..| +|. +. .|.++...+.+.. T Consensus 107 ~tpy~~~~l~~l~~~q---~aD---~~~i~h~~~~p~~L~r---~~~----~~W-~l~~~~f~~~p~~p~~~~at~~--- 169 (681) T protein:vir:98 107 ANPYAEADLFNIHYVQ---SAD---VLTLVHPNYAPRELRR---LGA----TNW-QLATIAFTSPVATPTSVTATSN--- 169 (681) T ss_pred cCCCChhhhcCceEEE---EcC---EEEEECCCCcceEEEE---ccC----Cce-EEEEEEeccccccceeeeeecc--- Confidence 00011100 0111111 223 7788776666543221 122 233 221 22 2222222222111 Q ss_pred CCCCCCcccceEEEEEEEecCC-CccCccccccceeec--CCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEE Q lcl|NC_019442. 139 SDDNPNDDETRFYTETFVSDYG-EEGPPGPASLEVTLR--TPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLV 215 (541) Q Consensus 139 ~~~~~~~~~ty~Yv~T~V~~~G-eEs~Ps~~S~~vtv~--~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lV 215 (541) .....-+++|.++=|+..+ .|+.++.. ..++.. ..+...++...+ ..+. ...|||+. .+.-+-++ T Consensus 170 ---~~~~~~t~~~~v~avda~t~~~s~~~~~-~tvt~~~~~~~~~~t~~w~a--~~g~--~~~~V~~~----~~gi~g~i 237 (681) T protein:vir:98 170 ---NKGTDYTYRYVVTALDAEGKTESAPSSA-GTCTNNLFTNGGANTIAWSA--SSGA--SRYNVYKE----QGGLYGYI 237 (681) T ss_pred ---CCccceeEeEEEEEeecccceeecCCcc-eEEeeeeecCCcceeEEEEe--cCCc--eeeeeccc----ceeEEEEe Confidence 1122346788888888765 47766642 222221 122223333222 2222 35677763 22234444 Q ss_pred EeeccceEEEEecccccccCccchhhhhhCCCCC-cc---eEEeccCcEEEEE----eCCEEEEecCCCcccCchhcc-- Q lcl|NC_019442. 216 AELDASVLSYTDKIPGKNLGPSLATWDYLPPPEN-MT---GLCLMANGIAAGF----AGNEVMFSEAYLPYAWPEVNR-- 285 (541) Q Consensus 216 ael~~~~~sf~D~~~~~~L~~~L~t~~~~~pP~~-~~---gL~~m~NGi~a~f----~Gn~l~fSep~~P~awp~~y~-- 285 (541) .. ....++.++.-..... .+.+|...|-. .+ ..+.+.++++.-. ..+.||+|.+..++.|-.+-- T Consensus 238 g~--~~~~~~~~~~~~~~~~---~t~~~~~~~~~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~~~ 312 (681) T protein:vir:98 238 GQ--TTGTSLVDDNIAPDLS---VTPPIYDAVFNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLPVR 312 (681) T ss_pred ec--cceeeeeecccccCcc---ccccccccccccCCCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccCCCC Confidence 32 2223444443222221 12234333221 11 2334445555443 347899999999999732111 Q ss_pred ------cccC----cceEEEEEcCCcEEEEEcCCEEEEEc-----cCcccceEEeecccccccccchheeCCccEEEecC Q lcl|NC_019442. 286 ------HTTA----EDIVAICPLGTSLVVATKGEPYLFSG-----VSPSTISGSRIPSMQACLSRRSMVAMEGFVLYAGT 350 (541) Q Consensus 286 ------~t~~----~~Iv~ia~v~~~lvV~T~~~py~l~G-----~~p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~ 350 (541) +++. ..|.-+.+++ .|+|+|.+.-|+|++ .+|.+..+. +....+|- .=..+.+|+.++|+++ T Consensus 313 ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~-~~s~~g~~-~~~Pv~vg~~v~fv~~ 389 (681) T protein:vir:98 313 DDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVR-PQSYVGAT-DVQPVVVNNTTIYGAA 389 (681) T ss_pred CCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEE-Eeeeeccc-cccceeeCCeEEEEec Confidence 2222 3377788885 599999999999987 455565444 34456774 3457889999999999 Q ss_pred Cc-----EEEEeCCCceE--EEe---cccCChhHhhhhcCcceEEEEEEcCeEEEEEecCCCccceEEEccCCceeEEEe Q lcl|NC_019442. 351 NG-----LVSVDVNGNTA--LAT---EKIISPEQWQSQFNPASIVAYSWRGEYIACYTKPDGKQDVFVFSPVNMDIRYLS 420 (541) Q Consensus 351 dG-----Lv~~~~~G~~~--~vT---~~~~~~~~W~~~l~P~ti~a~~~eG~Y~~~y~~~~g~~~~~i~d~~~~~~~~~~ 420 (541) .| +..-..++..+ -+| ..|+.- . .-.-+|++-+..-+.++...+|+.-++.|+...+..-|.. T Consensus 390 ~g~~vre~~y~~~~d~~~~~dlt~~a~Hl~~~------~-~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~~eq~v~aW~~ 462 (681) T protein:vir:98 390 RGGHVRELAYNWQANGFVTGDLSLRAAHLFDN------L-DILDMAYAKAPQPIVWFISSSGKLLGLTYVPEQQIGAWHQ 462 (681) T ss_pred CCCEEEEEEEeeecCceeccchhhhhhhhcCC------C-CeEEEEEecCCCEEEEEEecCCcEEEEEEecccceeeEEE Confidence 99 22222222111 122 333321 1 1111455556666667777788877788886655444444 Q ss_pred ecccEEEEE------ecCCEEEEEECC-----EEEEecCCCCceeEEEEcceEEeCcccceeEE---EEeeC-CCccEEE Q lcl|NC_019442. 421 TPFDCAWVD------LAKDMMRVVTGD-----KMSVLAGGSLPSTIRWHSKIFSLPERTSFSCI---RVKSP-APERVGI 485 (541) Q Consensus 421 ~~~d~~~~~------~~~d~LY~~~g~-----~i~~~~~g~~~~~~~WrSk~f~~~~~~~~~~~---~V~~~-~~~~~~v 485 (541) ..++-.+.+ -..|.||++..+ ...-.+.=+ .....++..-|.+...+++..- .+.+- -.+--.+ T Consensus 463 ~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~yie~~~-~~~~~~~~~~~~vD~~~t~~~~~~~~~sgl~~leG~tv 541 (681) T protein:vir:98 463 HDTDGVFESCAVVAEGNEDRLYAVVRRTIGGNEVRYVERMA-SRQFDAQADAFFVDSGLTYSGEPVSHISGLEHLEGKTV 541 (681) T ss_pred EecCCcEEEEEEecCCCCcEEEEEEEecCCCCeEEEEEecC-CccccccccceEeeccccccCcceeeeccccCCCCcEE Confidence 333333222 236799997521 111111111 1223344445555555443221 11110 0122234 Q ss_pred EEEECCceeEeecccccCCcceEccCcc-----cceEEEEEE---------------ecceEEEEEee--cchhhcCC Q lcl|NC_019442. 486 TIMADDVPVIHFAPGTFKGSVVRLPAAT-----GQNWQVMVS---------------GFGQVERITLS--TSMSEMPV 541 (541) Q Consensus 486 ~~~~d~~~~~~~~~~~~~~~~~rLP~~~-----~~~w~iei~---------------g~~~V~~i~la--~s~~EL~~ 541 (541) .+..||... .+.+..+..+.|+..- +.....+++ ...+|.++.|- .|.. +-+ T Consensus 542 ~i~aDG~~~---~~~~V~~G~itl~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g~~~ri~rv~lr~~~S~g-~~~ 615 (681) T protein:vir:98 542 SILADGAVH---PQRVVTDGAIDLDVEAGTVHIGLPITAELQTLPVAMQLDGSFGQGRVKNINKLWLRVHRSSG-IFA 615 (681) T ss_pred EEEeCCeec---CcEeecCcEEEeCcCCceEEEeeeceeEEEecceeeecCCcccCCceEEEEEEEEEEEcccc-eEE Confidence 555666432 1111112222333110 011111110 01222222221 1100 000 No 25 >protein:vir:107802 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996623;genbank:gi:45580757;genbank:GeneID:2767878 Probab=97.88 E-value=1.3e-05 Score=47.38 Aligned_cols=483 Identities=14% Similarity=0.104 Sum_probs=204.2 Q ss_pred CceEEeccccc-----------------ccccccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEEC Q lcl|NC_019442. 1 MPYIDITTMRG-----------------MMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYR 63 (541) Q Consensus 1 m~~i~i~~f~G-----------------~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~ 63 (541) |=-.-+.-.|| ..-||.|=...++|+ .=..|-+|.|+=++....+-. .+.|..| T Consensus 36 ~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~~~~---~~l~~g~~~~r~~~~~~~~~~--~~~~~~~---- 106 (681) T protein:vir:10 36 CRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSVTQT---MVIELGAGYFRFHTNGGTLLD--GAVPYEI---- 106 (681) T ss_pred hcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCCCce---EEEEEeCCeEEEEeCCcEEee--CcEeEEe---- Confidence 11111111111 123444444444433 234456666666654332211 1111111 Q ss_pred CcEEEEeC-CeEEEeeCCcccCCCCeEEEeCCCCcceeecceeeccccCCccceeeec---CC-CCCccceEEecCCCCC Q lcl|NC_019442. 64 DDFWFAWP-DVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLG---IP-APTTAPVCTVQQGGDV 138 (541) Q Consensus 64 ~~~W~~w~-~~V~vv~spia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LG---Vp-~P~~~pv~~v~~~~~~ 138 (541) ..-|.+=. ...++.- ..| .+|++.-..|+...-. .++ ..| +|. +. .|.++...+.+.. T Consensus 107 ~tpy~~~~l~~l~~~q---~aD---~~~i~h~~~~p~~L~r---~~~----~~W-~l~~~~f~~~p~~p~~~~at~~--- 169 (681) T protein:vir:10 107 ANPYAEADLFNIHYVQ---SAD---VLTLVHPNYAPRELRR---LGA----TNW-QLATIAFTSPVATPTSVTATSN--- 169 (681) T ss_pred cCCCChhhhcCceEEE---EcC---EEEEECCCCcceEEEE---ccC----Cce-EEEEEEeccccccceeeeeecc--- Confidence 00011100 0111111 223 7788776666543221 122 233 221 22 2222222222111 Q ss_pred CCCCCCcccceEEEEEEEecCC-CccCccccccceeec--CCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEE Q lcl|NC_019442. 139 SDDNPNDDETRFYTETFVSDYG-EEGPPGPASLEVTLR--TPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLV 215 (541) Q Consensus 139 ~~~~~~~~~ty~Yv~T~V~~~G-eEs~Ps~~S~~vtv~--~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lV 215 (541) .....-+++|.++=|+..+ .|+.++.. ..++.. ..+...++...+ ..+. ...|||+. .+.-+-++ T Consensus 170 ---~~~~~~t~~~~v~avda~t~~~s~~~~~-~tvt~~~~~~~~~~t~~w~a--~~g~--~~~~V~~~----~~gi~g~i 237 (681) T protein:vir:10 170 ---NKGTDYTYRYVVTALDAEGKTESAPSSA-GTCTNNLFTNGGANTIAWSA--SSGA--SRYNVYKE----QGGLYGYI 237 (681) T ss_pred ---CCccceeEeEEEEEeecccceeecCCcc-eEEeeeeecCCcceeEEEEe--cCCc--eeeeeccc----ceeEEEEe Confidence 1122346788888888765 47766642 222221 122223333222 2222 35677763 22234444 Q ss_pred EeeccceEEEEecccccccCccchhhhhhCCCCC-cc---eEEeccCcEEEEE----eCCEEEEecCCCcccCchhcc-- Q lcl|NC_019442. 216 AELDASVLSYTDKIPGKNLGPSLATWDYLPPPEN-MT---GLCLMANGIAAGF----AGNEVMFSEAYLPYAWPEVNR-- 285 (541) Q Consensus 216 ael~~~~~sf~D~~~~~~L~~~L~t~~~~~pP~~-~~---gL~~m~NGi~a~f----~Gn~l~fSep~~P~awp~~y~-- 285 (541) .. ....++.++.-..... .+.+|...|-. .+ ..+.+.++++.-. ..+.||+|.+..++.|-.+-- T Consensus 238 g~--~~~~~~~~~~~~~~~~---~t~~~~~~~~~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~~~ 312 (681) T protein:vir:10 238 GQ--TTGTSLVDDNIAPDLS---VTPPIYDAVFNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLPVR 312 (681) T ss_pred ec--cceeeeeecccccCcc---ccccccccccccCCCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccCCCC Confidence 32 2223444443222221 12234333221 11 2334445555443 347899999999999732111 Q ss_pred ------cccC----cceEEEEEcCCcEEEEEcCCEEEEEc-----cCcccceEEeecccccccccchheeCCccEEEecC Q lcl|NC_019442. 286 ------HTTA----EDIVAICPLGTSLVVATKGEPYLFSG-----VSPSTISGSRIPSMQACLSRRSMVAMEGFVLYAGT 350 (541) Q Consensus 286 ------~t~~----~~Iv~ia~v~~~lvV~T~~~py~l~G-----~~p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~ 350 (541) +++. ..|.-+.+++ .|+|+|.+.-|+|++ .+|.+..+. +....+|- .=..+.+|+.++|+++ T Consensus 313 ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~-~~s~~g~~-~~~Pv~vg~~v~fv~~ 389 (681) T protein:vir:10 313 DDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVR-PQSYVGAT-DVQPVVVNNTTIYGAA 389 (681) T ss_pred CCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEE-Eeeeeccc-cccceeeCCeEEEEec Confidence 2222 3377788885 599999999999987 455565444 34456774 3457889999999999 Q ss_pred Cc-----EEEEeCCCceE--EEe---cccCChhHhhhhcCcceEEEEEEcCeEEEEEecCCCccceEEEccCCceeEEEe Q lcl|NC_019442. 351 NG-----LVSVDVNGNTA--LAT---EKIISPEQWQSQFNPASIVAYSWRGEYIACYTKPDGKQDVFVFSPVNMDIRYLS 420 (541) Q Consensus 351 dG-----Lv~~~~~G~~~--~vT---~~~~~~~~W~~~l~P~ti~a~~~eG~Y~~~y~~~~g~~~~~i~d~~~~~~~~~~ 420 (541) .| +..-..++..+ -+| ..|+.- . .-.-+|++-+..-+.++...+|+.-++.|+...+..-|.. T Consensus 390 ~g~~vre~~y~~~~d~~~~~dlt~~a~Hl~~~------~-~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~~eq~v~aW~~ 462 (681) T protein:vir:10 390 RGGHVRELAYNWQANGFVTGDLSLRAAHLFDN------L-DILDMAYAKAPQPIVWFISSSGKLLGLTYVPEQQIGAWHQ 462 (681) T ss_pred CCCEEEEEEEeeecCceeccchhhhhhhhcCC------C-CeEEEEEecCCCEEEEEEecCCcEEEEEEecccceeeEEE Confidence 99 22222222111 122 333321 1 1111455556666667777788877788886655444444 Q ss_pred ecccEEEEE------ecCCEEEEEECC-----EEEEecCCCCceeEEEEcceEEeCcccceeEE---EEeeC-CCccEEE Q lcl|NC_019442. 421 TPFDCAWVD------LAKDMMRVVTGD-----KMSVLAGGSLPSTIRWHSKIFSLPERTSFSCI---RVKSP-APERVGI 485 (541) Q Consensus 421 ~~~d~~~~~------~~~d~LY~~~g~-----~i~~~~~g~~~~~~~WrSk~f~~~~~~~~~~~---~V~~~-~~~~~~v 485 (541) ..++-.+.+ -..|.||++..+ ...-.+.=+ .....++..-|.+...+++..- .+.+- -.+--.+ T Consensus 463 ~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~yie~~~-~~~~~~~~~~~~vD~~~t~~~~~~~~~sgl~~leG~tv 541 (681) T protein:vir:10 463 HDTDGVFESCAVVAEGNEDRLYAVVRRTIGGNEVRYVERMA-SRQFDAQADAFFVDSGLTYSGEPVSHISGLEHLEGKTV 541 (681) T ss_pred EecCCcEEEEEEecCCCCcEEEEEEEecCCCCeEEEEEecC-CccccccccceEeeccccccCcceeeeccccCCCCcEE Confidence 333333222 236799997521 111111111 1223344445555555443221 11110 0122234 Q ss_pred EEEECCceeEeecccccCCcceEccCcc-----cceEEEEEE---------------ecceEEEEEee--cchhhcCC Q lcl|NC_019442. 486 TIMADDVPVIHFAPGTFKGSVVRLPAAT-----GQNWQVMVS---------------GFGQVERITLS--TSMSEMPV 541 (541) Q Consensus 486 ~~~~d~~~~~~~~~~~~~~~~~rLP~~~-----~~~w~iei~---------------g~~~V~~i~la--~s~~EL~~ 541 (541) .+..||... .+.+..+..+.|+..- +.....+++ ...+|.++.|- .|.. +-+ T Consensus 542 ~i~aDG~~~---~~~~V~~G~itl~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g~~~ri~rv~lr~~~S~g-~~~ 615 (681) T protein:vir:10 542 SILADGAVH---PQRVVTDGAIDLDVEAGTVHIGLPITAELQTLPVAMQLDGSFGQGRVKNINKLWLRVHRSSG-IFA 615 (681) T ss_pred EEEeCCeec---CcEeecCcEEEeCcCCceEEEeeeceeEEEecceeeecCCcccCCceEEEEEEEEEEEcccc-eEE Confidence 555666432 1111112222333110 011111110 01222222221 1100 000 No 26 >protein:vir:2109 Length: 472 # NCBI annotation: head completion protein # Family: family:all:1540 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059633;genbank:gi:9635541;genbank:GeneID:1262840 Probab=97.83 E-value=1.6e-05 Score=46.88 Aligned_cols=389 Identities=11% Similarity=0.105 Sum_probs=184.2 Q ss_pred CCCCCccceEEecCCCCCCCCCCCcccceEEEEEEEecCCCccCccccccceeecCCCCEEEEccccCCCCc-------- Q lcl|NC_019442. 121 IPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQN-------- 192 (541) Q Consensus 121 Vp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~-------- 192 (541) +|.=. .|++ .|. ..+ .. ...|.=.+ =|+-+=+|+.-+- |.-.....||....... |-+.-| T Consensus 1 m~~~q-~Pl~---~g~-~~~--~~-~~d~~~~~-pVN~~a~~~~~~~-s~~~lr~tPG~~~~~~~-~g~~RG~~~~t~~~ 69 (472) T protein:vir:21 1 MPIQQ-LPMM---KGM-GKD--FK-NADYIDYL-PVNMLATPKEILN-SSGYLRSFPGITKRYDM-NGVSRGVEYNTAQN 69 (472) T ss_pred CceEE-eecc---ccc-ccc--cc-ccceeeee-eeeeeeeccCCcc-cceeeeecCCcceeccC-CCceeeeeecccCC Confidence 11110 1111 010 000 00 00111011 1443333433322 22333334665444342 222111 Q ss_pred --cccceEEEEEeecCCCceeEEEEEeeccc-eEEEEeccccccc--Cc-----cch--hhhhhC-CCC-Cc-------- Q lcl|NC_019442. 193 --ASIKRRRIYRSASGGGEADFLLVAELDAS-VLSYTDKIPGKNL--GP-----SLA--TWDYLP-PPE-NM-------- 250 (541) Q Consensus 193 --~~i~~~RIYRs~t~~~~~~~~lVael~~~-~~sf~D~~~~~~L--~~-----~L~--t~~~~~-pP~-~~-------- 250 (541) ..+....|||=.+ + +++++.+ .-+++||...-.. +. .+. +..... |.| +. T Consensus 70 ~ly~V~G~~LY~v~~-----~---~G~i~gsgrVsMa~n~~~~~v~~~~~~~~Y~~~~~~~t~~~~~~d~~f~~~dl~~~ 141 (472) T protein:vir:21 70 AVYRVCGGKLYKGES-----E---VGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSGFTQYELGSV 141 (472) T ss_pred eEEEEeCCceEEEee-----e---eeeecccccEEEeeCCeEEEEEECCceeEEEEecchhhhhcccCccccccccccce Confidence 0122334444111 0 2333221 1244444321000 00 000 010111 211 11 Q ss_pred ceEEeccCcEEEEE-eCCEEEEecCCCcccCchhc--cccc---CcceEEEEEcCCcEEEEEcCCE--EEEEccCc-ccc Q lcl|NC_019442. 251 TGLCLMANGIAAGF-AGNEVMFSEAYLPYAWPEVN--RHTT---AEDIVAICPLGTSLVVATKGEP--YLFSGVSP-STI 321 (541) Q Consensus 251 ~gL~~m~NGi~a~f-~Gn~l~fSep~~P~awp~~y--~~t~---~~~Iv~ia~v~~~lvV~T~~~p--y~l~G~~p-~s~ 321 (541) ..+|-+ .|++..- .|...+|.-...+--.|.+| |-+= +|.|++|....+-||++-+... |..+|..+ ... T Consensus 142 ~dv~f~-dGyfV~~~~gt~~f~is~l~d~~~~~~y~~FatAE~~pD~Iv~i~~~~~~l~lfG~~TiEvw~ntG~ad~~~f 220 (472) T protein:vir:21 142 RDITRL-RGRYAWSKDGTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTAGAA 220 (472) T ss_pred eEEEEe-cceEEEccCCcceeEEecCCCCccccCCccceeeccCCCceEEEEeeccEEEEEeccceEEEEecCCCCcCcC Confidence 122223 4554433 35545665444443356666 3332 3889999999999999988777 99999874 445 Q ss_pred eEEeec---ccccccccchheeCCccEEEecCCc----EEEEeCCCceEEEecccCChhHhhhhcC-cce----EEEEEE Q lcl|NC_019442. 322 SGSRIP---SMQACLSRRSMVAMEGFVLYAGTNG----LVSVDVNGNTALATEKIISPEQWQSQFN-PAS----IVAYSW 389 (541) Q Consensus 322 ~~~~l~---~~~pCvs~rsiv~~~~~v~y~s~dG----Lv~~~~~G~~~~vT~~~~~~~~W~~~l~-P~t----i~a~~~ 389 (541) ..++.+ .+.+|.++.|+..+++.++|+|+|+ .|....+++++.|+-.=|+ +.|++ .+ |+. +...+. T Consensus 221 py~r~~g~~iq~Gcaa~~sv~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST~aIE-~~i~~-y~~~e~~~A~~~t~~~ 298 (472) T protein:vir:21 221 LYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIE-KIIRS-YTAEEMATGVMETLRF 298 (472) T ss_pred ceEEcCcceeeecccCcchhhecCceEEEEecCCCcccEEEEccCceeEEecCHHHH-HHHHh-cCCccccceEEEEEEe Confidence 555554 6789999999999999999999998 2445556678777444444 47776 44 663 345668 Q ss_pred cCe--EEEEEecCCCccceEEEccCCce----eEEEeec-----ccEEEEEecCCEEEE--EECCEEEE--ecCCC---C Q lcl|NC_019442. 390 RGE--YIACYTKPDGKQDVFVFSPVNMD----IRYLSTP-----FDCAWVDLAKDMMRV--VTGDKMSV--LAGGS---L 451 (541) Q Consensus 390 eG~--Y~~~y~~~~g~~~~~i~d~~~~~----~~~~~~~-----~d~~~~~~~~d~LY~--~~g~~i~~--~~~g~---~ 451 (541) ||. |+.++. + ..+.||..++. -.+...+ |-+.-+....+++-+ .+++.||+ |+... . T Consensus 299 eGH~fy~LtfP--~---~Tw~yD~at~~~~e~W~~~~sg~~~~~~R~~~~~~~~g~~ivGD~~nG~ly~L~fd~~~~~d~ 373 (472) T protein:vir:21 299 DSHELLIIHLP--R---HVLVYDASSSQNGPQWCVLKTGLYDDVYRGVDFMYEGNQITCGDKSEAVVGQLQFDISSQYDK 373 (472) T ss_pred CCeEEEEEEcC--C---eeEEEEcccCccCceeeeeccCCCcCceeEEEEEeeCCeEEEEEcCCCeEEEEEecccccCCC Confidence 898 888775 2 37999988763 1111221 222222233444433 23566665 34433 2 Q ss_pred ceeEEEEcceEEeCcccceeEEEEeeC-----CCccEEEEEEECCceeEee------cccccCCc--ceEccCcccce-E Q lcl|NC_019442. 452 PSTIRWHSKIFSLPERTSFSCIRVKSP-----APERVGITIMADDVPVIHF------APGTFKGS--VVRLPAATGQN-W 517 (541) Q Consensus 452 ~~~~~WrSk~f~~~~~~~~~~~~V~~~-----~~~~~~v~~~~d~~~~~~~------~~~~~~~~--~~rLP~~~~~~-w 517 (541) +-+..-.+..|..+..--|- ..|+.. ...++-++.--||..+-+- .++..+.| -.||=..|.+- . T Consensus 374 ~~~~~r~~p~~~~dn~R~fd-~eve~~~Gv~q~~d~v~L~wSddG~~~~~~~~~~~g~~g~~~tr~~~~RlG~~r~~v~f 452 (472) T protein:vir:21 374 QQEHLLFTPLFKADNARCFD-LEVESSTGVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGF 452 (472) T ss_pred cCcEEEEccceeCCCCEEEE-EeeeccCCCCCcCcEEEEEeeccccccccceeeccCCccchhcceeeeeeeecccceeE Confidence 22223334444444432232 344421 1122222222243322111 11111211 22444444443 8 Q ss_pred EEEEEecceEEEEEeecchh Q lcl|NC_019442. 518 QVMVSGFGQVERITLSTSMS 537 (541) Q Consensus 518 ~iei~g~~~V~~i~la~s~~ 537 (541) +|+++...+|.---+---+| T Consensus 453 ~~r~~~~~~~~l~g~~~~~E 472 (472) T protein:vir:21 453 KLRVITKSPVTLSGCQIRLE 472 (472) T ss_pred EEEEEecCcceeeeeEEeeC Confidence 88888999988777777777 No 27 >protein:vir:3529 Length: 477 # NCBI annotation: P28 # Family: family:all:1540 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050989;genbank:gi:9633575;genbank:GeneID:1262322 Probab=97.54 E-value=4.8e-05 Score=44.26 Aligned_cols=396 Identities=11% Similarity=0.079 Sum_probs=189.9 Q ss_pred CccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEEEE-EecCCCccCccccccceeecCCCCEEEEccccCCC Q lcl|NC_019442. 112 HPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETF-VSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPL 190 (541) Q Consensus 112 ~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~T~-V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~ 190 (541) +=++-++--++-|=..-.......+ .|+--+ |+.+.....-.-.|.. ....+|-.....+ +-+. T Consensus 1 ~~~~~~m~~~~ipl~~g~~~~~~~~-------------d~~~~~PVN~~a~p~~~~~s~~~-L~~~pG~~~~~~~-~G~~ 65 (477) T protein:vir:35 1 MLSEVFMPKIQIPLAKGLVKDIKTA-------------DYIDALPVNMLATPKEVLNASGY-LRSFPGIEKKQDA-KGVS 65 (477) T ss_pred Ccccceeeeeccccccccccccccc-------------cceeeeeeccceeeccccccccc-cccCCcceeeccC-Cccc Confidence 0011111111111100000000000 011111 3332211100111111 1112343332221 1111 Q ss_pred Ccc--ccceEEEEEeecCCCceeEE---EEEeeccc-eEEEEecc------------------cccccCccchhhhhhCC Q lcl|NC_019442. 191 QNA--SIKRRRIYRSASGGGEADFL---LVAELDAS-VLSYTDKI------------------PGKNLGPSLATWDYLPP 246 (541) Q Consensus 191 ~~~--~i~~~RIYRs~t~~~~~~~~---lVael~~~-~~sf~D~~------------------~~~~L~~~L~t~~~~~p 246 (541) -|- ++..-.+|| +.|+ +=|+ =|++++.+ .-+++||. ...++. -.| .+..+- T Consensus 66 RG~~~~~~~g~lY~-V~G~--~LY~v~~~vG~I~gsg~VsMa~n~~~~aIv~~g~~~gy~y~~t~~~~~-~~~-~~~~p~ 140 (477) T protein:vir:35 66 RGVHFNTKNNALYR-VCGN--TLYRNDKEVADIAGMSRVSMSHSSHSQAICFEGKVKLYRYDGTEKALS-NWP-KDKYPQ 140 (477) T ss_pred cceeEeecCCeEEE-EecC--eeEeeeeeeeeecccccEEEeeCCcEEEEEECCcceeEEEecccceee-ecC-ccccCC Confidence 110 111233444 1111 1111 13333211 11333332 111110 001 011111 Q ss_pred CC--CcceEEeccCcEE-EEEeC-CEEEEecCCCcccCchh-ccccc---CcceEEEEEcCCcEEEEEcCCE--EEEEcc Q lcl|NC_019442. 247 PE--NMTGLCLMANGIA-AGFAG-NEVMFSEAYLPYAWPEV-NRHTT---AEDIVAICPLGTSLVVATKGEP--YLFSGV 316 (541) Q Consensus 247 P~--~~~gL~~m~NGi~-a~f~G-n~l~fSep~~P~awp~~-y~~t~---~~~Iv~ia~v~~~lvV~T~~~p--y~l~G~ 316 (541) ++ +...+|-+ .|++ ..-.| +...+|...-+-.++.- +|-+= +|.|++|...-+-||++-+... |..+|. T Consensus 141 ~~l~~~~~v~f~-dGyfV~~~~gt~~~~iS~L~d~s~~d~~~~FasAE~~pD~Ivgi~~~~~~i~lfG~~TiEvw~ntG~ 219 (477) T protein:vir:35 141 YDLGEVIDVCRN-RGRYIWLQKGGERFGVTDLEDESKPDRYQPFYRAESQPDGIVSVDAWRDLIVCFGSSSIEYFTLTGS 219 (477) T ss_pred ccccceeEEEee-CceEEEeecCCCeEEEeecCCccccccccccccccCCCCceEEEEeeccEEEEEeccceEEEEecCC Confidence 11 11234444 4553 33344 55555865544444332 23332 3889999999999999988776 999999 Q ss_pred CcccceEEeec----ccccccccchheeCCccEEEecCC----cEEEEeCCCceEEEecccCChhHhhhhcCcceEE--- Q lcl|NC_019442. 317 SPSTISGSRIP----SMQACLSRRSMVAMEGFVLYAGTN----GLVSVDVNGNTALATEKIISPEQWQSQFNPASIV--- 385 (541) Q Consensus 317 ~p~s~~~~~l~----~~~pCvs~rsiv~~~~~v~y~s~d----GLv~~~~~G~~~~vT~~~~~~~~W~~~l~P~ti~--- 385 (541) .+-+....|.. ++.+|.++.|+..+++.++|+|+| |.|-...+++++.|+-.=|+ +.|++-=.++... T Consensus 220 a~f~~p~~r~~~~~mIq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~~~g~q~~rIST~aIE-~~i~ay~~~e~a~af~ 298 (477) T protein:vir:35 220 ADTSQPLYIHQAAYMIQAGIAGRDCKCRYQDKYAILSHQSTGQPAVYLIGAGEKNKISTATID-KIIRYYSADELAASFM 298 (477) T ss_pred CCCCcceeecCCceeeeecccCchhhhhhCceEEEEecCCCcccEEEEccCceeEEecCHHHH-HHHHhcCCcchhceeE Confidence 99987777776 688999999999999999999998 55777777788888444443 4677644466654 Q ss_pred -EEEEcCe--EEEEEecCCCccceEEEccCCce--eEEEee-------cccEEEEEecCCEEEE--EECCEEEEecCCC- Q lcl|NC_019442. 386 -AYSWRGE--YIACYTKPDGKQDVFVFSPVNMD--IRYLST-------PFDCAWVDLAKDMMRV--VTGDKMSVLAGGS- 450 (541) Q Consensus 386 -a~~~eG~--Y~~~y~~~~g~~~~~i~d~~~~~--~~~~~~-------~~d~~~~~~~~d~LY~--~~g~~i~~~~~g~- 450 (541) ..+.||. |+.++.+ ..+.||..++. .++... +|-+.-+-...+++.+ .+++.||+++-+. T Consensus 299 ~t~~~eGH~fy~LtfP~-----~Tw~yD~at~~w~e~W~~~~~g~~~~~~Ra~~~~~~~g~~~vGD~~ng~l~~ld~~~~ 373 (477) T protein:vir:35 299 ESIRFDNHELLLLHLPK-----HTLCFDGSASHQYSQWSLLKSGFYDEPYRAIDFMFFDNQITVGDKKEGVLGHLIFNAS 373 (477) T ss_pred EEEEeCCeeEEEEEcCC-----ceEEEecccccccceeeeeccCCccCceEEEEEEEeCCeEEEEEcCCCeEEEECCCCc Confidence 4568899 8888762 47999977752 123221 3333333345667766 4578899874211 Q ss_pred ----CceeEEEEcceEEeCcccceeEEEEeeC-----CCccEEEEEEECCceeEee------cccccCCc--ceEccCcc Q lcl|NC_019442. 451 ----LPSTIRWHSKIFSLPERTSFSCIRVKSP-----APERVGITIMADDVPVIHF------APGTFKGS--VVRLPAAT 513 (541) Q Consensus 451 ----~~~~~~WrSk~f~~~~~~~~~~~~V~~~-----~~~~~~v~~~~d~~~~~~~------~~~~~~~~--~~rLP~~~ 513 (541) .+..-+-.++.|..+..--|- ..++.. ...++-++.--||..+-+- .++..+.| -.|| ++ T Consensus 374 ~d~g~~i~~~~~~p~~~~d~~Rv~~-~el~~~tGvgq~~d~v~L~~sddG~~~~~~~~~~~g~~g~~~~r~~~~Rl--G~ 450 (477) T protein:vir:35 374 NQYEQQTEHLLYTPMIKADNARLFD-FELEASTGVAQIADKLFLSVTTDGINYSREQLIEQNSPFQYDKRILWRRI--GR 450 (477) T ss_pred ccCCCccceEEecceeeCCCCeEEE-EEEEEecCcCccCceEEEEEeccccccccceeecCCCccccccceeeeee--ee Confidence 222333346666666554343 444322 1223322222233322111 11111111 1133 24 Q ss_pred cce---EEEEEEecceEEEEEeecchh Q lcl|NC_019442. 514 GQN---WQVMVSGFGQVERITLSTSMS 537 (541) Q Consensus 514 ~~~---w~iei~g~~~V~~i~la~s~~ 537 (541) +++ .+|+++-..+|..--++.-|| T Consensus 451 ~r~~vgf~~r~~~~~pv~l~~~~~~~e 477 (477) T protein:vir:35 451 VRKNIGFKIRIITKSPVTLSDLSIRME 477 (477) T ss_pred ceeccceEEEEEecCCceeccceeEeC Confidence 443 778888888888777777777 No 28 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=97.30 E-value=1.7e-05 Score=46.77 Aligned_cols=268 Identities=17% Similarity=0.232 Sum_probs=105.9 Q ss_pred CceEEe-----------------cccccccccccceecccccceEEEEe-eecCCeeeeeecccccCccccccceeEEEE Q lcl|NC_019442. 1 MPYIDI-----------------TTMRGMMPRVVTSMLPEHSAVLAEDC-HFRFGVITPERQISGVEKTFTIKPKTIFHY 62 (541) Q Consensus 1 m~~i~i-----------------~~f~G~~Pr~~p~llp~~~a~~a~N~-~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~ 62 (541) |-.|.- .-|-|.+--+. | +|+ +++.+.|++ +.++.. +..+..=|=- T Consensus 169 a~tiE~a~FyGDs~L~s~~~~~gleFDGl~~lI~----~-------~NvIDarG~~Ls~----~~ln~a-A~~i~~gfGt 232 (514) T protein:vir:10 169 IKTDEWAMFYGDADLTSGQKGEGLQFDGLFKLIA----P-------ENHIDLRGGRLSP----AALNMA-ARKIGEGFGT 232 (514) T ss_pred HHHHHHHHhhhcccCCCccccCcchhhhHHHhhc----C-------CCeEecCCCCccH----HHHhhh-hhhhhcccCC Confidence 000000 12333333332 1 222 233333331 111111 1111111222 Q ss_pred CCcEEEEeCCeEEEeeCCcccCCCCeEEEeCCCC----------cceeecceeeccccCCccceeee--cCC----CCCc Q lcl|NC_019442. 63 RDDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRF----------PKVTDATIATKGDGNHPASSYSL--GIP----APTT 126 (541) Q Consensus 63 ~~~~W~~w~~~V~vv~spia~D~~~Rvy~t~~~~----------pk~t~~~ia~~g~g~~p~a~y~L--GVp----~P~~ 126 (541) ..+-|+..-..-+.+-++ .|-+||...+++. -....++|.+-|+-... ..-.| .++ +|.+ T Consensus 233 ~TD~ylp~~vka~f~~~~---~~~qRV~~~~n~~~~~~G~~v~~f~s~~G~I~L~gs~im~-~~n~L~~~~~~~~~Ap~~ 308 (514) T protein:vir:10 233 PTDAYMPIGIKADFVNQH---LNGQRVMLPGQTGGMTTGLDIDKFLSAHGSIRIQGSTIMD-SDNKLDFDRPVSPTAPTA 308 (514) T ss_pred hhheeCchHHHHHHhhcc---cCcceEEeecCccceeeeeeccceeEeccceeecCCeeec-ccccCccCCccCCcCCCC Confidence 233333322222222222 2334555544311 11123334443332221 11112 222 4433 Q ss_pred cce-EEecCCCC---CCCCCC---------Cccc-ceEEEEEEEecCCCccCccccccceeecCCCCEEEEccccCCCCc Q lcl|NC_019442. 127 APV-CTVQQGGD---VSDDNP---------NDDE-TRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQN 192 (541) Q Consensus 127 ~pv-~~v~~~~~---~~~~~~---------~~~~-ty~Yv~T~V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~ 192 (541) +-+ ++|+..+. .+.+.. .+.+ .|.|++..|+.+| ||+||.+ ...|+...+..|+|+.-|-+.+. T Consensus 309 ~~va~svT~~~~g~~~~ad~t~~~g~~~~~~~~g~~~sYaVv~~n~~G-eS~ps~~-vtaT~a~~~~~i~ltItp~~~~~ 386 (514) T protein:vir:10 309 PQLSATVTPDGGGLWHEADKTDSKGEVILNKEVGVEQSYVAVMVSRHG-DSRPSLV-QTATPTKKDDAITLTITPNAMQN 386 (514) T ss_pred CcceEEEecCcccccCcccccccccccccccccceeEEEEEEEECCCC-cccccce-eeeeeeccCceEEEEEEeccCcc Confidence 322 24422111 111111 3444 4779999999988 6688876 45566667888999987666665 Q ss_pred cccceEEEEEee------------cCCCceeEEEEEeec-----cceEEEEecccccc------cCcc-chhhhhhC--C Q lcl|NC_019442. 193 ASIKRRRIYRSA------------SGGGEADFLLVAELD-----ASVLSYTDKIPGKN------LGPS-LATWDYLP--P 246 (541) Q Consensus 193 ~~i~~~RIYRs~------------t~~~~~~~~lVael~-----~~~~sf~D~~~~~~------L~~~-L~t~~~~~--p 246 (541) ..-+.+.|||+. ..++.++|+++++++ .++++|+|.-.--. .++. -.+..|.. | T Consensus 387 ~~p~yv~IYR~~~~~s~~~~~~~~~~~~tGdf~li~rv~~~~~~~gttt~~D~n~~IPgT~~vfVgemspevi~l~ellP 466 (514) T protein:vir:10 387 VIPDYVAIYRKSNFDSDALEANTDASGNRGSYYLIGKVAVREQEGATITFVDTNARIAGCGDVFVIENRPETVALQEFIP 466 (514) T ss_pred cccceEEEEeccCCCcchhhhhccccccccceeEEEEEeeecCCCCeEEEeccccccCCcceeEEeeCchHHHHHHHHhh Confidence 555889999983 123557899998887 47889999643211 1111 12233322 2 Q ss_pred CCCcceEEeccCcEEEEEeCCEEEEec--CCCcccCchhcccccC--cceEEEEEc Q lcl|NC_019442. 247 PENMTGLCLMANGIAAGFAGNEVMFSE--AYLPYAWPEVNRHTTA--EDIVAICPL 298 (541) Q Consensus 247 P~~~~gL~~m~NGi~a~f~Gn~l~fSe--p~~P~awp~~y~~t~~--~~Iv~ia~v 298 (541) .-.|. | +.-|-+. ..-++||-- -+.|--|= ++..= .+|--+... T Consensus 467 m~klp-L-A~~na~~---~waVlwYGaLal~aPkr~~---~IkNv~~~~v~~~~~~ 514 (514) T protein:vir:10 467 LSKLN-L-AVTTTAT---SFVVLNYVALALYYPKRGA---VLENVVYSRVEDLELS 514 (514) T ss_pred hhhcC-h-hhhcchH---HHHHHHHhHHHhhccccce---EEEeeeeeeccccccC Confidence 21111 1 1111110 112222221 11221120 11100 111111111 No 29 >protein:vir:102644 Length: 594 # NCBI annotation: Hypothetical protein # Family: family:all:780 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024422;genbank:gi:48696643;genbank:GeneID:2948111 Probab=97.28 E-value=0.00011 Score=42.34 Aligned_cols=430 Identities=10% Similarity=0.029 Sum_probs=158.5 Q ss_pred CceEEeccccc-cc-ccccceeccc---ccceEEEEeee-cCCeeeeeecccccCccccc-cceeEEEECCcEEEEeCC- Q lcl|NC_019442. 1 MPYIDITTMRG-MM-PRVVTSMLPE---HSAVLAEDCHF-RFGVITPERQISGVEKTFTI-KPKTIFHYRDDFWFAWPD- 72 (541) Q Consensus 1 m~~i~i~~f~G-~~-Pr~~p~llp~---~~a~~a~N~~~-~~G~l~P~~~~~~v~~~~~~-~~~Tif~~~~~~W~~w~~- 72 (541) |+.|.=..|.| |+ |++..|.=-+ +.++.++|--+ -.|.++=.-.-+-++...-. ..--|+-+ .++. T Consensus 1 m~~~~~~~F~~GelsP~l~~r~Dl~~y~~~~~~~~n~~~~~~G~~~rR~G~~~~~~~~~~~~~~~lipF------~~s~~ 74 (594) T protein:vir:10 1 MADFSQTSFKGGVIAPRLQFNEYESAYHHSIEDAVNFVVTEQGSLITRCGSEEVGLCQDGEVRLFRLPA------VDAPS 74 (594) T ss_pred CceeeccccCcceecceeccchhHHHHHHHHhhhhceEEEecCCeecCChhHhhhhccCCCCCEEEEEE------EeCCC Confidence 55554444432 21 2222221111 12223333221 11111100000111100000 00001000 0000 Q ss_pred eEEEeeCCcccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCccceEEecCCCCCCCCCCCcccceEEE Q lcl|NC_019442. 73 VVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYT 152 (541) Q Consensus 73 ~V~vv~spia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv 152 (541) .-.++.. .+-+-|+|.- . T Consensus 75 ~~~~le~---g~~~~r~~~~-----------------------------------------------------------~ 92 (594) T protein:vir:10 75 NDVIVEV---GNTNIAVWVN-----------------------------------------------------------D 92 (594) T ss_pred CeEEEEE---cCCeEEEEec-----------------------------------------------------------C Confidence 0000000 1111122211 1 Q ss_pred EEEEe-cCCCccCcccc--------ccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEEEeeccceE Q lcl|NC_019442. 153 ETFVS-DYGEEGPPGPA--------SLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVL 223 (541) Q Consensus 153 ~T~V~-~~GeEs~Ps~~--------S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lVael~~~~~ 223 (541) -+.|. ..|.|..++.+ ...+...+.++.+.+.- .+..-.||||.. .+.|.+.+ .+.... T Consensus 93 ~~~v~~~~~~~~~~~tp~~~t~~~~l~~i~~tqsad~~~~~~-------~~~~p~~L~R~~----~~~w~~~~-~~~~~~ 160 (594) T protein:vir:10 93 VRQVVANTPSEWRNTIDRIQTAYDTIGDDAGAANTGRLIMVH-------PALQPKRLYRDN----NNAWQFVN-MHTGAV 160 (594) T ss_pred cEEEEccCCCcccccccceeeccCCccceEEEEEeeEEEEEc-------CCCCceEEEEcc----CCCceEEe-cccCcc Confidence 12111 22222222110 11122223334443321 122347889942 23455543 222111 Q ss_pred EEEecccccccCccchhhhhhCCCCCcceEEeccCcEEEEEeC----CEEEEecCCCcccCchhccccc----------C Q lcl|NC_019442. 224 SYTDKIPGKNLGPSLATWDYLPPPENMTGLCLMANGIAAGFAG----NEVMFSEAYLPYAWPEVNRHTT----------A 289 (541) Q Consensus 224 sf~D~~~~~~L~~~L~t~~~~~pP~~~~gL~~m~NGi~a~f~G----n~l~fSep~~P~awp~~y~~t~----------~ 289 (541) .+.|+. .+ -| ..+.+.+.++.-... +.||+|.+...+.+-..--++- + T Consensus 161 p~~~~~-----------~~---~p----~~v~f~q~RL~f~~~~~~p~~v~~Srtgd~~nF~~~~~~~ddd~i~~~~s~~ 222 (594) T protein:vir:10 161 PAEWSP-----------SN---YP----QTVGIFQNRVWYVGSPVHRTYFWATRAGKLEDIAPSTANNPNDPISFVGIME 222 (594) T ss_pred cccccC-----------Cc---cc----eEEEEEeeeEEEEeCCCCCceEEEEecccccccccCCCCCCCccEEEEEecc Confidence 111110 11 12 334455666544332 4799999988887622111111 1 Q ss_pred cceEEEEEcCCcEEEEEcCCEEEEEcc-----CcccceEEeecccccccccchheeCCccEEEecCCc-----EEEEe-C Q lcl|NC_019442. 290 EDIVAICPLGTSLVVATKGEPYLFSGV-----SPSTISGSRIPSMQACLSRRSMVAMEGFVLYAGTNG-----LVSVD-V 358 (541) Q Consensus 290 ~~Iv~ia~v~~~lvV~T~~~py~l~G~-----~p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dG-----Lv~~~-~ 358 (541) ..+.-+.+....|+++|++.-|.|+|. +|.+..+.+ ....+| +.=-.+.+|+.++|+++.| +..-. . T Consensus 223 ~~~~~~v~~~~~L~i~t~~~e~~l~~~~~~~lTp~~~~~~~-~s~~g~-~~~~P~~vg~~~~fv~~~g~~vre~~y~~~~ 300 (594) T protein:vir:10 223 GTPCWIIASSDVLTIGTTINDYQLAASTGVSVTAATAILRR-SSVQGT-AAVQGIPAEEQVIFCSRNKSKVYAMNYVREQ 300 (594) T ss_pred cceEEEEecCCceEEEecCceEEEecCCCcccccceEEEEE-eeeecc-CCCcceeeCCeEEEEcCCCCEEEEEEEeecc Confidence 334445566788999999999999874 456655544 345577 3445678899999999998 22111 1 Q ss_pred ----CCceEEEecccCChhHhhhhcCcceEEEEEE--cCeEEEEEecCCCccceEEEccCCceeEEEeecc-c------E Q lcl|NC_019442. 359 ----NGNTALATEKIISPEQWQSQFNPASIVAYSW--RGEYIACYTKPDGKQDVFVFSPVNMDIRYLSTPF-D------C 425 (541) Q Consensus 359 ----~G~~~~vT~~~~~~~~W~~~l~P~ti~a~~~--eG~Y~~~y~~~~g~~~~~i~d~~~~~~~~~~~~~-d------~ 425 (541) +...+++.+.||..-. .+....|+.+.+ +-.-+.++...||...++-|+......-|..-.+ + | T Consensus 301 d~y~~~dlt~~a~hl~~~~~---~~~~~~i~~~a~~~~p~~~~~~v~~dG~l~~~ty~~eq~v~aWs~~~~t~G~v~~va 377 (594) T protein:vir:10 301 DNWIPDEMSSQAQHLFTPIS---SAKGASVRRVAYISDAAKSLWVVLENGQINYCCFDRTTDTKAWTQLELSGGKVIDIA 377 (594) T ss_pred CceeccchhhhhhhhcCccc---cccCceEEEEEEecCCceEEEEEeCCCeEEEEEEecccceeeeEeeccCCCcEEEEE Confidence 1123344455543200 012344433333 2234455566666655666665443333332222 1 2 Q ss_pred EEEEecCCEEEEEEC--CEE------E----EecCCCCceeEEEEcceEEeCcccceeEEEEee-CCCccEEEEEEECCc Q lcl|NC_019442. 426 AWVDLAKDMMRVVTG--DKM------S----VLAGGSLPSTIRWHSKIFSLPERTSFSCIRVKS-PAPERVGITIMADDV 492 (541) Q Consensus 426 ~~~~~~~d~LY~~~g--~~i------~----~~~~g~~~~~~~WrSk~f~~~~~~~~~~~~V~~-~~~~~~~v~~~~d~~ 492 (541) +......|+||++.- +.| | .++... . ......+.+...++... .|-+ +..+--.+.+++||. T Consensus 378 ~i~~~~~d~l~~~V~R~~ti~g~~~~y~~lE~~~~~~--~--~~~~~~~~~d~~~~~~~-~vsgl~hLeg~tv~v~aDG~ 452 (594) T protein:vir:10 378 AAFNPDSDYAYVAVVRSKAINGVQKNYTVLEKISSPR--T--DWKRADGWVVAQVNQNG-DVLNLDRYIGRTAVIFSKYG 452 (594) T ss_pred EeecCCCCEEEEEEEECCccccceeeEEEeecCCCcc--c--cccccceeeeecccccc-eeecccccCCceEEEEeCCe Confidence 222334678888642 222 1 111110 0 00111111112222110 1111 111223455666665 Q ss_pred eeEeecccccCCcceEccCc--c-cceE-------------EEEE--------EecceEEEEEee--cchh------hcC Q lcl|NC_019442. 493 PVIHFAPGTFKGSVVRLPAA--T-GQNW-------------QVMV--------SGFGQVERITLS--TSMS------EMP 540 (541) Q Consensus 493 ~~~~~~~~~~~~~~~rLP~~--~-~~~w-------------~iei--------~g~~~V~~i~la--~s~~------EL~ 540 (541) .... .+..+..+.||.. . +..- -+|+ -...+|.++++. .|.. +.+ T Consensus 453 ~~~~---~~V~~g~itL~~~~~~~~~~v~VGl~Y~s~i~~lp~~~~~~~gs~~g~r~ri~r~~v~~~~S~g~~vg~~~~~ 529 (594) T protein:vir:10 453 LEAE---VEVNNIGLTHRINGYDPNTVYYVGYKMDSYFRTLTPSNGDMKKSMFGSKIRISKVQLALFDSIEPTVNGEPAD 529 (594) T ss_pred ecCC---eEEcCCeeEeeccCCCCcceEEEeeeeeEEEEeecccccCCcccccCccEEEEEEEEEEEcceeeEECCcccc Confidence 4311 1112222333311 0 1110 0110 112333443333 2211 111 Q ss_pred C Q lcl|NC_019442. 541 V 541 (541) Q Consensus 541 ~ 541 (541) . T Consensus 530 ~ 530 (594) T protein:vir:10 530 D 530 (594) T ss_pred c Confidence 1 No 30 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=97.09 E-value=1.4e-05 Score=47.11 Aligned_cols=253 Identities=15% Similarity=0.122 Sum_probs=99.0 Q ss_pred CceEEe----ccccccccccc-------------ceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEEC Q lcl|NC_019442. 1 MPYIDI----TTMRGMMPRVV-------------TSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYR 63 (541) Q Consensus 1 m~~i~i----~~f~G~~Pr~~-------------p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~ 63 (541) .+.=.. -.|-|.+-.+. ..+|. +.|.+..-..|..+=...|..+.--| T Consensus 160 l~~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln----~Aa~~i~~~fGt~TD~~lp~~v~a~f----------- 224 (464) T protein:vir:80 160 LSENPDAGSGLEFDGLAKLIDKHNVLDAKGASLTEALLN----QASVLVGKGYGTPTDAYMPIGVQADF----------- 224 (464) T ss_pred cCCCCCCccccchhhhHhhcCCCceeecCCCCcCHHHHh----hhhhhhhcccCChhhcccchhHHHHH----------- Confidence 110011 13444442222 22222 11222233333333333332221000 Q ss_pred CcEEEEeCCeEEEeeCCcccCCCCeEEEeCCCCc--------c--eeecceeeccccCCccc----eeeec-CCCCCccc Q lcl|NC_019442. 64 DDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRFP--------K--VTDATIATKGDGNHPAS----SYSLG-IPAPTTAP 128 (541) Q Consensus 64 ~~~W~~w~~~V~vv~spia~D~~~Rvy~t~~~~p--------k--~t~~~ia~~g~g~~p~a----~y~LG-Vp~P~~~p 128 (541) |+--=.| +|+.+++.+.- + -..++|.+-++-..... .-+.. .-+|+ +| T Consensus 225 ----------~n~~l~~------q~~~~~~n~~~~~~G~~v~~f~sa~G~i~L~~s~~m~~~~~ld~~~~~~~~apa-ap 287 (464) T protein:vir:80 225 ----------VNQQLDR------QVQVISDNGQNATMGFNVKGFNSARGFIRLHGSTVMELEQILDENRMQLPNAPQ-KA 287 (464) T ss_pred ----------HhhhcCc------eeEEEcCCCCcceeeeecccccccccceeccCccccCcccccccccccCCCCcC-Cc Confidence 0000000 12222221110 0 01122222221111000 00111 11342 34 Q ss_pred eEEecCCCCCC-CCCCCc-ccceEEEEEEEecCCCccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecC Q lcl|NC_019442. 129 VCTVQQGGDVS-DDNPND-DETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASG 206 (541) Q Consensus 129 v~~v~~~~~~~-~~~~~~-~~ty~Yv~T~V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~ 206 (541) .++++..++.. ...+.+ .+.|.|++.-|+.+| ||+||.+ ..+++.+....|.|+.-+.+.-...-+.+.|||+.. T Consensus 288 svt~tv~~~~~g~f~~~~~~~~~~Ykv~~vn~~G-eS~ps~~-~~~ti~~~~~~V~l~it~~~~~~~~p~yv~IYR~~~- 364 (464) T protein:vir:80 288 TVKATLEAGTKGKFRDEDLTIDTEYKVVVVSDDA-ESAPSDV-ASVVIDDKKKQVKLEITINNMYQARPQYVAIYRKGL- 364 (464) T ss_pred eeEEEecCCcccCCccccccceeEEEEEEECCCC-cccccee-eeeeecCcccEEEEEEEeCCccccccceEEEEeecC- Confidence 44333222222 122222 456899999999988 9999874 456666777788877666443322236899999854 Q ss_pred CCceeEEEEEeecc-----ceEEEEeccccc---------ccCccchhhhhhCCCCCcceEEeccCcEEE-EEeCCEEEE Q lcl|NC_019442. 207 GGEADFLLVAELDA-----SVLSYTDKIPGK---------NLGPSLATWDYLPPPENMTGLCLMANGIAA-GFAGNEVMF 271 (541) Q Consensus 207 ~~~~~~~lVael~~-----~~~sf~D~~~~~---------~L~~~L~t~~~~~pP~~~~gL~~m~NGi~a-~f~Gn~l~f 271 (541) ++++|+|++.++. ++.+|.|.-.-- ++.+ .+-.|..-.+ +..||+-.+- ....-++|| T Consensus 365 -~~g~f~~i~rv~~~~~~~gt~t~vD~n~~IPgt~~vfVgems~--~ti~l~ellP----m~rlplA~~n~~~~waVl~Y 437 (464) T protein:vir:80 365 -ETGLFYQIARVPASKAVEGVITFIDVNDEIPETADVFVGELTP--SVVHLFELLP----MMRLPLAQVNASVTFAVLWY 437 (464) T ss_pred -CCCceeEEEEEeeccccCCceEEEecccccCCceeEeeecCCc--hHHHHHHHHH----hhhCCchhcccchhhhhhhh Confidence 3579999999954 456798853321 1111 1222221100 1122222111 112333443 Q ss_pred ecCCCcccCchhc-ccccCcceEEEEE--cCC Q lcl|NC_019442. 272 SEAYLPYAWPEVN-RHTTAEDIVAICP--LGT 300 (541) Q Consensus 272 Sep~~P~awp~~y-~~t~~~~Iv~ia~--v~~ 300 (541) --+.+ .-|+++ ++ ..|+.|++ +++ T Consensus 438 GaLal--~aPk~~~~i---kNv~~~~~~~~~~ 464 (464) T protein:vir:80 438 GALAL--RAPKKWARI---KNVKYIATGNVFN 464 (464) T ss_pred hHHhh--hccccceEE---EEEEEeecccCCC Confidence 32211 013332 11 23333332 222 No 31 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=96.75 E-value=3.4e-05 Score=45.09 Aligned_cols=234 Identities=14% Similarity=0.171 Sum_probs=97.9 Q ss_pred CceEEecccccccc----------cccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEECCcEEEEe Q lcl|NC_019442. 1 MPYIDITTMRGMMP----------RVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAW 70 (541) Q Consensus 1 m~~i~i~~f~G~~P----------r~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W~~w 70 (541) +++++|..==|... -+.-++||-.--.+.-|.+ .. .++ | .-+..++= T Consensus 202 ~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~~N~~----~~-------~~G----------~--~v~~f~s~ 258 (463) T protein:vir:99 202 EAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSG----NV-------NTG----------Y--SVNGFYSS 258 (463) T ss_pred hhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEEcCCCC----ce-------eee----------e--eccceeee Confidence 77777754222211 1122222222222111110 00 000 0 00001111 Q ss_pred CCeEEEeeCCcccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCcc-ceEEecCCCCCCCCCCCcccce Q lcl|NC_019442. 71 PDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTA-PVCTVQQGGDVSDDNPNDDETR 149 (541) Q Consensus 71 ~~~V~vv~spia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~-pv~~v~~~~~~~~~~~~~~~ty 149 (541) .+.++.=.|-+.++ |... .++..+-.-+|.++ ++++|.......-..+.+...+ T Consensus 259 ~G~I~L~~s~~m~~------------~~il-------------~~~~~~~p~ap~~~~~tatv~~~~~~~~~~~~~~a~~ 313 (463) T protein:vir:99 259 RGFIKLHGSTVMEN------------ELIL-------------DESLQPLPNAPQPAKVTATVETKQKGAFENEEDRAGL 313 (463) T ss_pred eeeeeeCCceecCC------------cccc-------------cchhhcCCCCccCceeEEEEeeccCCCCCCcccccce Confidence 11121111111111 1100 11212233344333 2334432222222235567789 Q ss_pred EEEEEEEecCCCccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEEEeecc------ceE Q lcl|NC_019442. 150 FYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLVAELDA------SVL 223 (541) Q Consensus 150 ~Yv~T~V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lVael~~------~~~ 223 (541) .|++..++.+| ||+||++ ...|+-..+..|.|+.-+++..+.+.+.+.|||+. .++++|++++.++. +++ T Consensus 314 ~Y~vv~~s~~g-eS~pS~i-vtaT~a~~~~gv~l~It~~a~~~~~~~~v~IYR~~--~~~g~~~~i~rv~v~~an~~gtt 389 (463) T protein:vir:99 314 SYKVVVNSDDA-QSAPSEE-VTATVSNVDDGVKLSINVNAMYQQQPQFVSIYRQG--KETGMYFLIKRVPVKDAQEDGTI 389 (463) T ss_pred EEEEEEECCCC-Ccccchh-eeeeeeeccceEEEEEEecCCcccceeEEEEEeec--CCCCcceeEEEEEecccCCCceE Confidence 99999777655 9999987 45555555666777777667777888999999984 45679999999833 467 Q ss_pred EEEeccccc---------ccCccchhhhhhCCCCCcceEEeccCcEEEE-EeCCEEEEecCCCcccCchhc-ccccCcce Q lcl|NC_019442. 224 SYTDKIPGK---------NLGPSLATWDYLPPPENMTGLCLMANGIAAG-FAGNEVMFSEAYLPYAWPEVN-RHTTAEDI 292 (541) Q Consensus 224 sf~D~~~~~---------~L~~~L~t~~~~~pP~~~~gL~~m~NGi~a~-f~Gn~l~fSep~~P~awp~~y-~~t~~~~I 292 (541) +|+|.-.-- ++.+ .|-.|..-.+ |..||+-.+-+ ...-++||--+..- -|+++ ++.. T Consensus 390 t~~D~n~~IPgt~~vfVgems~--~ti~~~ellP----m~klpLA~~~~~~~waVl~YGaLal~--~Pk~~~~ikN---- 457 (463) T protein:vir:99 390 VFVDKNETLPETADVFVGEMSP--QVVHLFELLP----MMKLPLAQINASITFAVLWYGALALR--APKKWARIKN---- 457 (463) T ss_pred EEeecccccCCceeEeeeccCc--hhhhhHhhhH----hhhCCchhccchhhhHHHHhhHHHhh--ccccceEEEE---- Confidence 888864321 1111 1222222110 12222221111 11233333311110 12221 1111 Q ss_pred EEEEEc Q lcl|NC_019442. 293 VAICPL 298 (541) Q Consensus 293 v~ia~v 298 (541) +.-.++ T Consensus 458 v~~~~v 463 (463) T protein:vir:99 458 VRYIAV 463 (463) T ss_pred eeEecC Confidence 111111 No 32 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=96.75 E-value=3.4e-05 Score=45.09 Aligned_cols=234 Identities=14% Similarity=0.171 Sum_probs=97.9 Q ss_pred CceEEecccccccc----------cccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEECCcEEEEe Q lcl|NC_019442. 1 MPYIDITTMRGMMP----------RVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAW 70 (541) Q Consensus 1 m~~i~i~~f~G~~P----------r~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W~~w 70 (541) +++++|..==|... -+.-++||-.--.+.-|.+ .. .++ | .-+..++= T Consensus 202 ~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~~N~~----~~-------~~G----------~--~v~~f~s~ 258 (463) T protein:vir:95 202 EAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSG----NV-------NTG----------Y--SVNGFYSS 258 (463) T ss_pred hhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEEcCCCC----ce-------eee----------e--eccceeee Confidence 77777754222211 1122222222222111110 00 000 0 00001111 Q ss_pred CCeEEEeeCCcccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCcc-ceEEecCCCCCCCCCCCcccce Q lcl|NC_019442. 71 PDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTA-PVCTVQQGGDVSDDNPNDDETR 149 (541) Q Consensus 71 ~~~V~vv~spia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~-pv~~v~~~~~~~~~~~~~~~ty 149 (541) .+.++.=.|-+.++ |... .++..+-.-+|.++ ++++|.......-..+.+...+ T Consensus 259 ~G~I~L~~s~~m~~------------~~il-------------~~~~~~~p~ap~~~~~tatv~~~~~~~~~~~~~~a~~ 313 (463) T protein:vir:95 259 RGFIKLHGSTVMEN------------ELIL-------------DESLQPLPNAPQPAKVTATVETKQKGAFENEEDRAGL 313 (463) T ss_pred eeeeeeCCceecCC------------cccc-------------cchhhcCCCCccCceeEEEEeeccCCCCCCcccccce Confidence 11121111111111 1100 11212233344333 2334432222222235567789 Q ss_pred EEEEEEEecCCCccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEEEeecc------ceE Q lcl|NC_019442. 150 FYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLVAELDA------SVL 223 (541) Q Consensus 150 ~Yv~T~V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lVael~~------~~~ 223 (541) .|++..++.+| ||+||++ ...|+-..+..|.|+.-+++..+.+.+.+.|||+. .++++|++++.++. +++ T Consensus 314 ~Y~vv~~s~~g-eS~pS~i-vtaT~a~~~~gv~l~It~~a~~~~~~~~v~IYR~~--~~~g~~~~i~rv~v~~an~~gtt 389 (463) T protein:vir:95 314 SYKVVVNSDDA-QSAPSEE-VTATVSNVDDGVKLSINVNAMYQQQPQFVSIYRQG--KETGMYFLIKRVPVKDAQEDGTI 389 (463) T ss_pred EEEEEEECCCC-Ccccchh-eeeeeeeccceEEEEEEecCCcccceeEEEEEeec--CCCCcceeEEEEEecccCCCceE Confidence 99999777655 9999987 45555555666777777667777888999999984 45679999999833 467 Q ss_pred EEEeccccc---------ccCccchhhhhhCCCCCcceEEeccCcEEEE-EeCCEEEEecCCCcccCchhc-ccccCcce Q lcl|NC_019442. 224 SYTDKIPGK---------NLGPSLATWDYLPPPENMTGLCLMANGIAAG-FAGNEVMFSEAYLPYAWPEVN-RHTTAEDI 292 (541) Q Consensus 224 sf~D~~~~~---------~L~~~L~t~~~~~pP~~~~gL~~m~NGi~a~-f~Gn~l~fSep~~P~awp~~y-~~t~~~~I 292 (541) +|+|.-.-- ++.+ .|-.|..-.+ |..||+-.+-+ ...-++||--+..- -|+++ ++.. T Consensus 390 t~~D~n~~IPgt~~vfVgems~--~ti~~~ellP----m~klpLA~~~~~~~waVl~YGaLal~--~Pk~~~~ikN---- 457 (463) T protein:vir:95 390 VFVDKNETLPETADVFVGEMSP--QVVHLFELLP----MMKLPLAQINASITFAVLWYGALALR--APKKWARIKN---- 457 (463) T ss_pred EEeecccccCCceeEeeeccCc--hhhhhHhhhH----hhhCCchhccchhhhHHHHhhHHHhh--ccccceEEEE---- Confidence 888864321 1111 1222222110 12222221111 11233333311110 12221 1111 Q ss_pred EEEEEc Q lcl|NC_019442. 293 VAICPL 298 (541) Q Consensus 293 v~ia~v 298 (541) +.-.++ T Consensus 458 v~~~~v 463 (463) T protein:vir:95 458 VRYIAV 463 (463) T ss_pred eeEecC Confidence 111111 No 33 >protein:vir:103790 Length: 768 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024932;genbank:gi:48697202;genbank:GeneID:2846114 Probab=96.67 E-value=0.00042 Score=39.07 Aligned_cols=516 Identities=15% Similarity=0.113 Sum_probs=194.9 Q ss_pred CceEE--ecccccccccccceecc-------cccceEEEEeeec-CCeeeeeecccccCccc-cccceeEEEE---CCcE Q lcl|NC_019442. 1 MPYID--ITTMRGMMPRVVTSMLP-------EHSAVLAEDCHFR-FGVITPERQISGVEKTF-TIKPKTIFHY---RDDF 66 (541) Q Consensus 1 m~~i~--i~~f~G~~Pr~~p~llp-------~~~a~~a~N~~~~-~G~l~P~~~~~~v~~~~-~~~~~Tif~~---~~~~ 66 (541) |+.|. ++.|.|=. |.|+|.- .+++..++|+-.. .|.|+=.-.-+-++... +....-|+++ +++. T Consensus 1 M~~~~~~~~~F~~Ge--lsP~l~~r~Dl~ry~~~~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~lipf~~~~~~~ 78 (768) T protein:vir:10 1 MPKAAPQQVSFDAGE--LSPLLGARVDLAKYPNGCQVMENFIATVQGPAIRRGGKRFVAATKDSTKQSWLLPFIVADGIA 78 (768) T ss_pred CCcceeeeeeccCce--echhhcccchHHHHHHHHhhhhcceeeecCCceecCchhhhhhhcCCCCCeeEEEEEecCccE Confidence 99654 66786532 4444432 3566777787432 23332211112222111 1111223222 1222 Q ss_pred EEE---------eC--------CeEEEeeCCcccCC---------------CCeEEEeCCCCcceeec----------ce Q lcl|NC_019442. 67 WFA---------WP--------DVVDVIRSPIAQDP---------------HGRIYYTDGRFPKVTDA----------TI 104 (541) Q Consensus 67 W~~---------w~--------~~V~vv~spia~D~---------------~~Rvy~t~~~~pk~t~~----------~i 104 (541) |+- |. +..-.+..|-..+. -..+|++.-..|+-+.. .+ T Consensus 79 y~l~fg~~~irv~~~~g~v~~~~~~~e~~tp~~~~~l~~~~~~~~L~~~q~aD~~~i~~~~~~p~~l~r~~~~~w~l~~~ 158 (768) T protein:vir:10 79 YMLEFGDHYIRFFVNRGQLVNAGAPVEIATPYALADLTTEDGTFAIRATQSADTMYLFHGGYPTQKLLRTSATTFSLQPV 158 (768) T ss_pred EEEEEcCCEEEEEECCcEEEecCeeEEEEcCCCcceeecccccceeEEEeecCEEEEEcCCcceeEEEEecCCCceeEEe Confidence 221 22 11112333321111 12677777555542211 11 Q ss_pred eeccccCCccc----eeeecCCCCCccceEEecCCCCCCCCCCCcccceEEEE------------EEEecCCCccCcccc Q lcl|NC_019442. 105 ATKGDGNHPAS----SYSLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTE------------TFVSDYGEEGPPGPA 168 (541) Q Consensus 105 a~~g~g~~p~a----~y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ty~Yv~------------T~V~~~GeEs~Ps~~ 168 (541) .+.+ ++|..- .+.+..+.-+..- +....+ ....+...+...|.- .+.......+.+.. T Consensus 159 ~~~~-gp~~~~n~~~~vti~~s~~~~~~--T~tasa--~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~g~~~~~~~~~- 232 (768) T protein:vir:10 159 TFVG-GPFAAVNSDNNVRVHASAGTGAV--TLVASA--SVFRPSDVGTLFYLEQEDNSFVKPWVVHQKIGPSELRRVGD- 232 (768) T ss_pred eecC-ccccccccceeEEEEecccceeE--EEeecC--CccchhhcceeeeeeeeccccccccEEEEeeeeEEEEecCC- Confidence 1110 111000 0111111111100 110000 000111111111100 00000000110000 Q ss_pred ccceeecCCCCEEEE--ccccCCCCccc----------------c-ceEEEEEeecCCCceeEEEEEeeccc-eEEEE-- Q lcl|NC_019442. 169 SLEVTLRTPGTAVQL--TLSPVPLQNAS----------------I-KRRRIYRSASGGGEADFLLVAELDAS-VLSYT-- 226 (541) Q Consensus 169 S~~vtv~~~g~~v~l--~~~p~~~~~~~----------------i-~~~RIYRs~t~~~~~~~~lVael~~~-~~sf~-- 226 (541) .....+..++...+. +..|....+.. . ..+|.++. +.+ ...+.+.... ..... T Consensus 233 ~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~---~~~i~~~~~~t~~~~~~~ 307 (768) T protein:vir:10 233 RVYLCTAVGTATPQVTGTETPTHTSGSRWDGTGQDESATDEYGSIGAEWEYQHS--GYG---TVLITGYTNDQVVTGTVA 307 (768) T ss_pred ceEEeeeeccccccccceeccccccCceEEEecCcccccccccccceEEEEEEc--CCc---eEEEEEecCCeeEEeeee Confidence 000001111111111 11111111100 0 01222210 000 1112222111 11110 Q ss_pred -----ecccc-cccCccchhhhhhCCCCCcc----eEEeccCcEEEEEeCCEEEEecCCCcccC-chh---------ccc Q lcl|NC_019442. 227 -----DKIPG-KNLGPSLATWDYLPPPENMT----GLCLMANGIAAGFAGNEVMFSEAYLPYAW-PEV---------NRH 286 (541) Q Consensus 227 -----D~~~~-~~L~~~L~t~~~~~pP~~~~----gL~~m~NGi~a~f~Gn~l~fSep~~P~aw-p~~---------y~~ 286 (541) |+... ..+...-.++.|...+..-. ..+.+.+++|.-..++.||+|.+..++.| +.. --+ T Consensus 308 ~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~g~Ps~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdD~I~~ 387 (768) T protein:vir:10 308 TNDPADPGMLPNTVVTLTGTYKWARSLFNSTDGFPQMGTFWRNRLCLMRDRWLAMSVSADFETFKTKDADQQTDDSAIVQ 387 (768) T ss_pred eecCcccccccccccccCCCcccccCCCcCCCCCceEEEEEeeeEEEeeCCEEEEEcccccccccccccccccCCccEEE Confidence 11110 11122333455655543221 23456666666667999999999999997 111 111 Q ss_pred ccC----cceEEEEEcCCcEEEEEcCCEEEEEc------cCcccceEEeecccccccccchheeCCccEEEecCCc---- Q lcl|NC_019442. 287 TTA----EDIVAICPLGTSLVVATKGEPYLFSG------VSPSTISGSRIPSMQACLSRRSMVAMEGFVLYAGTNG---- 352 (541) Q Consensus 287 t~~----~~Iv~ia~v~~~lvV~T~~~py~l~G------~~p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dG---- 352 (541) ++. ..|.-+.+++ .|+|+|++.-|.|+| .+|.+..+.+ ....+|- .=..+.+|+.++|+++.| T Consensus 388 ~~ss~~~~~i~~~v~~~-~L~i~T~~~q~~l~~~~~~~~lTP~~~~i~~-~s~~g~~-~~~Pv~vG~~v~fv~~~g~~vr 464 (768) T protein:vir:10 388 QLNARQLNKLAWMVESD-SLLIGMTGDEWVIGPANASQPVSAANLNAAR-RTSYGSK-RIQPVQVGGTIMFVQKAGRKLR 464 (768) T ss_pred EecCCcceeEEEEeecC-cEEEEecCceEEEecCCCCcccccceEEEEE-eehhccc-ccccEEeCCeEEEEcCCCCEEE Confidence 111 3367888884 799999999999987 4778865554 4556783 335578999999999999 Q ss_pred -EEEEeCCC-----ceEEEecccCChhHhhhhcCcceEEEEEE--cCeEEEEEecCCCccceEEEccCC--ce---eEEE Q lcl|NC_019442. 353 -LVSVDVNG-----NTALATEKIISPEQWQSQFNPASIVAYSW--RGEYIACYTKPDGKQDVFVFSPVN--MD---IRYL 419 (541) Q Consensus 353 -Lv~~~~~G-----~~~~vT~~~~~~~~W~~~l~P~ti~a~~~--eG~Y~~~y~~~~g~~~~~i~d~~~--~~---~~~~ 419 (541) +..-..++ ..+++...||.-..++ ++.|+++.+ +..=+.++.+.+|+--++-|+... ++ -.+. T Consensus 465 e~~y~~~~d~y~a~DlT~~a~hl~~~~~~~----~~~i~~~a~~~~p~~v~~~v~~dg~l~~~ty~~e~~~q~v~aW~~~ 540 (768) T protein:vir:10 465 DFKYDFSSDNYVSTDVTKIADHITRGRAGT----NSGIMSLCFQQEPHSVVWAARADGQLIGCTYDEEAGRSDVYGWHRH 540 (768) T ss_pred EEEeeeecCceecchhhhhhhhhccccCcc----ccceeeEEEeecCCeEEEEEecCCeEEEEEEecCCCceeEEeEEEE Confidence 21111111 1222335555433222 244543333 223345555666665666677653 22 2222 Q ss_pred eecc----cEE-EE--EecCCEEEEEECC----EE---EEecCCCCceeEEEEcceEEeCcccceeEEEEe---e-CCCc Q lcl|NC_019442. 420 STPF----DCA-WV--DLAKDMMRVVTGD----KM---SVLAGGSLPSTIRWHSKIFSLPERTSFSCIRVK---S-PAPE 481 (541) Q Consensus 420 ~~~~----d~~-~~--~~~~d~LY~~~g~----~i---~~~~~g~~~~~~~WrSk~f~~~~~~~~~~~~V~---~-~~~~ 481 (541) .+.. .++ .. +-.+|.||++.-+ .. -|.- ........-....|.++..+.+....+. + +-.+ T Consensus 541 ~~~~g~v~~v~~i~~~~g~~d~l~~~v~r~~~g~~~~~ie~l-~~~~~~~~~~~~~~~~D~~~~~~~~~~~~~~gl~~le 619 (768) T protein:vir:10 541 PDANGFVECVASMPAPDGASDDLWVIVRRQVNGQTVRYVEYL-NPALQDDEPQSSAFYVDAGITYNGVPTSTIAGLGHLE 619 (768) T ss_pred EcCCCEEEEEEEEecCCCCccEEEEEEEecCCCeEEEEEEec-CcccccccccccceEeccccccCCcceeeecCCCCcc Confidence 2221 111 11 2236899997532 21 1211 1111222233455666654443322221 1 1112 Q ss_pred cEEEEEEECCceeEeecccccCCcceEccCc-----ccceEEEE---------------EEecceEEEEEee--cchhhc Q lcl|NC_019442. 482 RVGITIMADDVPVIHFAPGTFKGSVVRLPAA-----TGQNWQVM---------------VSGFGQVERITLS--TSMSEM 539 (541) Q Consensus 482 ~~~v~~~~d~~~~~~~~~~~~~~~~~rLP~~-----~~~~w~ie---------------i~g~~~V~~i~la--~s~~EL 539 (541) --.+.+..||..... ....+-.+.||.- -+...+-+ ..+..+|.++.+. .|.. + T Consensus 620 g~~v~v~~dG~~~~~---~~v~~g~itl~~~~~~v~vG~~y~s~~~~~p~~~~~~~gs~~~~~~ri~r~~v~~~~S~~-~ 695 (768) T protein:vir:10 620 GVTVAVLTDGAVHPS---RTVTAGAITLDWSASIVHIGVPTTCRIQTMQLNAGAANGTAQGKTKRVTNIATRFSRSLG-G 695 (768) T ss_pred cceEEEEECCEeccC---ceecCCEEEeCCCCceEEEeEeeeEEEEecceEeecCCccccccceEEEEEEEEEecccc-e Confidence 224556677653311 1111112222210 01111111 1234445554443 2221 1 Q ss_pred CC Q lcl|NC_019442. 540 PV 541 (541) Q Consensus 540 ~~ 541 (541) -+ T Consensus 696 ~~ 697 (768) T protein:vir:10 696 VV 697 (768) T ss_pred EE Confidence 11 No 34 >protein:vir:1778 Length: 680 # NCBI annotation: tail protein A # Family: family:all:825 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570344;genbank:gi:18640503;genbank:GeneID:932716 Probab=96.66 E-value=0.00043 Score=39.05 Aligned_cols=400 Identities=14% Similarity=0.045 Sum_probs=149.1 Q ss_pred CceEEecccccccccccceecccccceEEEEeeecCCeeeeeec--ccccCc-----cc-cccceeEEEECCcEEEEeCC Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQ--ISGVEK-----TF-TIKPKTIFHYRDDFWFAWPD 72 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~--~~~v~~-----~~-~~~~~Tif~~~~~~W~~w~~ 72 (541) ++....-.+++..=...+..+. ..+....++++.-..-+. -.++.. .+ .....+-.-|.++.+..+.. T Consensus 219 la~~l~~~~~~~~~~~g~~~~~----~y~~~~~l~~tg~~~~~~~~t~~v~~~G~~y~IsI~~~~~~~~~~~~~s~~~~t 294 (680) T protein:vir:17 219 LSFRVKVEARAFLVDDGEEYGH----NYIPYVTLLTPGNNTSPFPDTIRVDVSGEGWDIKVTKQIQSKVYANLGTAQFTT 294 (680) T ss_pred eeeeeeeccceeeecCCCceEE----EEeeEEEEecCCccccccCceEEEecccceeEEEEccceeeEeccCccceeeee Confidence 2222111112111000000000 000011111110000000 000000 00 00000111122222222221 Q ss_pred eEEEee-----CCccc-------C--------CCCeEEEeCCCCcceeecceeeccccCCcc-ceeeecCCCCCccceEE Q lcl|NC_019442. 73 VVDVIR-----SPIAQ-------D--------PHGRIYYTDGRFPKVTDATIATKGDGNHPA-SSYSLGIPAPTTAPVCT 131 (541) Q Consensus 73 ~V~vv~-----spia~-------D--------~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~-a~y~LGVp~P~~~pv~~ 131 (541) .++-.. .-|++ . --..+|+-....+..+...+-+.+....-. ....-.|..++-+|..+ T Consensus 295 ~~~~~a~~at~~~Ia~~L~~~i~~~~~~~~~~~g~~i~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~Lp~~a 374 (680) T protein:vir:17 295 PVDQSGGGASTSDIVTGLSAAINGLGTFTAESIGNVIRVRYSDPTRTDEFTMSARGGTSGTGLESIKYSVDTLAELPTKC 374 (680) T ss_pred ccCCcccceeHHHHHHHHHHhhcccCcEEEEECCCEEEEEeccCCCceEEEeeccCCCCceeeeeeeeeecccccccccc Confidence 111000 00100 0 001345532222222333333322111100 01111233333222211 Q ss_pred ecCCCCCCCCCCCcccceEEEEEEEecCCCccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCcee Q lcl|NC_019442. 132 VQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEAD 211 (541) Q Consensus 132 v~~~~~~~~~~~~~~~ty~Yv~T~V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~ 211 (541) - .+-.+...+........|.+.|-...+.+..+++. .=++..++|....+... -.-.+|||...+ . T Consensus 375 ~-~g~~v~v~~~~~~~~~~Yyv~~~~~~~~~~~~~~~-~W~E~~~~~~~~~~~~~--------tmp~~l~r~~~g----~ 440 (680) T protein:vir:17 375 W-NDYQVAVRNTQDTEVDDYYVKFETDVEDADVPGSG-YWVETVKNGDDGGLVDD--------TMPHVLVRNALG----D 440 (680) T ss_pred C-CCcEEEEEeCCCCcccceEEEEeccCcccCccccc-ceeecccCcccceeccC--------cceEEEEEccCc----e Confidence 0 00000000111223456777776554434333321 12223344443222221 124789995322 2 Q ss_pred EEEEE-eeccceEEEEecccccccCccchhhhh-hCCCCCcceEEeccCcEEEEEeCCEEEEecCCCcccC-chh-cccc Q lcl|NC_019442. 212 FLLVA-ELDASVLSYTDKIPGKNLGPSLATWDY-LPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAW-PEV-NRHT 287 (541) Q Consensus 212 ~~lVa-el~~~~~sf~D~~~~~~L~~~L~t~~~-~~pP~~~~gL~~m~NGi~a~f~Gn~l~fSep~~P~aw-p~~-y~~t 287 (541) |.+-+ +.......|.|....++.....|+..- -..| ..+ .|.++++.-..++.||+|.+..++.| +.. ..++ T Consensus 441 f~~~~~~~~~~~~~~~~r~~Gdd~tnp~psF~~~G~~p---~~v-~f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~ 516 (680) T protein:vir:17 441 FTFSSLNNSSYGKTWADRSVGSEDTNPHPTFTESGNGI---YGM-FMYKNRLGFLTQDAVIMSQVGDYFNFYATSGVTIS 516 (680) T ss_pred eEEEeeccccccccccccccCCcccCCCcccccCCCCc---eEE-EEEcceEEEeeCCeEEEEccCCcccccccccccCC Confidence 44432 122223356665544444333333211 0112 223 45566665557999999999999997 331 1111 Q ss_pred cC------------cceEEEEEcCCcEEEEEcCCEEEEEcc----CcccceEEeecccccccccchheeCCccEEEecCC Q lcl|NC_019442. 288 TA------------EDIVAICPLGTSLVVATKGEPYLFSGV----SPSTISGSRIPSMQACLSRRSMVAMEGFVLYAGTN 351 (541) Q Consensus 288 ~~------------~~Iv~ia~v~~~lvV~T~~~py~l~G~----~p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~d 351 (541) =+ ..|.-+.+++..|+++|++.-|.++|. +|.+..+.+ ...-.|-+.=..+.+|+.++|+++. T Consensus 517 DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~t~g~q~~ls~~~~~lTP~~~~i~~-~s~~~~~~~~~Pv~vG~~v~Fv~~~ 595 (680) T protein:vir:17 517 DADPIDMATSDTKPVKLEAAISSTSGAILFGNQAQFRLSSPDESFGPKTATLDK-ISNYTYESKADPVQTGVSMIFPTNM 595 (680) T ss_pred CCccEEEEEcCCcceeeeEEeecCCcEEEEecCeEEEEecCCceecceeEEEEE-EEeecccCCCCceEeCCeEEEeecC Confidence 11 446678899999999999999999873 355544332 3334576666688999999999998 Q ss_pred cEE-EEeCCCceEEEecccCChhHhhhhcCcceEEEEEE-cC-eEEEEEecCCCccceEEEccCCceeEEEeecccEEEE Q lcl|NC_019442. 352 GLV-SVDVNGNTALATEKIISPEQWQSQFNPASIVAYSW-RG-EYIACYTKPDGKQDVFVFSPVNMDIRYLSTPFDCAWV 428 (541) Q Consensus 352 GLv-~~~~~G~~~~vT~~~~~~~~W~~~l~P~ti~a~~~-eG-~Y~~~y~~~~g~~~~~i~d~~~~~~~~~~~~~d~~~~ 428 (541) |=. .+.-= .....+++--.. ..|+.+-.+ +| ..+.. .-. ..++-.+|. T Consensus 596 g~~s~vre~-~y~~~~d~y~a~--------DlT~~a~hl~~g~v~~~~-------------~~~-------~~~~~~~~~ 646 (680) T protein:vir:17 596 GTYSSVYEL-STESAKGTPVIE--------DSSRVIPRLIPSGLTWST-------------ASM-------NNDTVFFGN 646 (680) T ss_pred CCcceEEEE-eeeeccCceehh--------hHHHHHHHhcCCceEEEE-------------eeC-------CCCeEEEEE Confidence 721 01000 011111111111 111111100 11 11000 000 112234445 Q ss_pred EecCCEEEEEE------CCEEEEec----CCCCce Q lcl|NC_019442. 429 DLAKDMMRVVT------GDKMSVLA----GGSLPS 453 (541) Q Consensus 429 ~~~~d~LY~~~------g~~i~~~~----~g~~~~ 453 (541) +..+++||..+ .+++.-|. .+. .+ T Consensus 647 ~~~~~~l~~~~yl~~~~e~~v~aW~rw~~~~~-d~ 680 (680) T protein:vir:17 647 AKKGRNVYVFRFFNEGQERKVAGWTTWYYEDQ-DH 680 (680) T ss_pred EcCCCEEEEEEEeeCCCceEEEEEEEEecCCC-CC Confidence 55666776654 24455553 233 23 No 35 >protein:vir:100022 Length: 976 # NCBI annotation: T7-like tail tubular protein B # Family: family:all:825 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214208;genbank:gi:61806431;genbank:GeneID:3294702 Probab=95.63 E-value=0.0017 Score=35.74 Aligned_cols=511 Identities=14% Similarity=0.106 Sum_probs=178.1 Q ss_pred Cce-----EEeccccccccccccee-----------cccccceE-------EEEeeecCCeeeeeecccccCccccccce Q lcl|NC_019442. 1 MPY-----IDITTMRGMMPRVVTSM-----------LPEHSAVL-------AEDCHFRFGVITPERQISGVEKTFTIKPK 57 (541) Q Consensus 1 m~~-----i~i~~f~G~~Pr~~p~l-----------lp~~~a~~-------a~N~~~~~G~l~P~~~~~~v~~~~~~~~~ 57 (541) |.. .+..+.-...|.+...- -|.+.... .-|....++...+.+.++ .+........ T Consensus 216 ~~q~~~~~s~~~G~~~~~~~v~~~~f~~~~G~~~~i~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~-~~~~~~~~~g 294 (976) T protein:vir:10 216 SNSTRCDDSAGDGRDAYAPNVGTKVFNVTDGASLTDEANSGSYTYTIDVKDSSNNSVNRGVNLYFRIRT-VGQSVPFTTG 294 (976) T ss_pred hhhhcccccccccccccCceeeeeEEEeccCccceEcCCcceEEEEeeccccceEEeecCCceEEEEcc-ccceeecccc Confidence 111 01111111122221111 12221111 123344444555554433 2233333444 Q ss_pred eEEEECCcEEEEeCCeEEEeeCCcccCCCCeEEEeC-CCCcceeecceeeccccCCccceeeecCCCCCccceEEecCCC Q lcl|NC_019442. 58 TIFHYRDDFWFAWPDVVDVIRSPIAQDPHGRIYYTD-GRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTVQQGG 136 (541) Q Consensus 58 Tif~~~~~~W~~w~~~V~vv~spia~D~~~Rvy~t~-~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~ 136 (541) |-.+-.+++++..+..+.+-.+...-.+....+..- ++....+...+... ..+...+.++|.+.|.-...... T Consensus 295 t~~~~~~~Y~~~y~~~~~v~~~~~g~~~~~~~~V~v~g~~Y~it~~~~~~~------~~~a~~~~~~~~~t~~d~~~~~~ 368 (976) T protein:vir:10 295 SGSSATTTYQARYTTTFDLLYGGTGWQEGDYFYVWMKDGYYKITVEAISTA------NVQANLGLIRPNPTPFDTETAVT 368 (976) T ss_pred cccceeeeeeEEEEeEEEEecCCCCcccCceEEEEccccceeeEEEEeece------eEEeccccccCcCCcCccccccc Confidence 444445566666665555444443222222332221 22222222111110 01111122222111110000000 Q ss_pred -----------C---CCCCC--CCcccceEEEE----EE----------------EecCCCccCccccccceeecCCCCE Q lcl|NC_019442. 137 -----------D---VSDDN--PNDDETRFYTE----TF----------------VSDYGEEGPPGPASLEVTLRTPGTA 180 (541) Q Consensus 137 -----------~---~~~~~--~~~~~ty~Yv~----T~----------------V~~~GeEs~Ps~~S~~vtv~~~g~~ 180 (541) . ....+ -...+++.|+. +| |+..++- |+.+ .+|-. T Consensus 369 ~~~ia~~L~~~l~a~~~~~g~tv~~~g~~~~i~~~~~~~~~s~~~~~~~~~~~~~V~~~~~L--P~~~-------~~g~~ 439 (976) T protein:vir:10 369 AESIIGDIRTAIIATGNFTSANVQQIGTGLYVTRPSGTFNVTAPSSDLLRVMSGEVANVDDL--PSQC-------KHGYV 439 (976) T ss_pred HHHHHHHHHHhhcccccccceEEEEcCcEEEEEecCcceEecCCCceeEEEEEeeecchhhh--hhhc-------cCCcE Confidence 0 00000 00011222221 12 3332221 1110 01111 Q ss_pred EEEccccCCCCcc--------ccceEEEEEeecCCC-ce-------eEEEEEeeccceEEEEec-------ccccccCcc Q lcl|NC_019442. 181 VQLTLSPVPLQNA--------SIKRRRIYRSASGGG-EA-------DFLLVAELDASVLSYTDK-------IPGKNLGPS 237 (541) Q Consensus 181 v~l~~~p~~~~~~--------~i~~~RIYRs~t~~~-~~-------~~~lVael~~~~~sf~D~-------~~~~~L~~~ 237 (541) |.+.......... +-..-++|+-..+-+ .. -+.|+.+ +..+|.-. ...++..-. T Consensus 440 v~V~~~~~~~d~yyv~~~~~~~~~~~~~w~E~~~~g~~~g~~~~tmP~~l~~~---~~g~f~~~~~~w~~r~vGd~~tnp 516 (976) T protein:vir:10 440 VKVANSEADADDYYVKFFGHNNRDGDGVWEECAKPSRNIEFDKGTMPIQLVRQ---ANGTFTVSQATWQNAEVGDELTNP 516 (976) T ss_pred EEEecCCCCceeEEEEeeccccccccceEEEeeccccccccccccccEEEEec---ccCeEEeeeccccccccCCcccCc Confidence 2211110000000 000011121000000 00 0112211 11111111 111111112 Q ss_pred chhhhhhCCCCCcceEEeccCcEEEEEeCCEEEEecCCCcccC-chh-------ccccc------CcceEEEEEcCCcEE Q lcl|NC_019442. 238 LATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAW-PEV-------NRHTT------AEDIVAICPLGTSLV 303 (541) Q Consensus 238 L~t~~~~~pP~~~~gL~~m~NGi~a~f~Gn~l~fSep~~P~aw-p~~-------y~~t~------~~~Iv~ia~v~~~lv 303 (541) .|+..+. +| .++ .|.+++|.-..++.||+|.+..++.| +.. ..+.+ ...|.-+.+++..|+ T Consensus 517 ~psf~g~-~i---s~v-~f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~ 591 (976) T protein:vir:10 517 NPSFVGK-TI---NQL-VFFRNRLVFLSDENVIMSRPGEFFNFWSKTATTFTPQDVIDLSCSSTYPAIVYDGIQVNAGLL 591 (976) T ss_pred Cceeccc-cc---ceE-EEEcceEEEecCCeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeEEEEecCCcEE Confidence 2222211 11 234 44566665557999999999999997 321 11222 145778889999999 Q ss_pred EEEcCCEEEEEcc----CcccceEEeecccccccccchheeCCccEEEecCCcE----EEEeCC---CceEEEecccCCh Q lcl|NC_019442. 304 VATKGEPYLFSGV----SPSTISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGL----VSVDVN---GNTALATEKIISP 372 (541) Q Consensus 304 V~T~~~py~l~G~----~p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGL----v~~~~~---G~~~~vT~~~~~~ 372 (541) ++|++.-|.|+|. +|.+.++.+ ...-.|.+.=..+.+|+.++|+++.|= .....+ .... + .-+|. T Consensus 592 l~T~g~e~~lsg~~~~lTP~t~~i~~-~s~~~~~~~v~Pv~vG~~v~Fv~~~g~~~r~~~~~~~~~~~~~~-~--~dlt~ 667 (976) T protein:vir:10 592 LFTKNQQFMLTTDSDILSPETAKINA-VSSYNFNEKTHPVSLGTTVAFIDNANQFTRFFEMSNVVRQGEPD-V--VDQSK 667 (976) T ss_pred EEecCceEEEecCCceecceeEEEEE-EEeeeccCCCccEEeCCeEEEEecCCCeEEEEEEeecccccccc-h--hHHHH Confidence 9999999999974 344544332 333457766678899999999999982 112111 1111 0 11111 Q ss_pred hHhhhhcC--cceEEEEEEcCeEEEEEecCCCccceEEE-ccCC-c---eeEEEeecccEEEEEecCCEEEEEEC----C Q lcl|NC_019442. 373 EQWQSQFN--PASIVAYSWRGEYIACYTKPDGKQDVFVF-SPVN-M---DIRYLSTPFDCAWVDLAKDMMRVVTG----D 441 (541) Q Consensus 373 ~~W~~~l~--P~ti~a~~~eG~Y~~~y~~~~g~~~~~i~-d~~~-~---~~~~~~~~~d~~~~~~~~d~LY~~~g----~ 441 (541) -- ..+- +-.+.+.+-+..-+.+.+..+|+.-++=| +... + ...++.++-...+....+|.||++.. + T Consensus 668 ~~--~~l~~g~~~~~a~~~~~~~vv~~~~~~g~l~~~ty~~~~~eq~v~aWsr~~~~G~v~sv~~i~D~ly~vV~r~~~g 745 (976) T protein:vir:10 668 VI--SRLLDKNISLVSVSRENSVVFFSQKDTDKIYCFRYFTSGEKRLLQAWTTWTITGNIQYHCMLDDALYVVTRNNNKD 745 (976) T ss_pred Hh--hhhcCCceEEEEEcCCCcEEEEEEcCCCEEEEEEEeecCCceeEEeeEEEecCCcEEEEEEeCCeEEEEEEecCCe Confidence 00 0111 23345666676667777666664222212 1122 2 12333333223333345899999752 2 Q ss_pred EEEEecCCCCceeEEEEcceEEeCcc-------------------cceeEEEEe----eCC--CccEEEEE-EECCceeE Q lcl|NC_019442. 442 KMSVLAGGSLPSTIRWHSKIFSLPER-------------------TSFSCIRVK----SPA--PERVGITI-MADDVPVI 495 (541) Q Consensus 442 ~i~~~~~g~~~~~~~WrSk~f~~~~~-------------------~~~~~~~V~----~~~--~~~~~v~~-~~d~~~~~ 495 (541) .+..+..-..+....+....|..... .++..-.+. ... .....+.+ ..|+.... T Consensus 746 ~~~r~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~t~~~~t~~t~~~~~~~~~~~~~~~~~~~d~~~~~ 825 (976) T protein:vir:10 746 QIVKYSLKLDDAGHFVTDTQGTTSTDDDSIYRVHLDHSSSVTAASNTYNTTTIKTTIPKPNGYESTKQLVAYDTDAGNDL 825 (976) T ss_pred EEEEEEEEECCccceeeeccCccccccCCcceeeeccceEEEeccccccCCceeEEeecCccccCceeEEEEecccCccc Confidence 22222110000001111111111010 000000000 000 00111111 12322211 Q ss_pred e-ecccccCCcceEccCc-ccceE------E--EE-----EE------------ecceEEEEEeecch-hhcCC Q lcl|NC_019442. 496 H-FAPGTFKGSVVRLPAA-TGQNW------Q--VM-----VS------------GFGQVERITLSTSM-SEMPV 541 (541) Q Consensus 496 ~-~~~~~~~~~~~rLP~~-~~~~w------~--ie-----i~------------g~~~V~~i~la~s~-~EL~~ 541 (541) . .+..+..+..++||+. .+..+ + || |+ |..+|+|+.+.--. ...-+ T Consensus 826 ~~~~~~~v~g~~i~l~g~~~~~~v~VGl~Y~s~~~~~~~~i~~~~g~~~~~~~~gRl~i~r~~~~~~~tg~~~v 899 (976) T protein:vir:10 826 GRYALVTVSGSNLEIPGNWSNNSFIIGYLYEMDVQLPTLYVTQQVGDKYRSDAKSSLIVHRIKFSFGPLGVYST 899 (976) T ss_pred ccceeeeecCCeeEecCCCCCCeEEEeeeeEEEEeecceeEEeCCCCcccccceeeEEEEEEEEEeecccceEE Confidence 1 1112223344666643 22222 1 11 11 22345555543211 11112 No 36 >protein:vir:8887 Length: 808 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813776;genbank:gi:29366731;genbank:GeneID:1258831 Probab=93.25 E-value=0.0082 Score=32.02 Aligned_cols=514 Identities=10% Similarity=0.049 Sum_probs=173.7 Q ss_pred CceEEecccccccccccceecccccceEEEEee------ecCCeeeeeecccccCccccccceeEEEE---CCcEEEEeC Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCH------FRFGVITPERQISGVEKTFTIKPKTIFHY---RDDFWFAWP 71 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp~~~a~~a~N~~------~~~G~l~P~~~~~~v~~~~~~~~~Tif~~---~~~~W~~w~ 71 (541) +..++.+...|.-+++.+-...+++.-+-+... ..+|....+.....-.. +..++.=+++ +|-.++. + T Consensus 51 v~~l~~~~~~~~~~~~~~~~~~~~~~y~v~~~~~~i~v~~~~G~~~~v~~~~~y~~--~~~~~~~l~~~tvaD~~fi~-n 127 (808) T protein:vir:88 51 TKRLQNKGFLGTKPLVHLINRDAQEQYFVGFSGTGLAVWDLKGNNYTVRGYNGYAN--CANPRTDLRLITVADYTFVV-N 127 (808) T ss_pred eeeeeccCCCCCCcEEEEEEeCcCceEEEEEeCCeEEEEEcCCceEEEeecCcceE--ecCChhheeEEEEcCEEEEE-c Confidence 333445677788777777655444433332221 01233332222111100 1111110111 1111211 1 Q ss_pred CeEEEeeCCcccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCcc--ceEE--ecCCC--------CCC Q lcl|NC_019442. 72 DVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTA--PVCT--VQQGG--------DVS 139 (541) Q Consensus 72 ~~V~vv~spia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~--pv~~--v~~~~--------~~~ 139 (541) ..+-+...+-+ +..+.|+.+.--++-...|.| +..|.+-|....+. ...+ ...+. ... T Consensus 128 ~~~~~~~~~~~---------~~~~~~~~~~~~~~~vr~g~y-~~~y~i~i~g~~s~~~~t~t~~~~~~s~~~v~~~~~~~ 197 (808) T protein:vir:88 128 RNTVCQMGSTL---------THAAYPRLDGRAIINVRGGQY-GRTLSITINGDGTGSSPQASIKMPNGSAEKVPAGDPYA 197 (808) T ss_pred CCcceeecccc---------cccCCCCCCccEEEEEccccc-CceEEEEEecCCcceeeeEeEEEccCcccceeecccee Confidence 11111100000 000111111000111111222 12222222111000 0000 00000 000 Q ss_pred ------------CCCCC----cc----cceEE----EEEEEecC-C-------Cc-cCccccccceeecCCCCEEEEccc Q lcl|NC_019442. 140 ------------DDNPN----DD----ETRFY----TETFVSDY-G-------EE-GPPGPASLEVTLRTPGTAVQLTLS 186 (541) Q Consensus 140 ------------~~~~~----~~----~ty~Y----v~T~V~~~-G-------eE-s~Ps~~S~~vtv~~~g~~v~l~~~ 186 (541) ....+ .. ..+.+ -+.++... + -+ +........+.... ++.-+|+ T Consensus 198 ~~~~~~~~~~~~ia~~l~~~~~~~~~~~~~~~~~~~~~~~i~~~a~~~~~~~~t~~g~~~~~~~~~~~~v-~~~~~lp-- 274 (808) T protein:vir:88 198 GMNQVDMTDASWIAAELARQLTVSLGGSGWSFQAGTGWILINAPANDNVRQIATKDGYADTLLSGFIYQV-QTFTKLP-- 274 (808) T ss_pred ecccCCccccccchhhheeeeeecccccceEEEeccceEEEEeccCceeEEEcccCCcCcceeeeeeeec-cceeecc-- Confidence 00000 00 00100 01111110 0 00 00000000000000 0111111 Q ss_pred cCCCCccccceEEEEEeecCCCceeEEEEEeecc-----------------ceEEEEecccccccCccchhhhhhC---- Q lcl|NC_019442. 187 PVPLQNASIKRRRIYRSASGGGEADFLLVAELDA-----------------SVLSYTDKIPGKNLGPSLATWDYLP---- 245 (541) Q Consensus 187 p~~~~~~~i~~~RIYRs~t~~~~~~~~lVael~~-----------------~~~sf~D~~~~~~L~~~L~t~~~~~---- 245 (541) ...+++.. ++|=- .++.....|++..+... ++.-+.- +....-.-.+...+|.. T Consensus 275 ~~~p~g~~---v~i~~-~~~~~~~~~yv~~~~~~~~w~e~~~~~~~~~~~~~tmp~~l-v~~~~~~~~~~~~~w~~r~~G 349 (808) T protein:vir:88 275 ANAPPGYL---VEITG-ESARSGDNYWVQYDASGKVWKETAKPKIIAGFNNATLPHAL-VRAADGQFDWTPLTWDGRNAG 349 (808) T ss_pred ccCCCCcE---EEEEe-cCCCCCceeEEEEEcCCeEEEEeeeccceeeecccceeEEE-EecCCceEEEEeccccccccc Confidence 11111111 11111 11112222332222111 1100000 00000000111223432 Q ss_pred ------CCC-Ccce--EEeccCcEEEEEeCCEEEEecCCCcccC-chh-c--------ccccC----cceEEEEEcCCcE Q lcl|NC_019442. 246 ------PPE-NMTG--LCLMANGIAAGFAGNEVMFSEAYLPYAW-PEV-N--------RHTTA----EDIVAICPLGTSL 302 (541) Q Consensus 246 ------pP~-~~~g--L~~m~NGi~a~f~Gn~l~fSep~~P~aw-p~~-y--------~~t~~----~~Iv~ia~v~~~l 302 (541) .|. ..+. -+.+.+++|.-..++.||+|.+..++.| +.. - .++.. ..|.-+.+++..| T Consensus 350 d~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L 429 (808) T protein:vir:88 350 DDDTNPMPSFVGATINDVFFFRNRLGFLSGENVVMSRTSKYFNFFPSSVATLSDDDPIDVAISHNRISILKYAVPFSEQL 429 (808) T ss_pred ccccCccceecCCceeEEEEEcceEEEeeCCeEEEEeccCcccccCCcccCCCCCccEEEEecCCccceeeEEeecCCcE Confidence 121 1111 2245666666557899999999999997 221 1 11211 2256688999999 Q ss_pred EEEEcCCEEEEEccC---cccceEEeecccccccccchheeCCccEEEecCCcE-------EEEeC-CCceEEEecccCC Q lcl|NC_019442. 303 VVATKGEPYLFSGVS---PSTISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGL-------VSVDV-NGNTALATEKIIS 371 (541) Q Consensus 303 vV~T~~~py~l~G~~---p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGL-------v~~~~-~G~~~~vT~~~~~ 371 (541) +++|++.-|.|+|.+ |.+..+. +...-.|.+.=..+.+|+.++|+++.|= +..+- +... . .+=+| T Consensus 430 ~i~T~~~e~~l~~~~~lTP~~~~~~-~~s~~~~~~~~~Pv~vG~~v~f~~~~g~~~~v~r~~~~~~~~d~y-~--~~dlt 505 (808) T protein:vir:88 430 LLWSDQAQFVLSSKTILSSKTIELD-LTTEFDVSDGARPYGIGRGVYFAAPRASFTSLKRYYAIQDVSDVK-S--AEDVS 505 (808) T ss_pred EEEecCcEEEEeCCCcccceeEEEE-EEEEecccCCCCceEeCCeEEEEecCCCeeEEEEEEEeeeccCce-e--hhhHH Confidence 999999999999864 4444433 2334467766778899999999999982 11111 1111 1 11111 Q ss_pred hhHhhhhcCcce---EEEEEEcCeEEEEEecCCCccceEEEcc--CCce---eEEEeec----ccEEEEEecCCEEEEEE Q lcl|NC_019442. 372 PEQWQSQFNPAS---IVAYSWRGEYIACYTKPDGKQDVFVFSP--VNMD---IRYLSTP----FDCAWVDLAKDMMRVVT 439 (541) Q Consensus 372 ~~~W~~~l~P~t---i~a~~~eG~Y~~~y~~~~g~~~~~i~d~--~~~~---~~~~~~~----~d~~~~~~~~d~LY~~~ 439 (541) .-- ..+-|.. +.+.+-++.=+.+.+..+|+--++=|.- +.++ -.+.+++ .-|+.....+|.||++. T Consensus 506 ~~~--~h~~~~~~~~~~~~~~~~~~~v~~~~~~g~l~~~~y~~~~~e~~v~aW~r~~~~g~~~~~~~~~~~~~d~l~~vV 583 (808) T protein:vir:88 506 AHV--PSYITNTVHAIHGSGTENFVSILSDGSPNKVFIYKFLYLDEILQQQSFSHWEFGDAATTRVLAASCIGSYCYLMI 583 (808) T ss_pred HHH--HHhcCCCeEEEEEeCCCCeEEEEEEcCCCEEEEEEEeccCCceeEEeeEEEecCCCeeEEEEEEeccCCEEEEEE Confidence 000 0111222 2333344444445555555432222221 1121 2222322 23667777899999875 Q ss_pred CC----EEEEecCCCCceeEEEEcceEEeCcccce--------------eEEEEeeC--CCccEEEEEEECCceeEeecc Q lcl|NC_019442. 440 GD----KMSVLAGGSLPSTIRWHSKIFSLPERTSF--------------SCIRVKSP--APERVGITIMADDVPVIHFAP 499 (541) Q Consensus 440 g~----~i~~~~~g~~~~~~~WrSk~f~~~~~~~~--------------~~~~V~~~--~~~~~~v~~~~d~~~~~~~~~ 499 (541) .+ .|-.++-........-....|.+...+.+ .+.-.++. ..+...+.+..||.....-.. T Consensus 584 ~r~~~~~ler~~~~~~~~~~~~~~~~~~lD~~~~~~~g~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~dg~~~~~~~~ 663 (808) T protein:vir:88 584 DRPEGLCLERMEFTQHTIDYSIEPYRTYMDMKKTIVLGAYNIDTNLTSFDVRTAYGGTPGPESTFYTIDQQGVLIEHEAR 663 (808) T ss_pred EcCCcEEEEEEeeccCCCCCccccceeeeeeeeeeccccccCccccceeecccccccccccceeEEEEcCCceEEeeecc Confidence 32 23222111100000001111222221111 11101111 111122233334433222233 Q ss_pred cccCCcceEccCcc-cceEEE-------------EEE-------------ecceEEEEEeecc-hhhcCC Q lcl|NC_019442. 500 GTFKGSVVRLPAAT-GQNWQV-------------MVS-------------GFGQVERITLSTS-MSEMPV 541 (541) Q Consensus 500 ~~~~~~~~rLP~~~-~~~w~i-------------ei~-------------g~~~V~~i~la~s-~~EL~~ 541 (541) +.....-+++|+.+ +....| ++. |..+|+++.+.-- ..++-| T Consensus 664 ~~~~~~~~~~~~~~~~~~v~vGl~y~s~~~~~p~~~~~~~g~~~~~~~~~gr~~l~r~~~~~~~tg~~~v 733 (808) T protein:vir:88 664 DWATNPYISFVGNRAGEQMVIGKQYTFQYEFSKFLIKQTADDGSTSTEDIGRLQLRRAWLNYEESGAFEI 733 (808) T ss_pred cccCcceEEeCCCccCceEEEeeeeeEEEEecceEEecCCCCcceeecccceEEEEEEEEEeecccceEE Confidence 33334456666552 222222 221 2344544443321 122222 No 37 >protein:vir:80253 Length: 777 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522886;genbank:gi:158345179;genbank:GeneID:5687516 Probab=91.04 E-value=0.018 Score=30.18 Aligned_cols=497 Identities=14% Similarity=0.079 Sum_probs=165.3 Q ss_pred CceEEecccccccccccceeccc----ccceEEEEeeecCCeeeeeecccccCccccccceeEEEECCcEEEEeC-CeEE Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLPE----HSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWP-DVVD 75 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp~----~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W~~w~-~~V~ 75 (541) |-.|.- ...+.-+...+++++. +|+-+. .+.+|.|+=+...-..-. ......++.+=+ .+ T Consensus 48 t~fv~~-l~~~~~~~~~~~~~~~~~~~e~~~~l---~~g~g~irv~~~~~g~~~---------~~~~~~Yl~a~~~~~-- 112 (777) T protein:vir:80 48 VHLIAD-AMAATDANRLAYSLATFSGREVLLLV---DTLDGTLTILDDATGEVL---------FTGTNSYLTAGTGRS-- 112 (777) T ss_pred hHhhhh-hcCCCcccceeEEEEecCCCeeEEEE---EecCCeEEEEECCCCeEE---------EecCCCceeeccccc-- Confidence 222211 0123344555566653 333332 345555554432111100 000111111100 01 Q ss_pred EeeCCcccCCCCeEEEeCCCCcceeecce-------------eeccccCCccceeeecCCCCCccceEEecCCCCCCCCC Q lcl|NC_019442. 76 VIRSPIAQDPHGRIYYTDGRFPKVTDATI-------------ATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDVSDDN 142 (541) Q Consensus 76 vv~spia~D~~~Rvy~t~~~~pk~t~~~i-------------a~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~ 142 (541) ++.--.+| .+|+|.-..|+...-+. .....+.| +..|.+-+-.....++.++..++...... T Consensus 113 -l~~~q~aD---~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~~~v~~~~~-g~~y~i~i~~~~~~~~~t~~~~t~~~~~~ 187 (777) T protein:vir:80 113 -IRFAALDD---SVFVANTEVIPQTQLWSGASAYPDPTRAGYLYVVAGAF-SKQYRLSITNQVTGVTTSVDVTTSATEAS 187 (777) T ss_pred -eeEEEEcC---EEEEEeCCccceeeecccCCCccCcccceEEEeeccCC-CceeeEeecCCcCceeEEEecCCcccccc Confidence 11111122 55555543333221111 11112222 33445555444444444333222211110 Q ss_pred CCcccceEEE-------------------EEEEecCCCccCccccccceeecCCC--------CEE-EEcccc-CCCCcc Q lcl|NC_019442. 143 PNDDETRFYT-------------------ETFVSDYGEEGPPGPASLEVTLRTPG--------TAV-QLTLSP-VPLQNA 193 (541) Q Consensus 143 ~~~~~ty~Yv-------------------~T~V~~~GeEs~Ps~~S~~vtv~~~g--------~~v-~l~~~p-~~~~~~ 193 (541) ..+ ..|+ ++++...+-..-=.+....++..... ..+ +...+| ..+++. T Consensus 188 ~~~---~~~ia~~L~~~~~~~~~~~s~~~~~~~~~g~~~~i~~~~~~~~t~~~g~~~~~~~~~~~v~~~~~lp~~~~~~~ 264 (777) T protein:vir:80 188 QAT---GEYVITQLRTAAEADATIGTAAGFAYYQDGAYLYVTAPEAIAVSTDSGSNFLRASNAASIRDAAELPAKLPADA 264 (777) T ss_pred ccc---chhhhhhhhhhhccccceeecCceEEEeCCcEEEEEecCceeEecCCcCccceeeeeEEEeecccccccccccc Confidence 000 0000 11111000000000000011111000 001 011111 111221 Q ss_pred ccceEEEEEeecCCCceeEEEEEeeccce---------------EEEEecccccccCccchhhhhhC-----------CC Q lcl|NC_019442. 194 SIKRRRIYRSASGGGEADFLLVAELDASV---------------LSYTDKIPGKNLGPSLATWDYLP-----------PP 247 (541) Q Consensus 194 ~i~~~RIYRs~t~~~~~~~~lVael~~~~---------------~sf~D~~~~~~L~~~L~t~~~~~-----------pP 247 (541) ..|+..+++.+.+|++.-+-..+. .-+.-...++.. .+...+|.. |+ T Consensus 265 -----~~~~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~t~p~~l~~~~~~~--~~~~~~w~~r~~gd~~tn~~Ps 337 (777) T protein:vir:80 265 -----DGFIIATGAAKNKTYFRWVDLERKWDEDASRGAQAELIDMPLRITYSAPNF--SLTALNYERRASGDATSNPALK 337 (777) T ss_pred -----ceEEEeCCCCCCceEEEEEccCcEEEEeecccccccccccceEEEecCCce--EeeccCCccccccccccCCCce Confidence 234445555444444433221110 000000000000 011111222 22 Q ss_pred CCcceE--EeccCcEEEEEeCCEEEEecCCCcccC-chh-cccccC------------cceEEEEEcCCcEEEEEcCCEE Q lcl|NC_019442. 248 ENMTGL--CLMANGIAAGFAGNEVMFSEAYLPYAW-PEV-NRHTTA------------EDIVAICPLGTSLVVATKGEPY 311 (541) Q Consensus 248 ~~~~gL--~~m~NGi~a~f~Gn~l~fSep~~P~aw-p~~-y~~t~~------------~~Iv~ia~v~~~lvV~T~~~py 311 (541) -..+.+ +.|.+++|.-..++.||+|.+..++.| +.. --++=+ ..|.-+.+++..|+++|++.-| T Consensus 338 f~g~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdDpI~~~~ss~~~~~i~~~v~~~~~L~i~T~~~e~ 417 (777) T protein:vir:80 338 FTEQGISGMTTMQGRLVLLAGEYVCMSASGNPLRWFRASVSTQSDDDPIEVAATAPVASPYEYAVAFNKDLVLFAKTHQG 417 (777) T ss_pred ecCCceeEEEEEcceeeeecCCeEEEEeccCccccccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCceE Confidence 122222 255666766557899999999999997 332 112222 2356678889999999999999 Q ss_pred EEEcc---CcccceEEeecccccccccchheeCCccEEEecCC-c----EEEEeC----CCceE-----EEecccCChhH Q lcl|NC_019442. 312 LFSGV---SPSTISGSRIPSMQACLSRRSMVAMEGFVLYAGTN-G----LVSVDV----NGNTA-----LATEKIISPEQ 374 (541) Q Consensus 312 ~l~G~---~p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~d-G----Lv~~~~----~G~~~-----~vT~~~~~~~~ 374 (541) .|+|. +|.+..+.+ ...-.|-+.=..+.+|+.++|+++. | +.-... +++.+ ++-..+| T Consensus 418 ~l~~~~~lTP~~~~~~~-~s~~~~~~~~~Pv~vG~~v~Fv~~r~g~~s~v~e~~~~~~~~d~y~a~Dlt~~~~hl~---- 492 (777) T protein:vir:80 418 LVPGANLLTSRNATAAV-VTEYSFQNSCSPVVAGRTVFFASPRSGPWSAVWEMLPSQYTDAQVEASDSTSHLPKYI---- 492 (777) T ss_pred EEeCCCcccceeEEEEE-EEeeccCCCCCceEeCCeEEEEecCCCceeEEeeeeecccccCceehhHHHHHHHHhc---- Confidence 99985 444544332 3333576666778999999999874 3 211111 11111 1111111 Q ss_pred hhhhcCcceE--EEEEEcCeEEEEEecCCCccceEEEc--cCCc---eeEEEeecccEEEEEecCCEEEEEEC--CE--E Q lcl|NC_019442. 375 WQSQFNPASI--VAYSWRGEYIACYTKPDGKQDVFVFS--PVNM---DIRYLSTPFDCAWVDLAKDMMRVVTG--DK--M 443 (541) Q Consensus 375 W~~~l~P~ti--~a~~~eG~Y~~~y~~~~g~~~~~i~d--~~~~---~~~~~~~~~d~~~~~~~~d~LY~~~g--~~--i 443 (541) |..+ .+++-+--.+.+.+..+|+.-++=|. -+.+ .-.+.+++-...+....+|.||++.. +. | T Consensus 493 ------~~~v~~~a~s~~p~~v~~~~~~dg~l~~~ty~~~~~e~~v~aW~r~~~~g~v~~v~~i~d~l~~iv~r~~~~~l 566 (777) T protein:vir:80 493 ------AGPVRFLATSSTTSIVVVGTSNLRELVVHEYLWQGGEKVHAAWHKWSFPQDITGAYFRGDRLILLFHVAGRVIL 566 (777) T ss_pred ------CCceEEEEEcCCCceEEEEEcCCCeEEEEEEeecCCceEEEeeEEeccCCcEEEEEEECCEEEEEEEcCCeEEE Confidence 2222 22222222333333333322122221 1112 12233333223333344899999753 22 3 Q ss_pred EEecCC----CCcee---EEEEcce-EEeCcccceeEEEEee-CCCccEEEEEEECCcee--------Ee---------e Q lcl|NC_019442. 444 SVLAGG----SLPST---IRWHSKI-FSLPERTSFSCIRVKS-PAPERVGITIMADDVPV--------IH---------F 497 (541) Q Consensus 444 ~~~~~g----~~~~~---~~WrSk~-f~~~~~~~~~~~~V~~-~~~~~~~v~~~~d~~~~--------~~---------~ 497 (541) -+++-. ..... +.+.... ...... ...-.+-. ...+.....+..++... .+ . T Consensus 567 e~~~~~~~~d~~~~~~~~~D~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~v~~~~~~~~~~v 644 (777) T protein:vir:80 567 GELFMQRLGDAQSIPGGFLDLYRVGAANADEE--VAIPAFAADLYPEDSTFAYKLSGEFQSLGQRCGDRRVDGATVYIKV 644 (777) T ss_pred EEEeeccCCCCcccceeeeeeeeeeeeeeCCc--cceeEeeccccCCcceeEEEecCcccccceeeeeEEeCCceeeEEE Confidence 333211 11011 1110000 000000 00000000 00111111111111100 00 0 Q ss_pred cccccC---------CcceE-ccCc-ccceEEEEEEecceEEEEEeec--chhhcCC Q lcl|NC_019442. 498 APGTFK---------GSVVR-LPAA-TGQNWQVMVSGFGQVERITLST--SMSEMPV 541 (541) Q Consensus 498 ~~~~~~---------~~~~r-LP~~-~~~~w~iei~g~~~V~~i~la~--s~~EL~~ 541 (541) ...... ..-+. +|.. +...=.-..+|..+|+++.+.- |. .+-| T Consensus 645 ~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~g~~~~~~r~~i~r~~~~~~~sg-~~~v 700 (777) T protein:vir:80 645 VGAQAGDQYRIGLRYLSKLGPTRPILRDPNGVPITTERTQLHRLTWSLDSTG-EVTF 700 (777) T ss_pred cCCCCCCEEEEeeeeEEEEEeCceEEeCCCCceeeecCeEEEEEEEEeeccc-cEEE Confidence 000000 00011 1111 1000011122444555544332 21 1111 No 38 >protein:vir:99677 Length: 794 # NCBI annotation: Tail tubular protein B # Family: family:all:825 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249591;genbank:gi:68299742;genbank:GeneID:3799992 Probab=90.50 E-value=0.02 Score=29.84 Aligned_cols=495 Identities=11% Similarity=0.083 Sum_probs=179.6 Q ss_pred CceE---EecccccccccccceecccccceEEEEe-------eecCCeeeeeecccccCccccccceeEEEECCcEE--E Q lcl|NC_019442. 1 MPYI---DITTMRGMMPRVVTSMLPEHSAVLAEDC-------HFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFW--F 68 (541) Q Consensus 1 m~~i---~i~~f~G~~Pr~~p~llp~~~a~~a~N~-------~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W--~ 68 (541) |-.| +-+...+..+|++|--..+.++-+-+.. ++.+|.+..+..+-... |.+..+.+ + T Consensus 48 ~~fv~~l~~~~~~~~~~~l~~f~~~~~~~y~l~f~~~~irv~~~~~g~~~~v~~~~~~~----------y~~~~~~~~~l 117 (794) T protein:vir:99 48 SVHIKRLTDQFGLGQKPYCHIINRDEVERYAVFFTGSNIRVFDLFTGDEKTVNAPNGLS----------YVSSSNPRKDL 117 (794) T ss_pred cceeeeecCCCCCccccEEEEEEeCCCceEEEEEcCCeEEEEECCCCeEEEeecccccc----------ccccCCcccee Confidence 3333 3344556778888888777776554432 22345555444332110 11111111 1 Q ss_pred EeCCeEEEeeCCcccCCCCeEEEeCCCCcceeecc-------------eeeccccCCccceeeecCCCCCccceEEecCC Q lcl|NC_019442. 69 AWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDAT-------------IATKGDGNHPASSYSLGIPAPTTAPVCTVQQG 135 (541) Q Consensus 69 ~w~~~V~vv~spia~D~~~Rvy~t~~~~pk~t~~~-------------ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~ 135 (541) .|. .+| | .+|++.-..|+-..-+ +.-.-.+.| +..|.+.+....++... +..+ T Consensus 118 ~~~--------q~a-D---~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~~~v~~g~y-~~~y~v~i~gs~ta~~~-tp~~ 183 (794) T protein:vir:99 118 RMV--------TVA-D---YTFILNRNVATAQGTTNTPSGLAPFGHFGLVVIRGGQY-GRTYRIKVNGSVEASFE-TPLG 183 (794) T ss_pred eEE--------EEc-c---EEEEEcCCeeeeEeeeeccccCcCCCceEEEEeccCCC-CceEEEEecCCccccee-eccC Confidence 111 011 2 4444443332211100 001111233 22233433222111111 0000 Q ss_pred CCCCCCCCCc------c-----cceEEEEEEEecCCCccCcccc---------------ccceeecCCCCEEEEccccCC Q lcl|NC_019442. 136 GDVSDDNPND------D-----ETRFYTETFVSDYGEEGPPGPA---------------SLEVTLRTPGTAVQLTLSPVP 189 (541) Q Consensus 136 ~~~~~~~~~~------~-----~ty~Yv~T~V~~~GeEs~Ps~~---------------S~~vtv~~~g~~v~l~~~p~~ 189 (541) .......... + ..-.|.++-+..+-...+|+-. ...+... -++.- .++... T Consensus 184 ~~~~~~~~~s~~~ia~~l~~~l~~~g~~v~~~~g~~~i~~~~~~~v~t~s~~~g~~~t~~~~~~~~-v~~~~--~Lp~~~ 260 (794) T protein:vir:99 184 DQVAHAKQIDIAYIIDQLAAGLINKGWAVTKGSGYFYFSKSGSVIINSLEVEDGYNGQLAWGIIND-VQKTT--QLPVYA 260 (794) T ss_pred cccccccccchhhhhhhhHhhhhcccceEEeCCeEEEEEecCCceeEEEEeecCCCCceeeEEeee-cccee--ecccCC Confidence 0000000000 0 0000111111110011111100 0001000 01111 122222 Q ss_pred CCccccceEEEEEeecCCCceeEEEEEeeccc-----------------eEEEEecccccccCccchhhhhhCC------ Q lcl|NC_019442. 190 LQNASIKRRRIYRSASGGGEADFLLVAELDAS-----------------VLSYTDKIPGKNLGPSLATWDYLPP------ 246 (541) Q Consensus 190 ~~~~~i~~~RIYRs~t~~~~~~~~lVael~~~-----------------~~sf~D~~~~~~L~~~L~t~~~~~p------ 246 (541) ++|. .++|== .++....+|++..+...+ +.-... +..+.-.-.+...+|... T Consensus 261 ~~G~---~v~v~~-~~~~~~~~y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~~-v~~~~~~~~~~~~~w~~r~~Gd~~ 335 (794) T protein:vir:99 261 PNNY---IIRVSG-DPTLNQDDYYVRFDASRNVWTECPAPNIKADYNKATMPHVL-IREADGTFTFKQADWTHRAAGDDE 335 (794) T ss_pred CCCe---EEEEec-cCCCCCCceEEEEEcCCceEEeeccceeecceeccceEEEE-eccCCCceeEeeccccccccCCcc Confidence 2221 111110 011122234433332221 111100 000010111222245331 Q ss_pred ----CC-Cc---ceEEeccCcEEEEEeCCEEEEecCCCcccC-chh-cccccC------------cceEEEEEcCCcEEE Q lcl|NC_019442. 247 ----PE-NM---TGLCLMANGIAAGFAGNEVMFSEAYLPYAW-PEV-NRHTTA------------EDIVAICPLGTSLVV 304 (541) Q Consensus 247 ----P~-~~---~gL~~m~NGi~a~f~Gn~l~fSep~~P~aw-p~~-y~~t~~------------~~Iv~ia~v~~~lvV 304 (541) |. .. .++ .|.++++.-..++.||+|....++.| +.. -.++=+ ..|.-+.+++..|++ T Consensus 336 tnp~psf~g~~is~v-~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~~~~~~~~i~~~v~~~~~L~l 414 (794) T protein:vir:99 336 TNPYPSFIGNSINDI-FFFRNRLGFLSGENVILSGSGNYFNFFPESVAVLTDTDPIDVAVSTNRISILKYAVPFSEELIL 414 (794) T ss_pred cCCCccccCcceeEE-EEEeeeEEEecCCeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEE Confidence 11 11 223 45566665557899999999999997 321 111111 335668888999999 Q ss_pred EEcCCEEEEEccC---cccceEEeecccccccccchheeCCccEEEecCCcE-------EEEe--CCC----ceEEEecc Q lcl|NC_019442. 305 ATKGEPYLFSGVS---PSTISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGL-------VSVD--VNG----NTALATEK 368 (541) Q Consensus 305 ~T~~~py~l~G~~---p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGL-------v~~~--~~G----~~~~vT~~ 368 (541) +|++.-|.|+|.+ |.+..+.+ ...-.|.+.=..+.+|+.++|+++.|= +..+ .++ ..+++-.. T Consensus 415 ~t~~~q~~l~~~~~lTP~~~~~~~-~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~v~r~~~~~~~~d~y~a~Dlt~~~~h 493 (794) T protein:vir:99 415 WSDQAQFVLSSDGGLTPTTIRLDL-TTEFEVTEQARPYGIGRGVYFVSPRAKFSSVRRFYAVQDVTQVKNAEDISAHVPY 493 (794) T ss_pred EecCcEEEEeCCCcccceeEEEEE-EEEeeccCCCCceEeCCeEEEEecCCCeeEEEEeeeeccccCceehhhHHHHHHH Confidence 9999999999864 44444332 333357766778899999999999982 1111 111 01111122 Q ss_pred cCChhHhhhhcCc-ceEEEEEEcCeEEEEEecCCCccceEEEcc--CCc---eeEEEeec--c--cEEEEEecCCEEEEE Q lcl|NC_019442. 369 IISPEQWQSQFNP-ASIVAYSWRGEYIACYTKPDGKQDVFVFSP--VNM---DIRYLSTP--F--DCAWVDLAKDMMRVV 438 (541) Q Consensus 369 ~~~~~~W~~~l~P-~ti~a~~~eG~Y~~~y~~~~g~~~~~i~d~--~~~---~~~~~~~~--~--d~~~~~~~~d~LY~~ 438 (541) +|. ++ -.++|.+-+..=+.+.+..+|+--++=|.- +.+ .-.+++++ + .|+. ..+|.||++ T Consensus 494 l~~--------~~~~~~~a~~~~~~~~v~~~~~~g~l~~~~y~~~~~eq~v~aW~~~~~~g~~~~~~~~--~~~d~l~~~ 563 (794) T protein:vir:99 494 YVE--------NGVFKMSGSSTENFLTILTEGNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVLCCD--MIGAVMHLI 563 (794) T ss_pred hcC--------CCeEEEEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceEEEeEEEEEcCCCeEEEEEE--EcCCEEEEE Confidence 221 01 112344444444444554455433333322 122 12233332 1 2332 357899997 Q ss_pred EC----CEEEEecCCCCceeEEEEcc--eEEeCcccceeE--EEEeeC--------------C-CccEEEEEEECCceeE Q lcl|NC_019442. 439 TG----DKMSVLAGGSLPSTIRWHSK--IFSLPERTSFSC--IRVKSP--------------A-PERVGITIMADDVPVI 495 (541) Q Consensus 439 ~g----~~i~~~~~g~~~~~~~WrSk--~f~~~~~~~~~~--~~V~~~--------------~-~~~~~v~~~~d~~~~~ 495 (541) .. ..|-.++- ...+..|.+. .+.+...+.+.. .....+ . .+--.+.+..||.... T Consensus 564 v~r~~~~~ler~~~--~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~g~~v~~~~dg~~~~ 641 (794) T protein:vir:99 564 IDSPSGVLMEKIEF--TQNTKDYPDEPYRLYVDRKIEYTFPEGSYNDDDFKTRVKLKDIYGSTPANGQYVFISLGGVTFT 641 (794) T ss_pred EEeCCCEEEEEEEe--eeCCCCCCCcccceeeeeeeeeeecccccccCcceeEEeccccccccccCCceEEEEeCCceee Confidence 53 23433320 0001111111 122333322221 111100 0 0111234445554321 Q ss_pred e-eccc--ccCCcceEccCcc-cc--------eEEE-----EEE-------------ecceEEEEEeecc-hhhcCC Q lcl|NC_019442. 496 H-FAPG--TFKGSVVRLPAAT-GQ--------NWQV-----MVS-------------GFGQVERITLSTS-MSEMPV 541 (541) Q Consensus 496 ~-~~~~--~~~~~~~rLP~~~-~~--------~w~i-----ei~-------------g~~~V~~i~la~s-~~EL~~ 541 (541) . .... ...+...+||+.+ +. ..+| +++ |..+++++.+.-- -..+-| T Consensus 642 ~~~~~~~~~~~~~~~~~~~~~~~~~v~vGl~y~s~~~~~~~~~~~~~~~g~~~~~~~gr~~l~r~~~~~~~tg~~~v 718 (794) T protein:vir:99 642 FDPPAGGWQANDGLIEFDGDLRGTKFFVGEAYTFLYEFSKFLIKTTDTADGVATEDIGRLQLRRAWVNYDKSGNFRV 718 (794) T ss_pred eecccceEecCccEEEecCCCCCcEEEEeeeeeEEEeecceEEeecCCCCceeeeccceEEEEEEEEEeecccceEE Confidence 1 1111 1122334555321 11 1111 221 2335555554310 011111 No 39 >protein:vir:95324 Length: 823 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512269;genbank:gi:89152436;genbank:GeneID:3952993 Probab=90.31 E-value=0.022 Score=29.72 Aligned_cols=497 Identities=15% Similarity=0.154 Sum_probs=172.8 Q ss_pred CceEEecccccccccccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEECCcEEEEeC-CeEEEeeC Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWP-DVVDVIRS 79 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W~~w~-~~V~vv~s 79 (541) |-.|......+---||+|-.-. +.-+.-+-|-+|.|+=++.-..+. ....+.+... +.|-.=. .+..++- T Consensus 52 t~fva~~~~~~g~~rLipf~~s---~~q~y~Lefg~~~irV~~~~g~vv----~~~~~~~ev~-tPy~~~~l~~Lr~~q- 122 (823) T protein:vir:95 52 TRFVGAAKYPNRKCRLIPFQFS---TVQTYALEFGHQYMRVIKDGALVL----NSSNVIYEIA-TPYTEADLFRIKFTQ- 122 (823) T ss_pred hhhhhhhcCCCCCeeEEEEEeC---CCcEEEEEEcCCeEEEEeCCcEEE----ecCCceeEEe-cccccccccceeEEE- Confidence 3333322222222234443322 333445556666665554322110 0001111110 0000000 0111110 Q ss_pred CcccCCCCeEEEeCCCCcceeec----------ceeeccccCCccce--eeecCCCCCccceEEecCCCCCCCCCCCccc Q lcl|NC_019442. 80 PIAQDPHGRIYYTDGRFPKVTDA----------TIATKGDGNHPASS--YSLGIPAPTTAPVCTVQQGGDVSDDNPNDDE 147 (541) Q Consensus 80 pia~D~~~Rvy~t~~~~pk~t~~----------~ia~~g~g~~p~a~--y~LGVp~P~~~pv~~v~~~~~~~~~~~~~~~ 147 (541) ..| .+|++.-..|+.+.. .+.+. .+.|-..- ..+.+..-......+... ....-.....+ T Consensus 123 --saD---~~fivh~~~~p~~L~r~~~~~w~l~~~~~~-~gp~~~~~~~~t~~v~~~~~~~~~t~ta--~~~~~~~d~vg 194 (823) T protein:vir:95 123 --SAD---VLTLVHPAYPPKELRRYAHDNWQLVDVVTK-NGPFEDINIDESLTVYASASTGTITLTA--SASIFGAEQVG 194 (823) T ss_pred --ecc---EEEEEcCCccceEEEecCCCCceEEEEEEe-ccccccccccceeEEeccccCceeEEee--cccccchhhcc Confidence 112 445544333332211 11111 11111100 011111111111111111 11111111222 Q ss_pred ceEE----EEEEEecCCCccCccccccceeecCC-------CCEEEEccccCCCCcc---------ccceEEEEEeecCC Q lcl|NC_019442. 148 TRFY----TETFVSDYGEEGPPGPASLEVTLRTP-------GTAVQLTLSPVPLQNA---------SIKRRRIYRSASGG 207 (541) Q Consensus 148 ty~Y----v~T~V~~~GeEs~Ps~~S~~vtv~~~-------g~~v~l~~~p~~~~~~---------~i~~~RIYRs~t~~ 207 (541) ...| ..+.+..++.+.... .+.......+ +...++. |....+. .-..-+-||...++ T Consensus 195 ~~~~l~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~g~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 271 (823) T protein:vir:95 195 KLFYLEQPAVDSVPVWETSKSTS-IGDIRRADSNYYRAVTAGKTGTLR--PSHTEGTSWDGWGGSGDDDTGIEWEYLHSG 271 (823) T ss_pred ceEEEeccccceeeecceeeeec-ccceEEecccceeeeeccccceee--cccCCcceEEeceecccccceeEEEEEeCC Confidence 2222 223344333222111 1111111100 0111111 1110000 00111112222122 Q ss_pred -CceeEEEEEee--ccceEEEEecccccccCccchhhhhhCCCCC----cceEEeccCcEEEEEeC----CEEEEecCCC Q lcl|NC_019442. 208 -GEADFLLVAEL--DASVLSYTDKIPGKNLGPSLATWDYLPPPEN----MTGLCLMANGIAAGFAG----NEVMFSEAYL 276 (541) Q Consensus 208 -~~~~~~lVael--~~~~~sf~D~~~~~~L~~~L~t~~~~~pP~~----~~gL~~m~NGi~a~f~G----n~l~fSep~~ 276 (541) +..+...+... +....++.+..... ..-.+..|...|-. .-..+.+.++++.-..+ +.||+|.+.. T Consensus 272 ~g~~~~t~v~~~~~~~~~~~~~~~~~~~---~~~~t~~~~~~~~~~~~g~Ps~v~f~q~RL~f~g~~~~p~~v~~Srtgd 348 (823) T protein:vir:95 272 FGIARITAVNGTTATAEVISYIPSQVVG---EDNASYKWAKYAWNSVNGYPGTVVYYQQRLYFAASTAFPQTIWASRTGD 348 (823) T ss_pred cceEEEEeecceeeeceEeeeecccccc---CCcCCccccccccCcCCCCccEEEEEeceEEEEEcCCCCcEEEEeccCC Confidence 11111112111 12222333322111 12233445443321 11234566666653322 6899999999 Q ss_pred cccCchhccc--------ccC----cceEEEEEcCCcEEEEEcCCEEEEEcc-----CcccceEEeecccccccccchhe Q lcl|NC_019442. 277 PYAWPEVNRH--------TTA----EDIVAICPLGTSLVVATKGEPYLFSGV-----SPSTISGSRIPSMQACLSRRSMV 339 (541) Q Consensus 277 P~awp~~y~~--------t~~----~~Iv~ia~v~~~lvV~T~~~py~l~G~-----~p~s~~~~~l~~~~pCvs~rsiv 339 (541) ++.|-.+--+ ++. ..|.-+.+++ .|+|+|++.-|.++|. +|.+..+.+ ....+| +.=..+ T Consensus 349 ~~nF~~~~~~~DdD~I~~~~s~~~~~~i~~~v~~~-~Lli~t~~~e~~l~~~~~~~lTP~~~~~~~-~s~~g~-~~~~Pv 425 (823) T protein:vir:95 349 YKDFGKSNPTQDDDRIIYTYAGRQVNEIRHLIDVG-SLVALTSGGEYVITGDQNKVLTPSSFAFSS-QGSNGS-SNVPPI 425 (823) T ss_pred ccccccccCCCCCCcEEEEEcCCcceEEEEEeecC-cEEEEecCcEEEEEcCCCcccceeeEEEEE-eecccc-ccccce Confidence 9987432222 121 3367788885 7999999999999874 666765543 456677 445667 Q ss_pred eCCccEEEecCCcEEE-----EeCCC-----ceEEEecccCChhHhhhhcCcceEE--EEEEcCeEEEEEecCCCccceE Q lcl|NC_019442. 340 AMEGFVLYAGTNGLVS-----VDVNG-----NTALATEKIISPEQWQSQFNPASIV--AYSWRGEYIACYTKPDGKQDVF 407 (541) Q Consensus 340 ~~~~~v~y~s~dGLv~-----~~~~G-----~~~~vT~~~~~~~~W~~~l~P~ti~--a~~~eG~Y~~~y~~~~g~~~~~ 407 (541) .+|+.++|+++.|=.+ -+.++ ..+++...||.. ..++ +++-+..-+.++...||+.-++ T Consensus 426 ~vg~~~~Fv~~~g~~vre~~~~~~~d~~~~~dlT~~a~hl~~~---------~~i~~~a~~~~p~~~~~~v~~dG~l~~~ 496 (823) T protein:vir:95 426 AVANIALFVQEKGSVVRDLAYSFDVDGYQGNDLTILANHLFQK---------HSIVDWCFSIVPYSSAFCIRDDGKLLVM 496 (823) T ss_pred EeCCeEEEEecCCCEEEEEEEeeecCceecchhhhhhhhhcCC---------CceEEEEEecCCCeEEEEEecCCcEEEE Confidence 8999999999988211 11111 122222444432 1222 2222323444455556665556 Q ss_pred EEccCCceeEEEeeccc------EEEEEecCCEEEEEECC-----EEE---EecCCCCceeEEEEcceEEeCccccee-- Q lcl|NC_019442. 408 VFSPVNMDIRYLSTPFD------CAWVDLAKDMMRVVTGD-----KMS---VLAGGSLPSTIRWHSKIFSLPERTSFS-- 471 (541) Q Consensus 408 i~d~~~~~~~~~~~~~d------~~~~~~~~d~LY~~~g~-----~i~---~~~~g~~~~~~~WrSk~f~~~~~~~~~-- 471 (541) -|+...+..-|..-.++ |+..+...|.||++..+ .++ .++...... ....|.+...+++. T Consensus 497 ty~~~q~v~aW~~~~~~g~~~~~~~i~~~~~d~l~~~v~R~i~g~~~~yiE~~~~~~~~~----~~~~~~lD~~~s~~g~ 572 (823) T protein:vir:95 497 TYLRDQQVFAWAPQSSTGKYESTCSISEGNEDAVYFVVNRTVNGQTVRYIERLSSRLFTS----DEDAFFVDSGLSYDGR 572 (823) T ss_pred EEecccceeeeEEEecCCcEEEEEEecCCCCCEEEEEEEeccCCeEEEEEEeeccccCCC----ccceeEEEEEEEeecC Confidence 66654333333332232 23334457899997532 211 121111000 12233333322222 Q ss_pred -----EEEEeeCC--Ccc---EEEEEEECCceeEeecccccCCcceEccCcc-----c----ceEEEEEEe--cceEEEE Q lcl|NC_019442. 472 -----CIRVKSPA--PER---VGITIMADDVPVIHFAPGTFKGSVVRLPAAT-----G----QNWQVMVSG--FGQVERI 530 (541) Q Consensus 472 -----~~~V~~~~--~~~---~~v~~~~d~~~~~~~~~~~~~~~~~rLP~~~-----~----~~w~iei~g--~~~V~~i 530 (541) ..-+.... ..+ ..+.+ .||. +. ..... +..+.||--. + .+.+..+.. ...+.++ T Consensus 573 ~~~~~~~~l~~g~~~l~~l~g~~v~~-adg~-~~--~~~~v-~g~i~l~~~~~~~~vGl~~~~~i~~~~~~v~~~~a~~~ 647 (823) T protein:vir:95 573 NTSDRTMTITGGSGEWDYLAEYTISV-SGGA-YF--TSSDV-GAQLQFPYTGADPDTGYEVSKELRCDIISVTSNTAVVV 647 (823) T ss_pred cccceeeEecCCCCcccccCceEEEe-cCcc-eE--CCccc-eeEEEeCcCCCccccccceEEEEEEeeceeeCCceEEE Confidence 11111110 011 11211 3332 21 11111 2234444210 0 111111111 1122222 Q ss_pred Eeecchh-----------hcCC Q lcl|NC_019442. 531 TLSTSMS-----------EMPV 541 (541) Q Consensus 531 ~la~s~~-----------EL~~ 541 (541) +...... -+++ T Consensus 648 ~~~r~v~a~l~~~~t~~~~~~~ 669 (823) T protein:vir:95 648 RANRNVPPSLRNVATTNWQMAR 669 (823) T ss_pred ccCCcccceeeeeecccccccc Confidence 2221111 1111 No 40 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=87.67 E-value=0.037 Score=28.42 Aligned_cols=248 Identities=13% Similarity=0.096 Sum_probs=90.1 Q ss_pred CceE------EecccccccccccceecccccceEEEEeeecCCeeeeeecccccCcccccc---ceeEEEECCc--EEEE Q lcl|NC_019442. 1 MPYI------DITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIK---PKTIFHYRDD--FWFA 69 (541) Q Consensus 1 m~~i------~i~~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~---~~Tif~~~~~--~W~~ 69 (541) -..| .+-.-+|+-+ ...+|.+.++.+= .--..|..+=...|..+.--|.-. .+..|.-.++ .-.. T Consensus 167 ~~lId~~~~~NViDarG~~L--s~~~L~~aa~~I~--~~~~fGt~TD~~lp~~vka~f~~~~~~~qRv~~~~N~~~~~~G 242 (470) T protein:vir:10 167 INIIKRGAPQNVLDAGGRPL--SIDLLWEAESRVV--STQAFANPTAVFISYVDKLNLQASFYQISRVMTTADRRAGLLG 242 (470) T ss_pred hhhccCCCCccccccCCCCc--cHHHHHHHHhhhc--ccccccChhhhccchhHHHHHHHhhcCceEEEEecCCCceeee Confidence 1111 2223445433 4445544433210 012345555555554432211111 1222221111 0000 Q ss_pred eCCeEEEeeCCcccCCCCeEEEeCCCCcceeecceeeccccCCccc----ee----eec-CCCCCccceEEecCCCC--- Q lcl|NC_019442. 70 WPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPAS----SY----SLG-IPAPTTAPVCTVQQGGD--- 137 (541) Q Consensus 70 w~~~V~vv~spia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a----~y----~LG-Vp~P~~~pv~~v~~~~~--- 137 (541) .+ |- + |.+ ..++|.+-++-.+|.- -. +.+ +++|+...+++.+++.. T Consensus 243 ~~---------v~-----~-f~s-------a~G~I~L~~s~~m~~~~k~~p~~l~~~v~~~aAP~~~~tv~~t~~~~a~~ 300 (470) T protein:vir:10 243 AD---------AQ-----S-YIG-------VRGEHSLYPSQFLGDFHKFNPARFGAEVGDFAAPSNSWTVSTTDNFVTLP 300 (470) T ss_pred ee---------cc-----c-eee-------eeeeeeecccccccchhhcCcccCCcccCCcccCceeEEeecCCCceeec Confidence 10 00 0 000 1122333222222210 00 111 34444222222211111 Q ss_pred --CCCCCCCcccceEEEEEEEecCCCccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEEE Q lcl|NC_019442. 138 --VSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLLV 215 (541) Q Consensus 138 --~~~~~~~~~~ty~Yv~T~V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lV 215 (541) ......++.+.+.|.|..++.+| ||+|+-+...++...-+..|.|+.-.++ ++.-+.|||+.. ++.+|+|+ T Consensus 301 ~~sk~g~~~~~~v~sy~y~v~~~~g-ds~s~~v~vt~t~~~v~kgv~ltI~~~~----~v~yv~IYRk~~--~s~~~~li 373 (470) T protein:vir:10 301 YNSGLGDPANTTVYSYAFKAANFYG-ESAAKYIDVYIDSTEAGKGVRFQFHGLV----NVKWLDVYRKDP--GSQEYKFY 373 (470) T ss_pred ccCCCCcccCcceeEEEEEEEEecC-CCCcceEEEEEeeehhcceeEEEEecCC----CCcEEEEEeecC--CCCceeEE Confidence 11112334556789999999887 6654433334444444555665553332 367899999854 44699999 Q ss_pred Eeec-----cceEEEEecccccccCccchhhh-------------------------hhCCCCCcce-EEeccCcEEEEE Q lcl|NC_019442. 216 AELD-----ASVLSYTDKIPGKNLGPSLATWD-------------------------YLPPPENMTG-LCLMANGIAAGF 264 (541) Q Consensus 216 ael~-----~~~~sf~D~~~~~~L~~~L~t~~-------------------------~~~pP~~~~g-L~~m~NGi~a~f 264 (541) +.++ .+...|.|. .+.+|++- |+.|.+-++- |...+--++++. T Consensus 374 ~rv~v~~~ng~~~~~~D~------~e~i~tt~~v~~~~~~Pgt~~Vgemsp~v~sl~~~l~m~l~klp~a~~~~~v~~~v 447 (470) T protein:vir:10 374 KRVKVSTVNGDFTWIDDG------HETVTTPSGVYRWKKIPGTGVVVGIDPNVTTMAVWIGMELYRLPPALTHDYVIWKV 447 (470) T ss_pred EEEeeeeccCCEEEEecc------cccCCCcceeeeecccCcceeccccCcchhhhhhhhhhhhhhcCHHHHHHHHHHHH Confidence 9998 444456665 23344333 2222211110 000000000000 Q ss_pred eCCEEEEecCCCcccCchhcccccCcceEEEEEcCCcEEE Q lcl|NC_019442. 265 AGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVV 304 (541) Q Consensus 265 ~Gn~l~fSep~~P~awp~~y~~t~~~~Iv~ia~v~~~lvV 304 (541) -.|+.- -|+++ +-|.-+.---+| T Consensus 448 --galal~-------aPKr~--------~~IkNV~~~~~~ 470 (470) T protein:vir:10 448 --ASVFSR-------APEFN--------FLIVNVGQEPIV 470 (470) T ss_pred --HHHHHh-------ccccc--------eEEEEeeeeecC Confidence 000000 12221 111111111111 No 41 >protein:vir:78957 Length: 826 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522826;genbank:gi:158345061;genbank:GeneID:5687447 Probab=87.29 E-value=0.04 Score=28.27 Aligned_cols=485 Identities=12% Similarity=0.054 Sum_probs=161.4 Q ss_pred Cc-------------eEEecccccccccccceecccccceEEEEeeecCCeeeeeecccccCcc-ccccceeEE--EECC Q lcl|NC_019442. 1 MP-------------YIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKT-FTIKPKTIF--HYRD 64 (541) Q Consensus 1 m~-------------~i~i~~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~-~~~~~~Tif--~~~~ 64 (541) .. .|.|.+ +...-. -+...-+.......+.-... ....+.+. .......++ ++.. T Consensus 149 ~~~~~v~~g~y~~~y~v~i~~---~~~~~~-----~~~s~t~~y~t~~~~~~~~~-~~~~~~~~~~~~~a~~l~~~~~~~ 219 (826) T protein:vir:78 149 TGWLYIKAGQYSKAFSLTIKV---KDNATG-----TTYSHTATYVTPDNASTNPN-LAEAPFQTSVGYIAWQLFGKFFGA 219 (826) T ss_pred eEEEEecccccCceeEEEecc---ceeecc-----cccceeEEEEeccCCccccc-cccccceecchhhheecceeeccc Confidence 11 223321 100000 00000011111111100000 00000000 000111122 1222 Q ss_pred cEEEEeCCeEEE---eeCCccc----------CCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCC-------- Q lcl|NC_019442. 65 DFWFAWPDVVDV---IRSPIAQ----------DPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPA-------- 123 (541) Q Consensus 65 ~~W~~w~~~V~v---v~spia~----------D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~-------- 123 (541) ..|-.+...-.+ .+...+. .+-..+++.+++.-.+ . +.+.++.++-..+..-.|.. T Consensus 220 ~~~~~~~~t~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~--~-~~~~~g~~~~~~~~~~~v~~~~~l~a~~ 296 (826) T protein:vir:78 220 PEYTLPNSTKKYPKVDPDPAAATVAGYLNQRGVQDGYIAFRGDGDIVV--E-VSTDMGNNYGIASGGMSLNATADLPALL 296 (826) T ss_pred cceeeeccceeEeeccccccceeeccceeecccccceEEEecCCCeEE--E-eccCCCccceEEEeeEEEecccceeeee Confidence 233333221111 0011110 0111223322211111 0 11111111111111111111 Q ss_pred CCccceE---EecCCCCCCCCCCCcccceEEEEEEEecCC-CccCccccccceeecCCCCEEEEccccCCCCccccceEE Q lcl|NC_019442. 124 PTTAPVC---TVQQGGDVSDDNPNDDETRFYTETFVSDYG-EEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRR 199 (541) Q Consensus 124 P~~~pv~---~v~~~~~~~~~~~~~~~ty~Yv~T~V~~~G-eEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~R 199 (541) |+.+... .+..+...+. + .....|.++|+...| .|+.. .+|..+. ...+.++- T Consensus 297 p~~~~~~~~~~~~~~~~~~~-g---~~~~~~y~~~~~~~~~w~e~a----------~~g~~~~---------~~tmp~~l 353 (826) T protein:vir:78 297 PGAGTPGTGVQFMDGAIMAT-G---STKAPVYFAWDAANRRWAERA----------AYGTDWV---------LKKMPLAL 353 (826) T ss_pred cccccceEEEEEEeeeEecC-C---CcccceeEEEEcCCceEEEee----------ccCcccc---------cccccEEE Confidence 2111100 0000000000 0 011234456665443 11111 1121111 11222333 Q ss_pred EEEeecCCCceeEEEEEeeccceEEEEecccccccCccchhhhhhCCCCCcceEEeccCcEEEEEeCCEEEEecCCCccc Q lcl|NC_019442. 200 IYRSASGGGEADFLLVAELDASVLSYTDKIPGKNLGPSLATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYA 279 (541) Q Consensus 200 IYRs~t~~~~~~~~lVael~~~~~sf~D~~~~~~L~~~L~t~~~~~pP~~~~gL~~m~NGi~a~f~Gn~l~fSep~~P~a 279 (541) .+|... +.|.+-. ..+.+....++..-..|++- ...| +.+ .+.+++|.-..++.||+|.+..++. T Consensus 354 ~~~~~~----~~f~~~~------~~w~~r~~gd~~tnp~psf~-g~~i---~~v-~f~q~RL~f~~~~~v~~Srtgd~~n 418 (826) T protein:vir:78 354 RWDEST----DTYSLNE------LEYDRRGSGDEETNPTFNFV-KRGI---TGM-TTFQGRLVLLSQEYVCMSASNNPHR 418 (826) T ss_pred EEecCC----CeEEEee------ccccccccCcccccCccccc-CCCc---eEE-EEEeceEEEeeCCeEEEEeccCccc Confidence 444211 1232211 12333222222211222211 0112 223 5556666555699999999999999 Q ss_pred Cchhccccc--C------------cceEEEEEcCCcEEEEEcCCEEEEEcc---CcccceEEeecccccccccchheeCC Q lcl|NC_019442. 280 WPEVNRHTT--A------------EDIVAICPLGTSLVVATKGEPYLFSGV---SPSTISGSRIPSMQACLSRRSMVAME 342 (541) Q Consensus 280 wp~~y~~t~--~------------~~Iv~ia~v~~~lvV~T~~~py~l~G~---~p~s~~~~~l~~~~pCvs~rsiv~~~ 342 (541) |-.+--.+. + ..|.-+.+++..|+++|++.-|.|+|. +|.+..+.+ ...-.|-+.=..+.+| T Consensus 419 F~~~t~~~~~DdD~I~~~~~s~~~~~i~~~v~~~~~L~l~T~~~e~~l~~~~~lTP~~~~~~~-~s~~~~~~~~~Pv~vG 497 (826) T protein:vir:78 419 WFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVISI-TTQYDVDTRAAPAVTG 497 (826) T ss_pred cccccccCCCCCCcEEEEEccCcceeEEEEEecCCcEEEEecCcEEEEeCCCcccceeEEEEE-EEeecccCCCCceEeC Confidence 732221111 1 346678888999999999999999985 455544332 3344676666778999 Q ss_pred ccEEEecCCc-----EE--EEeCCCceEEEecccCChhHhhhhcCcceE--EEEEEcCeEEEEEecCCCccceEEEccCC Q lcl|NC_019442. 343 GFVLYAGTNG-----LV--SVDVNGNTALATEKIISPEQWQSQFNPASI--VAYSWRGEYIACYTKPDGKQDVFVFSPVN 413 (541) Q Consensus 343 ~~v~y~s~dG-----Lv--~~~~~G~~~~vT~~~~~~~~W~~~l~P~ti--~a~~~eG~Y~~~y~~~~g~~~~~i~d~~~ 413 (541) +.++|+++.| +. ..+.+..-+-.. +-+|.-- ..+-|..+ +|++-+-.++.+.+..+|+.-++=|.-.+ T Consensus 498 ~~v~F~~~r~~~~s~v~e~~~~~~~~~~y~~-~dlt~~~--~~l~~~~v~~~a~s~~~~~~v~~~~~~g~l~~~ty~~~~ 574 (826) T protein:vir:78 498 RSVYFAAERALGFMGLHEMAPSPSTDSHYVA-EDVTSHI--PSYMPGPAEYIQAAASSGYLVFGTSAADEMICHQYLWQG 574 (826) T ss_pred CeEEEEecCCCceeEEEEEEeeecccCccch-HHHHHHH--HHhcCCCeEEEEEeCCCCeEEEEEcCCCeEEEEEEEecC Confidence 9999999876 21 122222111111 0111100 01223333 34555666667776666654344332122 Q ss_pred --ce---eEEEeecccEEEEEecCCEEEEEEC----CEEEEec--CCCCceeEEEEcceEE------eCccc---ceeEE Q lcl|NC_019442. 414 --MD---IRYLSTPFDCAWVDLAKDMMRVVTG----DKMSVLA--GGSLPSTIRWHSKIFS------LPERT---SFSCI 473 (541) Q Consensus 414 --~~---~~~~~~~~d~~~~~~~~d~LY~~~g----~~i~~~~--~g~~~~~~~WrSk~f~------~~~~~---~~~~~ 473 (541) ++ -.+++++-...+....+|.||++.. ..+..+. ..............|. ..... .+.+- T Consensus 575 ~e~~v~aW~~~~~~g~v~~v~~i~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 654 (826) T protein:vir:78 575 NEKVQNAYHRWTLRHQIIGAYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEATVDGELELTKQHWD 654 (826) T ss_pred CcEEEEeEEEEccCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEEecCCCccccccccceeEEEEEEEcceeccccceeE Confidence 11 2223332222333344899999752 2233321 1111111111111111 11100 00000 Q ss_pred EEeeCC-CccE--EEEEEECCceeEeecccccCCcceEccCcc-cceE--------E--------------EEEEecceE Q lcl|NC_019442. 474 RVKSPA-PERV--GITIMADDVPVIHFAPGTFKGSVVRLPAAT-GQNW--------Q--------------VMVSGFGQV 527 (541) Q Consensus 474 ~V~~~~-~~~~--~v~~~~d~~~~~~~~~~~~~~~~~rLP~~~-~~~w--------~--------------iei~g~~~V 527 (541) -+.... ...+ .+..+.++..+ .........-.+++|+.. +... + -...|..++ T Consensus 655 ~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~g~~~~~~r~~l 733 (826) T protein:vir:78 655 LIKDGAAVYQLQPQVGAYMERYQL-GVKRETSTKVFLDVPEAVVGSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTRAVL 733 (826) T ss_pred EecCCceeeeeccceeeeccccce-eccccCCCceEEEeCCCccccEEEEeeceeEEEEeCceEEecCCCcceeecceEE Confidence 000000 0000 00001111111 000001011123344221 1100 0 113455666 Q ss_pred EEEEee--cchhhcCC Q lcl|NC_019442. 528 ERITLS--TSMSEMPV 541 (541) Q Consensus 528 ~~i~la--~s~~EL~~ 541 (541) +|+.+. .| ..+.+ T Consensus 734 ~r~~~~~~~t-g~~~v 748 (826) T protein:vir:78 734 HRYNVNFGWT-GEFLW 748 (826) T ss_pred EEEEEEeecc-ccEEE Confidence 666432 22 23333 No 42 >protein:vir:2203 Length: 794 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_042000;swissprot:sw:p03747;genbank:gi:9627472;uniprot:P03747;genbank:GeneID:1261024 Probab=70.72 E-value=0.21 Score=24.35 Aligned_cols=492 Identities=11% Similarity=0.083 Sum_probs=168.2 Q ss_pred CceEEecccccccccccceecccccceEEEEe------eecCCeeeeeecccccCccccccceeEEEECCcEEEEeCCeE Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDC------HFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVV 74 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp~~~a~~a~N~------~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W~~w~~~V 74 (541) |..++-+...|..+++.+--..+.++-+-+.+ -..+|....+..+-...-..+..++.=. .|.+-.| T Consensus 51 v~~l~~~~~~~~~~~l~~~~~~~~~~y~l~~~~~~irv~~~~G~~~~v~~~~~~~y~~~~~~~~~l-----~~~q~aD-- 123 (794) T protein:vir:22 51 LNTLGDNGALGQAPYIHLINRDEHEQYYAVFTGSGIRVFDLSGNEKQVRYPNGSNYIKTANPRNDL-----RMVTVAD-- 123 (794) T ss_pred hhhhcccCCCCCccEEEEEEeCCCcEEEEEEcCCeEEEEecCCcEEEeecCCCccceecCCCcccE-----EEEEEcC-- Confidence 44445556677888888866666655443221 1123333333222111110011100000 1222222 Q ss_pred EEeeCCcccCCCCeEEEeCCCCcceeecc-------------eeeccccCCccceeeecCCCCCccceEEecCCCCCCC- Q lcl|NC_019442. 75 DVIRSPIAQDPHGRIYYTDGRFPKVTDAT-------------IATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDVSD- 140 (541) Q Consensus 75 ~vv~spia~D~~~Rvy~t~~~~pk~t~~~-------------ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~~~- 140 (541) .+|++.-..|+..... +.....|.| +..|.+.+-..+. .+..+..++.... T Consensus 124 -------------~~fi~~~~~~p~~~~~~~~~~~~~~~~~g~v~v~~g~y-~~ty~v~I~~~~~-a~~~~p~gt~~~~~ 188 (794) T protein:vir:22 124 -------------YTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVRGGQY-GRELIVHINGKDV-AKYKIPDGSQPEHV 188 (794) T ss_pred -------------EEEEEcCCeeeeEeeccccCCCCCCCceEEEEccCCcc-ceeEEEEeccCcc-eEEEEcCCCccccc Confidence 2233322222111000 011112222 2233333322211 1111111111000 Q ss_pred ---------------------CCCCcc-cceEEEEEEEecC------CCccCccccccceeecCCCCEEEEccccCCCCc Q lcl|NC_019442. 141 ---------------------DNPNDD-ETRFYTETFVSDY------GEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQN 192 (541) Q Consensus 141 ---------------------~~~~~~-~ty~Yv~T~V~~~------GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~ 192 (541) +..... +.+.|+. ..+.. .+.+.-.-....+. ..-++.- .+++..++| T Consensus 189 ~~~~~~~ia~~L~~~l~~~~~~~t~~~~~~~~~i~-a~~~~~~~~~t~~~g~~~t~~~~~~-~~~~~~~--~lp~~~~~G 264 (794) T protein:vir:22 189 NNTDAQWLAEELAKQMRTNLSDWTVNVGQGFIHVT-APSGQQIDSFTTKDGYADQLINPVT-HYAQSFS--KLPPNAPNG 264 (794) T ss_pred eeechhhhhhhhhhhheeccccceEEeCCceEEEE-EcCCceEEEEeeecccCcceeEEEE-eccccce--eccccCCCC Confidence 000000 1112211 11110 00000000000000 0001111 122222222 Q ss_pred cccceEEEEEeecCC-CceeEEEEEeeccce----------EEEEeccccccc------CccchhhhhhCC--------- Q lcl|NC_019442. 193 ASIKRRRIYRSASGG-GEADFLLVAELDASV----------LSYTDKIPGKNL------GPSLATWDYLPP--------- 246 (541) Q Consensus 193 ~~i~~~RIYRs~t~~-~~~~~~lVael~~~~----------~sf~D~~~~~~L------~~~L~t~~~~~p--------- 246 (541) . .++|= .+++ ...+|++-.+-..+. ..+........| .-.++..+|.+. T Consensus 265 ~---~v~i~--~~~~~~~~~Y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~lv~~~~~~~~~~~~~w~~r~~Gd~~tnp 339 (794) T protein:vir:22 265 Y---MVKIV--GDASKSADQYYVRYDAERKVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWLEWSPKSCGDVDTNP 339 (794) T ss_pred e---EEEEE--eCCCCCcceeEEEEeccceEEEEeeeccceeeecccceeeEeeeccCCcEEEeeccccccccCccccCC Confidence 1 11111 1111 222233221110000 001000000000 001122234431 Q ss_pred -CC-Ccc--eEEeccCcEEEEEeCCEEEEecCCCcccC-chh-cccccCc------------ceEEEEEcCCcEEEEEcC Q lcl|NC_019442. 247 -PE-NMT--GLCLMANGIAAGFAGNEVMFSEAYLPYAW-PEV-NRHTTAE------------DIVAICPLGTSLVVATKG 308 (541) Q Consensus 247 -P~-~~~--gL~~m~NGi~a~f~Gn~l~fSep~~P~aw-p~~-y~~t~~~------------~Iv~ia~v~~~lvV~T~~ 308 (541) |. ..+ .-+.+.++++.-..++.||+|....++.| |.. --++=++ .|.-+.+++..|+++|++ T Consensus 340 ~psf~g~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~ss~~~~~i~~~v~~~~~L~i~t~~ 419 (794) T protein:vir:22 340 WPSFVGSSINDVFFFRNRLGFLSGENIILSRTAKYFNFYPASIANLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDE 419 (794) T ss_pred cceecCCCcceEEEEcceEEEecCCeEEEEccCCccccccccCcCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecC Confidence 10 011 12345566666567999999999999997 332 1122222 255578888999999999 Q ss_pred CEEEEEccC---cccceEEeecccccccccchheeCCccEEEecCCcE-------EEEe--CCC----ceEEEecccCCh Q lcl|NC_019442. 309 EPYLFSGVS---PSTISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGL-------VSVD--VNG----NTALATEKIISP 372 (541) Q Consensus 309 ~py~l~G~~---p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGL-------v~~~--~~G----~~~~vT~~~~~~ 372 (541) .-|.|+|.+ |.+..+. +...-.|.+.=..+.+|+.++|+++.|= +..+ .++ ..+++-..+| T Consensus 420 ~e~~l~~~~~lTP~~~~~~-~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~y~~~Dlt~~~~~~~-- 496 (794) T protein:vir:22 420 AQFVLTASGTLTSKSVELN-LTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYI-- 496 (794) T ss_pred cEEEEeCCCcccceeEEEE-EEEEeeccCCCCceEeCCeEEEEecCCCeeEEEEeEeeecccCceehhhHHHHHHHhc-- Confidence 999998864 4444433 2333457666678899999999999882 1111 111 0111111111 Q ss_pred hHhhhhcCcce---EEEEEEcCeEEEEEecCCCccceEEEc---cCC-ce---eEEEeecc----cEEEEEecCCEEEEE Q lcl|NC_019442. 373 EQWQSQFNPAS---IVAYSWRGEYIACYTKPDGKQDVFVFS---PVN-MD---IRYLSTPF----DCAWVDLAKDMMRVV 438 (541) Q Consensus 373 ~~W~~~l~P~t---i~a~~~eG~Y~~~y~~~~g~~~~~i~d---~~~-~~---~~~~~~~~----d~~~~~~~~d~LY~~ 438 (541) |.. +.+..-++.=+.+.+..+++ .+++. .+. ++ ..+++++- .|+ ....|.||++ T Consensus 497 --------~~~~~~~~~~~~~~~~v~~~~~~~~~--l~~~~y~~~~~e~~v~aW~~~~~~g~~~~~~~--~~~~d~l~~i 564 (794) T protein:vir:22 497 --------PNGVFSICGSGTENFCSVLSHGDPSK--IFMYKFLYLNEELRQQSWSHWDFGENVQVLAC--QSISSDMYVI 564 (794) T ss_pred --------CCceEEEEEeCCCCcEEEEEEcCCCE--EEEEEEeecCCceeEEeeEEEEcCCCEEEEEE--EecCCEEEEE Confidence 111 11222222222222332332 22222 122 11 22223221 133 2457899997 Q ss_pred EC----CEEEEe----cCC----CCcee-EEEEcceEEeCccc-----ceeEEEEe---eC-CCccEEEEEEECCceeEe Q lcl|NC_019442. 439 TG----DKMSVL----AGG----SLPST-IRWHSKIFSLPERT-----SFSCIRVK---SP-APERVGITIMADDVPVIH 496 (541) Q Consensus 439 ~g----~~i~~~----~~g----~~~~~-~~WrSk~f~~~~~~-----~~~~~~V~---~~-~~~~~~v~~~~d~~~~~~ 496 (541) .. ..+-.+ +.. +.... +-++ ..|..+..+ +.+-+... +. ..+--.+.+..||..... T Consensus 565 v~r~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~-~~~~~~~g~~~~~~~~t~~~~~~~~g~~~~~g~~v~~~~dg~~~~~ 643 (794) T protein:vir:22 565 LRNEFNTFLARISFTKNAIDLQGEPYRAFMDMK-IRYTIPSGTYNDDTFTTSIHIPTIYGANFGRGKITVLEPDGKITVF 643 (794) T ss_pred EEeCCCEEEEEEEEeeccccCCCccceeeeeee-EEEeeccceeecCCcceEEEcccccCcccccceEEEEEcCCceeec Confidence 63 222222 221 11110 1111 112221111 11111111 00 111223444455543211 Q ss_pred ec--ccccCCcceEccCcc-cceE------E--E-----EE-------------EecceEEEEEeecchhhcCC Q lcl|NC_019442. 497 FA--PGTFKGSVVRLPAAT-GQNW------Q--V-----MV-------------SGFGQVERITLSTSMSEMPV 541 (541) Q Consensus 497 ~~--~~~~~~~~~rLP~~~-~~~w------~--i-----ei-------------~g~~~V~~i~la~s~~EL~~ 541 (541) .. .+...+.-++||+.+ +... + | ++ .|..+++|+.+. +.+.-. T Consensus 644 ~~~~~~~~~~~~~~v~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~~~~~~~~~~~grl~l~r~~~~--~~~tg~ 715 (794) T protein:vir:22 644 EQPTAGWNSDPWLRLSGNLEGRMVYIGFNINFVYEFSKFLIKQTADDGSTSTEDIGRLQLRRAWVN--YENSGT 715 (794) T ss_pred eeeeeeeeccceEEeCCCCCCcEEEEeeeeeEEEEecceEEEecCCCccceeeecceEEEEEEEEE--eccccc Confidence 11 111111233555331 1111 1 1 11 123344454443 222221 No 43 >protein:vir:94583 Length: 792 # NCBI annotation: Tubular tail protein B # Family: family:all:825 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919014;genbank:gi:119637778;genbank:GeneID:5179343 Probab=70.62 E-value=0.21 Score=24.34 Aligned_cols=501 Identities=12% Similarity=0.051 Sum_probs=162.4 Q ss_pred CceEEecccccccccccceecccccceEEEEee-----e-cCCeeeeeecccccCccccccceeEEEECCcEEEEeCCeE Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCH-----F-RFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVV 74 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp~~~a~~a~N~~-----~-~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W~~w~~~V 74 (541) +..++-....+.-+|++|=-..++++-+-+..+ + .+|..........-. .+..++.=. .|.+-.| T Consensus 51 v~~l~~~~~~~~~~~l~~~~~~~~q~y~l~f~~~~~rv~~~~g~~~~~~~~~~y~--~~~~~~~~l-----~~~q~aD-- 121 (792) T protein:vir:94 51 TKTIGDQNALGAKPLVHLINRDSAEQYYVVFTGQGVRVFDLNGKEYDVKGDLSYV--KVENPRDDL-----RMVTVAD-- 121 (792) T ss_pred HHhhhcCCCCCcccEEEEEEeCCCceEEEEEcCCeEEEEecCCceEEecccCcee--eecCCccee-----EEEEEcC-- Confidence 222222233344445554333344433322221 1 122211111110000 000111101 1111112 Q ss_pred EEeeCCcccCCCCeEEEeCCCCccee------------ecceeeccccCCccceeeec---------CCCCCccceEEec Q lcl|NC_019442. 75 DVIRSPIAQDPHGRIYYTDGRFPKVT------------DATIATKGDGNHPASSYSLG---------IPAPTTAPVCTVQ 133 (541) Q Consensus 75 ~vv~spia~D~~~Rvy~t~~~~pk~t------------~~~ia~~g~g~~p~a~y~LG---------Vp~P~~~pv~~v~ 133 (541) .+|++.-..|+-. .+-+.. ..|.| +-.|++- +|.++++..+... T Consensus 122 -------------~~fi~n~~~~~~~~~~~~~~~~~~~~~~v~i-~~g~y-~~~y~i~i~~~~~~~~~~~~t~~~~~~~~ 186 (792) T protein:vir:94 122 -------------YTFIVNRNMVVRPDTTPLYTLKENGDCLINI-RGGMY-GRTLAFTINNTKIAYEIAHGDAPEHSKQT 186 (792) T ss_pred -------------EEEEEeCCccceeEecCcCCCCCCceEEEEc-cCCCc-ceeEEEEecCceeeeeeecCcccceeccc Confidence 2333332211100 000111 11222 1112222 2333222111100 Q ss_pred CCCC----------CCCCCCC----cccceEEEEE----EEecCCCccCccccccceeecCCCCEEEEccccCCCCcccc Q lcl|NC_019442. 134 QGGD----------VSDDNPN----DDETRFYTET----FVSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASI 195 (541) Q Consensus 134 ~~~~----------~~~~~~~----~~~ty~Yv~T----~V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i 195 (541) .+.- ......+ ..+.+.|++. -++.. ++.++...... ... ...++.. ..+|....+- T Consensus 187 ~~~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~--~~~~g~~~~~~-~~~-~~~v~~~-~~lp~~~~~G 261 (792) T protein:vir:94 187 DAQWLVKKLAGLARLNVAFKGWTFTEGPGYIHVIAPSNSQINSL--STEDGYADQLM-NAV-MHTSQSF-SRLPVEAPNG 261 (792) T ss_pred chhhhhhhhhhhccccccccccEEEECCeEEEEEecCCceeeee--ecccCcCccee-eee-eeccccc-ccccccCCCC Confidence 0000 0000000 0112222210 00001 11111100000 000 0011100 0111111111 Q ss_pred ceEEEEEeecCCCceeEEEEEeeccce----------EEEEeccccccc-----Cc-cchhhhhhCC----------CC- Q lcl|NC_019442. 196 KRRRIYRSASGGGEADFLLVAELDASV----------LSYTDKIPGKNL-----GP-SLATWDYLPP----------PE- 248 (541) Q Consensus 196 ~~~RIYRs~t~~~~~~~~lVael~~~~----------~sf~D~~~~~~L-----~~-~L~t~~~~~p----------P~- 248 (541) ..++|-- .++.+..+|++..+-..+. ..+..+.-...| +. .+...+|... |. T Consensus 262 ~~v~i~~-~~~~~~d~y~v~~~~~~~~w~E~~~~~~~~~~~~~tmp~~lv~~~~~~~~~~~~~w~~r~~gd~~tnp~psf 340 (792) T protein:vir:94 262 YTVKIVG-DTSKTSDMFYVQYDNMKKVWKEVAGWGVQKGLNGGTMPHALVRQADGSFQMQVLPWTQRTCGDMDTNPTPSI 340 (792) T ss_pred cEEEEEc-cCCCCccceEEEEEcCCceEEEecccceeeeecccccCeeEEEcCCCcEEEEeccccccccCccccCcccee Confidence 2233321 1222333455444322110 000000000000 00 0112234331 11 Q ss_pred Cc---ceEEeccCcEEEEEeCCEEEEecCCCcccC-chh-cccccC------------cceEEEEEcCCcEEEEEcCCEE Q lcl|NC_019442. 249 NM---TGLCLMANGIAAGFAGNEVMFSEAYLPYAW-PEV-NRHTTA------------EDIVAICPLGTSLVVATKGEPY 311 (541) Q Consensus 249 ~~---~gL~~m~NGi~a~f~Gn~l~fSep~~P~aw-p~~-y~~t~~------------~~Iv~ia~v~~~lvV~T~~~py 311 (541) .. .++ .+.++++.-..++.||+|....++.| +.. ..++=+ ..|.-+.+++..|+++|++.-| T Consensus 341 ~g~~i~~v-~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~~~q~ 419 (792) T protein:vir:94 341 VDQKINDV-FFFRNRLGFLAGENIVMSRTSKYFSLFPASVANLSDDDPIDVAVSHNRISILKYAVPFSEELLLWSDQAQF 419 (792) T ss_pred ccCCcceE-EEEcceEEEecCCeEEEEccCCcccCccccccCCCCCccEEEEecCCcceeeeEEeecCCcEEEEecCcEE Confidence 01 223 45566665557999999999999997 332 111111 3456688899999999999999 Q ss_pred EEEccC---cccceEEeecccccccccchheeCCccEEEecCCcEE------EEeCC--CceE-----EEecccCChhHh Q lcl|NC_019442. 312 LFSGVS---PSTISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGLV------SVDVN--GNTA-----LATEKIISPEQW 375 (541) Q Consensus 312 ~l~G~~---p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGLv------~~~~~--G~~~-----~vT~~~~~~~~W 375 (541) .|+|.+ |.+..+. +...-.|.+.=..+.+|+.++|+++.|=. ..... .... ++-..+|. T Consensus 420 ~l~~~~~lTP~~~~i~-~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~~~~v~r~~~~~~~~d~y~a~DlT~~~~hl~~---- 494 (792) T protein:vir:94 420 VLSAQGILSPKSVELN-LTTEFDVSDRARPFGVGRGVYFASPRASYTSLNRYYAVQDVSSVKSAEDMSAHVPNYIP---- 494 (792) T ss_pred EEeCCCcccceeEEEE-EEEEeeccCCCCceEeCCeEEEeecCCCeeEEEeeeeeccccCceehhhHHHHHHHhcC---- Confidence 999864 4444433 23334576666778999999999998821 11111 1110 11111111 Q ss_pred hhhcCc-ceEEEEEEcCeEEEEEecCCCccceEEEc-cCC-c---eeEEEeec--ccEEEEEecCCEEEEEEC----CEE Q lcl|NC_019442. 376 QSQFNP-ASIVAYSWRGEYIACYTKPDGKQDVFVFS-PVN-M---DIRYLSTP--FDCAWVDLAKDMMRVVTG----DKM 443 (541) Q Consensus 376 ~~~l~P-~ti~a~~~eG~Y~~~y~~~~g~~~~~i~d-~~~-~---~~~~~~~~--~d~~~~~~~~d~LY~~~g----~~i 443 (541) ++ ..+.|.+-++.=+.+.+..+|+.-++=|. .+. + .-.+.+++ +-..-.+..+|+||++.. ..+ T Consensus 495 ----~~v~~~~a~~~~~~~vv~~~~~~g~l~~~ty~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~D~l~~~v~r~~~~~~ 570 (792) T protein:vir:94 495 ----NGVFSIRGSSTENFISVLSSNAPSRIFLYKFLYLNEEIAQQSWSHWELGSNVTVLACDSIGSTMYLVLRNQSHTWM 570 (792) T ss_pred ----CceEEEEEeCCCCcEEEEEEcCCCeEEEEEEeecCCceEEEeEEEEEcCCcEEEEEEeecCCEEEEEEEeCCCEEE Confidence 01 11234444554455555555533222221 111 2 12222221 111112456799998652 222 Q ss_pred EEe----cCCC---CceeEEEEcc-eEEeCccc-----ceeEE---EEee-CCCccEEEEEEECCceeEe-eccccc--C Q lcl|NC_019442. 444 SVL----AGGS---LPSTIRWHSK-IFSLPERT-----SFSCI---RVKS-PAPERVGITIMADDVPVIH-FAPGTF--K 503 (541) Q Consensus 444 ~~~----~~g~---~~~~~~WrSk-~f~~~~~~-----~~~~~---~V~~-~~~~~~~v~~~~d~~~~~~-~~~~~~--~ 503 (541) .++ ++-. .+.....-++ .+..+... .+... -+.+ +..+--.+.+..||..... ...... . T Consensus 571 ~r~~~~~~~~d~~~~~~~~~lD~~~~~~~~~~~~~~~~~~T~~~~~~~~gl~~l~G~~v~v~~dG~~~~~~~~~~~~~~~ 650 (792) T protein:vir:94 571 CRAHFTKNSIDFPDEPYRLYIDNKVKYVIPEGSYNDDTYATTVKPVDVYGMKYWTGKFYIVASDGLVSWFEPPRGGWPNG 650 (792) T ss_pred EEEEEeecccccCCCcceeeeeeeeeEEecCcceecCceeeeeccccccCcccccCcEEEEEecCceeEeecccceecCC Confidence 222 1110 0111111111 12111110 00000 0000 0011123344556542110 111111 1 Q ss_pred CcceEccCcc-cceEEE--------E------------------EEecceEEEEEeecchh-hcCC Q lcl|NC_019442. 504 GSVVRLPAAT-GQNWQV--------M------------------VSGFGQVERITLSTSMS-EMPV 541 (541) Q Consensus 504 ~~~~rLP~~~-~~~w~i--------e------------------i~g~~~V~~i~la~s~~-EL~~ 541 (541) ...++||+.+ +...+| + -.|..+++|+.+.---. ++-+ T Consensus 651 ~~~i~~~g~~~a~~v~VGl~y~~~~~~~~~~~~~~~g~~~~~~~~~gr~rl~r~~~~~~~tg~~~v 716 (792) T protein:vir:94 651 VPMLTMSGNREGETIYVGLAISFRYVFSKFLIKKTADDGSIATEDIGRLQLRRAWVNYEDSGAFTV 716 (792) T ss_pred ccEEEecCCccCCeEEEeeeeeEEEEeccceeeccCCCcCccccceeeEEEEEEEEeeeccceeEE Confidence 1234555332 111111 1 11222333332221111 1111 No 44 >protein:vir:1543 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052111;swissprot:trembl:q9t105;genbank:gi:9634037;uniprot:Q9T105;genbank:GeneID:1262408 Probab=68.55 E-value=0.24 Score=24.03 Aligned_cols=508 Identities=11% Similarity=0.097 Sum_probs=164.9 Q ss_pred CceEEe---cccccccccccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEEC-----CcEEEEeCC Q lcl|NC_019442. 1 MPYIDI---TTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYR-----DDFWFAWPD 72 (541) Q Consensus 1 m~~i~i---~~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~-----~~~W~~w~~ 72 (541) |-.|.- ..-.++.|.+ |++.-+..+... +-|.+|.|+=+... ...... ...+=+... +=.|.+-.| T Consensus 48 t~~va~~~~~~~~~~~~~~--~~~~~~~~e~y~-l~~~~~~irv~~~~-G~~~~v--~~~~~y~~~~~~~~~l~~~~~aD 121 (801) T protein:vir:15 48 MIHLKTLGPAGYVGAQPYV--HLINRDEFEQYF-VVFTGEDIKVFDLD-GKEYQV--RGDRSYVRTANPREDLRMITVAD 121 (801) T ss_pred hheeeeecCCCCcccceeE--EEEEeCCceEEE-EEEcCCeEEEEccC-CcEEEE--ecCCccccccCchhheeEEEEcC Confidence 333332 2333344443 222222222211 34445555544321 100000 000000000 111222222 Q ss_pred eEEEee--------------CCcccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCccc---eEEecCC Q lcl|NC_019442. 73 VVDVIR--------------SPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAP---VCTVQQG 135 (541) Q Consensus 73 ~V~vv~--------------spia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~p---v~~v~~~ 135 (541) .+-++. ++.+..++.=++.-+..+-+...-++ -| .. .+.+ .+|..+.+. .....+. T Consensus 122 ~~fi~nr~~~~~~~~~~~~~~~~~~~~~alv~v~~~~yg~t~~I~i--~g--s~-~~~~--t~~~gs~~~~~~~~s~~~i 194 (801) T protein:vir:15 122 YTFVTNRKVVVQSNDQSVNLPGFKDQGDALINVRGGQYGRRLSIEF--NG--AE-RAAV--QLPDGSQPAHVNEVDGQAI 194 (801) T ss_pred EEEEeeCCeeeecccCccccCccCCCCceEEEeeeccCceeEEEEe--CC--cc-eEEE--EeccCcccchhhhcceeec Confidence 111110 00111111111111111111000000 00 00 0011 111111100 0000000 Q ss_pred CCC-----CCCCCC---cccceEEEEE------EEecCC------CccCcccccccee--ecCCCCEEEEccccCCCCcc Q lcl|NC_019442. 136 GDV-----SDDNPN---DDETRFYTET------FVSDYG------EEGPPGPASLEVT--LRTPGTAVQLTLSPVPLQNA 193 (541) Q Consensus 136 ~~~-----~~~~~~---~~~ty~Yv~T------~V~~~G------eEs~Ps~~S~~vt--v~~~g~~v~l~~~p~~~~~~ 193 (541) +.. ....+. ......|.++ ++...+ .+...+-.-.... ...-.+.-+|+.. + ++|. T Consensus 195 a~~l~~~~~~~~p~~~~~~~~~~w~~~~~~g~~~i~a~~~~~~~~~~t~dg~~~~~~~~~~~~v~~~~~lp~~-~-~~G~ 272 (801) T protein:vir:15 195 AEKLAAQLRNNLGNPNNDQDPNKWRFNVGPGFIHILAPNNDNVWGLQTKDGYADQLINPVTHYTQSFQKLPIN-A-PDGY 272 (801) T ss_pred hHHHhhhhhhccCccceeccCccEEEEecCcEEEEeCCCCcccceeeeccccCceeeeEEeecccceeeeeee-c-CCCc Confidence 000 000000 0000011111 111100 0000000000000 0000011111111 1 1111 Q ss_pred ccceEEEEEee-cCCCceeEEEEEeecc---------------------------ceEEEEecccccccCccchhhhhhC Q lcl|NC_019442. 194 SIKRRRIYRSA-SGGGEADFLLVAELDA---------------------------SVLSYTDKIPGKNLGPSLATWDYLP 245 (541) Q Consensus 194 ~i~~~RIYRs~-t~~~~~~~~lVael~~---------------------------~~~sf~D~~~~~~L~~~L~t~~~~~ 245 (541) + ++| +. ++.....|++-.+... +..+|.....+-+--.+ ...+.++ T Consensus 273 -~--v~v--~~~~~~~~~~y~v~~~~~~~~w~E~a~~g~~~~~~~~tmp~~lv~~~~~~~~~~~~~w~~r~~-gd~~tnp 346 (801) T protein:vir:15 273 -I--VKI--VGDTSKTADQYYVRFDLNRKVWVETIGWNTRTHLYYHTMPWALVRASDGNFDFKVLEWGARTV-GDDTTNP 346 (801) T ss_pred -E--EEE--EecCCCccceEEEEEEcCCeeEEeecccccceeeeccccceEEEeeccceEEEeccccccccC-CccccCC Confidence 1 110 01 1111122222211110 11122221111000000 0001111 Q ss_pred CCC-CcceE--EeccCcEEEEEeCCEEEEecCCCcccC-chh-cccccC------------cceEEEEEcCCcEEEEEcC Q lcl|NC_019442. 246 PPE-NMTGL--CLMANGIAAGFAGNEVMFSEAYLPYAW-PEV-NRHTTA------------EDIVAICPLGTSLVVATKG 308 (541) Q Consensus 246 pP~-~~~gL--~~m~NGi~a~f~Gn~l~fSep~~P~aw-p~~-y~~t~~------------~~Iv~ia~v~~~lvV~T~~ 308 (541) .|. ..+++ +.+.++++.-..++.||+|....++.| +.. -.++=+ ..|.-+.+++..|+++|++ T Consensus 347 ~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~i~t~~ 426 (801) T protein:vir:15 347 YPSFTGQTINDIFFFRNRLGFLSGENIILSRTSKYFNFFPASVSNYSDDDPIDVAVSHNRVSTLKYAVPFSEELLLWSDQ 426 (801) T ss_pred cccccCCCceEEEEEcceEEEeeCCeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecC Confidence 111 11222 355666665557999999999999997 321 111111 3456688889999999999 Q ss_pred CEEEEEccC---cccceEEeecccccccccchheeCCccEEEecCCcE-------EEEe--CCC----ceEEEecccCCh Q lcl|NC_019442. 309 EPYLFSGVS---PSTISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGL-------VSVD--VNG----NTALATEKIISP 372 (541) Q Consensus 309 ~py~l~G~~---p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGL-------v~~~--~~G----~~~~vT~~~~~~ 372 (541) .-|.|+|.+ |.+..+.+ ...-.|.+.=..+.+|+.++|+++.|= +..+ .++ ..+++-..+| T Consensus 427 ~q~~ls~~~~lTP~~~~~~~-~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~y~a~Dlt~~~~hl~-- 503 (801) T protein:vir:15 427 AQFVLTASGILSSRSVELNL-TTQFDVQDRARPHGVGRNVYFASPRASFTSINRYYAVQDVSSVKNAEDMTAHVPNYI-- 503 (801) T ss_pred cEEEEcCCCcccceeEEEEE-EEeeeccCCCCceEeCCeEEEEecCCCeeEEEEEEeecccccceehhhHHHHHHHhc-- Confidence 999999864 44544332 333457666677899999999999881 1111 111 1111122222 Q ss_pred hHhhhhcCcceEE---EEEEcCeEEEEEecCCCccceEEEccCC--c---eeEEEeec--ccEEEEEecCCEEEEEEC-- Q lcl|NC_019442. 373 EQWQSQFNPASIV---AYSWRGEYIACYTKPDGKQDVFVFSPVN--M---DIRYLSTP--FDCAWVDLAKDMMRVVTG-- 440 (541) Q Consensus 373 ~~W~~~l~P~ti~---a~~~eG~Y~~~y~~~~g~~~~~i~d~~~--~---~~~~~~~~--~d~~~~~~~~d~LY~~~g-- 440 (541) |..++ +..-++.=+.+.+..+++.-++=|...+ + .-.+++++ +-..-....+|.||++.. T Consensus 504 --------~~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~d~l~~~v~r~ 575 (801) T protein:vir:15 504 --------PNGVFSISGTTAENFAAILTSGAPNRVYIYKFLYIDEEIRQQSWSHWDFGDNVTVFAAQVINSTMTVLMGNE 575 (801) T ss_pred --------CCceEEEEEeCCCCcEEEEEEcCCCEEEEEEEecCCCceEEEeeEEEEcCCCEEEEEEEecCCEEEEEEEec Confidence 22221 1112232222333333322122221111 1 12233331 111112346799998752 Q ss_pred --CEEE--EecC------CCCceeEEEEcceEEeCccccee---------EEEEeeC-CCccEEEEEEECCceeEee--c Q lcl|NC_019442. 441 --DKMS--VLAG------GSLPSTIRWHSKIFSLPERTSFS---------CIRVKSP-APERVGITIMADDVPVIHF--A 498 (541) Q Consensus 441 --~~i~--~~~~------g~~~~~~~WrSk~f~~~~~~~~~---------~~~V~~~-~~~~~~v~~~~d~~~~~~~--~ 498 (541) ..|. ++.. ++....+.+....+..+. .++. +..+++. ..+-..+.+..||...... . T Consensus 576 ~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~-~t~~~~~~~~~~~~~~~~gl~~l~g~~v~v~~dG~~~~~~~~~ 654 (801) T protein:vir:15 576 HAVWMGRLHFTKNSIDIPGEPYRLYIDAKRKYTIPA-GTYNDDTYQTSISLATIYGMNFTKGRVSVVFPDGKIIEVDQPI 654 (801) T ss_pred CcEEEEEEEEccccccCCCcceeeeeeeeeeEeecc-ceeccCceecccccccccccccccceEEEEEeCCceeeeeeec Confidence 2222 2221 111223344444443321 1111 1122211 1233455666777543322 2 Q ss_pred ccccCCcceEccCcc-cc--------eEEEE------------------EEecceEEEEEeecch-hhcCC Q lcl|NC_019442. 499 PGTFKGSVVRLPAAT-GQ--------NWQVM------------------VSGFGQVERITLSTSM-SEMPV 541 (541) Q Consensus 499 ~~~~~~~~~rLP~~~-~~--------~w~ie------------------i~g~~~V~~i~la~s~-~EL~~ 541 (541) .+...+..+++|+.+ +. ..+|| ..|..+|+|+.+.--. .++-+ T Consensus 655 ~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~~~~~~~~~~~~rl~l~r~~~~~~~tg~~~~ 725 (801) T protein:vir:15 655 NGWSSDPVLRLDGNQEGQVVYIGFNIPFTYTFSKFLIKKTAEDGSTATEDIGRLQLRRAWVNYEDSGAFTI 725 (801) T ss_pred CcccCcceEEEcCCCCCcEEEEeeeeeEEEEecceEEeccCCCCCceeeeeccEEEEEEEEEeccCcceEE Confidence 222233355666432 11 12222 2233455555543211 12222 No 45 >protein:vir:78703 Length: 905 # NCBI annotation: tail tube B # Family: family:all:825 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285450;genbank:gi:148724484;genbank:GeneID:5220174 Probab=40.42 E-value=0.97 Score=20.65 Aligned_cols=487 Identities=12% Similarity=0.054 Sum_probs=155.4 Q ss_pred CceEEecccccccccccceecccccceEEEEeeecCCe---------eeeeeccccc-C------ccccccceeEEEECC Q lcl|NC_019442. 1 MPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGV---------ITPERQISGV-E------KTFTIKPKTIFHYRD 64 (541) Q Consensus 1 m~~i~i~~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~---------l~P~~~~~~v-~------~~~~~~~~Tif~~~~ 64 (541) -+..++ ..+|+-.+. +..+--.+-.+-.+.....+. .++-..+-+. + ...+.....+= .. T Consensus 242 ~~~~~l-~~g~~~~~~-~~~~~v~~~g~~y~i~i~~~~~~~~~~~~~~~~~~t~~d~~a~~~~~~~i~~~l~~~~~--~~ 317 (905) T protein:vir:78 242 NVSVVL-QNGGTGFRK-GDMITVNLNGRDYNIRVTQEEFVYTYASDGTAAHTTPQDSTAGTLDIGQITAGLVNSVN--LI 317 (905) T ss_pred cceeee-ecccccccc-CccEEEeeccceEEEEEecceeEEEecCCCcccccCccCccCccccHHHHHHHHHHhhc--cc Confidence 011110 001111000 000000000011122222211 1111111000 0 00000000000 00 Q ss_pred cEEEEeC--CeEEEeeCCcccCCCCeEEEeCCCCcceeecceeeccccCCccce-eeecCCCCCccce-------EEecC Q lcl|NC_019442. 65 DFWFAWP--DVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASS-YSLGIPAPTTAPV-------CTVQQ 134 (541) Q Consensus 65 ~~W~~w~--~~V~vv~spia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~-y~LGVp~P~~~pv-------~~v~~ 134 (541) ..|-.-- ..++ |..+-+.+ -..-|.|+..-. ..+ ..=-|..++-+|. +.+.. T Consensus 318 ~~~~~~~~g~~i~-v~~~~~~~---~~~~~~~g~~~~--------------~~~~~~~~v~~~~~Lp~~~~~g~~v~v~~ 379 (905) T protein:vir:78 318 SNYSAQAVGNVIE-IERTDGRD---FNLGVRGGATNR--------------AMTAIKGTANSIVDLPGQCFDGFELKVIN 379 (905) T ss_pred ccEEEEecCcEEE-EEecCCCc---cEEEEeccCCcc--------------eEEEEeccccccccCccccCCCcEEEEEe Confidence 1111100 1222 22221111 112222222210 000 0001222222222 22221 Q ss_pred CCCCCCCCCCcccceEEEEEEEecCCCccCccccccceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCceeEEE Q lcl|NC_019442. 135 GGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGEADFLL 214 (541) Q Consensus 135 ~~~~~~~~~~~~~ty~Yv~T~V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~l 214 (541) .+ ...+..|.+.|+++.+..++.. + =++...+|....+.... .-.+|||...+ .|.+ T Consensus 380 ~~--------~~~~d~yyv~~~~~~~~~~~~~--~-W~E~~~~~~~~~~~~~t--------mp~~l~r~~~g----~f~~ 436 (905) T protein:vir:78 380 TE--------NAESDDYYVVFRSAAEGIPGSG--S-WEETVAPGIERGFNTST--------MPHALIRQADG----NFTL 436 (905) T ss_pred CC--------CCCcceEEEEEEecccCCcCce--e-EEEeccccccccccccc--------ccEEEEEecCc----eEEE Confidence 11 1234567788888754443211 1 11122333222222111 23788985333 3555 Q ss_pred EEeecc-ceEEEEecccccccCccchhhhhhCCCCCcceEEeccCcEEEEEeCCEEEEecCCCcccC-chh-cccccC-- Q lcl|NC_019442. 215 VAELDA-SVLSYTDKIPGKNLGPSLATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAW-PEV-NRHTTA-- 289 (541) Q Consensus 215 Vael~~-~~~sf~D~~~~~~L~~~L~t~~~~~pP~~~~gL~~m~NGi~a~f~Gn~l~fSep~~P~aw-p~~-y~~t~~-- 289 (541) .+--.. ....+.+....++....-|+..- .+| .++ .+.++++.-..++.||+|.+..++.| +.. -.++=+ T Consensus 437 ~~~~~~~~~~~~~~r~~Gd~~Tnp~psf~g-~~i---s~v-~f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdDp 511 (905) T protein:vir:78 437 EALNDEGTITGWAQREVGDDDTNPKPSFVG-RGI---SDM-FFYNNRLGFLSEDAVIMSQPGDYFNFFVTSAITISDSDP 511 (905) T ss_pred EEeccccccccccccccCCcccCCCCcccC-CCc---ceE-EEEcceEEEecCCeEEEEccCCccccccccccCCCCCcc Confidence 542211 11123222221111111122211 111 233 45566665556899999999999997 331 111122 Q ss_pred ----------cceEEEEEcCCcEEEEEcCCEEEEEcc----CcccceEEeecccccccccchheeCCccEEEecCCc--- Q lcl|NC_019442. 290 ----------EDIVAICPLGTSLVVATKGEPYLFSGV----SPSTISGSRIPSMQACLSRRSMVAMEGFVLYAGTNG--- 352 (541) Q Consensus 290 ----------~~Iv~ia~v~~~lvV~T~~~py~l~G~----~p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dG--- 352 (541) ..|.-+.+++..|+|+|++.-|.|+|. +|.+.++.+ ...-.|-+.=..+.+|+.++|+++.| T Consensus 512 I~~~~ss~~~~~i~~~v~~~~~L~ifT~g~ef~lsg~~~~lTP~s~~i~~-~S~~~~~~~v~Pv~vG~~vlFv~~~g~~s 590 (905) T protein:vir:78 512 IDVTASSTKPAILRAAIGAPKGLILFAENSQFLLASQEVVFSTATIKLTE-ISDYFYRSLAKPVSTGVSIAFVSEADTYS 590 (905) T ss_pred EEEEEcCCcceeeEEEeecCCcEEEEecCceEEEecCCccccceeEEEEe-EEeecccCCCCcEEeCCeEEEeecCCCee Confidence 335567888999999999999999873 455544442 33445755555688999999999987 Q ss_pred -E--EEEeCC--C----ceEEEecccC-ChhHhhhhcCcce-EEEEEEcCeEEEE-EecCCCccceEEE---ccCCceeE Q lcl|NC_019442. 353 -L--VSVDVN--G----NTALATEKII-SPEQWQSQFNPAS-IVAYSWRGEYIAC-YTKPDGKQDVFVF---SPVNMDIR 417 (541) Q Consensus 353 -L--v~~~~~--G----~~~~vT~~~~-~~~~W~~~l~P~t-i~a~~~eG~Y~~~-y~~~~g~~~~~i~---d~~~~~~~ 417 (541) + +..+.. + ..+++-..+| .+..|...-.|.+ ++...-+|+=+++ |-.....+...-. +.. +.+. T Consensus 591 ~vre~~y~~~~d~y~a~DlT~~a~hl~~g~v~~~~~s~~~~~v~~~~~~~~l~~ytyl~~~~eq~v~AWsrw~~~-G~~~ 669 (905) T protein:vir:78 591 KIFEMSIDSVDNRPQVADITRIVPEYVPTGLTWSVSTPNNSMMLFGDNSNTAYIFKFFNQGNERQVAGWSKWILP-GEQR 669 (905) T ss_pred EEEEEEeeecccceehhHHHHHHHHhcCCceEEEEecCCCcEEEEEcCCCeEEEEEeecCCCceeEEeEEEEecC-CCeE Confidence 2 122211 1 1111112222 1122433334444 3555556664332 1111111100000 000 1111 Q ss_pred EEeecccEEEEEec---CCE--EEEEECCEEEEecCCCCc--eeEEEE-cceEEeCcccc---eeE--EEEeeCCC---- Q lcl|NC_019442. 418 YLSTPFDCAWVDLA---KDM--MRVVTGDKMSVLAGGSLP--STIRWH-SKIFSLPERTS---FSC--IRVKSPAP---- 480 (541) Q Consensus 418 ~~~~~~d~~~~~~~---~d~--LY~~~g~~i~~~~~g~~~--~~~~Wr-Sk~f~~~~~~~---~~~--~~V~~~~~---- 480 (541) -...-.|..|+-+. ++. +|+.+ ..-+.+..... ..+.+. .--|....... ++. ..+..... T Consensus 670 ~~a~i~d~~~~vV~r~~~G~~~~~~~~--l~~~~~~~~~d~~~~~~~~~~d~~~~~~~~t~~~~~~~~~~~~~~~~~~~~ 747 (905) T protein:vir:78 670 MCGFFADTGYFVLYDSTTGSYVLSAME--LLDDPDSASIDTAFSSFLPRLDNYVVKSDLTVVDNGDGTLTVDLEAGQAMT 747 (905) T ss_pred EEEEEcCCEEEEEEEccCCeEEEEEEe--eccccCccccccceeeeeeccceeeecccceecccCcceEeeeccCccccc Confidence 11111122222111 111 11110 00000000000 000110 11111111000 000 00000000 Q ss_pred ccEEEEEEECCceeEeecc------------------cccCCcceEccCc-ccceEEEEEEecceEEEEEeecch-hhcC Q lcl|NC_019442. 481 ERVGITIMADDVPVIHFAP------------------GTFKGSVVRLPAA-TGQNWQVMVSGFGQVERITLSTSM-SEMP 540 (541) Q Consensus 481 ~~~~v~~~~d~~~~~~~~~------------------~~~~~~~~rLP~~-~~~~w~iei~g~~~V~~i~la~s~-~EL~ 540 (541) ....+-+..||....++.. +..-..-+.+|+. ...+-.-.-.+..+|.++.|.=-. .++- T Consensus 748 ~~~~~~~~~dG~~~~~~~~~~~~~~~~t~~~a~~v~VGl~Y~s~v~~~p~~~~~~~~s~~~~~~rI~rv~lr~~~Sg~~~ 827 (905) T protein:vir:78 748 GATPVIMFTDGPSEFAFSQPTITAGQFTVDTTDDFVVGFKYETKITLPGFFTSEENKADRVYAPIVEFLYLDLYYSGRYQ 827 (905) T ss_pred cceeEEEeeCCceeeeEEEEEeeceeeccccCCeEEEeeeeeEEEeecceEeccCCCcccccceEEEEEEEEeecceeEE Confidence 0001111222221111110 1100111223322 111111112355566666554111 1122 Q ss_pred C Q lcl|NC_019442. 541 V 541 (541) Q Consensus 541 ~ 541 (541) | T Consensus 828 v 828 (905) T protein:vir:78 828 I 828 (905) T ss_pred E Confidence 2 No 46 >protein:vir:6326 Length: 826 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877473;genbank:gi:33300845;uniprot:Q7Y2D1;genbank:GeneID:1482615 Probab=30.15 E-value=1.6 Score=19.47 Aligned_cols=483 Identities=11% Similarity=0.033 Sum_probs=161.6 Q ss_pred Cc-------------eEEeccccc-ccccccce---ecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEEC Q lcl|NC_019442. 1 MP-------------YIDITTMRG-MMPRVVTS---MLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYR 63 (541) Q Consensus 1 m~-------------~i~i~~f~G-~~Pr~~p~---llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~ 63 (541) .. .|.|.+-.+ .-+-..+. -++++ ...++++........-. -........+ +. T Consensus 149 ~~~~~v~~g~Y~~~y~vti~~~~~~~gt~~s~t~t~~t~~~---~~a~~~~~~~~~~~s~~-----yia~~l~~~~--~a 218 (826) T protein:vir:63 149 AGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDN---ASTNPNLAEAPFQTSVG-----YIAWQLYGKF--FG 218 (826) T ss_pred cEEEEeeccccCceEEEEEEeccccCCccccceEEEEeccC---Ccccccccccceeeeee-----eeeeeceeee--ee Confidence 11 123321110 11111111 11111 11122222222111100 0000000111 12 Q ss_pred CcEEEEeCCeEEEeeCCcccC---------------CCCeEEEeCCCCcceeecceeeccccCCcc--------ceeeec Q lcl|NC_019442. 64 DDFWFAWPDVVDVIRSPIAQD---------------PHGRIYYTDGRFPKVTDATIATKGDGNHPA--------SSYSLG 120 (541) Q Consensus 64 ~~~W~~w~~~V~vv~spia~D---------------~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~--------a~y~LG 120 (541) ..+|....+... +..++.| +-..+|+..++.+... +.+.++.++-. ....|. T Consensus 219 ~~~~~~~~~t~~--~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~g~~~~~~~~~~~~~~~~~l~ 293 (826) T protein:vir:63 219 APEYTLPNSTKK--YPKVDPDANAATIAGYLNQRGVQDGYIAFRGDADIHVE---VSTDMGNNYGIASGGMSLNATADLP 293 (826) T ss_pred ccccccCCCccc--cceecCCcccceeecceeEecccccEEEEeeCCcccEE---EccCCCCcceEEEEEeeccceeecc Confidence 233332221110 0111110 1123444443333211 00000001101 111222 Q ss_pred CCCCCccceE-Eec--CCCCCCCCCCCcccceEEEEEEEecCCCccCccccccceeecCCCCEEEEccccCCCCccccce Q lcl|NC_019442. 121 IPAPTTAPVC-TVQ--QGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLSPVPLQNASIKR 197 (541) Q Consensus 121 Vp~P~~~pv~-~v~--~~~~~~~~~~~~~~ty~Yv~T~V~~~GeEs~Ps~~S~~vtv~~~g~~v~l~~~p~~~~~~~i~~ 197 (541) --.|..++.. .+. .+.... . + .....|.+.|....|--. + +. .+|....+ .++.+ T Consensus 294 ~~~p~~~~~~~~~~~~~~~~~~-~--g-~~~d~~y~~~~~~~~~w~-------e-~~-~~~~~~~~---------~tmp~ 351 (826) T protein:vir:63 294 ALLPGVGAPGVGVQFMDGAVMA-T--G-STKAPVYFEWDSANRRWA-------E-RA-AYGTDWVL---------KKMPL 351 (826) T ss_pred ccCCCcccceEEEeeEEeEEec-C--C-CcccceEEEEEcCCceEE-------E-Ee-ecCccccc---------ccceE Confidence 1123222111 110 000000 0 0 111233356665443100 0 00 12211111 11212 Q ss_pred EEEEEeecCCCceeEEEEEeeccceEEEEecccccccCccchhhhhhCCCCCcceEEeccCcEEEEEeCCEEEEecCCCc Q lcl|NC_019442. 198 RRIYRSASGGGEADFLLVAELDASVLSYTDKIPGKNLGPSLATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLP 277 (541) Q Consensus 198 ~RIYRs~t~~~~~~~~lVael~~~~~sf~D~~~~~~L~~~L~t~~~~~pP~~~~gL~~m~NGi~a~f~Gn~l~fSep~~P 277 (541) +-..|... +.|-+-+ ..+.+-...++..-..|+.- ..+| .-+.|.++++.-..++.||+|....+ T Consensus 352 ~l~~~~~~----~~f~~~~------~~w~~r~~Gd~~tnp~psf~-g~~~----~~v~f~q~RL~f~~~~~v~~Srtgd~ 416 (826) T protein:vir:63 352 ALRWDEAT----DTYSLNE------LEYDRRGSGDEDTNPTFNFV-TRGI----TGMTTFQGRLVLLSQEYVCMSASNNP 416 (826) T ss_pred EEEEeccC----CeEEEec------cccccccccccccCCCcccc-CCCc----eEEEEEeceEEEeeCCeEEEEccCCc Confidence 22222111 1232211 12333222222221222211 1122 12355666665557999999999999 Q ss_pred ccC-chh-cccccC------------cceEEEEEcCCcEEEEEcCCEEEEEccC---cccceEEeecccccccccchhee Q lcl|NC_019442. 278 YAW-PEV-NRHTTA------------EDIVAICPLGTSLVVATKGEPYLFSGVS---PSTISGSRIPSMQACLSRRSMVA 340 (541) Q Consensus 278 ~aw-p~~-y~~t~~------------~~Iv~ia~v~~~lvV~T~~~py~l~G~~---p~s~~~~~l~~~~pCvs~rsiv~ 340 (541) +.| |.. .-++=+ ..|.-+.+++..|+++|++.-|.|+|.+ |.+..+.+ ...-.|-+.=..+. T Consensus 417 ~nF~~~s~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~~~q~~ls~~~~lTP~~~~i~~-~s~~~~~~~~~Pv~ 495 (826) T protein:vir:63 417 HRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVISI-TTQYDLDTRAAPAV 495 (826) T ss_pred cccccccccCCCCCccEEEEEcCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEE-EEeecccCCCCceE Confidence 997 321 111111 3466788899999999999999998754 55544332 23335766667789 Q ss_pred CCccEEEecCCc-----EEE--E--eCCCceEEEecccCChhHhhhhcCcceE--EEEEEcCeEEEEEecCCCccceEEE Q lcl|NC_019442. 341 MEGFVLYAGTNG-----LVS--V--DVNGNTALATEKIISPEQWQSQFNPASI--VAYSWRGEYIACYTKPDGKQDVFVF 409 (541) Q Consensus 341 ~~~~v~y~s~dG-----Lv~--~--~~~G~~~~vT~~~~~~~~W~~~l~P~ti--~a~~~eG~Y~~~y~~~~g~~~~~i~ 409 (541) +|+.++|+++.| +.- . ..++.. .. +-+|.--= .+-|..+ .+++-+--++.+.+..+|+--++=| T Consensus 496 vG~~v~Fv~~~g~~~s~v~e~~~~~d~~~~y-~~--~dlt~~~~--~l~~~~v~~~a~s~~~~~v~~~~~~dg~l~~~~y 570 (826) T protein:vir:63 496 TGRSVYFAAERALGFMGLHEMAPSPSTDSHY-VA--EDVTSHIP--SYMPGPAEYIQAAASSGYLVFGTSTADEMICHQY 570 (826) T ss_pred eCCeEEEEecCCCceeEEEEEEeeeccccce-eh--hHHHHHHH--HhcCCCeEEEEEcCCCCEEEEEEcCCCEEEEEEE Confidence 999999999877 211 1 122211 11 11111000 1222222 3444444456666655553222222 Q ss_pred c-cCC-c---eeEEEeecccEEEEEecCCEEEEEE----CCEEEEe--cCCCCceeEEEEcceEEeCcccceeEEE---- Q lcl|NC_019442. 410 S-PVN-M---DIRYLSTPFDCAWVDLAKDMMRVVT----GDKMSVL--AGGSLPSTIRWHSKIFSLPERTSFSCIR---- 474 (541) Q Consensus 410 d-~~~-~---~~~~~~~~~d~~~~~~~~d~LY~~~----g~~i~~~--~~g~~~~~~~WrSk~f~~~~~~~~~~~~---- 474 (541) . .+. + .-.+++++-........+|.||++. +..+..+ +.-+....+.+..+.+...-....++.- T Consensus 571 ~~~~~e~~v~aW~~~~~~g~v~~~~~i~d~l~~iv~r~~~~~~~r~~~e~~~~~~~~~~~~~d~~~~~d~~~~~~~~~~~ 650 (826) T protein:vir:63 571 LWQGNEKVQNAFHRWTLRHQIIGAYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEATVAGELELTK 650 (826) T ss_pred eeCCCcEEEEeEEEEecCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEEecCCccccccCCccceEEEEEeeeeeeccCc Confidence 1 112 2 1223333322333344489999974 2333333 1111111111111111000000000000 Q ss_pred -E---eeCCCccEEEEEEECCceeEe-ecccccCCc--ceEccCc---------ccceEEEE--------------EEec Q lcl|NC_019442. 475 -V---KSPAPERVGITIMADDVPVIH-FAPGTFKGS--VVRLPAA---------TGQNWQVM--------------VSGF 524 (541) Q Consensus 475 -V---~~~~~~~~~v~~~~d~~~~~~-~~~~~~~~~--~~rLP~~---------~~~~w~ie--------------i~g~ 524 (541) + ..+..+-..+.+++|+..... .......+. .+++|.. +.-..+|+ ..|. T Consensus 651 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~l~~~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~g~~~~~gr 730 (826) T protein:vir:63 651 QHWDLIKDASAVYQLQPVAGAYMERTHLGVKRETNTKVFLDVPEAVVGAVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTR 730 (826) T ss_pred ceeecccCcccccEEEEeeCccccCCccceEEecCCEEEEecCCCccccEEEEeeeeeEEEEecceEEEccCCCcceecc Confidence 0 000111122233333321000 000000010 1222321 11111221 2356 Q ss_pred ceEEEEEeec-chhhcCC Q lcl|NC_019442. 525 GQVERITLST-SMSEMPV 541 (541) Q Consensus 525 ~~V~~i~la~-s~~EL~~ 541 (541) .+++|+.+.- ....+-+ T Consensus 731 ~~l~r~~~~~~~tg~~~v 748 (826) T protein:vir:63 731 AVLHRYNVNFGWTGEFLW 748 (826) T ss_pred EEEEEEEEEeeccccEEE Confidence 6777766653 1122222 No 47 >protein:vir:94713 Length: 785 # NCBI annotation: tail tube # Family: family:all:825 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338122;genbank:gi:77118200;genbank:GeneID:3707736 Probab=27.73 E-value=1.8 Score=19.16 Aligned_cols=506 Identities=11% Similarity=0.092 Sum_probs=171.2 Q ss_pred CceEE-ecccccccccccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEECCcEE--EEeCCeEEEe Q lcl|NC_019442. 1 MPYID-ITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFW--FAWPDVVDVI 77 (541) Q Consensus 1 m~~i~-i~~f~G~~Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W--~~w~~~V~vv 77 (541) |-.|. +..-.+..+++.+=--.++++-+ .-|.+|.|+=++. ....... ...+-|....+.| +.|.. T Consensus 48 ~~~v~~l~~~~~~~~~~~~f~~~~~~~y~---l~~~~~~irv~~~-~G~~~~v--~~~~~y~~~~~~~~~l~~~q----- 116 (785) T protein:vir:94 48 TVFKRRLNIDVGSNPKFHLINRDEQEQYY---IVFNGSNIQIVDL-SGNQYSV--SGSVDYVKSSNPRDDIRVVT----- 116 (785) T ss_pred hHhhhcccCCCCcCcEEEEEEeCCCceEE---EEEcCCeEEEEec-CCcEEEE--ecCCCceeecCchhheeeEe----- Confidence 22222 12222333333332223333322 3344566654432 1111111 1111111122222 22221 Q ss_pred eCCcccCCCCeEEEeCCCCcceeecc------------eeeccccCCccceeeecCCCCCccceEEecCCCCCC------ Q lcl|NC_019442. 78 RSPIAQDPHGRIYYTDGRFPKVTDAT------------IATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDVS------ 139 (541) Q Consensus 78 ~spia~D~~~Rvy~t~~~~pk~t~~~------------ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~~------ 139 (541) ..| .+|++.-..|+-+..+ +.-...+.| +-.|.+.+..-... ...+..+.... T Consensus 117 ----~aD---~~fi~n~~~~~~~~~~~~~~~~~~~~~~~~~i~~g~y-~~~y~i~i~g~~~a-t~~t~~~s~a~~s~~~~ 187 (785) T protein:vir:94 117 ----VAD---YTFVVNRKVVVKGGSEKSHSGYNRKARALINLRGGQY-GRTLKVGINGGVKV-SHKLPAGNDAENDPPKV 187 (785) T ss_pred ----eCC---EEEEEcCCcceeeeeccCCcCCCCCCceEEEeccccc-ceeEEEeeCCccee-EEEEccCcccccccccc Confidence 111 3333332222111000 000111222 22233333211100 00000000000 Q ss_pred ----------------CCCCC--cccceEEEEEEEec----CCCccCcccc-ccceeecC-CCCEEEEccccCCCCcccc Q lcl|NC_019442. 140 ----------------DDNPN--DDETRFYTETFVSD----YGEEGPPGPA-SLEVTLRT-PGTAVQLTLSPVPLQNASI 195 (541) Q Consensus 140 ----------------~~~~~--~~~ty~Yv~T~V~~----~GeEs~Ps~~-S~~vtv~~-~g~~v~l~~~p~~~~~~~i 195 (541) ..+-+ ..+.+.|+. ... +.-+...+-. +....+.. -++. -.+++..++|. + T Consensus 188 s~~~i~~~l~~~l~a~~t~~t~~~~g~~i~i~--a~s~t~~~~~s~~~~~~~t~~~~~~~~~~~~--~~Lp~~~~~G~-~ 262 (785) T protein:vir:94 188 DAQAIGAALRDLLVTAYPTFTFDLGSGFLLIT--APSGTDINSVETEDGYANQLISPVLDTVQTI--SKLPLAAPNGY-I 262 (785) T ss_pred chHHHHHHHHHHhhccccceeEEecCcEEEEE--ecCCccccceeeecccCCeEEEEEEeeccce--eccccccCCCC-E Confidence 00000 011122221 111 1111111100 00000000 0111 12222222221 1 Q ss_pred ceEEEEEeecC-CCceeEEEEEeeccce----------EEEEeccc------ccccCccchhhhhhCC----------CC Q lcl|NC_019442. 196 KRRRIYRSASG-GGEADFLLVAELDASV----------LSYTDKIP------GKNLGPSLATWDYLPP----------PE 248 (541) Q Consensus 196 ~~~RIYRs~t~-~~~~~~~lVael~~~~----------~sf~D~~~------~~~L~~~L~t~~~~~p----------P~ 248 (541) ++| +.++ .....|++-.+...+. ..+....- +..-.-.+...+|... |. T Consensus 263 --v~v--~~~~~~~~~~y~v~~~~~~g~w~e~~~~g~~~~~~~~tmp~~l~~~~~~~~~~~~~~w~~r~~Gd~~tnp~ps 338 (785) T protein:vir:94 263 --IKI--QGETNSSADEYYVMYDSNTKTWKETVEPGVVTGFDNTTMPHALVRQSDGSFEFKALDWSKRGAGNDDTNPMPS 338 (785) T ss_pred --EEE--EccCCCCccceEEEEEcCCceEEEecccceeeeeeccccceEEEeccCCceEEeccccccccCCCcccCCcce Confidence 111 1111 1222333322221111 00000000 0000001222233321 11 Q ss_pred -Cc---ceEEeccCcEEEEEeCCEEEEecCCCcccC-chh-cc------ccc--C----cceEEEEEcCCcEEEEEcCCE Q lcl|NC_019442. 249 -NM---TGLCLMANGIAAGFAGNEVMFSEAYLPYAW-PEV-NR------HTT--A----EDIVAICPLGTSLVVATKGEP 310 (541) Q Consensus 249 -~~---~gL~~m~NGi~a~f~Gn~l~fSep~~P~aw-p~~-y~------~t~--~----~~Iv~ia~v~~~lvV~T~~~p 310 (541) .. +.+ .|.++++.-..++.||+|.+..++.| +.. .. +.+ . ..|.-+.+++..|+++|++.- T Consensus 339 f~g~~~~~v-~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~T~~~e 417 (785) T protein:vir:94 339 FVDATINDV-FFYRNRLGFLSGENVIMSRSASYFAFFPKSVATLSDDDPIDVAVSHPRISILKYAVPFSEQLLLWSDEVQ 417 (785) T ss_pred ecccccceE-EEEeceEEEecCCeEEEEccCCcccCccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcE Confidence 01 223 45566665556999999999999997 321 11 111 1 336668899999999999999 Q ss_pred EEEEccC---cccceEEeecccccccccchheeCCccEEEecCCcEE------EEeCCCceEEEecccCChhHhhhhcCc Q lcl|NC_019442. 311 YLFSGVS---PSTISGSRIPSMQACLSRRSMVAMEGFVLYAGTNGLV------SVDVNGNTALATEKIISPEQWQSQFNP 381 (541) Q Consensus 311 y~l~G~~---p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~dGLv------~~~~~G~~~~vT~~~~~~~~W~~~l~P 381 (541) |.|+|.+ |.+..+.+ ...-.|.+.=..+.+|+.++|+++.|=. ...+...-.-.+.++ |.--= .+=| T Consensus 418 ~~l~~~~~lTP~~~~~~~-~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~v~r~~~~~~~~d~y~~~dl-t~~~~--~~~~ 493 (785) T protein:vir:94 418 FVMTSSGVLTSKSIQLDV-GSEFALGDNARPFAVGRSVFFSAPRGSFTSIKRYFAVADVSDVKDADDT-TGHVL--SYIP 493 (785) T ss_pred EEEcCCCcccceeEEEEE-EEeeeccCCCCceEeCCeEEEEecCCCeeEEEeeeeecccccceehhhH-HHHHH--HhcC Confidence 9998854 55544332 3334577767788999999999997721 111111000111111 11000 0111 Q ss_pred c---eEEEEEEcCeEEEEEecCCCccceEEEc--cCCc---eeEEEeec--ccEEEEEecCCEEEEEE---C-CEEEEec Q lcl|NC_019442. 382 A---SIVAYSWRGEYIACYTKPDGKQDVFVFS--PVNM---DIRYLSTP--FDCAWVDLAKDMMRVVT---G-DKMSVLA 447 (541) Q Consensus 382 ~---ti~a~~~eG~Y~~~y~~~~g~~~~~i~d--~~~~---~~~~~~~~--~d~~~~~~~~d~LY~~~---g-~~i~~~~ 447 (541) . .+.|.+-+..=+.+.+..+|+.-++=|. -+.+ ...+++++ .-..+....+|.||++. | ..+...+ T Consensus 494 g~~~~~~a~~~~~~~~~~~~~~~g~l~~~~y~~~~~e~~v~aW~r~~~~~~~~~~~~~~~~d~~~~vv~r~~g~~~~~ie 573 (785) T protein:vir:94 494 NGVFDIQGTGTENYICVNSTGAYNRIYIYKFLFKDSVQLQASWSHWEFPKDDKILASASIGSTMFIVRQHQGGVDIEHLK 573 (785) T ss_pred CCcEEEEEecCCCcEEEEEEcCCCEEEEEEEeecCCceEEEEEEEEEeCCCeEEEEEEEeCCEEEEEEEcCCCEEEEEEE Confidence 2 1334444443344444444432222221 1112 12232322 12333345588999875 2 2233221 Q ss_pred ----CCCCc-eeEEE--Ecce-EEeC-------ccc-ceeEEEEeeC--CCccEEEEEEECCceeEeecccccCCcceEc Q lcl|NC_019442. 448 ----GGSLP-STIRW--HSKI-FSLP-------ERT-SFSCIRVKSP--APERVGITIMADDVPVIHFAPGTFKGSVVRL 509 (541) Q Consensus 448 ----~g~~~-~~~~W--rSk~-f~~~-------~~~-~~~~~~V~~~--~~~~~~v~~~~d~~~~~~~~~~~~~~~~~rL 509 (541) ....+ ..+++ -++. +..+ ... .+..-..+.. ..+-..+.+..||..+ ........+..++| T Consensus 574 ~~~~~~d~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~~g~~~leg~~v~v~adG~~~-~~~~v~~~~~tl~~ 652 (785) T protein:vir:94 574 FIKEATDFPSEPYRLHVDSKVSMVIPIGSFNADTYKTTVDIGAAYGGNAPSPGRYYLIDSQGAYL-DLGELTSISTVITL 652 (785) T ss_pred eecccCCCCCcceeEEeeeeeEEEecCcceeccccccccccccccccCCccCCeEEEEeeCCcCc-cCceEcCCCcEEEe Confidence 11111 11110 1111 1111 100 0111111111 1122345556676432 22222223346666 Q ss_pred cCcc-------cce--EEEE-----EE------------ecceEEEEEeecc-hhhcCC Q lcl|NC_019442. 510 PAAT-------GQN--WQVM-----VS------------GFGQVERITLSTS-MSEMPV 541 (541) Q Consensus 510 P~~~-------~~~--w~ie-----i~------------g~~~V~~i~la~s-~~EL~~ 541 (541) |+.+ +.. .+|+ ++ |..+++|+.+.-- -.++-| T Consensus 653 ~g~~~~~~v~vGl~y~~~~~~~~~~~~~~~~~~~~~~~~gr~~l~r~~~~~~~sg~~~v 711 (785) T protein:vir:94 653 NGDWSGRTVFIGRSYLMSYKFSRFLIKIEDDSGTQSEDTGRLQLRRAWVNYRDTGALRL 711 (785) T ss_pred cCCCCCceEEEeeeeeEEEeecceeEEecCCCcccccccccEEEEEEEEEeecccceEE Confidence 6332 112 2222 11 3345555443221 122222 No 48 >protein:vir:7329 Length: 825 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848220;genbank:gi:30387391;genbank:GeneID:2641863 Probab=24.61 E-value=2.2 Score=18.75 Aligned_cols=473 Identities=15% Similarity=0.154 Sum_probs=154.9 Q ss_pred CceEEeccccccc------------ccccceecccccceEEEEeeecCCeeeeeecccccCccccccceeEEEECCcEEE Q lcl|NC_019442. 1 MPYIDITTMRGMM------------PRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWF 68 (541) Q Consensus 1 m~~i~i~~f~G~~------------Pr~~p~llp~~~a~~a~N~~~~~G~l~P~~~~~~v~~~~~~~~~Tif~~~~~~W~ 68 (541) =-.|+|-..+|.+ |-..-+|-.-.++|.|-=.-+ +++ ...|++|-|+.++-|- T Consensus 84 ~~~lrv~~~gg~v~~~~~~~~e~~TPy~~~~l~~l~~~QsaD~~~i-------------~h~--~~pp~~L~r~~~~~W~ 148 (825) T protein:vir:73 84 HNYMRVIKDGAYVLTTSNVIYELAMPYADTDLFRIKFTQSADVLTL-------------VHP--AYPPKELRRYAHDNWQ 148 (825) T ss_pred CCeEEEEeCCceEeccCCceEEEecccchhhhhhheeeeecCEEEE-------------EcC--CCceeEEEEecCCCcE Confidence 1112222222211 111111111111111100000 011 2245667777666554 Q ss_pred EeCCeEEEeeCCc---ccCCCCeEEEeCC-CCcceeecceeeccccCCccceeeecCCCCCccceEEecCCCCCCCCCCC Q lcl|NC_019442. 69 AWPDVVDVIRSPI---AQDPHGRIYYTDG-RFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDVSDDNPN 144 (541) Q Consensus 69 ~w~~~V~vv~spi---a~D~~~Rvy~t~~-~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~~~~~~~ 144 (541) .. .+.....|- +-|..-+++..+. +..++|... ..-+.-+.+...++.-....+-+..+ T Consensus 149 l~--~~~f~~gp~~~in~~~sv~v~asg~tg~~TiTaS~--a~~~~~~vG~~i~~~~~~v~si~~~~------------- 211 (825) T protein:vir:73 149 IV--DVTTKNGPFEDINVDETVKVYASASTGTITLTASS--AIFGAEQVGKLFYLEQPAVDSVPVWE------------- 211 (825) T ss_pred EE--EEeccCCccccccccccceeeecccCceeEEEeec--cccCchhcCeEEEEecccccccceee------------- Confidence 43 222222331 2233333443321 111111110 00000000011111111111110000 Q ss_pred cccceEEEE-EEEecCCCccCcccc--ccceeecC-CCCEE-EEccccCCCCccccceEEEEEeecCCCceeEEEE-E-- Q lcl|NC_019442. 145 DDETRFYTE-TFVSDYGEEGPPGPA--SLEVTLRT-PGTAV-QLTLSPVPLQNASIKRRRIYRSASGGGEADFLLV-A-- 216 (541) Q Consensus 145 ~~~ty~Yv~-T~V~~~GeEs~Ps~~--S~~vtv~~-~g~~v-~l~~~p~~~~~~~i~~~RIYRs~t~~~~~~~~lV-a-- 216 (541) ....|.. .+....+.+-..... +..+.... .+... .+.+.... +.++ .++.+|+. .+...+--+ . T Consensus 212 --~~~~~~~~~v~~~~~~~~~~~~~~~~~t~~~~a~~g~~~~~~~g~~~~--~~~~-~~~~~~~~--~g~~~it~~~~~~ 284 (825) T protein:vir:73 212 --TSKTTAINDVRRADSNYYRANTSGKTGTLRPSHTEGMSWDGWGGTGSD--DTGI-QWEYLHSG--FGIAKITAVAGDG 284 (825) T ss_pred --eeeEEEeeeEEECCCceeeeecccccceeeccccCCceeEeeeeeccc--CCce-EEEEEecC--CceEEEeeccccc Confidence 0001111 111111111110000 00000000 00000 11110000 0111 23333321 111111111 0 Q ss_pred -eeccceEEEEecccccccCccchhhhhhCCCCCcc----eEEeccCcEEEEE----eCCEEEEecCCCcccCchhcccc Q lcl|NC_019442. 217 -ELDASVLSYTDKIPGKNLGPSLATWDYLPPPENMT----GLCLMANGIAAGF----AGNEVMFSEAYLPYAWPEVNRHT 287 (541) Q Consensus 217 -el~~~~~sf~D~~~~~~L~~~L~t~~~~~pP~~~~----gL~~m~NGi~a~f----~Gn~l~fSep~~P~awp~~y~~t 287 (541) +..+....|.+... ....-.++.|...+..-. ..+.+.++++.-. ..+.||+|.+...+.|-.+--++ T Consensus 285 ~~~~~~~~~~~~~~~---~~~~~~t~~~~~~~~~~~~gyPs~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~~~~ 361 (825) T protein:vir:73 285 LTATADVVSFIPSQV---VGSANASYKWAKYAWNSVNGYPSTVVYYQQRLYFAASTAYPQTIWASRTGDYKDFGKNNPIQ 361 (825) T ss_pred eeeccccceeccccc---ccCCCCCcccccCCcccCCCCccEEEEEcceEEEeecCCCCCEEEEEccCCccccccCCCCC Confidence 01122223333211 111222344544433211 2334555555433 46899999999999874333222 Q ss_pred cC------------cceEEEEEcCCcEEEEEcCCEEEEEcc-----CcccceEEeecccccccccchheeCCccEEEecC Q lcl|NC_019442. 288 TA------------EDIVAICPLGTSLVVATKGEPYLFSGV-----SPSTISGSRIPSMQACLSRRSMVAMEGFVLYAGT 350 (541) Q Consensus 288 ~~------------~~Iv~ia~v~~~lvV~T~~~py~l~G~-----~p~s~~~~~l~~~~pCvs~rsiv~~~~~v~y~s~ 350 (541) -+ ..|..+.+++ .|+++|++.-|.++|. +|.+..+. .....+|-. =..+.+|+.++|+++ T Consensus 362 DdD~I~~~~s~~~~~~i~~~~~~~-~L~~~t~~~e~~l~~~~~~~lTP~~~~~~-~~s~~g~~~-~~Pv~vg~~~~Fv~~ 438 (825) T protein:vir:73 362 DDDRIIYTYAGRQVNEIRHLIDVG-NLVALTSGGEYTISGDQNKVLTPSAFSFS-SQGNNGSSN-VPPIAVANIALFIQE 438 (825) T ss_pred CCccEEEEEcCCcceeEEEEeecC-cEEEEecCceEEEecCCCcccceeeEEEE-eeeeecccc-ccceEeCCeEEEEeC Confidence 22 3366778875 7999999999999875 45554433 345567743 356788999999999 Q ss_pred CcEE------EEeCCC----ceEEEecccCC---hhHhhhhcCcceE-EEEEEcCeEEEEEecCCCccceEEEccCCcee Q lcl|NC_019442. 351 NGLV------SVDVNG----NTALATEKIIS---PEQWQSQFNPASI-VAYSWRGEYIACYTKPDGKQDVFVFSPVNMDI 416 (541) Q Consensus 351 dGLv------~~~~~G----~~~~vT~~~~~---~~~W~~~l~P~ti-~a~~~eG~Y~~~y~~~~g~~~~~i~d~~~~~~ 416 (541) .|=. ....++ ..+++...||. -.+|...=+|.++ +...- +|+.-++-|+...+.. T Consensus 439 ~g~~vre~~~~~~~d~~~~~dlt~~a~hl~~~~~~~~~a~~~~p~~~~~~v~~-----------dg~l~~~ty~~~q~v~ 507 (825) T protein:vir:73 439 KGSVVRDLAYSFDVDGYQGTDLTILANHLFQKHSIVDWSFCIVPYSSAFCIRD-----------DGKLLVLTYLRDQQVF 507 (825) T ss_pred CCCeEEEEEEeeecCceeccchhhhhHhhccCCceEEEEEcCCCceEEEEEec-----------CCeEEEEEEeccccce Confidence 8811 111111 11122233332 1233222234443 33333 4443344455433322 Q ss_pred EEEeeccc------EEEEEecCCEEEEEECCE-----E---EEecCCCCceeEEEEcceEEeCcccce-------eEEEE Q lcl|NC_019442. 417 RYLSTPFD------CAWVDLAKDMMRVVTGDK-----M---SVLAGGSLPSTIRWHSKIFSLPERTSF-------SCIRV 475 (541) Q Consensus 417 ~~~~~~~d------~~~~~~~~d~LY~~~g~~-----i---~~~~~g~~~~~~~WrSk~f~~~~~~~~-------~~~~V 475 (541) -|..-.++ |+..+-..|.||++..+. + -.++...... ....|.+...+++ +..-+ T Consensus 508 aW~~~~~~g~v~~~~~i~~~~~D~l~~iV~R~~~g~~~~yiE~~~~~~~~~----~~~~~~vD~g~~~~g~~~~~~l~~l 583 (825) T protein:vir:73 508 AWAPQSSAGKYESTCSISEGSEDAVYFVVNRTINGQTVRYIERLSSRLFTN----DEDAFFVDCGLSYDGRNTSSRTMTI 583 (825) T ss_pred eeEEEecCCcEEEEEEecCCCccEEEEEEEEeeCCceEEEEEEecccccCC----CcceeEEEEEeeecccceeeceeee Confidence 23232233 222233467999975322 1 1121110000 0112222221111 11111 Q ss_pred eeCCCccEEEEEEECC--------ceeEeecccccCCcceEccCc---------ccceEEEEEEecc--eEEEEEeecch Q lcl|NC_019442. 476 KSPAPERVGITIMADD--------VPVIHFAPGTFKGSVVRLPAA---------TGQNWQVMVSGFG--QVERITLSTSM 536 (541) Q Consensus 476 ~~~~~~~~~v~~~~d~--------~~~~~~~~~~~~~~~~rLP~~---------~~~~w~iei~g~~--~V~~i~la~s~ 536 (541) ++ -.+.+.+|+ +.+..-... ..-++||-. .++.-...+.+.. .+.++...... T Consensus 584 ~g-----~tv~~~~~g~~~~~v~~g~itl~~~~---~~~i~l~~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~v~~~~~~ 655 (825) T protein:vir:73 584 SG-----GTGDWSYQVDYPVTVSGGAYFVNTDV---GAQIQFPYTGTDPDTNEPVAKELRGDIISVTSNTAVVVRFNRNV 655 (825) T ss_pred CC-----ceEEEEeCCeEEEEEcCCeEEecccc---eEEEEecccCcccccccceeceeeEEEccccCceEEEEEecccc Confidence 11 112222332 221000000 012344422 1122222232222 22222221111 Q ss_pred hhc----CC Q lcl|NC_019442. 537 SEM----PV 541 (541) Q Consensus 537 ~EL----~~ 541 (541) ++- +. T Consensus 656 ~a~~~~~~~ 664 (825) T protein:vir:73 656 PPVLRNVAT 664 (825) T ss_pred cceeeeecc Confidence 110 00 No 49 >protein:vir:352 Length: 536 # NCBI annotation: hypothetical protein # Family: family:all:3197 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203466;genbank:gi:15320622;genbank:GeneID:921729 Probab=23.13 E-value=2.3 Score=18.55 Aligned_cols=416 Identities=13% Similarity=0.093 Sum_probs=150.4 Q ss_pred CCeEEEeeCCcccCCCCeEEEeCCCCcceeecceeeccccCCccceeeecCCCCCccceEEecCCCCC-CCCCCCcccce Q lcl|NC_019442. 71 PDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPASSYSLGIPAPTTAPVCTVQQGGDV-SDDNPNDDETR 149 (541) Q Consensus 71 ~~~V~vv~spia~D~~~Rvy~t~~~~pk~t~~~ia~~g~g~~p~a~y~LGVp~P~~~pv~~v~~~~~~-~~~~~~~~~ty 149 (541) --.|.+.|-|.. ...+..-+|+|....+...+-..-. .+.-.+ ++. T Consensus 1 ~~~~~a~r~~~~-------------------------------~~~~~~~~pAPv~G~~t~~~~A~m~~~~A~vl--dN~ 47 (536) T protein:vir:35 1 MMPLRARRVPPP-------------------------------PSIQEAHLPAPVGGLNTVSAGSAMPVSDCLQG--FNL 47 (536) T ss_pred CCccccccCCCC-------------------------------ccceeeeeCccccceeccchhhcCCCCceEEE--eec Confidence 112222222211 1112222444433222211100000 000000 011 Q ss_pred EEEEEEEec-CCC----ccCcccccc---------------ceeecCCCCEEEEccccCCCCccccceEEEEEeecCCCc Q lcl|NC_019442. 150 FYTETFVSD-YGE----EGPPGPASL---------------EVTLRTPGTAVQLTLSPVPLQNASIKRRRIYRSASGGGE 209 (541) Q Consensus 150 ~Yv~T~V~~-~Ge----Es~Ps~~S~---------------~vtv~~~g~~v~l~~~p~~~~~~~i~~~RIYRs~t~~~~ 209 (541) +-.-+.|+. +|. +.-+.|+.. .+.+. +-.+-+++.+-.|+ ..-+ + --.. .+.+ T Consensus 48 fpt~~g~r~R~G~~~~at~~~~~v~s~~~~~~~~~~Ga~~klf~at-~~~i~dvT~pa~p~-~~~~--~--~g~~-~g~~ 120 (536) T protein:vir:35 48 IASELGLRSRLGYREWCTGLGVPARSTLPFAGSAKSGAANRLFQTT-SEGIWDVSASSQTP-TQVL--T--FGDQ-TGDA 120 (536) T ss_pred CCChhhhhhhccchhHhcCCccceEEeeeeeeccccCcceeEEEec-ccceeeeecCCCCc-ceEE--E--eccC-CCce Confidence 111111111 010 111111111 11111 01112222211111 0000 0 0000 0001 Q ss_pred eeEEEEEee-ccceEEEEecccccccCccchhhhhhC-------------CCCCcceEEeccCcEEEEEeCCEEEEecCC Q lcl|NC_019442. 210 ADFLLVAEL-DASVLSYTDKIPGKNLGPSLATWDYLP-------------PPENMTGLCLMANGIAAGFAGNEVMFSEAY 275 (541) Q Consensus 210 ~~~~lVael-~~~~~sf~D~~~~~~L~~~L~t~~~~~-------------pP~~~~gL~~m~NGi~a~f~Gn~l~fSep~ 275 (541) ..+..|.-- ..+...|.-|..+...---+.+..|.. -|.++.+|+++ -|.|||-|-. T Consensus 121 ~~w~~v~~~~~gG~~l~~~nG~~~~~~~~gt~~~w~~v~~~t~~~~i~Gv~~~~l~~i~~~---------knRLffvq~~ 191 (536) T protein:vir:35 121 GFGVSHAFVTQRGHFLFYADETNGLFRYSESTDTWTAVAQGTGVGEIDGVNPANIVFVAVF---------KQRVWLVERD 191 (536) T ss_pred eeEEEEEecCCCceEEEEEEcCCCceEeecccCchhhcccCCcccccCCCCcccceeeeeE---------eeeEEEEEeC Confidence 111111111 112222222222222111111111221 23344555555 4555554333 Q ss_pred Cccc--------------CchhcccccCcceEEEEE--c-------CCcEEEEEcCCEEEEEccCcccc---eEE---ee Q lcl|NC_019442. 276 LPYA--------------WPEVNRHTTAEDIVAICP--L-------GTSLVVATKGEPYLFSGVSPSTI---SGS---RI 326 (541) Q Consensus 276 ~P~a--------------wp~~y~~t~~~~Iv~ia~--v-------~~~lvV~T~~~py~l~G~~p~s~---~~~---~l 326 (541) .=-+ +|..-.++..+-.+++.. . +--.||-++|.+.+.+|++|++- ++. |+ T Consensus 192 s~~awYLp~~av~G~A~~f~lg~~~~~GGsL~~~~sWS~~~G~Gl~d~~VfvSs~GeVaVyqGsdPs~s~~Wsl~giy~I 271 (536) T protein:vir:35 192 TARAWYLPAGAIAGTAQPFEMGAQFRAGGHLVGLWNWTYDGGAGMDDSLVAISGGGDVAIWQGTDPASSATFGLRGVWSL 271 (536) T ss_pred CceEEEeecccccceeeeeeccCccccCceEccceeeccccCCCcceeEEEEecCCcEEEEecCCCCcccceeEEEEEEe Confidence 2222 344434444444444432 1 12347888999999999999753 222 33 Q ss_pred cccccccccchheeCCccEEEecCCcEEEEeCCCc-----eEEEecccCChhHhhhhcC--cceE--EEEEE--cCeEEE Q lcl|NC_019442. 327 PSMQACLSRRSMVAMEGFVLYAGTNGLVSVDVNGN-----TALATEKIISPEQWQSQFN--PASI--VAYSW--RGEYIA 395 (541) Q Consensus 327 ~~~~pCvs~rsiv~~~~~v~y~s~dGLv~~~~~G~-----~~~vT~~~~~~~~W~~~l~--P~ti--~a~~~--eG~Y~~ 395 (541) ....| +-+|.++..++-++.++.|||+-++.-=+ -+.||+.|- +.|+..++ +.+. ....| ++++|- T Consensus 272 G~~pp-~G~r~~i~~G~Dl~iit~dGivplsq~~q~d~~a~~~it~~I~--~~~~~~v~~~a~~~gWq~~~~P~~n~liV 348 (536) T protein:vir:35 272 GGSPP-AGRRIATDYGGDVLVLSRLGVRPLSRLVAGEVDKDTYVTAKVS--NLFSALMLTRASLPGWSMQLHPEDNALLV 348 (536) T ss_pred ccCCC-CCceEEEeecCeeEeeecCCccchhhhhhhhhhcccCCCccch--hhHHHHHhhccCCCccEEEEccCCCeEEE Confidence 32223 45899999999999999999998765210 112444443 24643222 3322 23333 444444 Q ss_pred EEecCC-CccceEEEccCCceeEEEeecccEEEEEecCCEEEEEE-CCEEEEecC---CCCc------eeEEEEcce--- Q lcl|NC_019442. 396 CYTKPD-GKQDVFVFSPVNMDIRYLSTPFDCAWVDLAKDMMRVVT-GDKMSVLAG---GSLP------STIRWHSKI--- 461 (541) Q Consensus 396 ~y~~~~-g~~~~~i~d~~~~~~~~~~~~~d~~~~~~~~d~LY~~~-g~~i~~~~~---g~~~------~~~~WrSk~--- 461 (541) -....+ .....|+++..++.--++. +|++.-+.+..|+||+-. ++.+|++++ |++. .++.|.=+. T Consensus 349 ~~P~~~g~~~~~fV~N~~tgaW~~ft-gw~a~C~~v~~~~LyFG~~dG~v~~~da~v~g~D~~~~~ag~~I~~~~~~af~ 427 (536) T protein:vir:35 349 TVPTYPGQPTEQLVMALAGRAWFRYR-DLPIYSSAVWGGKLYFGTVDGRVCVNDGYVDGVLLSEPSAFTPVQWSLLSAFT 427 (536) T ss_pred EccCCCCCCceEEEeecccCceeeec-CCcceEEEEecCeEEEeecCCEEEecccccCccccccCcCcceeeeccccchh Confidence 332222 2345678887777665543 566665567889999965 799999885 3211 223332111 Q ss_pred -EEeCcccceeEEE---EeeCCCccEEEEEEECCceeEeeccccc--------CCc---ceEccCc--ccceEEEEEEec Q lcl|NC_019442. 462 -FSLPERTSFSCIR---VKSPAPERVGITIMADDVPVIHFAPGTF--------KGS---VVRLPAA--TGQNWQVMVSGF 524 (541) Q Consensus 462 -f~~~~~~~~~~~~---V~~~~~~~~~v~~~~d~~~~~~~~~~~~--------~~~---~~rLP~~--~~~~w~iei~g~ 524 (541) |-.|+.-++.-+| +.......+.+...+|=. ....+.... |+. .+.-.++ -.++|+= +.|. T Consensus 428 ~~G~~~~K~~~~~r~~~~s~~~~p~l~l~~~~d~D-~~~p~~~~~~~~~~~~Wd~s~Wd~~~Ws~~~~v~~~~~s-~~g~ 505 (536) T protein:vir:35 428 NLGSARQKQVQLLRPTLLSESATPSYEVQARYRYD-FAELAPVSAMGGGSGTWDGSTWDVDVWSGEYQASQQVRG-GTGV 505 (536) T ss_pred hcCchHHHHHHHhhhhhhhccCCceEEEEEEEEec-cCCCCCcCCCCCCcccCCcccCCceecCCcceeEeeeeE-eccc Confidence 1111111111111 111122222232222100 000000000 000 0011111 1122332 3333 Q ss_pred ceEEEEEe-ecchhhcCC Q lcl|NC_019442. 525 GQVERITL-STSMSEMPV 541 (541) Q Consensus 525 ~~V~~i~l-a~s~~EL~~ 541 (541) +.--.+.| |++..|.-+ T Consensus 506 G~~is~~~~g~a~~~~~~ 523 (536) T protein:vir:35 506 GVDLAIAIRGTAVARTVL 523 (536) T ss_pred eEEEEEEEeeccccceEE Confidence 32222222 222222222 Done!