Query lcl|NC_016163.1_cdsid_YP_004934394.1 [gene=g160] [protein=hypothetical protein] [protein_id=YP_004934394.1] [location=complement(104512..107148)] Match_columns 878 No_of_seqs 64 out of 73 Neff 1.6 Searched_HMMs 1612 Date Thu Nov 7 15:45:07 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_161 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_161_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:101806 Length: 516 100.0 5E-123 3E-126 691.0 18.9 404 341-816 1-516 (516) 2 protein:vir:101189 Length: 516 100.0 5E-123 3E-126 691.0 18.9 404 341-816 1-516 (516) 3 protein:vir:100598 Length: 516 100.0 4E-122 2E-125 686.4 19.2 406 341-816 1-516 (516) 4 protein:vir:6896 Length: 523 # 100.0 1E-122 8E-126 688.7 14.4 443 310-819 1-523 (523) 5 protein:vir:98265 Length: 524 100.0 3E-122 2E-125 686.8 16.3 422 341-819 1-524 (524) 6 protein:vir:108049 Length: 524 100.0 4E-122 3E-125 686.1 14.8 445 306-819 1-524 (524) 7 protein:vir:103177 Length: 533 100.0 4E-122 2E-125 686.3 14.3 446 343-878 1-533 (533) 8 protein:vir:106282 Length: 521 100.0 1E-121 9E-125 683.2 16.3 419 318-819 1-521 (521) 9 protein:vir:104500 Length: 537 100.0 2E-120 1E-123 677.4 21.9 489 222-878 1-536 (537) 10 protein:vir:104892 Length: 558 100.0 1E-121 6E-125 684.0 15.0 475 290-861 1-558 (558) 11 protein:vir:103458 Length: 524 100.0 1E-121 7E-125 683.6 14.7 433 303-819 1-524 (524) 12 protein:vir:7208 Length: 524 # 100.0 1E-121 8E-125 683.3 14.7 433 303-819 1-524 (524) 13 protein:vir:81017 Length: 521 100.0 3E-121 2E-124 681.3 13.4 436 325-819 1-521 (521) 14 protein:vir:6596 Length: 521 # 100.0 4E-121 3E-124 680.4 13.3 440 325-819 1-521 (521) 15 protein:vir:106999 Length: 564 100.0 1E-120 7E-124 678.1 14.6 483 290-878 1-560 (564) 16 protein:vir:5665 Length: 511 # 100.0 3E-120 2E-123 676.2 13.4 401 303-816 1-511 (511) 17 protein:vir:5839 Length: 533 # 100.0 1E-107 6E-111 607.2 18.4 472 251-878 1-524 (533) 18 protein:vir:103219 Length: 201 67.6 0.094 5.8E-05 26.2 5.2 179 588-815 1-201 (201) 19 protein:vir:107662 Length: 427 23.5 2.3 0.0014 18.6 14.7 339 391-815 1-427 (427) 20 protein:vir:80040 Length: 461 21.1 2.7 0.0017 18.3 19.4 414 231-834 1-461 (461) No 1 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=100.00 E-value=5.2e-123 Score=690.97 Aligned_cols=404 Identities=17% Similarity=0.291 Sum_probs=278.6 Q ss_pred ccccccccc-------------------ccccccccCCCC--CCCCCCc--------------------------c-cCC Q lcl|NC_016163. 341 MNISGFFGS-------------------AVSPFNATGEGN--TQPGSNR--------------------------K-LAD 372 (878) Q Consensus 341 ~~~~~~~~~-------------------~~~~~~~~~~~~--~~~~~~~--------------------------~-~~~ 372 (878) ||++-+||- .+.|-++.|.-- +|++..- . ..+ T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINN 80 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhc Confidence 666666653 344544443310 1111100 0 112 Q ss_pred chhhhhhhhhcCCcchhhheecCCCceEEeeecc-----c--------------c----------------eeeeeeE-e Q lcl|NC_016163. 373 PEKEKILKTKLGGNEKAIVKRISPGNIVDLTFED-----N--------------I----------------LGYLYLD-I 416 (878) Q Consensus 373 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~--------------~----------------~~~~~~~-~ 416 (878) ||-+..+.--. | .|||-. .-+++|.|.+++ + + -|-+|+- | T Consensus 81 pEvd~Av~eIV--n-eaiv~d-~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKi 156 (516) T protein:vir:10 81 PEVERAVANIV--N-EAIVYE-RGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKI 156 (516) T ss_pred cchhhHHHHhh--c-ceeEec-CCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEE Confidence 22221111100 0 122222 233445444422 1 1 1223333 2 Q ss_pred eEecCC-CcccCcccccCCCC-ceeecccccccchhHHHH----hhhcCccccccCCCccccccccccCCCCceeccccc Q lcl|NC_016163. 417 VEVDPD-GTTMPSDKVDNGNE-GYTFMPSQSVGNGNVLQN----MVYSGKDIPIDGNGGHSSGAKNVENPNGQRLDVADD 490 (878) Q Consensus 417 ~~~~~~-~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (878) +. +|. |... ---+|.-+- -+...|.....+..|..+ .+|.-.+-- -+-+|+- T Consensus 157 id-~~k~GI~E-lr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~--------------~~~~g~~------ 214 (516) T protein:vir:10 157 MP-NPKKGIAE-LRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEG--------------YSYNGRI------ 214 (516) T ss_pred ec-Ccccccee-eeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccc--------------cccccce------ Confidence 21 211 1110 001111100 011112222222333333 222210000 0111111 Q ss_pred hhHHHHHHHHHhhcCCcchhhhhhhhhhhhhhhhhhhhhhhhhhhccceeEEechHHHHhhhcCchhhhhhhhhhhhhhh Q lcl|NC_016163. 491 ARLQFLAAAFANRLSDESNIKLIKKSATIKQAIYNTLSIRKLTRKDKVRVIYLKPEEVVMINRGHSIFDNILFFAKIYIT 570 (878) Q Consensus 491 ~~~~~~~~~~~~~l~~~~~IKl~K~S~T~~~~~~~~l~~~~L~~k~K~RVi~~~peeIv~In~G~sI~dnil~~skiylt 570 (878) .+..+.|||.+++|| .|.|||.+.++.-+++|+|.+|+..||.+|++| T Consensus 215 -------------~~~~~~ikI~~dAI~--------y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~ED----------- 262 (516) T protein:vir:10 215 -------------FEPNTRIKIPRSAVV--------YASSGLMDCSDRGIIGYLHNAVKPANQLKLLED----------- 262 (516) T ss_pred -------------eCCCcceeechhhee--------eecccceeCCCCceeeeehhhhHhHHhhHHHHh----------- Confidence 122355899999999 999999999999999999999999999999999 Q ss_pred hHHHHHHHHHhcCccceEEEEecC-CCChhHHHHHHHHH-hhcceeeecccCceeccccchhhhhhhcccccccCCCCce Q lcl|NC_016163. 571 TLLTLLMQNVLRGAPKRAVYVEVG-LDNNPANATQQAIR-DVKSKEISSITNMDMQSIINYVGEFQDYYIPVVDGEKPIT 648 (878) Q Consensus 571 TllSLVIYRITRAPERRVFYIDVG-LPK~KAEQYMrdI~-kyKNKLVYDAsTGEVRDDrk~MSMLEDYWLPRREGGRGTE 648 (878) +|||||||||||||||||||| |||.||||||++|| +||||+|||++||+|+|++++||||||||||||||||||| T Consensus 263 ---AlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTE 339 (516) T protein:vir:10 263 ---AMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTE 339 (516) T ss_pred ---hHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCccc Confidence 999999999999999999999 99999999999985 8999999999999999999999999999999999999999 Q ss_pred eeecCcccccccchHHHHHHHHHHHHHcCCCchhc---------------cccccccHHHHHHHHHHHH---HHhhhHhh Q lcl|NC_016163. 649 FETIDALDAKSLDDDFLNWLSNNIFSGMGIPSAYL---------------TEVENVDFAKTLSMQNSRF---IRDIIGDQ 710 (878) Q Consensus 649 ISTLpGqNlgei~DDvVeYFqKKLYrALNVPvSRL---------------ITRDELKFsKFI~RLR~RF---F~DlLRtQ 710 (878) |+||||+++++++|| |+||++|||+|||||+||| |||||+||+|||.|||+|| |.++||+| T Consensus 340 ItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~q 418 (516) T protein:vir:10 340 VSSLPGAQTMGDMDD-VRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKTN 418 (516) T ss_pred eeeccccCCcChHHH-HHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 999976655555555 9999999999999999999 9999999999999999999 99999999 Q ss_pred ccchhhhhh--hhccccccccccccchhHHHHHHHhhhhHhhhhhhhhhhhhhhhHhhcchhhhheecCcccccchhhhh Q lcl|NC_016163. 711 VILSKGYTE--LVRKIYNLNFKSNEVDDKSDPAKDEILTKNAKSYTDTQSLAKAAIKYFDINNISVKFPSPASLNMNNLS 788 (878) Q Consensus 711 LILKKiiTE--~IRniyn~nFksnEVdd~selkk~EILt~riksytd~Qsla~aa~KYFdv~yis~K~l~~t~l~m~eld 788 (878) ||||++||+ |-.-.-++.|.....+++.|++..|||..|+..+ +.++|+++|||+++|+++++|++++-+|.+.+ T Consensus 419 LilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l---~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~ 495 (516) T protein:vir:10 419 LIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDAL---SQIEPYVGKYVSHDYVMKNILQMTEEQIAQEE 495 (516) T ss_pred hhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHH---HHhhhhhccccchHHHHHHHhcCCHhhHHHHH Confidence 999999995 4332234555556679999999999999999988 78899999999999999999888776665555 Q ss_pred HHHHHHhhhhhhhhcCccccccCCcchH Q lcl|NC_016163. 789 EQISNVNNFVTTLTENLTFDDTIPQDDQ 816 (878) Q Consensus 789 eQIs~~~~~v~~~~~N~~F~~~~~QDDQ 816 (878) +| |.++..++.|.+.-+++|= T Consensus 496 k~-------I~~E~~~~~~~~p~~~~~f 516 (516) T protein:vir:10 496 KQ-------IEQEAGIKRFQNPENEDDF 516 (516) T ss_pred HH-------HHHhhhCCCCCCCCccccC Confidence 55 5678889999865444333 No 2 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=100.00 E-value=5.2e-123 Score=690.97 Aligned_cols=404 Identities=17% Similarity=0.291 Sum_probs=278.6 Q ss_pred ccccccccc-------------------ccccccccCCCC--CCCCCCc--------------------------c-cCC Q lcl|NC_016163. 341 MNISGFFGS-------------------AVSPFNATGEGN--TQPGSNR--------------------------K-LAD 372 (878) Q Consensus 341 ~~~~~~~~~-------------------~~~~~~~~~~~~--~~~~~~~--------------------------~-~~~ 372 (878) ||++-+||- .+.|-++.|.-- +|++..- . ..+ T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINN 80 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhc Confidence 666666653 344544443310 1111100 0 112 Q ss_pred chhhhhhhhhcCCcchhhheecCCCceEEeeecc-----c--------------c----------------eeeeeeE-e Q lcl|NC_016163. 373 PEKEKILKTKLGGNEKAIVKRISPGNIVDLTFED-----N--------------I----------------LGYLYLD-I 416 (878) Q Consensus 373 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~--------------~----------------~~~~~~~-~ 416 (878) ||-+..+.--. | .|||-. .-+++|.|.+++ + + -|-+|+- | T Consensus 81 pEvd~Av~eIV--n-eaiv~d-~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKi 156 (516) T protein:vir:10 81 PEVERAVANIV--N-EAIVYE-RGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKI 156 (516) T ss_pred cchhhHHHHhh--c-ceeEec-CCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEE Confidence 22221111100 0 122222 233445444422 1 1 1223333 2 Q ss_pred eEecCC-CcccCcccccCCCC-ceeecccccccchhHHHH----hhhcCccccccCCCccccccccccCCCCceeccccc Q lcl|NC_016163. 417 VEVDPD-GTTMPSDKVDNGNE-GYTFMPSQSVGNGNVLQN----MVYSGKDIPIDGNGGHSSGAKNVENPNGQRLDVADD 490 (878) Q Consensus 417 ~~~~~~-~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (878) +. +|. |... ---+|.-+- -+...|.....+..|..+ .+|.-.+-- -+-+|+- T Consensus 157 id-~~k~GI~E-lr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~--------------~~~~g~~------ 214 (516) T protein:vir:10 157 MP-NPKKGIAE-LRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEG--------------YSYNGRI------ 214 (516) T ss_pred ec-Ccccccee-eeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccc--------------cccccce------ Confidence 21 211 1110 001111100 011112222222333333 222210000 0111111 Q ss_pred hhHHHHHHHHHhhcCCcchhhhhhhhhhhhhhhhhhhhhhhhhhhccceeEEechHHHHhhhcCchhhhhhhhhhhhhhh Q lcl|NC_016163. 491 ARLQFLAAAFANRLSDESNIKLIKKSATIKQAIYNTLSIRKLTRKDKVRVIYLKPEEVVMINRGHSIFDNILFFAKIYIT 570 (878) Q Consensus 491 ~~~~~~~~~~~~~l~~~~~IKl~K~S~T~~~~~~~~l~~~~L~~k~K~RVi~~~peeIv~In~G~sI~dnil~~skiylt 570 (878) .+..+.|||.+++|| .|.|||.+.++.-+++|+|.+|+..||.+|++| T Consensus 215 -------------~~~~~~ikI~~dAI~--------y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~ED----------- 262 (516) T protein:vir:10 215 -------------FEPNTRIKIPRSAVV--------YASSGLMDCSDRGIIGYLHNAVKPANQLKLLED----------- 262 (516) T ss_pred -------------eCCCcceeechhhee--------eecccceeCCCCceeeeehhhhHhHHhhHHHHh----------- Confidence 122355899999999 999999999999999999999999999999999 Q ss_pred hHHHHHHHHHhcCccceEEEEecC-CCChhHHHHHHHHH-hhcceeeecccCceeccccchhhhhhhcccccccCCCCce Q lcl|NC_016163. 571 TLLTLLMQNVLRGAPKRAVYVEVG-LDNNPANATQQAIR-DVKSKEISSITNMDMQSIINYVGEFQDYYIPVVDGEKPIT 648 (878) Q Consensus 571 TllSLVIYRITRAPERRVFYIDVG-LPK~KAEQYMrdI~-kyKNKLVYDAsTGEVRDDrk~MSMLEDYWLPRREGGRGTE 648 (878) +|||||||||||||||||||| |||.||||||++|| +||||+|||++||+|+|++++||||||||||||||||||| T Consensus 263 ---AlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTE 339 (516) T protein:vir:10 263 ---AMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTE 339 (516) T ss_pred ---hHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCccc Confidence 999999999999999999999 99999999999985 8999999999999999999999999999999999999999 Q ss_pred eeecCcccccccchHHHHHHHHHHHHHcCCCchhc---------------cccccccHHHHHHHHHHHH---HHhhhHhh Q lcl|NC_016163. 649 FETIDALDAKSLDDDFLNWLSNNIFSGMGIPSAYL---------------TEVENVDFAKTLSMQNSRF---IRDIIGDQ 710 (878) Q Consensus 649 ISTLpGqNlgei~DDvVeYFqKKLYrALNVPvSRL---------------ITRDELKFsKFI~RLR~RF---F~DlLRtQ 710 (878) |+||||+++++++|| |+||++|||+|||||+||| |||||+||+|||.|||+|| |.++||+| T Consensus 340 ItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~q 418 (516) T protein:vir:10 340 VSSLPGAQTMGDMDD-VRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKTN 418 (516) T ss_pred eeeccccCCcChHHH-HHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 999976655555555 9999999999999999999 9999999999999999999 99999999 Q ss_pred ccchhhhhh--hhccccccccccccchhHHHHHHHhhhhHhhhhhhhhhhhhhhhHhhcchhhhheecCcccccchhhhh Q lcl|NC_016163. 711 VILSKGYTE--LVRKIYNLNFKSNEVDDKSDPAKDEILTKNAKSYTDTQSLAKAAIKYFDINNISVKFPSPASLNMNNLS 788 (878) Q Consensus 711 LILKKiiTE--~IRniyn~nFksnEVdd~selkk~EILt~riksytd~Qsla~aa~KYFdv~yis~K~l~~t~l~m~eld 788 (878) ||||++||+ |-.-.-++.|.....+++.|++..|||..|+..+ +.++|+++|||+++|+++++|++++-+|.+.+ T Consensus 419 LilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l---~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~ 495 (516) T protein:vir:10 419 LIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDAL---SQIEPYVGKYVSHDYVMKNILQMTEEQIAQEE 495 (516) T ss_pred hhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHH---HHhhhhhccccchHHHHHHHhcCCHhhHHHHH Confidence 999999995 4332234555556679999999999999999988 78899999999999999999888776665555 Q ss_pred HHHHHHhhhhhhhhcCccccccCCcchH Q lcl|NC_016163. 789 EQISNVNNFVTTLTENLTFDDTIPQDDQ 816 (878) Q Consensus 789 eQIs~~~~~v~~~~~N~~F~~~~~QDDQ 816 (878) +| |.++..++.|.+.-+++|= T Consensus 496 k~-------I~~E~~~~~~~~p~~~~~f 516 (516) T protein:vir:10 496 KQ-------IEQEAGIKRFQNPENEDDF 516 (516) T ss_pred HH-------HHHhhhCCCCCCCCccccC Confidence 55 5678889999865444333 No 3 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=100.00 E-value=3.5e-122 Score=686.44 Aligned_cols=406 Identities=17% Similarity=0.271 Sum_probs=275.7 Q ss_pred ccccccccc-------------------ccccccccCCCCCCCCCCcc--------------cCCchhhhhhhhhc---- Q lcl|NC_016163. 341 MNISGFFGS-------------------AVSPFNATGEGNTQPGSNRK--------------LADPEKEKILKTKL---- 383 (878) Q Consensus 341 ~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~---- 383 (878) ||++-+||- .+.|-|..|.--.+.|.|.. -..-+++-| ++-- T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI-~~YR~ma~ 79 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDLI-NTYRQLTN 79 (516) T ss_pred CCchHhcccccchhhHHHHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHHHHH-HHHHHhhh Confidence 777666654 33344433322111111110 001111111 1100 Q ss_pred -CCc--------chhhheecCCCceEEeeeccc-------------------c----------------eeeeeeE-eeE Q lcl|NC_016163. 384 -GGN--------EKAIVKRISPGNIVDLTFEDN-------------------I----------------LGYLYLD-IVE 418 (878) Q Consensus 384 -~~~--------~~~~~~~~~~~~~~~~~~~~~-------------------~----------------~~~~~~~-~~~ 418 (878) --- ..|||-. .-.++|.|..++- + -|-+|+- |+. T Consensus 80 ~pEvd~Av~eIvneaiv~d-~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid 158 (516) T protein:vir:10 80 NPEVERAVANIVNEAVVYE-KGHKVVSLDLDDTEFSSSIKDKILEEFDEICRLLDASRKLDTLFRRWYIDSRIFFHKIMP 158 (516) T ss_pred ccchhHHHHHhhcceeEec-CCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhhhhcceEEEEEEec Confidence 000 0112211 2233444432221 1 1223333 221 Q ss_pred ecCC-CcccCcccccCCCC-ceeecccccccchhHHHH----hhhcCccccccCCCccccccccccCCCCceeccccchh Q lcl|NC_016163. 419 VDPD-GTTMPSDKVDNGNE-GYTFMPSQSVGNGNVLQN----MVYSGKDIPIDGNGGHSSGAKNVENPNGQRLDVADDAR 492 (878) Q Consensus 419 ~~~~-~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (878) +|. |... ---+|.-+- -+...|....+..+|..+ .+|.- | +.+. +-||+-. T Consensus 159 -~~k~GI~e-lr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~------~---~~~~-----~~~g~~~------- 215 (516) T protein:vir:10 159 -NPKEGIVE-LRRLDPRHVEYYREIVTSDVGGTSVVKGYREFFVYTT------G---NEGY-----AYNGRLF------- 215 (516) T ss_pred -Ccccceee-eeeeCCcceeeEEeeecccCcchhhhhceeeeeeeec------C---ccce-----ecccccc------- Confidence 211 1110 001111110 011122222222233332 23321 1 1110 1122211 Q ss_pred HHHHHHHHHhhcCCcchhhhhhhhhhhhhhhhhhhhhhhhhhhccceeEEechHHHHhhhcCchhhhhhhhhhhhhhhhH Q lcl|NC_016163. 493 LQFLAAAFANRLSDESNIKLIKKSATIKQAIYNTLSIRKLTRKDKVRVIYLKPEEVVMINRGHSIFDNILFFAKIYITTL 572 (878) Q Consensus 493 ~~~~~~~~~~~l~~~~~IKl~K~S~T~~~~~~~~l~~~~L~~k~K~RVi~~~peeIv~In~G~sI~dnil~~skiyltTl 572 (878) +.-+.|||.++++| .|.|||.+.++.-+++|+|.+|+..||.+|++| T Consensus 216 ------------~~~~~ikI~~daI~--------y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~ED------------- 262 (516) T protein:vir:10 216 ------------EPNTRIKIPRSAIV--------YAHSGLQDCSDRGIVGYLHNAVKPANQLKLLED------------- 262 (516) T ss_pred ------------CCCCceecchhhee--------eeecCcccCCCCceeceehhhhHhHHhhHHHHh------------- Confidence 11234899999999 899999999999999999999999999999999 Q ss_pred HHHHHHHHhcCccceEEEEecC-CCChhHHHHHHHHH-hhcceeeecccCceeccccchhhhhhhcccccccCCCCceee Q lcl|NC_016163. 573 LTLLMQNVLRGAPKRAVYVEVG-LDNNPANATQQAIR-DVKSKEISSITNMDMQSIINYVGEFQDYYIPVVDGEKPITFE 650 (878) Q Consensus 573 lSLVIYRITRAPERRVFYIDVG-LPK~KAEQYMrdI~-kyKNKLVYDAsTGEVRDDrk~MSMLEDYWLPRREGGRGTEIS 650 (878) +|||||||||||||||||||| |||.||||||++|| +||||+|||++||+|+|++++||||||||||||||||||||+ T Consensus 263 -AlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEIt 341 (516) T protein:vir:10 263 -ALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVT 341 (516) T ss_pred -hHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCccccee Confidence 999999999999999999999 99999999999985 899999999999999999999999999999999999999999 Q ss_pred ecCcccccccchHHHHHHHHHHHHHcCCCchhc---------------cccccccHHHHHHHHHHHH---HHhhhHhhcc Q lcl|NC_016163. 651 TIDALDAKSLDDDFLNWLSNNIFSGMGIPSAYL---------------TEVENVDFAKTLSMQNSRF---IRDIIGDQVI 712 (878) Q Consensus 651 TLpGqNlgei~DDvVeYFqKKLYrALNVPvSRL---------------ITRDELKFsKFI~RLR~RF---F~DlLRtQLI 712 (878) ||||+++++++|| |+||++|||+|||||+||| |||||+||+|||.|||+|| |.++||+||| T Consensus 342 TLpGgqnlgem~D-V~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lF~~~L~~qLi 420 (516) T protein:vir:10 342 SLPGAQTMGEMDD-VRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDELDFRKFIVQLQHNFEEIFLDPLKTNLI 420 (516) T ss_pred eccccCCcChHHH-HHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 9976655555555 9999999999999999999 9999999999999999999 9999999999 Q ss_pred chhhhhh--hhccccccccccccchhHHHHHHHhhhhHhhhhhhhhhhhhhhhHhhcchhhhheecCcccccchhhhhHH Q lcl|NC_016163. 713 LSKGYTE--LVRKIYNLNFKSNEVDDKSDPAKDEILTKNAKSYTDTQSLAKAAIKYFDINNISVKFPSPASLNMNNLSEQ 790 (878) Q Consensus 713 LKKiiTE--~IRniyn~nFksnEVdd~selkk~EILt~riksytd~Qsla~aa~KYFdv~yis~K~l~~t~l~m~eldeQ 790 (878) ||++||+ |-.-.-++.|.....+++.|++..|||..|+..+ +.++|+++|||+++|+++++|++++-+|.+.++| T Consensus 421 lKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l---~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~ 497 (516) T protein:vir:10 421 YKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDAL---SQIEPYVGKYVSHDYVMKNILQMTDEQIAQEEKQ 497 (516) T ss_pred hcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHH---HHhhhhhccccchHHHHHHHhcCCHhHHHHHHHH Confidence 9999995 3322234555556779999999999999999988 7889999999999999999888877665555554 Q ss_pred HHHHhhhhhhhhcCccccccCCcchH Q lcl|NC_016163. 791 ISNVNNFVTTLTENLTFDDTIPQDDQ 816 (878) Q Consensus 791 Is~~~~~v~~~~~N~~F~~~~~QDDQ 816 (878) |.++..++.|.+.-+++|- T Consensus 498 -------I~~E~~~~~~~~p~~e~~f 516 (516) T protein:vir:10 498 -------IEKEANVKRFQNPENEDDF 516 (516) T ss_pred -------HHHhhhCCCCCCCCccccC Confidence 5678889999853222222 No 4 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=100.00 E-value=1.3e-122 Score=688.74 Aligned_cols=443 Identities=18% Similarity=0.221 Sum_probs=280.1 Q ss_pred HHH--HHhhhheeeeccchhHHHhhhhhhhhcccccccccccccccccccC----CCCCCC-------------CCCccc Q lcl|NC_016163. 310 DKF--SEALKDTFIIGDSSHILSEYSDVLLSEDMNISGFFGSAVSPFNATG----EGNTQP-------------GSNRKL 370 (878) Q Consensus 310 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~-------------~~~~~~ 370 (878) -+| ++.++ |.+-+.- -...+ .++--..|++.|-+..| +.+.+- |+.-.- T Consensus 1 m~f~~~~lf~--f~~~~de--------~~~~~--~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~ 68 (523) T protein:vir:68 1 MKFNILSLFA--PWAKMDE--------RDYKD--QEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPG 68 (523) T ss_pred CCCchhhhhh--hhhhhhh--------hhhhh--hhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccc Confidence 011 00000 0000000 00000 01112236677777665 111111 111111 Q ss_pred CCchhhhhhhhh-c---CCcchhhheecCCCceEEeeecccceeeeeeEeeEecCCCcccCcc--------------ccc Q lcl|NC_016163. 371 ADPEKEKILKTK-L---GGNEKAIVKRISPGNIVDLTFEDNILGYLYLDIVEVDPDGTTMPSD--------------KVD 432 (878) Q Consensus 371 ~~~~~~~~~~~~-~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~ 432 (878) +-.+++.|=.-. + ---+.|+ -.||+ +-|..=-+-++|+|+-|.|-.+.. -.+ T Consensus 69 ~~~~~eLI~~YR~ma~~pEvd~Av------~eIVn----eaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~ 138 (523) T protein:vir:68 69 LKSTRELIDTYRNLMTNYEVDNAV------SEIVS----DAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNHLS 138 (523) T ss_pred cchHHHHHHHHHHHhhccchhhHH------HHhhc----ceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhc Confidence 112222221100 0 0000010 01111 111111223445555444432211 001 Q ss_pred CCCCceeecccccccchhHHHHhhhcCcccccc-CC---Ccccc-----------ccccccCCCCceeccccchhHHHHH Q lcl|NC_016163. 433 NGNEGYTFMPSQSVGNGNVLQNMVYSGKDIPID-GN---GGHSS-----------GAKNVENPNGQRLDVADDARLQFLA 497 (878) Q Consensus 433 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~---~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (878) -.+.||.+.--.-|- |. | |--|- || .| |=+.- --...++++|- .|-..-+..|+- T Consensus 139 F~~~~~~~fR~WYVD-gR----i-~fhKi--id~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~g~--~vi~~~~e~f~Y 208 (523) T protein:vir:68 139 FQRKGSDHFRRWYVD-SR----I-FFHKI--IDPKRPKEGIKELRRLDPRQVQYVREVITTTEAGV--KIVKGYKEYFIY 208 (523) T ss_pred cchhhhHHHHhheee-eE----E-EEEEE--eeCCCccccceeeeeeCCcceeEEEeecCCCCcch--hhhhhhhhheee Confidence 112222211110000 00 0 00000 00 00 00000 00001111111 111122222221 Q ss_pred H------HHHh-hcCCcchhhhhhhhhhhhhhhhhhhhhhhhhhhccceeEEechHHHHhhhcCchhhhhhhhhhhhhhh Q lcl|NC_016163. 498 A------AFAN-RLSDESNIKLIKKSATIKQAIYNTLSIRKLTRKDKVRVIYLKPEEVVMINRGHSIFDNILFFAKIYIT 570 (878) Q Consensus 498 ~------~~~~-~l~~~~~IKl~K~S~T~~~~~~~~l~~~~L~~k~K~RVi~~~peeIv~In~G~sI~dnil~~skiylt 570 (878) . +|-- ..+..+.|||.+++|| .|.|||.+.++.-+++|+|.+|+..||.+|++| T Consensus 209 ~~~~~~~~~~g~~~~~~~~ikI~~dAI~--------y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlED----------- 269 (523) T protein:vir:68 209 DTSHESYACDGRIYEAGTKIKIPKAAIV--------YAHSGLVDCCGKNIIGYLHRAIKPANQLKLLED----------- 269 (523) T ss_pred ccccccccccccccCCCcceecchhhee--------eeeccceeCCCCceeccchhhhHHHHhhHHHHh----------- Confidence 1 0100 1122478999999999 999999999999999999999999999999999 Q ss_pred hHHHHHHHHHhcCccceEEEEecC-CCChhHHHHHHHHH-hhcceeeecccCceeccccchhhhhhhcccccccCCCCce Q lcl|NC_016163. 571 TLLTLLMQNVLRGAPKRAVYVEVG-LDNNPANATQQAIR-DVKSKEISSITNMDMQSIINYVGEFQDYYIPVVDGEKPIT 648 (878) Q Consensus 571 TllSLVIYRITRAPERRVFYIDVG-LPK~KAEQYMrdI~-kyKNKLVYDAsTGEVRDDrk~MSMLEDYWLPRREGGRGTE 648 (878) +|||||||||||||||||||| |||.||||||++|| +||||+|||++||+|+|++++||||||||||||||||||| T Consensus 270 ---AlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTE 346 (523) T protein:vir:68 270 ---AVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRIAYDATTGKIKNQQHIMSMTEDYWLQRRDGKAVTE 346 (523) T ss_pred ---hHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcceeEEeccCCeeccchhhhhhHhhhcccccCCCcccc Confidence 999999999999999999999 99999999999985 8999999999999999999999999999999999999999 Q ss_pred eeecCcccccccchHHHHHHHHHHHHHcCCCchhc--------------cccccccHHHHHHHHHHHH---HHhhhHhhc Q lcl|NC_016163. 649 FETIDALDAKSLDDDFLNWLSNNIFSGMGIPSAYL--------------TEVENVDFAKTLSMQNSRF---IRDIIGDQV 711 (878) Q Consensus 649 ISTLpGqNlgei~DDvVeYFqKKLYrALNVPvSRL--------------ITRDELKFsKFI~RLR~RF---F~DlLRtQL 711 (878) |+||||+++++++|| |+||++|||+|||||+||| |||||+||+|||.|||+|| |.++||+|| T Consensus 347 ItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEikF~KFI~rLR~rFs~lf~~~Lk~qL 425 (523) T protein:vir:68 347 VDTLPGADNTGNMED-VRWFRNALYMALRIPITRIPSDQGGIQFDAGTSITRDELSFGKFIRELQHKFEEIFLDPLKTNL 425 (523) T ss_pred eeeccccCCcChHHH-HHHHHHHHHHHhCCcceeecCCCcceecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 999976655455555 9999999999999999999 9999999999999999999 999999999 Q ss_pred cchhhhhh--hhccccccccccccchhHHHHHHHhhhhHhhhhhhhhhhhhhhhHhhcchhhhheecCcccccchhhhhH Q lcl|NC_016163. 712 ILSKGYTE--LVRKIYNLNFKSNEVDDKSDPAKDEILTKNAKSYTDTQSLAKAAIKYFDINNISVKFPSPASLNMNNLSE 789 (878) Q Consensus 712 ILKKiiTE--~IRniyn~nFksnEVdd~selkk~EILt~riksytd~Qsla~aa~KYFdv~yis~K~l~~t~l~m~elde 789 (878) |||++||+ |-.-.-++.|.....+++.|++..|||..|+..+ +.++|+++|||+++|+++++|++++-+|.++++ T Consensus 426 ilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l---~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k 502 (523) T protein:vir:68 426 ILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINML---QMAEPFIGKYISHRTAMKDILQMSDEEIEQEAK 502 (523) T ss_pred hhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHH---HHhhhhhcccchhHHHHHHHhccCHHHHHHHHH Confidence 99999995 4332244555556679999999999999999988 788999999999999999988887665555555 Q ss_pred HHHHHhhhhhhhhcCccccccCCcchHHHH Q lcl|NC_016163. 790 QISNVNNFVTTLTENLTFDDTIPQDDQEKL 819 (878) Q Consensus 790 QIs~~~~~v~~~~~N~~F~~~~~QDDQ~~~ 819 (878) | |.++..++.|++ |++++|.. T Consensus 503 q-------I~~E~k~~~~~~--p~~e~~~f 523 (523) T protein:vir:68 503 Q-------IEEESKEARFQD--PDQEQEDF 523 (523) T ss_pred H-------HHHHhhcCCCCC--CchhhhcC Confidence 4 567888999984 44555544 No 5 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=100.00 E-value=3.1e-122 Score=686.77 Aligned_cols=422 Identities=17% Similarity=0.261 Sum_probs=285.1 Q ss_pred cccccccc-------------------------cccccccccCCCC-------CCCCC-C-cccCCchhhhhhhhhcCCc Q lcl|NC_016163. 341 MNISGFFG-------------------------SAVSPFNATGEGN-------TQPGS-N-RKLADPEKEKILKTKLGGN 386 (878) Q Consensus 341 ~~~~~~~~-------------------------~~~~~~~~~~~~~-------~~~~~-~-~~~~~~~~~~~~~~~~~~~ 386 (878) ||.-|||. |.+.|-+..|.-- ...|. . .-..|++-.- -| T Consensus 1 ~~~~~~~~~l~~~~~~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~~-------~~ 73 (524) T protein:vir:98 1 MNFLGFGNVLSFFKNFAREDEIELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPAI-------QN 73 (524) T ss_pred CCCcchhhHHHHhhhhhhhhhhhHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeeecccccccc-------ch Confidence 66666654 5556666644311 11110 0 0111111000 01 Q ss_pred chhhheec-----CC--CceEEeeecccceeeeeeEeeEecCCCcccCcc--------------cccCCCCceeeccccc Q lcl|NC_016163. 387 EKAIVKRI-----SP--GNIVDLTFEDNILGYLYLDIVEVDPDGTTMPSD--------------KVDNGNEGYTFMPSQS 445 (878) Q Consensus 387 ~~~~~~~~-----~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~ 445 (878) ++..+++- .| -+-|+--..+-|..=-+-++|+||-|.+-.+.. -.+-.+.||.+.--.- T Consensus 74 ~~eLI~~YR~ma~~pEvd~Av~eIVneaIv~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WY 153 (524) T protein:vir:98 74 KEQLINTYRGIMSYPEVENAVSEIIDDAIVNEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVLNIYDFDNMGARLFRDWY 153 (524) T ss_pred HHHHHHHHHHHhhccchhhHHHhhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhh Confidence 22222210 01 111111111223333345677777766654322 0122334443321111 Q ss_pred ccchhHHHHhhhcCccccccCCCcccccccccc----------------C-CCCceeccccchh---------HHHHHHH Q lcl|NC_016163. 446 VGNGNVLQNMVYSGKDIPIDGNGGHSSGAKNVE----------------N-PNGQRLDVADDAR---------LQFLAAA 499 (878) Q Consensus 446 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~-~~~~~~~~~~~~~---------~~~~~~~ 499 (878) |- |.+ |-- +-||.+. +-|-+-+- + ++|-+ |-..-+ ..+.+.. T Consensus 154 VD-gRi-----~fh--kiid~~~--~kGI~ELr~lDPr~i~~vr~~~~~~~~~~~~--v~~~~~e~f~Y~~~~~~~~~~g 221 (524) T protein:vir:98 154 VD-SRI-----YFH--KIMHKDE--SKGIRELRQLDPRCMELIRESITETLDGGVK--VFRGYREFFVYSAPKAGYTYNG 221 (524) T ss_pred hc-cee-----EEE--EEEcCCC--CcceeeeeeeCCccceeeeeccccccccchh--hccceeeeeeeccCCCcccccc Confidence 10 000 000 1122221 11211110 0 01111 000000 0000000 Q ss_pred HHhhcCCcchhhhhhhhhhhhhhhhhhhhhhhhhhhccceeEEechHHHHhhhcCchhhhhhhhhhhhhhhhHHHHHHHH Q lcl|NC_016163. 500 FANRLSDESNIKLIKKSATIKQAIYNTLSIRKLTRKDKVRVIYLKPEEVVMINRGHSIFDNILFFAKIYITTLLTLLMQN 579 (878) Q Consensus 500 ~~~~l~~~~~IKl~K~S~T~~~~~~~~l~~~~L~~k~K~RVi~~~peeIv~In~G~sI~dnil~~skiyltTllSLVIYR 579 (878) .-.+.+ ..|||.++++| .|.|||.+.+|+ +|+|+|.+|+..||.+|++| +||||| T Consensus 222 ~~~~~~--~~ikI~~dAIv--------y~hSGL~d~~~~-iisyLhkAiKp~NQLkm~ED--------------AlVIYR 276 (524) T protein:vir:98 222 QIYQAN--QKIKIPRSAIV--------YAHSGLEDCSNN-IIGYLHRAVKPANQLRLLED--------------AMVIYR 276 (524) T ss_pred ceecCC--Cceeechhhee--------eeccCcccCCCC-eeeehhHhhHhHHhhHHHHh--------------hHHHHh Confidence 000122 34999999999 899999999987 88999999999999999999 999999 Q ss_pred HhcCccceEEEEecC-CCChhHHHHHHHHH-hhcceeeecccCceeccccchhhhhhhcccccccCCCCceeeecCcccc Q lcl|NC_016163. 580 VLRGAPKRAVYVEVG-LDNNPANATQQAIR-DVKSKEISSITNMDMQSIINYVGEFQDYYIPVVDGEKPITFETIDALDA 657 (878) Q Consensus 580 ITRAPERRVFYIDVG-LPK~KAEQYMrdI~-kyKNKLVYDAsTGEVRDDrk~MSMLEDYWLPRREGGRGTEISTLpGqNl 657 (878) ||||||||||||||| |||.||||||++|| +||||+|||++||+|+|+||+||||||||||||||||||||+||||+++ T Consensus 277 itRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGevrddrk~msMlEDyWLpRReGgrgTEItTLpggqn 356 (524) T protein:vir:98 277 ITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNRVVYDARTGTVKNQQNNLSMTEDYWLMRRDGKAITEVSTLPGGQN 356 (524) T ss_pred hhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccCceeeccccccchhhhhcccccCCCCccceeeccccCC Confidence 999999999999999 99999999999985 8999999999999999999999999999999999999999999976655 Q ss_pred cccchHHHHHHHHHHHHHcCCCchhc--------------cccccccHHHHHHHHHHHH---HHhhhHhhccchhhhhh- Q lcl|NC_016163. 658 KSLDDDFLNWLSNNIFSGMGIPSAYL--------------TEVENVDFAKTLSMQNSRF---IRDIIGDQVILSKGYTE- 719 (878) Q Consensus 658 gei~DDvVeYFqKKLYrALNVPvSRL--------------ITRDELKFsKFI~RLR~RF---F~DlLRtQLILKKiiTE- 719 (878) ++++|| |+||++|||+|||||+||| |||||+||+|||.|||+|| |.++||+|||||++||+ T Consensus 357 lgem~D-V~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~e 435 (524) T protein:vir:98 357 FSDMDD-IKWFNRKLYEALRVPLSRMPRDDGGMQIGGGGEITRDELKFSKFIRTLQIQFSPVLSDPLKTNLIAKKIITED 435 (524) T ss_pred cChHHH-HHHHHHHHHHHhCCCceeccCCCCccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHH Confidence 555555 9999999999999999999 9999999999999999999 99999999999999995 Q ss_pred -hhccccccccccccchhHHHHHHHhhhhHhhhhhhhhhhhhhhhHhhcchhhhheecCcccccchhhhhHHHHHHhhhh Q lcl|NC_016163. 720 -LVRKIYNLNFKSNEVDDKSDPAKDEILTKNAKSYTDTQSLAKAAIKYFDINNISVKFPSPASLNMNNLSEQISNVNNFV 798 (878) Q Consensus 720 -~IRniyn~nFksnEVdd~selkk~EILt~riksytd~Qsla~aa~KYFdv~yis~K~l~~t~l~m~eldeQIs~~~~~v 798 (878) |-.-.-++.|.....+++.|++..|||..|+..+ +.++|+++|||+++|+++++|++++-+|++++.| | T Consensus 436 ew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l---~~~dpyvGky~s~dyi~k~ILr~tDeei~~~~k~-------I 505 (524) T protein:vir:98 436 EWEENVSKISFVFQQDSYYAEVKDIEILERRLNLM---SQVEGVVGKYVSHKYIMKEILRMSDEDIDEQAKL-------I 505 (524) T ss_pred HHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHH---HHhccccccccchHHHHHHHhccCHHHHHHHHHH-------H Confidence 4333334555556679999999999999999988 8899999999999999999988887666655555 5 Q ss_pred hhhhcCccccccCCcchHHHH Q lcl|NC_016163. 799 TTLTENLTFDDTIPQDDQEKL 819 (878) Q Consensus 799 ~~~~~N~~F~~~~~QDDQ~~~ 819 (878) .++..++.|++ |+++.+.. T Consensus 506 ~~E~k~~~~~~--p~~e~~~f 524 (524) T protein:vir:98 506 EEESKEERFKN--PEAEEENF 524 (524) T ss_pred HHHHhCCCCcC--CccccccC Confidence 67888999984 56666655 No 6 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=100.00 E-value=4.1e-122 Score=686.09 Aligned_cols=445 Identities=18% Similarity=0.254 Sum_probs=284.2 Q ss_pred hhhHHHHHHhhhheeeeccchhHHHhhhhhhhhcccccccccccccccccccCC------CCCCCCC---------Cccc Q lcl|NC_016163. 306 GAALDKFSEALKDTFIIGDSSHILSEYSDVLLSEDMNISGFFGSAVSPFNATGE------GNTQPGS---------NRKL 370 (878) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~---------~~~~ 370 (878) -|..+..++.++ |++-..- .+|... + +--..|.+.|-+..|. .|+.|+. .-.- T Consensus 1 ~~~~~~~~~lf~--f~~~~de---~~~~~~-----~--~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~ 68 (524) T protein:vir:10 1 MANFNTILSFLK--PWANEDE---KEYKQQ-----I--NNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPE 68 (524) T ss_pred CCchhhHHHHhh--hhhcchh---hhhhhh-----h--ccCCCccccCCCCCCceeeccCcccccchhhhhhhhhcccch Confidence 222233333222 1111000 000000 0 1112345555555543 2232321 1110 Q ss_pred CCchhhhhhhhhcC-----CcchhhheecCCCceEEeeecccceeeeeeEeeEecCCCcccCcc--------------cc Q lcl|NC_016163. 371 ADPEKEKILKTKLG-----GNEKAIVKRISPGNIVDLTFEDNILGYLYLDIVEVDPDGTTMPSD--------------KV 431 (878) Q Consensus 371 ~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~ 431 (878) +--+++.| ++--+ --+.|+ -.||+ +-|..=-+-++|+||-|++-.+.. -. T Consensus 69 ~~~~~eLI-~~YR~ma~~pEvd~Av------~eIVn----eaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~ll 137 (524) T protein:vir:10 69 VKNTRELI-DTYRNLMNNYEVDNAV------QEIVS----DAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLNLL 137 (524) T ss_pred hhhHHHHH-HHHHHHhhccchhhHH------HHhhc----ceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHh Confidence 11122211 11000 000010 01111 111111233455555555543221 01 Q ss_pred cCCCCceeecccccccchhHHHHhhhcCccccccCCCcccccccc----------------ccCCCCceeccccchhHHH Q lcl|NC_016163. 432 DNGNEGYTFMPSQSVGNGNVLQNMVYSGKDIPIDGNGGHSSGAKN----------------VENPNGQRLDVADDARLQF 495 (878) Q Consensus 432 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~ 495 (878) +-.+.||.+.--.-|- |. +|--|-|-. .|. +-|-+- .+.-+| ..|-.+-+--| T Consensus 138 ~F~~~~~~~fR~WYVD-gR-----i~fHkiid~-~~p--k~GI~Elr~lDPr~i~~vr~i~~~~~~~--~~vi~~~~e~f 206 (524) T protein:vir:10 138 NFQRKGTDHFQRWYVD-SR-----IFFHKIINP-KKM--KDGVQELRRLDPRQVQYIREIVTRMEDG--VKIVDGYREFF 206 (524) T ss_pred ccchhhhHHHhhheee-ce-----EEEEEEeeC-CCc--cccceeeeeeCCccceeeeeecccCccc--chhhcchhhhe Confidence 1122233221111110 00 000000000 000 000000 011111 11122222222 Q ss_pred HHHH-------HHhhcCCcchhhhhhhhhhhhhhhhhhhhhhhhhhhccceeEEechHHHHhhhcCchhhhhhhhhhhhh Q lcl|NC_016163. 496 LAAA-------FANRLSDESNIKLIKKSATIKQAIYNTLSIRKLTRKDKVRVIYLKPEEVVMINRGHSIFDNILFFAKIY 568 (878) Q Consensus 496 ~~~~-------~~~~l~~~~~IKl~K~S~T~~~~~~~~l~~~~L~~k~K~RVi~~~peeIv~In~G~sI~dnil~~skiy 568 (878) +-.. -.+..+..+.|||.++++| .|.|||.+.++.-+++|+|.+|+..||.+|++| T Consensus 207 ~Y~~~~~~~~~~~~~~~~~~~ikI~~dAIv--------y~~SGL~d~~~~~i~syLhkAiKp~NQLkm~ED--------- 269 (524) T protein:vir:10 207 VYDTGHESYCADGRIYSAGTKVKIPRAAVV--------YAHSGLLDCCGKNIIGYLQRAIKPANQLKLMED--------- 269 (524) T ss_pred eecCCCcccccCcceecCCcceecchhhee--------eeccCcccCCCCceeccchHhhHHHHhhHHHHh--------- Confidence 2110 0112356678999999999 899999999999999999999999999999999 Q ss_pred hhhHHHHHHHHHhcCccceEEEEecC-CCChhHHHHHHHHH-hhcceeeecccCceeccccchhhhhhhcccccccCCCC Q lcl|NC_016163. 569 ITTLLTLLMQNVLRGAPKRAVYVEVG-LDNNPANATQQAIR-DVKSKEISSITNMDMQSIINYVGEFQDYYIPVVDGEKP 646 (878) Q Consensus 569 ltTllSLVIYRITRAPERRVFYIDVG-LPK~KAEQYMrdI~-kyKNKLVYDAsTGEVRDDrk~MSMLEDYWLPRREGGRG 646 (878) +|||||||||||||||||||| |||.||||||++|| +||||+|||++||+|+|++++||||||||||||||||| T Consensus 270 -----AlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrg 344 (524) T protein:vir:10 270 -----AMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAV 344 (524) T ss_pred -----hHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeccCCeeccchhhhhhHhhhcccccCCCCc Confidence 999999999999999999999 99999999999985 89999999999999999999999999999999999999 Q ss_pred ceeeecCcccccccchHHHHHHHHHHHHHcCCCchhc---------------cccccccHHHHHHHHHHHH---HHhhhH Q lcl|NC_016163. 647 ITFETIDALDAKSLDDDFLNWLSNNIFSGMGIPSAYL---------------TEVENVDFAKTLSMQNSRF---IRDIIG 708 (878) Q Consensus 647 TEISTLpGqNlgei~DDvVeYFqKKLYrALNVPvSRL---------------ITRDELKFsKFI~RLR~RF---F~DlLR 708 (878) |||+||||+++++++|| |+||++|||+|||||+||| |||||+||+|||.|||+|| |.++|| T Consensus 345 TEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~ 423 (524) T protein:vir:10 345 TEVDTMPGATGMSDMDD-VLYFRTALYRALRIPESRIPSESNSGVMFDAGTAITRDELKFAKWIRQLQNKFEEIFLDPLK 423 (524) T ss_pred cceeeccccCCcChHHH-HHHHHHHHHHHhCCCchhccCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999976655555555 9999999999999999999 9999999999999999999 999999 Q ss_pred hhccchhhhhh--hhccccccccccccchhHHHHHHHhhhhHhhhhhhhhhhhhhhhHhhcchhhhheecCcccccchhh Q lcl|NC_016163. 709 DQVILSKGYTE--LVRKIYNLNFKSNEVDDKSDPAKDEILTKNAKSYTDTQSLAKAAIKYFDINNISVKFPSPASLNMNN 786 (878) Q Consensus 709 tQLILKKiiTE--~IRniyn~nFksnEVdd~selkk~EILt~riksytd~Qsla~aa~KYFdv~yis~K~l~~t~l~m~e 786 (878) +|||||++||+ |-.-.-++.|.....+++.|++..|||..|+..+ +.++|+++|||+++||++++|++++-+|++ T Consensus 424 ~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l---~~~dpyvGky~s~~yi~k~ILr~tDeei~~ 500 (524) T protein:vir:10 424 TNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINML---TMAEPFIGKYISHQTAMKDFLQMTDEEINQ 500 (524) T ss_pred HhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHH---HHhhhhhcccchhHHHHHHHhccCHHHHHH Confidence 99999999995 4332234555556679999999999999999988 788999999999999999988887766555 Q ss_pred hhHHHHHHhhhhhhhhcCccccccCCcchHHHH Q lcl|NC_016163. 787 LSEQISNVNNFVTTLTENLTFDDTIPQDDQEKL 819 (878) Q Consensus 787 ldeQIs~~~~~v~~~~~N~~F~~~~~QDDQ~~~ 819 (878) +++| |.++..++.|++ |+++.+.. T Consensus 501 ~~k~-------I~~E~k~~~~~~--~~~~~~~f 524 (524) T protein:vir:10 501 EAKQ-------IEEESKEARFQN--PDEEEEDF 524 (524) T ss_pred HHHH-------HHHHhhcCCCCC--CChhhhcC Confidence 5555 567888999984 34444443 No 7 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=100.00 E-value=3.8e-122 Score=686.27 Aligned_cols=446 Identities=17% Similarity=0.273 Sum_probs=288.5 Q ss_pred cccccc-------------cccccccccCCCCCCCCCC-cccCCchhhhhhhhhcCCcchhhheec-----CC--CceEE Q lcl|NC_016163. 343 ISGFFG-------------SAVSPFNATGEGNTQPGSN-RKLADPEKEKILKTKLGGNEKAIVKRI-----SP--GNIVD 401 (878) Q Consensus 343 ~~~~~~-------------~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~--~~~~~ 401 (878) .|-.|| |.+.|-+..|.-..+.|.+ -...|-+- -+ -+++..|++- .| -+-|+ T Consensus 1 m~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~e~--~~-----~~~~eLI~~YR~ma~~pEvd~Av~ 73 (533) T protein:vir:10 1 MSQLFGFSLERAKKAPKGPSFVQKDNLDGSQPVSGGGYYGYTVDFDG--QV-----RNEYQLISRYREMVLQPECDSAVD 73 (533) T ss_pred CccccccccccccccccCCCCCCCCcccccceeecccccceeeeccc--cc-----chHHHHHHHHHHHhhccchhhHHH Confidence 222333 3333333333322222111 00000000 00 0122222211 01 11111 Q ss_pred eeecccceeeeeeEeeEecCCCcccCcccc---------------cCCCCceeecccccccc---------------h-- Q lcl|NC_016163. 402 LTFEDNILGYLYLDIVEVDPDGTTMPSDKV---------------DNGNEGYTFMPSQSVGN---------------G-- 449 (878) Q Consensus 402 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~---------------~-- 449 (878) --..+-|..=-+-++|+|+-|.|- -|+++ +-.+.||.+.-..-|-. | T Consensus 74 eIVneaiv~d~~~~pV~i~Ld~~~-~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~ 152 (533) T protein:vir:10 74 DIVNETICGNFDDVPVSVELSNLK-VSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPDNPQGGLI 152 (533) T ss_pred HhhcceeeecCCCceEEEEecccc-cchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCCCccccce Confidence 111122222233456666665544 22221 22333433221111100 0 Q ss_pred hHHH----Hhhh--cCccccccCCCc---ccc---cc--ccccCCCCceeccccchhHHHHHHHHHhhcCCcchhhhhhh Q lcl|NC_016163. 450 NVLQ----NMVY--SGKDIPIDGNGG---HSS---GA--KNVENPNGQRLDVADDARLQFLAAAFANRLSDESNIKLIKK 515 (878) Q Consensus 450 ~~~~----~~~~--~~~~~~~~~~~~---~~~---~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~IKl~K~ 515 (878) .+.+ +|-| --+.-+-+|.-| .+. |. --+.||+|+- .+....+||.++ T Consensus 153 ELr~lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~-------------------~~~~~~vkI~~d 213 (533) T protein:vir:10 153 ELRYIDPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGLK-------------------NSTTQGLKIAPD 213 (533) T ss_pred eeeeccccceeeeeeeeccCCCccceeecchhhhccceeeeeecccccc-------------------ccCCCceecchh Confidence 0000 0000 000000011000 000 00 1233555442 234667999999 Q ss_pred hhhhhhhhhhhhhhhhhhhhccceeEEechHHHHhhhcCchhhhhhhhhhhhhhhhHHHHHHHHHhcCccceEEEEecC- Q lcl|NC_016163. 516 SATIKQAIYNTLSIRKLTRKDKVRVIYLKPEEVVMINRGHSIFDNILFFAKIYITTLLTLLMQNVLRGAPKRAVYVEVG- 594 (878) Q Consensus 516 S~T~~~~~~~~l~~~~L~~k~K~RVi~~~peeIv~In~G~sI~dnil~~skiyltTllSLVIYRITRAPERRVFYIDVG- 594 (878) ++| .|.|||.+.++.-+++|+|.+|+..||.+||+| +|||||||||||||||||||| T Consensus 214 AI~--------y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~ED--------------AlVIYRitRAPeRRvFYIDVGn 271 (533) T protein:vir:10 214 SIC--------YVHSGIMDLNKNMTLSHLHKAIKAVNQLRMIED--------------SLVIYRLSRAPERRIFYIDVGN 271 (533) T ss_pred hee--------eeeccceeCCCCceeccchHhHHHHHhhHHHHh--------------hHHHHhhhccccceEEEEecCC Confidence 999 899999999999999999999999999999999 999999999999999999999 Q ss_pred CCChhHHHHHHHHH-hhcceeeecccCceeccccchhhhhhhcccccccCCCCceeeecCcccccccchHHHHHHHHHHH Q lcl|NC_016163. 595 LDNNPANATQQAIR-DVKSKEISSITNMDMQSIINYVGEFQDYYIPVVDGEKPITFETIDALDAKSLDDDFLNWLSNNIF 673 (878) Q Consensus 595 LPK~KAEQYMrdI~-kyKNKLVYDAsTGEVRDDrk~MSMLEDYWLPRREGGRGTEISTLpGqNlgei~DDvVeYFqKKLY 673 (878) |||.||||||++|| +||||+|||++||+|+|+++|||||||||||||||||||||+||||+++++++|| |+||++||| T Consensus 272 LPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~D-V~YF~kKLY 350 (533) T protein:vir:10 272 LPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEITTLPGGQNLGELED-VKYFQKKLY 350 (533) T ss_pred CCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcChHHH-HHHHHHHHH Confidence 99999999999985 8999999999999999999999999999999999999999999976655555555 999999999 Q ss_pred HHcCCCchhc-------------cccccccHHHHHHHHHHHH---HHhhhHhhccchhhhhh--hhccccccccccccch Q lcl|NC_016163. 674 SGMGIPSAYL-------------TEVENVDFAKTLSMQNSRF---IRDIIGDQVILSKGYTE--LVRKIYNLNFKSNEVD 735 (878) Q Consensus 674 rALNVPvSRL-------------ITRDELKFsKFI~RLR~RF---F~DlLRtQLILKKiiTE--~IRniyn~nFksnEVd 735 (878) +|||||+||| |||||+||+|||.|||+|| |.++||+|||||++||+ |-.-.-++.|.....+ T Consensus 351 ~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn 430 (533) T protein:vir:10 351 KSLNVPGSRLETETTFNVGRAAEITRDEVKFQKFVARLRKRFSELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADN 430 (533) T ss_pred HHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecc Confidence 9999999999 9999999999999999999 99999999999999995 4332244555556679 Q ss_pred hHHHHHHHhhhhHhhhhhhhhhhhhhhhHhhcchhhhheecCcccccchhhhhHHHHHHhhhhhhhhcCccccccCCcch Q lcl|NC_016163. 736 DKSDPAKDEILTKNAKSYTDTQSLAKAAIKYFDINNISVKFPSPASLNMNNLSEQISNVNNFVTTLTENLTFDDTIPQDD 815 (878) Q Consensus 736 d~selkk~EILt~riksytd~Qsla~aa~KYFdv~yis~K~l~~t~l~m~eldeQIs~~~~~v~~~~~N~~F~~~~~QDD 815 (878) ++.|++..|||..|+..+ +.++|+++||||++||++++|++++-+|+++++| |.++..++.|.+ |+++ T Consensus 431 ~f~ElKe~Eil~~Rl~~l---~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kq-------I~~E~k~~~~~~--p~~~ 498 (533) T protein:vir:10 431 YFAELKEIEIRNERMNQV---ATMDPFVGKYFSVEYMRRQVLKQTDVEMKEIDKQ-------IESEMESGIIAD--PAAE 498 (533) T ss_pred hHHHHHHHHHHHHHHHHH---HHhhhhhccccchHHHHHHHhccCHHHHHHHHHH-------HHHHHhCCCCCC--Ccch Confidence 999999999999999988 7889999999999999999998887666665555 467888999995 4444 Q ss_pred HHHHHHHHHHHhhhccCCCCCHHHHHHHHHHHHHhhcchhhhhhhccchhhcccccccccCCC Q lcl|NC_016163. 816 QEKLKKKLIMRFTKKNLPNVDWDELDSIMDEIVREYTGEKVEKSISTSEEENEEGSEDVGGGF 878 (878) Q Consensus 816 Q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 878 (878) +|.. +..-.|..+ |..-+. -.++.-+.++..-.-| T Consensus 499 ~~~~--------~~~~~~~~~----------------~~~~~~----~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:10 499 MDPA--------MAAGDPDAG----------------GAPAEE----VAPEGPDPSDERKAEF 533 (533) T ss_pred hhHH--------hcCCCCCcC----------------Cccccc----CCCCCCCcchhhccCC Confidence 4431 222344332 111110 0011111222222233 No 8 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=100.00 E-value=1.4e-121 Score=683.15 Aligned_cols=419 Identities=17% Similarity=0.265 Sum_probs=277.5 Q ss_pred heeeeccchhHHHhhhhhhhhccc-----ccccccccccccccccCCC-------CCCCCCCc--c-------------- Q lcl|NC_016163. 318 DTFIIGDSSHILSEYSDVLLSEDM-----NISGFFGSAVSPFNATGEG-------NTQPGSNR--K-------------- 369 (878) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~-------~~~~~~~~--~-------------- 369 (878) =.| |+|.-+ -....+|- ++..-..|.+.|-+..|.- .+.|+..- . T Consensus 1 m~~------~~l~lf-~f~~k~~e~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eL 73 (521) T protein:vir:10 1 MNP------IFLKLL-QPWMKDDEKRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDL 73 (521) T ss_pred CCc------chhHHh-hhhhhhhhhHHhhhhccCccccccccCCCCceeeccCCCccccccchhhhhhccccccchHHHH Confidence 000 011000 00000000 0011122455555555441 11121110 0 Q ss_pred -------cCCchhhhhhhhhcCCcchhhheecCCCceEEeeecc-----c--------------c--------------- Q lcl|NC_016163. 370 -------LADPEKEKILKTKLGGNEKAIVKRISPGNIVDLTFED-----N--------------I--------------- 408 (878) Q Consensus 370 -------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~--------------~--------------- 408 (878) ..+||-+..+.--. | .|||-. .-+++|.|..++ + + T Consensus 74 I~~YR~ma~~pEvd~Av~eIv--n-eaiv~d-~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WY 149 (521) T protein:vir:10 74 INQYRSLSKYHEVDNAIDEII--N-DAIVQE-DNRDTVYLDLDKTDWNESVKEMVREEFRTILKLLKFEREGKRHFRRWY 149 (521) T ss_pred HHHHHHHhhccchhhHHHhhh--c-ceEEec-CCCceEEEEecCcccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhe Confidence 01222221111000 0 011111 112344443321 1 1 Q ss_pred -eeeeeeEeeEecCC----CcccCcccccCCCCce--eecccccccc---hhHHHHhhhcCccccccCCCcccccccccc Q lcl|NC_016163. 409 -LGYLYLDIVEVDPD----GTTMPSDKVDNGNEGY--TFMPSQSVGN---GNVLQNMVYSGKDIPIDGNGGHSSGAKNVE 478 (878) Q Consensus 409 -~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~--~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (878) -|-+|+-.| |||+ |...- --+|.-+--| ...+...-|. +.+....+|..+ |+++. T Consensus 150 VDgRi~fHki-id~~~pk~GI~El-r~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~-------~~~~~------ 214 (521) T protein:vir:10 150 VDSRIYFHKM-IDPARPKDGIKEL-RLLDPRNVEYYRVNLKSNENGNDVYKGVKEFFTYGAT-------EDNRY------ 214 (521) T ss_pred eeeeEEEEEE-eeCCCccccceee-eeeCCcceeeeeeecCCCCCcchhhccceeeeeeccC-------CCcee------ Confidence 122333222 2221 11100 0000000000 0001110000 011222344322 11111 Q ss_pred CCCCceeccccchhHHHHHHHHHhhcCCcchhhhhhhhhhhhhhhhhhhhhhhhhhhccceeEEechHHHHhhhcCchhh Q lcl|NC_016163. 479 NPNGQRLDVADDARLQFLAAAFANRLSDESNIKLIKKSATIKQAIYNTLSIRKLTRKDKVRVIYLKPEEVVMINRGHSIF 558 (878) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~IKl~K~S~T~~~~~~~~l~~~~L~~k~K~RVi~~~peeIv~In~G~sI~ 558 (878) |.+| +..+.+||.+++|| .|.|||.+.++.-+++|+|.+|+..||.+|++ T Consensus 215 ~~~g----------------------~~~~~vkI~~daI~--------y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~E 264 (521) T protein:vir:10 215 NISG----------------------NSNNLVQIPIDAIV--------YSHSGKVDIDGKTIVGYLHNVIKPANQLKMLE 264 (521) T ss_pred cCCC----------------------CCCcceeechhhee--------eecccceeCCCCceeccchhhhHhHHhhHHHH Confidence 1222 23567999999999 99999999999999999999999999999999 Q ss_pred hhhhhhhhhhhhhHHHHHHHHHhcCccceEEEEecC-CCChhHHHHHHHHH-hhcceeeecccCceeccccchhhhhhhc Q lcl|NC_016163. 559 DNILFFAKIYITTLLTLLMQNVLRGAPKRAVYVEVG-LDNNPANATQQAIR-DVKSKEISSITNMDMQSIINYVGEFQDY 636 (878) Q Consensus 559 dnil~~skiyltTllSLVIYRITRAPERRVFYIDVG-LPK~KAEQYMrdI~-kyKNKLVYDAsTGEVRDDrk~MSMLEDY 636 (878) | +|||||||||||||||||||| |||.||||||++|| +||||+|||++||+|+|++++||||||| T Consensus 265 D--------------AlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMlEDy 330 (521) T protein:vir:10 265 D--------------AMVIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTGKVKNSSNNLAMTEDY 330 (521) T ss_pred h--------------hHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhHhhh Confidence 9 999999999999999999999 99999999999986 8999999999999999999999999999 Q ss_pred ccccccCCCCceeeecCcccccccchHHHHHHHHHHHHHcCCCchhc--------------cccccccHHHHHHHHHHHH Q lcl|NC_016163. 637 YIPVVDGEKPITFETIDALDAKSLDDDFLNWLSNNIFSGMGIPSAYL--------------TEVENVDFAKTLSMQNSRF 702 (878) Q Consensus 637 WLPRREGGRGTEISTLpGqNlgei~DDvVeYFqKKLYrALNVPvSRL--------------ITRDELKFsKFI~RLR~RF 702 (878) |||||||||||||+||||+++++++|| |+||++|||+|||||+||| |||||+||+|||.|||+|| T Consensus 331 WLpRReGgrgTEI~TLpggqnlgem~D-V~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~~EItRDEikF~KFI~rLR~rF 409 (521) T protein:vir:10 331 WLMRRDGKATTEVSTLPGAQSMGEMDD-VRWFNRKLYESMKIPLSRLPQEGAGVTFGAGNDITRDELQFTKYIRGLQQQF 409 (521) T ss_pred cccccCCCCccceeeccccCCcChHHH-HHHHHHHHHHHhCCCccccCCCCCceecccccchhHHHHHHHHHHHHHHHHH Confidence 999999999999999976655555555 9999999999999999999 9999999999999999999 Q ss_pred ---HHhhhHhhccchhhhhh--hhccccccccccccchhHHHHHHHhhhhHhhhhhhhhhhhhh--hhHhhcchhhhhee Q lcl|NC_016163. 703 ---IRDIIGDQVILSKGYTE--LVRKIYNLNFKSNEVDDKSDPAKDEILTKNAKSYTDTQSLAK--AAIKYFDINNISVK 775 (878) Q Consensus 703 ---F~DlLRtQLILKKiiTE--~IRniyn~nFksnEVdd~selkk~EILt~riksytd~Qsla~--aa~KYFdv~yis~K 775 (878) |.++||+|||||++||+ |-.-.-++.|.....+++.|++..|||..|+..+ +.++| +++|||+++|++++ T Consensus 410 s~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l---~~~dp~~yvGky~s~dyi~k~ 486 (521) T protein:vir:10 410 EPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLV---QTLASAEVTGKYLSHEYVMKN 486 (521) T ss_pred HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHH---HhhcCccccccccchHHHHHH Confidence 99999999999999995 4332234555556679999999999999999988 78888 99999999999999 Q ss_pred cCcccccchhhhhHHHHHHhhhhhhhhcCccccccCCcchHHHH Q lcl|NC_016163. 776 FPSPASLNMNNLSEQISNVNNFVTTLTENLTFDDTIPQDDQEKL 819 (878) Q Consensus 776 ~l~~t~l~m~eldeQIs~~~~~v~~~~~N~~F~~~~~QDDQ~~~ 819 (878) +|++++-+|++.++|| .++..++.|.+ |++++|.. T Consensus 487 ILr~tDeeik~~~k~I-------~~E~~~~~~~~--p~~e~~df 521 (521) T protein:vir:10 487 ILRMSDEDIKTEREKI-------DGELKDSVYKN--PEDPMEEF 521 (521) T ss_pred HhcCCHhHHHHHHHHH-------HHhhhCCCCCC--CcchhhcC Confidence 9888876666655554 67788899983 56666665 No 9 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=100.00 E-value=1.6e-120 Score=677.36 Aligned_cols=489 Identities=17% Similarity=0.248 Sum_probs=284.5 Q ss_pred cCCcEEEEeecccHHHHHHHhhhhhhhhhhccccccchhhhhhccc----------------hhhhhhhcccccccccc- Q lcl|NC_016163. 222 RDGKLIYLVLSMNEEIKQMLSESASELDKVDSYKTLNLSEGLLNSA----------------DKSQLLSENASFSKQGS- 284 (878) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~- 284 (878) -.-+| +-+|..++ +.+...+|.. .-+--+|-..-. ...||..+--+...+-. T Consensus 1 ~~~~l--fg~~i~~~--~~~~~~~s~~-------~~~~~dg~~~~~~~~~~g~~~~~e~~~~~~~eLI~~YR~ma~~pEv 69 (537) T protein:vir:10 1 MAQQL--FGFSLQRA--KKVPKGPSFV-------QKDSLDGSQPIVGGGYFGYSVDFDGTIRNDHELITRYREMVLNPEC 69 (537) T ss_pred Ccccc--ccceeecc--cccccCCccc-------CCCcccccceeecccccccccccccccchHHHHHHHHHHHhhccch Confidence 00000 01111111 1112222211 111001100000 00111111111111100 Q ss_pred -hhHhhhhhhhccccccccchhhhhHHH--HHHhhhheeeeccchhHHHhhhhhhhhcccccccccccccccccccCCCC Q lcl|NC_016163. 285 -IFLEEFKDAFGIEKEATGVKVGAALDK--FSEALKDTFIIGDSSHILSEYSDVLLSEDMNISGFFGSAVSPFNATGEGN 361 (878) Q Consensus 285 -~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 (878) --+++.-+.. |--+...-.|.--||. ||+++|+ -|++|+..||--=| | +-+|- T Consensus 70 d~Av~eIVnea-iv~d~~~~pV~i~Ld~~~~s~~iK~--------kI~eEF~~Il~ll~-----F--------~~~~~-- 125 (537) T protein:vir:10 70 DSAVDDVVNET-ICGNFDDVPISIDLHNLKQSEKIKK--------LIRSEFDEILRLLD-----F--------DNRAY-- 125 (537) T ss_pred hhHHHHhhcce-eEecCCCceEEEEecccccchHHHH--------HHHHHHHHHHHHhc-----c--------chhhh-- Confidence 0011111100 0001111112222222 4444443 35555555542111 1 11110 Q ss_pred CCCCCCcccCCchhhhhhhhhcCCcchhhheecCCCc----eEEeeecccceeeeeeEeeEecCCCcccCcccccCCCCc Q lcl|NC_016163. 362 TQPGSNRKLADPEKEKILKTKLGGNEKAIVKRISPGN----IVDLTFEDNILGYLYLDIVEVDPDGTTMPSDKVDNGNEG 437 (878) Q Consensus 362 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (878) +.+-+--.-|. -..-|.|.|.| |..|..-|-.-=-.+--|..-+++++.. .+++.+ T Consensus 126 --------------e~fR~WYVDgR-i~fhKiid~k~pk~GI~ELr~lDPr~i~~vR~i~~~~~~~~~~----~~~~~~- 185 (537) T protein:vir:10 126 --------------EIFRRWYVDGR-LFFHKVIDPKKPRQGLVELRYVDPRKIRKVTEYEAKRPEALRT----QDLNQQ- 185 (537) T ss_pred --------------HHHhhheeeeE-EEEEEEEeCCCccccceeeeeeCCccceeeEeecccCCccceE----Eeccee- Confidence 00000000000 01111122211 2222111100000000011111211110 000000 Q ss_pred eeecccccccchhHHHHhhhcCccccccCCCccccccccccCCCCceeccccchhHHHHHHHHHhhcCCcchhhhhhhhh Q lcl|NC_016163. 438 YTFMPSQSVGNGNVLQNMVYSGKDIPIDGNGGHSSGAKNVENPNGQRLDVADDARLQFLAAAFANRLSDESNIKLIKKSA 517 (878) Q Consensus 438 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~IKl~K~S~ 517 (878) |+. |..++ -+.||+|+- .+....+||.++++ T Consensus 186 ------------------v~~-------~~~ey-----f~ynp~g~~-------------------~~~~~~vkI~~dAI 216 (537) T protein:vir:10 186 ------------------LTQ-------QSASY-----FLYNPKGLK-------------------NSTNQGMKIAPDSI 216 (537) T ss_pred ------------------eee-------cccce-----eeecccccc-------------------ccCCCceeccHhhe Confidence 000 01111 112444431 12345699999999 Q ss_pred hhhhhhhhhhhhhhhhhhccceeEEechHHHHhhhcCchhhhhhhhhhhhhhhhHHHHHHHHHhcCccceEEEEecC-CC Q lcl|NC_016163. 518 TIKQAIYNTLSIRKLTRKDKVRVIYLKPEEVVMINRGHSIFDNILFFAKIYITTLLTLLMQNVLRGAPKRAVYVEVG-LD 596 (878) Q Consensus 518 T~~~~~~~~l~~~~L~~k~K~RVi~~~peeIv~In~G~sI~dnil~~skiyltTllSLVIYRITRAPERRVFYIDVG-LP 596 (878) | .|.|||++.++.-+++|+|.+|+..||.+||+| +|||||||||||||||||||| || T Consensus 217 ~--------y~hSGl~d~n~~~i~syLhkAiKp~NQLkm~ED--------------AlVIYRitRAPeRRvFYIDVGnLP 274 (537) T protein:vir:10 217 A--------YCHSGIQDLNKNMVLSHLHKAIKAVNQLRMIED--------------SLVIYRLSRAPERRIFYIDVGNLP 274 (537) T ss_pred e--------eecccceeCCCCeeeeeehhhhHHHHhhHHHHh--------------hHHHHhhhccccceEEEEecCCCC Confidence 9 999999999999999999999999999999999 999999999999999999999 99 Q ss_pred ChhHHHHHHHHH-hhcceeeecccCceeccccchhhhhhhcccccccCCCCceeeecCcccccccchHHHHHHHHHHHHH Q lcl|NC_016163. 597 NNPANATQQAIR-DVKSKEISSITNMDMQSIINYVGEFQDYYIPVVDGEKPITFETIDALDAKSLDDDFLNWLSNNIFSG 675 (878) Q Consensus 597 K~KAEQYMrdI~-kyKNKLVYDAsTGEVRDDrk~MSMLEDYWLPRREGGRGTEISTLpGqNlgei~DDvVeYFqKKLYrA 675 (878) |.||||||++|| +||||+|||++||+|+|+++|||||||||||||||||||||+||||+++++++|| |+||++|||+| T Consensus 275 k~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnlgem~D-V~YF~kKLy~a 353 (537) T protein:vir:10 275 KNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELED-VKYFQKKLYKA 353 (537) T ss_pred chhHHHHHHHHHHhccceEEEeccCceecccchhhhhhhhhcccccCCCcccceeeccccCCcChHHH-HHHHHHHHHHH Confidence 999999999985 8999999999999999999999999999999999999999999976655555555 99999999999 Q ss_pred cCCCchhc-------------cccccccHHHHHHHHHHHH---HHhhhHhhccchhhhhh--hhccccccccccccchhH Q lcl|NC_016163. 676 MGIPSAYL-------------TEVENVDFAKTLSMQNSRF---IRDIIGDQVILSKGYTE--LVRKIYNLNFKSNEVDDK 737 (878) Q Consensus 676 LNVPvSRL-------------ITRDELKFsKFI~RLR~RF---F~DlLRtQLILKKiiTE--~IRniyn~nFksnEVdd~ 737 (878) ||||+||| |||||+||+|||.|||+|| |.++||+|||||++||+ |-.-.-++.|.....+++ T Consensus 354 LnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f 433 (537) T protein:vir:10 354 LNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYF 433 (537) T ss_pred hCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchH Confidence 99999999 9999999999999999999 99999999999999995 433224455555667999 Q ss_pred HHHHHHhhhhHhhhhhhhhhhhhhhhHhhcchhhhheecCcccccchhhhhHHHHHHhhhhhhhhcCccccccCCcchHH Q lcl|NC_016163. 738 SDPAKDEILTKNAKSYTDTQSLAKAAIKYFDINNISVKFPSPASLNMNNLSEQISNVNNFVTTLTENLTFDDTIPQDDQE 817 (878) Q Consensus 738 selkk~EILt~riksytd~Qsla~aa~KYFdv~yis~K~l~~t~l~m~eldeQIs~~~~~v~~~~~N~~F~~~~~QDDQ~ 817 (878) .|++..|||..|+..+ +.++|+++|||+.+||++++|++++-+|+++++| |.++..+++|.+ |+++++ T Consensus 434 ~ElKe~Eil~~Rl~~l---~~~dpyvGky~s~dyi~k~ILr~tDeeI~~~~k~-------I~~E~k~~~~~~--p~~~~~ 501 (537) T protein:vir:10 434 TELKEIEIRNERMNEV---AQMDPYVGKYFSANYIRTKVLKQTESEIKEIDKE-------IKQEIADGVIMD--PQAMQA 501 (537) T ss_pred HHHHHHHHHHHHHHHH---HHhhhhhhcccchHHHHHHHhccCHHHHHHHHHH-------HHHHhhCCCCCC--cccccc Confidence 9999999999999988 7889999999999999999988887666665555 456778899996 665543 Q ss_pred HHHHHHHHHhh---hccCCCCCHHHHHHHHHHHHHhhcchhhhhhhccchhhcccccccccCCC Q lcl|NC_016163. 818 KLKKKLIMRFT---KKNLPNVDWDELDSIMDEIVREYTGEKVEKSISTSEEENEEGSEDVGGGF 878 (878) Q Consensus 818 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 878 (878) - .|. -..+|.. |+.-+ ...+ --+..++--||= T Consensus 502 ~-------~~~~~~~~~~~~~-----------------~~~~~--~~~~---~~~~~~~~~~~~ 536 (537) T protein:vir:10 502 M-------EMGIGDEEPVPEG-----------------GEEPQ--TDPN---SAVSPADQKRGE 536 (537) T ss_pred c-------ccCCCCcccCCCC-----------------CCCcc--cCCc---cCCCCCCccCCC Confidence 1 111 1112211 00000 0000 001122333333 No 10 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=100.00 E-value=9.7e-122 Score=684.02 Aligned_cols=475 Identities=17% Similarity=0.268 Sum_probs=279.8 Q ss_pred hhhhhccccccccchhhhhHHHHHHhhhheeeeccchhHHHhhhhhhhhcccccccccccccccccccCCCCCCCCCCc- Q lcl|NC_016163. 290 FKDAFGIEKEATGVKVGAALDKFSEALKDTFIIGDSSHILSEYSDVLLSEDMNISGFFGSAVSPFNATGEGNTQPGSNR- 368 (878) Q Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 368 (878) ....||.+-+.. +++ +--.-|.+.|-|..|..+..-|..- T Consensus 1 m~~lfgf~~~~~-------------------------------------~~~--~~~~~s~~~p~~ddg~~~~~~~g~~~ 41 (558) T protein:vir:10 1 MAKLFGFSIEET-------------------------------------QKK--STSIISPVPKNNEDGVDNFISSGFYG 41 (558) T ss_pred Ccchhcchhhhh-------------------------------------hhh--ccCCccccCCCccccccceeccceee Confidence 222222211100 000 0000122333333332221111110 Q ss_pred cc------CCchhhhhhhhh-c---CCcchhhheecCCCceEEeeecccceeeeeeEeeEecCCCcccCcccc------- Q lcl|NC_016163. 369 KL------ADPEKEKILKTK-L---GGNEKAIVKRISPGNIVDLTFEDNILGYLYLDIVEVDPDGTTMPSDKV------- 431 (878) Q Consensus 369 ~~------~~~~~~~~~~~~-~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------- 431 (878) .. +--+++.|=.-. + ---+.|+ -.||+ +-|..=-+-++|+|+-|+|-. |+.+ T Consensus 42 ~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av------~eIVn----eaiv~d~~~~pV~i~Ld~~~~-s~~iK~kI~eE 110 (558) T protein:vir:10 42 QYVDIEGAYRSEYDLIRRYREMALHPEADGAI------EDVVN----EAIVSDLYDSPVEVELSNLNA-SNTLKKKIREE 110 (558) T ss_pred eeecccchhhhHHHHHHHHHHHhhccchhhHH------HHhhc----ceeEecCCCceEEEEecccCc-chHHHHHHHHH Confidence 00 111111110000 0 0000010 01111 111111122344444444332 2111 Q ss_pred --------cCCCCceeecccccccchhHH---------------------HHh--hhcCccccccCCCc--ccccccccc Q lcl|NC_016163. 432 --------DNGNEGYTFMPSQSVGNGNVL---------------------QNM--VYSGKDIPIDGNGG--HSSGAKNVE 478 (878) Q Consensus 432 --------~~~~~~~~~~~~~~~~~~~~~---------------------~~~--~~~~~~~~~~~~~~--~~~~~~~~~ 478 (878) +-.+.||.+.-..-|-.--.+ .+| |.--+-=|.|++-- -..++-... T Consensus 111 F~~Il~ll~F~~~~~e~fR~WYVDgRiyfHKiid~k~pk~GI~ELr~lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~ 190 (558) T protein:vir:10 111 FRYIKEMMDFDKKSHEIFRNWYVDGRVFYLKVIDTKNPQEGIQDLRYIDPLKIKFIRQEKRKPGNQDPAIRVRSEQDVVP 190 (558) T ss_pred HHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeCcccceeeeeeccccccccceeeeecccceee Confidence 111222221110000000000 000 00000001111000 000000011 Q ss_pred CCCCceeccccchhHHHHHHHHHhhcCCcchhhhhhhhhhhhhhhhhhhhhhhhhhhccceeEEechHHHHhhhcCchhh Q lcl|NC_016163. 479 NPNGQRLDVADDARLQFLAAAFANRLSDESNIKLIKKSATIKQAIYNTLSIRKLTRKDKVRVIYLKPEEVVMINRGHSIF 558 (878) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~IKl~K~S~T~~~~~~~~l~~~~L~~k~K~RVi~~~peeIv~In~G~sI~ 558 (878) ||.=+-.=+-+... +-..+...+++..+.+||.++++| .|.|||++.++.-+++|+|.+|+..||.+||+ T Consensus 191 ~~~~~eyy~Y~~~~--~~~~~~~~~~~~~~~vkI~~dAI~--------y~hSGL~d~~~~~i~syLhkAIKp~NQLkmlE 260 (558) T protein:vir:10 191 NPEFEEFYIYTPKV--QHPTGMVGQMGGKNSIKIAKDSIT--------MCTSGLVDRNKNRVLSYLHKAIKALNQLRMIE 260 (558) T ss_pred ccceeEeeeecCCc--ccccccceeecCCCceeechhhee--------eecccceecCCCeeeecchHhhHhHHhhHHHH Confidence 11101000111111 111233445677889999999999 99999999999999999999999999999999 Q ss_pred hhhhhhhhhhhhhHHHHHHHHHhcCccceEEEEecC-CCChhHHHHHHHHH-hhcceeeecccCceeccccchhhhhhhc Q lcl|NC_016163. 559 DNILFFAKIYITTLLTLLMQNVLRGAPKRAVYVEVG-LDNNPANATQQAIR-DVKSKEISSITNMDMQSIINYVGEFQDY 636 (878) Q Consensus 559 dnil~~skiyltTllSLVIYRITRAPERRVFYIDVG-LPK~KAEQYMrdI~-kyKNKLVYDAsTGEVRDDrk~MSMLEDY 636 (878) | +|||||||||||||||||||| |||.||||||++|| +||||+|||++||+|+|+++|||||||| T Consensus 261 D--------------AlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDy 326 (558) T protein:vir:10 261 D--------------SLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRDDRKFMSMMEDF 326 (558) T ss_pred h--------------hHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhh Confidence 9 999999999999999999999 99999999999985 8999999999999999999999999999 Q ss_pred ccccccCCCCceeeecCcccccccchHHHHHHHHHHHHHcCCCchhc-------------cccccccHHHHHHHHHHHH- Q lcl|NC_016163. 637 YIPVVDGEKPITFETIDALDAKSLDDDFLNWLSNNIFSGMGIPSAYL-------------TEVENVDFAKTLSMQNSRF- 702 (878) Q Consensus 637 WLPRREGGRGTEISTLpGqNlgei~DDvVeYFqKKLYrALNVPvSRL-------------ITRDELKFsKFI~RLR~RF- 702 (878) |||||||||||||+||||+++++++|| |+||++|||+|||||+||| |||||+||+|||.|||+|| T Consensus 327 WLpRReGgrgTEItTLpGgqnLgem~D-V~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs 405 (558) T protein:vir:10 327 WLPRREGGRGTEITTLPGGQNLGELSD-VDYFQKKLYRALGVPESRIAAEGGFNLGRSSEILRDELKFAKFVGRLRKRFA 405 (558) T ss_pred cccccCCCCccceeeccccCCcchHHH-HHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHH Confidence 999999999999999975555555555 9999999999999999999 9999999999999999999 Q ss_pred --HHhhhHhhccchhhhhh--hhccccccccccccchhHHHHHHHhhhhHhhhhhhhhhhhhhhhHhhcchhhhheecCc Q lcl|NC_016163. 703 --IRDIIGDQVILSKGYTE--LVRKIYNLNFKSNEVDDKSDPAKDEILTKNAKSYTDTQSLAKAAIKYFDINNISVKFPS 778 (878) Q Consensus 703 --F~DlLRtQLILKKiiTE--~IRniyn~nFksnEVdd~selkk~EILt~riksytd~Qsla~aa~KYFdv~yis~K~l~ 778 (878) |.++||+|||||++||+ |-.-.-++.|.....+++.|++..|||..|+..+ +.++|+++||||++||++++|+ T Consensus 406 ~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l---~~~dpyvGky~S~dyi~k~ILr 482 (558) T protein:vir:10 406 AMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGML---ATIEPYIGKYYSTEYVRKRVLR 482 (558) T ss_pred HHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHH---HHhhhhhccccchHHHHHHHhc Confidence 99999999999999995 4332234555556679999999999999999988 7889999999999999999998 Q ss_pred ccccchhhhhHHHHHHhhhhhhhhcCccccc----------cCCcc-hHHHHHHHHHHHhhhccCCCCCH-HHHHHHHHH Q lcl|NC_016163. 779 PASLNMNNLSEQISNVNNFVTTLTENLTFDD----------TIPQD-DQEKLKKKLIMRFTKKNLPNVDW-DELDSIMDE 846 (878) Q Consensus 779 ~t~l~m~eldeQIs~~~~~v~~~~~N~~F~~----------~~~QD-DQ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 846 (878) +++-+|+++++|| .++..+++|.+ ++||. |-+ | .+++...- ++++---.+ T Consensus 483 ~tDeeI~~~~kqI-------~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~-------~----~~~~~~~~~~~~~~~~~~ 544 (558) T protein:vir:10 483 QTDMEIEEIDTQI-------EDEIQKGIIPDPSQIDPITGEPLPQEGDPA-------M----EGMGEQPVDPDLEAQAQA 544 (558) T ss_pred cCHHHHHHHHHHH-------HHHHhCCCCCCccccChhhccccCccCCch-------h----ccCCCCCcccccccchhh Confidence 8876666666655 56666777763 22221 100 1 11111110 111111122 Q ss_pred HHHhhcchhhhhhhc Q lcl|NC_016163. 847 IVREYTGEKVEKSIS 861 (878) Q Consensus 847 ~~~~~~~~~~~~~~~ 861 (878) +-.+|.- ++.|.-- T Consensus 545 ~~~~~~~-~~~~~~~ 558 (558) T protein:vir:10 545 VDAQYSK-DTKKAEL 558 (558) T ss_pred hhhhhhh-hhhhhcC Confidence 2222211 1111111 No 11 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=100.00 E-value=1.2e-121 Score=683.57 Aligned_cols=433 Identities=17% Similarity=0.247 Sum_probs=272.0 Q ss_pred chhhhhHHHHHHhhhheeeeccchhHHHhhhhhhhhcccccccccccccccccccCCC---------------------- Q lcl|NC_016163. 303 VKVGAALDKFSEALKDTFIIGDSSHILSEYSDVLLSEDMNISGFFGSAVSPFNATGEG---------------------- 360 (878) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------------- 360 (878) .+. ..|.-|.-.-+. |-.. |...+ +--..|.+.|-+..|.. T Consensus 1 m~~-~~L~~~~~w~~~-----de~~----~~~~~-------~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g 63 (524) T protein:vir:10 1 MKF-NVLSLFAPWAKM-----DERN----FKDQE-------KEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFG 63 (524) T ss_pred CCC-chhhHhhccccC-----cchh----hhhhh-------ccCCccccCccCCCCceeeeecccccccccceeeeehhc Confidence 111 011111000000 0000 00000 00111333333333211 Q ss_pred CCCC-CCCcc---------cCCchhhhhhhhhcCCcchhhheecCCCceEEeeecccceeeeeeEeeEecCCCcccCccc Q lcl|NC_016163. 361 NTQP-GSNRK---------LADPEKEKILKTKLGGNEKAIVKRISPGNIVDLTFEDNILGYLYLDIVEVDPDGTTMPSDK 430 (878) Q Consensus 361 ~~~~-~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (878) ++.| .+|.+ ..+||-+..+.--. | .|||-. .-.+ +|+||-|.+-.+ ++ T Consensus 64 ~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIV--n-eaiv~d-~~~~-----------------pV~l~L~~~~~s-~~ 121 (524) T protein:vir:10 64 SYEPGMKTTRELIDTYRNLMNNYEVDNAVSEIV--S-DAIVYE-DDTE-----------------VVALNLDKSKFS-PK 121 (524) T ss_pred ccccccchHHHHHHHHHHHhhccchhhHHHHhh--c-ceeEec-CCCc-----------------eEEEEecCcCcc-hH Confidence 1111 01100 11222221111000 0 112221 2233 344433333211 11 Q ss_pred c---------------cCCCCceeecccccccchhHHHHhhhcCcccccc-CC---Ccccc------cc-----ccccCC Q lcl|NC_016163. 431 V---------------DNGNEGYTFMPSQSVGNGNVLQNMVYSGKDIPID-GN---GGHSS------GA-----KNVENP 480 (878) Q Consensus 431 ~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~---~~~~~------~~-----~~~~~~ 480 (878) + +-.+.||.+.--.-|- |. | |--|- || .| |=+.- -- ...+.. T Consensus 122 iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVD-gR----i-~fhKi--id~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~ 193 (524) T protein:vir:10 122 IKNMMLDEFNDVLNHLSFQRKGSDHFRRWYVD-SR----I-FFHKI--IDPKRPKEGIKELRRLDPRQVQYVREIITETE 193 (524) T ss_pred HHHHHHHHHHHHHHHhccchhhhHHHhhheee-eE----E-EEEEE--eeCCCccccceeeeeeCCccceeeeeeccCCC Confidence 0 1111222111100000 00 0 00000 00 00 00000 00 000111 Q ss_pred CCceeccccchhHHHHHH------HHHhh-cCCcchhhhhhhhhhhhhhhhhhhhhhhhhhhccceeEEechHHHHhhhc Q lcl|NC_016163. 481 NGQRLDVADDARLQFLAA------AFANR-LSDESNIKLIKKSATIKQAIYNTLSIRKLTRKDKVRVIYLKPEEVVMINR 553 (878) Q Consensus 481 ~~~~~~~~~~~~~~~~~~------~~~~~-l~~~~~IKl~K~S~T~~~~~~~~l~~~~L~~k~K~RVi~~~peeIv~In~ 553 (878) +|- .|-..-+..|+-. .|--+ .+..+.|||.+++|| .|.|||.+.++.-+++|+|.+|+..|| T Consensus 194 ~~~--~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dAI~--------y~hSGL~d~~~~~i~gyLhkAiKp~NQ 263 (524) T protein:vir:10 194 AGT--KIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAAIV--------YAHSGLVDCCGKNIIGYLHRAVKPANQ 263 (524) T ss_pred ccc--hhhcchhhheeeccCccccccCccccCCCcceecchhhee--------eeeccceeCCCCceeccchhhhHHHHh Confidence 110 1111112222200 00000 122468999999999 999999999999999999999999999 Q ss_pred CchhhhhhhhhhhhhhhhHHHHHHHHHhcCccceEEEEecC-CCChhHHHHHHHHH-hhcceeeecccCceeccccchhh Q lcl|NC_016163. 554 GHSIFDNILFFAKIYITTLLTLLMQNVLRGAPKRAVYVEVG-LDNNPANATQQAIR-DVKSKEISSITNMDMQSIINYVG 631 (878) Q Consensus 554 G~sI~dnil~~skiyltTllSLVIYRITRAPERRVFYIDVG-LPK~KAEQYMrdI~-kyKNKLVYDAsTGEVRDDrk~MS 631 (878) .+|++| +|||||||||||||||||||| |||.||||||++|| +||||+|||++||+|+|++++|| T Consensus 264 LkmlED--------------AlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~ms 329 (524) T protein:vir:10 264 LKLLED--------------AVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMS 329 (524) T ss_pred hhHHHh--------------hHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhh Confidence 999999 999999999999999999999 99999999999985 89999999999999999999999 Q ss_pred hhhhcccccccCCCCceeeecCcccccccchHHHHHHHHHHHHHcCCCchhc---------------cccccccHHHHHH Q lcl|NC_016163. 632 EFQDYYIPVVDGEKPITFETIDALDAKSLDDDFLNWLSNNIFSGMGIPSAYL---------------TEVENVDFAKTLS 696 (878) Q Consensus 632 MLEDYWLPRREGGRGTEISTLpGqNlgei~DDvVeYFqKKLYrALNVPvSRL---------------ITRDELKFsKFI~ 696 (878) ||||||||||||||||||+||||+++++++|| |+||++|||+|||||+||| |||||+||+|||. T Consensus 330 MlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~ 408 (524) T protein:vir:10 330 MTEDYWLQRRDGKAVTEVDTLPGADNTGNMED-VRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIR 408 (524) T ss_pred hHhhhcccccCCCcccceeeccccCCcChHHH-HHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHH Confidence 99999999999999999999976655555555 9999999999999999999 9999999999999 Q ss_pred HHHHHH---HHhhhHhhccchhhhhh--hhccccccccccccchhHHHHHHHhhhhHhhhhhhhhhhhhhhhHhhcchhh Q lcl|NC_016163. 697 MQNSRF---IRDIIGDQVILSKGYTE--LVRKIYNLNFKSNEVDDKSDPAKDEILTKNAKSYTDTQSLAKAAIKYFDINN 771 (878) Q Consensus 697 RLR~RF---F~DlLRtQLILKKiiTE--~IRniyn~nFksnEVdd~selkk~EILt~riksytd~Qsla~aa~KYFdv~y 771 (878) |||+|| |.++||+|||||++||+ |-.-.-++.|.....+++.|++..|||..|+..+ +.++|+++|||+++| T Consensus 409 rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l---~~~dpyvGky~s~~y 485 (524) T protein:vir:10 409 ELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINML---TMAEPFIGKYISHRT 485 (524) T ss_pred HHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHH---HHhhhhhcccchhHH Confidence 999999 99999999999999995 4332234555556679999999999999999988 788999999999999 Q ss_pred hheecCcccccchhhhhHHHHHHhhhhhhhhcCccccccCCcchHHHH Q lcl|NC_016163. 772 ISVKFPSPASLNMNNLSEQISNVNNFVTTLTENLTFDDTIPQDDQEKL 819 (878) Q Consensus 772 is~K~l~~t~l~m~eldeQIs~~~~~v~~~~~N~~F~~~~~QDDQ~~~ 819 (878) +++++|++++-+|.++++| |.++..++.|++ |++++|.. T Consensus 486 i~k~ILr~tDeei~~~~k~-------I~~E~k~~~~~~--~~~~~~~f 524 (524) T protein:vir:10 486 AMKDILQMTDEEIEQEAKQ-------IEEESKEARFQD--PDQEQEDF 524 (524) T ss_pred HHHHHhccCHHHHHHHHHH-------HHHHhhcCCCCC--CchhhhcC Confidence 9999888877655555554 567888999984 44555544 No 12 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=100.00 E-value=1.3e-121 Score=683.30 Aligned_cols=433 Identities=17% Similarity=0.249 Sum_probs=272.1 Q ss_pred chhhhhHHHHHHhhhheeeeccchhHHHhhhhhhhhcccccccccccccccccccCCC---------------------- Q lcl|NC_016163. 303 VKVGAALDKFSEALKDTFIIGDSSHILSEYSDVLLSEDMNISGFFGSAVSPFNATGEG---------------------- 360 (878) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------------- 360 (878) .+. ..|.-|.-.-+. |-.. |...+ +--..|.+.|-+..|.. T Consensus 1 m~~-~~L~~~~~w~~~-----de~~----~~~~~-------~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g 63 (524) T protein:vir:72 1 MKF-NVLSLFAPWAKM-----DERN----FKDQE-------KEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFG 63 (524) T ss_pred CCC-chhhHhhccccC-----cchh----hhhhh-------ccCCccccCccCCCCceeeeecccccccccceeeeehhc Confidence 111 011111000000 0000 00000 00111333333333211 Q ss_pred CCCC-CCCcc---------cCCchhhhhhhhhcCCcchhhheecCCCceEEeeecccceeeeeeEeeEecCCCcccCccc Q lcl|NC_016163. 361 NTQP-GSNRK---------LADPEKEKILKTKLGGNEKAIVKRISPGNIVDLTFEDNILGYLYLDIVEVDPDGTTMPSDK 430 (878) Q Consensus 361 ~~~~-~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (878) ++.| .+|.+ ..+||-+..+.--. | .|||-. .-.+ +|+||-|.+-.+ ++ T Consensus 64 ~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIV--n-eaiv~d-~~~~-----------------pV~l~L~~~~~s-~~ 121 (524) T protein:vir:72 64 SYEPGMKTTRELIDTYRNLMNNYEVDNAVSEIV--S-DAIVYE-DDTE-----------------VVALNLDKSKFS-PK 121 (524) T ss_pred ccccccchHHHHHHHHHHHhhccchhhHHHHhh--c-ceeEec-CCCc-----------------eEEEEecCcCcc-hH Confidence 1111 01100 11222221111000 0 112221 2233 344443333211 11 Q ss_pred c---------------cCCCCceeecccccccchhHHHHhhhcCcccccc-CC---Ccccc------cc-----ccccCC Q lcl|NC_016163. 431 V---------------DNGNEGYTFMPSQSVGNGNVLQNMVYSGKDIPID-GN---GGHSS------GA-----KNVENP 480 (878) Q Consensus 431 ~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~---~~~~~------~~-----~~~~~~ 480 (878) + +-.+.||.+.--.-|- |.+ |--|- || .| |=+.- -- ...+.. T Consensus 122 iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVD-gRi-----~fhKi--id~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~ 193 (524) T protein:vir:72 122 IKNMMLDEFSDVLNHLSFQRKGSDHFRRWYVD-SRI-----FFHKI--IDPKRPKEGIKELRRLDPRQVQYVREIITETE 193 (524) T ss_pred HHHHHHHHHHHHHHHhccchhhhHHHhhheee-eEE-----EEEEE--EeCCCccccceeeeeeCCccceeeeeeccCCC Confidence 0 1111222111100000 000 00000 00 00 00000 00 000111 Q ss_pred CCceeccccchhHHHHHH------HHHhh-cCCcchhhhhhhhhhhhhhhhhhhhhhhhhhhccceeEEechHHHHhhhc Q lcl|NC_016163. 481 NGQRLDVADDARLQFLAA------AFANR-LSDESNIKLIKKSATIKQAIYNTLSIRKLTRKDKVRVIYLKPEEVVMINR 553 (878) Q Consensus 481 ~~~~~~~~~~~~~~~~~~------~~~~~-l~~~~~IKl~K~S~T~~~~~~~~l~~~~L~~k~K~RVi~~~peeIv~In~ 553 (878) +|- .|-..-+..|+-. .|--+ .+..+.|||.+++|| .|.|||.+.++.-+++|+|.+|+..|| T Consensus 194 ~~~--~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dAI~--------y~hSGL~d~~~~~i~gyLhkAiKp~NQ 263 (524) T protein:vir:72 194 AGT--KIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAAVV--------YAHSGLVDCCGKNIIGYLHRAVKPANQ 263 (524) T ss_pred ccc--hhhcchhhheeeccCccccccCccccCCCcceecchhhee--------eeeccceeCCCCceeccchhhhHhHHh Confidence 111 1111112222200 00001 122468999999999 999999999999999999999999999 Q ss_pred CchhhhhhhhhhhhhhhhHHHHHHHHHhcCccceEEEEecC-CCChhHHHHHHHHH-hhcceeeecccCceeccccchhh Q lcl|NC_016163. 554 GHSIFDNILFFAKIYITTLLTLLMQNVLRGAPKRAVYVEVG-LDNNPANATQQAIR-DVKSKEISSITNMDMQSIINYVG 631 (878) Q Consensus 554 G~sI~dnil~~skiyltTllSLVIYRITRAPERRVFYIDVG-LPK~KAEQYMrdI~-kyKNKLVYDAsTGEVRDDrk~MS 631 (878) .+|++| +|||||||||||||||||||| |||.||||||++|| +||||+|||++||+|+|++++|| T Consensus 264 LkmlED--------------AlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~ms 329 (524) T protein:vir:72 264 LKLLED--------------AVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMS 329 (524) T ss_pred hhHHHh--------------hHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhh Confidence 999999 999999999999999999999 99999999999985 89999999999999999999999 Q ss_pred hhhhcccccccCCCCceeeecCcccccccchHHHHHHHHHHHHHcCCCchhc---------------cccccccHHHHHH Q lcl|NC_016163. 632 EFQDYYIPVVDGEKPITFETIDALDAKSLDDDFLNWLSNNIFSGMGIPSAYL---------------TEVENVDFAKTLS 696 (878) Q Consensus 632 MLEDYWLPRREGGRGTEISTLpGqNlgei~DDvVeYFqKKLYrALNVPvSRL---------------ITRDELKFsKFI~ 696 (878) ||||||||||||||||||+||||+++++++|| |+||++|||+|||||+||| |||||+||+|||. T Consensus 330 MlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~ 408 (524) T protein:vir:72 330 MTEDYWLQRRDGKAVTEVDTLPGADNTGNMED-IRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIR 408 (524) T ss_pred hHhhhcccccCCCcccceeeccccCCcChHHH-HHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHH Confidence 99999999999999999999976655555555 9999999999999999999 9999999999999 Q ss_pred HHHHHH---HHhhhHhhccchhhhhh--hhccccccccccccchhHHHHHHHhhhhHhhhhhhhhhhhhhhhHhhcchhh Q lcl|NC_016163. 697 MQNSRF---IRDIIGDQVILSKGYTE--LVRKIYNLNFKSNEVDDKSDPAKDEILTKNAKSYTDTQSLAKAAIKYFDINN 771 (878) Q Consensus 697 RLR~RF---F~DlLRtQLILKKiiTE--~IRniyn~nFksnEVdd~selkk~EILt~riksytd~Qsla~aa~KYFdv~y 771 (878) |||+|| |.++||+|||||++||+ |-.-.-++.|.....+++.|++..|||..|+..+ +.++|+++|||+++| T Consensus 409 rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l---~~~dpyvGky~s~~y 485 (524) T protein:vir:72 409 ELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINML---TMAEPFIGKYISHRT 485 (524) T ss_pred HHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHH---HHhhhhhcccchhHH Confidence 999999 99999999999999995 4332234555556679999999999999999988 788999999999999 Q ss_pred hheecCcccccchhhhhHHHHHHhhhhhhhhcCccccccCCcchHHHH Q lcl|NC_016163. 772 ISVKFPSPASLNMNNLSEQISNVNNFVTTLTENLTFDDTIPQDDQEKL 819 (878) Q Consensus 772 is~K~l~~t~l~m~eldeQIs~~~~~v~~~~~N~~F~~~~~QDDQ~~~ 819 (878) +++++|++++-+|.++++| |.++..++.|++ |+++++.. T Consensus 486 i~k~ILr~tDeei~~~~k~-------I~~E~k~~~~~~--~~~~~~~f 524 (524) T protein:vir:72 486 AMKDILQMTDEEIEQEAKQ-------IEEESKEARFQD--PDQEQEDF 524 (524) T ss_pred HHHHHhccCHHHHHHHHHH-------HHHHhhcCCCCC--CchhhhcC Confidence 9999888877655555554 567888999984 34455544 No 13 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=100.00 E-value=3e-121 Score=681.33 Aligned_cols=436 Identities=18% Similarity=0.243 Sum_probs=289.4 Q ss_pred chhHHHhhhhhhhhccc-----ccccccccccccccccCCC----------------CCCCCCCcccCCchhhhhhhhhc Q lcl|NC_016163. 325 SSHILSEYSDVLLSEDM-----NISGFFGSAVSPFNATGEG----------------NTQPGSNRKLADPEKEKILKTKL 383 (878) Q Consensus 325 ~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~~ 383 (878) --|+|+-.+- +...|. .++.-..|.+.|-+..|.- +.+....-.-+--+++.| ++-- T Consensus 1 ~~~~l~~~~~-~~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI-~~YR 78 (521) T protein:vir:81 1 MFSRLKMLAR-WADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLV-NTYR 78 (521) T ss_pred CcchhhhhHh-hcCchhhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHH-HHHH Confidence 2233332221 111111 1122234555555554431 111111111111122221 1100 Q ss_pred C-----CcchhhheecCCCceEEeeecccceeeeeeEeeEecCCCcccCcc--------------cccCCCCceeecccc Q lcl|NC_016163. 384 G-----GNEKAIVKRISPGNIVDLTFEDNILGYLYLDIVEVDPDGTTMPSD--------------KVDNGNEGYTFMPSQ 444 (878) Q Consensus 384 ~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~ 444 (878) + --+.|+ -.||+ +-|..=-+-++|+||-|.+-.+.. -.+-.+.||.+.--. T Consensus 79 ~ma~~pEvd~Av------~eIVn----eaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~W 148 (521) T protein:vir:81 79 GLMNNHEVENAV------QNIVN----DAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRW 148 (521) T ss_pred HHhhccchhhHH------HHhhc----ceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhh Confidence 0 000110 11221 122222234567776665543321 012223344332111 Q ss_pred cccchhHHHHhhhcCccccccCCCccccccccc----------------cCCCCceeccccchhHHHHHHH-------HH Q lcl|NC_016163. 445 SVGNGNVLQNMVYSGKDIPIDGNGGHSSGAKNV----------------ENPNGQRLDVADDARLQFLAAA-------FA 501 (878) Q Consensus 445 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~-------~~ 501 (878) -|- |.+ |--| -||.|. +-|-+-+ ++.+|- .|...-+..|+-.. -. T Consensus 149 YVD-gRi-----~fhk--iid~~p--k~GI~Elr~lDPr~i~~vr~i~k~~~~~~--~v~~~~~e~f~Y~~~~~~~~~~g 216 (521) T protein:vir:81 149 YVD-SRI-----FFHK--IIGKNP--KDGIVELRQLDPRNLEYVREIITEDTPEG--KIYKATKEYFIYTVGNSSYCAGG 216 (521) T ss_pred hhc-ceE-----EEEE--EEcCCc--cccceeeeeeCCcceeeeeeecccccCcc--ceecceeeeeeeecCCccccccc Confidence 110 100 0001 122221 1221110 122221 11222222232100 11 Q ss_pred hhcCCcchhhhhhhhhhhhhhhhhhhhhhhhhhhccceeEEechHHHHhhhcCchhhhhhhhhhhhhhhhHHHHHHHHHh Q lcl|NC_016163. 502 NRLSDESNIKLIKKSATIKQAIYNTLSIRKLTRKDKVRVIYLKPEEVVMINRGHSIFDNILFFAKIYITTLLTLLMQNVL 581 (878) Q Consensus 502 ~~l~~~~~IKl~K~S~T~~~~~~~~l~~~~L~~k~K~RVi~~~peeIv~In~G~sI~dnil~~skiyltTllSLVIYRIT 581 (878) ...+..+.+||.++++| .|.|||.+.++.-+++|+|.+|+..||.+|++| +||||||| T Consensus 217 ~~~~~~~~vkI~~dAI~--------y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~ED--------------AlVIYRit 274 (521) T protein:vir:81 217 QVFSPNSRVKIPRSAIT--------YAHSGLMDCDDKYIIGYLHRAVKPANQLKLLED--------------AMVVYRIT 274 (521) T ss_pred eeecCCcceeechhhee--------eeeccceeCCCCeeeecchhhhHhHHhhHHHHh--------------hHHHHhhh Confidence 12355678999999999 899999999999999999999999999999999 99999999 Q ss_pred cCccceEEEEecC-CCChhHHHHHHHHH-hhcceeeecccCceeccccchhhhhhhcccccccCCCCceeeecCcccccc Q lcl|NC_016163. 582 RGAPKRAVYVEVG-LDNNPANATQQAIR-DVKSKEISSITNMDMQSIINYVGEFQDYYIPVVDGEKPITFETIDALDAKS 659 (878) Q Consensus 582 RAPERRVFYIDVG-LPK~KAEQYMrdI~-kyKNKLVYDAsTGEVRDDrk~MSMLEDYWLPRREGGRGTEISTLpGqNlge 659 (878) ||||||||||||| |||.||||||++|| +||||+|||++||+|+|++++||||||||||||||||||||+||||+++++ T Consensus 275 RAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlg 354 (521) T protein:vir:81 275 RAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMS 354 (521) T ss_pred ccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhhhhcccccCCCcccceeecccCCCCC Confidence 9999999999999 99999999999985 899999999999999999999999999999999999999999997655555 Q ss_pred cchHHHHHHHHHHHHHcCCCchhc---------------cccccccHHHHHHHHHHHH---HHhhhHhhccchhhhhh-- Q lcl|NC_016163. 660 LDDDFLNWLSNNIFSGMGIPSAYL---------------TEVENVDFAKTLSMQNSRF---IRDIIGDQVILSKGYTE-- 719 (878) Q Consensus 660 i~DDvVeYFqKKLYrALNVPvSRL---------------ITRDELKFsKFI~RLR~RF---F~DlLRtQLILKKiiTE-- 719 (878) ++|| |+||++|||+|||||+||| |||||+||+|||.|||+|| |.++||+|||||++||+ T Consensus 355 em~D-V~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~ee 433 (521) T protein:vir:81 355 DIDD-IRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTRQSQFSEVLRDPLKYNLILKNVITEDD 433 (521) T ss_pred hHHH-HHHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHH Confidence 5555 9999999999999999999 9999999999999999999 99999999999999995 Q ss_pred hhccccccccccccchhHHHHHHHhhhhHhhhhhhhhhhhhhhhHhhcchhhhheecCcccccchhhhhHHHHHHhhhhh Q lcl|NC_016163. 720 LVRKIYNLNFKSNEVDDKSDPAKDEILTKNAKSYTDTQSLAKAAIKYFDINNISVKFPSPASLNMNNLSEQISNVNNFVT 799 (878) Q Consensus 720 ~IRniyn~nFksnEVdd~selkk~EILt~riksytd~Qsla~aa~KYFdv~yis~K~l~~t~l~m~eldeQIs~~~~~v~ 799 (878) |-.-.-++.|.....+++.|++..|||..|+..+ +.++|+++|||+++||++++|++++-+|+++++| |. T Consensus 434 w~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l---~~~dpyvGky~s~dyi~k~ILr~tDeei~~~~k~-------I~ 503 (521) T protein:vir:81 434 WDREINNIKVVFHRDSYYTEVKDAEILERRIGLI---ERITPYIGKYFSNQTVMRDILKYTDDQMDTEKKQ-------IE 503 (521) T ss_pred HHHHhhcceEEEeecchHHHHHHHHHHHHHHHHH---HHhhhhhccccchHHHHHHHhccCHHHHHHHHHH-------HH Confidence 4333334555556679999999999999999988 7889999999999999999888877665555554 56 Q ss_pred hhhcCccccccCCcchHHHH Q lcl|NC_016163. 800 TLTENLTFDDTIPQDDQEKL 819 (878) Q Consensus 800 ~~~~N~~F~~~~~QDDQ~~~ 819 (878) ++..++.|.+ |+++.+-. T Consensus 504 ~E~~~~~~~~--p~~~~~~f 521 (521) T protein:vir:81 504 EEANDPRFKQ--TPDEIEDF 521 (521) T ss_pred HHhhCCCCCC--CcccccCC Confidence 7888999983 56666655 No 14 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=100.00 E-value=4.4e-121 Score=680.40 Aligned_cols=440 Identities=17% Similarity=0.240 Sum_probs=287.8 Q ss_pred chhHHHhhhhhhhhccc-----ccccccccccccccccCCCCCCC----------CCCcccCCchhhhhhhhhcCCcchh Q lcl|NC_016163. 325 SSHILSEYSDVLLSEDM-----NISGFFGSAVSPFNATGEGNTQP----------GSNRKLADPEKEKILKTKLGGNEKA 389 (878) Q Consensus 325 ~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~ 389 (878) --|+|+-.+- +.++|- .++.-..|.++|-++.|.--.++ |-....-+-+- ++ -|++. T Consensus 1 ~~~~l~~~~~-~~~~d~~~~~e~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~-~~------~~~~e 72 (521) T protein:vir:65 1 MFSRLKMLAR-WADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQ-KI------STTKQ 72 (521) T ss_pred Cccchhhhhh-ccCchhhHHHhhhccCCCcccCCCCCCCceeecccCCccccccccceeeeccccc-hh------hhHHH Confidence 2233332211 111111 12222344455554444321111 11111111110 00 01111 Q ss_pred hheec-----CC--CceEEeeecccceeeeeeEeeEecCCCcccCcc--------------cccCCCCceeecccccccc Q lcl|NC_016163. 390 IVKRI-----SP--GNIVDLTFEDNILGYLYLDIVEVDPDGTTMPSD--------------KVDNGNEGYTFMPSQSVGN 448 (878) Q Consensus 390 ~~~~~-----~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~ 448 (878) .+++- .| -+-|+--..+-|..=-+-++|+||-|.+-.+.. -.+-.+.||.+.--.-| . T Consensus 73 LI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYV-D 151 (521) T protein:vir:65 73 LVNTYRGLMNNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYV-D 151 (521) T ss_pred HHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhh-c Confidence 11110 00 011111111112222234566766665543321 01222334333211111 0 Q ss_pred hhHHHHhhhcCccccccCCCcccccccc----------------ccCCCCceeccccchhHHHHHH-------HHHhhcC Q lcl|NC_016163. 449 GNVLQNMVYSGKDIPIDGNGGHSSGAKN----------------VENPNGQRLDVADDARLQFLAA-------AFANRLS 505 (878) Q Consensus 449 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~-------~~~~~l~ 505 (878) |.+ |--| -||.|. +-|-+- .++++|- .|-..-+..|+-. +-..-.+ T Consensus 152 gRi-----~fhk--iid~~p--k~GI~ELr~lDPr~i~~vr~i~k~~~~~~--~v~~~~~e~f~Y~~~~~~~~~~g~~~~ 220 (521) T protein:vir:65 152 SRI-----FFHK--IIGKNP--KDGIVELRQLDPRNLEYVREIITEDTPEG--KIYKATKEYFIYTVGNSSYCAGGQVFS 220 (521) T ss_pred cee-----EEEE--EEcCCc--cccceeeeeeCCcceeeeeeecccccCCc--ceecceeeeeeeecCCcceeccceeec Confidence 110 0001 122221 111111 0112111 1111111122210 0011234 Q ss_pred CcchhhhhhhhhhhhhhhhhhhhhhhhhhhccceeEEechHHHHhhhcCchhhhhhhhhhhhhhhhHHHHHHHHHhcCcc Q lcl|NC_016163. 506 DESNIKLIKKSATIKQAIYNTLSIRKLTRKDKVRVIYLKPEEVVMINRGHSIFDNILFFAKIYITTLLTLLMQNVLRGAP 585 (878) Q Consensus 506 ~~~~IKl~K~S~T~~~~~~~~l~~~~L~~k~K~RVi~~~peeIv~In~G~sI~dnil~~skiyltTllSLVIYRITRAPE 585 (878) ..+.+||.++++| .|.|||.+.++..+++|+|.+|+..||.+|++| +||||||||||| T Consensus 221 ~~~~vkI~~dAI~--------y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~ED--------------AlVIYRitRAPe 278 (521) T protein:vir:65 221 PNSRVKIPRSAIT--------YAHSGLMDCDDKYIIGYLHRAVKPANQLKLLED--------------AMVVYRITRAPE 278 (521) T ss_pred CCcceeechhhee--------eeeccceeCCCCeeeecchhhhHhHHhhHHHHh--------------hHHHHhhhcccc Confidence 4577999999999 899999999999999999999999999999999 999999999999 Q ss_pred ceEEEEecC-CCChhHHHHHHHHH-hhcceeeecccCceeccccchhhhhhhcccccccCCCCceeeecCcccccccchH Q lcl|NC_016163. 586 KRAVYVEVG-LDNNPANATQQAIR-DVKSKEISSITNMDMQSIINYVGEFQDYYIPVVDGEKPITFETIDALDAKSLDDD 663 (878) Q Consensus 586 RRVFYIDVG-LPK~KAEQYMrdI~-kyKNKLVYDAsTGEVRDDrk~MSMLEDYWLPRREGGRGTEISTLpGqNlgei~DD 663 (878) ||||||||| |||.||||||++|| +||||+|||++||+|+|++++||||||||||||||||||||+||||+++++++|| T Consensus 279 RRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D 358 (521) T protein:vir:65 279 RRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDD 358 (521) T ss_pred ceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhhhhcccccCCCCccceeecccCCCcChHHH Confidence 999999999 99999999999985 8999999999999999999999999999999999999999999976555455555 Q ss_pred HHHHHHHHHHHHcCCCchhc---------------cccccccHHHHHHHHHHHH---HHhhhHhhccchhhhhh--hhcc Q lcl|NC_016163. 664 FLNWLSNNIFSGMGIPSAYL---------------TEVENVDFAKTLSMQNSRF---IRDIIGDQVILSKGYTE--LVRK 723 (878) Q Consensus 664 vVeYFqKKLYrALNVPvSRL---------------ITRDELKFsKFI~RLR~RF---F~DlLRtQLILKKiiTE--~IRn 723 (878) |+||++|||+|||||+||| |||||+||+|||.|||+|| |.++||+|||||++||+ |-.- T Consensus 359 -V~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i 437 (521) T protein:vir:65 359 -IRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTLQSQFSEVLRDPLKYNLILKNVITEDDWDRE 437 (521) T ss_pred -HHHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHH Confidence 9999999999999999997 9999999999999999999 99999999999999995 4333 Q ss_pred ccccccccccchhHHHHHHHhhhhHhhhhhhhhhhhhhhhHhhcchhhhheecCcccccchhhhhHHHHHHhhhhhhhhc Q lcl|NC_016163. 724 IYNLNFKSNEVDDKSDPAKDEILTKNAKSYTDTQSLAKAAIKYFDINNISVKFPSPASLNMNNLSEQISNVNNFVTTLTE 803 (878) Q Consensus 724 iyn~nFksnEVdd~selkk~EILt~riksytd~Qsla~aa~KYFdv~yis~K~l~~t~l~m~eldeQIs~~~~~v~~~~~ 803 (878) .-++.|.....+++.|++..|||..|+..+ +.++|+++||||++|+++++|++++-+|+++++| |.++.. T Consensus 438 ~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l---~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~k~-------I~~E~~ 507 (521) T protein:vir:65 438 INNIKVVFHRDSYYTEVKDAEILERRIGLI---ERITPYIGKYFSNQTVMRDILKYTDDQMDTEKKQ-------IEEEAN 507 (521) T ss_pred hhcceEEeeecchHHHHHHHHHHHHHHHHH---HHhhhhhccccchHHHHHHHhccCHHHHHHHHHH-------HHHhhh Confidence 334555556679999999999999999988 7889999999999999999988877665555554 567888 Q ss_pred CccccccCCcchHHHH Q lcl|NC_016163. 804 NLTFDDTIPQDDQEKL 819 (878) Q Consensus 804 N~~F~~~~~QDDQ~~~ 819 (878) ++.|.+ |+++.|-. T Consensus 508 ~~~~~~--p~~~~~~f 521 (521) T protein:vir:65 508 DPRFKQ--TPDEIEDF 521 (521) T ss_pred CCCCCC--CcccccCC Confidence 999973 67777765 No 15 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=100.00 E-value=1.2e-120 Score=678.07 Aligned_cols=483 Identities=16% Similarity=0.210 Sum_probs=283.0 Q ss_pred hhhhhccccccccchhhhhHHHHHHhhhheeeeccchhHHHhhhhhhhhcccccccccccccccccccCCCCCCCCCCc- Q lcl|NC_016163. 290 FKDAFGIEKEATGVKVGAALDKFSEALKDTFIIGDSSHILSEYSDVLLSEDMNISGFFGSAVSPFNATGEGNTQPGSNR- 368 (878) Q Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 368 (878) ....||..-+....+-|. .++-=+--|-.++. -+||+|..|--. |. +...|. T Consensus 1 m~~lfgf~i~~~~~~~~~-----------S~vpp~~~~~~~~i----------~~g~~g~~v~~~-----g~-~~~~n~~ 53 (564) T protein:vir:10 1 MSQLFGFLINEKEGQKGQ-----------SPVPPNDEASVSTV----------AGGYFGTYVDTS-----GG-QNSRNEY 53 (564) T ss_pred CcchhcceeeeeccCCCC-----------CcccCCcCCChhhh----------hccccceeeecc-----cc-cchhhHH Confidence 444555444332211000 01100000000100 134555443211 11 111111 Q ss_pred c--------cCCchhhhhhhhhcCCcchhhheecCCCceEEeeecccceeeeeeEeeEecCCCcccCcc----------- Q lcl|NC_016163. 369 K--------LADPEKEKILKTKLGGNEKAIVKRISPGNIVDLTFEDNILGYLYLDIVEVDPDGTTMPSD----------- 429 (878) Q Consensus 369 ~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------- 429 (878) . ..+||-+..+.--. || |||-. -+-++|+||-+.+-.+.. T Consensus 54 eLI~~YR~ma~~pEVd~Av~eIV--ne-aIv~d------------------~~~~pV~vdL~~~~~s~siK~kI~eEF~~ 112 (564) T protein:vir:10 54 ELIRRYRDMSLHPEVDSAIDEIV--NE-FVVND------------------GDDKPVEVDLQNLEIGSGVKKKIRDEFNR 112 (564) T ss_pred HHHHHHHHHhhccchhhHHHHhh--cc-eeEec------------------CCCceEEEEecccCcchHHHHHHHHHHHH Confidence 1 12333332221111 11 22211 122334444433332221 Q ss_pred ---cccCCCCceeecccccccc----------hh---HHHHhhhc--------Ccccc-ccCCCc--cccccccccCCCC Q lcl|NC_016163. 430 ---KVDNGNEGYTFMPSQSVGN----------GN---VLQNMVYS--------GKDIP-IDGNGG--HSSGAKNVENPNG 482 (878) Q Consensus 430 ---~~~~~~~~~~~~~~~~~~~----------~~---~~~~~~~~--------~~~~~-~~~~~~--~~~~~~~~~~~~~ 482 (878) -.+-.+.||.+.-..-|-. .| -.+-+.|- -..++ .+-+|. -+..+--..++.+ T Consensus 113 Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~ 192 (564) T protein:vir:10 113 ILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDLDNPKKGILELRYIDSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDF 192 (564) T ss_pred HHHHhccchhhhHHHhhhhhcceEEEEEEeeCCChhhhhhhhhhhcccceeeeeeeccccccccceeeeeeeeecccccc Confidence 0011122221111000000 00 00000000 00000 000000 0000100111111 Q ss_pred ceeccccchhH--HHHHHHHHhhcCCcchhhhhhhhhhhhhhhhhhhhhhhhhhhccceeEEechHHHHhhhcCchhhhh Q lcl|NC_016163. 483 QRLDVADDARL--QFLAAAFANRLSDESNIKLIKKSATIKQAIYNTLSIRKLTRKDKVRVIYLKPEEVVMINRGHSIFDN 560 (878) Q Consensus 483 ~~~~~~~~~~~--~~~~~~~~~~l~~~~~IKl~K~S~T~~~~~~~~l~~~~L~~k~K~RVi~~~peeIv~In~G~sI~dn 560 (878) +...+-+..-. .....+=++-.+....+||.++++| .|.|||.+.++.-+++|+|.+|+..||.+||+| T Consensus 193 ~Eyy~Ynp~~~~g~~~~~~~~~~~~~~~~ikI~~daI~--------y~hSGL~d~~~~~i~gyLhkAIKp~NQLkmlED- 263 (564) T protein:vir:10 193 IEYYIYNPKGFAGNIPMVTGSMDWSNQEGIKIASDAIA--------QSTSGLMDLNKKMTLSFLHKAIKSLNQLRMIED- 263 (564) T ss_pred ccceeeccccccCcccccccccccccccceeechhhcc--------eecccceeCCCCceeccchhhhHhHHhhHHHHh- Confidence 11111110000 0000111122344567999999999 999999999999999999999999999999999 Q ss_pred hhhhhhhhhhhHHHHHHHHHhcCccceEEEEecC-CCChhHHHHHHHHH-hhcceeeecccCceeccccchhhhhhhccc Q lcl|NC_016163. 561 ILFFAKIYITTLLTLLMQNVLRGAPKRAVYVEVG-LDNNPANATQQAIR-DVKSKEISSITNMDMQSIINYVGEFQDYYI 638 (878) Q Consensus 561 il~~skiyltTllSLVIYRITRAPERRVFYIDVG-LPK~KAEQYMrdI~-kyKNKLVYDAsTGEVRDDrk~MSMLEDYWL 638 (878) +|||||||||||||||||||| |||.||||||++|| +||||+|||++||+|+|+++|||||||||| T Consensus 264 -------------AlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGevrddrk~msMlEDyWL 330 (564) T protein:vir:10 264 -------------SLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQTGEIRDDKKHMSMLEDFWL 330 (564) T ss_pred -------------hHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceecccchhhhhHhhhcc Confidence 999999999999999999999 99999999999985 899999999999999999999999999999 Q ss_pred ccccCCCCceeeecCcccccccchHHHHHHHHHHHHHcCCCchhc--------------cccccccHHHHHHHHHHHH-- Q lcl|NC_016163. 639 PVVDGEKPITFETIDALDAKSLDDDFLNWLSNNIFSGMGIPSAYL--------------TEVENVDFAKTLSMQNSRF-- 702 (878) Q Consensus 639 PRREGGRGTEISTLpGqNlgei~DDvVeYFqKKLYrALNVPvSRL--------------ITRDELKFsKFI~RLR~RF-- 702 (878) |||||||||||+||||+++++++|| |+||++|||+|||||+||| |||||+||+|||.|||+|| T Consensus 331 PRReGgrgTEItTLpGgqnLgem~D-V~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~ 409 (564) T protein:vir:10 331 PRREGGRGTEITTLPGGQNLGELKD-VEYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTEILRDELKFTKFIGRLRKRFAQ 409 (564) T ss_pred cccCCCcccceeeccccCCcchHHH-HHHHHHHHHHHhCCCcccccCCCceeecccccchhHHHHHHHHHHHHHHHHHHH Confidence 9999999999999975545445555 9999999999999999999 9999999999999999999 Q ss_pred -HHhhhHhhccchhhhhh--hhccccccccccccchhHHHHHHHhhhhHhhhhhhhhhhhhhhhHhhcchhhhheecCcc Q lcl|NC_016163. 703 -IRDIIGDQVILSKGYTE--LVRKIYNLNFKSNEVDDKSDPAKDEILTKNAKSYTDTQSLAKAAIKYFDINNISVKFPSP 779 (878) Q Consensus 703 -F~DlLRtQLILKKiiTE--~IRniyn~nFksnEVdd~selkk~EILt~riksytd~Qsla~aa~KYFdv~yis~K~l~~ 779 (878) |.++||+|||||++||+ |-.-.-++.|.....+++.|++..|||..|+..+ +.++|+++||||++||++++|++ T Consensus 410 lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l---~~~dpyvGky~S~dyi~k~ILr~ 486 (564) T protein:vir:10 410 LFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLA---TQMDPFVGKYFSTEYIRRKILMQ 486 (564) T ss_pred HHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHH---HHhhhhhccccchHHHHHHHhcc Confidence 99999999999999995 4332244555556679999999999999999988 77899999999999999999999 Q ss_pred cccchhhhhHHHHHHhhhhhhhhcCccccccCCcchHHHHHHHHHHHhhhccCC--CCCHH-HHHHHHHHHHHhhcchhh Q lcl|NC_016163. 780 ASLNMNNLSEQISNVNNFVTTLTENLTFDDTIPQDDQEKLKKKLIMRFTKKNLP--NVDWD-ELDSIMDEIVREYTGEKV 856 (878) Q Consensus 780 t~l~m~eldeQIs~~~~~v~~~~~N~~F~~~~~QDDQ~~~~~~~~~~~~~~~~~--~~~~~-~~~~~~~~~~~~~~~~~~ 856 (878) ++-+|+++++||. ++-.++.|.+. +..+.+- ++| +--.. ++...++.- T Consensus 487 tDeei~~~~kqI~-------~E~k~~~~~~P---~e~~~~~----------~~~~~~~~~~p~~~~~~~~~--------- 537 (564) T protein:vir:10 487 TENEFKEIDKQMK-------SDIESGLAIDP---IQVNMLD----------DMEKQNQAFAPELQAAQDDL--------- 537 (564) T ss_pred CHHHHHHHHHHHH-------HHhhcCCCCCc---hhhhcCC----------CccCCCCcCCcchhhhcccc--------- Confidence 9888887777764 34445666632 1111110 111 10000 111111110 Q ss_pred hhhhccchhhcc----cccccccCCC Q lcl|NC_016163. 857 EKSISTSEEENE----EGSEDVGGGF 878 (878) Q Consensus 857 ~~~~~~~~~~~~----~~~~~~~~~~ 878 (878) ....+++-- +..+...|+- T Consensus 538 ---~~~~~~~~~~~a~~~~~~~~~~~ 560 (564) T protein:vir:10 538 ---AAEREIKKLNSAPKPPPSQQSKS 560 (564) T ss_pred ---ccccChhhhccCCCCCCCCCCcC Confidence 001111111 1111122222 No 16 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=100.00 E-value=2.6e-120 Score=676.23 Aligned_cols=401 Identities=18% Similarity=0.262 Sum_probs=270.6 Q ss_pred chhhhhHHHHHHhhhheeeeccchhHHHhhhhhhhhcccccccccccccccccccCC-------CCCCCC---------- Q lcl|NC_016163. 303 VKVGAALDKFSEALKDTFIIGDSSHILSEYSDVLLSEDMNISGFFGSAVSPFNATGE-------GNTQPG---------- 365 (878) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~---------- 365 (878) .|.-+--|. .+|.. .++.-..|.+.|=+..|. .+.+.| T Consensus 1 ~~~w~~~de------------------~~~~~-------~~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~ 55 (511) T protein:vir:56 1 MKFWTKEEE------------------QDIQK-------IEKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQS 55 (511) T ss_pred CCCccchhh------------------hhhhh-------hccCCcccccCCCCCCCceEEecccccceecceeccccccc Confidence 111111000 00111 122233445555554442 011111 Q ss_pred ----CCccc--------CCchhhhhhhhhcCCcchhhheecCCCceEEeeecccceeeeeeEeeEecCCCcccCcc---- Q lcl|NC_016163. 366 ----SNRKL--------ADPEKEKILKTKLGGNEKAIVKRISPGNIVDLTFEDNILGYLYLDIVEVDPDGTTMPSD---- 429 (878) Q Consensus 366 ----~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 429 (878) +++.| .+||-+..+.--. | .|||-. .-.++|. ||-|++-.+.. T Consensus 56 ~~~~~~~eLI~~YR~ma~~pEvd~Av~eIv--n-e~iv~d-~~~~pV~-----------------l~ld~~~~s~~iK~k 114 (511) T protein:vir:56 56 EGTIPVKELIKSYRALAEYHEVDDAIQEIV--D-EAIVYE-NDKEVVW-----------------LNLDNTDFSENIKAK 114 (511) T ss_pred cCccchHHHHHHHHHHhhccchhhHHHHhh--c-ceeEec-CCCceEE-----------------EEecccCcchHHHHH Confidence 01111 1222221111000 0 122222 2233444 33333332111 Q ss_pred ----------cccCCCCceeeccccccc------------------------------------------chhHHHHhhh Q lcl|NC_016163. 430 ----------KVDNGNEGYTFMPSQSVG------------------------------------------NGNVLQNMVY 457 (878) Q Consensus 430 ----------~~~~~~~~~~~~~~~~~~------------------------------------------~~~~~~~~~~ 457 (878) -.+-.+.||.+.--.-|- -++++...+| T Consensus 115 I~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~k~GI~eLr~lDPr~i~~vr~i~~~~~~~~~v~~~~~ey~~Y 194 (511) T protein:vir:56 115 INEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKILDKDNNIIELRPLNPMKMELVREIQKETIDGVEVVKGTLEYYVY 194 (511) T ss_pred HHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeccccceeehhhcCcccchhhhhhhcccccccccccceeeeeEe Confidence 001112222111100000 0012222233 Q ss_pred cCccccccCCCccccccccccCCCCceeccccchhHHHHHHHHHhhcCCcchhhhhhhhhhhhhhhhhhhhhhhhhh--h Q lcl|NC_016163. 458 SGKDIPIDGNGGHSSGAKNVENPNGQRLDVADDARLQFLAAAFANRLSDESNIKLIKKSATIKQAIYNTLSIRKLTR--K 535 (878) Q Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~IKl~K~S~T~~~~~~~~l~~~~L~~--k 535 (878) . |.|-.- +.+.. .......++||.++++| .|.|||.+ + T Consensus 195 ~---------------------~~~~~~----~~~~~-------~~~~~~~~vkI~~daI~--------y~hSGL~d~~~ 234 (511) T protein:vir:56 195 K---------------------QSDYKM----PSWMS-------ATNRAQTSFRIPKDAIV--------FAHSGLMRGCA 234 (511) T ss_pred c---------------------CCCccc----Ccccc-------cccccccceeechhhee--------eecccceeccC Confidence 2 111000 00000 00012466999999999 99999998 8 Q ss_pred ccceeEEechHHHHhhhcCchhhhhhhhhhhhhhhhHHHHHHHHHhcCccceEEEEecC-CCChhHHHHHHHHH-hhcce Q lcl|NC_016163. 536 DKVRVIYLKPEEVVMINRGHSIFDNILFFAKIYITTLLTLLMQNVLRGAPKRAVYVEVG-LDNNPANATQQAIR-DVKSK 613 (878) Q Consensus 536 ~K~RVi~~~peeIv~In~G~sI~dnil~~skiyltTllSLVIYRITRAPERRVFYIDVG-LPK~KAEQYMrdI~-kyKNK 613 (878) |+..+++|+|.+|+..||.+|++| +|||||||||||||||||||| |||.||||||++|| +|||| T Consensus 235 ~~g~i~syLhkAiKp~NQLkm~ED--------------AlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNk 300 (511) T protein:vir:56 235 DDPYIIGYLDRAIKPANQLKMLED--------------ALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNR 300 (511) T ss_pred CCCeeeccchhhhHHHHhhHHHHh--------------hHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCce Confidence 999999999999999999999999 999999999999999999999 99999999999986 89999 Q ss_pred eeecccCceeccccchhhhhhhcccccccCCCCceeeecCcccccccchHHHHHHHHHHHHHcCCCchhc---------- Q lcl|NC_016163. 614 EISSITNMDMQSIINYVGEFQDYYIPVVDGEKPITFETIDALDAKSLDDDFLNWLSNNIFSGMGIPSAYL---------- 683 (878) Q Consensus 614 LVYDAsTGEVRDDrk~MSMLEDYWLPRREGGRGTEISTLpGqNlgei~DDvVeYFqKKLYrALNVPvSRL---------- 683 (878) +|||++||+|+|++++||||||||||||||||||||+||||+++++++|| |+||++|||+|||||+||| T Consensus 301 lVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kKLy~aLnVP~SRl~~e~q~~~f~ 379 (511) T protein:vir:56 301 VVYDTQTGQVKNTTNAMSMLEDYYLPRREGSKGTEVSTLPGGQSLGDIED-VLYFNRKLYKAMRIPTSRAASEDQTGGIN 379 (511) T ss_pred EEEeccCceeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHH-HHHHHHHHHHHhCCCcccccCCCCccccc Confidence 99999999999999999999999999999999999999976655555555 9999999999999999999 Q ss_pred ------cccccccHHHHHHHHHHHH---HHhhhHhhccchhhhhh--hhccccccccccccchhHHHHHHHhhhhHhhhh Q lcl|NC_016163. 684 ------TEVENVDFAKTLSMQNSRF---IRDIIGDQVILSKGYTE--LVRKIYNLNFKSNEVDDKSDPAKDEILTKNAKS 752 (878) Q Consensus 684 ------ITRDELKFsKFI~RLR~RF---F~DlLRtQLILKKiiTE--~IRniyn~nFksnEVdd~selkk~EILt~riks 752 (878) |||||+||+|||.|||+|| |.++||+|||||++||+ |-.-.-++.|.....+++.|++..|||..|+.. T Consensus 380 ~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~ 459 (511) T protein:vir:56 380 FGQGAEITRDELKFTKFVKRLQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNA 459 (511) T ss_pred cccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHH Confidence 9999999999999999999 99999999999999995 433223455555667999999999999999998 Q ss_pred hhhhhhhhhhhHhhcchhhhheecCcccccchhhhhHHHHHHhhhhhhhhcCccccccCCcchH Q lcl|NC_016163. 753 YTDTQSLAKAAIKYFDINNISVKFPSPASLNMNNLSEQISNVNNFVTTLTENLTFDDTIPQDDQ 816 (878) Q Consensus 753 ytd~Qsla~aa~KYFdv~yis~K~l~~t~l~m~eldeQIs~~~~~v~~~~~N~~F~~~~~QDDQ 816 (878) + +.++|+++||||++|+++++|++++-+|+++++| |.++..+|.|+. |.+|- T Consensus 460 l---~~~dpyvGky~S~~yi~k~ILr~tDeei~~~~k~-------I~~E~k~~~~~~--~e~~f 511 (511) T protein:vir:56 460 M---RDIQDYAGKYYSHKYIQKNILRLSDDQITAMQSE-------IDEEETNPRFQQ--DDQGF 511 (511) T ss_pred H---HHhcchhccccchHHHHHHHhccCHHHHHHHHHH-------HHHhhcCCCCCC--cccCC Confidence 8 7889999999999999999888877666555555 567888899983 33332 No 17 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=100.00 E-value=9.9e-108 Score=607.22 Aligned_cols=472 Identities=17% Similarity=0.204 Sum_probs=284.0 Q ss_pred hcccccc-chhhhhhccchhhhhhhcccccc----cccchhHhhhhhhhccccccc-cc-hhhhhHHHHHHhhhheeeec Q lcl|NC_016163. 251 VDSYKTL-NLSEGLLNSADKSQLLSENASFS----KQGSIFLEEFKDAFGIEKEAT-GV-KVGAALDKFSEALKDTFIIG 323 (878) Q Consensus 251 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~ 323 (878) ..|.|-. .+.|-. -+.+++|.-.||+ ..|+..++ .... +. .-|+|.. |+-| T Consensus 1 ~~~~~~w~~~de~~----~~~~~~~~~~~~~~p~~~dG~s~i~---------~~~~~~~~~~~~~~~---------~~gg 58 (533) T protein:vir:58 1 MPSLEKYKKLNEAV----NFTNFLSPMYGMGAPHGAGGSSMIP---------INMYHPFATAGYASR---------FYGG 58 (533) T ss_pred CCCcchhhhhhHHH----HHHHhhchhhcccCccCCCCCcccc---------CCCCcchhhhhhhhh---------hhcc Confidence 1221111 011111 1234444444442 23332221 1100 11 1122221 1223 Q ss_pred cchh---HHHhhhhhhhh--c-ccccccccccccccccccCCCCCCC----CCCcccCCchhhhhhhhhcCCcch--hhh Q lcl|NC_016163. 324 DSSH---ILSEYSDVLLS--E-DMNISGFFGSAVSPFNATGEGNTQP----GSNRKLADPEKEKILKTKLGGNEK--AIV 391 (878) Q Consensus 324 ~~~~---~~~~~~~~~~~--~-~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 391 (878) +... ....|...-++ | |--|.-..--|+.+.+ +..| -++-.+++.-|+||++ -++=+-+ .++ T Consensus 59 ~~~n~~eLI~~YR~ma~~~pEVd~AideIvneaiv~d~-----~~~pV~v~l~~~e~s~~iK~kI~~-lldf~~~~~~~f 132 (533) T protein:vir:58 59 IEFNRFFLYDMYDRMDYTDPLISTVLDIIADECTIPNE-----NGNIVDVVTKDIELAKAILSYLDY-VINIEKNAYPII 132 (533) T ss_pred ccccHHHHHHHHHHhhccCcchhhHHHhhhceeeEecC-----CCceeEeecccccccHHHHHHHHH-HhcchhhhhHHH Confidence 3222 45566555332 1 1122222223333322 2222 2455577777888874 1221111 122 Q ss_pred eecCCCceEEeeecccceeeeeeEeeEecCCCcccCcccccCC---------CCceeecccccccchhHHHHhhhcCccc Q lcl|NC_016163. 392 KRISPGNIVDLTFEDNILGYLYLDIVEVDPDGTTMPSDKVDNG---------NEGYTFMPSQSVGNGNVLQNMVYSGKDI 462 (878) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 462 (878) +|.- +-|-+|+-.+.-.|++..-----+|.- .+-..|..++ + T Consensus 133 R~WY------------VDGriy~Hkiik~~k~GI~elr~lDPr~i~~vr~~~t~~eyyvy~~---------------~-- 183 (533) T protein:vir:58 133 RNMI------------KYGDMFLHILEKGSDGTIEKFQVVSPYIFSKRYNPETDTWYYVITD---------------V-- 183 (533) T ss_pred Hhhh------------hcceeEEEeccCCcccchhhheecCCeeeEEEEeeccceEEEeecc---------------c-- Confidence 2210 112233333211222222111111111 0111111111 1 Q ss_pred cccCCCccccccccccCCCCceeccccchhHHHHHHHHHhhcCCcchhhhhhhhhhhhhhhhhhhhhhhhhhhccceeEE Q lcl|NC_016163. 463 PIDGNGGHSSGAKNVENPNGQRLDVADDARLQFLAAAFANRLSDESNIKLIKKSATIKQAIYNTLSIRKLTRKDKVRVIY 542 (878) Q Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~IKl~K~S~T~~~~~~~~l~~~~L~~k~K~RVi~ 542 (878) + + +-.++.+++||.++++| .|.+||.+.++.-+++ T Consensus 184 ------~-~------------------------------~~~s~~~~~kI~~daI~--------y~~SGl~d~~~~~iis 218 (533) T protein:vir:58 184 ------Y-R------------------------------NVVSGYFNEDIPEEDVI--------HFSHKIDTNFFPYGRS 218 (533) T ss_pred ------c-c------------------------------ccccCccccccchhhee--------eeeeccccCCCCceeh Confidence 0 0 01244456899999999 9999999999999999 Q ss_pred echHHHHhhhcCchhhhhhhhhhhhhhhhHHHHHHHHHhcCccceEEEEecC-CCChhHHHHHHHHH-hhcceeeecccC Q lcl|NC_016163. 543 LKPEEVVMINRGHSIFDNILFFAKIYITTLLTLLMQNVLRGAPKRAVYVEVG-LDNNPANATQQAIR-DVKSKEISSITN 620 (878) Q Consensus 543 ~~peeIv~In~G~sI~dnil~~skiyltTllSLVIYRITRAPERRVFYIDVG-LPK~KAEQYMrdI~-kyKNKLVYDAsT 620 (878) |+|.+|+..||.+||+| +|||||||||||||||||||| |||.||+|||++|| +||||+|||++| T Consensus 219 yLhkAiKp~NQLkmiED--------------AlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNklvYDa~T 284 (533) T protein:vir:58 219 YLESARAIWNQLRLMED--------------ALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQ 284 (533) T ss_pred hhhHHHHHHHHHHHHHH--------------HHHHHhhcCChhheEEEEeecCCCccCHHHHHHHHHHhcccceEEeccC Confidence 99999999999999988 999999999999999999999 99999999999985 899999999999 Q ss_pred ceeccccchh---hhhhhcccccccCCCCceeeecCcccccccchHHHHHHHHHHHHHcCCCchhc-----------ccc Q lcl|NC_016163. 621 MDMQSIINYV---GEFQDYYIPVVDGEKPITFETIDALDAKSLDDDFLNWLSNNIFSGMGIPSAYL-----------TEV 686 (878) Q Consensus 621 GEVRDDrk~M---SMLEDYWLPRREGGRGTEISTLpGqNlgei~DDvVeYFqKKLYrALNVPvSRL-----------ITR 686 (878) |+|+|++|+| |||||||||||||||||||+||||+++|++ || |+||++|||+|||||+||| ||| T Consensus 285 Gev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLpGg~lgem-eD-V~YF~kkLy~ALnVP~sRl~~e~~fgr~~eItR 362 (533) T protein:vir:58 285 NQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQGSKVDLA-ED-VEYMLNRLISALKVPKAFIGYEGDVNAKNTLAT 362 (533) T ss_pred CeEeeccchhhhhhhHhhhcccccCCCccceeeecCCCCCCcH-HH-HHHHHHHHHHHhCCCeeecCCCCCCccchhhhH Confidence 9999999999 999999999999999999999999886555 44 9999999999999999999 999 Q ss_pred ccccHHHHHHHHHHHHHHhhhHhhccchhhhhhhhccccccccccccchhHHHHHHHhhhhHhhhhhhhhhhhhhhhHhh Q lcl|NC_016163. 687 ENVDFAKTLSMQNSRFIRDIIGDQVILSKGYTELVRKIYNLNFKSNEVDDKSDPAKDEILTKNAKSYTDTQSLAKAAIKY 766 (878) Q Consensus 687 DELKFsKFI~RLR~RFF~DlLRtQLILKKiiTE~IRniyn~nFksnEVdd~selkk~EILt~riksytd~Qsla~aa~KY 766 (878) ||+||+|||.|||+|| .++|++|||||++||+. -|.++| ...+++.|+++.|||..|+..+ +.++||++|| T Consensus 363 DEiKF~KFI~rLR~rF-~~ll~~qLilk~iit~e---ew~~~f--~~Dn~f~ElKe~Eil~~Ri~~l---~~~dpyvgk~ 433 (533) T protein:vir:58 363 QDIKFNNTIKRIQGFF-VEELERMVRMNKEFADQ---DFRLVM--NRSNSIVEGERFAVIEQRIGIA---ERLKGWVRED 433 (533) T ss_pred HHHHHHHHHHHHHHHH-HHHHhcccccccCcchh---heeeee--eccchHHHHHHHHHHHHHHHHH---HHhcchhhHH Confidence 9999999999999985 68889999999999973 344454 4568999999999999999988 7889999997 Q ss_pred cchhhhheecCcccccchhhhhHHHHHHhhhhhhhhcCccccccCCcchHHHHHHHHHHHhhhccCCCCCHHHHHHHHH- Q lcl|NC_016163. 767 FDINNISVKFPSPASLNMNNLSEQISNVNNFVTTLTENLTFDDTIPQDDQEKLKKKLIMRFTKKNLPNVDWDELDSIMD- 845 (878) Q Consensus 767 Fdv~yis~K~l~~t~l~m~eldeQIs~~~~~v~~~~~N~~F~~~~~QDDQ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 845 (878) | |++++|++++ ++.+ | +.-|.++-.++.|+ +.|.+.--..+. | .|.+- |-+++-|+ T Consensus 434 y----i~k~ILr~td-ei~~---q----~e~ie~E~~~~~~~----~~~~~~e~~~~~--~----~~~~~-~p~~~~~~~ 490 (533) T protein:vir:58 434 W----IYSNILQIPY-DLKP---Q----EEVAEAAGGGGLFD----TGGFGEETTPAD--F----LGERG-SPIESPRGR 490 (533) T ss_pred H----HHHHHhcCCh-hhhH---H----HHHHHHhhcCCCCC----CCCcccccCCcc--c----Ccccc-CcccCCCCh Confidence 5 8888888876 3333 2 12356677788887 443332111100 0 01100 00000000 Q ss_pred -------HHHHhhcchhhhhhhccchhhcccccccccCCC Q lcl|NC_016163. 846 -------EIVREYTGEKVEKSISTSEEENEEGSEDVGGGF 878 (878) Q Consensus 846 -------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 878 (878) +--.+..|+---++ ...++. +.-|||= T Consensus 491 ~~~~~~~~~~~~~~~~~~~~~-----a~~~~~-~~~g~~~ 524 (533) T protein:vir:58 491 TEFDFGTEGGEELGGELNLGG-----AFEEFE-EETGGGE 524 (533) T ss_pred hhHhcccCCcccccccccccc-----cchhhh-hhcCCcc Confidence 00000111110000 000000 1112222 No 18 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=67.57 E-value=0.094 Score=26.22 Aligned_cols=179 Identities=13% Similarity=0.114 Sum_probs=77.5 Q ss_pred EEEEecCCCChhHHHHHHHHHhhcceeeecccCceeccccchhhhhhhcccccccC----CCCceeeecCcccccccchH Q lcl|NC_016163. 588 AVYVEVGLDNNPANATQQAIRDVKSKEISSITNMDMQSIINYVGEFQDYYIPVVDG----EKPITFETIDALDAKSLDDD 663 (878) Q Consensus 588 VFYIDVGLPK~KAEQYMrdI~kyKNKLVYDAsTGEVRDDrk~MSMLEDYWLPRREG----GRGTEISTLpGqNlgei~DD 663 (878) ||-++ | +..-+ +...++ -++.|.++.++-=- ..| +.+=+++++. .+++++. | T Consensus 1 V~k~~-~---------------l~~~~--~~~~~~---~~~r~~~~~~~~~~-~~~~~ld~~~e~~e~~~-~~lsGl~-d 56 (201) T protein:vir:10 1 MWKAK-G---------------LADLC--DDSDGA---ARLRLAQVDNNSGV-GQAIGIDADSEEYNVLN-SDIGGID-T 56 (201) T ss_pred Cccch-H---------------HHHHh--cCChHH---HHHHHHHHHHhhhh-hhhheeecCCcceeeee-cCcCChH-H Confidence 22211 0 00000 000011 11222222111000 000 1111233331 2344544 5 Q ss_pred HHHHHHHHHHHHcCCCchhc--c-------c--cccccHHHHHHHHHHHHHHhhhHhhccchhhhhhhhccccccccccc Q lcl|NC_016163. 664 FLNWLSNNIFSGMGIPSAYL--T-------E--VENVDFAKTLSMQNSRFIRDIIGDQVILSKGYTELVRKIYNLNFKSN 732 (878) Q Consensus 664 vVeYFqKKLYrALNVPvSRL--I-------T--RDELKFsKFI~RLR~RFF~DlLRtQLILKKiiTE~IRniyn~nFksn 732 (878) ++..|...+=-+.+||+.|| + | =|.-.|..+|..+|.+.+..+|++=+= -++. -.-|.+.|.. T Consensus 57 ~l~~~~~~iaa~s~iP~t~LfG~sp~Glnatge~d~~nyyd~i~~~Qe~~l~p~le~l~~--~~~~---~~~~~~~f~p- 130 (201) T protein:vir:10 57 FLSQKFDRIVALSGIHEIILKGKNVGGVSASQNTALETFYGYVDRKRKAELLPLLEFLLP--FIVT---EQEWSVEFNP- 130 (201) T ss_pred HHHHHHHHHHhHhcCchhhhcCCCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHH--hhcC---CCCceEeeCC- Confidence 89999999999999999999 1 2 255579999999999887776664221 1221 1335555544 Q ss_pred cchhHHHHHHHhhhhHhhhhhhhhhhhhhhhHhhcchhhhheecCcccccchhhhhHH-----H--HHHhhhhhhhhcCc Q lcl|NC_016163. 733 EVDDKSDPAKDEILTKNAKSYTDTQSLAKAAIKYFDINNISVKFPSPASLNMNNLSEQ-----I--SNVNNFVTTLTENL 805 (878) Q Consensus 733 EVdd~selkk~EILt~riksytd~Qsla~aa~KYFdv~yis~K~l~~t~l~m~eldeQ-----I--s~~~~~v~~~~~N~ 805 (878) ...-++..+.||...++..+ .+|++- -++++...+ +.|..+ | ..+...++..++.. T Consensus 131 -L~~~s~kekAei~~~~a~a~----------~~~~~~-----g~i~~~e~r-~~L~~~~~~~~~~~~~~~~~~~~~e~~d 193 (201) T protein:vir:10 131 -LSQVSDKDKSEILEKNVNSV----------AALIAA-----GIIDADEAR-DTLRAISTEVKIGEGSIQTEVVINESED 193 (201) T ss_pred -CCCCCHHHHHHHHHHHHHHH----------HHHHHc-----CCCCHHHHH-HHHHhcCCcCCCCCCCCCccccccccCC Confidence 35556666678877776543 233321 122211111 000000 0 01111222111111 Q ss_pred cccccCCcch Q lcl|NC_016163. 806 TFDDTIPQDD 815 (878) Q Consensus 806 ~F~~~~~QDD 815 (878) - -..|.|+ T Consensus 194 p--~~~~~~~ 201 (201) T protein:vir:10 194 P--LDVSANN 201 (201) T ss_pred C--CCCCCCC Confidence 1 1245566 No 19 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=23.52 E-value=2.3 Score=18.61 Aligned_cols=339 Identities=17% Similarity=0.208 Sum_probs=121.8 Q ss_pred heecCCCceEEeeecccceeeeeeEeeEecCCCcccCcccccCCCCceeecccccccchhHHHHhhhcC-------cccc Q lcl|NC_016163. 391 VKRISPGNIVDLTFEDNILGYLYLDIVEVDPDGTTMPSDKVDNGNEGYTFMPSQSVGNGNVLQNMVYSG-------KDIP 463 (878) Q Consensus 391 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~ 463 (878) +|... .| || .+++--.-+||.. .+..+. +|--+-.+ |.. -|+| T Consensus 1 ~~~~~---------~d---~~--~~~~~~~~~~~~~--------------~~~~~~-~~~~l~a~-Y~~~~l~~~~Vd~~ 50 (427) T protein:vir:10 1 MKIVK---------HD---GY--NDIFNGGADGSPK--------------PFFMSD-ASYHVGSF-YNDNATAKRIVDVI 50 (427) T ss_pred CCccc---------cc---hH--HHHhhcCCCCccc--------------CccccC-chHHHHHH-HHcCchhhhhhccc Confidence 11110 00 11 1111112233332 222222 22222222 321 2445 Q ss_pred cc---CCCccccccccccCCCCceeccccchhHH----HHHHHHHhhcCCcchhhh-hhhhhhhhhhhhhhhhhhhhh-- Q lcl|NC_016163. 464 ID---GNGGHSSGAKNVENPNGQRLDVADDARLQ----FLAAAFANRLSDESNIKL-IKKSATIKQAIYNTLSIRKLT-- 533 (878) Q Consensus 464 ~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~l~~~~~IKl-~K~S~T~~~~~~~~l~~~~L~-- 533 (878) .+ .+|-+=.|.+.- +++.-+ ..||+ +.-+....|+.|-+-|-+ +++...+.+-.-..-++.+|. T Consensus 51 aed~~r~g~~i~g~~~~-----~~~~~~-~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~ 124 (427) T protein:vir:10 51 PEEMVTAGFKMSGVKDE-----KEFKSL-WDSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVY 124 (427) T ss_pred hHHhhcCCccccCccHH-----HHHHHH-HHHhhHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEe Confidence 44 455554432210 111111 11222 223333456666554432 222222111111111111110 Q ss_pred ------------------------------------hhccceeEEechHHHHhhhc------Cchhhhhhhhhh--hhhh Q lcl|NC_016163. 534 ------------------------------------RKDKVRVIYLKPEEVVMINR------GHSIFDNILFFA--KIYI 569 (878) Q Consensus 534 ------------------------------------~k~K~RVi~~~peeIv~In~------G~sI~dnil~~s--kiyl 569 (878) .=.+-|||-+...++-..-+ |.|++...+ +- +.|. T Consensus 125 d~~~~~~~~~~~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~-~~~i~~~~ 203 (427) T protein:vir:10 125 DRFAITVEKRVTNARSPRYGEPEIYKVSPGDNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSL-IDAICDYD 203 (427) T ss_pred chhcccccccccCccccccCcceEEEEecCCCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHH-HHHHHHHH Confidence 00111444443222211111 455443221 11 2234 Q ss_pred hhHH--HHHHHHHhcCccceEEEE-ecC--CCChhHHHHH--HH--HHhhcceeeecccCceeccccchhhhhhhccccc Q lcl|NC_016163. 570 TTLL--TLLMQNVLRGAPKRAVYV-EVG--LDNNPANATQ--QA--IRDVKSKEISSITNMDMQSIINYVGEFQDYYIPV 640 (878) Q Consensus 570 tTll--SLVIYRITRAPERRVFYI-DVG--LPK~KAEQYM--rd--I~kyKNKLVYDAsTGEVRDDrk~MSMLEDYWLPR 640 (878) +|.. +.++|+- .=+|+.+ ++. +....++..+ +. +.+.|+ .+|- |.+ | T Consensus 204 ~~~~~~~~l~~k~----~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~------~~~~-------~~l--~----- 259 (427) T protein:vir:10 204 YCESLATQILRRK----QQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSG------VGRA-------IGI--D----- 259 (427) T ss_pred HHHHHHHHHHHHh----ccccccchhHHHHhcCccchHHHHHHHHHHHHhcC------cccc-------eee--e----- Confidence 4444 4455553 2234555 333 2222222111 11 111111 1111 111 0 Q ss_pred ccCCCCceeeecCcccccccchHHHHHHHHHHHHHcCCCchhc--c---------ccccccHHHHHHHHHHHHHHhhhHh Q lcl|NC_016163. 641 VDGEKPITFETIDALDAKSLDDDFLNWLSNNIFSGMGIPSAYL--T---------EVENVDFAKTLSMQNSRFIRDIIGD 709 (878) Q Consensus 641 REGGRGTEISTLpGqNlgei~DDvVeYFqKKLYrALNVPvSRL--I---------TRDELKFsKFI~RLR~RFF~DlLRt 709 (878) +.+-+++++. .++.++ +|++.+|...+=-+.+||+.|| + .=|.-.|..+|+.+|...+..+|.+ T Consensus 260 ---~~~e~~e~~~-~~lsgl-~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~nyyd~i~~~Qe~~l~p~l~~ 334 (427) T protein:vir:10 260 ---AETEEYDVLN-SDISGV-PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEF 334 (427) T ss_pred ---cCCCceeEEe-cccCCh-HHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2223455442 234444 4589999999999999999999 1 1355678899999998776666553 Q ss_pred hccchhhhhhhhccccccccccccchhHHHHHHHhhhhHhhhhh---hhhhhhhhhhHhhcchhhhheec----Cccccc Q lcl|NC_016163. 710 QVILSKGYTELVRKIYNLNFKSNEVDDKSDPAKDEILTKNAKSY---TDTQSLAKAAIKYFDINNISVKF----PSPASL 782 (878) Q Consensus 710 QLILKKiiTE~IRniyn~nFksnEVdd~selkk~EILt~riksy---td~Qsla~aa~KYFdv~yis~K~----l~~t~l 782 (878) = +.-++.. .-|.+.|.. ...-++..+.||...++.++ .+...+ +-+-++... .-.... T Consensus 335 l--~~~i~~s---~~~~~~f~p--L~~~s~kEkaei~~~~a~a~~~~~~~gvi--------~~~e~r~~L~~~~~~~~~~ 399 (427) T protein:vir:10 335 L--LPFIVDE---EEWSIEFEP--LSVPSKKEESEITKNNVESVTKAITEQII--------DLEEARDTLRSIAPEFKLK 399 (427) T ss_pred H--HHHhhcC---CCcEEEeCC--CCCCCHHHHHHHHHHHHHHHHHHHhcCCC--------CHHHHHHHHHhhhccccCC Confidence 1 1111211 235555543 33334444566665555433 111111 111111110 000011 Q ss_pred chhhhhHHHHHHhhhhhhhhcCccccccCCcch Q lcl|NC_016163. 783 NMNNLSEQISNVNNFVTTLTENLTFDDTIPQDD 815 (878) Q Consensus 783 ~m~eldeQIs~~~~~v~~~~~N~~F~~~~~QDD 815 (878) ..++++.+ ..-...+.-|...+.-..+| T Consensus 400 ~~~~~~~e-----~~~~~~e~~p~~~e~~~d~~ 427 (427) T protein:vir:10 400 DGNNINIR-----EPEETTEPEPGLGEKLEDEN 427 (427) T ss_pred CCcccccc-----ccchhcCCCCCCCCCCCCCC Confidence 11111111 00011122222222222222 No 20 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=21.06 E-value=2.7 Score=18.25 Aligned_cols=414 Identities=13% Similarity=0.136 Sum_probs=149.8 Q ss_pred ecccHHHHHHHhhhhhhhhhhccccccchhhhhhccchhhhhhhcccccccccchhHhhhhhhhccccccccchhhhhHH Q lcl|NC_016163. 231 LSMNEEIKQMLSESASELDKVDSYKTLNLSEGLLNSADKSQLLSENASFSKQGSIFLEEFKDAFGIEKEATGVKVGAALD 310 (878) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (878) .+-..+-+|.... .+-|+|..+...-|+-++.|+. +-+..++ +..+=.+++..++ +.+.=++.+.| T Consensus 1 ~~~~~~a~~~~~~-----~~a~~~~~~~~~~g~~~~~d~~---~~~~~~~-~~~~~~~~l~~lY-----~~~~l~r~iVd 66 (461) T protein:vir:80 1 MYSIDKAKQAKID-----SKIVNRNDFMVGHGKANSRDKL---TRQTPGN-GQKLDLKACENLY-----ASNSIAMNIVD 66 (461) T ss_pred Cccchhhhhhhhh-----hhhhhhhHHHhhcCCcchhhhh---hccccCc-ccccCHHHHHHHH-----HhCCccchhhc Confidence 2222233333211 1234555444444444444432 1122222 2223334443333 11112223333 Q ss_pred HHHHh-hhh-eeeeccchhH----HHhhh-----hhhhhcccccccccccc-cccccccCCCC-CCCCCCcccCCchhhh Q lcl|NC_016163. 311 KFSEA-LKD-TFIIGDSSHI----LSEYS-----DVLLSEDMNISGFFGSA-VSPFNATGEGN-TQPGSNRKLADPEKEK 377 (878) Q Consensus 311 ~~~~~-~~~-~~~~~~~~~~----~~~~~-----~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 377 (878) +..+- ++. .-|-|+..-. -++.. +. +.+-.--+.+||.+ +... +.+++ ..|... T Consensus 67 ~~a~d~~r~g~~i~~~~~~~~~~~~~~~~~l~~~~~-l~~~~~~~rl~G~a~i~i~--v~d~~~~~~~~~---------- 133 (461) T protein:vir:80 67 IISEDMVRAGWSLKTDNKEMKKNIESKWRKLKTKDR-FQKLYADKRLYGDGFLSIG--VVSSNREQADLS---------- 133 (461) T ss_pred cchHHhhcCCeeeecCCHHHHHHHHHHHHHhhHHHH-HHHHHHhhcccccEEEEEE--eecCCccccCcc---------- Confidence 22221 111 1122332211 11110 11 11112234455532 2221 11111 111111 Q ss_pred hhhhhcCCcchhhheecCCCceEEeeecccceeeeeeEeeEecCCCcccCcccccCC-CCceeecccccccchhHHHHhh Q lcl|NC_016163. 378 ILKTKLGGNEKAIVKRISPGNIVDLTFEDNILGYLYLDIVEVDPDGTTMPSDKVDNG-NEGYTFMPSQSVGNGNVLQNMV 456 (878) Q Consensus 378 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 456 (878) +.+.||.+. |..||+.+-.-| +++...++. ......+|..-.=+ T Consensus 134 --------------~pl~~~~~~---------~~~~l~~~~~~~----i~~~~~~~dp~sp~fg~P~~y~i~-------- 178 (461) T protein:vir:80 134 --------------TAIDPKTIK---------SIPYINTFNTQK----VTQLYLNQDMFSEHFGEVEFFEVN-------- 178 (461) T ss_pred --------------CCccccccc---------ceeEEEeccccc----cchhhhcccCcCcccccceEEEEe-------- Confidence 112222221 222233221111 111111111 11111223211000 Q ss_pred hcCccccccCCCccccccccccCCCCceeccccchhHHHHHHHHHhhcCCcchhhhhhhhhhhhhhhhhhhhhhhhhhhc Q lcl|NC_016163. 457 YSGKDIPIDGNGGHSSGAKNVENPNGQRLDVADDARLQFLAAAFANRLSDESNIKLIKKSATIKQAIYNTLSIRKLTRKD 536 (878) Q Consensus 457 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~IKl~K~S~T~~~~~~~~l~~~~L~~k~ 536 (878) |.. +.+...+.+..|.+ ..+ +. T Consensus 179 --~~~---------~~~~~~~~~~~~~~------------------------~~~-iH---------------------- 200 (461) T protein:vir:80 179 --RVS---------QLGEEILSGTTAST------------------------SEQ-IH---------------------- 200 (461) T ss_pred --ccc---------cccccccccccCcc------------------------ceE-Ec---------------------- Confidence 000 00000000000000 000 00 Q ss_pred cceeEEechHHHHhhhcCchhhhhhhhhhhhhhhhHH--HHHHHHHhcCccceEEEEecC--CCChhHHHHHHHHHhhcc Q lcl|NC_016163. 537 KVRVIYLKPEEVVMINRGHSIFDNILFFAKIYITTLL--TLLMQNVLRGAPKRAVYVEVG--LDNNPANATQQAIRDVKS 612 (878) Q Consensus 537 K~RVi~~~peeIv~In~G~sI~dnil~~skiyltTll--SLVIYRITRAPERRVFYIDVG--LPK~KAEQYMrdI~kyKN 612 (878) +-|||.+...++.-+=.|.|+++-++=.-+.|-+|+. +.++|+.. -.+|-+|-- +.+.-..+..+.+..++ T Consensus 201 ~SRii~~~~~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~----~~v~k~~~l~~~~~~~~~~~~~~~~~~~- 275 (461) T protein:vir:80 201 RSRIIHEQGLRFEGETKGRSIFESLYDIITVMDTSLWSVGQILYDFA----FKVYKTDDIDALNKDDKANLTAMLDFMF- 275 (461) T ss_pred cccEEEecCCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhC----CCceecchHHhhhchHHHHHHHHHHHhc- Confidence 0144444333332223388888877666666766666 33445532 346666521 33333333333332222 Q ss_pred eeeecccCceeccccchhhhhhhcccccccCCCCceeeecCcccccccchHHHHHHHHHHHHHcCCCchhcc-------- Q lcl|NC_016163. 613 KEISSITNMDMQSIINYVGEFQDYYIPVVDGEKPITFETIDALDAKSLDDDFLNWLSNNIFSGMGIPSAYLT-------- 684 (878) Q Consensus 613 KLVYDAsTGEVRDDrk~MSMLEDYWLPRREGGRGTEISTLpGqNlgei~DDvVeYFqKKLYrALNVPvSRLI-------- 684 (878) ..+|-+ + - +++-+++++. .+++++ +|+++.|...+--+.+||..+|. T Consensus 276 -----~~~g~~---------~--------~-d~~e~~e~~~-~~lsgl-~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~a 330 (461) T protein:vir:80 276 -----RTEALA---------I--------I-KGDEQLTKES-TNVSGM-KDLLDYGWDYLAGAVRMPKTVLKGQEAGTLT 330 (461) T ss_pred -----CCceEE---------E--------E-cCCcceEEEe-cCcCCH-HHHHHHHHHHHhhhhcCCeeeeecccCCccc Confidence 122211 0 1 1122344442 234454 45799999999999999999991 Q ss_pred c--cccccHHHHHHHHHHHHHHhhhHhh--ccch--hhhhhhhccc-cccccccccchhHHHHHHHhhhhHhhhhhhhhh Q lcl|NC_016163. 685 E--VENVDFAKTLSMQNSRFIRDIIGDQ--VILS--KGYTELVRKI-YNLNFKSNEVDDKSDPAKDEILTKNAKSYTDTQ 757 (878) Q Consensus 685 T--RDELKFsKFI~RLR~RFF~DlLRtQ--LILK--KiiTE~IRni-yn~nFksnEVdd~selkk~EILt~riksytd~Q 757 (878) | -|.-.|..+|.++|..++..+|.+= +|+. -++...+++. +.+.|..+.....++..+.|+...++.++ T Consensus 331 sge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~---- 406 (461) T protein:vir:80 331 GAQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEAD---- 406 (461) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHH---- Confidence 2 3555689999999987755444421 1211 1122223332 33445555566667777788877777544 Q ss_pred hhhhhhHhhcchhhhheecCccccc--------------chhhhhHHHHHHhhhhhhhhcCccccccCCcchHHHHHHHH Q lcl|NC_016163. 758 SLAKAAIKYFDINNISVKFPSPASL--------------NMNNLSEQISNVNNFVTTLTENLTFDDTIPQDDQEKLKKKL 823 (878) Q Consensus 758 sla~aa~KYFdv~yis~K~l~~t~l--------------~m~eldeQIs~~~~~v~~~~~N~~F~~~~~QDDQ~~~~~~~ 823 (878) .+|++ .-++++.++ .+..++..+-.+.+.+.+. T Consensus 407 ------~~~~~-----~g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------------- 453 (461) T protein:vir:80 407 ------QIYIV-----NGVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVYDA---------------------- 453 (461) T ss_pred ------HHHHh-----cCCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhcccc---------------------- Confidence 12331 112222111 1111111111121111111 Q ss_pred HHHhhhccCCC Q lcl|NC_016163. 824 IMRFTKKNLPN 834 (878) Q Consensus 824 ~~~~~~~~~~~ 834 (878) ..+.+-.. T Consensus 454 ---~~~e~~~g 461 (461) T protein:vir:80 454 ---YAKKNADG 461 (461) T ss_pred ---ccccCCCC Confidence 11111111 Done!