Query lcl|NC_019422.1_cdsid_YP_006990564.1 [gene=D864_gp10] [protein=tail sheath] [protein_id=YP_006990564.1] [location=6334..7401] Match_columns 355 No_of_seqs 120 out of 188 Neff 6.7 Searched_HMMs 1612 Date Thu Nov 7 18:02:05 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_10 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_10_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:102359 Length: 356 100.0 2E-125 1E-128 704.1 38.2 343 1-355 2-356 (356) 2 protein:vir:78986 Length: 436 100.0 1E-100 7E-104 568.6 31.6 320 1-355 84-435 (436) 3 protein:vir:102957 Length: 437 100.0 8E-95 4.9E-98 536.5 35.0 322 1-355 79-436 (437) 4 protein:vir:105470 Length: 451 100.0 6.8E-95 4.2E-98 536.8 32.3 324 1-355 79-450 (451) 5 protein:vir:96586 Length: 587 100.0 1.5E-73 9.1E-77 419.9 31.1 326 1-355 184-580 (587) 6 protein:vir:80488 Length: 562 100.0 2.8E-72 1.7E-75 412.9 28.7 323 1-355 197-555 (562) 7 protein:vir:63742 Length: 562 100.0 3.1E-70 1.9E-73 401.6 31.1 326 1-355 197-555 (562) 8 protein:vir:80779 Length: 569 100.0 2.5E-69 1.5E-72 396.7 30.7 326 1-355 187-562 (569) 9 protein:vir:99306 Length: 587 100.0 1E-67 6.5E-71 387.8 30.5 323 1-355 197-580 (587) 10 protein:vir:100829 Length: 607 100.0 6.7E-67 4.2E-70 383.4 29.7 322 1-355 225-594 (607) 11 protein:vir:95741 Length: 587 100.0 5.4E-67 3.3E-70 383.9 29.1 323 1-355 195-580 (587) 12 protein:vir:7653 Length: 581 # 100.0 3.7E-57 2.3E-60 330.0 31.4 323 1-355 190-564 (581) 13 protein:vir:107310 Length: 581 100.0 9E-57 5.6E-60 327.9 29.2 322 1-355 173-564 (581) 14 protein:vir:102819 Length: 648 100.0 1.6E-38 9.8E-42 227.8 25.4 323 1-355 228-643 (648) 15 protein:vir:79798 Length: 717 99.9 2E-27 1.3E-30 166.9 23.5 321 1-355 311-715 (717) 16 protein:vir:98824 Length: 774 99.8 1E-20 6.4E-24 130.1 17.2 322 1-355 369-765 (774) 17 protein:vir:1845 Length: 392 # 99.7 2.3E-18 1.4E-21 117.3 22.7 317 1-355 1-378 (392) 18 protein:vir:108052 Length: 660 99.7 3.3E-17 2E-20 111.0 27.7 320 1-355 245-645 (660) 19 protein:vir:5711 Length: 396 # 99.7 1.1E-17 6.8E-21 113.6 22.8 318 1-355 1-381 (396) 20 protein:vir:6079 Length: 396 # 99.7 7.3E-18 4.5E-21 114.5 21.0 318 1-355 1-381 (396) 21 protein:vir:101804 Length: 663 99.7 1.6E-16 1E-19 107.1 28.1 317 1-355 217-646 (663) 22 protein:vir:2035 Length: 396 # 99.7 1.5E-17 9.2E-21 112.9 20.7 318 1-355 1-381 (396) 23 protein:vir:101187 Length: 663 99.7 6.5E-16 4E-19 103.9 29.4 317 1-355 217-646 (663) 24 protein:vir:106427 Length: 679 99.7 1.1E-16 7.1E-20 108.0 25.2 317 1-355 267-663 (679) 25 protein:vir:4517 Length: 498 # 99.7 5.5E-17 3.4E-20 109.7 23.0 301 1-355 150-489 (498) 26 protein:vir:96740 Length: 388 99.7 5.5E-16 3.4E-19 104.2 28.4 314 1-355 1-375 (388) 27 protein:vir:4463 Length: 498 # 99.7 7.2E-17 4.5E-20 109.1 22.5 299 1-355 150-489 (498) 28 protein:vir:489 Length: 498 # 99.6 8.5E-17 5.3E-20 108.7 22.3 301 1-355 150-489 (498) 29 protein:vir:79181 Length: 390 99.6 6.4E-17 4E-20 109.3 21.4 317 1-355 4-376 (390) 30 protein:vir:100539 Length: 663 99.6 9.2E-16 5.7E-19 103.0 27.1 318 1-355 217-646 (663) 31 protein:vir:98553 Length: 395 99.6 9.9E-17 6.1E-20 108.3 21.4 317 1-355 1-381 (395) 32 protein:vir:103993 Length: 390 99.6 1.1E-16 6.7E-20 108.1 21.5 317 1-355 1-376 (390) 33 protein:vir:78206 Length: 390 99.6 1.1E-16 6.7E-20 108.1 21.5 317 1-355 1-376 (390) 34 protein:vir:1172 Length: 391 # 99.6 9.7E-17 6E-20 108.4 21.2 318 1-355 1-377 (391) 35 protein:vir:7206 Length: 659 # 99.6 8.5E-16 5.2E-19 103.2 25.6 321 1-355 217-644 (659) 36 protein:vir:106984 Length: 743 99.6 2.2E-15 1.4E-18 100.9 27.7 317 1-355 332-730 (743) 37 protein:vir:5833 Length: 742 # 99.6 6.5E-16 4E-19 103.8 24.2 318 1-355 352-734 (742) 38 protein:vir:79141 Length: 391 99.6 2E-16 1.2E-19 106.6 21.4 318 1-355 1-376 (391) 39 protein:vir:80984 Length: 666 99.6 4.8E-15 3E-18 99.1 28.5 321 1-355 217-649 (666) 40 protein:vir:100323 Length: 393 99.6 3.1E-15 1.9E-18 100.2 26.6 320 1-355 1-378 (393) 41 protein:vir:103456 Length: 659 99.6 3.1E-15 1.9E-18 100.1 26.4 311 1-355 262-644 (659) 42 protein:vir:10336 Length: 386 99.6 2.6E-15 1.6E-18 100.5 25.5 318 1-355 1-377 (386) 43 protein:vir:6594 Length: 666 # 99.6 4.3E-15 2.7E-18 99.4 26.0 320 1-355 259-649 (666) 44 protein:vir:1996 Length: 495 # 99.6 1.1E-14 6.9E-18 97.1 27.4 295 1-355 153-493 (495) 45 protein:vir:104858 Length: 729 99.6 1.6E-15 9.9E-19 101.7 22.7 316 1-355 315-715 (729) 46 protein:vir:5663 Length: 671 # 99.5 6.5E-14 4E-17 92.9 28.9 321 1-355 221-659 (671) 47 protein:vir:98263 Length: 664 99.5 8.3E-15 5.1E-18 97.8 23.5 324 1-355 218-648 (664) 48 protein:vir:104477 Length: 749 99.5 3.4E-14 2.1E-17 94.4 23.8 319 1-355 336-737 (749) 49 protein:vir:79092 Length: 477 99.5 6.6E-14 4.1E-17 92.8 25.2 317 1-355 74-465 (477) 50 protein:vir:6894 Length: 660 # 99.5 7.2E-14 4.5E-17 92.6 25.0 324 1-355 227-644 (660) 51 protein:vir:107865 Length: 477 99.3 1.8E-12 1.1E-15 84.9 23.6 309 1-355 111-465 (477) 52 protein:vir:5260 Length: 502 # 99.1 4.3E-10 2.7E-13 71.9 23.3 326 1-355 102-500 (502) 53 protein:vir:80052 Length: 331 99.0 5.8E-09 3.6E-12 65.7 25.5 313 1-355 3-329 (331) 54 protein:vir:276 Length: 369 # 98.6 1.6E-07 9.9E-11 57.8 29.5 316 1-355 1-364 (369) 55 protein:vir:99586 Length: 507 98.6 1.6E-07 1E-10 57.8 23.5 320 1-355 117-506 (507) 56 protein:vir:95263 Length: 450 98.5 4.4E-07 2.7E-10 55.4 25.5 312 1-355 1-447 (450) 57 protein:vir:3636 Length: 501 # 98.3 1.8E-06 1.1E-09 52.1 26.7 303 1-355 150-500 (501) 58 protein:vir:106730 Length: 501 98.2 2.9E-06 1.8E-09 50.9 26.6 302 1-355 150-500 (501) 59 protein:vir:3788 Length: 376 # 98.1 3.7E-06 2.3E-09 50.3 27.2 325 2-355 1-369 (376) 60 protein:vir:101576 Length: 501 98.1 4.3E-06 2.7E-09 50.0 26.3 298 1-355 150-500 (501) 61 protein:vir:78611 Length: 501 98.0 7.7E-06 4.8E-09 48.6 26.5 299 1-355 150-500 (501) 62 protein:vir:94073 Length: 494 98.0 8.4E-06 5.2E-09 48.4 24.7 293 1-355 147-492 (494) 63 protein:vir:96104 Length: 504 97.8 1.7E-05 1E-08 46.8 23.5 325 1-355 108-503 (504) 64 protein:vir:78782 Length: 370 97.7 2.7E-05 1.6E-08 45.7 27.2 316 2-355 1-361 (370) 65 protein:vir:3751 Length: 376 # 97.6 4.5E-05 2.8E-08 44.4 29.5 323 2-355 1-369 (376) 66 protein:vir:107720 Length: 515 97.0 0.00023 1.4E-07 40.6 24.3 331 1-355 122-514 (515) 67 protein:vir:3165 Length: 426 # 92.9 0.0097 6E-06 31.6 19.6 323 3-355 1-424 (426) 68 protein:vir:101326 Length: 529 42.0 0.9 0.00056 20.8 21.8 321 1-355 143-527 (529) No 1 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=100.00 E-value=2.1e-125 Score=704.11 Aligned_cols=343 Identities=42% Similarity=0.638 Sum_probs=316.5 Q ss_pred CCCCceEEEeeeeeeeeecCCCceeEEEEEecCCccceeEEEeehhhhhhhhhhHHHHHHHHhhhccc--------cceE Q lcl|NC_019422. 1 MGLPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIKKSYSIDFLTDINETEFTKENYDYIRLAFLGK--------PSKV 72 (355) Q Consensus 1 ~g~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~~~~~~~~~d~~~~~~~~~n~~~i~~a~~g~--------~~~v 72 (355) -|||+|+|+|+++|+||++||.||+|+++|+|++.... ++.+.+++ +..+++.|.+|++.+|.|+ |.++ T Consensus 2 ~glp~i~i~f~~~a~ta~~~g~rGiv~~il~d~~~~~~--~~~~~~~v-~~~~~~~n~~~i~~~~~g~~~~~~~~~p~~~ 78 (356) T protein:vir:10 2 AGLVNINIEFKELATSFIQRSKAGIVAIILKDTTKMYK--ELTSEDDI-PISLSADNKKYIKYGFVGATDNEKVLRPSKV 78 (356) T ss_pred CCCCceeEEEeecceeeccCCccceEEEEEecCCccee--EEeccccc-hhHHHHHHHHHHHHHhhccccccccccceee Confidence 79999999999999999999999999999999886533 45566666 4568999999999999876 3333 Q ss_pred EEEecCCCccchhHHHHHHHHHhcccceEEEEcCCChHHHHHHHHHHHHHHHhc-CCeEEEEecCCCCcCcceeEEecCC Q lcl|NC_019422. 73 IVEVINDSVDSERSLDDALKALRENKFNYLAIPFISEEVDKTKIVNWIKTARRE-KEIYKAVLPNISDANEKAIINFATT 151 (355) Q Consensus 73 ~l~~g~~g~~~~~~y~~al~~le~~~fn~l~~p~~~d~~~~~~~~~~ik~~r~~-g~~~~aVl~~~~~~d~egIinv~n~ 151 (355) ... +..++++|++||++||+++|||||||+. ++++|+++++||||||++ |+++++|+++. .+||||||||+|+ T Consensus 79 ~~~----~~~t~~~y~~aL~~le~~~fn~l~~~~~-d~~~~~~~~a~ikr~r~~~~~~~~~V~~~~-~aD~EgIInv~n~ 152 (356) T protein:vir:10 79 IIS----TFTEDGKVEDILEELESVEFNYLCMPEA-IEAEKTKIVTWIKKIREEESTEAKAVLANI-KADNEAIINFTEN 152 (356) T ss_pred eee----cccCchhHHHHHHHhcCccceEEEecCC-ChHHHHHHHHHHHHHHhcCCcEEEEEecCC-CCCCceeEEeecC Confidence 332 2345789999999999999999999984 788999999999999987 79999999986 5899999999998 Q ss_pred eEecCCceecHHHHHHHHHHHhcCcccccccccccCCcccc--cCChhhHHHHHhCCeEEEEECC-cEEEEecCcccccc Q lcl|NC_019422. 152 GIKVGEKSYTTAEYTARLAGILAGISLSESCTYFILDEVTE--IEPTENPDEAVEEGKLILINNN-GIRIARGVNSLITL 228 (355) Q Consensus 152 ~i~~~~~~~~~~~~~a~vAG~~Ag~~~~~S~T~~~~~~~~~--~~~~~e~~~ai~~G~lvl~~dg-~v~I~~~INSltt~ 228 (355) ++ .+|.+|++++||+||||++||||+|+|+||++|+++.. .++++|++++|++|+|+|+++| +|||+||||||||+ T Consensus 153 ~~-~~g~~~t~~~~~~~vAG~~Ag~~~n~S~T~~~~~~~~~~~~~t~~e~~~ai~~G~lvl~~d~~~V~I~~~VNSltt~ 231 (356) T protein:vir:10 153 VV-VDGEEITAEKYTTRVASLIASTPNTQSITYAPLDEVESIVKIDKASADAKVQAGELILRRLSGKIRIARGINSLTTL 231 (356) T ss_pred eE-ecceeechhHHHHHHHHHHhccchhccccceecCCccccccCCHHHHHHHHhCCeEEEEEEcCeEEEEecCccceec Confidence 76 58999999999999999999999999999999998764 4689999999999999999875 79999999999999 Q ss_pred CCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchh Q lcl|NC_019422. 229 SKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKK 308 (355) Q Consensus 229 ~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~ 308 (355) +++|+++|+|||++|+||+|.+||+++|+++||||++|+++||++||+++++||++|+++|+|+++ +++++|+|+||+ T Consensus 232 t~~k~~~f~Kirvvr~~D~i~~Di~~~f~~~yiGKv~N~~dgr~~l~~ai~~y~~~L~~~~~I~~~--~~~eid~e~q~~ 309 (356) T protein:vir:10 232 TAEKGEIFQKIKLVDTKDLISKDIKNIYVEKYLRKCPNTYDNKCLFIVAVQSYLTELAKQELIDSN--FTVEIDLEKQKE 309 (356) T ss_pred CCCCCcchhhhHHHHHHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhCCccccC--ceeEecccchHH Confidence 999999999999999999999999999999999999999999999999999999999999999986 479999999999 Q ss_pred hhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 309 YLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 309 ~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) |++++|+|+++|+|++|++++++|+||++++++|+||||||||||+| T Consensus 310 ~~~~~g~d~~~~~d~~v~~~~~~~~v~~~~~v~~vdamE~iy~ti~v 356 (356) T protein:vir:10 310 YLEGKKIAVSKMKENEIKEANTGSNGFYLINLKLVDAMEDINIRVQM 356 (356) T ss_pred HhhhccccccccccceeecccCCcEEEEEEEEEEEeeeeeEEeEEeC Confidence 99999999999999999999999999999999999999999999999 No 2 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=100.00 E-value=1.1e-100 Score=568.64 Aligned_cols=320 Identities=17% Similarity=0.266 Sum_probs=273.7 Q ss_pred CCCCceEEEee--------eeeeeeecCCCcee-EEEEEecCCccceeE---EEe-----------ehhhhhhhhhhHHH Q lcl|NC_019422. 1 MGLPSAIIEFQ--------RRSRTVKFRSRRGV-VALILKDSTAIKKSY---SID-----------FLTDINETEFTKEN 57 (355) Q Consensus 1 ~g~P~~~i~f~--------~~a~ta~~~~~rG~-v~iil~d~~~~~~~~---~~~-----------~~~d~~~~~~~~~n 57 (355) +|-|+.-..++ ..-++|+|+|.||+ +-+.++.+......+ ++. ++.++++ T Consensus 84 ~~~~~tv~~yrl~~G~~a~~~v~~Aky~g~~gn~i~v~v~~~~~d~~~~dv~~~~g~~~~d~~~~~~~~~l~~------- 156 (436) T protein:vir:78 84 FKNIRLGYFYKLNKGVKASCSIATARCSGIRGNDLKVIVTTNIDDNAKFDVVTLLDNKKVDTQIAKVITELQD------- 156 (436) T ss_pred hcCCCEEEEEECCCcceeeeeeeeeecCCCCCcEEEEEecccccccCceEEEEEecchhhhhhhHHHHhhccC------- Confidence 44444422222 12368999999995 445554443222222 221 2223333 Q ss_pred HHHHHhhh---ccccceEEEEecCCCc-cchhHHHHHHHHHhcccceEEEEcCCChHHHHHHHHHHHHHHHhc-CCeEEE Q lcl|NC_019422. 58 YDYIRLAF---LGKPSKVIVEVINDSV-DSERSLDDALKALRENKFNYLAIPFISEEVDKTKIVNWIKTARRE-KEIYKA 132 (355) Q Consensus 58 ~~~i~~a~---~g~~~~v~l~~g~~g~-~~~~~y~~al~~le~~~fn~l~~p~~~d~~~~~~~~~~ik~~r~~-g~~~~a 132 (355) ++|+.+.. +...+++.|+||++|+ +++++|+++|++||+++||+||||+. ++++|+++++||||||++ |+++++ T Consensus 157 n~~V~~~~~g~la~~a~~~LtGG~dG~~~T~~dy~~al~~le~~~fn~l~~~~~-d~~~~~~~~a~ikr~re~~g~~~~a 235 (436) T protein:vir:78 157 NDYVTWKKEATLEATAGLTFTNGTNGEAVTGTEYQAFLDKIESYSFNALGCLAT-TAEIKSLFVEFTKRMRDKVGAKFQT 235 (436) T ss_pred CceEEEEecccccccceeeeeccccccccchHHHHHHHHHHcccceeEEEecCC-ChHHHHHHHHHHHHHHhhcCCeEEE Confidence 34554443 4456778899999986 68899999999999999999999984 788999999999999975 999999 Q ss_pred EecCCCCcCcceeEEecCCeEecCCceecHHHHHHHHHHHhcCcccccccccccCCcccc---cCChhhHHHHHhCCeEE Q lcl|NC_019422. 133 VLPNISDANEKAIINFATTGIKVGEKSYTTAEYTARLAGILAGISLSESCTYFILDEVTE---IEPTENPDEAVEEGKLI 209 (355) Q Consensus 133 Vl~~~~~~d~egIinv~n~~i~~~~~~~~~~~~~a~vAG~~Ag~~~~~S~T~~~~~~~~~---~~~~~e~~~ai~~G~lv 209 (355) |+++++++||||||||+|++ +|+.|+++++|+||||++||||+++|+||++++++.+ .++++|++++|++|+|+ T Consensus 236 V~~~~~~~d~EgIInv~n~v---~g~~~~~~~~~a~vAG~~Ag~~~~~S~T~~~~~~~~~v~~~~t~~e~~~ai~~G~lv 312 (436) T protein:vir:78 236 VLYKKNDADYEGVVSVENKI---KDTGLLESSLIYWTTGAIAGCDINKSNTNKRYDGEFDVDVNYTQIHLEEALKTGKFI 312 (436) T ss_pred EecCCCCCCCceEEEeeccc---CCceechhHHHHHHHHHHhcCccccCccceecCccccccccCCHHHHHHHHhCCeEE Confidence 99998889999999999973 7899999999999999999999999999999997754 47889999999999999 Q ss_pred EEEC-CcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019422. 210 LINN-NGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRD 288 (355) Q Consensus 210 l~~d-g~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~ 288 (355) |+++ ++|||+||||||||++++|+++|+|||++|+||+|.+||+++|+++||||++|+++||++||++|++||++|+++ T Consensus 313 l~~d~~~v~I~~~VNTltt~~~~k~~~~~kI~vir~~D~i~~di~~~~~~~yiGKv~N~~dgr~~l~~~i~~yl~~L~~~ 392 (436) T protein:vir:78 313 FHKVGDEVHVLEDINTFVSFTDEKNDDFSSNQSVRVLDQIANDIATLFNTKYLGEVPNDKSGRISFWNDVVKHHEQLQNM 392 (436) T ss_pred EEEeCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccccccCCCHHHHHHHHHHHHHHHHHHHhC Confidence 9986 479999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 289 EVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 289 g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) |+|++|. ++|++|.+++++++||++++++|+||||||||||+| T Consensus 393 g~I~~f~------------------------~~Dv~v~~~~~~~~v~v~~~v~pvdamekiy~ti~v 435 (436) T protein:vir:78 393 RAIEDFK------------------------ADDVSVEPGSDKKTVVVSDAVKVISAMSKLYMTVSV 435 (436) T ss_pred CcccCCC------------------------CcceEEeecCCCCEEEEEEEEEEEEeeeeEEEEEEE Confidence 9999873 368999999999999999999999999999999999 No 3 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=100.00 E-value=8e-95 Score=536.47 Aligned_cols=322 Identities=20% Similarity=0.320 Sum_probs=274.8 Q ss_pred CCCC------------ceEEEee-eeeeeeecCCCcee-EEEEEecCCccceeE--------------EEeehhhhhhhh Q lcl|NC_019422. 1 MGLP------------SAIIEFQ-RRSRTVKFRSRRGV-VALILKDSTAIKKSY--------------SIDFLTDINETE 52 (355) Q Consensus 1 ~g~P------------~~~i~f~-~~a~ta~~~~~rG~-v~iil~d~~~~~~~~--------------~~~~~~d~~~~~ 52 (355) ++-+ ...++.. .++++|+|+|.+|+ +-+.+..+......+ .+.++.++.. T Consensus 79 ~~g~~~~~~~R~~~g~~a~~tl~~~~~~~A~~~G~~gn~i~v~v~~~~~d~~~~~v~~~~~~~~~d~~~v~~~~~~~~-- 156 (437) T protein:vir:10 79 FKRVSEVLLYRLNTGEKANVSLSDNVTAQAKYSGVRGNDITVTVKTNVDDPSSFDVVTFLDTVVMDLQTVKVLADLKN-- 156 (437) T ss_pred hcCCCEEEEEECCCCceeeEeeccceEEEeccCCcccceeEEEEeeccCCccceEEEEecCcceeeeeehhhhhhhhh-- Confidence 1111 1222332 47889999999995 333333322211111 1222223322 Q ss_pred hhHHHHHHHHhhh---ccccceEEEEecCCCccchhHHHHHHHHHhcccceEEEEcCCChHHHHHHHHHHHHHHHhc-CC Q lcl|NC_019422. 53 FTKENYDYIRLAF---LGKPSKVIVEVINDSVDSERSLDDALKALRENKFNYLAIPFISEEVDKTKIVNWIKTARRE-KE 128 (355) Q Consensus 53 ~~~~n~~~i~~a~---~g~~~~v~l~~g~~g~~~~~~y~~al~~le~~~fn~l~~p~~~d~~~~~~~~~~ik~~r~~-g~ 128 (355) +.|+.... +...+++.|+||++|++++++|+++|++||+++||+||||+. ++++|+++.+|+||||++ |+ T Consensus 157 -----n~~v~~~~~~~l~~~a~~~LtGG~dg~~t~~dy~~al~~le~~~~n~l~~~~~-d~~~~t~~~~~ik~~r~~~g~ 230 (437) T protein:vir:10 157 -----NALVEFSGTGELQPVAGAKLTGGTDGAISTQDYLEYFKALETVEFNYMALPVE-DASIKKAAINFIKRMREDEGL 230 (437) T ss_pred -----hcccccccccccccccceeeeccccCCCChhHHHHHHHHhccCcceEEEecCC-ChhHHHHHHHHHHHHHhccCc Confidence 33433332 344677899999999999999999999999999999999984 788999999999999986 89 Q ss_pred eEEEEecCCCCcCcceeEEecCCeEecCCceecHHHHHHHHHHHhcCcccccccccccCCcccc---cCChhhHHHHHhC Q lcl|NC_019422. 129 IYKAVLPNISDANEKAIINFATTGIKVGEKSYTTAEYTARLAGILAGISLSESCTYFILDEVTE---IEPTENPDEAVEE 205 (355) Q Consensus 129 ~~~aVl~~~~~~d~egIinv~n~~i~~~~~~~~~~~~~a~vAG~~Ag~~~~~S~T~~~~~~~~~---~~~~~e~~~ai~~ 205 (355) ++++|+++.+ +||||||||.|+++..++..|+++++|+|+||++||||+++|+||++++++.+ .++++|++++|++ T Consensus 231 ~~~~V~~~~~-~d~e~Iin~~n~~~~~~~~~~~~~~~~a~vAG~~Ag~~~~~S~t~~~~~~~~~v~~~~t~~e~~~~i~~ 309 (437) T protein:vir:10 231 GAQLVVADSD-ADSEAVINVKNGVILSDKTVIDKTKATVWVAAASANAGVEKSLTYEKYEDSVDVVGRLSHTETEDALLK 309 (437) T ss_pred eEEEEeCCCC-CCCceEEEeecceeecCcceechhhHHHHHHHHhccCccccCccccccCCcccccccCCHHHHHHHHhC Confidence 9999999975 89999999999999999999999999999999999999999999999998765 4688999999999 Q ss_pred CeEEEEECC-cEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 206 GKLILINNN-GIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKE 284 (355) Q Consensus 206 G~lvl~~dg-~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~ 284 (355) |+|+|++++ +|||+||||||||++++++++|+|||++|+||+|.+||+++|+++||||++|+++||++|+++|++||++ T Consensus 310 G~~vl~~~~~~v~i~~gInTltt~~~~~~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~~N~~~~r~~~~~~i~~yl~~ 389 (437) T protein:vir:10 310 GQFVFTARRGRAVVEQDINSHVSFTIEKNQDFRKNRILRTLDDIVNDTRYAFSEYFLGKVSNNEDGRQAFKANRIRYFKD 389 (437) T ss_pred CcEEEEEeCCeEEEEEccccccccCCCCCchhhhhhHHHHHHHHHHHHHHHHHhccccccCCCHHHHHHHHHHHHHHHHH Confidence 999999875 6999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 285 LQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 285 l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) |+++|+|+++ .++|+++++++++|+||+++.++|+||||||||||+| T Consensus 390 l~~~g~I~~~------------------------~~~d~~v~~~~~~~~v~v~~~v~~~dame~iy~ti~v 436 (437) T protein:vir:10 390 LEARGAIEDF------------------------KVEDIEVLRGELKESVVVNVKVKPVDSMEKLYMTVTV 436 (437) T ss_pred HHhCCCccCC------------------------CceeEEeecCCCCCEEEEEEEEEEeeeeeeEEEEEEe Confidence 9999999987 4578999999999999999999999999999999999 No 4 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=100.00 E-value=6.8e-95 Score=536.85 Aligned_cols=324 Identities=19% Similarity=0.270 Sum_probs=264.4 Q ss_pred CCCCceEEEe--------------eeeeeeeecCCCcee-EEEEEecCCccceeEE----------------Eeehhhhh Q lcl|NC_019422. 1 MGLPSAIIEF--------------QRRSRTVKFRSRRGV-VALILKDSTAIKKSYS----------------IDFLTDIN 49 (355) Q Consensus 1 ~g~P~~~i~f--------------~~~a~ta~~~~~rG~-v~iil~d~~~~~~~~~----------------~~~~~d~~ 49 (355) |+-|+.-+.+ ..++++|+|+|.||+ +.+.++.+......+. ..++.++. T Consensus 79 ~~g~~~v~~yrl~~g~~a~~t~~~~~~~~~Aky~G~~Gn~i~v~v~~~~~d~~~~~v~t~~g~~~vd~qtv~~~~~~el~ 158 (451) T protein:vir:10 79 LKGASKVLVLNPNEGTAATLTKEGLPWTVTANYPGEKGNQITVSVEVSPADQNAATVSTIFGTKLVDEQSIKFNELDKFK 158 (451) T ss_pred hcCCcEEEEEEcCCCceEEEEeecCceEEEEeeCCcCCceEEEEEecccCCcCceEEEEEECCeEEEEEEeeccchhhcc Confidence 2323322222 236789999999994 3333333222111111 12233333 Q ss_pred hhhhhHHHHHHHHhhh--ccccc---eEEEEecC-CCc--cchhHHHHHHHHHhcccceEEEEcCCC-hHHHHHHHHHHH Q lcl|NC_019422. 50 ETEFTKENYDYIRLAF--LGKPS---KVIVEVIN-DSV--DSERSLDDALKALRENKFNYLAIPFIS-EEVDKTKIVNWI 120 (355) Q Consensus 50 ~~~~~~~n~~~i~~a~--~g~~~---~v~l~~g~-~g~--~~~~~y~~al~~le~~~fn~l~~p~~~-d~~~~~~~~~~i 120 (355) + ++|+.+.. .+.+. .+.+.++. +|+ .++++|+++|++||+++||+||||+.+ ++++|+++.+|| T Consensus 159 ~-------nd~V~a~~~~~g~~~~~~~~~l~~~~~gg~~~~~~~~~~~~l~~~e~~~~n~l~~~~~~~~~~i~~~~~a~i 231 (451) T protein:vir:10 159 G-------NDYITAKVVEEGSSKPVAFTNVSGTLTGGTTTESNKVESLLNDALENEEYAVVTTAGFEPSSNMNKLVVEAV 231 (451) T ss_pred C-------CceEEEEecccccccceeeeecccccccccccCCccchHHHHHHhccceeeEEEEccCCCchHHHHHHHHHH Confidence 3 23444332 23332 22333332 222 346889999999999999999999754 467999999999 Q ss_pred HHHHh-cCCeEEEEecCCC--CcCcceeEEecCCeEecCCceecHHHHHHHHHHHhcCcccccccccccCCccccc---C Q lcl|NC_019422. 121 KTARR-EKEIYKAVLPNIS--DANEKAIINFATTGIKVGEKSYTTAEYTARLAGILAGISLSESCTYFILDEVTEI---E 194 (355) Q Consensus 121 k~~r~-~g~~~~aVl~~~~--~~d~egIinv~n~~i~~~~~~~~~~~~~a~vAG~~Ag~~~~~S~T~~~~~~~~~~---~ 194 (355) ||||+ +|+++++|+++.+ .+||||||||+|+++..+|..|+++++|+||||++||||+++|+||++|+++.++ + T Consensus 232 k~~r~~~g~~~~aVl~~~~~~~~d~egiinv~n~~~~~dg~~~~~~~~~~~vAG~~Ag~~~~~S~T~~~~~~~~~v~~~~ 311 (451) T protein:vir:10 232 KRLRENEGRKVRGVIPTDADTTYNYEGISTVVNGYTLSDGTNVDVKDATGYFAGISASADVATSLTYFEVEDAVSAYPKF 311 (451) T ss_pred HHHHHhcCCeEEEEecCccCCCCCCcceEEeecceEecCceeechhhhHHHHHHHHcccccccCccceecCCceeeeeeC Confidence 99998 5999999999854 4899999999999999999999999999999999999999999999999987654 6 Q ss_pred ChhhHHHHHhCCeEEEE-ECC-cEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHH Q lcl|NC_019422. 195 PTENPDEAVEEGKLILI-NNN-GIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKI 272 (355) Q Consensus 195 ~~~e~~~ai~~G~lvl~-~dg-~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~ 272 (355) +++|++++|++|+|+|+ ++| +|||+||||||||++++|+++|+|||++|+||+|.+||+++|+++||||++|+++||+ T Consensus 312 t~~e~~~~i~~G~lvl~~~~g~~v~i~~~INTltt~~~~k~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~~N~~~gr~ 391 (451) T protein:vir:10 312 DNEKTIKALDAGQIVFTTRPGQRVVIEQDINSLHKFTAEKPQAFSKNRVIRTLDEIATNTENTFERTYLGNVGNNAAGRD 391 (451) T ss_pred CHHHHHHHHhCCeEEEEEEcCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccceecCCCHHHHH Confidence 88999999999999995 676 6999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEE Q lcl|NC_019422. 273 LFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFK 352 (355) Q Consensus 273 ~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~t 352 (355) +||++|++||++|+++|+|++|.+ .|+++.+++++|+||+++.++|+||||||||| T Consensus 392 ~~~~~i~~yl~~l~~~g~i~~~~~------------------------~d~~v~~~~~~~~v~v~~~v~pvdame~iy~t 447 (451) T protein:vir:10 392 LFKADRIAYLTSLQNRNMIQSFAN------------------------TDITVEAGNDMDSIVVNLAVTPVDAMEKLYMT 447 (451) T ss_pred HHHHHHHHHHHHHHhCCCccCCCc------------------------cceEEeecCCCCEEEEEEEEEEEeeeeeEEEE Confidence 999999999999999999998843 58899999999999999999999999999999 Q ss_pred EeC Q lcl|NC_019422. 353 IYM 355 (355) Q Consensus 353 v~v 355 (355) ++| T Consensus 448 ~~v 450 (451) T protein:vir:10 448 MVV 450 (451) T ss_pred EEE Confidence 999 No 5 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=100.00 E-value=1.5e-73 Score=419.89 Aligned_cols=326 Identities=14% Similarity=0.145 Sum_probs=247.6 Q ss_pred CCCCceEEEee-----------ee-------eeeeecCCCceeEEEE-EecCCcc--ceeEEE--eeh-hhhhh----hh Q lcl|NC_019422. 1 MGLPSAIIEFQ-----------RR-------SRTVKFRSRRGVVALI-LKDSTAI--KKSYSI--DFL-TDINE----TE 52 (355) Q Consensus 1 ~g~P~~~i~f~-----------~~-------a~ta~~~~~rG~v~ii-l~d~~~~--~~~~~~--~~~-~d~~~----~~ 52 (355) .|=|+--..|+ -+ -.+|+|+|.||.-..+ ..|.... .....+ .+. .++.. .. T Consensus 184 ~gg~~~v~~yrl~~g~~~~~~~~~~~~~~~~~~tAky~g~~~n~~~v~v~d~~~~~~~k~~~~y~~t~~~di~~~~~~~~ 263 (587) T protein:vir:96 184 KVDEKEVKAYELNGGAYSFTNEIITDINELPDFEAKLSPFGDKNLESRKLDEATDVDIKGKAVYVKAVFGDIENQTQYNQ 263 (587) T ss_pred EecCceEEEEEeCCCchhhhhhhhhhhccccceEEEeecccCceeEEEeeccccccccceEEEeehhhhhhhhhhhcccc Confidence 11111111111 11 2589999999953333 2221111 111111 111 01100 00 Q ss_pred ------hhH--H---HHHHHHh-------hh-----ccccceEEEEecCCCccchhHHHHHHHHHhcccceEEEEcCCCh Q lcl|NC_019422. 53 ------FTK--E---NYDYIRL-------AF-----LGKPSKVIVEVINDSVDSERSLDDALKALRENKFNYLAIPFISE 109 (355) Q Consensus 53 ------~~~--~---n~~~i~~-------a~-----~g~~~~v~l~~g~~g~~~~~~y~~al~~le~~~fn~l~~p~~~d 109 (355) ... . +...... +. +...+...|+||++|++ +++|+++|++||.++||+|++++ .+ T Consensus 264 ~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~aLtGG~dG~~-~~~y~~~l~ale~~~~~~i~~~t-~d 341 (587) T protein:vir:96 264 YVKFEQLPEQASEPSDVEVHAETESATVTATSKPKAIEPFELTKLSGGTNGEP-PTSWSAKLEKFKNEGGYYIVPLT-DR 341 (587) T ss_pred ceeeccccchhhhhhcccccccccceeeeecccccccccccceeeecCCCCCC-cccHHHHHHHHhhCCcEEEEecC-CC Confidence 000 0 0000000 00 01123345899999976 56899999999999999999886 57 Q ss_pred HHHHHHHHHHHHHHHhcCCeEEEEecCCCC------------cCcceeEEecCCeEecCC----ceecHHHHHHHHHHHh Q lcl|NC_019422. 110 EVDKTKIVNWIKTARREKEIYKAVLPNISD------------ANEKAIINFATTGIKVGE----KSYTTAEYTARLAGIL 173 (355) Q Consensus 110 ~~~~~~~~~~ik~~r~~g~~~~aVl~~~~~------------~d~egIinv~n~~i~~~~----~~~~~~~~~a~vAG~~ 173 (355) +++|+++++||||||++|+++++|+++.+. +|+|||+++.++++..++ ..|+++++|+|+||++ T Consensus 342 ~ai~~~l~a~vk~~r~~gk~~~aVlg~~~~~~~~~~~~~a~~~n~e~vi~v~~~~~~~~~~~~~~~~~~~~~aa~vAG~~ 421 (587) T protein:vir:96 342 QSVHSEVATFVKNRSDAGEPMRAIVGGGTSETKEKLFGRQAILNNPRVALVANSGKFVMGNGRILQAPAYMVASAVAGLV 421 (587) T ss_pred HHHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHHHhhcCCCcEEEEecceEEecCCCceeeechhhHHHHHHHHH Confidence 889999999999999999999999987543 469999999998876544 4799999999999999 Q ss_pred cCcccccccccccCC--cccccCChhhHHHHHhCCeEEEEE--CCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHH Q lcl|NC_019422. 174 AGISLSESCTYFILD--EVTEIEPTENPDEAVEEGKLILIN--NNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQ 249 (355) Q Consensus 174 Ag~~~~~S~T~~~~~--~~~~~~~~~e~~~ai~~G~lvl~~--dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~ 249 (355) ||+++++|+||++++ ++...++++|++++|++|+|+|++ ++.+++.|+|||+||++.+++++|+||+++|+||+|. T Consensus 422 Ag~~~~~S~T~~~~~~~~v~~~~t~~e~~~~i~~G~~~l~~~~~~~~~v~~~vnsitT~t~~~~~~~~~i~virv~D~i~ 501 (587) T protein:vir:96 422 SGLDIGESITFKPLFVNSLDKVYESEELDELNENGIITIEFVRNRMTTMFRIVDDVTTFPDKNDPVKSEMALGEANDFLV 501 (587) T ss_pred hcCccccCccceeeecccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhhhHHHHHHHH Confidence 999999999999876 556778999999999999999975 3468999999999999999999999999999999999 Q ss_pred HHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccC Q lcl|NC_019422. 250 DDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEAN 329 (355) Q Consensus 250 ~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~ 329 (355) +||+++|++.|||| +|+.++|.+||++|++||++|+++|+|++|..+.+ ++.. T Consensus 502 ~di~~~~~~~yiGk-~nn~~~r~~v~~~i~~~L~~l~~~g~I~~~~~~dv------------------------~v~~-- 554 (587) T protein:vir:96 502 SELKILLEEQYIGT-RTINTSASQIKDFVQSYLGRKKRDNEIQDFPPEDV------------------------QVII-- 554 (587) T ss_pred HHHHHHHHhcCCcc-ccCHHHHHHHHHHHHHHHHHHHhCCcccCCCccce------------------------EEEe-- Confidence 99999999999999 79999999999999999999999999998854333 3322 Q ss_pred CCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 330 TGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 330 ~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) .+|.++|++.++|+++||+||||+.+ T Consensus 555 ~~D~~~v~~~v~Pv~~mekIy~tv~~ 580 (587) T protein:vir:96 555 EGNEARISLTIFPIRALKKISVSLVY 580 (587) T ss_pred cCCEEEEEEEEEEcccceEEEEEEEE Confidence 24678999999999999999999999 No 6 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=100.00 E-value=2.8e-72 Score=412.87 Aligned_cols=323 Identities=12% Similarity=0.117 Sum_probs=250.4 Q ss_pred CCC-CceEEEeee----eeeeeecCCCceeEEEE-EecCCcc-c---eeEEEeeh-hhhhhhhhhHHHHHHHHhhhc--c Q lcl|NC_019422. 1 MGL-PSAIIEFQR----RSRTVKFRSRRGVVALI-LKDSTAI-K---KSYSIDFL-TDINETEFTKENYDYIRLAFL--G 67 (355) Q Consensus 1 ~g~-P~~~i~f~~----~a~ta~~~~~rG~v~ii-l~d~~~~-~---~~~~~~~~-~d~~~~~~~~~n~~~i~~a~~--g 67 (355) .|- |.+.-.... ...+|+|.+.||...-+ ..|.... . ....+... .|+. .......|+...+. + T Consensus 197 ~g~~~~~~~l~~~i~~~~~~tAky~g~~~n~i~~~~~d~~~~~~~kt~~~~v~~~~~d~~---~~~~~n~~v~~~~~~~~ 273 (562) T protein:vir:80 197 SGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTDNFDAQIDVDIKTKEAYVKAVGGDIE---KQTAYNGYVEFEFDRSK 273 (562) T ss_pred CCccchhhhhhhhhccccceEEEecccCCceeeecccccchhhhcccceeeeeehhhhhh---hcccccceEEEEeccCc Confidence 221 111100000 11589999999975433 1121111 1 11112111 1110 11111234433322 2 Q ss_pred ---ccceEEEEecCCCccchhHHHHHHHHHhcccceEEEEcCCChHHHHHHHHHHHHHHHhcCCeEEEEecCCCC----- Q lcl|NC_019422. 68 ---KPSKVIVEVINDSVDSERSLDDALKALRENKFNYLAIPFISEEVDKTKIVNWIKTARREKEIYKAVLPNISD----- 139 (355) Q Consensus 68 ---~~~~v~l~~g~~g~~~~~~y~~al~~le~~~fn~l~~p~~~d~~~~~~~~~~ik~~r~~g~~~~aVl~~~~~----- 139 (355) ..+.+.|+||++|+. +++|+++|++||.++|+++++++ .++++|+++++||||||++|+++++|+++.+. T Consensus 274 ~la~~~~~~LtGG~dG~~-~~~~~dal~~Le~~~~~~i~~~t-~d~ai~~~~~a~vkr~r~~g~~~~aVvg~~~~~~~~~ 351 (562) T protein:vir:80 274 EIANFPLTKLTGGDNGTI-PESWADKFSYFANEGGYYLVPLT-SKQAVHAEALQFVRDCSYNGNPMRVFVGGGIGESMEQ 351 (562) T ss_pred cccccceeeeeCCCCCCc-cccHHHHHHHHHhCCcEEEEecC-CChHHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHHH Confidence 234678999999976 56899999999999999998875 57889999999999999999999999986542 Q ss_pred -------cCcceeEEecCCeEecCC----ceecHHHHHHHHHHHhcCcccccccccccCCc--ccccCChhhHHHHHhCC Q lcl|NC_019422. 140 -------ANEKAIINFATTGIKVGE----KSYTTAEYTARLAGILAGISLSESCTYFILDE--VTEIEPTENPDEAVEEG 206 (355) Q Consensus 140 -------~d~egIinv~n~~i~~~~----~~~~~~~~~a~vAG~~Ag~~~~~S~T~~~~~~--~~~~~~~~e~~~ai~~G 206 (355) +|||||+++.++.+..++ ..|+++++|+|+||++||+++++|+||+++++ +...++++|+++++++| T Consensus 352 ~~~~a~~~n~e~vv~v~~~~~~~~~~~~~~~~~~~~~aa~vAGl~Ag~~~~~S~T~~~i~~~~v~~~lt~~e~~~li~~G 431 (562) T protein:vir:80 352 LFTRAIGLQNERAGLIGFSGTVKMDDGRSLKMPGYMFAAQVAGLTCGLEIGEAITFKNIAIETLDTIYEGSQLDQLNESG 431 (562) T ss_pred HHHHhhhcCCCeEEEEecCeeEECCCCceeeechhHHHHHHHHHHhcCccccCccceeeccccccccCCHHHHHHHHhCC Confidence 579999999998765443 56999999999999999999999999999874 45678999999999999 Q ss_pred eEEEEEC--CcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 207 KLILINN--NGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKE 284 (355) Q Consensus 207 ~lvl~~d--g~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~ 284 (355) +|+|+++ +.+++.|+||++||++.++++.|+||+++|++|+|.+|||++|++.|||| +||.++|.+||++|++||++ T Consensus 432 ~l~l~~~~~~~v~~~riv~~itT~t~~~~~~~~ki~viRv~D~i~~dir~~~~~~yIGk-~Nn~~~r~~v~~~i~~~L~~ 510 (562) T protein:vir:80 432 IITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVGEANDFLVSELKISLDNEYIGT-KIIDTSASLVKNFVQSFLDR 510 (562) T ss_pred eEEEEEecCCcEEEEEeeccceeccCCCCchhhhhhhhHHHHHHHHHHHHHHHhcCCcc-ccChHHHHHHHHHHHHHHHH Confidence 9999763 45889999999999999999999999999999999999999999999999 68889999999999999999 Q ss_pred HHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 285 LQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 285 l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) |+++|+|++|..+.+ ++.. ..|.+||++.++|+++||+||||+.+ T Consensus 511 l~~~gaI~~~~~~dv------------------------~v~~--~~d~~~v~~~v~Pv~~mekIy~ti~~ 555 (562) T protein:vir:80 511 KKLAKEIQDYSPEEV------------------------QVVI--EGDIARISLTVFPIRSMKKIEVSLVY 555 (562) T ss_pred HHhCCcccCCCccce------------------------EEEe--cCCEEEEEEEEEEcccceEEEEEEEE Confidence 999999998854333 3332 24678999999999999999999999 No 7 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=100.00 E-value=3.1e-70 Score=401.63 Aligned_cols=326 Identities=11% Similarity=0.106 Sum_probs=248.8 Q ss_pred CCC-CceEEEee----eeeeeeecCCCceeEEEE-EecCCccceeEEEeehhhhhhhhhh--HHHHHHHHhhhc--c--- Q lcl|NC_019422. 1 MGL-PSAIIEFQ----RRSRTVKFRSRRGVVALI-LKDSTAIKKSYSIDFLTDINETEFT--KENYDYIRLAFL--G--- 67 (355) Q Consensus 1 ~g~-P~~~i~f~----~~a~ta~~~~~rG~v~ii-l~d~~~~~~~~~~~~~~d~~~~~~~--~~n~~~i~~a~~--g--- 67 (355) .|- |.+.-... ....+|+|.+.||...-+ ..|........+..........+.. ..-..|+..... + T Consensus 197 ~g~~~~~~~l~~~in~~~~~~aky~~~~gn~i~~~~~d~~~~~~vkt~~~~v~t~~~d~~~~~~~~~~v~~~~~~~~~la 276 (562) T protein:vir:63 197 SGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTDNFDAQIDVDIKTKEAYVKAVGGDIEKQTAYNGYVDFEFDRSKEIA 276 (562) T ss_pred CCccchhHHHHHhhccccceEEEeeccCCceeeeeccccccccchhhhhhhhhhhhhhhhhcccccceeeeeecccccee Confidence 110 11100000 011478899999864433 2222211111111100000000000 011234433221 2 Q ss_pred ccceEEEEecCCCccchhHHHHHHHHHhcccceEEEEcCCChHHHHHHHHHHHHHHHhcCCeEEEEecCCCC-------- Q lcl|NC_019422. 68 KPSKVIVEVINDSVDSERSLDDALKALRENKFNYLAIPFISEEVDKTKIVNWIKTARREKEIYKAVLPNISD-------- 139 (355) Q Consensus 68 ~~~~v~l~~g~~g~~~~~~y~~al~~le~~~fn~l~~p~~~d~~~~~~~~~~ik~~r~~g~~~~aVl~~~~~-------- 139 (355) ..+.+.|+||++|+. ..+|+++|++||.++|+++++++ +++++|+++.+|+||||++|+++++|+++.+. T Consensus 277 ~~~~~~LtGG~dGt~-~~~~~~al~ale~~~~~~i~~~t-~d~av~~~l~a~vkr~~~~g~~~~aVlg~~~~~~~~~~~~ 354 (562) T protein:vir:63 277 NFPLTKLTGGDNGTI-PESWADKFSYFANEGGYYLVPLT-SKQAVHAEALQFVRDCSYNGNPMRVFVGGGIGESMEQLFT 354 (562) T ss_pred cccceeeecCCCCCc-hhhHHHHHHHHHhCCcEEEEecC-CCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHH Confidence 234678999999976 56899999999999999998765 67889999999999999999999999987542 Q ss_pred ----cCcceeEEecCCeEecCC----ceecHHHHHHHHHHHhcCcccccccccccCC--cccccCChhhHHHHHhCCeEE Q lcl|NC_019422. 140 ----ANEKAIINFATTGIKVGE----KSYTTAEYTARLAGILAGISLSESCTYFILD--EVTEIEPTENPDEAVEEGKLI 209 (355) Q Consensus 140 ----~d~egIinv~n~~i~~~~----~~~~~~~~~a~vAG~~Ag~~~~~S~T~~~~~--~~~~~~~~~e~~~ai~~G~lv 209 (355) +|+|+|+++.++.+..++ ..|+++++|+|+||++|++++++|+||++++ ++...++++|+++++++|+|+ T Consensus 355 ~a~~~n~ervv~v~~~~~~~~~~~~~~~~~~~~~aa~vAGl~A~~~~~~SlT~~~i~~~~v~~~~t~~e~~~li~~Gv~~ 434 (562) T protein:vir:63 355 RAIGLQNERAGLIGFSGTVKMDDGRSLKMPGYMFAAQVAGLTCGLEIGEAITFKNIAIETLDTIYEGSQLDQLNESGIIT 434 (562) T ss_pred HhhhcCCCcEEEEecCeeEECCCCceeeechhHHHHHHHHHhhcCchhcCccceeeccccccccCCHHHHHHHHhCCeEE Confidence 579999999999876543 4699999999999999999999999999876 455678999999999999999 Q ss_pred EEEC--CcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019422. 210 LINN--NGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQR 287 (355) Q Consensus 210 l~~d--g~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~ 287 (355) |++. +.+++.|+||++||++.+++++|+||+++|+||+|.+|||.+|++.|||| +||.++|.+||++|++||++|++ T Consensus 435 l~~~~~~~v~~~~iv~~itT~t~~~~~~~~ki~viRv~D~i~~dir~~~~~~yiGk-~Nn~~~r~~v~~~i~~~L~~l~~ 513 (562) T protein:vir:63 435 AEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVGEANDFLVSELKISLDNEYIGT-KIIDTSASLVKNFVQSFLDRKKL 513 (562) T ss_pred EEEecCCcEEEEEeeccceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhcCCcc-ccChHHHHHHHHHHHHHHHHHHh Confidence 9753 46889999999999999999999999999999999999999999999999 68889999999999999999999 Q ss_pred cccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 288 DEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 288 ~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) +|+|++|..+ |+++.. ..|.++|++.++|+++||+||||+.+ T Consensus 514 ~gaI~~~~~~------------------------dv~v~~--~~d~~~v~~~v~pv~~mekIy~ti~~ 555 (562) T protein:vir:63 514 AKEIQDYSPE------------------------EVQVVI--EGDVARISLTVFPIRSMKKIEVSLVY 555 (562) T ss_pred CCcccCCCcc------------------------ceEEEe--cCCEEEEEEEEEEcccceEEEEEEEE Confidence 9999988543 233332 24679999999999999999999999 No 8 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=100.00 E-value=2.5e-69 Score=396.73 Aligned_cols=326 Identities=11% Similarity=0.132 Sum_probs=241.0 Q ss_pred CCCC-c--eEEEeeeee---------eeeecCCCceeEEEEEecCCccc--------eeEEEeehhhhhhh---hhhH-- Q lcl|NC_019422. 1 MGLP-S--AIIEFQRRS---------RTVKFRSRRGVVALILKDSTAIK--------KSYSIDFLTDINET---EFTK-- 55 (355) Q Consensus 1 ~g~P-~--~~i~f~~~a---------~ta~~~~~rG~v~iil~d~~~~~--------~~~~~~~~~d~~~~---~~~~-- 55 (355) .|-+ + +.+.+..++ ..+++++.+|.-+-+........ ..+.+++..+.... +... T Consensus 187 ~g~~~~~~~~v~~~~~~~~~~~~~~~lv~~~~~~~~f~a~~~~~~~~~~~~~~~d~~~~~~~~t~~~~~~~~~~di~~~~ 266 (569) T protein:vir:80 187 VGSEPESTTEVMKYELGQGVYSETNVLVSAINSLPDWEAKFFPIGDKNLPTDALEAVTKVDVKTEAVFVGALAGDIAKQL 266 (569) T ss_pred ecCCcceeEEEEeeccCCccchhhhhhhhhcCCccCceEEEEecCCCcceehhccchhheeccccceeeehhHHHHHHhh Confidence 1111 0 011111111 12233444443222110000000 00011111000000 0000 Q ss_pred HHHHHHHhhhcc--c---cceEEEEecCCCccchhHHHHHHHHHhcccceEEEEcCCChHHHHHHHHHHHHHHHhcCCeE Q lcl|NC_019422. 56 ENYDYIRLAFLG--K---PSKVIVEVINDSVDSERSLDDALKALRENKFNYLAIPFISEEVDKTKIVNWIKTARREKEIY 130 (355) Q Consensus 56 ~n~~~i~~a~~g--~---~~~v~l~~g~~g~~~~~~y~~al~~le~~~fn~l~~p~~~d~~~~~~~~~~ik~~r~~g~~~ 130 (355) ...+|+.+...+ . .+.+.|+||++|+. ..+|+++|++||.++||++++++ .++++|+++.+||||||++|+++ T Consensus 267 ~~~~~v~~~~~~~~~l~~~~~~~LtGG~dG~~-~~~~~~~l~~le~~~~~~i~~~t-~d~av~~~l~a~vkr~r~~g~~~ 344 (569) T protein:vir:80 267 EYNDYVTVAVDATKPVEDFELTNLTGGSDGTA-PESWANKFPLLANEGGYYLVPLT-DKQAVHSEALAFVKDRTDNGDPM 344 (569) T ss_pred cCCceEEEEecCCcceeeecceeecCCCCCCc-cchHHHHHHHHhhCCcEEEEecC-CChHHHHHHHHHHHHHHhCCCcE Confidence 012344333222 2 23456899999875 57899999999999999999876 57889999999999999999999 Q ss_pred EEEecCCC------------CcCcceeEEecCCeEecCC----ceecHHHHHHHHHHHhcCcccccccccccCC--cccc Q lcl|NC_019422. 131 KAVLPNIS------------DANEKAIINFATTGIKVGE----KSYTTAEYTARLAGILAGISLSESCTYFILD--EVTE 192 (355) Q Consensus 131 ~aVl~~~~------------~~d~egIinv~n~~i~~~~----~~~~~~~~~a~vAG~~Ag~~~~~S~T~~~~~--~~~~ 192 (355) ++|+++.+ .+|+|+++++.++...+++ ..|+++++|+|+||++||+++++|+||++++ ++.. T Consensus 345 ~aVvg~~~~~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~~g~~~~~~~~~~aa~vAG~~A~~~~~~S~T~k~i~~~~i~~ 424 (569) T protein:vir:80 345 RIIVGGGTNETVEESITRATNLRDPRASLVGFSGTRKMDDGRLLKLPGYMMASQIAGIASGLEVGEAITFKHFNVTSVDR 424 (569) T ss_pred EEEecCCCCCCHHHHHHHHhhcCCCeEEEEecCceeecCCCcceeechhhHHHHHHHHHhcCccccCccceeeccccccc Confidence 99998643 3589999999998755543 5799999999999999999999999999886 4556 Q ss_pred cCChhhHHHHHhCCeEEEEEC--CcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHH Q lcl|NC_019422. 193 IEPTENPDEAVEEGKLILINN--NGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDN 270 (355) Q Consensus 193 ~~~~~e~~~ai~~G~lvl~~d--g~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~g 270 (355) .++++|+++++++|+|+|++. +.+++.|+||++||++.+++++|+||+++|++|+|.+|||.+|++.|||| +|+.++ T Consensus 425 ~lt~~e~~~li~~G~~~l~~~~~~~~~v~~~vn~itT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yiGk-~nn~~~ 503 (569) T protein:vir:80 425 VFESSQLDMLNESGVISIEFVRNRTLTAFRVVQDVTTYNDKSDPVKNEMSVGEANDFLVSELKIELDNNFIGT-KVIDTS 503 (569) T ss_pred cCCHHHHHHHHhCCeEEEEEecCceEEEEEEeccceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhhcCcc-cCChhH Confidence 789999999999999999763 46889999999999999999999999999999999999999999999999 688899 Q ss_pred HHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEE Q lcl|NC_019422. 271 KILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLK 350 (355) Q Consensus 271 r~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy 350 (355) |.+|+++|++||++|+++|+|+++....+ ++.. .+|.+||++.++|+++||+|| T Consensus 504 r~~v~~~i~~~L~~l~~~gaI~~~~~~dv------------------------~v~~--~~d~~~v~~~v~Pv~~~ekI~ 557 (569) T protein:vir:80 504 ASLIKNFIQSFLDNKKRAREIQDYTPEEV------------------------QVVL--EGDVASISMTVMPIRSLNKIT 557 (569) T ss_pred HHHHHHHHHHHHHHHHhCCcccCCCccce------------------------EEEe--cCCEEEEEEEEEEcccccEEE Confidence 99999999999999999999998754322 3322 246799999999999999999 Q ss_pred EEEeC Q lcl|NC_019422. 351 FKIYM 355 (355) Q Consensus 351 ~tv~v 355 (355) +++.+ T Consensus 558 ~ti~~ 562 (569) T protein:vir:80 558 VQLVY 562 (569) T ss_pred EEEEE Confidence 99999 No 9 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=100.00 E-value=1e-67 Score=387.80 Aligned_cols=323 Identities=13% Similarity=0.126 Sum_probs=234.8 Q ss_pred CCCCceEEEe-ee----eeeeeecCCCceeEEEEEe-cCCccceeEE--------------------Eeehhhhhh---h Q lcl|NC_019422. 1 MGLPSAIIEF-QR----RSRTVKFRSRRGVVALILK-DSTAIKKSYS--------------------IDFLTDINE---T 51 (355) Q Consensus 1 ~g~P~~~i~f-~~----~a~ta~~~~~rG~v~iil~-d~~~~~~~~~--------------------~~~~~d~~~---~ 51 (355) .|-....... .. .-.||+|.+.+|.-..+-. +........+ +...+.+.. . T Consensus 197 ~g~~~~~~~~~~~i~~~~~~tAky~~~~~~~i~~~~~~~~~~~~v~~~~~~v~a~~~D~~~~~~~~~~~~~~~~~g~~~~ 276 (587) T protein:vir:99 197 GGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLDKIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEV 276 (587) T ss_pred CCchHHHHHHHhhhccccceeEEeeccCCceeEeecccccccceeeeeeeeeehhccceeeecccceeeeeeecccccch Confidence 2211110000 00 0137777777774222211 1110000000 000000000 0 Q ss_pred hhhH--HHHHHHHhh-------hccccceEEEEecCCCccchhHHHHHHHHHhcccceEEEEcCCChHHHHHHHHHHHHH Q lcl|NC_019422. 52 EFTK--ENYDYIRLA-------FLGKPSKVIVEVINDSVDSERSLDDALKALRENKFNYLAIPFISEEVDKTKIVNWIKT 122 (355) Q Consensus 52 ~~~~--~n~~~i~~a-------~~g~~~~v~l~~g~~g~~~~~~y~~al~~le~~~fn~l~~p~~~d~~~~~~~~~~ik~ 122 (355) .... ....+.... .+...+.+.|+||++|++ .++|+++|++||.++||+|+++. +++++|+++++|||| T Consensus 277 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~LtGG~dG~~-~~sy~~al~ale~~~~~~i~~~t-~d~~i~a~l~a~vk~ 354 (587) T protein:vir:99 277 PSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNGEP-PATWADKLDKFAHEGGYYIVPLS-SKQSVHAEVASFVKE 354 (587) T ss_pred hhhhhhhhccccceeeeeccccceecccceeeecCCCCCc-cccHHHHHHHHhhCCcEEEEecC-CCHHHHHHHHHHHHH Confidence 0000 000010000 111223456899999876 47899999999999999998775 678899999999999 Q ss_pred HHhcCCeEEEEecCCC------------CcCcceeEEecCCeEec--CC--ceecHHHHHHHHHHHhcCccccccccccc Q lcl|NC_019422. 123 ARREKEIYKAVLPNIS------------DANEKAIINFATTGIKV--GE--KSYTTAEYTARLAGILAGISLSESCTYFI 186 (355) Q Consensus 123 ~r~~g~~~~aVl~~~~------------~~d~egIinv~n~~i~~--~~--~~~~~~~~~a~vAG~~Ag~~~~~S~T~~~ 186 (355) ||++|+++++|+++.. .+||||||||+++.+.. +| ..|+++++|+|+||++||+++++|+||++ T Consensus 355 ~r~~g~~~~aVlg~~~~~~~~~~~~~a~~~n~e~vi~v~~~~~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~~~~SlT~~~ 434 (587) T protein:vir:99 355 RSDAGEPMRAIVGGGFNESKEQLFGRQASLSNPRVSLVANSGTFVMDDGRKNHVPAYMVAVALGGLASGLEIGESITFKP 434 (587) T ss_pred HHhCCCcEEEEecCCCCCCHHHHHHHhhhcCCCcEEEEeccceEecCCCceeeechHHHHHHHHHHHhcCchhcCcccee Confidence 9999999999998643 35899999999996543 34 45999999999999999999999999998 Q ss_pred CC--cccccCChhhHHHHHhCCeEEEEEC--C---cEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhc Q lcl|NC_019422. 187 LD--EVTEIEPTENPDEAVEEGKLILINN--N---GIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNEN 259 (355) Q Consensus 187 ~~--~~~~~~~~~e~~~ai~~G~lvl~~d--g---~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~ 259 (355) ++ ++...++++|+++++++|+|+|+.. + .+||++||||+ +.++++.|+||+++|++|+|.+||++.|++. T Consensus 435 i~~~~v~~~~t~~e~e~li~~Gvl~l~~~~~~~~~~vriv~~ItT~---t~~~~~~~~~i~viRv~D~i~~di~~~~~~~ 511 (587) T protein:vir:99 435 LRVSSLDQIYESIDLDELNENGIISIEFVRNRTNTFFRIVDDVTTF---NDKSDPVKAEMAVGEANDFLVSELKVQLEDQ 511 (587) T ss_pred eecccccccCCHHHHHHHHhCCeEEEEEecCCcceEEEEeeceeec---cCCCCchhhhhhhhhhHHHHHHHHHHHHHhh Confidence 76 5667789999999999999999742 2 36788888775 5788999999999999999999999999999 Q ss_pred cccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEE Q lcl|NC_019422. 260 YVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGN 339 (355) Q Consensus 260 yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~ 339 (355) |||| +|+..+|.+||++|++||++|+++|+|++|..+ |+++.. .+|.+||++. T Consensus 512 yiGk-~Nn~~~r~~i~~~i~~~L~~l~~~gaI~~~~~~------------------------dv~v~~--~~d~~~v~~~ 564 (587) T protein:vir:99 512 FIGT-RTINTSASIIKDFIQSYLGRKKRDNEIQDFPAE------------------------DVQVIV--EGNEARISMT 564 (587) T ss_pred CCcc-ccchHHHHHHHHHHHHHHHHHHhCCcccCCCcc------------------------ceEEEe--cCCEEEEEEE Confidence 9999 688899999999999999999999999988543 333332 3467999999 Q ss_pred EEEEeeeeEEEEEEeC Q lcl|NC_019422. 340 ITITDAMEDLKFKIYM 355 (355) Q Consensus 340 i~~vdamEkiy~tv~v 355 (355) ++|+|+||+||+|+.+ T Consensus 565 v~Pv~~mekIy~tv~~ 580 (587) T protein:vir:99 565 VYPIRSFKKISVSLVY 580 (587) T ss_pred EEEcccceEEEEEEEE Confidence 9999999999999999 No 10 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=100.00 E-value=6.7e-67 Score=383.37 Aligned_cols=322 Identities=12% Similarity=0.066 Sum_probs=236.2 Q ss_pred CCCCceEEEe-eeeeeeeecCCCcee-EEE--------------EEecCCccceeEEEe-ehhhhhhhhhhHHHHHHHHh Q lcl|NC_019422. 1 MGLPSAIIEF-QRRSRTVKFRSRRGV-VAL--------------ILKDSTAIKKSYSID-FLTDINETEFTKENYDYIRL 63 (355) Q Consensus 1 ~g~P~~~i~f-~~~a~ta~~~~~rG~-v~i--------------il~d~~~~~~~~~~~-~~~d~~~~~~~~~n~~~i~~ 63 (355) =-+|.....+ -...++++|.+.++. +-+ +++........++.. ...++. ......+...... T Consensus 225 n~~~~~~A~~~g~~~i~tky~d~~~~~i~V~~~~~iv~a~~~D~~~~~~~~~~~~~t~~~~~~~~~-~~~~~~~~~~~~~ 303 (607) T protein:vir:10 225 SATPNFSASVVGSPSVNTSYLDEVTSPVDVKTAPAVVTAKIGDAISKLGYDPYVVVTQTSNNKPIV-NGVSAGTGSATAS 303 (607) T ss_pred hcCCceEEEEecccceeeeccccccceeEEEEeeeeechhhhhhhhcccccceEEeeecccchhhh-hhhhccccceeee Confidence 1233322221 223456666666652 111 111112111111111 111111 1111111111111 Q ss_pred h---hccc---cceEEEEecCCCccchhHHHHHHHHHhcccceEEEEcCCChHHHHHHHHHHHHHHHhcCCeEEEEecCC Q lcl|NC_019422. 64 A---FLGK---PSKVIVEVINDSVDSERSLDDALKALRENKFNYLAIPFISEEVDKTKIVNWIKTARREKEIYKAVLPNI 137 (355) Q Consensus 64 a---~~g~---~~~v~l~~g~~g~~~~~~y~~al~~le~~~fn~l~~p~~~d~~~~~~~~~~ik~~r~~g~~~~aVl~~~ 137 (355) . ..+. .+.+.|+||++|+. +.+|+++|++||.++|++|++++ .++++|+++++||||||++|+++++|+++. T Consensus 304 ~~~~~~~~~a~~a~~~LtGGtdG~~-~~ty~dal~aLe~~e~~~i~~~t-~d~ai~~~l~a~vkr~~~~g~~~~aVlg~~ 381 (607) T protein:vir:10 304 VTTAPESFPANFDTAFLTGGSTGDV-PVSWADKFNGAIGNNVYYIIPLT-SEENIHAELQAFIDEQHVLGYNYHAFVGGG 381 (607) T ss_pred eeccccccccccceeeeeCCCCCCc-hhhHHHHHHHHhhcCceEEEecC-CCHHHHHHHHHHHHHHHhCCCcEEEEecCC Confidence 1 1122 34566999999975 57899999999999999999886 678899999999999999999999999875 Q ss_pred CC------------cCcceeEEecCCeEecCC---ceecHHHHHHHHHHHhcCcccccccccccCC--cccccCChhhHH Q lcl|NC_019422. 138 SD------------ANEKAIINFATTGIKVGE---KSYTTAEYTARLAGILAGISLSESCTYFILD--EVTEIEPTENPD 200 (355) Q Consensus 138 ~~------------~d~egIinv~n~~i~~~~---~~~~~~~~~a~vAG~~Ag~~~~~S~T~~~~~--~~~~~~~~~e~~ 200 (355) .. +|||||+|+.++....++ ..++++++|+|+||++||+++++|+||++++ ++...++++|++ T Consensus 382 ~~~t~~~~~t~a~~~N~ervv~V~~~~~~~~~G~~~~~~~~~~Aa~vAGl~Ag~~~~~SlT~k~i~~~~v~~~lt~~e~e 461 (607) T protein:vir:10 382 FAEPLEQILSRQVNINDSRFGLVGQSGHVQEGGESVHVPAYLMAAYVGGLSSSLGVAVPITNKKLALVDLDQNFSGDDLN 461 (607) T ss_pred CCCCHHHHHHHHHhhCCCcEEEEecCeeEeeCCcceeccHHHHHHHHHHHHhcCccccCcccceeccccccccCCHHHHH Confidence 43 469999999998765544 4689999999999999999999999999886 445678999999 Q ss_pred HHHhCCeEEEEEC------CcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHH Q lcl|NC_019422. 201 EAVEEGKLILINN------NGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILF 274 (355) Q Consensus 201 ~ai~~G~lvl~~d------g~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~ 274 (355) ++|++|+|+|+.+ +.|||++||||+ ++++++.|+||+++|++|+|.+|||++|++.||||++| ...|.++ T Consensus 462 ~ai~~Gv~~l~~~~~~~~~~~vrIv~~ItT~---t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yIGk~nn-d~~~~~v 537 (607) T protein:vir:10 462 TLNQNGVIGIEHLVNRNATGGYYIVQDVSTN---TVSSSHVDGSLYLGELTDFLFDNLRFVLRDTYIGSNIR-STSADDI 537 (607) T ss_pred HHHhCCeEEEEEccCccccceEEEeeeeeec---cCCCCcchheeehhhhHHHHHHHHHHHHhhcCCcccCC-cchHHHH Confidence 9999999999753 248999999875 46788999999999999999999999999999999755 5678889 Q ss_pred HHHHHHHH--HHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEE Q lcl|NC_019422. 275 LSAVNNYF--KELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFK 352 (355) Q Consensus 275 ~~~i~~yl--~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~t 352 (355) +..+..+| +.|+.+|+|++|.. +|+++.. .+|.++|++.++|+|+||+|||| T Consensus 538 k~~i~~~L~~~~l~~~gaI~df~~------------------------edv~v~~--~~D~v~v~~~v~Pv~~iekIyvt 591 (607) T protein:vir:10 538 KSTVASYLYSEMNNDDGLIVDFSE------------------------SDIVVTI--SGTVVYIQFAVAPTQEIKNIVVS 591 (607) T ss_pred HHHHHHHHHHHHHHhcCceeCCCc------------------------cccEEee--CCCEEEEEEEEEEcccceEEEEE Confidence 99998865 67888999998743 2444443 24689999999999999999999 Q ss_pred EeC Q lcl|NC_019422. 353 IYM 355 (355) Q Consensus 353 v~v 355 (355) +.+ T Consensus 592 v~v 594 (607) T protein:vir:10 592 GTY 594 (607) T ss_pred EEE Confidence 999 No 11 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=100.00 E-value=5.4e-67 Score=383.90 Aligned_cols=323 Identities=13% Similarity=0.130 Sum_probs=234.4 Q ss_pred CCCCc--eEEEee----e-eeeeeecCCCceeEEEEEe-cCCccceeEE--------------------Eeehhhhh--- Q lcl|NC_019422. 1 MGLPS--AIIEFQ----R-RSRTVKFRSRRGVVALILK-DSTAIKKSYS--------------------IDFLTDIN--- 49 (355) Q Consensus 1 ~g~P~--~~i~f~----~-~a~ta~~~~~rG~v~iil~-d~~~~~~~~~--------------------~~~~~d~~--- 49 (355) ++... ...... + ...+|+|.+.||.-..+-. +......+.+ +.....+. T Consensus 195 L~~g~~~~~~~~~~~in~~~~~tAky~g~~~~~i~~~~~~~~~~~~v~~~~~~v~a~~~d~~~~~~~~~~v~~~~~~g~~ 274 (587) T protein:vir:95 195 LTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLDKIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEG 274 (587) T ss_pred ecCCchHHHHHHHHhhccccceEEEEecccCceeEEeecCcccccceehhhhhhhhhhcceeeeeeceeeeeeecccccc Confidence 22111 000000 0 0136777777774322211 1111000000 00000000 Q ss_pred --hhhhhHHHHHHH--Hhhh--cc---ccceEEEEecCCCccchhHHHHHHHHHhcccceEEEEcCCChHHHHHHHHHHH Q lcl|NC_019422. 50 --ETEFTKENYDYI--RLAF--LG---KPSKVIVEVINDSVDSERSLDDALKALRENKFNYLAIPFISEEVDKTKIVNWI 120 (355) Q Consensus 50 --~~~~~~~n~~~i--~~a~--~g---~~~~v~l~~g~~g~~~~~~y~~al~~le~~~fn~l~~p~~~d~~~~~~~~~~i 120 (355) .......+..+. ...+ .+ ....+.|+||++|++ .++|+++|++||.++||+|+++. +++++|+++.+|| T Consensus 275 ~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~t~LtGG~dG~~-~~~y~~~l~ale~~~~~~i~~~t-~d~~v~a~l~a~v 352 (587) T protein:vir:95 275 EVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNGEP-PATWADKLDKFAHEGGYYIVPLS-SKQSVHAEVASFV 352 (587) T ss_pred eeccchhhhhcccchheeccccccceeccceeeeecCCCCCC-cccHHHHHHHHHhCCcEEEEecC-CCHHHHHHHHHHH Confidence 000000111110 1111 11 123456999999876 57899999999999999998775 6788999999999 Q ss_pred HHHHhcCCeEEEEecCCCC------------cCcceeEEecCCeEec--CC--ceecHHHHHHHHHHHhcCccccccccc Q lcl|NC_019422. 121 KTARREKEIYKAVLPNISD------------ANEKAIINFATTGIKV--GE--KSYTTAEYTARLAGILAGISLSESCTY 184 (355) Q Consensus 121 k~~r~~g~~~~aVl~~~~~------------~d~egIinv~n~~i~~--~~--~~~~~~~~~a~vAG~~Ag~~~~~S~T~ 184 (355) ||||++|+++++|+++.+. +|+|||++|.++.... +| ..|+++++|+|+||++||+++++|+|| T Consensus 353 k~~~~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~ervi~v~~~~~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~~~~SlT~ 432 (587) T protein:vir:95 353 KERSDAGEPMRAIVGGGFNESKEQLFGRQESLSNPRVSLVANSGTFVMDDGRKNHVPAYMVAVALGGLASGLEIGESITF 432 (587) T ss_pred HHHHhCCCcEEEEEcCCCCCCHHHHHHHHhhcCCCcEEEecccceEecCCCceeeechHHHHHHHHHHHhcCchhcCccc Confidence 9999999999999987542 4799999999986543 34 458999999999999999999999999 Q ss_pred ccCC--cccccCChhhHHHHHhCCeEEEEEC--C---cEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHh Q lcl|NC_019422. 185 FILD--EVTEIEPTENPDEAVEEGKLILINN--N---GIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWN 257 (355) Q Consensus 185 ~~~~--~~~~~~~~~e~~~ai~~G~lvl~~d--g---~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~ 257 (355) ++++ ++...++++|+++++++|+|+|+.. + .++|++||||+ +.++++.|+||+++|++|+|.+||++.|+ T Consensus 433 ~~i~~~~v~~~~t~~e~e~ai~~Gvl~l~~~~~~~~~~vriv~~itT~---t~~~d~~~~~i~viRv~D~i~~dir~~~~ 509 (587) T protein:vir:95 433 KPLRVSSLDQIYESIDLDELNENGIISIEFVRNRTNTFFRIVDDVTTF---NDKSDPVKAEMAVGEANDFLVSELKVQLE 509 (587) T ss_pred eeeecccccccCCHHHHHHHHhCCeEEEEEecCCcceEEEEeecceec---cCCCCcchhhhhhhhhHHHHHHHHHHHHH Confidence 9876 5667789999999999999999742 2 25777777764 57889999999999999999999999999 Q ss_pred hccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEE Q lcl|NC_019422. 258 ENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVE 337 (355) Q Consensus 258 ~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~ 337 (355) +.|||| +|+..+|.+||++|++||++|+++|+|++|..+ |+++.. .+|.+||+ T Consensus 510 ~~~iGk-~nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~------------------------dv~v~~--~~d~~~v~ 562 (587) T protein:vir:95 510 DQFIGT-RTINTSASIIKDFIQSYLGRKKRDNEIQDFPAE------------------------DVQVIV--EGNEARIS 562 (587) T ss_pred hhCCcc-ccchHHHHHHHHHHHHHHHHHHhCCcccCCCcc------------------------ceEEEe--cCCEEEEE Confidence 999999 688899999999999999999999999988543 333332 34679999 Q ss_pred EEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 338 GNITITDAMEDLKFKIYM 355 (355) Q Consensus 338 ~~i~~vdamEkiy~tv~v 355 (355) +.++|+++||+||+|+.+ T Consensus 563 ~~v~Pv~~mekI~vt~~~ 580 (587) T protein:vir:95 563 MTVYPIRSFKKISVSLVY 580 (587) T ss_pred EEEEEcccceEEEEEEEE Confidence 999999999999999999 No 12 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=100.00 E-value=3.7e-57 Score=329.99 Aligned_cols=323 Identities=11% Similarity=0.072 Sum_probs=240.2 Q ss_pred CCCCceEE------Eeee--eeeeeecCCCc---eeE-EEE-EecCCccceeEEEeehhhhhhhhhhHH----H-----H Q lcl|NC_019422. 1 MGLPSAII------EFQR--RSRTVKFRSRR---GVV-ALI-LKDSTAIKKSYSIDFLTDINETEFTKE----N-----Y 58 (355) Q Consensus 1 ~g~P~~~i------~f~~--~a~ta~~~~~r---G~v-~ii-l~d~~~~~~~~~~~~~~d~~~~~~~~~----n-----~ 58 (355) ++.+..+- ..+. .+......|.+ |.+ .+. .+.++...+++.+...++..+...... | . T Consensus 190 ~~~~~~~~~~~~~~~~~~~~~t~~~~~~g~~~~~~~i~~~~~~~~D~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~e~~ 269 (581) T protein:vir:76 190 YVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEIT 269 (581) T ss_pred ccceeeccCcccceeeeeeeeeeEeecccccccceeEEEEEEEeecCCccceEEEecccccccceeeehhhcCccccchh Confidence 23222221 1222 33333334432 222 222 234444446666665555432110000 1 1 Q ss_pred HHHHhhhccccceEEEEecCCC---ccchhHHHHHHHHHhcccceEEEEcCCChHHHHHHHHHHHHHHHhcCCeEEE--- Q lcl|NC_019422. 59 DYIRLAFLGKPSKVIVEVINDS---VDSERSLDDALKALRENKFNYLAIPFISEEVDKTKIVNWIKTARREKEIYKA--- 132 (355) Q Consensus 59 ~~i~~a~~g~~~~v~l~~g~~g---~~~~~~y~~al~~le~~~fn~l~~p~~~d~~~~~~~~~~ik~~r~~g~~~~a--- 132 (355) .-.+.++++++. ..|.++.++ +++++||.+||++||+++|+.+++|+..++++|+++.+|+++|++.|++.++ T Consensus 270 ~~~~~~~t~~~~-~~l~~gvd~~g~tvt~~dy~~aL~ale~~~~~~ivvp~t~~~~i~a~l~ahv~~~s~~~~~~ra~ig 348 (581) T protein:vir:76 270 LCAQLAITNGAS-TILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILG 348 (581) T ss_pred hhhheeeccccc-eEEEeeecCCCCccchHHHHHHHHHHhcCCeEEEEEecCCChHHHHHHHHHHHHHHhccCCceEEEE Confidence 112233444444 556666665 5678999999999999999999999888889999999999999988766554 Q ss_pred EecCC------------CCcCcceeEEecCCeEecCCc------eecHHHHHHHHHHHhcCcccccccccccCCcccc-- Q lcl|NC_019422. 133 VLPNI------------SDANEKAIINFATTGIKVGEK------SYTTAEYTARLAGILAGISLSESCTYFILDEVTE-- 192 (355) Q Consensus 133 Vl~~~------------~~~d~egIinv~n~~i~~~~~------~~~~~~~~a~vAG~~Ag~~~~~S~T~~~~~~~~~-- 192 (355) |.++. ...|+++++++.++....++. .++++.++||+||++|++++++|+||++++++.. T Consensus 349 v~g~~~~~~~~~~~~~a~~~ns~Rvvlv~p~~~~~~g~~~~~~~~lp~~~~AA~vAG~~a~~~~~~slT~~~i~g~~~~~ 428 (581) T protein:vir:76 349 MDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPA 428 (581) T ss_pred eeCCCCCchHHHHHHhhcccCCCcEEEEEcCceEeccccCCcceecchhhhhhhHHhhhhccccccCccccccccccccc Confidence 44432 246899999999987666543 5789999999999999999999999999988765 Q ss_pred -cCChhhHHHHHhCCeEEEEE--CCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHh-hccccccCCCH Q lcl|NC_019422. 193 -IEPTENPDEAVEEGKLILIN--NNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWN-ENYVGKVTNKY 268 (355) Q Consensus 193 -~~~~~e~~~ai~~G~lvl~~--dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~-~~yiGK~~N~~ 268 (355) .++++|+++++++|.++|.. +++|||+|||||+++ +..|++|+++|++|.+.+++|+.++ +.|||+ +|+. T Consensus 429 ~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s-----~~~~k~i~viR~~D~v~~~vr~~~~~~~fiG~-~n~~ 502 (581) T protein:vir:76 429 EVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPT-----SLHTREWNIIGQQDVMVYRIRDYLDADGLIGM-PIYD 502 (581) T ss_pred ccCCHHHHHHHHhCCeEEEEEecCCeEEEEEeeecCCC-----CCccceeeehhhhHHHHHHHHHHHhhhcCCCc-ccCh Confidence 46889999999999999974 457999999999875 3458999999999999999999986 679999 8999 Q ss_pred HHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeE Q lcl|NC_019422. 269 DNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMED 348 (355) Q Consensus 269 ~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEk 348 (355) ++|.++++++++||.+|+++|+|+++.+..+ ..+ ...+|.+++.+.++|+++||+ T Consensus 503 ~~r~~ik~~i~~~L~~l~~~g~I~g~~~~~~-----------------------~~~--~~~~d~v~V~i~v~Pv~~ie~ 557 (581) T protein:vir:76 503 TTIVQVKASAEAALVWLVDNNIIRGYRNLKA-----------------------RQI--ERQPDVIEVRYEWRPAYPLNY 557 (581) T ss_pred HHHHHHHHHHHHHHHHHHhcCcccCccccee-----------------------eEE--ecCCCEEEEEEEEEecccceE Confidence 9999999999999999999999998753211 111 124688999999999999999 Q ss_pred EEEEEeC Q lcl|NC_019422. 349 LKFKIYM 355 (355) Q Consensus 349 iy~tv~v 355 (355) ||+|+.. T Consensus 558 I~vt~~~ 564 (581) T protein:vir:76 558 IVVRYSI 564 (581) T ss_pred EEEEEEE Confidence 9999999 No 13 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=100.00 E-value=9e-57 Score=327.86 Aligned_cols=322 Identities=9% Similarity=0.056 Sum_probs=236.4 Q ss_pred CCCCce-----------------E------EEeee--eeeeeecCCCce---eEEEEE--ecCCccceeEEEeehhhhhh Q lcl|NC_019422. 1 MGLPSA-----------------I------IEFQR--RSRTVKFRSRRG---VVALIL--KDSTAIKKSYSIDFLTDINE 50 (355) Q Consensus 1 ~g~P~~-----------------~------i~f~~--~a~ta~~~~~rG---~v~iil--~d~~~~~~~~~~~~~~d~~~ 50 (355) -||..+ + -.+++ .+......|..+ +++-+- +.++....+..+...+++. T Consensus 173 ~~l~~~~~~~~~~~gsd~~~~~~~~~~~~~~~~~~D~~t~~~~~~g~~~~~~~v~~~~~~~~d~~~~~~v~~~~~~~~~- 251 (581) T protein:vir:10 173 IRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQ- 251 (581) T ss_pred ccccccccCcceeccccceeeecccCccccccccccceeeeeeecccccccceEEEEEEEeecCCcceeEEeecCcchh- Confidence 111111 1 11222 333333334332 222111 1222233455555544442 Q ss_pred hhhhH----------HHHHHHHhhhccccceEEEEecCCC---ccchhHHHHHHHHHhcccceEEEEcCCChHHHHHHHH Q lcl|NC_019422. 51 TEFTK----------ENYDYIRLAFLGKPSKVIVEVINDS---VDSERSLDDALKALRENKFNYLAIPFISEEVDKTKIV 117 (355) Q Consensus 51 ~~~~~----------~n~~~i~~a~~g~~~~v~l~~g~~g---~~~~~~y~~al~~le~~~fn~l~~p~~~d~~~~~~~~ 117 (355) +.+.. +.-...+.+++.+++ ..|.++.++ +++++||++||++||.++|+.+++|+..++++|+++. T Consensus 252 ~~~~~~~~~~g~~~~~~t~~~~~~~tn~~~-~~l~~gvd~~g~tvt~~dy~~Al~ale~~~~~~ivv~~t~~~~v~a~l~ 330 (581) T protein:vir:10 252 DFYGPAFDEAGNVQSEITLCAQLAITNGAS-TILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQ 330 (581) T ss_pred hhhhhhhhccCccccchhhhheeeeecccc-eeEEeeccCCCCccchHHHHHHHHHHhcCCceEEEEeCCCCHHHHHHHH Confidence 22211 111122233444444 445566555 5678999999999999999999999888889999999 Q ss_pred HHHHHHHhcCCeEE---EEecCC------------CCcCcceeEEecCCeEecCCc------eecHHHHHHHHHHHhcCc Q lcl|NC_019422. 118 NWIKTARREKEIYK---AVLPNI------------SDANEKAIINFATTGIKVGEK------SYTTAEYTARLAGILAGI 176 (355) Q Consensus 118 ~~ik~~r~~g~~~~---aVl~~~------------~~~d~egIinv~n~~i~~~~~------~~~~~~~~a~vAG~~Ag~ 176 (355) +|+++|++.++..+ +|.++. ..+|+++++++.++.+..++. .++++++|||+||++|++ T Consensus 331 ahv~~~s~~~~~~ravigV~g~~~~~~~~~~~~~a~~~n~~Rvvlv~p~~~~~~g~~~~~~v~lp~y~~AA~vAGl~a~~ 410 (581) T protein:vir:10 331 QHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSA 410 (581) T ss_pred HHHHHHHhccCCcEEEEEecCCCCCccHHHHHHhhccCCCceEEEEecCceeecCcccCceeccchhhHHHHHHHHhhcc Confidence 99999998765554 444432 246899999999988776654 489999999999999999 Q ss_pred ccccccccccCCcccc---cCChhhHHHHHhCCeEEEEE--CCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHH Q lcl|NC_019422. 177 SLSESCTYFILDEVTE---IEPTENPDEAVEEGKLILIN--NNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDD 251 (355) Q Consensus 177 ~~~~S~T~~~~~~~~~---~~~~~e~~~ai~~G~lvl~~--dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~d 251 (355) ++++|+||++++++.. .++++|+++++++|.++|.. +++|||+|||||+++ +..|++|+++|++|.+.++ T Consensus 411 ~~~~slT~~~i~gi~~l~~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s-----~~~~~~i~~iR~~D~v~~~ 485 (581) T protein:vir:10 411 IAAMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPT-----SLHTREWNIIGQQDVMVYR 485 (581) T ss_pred ccccCcccccccccccccccCCHHHHHHHHhCCeEEEEEecCCeEEEEeeeecCCC-----CCcceeeeeehhhhHHHHH Confidence 9999999999987764 56889999999999999964 467999999999865 4569999999999999999 Q ss_pred HHHHHh-hccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCC Q lcl|NC_019422. 252 ILQTWN-ENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANT 330 (355) Q Consensus 252 i~~~~~-~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~ 330 (355) +|+.++ +.|||+ +|+..+|.++++++++||.+|+++|+|+++.+-. + ...-.. T Consensus 486 ir~~~~~~~fIG~-~n~~~~r~~ik~~i~~~L~~l~~~g~I~~~~~~~-----------------------~--~~~~~~ 539 (581) T protein:vir:10 486 IRDYLDADGLIGM-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLK-----------------------A--RQIERQ 539 (581) T ss_pred HHHHhhhhcCCCc-ccCHHHHHHHHHHHHHHHHHHHhcCcccCCccce-----------------------e--eeeecC Confidence 999985 679999 8999999999999999999999999999874211 0 111234 Q ss_pred CCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 331 GSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 331 ~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) ++.+++.+.++|+++||+||+++.. T Consensus 540 ~d~v~V~i~v~Pv~~i~~I~vti~~ 564 (581) T protein:vir:10 540 PDVIEVRYEWRPAYPLNYIVVRYSI 564 (581) T ss_pred CCEEEEEEEEEecccceEEEEEEEE Confidence 5889999999999999999999999 No 14 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=100.00 E-value=1.6e-38 Score=227.83 Aligned_cols=323 Identities=10% Similarity=0.070 Sum_probs=204.1 Q ss_pred CCCCceEEEeeeeeeeeecC------------CCceeEE--------EEEecCCccce-eEEEeeh-hhhhhhhhhH--- Q lcl|NC_019422. 1 MGLPSAIIEFQRRSRTVKFR------------SRRGVVA--------LILKDSTAIKK-SYSIDFL-TDINETEFTK--- 55 (355) Q Consensus 1 ~g~P~~~i~f~~~a~ta~~~------------~~rG~v~--------iil~d~~~~~~-~~~~~~~-~d~~~~~~~~--- 55 (355) -+.=.+...+.. .++++. ...|.+. +.+........ .+...+. .+. .++.. T Consensus 228 ~~~~d~~~~~~~--~~a~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~tp~~~~~~~~~~~~~~~~--~~~~~v~~ 303 (648) T protein:vir:10 228 TNPVDIPLGLFV--YEVLYGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNLSATPFFDGSDYQDYTSLSDP--ANWFAKDA 303 (648) T ss_pred cccccccccccc--ccccchhhhcCCcchhhhhhhccccccccccceecccccccccccceeeeeccccc--cceeeeec Confidence 000000000000 001110 1112111 00000000000 0000000 010 00000 Q ss_pred -HHHHHHHhhhcc---ccceEEEEecCCCcc-----------chhHHHHHHHHHhcccceEEEEc-------------CC Q lcl|NC_019422. 56 -ENYDYIRLAFLG---KPSKVIVEVINDSVD-----------SERSLDDALKALRENKFNYLAIP-------------FI 107 (355) Q Consensus 56 -~n~~~i~~a~~g---~~~~v~l~~g~~g~~-----------~~~~y~~al~~le~~~fn~l~~p-------------~~ 107 (355) .-+..+...-.. .+.-+.|.||++|++ +..+|+++|+.|++++-.|+. | .+ T Consensus 304 ~~~~~l~~~~~~p~~~~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~~iv-p~~~~~~~~~~~~~lt 382 (648) T protein:vir:10 304 YTINHLVDTTINPHILATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVNFVI-PAYKFTNVTQLNDRLT 382 (648) T ss_pred cchhhcccccccCcccccccceecccccCCCcccccccccccchhhHHHHhhhccCCCceEEE-eecccccccccccccC Confidence 001111111110 111235888888876 557899999999999866654 4 34 Q ss_pred ChHHHHHHHHHHHHHHHh-----cCCeEEEEecCCCCc---Ccc-----eeEEec-----------------CCeEec-C Q lcl|NC_019422. 108 SEEVDKTKIVNWIKTARR-----EKEIYKAVLPNISDA---NEK-----AIINFA-----------------TTGIKV-G 156 (355) Q Consensus 108 ~d~~~~~~~~~~ik~~r~-----~g~~~~aVl~~~~~~---d~e-----gIinv~-----------------n~~i~~-~ 156 (355) ...++|+++.+|+++|+- ++....+.+++.+.. ++| +++|.. .+.+.. + T Consensus 383 ~~q~i~a~a~shv~~~s~~~~~~~r~~~~~~vg~~~~es~~~se~~~~~~~~~~~~a~~~~~d~~~~~~~~~~~~~~~~~ 462 (648) T protein:vir:10 383 IFKGIASTFLSHVQTMSQVNRRKARVGVFGLPAPSPNESVTASEYLYNRNILNTISAMFGGTDRAQAVVFPFYSNVFNDE 462 (648) T ss_pred CccchHHHHHHHHHHhhhccccccccCeEEEeCCCCchhHHHHHHHhhhhcccccceeeeecCCceEEeecccceeECCC Confidence 557899999999999963 345577777765322 112 222221 111111 2 Q ss_pred Cce--ecHHHHHHHHHHHhcCcccccccccccCCcc--cc--cCChhhHHHHHhCCeEEEEEC---CcEEEEecCccccc Q lcl|NC_019422. 157 EKS--YTTAEYTARLAGILAGISLSESCTYFILDEV--TE--IEPTENPDEAVEEGKLILINN---NGIRIARGVNSLIT 227 (355) Q Consensus 157 ~~~--~~~~~~~a~vAG~~Ag~~~~~S~T~~~~~~~--~~--~~~~~e~~~ai~~G~lvl~~d---g~v~I~~~INSltt 227 (355) |.. ++|+..++.|||++||+++.+|+||+++.++ .. .++.+|+++++++|.+++.+. +.+...|-+..+|| T Consensus 463 G~~~~~p~~~~Aa~VAGl~a~l~~~~s~T~k~i~~~~id~~~~~t~~qld~L~~~Gv~~ie~~~~~~~~~~~rvv~gITT 542 (648) T protein:vir:10 463 GKVELLGGEFFASYVAGMHANREPQDSITFLPISGIGAEPLYNWTYTQKDDLISNRVLFVEKVKTSFGGIVYRIHHNPTT 542 (648) T ss_pred CcEEecchhhHHHHHHhhhhccccccCcccceeeccccccccCCCHHHHHHHhcCCcEEEEEecCCcceeeEEEecccee Confidence 332 7999999999999999999999999998755 33 567899999999999999752 23333333444466 Q ss_pred cCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccch Q lcl|NC_019422. 228 LSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHK 307 (355) Q Consensus 228 ~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~ 307 (355) .+...+..|++|+++|+.|.+.+++|+.+.+.|||+ +|+...|.++++.|..||.++.+++.|.++.+.++..+ T Consensus 543 ~~~~~~~~~~eisv~ri~D~l~~~vr~~l~~~fIG~-~n~~~~~~~ik~~i~~~L~~~~~~~~I~~y~~~~v~~~----- 616 (648) T protein:vir:10 543 WLGPVTQGFQEFVLRRIDDFLQSYVYKNLQEQFIGR-KSYGRKTENDIKVYTEALLSNLVGKQIVAYKDVKVTSN----- 616 (648) T ss_pred ecCCCCcceeeeeeeehhhHHHHHHHHHHhhhcCcc-cccHHHHHHHHHHHHHHHhhHhhcCcccCcccceEEEE----- Confidence 667788999999999999999999999999999999 58888999999999999999999999998765443332 Q ss_pred hhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 308 KYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 308 ~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) ..++-+++...++|+.+|++||.++.+ T Consensus 617 ---------------------~~~~vv~V~~~v~Pv~~i~~I~vti~i 643 (648) T protein:vir:10 617 ---------------------EDKTVYYVEFFYQPVTEIKFILVTMKV 643 (648) T ss_pred ---------------------ecCCEEEEEEEEEecceeeEEEEEEEE Confidence 234778999999999999999999999 No 15 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=99.93 E-value=2e-27 Score=166.91 Aligned_cols=321 Identities=10% Similarity=0.094 Sum_probs=168.6 Q ss_pred CCC------CceEEEeeeeeeeeecCCCcee------------EEEEEec--CCccceeEEEeehhhhhhhhhhHHHHHH Q lcl|NC_019422. 1 MGL------PSAIIEFQRRSRTVKFRSRRGV------------VALILKD--STAIKKSYSIDFLTDINETEFTKENYDY 60 (355) Q Consensus 1 ~g~------P~~~i~f~~~a~ta~~~~~rG~------------v~iil~d--~~~~~~~~~~~~~~d~~~~~~~~~n~~~ 60 (355) +-+ +....+++ .+.--..+|-..- ..+.++. +.+..++....+..+.... +......- T Consensus 311 n~~~~~v~~~D~~~~~~-~t~~~~~~g~~~~~pl~~ts~dy~~~~~~vdgI~~~~~~~V~~~g~~s~a~a~-~~~g~~s~ 388 (717) T protein:vir:79 311 NDIMRKVESKDGAVTVT-ITKPESKRGMISEDPLVFKSGDYTNFKMLVDAINNHPFNNVVRARTKPEFEAT-FTSTLQAA 388 (717) T ss_pred eeeeeEEecCCceEEEE-EecccccCcceeccccccccCceeeeeeeecccccCchhheeeeeccccccee-eeecccCc Confidence 110 11111110 0000000000000 0000000 0011111111111111000 00000000 Q ss_pred HHhhhccccceEEE-----E---ecCCCccchhHHHHHHHHHhcccceEEEEcCCChH--------HHHHHHHHHHHHHH Q lcl|NC_019422. 61 IRLAFLGKPSKVIV-----E---VINDSVDSERSLDDALKALRENKFNYLAIPFISEE--------VDKTKIVNWIKTAR 124 (355) Q Consensus 61 i~~a~~g~~~~v~l-----~---~g~~g~~~~~~y~~al~~le~~~fn~l~~p~~~d~--------~~~~~~~~~ik~~r 124 (355) -...|.|+..+..+ + |+.......-.=..++..||++++|++++|+...+ ..+..+.+|+..+. T Consensus 389 d~a~f~Gg~dgl~~~~ee~Y~~lGgk~~d~g~lt~~aays~LE~~dVDlVil~ga~adtt~ga~~d~va~alad~caalS 468 (717) T protein:vir:79 389 ADAKFSGGKDELSLDKEEMYKRLGGEKNEEGFVTKQGAYQYLENYEVDYVIPLGVHADTKLIGKYDDFAYQLALACAVMS 468 (717) T ss_pred hhhccCCCccccccchhhhhccccccccccccccchhhhhhcCcceeEEEEecCccccccccchhhhHHHHHHHHHHHhh Confidence 00112222111111 0 11100000000013555666666666666543211 23445555555444 Q ss_pred hcCCeEEEEecCCCC--------------------------------c-------CcceeEE-e-cCCeEecC--Cceec Q lcl|NC_019422. 125 REKEIYKAVLPNISD--------------------------------A-------NEKAIIN-F-ATTGIKVG--EKSYT 161 (355) Q Consensus 125 ~~g~~~~aVl~~~~~--------------------------------~-------d~egIin-v-~n~~i~~~--~~~~~ 161 (355) ...+....|++-... . |.-...+ + ....+..+ +..+ T Consensus 469 al~r~ai~VI~l~sp~D~~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a~~~~~~~idis~y~~vv~~~~~iv~~~~~~~~- 547 (717) T protein:vir:79 469 HYNSVTIGIIPTTTPSDISLAGVEEHVKKLENYANEFYMRDRFGNIIFDADRNKIDLGQFIEVVAGPDFIVRNTRLGQM- 547 (717) T ss_pred hccccceeeeccccccccchhhHHHHHHHHHhhhhhhhhhcchhccccccccccccccceeeeeecceeEEEcCCCcee- Confidence 322222222221000 0 0000000 0 01111101 1111 Q ss_pred HHHHHHHHHHHhcCcccccccccccCCccccc---CChhhHHHHHhCCeEEEEE--CCcEEEEecCccccccCCCCCchh Q lcl|NC_019422. 162 TAEYTARLAGILAGISLSESCTYFILDEVTEI---EPTENPDEAVEEGKLILIN--NNGIRIARGVNSLITLSKEDTEDL 236 (355) Q Consensus 162 ~~~~~a~vAG~~Ag~~~~~S~T~~~~~~~~~~---~~~~e~~~ai~~G~lvl~~--dg~v~I~~~INSltt~~~~k~~~f 236 (355) ....++++||+.|++++.+|++|+++.|+... ++++|++.+.++|..+|.+ ++++++..++|+ ...+.+| T Consensus 548 ~~p~AG~vAGldA~rGVwkSPANk~I~GVvgLa~~lT~sE~d~Ln~aGIntIr~~~GrGirVWGaRTt-----asd~sdW 622 (717) T protein:vir:79 548 ASTPDASYIGMVSQLKTQSAPTNKPLPSVTALRYTYSANQLNRLTKARFATFKYKQDGSIGVVDAPTS-----AHAGSDY 622 (717) T ss_pred ecCHHHHHHHHHhcCCcccccccceecccccCcccCCHHHHHHHhhCCeEEEEEeCCceEEEEeeeec-----CCCCccc Confidence 12347999999999999999999999877653 6889999999999999864 448999999986 3445689 Q ss_pred hhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccc Q lcl|NC_019422. 237 KKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGID 316 (355) Q Consensus 237 ~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d 316 (355) ++|+++|++|.|.+.|++.++. |+|+ ||+...|..+++.|..||.+|+++|+|..|. +++... T Consensus 623 ryInVRRl~D~Ie~sIr~al~~-yVgE-PNd~~tr~~Ik~sI~afL~~L~r~GAI~Gyk---vdvtnT------------ 685 (717) T protein:vir:79 623 TRLSTARIVKEAVNAVREVADP-FIGE-PNDTGNRNALTAAVDKRLSKMIENKALLGFD---FRLVVT------------ 685 (717) T ss_pred ceeehhhhHHHHHHHHHHHHHH-hccc-cCCHHHHHHHHHHHHHHHHHHHhcCceecce---eeEecC------------ Confidence 9999999999999999999885 9999 7999999999999999999999999999863 222111 Q ss_pred cccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 317 YSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 317 ~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) ...+ ....+++.+.+.|+..||+||+++++ T Consensus 686 -----~~di----~~G~l~V~I~vaPv~PaEfI~ititI 715 (717) T protein:vir:79 686 -----PQQE----LLGEGSIELSLEAPNELRRLTTIVSL 715 (717) T ss_pred -----hhHh----hCCEEEEEEEEEecCcccEEEEEEEE Confidence 1111 12368999999999999999999999 No 16 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=99.77 E-value=1e-20 Score=130.15 Aligned_cols=322 Identities=14% Similarity=0.083 Sum_probs=187.9 Q ss_pred CCCCc--eEEEeee--------------------------eeeeeecCCCceeEEEE-----EecC--Ccc----ceeEE Q lcl|NC_019422. 1 MGLPS--AIIEFQR--------------------------RSRTVKFRSRRGVVALI-----LKDS--TAI----KKSYS 41 (355) Q Consensus 1 ~g~P~--~~i~f~~--------------------------~a~ta~~~~~rG~v~ii-----l~d~--~~~----~~~~~ 41 (355) =|..+ +.+.+.. .+.+.......|.+.-. +.+. ... ...-. T Consensus 369 pGawGN~ItV~I~~~t~~~~~l~v~~~~~s~f~~~~a~e~~tv~~~~~~~~~~v~e~~dn~~i~~~~~~~~~~~in~vs~ 448 (774) T protein:vir:98 369 EGNWGNQVTVSIYPVNNSEFRLNVQDLNGSAFNPPLADEVYTVKLGDTNESGELNALLDSKFIRGFFLPKSIDSINYDAA 448 (774) T ss_pred cCcCCCceEEEEEecCCceeEEEEEecCCccccccccceeEEEecccccccceeeeeeceeeEeeccccccccccccccc Confidence 11111 1111110 01111111111100000 0000 000 00000 Q ss_pred EeehhhhhhhhhhHHHHHHHH---hh-hccc--cceEEEEecCCCcc-chhHHHHHHHHHhcccceEEEEcCCChHHHHH Q lcl|NC_019422. 42 IDFLTDINETEFTKENYDYIR---LA-FLGK--PSKVIVEVINDSVD-SERSLDDALKALRENKFNYLAIPFISEEVDKT 114 (355) Q Consensus 42 ~~~~~d~~~~~~~~~n~~~i~---~a-~~g~--~~~v~l~~g~~g~~-~~~~y~~al~~le~~~fn~l~~p~~~d~~~~~ 114 (355) +...+.+.... ......-+. .. ..+. ...+.+.+|.++.. +..+|..+++.++...++.|+.+. .+..++. T Consensus 449 lv~~~~~~~a~-~d~~~~~~~~~~~~~~~~~~~~v~v~lagG~Dg~~tt~~~igg~~~~~~~tgi~aLl~a~-~~~~V~~ 526 (774) T protein:vir:98 449 LVRQSPLRLAP-PDESETDVENPAHVDFYGPNVLVDVTLENGYDGPPVTNDDYVSIIRTLENQPVHILLVGT-TNVGVQQ 526 (774) T ss_pred ccccchhcccc-cccccccccccccccccCCcceEEEeecCCCCcccccchheecccccccccceeEEEcCc-cchhhHH Confidence 00000000000 000000000 00 0001 11233556666654 346799999999999999998775 4667899 Q ss_pred HHHHHHHHHHhcCCeEEEEecCC------------CCcCcceeEEecCCeEecC---Cc--eecHHHHHHHHHHHhcCcc Q lcl|NC_019422. 115 KIVNWIKTARREKEIYKAVLPNI------------SDANEKAIINFATTGIKVG---EK--SYTTAEYTARLAGILAGIS 177 (355) Q Consensus 115 ~~~~~ik~~r~~g~~~~aVl~~~------------~~~d~egIinv~n~~i~~~---~~--~~~~~~~~a~vAG~~Ag~~ 177 (355) .+.++++++...++...+++... ..+|++..+.+.+.....+ +. .+++ ++++||++|.++ T Consensus 527 aii~~~e~~~~~~~~r~avid~p~g~t~~~Ai~~r~~f~S~~aal~~Pwvkv~D~~~g~~~~vPp---Sg~vAGl~ArtD 603 (774) T protein:vir:98 527 ALITEAERASDSDGLRIAVLAAPPRTTPTLAASVTRGFNSTRAVMVAGWFTYAGQPNSSRYGVPG---AAVYAGKLAAID 603 (774) T ss_pred HHHHHHHHhhhcccceEEEEECCCCCCHHHHHHHHhccCCceEEEEeCcEEEeccCCCceeecCh---hHHHHHHHHhcC Confidence 99999999988777666777432 1345667777766654332 21 2333 799999999999 Q ss_pred cccccccccCCccccc---------CChhhHHHHHhCCeEEE---EECCcEEEEecCccccccCCCCCchhhhhhhHhhH Q lcl|NC_019422. 178 LSESCTYFILDEVTEI---------EPTENPDEAVEEGKLIL---INNNGIRIARGVNSLITLSKEDTEDLKKIKIVEAI 245 (355) Q Consensus 178 ~~~S~T~~~~~~~~~~---------~~~~e~~~ai~~G~lvl---~~dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~ 245 (355) ..+|+.|+++.|+... .+..|.+.+-..|--++ +..+++++- |-.|+. .+..|+.|.++|++ T Consensus 604 v~kSPANk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~gIN~i~itt~g~G~rvW-G~RTls-----sDp~wr~InVRRlf 677 (774) T protein:vir:98 604 FFVSPAARSLVGPLFNIIESDTDNYTSRSNQDIYSAARLEVLSLDTVDRTYRFA-SGVTLS-----TDPAWERIYLRRVH 677 (774) T ss_pred cccccCCceeecceeccccccccccccchhhhhhcccccceeEEEEcCCcEEEE-cccccC-----CCcccceEeehhhH Confidence 9999999998876521 23445555555554332 223455443 233432 24679999999999 Q ss_pred HHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceee Q lcl|NC_019422. 246 DMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQI 325 (355) Q Consensus 246 D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v 325 (355) |.|.+.|++.... |+|+ +|+...|..++..++.||.+|.++|+|..+.. +..|.+- +....+ T Consensus 678 d~Ie~SI~~~~~~-~VfE-PNd~~l~~~I~~sI~~fL~~L~~~GaL~G~~~--V~~D~et--------------Nt~~dI 739 (774) T protein:vir:98 678 DVVRQGAHAILRN-YVAM-PNSRLVRNQIAAALNAFMGELKRNGNIVSFRP--AIIDGSN--------------NSTAAY 739 (774) T ss_pred HHHHHHHHHHHHH-hccC-CCCHHHHHHHHHHHHHHHHHHHhCCceecceE--EEEcCCC--------------CCHHHh Confidence 9999999998876 9999 89999999999999999999999999987642 2233221 111111 Q ss_pred eccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 326 KEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 326 ~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) ....+++.+.+.|+..+|.|++++.- T Consensus 740 ----~~G~l~i~I~vaP~~PAEfIilri~q 765 (774) T protein:vir:98 740 ----FSRELYVSLQFQPLYSADYIYVTISR 765 (774) T ss_pred ----hCCEEEEEEEEEecCCcceEEEEEEE Confidence 12478999999999999999999988 No 17 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=99.72 E-value=2.3e-18 Score=117.31 Aligned_cols=317 Identities=15% Similarity=0.102 Sum_probs=198.1 Q ss_pred CC--CCceEEEeeeeeeeeecCCCceeEEEEEecCCccc------eeEEEeehhhhhhhhhhH-HHHHHHHhhhcc-ccc Q lcl|NC_019422. 1 MG--LPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIK------KSYSIDFLTDINETEFTK-ENYDYIRLAFLG-KPS 70 (355) Q Consensus 1 ~g--~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~------~~~~~~~~~d~~~~~~~~-~n~~~i~~a~~g-~~~ 70 (355) |. +|+++++=..-+..++....-+++.++..-..... ....+.+.++........ .-...+...|.. ++. T Consensus 1 m~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~~~p~~its~~~~~~~~g~~gtl~~al~~~~~ngg~~ 80 (392) T protein:vir:18 1 MSDFHHGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPLNEPVLITNVQSAIAKAGKKGTLSASLQAIADQSKPV 80 (392) T ss_pred CCCCCCCeEEEEcCCCceeeeccCcceeEEEEeccCCCCcccccccceEeechHHHHhhcCCCcchHHHHHHhhcccCce Confidence 55 48888887777777776666666666654321111 112233333332211100 011223333332 211 Q ss_pred eEEE------------------EecCCCccchhHHHHHHHHHhc---ccceEEEEcCCChHHHHHHHHHHHHHHHhcCCe Q lcl|NC_019422. 71 KVIV------------------EVINDSVDSERSLDDALKALRE---NKFNYLAIPFISEEVDKTKIVNWIKTARREKEI 129 (355) Q Consensus 71 ~v~l------------------~~g~~g~~~~~~y~~al~~le~---~~fn~l~~p~~~d~~~~~~~~~~ik~~r~~g~~ 129 (355) .+.+ .|+.+.+. ...-..+|...+. ..++.+.+|+.++..+++.+.++++++| T Consensus 81 ~~vv~v~~~~~~~~~~~t~~dliG~~~~~~-~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~~~~~~----- 154 (392) T protein:vir:18 81 TVVVRVAEGTGDDAEAQTTSNIIGGTDENG-KYTGIKALLTAEAVTGVKPRILGVPGLDTQEVATALASVCISLR----- 154 (392) T ss_pred EEEecccccccccccccchhhheecccccc-hhhhHHHHHhhhhhhceeehhcccCccchHHHHHHHHHHHhhcC----- Confidence 1111 11111110 0111123333332 3567888998877778888888887765 Q ss_pred EEEEecC------------CCCcCcceeEEecCCeEecC----Cc-eecHHHHHHHHHHHhcCccc----ccccccccCC Q lcl|NC_019422. 130 YKAVLPN------------ISDANEKAIINFATTGIKVG----EK-SYTTAEYTARLAGILAGISL----SESCTYFILD 188 (355) Q Consensus 130 ~~aVl~~------------~~~~d~egIinv~n~~i~~~----~~-~~~~~~~~a~vAG~~Ag~~~----~~S~T~~~~~ 188 (355) ..+++.. ....++...+.+.+.....+ .. .++ .++++||+.|.+.. -+|+.|+++. T Consensus 155 ~~~~~d~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p---~s~~~AG~~a~~d~~~g~~~spaN~~l~ 231 (392) T protein:vir:18 155 AFGYVSAWGCKTISEAMAYRENFSQRELMVIWPDFLAWDTTANATATAY---ATARALGLRAYIDQTIGWHKTLSNVGVQ 231 (392) T ss_pred cEEEEecCCCCCHHHHHHHHhhccCceEEEEeCceeeecccCCceEEec---hHHHHHHHHHhhhccCCceEccCCceee Confidence 2333322 11234556666665544322 11 223 38999999998874 4589999888 Q ss_pred ccccc---------CChhhHHHHHhCCeEEEEECCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhc Q lcl|NC_019422. 189 EVTEI---------EPTENPDEAVEEGKLILINNNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNEN 259 (355) Q Consensus 189 ~~~~~---------~~~~e~~~ai~~G~lvl~~dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~ 259 (355) |+... ++..|.+.+-.+|.-.++++++.++-- -.|+. .+..|+.|.++|++|.|.+.|++.+.. T Consensus 232 gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~~~~G~~~wG-~rT~~-----~d~~~~~i~~rR~~~~i~~~i~~~~~~- 304 (392) T protein:vir:18 232 GVTGISASVFWDLQASGTDADLLNEAGVTTLVRKDGFRFWG-NRTCS-----DDPLFLFENYTRTAQVLADTMAEAHMW- 304 (392) T ss_pred ceeecceecccccCCCcchhhhhhhcCceEEEcCCCEEEEc-ccccC-----CCcccceeehhhHHHHHHHHHHHHHHH- Confidence 77542 234577778899999988877777654 44442 245799999999999999999999876 Q ss_pred cccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEE Q lcl|NC_019422. 260 YVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGN 339 (355) Q Consensus 260 yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~ 339 (355) |+++ +|+..-|..++..++.||.+|.++|+|..+ .+..|.+ .+....+.+ -.+++.+. T Consensus 305 ~v~e-~n~~~~~~~i~~~i~~~L~~l~~~gal~g~---~v~~d~~--------------~nt~~~i~~----G~~~~~v~ 362 (392) T protein:vir:18 305 AVDK-PITASLIRDIVDGINAKFRELKSNGYIVDG---ECWFDEE--------------SNDKETLKA----GKLYIDYD 362 (392) T ss_pred hccC-CCCHHHHHHHHHHHHHHHHHHHhcCcccce---EEEEecC--------------CCCHHHhhC----CeEEEEEE Confidence 9999 899999999999999999999999999886 3444432 222222322 34899999 Q ss_pred EEEEeeeeEEEEEEeC Q lcl|NC_019422. 340 ITITDAMEDLKFKIYM 355 (355) Q Consensus 340 i~~vdamEkiy~tv~v 355 (355) +.|+-.||.|.++++. T Consensus 363 ~~p~~p~e~I~~~~~~ 378 (392) T protein:vir:18 363 YTPVPPLESLTLRQRI 378 (392) T ss_pred EEecCCcceEEEEEEE Confidence 9999999999999888 No 18 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=99.71 E-value=3.3e-17 Score=110.96 Aligned_cols=320 Identities=11% Similarity=0.097 Sum_probs=187.2 Q ss_pred CCCCceEEEeeeeee----ee----ecCCCce-eEEEEEecCCccceeEEEeehhhhhh-hh--------hhHHHHHHHH Q lcl|NC_019422. 1 MGLPSAIIEFQRRSR----TV----KFRSRRG-VVALILKDSTAIKKSYSIDFLTDINE-TE--------FTKENYDYIR 62 (355) Q Consensus 1 ~g~P~~~i~f~~~a~----ta----~~~~~rG-~v~iil~d~~~~~~~~~~~~~~d~~~-~~--------~~~~n~~~i~ 62 (355) .+=|.+.+.....+. ++ -..+... ...++++.+.....++.+........ .. +......++. T Consensus 245 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 324 (660) T protein:vir:10 245 EAGSSKMLDVYPGGGTRASIAKAVFNYGPQTDDQYAIIVRRDGAIVESVVLSTKEGEKDVYGNNIYLDDYFAKGTSNYIY 324 (660) T ss_pred CCcceeEEeeeeccceeeEEeeeecccccccccccccccccCCcccceeeeeccccccccccceeeeehhhcCCCccEEE Confidence 322222222211110 01 1111111 12333333333333333322111000 00 0000011111 Q ss_pred hhhcccc----ceEEEEecCCCc--cchhHHHHHHHHHhc---ccceEEEEcCCC------hHHHHHHHHHHHHHHHhcC Q lcl|NC_019422. 63 LAFLGKP----SKVIVEVINDSV--DSERSLDDALKALRE---NKFNYLAIPFIS------EEVDKTKIVNWIKTARREK 127 (355) Q Consensus 63 ~a~~g~~----~~v~l~~g~~g~--~~~~~y~~al~~le~---~~fn~l~~p~~~------d~~~~~~~~~~ik~~r~~g 127 (355) ....+.| ..+.+.++.++. ++..++..++++|+. ..++.+++|+.. ..++++.+.++++++|+ T Consensus 325 ~~~~~~~~~~~~~~~l~gg~~~~~~~t~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~al~~~~~~~~~-- 402 (660) T protein:vir:10 325 ATSLNWPKGFSGIINLSGGISANDKVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDEVASTVQKHVVSIADERQD-- 402 (660) T ss_pred EEeccCCCCcccceeeeccccCccccccchhhhhhhhhhhhhhcccceEEEcCcCCCchhhhHHHHHHHHHHHHhhCC-- Confidence 1111111 223466666543 345677777777754 468999998643 23467778888887753 Q ss_pred CeEEEEecCC------------------------------CCcCcceeEEecCCeEecC---C--ceecHHHHHHHHHHH Q lcl|NC_019422. 128 EIYKAVLPNI------------------------------SDANEKAIINFATTGIKVG---E--KSYTTAEYTARLAGI 172 (355) Q Consensus 128 ~~~~aVl~~~------------------------------~~~d~egIinv~n~~i~~~---~--~~~~~~~~~a~vAG~ 172 (355) +.+++.-. ...|+...+.+.+-....+ + ..+++ ++.+||+ T Consensus 403 --~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---sg~~AGl 477 (660) T protein:vir:10 403 --CLAFISPPKGLLVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTYAAIDGNYKYQYDKYNDVNRWVPL---AADLAGL 477 (660) T ss_pred --EEEEEecCcccccccccccCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceeEech---hHHHHHH Confidence 44554210 0123344443433322211 1 12344 7999999 Q ss_pred hcCccccc----ccccccCCcc---cc---cCChhhHHHHHhCCeEEEE--EC-CcEEEEecCccccccCCCCCchhhhh Q lcl|NC_019422. 173 LAGISLSE----SCTYFILDEV---TE---IEPTENPDEAVEEGKLILI--NN-NGIRIARGVNSLITLSKEDTEDLKKI 239 (355) Q Consensus 173 ~Ag~~~~~----S~T~~~~~~~---~~---~~~~~e~~~ai~~G~lvl~--~d-g~v~I~~~INSltt~~~~k~~~f~ki 239 (355) .|..+.++ |+.++++.++ .+ .+++.|.+.+..+|.-++. .+ +++++ -|-.|+. ..+.+|+.| T Consensus 478 ~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~~G~~~-wG~rT~~----~~~s~~~~i 552 (660) T protein:vir:10 478 CARTDDVSQPWMSPAGYNRGQILNVLKLAIEPRQAQRDRMYQEAINPVVGFAGGDGFVL-FGDKTAT----KVPSPMDHI 552 (660) T ss_pred HHHhhccCCcEEccCCeeeceeeccceeeecCChhhHHhHhhCCceEEEEeeCCCcEEE-EcccccC----CCCcccceE Confidence 99887554 8888865543 22 2577899999999976654 34 35554 6777752 334689999 Q ss_pred hhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhcccccccc Q lcl|NC_019422. 240 KIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSE 319 (355) Q Consensus 240 rvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~ 319 (355) .++|++|.|.+-+++.... |+++ +|++.-|..++..++.||.+|.++|+|..| .+..|.+... T Consensus 553 ~vrR~~~~i~~si~~~~~~-~v~e-pn~~~l~~~i~~~i~~fL~~l~~~gal~g~---~V~~d~~~nt------------ 615 (660) T protein:vir:10 553 NVRRLFNMLKKNIGDASKY-KLFE-LNDNFTRSSFRMEVSQYLDGIKALGGIYEG---RVVCDTTVNT------------ 615 (660) T ss_pred ehhhHHHHHHHHHHHHHHH-hccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeee---EEEEcCCCCC------------ Confidence 9999999999999998765 9999 799999999999999999999999999986 4666644211 Q ss_pred ccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 320 MTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 320 ~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) ...+ ..-.+++.+.++|+-.||.|.+++.- T Consensus 616 --~~di----~~G~~~~~i~~~P~~pae~I~~~~~~ 645 (660) T protein:vir:10 616 --PAVI----DRNEFIANIYVKPARSINYITLNFVA 645 (660) T ss_pred --HHHh----hCCeEEEEEEEEecCCccEEEEEEEE Confidence 1111 13468899999999999999999887 No 19 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=99.69 E-value=1.1e-17 Score=113.57 Aligned_cols=318 Identities=14% Similarity=0.123 Sum_probs=192.9 Q ss_pred CC--CCceEEEeeeeeeeeecCCCceeEEEEEecCCccce------eEEEeehhhhhhhhhh-HHHHHHHHhhhcc-ccc Q lcl|NC_019422. 1 MG--LPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIKK------SYSIDFLTDINETEFT-KENYDYIRLAFLG-KPS 70 (355) Q Consensus 1 ~g--~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~------~~~~~~~~d~~~~~~~-~~n~~~i~~a~~g-~~~ 70 (355) |- +|+++|+-..-....+.....++++++......... ...+.+..+....... ..-...+...+.. ++. T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~al~~~~~~~~~~ 80 (396) T protein:vir:57 1 MSDYHHGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAIAKAGKKGTLAASLQAIADQSKPV 80 (396) T ss_pred CCCCCCceEEEEcCCCcccccccCCceEEEEEeccCCCcccccCccCeEeecchhhhhhcccccchHHHHHHhhhcCCce Confidence 43 478887776666655655555666666543221111 1223333332211100 0001112212211 111 Q ss_pred eEEEE--ecC---------------CCccchhHHHHHHHHHhcc------cceEEEEcCCChHHHHHHHHHHHHHHHhcC Q lcl|NC_019422. 71 KVIVE--VIN---------------DSVDSERSLDDALKALREN------KFNYLAIPFISEEVDKTKIVNWIKTARREK 127 (355) Q Consensus 71 ~v~l~--~g~---------------~g~~~~~~y~~al~~le~~------~fn~l~~p~~~d~~~~~~~~~~ik~~r~~g 127 (355) -+.+. .+. -|++........+.+|+.. ..+.+++|+......++.+.+++++++ T Consensus 81 ~~vv~~~~~~~~~~~~~~a~t~~~iiG~~~~~~~~tgl~al~~~~~~~~~~p~i~~ap~~~~~~v~~al~~~~~~~~--- 157 (396) T protein:vir:57 81 TVVVRVEDGTGDDEETKLAQTVSNIIGTTDENGQYTGLKALMGAESVTGVKPRILGVPGLDTKEVAVALASVCQELN--- 157 (396) T ss_pred eEeeeccccccccccccccccceeeeeeccccccchhhhhhhhcccceeEEeccccCcccchhHHHHHHHHHhhhCc--- Confidence 11100 000 0001111122333343322 245666777666667778888887764 Q ss_pred CeEEEEecC------------CCCcCcceeEEecCCeEecC-----CceecHHHHHHHHHHHhcCccccc----cccccc Q lcl|NC_019422. 128 EIYKAVLPN------------ISDANEKAIINFATTGIKVG-----EKSYTTAEYTARLAGILAGISLSE----SCTYFI 186 (355) Q Consensus 128 ~~~~aVl~~------------~~~~d~egIinv~n~~i~~~-----~~~~~~~~~~a~vAG~~Ag~~~~~----S~T~~~ 186 (355) ..+++-. ....+++..+.+.+.....+ ...+++ ++++||+.|.+...+ |+.|++ T Consensus 158 --~~~~~d~p~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~Ag~~a~~d~~~g~~~spaN~~ 232 (396) T protein:vir:57 158 --AFGYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVTSTTATAYA---TARALGLRAKIDQEQGWHKTLSNVG 232 (396) T ss_pred --eEEEEcCCCCCCHHHHHHHHhccCCceEEEEcceeeeecccCCceeEEeh---hHHHHHHHHHhhhccCcEeccCCce Confidence 3344322 11245666666666543322 123443 799999999888554 999998 Q ss_pred CCccccc---------CChhhHHHHHhCCeEEEEECCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHh Q lcl|NC_019422. 187 LDEVTEI---------EPTENPDEAVEEGKLILINNNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWN 257 (355) Q Consensus 187 ~~~~~~~---------~~~~e~~~ai~~G~lvl~~dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~ 257 (355) +.|+... ++..|.+.+-.+|.-+++++++.++--+ .|+. .+..|+.|.++|++|.|.+.|++.+. T Consensus 233 l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~t~~~~~G~~~wG~-rT~~-----~d~~~~~i~vrR~~~~i~~~i~~~~~ 306 (396) T protein:vir:57 233 VNGVTGISASVFWDLQKPGTDADLLNEAGVTTLVRRDGFRFWGN-RTCS-----DDPLFLFESYTRTAQVLADTMAEAHM 306 (396) T ss_pred eccccccceecccccCCcchhhhhhhhcCcEEEEcCCCEEEEcc-cccC-----CCcccceeehhhHHHHHHHHHHHHHH Confidence 8876532 2346777888999999988777766543 3442 24569999999999999999999987 Q ss_pred hccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEE Q lcl|NC_019422. 258 ENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVE 337 (355) Q Consensus 258 ~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~ 337 (355) . |+++ +|+..-|..++..++.||.+|.++|+|..+ .+..|.+. +....+.+ -.+++. T Consensus 307 ~-~v~e-~n~~~~~~~i~~~i~~~l~~l~~~gal~g~---~v~~d~~~--------------n~~~~i~~----G~~~~~ 363 (396) T protein:vir:57 307 W-AIDK-PITATLIRDIIDGINAKFRELKNNGYIVDG---TCWFSEES--------------NDAETLKA----GKLYID 363 (396) T ss_pred H-hccC-CCCHHHHHHHHHHHHHHHHHHHhCCceece---EEEEecCC--------------CCHHHhhC----CeEEEE Confidence 6 9999 899999999999999999999999999986 35555432 22222222 358999 Q ss_pred EEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 338 GNITITDAMEDLKFKIYM 355 (355) Q Consensus 338 ~~i~~vdamEkiy~tv~v 355 (355) +.+.|+-.+|.|.++++. T Consensus 364 v~~~p~~p~e~I~~~~~~ 381 (396) T protein:vir:57 364 YDYTPVPPLENLTLRQRI 381 (396) T ss_pred EEEEecCCcceEEEEEEE Confidence 999999999999999888 No 20 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=99.69 E-value=7.3e-18 Score=114.54 Aligned_cols=318 Identities=16% Similarity=0.140 Sum_probs=198.2 Q ss_pred CCC--CceEEEeeeeeeeeecCCCceeEEEEEecCCccc------eeEEEeehhhhhhhhhhH-HHHHHHHhhhc-cccc Q lcl|NC_019422. 1 MGL--PSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIK------KSYSIDFLTDINETEFTK-ENYDYIRLAFL-GKPS 70 (355) Q Consensus 1 ~g~--P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~------~~~~~~~~~d~~~~~~~~-~n~~~i~~a~~-g~~~ 70 (355) |.- |+++|+=..-+...+.....++.+++..-..... ....+.+..+.....-.. .-...+...+. ++.. T Consensus 1 m~~~~~Gv~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~~~~~~~~g~~~tl~~a~~~~~~~gg~~ 80 (396) T protein:vir:60 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPLNKPVLITNVQSAIAKAGKKGTLAASLQAIADQSKPV 80 (396) T ss_pred CCCCCCCeEEEEcCCCcccccccCceeEEEEecccccccccccCccCeEeechHHHHHhhcCcchhHHHHHHHhhccCce Confidence 554 8999888777777777777777776653221111 122233333321111000 01112222222 1111 Q ss_pred eEEEEe--cCCC---------------ccchhHHHHHHHHHhc------ccceEEEEcCCChHHHHHHHHHHHHHHHhcC Q lcl|NC_019422. 71 KVIVEV--INDS---------------VDSERSLDDALKALRE------NKFNYLAIPFISEEVDKTKIVNWIKTARREK 127 (355) Q Consensus 71 ~v~l~~--g~~g---------------~~~~~~y~~al~~le~------~~fn~l~~p~~~d~~~~~~~~~~ik~~r~~g 127 (355) -+++.. +.+. .........++.+|+. ...+.+.+|+..+..++..+.++++++| T Consensus 81 ~~vv~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~tg~~al~~~~~~~~~~~~il~ap~~~~~~v~~al~~~~~~~~--- 157 (396) T protein:vir:60 81 TVVVRVEDGTGEDEETKLAQTVSNIIGTTDENGQYTGLKALLAAESVTGVKPRILGVPGLDTKEVAVALASVCQKLR--- 157 (396) T ss_pred EEEEecccccccccccccccccccccccccccccccchhhhhhcccceeeeeeeccccccccHHHHHHHHHHhccCC--- Confidence 111110 0000 0000111223333332 2346677787767778888888887665 Q ss_pred CeEEEEecCC------------CCcCcceeEEecCCeEecC---C--ceecHHHHHHHHHHHhcCccccc----cccccc Q lcl|NC_019422. 128 EIYKAVLPNI------------SDANEKAIINFATTGIKVG---E--KSYTTAEYTARLAGILAGISLSE----SCTYFI 186 (355) Q Consensus 128 ~~~~aVl~~~------------~~~d~egIinv~n~~i~~~---~--~~~~~~~~~a~vAG~~Ag~~~~~----S~T~~~ 186 (355) ..+++... ...++...+.+.+.....+ + ..+++ ++++||++|.+...+ |+.|++ T Consensus 158 --~~~i~d~p~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~AG~~a~~d~~~g~~~spaN~~ 232 (396) T protein:vir:60 158 --AFGYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVASTTATAYA---TARALGLRAKIDQEQGWHKTLSNVG 232 (396) T ss_pred --eEEEEeCCCCCCHHHHHHHHhhcCCceEEEEeCceeeecccCCceeEEch---hHHHHHHHHHhhhccCcEeCcCCce Confidence 23333211 1234556666666544322 1 12343 799999999988665 899998 Q ss_pred CCccccc---------CChhhHHHHHhCCeEEEEECCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHh Q lcl|NC_019422. 187 LDEVTEI---------EPTENPDEAVEEGKLILINNNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWN 257 (355) Q Consensus 187 ~~~~~~~---------~~~~e~~~ai~~G~lvl~~dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~ 257 (355) +.|+... ++..|.+.+-.+|.-+++++++.++- |-.|+.+ +..|+.|.++|++|.|.+.+++.+. T Consensus 233 l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~~~G~~~w-G~rT~~~-----d~~~~~i~~rR~~~~i~~~i~~~~~ 306 (396) T protein:vir:60 233 VNGVTGISASVFWDLQESGTDADLLNESGVTTLIRRDGFRFW-GNRTCSD-----DPLFLFENYTRTAQVLADTMAEAHM 306 (396) T ss_pred ecceeeceeecccccCCCcchhhhhhhcCcEEEEcCCCEEEE-cccccCC-----CcccceeehhhHHHHHHHHHHHHHH Confidence 8876532 24467888889999999887777654 5556532 3469999999999999999999987 Q ss_pred hccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEE Q lcl|NC_019422. 258 ENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVE 337 (355) Q Consensus 258 ~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~ 337 (355) . |+++ +|+..-|..++..++.||.+|.++|+|..+ .+..|.+ .++...+.+ -.+++. T Consensus 307 ~-~v~e-~n~~~~~~~i~~~i~~~l~~l~~~gal~g~---~~~~d~~--------------~nt~~~i~~----G~~~~~ 363 (396) T protein:vir:60 307 W-AVDK-PITATLIRDIVDGINAKFRELKTNGYIVDA---TCWFSEE--------------SNDAETLKA----GKLYID 363 (396) T ss_pred H-hccC-CCCHHHHHHHHHHHHHHHHHHHhCCceece---EEEEecC--------------CCCHHHhhC----CEEEEE Confidence 6 9999 899999999999999999999999999876 3444432 223333333 358999 Q ss_pred EEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 338 GNITITDAMEDLKFKIYM 355 (355) Q Consensus 338 ~~i~~vdamEkiy~tv~v 355 (355) +.+.|+-.+|.|.++++. T Consensus 364 i~~~p~~pae~I~~~~~~ 381 (396) T protein:vir:60 364 YDYTPVPPLENLTLRQRI 381 (396) T ss_pred EEEEecCCcceEEEEEEE Confidence 999999999999999988 No 21 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=99.68 E-value=1.6e-16 Score=107.12 Aligned_cols=317 Identities=11% Similarity=0.070 Sum_probs=179.5 Q ss_pred CCCCceEE----------EeeeeeeeeecCCC---------------c---------e-eEEEEEecCCccceeEEEeeh Q lcl|NC_019422. 1 MGLPSAII----------EFQRRSRTVKFRSR---------------R---------G-VVALILKDSTAIKKSYSIDFL 45 (355) Q Consensus 1 ~g~P~~~i----------~f~~~a~ta~~~~~---------------r---------G-~v~iil~d~~~~~~~~~~~~~ 45 (355) .+.|.+.. ...-.+.++...+. + . ...++...+......+.++.. T Consensus 217 ~~~~~i~A~~~G~~Gn~i~V~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~ 296 (663) T protein:vir:10 217 FGMPLISAVYPGEIGSTVEVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTR 296 (663) T ss_pred cccceEEeccCCcccceeeeeeccccccccccccceecccccccccccceeecccccccceeeEeecCCcceeeeccccc Confidence 33333221 11111111110000 0 0 011111111111111111110 Q ss_pred hhhhhhhhhHHHHHHHHhhhc------------ccc---c-eEEEEecCCCc--cchhHHHHHHHHHhcc---cceEEEE Q lcl|NC_019422. 46 TDINETEFTKENYDYIRLAFL------------GKP---S-KVIVEVINDSV--DSERSLDDALKALREN---KFNYLAI 104 (355) Q Consensus 46 ~d~~~~~~~~~n~~~i~~a~~------------g~~---~-~v~l~~g~~g~--~~~~~y~~al~~le~~---~fn~l~~ 104 (355) .+... ... ...++...+. +.| + -+.+.+|.++. .+..+|..+++.|+.. ..+.+++ T Consensus 297 ~~~~~--~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~ 373 (663) T protein:vir:10 297 KGDRD--VYG-SNIFMDDYFRNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIA 373 (663) T ss_pred ccccc--ccc-chhhhhhhhcCCcceEEEEeecccCccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEe Confidence 01000 000 0111111111 111 1 13456666654 3457788888887654 5666666 Q ss_pred cCCC-h-----HHHHHHHHHHHHHHHhcCCeEEEEecCC---------------------------------CCcCccee Q lcl|NC_019422. 105 PFIS-E-----EVDKTKIVNWIKTARREKEIYKAVLPNI---------------------------------SDANEKAI 145 (355) Q Consensus 105 p~~~-d-----~~~~~~~~~~ik~~r~~g~~~~aVl~~~---------------------------------~~~d~egI 145 (355) |... + ..+++.+.++++++|+ +.+++... ...+++.. T Consensus 374 ~~~~~~~~~~~~~v~~~l~~~a~~~~~----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~ 449 (663) T protein:vir:10 374 GACGSDGAEIASTVQKYVVSLADDRQD----CVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYA 449 (663) T ss_pred ccCCCCchhhHHHHHHHHHHHHHhhCC----EEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCccce Confidence 5311 1 2356777777777653 33333211 01344555 Q ss_pred EEecCCeEec---CCc--eecHHHHHHHHHHHhcCccccc----ccccccCC---cccc---cCChhhHHHHHhCCeEEE Q lcl|NC_019422. 146 INFATTGIKV---GEK--SYTTAEYTARLAGILAGISLSE----SCTYFILD---EVTE---IEPTENPDEAVEEGKLIL 210 (355) Q Consensus 146 inv~n~~i~~---~~~--~~~~~~~~a~vAG~~Ag~~~~~----S~T~~~~~---~~~~---~~~~~e~~~ai~~G~lvl 210 (355) +.+.+-.... ++. .+++ ++.+||++|.+...+ |+-++++. ++.. .+++.|.+.+..+|.-++ T Consensus 450 ~~~~p~~~~~d~~~~~~~~~p~---sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i 526 (663) T protein:vir:10 450 FIIGNYKYQYDKYNDINRWVPL---AADIAGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPV 526 (663) T ss_pred EEEcCceEEecccCCceEEech---hHHHHHHHHHhhccCCceEccCCceeccccccccceeecChhHHHHHhhCCceEE Confidence 5554443322 222 2344 799999999888665 66666533 4332 357789999999997555 Q ss_pred E--EC-CcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019422. 211 I--NN-NGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQR 287 (355) Q Consensus 211 ~--~d-g~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~ 287 (355) . .+ +++++ -|-.|+. ..+.+|+.|.++|++|.|.+-|++.... |+++ +|++.-|..++..|+.||.+|.+ T Consensus 527 ~~~~~~~G~~~-wG~rT~~----~~~s~~~~i~vrR~~~~i~~si~~~~~~-~v~e-pn~~~l~~~i~~~i~~~L~~l~~ 599 (663) T protein:vir:10 527 TGFAGGDGFVL-FGDKMAT----QVPSPFDRINVRRLFNMLKKNIGDTSKY-ELFE-NNDAFTRQSFRMETSQYLDGIRS 599 (663) T ss_pred EEEeCCCcEEE-EcccccC----CCCcccceEehhhHHHHHHHHHHHHHHH-hccC-CCCHHHHHHHHHHHHHHHHHHHh Confidence 3 34 34444 4556652 2335799999999999999999998775 9999 79999999999999999999999 Q ss_pred cccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 288 DEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 288 ~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) +|+|..| .+.+|.+-. ....+. .-.+++.+.++|+-.+|.|.++|.. T Consensus 600 ~gal~g~---~v~~d~~~n--------------t~~~i~----~G~~~~~i~~~p~~pae~i~~~~~~ 646 (663) T protein:vir:10 600 LGGCYDF---RVVCDTTNN--------------TPNVID----RNEFVGTIYVKPPRSINYITLNMVA 646 (663) T ss_pred CCceeee---EEEEcCCCC--------------CHHHhh----CCeEEEEEEEEecCCcceEEEEEEE Confidence 9999986 466665422 111111 2467999999999999999999887 No 22 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=99.67 E-value=1.5e-17 Score=112.85 Aligned_cols=318 Identities=15% Similarity=0.127 Sum_probs=192.4 Q ss_pred CCC--CceEEEeeeeeeeeecCCCceeEEEEEecCCccc------eeEEEeehhhhhhhhhh-HHHHHHHHhhhccc-cc Q lcl|NC_019422. 1 MGL--PSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIK------KSYSIDFLTDINETEFT-KENYDYIRLAFLGK-PS 70 (355) Q Consensus 1 ~g~--P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~------~~~~~~~~~d~~~~~~~-~~n~~~i~~a~~g~-~~ 70 (355) |.= |+++|+-..-+...+......+++++-....... ...-+.+..+.....-. ..-...+...|..+ .. T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l~~pvlvts~~~~~~~~g~~~tL~~al~~~~~ngg~~ 80 (396) T protein:vir:20 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAISKAGKKGTLAASLQAIADQSKPV 80 (396) T ss_pred CCCCCCCeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccCccCEEeechHHHHhhcccccchhhhhhhhhccCcee Confidence 654 9999998888777776666666666543211111 12223333332211100 00112222222221 11 Q ss_pred eEEEEec--C----------------CCcc--chhHHHHHHHHHhcc---cceEEEEcCCChHHHHHHHHHHHHHHHhcC Q lcl|NC_019422. 71 KVIVEVI--N----------------DSVD--SERSLDDALKALREN---KFNYLAIPFISEEVDKTKIVNWIKTARREK 127 (355) Q Consensus 71 ~v~l~~g--~----------------~g~~--~~~~y~~al~~le~~---~fn~l~~p~~~d~~~~~~~~~~ik~~r~~g 127 (355) .+.+..+ . ++.. .......+|...+.. ....+..|+.....+++.+.+++.++| T Consensus 81 ~~v~~~~~~~~~~~~~~~a~t~~~~~~~~~~~~~~tg~~al~~~~~~~~~~p~i~~ap~~~~~~v~~al~~~~~~~~--- 157 (396) T protein:vir:20 81 TVVMRVEDGTGDDEETKLAQTVSNIIGTTDENGQYTGLKAMLAAESVTGVKPRILGVPGLDTKEVAVALASVCQKLR--- 157 (396) T ss_pred EEEEeccccccccccccccccccccccccccccccchhhhhhhhccccccchhhhhhhhhccHHHHHHHHHHHhcCC--- Confidence 1111100 0 0000 001112223222221 223444565555567777777776654 Q ss_pred CeEEEEecC------------CCCcCcceeEEecCCeEecCC-----ceecHHHHHHHHHHHhcCcccc----ccccccc Q lcl|NC_019422. 128 EIYKAVLPN------------ISDANEKAIINFATTGIKVGE-----KSYTTAEYTARLAGILAGISLS----ESCTYFI 186 (355) Q Consensus 128 ~~~~aVl~~------------~~~~d~egIinv~n~~i~~~~-----~~~~~~~~~a~vAG~~Ag~~~~----~S~T~~~ 186 (355) ..+++-. ....+++..+.+.+.....+. ..+++ ++++||++|..... .|+.|++ T Consensus 158 --~~~~iD~p~~~~~~~a~~~r~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~---s~~~Ag~~a~~d~~~g~~~spaN~~ 232 (396) T protein:vir:20 158 --AFGYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVTSTTATAYA---TARALGLRAKIDQEQGWHKTLSNVG 232 (396) T ss_pred --cEEEEecCCCCCHHHHHHHhhCCCCceEEEEcCccccccCcCCcceeech---hHHHHHHHHHhhhhcCcEeccCCce Confidence 2333321 123456666666555433221 12333 79999999987755 4888998 Q ss_pred CCccccc---------CChhhHHHHHhCCeEEEEECCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHh Q lcl|NC_019422. 187 LDEVTEI---------EPTENPDEAVEEGKLILINNNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWN 257 (355) Q Consensus 187 ~~~~~~~---------~~~~e~~~ai~~G~lvl~~dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~ 257 (355) +.|+... ++..|.+.+-.+|...++++++.++- |-.|+. .+..|+.|.++|++|.|.+.+++.+. T Consensus 233 l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~~~G~~~w-G~rT~s-----~d~~~~~i~~rR~~~~i~~~~~~~~~ 306 (396) T protein:vir:20 233 VNGVTGISASVFWDLQESGTDADLLNESGVTTLIRRDGFRFW-GNRTCS-----DDPLFLFENYTRTAQVVADTMAEAHM 306 (396) T ss_pred eccceecceecccccCCCcchhhhhhhcCcEEEEcCCCEEEE-cccccC-----CCcccceeehhhHHHHHHHHHHHHHH Confidence 8876532 23567888889999999887776554 555543 24569999999999999999999987 Q ss_pred hccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEE Q lcl|NC_019422. 258 ENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVE 337 (355) Q Consensus 258 ~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~ 337 (355) . |+++ +|+..-+..++..++.||.+|.++|+|..+ .+..|.+. +....+.+ -.+++. T Consensus 307 ~-~v~e-~~~~~~~~~i~~~i~~~L~~l~~~G~l~g~---~v~~d~~~--------------nt~~~i~~----G~~~~~ 363 (396) T protein:vir:20 307 W-AVDK-PITATLIRDIVDGINAKFRELKTNGYIVDA---TCWFSEES--------------NDAETLKA----GKLYID 363 (396) T ss_pred H-hccC-CCCHHHHHHHHHHHHHHHHHHHhCcceece---EEEEecCC--------------CCHHHhhC----CEEEEE Confidence 5 9999 899999999999999999999999999876 45555432 22222222 458999 Q ss_pred EEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 338 GNITITDAMEDLKFKIYM 355 (355) Q Consensus 338 ~~i~~vdamEkiy~tv~v 355 (355) +.+.|+-.+|.|.++++. T Consensus 364 i~~~p~~p~e~i~~~~~~ 381 (396) T protein:vir:20 364 YDYTPVPPLENLTLRQRI 381 (396) T ss_pred EEEEecCCcceEEEEEEE Confidence 999999999999999888 No 23 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=99.67 E-value=6.5e-16 Score=103.86 Aligned_cols=317 Identities=10% Similarity=0.057 Sum_probs=179.3 Q ss_pred CCCCce----------EEEeeeeeeeeecC------------------------CCc-eeEEEEEecCCccceeEEEeeh Q lcl|NC_019422. 1 MGLPSA----------IIEFQRRSRTVKFR------------------------SRR-GVVALILKDSTAIKKSYSIDFL 45 (355) Q Consensus 1 ~g~P~~----------~i~f~~~a~ta~~~------------------------~~r-G~v~iil~d~~~~~~~~~~~~~ 45 (355) .+.|.+ .+...--...+... ... ....++...+............ T Consensus 217 ~~~~~~~a~~~G~~Gn~i~v~i~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~ 296 (663) T protein:vir:10 217 FGMPLVSAVYPGEIGSTVEVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTR 296 (663) T ss_pred ccceeeeeecccccccceeEEecccccccccccccccccccccccccceeeeeccccccceeEEEecCCcceeeeeeeec Confidence 111111 11111000000000 000 0111222222211111111111 Q ss_pred hhhhhhhhhHHHHHHHHhhhc------------ccc---c-eEEEEecCCCc--cchhHHHHHHHHHhc---ccceEEEE Q lcl|NC_019422. 46 TDINETEFTKENYDYIRLAFL------------GKP---S-KVIVEVINDSV--DSERSLDDALKALRE---NKFNYLAI 104 (355) Q Consensus 46 ~d~~~~~~~~~n~~~i~~a~~------------g~~---~-~v~l~~g~~g~--~~~~~y~~al~~le~---~~fn~l~~ 104 (355) .+.. . ... ...++...+. +.| + -+.+.+|.++. ++..+|..+++.|+. ...+.+.+ T Consensus 297 ~~~~-~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~ 373 (663) T protein:vir:10 297 KGDR-D-VYG-SNIFMDDYFRNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIA 373 (663) T ss_pred cccc-c-cch-hhhhhhhhhccCcceEEEEeecccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEe Confidence 1110 0 000 0111222111 111 1 13466666653 345778888887765 35666666 Q ss_pred cCC--Ch----HHHHHHHHHHHHHHHhcCCeEEEEecCC---------------------------------CCcCccee Q lcl|NC_019422. 105 PFI--SE----EVDKTKIVNWIKTARREKEIYKAVLPNI---------------------------------SDANEKAI 145 (355) Q Consensus 105 p~~--~d----~~~~~~~~~~ik~~r~~g~~~~aVl~~~---------------------------------~~~d~egI 145 (355) |.. .+ .++++.+.++++++|+ +.+++... ..+|++.. T Consensus 374 ~~~~~~~~~~~~~v~~al~~~a~~~~~----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~ 449 (663) T protein:vir:10 374 GACGSDGAEIASTVQKYVVSLADDRQD----CVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYA 449 (663) T ss_pred ccCCCCchhhHHHHHHHHHHHHHhhCC----EEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceE Confidence 421 11 3456677777777653 44444211 11344545 Q ss_pred EEecCCeEec---CCc--eecHHHHHHHHHHHhcCccccc----ccccccCC---cccc---cCChhhHHHHHhCCeEEE Q lcl|NC_019422. 146 INFATTGIKV---GEK--SYTTAEYTARLAGILAGISLSE----SCTYFILD---EVTE---IEPTENPDEAVEEGKLIL 210 (355) Q Consensus 146 inv~n~~i~~---~~~--~~~~~~~~a~vAG~~Ag~~~~~----S~T~~~~~---~~~~---~~~~~e~~~ai~~G~lvl 210 (355) +.+.+-.... ++. .+++ ++.+||+.|.....+ |+.++++. |+.. .+++.|.+.+..+|.-++ T Consensus 450 ~l~~P~~~~~d~~~~~~~~~p~---s~~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i 526 (663) T protein:vir:10 450 FIIGNYKYQYDKYNDINRWVPL---AADIAGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPV 526 (663) T ss_pred EEEccceEEecccCCceEEech---hHHHHHHHHHhhccCCceEccCCceeccccccccceeccChhHHHHHhhCCceEE Confidence 5444433222 222 2344 799999999887665 55555433 3332 357789999999997555 Q ss_pred E--EC-CcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019422. 211 I--NN-NGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQR 287 (355) Q Consensus 211 ~--~d-g~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~ 287 (355) . .+ +++++ -|-.|+. ..+.+|+.|.++|++|+|.+-|++.... |+++ +|+..-|..++..|+.||.+|.+ T Consensus 527 ~~~~~~~G~~~-wG~rT~s----~~~s~~~~i~vrR~~~~i~~si~~~~~~-~v~e-~n~~~l~~~i~~~i~~~L~~l~~ 599 (663) T protein:vir:10 527 TGFAGGDGFVL-FGDKMAT----QVPSPFDRINVRRLFNMLKKNIGDTSKY-ELFE-NNDAFTRQSFRMETSQYLDGIRS 599 (663) T ss_pred EEEeCCCcEEE-EcccccC----CCCcccceEehhhHHHHHHHHHHHHHHH-hccC-CCCHHHHHHHHHHHHHHHHHHHh Confidence 3 34 45655 5667752 2335799999999999999999998775 9999 79999999999999999999999 Q ss_pred cccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 288 DEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 288 ~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) +|+|..| .+.+|.+-... ..+ ..-.+++.+.++|+-.+|.|.++|.. T Consensus 600 ~gal~g~---~v~~d~~~nt~--------------~~i----~~G~~~~~i~~~p~~pae~i~~~~~~ 646 (663) T protein:vir:10 600 LGGCYDF---RVVCDTTNNTP--------------NVI----DRNEFVGTIYVKPPRSINYITLNMVA 646 (663) T ss_pred cCceeee---EEEEcCCCCCH--------------HHh----hCCeEEEEEEEEecCCcceEEEEEEE Confidence 9999986 56666553211 111 13468999999999999999999887 No 24 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=99.67 E-value=1.1e-16 Score=108.00 Aligned_cols=317 Identities=10% Similarity=0.048 Sum_probs=180.5 Q ss_pred CCCCceEEEeeeeeeeeecCCCce---eEEEEEecCCccceeEEEeehhhhhhhhhhHHHHHHHHhhhc----------- Q lcl|NC_019422. 1 MGLPSAIIEFQRRSRTVKFRSRRG---VVALILKDSTAIKKSYSIDFLTDINETEFTKENYDYIRLAFL----------- 66 (355) Q Consensus 1 ~g~P~~~i~f~~~a~ta~~~~~rG---~v~iil~d~~~~~~~~~~~~~~d~~~~~~~~~n~~~i~~a~~----------- 66 (355) .............+.++...+... -..++...+....+.+.+......... .....++...+. T Consensus 267 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vvv~~~g~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~~~~ 343 (679) T protein:vir:10 267 PKVFPTGLDYGNVTPSSYLEFGPQNESQFAFIVFNNGVAVESKILSTKPGDRDI---YGTSIYINEYFGNGYSSFVQGVA 343 (679) T ss_pred ccccccccccccceeeeecccccccccceeeEEecccccccceeeecccccccc---cchhhhhhhhhcCcccceeeecc Confidence 000000000111111111111000 122222222222222222221111110 001112222111 Q ss_pred -c---ccc-eEEEEecCCCc--cchhHHHHHHHHH---hcccceEEEEcCCCh------HHHHHHHHHHHHHHHhcCCeE Q lcl|NC_019422. 67 -G---KPS-KVIVEVINDSV--DSERSLDDALKAL---RENKFNYLAIPFISE------EVDKTKIVNWIKTARREKEIY 130 (355) Q Consensus 67 -g---~~~-~v~l~~g~~g~--~~~~~y~~al~~l---e~~~fn~l~~p~~~d------~~~~~~~~~~ik~~r~~g~~~ 130 (355) + +++ -+.+.+|.++. .+..++..++..+ +.+.+|.|++|+... .++++.+.++++++|+ + T Consensus 344 ~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~~l~~~~~~~~~----~ 419 (679) T protein:vir:10 344 ESWPVGYTGVLAFGGGQSSNTDISAAEFMKGWDMFADREHTDVNLFIAGAVAGEGAQIASTVQKAVVAIADERRD----C 419 (679) T ss_pred ccccccccceeeccCCccCCCccchhhhhhhhhhhhcccccccceEEecCCCCCchhhhHHHHHHHHHHHHhhCC----e Confidence 1 112 23355666553 2345666666655 445789999997542 3477888888888874 3 Q ss_pred EEEecCC---------------------------------CCcCcceeEEecCCeEec---CCc--eecHHHHHHHHHHH Q lcl|NC_019422. 131 KAVLPNI---------------------------------SDANEKAIINFATTGIKV---GEK--SYTTAEYTARLAGI 172 (355) Q Consensus 131 ~aVl~~~---------------------------------~~~d~egIinv~n~~i~~---~~~--~~~~~~~~a~vAG~ 172 (355) .+++-.. ..+|+.....+.+-.... ++. .+++ ++++||+ T Consensus 420 ~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---sg~vAGl 496 (679) T protein:vir:10 420 LVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYASVDGNYKYQYDKYNDVNRWIPL---AADIAGL 496 (679) T ss_pred EEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceEEEEccceeeecccCCceEEech---HHHHHHH Confidence 3333110 012333333333322221 122 2343 7999999 Q ss_pred hcCccc----ccccccccCCccc---c---cCChhhHHHHHhCCeEEEEE--CCcEEEEecCccccccCCCCCchhhhhh Q lcl|NC_019422. 173 LAGISL----SESCTYFILDEVT---E---IEPTENPDEAVEEGKLILIN--NNGIRIARGVNSLITLSKEDTEDLKKIK 240 (355) Q Consensus 173 ~Ag~~~----~~S~T~~~~~~~~---~---~~~~~e~~~ai~~G~lvl~~--dg~v~I~~~INSltt~~~~k~~~f~kir 240 (355) +|.+.. -+|+.++++.++. + .+++.|.+.+..+|.-++.+ +++.++- |-.|+. ..+.+|+.|. T Consensus 497 ~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~g~G~~~w-G~rT~~----~~~s~~~~i~ 571 (679) T protein:vir:10 497 CARTDTVGQPWQSPAGFNRGQIVNVIKLAVDTRQAHRDEMYTNGINPIVGFAGQGYILY-GDKTAS----QAPTPFDRIN 571 (679) T ss_pred HHHhhccCCcEECcCCeeeccccccccceeecChhhHHhhhhCCceEEEEecCCeEEEE-cccccC----CCCcccceEe Confidence 998864 4577777655443 2 24678999999999877653 4466663 334541 2235899999 Q ss_pred hHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccc Q lcl|NC_019422. 241 IVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEM 320 (355) Q Consensus 241 vvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~ 320 (355) ++|++|+|.+.|++.... |+++ +|+..-|..++..|..||.+|.++|+|..| .+.+|.+- | T Consensus 572 vrR~~~~i~~si~~~~~~-~v~e-pn~~~~~~~i~~~i~~fL~~l~~~gal~gf---~v~~d~~~--------------n 632 (679) T protein:vir:10 572 VRRLFNLLKKSISESAKY-KLFE-LNDAFTRSSFRSEVGSYLDTIRSLGGIYDF---RVVCDESN--------------N 632 (679) T ss_pred hhhHHHHHHHHHHHHHHH-hccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeee---EEEEcCCC--------------C Confidence 999999999999998775 9999 799999999999999999999999999986 46665432 1 Q ss_pred cceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 321 TEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 321 ~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) ....+. .-.+++.+.++|+-.||.|.++|.- T Consensus 633 t~~~i~----~G~~~~~i~~~p~~pae~i~~~~~~ 663 (679) T protein:vir:10 633 TPAVID----RNEFVATILIKPARSINYITLSFVA 663 (679) T ss_pred CHHHhh----CCeEEEEEEEEecCCccEEEEEEEE Confidence 111222 2457999999999999999999887 No 25 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=99.66 E-value=5.5e-17 Score=109.71 Aligned_cols=301 Identities=12% Similarity=0.084 Sum_probs=186.5 Q ss_pred CCCCce-EEEeeeeeeeeecCCCceeEE-EEEecCCccceeEEEeehhhhhhhhhhHHHHHHHHhhhccccceE--EEEe Q lcl|NC_019422. 1 MGLPSA-IIEFQRRSRTVKFRSRRGVVA-LILKDSTAIKKSYSIDFLTDINETEFTKENYDYIRLAFLGKPSKV--IVEV 76 (355) Q Consensus 1 ~g~P~~-~i~f~~~a~ta~~~~~rG~v~-iil~d~~~~~~~~~~~~~~d~~~~~~~~~n~~~i~~a~~g~~~~v--~l~~ 76 (355) -.||-. ...--..+.||++.|.-|+-+ +-+. +.-....|. .|.++ .++. T Consensus 150 ~~lPVTA~~~~~~VtlTAr~kG~~GN~I~l~~~--------~~~~~~ge~-------------------~p~Glt~~ita 202 (498) T protein:vir:45 150 PTLPFTASSSAGVVTLTARHKGLCGNEIPVSLN--------YYGFGGGEV-------------------LPAGVQIAVAT 202 (498) T ss_pred CCCceEEEecCceEEEEeeccCccccceeEEEe--------ecccccccc-------------------ccceeeEEEEc Confidence 455521 122223444555555555321 1110 000000111 23333 2444 Q ss_pred cCCCccchhHHHHHHHHHhcccceEEEEcCCChHHHHHHHHHHHHH----HH-hcCCeEEEEecCC----------CCcC Q lcl|NC_019422. 77 INDSVDSERSLDDALKALRENKFNYLAIPFISEEVDKTKIVNWIKT----AR-REKEIYKAVLPNI----------SDAN 141 (355) Q Consensus 77 g~~g~~~~~~y~~al~~le~~~fn~l~~p~~~d~~~~~~~~~~ik~----~r-~~g~~~~aVl~~~----------~~~d 141 (355) .++|+. +.|++++|.+|....||++++|-. |++--+.+.+|+.. +. -+.+..+++.+.. ...| T Consensus 203 magGag-~PD~a~alaal~~~~~~~I~~p~~-D~asL~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N 280 (498) T protein:vir:45 203 GTAGTG-APVLTGAVAAMADEPFDYIGLPFN-DTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFN 280 (498) T ss_pred cCCCcc-CchhHHHHHHhccCCccEEEEeeC-CHHHHHHHHHHHhhhhhhhhHHhhcCeEEEEeccCCHHHHHHhhhccC Confidence 455543 469999999999999999999975 44445688888863 22 2455566655531 2347 Q ss_pred cceeEEecCCeEecCCceecHHHHHHHHHHHhc---CcccccccccccCCccc-----ccCChhhHHHHHhCCeEEEE-E Q lcl|NC_019422. 142 EKAIINFATTGIKVGEKSYTTAEYTARLAGILA---GISLSESCTYFILDEVT-----EIEPTENPDEAVEEGKLILI-N 212 (355) Q Consensus 142 ~egIinv~n~~i~~~~~~~~~~~~~a~vAG~~A---g~~~~~S~T~~~~~~~~-----~~~~~~e~~~ai~~G~lvl~-~ 212 (355) +++|.-.... .+..-++++++|..||.+| ..+..+++.-..|+|+. ++++.+|.+.++.+|.-.|+ . T Consensus 281 ~~~it~~~~~----~~~~sp~~~~AAa~aa~~A~~l~~DPArPL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~ 356 (498) T protein:vir:45 281 QQHITLAGYE----KETQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVE 356 (498) T ss_pred CceEEEEecC----CCCCChHHHHHHHHHHHHHHHhhcccccccCceeecceecCCchhcCChHHHHHHHhCCcceEEEc Confidence 7777654321 1334567899999998888 45666777777787776 56788999999999988885 5 Q ss_pred CCcEEEEecCccccccC-CCCCchhhhhhhHhhHHHHHHHHHHHHhhcccc-ccCCCHH---------HHHHHHHHHHHH Q lcl|NC_019422. 213 NNGIRIARGVNSLITLS-KEDTEDLKKIKIVEAIDMIQDDILQTWNENYVG-KVTNKYD---------NKILFLSAVNNY 281 (355) Q Consensus 213 dg~v~I~~~INSltt~~-~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiG-K~~N~~~---------gr~~~~~~i~~y 281 (355) +|.|.|+|.|+|.++=. ...+..|..|.+++++|.+.+.+|..+..+|=+ |+.++.. -...+++++..- T Consensus 357 ~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~ell~~ 436 (498) T protein:vir:45 357 SGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLAT 436 (498) T ss_pred CCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCeeecccCcccCCCCcccchHHHHHHHHHH Confidence 67899999999977653 567789999999999999999999999999954 4433311 135678999888 Q ss_pred HHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 282 FKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 282 l~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) +++|+..|++++++.+.-.+-+|.+ ++..+.|=+.+....++-+--|=..+.. T Consensus 437 y~~le~~givEn~~~~~~~LiVerd---------------------~~dpnRln~~~p~d~vn~L~V~A~~~~f 489 (498) T protein:vir:45 437 YRQLERAGIVENYELFKQYLVVERD---------------------ASVPNRLNTLFPPDYVNQLRVFAVVNQF 489 (498) T ss_pred HHhhhhhccccChhhhcceeEEEEC---------------------CCCCcEEEEEecccccCchhhhhhhhhh Confidence 8999999999998543322222211 1111333333333333332222111111 No 26 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=99.66 E-value=5.5e-16 Score=104.23 Aligned_cols=314 Identities=12% Similarity=0.061 Sum_probs=197.4 Q ss_pred CC-----CCceEEEeeeeeeeeecCCCceeEEEEEecCCccc-----eeEEEeehhhhhhh--hh--hHHHHHHHHhhhc Q lcl|NC_019422. 1 MG-----LPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIK-----KSYSIDFLTDINET--EF--TKENYDYIRLAFL 66 (355) Q Consensus 1 ~g-----~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~-----~~~~~~~~~d~~~~--~~--~~~n~~~i~~a~~ 66 (355) |= +|++++.=-..+...+.....+++++|........ ....+.+..+.... .. .......+...+. T Consensus 1 m~~~~~~~hGv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~~p~~~~~~i~~~~d~~~~~~~~~~~gtl~~al~~~~~ 80 (388) T protein:vir:96 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) T ss_pred CCCCCCCCCceEEEEcCCCcccccccCcceeEEEEecCCCccccccccceeeecchhhhhhhccccccccchhhhHhhhc Confidence 33 57888876667777777777777777765322111 11112222221100 00 0011122333333 Q ss_pred cccceEEEEe---cCCCccchh----------HHHHHHHHHhcc--cceEEEEcCCCh-HHHHHHHHHHHHHHHhcCCeE Q lcl|NC_019422. 67 GKPSKVIVEV---INDSVDSER----------SLDDALKALREN--KFNYLAIPFISE-EVDKTKIVNWIKTARREKEIY 130 (355) Q Consensus 67 g~~~~v~l~~---g~~g~~~~~----------~y~~al~~le~~--~fn~l~~p~~~d-~~~~~~~~~~ik~~r~~g~~~ 130 (355) .+...+.+.. +.++..+.+ .-...|.+++.. ..+.|++|+.++ ..+++.+.++++++| . T Consensus 81 ~~~~~~~vv~v~~g~~~~at~a~iig~~~~~tg~~~gl~al~~~~~~p~il~aPg~s~~~~v~~al~~~~~~~~-----~ 155 (388) T protein:vir:96 81 KTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECTERPTLIGAPGFSQNKAVIDALASMAKRLK-----C 155 (388) T ss_pred cCCceEEEEEeccccccccccceeeeecccccchhhHHHHhhhcccceeEEEeeccccchHHHHHHHHHHhhcC-----c Confidence 3333322221 222211111 112456666543 348999998644 467889999998875 2 Q ss_pred EEEecC----------------CCCcCcceeEEecCCeEecCC-----ceecHHHHHHHHHHHhcCcccccccccccC-- Q lcl|NC_019422. 131 KAVLPN----------------ISDANEKAIINFATTGIKVGE-----KSYTTAEYTARLAGILAGISLSESCTYFIL-- 187 (355) Q Consensus 131 ~aVl~~----------------~~~~d~egIinv~n~~i~~~~-----~~~~~~~~~a~vAG~~Ag~~~~~S~T~~~~-- 187 (355) .+++-. ....+++..+.+.+.....+. ..+++ ++.+||+.|.....+|+.|+++ T Consensus 156 ~~i~D~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~---s~~~AG~~a~~D~~~spaN~~i~i 232 (388) T protein:vir:96 156 RAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPP---STIAMGAVAAVKPWESPGNQGVLI 232 (388) T ss_pred EEEEeccCCchhHHHHHHhhhhccCcCcceEEEEeCceeeecccCCceeeech---HHHHHHHHHhhcCcccccCeeEEe Confidence 344321 112455666666665443221 22333 7999999999999999999876 Q ss_pred Cccccc------CChhhHHHHHhCCeEEEEE--CCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhc Q lcl|NC_019422. 188 DEVTEI------EPTENPDEAVEEGKLILIN--NNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNEN 259 (355) Q Consensus 188 ~~~~~~------~~~~e~~~ai~~G~lvl~~--dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~ 259 (355) .|+... ++..|.+.+-++|.-++.+ +++.++--+- |+ .|+.|.++|+.|.|.+.|++.+.. T Consensus 233 ~g~~~~~~~~~~~~~~~~~~Ln~~gI~~i~~~~~~G~~~wG~r-T~---------~~~~i~vrR~~~~i~~si~~~~~~- 301 (388) T protein:vir:96 233 QDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNR-TV---------TGKFISFVGLEDAIARKLEAASQR- 301 (388) T ss_pred eeecccccccccCChhhHHhhhhcCceEEEEecCCcEEEEccc-cc---------CCcceeehhhHHHHHHHHHHHHHH- Confidence 233211 2456888888999888765 4566664332 32 389999999999999999999876 Q ss_pred cccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEE Q lcl|NC_019422. 260 YVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGN 339 (355) Q Consensus 260 yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~ 339 (355) |+++ ||+..-|..++..++.||.+|.++|+|..+ .+..|.+ .+....+.+| .+++.+. T Consensus 302 ~v~e-pn~~~~~~~i~~~i~~fL~~l~~~Gal~g~---~~~~d~~--------------~nt~~~i~~G----~~~~~i~ 359 (388) T protein:vir:96 302 AMSK-QLTKSFMEQEIKKINLFMQDLVAAEIIPGG---EVYLHPT--------------LNTVERYKNG----SWYIVID 359 (388) T ss_pred hccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeee---EEEEecC--------------CCCHHHhhCC----EEEEEEE Confidence 9999 899999999999999999999999999875 3444433 2222333332 5899999 Q ss_pred EEEEeeeeEEEEEEeC Q lcl|NC_019422. 340 ITITDAMEDLKFKIYM 355 (355) Q Consensus 340 i~~vdamEkiy~tv~v 355 (355) +.|+-.+|.|.+.++. T Consensus 360 ~~p~~pae~I~~~~~~ 375 (388) T protein:vir:96 360 YGRYSPNEHMIFHLNA 375 (388) T ss_pred EEecCCcceEEEEEEE Confidence 9999999999999998 No 27 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=99.65 E-value=7.2e-17 Score=109.09 Aligned_cols=299 Identities=12% Similarity=0.102 Sum_probs=186.7 Q ss_pred CCCCc-eEEEeeeeeeeeecCCCceeEEEEEecCCccceeEEEee--hhhhhhhhhhHHHHHHHHhhhccccceE--EEE Q lcl|NC_019422. 1 MGLPS-AIIEFQRRSRTVKFRSRRGVVALILKDSTAIKKSYSIDF--LTDINETEFTKENYDYIRLAFLGKPSKV--IVE 75 (355) Q Consensus 1 ~g~P~-~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~~~~~~~--~~d~~~~~~~~~n~~~i~~a~~g~~~~v--~l~ 75 (355) -.||- ....--..+.||++.|.-|+-+-+ ...+.. ..|. .|.++ .++ T Consensus 150 ~~lPVTA~~~~~~vtlTAr~kG~~GN~I~l---------~~~~~~~~~ge~-------------------~p~Glt~tit 201 (498) T protein:vir:44 150 PDLPFTATSEAGVVTLTARHKGLYGNEIPV---------TLNYYGFGGGEV-------------------LPAGVNITVA 201 (498) T ss_pred CCCceEEeeccceEEEEEeccCcccCcceE---------EEeeccCccccc-------------------cccceeEEEE Confidence 45662 222233445566666655532111 000000 0111 13332 244 Q ss_pred ecCCCccchhHHHHHHHHHhcccceEEEEcCCChHHHHHHHHHHHHH----HHh-cCCeEEEEecCC----------CCc Q lcl|NC_019422. 76 VINDSVDSERSLDDALKALRENKFNYLAIPFISEEVDKTKIVNWIKT----ARR-EKEIYKAVLPNI----------SDA 140 (355) Q Consensus 76 ~g~~g~~~~~~y~~al~~le~~~fn~l~~p~~~d~~~~~~~~~~ik~----~r~-~g~~~~aVl~~~----------~~~ 140 (355) ..++|+. +.|++++|.+|....||++++|.. |++--+.+.+|+.. +.- +.+...++.+.. ... T Consensus 202 amsgGag-~PDia~alaal~~~~~~~i~~p~~-D~asl~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~a~l~t~g~~~ 279 (498) T protein:vir:44 202 SGVKGAG-APALNDAVAAMGDEPFDYIGLPFN-DTASVNSMATEMNDSSGRWSYVRQLYGHVYTAKTGTLSELVAAGDQF 279 (498) T ss_pred cccCCcc-CchhHHHHHhhccCCccEEEEeec-CHHHHHHHHHHHhhhhcchHHHhhcCeEEEEeccCCHHHHHHhhhcc Confidence 4444543 469999999999999999999975 44445678888853 222 344555555421 234 Q ss_pred CcceeEEecCCeEecCCceecHHHHHHHHHHHhc---CcccccccccccCCccc-----ccCChhhHHHHHhCCeEEEE- Q lcl|NC_019422. 141 NEKAIINFATTGIKVGEKSYTTAEYTARLAGILA---GISLSESCTYFILDEVT-----EIEPTENPDEAVEEGKLILI- 211 (355) Q Consensus 141 d~egIinv~n~~i~~~~~~~~~~~~~a~vAG~~A---g~~~~~S~T~~~~~~~~-----~~~~~~e~~~ai~~G~lvl~- 211 (355) |+++|...... ++..-++++++|..||.+| ..+..+++.-..|+|+. ++++.+|.+.++.+|.-.++ T Consensus 280 N~~~it~~~~~----~~~~sp~~~~AAa~a~~aA~~l~~DPArPL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V 355 (498) T protein:vir:44 280 NLQHITLAGYE----KDTQTPADELAASRTARAAVFIRNDPARPTQTGELVDMLPAPKGKRFTTTEQQTLLSHGVATAYV 355 (498) T ss_pred CCceEEEEecC----CCCCCHHHHHHHHHHHHHHHHhhcccccccCceeecccccCCchhcCChHHHHHHHhcCcceEEE Confidence 77777644321 1233466888888888888 45666777777787776 56788999999999988885 Q ss_pred ECCcEEEEecCcccccc-CCCCCchhhhhhhHhhHHHHHHHHHHHHhhcccc-ccCCCHH---------HHHHHHHHHHH Q lcl|NC_019422. 212 NNNGIRIARGVNSLITL-SKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVG-KVTNKYD---------NKILFLSAVNN 280 (355) Q Consensus 212 ~dg~v~I~~~INSltt~-~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiG-K~~N~~~---------gr~~~~~~i~~ 280 (355) .+|.|.|+|.|+|.++= ....+..|..|.+++++|.+.+.+|..+..+|=+ |+.++.. -...+++++.. T Consensus 356 ~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~eli~ 435 (498) T protein:vir:44 356 ESGVLRIQRDITTYRKNAYGVADNSYLDSETLHTSAYVLRRLKSVITSKYGRHKLANDGTRFGSGQAIVTPAVIRGELGS 435 (498) T ss_pred cCCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCcccccCCcccCCCcccccHHHHHHHHHH Confidence 56789999999997755 3567789999999999999999999999999965 5544411 23467889988 Q ss_pred HHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeecc-CCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 281 YFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEA-NTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 281 yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~-~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) -+++|+..|++++++.+.-.+- |++. +..+.|=+.+..+.++-+--|=..+.. T Consensus 436 ~y~~le~~givEn~~~~~~~Li----------------------Verd~~dpnRln~~~p~d~vn~L~V~A~~~~f 489 (498) T protein:vir:44 436 TYRQMEREGIVENFDLFQQHLI----------------------VERNANDSNRLDVLFPPDYVNQLRVFAVLNQF 489 (498) T ss_pred HHHhhhhhccccChhhhcceeE----------------------EEECCCCCcEEEEEecccccCchhhhhhhhhh Confidence 8899999999999854332222 2222 122333333333333332222111111 No 28 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=99.65 E-value=8.5e-17 Score=108.68 Aligned_cols=301 Identities=10% Similarity=0.061 Sum_probs=185.4 Q ss_pred CCCCc-eEEEeeeeeeeeecCCCceeEEEEEecCCccceeEEEeehhhhhhhhhhHHHHHHHHhhhccccceE--EEEec Q lcl|NC_019422. 1 MGLPS-AIIEFQRRSRTVKFRSRRGVVALILKDSTAIKKSYSIDFLTDINETEFTKENYDYIRLAFLGKPSKV--IVEVI 77 (355) Q Consensus 1 ~g~P~-~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~~~~~~~~~d~~~~~~~~~n~~~i~~a~~g~~~~v--~l~~g 77 (355) ..||- ..+.--..+.||++.|.-|+-+-+ ...+. .+ ......|.++ .++.. T Consensus 150 ~~lPVTA~~~~~~VtlTAr~kG~~GN~I~l---------~~~~~--~~---------------~~ge~~p~Glt~~itam 203 (498) T protein:vir:48 150 ITLPFAASSDAGVVTLTARHKGLYGNELPV---------CLNYY--GS---------------GGGEILPAGLQVVTEAG 203 (498) T ss_pred CCcceEEEecCcEEEEEeeeccccccccee---------eeeec--cC---------------cccccccceeeEEEEcc Confidence 66662 222223344455555554421111 00000 00 0001123333 33444 Q ss_pred CCCccchhHHHHHHHHHhcccceEEEEcCCChHHHHHHHHHHHHH----HH-hcCCeEEEEecCC----------CCcCc Q lcl|NC_019422. 78 NDSVDSERSLDDALKALRENKFNYLAIPFISEEVDKTKIVNWIKT----AR-REKEIYKAVLPNI----------SDANE 142 (355) Q Consensus 78 ~~g~~~~~~y~~al~~le~~~fn~l~~p~~~d~~~~~~~~~~ik~----~r-~~g~~~~aVl~~~----------~~~d~ 142 (355) ++|+. +.|++++|.+|....||++++|-. |++--+.+.+|+.. +. -+.+..+++.+.. ...|+ T Consensus 204 sgGag-~PDia~aLaal~~~~~~~I~~p~~-D~asl~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~ 281 (498) T protein:vir:48 204 TAGSG-APDLTAAVAAMGDEAFDFIGLPFN-DAASINMMMTEMNDSSGRWSYARQLYGHVYTAKLGTLSELVNAGDMHNQ 281 (498) T ss_pred cCCcc-CcchHHHHHhhccCCccEEEEeec-CHHHHHHHHHHHhhhhhhhhHHhhcCeEEEEeccCCHHHHHHhhhccCC Confidence 44443 468999999999999999999975 44445678888863 22 2445556655531 23477 Q ss_pred ceeEEecCCeEecCCceecHHHHHHHHHHHhc---CcccccccccccCCccc-----ccCChhhHHHHHhCCeEEEE-EC Q lcl|NC_019422. 143 KAIINFATTGIKVGEKSYTTAEYTARLAGILA---GISLSESCTYFILDEVT-----EIEPTENPDEAVEEGKLILI-NN 213 (355) Q Consensus 143 egIinv~n~~i~~~~~~~~~~~~~a~vAG~~A---g~~~~~S~T~~~~~~~~-----~~~~~~e~~~ai~~G~lvl~-~d 213 (355) ++|.-.... ++.+.+++++++..||+.| ..+..+++.-..|+|+. ++++.+|.+.++.+|.-.|+ .+ T Consensus 282 ~~it~~~~~----~~~~~p~~~~AAa~a~~aA~~l~~DPArPLqtl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~ 357 (498) T protein:vir:48 282 QHITLAGYE----KETQSPVDELVASRLAREAVFIRNDPARPTQTGELVGMLPAPKGKRFIMTEQQTLLSHGVATAYVEG 357 (498) T ss_pred ceEEEEecC----CCCCChHHHHHHHHHHHHHHhhhccccccccceeeeccccCCchhcCChHHHHHHHhcCcceEEEcC Confidence 777644311 2334567788888888887 45566666666777765 46788999999999988885 56 Q ss_pred CcEEEEecCccccccC-CCCCchhhhhhhHhhHHHHHHHHHHHHhhcccc-ccCCCHH---------HHHHHHHHHHHHH Q lcl|NC_019422. 214 NGIRIARGVNSLITLS-KEDTEDLKKIKIVEAIDMIQDDILQTWNENYVG-KVTNKYD---------NKILFLSAVNNYF 282 (355) Q Consensus 214 g~v~I~~~INSltt~~-~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiG-K~~N~~~---------gr~~~~~~i~~yl 282 (355) |.|.|+|.|+|.++=. ...+..|..|.+++++|.+.+++|..+..+|=+ |..++-. -...+++++..-+ T Consensus 358 G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~eli~~y 437 (498) T protein:vir:48 358 GTLRIQRSVTTYKKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLANDGTRFGPGQAIVTPAVIKGELLATY 437 (498) T ss_pred CeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCceecccCcccCCCCcccchHHHHHHHHHHH Confidence 7899999999987653 567789999999999999999999999999954 4433311 2356789998888 Q ss_pred HHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCC-CCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 283 KELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANT-GSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 283 ~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~-~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) ++|+..|++++.+.+.-.+-+ ++... .+.|=+.+....++-+--|=..+.. T Consensus 438 ~~le~~given~~~~~~~LiV----------------------erd~~dpnRln~~~p~d~vn~L~V~A~~~~f 489 (498) T protein:vir:48 438 RQMERAGIVENYDLFKQYLIV----------------------ERDADNPNRLNTLFPPDYVNQLRVFAVVNQF 489 (498) T ss_pred HhhhhhccccChhhhcceeEE----------------------EECCCCCcEEEEEecccccCchhhhhhhhhh Confidence 999999999998543322222 22111 1333333333333332222111111 No 29 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=99.64 E-value=6.4e-17 Score=109.35 Aligned_cols=317 Identities=14% Similarity=0.079 Sum_probs=193.7 Q ss_pred CCCCceEEEeeeeeeeeecCCCceeEEEEEecCCcccee------EEEeehhhhhhhhhhHH--HHHHHHhhhccccceE Q lcl|NC_019422. 1 MGLPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIKKS------YSIDFLTDINETEFTKE--NYDYIRLAFLGKPSKV 72 (355) Q Consensus 1 ~g~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~~------~~~~~~~d~~~~~~~~~--n~~~i~~a~~g~~~~v 72 (355) -=+|+++|.=...+...+.....+++.++..-....... ..+.+..+.... +... -...++..|..+...+ T Consensus 4 ~~~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~its~~~~~~~-~g~~~tL~~al~~~~~~~~~~~ 82 (390) T protein:vir:79 4 DYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGK-AGKKGTLRRTLDAIGKQTKPLT 82 (390) T ss_pred ccCCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccceEeecHHHHHHh-cCCCccchhhhhhhcccccceE Confidence 137899998766777777777667777666433221112 223332222111 1000 0112222222222211 Q ss_pred E---EEecCCCccch---------hHHHHHHHHHhc------ccceEEEEcCCChHHHHHHHHHHHHHHHhcCCeEEEEe Q lcl|NC_019422. 73 I---VEVINDSVDSE---------RSLDDALKALRE------NKFNYLAIPFISEEVDKTKIVNWIKTARREKEIYKAVL 134 (355) Q Consensus 73 ~---l~~g~~g~~~~---------~~y~~al~~le~------~~fn~l~~p~~~d~~~~~~~~~~ik~~r~~g~~~~aVl 134 (355) . +..+.++..+. ......|.+|+. ...+.++.|+.+...+++.+..+.++++ ..+++ T Consensus 83 ~vv~v~~~~~~~~~~~~~ig~~~~~~~~tgl~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~-----~~ai~ 157 (390) T protein:vir:79 83 VVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSLR-----AMAYV 157 (390) T ss_pred EEEeeccccccccccceeeecccccccchhhhhhhhhhhhhccccccccCCcccchHHHHHHHHhhhhcc-----eEEEE Confidence 1 22222221111 112233444432 2347788888776667777777776654 34444 Q ss_pred cC------------CCCcCcceeEEecCCeEecC----C-ceecHHHHHHHHHHHhcCcccc----cccccccCCccccc Q lcl|NC_019422. 135 PN------------ISDANEKAIINFATTGIKVG----E-KSYTTAEYTARLAGILAGISLS----ESCTYFILDEVTEI 193 (355) Q Consensus 135 ~~------------~~~~d~egIinv~n~~i~~~----~-~~~~~~~~~a~vAG~~Ag~~~~----~S~T~~~~~~~~~~ 193 (355) -. ....++.....+.+.....+ . ..++ .++.+||+.|.+... +|+.|+++.|+... T Consensus 158 D~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p---~s~~~Ag~~a~~D~~~g~~~spsN~~i~gi~~~ 234 (390) T protein:vir:79 158 SASGCKTKEEAAAYRRQFGQREIMVIWPDWLGWDDTTNSTAVIP---APAIAAGLRAKIDNDIGWHKTISNVVVNGVSGI 234 (390) T ss_pred EccCCCCHHHHHHHhcCCCCceEEEEcCceeecccccCceeEee---hHHHHHHHHHhhhccCCcEEccCCceeecccee Confidence 21 11234555555555543322 1 1233 478999999988844 59999988776533 Q ss_pred ---CC------hhhHHHHHhCCeEEEEECCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhcccccc Q lcl|NC_019422. 194 ---EP------TENPDEAVEEGKLILINNNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKV 264 (355) Q Consensus 194 ---~~------~~e~~~ai~~G~lvl~~dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~ 264 (355) .+ ..|.+.+-.+|...++++++.++--+- |+. .+..|+.|.++|++|.|.+.+++.... |+++ T Consensus 235 ~~~~~~~~~~~~~~a~~Ln~~gi~t~~~~~G~~~wG~r-T~~-----~d~~~~~i~vrR~~~~i~~~i~~~~~~-~v~e- 306 (390) T protein:vir:79 235 SADVSWDLQDPATDAGYLNEHEVTTLVNRNGFRFWGER-TCS-----DDPKFAFENYTRTAQVAADSIAEAQMP-VVDG- 306 (390) T ss_pred eeeccccccccchhhhhhhhcCcEEEEcCCCEEEEecc-ccC-----CCcccceeeehhhHHHHHHHHHHHHHH-hccC- Confidence 22 234455667888888888777765443 432 235699999999999999999999875 9999 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEe Q lcl|NC_019422. 265 TNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITD 344 (355) Q Consensus 265 ~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vd 344 (355) +|+..-|..++..++.||.+|.++|+|..| .+..|.+.. ....+. .-.+++.+.+.|+- T Consensus 307 ~~~~~~~~~i~~~i~~~L~~l~~~gal~g~---~v~~d~~~n--------------t~~~i~----~G~~~~~i~~~p~~ 365 (390) T protein:vir:79 307 PLNPSLARDIVESINGWFRQQVANGYLIGG---SAWIDPEPN--------------TADILA----SGKAYIDYDYTPVP 365 (390) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeee---EEEEecCCC--------------CHHHhh----CCEEEEEEEEEecC Confidence 899999999999999999999999999986 455554321 111122 24679999999999 Q ss_pred eeeEEEEEEeC Q lcl|NC_019422. 345 AMEDLKFKIYM 355 (355) Q Consensus 345 amEkiy~tv~v 355 (355) .+|.|.+.++. T Consensus 366 p~e~i~~~~~~ 376 (390) T protein:vir:79 366 PLENLVLRQRI 376 (390) T ss_pred CcceEEEEEEE Confidence 99999988888 No 30 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=99.64 E-value=9.2e-16 Score=103.01 Aligned_cols=318 Identities=11% Similarity=0.058 Sum_probs=177.4 Q ss_pred CCCCceEEEeeeee-----ee---------------eecCCCce--------------e-EEEEEecCCccceeEEEeeh Q lcl|NC_019422. 1 MGLPSAIIEFQRRS-----RT---------------VKFRSRRG--------------V-VALILKDSTAIKKSYSIDFL 45 (355) Q Consensus 1 ~g~P~~~i~f~~~a-----~t---------------a~~~~~rG--------------~-v~iil~d~~~~~~~~~~~~~ 45 (355) .++|.+-....+.. +. +...+.++ . ..++...+....+.+.+... T Consensus 217 ~~~~~~~a~~~g~~G~~i~v~~~~~~~~~~~~~~~v~~~~g~~~~~~~~~~~~g~~~~~~~~~~~~~~g~~~e~~~ls~~ 296 (663) T protein:vir:10 217 YGMPLISAVYPGEIGSTVEVEVISKTAFQSGAAQPIYPFGGTRASNARSVIQYGPMTDDQFAIIVRRDGIVVESTVLSTR 296 (663) T ss_pred cccceeeeecccccCcceeEeecccccccccceeeecccCcccccccccccccccccchhhcccccCCCcccceeeeecc Confidence 33333322211100 00 01111111 0 11111111111122221111 Q ss_pred hhhhhhhhhHHHHHHHHhhh------------ccccc----eEEEEecCCCc--cchhHHHHHHHHHhc---ccce-EEE Q lcl|NC_019422. 46 TDINETEFTKENYDYIRLAF------------LGKPS----KVIVEVINDSV--DSERSLDDALKALRE---NKFN-YLA 103 (355) Q Consensus 46 ~d~~~~~~~~~n~~~i~~a~------------~g~~~----~v~l~~g~~g~--~~~~~y~~al~~le~---~~fn-~l~ 103 (355) .+.. .......++...+ .+.|+ .+.+.+|.++. .+..+|..+++.|.. .+.+ .++ T Consensus 297 ~~~~---~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~ 373 (663) T protein:vir:10 297 RGDR---DVYGNNIFMDDYFRNGSSNFIYASSVNWPAGFTGIIQLGGGASANNAVGSDELIAGWDLFADREALHVNLMIA 373 (663) T ss_pred cccc---ccchhhhhhhhhhcCcccceeEeeccccCcccceeEEecccccCcccchhhhhhhHHhhhccccccCceEEEe Confidence 1110 0000111222221 11121 12456666543 355678877777643 3444 444 Q ss_pred EcCCCh-----HHHHHHHHHHHHHHHhcCCeEEEEecCC---------------------------------CCcCccee Q lcl|NC_019422. 104 IPFISE-----EVDKTKIVNWIKTARREKEIYKAVLPNI---------------------------------SDANEKAI 145 (355) Q Consensus 104 ~p~~~d-----~~~~~~~~~~ik~~r~~g~~~~aVl~~~---------------------------------~~~d~egI 145 (355) .|..++ .++++.+.++++++| .+.+++... ..++++.. T Consensus 374 ~~~~~~~~~~~~~v~~~l~~~~~~~~----~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~ 449 (663) T protein:vir:10 374 GACKSDGVAVASTVQKHVVALADDRQ----DCVAFVNPPSELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYA 449 (663) T ss_pred ecCCCCchhhHHHHHHHHHHHHHhhC----CEEEEEecCcccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceE Confidence 444333 234555556665554 355555321 01344555 Q ss_pred EEecCCeEecC---C--ceecHHHHHHHHHHHhcCccccc----ccccccCCcccc------cCChhhHHHHHhCCeEEE Q lcl|NC_019422. 146 INFATTGIKVG---E--KSYTTAEYTARLAGILAGISLSE----SCTYFILDEVTE------IEPTENPDEAVEEGKLIL 210 (355) Q Consensus 146 inv~n~~i~~~---~--~~~~~~~~~a~vAG~~Ag~~~~~----S~T~~~~~~~~~------~~~~~e~~~ai~~G~lvl 210 (355) +.+.+-....+ + ..+++ ++.+||++|.++.++ |+.++++.++.. .+++.|.+.+..+|.-++ T Consensus 450 ~l~~p~~~~~d~~~~~~~~~p~---s~~vAGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i 526 (663) T protein:vir:10 450 FISGNYKYQYDKYNDINRWVPL---SADIAGLCAYTDQVGHPWMSPAGYRRGQLRNTIKLAIEPKQSLRDTMYQVSINPV 526 (663) T ss_pred EEEecceeEecccCCceEEech---HHHHHHHHHHhhccCCcEEccCCeeecceeccccceeecCchhHHHHHhCCCcEE Confidence 54544332221 2 12343 789999999887554 555554443322 246779999999997665 Q ss_pred E--ECCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019422. 211 I--NNNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRD 288 (355) Q Consensus 211 ~--~dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~ 288 (355) . .+++=.+.-|-.|+ ...+..|+.|.++|++|+|.+.|++.... |+++ +|+..-|..++..|+.||.+|.++ T Consensus 527 ~~~~~~~G~~~wG~rT~----s~~~s~~~~i~vrR~~~~i~~si~~~~~~-~v~e-pn~~~l~~~i~~~i~~~L~~l~~~ 600 (663) T protein:vir:10 527 TGFAGGDGFVLFGDKMA----TQVPSPFDRINVRRLFNMLKKNIGDTSKY-ELFE-NNDAFTRQSFRMEVSQYLDNIRSL 600 (663) T ss_pred EEeeCCCcEEEEccccc----CCCCcccceEehhhHHHHHHHHHHHHHHH-hccC-CCCHHHHHHHHHHHHHHHHHHHhC Confidence 3 34444455666665 22335799999999999999999999875 9999 799999999999999999999999 Q ss_pred ccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 289 EVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 289 g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) |+|..| .+.+|.+.. ....+. .-.+++.+.++|+-.+|.|.+++.. T Consensus 601 gal~gf---~V~~d~~~n--------------t~~~i~----~G~~~~~i~~~p~~pae~I~~~~~~ 646 (663) T protein:vir:10 601 GGVYDF---RVVCDTTNN--------------TPQVID----SNEFVATIYIKAPRSINYITLNFVA 646 (663) T ss_pred Cceeee---EEEEcCCCC--------------CHHHhh----CCeEEEEEEEEecCCcceEEEEEEE Confidence 999986 466664421 111122 2467999999999999999999888 No 31 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=99.63 E-value=9.9e-17 Score=108.33 Aligned_cols=317 Identities=16% Similarity=0.141 Sum_probs=194.6 Q ss_pred CC--CCceEEEeeeeeeeeecCCCceeEEEEEecCCcc------ceeEEEeehhhhhhhhhhH-HHHHHHHhhhc-cccc Q lcl|NC_019422. 1 MG--LPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAI------KKSYSIDFLTDINETEFTK-ENYDYIRLAFL-GKPS 70 (355) Q Consensus 1 ~g--~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~------~~~~~~~~~~d~~~~~~~~-~n~~~i~~a~~-g~~~ 70 (355) |. +|+++++=..-+...+.....++++++....... .....+.+.++.....-.. .-...+...+. ++.. T Consensus 1 m~~~~~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~~~pv~v~s~~~~~~~~g~~~tl~~al~~~~~~~~~~ 80 (395) T protein:vir:98 1 MSDFHHGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPLNEPVLITNVQSAIAKAGKKGTLAASLQAIADQSKPV 80 (395) T ss_pred CCCCCCCeEEEEcCCCcccccccCcceEEEEeeccCCCccccccccceEeechHHhHhhcccccchhhHHHHHhhccCce Confidence 54 4888888766666666666677777665432111 1122333333332111000 00011111221 1111 Q ss_pred eEEEEecC------------------CCccchhHHHHHHHHHhc------ccceEEEEcCCChHHHHHHHHHHHHHHHhc Q lcl|NC_019422. 71 KVIVEVIN------------------DSVDSERSLDDALKALRE------NKFNYLAIPFISEEVDKTKIVNWIKTARRE 126 (355) Q Consensus 71 ~v~l~~g~------------------~g~~~~~~y~~al~~le~------~~fn~l~~p~~~d~~~~~~~~~~ik~~r~~ 126 (355) -+++..+. ++......+ ..|.+|+. .....+.+|+..+..+.+.+.++.+++|. T Consensus 81 ~~vv~~~~~~~~~~~~~~a~~~~~i~g~~~~~~~~-Tgl~al~~~~~~~~~~p~il~ap~~~~~~v~~al~~~~~~~~~- 158 (395) T protein:vir:98 81 TVVVRVEDGTGDDEEAALAQTVSNIIGGTDENGKY-TGIKALLTAQAVTGVKPRILGVPGLDTKEVAVALASAAIKLRA- 158 (395) T ss_pred EEEeeccccccccccccccccccccccccccccch-hHHHHHhhhhhhhccchhhcccccccccHHHHHHHHHhhhcCc- Confidence 11111100 000001112 23444432 24567778887777788888888887652 Q ss_pred CCeEEEEecC------------CCCcCcceeEEecCCeEecCC-----ceecHHHHHHHHHHHhcCcc----cccccccc Q lcl|NC_019422. 127 KEIYKAVLPN------------ISDANEKAIINFATTGIKVGE-----KSYTTAEYTARLAGILAGIS----LSESCTYF 185 (355) Q Consensus 127 g~~~~aVl~~------------~~~~d~egIinv~n~~i~~~~-----~~~~~~~~~a~vAG~~Ag~~----~~~S~T~~ 185 (355) .+++-. ....+++..+.+.+.....+. ..++ .++++||++|... +-.|+.|+ T Consensus 159 ----~~~~d~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p---~s~~~AG~~a~~d~~~g~~~spaN~ 231 (395) T protein:vir:98 159 ----FAYVSAWGCKTISEAMEYRKNFSQRELMVIWPDFLAWDTVKNTTATAY---ATARALGLRAYIDQTVGWHKTLSNV 231 (395) T ss_pred ----EEEEEcCCCCCHHHHHHHHhccCCceEEEEecceeEecccCCceeeec---hHHHHHHHHHHhhcccCcEeccCCc Confidence 333321 112345555556555433221 1223 4799999999775 55688888 Q ss_pred cCCccccc---------CChhhHHHHHhCCeEEEEECCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHH Q lcl|NC_019422. 186 ILDEVTEI---------EPTENPDEAVEEGKLILINNNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTW 256 (355) Q Consensus 186 ~~~~~~~~---------~~~~e~~~ai~~G~lvl~~dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~ 256 (355) ++.|+... ++..|.+.+-.+|...+++++++++- |-.|+. .+..|+.|.++|++|.|.+.+++.. T Consensus 232 ~i~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~~~G~~~w-G~rT~s-----~d~~~~~i~~rR~~~~i~~~i~~~~ 305 (395) T protein:vir:98 232 GVQGVTGISASVFWDLQASGTDADLLNEAGVTTLVRKDGFRFW-GNRTCS-----DDPLFLFENYTRTAQVLADTMAEAH 305 (395) T ss_pred eeecccccceecccccCCCcchHHhhhhcCcEEEEcCCCEEEE-cccccC-----CCcccceeehhhHHHHHHHHHHHHH Confidence 87665432 24568888889999999887776654 555552 1457999999999999999999998 Q ss_pred hhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEE Q lcl|NC_019422. 257 NENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFV 336 (355) Q Consensus 257 ~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~ 336 (355) .. |+++ +|+..-|..++..++.||.+|.++|+|..+ .+..|.+. +....+.+ -.+++ T Consensus 306 ~~-~v~e-~~~~~~~~~i~~~i~~~L~~l~~~g~l~g~---~v~~d~~~--------------nt~~~i~~----G~~~~ 362 (395) T protein:vir:98 306 MW-AVDK-PITATLIRDIVDGINAKFRELKSNGYIVEG---KCWFDEES--------------NDKETLKA----GKLYI 362 (395) T ss_pred HH-hccC-CCCHHHHHHHHHHHHHHHHHHHhCCceece---EEEEecCC--------------CCHHHhhC----CeEEE Confidence 75 9999 899999999999999999999999999876 34444432 22222222 35899 Q ss_pred EEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 337 EGNITITDAMEDLKFKIYM 355 (355) Q Consensus 337 ~~~i~~vdamEkiy~tv~v 355 (355) .+.+.|+-.+|.|.++++. T Consensus 363 ~i~~~p~~p~e~I~~~~~~ 381 (395) T protein:vir:98 363 DYDYTPVPPLESLTLRQRI 381 (395) T ss_pred EEEEEecCCcceEEEEEEE Confidence 9999999999999999988 No 32 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=99.63 E-value=1.1e-16 Score=108.12 Aligned_cols=317 Identities=14% Similarity=0.100 Sum_probs=190.0 Q ss_pred CC---CCceEEEeeeeeeeeecCCCceeEEEEEecCCccce------eEEEeehhhhhhhhhhHH--HHHHHHhhhcccc Q lcl|NC_019422. 1 MG---LPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIKK------SYSIDFLTDINETEFTKE--NYDYIRLAFLGKP 69 (355) Q Consensus 1 ~g---~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~------~~~~~~~~d~~~~~~~~~--n~~~i~~a~~g~~ 69 (355) |. +|+++|+=..-+...+......+..++..-...... ...+.+..+.... +... -...+...+..+. T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~-~g~~gtL~~al~~~~~~gg 79 (390) T protein:vir:10 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGK-AGKKGTLRRTLDAIGKQTK 79 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhh-cCCCceehhhhhhhccccC Confidence 33 568887766665555555555666665532111111 1223332222111 1100 0112222222222 Q ss_pred ceEE-EE--ecCCCccchhHH---------HHHHHHHhc------ccceEEEEcCCChHHHHHHHHHHHHHHHhcCCeEE Q lcl|NC_019422. 70 SKVI-VE--VINDSVDSERSL---------DDALKALRE------NKFNYLAIPFISEEVDKTKIVNWIKTARREKEIYK 131 (355) Q Consensus 70 ~~v~-l~--~g~~g~~~~~~y---------~~al~~le~------~~fn~l~~p~~~d~~~~~~~~~~ik~~r~~g~~~~ 131 (355) ...+ +. .+.+...+..++ ...+.+|+. ...+.+.+|+.+...+.+.+.+..+++| .. T Consensus 80 ~~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~-----~~ 154 (390) T protein:vir:10 80 PLTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSLR-----AM 154 (390) T ss_pred ceEEEEEecccccccccccccccccccccccchhhhhhhhhhhhcceehhhcccccchHHHHHHHHHhhcccc-----eE Confidence 1122 11 122222111111 122333322 2346677787766667777777776654 34 Q ss_pred EEecCC------------CCcCcceeEEecCCeEecCC-----ceecHHHHHHHHHHHhcCcccc----cccccccCCcc Q lcl|NC_019422. 132 AVLPNI------------SDANEKAIINFATTGIKVGE-----KSYTTAEYTARLAGILAGISLS----ESCTYFILDEV 190 (355) Q Consensus 132 aVl~~~------------~~~d~egIinv~n~~i~~~~-----~~~~~~~~~a~vAG~~Ag~~~~----~S~T~~~~~~~ 190 (355) +++-.. ...+++..+.+.+.....+. ..+++ ++.+||++|.+... +|+.|+++.|+ T Consensus 155 aivD~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~Agl~a~~D~~~g~~~spaN~~l~gi 231 (390) T protein:vir:10 155 AYVSASGCKTKEEAAAYRKQFGQREIMVIWPDWLGWDDTTNSTAVIPA---PAIAAGLRAKIDNDIGWHKTISNVVVNGV 231 (390) T ss_pred EEEecCCCCCHHHHHHHhhccCCceEEEEcCceEeecccCCcccccch---HHHHHHHHHHhhcCCCcEECcCCceeece Confidence 444321 12345566666665443221 23444 79999999988754 58899988877 Q ss_pred ccc---CCh------hhHHHHHhCCeEEEEECCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccc Q lcl|NC_019422. 191 TEI---EPT------ENPDEAVEEGKLILINNNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYV 261 (355) Q Consensus 191 ~~~---~~~------~e~~~ai~~G~lvl~~dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yi 261 (355) ... .+. .|...+-.+|...++++++.++- |-.|+. .+..|+.|.++|+.|.|.+.+++.+.. |+ T Consensus 232 ~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~~~G~~~w-G~rT~s-----~d~~~~~i~~rR~~~~i~~~i~~~~~~-~v 304 (390) T protein:vir:10 232 SGISADVSWDLQDPATDAGYLNEHEVTTLVNRNGFRFW-GERTCS-----DDPKFAFENYTRTAQVAGDSIAEAQMP-VV 304 (390) T ss_pred eecceecccccccccchhhhhhhcCcEEEEcCCCEEEE-cccccC-----CCcccceeehhhHHHHHHHHHHHHHHH-hc Confidence 643 222 23455667788888887777664 555542 235699999999999999999999876 99 Q ss_pred cccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEE Q lcl|NC_019422. 262 GKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNIT 341 (355) Q Consensus 262 GK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~ 341 (355) ++ +|+..-|..++..++.||.+|.++|+|..+ .+..|.+. +....+. .-.+++.+.+. T Consensus 305 ~e-~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~---~v~~d~~~--------------nt~~~i~----~G~~~~~v~~~ 362 (390) T protein:vir:10 305 DG-PLNPSLARDIVESINGWFRQQVANGYLIGG---SAWIDPEP--------------NTADILA----SGKAYIDYDYT 362 (390) T ss_pred cC-CCCHHHHHHHHHHHHHHHHHHHhCCceeee---EEEEccCC--------------CCHHHhh----CCeEEEEEEEE Confidence 99 899999999999999999999999999876 35555432 2212222 24689999999 Q ss_pred EEeeeeEEEEEEeC Q lcl|NC_019422. 342 ITDAMEDLKFKIYM 355 (355) Q Consensus 342 ~vdamEkiy~tv~v 355 (355) |+-.+|.|.++++. T Consensus 363 p~~pae~I~~~~~~ 376 (390) T protein:vir:10 363 PVPPLENLVLRQRI 376 (390) T ss_pred ecCCcceEEEEEEE Confidence 99999999888888 No 33 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=99.63 E-value=1.1e-16 Score=108.12 Aligned_cols=317 Identities=14% Similarity=0.100 Sum_probs=190.0 Q ss_pred CC---CCceEEEeeeeeeeeecCCCceeEEEEEecCCccce------eEEEeehhhhhhhhhhHH--HHHHHHhhhcccc Q lcl|NC_019422. 1 MG---LPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIKK------SYSIDFLTDINETEFTKE--NYDYIRLAFLGKP 69 (355) Q Consensus 1 ~g---~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~------~~~~~~~~d~~~~~~~~~--n~~~i~~a~~g~~ 69 (355) |. +|+++|+=..-+...+......+..++..-...... ...+.+..+.... +... -...+...+..+. T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~-~g~~gtL~~al~~~~~~gg 79 (390) T protein:vir:78 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGK-AGKKGTLRRTLDAIGKQTK 79 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhh-cCCCceehhhhhhhccccC Confidence 33 568887766665555555555666665532111111 1223332222111 1100 0112222222222 Q ss_pred ceEE-EE--ecCCCccchhHH---------HHHHHHHhc------ccceEEEEcCCChHHHHHHHHHHHHHHHhcCCeEE Q lcl|NC_019422. 70 SKVI-VE--VINDSVDSERSL---------DDALKALRE------NKFNYLAIPFISEEVDKTKIVNWIKTARREKEIYK 131 (355) Q Consensus 70 ~~v~-l~--~g~~g~~~~~~y---------~~al~~le~------~~fn~l~~p~~~d~~~~~~~~~~ik~~r~~g~~~~ 131 (355) ...+ +. .+.+...+..++ ...+.+|+. ...+.+.+|+.+...+.+.+.+..+++| .. T Consensus 80 ~~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~-----~~ 154 (390) T protein:vir:78 80 PLTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSLR-----AM 154 (390) T ss_pred ceEEEEEecccccccccccccccccccccccchhhhhhhhhhhhcceehhhcccccchHHHHHHHHHhhcccc-----eE Confidence 1122 11 122222111111 122333322 2346677787766667777777776654 34 Q ss_pred EEecCC------------CCcCcceeEEecCCeEecCC-----ceecHHHHHHHHHHHhcCcccc----cccccccCCcc Q lcl|NC_019422. 132 AVLPNI------------SDANEKAIINFATTGIKVGE-----KSYTTAEYTARLAGILAGISLS----ESCTYFILDEV 190 (355) Q Consensus 132 aVl~~~------------~~~d~egIinv~n~~i~~~~-----~~~~~~~~~a~vAG~~Ag~~~~----~S~T~~~~~~~ 190 (355) +++-.. ...+++..+.+.+.....+. ..+++ ++.+||++|.+... +|+.|+++.|+ T Consensus 155 aivD~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~Agl~a~~D~~~g~~~spaN~~l~gi 231 (390) T protein:vir:78 155 AYVSASGCKTKEEAAAYRKQFGQREIMVIWPDWLGWDDTTNSTAVIPA---PAIAAGLRAKIDNDIGWHKTISNVVVNGV 231 (390) T ss_pred EEEecCCCCCHHHHHHHhhccCCceEEEEcCceEeecccCCcccccch---HHHHHHHHHHhhcCCCcEECcCCceeece Confidence 444321 12345566666665443221 23444 79999999988754 58899988877 Q ss_pred ccc---CCh------hhHHHHHhCCeEEEEECCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccc Q lcl|NC_019422. 191 TEI---EPT------ENPDEAVEEGKLILINNNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYV 261 (355) Q Consensus 191 ~~~---~~~------~e~~~ai~~G~lvl~~dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yi 261 (355) ... .+. .|...+-.+|...++++++.++- |-.|+. .+..|+.|.++|+.|.|.+.+++.+.. |+ T Consensus 232 ~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~~~G~~~w-G~rT~s-----~d~~~~~i~~rR~~~~i~~~i~~~~~~-~v 304 (390) T protein:vir:78 232 SGISADVSWDLQDPATDAGYLNEHEVTTLVNRNGFRFW-GERTCS-----DDPKFAFENYTRTAQVAGDSIAEAQMP-VV 304 (390) T ss_pred eecceecccccccccchhhhhhhcCcEEEEcCCCEEEE-cccccC-----CCcccceeehhhHHHHHHHHHHHHHHH-hc Confidence 643 222 23455667788888887777664 555542 235699999999999999999999876 99 Q ss_pred cccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEE Q lcl|NC_019422. 262 GKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNIT 341 (355) Q Consensus 262 GK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~ 341 (355) ++ +|+..-|..++..++.||.+|.++|+|..+ .+..|.+. +....+. .-.+++.+.+. T Consensus 305 ~e-~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~---~v~~d~~~--------------nt~~~i~----~G~~~~~v~~~ 362 (390) T protein:vir:78 305 DG-PLNPSLARDIVESINGWFRQQVANGYLIGG---SAWIDPEP--------------NTADILA----SGKAYIDYDYT 362 (390) T ss_pred cC-CCCHHHHHHHHHHHHHHHHHHHhCCceeee---EEEEccCC--------------CCHHHhh----CCeEEEEEEEE Confidence 99 899999999999999999999999999876 35555432 2212222 24689999999 Q ss_pred EEeeeeEEEEEEeC Q lcl|NC_019422. 342 ITDAMEDLKFKIYM 355 (355) Q Consensus 342 ~vdamEkiy~tv~v 355 (355) |+-.+|.|.++++. T Consensus 363 p~~pae~I~~~~~~ 376 (390) T protein:vir:78 363 PVPPLENLVLRQRI 376 (390) T ss_pred ecCCcceEEEEEEE Confidence 99999999888888 No 34 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=99.63 E-value=9.7e-17 Score=108.38 Aligned_cols=318 Identities=14% Similarity=0.093 Sum_probs=193.0 Q ss_pred CC----CCceEEEeeeeeeeeecCCCceeEEEEEecCCccc------eeEEEeehhhhhhhhhh-HHHHHHHHhhhcccc Q lcl|NC_019422. 1 MG----LPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIK------KSYSIDFLTDINETEFT-KENYDYIRLAFLGKP 69 (355) Q Consensus 1 ~g----~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~------~~~~~~~~~d~~~~~~~-~~n~~~i~~a~~g~~ 69 (355) |- .|+++|+=..-....+....-++++++.+-..... ....+.+..+.....-. ..-...+...|..+. T Consensus 1 M~~~~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v~s~~~~~~~~g~~~tl~~al~~~~~~~g 80 (391) T protein:vir:11 1 MAADQYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLITNVQAAIGKAGTSGTLPASLQAIADQAN 80 (391) T ss_pred CCCCcCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEEecchhhheecCCCccchhhhhhhhcccc Confidence 33 89999986666666666555566666654321111 11223333332111000 001112333333333 Q ss_pred ceEEEE---ecCCCccchhHHH---------HHHHHH-hcc-----cceEEEEcCCChHHHHHHHHHHHHHHHhcCCeEE Q lcl|NC_019422. 70 SKVIVE---VINDSVDSERSLD---------DALKAL-REN-----KFNYLAIPFISEEVDKTKIVNWIKTARREKEIYK 131 (355) Q Consensus 70 ~~v~l~---~g~~g~~~~~~y~---------~al~~l-e~~-----~fn~l~~p~~~d~~~~~~~~~~ik~~r~~g~~~~ 131 (355) ..+.+. ++.....+..++. ..+.++ +.+ ....+.+|+.+..++++.+.+...++ ... T Consensus 81 ~~~~vv~~~~~~~~~~t~~d~~g~~~a~~~~~g~~a~~~~~~~~~~~p~~~~ap~~~~~~v~~al~~~~~~~-----~~~ 155 (391) T protein:vir:11 81 AATVVVRVKPGEDEAATNSAVIGGVSADGKYTGMKALLAAKARLGVVPRILGVPGLDTQPVATALIAIAQQL-----RAF 155 (391) T ss_pred ceeEEeeecccccccccchhhhcccccccchhhhhhhhhhhhhheeccccccccccccHHHHHHHHHhhccc-----ceE Confidence 222222 2222211111111 122222 211 23455667666666777777776554 234 Q ss_pred EEec--C----------CCCcCcceeEEecCCeEecC---Cc--eecHHHHHHHHHHHhcCcc----cccccccccCCcc Q lcl|NC_019422. 132 AVLP--N----------ISDANEKAIINFATTGIKVG---EK--SYTTAEYTARLAGILAGIS----LSESCTYFILDEV 190 (355) Q Consensus 132 aVl~--~----------~~~~d~egIinv~n~~i~~~---~~--~~~~~~~~a~vAG~~Ag~~----~~~S~T~~~~~~~ 190 (355) +++- + ....+++..+.+.+.....+ +. .++ .++++||++|.++ +-+|+.|+++.|+ T Consensus 156 ~i~D~p~~~t~~~a~~~r~~~~s~~~~~~~p~~~~~~~~~~~~~~~p---~s~~~ag~~a~~d~~~g~~~span~~l~gi 232 (391) T protein:vir:11 156 AYVSASGCKTKEEATAYRENFAAREAMVIWPDFLTWSTVVNQTVPAP---AVAQALGLRARIDQEVGWHKTLSNVAVNGV 232 (391) T ss_pred EEEEcCCCCCHHHHHHHhhhcCCceEEEEcCcceecccccCceEEec---hHHHHHHHHHHhhccCCcEEccCCceeece Confidence 4442 1 11235556666666543322 11 233 4889999999877 5578889988876 Q ss_pred ccc---------CChhhHHHHHhCCeEEEEECCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccc Q lcl|NC_019422. 191 TEI---------EPTENPDEAVEEGKLILINNNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYV 261 (355) Q Consensus 191 ~~~---------~~~~e~~~ai~~G~lvl~~dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yi 261 (355) ... ++..|.+.+-.+|..+++++++.++- |-.|+.+ +..|+.|.++|++|.|.+.+++.... |+ T Consensus 233 ~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~~~G~~~w-G~rT~~~-----d~~~~~i~vrR~~~~i~~~~~~~~~~-~v 305 (391) T protein:vir:11 233 TGISADVFWDLQSPSTDANYLNENEVTTLVQEGGFRFW-GSRTCSD-----DPLFAFENYTRTAQVLADTIAEAHMW-AV 305 (391) T ss_pred eecccccccccCCCcchhhhhhhcCcEEEEcCCCEEEE-cccccCC-----CcccceeehhhHHHHHHHHHHHHHHH-hc Confidence 643 13457778889999998887776655 4445432 34699999999999999999999875 99 Q ss_pred cccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEE Q lcl|NC_019422. 262 GKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNIT 341 (355) Q Consensus 262 GK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~ 341 (355) ++ +|+..-|..++..++.||.+|.++|+|..+ .+..|.+. +....+. .-.+++++.+. T Consensus 306 ~e-~n~~~~~~~i~~~i~~~l~~l~~~g~l~g~---~~~~~~~~--------------n~~~~i~----~G~~~~~i~~~ 363 (391) T protein:vir:11 306 DK-PMHPSLVRDILEGVNAKFRELKGLGLIIDA---QAWYDPNV--------------NDKDTLK----AGKLRITYDYT 363 (391) T ss_pred cC-CCCHHHHHHHHHHHHHHHHHHHhccceece---EEEEecCC--------------CCHHHhh----CCeEEEEEEEE Confidence 99 899999999999999999999999999976 34444332 2222222 24689999999 Q ss_pred EEeeeeEEEEEEeC Q lcl|NC_019422. 342 ITDAMEDLKFKIYM 355 (355) Q Consensus 342 ~vdamEkiy~tv~v 355 (355) |+-.+|.|.+.++. T Consensus 364 p~~p~e~i~~~~~~ 377 (391) T protein:vir:11 364 PVPPLEDLTFFQKI 377 (391) T ss_pred ecCCcceEEEEEEE Confidence 99999999999998 No 35 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=99.63 E-value=8.5e-16 Score=103.22 Aligned_cols=321 Identities=12% Similarity=0.071 Sum_probs=178.9 Q ss_pred CCCCc----------eEEEeeeeeeeeecCCCce-----------------e----------EEEEEecCCccceeEEEe Q lcl|NC_019422. 1 MGLPS----------AIIEFQRRSRTVKFRSRRG-----------------V----------VALILKDSTAIKKSYSID 43 (355) Q Consensus 1 ~g~P~----------~~i~f~~~a~ta~~~~~rG-----------------~----------v~iil~d~~~~~~~~~~~ 43 (355) .+.|. -.++..-.+.+........ . ..++..........+.+. T Consensus 217 ~~~~~~~a~~~gt~g~~~tv~i~~~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (659) T protein:vir:72 217 YGIPGVVALYPGELGDKIEIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLS 296 (659) T ss_pred cccceeeeccccccccceeEEEccccccccceeeeeecccccccccccceeeeeeecccccccceeeecccceeeeeeee Confidence 11111 1111111100000000000 0 001111111111111111 Q ss_pred ehhhhhhhhhhHHHHHHHHhhhcccc---------------c-eEEEEecCCCc--cchhHHHHHHHHHh---cccceEE Q lcl|NC_019422. 44 FLTDINETEFTKENYDYIRLAFLGKP---------------S-KVIVEVINDSV--DSERSLDDALKALR---ENKFNYL 102 (355) Q Consensus 44 ~~~d~~~~~~~~~n~~~i~~a~~g~~---------------~-~v~l~~g~~g~--~~~~~y~~al~~le---~~~fn~l 102 (355) .... ...... ...++...+..+. + .+.+.+|.++. .+..++..++..|+ ..+++.| T Consensus 297 ~~~~--~~~~~~-~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 373 (659) T protein:vir:72 297 TKRG--EKDIYD-SNIYIDDFFAKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLF 373 (659) T ss_pred eccc--cccccc-hhhhhhhhhhcCCceEEEEEecccCCcccccccccccccccccccchhHHHHHHHhhhccccceeEE Confidence 1000 000000 0112222221111 1 12344554432 34566777877774 4579999 Q ss_pred EEcCCCh------HHHHHHHHHHHHHHHhc----CCeEEEEec---C-------------------CCCcCcceeEEecC Q lcl|NC_019422. 103 AIPFISE------EVDKTKIVNWIKTARRE----KEIYKAVLP---N-------------------ISDANEKAIINFAT 150 (355) Q Consensus 103 ~~p~~~d------~~~~~~~~~~ik~~r~~----g~~~~aVl~---~-------------------~~~~d~egIinv~n 150 (355) ++|+... .++++.+.++++++|+- ......++. + ...+++...+.+.+ T Consensus 374 ~~p~~~~~~~~~~~~v~~~l~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p 453 (659) T protein:vir:72 374 IAGSCAGESLETASTVQKHVVSIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGN 453 (659) T ss_pred EecCCCCcchhhhHHHHHHHHHHHhhhCCEEEEEcCccccccCCCcccCHHHHHHHHhhccccccccccccceeEEEEcC Confidence 9997532 34677788888877741 111111111 0 00134555555555 Q ss_pred CeEecC---Cc--eecHHHHHHHHHHHhcCccccc----ccccccCCccc---c---cCChhhHHHHHhCCeEEEEE--C Q lcl|NC_019422. 151 TGIKVG---EK--SYTTAEYTARLAGILAGISLSE----SCTYFILDEVT---E---IEPTENPDEAVEEGKLILIN--N 213 (355) Q Consensus 151 ~~i~~~---~~--~~~~~~~~a~vAG~~Ag~~~~~----S~T~~~~~~~~---~---~~~~~e~~~ai~~G~lvl~~--d 213 (355) -....+ +. .++ .++++||++|.+..++ |+-++++.++. + .+++.|.+.+-.+|.-++.+ + T Consensus 454 ~~~~~d~~~~~~~~~p---~sg~vAGl~Ar~D~~~G~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g 530 (659) T protein:vir:72 454 HKYQYDKYNDVNRWVP---LAADIAGLCARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGG 530 (659) T ss_pred ceeeccccCCceEEec---hHHHHHHHHHHhhccCCcEEccCCeeeceeeccccccccCChhHHHHHhhCCceEEEEecC Confidence 433222 11 233 3889999999877654 56666544333 2 24678999999999877754 3 Q ss_pred CcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccC Q lcl|NC_019422. 214 NGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDN 293 (355) Q Consensus 214 g~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~ 293 (355) ++.++--+ .|+. ..+.+|+.|.++|++|.|.+-+++.... |+++ +|++.-|..++..|..||.+|.++|+|.. T Consensus 531 ~G~~~wG~-rT~~----~~~s~~~~i~vrR~~~~i~~si~~~~~~-~v~e-~n~~~l~~~i~~~i~~fL~~l~~~gal~~ 603 (659) T protein:vir:72 531 DGYVLYGD-KTAT----SVPSPFDRINVRRLFNMLKTNIGRSSKY-RLFE-LNNAFTRSSFRTETAQYLQGNKALGGIYE 603 (659) T ss_pred CeEEEEcc-cccC----CCCcccceEeehhHHHHHHHHHHHHHHH-hhcC-CCCHHHHHHHHHHHHHHHHHHHhcCceee Confidence 45655433 4442 2346799999999999999999999765 9999 79999999999999999999999999987 Q ss_pred CCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 294 SQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 294 ~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) | .|..|.+-. ....+. .-.+++.+.++|+-.+|.|.+++.- T Consensus 604 ~---~V~~d~~~n--------------t~~~i~----~G~~~~~i~~~p~~pae~I~~~~~~ 644 (659) T protein:vir:72 604 Y---RVVCDTTNN--------------TPSVID----RNEFVATFYIQPARSINYITLNFVA 644 (659) T ss_pred E---EEEEcCCCC--------------CHHHhh----CCeEEEEEEEEecCCccEEEEEEEE Confidence 6 466654422 111111 2468999999999999999999877 No 36 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=99.62 E-value=2.2e-15 Score=100.92 Aligned_cols=317 Identities=12% Similarity=0.092 Sum_probs=180.3 Q ss_pred CCCC------------------ceEEEeeeeeeeeecCCCcee---EEEEEecCCccceeEEEee---hhhhhhhhhhHH Q lcl|NC_019422. 1 MGLP------------------SAIIEFQRRSRTVKFRSRRGV---VALILKDSTAIKKSYSIDF---LTDINETEFTKE 56 (355) Q Consensus 1 ~g~P------------------~~~i~f~~~a~ta~~~~~rG~---v~iil~d~~~~~~~~~~~~---~~d~~~~~~~~~ 56 (355) ..-+ .+-..|..+....-.+..-|. +.-.+.+.... .+.... ........+.. T Consensus 332 ~~~d~~~v~v~~~~~~~~~~~~~v~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~--~~~~~~~~~~~~~~~~~~~~- 408 (743) T protein:vir:10 332 ITDDQVHFAVIDTTGELTGTANTIVERLTYLSKLSDARSEENANIYYKNVINEQSAY--LYHGNDAAVQIAASGEAWGQ- 408 (743) T ss_pred ccccceEEEEecCcceeeeccCceeEEEeeeecccccccccCcceeecceeccccce--eeccCcccceeeeccccCcc- Confidence 1111 111112211111000111111 11111111000 000000 00000000000 Q ss_pred HHHHHHhh----hc-cccceEEEEecCCCc-cchhHHHHHHHHHh---cccceEEEEcCCC-----hHHHHHHHHHHHHH Q lcl|NC_019422. 57 NYDYIRLA----FL-GKPSKVIVEVINDSV-DSERSLDDALKALR---ENKFNYLAIPFIS-----EEVDKTKIVNWIKT 122 (355) Q Consensus 57 n~~~i~~a----~~-g~~~~v~l~~g~~g~-~~~~~y~~al~~le---~~~fn~l~~p~~~-----d~~~~~~~~~~ik~ 122 (355) ...++... .. .....+.+.||.++. .+..++..++..|+ ...++.|++|+.. ..++++.+.+++++ T Consensus 409 ~~~~~~~~~~~~~~~~~~~~~~~~gG~d~~~~~~~~~~~~~~~~~~~~~~~~~ll~~p~~~~~~~~~~~v~~a~~~~~~~ 488 (743) T protein:vir:10 409 SSDQVLADAGTAFSRTTGYWVNLAGGNDDFAYDAGEFGAAMDLFLDTEETEIDFVLMGGSMADEADTKSKATKVIAIAAS 488 (743) T ss_pred ccceeeeecccccccccceEEEeecCccccccchhHHHHHHHHhhhccccCcceEEecCcccCccchHHHHHHHHHHHHh Confidence 00010000 00 112234567777764 34567887777775 4467999999642 24567778888877 Q ss_pred HHhcCCeEEEEecCC---------------------------CCcCcceeEEecCCeEecC---C--ceecHHHHHHHHH Q lcl|NC_019422. 123 ARREKEIYKAVLPNI---------------------------SDANEKAIINFATTGIKVG---E--KSYTTAEYTARLA 170 (355) Q Consensus 123 ~r~~g~~~~aVl~~~---------------------------~~~d~egIinv~n~~i~~~---~--~~~~~~~~~a~vA 170 (355) +|+ +.+++-.- ...+++..+.+.+-....+ + ..+++ ++++| T Consensus 489 ~~~----~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~A 561 (743) T protein:vir:10 489 RKD----ALAFVSPHKGNQIASTGNVALSSAQQKENTIAFFSDLTSTSYAVFDSGYKYVYDRFTDKYRYIPC---NGDVA 561 (743) T ss_pred hCC----eEEEEecCCCccccccccccccccccchHHHHHHHhccCCeeEEEEccceeeeccccCceeEech---hHHHH Confidence 763 33443110 0123444444433322211 1 12344 79999 Q ss_pred HHhcCcccc----cccccccCCcccc------cCChhhHHHHHhCCeEEEEE--CCcEEEEecCccccccCCCCCchhhh Q lcl|NC_019422. 171 GILAGISLS----ESCTYFILDEVTE------IEPTENPDEAVEEGKLILIN--NNGIRIARGVNSLITLSKEDTEDLKK 238 (355) Q Consensus 171 G~~Ag~~~~----~S~T~~~~~~~~~------~~~~~e~~~ai~~G~lvl~~--dg~v~I~~~INSltt~~~~k~~~f~k 238 (355) |++|.+..+ .|+.++++.++.. .+++.|.+.+-.+|.-++.+ +++.++- |-.|+. ..+..|+. T Consensus 562 Gl~a~~D~~~g~~~span~~~~gi~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~w-G~rT~~----s~d~~~~~ 636 (743) T protein:vir:10 562 GLCVQTSNQLDDWYSPAGLNRGGILNAVKLAYNPNKADRDELYQNRINPVVSLRGQGITLF-GDKTAL----AAPSAFDR 636 (743) T ss_pred HHHHHhhccCCcEEccCCeeeeeeeccccceecCChhHHHhHhhCCceEEEEecCCeEEEE-cccccC----CCCcccce Confidence 999988654 4777877665532 24677889999999887754 4466554 444542 33468999 Q ss_pred hhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccc Q lcl|NC_019422. 239 IKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYS 318 (355) Q Consensus 239 irvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~ 318 (355) |.++|++|.|.+.|++.... |+++ +|+..-|..++..++.||.+|.++|+|..| .|.+|.+.. T Consensus 637 i~vrR~~~~i~~si~~~~~~-~v~e-~n~~~~~~~i~~~i~~fL~~l~~~gal~~~---~V~~d~~~n------------ 699 (743) T protein:vir:10 637 INVRRLFLNLEKRARRLAEG-VLFE-QNDATTRAGFSSALNSYLSEVQARRGVTDY---LVICDESNN------------ 699 (743) T ss_pred EeehhhHHHHHHHHHHHHHH-hccC-CCCHHHHHHHHHHHHHHHHHHHhcCceeee---EEEEcCCCC------------ Confidence 99999999999999999875 9999 799999999999999999999999999876 466664422 Q ss_pred cccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 319 EMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 319 ~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) ....+. .-.+++.+.++|+-.||.|.+++.= T Consensus 700 --t~~~i~----~G~~~~~i~~~p~~pae~I~~~~~~ 730 (743) T protein:vir:10 700 --TPDIID----RNEFVAEVYVKPTRSINFITITFTA 730 (743) T ss_pred --CHHHhh----CCeEEEEEEEEecCCcceEEEEEEE Confidence 111122 2468999999999999999999874 No 37 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=99.62 E-value=6.5e-16 Score=103.84 Aligned_cols=318 Identities=11% Similarity=0.092 Sum_probs=172.1 Q ss_pred CCCCceEEEeeeeeee---------ee--cCCCceeEEEEEecCCcc---c--eeEE---EeehhhhhhhhhhHHHH--- Q lcl|NC_019422. 1 MGLPSAIIEFQRRSRT---------VK--FRSRRGVVALILKDSTAI---K--KSYS---IDFLTDINETEFTKENY--- 58 (355) Q Consensus 1 ~g~P~~~i~f~~~a~t---------a~--~~~~rG~v~iil~d~~~~---~--~~~~---~~~~~d~~~~~~~~~n~--- 58 (355) =.++++.+-....+.. ++ ..|. .+.++....... . ..+. +.....+.......... T Consensus 352 ~~~~~v~~~~t~~~~~pp~~~~~~e~v~~ngG~--~f~v~s~~~~g~~i~~~~as~~~s~ln~~~~V~Gt~aa~~~~d~~ 429 (742) T protein:vir:58 352 NELTNVSIPVTDSAIIPPMRFTRIEQITLSGGA--SFSVISNQPYGFNIQDSRHSYWLSPFKDDELIIGTELVLPALDVS 429 (742) T ss_pred cccccceeeccccccCCcccccccceeecccCc--ceEEEEecccCcceeccCcceEEeccCCceEEEeehhhccccccc Confidence 1122222222111100 01 1111 111211110000 0 0000 00000000000100000 Q ss_pred -HHHHhhhccc----cceEEEEecCCCcc------------------chhHHHHHHHHH-hcccceEEEEcCCChHHHHH Q lcl|NC_019422. 59 -DYIRLAFLGK----PSKVIVEVINDSVD------------------SERSLDDALKAL-RENKFNYLAIPFISEEVDKT 114 (355) Q Consensus 59 -~~i~~a~~g~----~~~v~l~~g~~g~~------------------~~~~y~~al~~l-e~~~fn~l~~p~~~d~~~~~ 114 (355) .+......++ ...+.+.||.++.+ ..++++ .|.+| +..+++.|++|+.++...+. T Consensus 430 t~~~v~s~~~alp~~a~sv~laGG~dg~v~v~~~~~D~iG~~~~~d~~~adrT-GL~ALlev~eVtILiAPG~t~~~v~a 508 (742) T protein:vir:58 430 TEFGVSSWEEALPEFSFLMPFQGGSDGYIRVDENEPDTIGRVKITPALLANYE-RLLPLLTEDQFDLVLTPYLTFADHAG 508 (742) T ss_pred hheeccccccccceeeEEEeecCCccccccccCCCcccccccccccccccchh-HHHHhhhcCCCcEEEEcCCCchHHHH Confidence 0111111111 11222333333311 013344 35555 55689999999988777788 Q ss_pred HHHHHHHHHHhcCCeEEEEecCCC-----------CcCcceeEEecCCeEecCC---ceecHHHHHHHHHHHhcCccccc Q lcl|NC_019422. 115 KIVNWIKTARREKEIYKAVLPNIS-----------DANEKAIINFATTGIKVGE---KSYTTAEYTARLAGILAGISLSE 180 (355) Q Consensus 115 ~~~~~ik~~r~~g~~~~aVl~~~~-----------~~d~egIinv~n~~i~~~~---~~~~~~~~~a~vAG~~Ag~~~~~ 180 (355) .+.+++..+++. ..+.+-.++.. ..++...+.+.+-....++ ..++ .++++||++|.++..+ T Consensus 509 av~A~la~a~~R-l~vL~D~P~~~tt~~~A~a~r~~~nSsraaly~PwVkv~d~~~~r~vP---pSgaIAGL~ARtD~er 584 (742) T protein:vir:58 509 TVNAFINRAENR-FLYLFDIAGDDDTENLAISLAGYINSSFATTFFPWVRRLTNKGMRTVP---ASLAAYRSIRTTDPET 584 (742) T ss_pred HHHHHHHhhcCC-eEEEEecCCCCchHHHHHHHHhccCCceEEEEeceeeeccCCcceeec---hHHHHHHHHHHhccCC Confidence 888888876642 12222223221 1233333333332221111 1223 3789999999887543 Q ss_pred ----ccccccCCcccccCChhhHHHHHhCCeEEEEEC-CcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHH Q lcl|NC_019422. 181 ----SCTYFILDEVTEIEPTENPDEAVEEGKLILINN-NGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQT 255 (355) Q Consensus 181 ----S~T~~~~~~~~~~~~~~e~~~ai~~G~lvl~~d-g~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~ 255 (355) |+-+..+-+.. .....|.+.+.++|.-++.+- +++++- |-.|+. ..+..|+.|.++|++|.|.+.|++. T Consensus 585 Gvw~SPANrgii~~~-~~s~se~d~LN~~GINtIrsfG~G~rlW-GnRTla----ssDs~wryInVRRlfd~Ie~SI~~a 658 (742) T protein:vir:58 585 GLAPVGARRGVVTGE-PVRQVDWEDLYNNRINPIVRVGNDVLLF-GQKTML----NVNSALNRINVRRLLIVMRNRISQI 658 (742) T ss_pred ceEecCCcceeeecc-ccchhhHHHHhhCCceEEEECCCcEEEE-cceecC----CCCcccceEeehhhHHHHHHHHHHH Confidence 55444332221 235678888999998777654 355554 556653 2346899999999999999999998 Q ss_pred HhhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEE Q lcl|NC_019422. 256 WNENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVF 335 (355) Q Consensus 256 ~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~ 335 (355) ... |+++ +|+..-|..++..++.||..|.++|+|..| .|..|.+ ++...+. .-.++ T Consensus 659 ~q~-~VfE-PNd~~L~~sIk~sInafL~~L~aqGALlGf---rV~lDet---------------NTpeDI~----~Gklv 714 (742) T protein:vir:58 659 LSS-YLFE-NNTSENRLRAEALVRQYLESLRLRGAVTDY---EVAIDSV---------------TTPTDID----NNTLR 714 (742) T ss_pred HHH-hccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeee---EEEEcCC---------------CCHHHhh----CCEEE Confidence 765 9999 799999999999999999999999999875 4555522 1111121 23589 Q ss_pred EEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 336 VEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 336 ~~~~i~~vdamEkiy~tv~v 355 (355) +.+.+.|+-.||.|.+++.. T Consensus 715 v~I~vAP~~PAEfI~lrf~i 734 (742) T protein:vir:58 715 ARVTVQPARSIEYIDITFVI 734 (742) T ss_pred EEEEEEccCCcceEEEEEEE Confidence 99999999999999988877 No 38 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=99.62 E-value=2e-16 Score=106.64 Aligned_cols=318 Identities=14% Similarity=0.094 Sum_probs=190.9 Q ss_pred CC---CCceEEEeeeeeeeeecCCCceeEEEEEecCCc------cceeEEEeehhhhhhhhhhHHH-HHHHHhhhc-ccc Q lcl|NC_019422. 1 MG---LPSAIIEFQRRSRTVKFRSRRGVVALILKDSTA------IKKSYSIDFLTDINETEFTKEN-YDYIRLAFL-GKP 69 (355) Q Consensus 1 ~g---~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~------~~~~~~~~~~~d~~~~~~~~~n-~~~i~~a~~-g~~ 69 (355) |- .|+++|.=.......+......+++++..-... .+....+.+.++.......... ...+...+. ++. T Consensus 1 M~~~~~pGv~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~n~pv~iss~~~~~~~~g~~gtl~~al~~~~~~gg~ 80 (391) T protein:vir:79 1 MPTDYHHGVRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPLDTPVLLTNPQAYIGKAGDKGTLAHTLDAITDQTNP 80 (391) T ss_pred CCCCCCCCeEEEECCCCcccccccCCceEEEEeecccccccccccccCEEeccHHHHHHhcCCccccchhhhhhhccccc Confidence 33 488888655555555555666666666543211 1222334444443221111000 111222222 222 Q ss_pred ceEEEEecCCC-----------ccchhHHHHHHHHHhcc------cceEEEEcCCChHHHHHHHHHHHHHHHhcCCeEEE Q lcl|NC_019422. 70 SKVIVEVINDS-----------VDSERSLDDALKALREN------KFNYLAIPFISEEVDKTKIVNWIKTARREKEIYKA 132 (355) Q Consensus 70 ~~v~l~~g~~g-----------~~~~~~y~~al~~le~~------~fn~l~~p~~~d~~~~~~~~~~ik~~r~~g~~~~a 132 (355) ..+.+..+... ......-..++..|+.. ....+++|+.+..++++.+.+++.++| ..+ T Consensus 81 ~~~vv~~~~~~~~~~~~~~~~g~~~~~~~~tGl~~l~~~~~~~~~~p~~l~~p~~~~~~v~~al~~~~~~~~-----~~a 155 (391) T protein:vir:79 81 LTVVVRVAGGASEAETTSNLIGTTNAAGRYTGMKALLTARNRFGVAPRILAVPGLDSLPVGTELVTIAQKLR-----AFA 155 (391) T ss_pred ceeeeccccccccccccccccccccchhhhHHHhhhhhhhhhhcccchhhcCCccchhHHHHHHHHHHhhcC-----cEE Confidence 22222111100 00011223344444332 235666777766677778888776655 233 Q ss_pred Ee--cCCC----------CcCcceeEEecCCeEecCC-----ceecHHHHHHHHHHHhcCccc----ccccccccCCccc Q lcl|NC_019422. 133 VL--PNIS----------DANEKAIINFATTGIKVGE-----KSYTTAEYTARLAGILAGISL----SESCTYFILDEVT 191 (355) Q Consensus 133 Vl--~~~~----------~~d~egIinv~n~~i~~~~-----~~~~~~~~~a~vAG~~Ag~~~----~~S~T~~~~~~~~ 191 (355) ++ +... ..++.....+.+.....+. ..+++ ++.+||++|.+.- -+|+.|+++.|+. T Consensus 156 i~d~p~~~t~~~a~~~~~~~~s~~~a~~~P~~~~~d~~~~~~~~~p~---s~~~AG~~a~~D~~~g~~~spaN~~l~gi~ 232 (391) T protein:vir:79 156 YLSAYGCQTKEEAVAYRSNFGQREAMVMWPDFVGWDTAANAETTLWA---TARAVGLRAKIDNDTGWHKTLSNVAVGGVT 232 (391) T ss_pred EEECCCCCCHHHHHHHHhccCCceeEEecceeeeecCcCCceeeech---HHHHHHHHHHhhhcccceeccCCceehhhh Confidence 33 2211 1234444444444332221 22343 6999999998874 4788888887765 Q ss_pred cc---C------ChhhHHHHHhCCeEEEEECCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhcccc Q lcl|NC_019422. 192 EI---E------PTENPDEAVEEGKLILINNNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVG 262 (355) Q Consensus 192 ~~---~------~~~e~~~ai~~G~lvl~~dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiG 262 (355) .. . +..|.+.+-.+|...++++++.++--+ -|+. .+..|+.|.++|++|.|.+.|++.+.. |++ T Consensus 233 ~~~~~~~~~~~~~~~~~~~Ln~~~I~t~~~~~G~~~wG~-rT~~-----~d~~~~~i~~rR~~~~i~~~i~~~~~~-~v~ 305 (391) T protein:vir:79 233 GLSRDVFWDLQDPATDAGYLNANEVTTLVHRDGYRFWGS-RTCS-----ADPLFAFENYTRTAQVLADTMAEAHMW-AND 305 (391) T ss_pred ccccccccccccccchhhhhhhcCceEEECCCcEEEEcc-cccC-----CCcccceeehhhHHHHHHHHHHHHHHH-hcc Confidence 32 1 223556677889988887777766544 3442 245799999999999999999999876 999 Q ss_pred ccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEE Q lcl|NC_019422. 263 KVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITI 342 (355) Q Consensus 263 K~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~ 342 (355) + +|+..-|..++..++.||.+|.++|+|..+ .+..|.+ .++...+.+| .+++.+.+.| T Consensus 306 e-pn~~~~~~~i~~~i~~~l~~l~~~g~l~g~---~v~~~~~--------------~nt~~~i~~G----~~~~~i~~~p 363 (391) T protein:vir:79 306 L-PMTPTLVRDLLEGINAKLRMLTRNGYLLGG---AAWFDAD--------------ANSKDTLKAG----QLAIDYDYTP 363 (391) T ss_pred C-CCCHHHHHHHHHHHHHHHHHHHhCCceece---EEEEecC--------------CCCHHHhhCC----EEEEEEEEEe Confidence 9 899999999999999999999999999876 3444432 3333334433 4799999999 Q ss_pred EeeeeEEEEEEeC Q lcl|NC_019422. 343 TDAMEDLKFKIYM 355 (355) Q Consensus 343 vdamEkiy~tv~v 355 (355) +-.||.|.++++. T Consensus 364 ~~p~e~i~~~~~~ 376 (391) T protein:vir:79 364 VPPLENLTFRQRI 376 (391) T ss_pred cCCcceEEEEEEE Confidence 9999999999888 No 39 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=99.61 E-value=4.8e-15 Score=99.08 Aligned_cols=321 Identities=11% Similarity=0.041 Sum_probs=180.2 Q ss_pred CCCC-----------------------------ceEEEeeee---eeee---ecCCCceeEEEEEecCCccceeEEEeeh Q lcl|NC_019422. 1 MGLP-----------------------------SAIIEFQRR---SRTV---KFRSRRGVVALILKDSTAIKKSYSIDFL 45 (355) Q Consensus 1 ~g~P-----------------------------~~~i~f~~~---a~ta---~~~~~rG~v~iil~d~~~~~~~~~~~~~ 45 (355) .+.| .+.+.-..- ...+ ......+-..+++.......+.+.+... T Consensus 217 ~~~~a~~a~~~g~~g~~l~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~ 296 (666) T protein:vir:80 217 YDMPAVSAIYAGEIGNSLEVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTL 296 (666) T ss_pred ccchhhhhhcccccccceeeeeccccccccccccceeeeccccccccceeeeeccccccceeeEeccCCccceeeecccc Confidence 1111 111100000 0000 0000001112233333333334444333 Q ss_pred hhhhhhhhhHHHHHHHHhhhc------------cc---cce-EEEEecCCCccc----------h---hHHHHHHHHHhc Q lcl|NC_019422. 46 TDINETEFTKENYDYIRLAFL------------GK---PSK-VIVEVINDSVDS----------E---RSLDDALKALRE 96 (355) Q Consensus 46 ~d~~~~~~~~~n~~~i~~a~~------------g~---~~~-v~l~~g~~g~~~----------~---~~y~~al~~le~ 96 (355) .+..... ....|+...+. +. .+. +.+.+|.++... . ..-...+...|. T Consensus 297 ~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 373 (666) T protein:vir:80 297 KGDKDVY---GNSIYMDDFFGRGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWGLFAERES 373 (666) T ss_pred ccccccc---chhhhhhhhhccccceeeeecccccccccceEEEecCCCCcccccccccccccccccchhhhhhhhhhcc Confidence 2221111 11112221111 11 111 123333322110 0 112234455567 Q ss_pred ccceEEEEcCCC-----hHHHHHHHHHHHHHHHhc----CCeEEEEecCC----------------------CCcCccee Q lcl|NC_019422. 97 NKFNYLAIPFIS-----EEVDKTKIVNWIKTARRE----KEIYKAVLPNI----------------------SDANEKAI 145 (355) Q Consensus 97 ~~fn~l~~p~~~-----d~~~~~~~~~~ik~~r~~----g~~~~aVl~~~----------------------~~~d~egI 145 (355) +.++.+++|+.. ..+++..+.+++.++|+- .-...+++... ..+++... T Consensus 374 ~~~~~l~~p~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~ 453 (666) T protein:vir:80 374 IHVNLLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYA 453 (666) T ss_pred cccceEeecCcCCcccchHHHHHHHHHHHHhhcceEEEeecceeEEeecCCCCCHHHHHHHHHhcccchhhhcccCcceE Confidence 789999998754 345788888888888742 12222332210 01344444 Q ss_pred EEecCCeEecC---Cc--eecHHHHHHHHHHHhcCccccc----ccccccCCccc---c---cCChhhHHHHHhCCeEEE Q lcl|NC_019422. 146 INFATTGIKVG---EK--SYTTAEYTARLAGILAGISLSE----SCTYFILDEVT---E---IEPTENPDEAVEEGKLIL 210 (355) Q Consensus 146 inv~n~~i~~~---~~--~~~~~~~~a~vAG~~Ag~~~~~----S~T~~~~~~~~---~---~~~~~e~~~ai~~G~lvl 210 (355) +.+.+-....+ +. .+++ ++.+||+.|.....+ |+.++++.++. . .+++.|.+.+-.+|.-++ T Consensus 454 ~l~~p~~~~~d~~~~~~~~~p~---sg~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i 530 (666) T protein:vir:80 454 VIDGNYKYQYDKYNDVNRWVPL---AADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPV 530 (666) T ss_pred EEEcCceEEecccCCceeEech---HHHHHHHHHHHhhcCCceEccCCeecceeeccccceeecChhHHHhhhhCCeeEE Confidence 44444332222 11 2333 799999999876555 77777655443 2 246789999999998776 Q ss_pred EE--CCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019422. 211 IN--NNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRD 288 (355) Q Consensus 211 ~~--dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~ 288 (355) .+ ++++++--+ .|+ ...+.+|+.|.++|++|.|.+-|++.... |+++ +|+..-|..++..++.||.+|.++ T Consensus 531 ~~~~g~G~~~wG~-rT~----~~~~s~~~~i~vRRl~~~i~~si~~~~~~-~v~e-pn~~~l~~~i~~~i~~~L~~l~~~ 603 (666) T protein:vir:80 531 IGAGGEGFILMGD-KTA----TTVPSPFDRINVRRLFNMLKKNIGDSSKY-KLFE-NNDNFTRASFRMEVSQYLSTIRSL 603 (666) T ss_pred EEeCCCeEEEEcc-ccC----CCCCcccceeehhhHHHHHHHHHHHHHHH-hccC-CCCHHHHHHHHHHHHHHHHHHHhc Confidence 53 446776644 444 12335799999999999999999998875 9999 799999999999999999999999 Q ss_pred ccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 289 EVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 289 g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) |+|..| .+..|.+-.. ...+. .-.+++.+.++|+-.||.|.+++.- T Consensus 604 gal~g~---~V~~d~~~nt--------------~~di~----~G~~~~~i~~~P~~Pae~I~~~~~~ 649 (666) T protein:vir:80 604 GGIYDF---RVQCDTTNNT--------------PDVID----RNEFVASMFIKPAKSINYIMLNFTA 649 (666) T ss_pred Cceeee---EEEEcCCCCC--------------HHHhh----CCeEEEEEEEEecCCcceEEEEEEE Confidence 999986 4666644221 11111 2568999999999999999999876 No 40 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=99.60 E-value=3.1e-15 Score=100.16 Aligned_cols=320 Identities=15% Similarity=0.092 Sum_probs=189.7 Q ss_pred CC-----CCceEEEeeeeeeeeecCCCceeEEEEEecCCcccee------EEEeehhhhhhhhhhH-HHHHHHHhhhccc Q lcl|NC_019422. 1 MG-----LPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIKKS------YSIDFLTDINETEFTK-ENYDYIRLAFLGK 68 (355) Q Consensus 1 ~g-----~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~~------~~~~~~~d~~~~~~~~-~n~~~i~~a~~g~ 68 (355) |- +|++++.=..-+..++....-.++++|..-....... ..+.+..+........ .-...+...+..+ T Consensus 1 m~m~~~~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~~~~~~~~g~~g~L~~al~~~~~~~ 80 (393) T protein:vir:10 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIV 80 (393) T ss_pred CCCCCccCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEecchHHHHHhhCCccchhhhhhhhhccc Confidence 43 4688777666666666555555666665322221112 2233333332211100 0112222233333 Q ss_pred cceEEEEecCCC---ccc--------hhHHHHHHHHHh---c---ccceEEEEcCCChHHHHHHHHHHHHHHHhcCCeEE Q lcl|NC_019422. 69 PSKVIVEVINDS---VDS--------ERSLDDALKALR---E---NKFNYLAIPFISEEVDKTKIVNWIKTARREKEIYK 131 (355) Q Consensus 69 ~~~v~l~~g~~g---~~~--------~~~y~~al~~le---~---~~fn~l~~p~~~d~~~~~~~~~~ik~~r~~g~~~~ 131 (355) ...+++.....+ ..+ .......+.+|+ . ...+.++.|+.+..+..+.+.+.+++++. .-+. T Consensus 81 ~~~~~vv~v~~~~~~~~t~~~iig~~~~~~~tgl~al~~~~~~~~~~p~li~apg~~~~~~~~al~~~~~~~~~--~~~v 158 (393) T protein:vir:10 81 KTPTVIVRVAESDDSDTLTANIVGTQENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNA--FAFI 158 (393) T ss_pred CceEEEeecccCccccccccccccccccchhhHHHHHHhhhhhcceeeeeeeeccccchHHHHHHHHHhhccCc--EEEE Confidence 333332222111 100 011223344443 2 23477888987777788888888887652 1111 Q ss_pred EEecCCC---------CcCcceeEEecCCeEecC---C--ceecHHHHHHHHHHHhcCccc----ccccccccCCccccc Q lcl|NC_019422. 132 AVLPNIS---------DANEKAIINFATTGIKVG---E--KSYTTAEYTARLAGILAGISL----SESCTYFILDEVTEI 193 (355) Q Consensus 132 aVl~~~~---------~~d~egIinv~n~~i~~~---~--~~~~~~~~~a~vAG~~Ag~~~----~~S~T~~~~~~~~~~ 193 (355) ...++.+ ..++.....+.+.....+ + ..++ .++.+||++|.+.- -+|+.|+++.|+... T Consensus 159 ~d~~~~t~~~ai~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p---~s~~~Ag~~a~~d~~~G~~~spaN~~l~gi~~~ 235 (393) T protein:vir:10 159 SDNGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDY---AVARACALQAYIDKTVGWHKNISNVELDGVTGI 235 (393) T ss_pred EcCCCCCHHHHHHHhhhcCCceEEEEecccccccccCCceeEee---hhHHHHHHHHHhhcCCCcEEccCCceeeceeec Confidence 1112211 122323333333322111 1 2233 36899999998764 468888887776542 Q ss_pred ---------CChhhHHHHHhCCeEEEEECCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhcccccc Q lcl|NC_019422. 194 ---------EPTENPDEAVEEGKLILINNNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKV 264 (355) Q Consensus 194 ---------~~~~e~~~ai~~G~lvl~~dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~ 264 (355) ++..|.+.+-++|.-+++++++.++- |--|+. .+..|+.|.++|+.|.|.+.+++.+.. |++| T Consensus 236 ~~~~~~~~~~~~~~~~~Ln~~gI~t~~~~~G~~~w-G~rT~s-----~d~~~~~i~vrR~~~~i~~~i~~~~~~-~v~e- 307 (393) T protein:vir:10 236 TKAVEFDINESSTEANYLNEKGITICLNHNGFRYW-GSRTLA-----TDTRWAFQQSVRTAQIIKETIGAGLAW-AVDM- 307 (393) T ss_pred ceecccccCCCcchhHhHhhcCceEEEcCCCEEEE-cccccC-----CCcccceeehhhHHHHHHHHHHHHHHH-hccC- Confidence 23567787888998888887777665 333442 245699999999999999999999876 9999 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHhcc--cccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEE Q lcl|NC_019422. 265 TNKYDNKILFLSAVNNYFKELQRDE--VLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITI 342 (355) Q Consensus 265 ~N~~~gr~~~~~~i~~yl~~l~~~g--~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~ 342 (355) +|+..-+..++..++.||.+|++.| +|..+ .+..|.+ ++...+.. -.+++++.+.| T Consensus 308 ~~~~~~~~~i~~~i~~~L~~l~~~g~~al~g~---~v~~~~~---------------nt~~~i~~----G~~~~~i~~~p 365 (393) T protein:vir:10 308 PLTPLRVKTMLEAINNKLRSWASGDDPRILGA---RVWVAEE---------------ITADIIKS----GKFVIKYDYHW 365 (393) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhccccccccc---eEEecCC---------------CCHHHhhC----CEEEEEEEEEe Confidence 8999999999999999999999855 78776 3433332 22233333 36899999999 Q ss_pred EeeeeEEEEEEeC Q lcl|NC_019422. 343 TDAMEDLKFKIYM 355 (355) Q Consensus 343 vdamEkiy~tv~v 355 (355) +-.||.|.++++. T Consensus 366 ~~p~e~I~~~~~~ 378 (393) T protein:vir:10 366 IPSLESLGLEQRV 378 (393) T ss_pred cCCcceEEEEEEE Confidence 9999999999998 No 41 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=99.60 E-value=3.1e-15 Score=100.13 Aligned_cols=311 Identities=10% Similarity=0.051 Sum_probs=178.2 Q ss_pred CCCCceEEEeeeeeeeeecCCCcee-EEEEEecCCccceeEEEeehhhhh----hhhhh-----HHHHHHHHhhhcc--- Q lcl|NC_019422. 1 MGLPSAIIEFQRRSRTVKFRSRRGV-VALILKDSTAIKKSYSIDFLTDIN----ETEFT-----KENYDYIRLAFLG--- 67 (355) Q Consensus 1 ~g~P~~~i~f~~~a~ta~~~~~rG~-v~iil~d~~~~~~~~~~~~~~d~~----~~~~~-----~~n~~~i~~a~~g--- 67 (355) -+.+...+.+. +.... ..+++.........+.+....... ...+. .....|+...... T Consensus 262 a~~~t~~~~~~---------~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~ 332 (659) T protein:vir:10 262 ASTAKAVFGYG---------PQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDFFAKGGSEYIFATAQNWPE 332 (659) T ss_pred cccceeeeeec---------cccccchhhccccccceeeeeeeeccccccccccchhhhhhhhccCcccEEEEeecccCC Confidence 11111111111 11110 111111111111111111100000 00000 0001111111111 Q ss_pred ccc-eEEEEecCCCc--cchhHHHHHHHHHh---cccceEEEEcCCCh------HHHHHHHHHHHHHHHhcCCeEEEEec Q lcl|NC_019422. 68 KPS-KVIVEVINDSV--DSERSLDDALKALR---ENKFNYLAIPFISE------EVDKTKIVNWIKTARREKEIYKAVLP 135 (355) Q Consensus 68 ~~~-~v~l~~g~~g~--~~~~~y~~al~~le---~~~fn~l~~p~~~d------~~~~~~~~~~ik~~r~~g~~~~aVl~ 135 (355) ..+ -+.+.+|.++. .+..++..++.+|+ ..+++.|++|+... .++++.+.++++++|+ +.+++. T Consensus 333 ~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~il~~p~~~~~~~~~~~~v~~al~~~~~~~~~----~~~~~d 408 (659) T protein:vir:10 333 GFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVVSIGDARQD----CLVLCS 408 (659) T ss_pred CccceeeecccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHHHHHHhhCC----eEEEEc Confidence 112 23456665542 34456777777664 55799999997532 3467777888887763 222221 Q ss_pred --------C---C-------------------CCcCcceeEEecCCeEecC---Cc--eecHHHHHHHHHHHhcCccccc Q lcl|NC_019422. 136 --------N---I-------------------SDANEKAIINFATTGIKVG---EK--SYTTAEYTARLAGILAGISLSE 180 (355) Q Consensus 136 --------~---~-------------------~~~d~egIinv~n~~i~~~---~~--~~~~~~~~a~vAG~~Ag~~~~~ 180 (355) . . ..+|+.....+.+-....+ +. .++ .++++||++|.+..++ T Consensus 409 ~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p---~sg~~AGl~Ar~D~~~ 485 (659) T protein:vir:10 409 PPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVP---LAADIAGLCARTDNVS 485 (659) T ss_pred CccccccCCCcccCHHHHHHHHHhcccccccccccCcceEEEEeCcEEEecccCCceEEec---hHHHHHHHHHHHhccC Confidence 0 0 0134555555544433222 22 233 3899999999876655 Q ss_pred ----ccccccCC---cccc---cCChhhHHHHHhCCeEEEEE--CCcEEEEecCccccccCCCCCchhhhhhhHhhHHHH Q lcl|NC_019422. 181 ----SCTYFILD---EVTE---IEPTENPDEAVEEGKLILIN--NNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMI 248 (355) Q Consensus 181 ----S~T~~~~~---~~~~---~~~~~e~~~ai~~G~lvl~~--dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i 248 (355) |+-++++. ++.+ .+++.|.+.+-.+|.-++.+ +++.++--+ .|+ ...+.+|+.|.++|++|+| T Consensus 486 g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~-rT~----~~~~s~~~~i~vrR~~~~i 560 (659) T protein:vir:10 486 QTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGD-KTA----TSVPSPFDRINVRRLFNML 560 (659) T ss_pred CceEccCCceeeeeeccccceecCCHhHHHHHhhCCeeEEEEeCCCeEEEEcc-ccc----CCCCcccceEehhhHHHHH Confidence 56565443 3332 25678999999999877754 446666544 333 1223579999999999999 Q ss_pred HHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeecc Q lcl|NC_019422. 249 QDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEA 328 (355) Q Consensus 249 ~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~ 328 (355) .+-|++.... |+++ +|++.-|..++..|+.||.+|.++|+|..| .|.+|.+.... ..+. T Consensus 561 ~~si~~~~~~-~v~e-~n~~~l~~~i~~~i~~fL~~l~~~gal~~~---~V~~d~~~nt~--------------~~i~-- 619 (659) T protein:vir:10 561 KTNIGRSSKY-RLFE-LNNAFTRSSFRTETAQYLQGIKALGGIYEY---RVVCDTTNNTP--------------SVID-- 619 (659) T ss_pred HHHHHHHHHH-hccC-CCCHHHHHHHHHHHHHHHHHHHhcCceeeE---EEEEcCCCCCH--------------HHhh-- Confidence 9999999766 9999 799999999999999999999999999876 46665442211 1111 Q ss_pred CCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 329 NTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 329 ~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) .-.+++.+.+.|+-.+|.|.+++.- T Consensus 620 --~G~~~~~i~~~p~~pae~i~~~~~~ 644 (659) T protein:vir:10 620 --RNEFVATFYIQPARSINYITLNFVA 644 (659) T ss_pred --CCeEEEEEEEEecCCcceEEEEEEE Confidence 2468999999999999999999887 No 42 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=99.60 E-value=2.6e-15 Score=100.54 Aligned_cols=318 Identities=17% Similarity=0.087 Sum_probs=183.8 Q ss_pred CC---CCceEEEeeeeeeeeecCCCceeEEEEEecCCccce------eEEEeehhhhhhhhhhHH-HHHHHHhhhccccc Q lcl|NC_019422. 1 MG---LPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIKK------SYSIDFLTDINETEFTKE-NYDYIRLAFLGKPS 70 (355) Q Consensus 1 ~g---~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~------~~~~~~~~d~~~~~~~~~-n~~~i~~a~~g~~~ 70 (355) |. .|+++++=..-...++.....+++.+|......... ...+.+..+......... -...++..+..++. T Consensus 1 M~~~~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~a~~~~~~~gg~ 80 (386) T protein:vir:10 1 MAEQYLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVLIAGSRREAAKLGAGGTLPQAIDGIFDQTGA 80 (386) T ss_pred CccccCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceEecchHHHHhhcCCCcchhHHHHHHhccCce Confidence 33 478888866666666766667777777643221111 122233332211111110 01223333333322 Q ss_pred eEEEEe---cCCCccc----------hhHHHHHHHHHhcccc------eEEEEcCCCh-HHHHHHHHHHHHHHHhcCCeE Q lcl|NC_019422. 71 KVIVEV---INDSVDS----------ERSLDDALKALRENKF------NYLAIPFISE-EVDKTKIVNWIKTARREKEIY 130 (355) Q Consensus 71 ~v~l~~---g~~g~~~----------~~~y~~al~~le~~~f------n~l~~p~~~d-~~~~~~~~~~ik~~r~~g~~~ 130 (355) ...+.. +.++..+ .......+.+|..... +.+..|+..+ ..+.+.+.+..+++ .. T Consensus 81 ~~~vv~~~~~~~~~~t~~~~ig~~~~~t~~~tgl~~l~~~~~~~~~~p~i~~ap~~~~~~~v~~~l~~~~~~~-----~~ 155 (386) T protein:vir:10 81 VVVVIRVDEGVDSAATQSNVIGKVDADTEQYTGILALLSAENTVKVQPRILIAPGFSNQKAVADQLVSVADTA-----AW 155 (386) T ss_pred eEEEeeccccccccccchhhhcccccccchhhhhHHhhhhcccccccccccccccccchhHHHHHHHHhhcce-----EE Confidence 222222 1111111 1122334555543322 3333333221 11222222222222 22 Q ss_pred EEEecCCC-----------CcCcceeEEecCCeEec-----CCceecHHHHHHHHHHHhcCcc----cccccccccCCcc Q lcl|NC_019422. 131 KAVLPNIS-----------DANEKAIINFATTGIKV-----GEKSYTTAEYTARLAGILAGIS----LSESCTYFILDEV 190 (355) Q Consensus 131 ~aVl~~~~-----------~~d~egIinv~n~~i~~-----~~~~~~~~~~~a~vAG~~Ag~~----~~~S~T~~~~~~~ 190 (355) .++..... ..++.....+.+..... ....+++ ++++||+.|.+. +-+|+.|+++.|+ T Consensus 156 ~~~~~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~v~~~~~~~~~~~p~---s~~~ag~~a~~D~~~G~~~spaN~~l~gv 232 (386) T protein:vir:10 156 LCHSGWSNTTDAAAITYRELFGSRRCEVVDPWYKVWDVETSAHIIQPP---SARHAGVMAKVHNTLGFWWSNSNQEILGI 232 (386) T ss_pred EEEeCCCCCchHHHHHhhhcccccceEEecCceeeeccccccceeech---HHHHHHHHHHhhhcCCcEEccCCceeecc Confidence 23333211 12333333333332221 1233444 789999999886 4458889888776 Q ss_pred ccc---------CChhhHHHHHhCCeEEEEECCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccc Q lcl|NC_019422. 191 TEI---------EPTENPDEAVEEGKLILINNNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYV 261 (355) Q Consensus 191 ~~~---------~~~~e~~~ai~~G~lvl~~dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yi 261 (355) ... ++..|.+.+-.+|...+++++++++- |--|+. -+..|+.|.++|++|.|.+.+++.+.. |+ T Consensus 233 ~~~~~~~~~~~~~~~~~~~~l~~~gi~~~~~~~G~~~w-G~rT~~-----~d~~~~~i~vrR~~~~i~~~~~~~~~~-~v 305 (386) T protein:vir:10 233 DGLCRPVDFKLDDPTCRANLLNAKEVTTTIQQNGFRVW-GDRTCS-----ADSKWAFKNVVITNDMIADSLVRNHLW-AV 305 (386) T ss_pred cccceecccccccCcchhhhhhhcCcEEEEcCCCEEEE-cccccC-----CCcccceeehhhHHHHHHHHHHHHHHH-hc Confidence 532 13457788889999999887776665 444542 245799999999999999999999876 99 Q ss_pred cccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEE Q lcl|NC_019422. 262 GKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNIT 341 (355) Q Consensus 262 GK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~ 341 (355) ++ +|+..-+..++..++.||.+|.++|+|..| .|.+|.+- +....+. .-.+++.+.+. T Consensus 306 ~e-~~~~~~~~~i~~~i~~~L~~l~~~g~l~g~---~v~~d~~~--------------nt~~~~~----~G~~~~~i~~~ 363 (386) T protein:vir:10 306 DR-NITKTYVEDVTEGVNNYLRHLKNIGAIAGG---ECWVDPEL--------------NSPDQIQ----QGKVYFDYDFS 363 (386) T ss_pred cC-CCCHHHHHHHHHHHHHHHHHHHhCCceeee---EEEEcccC--------------CCHHHhh----CCeEEEEEEEE Confidence 99 799999999999999999999999999986 46665442 2222222 34689999999 Q ss_pred EEeeeeEEEEEEeC Q lcl|NC_019422. 342 ITDAMEDLKFKIYM 355 (355) Q Consensus 342 ~vdamEkiy~tv~v 355 (355) |+--+|.|.++++. T Consensus 364 p~~p~e~i~~~~~~ 377 (386) T protein:vir:10 364 AYAPAEHITFRSHM 377 (386) T ss_pred ecCCceeEEEEEEE Confidence 99999999999988 No 43 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=99.59 E-value=4.3e-15 Score=99.35 Aligned_cols=320 Identities=10% Similarity=0.032 Sum_probs=175.3 Q ss_pred CC-CCceEEEeee-----eeeeeecCCCceeEEEEEecCCccceeEEEeehhhhhhhhhhHHHHHHHHhhhc----cccc Q lcl|NC_019422. 1 MG-LPSAIIEFQR-----RSRTVKFRSRRGVVALILKDSTAIKKSYSIDFLTDINETEFTKENYDYIRLAFL----GKPS 70 (355) Q Consensus 1 ~g-~P~~~i~f~~-----~a~ta~~~~~rG~v~iil~d~~~~~~~~~~~~~~d~~~~~~~~~n~~~i~~a~~----g~~~ 70 (355) .+ .....+.... -+......|.---...+.................+... .....++..... +... T Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~v~~~~~~~~~~~~~ 334 (666) T protein:vir:65 259 ERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFA----RGSSQYIYATAQGWVDGFSG 334 (666) T ss_pred ccccceeeecccccccccceeeeecCCcccceeecccCcccccccchhhhhhhhhc----ccccceeeeecccccccccc Confidence 00 0011111100 01111111110000011111100000000000001000 000111111111 1111 Q ss_pred eEEEEecCCCcc----------chhHHHHHHHHHhc---ccceEEEEcCCC-----hHHHHHHHHHHHHHHHhc----CC Q lcl|NC_019422. 71 KVIVEVINDSVD----------SERSLDDALKALRE---NKFNYLAIPFIS-----EEVDKTKIVNWIKTARRE----KE 128 (355) Q Consensus 71 ~v~l~~g~~g~~----------~~~~y~~al~~le~---~~fn~l~~p~~~-----d~~~~~~~~~~ik~~r~~----g~ 128 (355) .+.+.+|.++.. ...++..++++|+. +.++.+++|+.+ +.++++.+.++++++|+- +- T Consensus 335 ~~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v~~~l~~~~~~~~~~~a~~d~ 414 (666) T protein:vir:65 335 IISLAGGVSANEATTGGVGADPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVMVSP 414 (666) T ss_pred eEEccCCCCcCcccccccccccccccHHHHHHHHhhhhhccCCceeecCcCCccchhHHHHHHHHHHHhhccceEEEecc Confidence 222333333211 12345666666654 468999988643 356788888888888752 11 Q ss_pred eEEEEecC---C-------------------CCcCcceeEEecCCeEecC---Cc--eecHHHHHHHHHHHhcCccccc- Q lcl|NC_019422. 129 IYKAVLPN---I-------------------SDANEKAIINFATTGIKVG---EK--SYTTAEYTARLAGILAGISLSE- 180 (355) Q Consensus 129 ~~~aVl~~---~-------------------~~~d~egIinv~n~~i~~~---~~--~~~~~~~~a~vAG~~Ag~~~~~- 180 (355) ...+++.. . ...+++..+.+.+-....+ +. .+++ ++++||++|....++ T Consensus 415 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---sg~vAGl~Ar~D~~~g 491 (666) T protein:vir:65 415 PRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVPL---AADIAGLCARTDAVSQ 491 (666) T ss_pred ccceeeecCCCCCHHHHHHHHHhcccccccccccCcceEEEEcCceEEecccCCceeEech---HHHHHHHHHHHhccCC Confidence 11121111 0 0133444444444332221 22 2343 799999999886554 Q ss_pred ---ccccccCCcc---cc---cCChhhHHHHHhCCeEEEEE--CCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHH Q lcl|NC_019422. 181 ---SCTYFILDEV---TE---IEPTENPDEAVEEGKLILIN--NNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQ 249 (355) Q Consensus 181 ---S~T~~~~~~~---~~---~~~~~e~~~ai~~G~lvl~~--dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~ 249 (355) |+.++++.++ .+ .+++.|.+.+..+|.-++.+ ++++++--+ .|+ ...+..|+.|.++|++|.|. T Consensus 492 ~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~-rT~----~~~~s~~~~i~vrR~~~~i~ 566 (666) T protein:vir:65 492 PWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGD-KTA----TTVPSPFDRINVRRLFNMLK 566 (666) T ss_pred cEEccCCeecceeeccccceeecChhHHHhhhhCCceEEEEeCCCeEEEEec-ccC----CCCCcccceEehhhHHHHHH Confidence 6777655443 22 24678999999999887765 346766544 333 22335799999999999999 Q ss_pred HHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccC Q lcl|NC_019422. 250 DDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEAN 329 (355) Q Consensus 250 ~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~ 329 (355) +.|++.... |+++ +|+..-|..++..|+.||.+|.++|+|..| .+.+|.+-. ....+ T Consensus 567 ~si~~~~~~-~v~e-pn~~~l~~~i~~~i~~~L~~l~~~gal~g~---~V~~d~~~n--------------t~~~i---- 623 (666) T protein:vir:65 567 KNIGDSSKY-KLFE-NNDNFTRASFRMEVSQYLSTIRSLGGIYDF---RVQCDTTNN--------------TPDVI---- 623 (666) T ss_pred HHHHHHHHH-hccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeee---EEEEcCCCC--------------CHHHh---- Confidence 999999876 9999 799999999999999999999999999986 466654422 11111 Q ss_pred CCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 330 TGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 330 ~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) ..-.+++.+.++|+-.||.|.+++.- T Consensus 624 ~~G~~~~~i~~~p~~pae~i~~~~~~ 649 (666) T protein:vir:65 624 DRNEFVASMFIKPAKSINYIMLNFTA 649 (666) T ss_pred hCCeEEEEEEEEecCCcceEEEEEEE Confidence 13467999999999999999999887 No 44 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=99.58 E-value=1.1e-14 Score=97.09 Aligned_cols=295 Identities=9% Similarity=0.117 Sum_probs=188.6 Q ss_pred CCCCceEEEe----------eeeeeeeecCCCceeEEEEEecCCccceeEEEeehhhhhhhhhhHHHHHHHHhhhccccc Q lcl|NC_019422. 1 MGLPSAIIEF----------QRRSRTVKFRSRRGVVALILKDSTAIKKSYSIDFLTDINETEFTKENYDYIRLAFLGKPS 70 (355) Q Consensus 1 ~g~P~~~i~f----------~~~a~ta~~~~~rG~v~iil~d~~~~~~~~~~~~~~d~~~~~~~~~n~~~i~~a~~g~~~ 70 (355) -.||-. ... -..+.||++.|+ |+- .+....+.. ....|. T Consensus 153 ~~lPvT-A~~~~~~~~~~a~~~VtlTAr~kG~-~n~---------idi~~~~~~--------------------ge~~p~ 201 (495) T protein:vir:19 153 PDLPVT-AEVRADSGDDDTHADVVLSAKFTGA-LSA---------VDVRWNYYA--------------------GETTPY 201 (495) T ss_pred ccCceE-EEeeccCCCCcCceeEEEEEeeccc-ccc---------ceeEEEeec--------------------cccccc Confidence 345522 110 122233333332 100 000011110 111233 Q ss_pred eE--EEEecCCCccchhHHHHHHHHHhcccceEEEEcCCChHHHHHHHHHHHHHHHh--cCCeEEEEecCC--------- Q lcl|NC_019422. 71 KV--IVEVINDSVDSERSLDDALKALRENKFNYLAIPFISEEVDKTKIVNWIKTARR--EKEIYKAVLPNI--------- 137 (355) Q Consensus 71 ~v--~l~~g~~g~~~~~~y~~al~~le~~~fn~l~~p~~~d~~~~~~~~~~ik~~r~--~g~~~~aVl~~~--------- 137 (355) ++ .++..++|+. +.|++++|.+|....||++++|-.+.++ -+.+.+|+..... +.+..+++.+.. T Consensus 202 Glt~titamsgGag-~PDia~alaal~~~~~~~I~~P~tD~as-L~al~~~l~~rw~~~~q~~g~~~~a~~gT~~~l~t~ 279 (495) T protein:vir:19 202 GIITAFKAASGKNG-NPDISASIAGMGDLQYKYIVMPYTDEPN-LNLLRTELQERWGPVNQADGFAVTVLSGTYGDISTF 279 (495) T ss_pred ceeEEEEecCCCCC-CcchHHHHHHhccCCCcEEEEecCcHHH-HHHHHHHHHHhhhHHHhcCeEEEEeecCCHHHHHHh Confidence 33 3444555543 5689999999999999999999754444 4689999987444 356666666532 Q ss_pred -CCcCcceeEEecCCeEecCCceecHHHHHHHHHHHhc---CcccccccccccCCccc-----ccCChhhHHHHHhCCeE Q lcl|NC_019422. 138 -SDANEKAIINFATTGIKVGEKSYTTAEYTARLAGILA---GISLSESCTYFILDEVT-----EIEPTENPDEAVEEGKL 208 (355) Q Consensus 138 -~~~d~egIinv~n~~i~~~~~~~~~~~~~a~vAG~~A---g~~~~~S~T~~~~~~~~-----~~~~~~e~~~ai~~G~l 208 (355) ...|+++|--+. .+|..-++++++|..||.+| ..+..+++.--.|+|+. ++++.+|.+.++.+|.- T Consensus 280 g~~~N~~~it~~~-----~~gsp~~~~~~AAA~aa~~A~~l~~DPArPL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gis 354 (495) T protein:vir:19 280 GVSRNDHLISCMG-----IAGAPEPSYLYAATLCAVASQALSIDPARPLQTLTLPGRMPPAVGDRFTWSERNALLFDGIS 354 (495) T ss_pred hhccCCceEEEEe-----cCCCCCcHHHHHHHHHHHHHHHhhcccccccCceeecceecCCccccCChHHHHHHHhCCcc Confidence 234777776543 24566678888888888876 35566777777777765 46788999999999999 Q ss_pred EEEE--CCcEEEEecCccccccC-CCCCchhhhhhhHhhHHHHHHHHHHHHhhcccc-ccCCCHHH---------HHHHH Q lcl|NC_019422. 209 ILIN--NNGIRIARGVNSLITLS-KEDTEDLKKIKIVEAIDMIQDDILQTWNENYVG-KVTNKYDN---------KILFL 275 (355) Q Consensus 209 vl~~--dg~v~I~~~INSltt~~-~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiG-K~~N~~~g---------r~~~~ 275 (355) .|.- +|.|.|+|.|+|.++=. ...+..|..|.+++++|.+.+++|.-+..+|=+ |+.++..+ -..++ T Consensus 355 t~~V~~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir 434 (495) T protein:vir:19 355 TFNVNDGGEMQIERMITMYRTNKYGDSDPSYLNVNTIATLSYLRYSLRTRITQKFPNYKLASDGTRFATGQAVVTPSVIK 434 (495) T ss_pred eEEECCCCeEEEEeeeeeeeecCCCCcchhhhhhHHHHHHHHHHHHHHHHHhhhcCCcccccCCCCCCCcccccChHHHH Confidence 9864 56899999999987664 577889999999999999999999999998965 45444111 23578 Q ss_pred HHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCC-CCEEEEEEEEEEEeeeeEEEEEEe Q lcl|NC_019422. 276 SAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANT-GSYVFVEGNITITDAMEDLKFKIY 354 (355) Q Consensus 276 ~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~-~d~v~~~~~i~~vdamEkiy~tv~ 354 (355) +++..-+++|+..|++++.+.+.-.+-+ ++... .+-|-+.+....++-+--|=..+. T Consensus 435 ~ell~~~~~le~~given~~~~~~~LiV----------------------erd~~dpnRln~~~p~d~vn~L~V~A~~i~ 492 (495) T protein:vir:19 435 TELLALFEEWENAGLVEDFDTFKEELYV----------------------ARNKDDKDRLDVLCGPNLINQFRIFAAQVQ 492 (495) T ss_pred HHHHHHHHhhhhhccccChhhhcceeEE----------------------EECCCCCcEEEEEecceeeCceeeeeeeee Confidence 8998888999999999998544322222 22222 234445554444444443333333 Q ss_pred C Q lcl|NC_019422. 355 M 355 (355) Q Consensus 355 v 355 (355) . T Consensus 493 f 493 (495) T protein:vir:19 493 F 493 (495) T ss_pred e Confidence 3 No 45 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=99.58 E-value=1.6e-15 Score=101.70 Aligned_cols=316 Identities=9% Similarity=0.048 Sum_probs=171.7 Q ss_pred CCCCceEE---EeeeeeeeeecCCCcee-EEEEEecCCccceeEEEeeh--hhhh---------hhhhhHHHHHHHHhh- Q lcl|NC_019422. 1 MGLPSAII---EFQRRSRTVKFRSRRGV-VALILKDSTAIKKSYSIDFL--TDIN---------ETEFTKENYDYIRLA- 64 (355) Q Consensus 1 ~g~P~~~i---~f~~~a~ta~~~~~rG~-v~iil~d~~~~~~~~~~~~~--~d~~---------~~~~~~~n~~~i~~a- 64 (355) .|-|+.-. .+-+.+..+.......+ ...++.... .+.+... .... ..........+.... T Consensus 315 ~~~~g~vve~~~~~s~~~~~~~~~~~~~~~~~vi~~~s----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~ 390 (729) T protein:vir:10 315 TGNSGTILEKHLSLSKAKDAEYSVGSSSYWRDFLATNS----KYIFGGGATSGITTTGYSVSSTNTLDTDSGWDQNAEGV 390 (729) T ss_pred ccCcccceeeeeeeeeccccccccccccccceeecccc----ceeeecccccccccccccccccceeccccccccccccc Confidence 23332111 11111111111000000 000110000 0000000 0000 000000000000000 Q ss_pred --hccccceEEEEecCCCc------------cchhHHHHHHHHHhcc---cceEEEEc-----CCChHHHHHHHHHHHHH Q lcl|NC_019422. 65 --FLGKPSKVIVEVINDSV------------DSERSLDDALKALREN---KFNYLAIP-----FISEEVDKTKIVNWIKT 122 (355) Q Consensus 65 --~~g~~~~v~l~~g~~g~------------~~~~~y~~al~~le~~---~fn~l~~p-----~~~d~~~~~~~~~~ik~ 122 (355) ....+..+.+.++.++. ....++..++.+|+.. .++.+.++ +.....++..+.+++++ T Consensus 391 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~~~~~~v~~a~~~~~~~ 470 (729) T protein:vir:10 391 NFGASGVATLTLAGGTNYGDKTDLTTSGALSSGVDDIISGYTLFENTEEIEVDFILMGAAHHPKEQSQAVAEKVTAVAEA 470 (729) T ss_pred cccccceeEEEeecccccccccccccccccccchhHHHHHHHHhhcccccccceeeecCCCCCccchHHHHHHHHHHHHh Confidence 01112223344443321 1234567788888654 34544443 22345677888888888 Q ss_pred HHhcCCeEEEEecCC------------------C------------CcCcceeEEecCCeEec---CC--ceecHHHHHH Q lcl|NC_019422. 123 ARREKEIYKAVLPNI------------------S------------DANEKAIINFATTGIKV---GE--KSYTTAEYTA 167 (355) Q Consensus 123 ~r~~g~~~~aVl~~~------------------~------------~~d~egIinv~n~~i~~---~~--~~~~~~~~~a 167 (355) +|+ +.+++... . -.+++....+.+-.... ++ ..+++ ++ T Consensus 471 ~~~----~~a~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~p~---s~ 543 (729) T protein:vir:10 471 RKD----AVAFISPYRQAFLNDTVSGTVTVSNIDQTTENVVGFYAPLSSSTYSVFDSGYKYMFDRFNNTFRYVPL---NG 543 (729) T ss_pred cCC----eEEEecccccccccccccccccccccchhhHHHHHHHhhccCCceEEEEcCeeEEecccCCceEEech---hH Confidence 764 22222100 0 01223333333322211 12 12343 89 Q ss_pred HHHHHhcCccccc----ccccccCCccc---c---cCChhhHHHHHhCCeEEEEE--CCcEEEEecCccccccCCCCCch Q lcl|NC_019422. 168 RLAGILAGISLSE----SCTYFILDEVT---E---IEPTENPDEAVEEGKLILIN--NNGIRIARGVNSLITLSKEDTED 235 (355) Q Consensus 168 ~vAG~~Ag~~~~~----S~T~~~~~~~~---~---~~~~~e~~~ai~~G~lvl~~--dg~v~I~~~INSltt~~~~k~~~ 235 (355) ++||++|.+...+ |+.++++.++. . .+++.|.+.+-.+|.-++.+ ++++++--+ .|+. ..+.. T Consensus 544 ~~aGl~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~-rT~~----~~d~~ 618 (729) T protein:vir:10 544 DIAGTCARTDIEQFPWFSPAGTARGPILNSVKLVYNPGKKQRDILYSNRINPVILSPGAGIILFGD-KTGF----GKSSA 618 (729) T ss_pred HHHHHHHHhhccCCcEEccCCccccceecccceeeecChhhHhhhhhCCceEEEEecCCeEEEEcc-eecC----CCCcc Confidence 9999999987654 78887765443 2 24678888888999877764 446666544 4442 23468 Q ss_pred hhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhcccc Q lcl|NC_019422. 236 LKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGI 315 (355) Q Consensus 236 f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~ 315 (355) |+.|.++|++|+|.+-|++.+.. |+++ +|+..-|..++..|+.||.+|.++|+|..| .|..|.+-.. T Consensus 619 ~~~i~vrR~~~~i~~si~~~~~~-~v~e-pn~~~~~~~i~~~i~~~L~~l~~~g~l~g~---~v~~d~~~nt-------- 685 (729) T protein:vir:10 619 FDRINVRRLFIYLEDAISAAAKD-QLFE-FNDELTRTNFVNIVEPFLRDVQAKRGIFDF---VVICDETNNT-------- 685 (729) T ss_pred cceeehhhhHHHHHHHHHHHHHH-hhcC-CCCHHHHHHHHHHHHHHHHHHHhccceeee---EEEEcCCCCC-------- Confidence 99999999999999999999876 9999 799999999999999999999999999886 4555543221 Q ss_pred ccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 316 DYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 316 d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) ...+. .-.+++.+.+.|+-.||.|.+++.- T Consensus 686 ------~~~i~----~G~~~~~v~~~p~~p~e~i~~~~~~ 715 (729) T protein:vir:10 686 ------AAVID----SNEFVADIFIKPARSINFIGLTFVA 715 (729) T ss_pred ------HHHhh----CCeEEEEEEEEecCCccEEEEEEEE Confidence 11111 2468999999999999999999876 No 46 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=99.55 E-value=6.5e-14 Score=92.89 Aligned_cols=321 Identities=10% Similarity=0.055 Sum_probs=176.2 Q ss_pred CCCCceEEEeeee----------------eeeeecCCCceeE------------------------------EEEEecCC Q lcl|NC_019422. 1 MGLPSAIIEFQRR----------------SRTVKFRSRRGVV------------------------------ALILKDST 34 (355) Q Consensus 1 ~g~P~~~i~f~~~----------------a~ta~~~~~rG~v------------------------------~iil~d~~ 34 (355) +|.|.+...+..- ..++..++..+.+ .+++..+. T Consensus 221 ~~~~~~~a~~~g~~g~~~~v~v~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g 300 (671) T protein:vir:56 221 QGFPRLSARYVGDFGDAISVEIINYADYQTAFAFAAGHTLGDIELPIYPDGGTRSINLSSYFTFGPSNSNQYAVIVRVSG 300 (671) T ss_pred ccccccccccccccCcceEEEEecccccccccccccceeeeeccccccccccccccccceeecccccccccceeEEeecC Confidence 3333333222110 0011111000000 01111111 Q ss_pred ccceeEEEeehhhhhhhhhhHHHHHHHHhhhc-c--------------ccceEEEEecCCCccchhHHHHHHHHHhcc-- Q lcl|NC_019422. 35 AIKKSYSIDFLTDINETEFTKENYDYIRLAFL-G--------------KPSKVIVEVINDSVDSERSLDDALKALREN-- 97 (355) Q Consensus 35 ~~~~~~~~~~~~d~~~~~~~~~n~~~i~~a~~-g--------------~~~~v~l~~g~~g~~~~~~y~~al~~le~~-- 97 (355) .....+.+......... .....++...+. + .+....+.+|.+++....++.+++++++.. T Consensus 301 ~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~ 377 (671) T protein:vir:56 301 EVEEAFIVSTNPGDKDV---NGQSIFIDEYFENSGSAYITAIAEGWKTESGAYNFGGGSDANAGADDWMFGLDMLSDPEV 377 (671) T ss_pred ccceeEEEeeccccccc---chhhhhhhhhhcccCceEEEecCcccCCccccccccCccccccchhHHHHHHHhhhhccc Confidence 11111111110000000 000111111110 0 111223556777766667888899998753 Q ss_pred -cceEEEEcCCCh-HH--HHHHHHHHHHHHHhcCCeEEEEecCC----------------------------------CC Q lcl|NC_019422. 98 -KFNYLAIPFISE-EV--DKTKIVNWIKTARREKEIYKAVLPNI----------------------------------SD 139 (355) Q Consensus 98 -~fn~l~~p~~~d-~~--~~~~~~~~ik~~r~~g~~~~aVl~~~----------------------------------~~ 139 (355) ..+++..|.... +. .+....+.+..+.+..+.+.+++..- .. T Consensus 378 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (671) T protein:vir:56 378 LYTNLVIAGNAAAEEVSIASTVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAGTAVANIQGWRTGIDPTNGQAVVDNLN 457 (671) T ss_pred cceeEEEcCCCCCccchhHHHHHHHHHHHHHhhcCCEEEEEeccccccccccccccHHHHHHHhhhccccchhhhhhhcc Confidence 456666653221 11 12233333444444444566655310 01 Q ss_pred cCcceeEEecCCeEecC---Cc--eecHHHHHHHHHHHhcCccccc----ccccccCCcc---cc---cCChhhHHHHHh Q lcl|NC_019422. 140 ANEKAIINFATTGIKVG---EK--SYTTAEYTARLAGILAGISLSE----SCTYFILDEV---TE---IEPTENPDEAVE 204 (355) Q Consensus 140 ~d~egIinv~n~~i~~~---~~--~~~~~~~~a~vAG~~Ag~~~~~----S~T~~~~~~~---~~---~~~~~e~~~ai~ 204 (355) .++...+.+.+-....+ +. .+++ ++++||+.|.....+ |+.++++.++ .. .+++.|.+.+.. T Consensus 458 ~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~ 534 (671) T protein:vir:56 458 VSTTYAVIDGNYKYQYDKYNDRNRWVPL---AGDIAGLCAYTDQVSQPWMSPAGFNRGQIKGVNRLAVDLRRAHRDALYQ 534 (671) T ss_pred CCcceEEEecCceEEecccCCceeEech---HHHHHHHHHHhhccCCcEECcCCceeccccccccceeecChhHHHHHhh Confidence 12333333333322211 11 2343 799999999887554 8888765433 22 246789999999 Q ss_pred CCeEEEEE--CCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHH Q lcl|NC_019422. 205 EGKLILIN--NNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYF 282 (355) Q Consensus 205 ~G~lvl~~--dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl 282 (355) +|.-++.+ +++.++--+ .|+ ...+..|+.|.++|++|+|.+.|++.... |+++ +|+..-|..++..|+.|| T Consensus 535 ~gIn~i~~~~~~G~~~wG~-rT~----~~~~~~~~~i~vrR~~~~i~~si~~~~~~-~v~e-pn~~~~~~~i~~~i~~fL 607 (671) T protein:vir:56 535 IGINPVVGFAGQGFVLYGD-KTA----TQQASAFDRINVRRLFNLLKKAISDAAKY-RLFE-LNDEFTRSSFKSEIDAYL 607 (671) T ss_pred CCceEEEEecCCeEEEEcc-eec----CCCCcccceEehhhHHHHHHHHHHHHHHH-hcCC-CCCHHHHHHHHHHHHHHH Confidence 99777654 445555333 444 22346799999999999999999998775 9999 799999999999999999 Q ss_pred HHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 283 KELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 283 ~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) .+|.++|+|..| .|.+|.+-.. ...+. .-.+++.+.++|+-.+|.|.+++.- T Consensus 608 ~~l~~~gal~g~---~v~~d~~~nt--------------~~~i~----~G~~~~~i~~~p~~Pae~I~~~~~~ 659 (671) T protein:vir:56 608 TNIQDLGGVYDF---RVVCDETNNP--------------GSVID----RNEFVASIYVKPAKSINFITLNFVA 659 (671) T ss_pred HHHHhCCceeee---EEEEcCCCCC--------------HHHhh----CCeEEEEEEEEecCCcceEEEEEEE Confidence 999999999986 4666644221 11111 2467999999999999999999876 No 47 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=99.54 E-value=8.3e-15 Score=97.79 Aligned_cols=324 Identities=9% Similarity=0.026 Sum_probs=176.2 Q ss_pred CCCCce----------EEEeeeeeeeeecC------------------------CCce-eEEEEEecCCccceeEEEeeh Q lcl|NC_019422. 1 MGLPSA----------IIEFQRRSRTVKFR------------------------SRRG-VVALILKDSTAIKKSYSIDFL 45 (355) Q Consensus 1 ~g~P~~----------~i~f~~~a~ta~~~------------------------~~rG-~v~iil~d~~~~~~~~~~~~~ 45 (355) .+.|.+ .+++.-.+..+... .... ...++...+....+++.+... T Consensus 218 ~~~~~~~a~~~G~~Gn~isv~i~s~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~ 297 (664) T protein:vir:98 218 YQIPSVVALYPGELGSTVQVEIISKAAYDTGAMISGYPSGISVKNSGRSVMTYGPQTDNQYAFVVRRGGIVQESFIVSTD 297 (664) T ss_pred cccceeeeeecccccceeeeeecccccccCcceEeeccCceecccceeeeeeccccCccceeEEEecCCceeeeEEeecc Confidence 111111 11110000000000 0000 011112222222222222211 Q ss_pred hhhhhhhhhHH---------HHHHHHhhhccccc---e-EEEEecCCCc--cchhHHHHHHHHHhc---ccceEEEEcCC Q lcl|NC_019422. 46 TDINETEFTKE---------NYDYIRLAFLGKPS---K-VIVEVINDSV--DSERSLDDALKALRE---NKFNYLAIPFI 107 (355) Q Consensus 46 ~d~~~~~~~~~---------n~~~i~~a~~g~~~---~-v~l~~g~~g~--~~~~~y~~al~~le~---~~fn~l~~p~~ 107 (355) .+......... ...++...-.+.|. . ..+.+|.+.. .+..++..+|.+|+. +..+.|++|+. T Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~~~~~~~~~ll~~p~~ 377 (664) T protein:vir:98 298 KTDKDIYGVNIYMDDFFANGGSQYVFGTSMNWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFADREALHVPLLIAGGC 377 (664) T ss_pred cCcccceeeeeechhheecccceeeeeecccCCcccceeEeccCccccccccCchhHHHHHHhhhcccccccceEEecCC Confidence 11111000000 00111111111111 1 1233444332 234556677777764 45789999974 Q ss_pred Ch------HHHHHHHHHHHHHHHhc----CCeEEEEecCC--------------------------CCcCcceeEEecCC Q lcl|NC_019422. 108 SE------EVDKTKIVNWIKTARRE----KEIYKAVLPNI--------------------------SDANEKAIINFATT 151 (355) Q Consensus 108 ~d------~~~~~~~~~~ik~~r~~----g~~~~aVl~~~--------------------------~~~d~egIinv~n~ 151 (355) +. .+++..+.++++++|+- .-...+++... ..+|++..+.+.+- T Consensus 378 ~~~~~~~~~~v~~al~~~a~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~ 457 (664) T protein:vir:98 378 AGESVEIASTVQKHVISIGDERQDCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNY 457 (664) T ss_pred CCCcHHHHHHHHHHHHHHHHhcCCeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCe Confidence 32 13667777777777632 11112222110 11344444444443 Q ss_pred eEecC---Cc--eecHHHHHHHHHHHhcCcccc----cccccccCCccc---c---cCChhhHHHHHhCCeEEEE--ECC Q lcl|NC_019422. 152 GIKVG---EK--SYTTAEYTARLAGILAGISLS----ESCTYFILDEVT---E---IEPTENPDEAVEEGKLILI--NNN 214 (355) Q Consensus 152 ~i~~~---~~--~~~~~~~~a~vAG~~Ag~~~~----~S~T~~~~~~~~---~---~~~~~e~~~ai~~G~lvl~--~dg 214 (355) ....+ +. .++ .++.+||+.|.+... +|+.++++.++. + .+++.|.+.+-.+|.-++. .++ T Consensus 458 ~~~~d~~~~~~~~~p---~sg~~AGl~A~~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~ 534 (664) T protein:vir:98 458 KYQYDKYNDVNRWVP---LAGDIAGLCVYTDSVANPWMSPAGYNRGQIRNCIKLAIEPRTAHRDAMYQVQINPVTGFAGG 534 (664) T ss_pred EEEecccCCceEEec---hHHHHHHHHHHhhhcCCcEECcCCceeeeeeccccceeecChhhHHHHHhCCCeEEEEeeCC Confidence 32222 11 133 388999999988754 477777654433 2 2467899999999975553 343 Q ss_pred -cEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccC Q lcl|NC_019422. 215 -GIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDN 293 (355) Q Consensus 215 -~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~ 293 (355) +.++ -|-.|+. ..+..|+.|.++|++|.|.+.|++.... |+++ +|+..-|..++..|+.||.+|.++|+|.. T Consensus 535 ~G~~~-wG~rT~~----~~~s~~~~i~vrR~~~~i~~si~~~~~~-~v~e-pn~~~l~~~i~~~i~~~L~~l~~~gal~g 607 (664) T protein:vir:98 535 SGFVL-YGDKTLT----SVPSPFDRINVRRLFNMIKKDIGDNAKY-KLFE-NNDDFTRASFRMDTGQYMTNIRALGGCYD 607 (664) T ss_pred CcEEE-EcccccC----CCCcccceEeehhHHHHHHHHHHHHHHH-hhcC-CCCHHHHHHHHHHHHHHHHHHHhcCceee Confidence 4444 3445552 2335799999999999999999998775 9999 79999999999999999999999999998 Q ss_pred CCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 294 SQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 294 ~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) | .|.+|.+-. ....+. .-.+++.+.++|+-.+|.|.+++.- T Consensus 608 ~---~V~~d~~~n--------------t~~~i~----~G~~~~~i~~~p~~pae~I~~~~~q 648 (664) T protein:vir:98 608 Y---RVICDTTNN--------------TPDVID----RNEFVATVYVKPPRSINYITLNFVA 648 (664) T ss_pred e---EEEEcCCCC--------------CHHHhh----CCeEEEEEEEEecCCcceEEEEEEE Confidence 6 466664422 111122 2467999999999999999999887 No 48 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=99.50 E-value=3.4e-14 Score=94.40 Aligned_cols=319 Identities=11% Similarity=0.083 Sum_probs=171.8 Q ss_pred CCCCceEEE-eeee--eeeeecCCCce-eEEEEEecCCc---------cceeEEEeehhhh-hhh--------hhh--HH Q lcl|NC_019422. 1 MGLPSAIIE-FQRR--SRTVKFRSRRG-VVALILKDSTA---------IKKSYSIDFLTDI-NET--------EFT--KE 56 (355) Q Consensus 1 ~g~P~~~i~-f~~~--a~ta~~~~~rG-~v~iil~d~~~---------~~~~~~~~~~~d~-~~~--------~~~--~~ 56 (355) -|.|+.-.. |-.+ +..++...... .+.-++..... .....+. +..+- ... .+. .. T Consensus 336 ~~~~g~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 414 (749) T protein:vir:10 336 TGTVGALLERYIDVSKASDAKTSVGETNYYAEVIKQKSEFIYWAEHESTLYAATS-SASDGLFGQTAANRQFNLFRSAAG 414 (749) T ss_pred eecccceeeeeeeccccccccccccccchhhhhhccCCCEEEEEecccccccccc-cccccccccccccceeeccccccc Confidence 233332211 1111 11111111000 00111111000 0000000 00000 000 000 00 Q ss_pred HHHHHHh--hhccccceE---EEEecCCCc-------cchhHHHHHHHHHh---cccceEEEEc--CCCh---HHHHHHH Q lcl|NC_019422. 57 NYDYIRL--AFLGKPSKV---IVEVINDSV-------DSERSLDDALKALR---ENKFNYLAIP--FISE---EVDKTKI 116 (355) Q Consensus 57 n~~~i~~--a~~g~~~~v---~l~~g~~g~-------~~~~~y~~al~~le---~~~fn~l~~p--~~~d---~~~~~~~ 116 (355) ...+... .....+... .+.++.+++ .+..++..+++.|. ...++.++++ +.++ .+++..+ T Consensus 415 ~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~li~~~~~~~~~~~~~v~~al 494 (749) T protein:vir:10 415 SVDYPAGVTTLGSKNNATYYYRLSGGVNYTVSAGQYTITNTDIGSAYELIGDPESQIVDFIISGPSGTSDANALAKITSL 494 (749) T ss_pred cceeccccccccccCCcEEEEEccCCcccccccccccccchhHHHHHHHhhhhhhcccceEEEecCCCCcchhHHHHHHH Confidence 0000000 000011121 123333322 23457787777764 3467877653 2222 2466777 Q ss_pred HHHHHHHHhcCCeEEEEecCCC----------------------CcCcceeEEecCCeEec---CCc--eecHHHHHHHH Q lcl|NC_019422. 117 VNWIKTARREKEIYKAVLPNIS----------------------DANEKAIINFATTGIKV---GEK--SYTTAEYTARL 169 (355) Q Consensus 117 ~~~ik~~r~~g~~~~aVl~~~~----------------------~~d~egIinv~n~~i~~---~~~--~~~~~~~~a~v 169 (355) .++++++|+ +.+++.-.. ..+++..+.+.+-.... ++. .+++ ++++ T Consensus 495 ~~~~~~~~~----~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~v 567 (749) T protein:vir:10 495 VNIAEERRD----CMVFVSPRRGNVIGISNTTTITTNIVDFFKKLPSSSYMVFDSGYKYIYDKYNDVYRYIPC---NGDT 567 (749) T ss_pred HHHHhhcCC----EEEEEcCCCCcccccccchhhhhHHHHHHhhccCceeEEEEccceeeeccccCceEEech---HHHH Confidence 788877764 344442100 01233333333322211 222 2343 8999 Q ss_pred HHHhcCcccc----cccccccCC---cccc---cCChhhHHHHHhCCeEEEEE--CCcEEEEecCccccccCCCCCchhh Q lcl|NC_019422. 170 AGILAGISLS----ESCTYFILD---EVTE---IEPTENPDEAVEEGKLILIN--NNGIRIARGVNSLITLSKEDTEDLK 237 (355) Q Consensus 170 AG~~Ag~~~~----~S~T~~~~~---~~~~---~~~~~e~~~ai~~G~lvl~~--dg~v~I~~~INSltt~~~~k~~~f~ 237 (355) ||++|.+... .|+.++++. |+.. .+++.|.+.+-.+|.-++.+ +.++++--+ .|+. ..+..|+ T Consensus 568 AGl~Ar~D~~~g~~~SPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~g~G~~~wG~-rT~~----s~d~~~~ 642 (749) T protein:vir:10 568 AGLCLQTNEISEPWFSPAGFQRGVLRNAIKLAYTPNKAQRDQLYANRVNPIVSFPGQGVVLYGD-KTAL----GFASAFD 642 (749) T ss_pred HHHHHHhhccCCcEECcCCceeeeeeccccceeecChhHHHhhhhCCceEEEEecCCeEEEEcc-eecC----CCCcccc Confidence 9999988744 478887643 4433 24678889999999776653 345655443 4442 2234799 Q ss_pred hhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhcccccc Q lcl|NC_019422. 238 KIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDY 317 (355) Q Consensus 238 kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~ 317 (355) .|.++|++|.|.+.|++.... |+++ ||+..-|..++..+..||.+|.++|+|+.| .|.+|.+.... T Consensus 643 ~i~vRRl~~~ie~si~~~~~~-~v~e-pn~~~l~~~i~~~i~~fL~~l~~~G~i~~f---~V~~d~~~Nt~--------- 708 (749) T protein:vir:10 643 RINIRRLFLTVERVISTAAKA-QLFE-QNDEAQRSLFINIVEPYLRDVQGRRGVVDF---LVKCDSTNNTP--------- 708 (749) T ss_pred eeehhhhHHHHHHHHHHHHHH-hhcC-CCCHHHHHHHHHHHHHHHHHHHhcCCeeee---EEEEcCCCCCH--------- Confidence 999999999999999998775 9998 799999999999999999999999999877 46666543221 Q ss_pred ccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 318 SEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 318 ~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) ..+. .-.+++.+.++|+-.+|.|.+++.- T Consensus 709 -----~~i~----~G~~~~~i~~~P~~pae~I~~~~~~ 737 (749) T protein:vir:10 709 -----EAVD----RGEFYAEVFLKPTRTINYVQLTFVA 737 (749) T ss_pred -----HHhh----CCEEEEEEEEEecCCccEEEEEEEE Confidence 1111 2468999999999999999999876 No 49 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=99.50 E-value=6.6e-14 Score=92.83 Aligned_cols=317 Identities=14% Similarity=0.119 Sum_probs=169.5 Q ss_pred CCCCc-eEEEeeeee----ee-----------eecCCCce-eEEEEEecCCccceeEEEee--hhhhhhhhhhHHHHHH- Q lcl|NC_019422. 1 MGLPS-AIIEFQRRS----RT-----------VKFRSRRG-VVALILKDSTAIKKSYSIDF--LTDINETEFTKENYDY- 60 (355) Q Consensus 1 ~g~P~-~~i~f~~~a----~t-----------a~~~~~rG-~v~iil~d~~~~~~~~~~~~--~~d~~~~~~~~~n~~~- 60 (355) =|-+. +-+....-+ .. ........ ...+.+...... ....... ..+.... ....... T Consensus 74 ngg~~~~vvrV~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~~~~~~~--~~~~~~~~ 150 (477) T protein:vir:79 74 YGSGTVIVINVLDPAVHKSNAASESVTFDAATGRAKLAHPAAANLVLKNDSGG-TTYTEGTDYAVDLING--VITRIKTG 150 (477) T ss_pred cCCceEEEEeccCCccccccccccccccccccccccccccccceeEEeecccc-cccccCccccccccch--hhhhhhcc Confidence 11111 111111100 00 00111100 111111111100 0000000 0000000 0000000 Q ss_pred --------HHhhhc-cccce---EEEEecCCCccchhHHHHHHHHHhcc---cceEEEEcCCC-hHHHHHHHHHHHHHHH Q lcl|NC_019422. 61 --------IRLAFL-GKPSK---VIVEVINDSVDSERSLDDALKALREN---KFNYLAIPFIS-EEVDKTKIVNWIKTAR 124 (355) Q Consensus 61 --------i~~a~~-g~~~~---v~l~~g~~g~~~~~~y~~al~~le~~---~fn~l~~p~~~-d~~~~~~~~~~ik~~r 124 (355) +...+. +.+.. ..+.+..+. ........+|...+.. ..+.++.|+.+ +..+++.+.+.++++| T Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~a-~~~~tg~~al~~~~~~~~~~~~iv~apg~~~~~~v~~~l~~~~~~~~ 229 (477) T protein:vir:79 151 TIPAAATAAKATYDYADPTKVTAADIIGAVNA-AGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQLG 229 (477) T ss_pred ccccccceeeceeccCCcccceeeeecccccc-cccchhhhhhhhhhhhcccccceeeccccccchhHHHHHHHHHhhcC Confidence 000000 00110 011111111 1112233344444332 34667778643 3457777777777664 Q ss_pred hcCCeEEEEecCC-------------------CCcCcceeEEecCCeEecC---C--ceecHHHHHHHHHHHhcCcccc- Q lcl|NC_019422. 125 REKEIYKAVLPNI-------------------SDANEKAIINFATTGIKVG---E--KSYTTAEYTARLAGILAGISLS- 179 (355) Q Consensus 125 ~~g~~~~aVl~~~-------------------~~~d~egIinv~n~~i~~~---~--~~~~~~~~~a~vAG~~Ag~~~~- 179 (355) ..+++-.. ..++++.++.+.+-....+ + ..++ .++.+||++|..+.. T Consensus 230 -----~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p---~s~~~ag~~a~~d~~~ 301 (477) T protein:vir:79 230 -----AIAYIDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDIATNAERLEP---LSSRAAGLRARVDLDK 301 (477) T ss_pred -----eEEEEecCCCCChHHHhhhhhhccccccccccceEEEEcCeeEEecccCCceeeec---hHHHHHHHHHHhhccC Confidence 33444211 1134555555554432211 1 2234 388999999987644 Q ss_pred ---cccccccCCccccc---------CChhhHHHHHhCCeEEEEE--CCcEEEEecCccccccCCCCCchhhhhhhHhhH Q lcl|NC_019422. 180 ---ESCTYFILDEVTEI---------EPTENPDEAVEEGKLILIN--NNGIRIARGVNSLITLSKEDTEDLKKIKIVEAI 245 (355) Q Consensus 180 ---~S~T~~~~~~~~~~---------~~~~e~~~ai~~G~lvl~~--dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~ 245 (355) +|+.++++.|+... ++..|.+.+-++|.-++.+ +++.++--+-+. ..+..+..|+.|.++|++ T Consensus 302 g~~~span~~~~gv~~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~~G~~~wG~rT~---~~~~~~~~~~~i~vrR~~ 378 (477) T protein:vir:79 302 GYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTA---AWPTVTHMRNFENVRRTG 378 (477) T ss_pred CceEccCCceeecceecccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEccccc---CCCCCCccceeeehhhHH Confidence 58888888876532 1346788888999877754 457776554322 234555679999999999 Q ss_pred HHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceee Q lcl|NC_019422. 246 DMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQI 325 (355) Q Consensus 246 D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v 325 (355) |.|.+.+++.... |+++ +|+..-+..++..|+.||.+|.++|+|..| .+.+|.+ .|....+ T Consensus 379 ~~i~~~~~~~~~~-~v~e-~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~---~v~~~~~--------------~nt~~~i 439 (477) T protein:vir:79 379 DVINESLRYFSQQ-FVDA-PIDQGLIDSLVESVNGFGRKLIGDGALLGF---KAWFDPA--------------RNPKEEL 439 (477) T ss_pred HHHHHHHHHHHHH-hccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeee---EEEEecC--------------CCCHHHh Confidence 9999999999876 9999 799999999999999999999999999876 3444432 2222222 Q ss_pred eccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 326 KEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 326 ~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) .+ ..+++.+.+.|+-.+|.|.+++.. T Consensus 440 ~~----G~~~~~i~~~p~~p~e~i~~~~~~ 465 (477) T protein:vir:79 440 AA----GHLLINYKYTVPPPLERLTYETEI 465 (477) T ss_pred hC----CeEEEEEEEEecCCceeEEEEEEE Confidence 22 358999999999999999999888 No 50 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=99.49 E-value=7.2e-14 Score=92.62 Aligned_cols=324 Identities=10% Similarity=0.043 Sum_probs=175.9 Q ss_pred CCCCceEEEeee-------------------------eeeeee-cCCCc-eeEEEEEecCCccceeEEEeehhhhhhhhh Q lcl|NC_019422. 1 MGLPSAIIEFQR-------------------------RSRTVK-FRSRR-GVVALILKDSTAIKKSYSIDFLTDINETEF 53 (355) Q Consensus 1 ~g~P~~~i~f~~-------------------------~a~ta~-~~~~r-G~v~iil~d~~~~~~~~~~~~~~d~~~~~~ 53 (355) -|--+-.|++.- .+.... ..+.. +...++...+......+.+....+...... T Consensus 227 ~g~~G~~i~v~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 306 (660) T protein:vir:68 227 PGELGDQLEIEIVSKADYDKGASAQLKIYPDGGTRYSTAKAIFGYGPQTDDQYAIIVRRNDSVVQSVVLSTKRGERDIYG 306 (660) T ss_pred ccccccceEEEEeccccccccccccceeeecccccccceeeEeecccccccceeeeeecCCcceeeeeeecccccccccc Confidence 111111111100 000000 00011 112222222211122222221111110000 Q ss_pred h---------HHHHHHHHhhhccccc----eEEEEecCCCc--cchhHHHHHHHH---HhcccceEEEEcCCCh------ Q lcl|NC_019422. 54 T---------KENYDYIRLAFLGKPS----KVIVEVINDSV--DSERSLDDALKA---LRENKFNYLAIPFISE------ 109 (355) Q Consensus 54 ~---------~~n~~~i~~a~~g~~~----~v~l~~g~~g~--~~~~~y~~al~~---le~~~fn~l~~p~~~d------ 109 (355) . ..-..++.....+.|. .+.+.+|.++. .+..++..+++. ++.+..+.+.+++... T Consensus 307 ~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 386 (660) T protein:vir:68 307 SNIFIDDFFAKGASNYIFATAQGWPKGFSGVIKLNGGLSSNETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESLEVA 386 (660) T ss_pred cceeeehhhccCcccEEEEeecCCCccccceeeeccccccccccccchhhhHHHHhhhhhccccceeeccccCCCchHHH Confidence 0 0001111111111111 12355555442 223455555544 4555666555543222 Q ss_pred HHHHHHHHHHHHHHHhc----CCeEEEEecCC----------------------CCcCcceeEEecCCeEecC---Cc-- Q lcl|NC_019422. 110 EVDKTKIVNWIKTARRE----KEIYKAVLPNI----------------------SDANEKAIINFATTGIKVG---EK-- 158 (355) Q Consensus 110 ~~~~~~~~~~ik~~r~~----g~~~~aVl~~~----------------------~~~d~egIinv~n~~i~~~---~~-- 158 (355) .++++.+.++++++|+- .....+++... ..+|+...+.+.+-....+ +. T Consensus 387 ~~v~~~l~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~ 466 (660) T protein:vir:68 387 STVQKHVVAIGDSRQDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKYNDVNR 466 (660) T ss_pred HHHHHHHHHHHHhhCCeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceE Confidence 24677888888887642 11122222110 0135555555555433222 21 Q ss_pred eecHHHHHHHHHHHhcCcccc----cccccccCCccc---c---cCChhhHHHHHhCCeEEEE--ECCcEEEEecCcccc Q lcl|NC_019422. 159 SYTTAEYTARLAGILAGISLS----ESCTYFILDEVT---E---IEPTENPDEAVEEGKLILI--NNNGIRIARGVNSLI 226 (355) Q Consensus 159 ~~~~~~~~a~vAG~~Ag~~~~----~S~T~~~~~~~~---~---~~~~~e~~~ai~~G~lvl~--~dg~v~I~~~INSlt 226 (355) .+++ ++.+||+.|.+..+ +|+-++++.++. . .+++.|.+.+-.+|.-++. .+++.++--+ .|+ T Consensus 467 ~~p~---sg~~AGl~Ar~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~-rT~- 541 (660) T protein:vir:68 467 WVPL---AADIAGLCARTDNISQPWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGD-KTA- 541 (660) T ss_pred Eech---hHHHHHHHHHHhccCCcEEccCCeeeceeeccceeeecCChhHHHHHhhCCceEEEEecCCeEEEEcc-eec- Confidence 2333 88999999988744 477777655443 2 2477899999999987764 3446665444 343 Q ss_pred ccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccc Q lcl|NC_019422. 227 TLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAH 306 (355) Q Consensus 227 t~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q 306 (355) ...+.+|+-|.++|+++.|.+.+++.... |+++ +|+..-|..++..|+.||.+|.++|+|..| .|..|.+-. T Consensus 542 ---~~~~s~~~~i~vrR~~~~i~~si~~~~~~-~v~e-pn~~~~~~~i~~~i~~~L~~l~~~gal~gf---~V~~d~~~n 613 (660) T protein:vir:68 542 ---TSVPSPFDRINVRRLFNMVKTNIGSASKY-RLFE-LNNAFTRSSFRTETSQYLQGIKALGGVYNF---KVVCDTTNN 613 (660) T ss_pred ---CCCCcccceEehhhHHHHHHHHHHHHHHH-hccC-CCCHHHHHHHHHHHHHHHHHHHhcCceeee---EEEEecCCC Confidence 23346899999999999999999998875 9998 799999999999999999999999999986 355554422 Q ss_pred hhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 307 KKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 307 ~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) . ...+. .-.+++.+.+.|+-.||.|.+++.- T Consensus 614 t--------------~~~i~----~G~~~~~i~~~p~~pae~i~l~~~~ 644 (660) T protein:vir:68 614 T--------------PAVID----RNEFVATFYLQPARSINYITLNFVA 644 (660) T ss_pred C--------------HHHhh----CCeEEEEEEEEecCCcceEEEEEEE Confidence 1 11111 2468899999999999999999876 No 51 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=99.35 E-value=1.8e-12 Score=84.91 Aligned_cols=309 Identities=13% Similarity=0.075 Sum_probs=153.8 Q ss_pred CCCCceEEEeeeeeeeeecCCCceeEEEEEecCCccceeEEEeehhhhhhhhhhHHHHHHHHhhhccccceE---EEEec Q lcl|NC_019422. 1 MGLPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIKKSYSIDFLTDINETEFTKENYDYIRLAFLGKPSKV---IVEVI 77 (355) Q Consensus 1 ~g~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~~~~~~~~~d~~~~~~~~~n~~~i~~a~~g~~~~v---~l~~g 77 (355) -+.+.....+...+..+..........-...+. ............... .......+.+..+ .+.+. T Consensus 111 ~~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~g~ 179 (477) T protein:vir:10 111 AHPAAANLVLKNDSGGTTYAEGTDYAVDLINGV--ITRIKTGTIPPGATA---------AKATYDYADPTKVTAADIIGA 179 (477) T ss_pred cccccccccccccccccccccchhhhhhhcccc--ceeccccccccccee---------eeecccccccccccccccccc Confidence 111111111111111110000000000000000 000000000000000 0000000000000 00011 Q ss_pred CCCccchhHHHHHHHHHhcc---cceEEEEcCCC-hHHHHHHHHHHHHHHHhcCCeEEEEecC----------------- Q lcl|NC_019422. 78 NDSVDSERSLDDALKALREN---KFNYLAIPFIS-EEVDKTKIVNWIKTARREKEIYKAVLPN----------------- 136 (355) Q Consensus 78 ~~g~~~~~~y~~al~~le~~---~fn~l~~p~~~-d~~~~~~~~~~ik~~r~~g~~~~aVl~~----------------- 136 (355) .+.+ .....-.+|...+.. ....++.|+.. +..+++.+.+.++++| ..+++-. T Consensus 180 ~~~~-~~~tGl~al~~~~~~~~~~~~~l~apg~~~~~~v~~~l~~~~~~~~-----~~~~~d~p~~~~~~~~~~~~~~~~ 253 (477) T protein:vir:10 180 VNAA-GMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQLG-----AIAYIDAPIGTTLAQALAGRGPAG 253 (477) T ss_pred cccc-chhhhhhhhhhhhhhcchhcccccccccccchhhHHHHHHHHhhCC-----EEEEEecCCCCCHHHHHhhhhhcc Confidence 0000 000111112111111 11344444422 2234444555444443 1222211 Q ss_pred --CCCcCcceeEEecCCeEecC---C--ceecHHHHHHHHHHHhcCcc----cccccccccCCccccc---------CCh Q lcl|NC_019422. 137 --ISDANEKAIINFATTGIKVG---E--KSYTTAEYTARLAGILAGIS----LSESCTYFILDEVTEI---------EPT 196 (355) Q Consensus 137 --~~~~d~egIinv~n~~i~~~---~--~~~~~~~~~a~vAG~~Ag~~----~~~S~T~~~~~~~~~~---------~~~ 196 (355) ...++++.++.+.+-....+ + ..+++ ++.+||++|..+ .-+|+.++++.|+... ++. T Consensus 254 ~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~ag~~a~~d~~~g~~~span~~~~gi~~~~~~~~~~~~~~~ 330 (477) T protein:vir:10 254 TINFNTSSDRVRLCYPHVKVYDTATNAERLEPL---SSRAAGLRARVDLDKGYWWSSSNQQLVGVTGVERPLSAMIDDPQ 330 (477) T ss_pred ccccccccceEEEEcCeEEEecccCCceeEEch---HHHHHHHHHHhhhcCCceeccCCceeccccccccccccccCCCh Confidence 01234455554444332111 1 23444 789999999876 4458888888776543 134 Q ss_pred hhHHHHHhCCeEEEEE--CCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHH Q lcl|NC_019422. 197 ENPDEAVEEGKLILIN--NNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILF 274 (355) Q Consensus 197 ~e~~~ai~~G~lvl~~--dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~ 274 (355) .|.+.+-.+|.-++.+ +++.++--+ .|+ -.+..+..|+.|.++|++|.|.+.+++.+.. |+++ +|+..-+..+ T Consensus 331 ~~~~~L~~~gi~~i~~~~~~G~~~wG~-rT~--~~~~~~~~~~~~~vrR~~~~i~~~~~~~~~~-~v~~-~~~~~~~~~i 405 (477) T protein:vir:10 331 SDVNMLNEQGITTVFSSYGSGLRLWGN-RTA--AWPTVTHMRNFENVRRTGDVINESLRYFSQQ-FVDA-PIDQGLIDSL 405 (477) T ss_pred hhHHHHhhCCceEEEEecCCcEEEEcc-ccc--CCCCCCcccceeehhhHHHHHHHHHHHHHHH-hccC-CCCHHHHHHH Confidence 5788888999887764 456766544 222 1234456799999999999999999999876 9999 7999999999 Q ss_pred HHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEe Q lcl|NC_019422. 275 LSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIY 354 (355) Q Consensus 275 ~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~ 354 (355) +..++.||.+|.++|+|..| .+.+|.+. +....+. .-.+++.+.+.|+-.+|.|.++++ T Consensus 406 ~~~i~~~l~~l~~~g~l~g~---~v~~~~~~--------------nt~~~i~----~G~~~~~i~~~p~~p~e~i~~~~~ 464 (477) T protein:vir:10 406 VESVNGFGRKLIGDGALLGF---KAWFDPAR--------------NPKEELA----AGHLLINYKYTVPPPLERLTYETE 464 (477) T ss_pred HHHHHHHHHHHHhCCceeee---EEEEecCC--------------CCHHHhh----CCeEEEEEEEEecCCcceEEEEEE Confidence 99999999999999999876 35554432 1111222 236799999999999999988888 Q ss_pred C Q lcl|NC_019422. 355 M 355 (355) Q Consensus 355 v 355 (355) . T Consensus 465 ~ 465 (477) T protein:vir:10 465 I 465 (477) T ss_pred E Confidence 8 No 52 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=99.06 E-value=4.3e-10 Score=71.94 Aligned_cols=326 Identities=11% Similarity=0.095 Sum_probs=173.5 Q ss_pred CCC--------------CceEEEeeeeeeee-----------------ecC--CCceeEEEEEecCCccceeEEEeehhh Q lcl|NC_019422. 1 MGL--------------PSAIIEFQRRSRTV-----------------KFR--SRRGVVALILKDSTAIKKSYSIDFLTD 47 (355) Q Consensus 1 ~g~--------------P~~~i~f~~~a~ta-----------------~~~--~~rG~v~iil~d~~~~~~~~~~~~~~d 47 (355) -|. ....+++.+-..+. +.- +.-|....+..|.+. ..+.+.+.+. T Consensus 102 ~~~~~~~~~~~~~~~~~G~l~i~i~g~~~t~~~i~lS~~ts~~~vA~~i~~~l~~~~~~~tv~~d~~~--~~F~i~s~tt 179 (502) T protein:vir:52 102 SGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLSVAVSIAYDETG--NRFIVSANVA 179 (502) T ss_pred hhhhhHHhHHHhhhhcCceeEEEecceeeeeeccccccccchhHHHHHHHhhhcccccceEEEEecCC--ceEEEEeccC Confidence 011 11223332222211 111 111222333333332 2233322221 Q ss_pred hhhhh----hh---HHHHHHHHhhh--ccccceEEEEecCCCccchhHHHHHHHHHhcccceEE--EEcCCChHHHHHHH Q lcl|NC_019422. 48 INETE----FT---KENYDYIRLAF--LGKPSKVIVEVINDSVDSERSLDDALKALRENKFNYL--AIPFISEEVDKTKI 116 (355) Q Consensus 48 ~~~~~----~~---~~n~~~i~~a~--~g~~~~v~l~~g~~g~~~~~~y~~al~~le~~~fn~l--~~p~~~d~~~~~~~ 116 (355) ..+.. ++ .....++...+ ...++.+.+..... ....+++.++|+++.....||. .++...+++.+..+ T Consensus 180 g~~~~~~~~~a~~~~~~gt~~a~~l~l~~~~~av~v~~~~~-g~~aet~~~al~a~~~~~~~w~~~~~a~~~~~~~~la~ 258 (502) T protein:vir:52 180 GEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSV-SLKKETLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAA 258 (502) T ss_pred CCcceeEEEEeecCCcchhHHHHHhccccccceeeeeeecc-cccccCHHHHHHHHHhccCceEEEEEeecCChhHHHHH Confidence 11100 00 00112232222 23334443322222 2234679999999987765554 45444345568899 Q ss_pred HHHHHHHHhcCCeEEEEecCCC--------------CcCcceeEEecCCeEecCCceecHHHHHHHHHHHhcCccc---c Q lcl|NC_019422. 117 VNWIKTARREKEIYKAVLPNIS--------------DANEKAIINFATTGIKVGEKSYTTAEYTARLAGILAGISL---S 179 (355) Q Consensus 117 ~~~ik~~r~~g~~~~aVl~~~~--------------~~d~egIinv~n~~i~~~~~~~~~~~~~a~vAG~~Ag~~~---~ 179 (355) ..|+.. .++.+........ ..++.+.+-+-+. ...+ ..+.+.|..|+... + T Consensus 259 a~~iea---~~~~f~~~~~d~~~~~~~~~~i~~~l~a~~~~~t~~~y~~-----~~~~----~~aa~~g~~as~~f~~~~ 326 (502) T protein:vir:52 259 AKYAQA---NTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDK-----NDMY----PVSSALARLLSTNFAANN 326 (502) T ss_pred HHHHhh---cCcEEEEEecCcceeccccchHHHHHHhccCceeEEEecC-----Ccch----hHHHHHHHHHhcCCCcCc Confidence 999974 3444433222111 1122222222221 1112 22334566666643 4 Q ss_pred cccc--cccCCccc-ccCChhhHHHHHhCCeEEEEE-CCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHH Q lcl|NC_019422. 180 ESCT--YFILDEVT-EIEPTENPDEAVEEGKLILIN-NNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQT 255 (355) Q Consensus 180 ~S~T--~~~~~~~~-~~~~~~e~~~ai~~G~lvl~~-dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~ 255 (355) .++| |+.++|+. +.++..|+..+.++|.-++.+ ++.-.+.+|+.. + -+| |-.++-+|-+.+.++.. T Consensus 327 g~iT~~fk~l~GV~~~~lt~t~~~al~~~~~N~y~~~~~~~~~~~G~~~----~----G~~--iD~~~~~~Wl~~~lq~~ 396 (502) T protein:vir:52 327 STLTLKFKQQPTITADEITATEFAKAKRLGINVYTYFDDVAMIAEGTVI----G----GKF--ADEIVILDWFVDAVQKE 396 (502) T ss_pred ceeeecccccCCcccCcCCHHHHHHHHhcCceEEEEecCeeEEecCeee----C----Cch--hhHHHHHHHHHHHHHHH Confidence 4555 56899998 467889999999998777754 455567777644 2 235 77888889888888666 Q ss_pred Hhh-cc--ccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccc-----hhhhhccccccccccceeeec Q lcl|NC_019422. 256 WNE-NY--VGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAH-----KKYLKEKGIDYSEMTEQQIKE 327 (355) Q Consensus 256 ~~~-~y--iGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q-----~~~~~~~~~d~~~~~d~~v~~ 327 (355) +-. -| -+|+|=+.+|..++.+.++.-+++..+.|+|.++....-....... +-|+...+ .++++.-.. T Consensus 397 l~~~L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~G~~~~~~~g~~~~~d~~~~gy~v~~~----~~~~~s~~d 472 (502) T protein:vir:52 397 VFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAA----PMDTLSDSD 472 (502) T ss_pred HHHHHHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeecccccCceEEEeC----chhhCCHHH Confidence 543 23 2799999999999999999999999999999876432111100001 12322221 222333222 Q ss_pred cCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 328 ANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 328 ~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) ...+..--+++.+++-.|+..+.+.++| T Consensus 473 r~~R~~~~~~~~~~~aGaIh~v~i~~nv 500 (502) T protein:vir:52 473 RQARRATPIQTAVKLAGAIHSSDVIVNY 500 (502) T ss_pred HHcccCCCeEEEEEECceEEEEEEEEEE Confidence 3333444588888999999999999999 No 53 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=98.95 E-value=5.8e-09 Score=65.75 Aligned_cols=313 Identities=16% Similarity=0.147 Sum_probs=176.1 Q ss_pred CCCCceEEEeeeeeeeeecCCCceeEEEEEecCCccceeEEEeehhhhhhhhhhHHHHHHHH--hhhccccceEEEEecC Q lcl|NC_019422. 1 MGLPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIKKSYSIDFLTDINETEFTKENYDYIR--LAFLGKPSKVIVEVIN 78 (355) Q Consensus 1 ~g~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~~~~~~~~~d~~~~~~~~~n~~~i~--~a~~g~~~~v~l~~g~ 78 (355) =.+-+++|+....+. .-+..+|.-.++..++.. ....|.+++++ +.++.+....|.. ..|-.+|....++++. T Consensus 3 ~~iv~V~v~~~~~~~--~~~~~~~~~~~~~~~t~~--~~~~y~s~~~v-~~d~~~~~~~Ykaa~~~f~Q~~~~~~i~v~~ 77 (331) T protein:vir:80 3 ETITDVRVHISVLYP--SPRIGLGRPAIFVKGTAM--GYKEYTTLEEL-KDTFADNTEVYAKAKAVFLQKDRPDTVAVIT 77 (331) T ss_pred cceecceeeeccccc--ccccccCcceeEEecccc--ceEEEechhhh-ccCCCCCcHHHHHHHHHHhccCccceEEEec Confidence 233344444443322 233444554444444332 23356666665 5567665555544 3344455544444443 Q ss_pred CCccchhHHHHHHHHHhcccceEEEEcCCChHHHHHHHHHHHHHHHhcCCeEEEEecCCC-----CcCcceeEEecCCeE Q lcl|NC_019422. 79 DSVDSERSLDDALKALRENKFNYLAIPFISEEVDKTKIVNWIKTARREKEIYKAVLPNIS-----DANEKAIINFATTGI 153 (355) Q Consensus 79 ~g~~~~~~y~~al~~le~~~fn~l~~p~~~d~~~~~~~~~~ik~~r~~g~~~~aVl~~~~-----~~d~egIinv~n~~i 153 (355) ... ...+...+..+ ...|=.+++... +.+.+..+..|+.. .++++..+-.... .....+.+.+... T Consensus 78 ~~~--~~~~~a~~a~~-~~~w~~~~~~~~-~~~~~~a~a~~~~a---~~~~f~~~~~~~~~~~~~~~~~~~t~~~~~~-- 148 (331) T protein:vir:80 78 YED--TKLLEAAEAYF-LKSWHFALLAEF-KAADALALSNLIEE---QKFKFAVFQVTAVADITPLAKNTRTIAIVHS-- 148 (331) T ss_pred cch--HHHHHHHHHhc-cCceeEEEeecC-CHHHHHHHHHHHhh---CCcEEEEEecCchHHHHHhhccccEEEEEcC-- Confidence 221 12233333333 333445555544 44566789999963 4555554433210 0112222222111 Q ss_pred ecCCceecHHHHHHHHHHHhcCcccccccc--cc-cCCccc-ccCChhhHHHHHhCCeEEEEE-CCcEEEEecCcccccc Q lcl|NC_019422. 154 KVGEKSYTTAEYTARLAGILAGISLSESCT--YF-ILDEVT-EIEPTENPDEAVEEGKLILIN-NNGIRIARGVNSLITL 228 (355) Q Consensus 154 ~~~~~~~~~~~~~a~vAG~~Ag~~~~~S~T--~~-~~~~~~-~~~~~~e~~~ai~~G~lvl~~-dg~v~I~~~INSltt~ 228 (355) +.. ++ .++.+.|..+..... |.| |+ +++|+. +.++..|+..+.++|.-++.+ +|.-.+.+|+.+ T Consensus 149 --~~~---~~-~~aa~~g~~~~~~~g-~~t~~fk~~l~GV~~~~lt~t~~~al~~~~~N~y~~~~~~~~~~~G~~~---- 217 (331) T protein:vir:80 149 --KTG---EK-LDAALIGNVASLPVG-SATWKGRHGLAGITSEELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTV---- 217 (331) T ss_pred --Ccc---ch-hHHHHHHHHHhcCcc-ceeeeeecccCCCCCCCCCHHHHHHHHhcCceEEEEecCeeEEecceEe---- Confidence 111 22 234445556666553 455 55 489998 478999999999999888865 455677888632 Q ss_pred CCCCCchhhhhhhHhhHHHHHHHHHHHHhhccc--cccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccc Q lcl|NC_019422. 229 SKEDTEDLKKIKIVEAIDMIQDDILQTWNENYV--GKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAH 306 (355) Q Consensus 229 ~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yi--GK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q 306 (355) +. +| |-.++-.|-+.+.++..+-..++ +|+|=+.+|..++.+.++.-+++..+.|+|.++..+. .. T Consensus 218 ~G----~~--iD~~~~~dWl~~~lq~~l~~ll~~~~kiPy~~~G~~~l~a~i~~~~~~av~~G~I~~g~~~~-----~~- 285 (331) T protein:vir:80 218 SG----EF--IDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETG-----EP- 285 (331) T ss_pred Cc----hh--HHHHHHHHHHHHHHHHHHHHHHHhCCCCccChhhHHHHHHHHHHHHHHHHhCCceecCccCC-----Cc- Confidence 22 34 77888888877777766555444 7999999999999999999999999999998764320 00 Q ss_pred hhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 307 KKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 307 ~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) .|.. -...++++.-.....+..--+.+.+++-.|+..+.+.++| T Consensus 286 -~~~v----~~~~~~~~s~~dr~~R~~~~~~~~~~~~gaI~~v~i~~~v 329 (331) T protein:vir:80 286 -NFSI----TALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEV 329 (331) T ss_pred -ceEE----EeCchhcCCHHHHhccCCCCeEEEEEEcceEEEEEEEEEE Confidence 1110 0111122211112222223377788999999999998888 No 54 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=98.64 E-value=1.6e-07 Score=57.84 Aligned_cols=316 Identities=13% Similarity=0.091 Sum_probs=175.6 Q ss_pred CCCCceEEEeeeeeeeeecCCCceeEEEEEec-CCccceeEEEeehhhhhhhhhhH---HHHHHHHhhhccccceEEEEe Q lcl|NC_019422. 1 MGLPSAIIEFQRRSRTVKFRSRRGVVALILKD-STAIKKSYSIDFLTDINETEFTK---ENYDYIRLAFLGKPSKVIVEV 76 (355) Q Consensus 1 ~g~P~~~i~f~~~a~ta~~~~~rG~v~iil~d-~~~~~~~~~~~~~~d~~~~~~~~---~n~~~i~~a~~g~~~~v~l~~ 76 (355) |-+|.+.|.=-.+..-....-+|=.+.+..-. .+...+.+.+.+.+|+..- +.+ .-+..+..+...++..-..+. T Consensus 1 m~~~~V~in~~n~~qg~~~~ver~~lfig~g~~~~~~g~~~~~~~~sdld~~-lg~~ds~lk~~v~aa~~naG~~w~a~~ 79 (369) T protein:vir:27 1 MAWPTVIIKILNLMNGPIADIECHFLFVIRGTVSGEVRNLIMVDSTSDLDDV-LAEASAEGLAIVKAAQLNGKQAWTAGV 79 (369) T ss_pred CCCCceEEecccccCCCcccccceEEEEEeccccccccceEEecCccchHhh-cCCcChhHHHHHHHHHhCCCCceEEEE Confidence 99999998655544433333333333332221 2344566778777787433 322 224456666665543322211 Q ss_pred cCCCccchhHHHHHHHHH-hcccceEEEEcCC-ChHHHHHHHHHHHHHHHhc-CCeEEEEecCC----CC---------- Q lcl|NC_019422. 77 INDSVDSERSLDDALKAL-RENKFNYLAIPFI-SEEVDKTKIVNWIKTARRE-KEIYKAVLPNI----SD---------- 139 (355) Q Consensus 77 g~~g~~~~~~y~~al~~l-e~~~fn~l~~p~~-~d~~~~~~~~~~ik~~r~~-g~~~~aVl~~~----~~---------- 139 (355) . +..+..+|.+|++.. +.+.|.++++-+. ++.+.-+...+-...+-.. |+.+..++.-. .. T Consensus 80 ~--p~~~~~~~~~Av~~a~~~~s~E~V~v~~p~t~~a~i~aaq~~a~el~~~~~R~vffi~e~~~~~~~~~~~e~w~dy~ 157 (369) T protein:vir:27 80 M--ILSEEDNWQDAVKKANEVSSFEFVVLGFDAETKAMIEDAITLRTELKNSLGREVGVLCQLPAINNDPTNGQTWSEWL 157 (369) T ss_pred E--EeCCchhHHHHHHhhhhhCCccEEEEecCcccHHHHHHHHHHHHHHHHhcCCeEEEEEeccccCCCccccCCHHHHH Confidence 1 122346799988766 4566776665442 3323334444444455444 77777776411 01 Q ss_pred ---------cCcceeEEecCCeEecCCceecHHHHHHHHHHHhc--Ccccccccccc---cCCcccc--------cCChh Q lcl|NC_019422. 140 ---------ANEKAIINFATTGIKVGEKSYTTAEYTARLAGILA--GISLSESCTYF---ILDEVTE--------IEPTE 197 (355) Q Consensus 140 ---------~d~egIinv~n~~i~~~~~~~~~~~~~a~vAG~~A--g~~~~~S~T~~---~~~~~~~--------~~~~~ 197 (355) ..++.|--| .... ..| ....-+||.+| +.++-.|+--. ++.|+.. .++.+ T Consensus 158 a~l~al~~g~a~~~V~vv-~~~~-~~g------n~~G~~aGRl~n~aVsIadsp~RVktG~l~g~~~~p~d~~g~~l~~a 229 (369) T protein:vir:27 158 ADTVDIPKDVASEYISVV-PNVH-AAG------DTLGKYAGRLANKEVSIADSPARVQTGSVLGNTELMKDKAGKALDLA 229 (369) T ss_pred HHHHHHhhccCcccceee-eeec-ccc------chHHHHHHHHHhcccchhcCcceeeecccccccccccCCCCcccCHH Confidence 112222222 1111 011 12555667764 45566666322 2233321 23456 Q ss_pred hHHHHHhCCeEEEE-ECC--cEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhcccc-ccCCCHHHH-H Q lcl|NC_019422. 198 NPDEAVEEGKLILI-NNN--GIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVG-KVTNKYDNK-I 272 (355) Q Consensus 198 e~~~ai~~G~lvl~-~dg--~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiG-K~~N~~~gr-~ 272 (355) .+.++=++|-.++. ..| ++-+..| ++| ...+.||+.|.-+|++|.+.+.+|...=+ +|+ ..-|+..+- . T Consensus 230 ~l~aLd~agysvp~~Y~gy~G~Yw~d~-~tl----~~~gsDYq~iE~~RVvdKa~R~vR~~Ai~-~i~Dr~lnstp~sia 303 (369) T protein:vir:27 230 TLKALESNRIAVPMWYPDYPGQYWTTG-RTL----DVPGGDYQDIRHIRVAMKAARKVRIRAIA-RIADRTLNSTPQSIA 303 (369) T ss_pred HHHHHHhCCCeEEEeeCCCCceEEeCc-eEe----ccCCCCeehhhhhhHHHHHHHHHHHHHHH-HhcCcccccChhHHH Confidence 67777789988884 333 6666655 444 56678999999999999999999988777 455 444555544 3 Q ss_pred HHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEE Q lcl|NC_019422. 273 LFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFK 352 (355) Q Consensus 273 ~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~t 352 (355) ....-+..=|++|++.. -|+ . |.+. .=.||.|... ++..|-+.+.++|.++=-+|... T Consensus 304 ~~~~~~~~pLr~M~ks~--fpg---e--i~~P--------------~d~dI~i~w~-~k~~V~I~~~vrP~~~pk~it~~ 361 (369) T protein:vir:27 304 AAKLYFTQDLRTMALTG--VPG---E--IYPP--------------EDEDIQIKWV-NSTDVEIYMSVQPYECPVKITIA 361 (369) T ss_pred HHHHHHhhHHHHHHhhc--CCe---E--EecC--------------CCCceEEEee-ccceEEEEEEEeeccCCceEEEE Confidence 33333344555776652 111 1 2111 1134544443 34578888899999999999999 Q ss_pred EeC Q lcl|NC_019422. 353 IYM 355 (355) Q Consensus 353 v~v 355 (355) |.+ T Consensus 362 I~l 364 (369) T protein:vir:27 362 ISV 364 (369) T ss_pred EEE Confidence 999 No 55 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=98.63 E-value=1.6e-07 Score=57.79 Aligned_cols=320 Identities=11% Similarity=0.093 Sum_probs=166.6 Q ss_pred CCCCceEEEeeeeeeeeecC---------CCce-------eEEEEEecCCc-----------cceeEEEeehhhhhhhhh Q lcl|NC_019422. 1 MGLPSAIIEFQRRSRTVKFR---------SRRG-------VVALILKDSTA-----------IKKSYSIDFLTDINETEF 53 (355) Q Consensus 1 ~g~P~~~i~f~~~a~ta~~~---------~~rG-------~v~iil~d~~~-----------~~~~~~~~~~~d~~~~~~ 53 (355) +-.=+...++.++-.++... ...+ ..+.+..|.+. ......+.+..+ T Consensus 117 i~v~G~~~t~~~i~lS~~ts~~~vAs~i~~~l~a~~~~~~~~~tv~~d~~~~~F~v~s~~tG~~s~i~~at~~~------ 190 (507) T protein:vir:99 117 LSIGGTVVPIAGIDLTAALTLTDVAATLQTKIRASANAELATATVTFNTTTNQFVLNGTTTGALAPTITAVRTD------ 190 (507) T ss_pred EEEcCceeEeccccccccCCHHHHHHHHHHhhhccccccccceEEEEecCCceEEEEeeeccccceeEEEEcCC------ Confidence 11111222222210000000 0000 00122222221 111111211100 Q ss_pred hHHHHHHHH-hhhccccceEEEEecCCCccchhHHHHHHHHHhcccceEEEE-----cCCChHHHHHHHHHHHHHHHhcC Q lcl|NC_019422. 54 TKENYDYIR-LAFLGKPSKVIVEVINDSVDSERSLDDALKALRENKFNYLAI-----PFISEEVDKTKIVNWIKTARREK 127 (355) Q Consensus 54 ~~~n~~~i~-~a~~g~~~~v~l~~g~~g~~~~~~y~~al~~le~~~fn~l~~-----p~~~d~~~~~~~~~~ik~~r~~g 127 (355) .-..+. +.++.++..+++. |.+ .++..++++++.....||..+ |+.+ +..+..+..|+.. ++ T Consensus 191 ---~gt~~s~l~~~~~~~a~~~~-g~~----aet~~~a~~a~~~~~~nW~~~~~a~~~~~t-d~~~lalA~wiea---~~ 258 (507) T protein:vir:99 191 ---PATDISSLLGWTNTGTVFVK-GQA----AETPDTSISKSAAISTNFGSFIYTSTPALT-NDQITAVASWNAS---QN 258 (507) T ss_pred ---chhhHHHHhccccccceEee-ccc----ccCHHHHHHHHHhhcCCeEEEEEEeccccC-hHHHHHHHHHHhh---cC Confidence 001111 1222233344443 332 245778999998877677532 3323 4467899999983 44 Q ss_pred CeEEEEecCCCCcCc--------ceeEEecCCeEecCCceecHHHHHHHHHHHhcCcc---cccccc--cccCCccc-cc Q lcl|NC_019422. 128 EIYKAVLPNISDANE--------KAIINFATTGIKVGEKSYTTAEYTARLAGILAGIS---LSESCT--YFILDEVT-EI 193 (355) Q Consensus 128 ~~~~aVl~~~~~~d~--------egIinv~n~~i~~~~~~~~~~~~~a~vAG~~Ag~~---~~~S~T--~~~~~~~~-~~ 193 (355) +++..+.... .++. .+-....+. .....+....++.+.|..|+.. .+.++| |+.++|+. ++ T Consensus 259 ~~f~~~~~~~-~a~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~aa~~g~~as~nf~~~ng~~T~~fk~l~GV~a~~ 333 (507) T protein:vir:99 259 NMYMYSVPTT-IANIGTLYAAVKGFSGCALNI----TSDSLPVDYIEQSPCEILAATDYTRVNATQNYMYYQFPSRNITV 333 (507) T ss_pred cEEEEEEecC-chhhhhhhhhhhhcceeEEEe----ecccccchhHHHHHHHHHHhhccCcCccceeecccccCCccccc Confidence 5554433321 1111 110111111 0112232334556666676654 445555 46899998 46 Q ss_pred CChhhHHHHHhCCeEEEEE--C--CcE-EEEecCccccccCCCCCc-hhhhhhhHhhHHHHHHHHHHHHhhccc--cccC Q lcl|NC_019422. 194 EPTENPDEAVEEGKLILIN--N--NGI-RIARGVNSLITLSKEDTE-DLKKIKIVEAIDMIQDDILQTWNENYV--GKVT 265 (355) Q Consensus 194 ~~~~e~~~ai~~G~lvl~~--d--g~v-~I~~~INSltt~~~~k~~-~f~kirvvr~~D~i~~di~~~~~~~yi--GK~~ 265 (355) ++..|.+.+.++|.-++.+ + ... .+.+|+.+ + ++ +|.-+.+.+-.|-+.+.++..+-+-|. +|+| T Consensus 334 lt~t~a~al~~~n~N~y~~~a~~~~~~~~~~~G~~~----g---G~~~fid~d~~~~~~WL~~~iq~~l~~l~~~~~kIP 406 (507) T protein:vir:99 334 SDDTTANLVDANRGNYIGQTQSAGQSLAFYQRGILC----G---GPNDAVDMNIYANEIWLKSAISAQILSLFLNVPRVP 406 (507) T ss_pred CCHHHHHHHHhcCCeEEEEeccccceeeEEecCeee----C---CcccceeeeeecchHHHHHHHHHHHHHHHhcCCCCc Confidence 7889999999999887743 2 234 45677655 1 22 565566655544444444444333332 8999 Q ss_pred CCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhcccccc---------------ccccceeeeccCC Q lcl|NC_019422. 266 NKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDY---------------SEMTEQQIKEANT 330 (355) Q Consensus 266 N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~---------------~~~~d~~v~~~~~ 330 (355) =+.+|..++.+.++.-|++-.+.|+|.++.. .-..|+.+|....-+. ..++.+.-..... T Consensus 407 yt~~G~~~l~a~i~~~l~~av~nG~I~~Gvt-----l~~~q~~~in~~~~~~~~~~~~~~~Gyy~~~~~~s~~~~~~r~~ 481 (507) T protein:vir:99 407 ANETGESMLLSVIQSVVNTAKNNGTISAGKN-----LNVIQQQYITQISGDANAWRQVANIGYWLNITFSSYTNPNTQLT 481 (507) T ss_pred cChhhHHHHHHHHHHHHHHHHhccccccCCc-----ccccchheecccccccccccceeccceEEEeCChHhcChhhhhc Confidence 9999999999999999999999999999742 1234444444333221 1222333233445 Q ss_pred CCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 331 GSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 331 ~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) +.+.-+....+-=+|+-++.|.-++ T Consensus 482 r~~~~~~~~y~~~gaI~~v~~~~~~ 506 (507) T protein:vir:99 482 EWKASYQLIYSKDDAIRFVEGTDTL 506 (507) T ss_pred cccceEEEEEEeCCeEEEEEeeeec Confidence 5577788888888999999999888 No 56 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=98.50 E-value=4.4e-07 Score=55.43 Aligned_cols=312 Identities=12% Similarity=0.091 Sum_probs=179.0 Q ss_pred CCCCceEEEeeeeeeeeecCCCceeEEEEEecCCccceeEEEeehhhhhhhhhhHHHHHHHH--hhhcccc--------- Q lcl|NC_019422. 1 MGLPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIKKSYSIDFLTDINETEFTKENYDYIR--LAFLGKP--------- 69 (355) Q Consensus 1 ~g~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~~~~~~~~~d~~~~~~~~~n~~~i~--~a~~g~~--------- 69 (355) |=-+=+++...-.+.. ..+..-|.+.++-...........|.+.+++ ..+|......|.. ..|.+.| T Consensus 1 ~~s~iVnV~i~~~~~a-~~~~~f~~~l~~~~~~~~~~r~~~yss~~~V-~~~FG~~S~ey~aA~~yF~q~p~p~~l~igr 78 (450) T protein:vir:95 1 MWNPIVNVDITLNTAG-TTREGFGLPLFLASTDNFEERVRGYTSLTEV-AEDFDENTAAYKAAKQLWSQTPKVTQLYIGR 78 (450) T ss_pred CCCceEEEeecccccc-cccccceeEEEEcCCCCCccceeeecCHHHH-HHhcCCCcHHHHHHHHHHhCCCcccEEEEEe Confidence 7666666666554333 3333334444444443334445566666664 3444433222211 1111100 Q ss_pred -----------------ceEE---------------E------------------------------Eec---------- Q lcl|NC_019422. 70 -----------------SKVI---------------V------------------------------EVI---------- 77 (355) Q Consensus 70 -----------------~~v~---------------l------------------------------~~g---------- 77 (355) .+++ + ..+ T Consensus 79 ~~~~~t~~~~~~~~~~~~g~lt~tv~G~~~~~~~i~~s~a~s~~~va~~~~tai~~~~~~~~~~~~~s~g~~~~~t~~~~ 158 (450) T protein:vir:95 79 RAMQYTVSIPDAVTESTDYSITVAAGGGISQPYQYTAQSSDTAENVLQQFKTQIEADPTIKDKVSVNVTGSNGSATMIIA 158 (450) T ss_pred eccchhhhhhhhhccccceeEEEEecceeeeeeEEEEEecCChhhHHHHhhhhhcccceeeeeeeeeeecccceeeeeee Confidence 0000 0 000 Q ss_pred CCC-----------------ccchhHHHHHHHHHhcccceE--EEEcCCChHHHHHHHHHHHHHHHhcCCeEEEEecCCC Q lcl|NC_019422. 78 NDS-----------------VDSERSLDDALKALRENKFNY--LAIPFISEEVDKTKIVNWIKTARREKEIYKAVLPNIS 138 (355) Q Consensus 78 ~~g-----------------~~~~~~y~~al~~le~~~fn~--l~~p~~~d~~~~~~~~~~ik~~r~~g~~~~aVl~~~~ 138 (355) ..+ ....+...++++++.....|| ++++.. +++.+..+..|+.. .++.+..+..+.. T Consensus 159 ~~~~~~~~~l~~~~~~~~~~g~~aet~~~a~~a~~~~~~~w~~~~~~~~-~~~~i~a~a~w~~a---~~~~f~~~~~~~~ 234 (450) T protein:vir:95 159 KAGDNDFVKVTTTAQTVYIASTTADTASTALAAIEAYSTDWYFIAAEDR-TQQFVLAMASEIQA---RKKIFFTANSDVT 234 (450) T ss_pred ccccchhhccccccceeEecccccccHHHHHHHHHHhhCCeEEEEecCC-CHHHHHHHHHHHhh---cCcEEEEEcCCch Confidence 000 001245788898888664444 455554 34567889999985 3455544433311 Q ss_pred ------------------CcCcceeEEecCCeEecCCceecHHHHHHHHHHHhcCcccccccc--cccCCcccc------ Q lcl|NC_019422. 139 ------------------DANEKAIINFATTGIKVGEKSYTTAEYTARLAGILAGISLSESCT--YFILDEVTE------ 192 (355) Q Consensus 139 ------------------~~d~egIinv~n~~i~~~~~~~~~~~~~a~vAG~~Ag~~~~~S~T--~~~~~~~~~------ 192 (355) ..++.+.+.+-++. -.....+++++|..+.. .+.|+| |+.++|+.. T Consensus 235 ~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~-------~~~~~~~aa~~g~~~~~-~~g~~T~~fk~l~Gv~~~v~~~~ 306 (450) T protein:vir:95 235 ALQGTELASANDVPAQLAKNMYTRTVCLWHHA-------AAEDYPEMAYIAYGAPY-DAGSIAWGNAQLTGVAASLQPSN 306 (450) T ss_pred hhhhhhhhcccchHHHHHhccCCeeEEEeeCC-------CchhHHHHHHHHHhhhc-ccceeeeccccccceeeeccCcc Confidence 11112222222110 11223456666765554 334566 568899874 Q ss_pred --cCChhhHHHHHhCCeEEEEE-CCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhcc----ccccC Q lcl|NC_019422. 193 --IEPTENPDEAVEEGKLILIN-NNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENY----VGKVT 265 (355) Q Consensus 193 --~~~~~e~~~ai~~G~lvl~~-dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~y----iGK~~ 265 (355) .++..|.+.+.++|.-++.+ ++.-.+.+|+.+ + -+| |-+++.+|-+.+.++..+-+.+ .||+| T Consensus 307 ~~~lt~~~~~al~~~~~n~y~~~~~~~~~~~G~~~----~----G~~--iD~~~~~~wl~~~iq~~l~~ll~~~~~~KiP 376 (450) T protein:vir:95 307 QRPLTSIQKSALDVRHCNFIDLDGGVPVVRRGITS----G----GEW--IDIIRGVDWLESDLKTSLRDLLINQKGGKIT 376 (450) T ss_pred ccccchHHHHHHHhCCcEEEEEecCceeeeCCeee----C----cch--hHHHHHHHHHHHHHHHHHHHHHHhcCCCCCc Confidence 35778899898888776654 455567788744 1 235 7788999999999988876644 27999 Q ss_pred CCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEee Q lcl|NC_019422. 266 NKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDA 345 (355) Q Consensus 266 N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vda 345 (355) =+.+|..++.+.|+.-+++..+.|+|..++ +.. ..+.++......-+-.--+.+.++.-.| T Consensus 377 y~~~G~~~i~a~i~~~l~~a~~~G~Ia~~~---V~~----------------~~~~~~~~~dr~~R~~~~i~~~~~laGA 437 (450) T protein:vir:95 377 YDDTGITRIRQVIETSLQRAVNRNFLSSYT---VNV----------------PKASQVALADKKARILKDVTFAGILAGA 437 (450) T ss_pred cChhhHHHHHHHHHHHHHHHHhcCccccee---Eec----------------CChHhcCHHHHhccCCCCeeEEEEEccc Confidence 999999999999999999999999997542 221 1122222222222223338888999999 Q ss_pred eeEEEEEEeC Q lcl|NC_019422. 346 MEDLKFKIYM 355 (355) Q Consensus 346 mEkiy~tv~v 355 (355) +..+.+.++| T Consensus 438 Ih~~~i~~~v 447 (450) T protein:vir:95 438 ILDVDLKGTV 447 (450) T ss_pred eEEEEEEEEE Confidence 9999999999 No 57 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=98.27 E-value=1.8e-06 Score=52.09 Aligned_cols=303 Identities=10% Similarity=0.079 Sum_probs=160.3 Q ss_pred CCCCceEEEeeeeeeeeecCCCceeEEEEEecCCccceeEEEeehhhhhhhhhhHHHHHHHHhhhccccceEEEEecCCC Q lcl|NC_019422. 1 MGLPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIKKSYSIDFLTDINETEFTKENYDYIRLAFLGKPSKVIVEVINDS 80 (355) Q Consensus 1 ~g~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~~~~~~~~~d~~~~~~~~~n~~~i~~a~~g~~~~v~l~~g~~g 80 (355) ++.|++.+++.+... ..++...++....+..+.+... ++++ . +.+..+....+...+.+ T Consensus 150 l~~~~~tv~~d~~~~----------~f~i~s~t~G~~~~i~~~t~~~----~ia~----~--l~Lt~~~~a~v~~~g~~- 208 (501) T protein:vir:36 150 FTSPDFVVAYDALRN----------RFTVVTNATGTAAAISAVTGTN----NFAD----E--IGLSAAAGATLQAAGVA- 208 (501) T ss_pred hcCcceEEEEcCcce----------eEEEEeccCCcceeeEeeeccc----chhh----h--hcccccCcceEEecccc- Confidence 333332222221111 1111111122112222222111 0100 0 11222222222223322 Q ss_pred ccchhHHHHHHHHHhcccceEEE--EcCCChHHHHHHHHHHHHHHHhcCCeEEEEecCCC-----------------CcC Q lcl|NC_019422. 81 VDSERSLDDALKALRENKFNYLA--IPFISEEVDKTKIVNWIKTARREKEIYKAVLPNIS-----------------DAN 141 (355) Q Consensus 81 ~~~~~~y~~al~~le~~~fn~l~--~p~~~d~~~~~~~~~~ik~~r~~g~~~~aVl~~~~-----------------~~d 141 (355) .+...++|+++.....||.. +....+++.+..+..|+. .+++++..+..... ..+ T Consensus 209 ---~et~~~al~a~~~~s~~Wy~f~~a~~~~~~~~la~A~wie---a~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~ 282 (501) T protein:vir:36 209 ---ADTPASAMNRAVGLSRNWATFTTAWTAVIADRLAFASWNS---GQAYKYMYVAPDLEAASIVSNNAASFGAQVFAAP 282 (501) T ss_pred ---cccHHHHHHHHHhccCceEEEEEecCCChHHHHHHHHHHh---hcCceEEEEEecCchhhhhccchhhHHHHHHhcC Confidence 24578999999988878743 222234556789999997 44555554443210 012 Q ss_pred cceeEEecCCeEecCCceecHHHHHHHHHHHhcCcccc---ccccc--ccC-Ccccc-cCChhhHHHHHhCCeEEEE--- Q lcl|NC_019422. 142 EKAIINFATTGIKVGEKSYTTAEYTARLAGILAGISLS---ESCTY--FIL-DEVTE-IEPTENPDEAVEEGKLILI--- 211 (355) Q Consensus 142 ~egIinv~n~~i~~~~~~~~~~~~~a~vAG~~Ag~~~~---~S~T~--~~~-~~~~~-~~~~~e~~~ai~~G~lvl~--- 211 (355) +.+.+-+ |.....++.+-|..|+...+ .++|+ +.+ +|+.. +++.+|.+.+.++|.-++. T Consensus 283 y~~t~~~-----------y~~~~~~aa~~g~~as~nf~~~~g~~T~~fkq~~~Gi~a~~l~~t~a~al~~~~~N~y~~~~ 351 (501) T protein:vir:36 283 YQGTLPL-----------YGDQATAGAVMGYAASINFQLRNGRTVLAFRQFNAGVPATVHDLPTANALRSNNYTYIGAYA 351 (501) T ss_pred CCcEEEE-----------cCCCCHHHHHHHHHHhcCcccCcceeeeeccccCCCcCcCcCCHHHHHHHHhcCCcEEEEEe Confidence 2222222 11222234456666666543 45665 566 68875 6788999999999977652 Q ss_pred E-CCcEE-EEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccc--cccCCCHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019422. 212 N-NNGIR-IARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYV--GKVTNKYDNKILFLSAVNNYFKELQR 287 (355) Q Consensus 212 ~-dg~v~-I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yi--GK~~N~~~gr~~~~~~i~~yl~~l~~ 287 (355) . ++... +.+|+=| -+|.-|.+++-.|-+.+.++..+-+-+. +|+|=+.+|..++.+.+..-+++-.+ T Consensus 352 ~~~~~~~~~~~G~~s---------G~~~wiD~~~g~dWL~~~iq~~l~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~ 422 (501) T protein:vir:36 352 NAANNYTIAYDGKLS---------GKFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTGLYRAGVDVIDAAVT 422 (501) T ss_pred cccceeeEEEcCeee---------ccchhhhHHHhHHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHh Confidence 2 33454 4667322 1455688888888888777766655443 79999999999999999999999999 Q ss_pred cccccCCCCce----eEec------cc----cchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEE-E Q lcl|NC_019422. 288 DEVLDNSQEAY----AQID------IE----AHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKF-K 352 (355) Q Consensus 288 ~g~I~~~~~~~----v~id------~e----~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~-t 352 (355) .|+|.++.... .+|+ +. +.|-|+...+ .+.... .....+...-+....+-=+|+-.+.+ + T Consensus 423 nG~I~~Gv~~~~~q~~~I~~~~g~~~~~~~v~~~Gyy~~~~--~~~~~~---~~R~~R~~p~~~~~y~~~gaIh~v~i~s 497 (501) T protein:vir:36 423 SGIIRAGVTLTNSQLQQIDKAARVAGAGQLVQTRGWYFLIG--DPANPG---QARQNRTTPACTLWYSDGGSIQSLTIGS 497 (501) T ss_pred CceeecCCCCCcccceeecccccccccccceeccceEEeeC--cccCCh---hhhhhcccCcEEEEEEeCCceeEEEeee Confidence 99999874321 1111 11 1233433322 111111 12334445556777777788888874 3 Q ss_pred EeC Q lcl|NC_019422. 353 IYM 355 (355) Q Consensus 353 v~v 355 (355) +.| T Consensus 498 ~~v 500 (501) T protein:vir:36 498 NAV 500 (501) T ss_pred eee Confidence 344 No 58 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=98.19 E-value=2.9e-06 Score=50.94 Aligned_cols=302 Identities=10% Similarity=0.072 Sum_probs=160.5 Q ss_pred CCCCceEEEeeeeeeeeecCCCceeEEEEEecCCccceeEEEeehhhhhhhhhhHHHHHHHHhhhcc-ccceEEEEecCC Q lcl|NC_019422. 1 MGLPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIKKSYSIDFLTDINETEFTKENYDYIRLAFLG-KPSKVIVEVIND 79 (355) Q Consensus 1 ~g~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~~~~~~~~~d~~~~~~~~~n~~~i~~a~~g-~~~~v~l~~g~~ 79 (355) ++.|++.++|.+.+... ++-..++.......+.+... +.+ .. +.+.. .++.+... |.. T Consensus 150 l~~~~~tv~~d~~~~~f----------~i~~~t~G~~~~i~~~t~~~----d~a----~~--l~Lt~~~~a~v~~~-g~~ 208 (501) T protein:vir:10 150 FTSPDFVVAYDALRNRF----------TVVTNTTGTAAAISAVTGTN----NLA----DE--LGLSAAAGATLQAA-GVA 208 (501) T ss_pred hcCCceEEEEecccceE----------EEEecccCcceeEEEeeccc----cch----hh--hcccccCceeEEec-Ccc Confidence 44444443333222111 11112222222333322111 010 01 11222 22333332 222 Q ss_pred CccchhHHHHHHHHHhcccceEEEE--cCCChHHHHHHHHHHHHHHHhcCCeEEEEecCCC-C----------------c Q lcl|NC_019422. 80 SVDSERSLDDALKALRENKFNYLAI--PFISEEVDKTKIVNWIKTARREKEIYKAVLPNIS-D----------------A 140 (355) Q Consensus 80 g~~~~~~y~~al~~le~~~fn~l~~--p~~~d~~~~~~~~~~ik~~r~~g~~~~aVl~~~~-~----------------~ 140 (355) .+.-.++|+++.....||..+ ....+++.+..+..|+. .+++++..+..... . . T Consensus 209 ----aet~~~Al~a~~~~~~~Wy~f~~a~~~~~~~~la~A~wi~---a~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~ 281 (501) T protein:vir:10 209 ----ADTPASAMNRAVGLSRNWATFTTAWTAVIADRLAFAAWNS---GQAYKYMYVAPDLEAASIVTNNAASFGAQVFAA 281 (501) T ss_pred ----cccHHHHHHHHHhcccceEEEEEEecCChHHHHHHHHHHH---hcCceEEEEEecCcceeeecccchhHHHHHHhc Confidence 245779999999888887532 22234556789999997 44555555543211 0 1 Q ss_pred CcceeEEecCCeEecCCceecHHHHHHHHHHHhcCcccc---ccccc--ccC-Ccccc-cCChhhHHHHHhCCeEEEE-- Q lcl|NC_019422. 141 NEKAIINFATTGIKVGEKSYTTAEYTARLAGILAGISLS---ESCTY--FIL-DEVTE-IEPTENPDEAVEEGKLILI-- 211 (355) Q Consensus 141 d~egIinv~n~~i~~~~~~~~~~~~~a~vAG~~Ag~~~~---~S~T~--~~~-~~~~~-~~~~~e~~~ai~~G~lvl~-- 211 (355) ++.+.+-+ |.....++.+-|..|+...+ .++|+ +.+ +|+.. +++..|.+.+.++|.-++. T Consensus 282 ~y~~t~~~-----------y~~~~~~aa~~g~~as~nf~~~~g~~T~~fkql~~Gv~a~~l~~t~a~al~~~~~N~y~~~ 350 (501) T protein:vir:10 282 PYQGTLPL-----------YGDQATAGAVMGYAASINFQLRNGRTVLAFRQFNAGVPATAHDLPTANALRSNNYTYIGAY 350 (501) T ss_pred CCCceEEE-----------CCCCCHHHHHHHHHHhcCcccCcceeeeeecccCCCcCcccCCHHHHHHHHhcCCeEEEEE Confidence 11121111 22222344556666766543 45565 566 78875 6888999999999987652 Q ss_pred -E-CCcEE-EEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccc--cccCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 212 -N-NNGIR-IARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYV--GKVTNKYDNKILFLSAVNNYFKELQ 286 (355) Q Consensus 212 -~-dg~v~-I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yi--GK~~N~~~gr~~~~~~i~~yl~~l~ 286 (355) . +.... +.+|+-| -+|.-|.+++-.|-+.+.++..+-+-+. +|+|=+..|..++.+.+..-+++-. T Consensus 351 ~~~~~~~~~~~~G~~s---------G~~~wiD~~~g~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a~i~~~l~~av 421 (501) T protein:vir:10 351 ANAANNYTIAYDGKLS---------GKFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTANYRAGVDVIDAAV 421 (501) T ss_pred ecccceeeEEEcceee---------ccceehhhHhhHHHHHHHHHHHHHHHHhcCCCcccCHHHHHHHHHHHHHHHHHHH Confidence 2 23444 4567422 1455577777777666666555444333 7999999999999999999999999 Q ss_pred hcccccCCCCce----eEe------ccc----cchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEE Q lcl|NC_019422. 287 RDEVLDNSQEAY----AQI------DIE----AHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFK 352 (355) Q Consensus 287 ~~g~I~~~~~~~----v~i------d~e----~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~t 352 (355) +.|+|.++...+ .++ |+. +.|-|+...+. ..... .....+...-+.+.++-=+|+-.+.+- T Consensus 422 ~nG~Ia~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~--~~~~~---~~R~~R~~p~~~~~y~~~gaIh~v~i~ 496 (501) T protein:vir:10 422 TSGIIRAGVTLTNSQLQQIDAVAGVAGAGALVQTRGWYFLIGN--PANPG---QARQNRTSPACTLWYSDGGSIQELTIG 496 (501) T ss_pred hCcceecCcccCcccceeecccccccccccceeccceEEeeCc--ccCCh---hhhhhcccCceEEEEEeCCceeEEEee Confidence 999999874211 011 111 12233332221 11111 113334455567777777888888743 Q ss_pred -EeC Q lcl|NC_019422. 353 -IYM 355 (355) Q Consensus 353 -v~v 355 (355) +.| T Consensus 497 s~~v 500 (501) T protein:vir:10 497 SNAV 500 (501) T ss_pred eeec Confidence 333 No 59 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=98.14 E-value=3.7e-06 Score=50.33 Aligned_cols=325 Identities=11% Similarity=0.086 Sum_probs=176.4 Q ss_pred CCCceEEEeeeeeeeeecCCCceeEEEEEecCCccceeEEEeehhhhhhhhhh---HHHHHHHHhhhccccceEEEEecC Q lcl|NC_019422. 2 GLPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIKKSYSIDFLTDINETEFT---KENYDYIRLAFLGKPSKVIVEVIN 78 (355) Q Consensus 2 g~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~~~~~~~~~d~~~~~~~---~~n~~~i~~a~~g~~~~v~l~~g~ 78 (355) =+|.+.|.=-.+..-....-+|=.+.+- ...+..++.+.+.+.+|+ +..+. +.-+..++.+.+.++..-.. ... T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er~~Lfig-~~~~~~~~~~~~~~~sdl-d~~lg~~~~~lk~~v~aa~~naG~~~~~-~~~ 77 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIERHALFVG-VGTTNQGKLLALTPDSDF-DKVFGETDTDLKKQVRAAMLNAGQNWFA-HVY 77 (376) T ss_pred CCCeEEEecccccCCCcccccceEEeec-cccccccceeeecCccch-HhhhCCCchHHHHHHHHHHhCCCCcEEE-EEE Confidence 4677666544333322222222222221 333445566677777887 33332 23355677777654433211 111 Q ss_pred CCccchhHHHHHHHHH-hcccceEEEEcCC--ChHHHHHHHHHHHHHHHhc-CCeEEEEecCCC-CcC------------ Q lcl|NC_019422. 79 DSVDSERSLDDALKAL-RENKFNYLAIPFI--SEEVDKTKIVNWIKTARRE-KEIYKAVLPNIS-DAN------------ 141 (355) Q Consensus 79 ~g~~~~~~y~~al~~l-e~~~fn~l~~p~~--~d~~~~~~~~~~ik~~r~~-g~~~~aVl~~~~-~~d------------ 141 (355) ....++++|.+|++.. +.+.|.++++-+. ++.+.-..+.+-...+-.. ++.+..++.... ..| T Consensus 78 ~~~~~~~~~~~Av~~a~~~~s~E~V~v~~pv~t~~a~i~aa~~~a~el~~~~~Rpv~file~r~~~~~~~~~e~w~~y~~ 157 (376) T protein:vir:37 78 IAQEDGYDFVECVKKANQTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQAVQGINHDQSDGETWDQYVQ 157 (376) T ss_pred eecCCchHHHHHHHHhhhhcCceEEEEeccccccHHHHHHHHHHHHHHHHhcCCeEEEEEeccCcCcccccccCHHHHHH Confidence 1233457899998664 6677776665432 2333334444444445444 788877776431 001 Q ss_pred -----cceeEEecCCeEecCCceecHHHHHHHHHHHh--cCcccccccccc---cCCcccc----------cCChhhHHH Q lcl|NC_019422. 142 -----EKAIINFATTGIKVGEKSYTTAEYTARLAGIL--AGISLSESCTYF---ILDEVTE----------IEPTENPDE 201 (355) Q Consensus 142 -----~egIinv~n~~i~~~~~~~~~~~~~a~vAG~~--Ag~~~~~S~T~~---~~~~~~~----------~~~~~e~~~ 201 (355) -.||.+-.-..+- ..+ ...+..+||.+ |+.++.+|+--. ++.++.+ -++...+++ T Consensus 158 ~~~al~~gia~~~V~~V~---~~~--gn~~G~~aGRl~~aaVsVadspgRV~tG~l~gl~~~~lp~d~~~~~l~~a~l~a 232 (376) T protein:vir:37 158 KLTTLQQTIVADHVCLVP---LLF--GNETGVLAGRLANRAVTVADSPARVQTGALVSLGSANKPLDKDRNELTLAHLKS 232 (376) T ss_pred HHHHhhcccccccceeee---eeh--hhhHHHHHHHHhhcccchhhCccceeccccccccccccccCcCcccCCHHHHHH Confidence 0122111000000 000 12377889987 466777887432 3444422 234556777 Q ss_pred HHhCCeEEEE-ECC--cEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHH-HHHHH Q lcl|NC_019422. 202 AVEEGKLILI-NNN--GIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKI-LFLSA 277 (355) Q Consensus 202 ai~~G~lvl~-~dg--~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~-~~~~~ 277 (355) +=++|-.++. ..| ++-+..| |+| ...+.||+.|.-+|++|.+.+.+|...=.+...+.-|+..+-. ..++- T Consensus 233 Ld~agy~vp~~Y~gy~G~Y~~d~-~tl----~~~gsDY~~ie~~RVvdKa~R~vR~~ai~~i~D~~lnst~~sia~~~~y 307 (376) T protein:vir:37 233 LETARYSVPMWYPDYDGYYWADG-RTL----DVEGGDYQVIENLRVVDKVARKVRLLAIGKIADRSFNSTTSSTEYHKNY 307 (376) T ss_pred HHhCCCeEEEeeCCCCceEEeCc-eEe----ccCCCChhhhhhhhHHHHHHHHHHHHHHHHhCCcccCcchhhHHHHHHH Confidence 7789988874 333 6666554 444 5667899999999999999999998877644445546655432 22333 Q ss_pred HHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 278 VNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 278 i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) +..=|++|.+...|..-. +-=+|.++.+ .|+.+.- ..+..|-+.+.++|.++-.+|...|.+ T Consensus 308 i~~pLr~M~~s~~i~g~~-fpGeI~~p~d--------------~Di~i~w-~s~~~V~I~~~v~P~~~pk~Itv~I~L 369 (376) T protein:vir:37 308 FAKPLRDMSKSATINGKD-FPGECMPPKD--------------DAITIVW-QSKTKVTIYIKVRPYDCPKEITANIFL 369 (376) T ss_pred HHHHHHHHHhcchhcccc-ccceeecCCC--------------CCceEEe-eccceEEEEEEEEeccCCceEEEEEEe Confidence 334456776665553311 0001222111 1333332 234678899999999999999888888 No 60 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=98.11 E-value=4.3e-06 Score=50.02 Aligned_cols=298 Identities=9% Similarity=0.045 Sum_probs=154.9 Q ss_pred CCCCceEEEeeeeeeeeecCCCceeEEEEEecCCccceeEEEeehhhhhhhhhhHHHHHHHHhh--hccccceEEEEecC Q lcl|NC_019422. 1 MGLPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIKKSYSIDFLTDINETEFTKENYDYIRLA--FLGKPSKVIVEVIN 78 (355) Q Consensus 1 ~g~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~~~~~~~~~d~~~~~~~~~n~~~i~~a--~~g~~~~v~l~~g~ 78 (355) ++.+++.++|.+.. . ..++.-..+....+.++.+.. .++... +..+...++...|. T Consensus 150 l~~~~~tv~~d~~~---------~-~f~its~ttG~~~~i~~~~~~------------~~la~~l~Lt~~~~a~v~~~g~ 207 (501) T protein:vir:10 150 FTSPDFVVAYDALR---------N-RFTVVTNATGTAAAISAVTGT------------NNLADELGLSAAAGATLQAAGV 207 (501) T ss_pred ccCCceEEEEcccC---------c-eEEEEeeccCCceeEEEeeCc------------hhhhhhcCccccccceEEecCc Confidence 22332222222111 1 111111112222222332211 122111 11111122222222 Q ss_pred CCccchhHHHHHHHHHhcccceEEE---EcCCChHHHHHHHHHHHHHHHhcCCeEEEEecCCC----------------- Q lcl|NC_019422. 79 DSVDSERSLDDALKALRENKFNYLA---IPFISEEVDKTKIVNWIKTARREKEIYKAVLPNIS----------------- 138 (355) Q Consensus 79 ~g~~~~~~y~~al~~le~~~fn~l~---~p~~~d~~~~~~~~~~ik~~r~~g~~~~aVl~~~~----------------- 138 (355) . .+.-.++++++.....||.. ++. .+++.+..+..|+.. +++++..+..... T Consensus 208 ~----aet~~~a~~a~~~~~~~Wy~f~~a~~-~~~~~~la~A~wiea---~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~ 279 (501) T protein:vir:10 208 A----ADTPASAMNRAVGLSRNWATFTTAWT-AVIADRLAFAAWNSG---QAYKYMYVAPDLEAASIVTNNAASFGAQVF 279 (501) T ss_pred c----cccHHHHHHHHHhccCceEEEEEecC-CChHHHHHHHHHHHh---cCceEEEEEecCchhhhhhhhhhhHHHHHH Confidence 2 24578999999988777754 333 345567889999973 4555555443211 Q ss_pred CcCcceeEEecCCeEecCCceecHHHHHHHHHHHhcCccc---cccccc--ccCC-cccc-cCChhhHHHHHhCCeEEEE Q lcl|NC_019422. 139 DANEKAIINFATTGIKVGEKSYTTAEYTARLAGILAGISL---SESCTY--FILD-EVTE-IEPTENPDEAVEEGKLILI 211 (355) Q Consensus 139 ~~d~egIinv~n~~i~~~~~~~~~~~~~a~vAG~~Ag~~~---~~S~T~--~~~~-~~~~-~~~~~e~~~ai~~G~lvl~ 211 (355) ..++.+.+-+ |.....++.+.|..|+... +.++|+ +.++ |+.. +++..|.+.+.++|.-++. T Consensus 280 ~~~y~~t~~~-----------y~~~~~~aa~~g~~as~nf~~~~g~~T~~fkq~~~Gi~a~~lt~t~a~al~~~~~N~y~ 348 (501) T protein:vir:10 280 AAPYQGTLPL-----------YGDQATAGAVMGYAASINFQLRNGRTVLAFRQFNAGVPATAHDLPTANALRSNNYTYIG 348 (501) T ss_pred hcCCCceEEE-----------CCCCcHHHHHHHHHHhhCcccCccceeeeccccCCCcCcccCCHHHHHHHHhcCCeEEE Confidence 0112222211 1112233444666666544 345554 5676 7864 6889999999999988874 Q ss_pred E----CCcEE-EEecCccccccCCCCCchhhhhhhHhhHH----HHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHH Q lcl|NC_019422. 212 N----NNGIR-IARGVNSLITLSKEDTEDLKKIKIVEAID----MIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYF 282 (355) Q Consensus 212 ~----dg~v~-I~~~INSltt~~~~k~~~f~kirvvr~~D----~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl 282 (355) + +.... +.+|+-| -+|.-|.+++-.| .+...+...|-.. +|+|=+..|..++.+.+..-+ T Consensus 349 ~~~~~~~~~~~~~~G~~s---------G~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~--~kIPyt~~G~~~l~a~v~~~l 417 (501) T protein:vir:10 349 AYANAANNYTIAYDGKLS---------GKFLWVDTYLDQIYLNAELQRAEFEAMLAY--NSLPYNEDGYTALYRAGVDVI 417 (501) T ss_pred EeccccceeeEEecCeee---------ccceeehhhhhHHHHHHHHHHHHHHHHHhc--CCcccCHHHHHHHHHHHHHHH Confidence 3 22343 4577422 1355577766444 4445555555442 899999999999999999999 Q ss_pred HHHHhcccccCCCCce----eEecc------c----cchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeE Q lcl|NC_019422. 283 KELQRDEVLDNSQEAY----AQIDI------E----AHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMED 348 (355) Q Consensus 283 ~~l~~~g~I~~~~~~~----v~id~------e----~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEk 348 (355) ++-.+.|+|.++.... .+|+- . +.|-|+...+. ..... .....+...-+.+..+-=+|+-. T Consensus 418 ~~av~nG~I~~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~--~~~~~---~~R~~R~~p~~~~~y~~~gaIh~ 492 (501) T protein:vir:10 418 DAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQTRGWYFLIGD--PANPG---QARQNRTTPACTLWYSDGGSIQQ 492 (501) T ss_pred HHHHhCceeecCCCCCcccceeeccccCccccccceeccceeEeecc--ccCCh---hhhhhccccceEEEEEeCCceeE Confidence 9999999999874311 11111 1 12334333221 11111 12334445567777777788888 Q ss_pred EEEE-EeC Q lcl|NC_019422. 349 LKFK-IYM 355 (355) Q Consensus 349 iy~t-v~v 355 (355) +.+- +.| T Consensus 493 v~i~s~~v 500 (501) T protein:vir:10 493 LTIGSNAV 500 (501) T ss_pred EEeeeeec Confidence 8743 334 No 61 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=98.00 E-value=7.7e-06 Score=48.62 Aligned_cols=299 Identities=10% Similarity=0.059 Sum_probs=155.1 Q ss_pred CCCCceEEEeeeeeeeeecCCCceeEEEEEecCCccceeEEEeehhhhhhhhhhHHHHHHHHhhhc-cccceEEEEecCC Q lcl|NC_019422. 1 MGLPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIKKSYSIDFLTDINETEFTKENYDYIRLAFL-GKPSKVIVEVIND 79 (355) Q Consensus 1 ~g~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~~~~~~~~~d~~~~~~~~~n~~~i~~a~~-g~~~~v~l~~g~~ 79 (355) ++.+++.++|.+..... ++-...+....+..+.+.. .+++ .. +.+. +.++.+...| .. T Consensus 150 l~a~~~tv~~ds~~~~f----------~its~t~G~~~~i~~~t~~----~~~a----~~--l~Lt~~~~a~v~~~g-~~ 208 (501) T protein:vir:78 150 FTSPDFVVSYDALRNRF----------VVNTNATGTAAAISAVTGT----NNLA----DE--LGLSAAAGASLQAAG-VA 208 (501) T ss_pred hcCcceEEEEccccceE----------EEEeeecCCceeEEEEecc----cchh----hh--hcccccCceeeEecc-cc Confidence 44444444433322111 1111111111222222210 0000 01 1111 2233333322 22 Q ss_pred CccchhHHHHHHHHHhcccceEEE---EcCCChHHHHHHHHHHHHHHHhcCCeEEEEecCCC-----------------C Q lcl|NC_019422. 80 SVDSERSLDDALKALRENKFNYLA---IPFISEEVDKTKIVNWIKTARREKEIYKAVLPNIS-----------------D 139 (355) Q Consensus 80 g~~~~~~y~~al~~le~~~fn~l~---~p~~~d~~~~~~~~~~ik~~r~~g~~~~aVl~~~~-----------------~ 139 (355) .+...++++++.....||.. ++. .+++.+..+..|+.. +++++..+..... . T Consensus 209 ----aet~~~a~~a~~~~~~~Wy~f~~a~~-~~~~~~lalA~wiea---~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~a 280 (501) T protein:vir:78 209 ----ADTPASAMNRAVGLSRNWATFTTAWT-AVIADRLALASWNSG---QAYKYMYVAPDLEPASIVTNNSASFGAQVFA 280 (501) T ss_pred ----ccCHHHHHHHHHhccCceEEEEEecC-CCHHHHHHHHHHHHh---cCceEEEEEecCCcceeecccchhHHHHHhh Confidence 24578999999988878753 333 345567899999983 4555554443211 0 Q ss_pred cCcceeEEecCCeEecCCceecHHHHHHHHHHHhcCccc---cccccc--ccC-Ccccc-cCChhhHHHHHhCCeEEEE- Q lcl|NC_019422. 140 ANEKAIINFATTGIKVGEKSYTTAEYTARLAGILAGISL---SESCTY--FIL-DEVTE-IEPTENPDEAVEEGKLILI- 211 (355) Q Consensus 140 ~d~egIinv~n~~i~~~~~~~~~~~~~a~vAG~~Ag~~~---~~S~T~--~~~-~~~~~-~~~~~e~~~ai~~G~lvl~- 211 (355) .++.+.+-+ |.....++.+.|..|+... +.++|+ +.+ +|+.. +++..|.+.+.++|.-++. T Consensus 281 ~~y~~t~~~-----------y~~~~~~aa~~g~~as~nf~~~~g~~T~~fkq~~~Gv~a~~l~~t~a~al~~~~~N~y~~ 349 (501) T protein:vir:78 281 APYQGTLPL-----------YGDQATAGAVMGYAASINFQLRNGRTVLAFRQFNAGVPATAHDLGTANALRSNNYTYIGA 349 (501) T ss_pred cCCCceEEE-----------cCCcchHHHHHHHHHhcCcccCcceeeeeccccCCCcCcccCCHHHHHHHHhcCCeEEEE Confidence 112222211 1222234455666666554 345555 465 78875 6788999999999987653 Q ss_pred --E-CCcEE-EEecCccccccCCCCCchhhhhhhHhhHHHHHHH----HHHHHhhccccccCCCHHHHHHHHHHHHHHHH Q lcl|NC_019422. 212 --N-NNGIR-IARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDD----ILQTWNENYVGKVTNKYDNKILFLSAVNNYFK 283 (355) Q Consensus 212 --~-dg~v~-I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~d----i~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~ 283 (355) . +.... +.+|+-| -+|.-|.+++-.|-+.+. +...|... +|+|=+..|..++.+.+..-++ T Consensus 350 ~~~~~~~~~~~~~G~~s---------G~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~--~kIPyt~~G~~~l~a~v~~~l~ 418 (501) T protein:vir:78 350 YANAANNYTIAYDGKLS---------GKFLWVDTYLDQIYLNAELQRAEFEAMLAY--NSLPYNEDGYTALYRAGVDVID 418 (501) T ss_pred EecccceeeEEEcCeee---------ccceeehhhhhHHHHHHHHHHHHHHHHHhC--CCcccCHHHHHHHHHHHHHHHH Confidence 1 23444 4567422 135557777655544444 44444332 8999999999999999999999 Q ss_pred HHHhcccccCCCCce----eEe------ccc----cchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEE Q lcl|NC_019422. 284 ELQRDEVLDNSQEAY----AQI------DIE----AHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDL 349 (355) Q Consensus 284 ~l~~~g~I~~~~~~~----v~i------d~e----~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEki 349 (355) +-.+.|+|.++.... .+| |+. +.|-|+...+. ..... .....+...-+...++-=+|+-.+ T Consensus 419 ~av~nG~I~~Gv~l~~~q~~~I~~~~g~~~~~~~~~~~Gyy~~~~~--~~~~~---~~R~~R~~p~~~~~y~~~gaIh~v 493 (501) T protein:vir:78 419 AAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQMRGWYFLIGD--PANPG---QARQNRTTPTCTLWYSDGGSIQEL 493 (501) T ss_pred HHHhCceeecCCCCCCccceeeccccCccccccceeccceEEeecc--ccCCh---hhhhhcccCcEEEEEEeCCceeEE Confidence 999999999874311 111 111 12233332221 11111 113334445566777777888888 Q ss_pred EEE-EeC Q lcl|NC_019422. 350 KFK-IYM 355 (355) Q Consensus 350 y~t-v~v 355 (355) .+- +.| T Consensus 494 ~i~s~~v 500 (501) T protein:vir:78 494 TIGSNAV 500 (501) T ss_pred Eeeeeec Confidence 743 334 No 62 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=97.98 E-value=8.4e-06 Score=48.42 Aligned_cols=293 Identities=10% Similarity=0.046 Sum_probs=146.1 Q ss_pred CCCCceEEEeeeeeeeeecCCCceeEEEEEecCCccceeEEEeehhhhhhhhhhHHHHHHHHhhhc-cccceEEEEecCC Q lcl|NC_019422. 1 MGLPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIKKSYSIDFLTDINETEFTKENYDYIRLAFL-GKPSKVIVEVIND 79 (355) Q Consensus 1 ~g~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~~~~~~~~~d~~~~~~~~~n~~~i~~a~~-g~~~~v~l~~g~~ 79 (355) ++.++..++|..... |. ++....+...-...+.+ .++ + .. +.+. ++++.+.. .|.+ T Consensus 147 i~~a~~~v~~d~~~~-------~f---~v~s~ttG~~s~is~~t-~~~-----a----~~--l~lt~~~~a~v~~-~g~~ 203 (494) T protein:vir:94 147 FTTPNFAITYDAQRR-------RF---VLSTTATGTTASVSAVT-GTL-----A----DG--VGLSTASGAYVEG-SGLA 203 (494) T ss_pred hccccceEEEcccCc-------EE---EEEEccCCceeEEEEec-cch-----h----hh--hhhhccccceEee-cCcc Confidence 333332222221110 00 11111111111112221 010 0 00 1112 12333333 2222 Q ss_pred CccchhHHHHHHHHHhcccceEE--EEcCCChHHHHHHHHHHHHHHHhcCCeEEEEecCC-C----------------Cc Q lcl|NC_019422. 80 SVDSERSLDDALKALRENKFNYL--AIPFISEEVDKTKIVNWIKTARREKEIYKAVLPNI-S----------------DA 140 (355) Q Consensus 80 g~~~~~~y~~al~~le~~~fn~l--~~p~~~d~~~~~~~~~~ik~~r~~g~~~~aVl~~~-~----------------~~ 140 (355) .+...++++++....-||. .+....+.+.+..+.+|+.. .++++..+.... + .. T Consensus 204 ----aet~~~a~~a~~~~~~~Wy~f~~~~~~~~~~ilalA~wiea---~~~~~~~~~~~~d~~~~~~~~~~~i~~~l~~~ 276 (494) T protein:vir:94 204 ----ADTAASALDRLAASSSTWAIFTTAWAASLSDRTALAQWTSD---QVFRRIYAAWDQDAAGLSVNNVSSFGNIVKTT 276 (494) T ss_pred ----cccHHHHHHHHHhccCceEEEEEecCCCHHHHHHHHHHHhh---cCccEEEEEecCCcceeecccchhHHHHHHhh Confidence 3567899999987765664 33333445678899999984 344444433211 0 11 Q ss_pred CcceeEEecCCeEecCCceecHHHHHHHHHHHhcCcccc-----cccccc-cCCcccc-cCChhhHHHHHhCCeEEEEE- Q lcl|NC_019422. 141 NEKAIINFATTGIKVGEKSYTTAEYTARLAGILAGISLS-----ESCTYF-ILDEVTE-IEPTENPDEAVEEGKLILIN- 212 (355) Q Consensus 141 d~egIinv~n~~i~~~~~~~~~~~~~a~vAG~~Ag~~~~-----~S~T~~-~~~~~~~-~~~~~e~~~ai~~G~lvl~~- 212 (355) ++++.+-+-+ ...-.+.+-|+.|+..++ .+++|+ .++++.. +++.+|.+.+.++|.-++.+ T Consensus 277 ~y~~t~~~y~-----------~~~~~aa~~g~~aa~~~~~~~g~~T~~~k~q~~gi~~~~l~~t~a~al~~~~~N~y~~~ 345 (494) T protein:vir:94 277 PFSNTIPVYG-----------LLANAMIVLAWGASTNLQIAEGRTTLALRSPVSSAGVRVDNLANANALLSNGYTYLGKY 345 (494) T ss_pred cCCceEEEcC-----------CCChHHHHHHHHHhccccccCcceeEEeeccCCCCCCccCCHHHHHHHHhcCCeEEEEe Confidence 2223332222 111223445566666653 455565 5788875 56778999999999888753 Q ss_pred C--C-cEEEEecCccccccCCCCCchhhhhhhHh----hHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019422. 213 N--N-GIRIARGVNSLITLSKEDTEDLKKIKIVE----AIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNNYFKEL 285 (355) Q Consensus 213 d--g-~v~I~~~INSltt~~~~k~~~f~kirvvr----~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~yl~~l 285 (355) . + ...+.+|.. +. -+|.-|...+ .-..+..++...|-.. +|+|=+..|..++.+.+..-+++- T Consensus 346 ~~~~~~~~~~~gg~-~s-------G~~~~id~~~~~~WL~~~iq~~l~~ll~~~--~KIPytd~G~~~l~a~i~~~l~~a 415 (494) T protein:vir:94 346 ASATNTYTVTYNGA-IG-------GQFLWADTALGWIALRRNLQQALFETLLAY--RSLPYNADGYNALYQGAQDVVSQF 415 (494) T ss_pred cccCceEEEecCce-ec-------cccceeeeeccHHHHHHHHHHHHHHHHHhC--CCcccChhhHHHHHHHHHHHHHHH Confidence 2 2 345555542 21 1121122221 2344566666666643 899999999999999999999999 Q ss_pred HhcccccCCCCceeEeccccchhhh------------------hccccccccccceeeeccCCCCEEEEEEEEEEEeeee Q lcl|NC_019422. 286 QRDEVLDNSQEAYAQIDIEAHKKYL------------------KEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAME 347 (355) Q Consensus 286 ~~~g~I~~~~~~~v~id~e~q~~~~------------------~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamE 347 (355) .+.|+|.++.... +.|+.++ ... .++....+. .+.....-.|+.. -=+|+. T Consensus 416 v~nG~I~~Gv~~~-----~~q~~~i~~~~G~~~~~~~~~kGyy~~~-~~~~s~~~r--a~R~~~~~~~~y~---~~GAIh 484 (494) T protein:vir:94 416 VAAGVIRAGVALS-----ASQRAQIDQAAGVPISGDVVDKGWYLQV-IDPITTTVR--TDRGSPTVNFWYC---DGGSIQ 484 (494) T ss_pred HhCceeecccccC-----cchhhhhhhhhcCccccceeccceeeec-cCCCChhhh--hccccCCceEEEE---ecCcEE Confidence 9999999864321 2222222 111 122222222 2222223333332 267788 Q ss_pred EEEEEEeC Q lcl|NC_019422. 348 DLKFKIYM 355 (355) Q Consensus 348 kiy~tv~v 355 (355) .+.+..++ T Consensus 485 ~v~i~~~~ 492 (494) T protein:vir:94 485 RVVVSATT 492 (494) T ss_pred EEEEeeEE Confidence 88887777 No 63 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=97.82 E-value=1.7e-05 Score=46.75 Aligned_cols=325 Identities=11% Similarity=0.055 Sum_probs=166.0 Q ss_pred CCCC--ceEEEeeeeeeee-----------------ecCCCcee------EEEEEecCCccceeEEEeehhhhhhh---h Q lcl|NC_019422. 1 MGLP--SAIIEFQRRSRTV-----------------KFRSRRGV------VALILKDSTAIKKSYSIDFLTDINET---E 52 (355) Q Consensus 1 ~g~P--~~~i~f~~~a~ta-----------------~~~~~rG~------v~iil~d~~~~~~~~~~~~~~d~~~~---~ 52 (355) ..++ .+.|++.+...+. +.-+.+-- .+.+..|.+. ..+++.+.+.-... . T Consensus 108 ~~i~~G~lsitv~G~~~~~~~i~~S~~ts~~~vA~~i~~al~~~~~~~~~~~tv~~d~~~--~~f~its~~tg~~~~~~~ 185 (504) T protein:vir:96 108 AGFSAGVLTIMVGAAEKNITAIDTSAATSMDNVASIIQTEIRKNTDPQLAQATVTWNPNT--NQFTLVGATIGTGVLAVA 185 (504) T ss_pred hhhhceEEEEEEcceeeeecccccccccchHHHHHHHHhhhhcccccccccceEEEeccC--CeEEEEeeccccceeEEE Confidence 2222 2333333322111 00000000 0112222221 11222111110000 0 Q ss_pred hhHHHHHHHHhh-hccccceEEEEecCCCccchhHHHHHHHHHhcccceEE--EEcCC-ChHHHHHHHHHHHHHHHhcCC Q lcl|NC_019422. 53 FTKENYDYIRLA-FLGKPSKVIVEVINDSVDSERSLDDALKALRENKFNYL--AIPFI-SEEVDKTKIVNWIKTARREKE 128 (355) Q Consensus 53 ~~~~n~~~i~~a-~~g~~~~v~l~~g~~g~~~~~~y~~al~~le~~~fn~l--~~p~~-~d~~~~~~~~~~ik~~r~~g~ 128 (355) .++.- -.+... .+..+..+++.| .+ .+..+++|.++....-||. .+... .++..+..+..|+.. .++ T Consensus 186 ~~a~~-~~~~~~lgl~~~~~~~v~g-~~----aet~~~al~al~~~~~~Wy~f~~a~~~~~dd~ilalA~w~ea---~~~ 256 (504) T protein:vir:96 186 KSADP-QDMSTALGWSTSNVVNVAG-QA----ADLPDAAVAKSTNVSNNFGSFLFAGATLDNDQIKAVSAWNAA---QNN 256 (504) T ss_pred eeccc-cchhhhhhcccccceEEee-cc----cccHHHHHHHHHhhcCCeEEEEEEeccCCHHHHHHHHHHHhh---cCc Confidence 00000 001111 112233344333 22 2457789999987765554 33221 223345688899983 455 Q ss_pred eEEEEecCCCCcCcc-----------eeEEecCCeEecCCceecHHHHHHHHHHHhcCccc---ccccc--cccCCccc- Q lcl|NC_019422. 129 IYKAVLPNISDANEK-----------AIINFATTGIKVGEKSYTTAEYTARLAGILAGISL---SESCT--YFILDEVT- 191 (355) Q Consensus 129 ~~~aVl~~~~~~d~e-----------gIinv~n~~i~~~~~~~~~~~~~a~vAG~~Ag~~~---~~S~T--~~~~~~~~- 191 (355) ++..++.... ++.+ ..+++.... ....+.+ +...|..|+... +.++| |+.++||. T Consensus 257 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~----~~~~~~~as~~f~~~ng~~T~~fk~l~GVta 328 (504) T protein:vir:96 257 QFIYTVATSL-ANLGALFDLVKGNSGTALNVLSAT---ASNDFVE----QCPSEILAATNYDEPGASQNYMYYQFPGRNI 328 (504) T ss_pred eEEEEEeecc-cchhhHHHhhhhcceeEEEEeecC---ccchhHH----HHHHHHHHhcCcCcccccccccccccCCcCc Confidence 6665554321 1211 111111110 1112222 223444444432 33444 56899998 Q ss_pred ccCChhhHHHHHhCCeEEEEE---CC-cE-EEEecCccccccCCCCCc-hhhhhhhHhhHHHHHHHHHHHHhhccc--cc Q lcl|NC_019422. 192 EIEPTENPDEAVEEGKLILIN---NN-GI-RIARGVNSLITLSKEDTE-DLKKIKIVEAIDMIQDDILQTWNENYV--GK 263 (355) Q Consensus 192 ~~~~~~e~~~ai~~G~lvl~~---dg-~v-~I~~~INSltt~~~~k~~-~f~kirvvr~~D~i~~di~~~~~~~yi--GK 263 (355) ++++..|.+.+.++|.-++.. .+ .. .+.+|+-+ + ++ +|.-|.+++-.|-+.+.++..+-+-|. +| T Consensus 329 ~~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~~~------g-G~~~~~wiDv~~~~~WL~~~lq~~l~~l~~~~~k 401 (504) T protein:vir:96 329 TVSDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGILC------G-GPTDAVDMNVYANEIWLKSAIAQALLDLFLNVNA 401 (504) T ss_pred ccCCHHHHHHHHhcCCeEEEEeecccceeeEEecCeee------C-CccccchhhhhhhHHHHHHHHHHHHHHHHhcCCC Confidence 467889999999998776632 22 33 35677654 2 34 677788888877777777665554343 79 Q ss_pred cCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCcee----Eec------cc----cchhhhhccccccccccceeeeccC Q lcl|NC_019422. 264 VTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYA----QID------IE----AHKKYLKEKGIDYSEMTEQQIKEAN 329 (355) Q Consensus 264 ~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v----~id------~e----~q~~~~~~~~~d~~~~~d~~v~~~~ 329 (355) +|=+.+|..++.+.++.-+++-.+.|+|.++..... .|+ .. +.|-|+.. ...++++.-.... T Consensus 402 IPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~I~~~~~~d~~~~~~~~~GYyv~----~~~~s~~s~~~r~ 477 (504) T protein:vir:96 402 VPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQYITQITGDRRAWRQVQTLGYWIN----ITFSSYTNSNTGL 477 (504) T ss_pred cccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchheecccccccccccceeccceEEE----ecChhccChhHhh Confidence 999999999999999999999999999998743221 011 11 01223322 2233344334445 Q ss_pred CCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 330 TGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 330 ~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) .+-..-+.+..+.=+|+.++.+.-.+ T Consensus 478 ~R~~~~~~~~y~~~gaI~~v~~~~~~ 503 (504) T protein:vir:96 478 TEWKANYTLIYSKGDAIRFVEGSDVM 503 (504) T ss_pred hccccceEEEEEECCeEEEEEecccc Confidence 55666788888889999999988777 No 64 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=97.71 E-value=2.7e-05 Score=45.67 Aligned_cols=316 Identities=13% Similarity=0.070 Sum_probs=169.4 Q ss_pred CCCceEEEeeeeeeeeecCCCceeEEEEE-ecCCccceeEEEeehhhhhhhhhhH---HHHHHHHhhhccccceEEEEec Q lcl|NC_019422. 2 GLPSAIIEFQRRSRTVKFRSRRGVVALIL-KDSTAIKKSYSIDFLTDINETEFTK---ENYDYIRLAFLGKPSKVIVEVI 77 (355) Q Consensus 2 g~P~~~i~f~~~a~ta~~~~~rG~v~iil-~d~~~~~~~~~~~~~~d~~~~~~~~---~n~~~i~~a~~g~~~~v~l~~g 77 (355) =.|.+.|.=-.+..-....-+| ..+.+ ...+...+.+.+.+.+|+..- +.+ .-+..+..+..++...-..+.. T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er--~~lfig~~~~~~g~~~~~~~~sdld~~-l~~~ds~lk~~v~aa~~naG~~~~~~~~ 77 (370) T protein:vir:78 1 MWPYVQIYNLNQMQGPVTEVER--HLLFIGSAASNTGKLLSLNAQSDFDQL-LGAADSELKANLLAARDNAGQNWSAAAY 77 (370) T ss_pred CCceEEEeeccccCCCcCccce--eEEEEecccccccceEeecCccCHHHh-cCCcChhHHHHHHHHHhCCCCceEEEEE Confidence 3566555433332222222222 22222 333444556667777776433 322 2245566676655433322111 Q ss_pred CCCccchhHHHHHHHHH-hcccceEEEEcCC-ChHHHHHHHHHHHHHHHhc-CCeEEEEecCCCCcCc------------ Q lcl|NC_019422. 78 NDSVDSERSLDDALKAL-RENKFNYLAIPFI-SEEVDKTKIVNWIKTARRE-KEIYKAVLPNISDANE------------ 142 (355) Q Consensus 78 ~~g~~~~~~y~~al~~l-e~~~fn~l~~p~~-~d~~~~~~~~~~ik~~r~~-g~~~~aVl~~~~~~d~------------ 142 (355) +..+..+|.+|++.. +.+.|-++++-+. ++.+.-+.+.+....+-.. ||.+..++......+. T Consensus 78 --p~~~~~d~~~Av~~a~~~~s~E~V~v~~~~s~~a~~~a~~~~a~el~n~~~Rpv~file~~~~~~~e~w~~y~~~l~a 155 (370) T protein:vir:78 78 --VLPTDKPWLDAARDAQQTQSFEGVVVLGQEWHQAAINAAHALNQELIAKWGRWQFMLLAVPAIADEQDWATYEAELAT 155 (370) T ss_pred --EecCchhHHHHHHHHHhhCCccEEEEecCcchHHHHHHHHHHHHHHHHhcCCeEEEEEeecCCCCcCCHHHHHHHHHH Confidence 122356899999666 5566666665542 3434445555555555554 7888877764221111 Q ss_pred --ceeE----EecCCeEecCCceecHHHHHHHHHHHhcCc--ccccccc----cc-------cCCcccccCChhhHHHHH Q lcl|NC_019422. 143 --KAII----NFATTGIKVGEKSYTTAEYTARLAGILAGI--SLSESCT----YF-------ILDEVTEIEPTENPDEAV 203 (355) Q Consensus 143 --egIi----nv~n~~i~~~~~~~~~~~~~a~vAG~~Ag~--~~~~S~T----~~-------~~~~~~~~~~~~e~~~ai 203 (355) .|+. .+... +.|. ....+||.++.. .+..|+- .. |.+.-..-++...++++- T Consensus 156 l~~gia~~~V~vvp~---~~g~------~~G~~aGRL~naavsVadsP~Rv~tG~l~gl~~~p~d~~~~~l~~a~l~aLd 226 (370) T protein:vir:78 156 LQDGIAASSVSLIPQ---LWPT------LAGAYAGRLCNRAVSIADSPCRVKTGALVGLGNKPVGKDGIPLPLATLQTLE 226 (370) T ss_pred hhhccccccceEEee---eccc------cHHHHHHHHhcCeeeecccceeeeccccccccccccccCCcccCHHHHHHHH Confidence 1221 11111 1111 245567765322 2233332 11 222222234567888888 Q ss_pred hCCeEEEE-ECC--cEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHH Q lcl|NC_019422. 204 EEGKLILI-NNN--GIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVGKVTNKYDNKILFLSAVNN 280 (355) Q Consensus 204 ~~G~lvl~-~dg--~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiGK~~N~~~gr~~~~~~i~~ 280 (355) ++|-.++. ..| ++-+..| |+| ...+.||+.|..+|++|.+.+-+|.+.=++.-.+.-|+..|-. ++.+. T Consensus 227 ~agy~vp~~Y~gy~G~Y~~d~-~tl----~~~gsDYq~ie~~RVvdKa~R~vR~~ai~~i~D~~lnst~gsi---a~~~~ 298 (370) T protein:vir:78 227 ANRYSVPMWYPDYDGIYWADG-RTL----DAEGGDYQVIENLRIAYKVARRMRLRAIARIGDRSFNSTPGST---AAAIT 298 (370) T ss_pred hCCCeEEEeeCCCCceEEeCc-eEe----ccCCCChhhhhhhhHHHHHHHHHHHHHHHHhCCcccCCCCcch---hHHHH Confidence 99988884 333 6666554 444 5667899999999999999999997776645556668877665 33444 Q ss_pred HH----HHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 281 YF----KELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 281 yl----~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) || .+|.+.+-|..- .+.-+|.++.. .|+.+.- ..+..|-+.+.++|.++=-+|...|.+ T Consensus 299 ~~~~~L~ema~s~~i~~~-~fpgeI~~p~d--------------~Di~i~w-~s~~~v~I~~~v~P~~~pk~Itv~I~L 361 (370) T protein:vir:78 299 YFGKDLREMAKSTTINGQ-PFPGDIASPQD--------------GDIRIQW-VAKNLVSVFVVVRTVDCPKGITVNIML 361 (370) T ss_pred HHHhhHHHHHhhhhhccc-ccceeEeccCC--------------CcceEEe-eccceEEEEEEEEeccCCceEEEEEEE Confidence 43 344445544321 11112222111 1222222 245678899999999999999988888 No 65 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=97.56 E-value=4.5e-05 Score=44.43 Aligned_cols=323 Identities=11% Similarity=0.086 Sum_probs=177.8 Q ss_pred CCCceEEEeeeeeeeeecCCCceeEEEEEecCCccceeEEEeehhhhhhhhhhH---HHHHHHHhhhccccceEEEEecC Q lcl|NC_019422. 2 GLPSAIIEFQRRSRTVKFRSRRGVVALILKDSTAIKKSYSIDFLTDINETEFTK---ENYDYIRLAFLGKPSKVIVEVIN 78 (355) Q Consensus 2 g~P~~~i~f~~~a~ta~~~~~rG~v~iil~d~~~~~~~~~~~~~~d~~~~~~~~---~n~~~i~~a~~g~~~~v~l~~g~ 78 (355) =+|.+.|.=..+..-.+..-+|=.+.+-. ..+...+.+.+.+-+|+. ..+.+ .-+.-+..+..+++..-.... + T Consensus 1 ~~~~v~vn~ln~~qg~~~~ver~~lfig~-~~~~~~~~~~~~~~sdld-~~lg~~ds~lk~~v~aa~~naG~~w~a~~-~ 77 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIERHALFVGV-GTTNQGKLLALTPDSDFD-KVFGETDTDLKKQVRAAMLNAGQNWFAHV-Y 77 (376) T ss_pred CCCeEEEeeeeccCCCcccccceEEEeec-cccccCceEEecCCCChH-HhhCCCchhHHHHHHHHHhCCCCceEEEE-E Confidence 46777665444433333223333333322 233445666777777763 33322 234556777776544332211 1 Q ss_pred CCccchhHHHHHHHHH-hcccceEEEEcC--CChHHHHHHHHHHHHHHHhc-CCeEEEEecCCC----CcCc-------- Q lcl|NC_019422. 79 DSVDSERSLDDALKAL-RENKFNYLAIPF--ISEEVDKTKIVNWIKTARRE-KEIYKAVLPNIS----DANE-------- 142 (355) Q Consensus 79 ~g~~~~~~y~~al~~l-e~~~fn~l~~p~--~~d~~~~~~~~~~ik~~r~~-g~~~~aVl~~~~----~~d~-------- 142 (355) ....++++|.+|++.. +.+.|-++++-+ .++.+.-+.+.+-...+... ||.+..++.... ..+. T Consensus 78 ~p~~~~~~~~~Av~~a~~~~s~E~V~v~~p~~t~~a~i~a~qa~a~el~~~~~R~vffile~~g~d~~~~~ge~w~~y~~ 157 (376) T protein:vir:37 78 IAQEDGYDFVECVKKANQTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQAVQGINHDQSDGETWDQYVQ 157 (376) T ss_pred ecCCChhhHHHHHHHHHhhCCeeEEEEecCcchhHHHHHHHHHHHHHHHHhcCCeEEEEEeccCCCCcccccCCHHHHHH Confidence 1223457899999777 455666666543 22333333344434444444 777777775321 0111 Q ss_pred ------ceeEEecCCeEecCCceecHHHHHHHHHHHhc--Ccccccccccc---cCCcccc----------cCChhhHHH Q lcl|NC_019422. 143 ------KAIINFATTGIKVGEKSYTTAEYTARLAGILA--GISLSESCTYF---ILDEVTE----------IEPTENPDE 201 (355) Q Consensus 143 ------egIinv~n~~i~~~~~~~~~~~~~a~vAG~~A--g~~~~~S~T~~---~~~~~~~----------~~~~~e~~~ 201 (355) +|+.+-.-..+- .+-+ .....+||.+| +.++-+|+--. ++.++.. -++.+-+.+ T Consensus 158 ~l~a~~~gia~~~V~vV~----~~~g-n~~G~~aGRl~naaVsVadspgRV~tGai~gl~~~~~p~d~~g~el~~a~l~a 232 (376) T protein:vir:37 158 KLTTLQQTIVADHVCLVP----LLFG-NETGVLAGRLANRAVTVADSPARVQTGALVSLGSANKPLDKDGNELTLAHLKS 232 (376) T ss_pred HHHHHhccccccceeeee----eecc-chHHHHHHHHHhCCcchhcCccceeecccccccccccccccCCcccchHHHHH Confidence 132211000000 0000 13667788885 55666777543 2333321 123456667 Q ss_pred HHhCCeEEEE-ECC--cEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhcccc-ccCCCHHHH-HHHHH Q lcl|NC_019422. 202 AVEEGKLILI-NNN--GIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYVG-KVTNKYDNK-ILFLS 276 (355) Q Consensus 202 ai~~G~lvl~-~dg--~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yiG-K~~N~~~gr-~~~~~ 276 (355) +=++|-.++. ..| ++-+..| |+| ...+.||+.|.-+|++|.+.+.+|...=. +|+ +.-|+..+- ...+. T Consensus 233 Ld~arysvpr~Y~gydG~Yw~dg-~tl----~~~gsDYq~ie~~RVvdKa~R~vR~~Ai~-~i~Dr~lnstp~sia~~~~ 306 (376) T protein:vir:37 233 LETARYSVPMWYPDYDGYYRADG-RTL----DVEGGDYQVIENLRVVDKVARKVRLLAIG-KIADRSFNSTTSSTEYHKN 306 (376) T ss_pred HHhCCCeEEEeeCCCCceEEeCC-eEe----ccCCCCeeeehhchHHHHHHHHHHHHHHH-HhcCccccCChhHHHHHHH Confidence 7788988874 333 6666655 444 56778999999999999999999988776 555 445655544 33333 Q ss_pred HHHHHHHHHHhcccccCCC-CceeEeccccchhhhhccccccccccceeeeccCCCCEEEEEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 277 AVNNYFKELQRDEVLDNSQ-EAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 277 ~i~~yl~~l~~~g~I~~~~-~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~~i~~vdamEkiy~tv~v 355 (355) -+..=|++|.+.+-|.... .-.+.+ ..=.||.|.- .++..|-+.+.++|.++=.+|...|.+ T Consensus 307 ~~~~pLr~M~ks~ei~g~~fpgei~~----------------P~d~dI~i~w-~sk~~V~I~~~vrPy~cpk~i~~~I~L 369 (376) T protein:vir:37 307 YFAKPLRDMSKSATINGKDFPGECMP----------------PKDDAITIVW-QSKTKVTIYIKVRPYDCPKEITANIFL 369 (376) T ss_pred HHhHHHHHHHhhhhhccccccceeec----------------CCCCceEEEe-ccCceEEEEEEEeeecCcceeEEEEEE Confidence 3444566787776654320 111111 1112444432 345789999999999999999999988 No 66 >protein:vir:107720 Length: 515 # NCBI annotation: gp17 # Family: family:all:396 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024865;genbank:gi:48697507;genbank:GeneID:2948336 Probab=96.98 E-value=0.00023 Score=40.56 Aligned_cols=331 Identities=11% Similarity=0.006 Sum_probs=157.0 Q ss_pred CCCCc---eEEEeeee-----eeeeecCCCce------eEEEEEecCCccceeEEEeehhhhhhh----hhhH--HHHHH Q lcl|NC_019422. 1 MGLPS---AIIEFQRR-----SRTVKFRSRRG------VVALILKDSTAIKKSYSIDFLTDINET----EFTK--ENYDY 60 (355) Q Consensus 1 ~g~P~---~~i~f~~~-----a~ta~~~~~rG------~v~iil~d~~~~~~~~~~~~~~d~~~~----~~~~--~n~~~ 60 (355) =|-.. .-|.|... +++.+.-+.+. -.+.+..|++. ..+++.+-+-.... .+.. ..-.+ T Consensus 122 dG~~~~t~s~i~~S~ats~~~vAs~i~tal~~~~~~~~~~~tv~~d~~~--~~F~v~s~~tG~~~~is~~~~t~~~~~t~ 199 (515) T protein:vir:10 122 GGATTVTVSGISFSAATSLADVASELQTALRANADANLATCTVSYDPVG--ARFNFAGSPSDDTVQESISIVPQSNPAID 199 (515) T ss_pred cceEEEEeeccccccccCHHHHHHHHHhhhccccccccceeEEEEecCC--CeEEEEEeecCCceeEEEEEecCCCchhh Confidence 02211 11111111 01111111111 01223333332 12222211100000 0000 00112 Q ss_pred HHhhh-ccccceEEEEecCCCccchhHHHHHHHHHhcccceEEEE--cCC----ChHHHHHHHHHHHHHHHhcCCeEEEE Q lcl|NC_019422. 61 IRLAF-LGKPSKVIVEVINDSVDSERSLDDALKALRENKFNYLAI--PFI----SEEVDKTKIVNWIKTARREKEIYKAV 133 (355) Q Consensus 61 i~~a~-~g~~~~v~l~~g~~g~~~~~~y~~al~~le~~~fn~l~~--p~~----~d~~~~~~~~~~ik~~r~~g~~~~aV 133 (355) +..++ +....+.....|.+ .+...++|.++....-||.++ ... ..++....+.+|+.+ .++.+..- T Consensus 200 ~a~~lglt~~~~av~~~g~a----aet~~~a~~a~~~~s~nWy~f~~a~~~~~~~~~a~~~a~a~~~e~---~~~~~~~~ 272 (515) T protein:vir:10 200 VAQLLGWNSAQGASYIAASP----VVSPVDTLIASVAGNNNFGSILFTKNGGTGITLSDAEAIALQNQS---YNVAYKFQ 272 (515) T ss_pred HHHHhccccccceEEecccc----cccHHHHHHHHHhccCCeEEEEEeecCccccchhHHHHHHHHHhh---cCceEEEE Confidence 22222 11233333334433 245789999998876676543 221 123455677777753 22222211 Q ss_pred ecCC------CCcCcceeEEecCCeE-ecCCceecHHHHHHHHHHHhcCccc---ccccc--cccCCccc-ccCChhhHH Q lcl|NC_019422. 134 LPNI------SDANEKAIINFATTGI-KVGEKSYTTAEYTARLAGILAGISL---SESCT--YFILDEVT-EIEPTENPD 200 (355) Q Consensus 134 l~~~------~~~d~egIinv~n~~i-~~~~~~~~~~~~~a~vAG~~Ag~~~---~~S~T--~~~~~~~~-~~~~~~e~~ 200 (355) .+-. .......+.+...... ..-...|. ++.+.|..|+++. +-++| |+.+||+. ++++..|.+ T Consensus 273 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~----~a~~~g~~asvnf~~~ng~iT~kfKq~~Gita~~lt~t~a~ 348 (515) T protein:vir:10 273 VGVDDTTYSSWQAALAAIGGVNMIYSPVALAAEYH----DMQDGIIEAATDFTQQGGATGYMYVQFNNQTPAVNDDTLSG 348 (515) T ss_pred eccCccceechhhhhhhhhhcCceEEEEeccCcch----HHHHHHHHHhcCCCccchhheeccccCCCCccccCCHHHHH Confidence 1100 0011111111110000 00111222 3345566677654 34455 45899998 678888999 Q ss_pred HHHhCCeEEEE--E--CCcEEE-EecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccc--cccCCCHHHHHH Q lcl|NC_019422. 201 EAVEEGKLILI--N--NNGIRI-ARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYV--GKVTNKYDNKIL 273 (355) Q Consensus 201 ~ai~~G~lvl~--~--dg~v~I-~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yi--GK~~N~~~gr~~ 273 (355) .+.++|.-++. . +..+.+ .+|+-+ ...-+|+=|-+++-+|-+.+.++..+-+-+. +|+|=+.+|..+ T Consensus 349 al~~~~~N~Y~~~~~~~~~~~~~~~G~~~------gG~~~~~WiD~~~g~~WL~~~iq~~l~~L~~s~~KIPytd~G~a~ 422 (515) T protein:vir:10 349 ILDDLNINYYGQTQVNGTNLSFYQDGVMM------GGPTDPRDSNVYANEQWLKSYAGASFMSLQLAQGKIPANIEGRGL 422 (515) T ss_pred HHHhcCCeEEEEEeccCceEEEEeCCeee------CCccchhHHHHHhhHHHHHHHHHHHHHHHHhcCCCCccChhhHHH Confidence 99999887774 2 234554 567644 1112566688888888888888776665444 699999999999 Q ss_pred HHHHH-HHHHHHHHhcccccCCCCceeE----------ecc----ccchhhhhccccccccccceeeeccCCCCEEEEEE Q lcl|NC_019422. 274 FLSAV-NNYFKELQRDEVLDNSQEAYAQ----------IDI----EAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFVEG 338 (355) Q Consensus 274 ~~~~i-~~yl~~l~~~g~I~~~~~~~v~----------id~----e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~~~ 338 (355) +.+++ +.-+++-.+-|+|.++-..+.. .|. -..|-|+...+...... .. ......-.+.=| T Consensus 423 i~a~v~q~vl~~av~nG~I~~Gv~ls~~Q~~~i~~~~g~d~~~~~~~~~Gyy~~~~~~~~~~-~~---~r~~~~~~~~~~ 498 (515) T protein:vir:10 423 LLGKMTKDIIPAAKLNGTFSIGKTLTVDQQLFVTELTGDDTAWQKVQNLGYWYDVQISSFVD-TG---GTTKYQAVYSLV 498 (515) T ss_pred HHHHHHHHHHHHHHhCCeeecCcccchhHHHHHHhhhcCcccccchhhcceeEecCcCCCCC-cc---cccccCceeEEE Confidence 99988 5799999999999998531110 111 12233333333221110 00 111111221222 Q ss_pred EEEEEeeeeEEEEEEeC Q lcl|NC_019422. 339 NITITDAMEDLKFKIYM 355 (355) Q Consensus 339 ~i~~vdamEkiy~tv~v 355 (355) ..+ =++|.+|.|+-++ T Consensus 499 y~~-g~~i~~i~~~~~~ 514 (515) T protein:vir:10 499 YSK-DDLIRKVVGTHTL 514 (515) T ss_pred EEc-CceEEEEEeeeec Confidence 222 5899999999988 No 67 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=92.87 E-value=0.0097 Score=31.63 Aligned_cols=323 Identities=11% Similarity=0.070 Sum_probs=150.1 Q ss_pred CCc--eEEEeeeeeeeeecCCCceeEEEEE-ecCCc----cceeEEEeehhhhhhhhhhHHHHHHH--Hhhhccc----- Q lcl|NC_019422. 3 LPS--AIIEFQRRSRTVKFRSRRGVVALIL-KDSTA----IKKSYSIDFLTDINETEFTKENYDYI--RLAFLGK----- 68 (355) Q Consensus 3 ~P~--~~i~f~~~a~ta~~~~~rG~v~iil-~d~~~----~~~~~~~~~~~d~~~~~~~~~n~~~i--~~a~~g~----- 68 (355) ||+ ++|.. ++..+|+.+-.-|.+.+|- ++..+ ..+.-.|.+++++. .+|....-.|. ...|--+ T Consensus 1 m~~~iVnV~I-s~~t~A~~~~~Fg~~liigs~~~~~p~~~f~~~~~Yss~~~V~-~Dfg~~s~~Y~AA~~~f~Q~~~~~r 78 (426) T protein:vir:31 1 MPKQIVEIEL-TAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVG-DDYGEDSDVYTASEAIEEMGAEQWR 78 (426) T ss_pred CCcceEEEEe-ecccccccccccceeeeeeeccccccccccchhhhhhhHHHHH-hcCCCChHHHHHHHHHHhCCceeEE Confidence 775 33333 3334444444446555553 22222 12333477776653 46654433332 2223111 Q ss_pred -------------------cceEEEEecCCCccchhHHHHHHHHH-hcccceEEEEcCCC-------------------- Q lcl|NC_019422. 69 -------------------PSKVIVEVINDSVDSERSLDDALKAL-RENKFNYLAIPFIS-------------------- 108 (355) Q Consensus 69 -------------------~~~v~l~~g~~g~~~~~~y~~al~~l-e~~~fn~l~~p~~~-------------------- 108 (355) -++++.++..+...+.++....++.- +...++...+.... T Consensus 79 ~~v~~at~~~~~~~t~~~tv~g~~~s~~a~~~~~a~~i~~~~~~~~~~~~~~~~~~~~t~~g~~t~~~~~~~~~~s~~dw 158 (426) T protein:vir:31 79 VMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFHADW 158 (426) T ss_pred eeccccceeeeccCCcceeecceeeeecccCcchHHHHHHhhccccccccceeeeEeccccceeeccccceeeeeccCcc Confidence 12233333333332233333322111 11122111110000 Q ss_pred ----------hH----------HHHHHHHHHHHHHHhcCCeEEEEecCCCCcCccee---EEecCCeE-ecC-----Cc- Q lcl|NC_019422. 109 ----------EE----------VDKTKIVNWIKTARREKEIYKAVLPNISDANEKAI---INFATTGI-KVG-----EK- 158 (355) Q Consensus 109 ----------d~----------~~~~~~~~~ik~~r~~g~~~~aVl~~~~~~d~egI---inv~n~~i-~~~-----~~- 158 (355) +. ..+..+..|..-. +..++..|..........++ +-+....- .+. +. T Consensus 159 ~~~~~~~s~~~~~~ia~~~~~~~~~~~~~~~~~wa--~~~~i~~va~~~e~~~~~~~~~~~a~~~~~~~y~p~~~~~~~~ 236 (426) T protein:vir:31 159 SQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWA--SDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIV 236 (426) T ss_pred hhhhcccccchhhhhhccccchhhhhhhHhhhhhh--hhcceeeeeeccchhhhcchhhhhhhhhcccccccchhheeeh Confidence 00 0011111111111 22233333332211111111 10100000 000 00 Q ss_pred eecHHHHHHHHHHHhcCcccccccccccCCcccccCC----------hhhHHHHHhCCeEE-EE-ECCcEEEEecCcccc Q lcl|NC_019422. 159 SYTTAEYTARLAGILAGISLSESCTYFILDEVTEIEP----------TENPDEAVEEGKLI-LI-NNNGIRIARGVNSLI 226 (355) Q Consensus 159 ~~~~~~~~a~vAG~~Ag~~~~~S~T~~~~~~~~~~~~----------~~e~~~ai~~G~lv-l~-~dg~v~I~~~INSlt 226 (355) .-......+|++|.++....-.+++...+++.+.... -+.-++|-.++..- |+ .+|...|-+++.| T Consensus 237 ~~~~~~~~~~~~~~~aa~~~~~~~~~~~~~~~~~~~~~~~~~gv~~t~~~~~~A~~~~~~n~~~~~~~~~~i~~~~~~-- 314 (426) T protein:vir:31 237 DASDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPVNVLIDVSDANRVSNAVTT-- 314 (426) T ss_pred hccccchhhHHhhhhhhhccccchhhhhccccccceeeccccccccccchhhhhhhcCCceEEEEecCceeeecceee-- Confidence 0111234789999999988777777666665543211 11112222234333 33 3567888887633 Q ss_pred ccCCCCCchhhhhhhHhhHHHHHHHHHHHHhhccc--cccCCCHHHHHHHHHHHHHHHHHHHhc-ccccCCCCceeEecc Q lcl|NC_019422. 227 TLSKEDTEDLKKIKIVEAIDMIQDDILQTWNENYV--GKVTNKYDNKILFLSAVNNYFKELQRD-EVLDNSQEAYAQIDI 303 (355) Q Consensus 227 t~~~~k~~~f~kirvvr~~D~i~~di~~~~~~~yi--GK~~N~~~gr~~~~~~i~~yl~~l~~~-g~I~~~~~~~v~id~ 303 (355) ..+.-+-.-|-++|.+|-+.++++..+-+-.+ .|+|=+..|..++.+.|.+-|++..+. |.+-+.+. +. T Consensus 315 ---~G~~~~G~~iD~~~g~dwl~~~iq~~l~~ll~~~~KIpyt~~Gi~~I~~~i~~~L~~~v~~~g~~~~~y~----v~- 386 (426) T protein:vir:31 315 ---AGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYE----VD- 386 (426) T ss_pred ---cccccchhhhhhHHHHHHHHHHHHHHHHHHhhcCCCCccchhHHHHHHHHHHHHHHHHhcCCCcccccee----ec- Confidence 23333333499999999999999988877666 599999999999999999999887764 33333221 11 Q ss_pred ccchhhhhccccccccccceeeeccCCCCEEE--EEEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 304 EAHKKYLKEKGIDYSEMTEQQIKEANTGSYVF--VEGNITITDAMEDLKFKIYM 355 (355) Q Consensus 304 e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~--~~~~i~~vdamEkiy~tv~v 355 (355) ...++++.- +-..-++ +.+.++.-.|+-.+.+.++| T Consensus 387 -------------~P~~~~~~~---dra~R~~~~i~~~~~laGAIh~v~I~g~v 424 (426) T protein:vir:31 387 -------------VPEWDDDDV---DRVNRNWGGIDLDARLAQRAHTFSLGLNV 424 (426) T ss_pred -------------CCCccccch---hhhhhccCCceEEEEEeCcEEEEEEEEEE Confidence 112223221 1112223 67788888888888777777 No 68 >protein:vir:101326 Length: 529 # NCBI annotation: gp22 # Family: family:all:9453 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006525;genbank:gi:46401679;genbank:GeneID:2777386 Probab=41.97 E-value=0.9 Score=20.82 Aligned_cols=321 Identities=11% Similarity=0.100 Sum_probs=151.5 Q ss_pred CCCCceEEEeeeeeeeeecCCCceeEEEEEe--cCCc---cceeEEEeehhhhhhhhhhHHHHHHHHhhhcc-------- Q lcl|NC_019422. 1 MGLPSAIIEFQRRSRTVKFRSRRGVVALILK--DSTA---IKKSYSIDFLTDINETEFTKENYDYIRLAFLG-------- 67 (355) Q Consensus 1 ~g~P~~~i~f~~~a~ta~~~~~rG~v~iil~--d~~~---~~~~~~~~~~~d~~~~~~~~~n~~~i~~a~~g-------- 67 (355) -=+|+...+++...+++.... | ..+.++ .+.. ..++++.+=..+.++. +.. .-|+-.++.. T Consensus 143 ~~s~~~~l~i~~~~ads~g~e-~--~~l~~~~~~~~g~~~~let~~~sl~~~a~dd-~G~--~~yl~svle~~s~~l~ai 216 (529) T protein:vir:10 143 CISPTRELTIETATADSAGNE-R--FLLKLTQTTSLGVVTTLETHTVSLAEEAKDD-MGR--LCYLPTALEARSKYLRAV 216 (529) T ss_pred ccCCceEEEEEeeccccCCCc-c--ceeeEEEEeecCCceEEEEEEeeeeechhhh-cCC--ccchhHHHhhccCceeee Confidence 226777777776555543221 1 222221 1111 1223332211111111 000 0111111100 Q ss_pred ------------ccceEEEEecCCCc---cchhHHHHHHHHHhccc--ceEEEEcCCChHHHHHHHHHHH-HHHHhc--- Q lcl|NC_019422. 68 ------------KPSKVIVEVINDSV---DSERSLDDALKALRENK--FNYLAIPFISEEVDKTKIVNWI-KTARRE--- 126 (355) Q Consensus 68 ------------~~~~v~l~~g~~g~---~~~~~y~~al~~le~~~--fn~l~~p~~~d~~~~~~~~~~i-k~~r~~--- 126 (355) +-....+++|+||. ..+++|..|+++|++-. |.++--.+.-+.+.-+.+...+ +++|+= T Consensus 217 ~~~e~~~t~~~~t~~d~~f~~GtdG~~~~i~s~~y~~A~~~L~n~p~d~~~il~~g~y~~a~I~~L~~ic~~~~~d~f~D 296 (529) T protein:vir:10 217 VNEELISTAKVTNKKSLAFTGGTNGDQSKISTAAYLRAVKVLNNAPYMYTAVLGLGCYDNAAITALGKICADRLIDGFFD 296 (529) T ss_pred eeeccccccchhhhhhhhccCCccccccccchHHHHHHHHHhcCCcceeeeeeccCCccHHHHHHHHHHHhhhhhcEEEc Confidence 01123466777764 34678999999998754 4433323333566666777777 444421 Q ss_pred --C-CeEEEEec---CCCCcCccee----EE---ecCCeEecCCce-ecHHHHHHHHHHHhcCcccccccc--cccCC-- Q lcl|NC_019422. 127 --K-EIYKAVLP---NISDANEKAI----IN---FATTGIKVGEKS-YTTAEYTARLAGILAGISLSESCT--YFILD-- 188 (355) Q Consensus 127 --g-~~~~aVl~---~~~~~d~egI----in---v~n~~i~~~~~~-~~~~~~~a~vAG~~Ag~~~~~S~T--~~~~~-- 188 (355) | ....++.. +....+.+++ .. ..++.. .+++. +...- .|++|+.=+ ...|.-+- |++.. T Consensus 297 V~~~LT~~aA~~~~e~~gl~~~~~~~~s~y~~P~~~~D~~-tg~k~~~GlsG-~A~~akarg-v~~na~v~g~hY~pAGe 373 (529) T protein:vir:10 297 VKPTLTYAEALPAVEDTGLLGTDYVSCSVYHYPFSCKDKW-TQSRVVFGLSG-VAYAAKARG-VKKNSDVGGWHYSPAGE 373 (529) T ss_pred CCCCcCHHHHHHHHHhcCccccCceeeEEEEcceeecccc-ccCceeeCCCc-ceeeccccc-eeecccccccccccCCC Confidence 0 11111110 0000011212 11 122211 11111 11111 133332211 01111111 22222 Q ss_pred --------cccccCChhhHHH-HHhCCeE--EEE-ECCcEEEEecCccccccCCCCCchhhhhhhHhhHHHHHHHHHHHH Q lcl|NC_019422. 189 --------EVTEIEPTENPDE-AVEEGKL--ILI-NNNGIRIARGVNSLITLSKEDTEDLKKIKIVEAIDMIQDDILQTW 256 (355) Q Consensus 189 --------~~~~~~~~~e~~~-ai~~G~l--vl~-~dg~v~I~~~INSltt~~~~k~~~f~kirvvr~~D~i~~di~~~~ 256 (355) ++.+.++.++++. ++-.+.+ +.. ..|.-.|... ||... |+.-||-+.+.+.|..|.+-.-+.. T Consensus 374 ~r~~inr~~I~~ly~~d~~e~~~lv~~riNPV~~~~~g~~~idDs---Lt~~~--knny~R~~hv~~lmn~I~~~~~k~a 448 (529) T protein:vir:10 374 ERAVIARASIQPLYPEDTPDEEAMVKGRLNKVSVGTSGQMIIDDA---LTCCT--QDNYLHFQHVPSLMNAISRFFVQLA 448 (529) T ss_pred ccceeecccceeccCCCccCHHHHHhhccCeeeeeccCcceeeee---eceee--eCCchhhhhHHHHHHHHHHHHHHHH Confidence 3445555543322 3333332 222 2345555544 44444 5889999999999999988887764 Q ss_pred hhccccccCCCHHHHHHHHHHHHHHHHHHHhcccccCCCCceeEeccccchhhhhccccccccccceeeeccCCCCEEEE Q lcl|NC_019422. 257 NENYVGKVTNKYDNKILFLSAVNNYFKELQRDEVLDNSQEAYAQIDIEAHKKYLKEKGIDYSEMTEQQIKEANTGSYVFV 336 (355) Q Consensus 257 ~~~yiGK~~N~~~gr~~~~~~i~~yl~~l~~~g~I~~~~~~~v~id~e~q~~~~~~~~~d~~~~~d~~v~~~~~~d~v~~ 336 (355) .. .+ .-|+...=|. +..-+...|+.+...|+|....+ +|.++..-| -++|.+.+. |..-+ T Consensus 449 ~~-~~-~~Pd~it~~g-l~~~l~~~L~r~~asgalv~prd----p~~~G~epy------------~~~V~q~d~-D~~~v 508 (529) T protein:vir:10 449 RQ-MK-HSPDGITAAG-LTKGMTKLLDRFVASGALVAPRD----PDADGTEPY------------VLKVTQAEF-DKWEV 508 (529) T ss_pred HH-Hh-hCCChHHHHH-HHHhHHHHHHHHHhcCceecccC----ccCCCCCce------------EEEEeeccc-CeEEE Confidence 32 32 3367766665 77788899999999999976432 233333333 234444444 66777 Q ss_pred EEEEEEEeeeeEEEEEEeC Q lcl|NC_019422. 337 EGNITITDAMEDLKFKIYM 355 (355) Q Consensus 337 ~~~i~~vdamEkiy~tv~v 355 (355) .|++-|.-.--.|+..=++ T Consensus 509 ~~~~~ptGv~Rri~~~p~l 527 (529) T protein:vir:10 509 VWACCPTGVARRIQGVPLL 527 (529) T ss_pred EEEeecCCceeeEEeeeee Confidence 7888887777777777666 Done!