Query lcl|Aclame:protein:vir:4517|NCBI_annot:tail sheath protein|genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Match_columns 498 No_of_seqs 143 out of 220 Neff 6.1 Searched_HMMs 1612 Date Sat Nov 30 04:17:12 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_11 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_11_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:4517 Length: 498 # 100.0 4E-235 2E-238 1305.9 52.6 498 1-498 1-498 (498) 2 protein:vir:4463 Length: 498 # 100.0 4E-235 2E-238 1305.6 51.4 498 1-498 1-498 (498) 3 protein:vir:489 Length: 498 # 100.0 3E-232 2E-235 1290.2 51.1 498 1-498 1-498 (498) 4 protein:vir:1996 Length: 495 # 100.0 3E-223 2E-226 1240.9 54.4 483 1-491 1-495 (495) 5 protein:vir:100829 Length: 607 100.0 4.5E-49 2.8E-52 285.6 44.6 462 1-498 1-603 (607) 6 protein:vir:95741 Length: 587 100.0 7.5E-50 4.6E-53 289.9 39.7 446 1-496 1-587 (587) 7 protein:vir:99306 Length: 587 100.0 2.4E-48 1.5E-51 281.7 45.6 448 1-496 1-587 (587) 8 protein:vir:96586 Length: 587 100.0 1.1E-48 6.8E-52 283.5 43.3 460 1-496 1-587 (587) 9 protein:vir:63742 Length: 562 100.0 1.8E-48 1.1E-51 282.4 43.4 460 1-496 1-562 (562) 10 protein:vir:80488 Length: 562 100.0 1.7E-46 1.1E-49 271.5 42.9 448 1-496 1-562 (562) 11 protein:vir:80779 Length: 569 100.0 1.2E-44 7.3E-48 261.4 43.8 455 1-496 1-569 (569) 12 protein:vir:102957 Length: 437 100.0 2.2E-43 1.4E-46 254.5 38.9 421 1-490 1-437 (437) 13 protein:vir:107310 Length: 581 100.0 1E-39 6.2E-43 234.4 32.6 423 1-498 1-569 (581) 14 protein:vir:7653 Length: 581 # 100.0 2.7E-39 1.7E-42 232.0 35.0 423 1-498 1-569 (581) 15 protein:vir:105470 Length: 451 100.0 3.7E-36 2.3E-39 214.9 37.9 427 1-490 1-451 (451) 16 protein:vir:78986 Length: 436 100.0 6.7E-34 4.2E-37 202.4 36.8 415 1-490 1-436 (436) 17 protein:vir:102819 Length: 648 100.0 4.6E-31 2.9E-34 186.9 35.5 459 1-492 1-648 (648) 18 protein:vir:102359 Length: 356 99.9 2.8E-24 1.7E-27 149.7 23.7 307 149-489 1-356 (356) 19 protein:vir:95263 Length: 450 99.8 3.8E-19 2.4E-22 121.6 31.5 423 12-494 1-450 (450) 20 protein:vir:5260 Length: 502 # 99.7 4.9E-15 3.1E-18 99.0 37.6 440 1-495 1-502 (502) 21 protein:vir:78611 Length: 501 99.7 4.3E-15 2.7E-18 99.3 34.9 444 3-498 1-489 (501) 22 protein:vir:3636 Length: 501 # 99.6 8.9E-14 5.5E-17 92.1 38.7 443 3-498 1-489 (501) 23 protein:vir:106730 Length: 501 99.6 2.2E-13 1.4E-16 90.0 36.0 446 3-498 1-489 (501) 24 protein:vir:79092 Length: 477 99.6 3.4E-15 2.1E-18 99.9 25.6 429 8-498 1-474 (477) 25 protein:vir:98824 Length: 774 99.6 6.3E-15 3.9E-18 98.4 26.3 448 1-498 271-774 (774) 26 protein:vir:101576 Length: 501 99.6 2.7E-13 1.7E-16 89.5 34.0 443 3-498 1-489 (501) 27 protein:vir:107865 Length: 477 99.6 2.7E-14 1.7E-17 95.0 28.1 432 8-498 1-470 (477) 28 protein:vir:96104 Length: 504 99.5 6.7E-13 4.2E-16 87.3 34.3 446 2-498 1-493 (504) 29 protein:vir:101187 Length: 663 99.5 3.5E-13 2.2E-16 88.9 30.3 443 10-498 1-651 (663) 30 protein:vir:99586 Length: 507 99.5 1.5E-12 9.1E-16 85.5 33.5 445 2-498 1-496 (507) 31 protein:vir:98263 Length: 664 99.5 9.2E-13 5.7E-16 86.6 30.2 450 1-498 1-653 (664) 32 protein:vir:94073 Length: 494 99.5 4.4E-12 2.7E-15 82.9 33.8 435 1-491 1-494 (494) 33 protein:vir:103456 Length: 659 99.5 1.4E-12 8.6E-16 85.6 30.3 443 10-498 1-649 (659) 34 protein:vir:79798 Length: 717 99.5 6.3E-12 3.9E-15 82.0 33.5 448 1-495 1-717 (717) 35 protein:vir:108052 Length: 660 99.5 2.8E-12 1.7E-15 83.9 31.4 442 10-498 1-650 (660) 36 protein:vir:6894 Length: 660 # 99.4 2.8E-12 1.7E-15 83.9 29.9 452 10-498 1-649 (660) 37 protein:vir:7206 Length: 659 # 99.4 1E-11 6.2E-15 80.9 32.8 443 10-498 1-649 (659) 38 protein:vir:101804 Length: 663 99.4 4.2E-12 2.6E-15 83.0 30.1 440 10-498 1-651 (663) 39 protein:vir:80984 Length: 666 99.4 1.2E-11 7.4E-15 80.5 32.0 452 10-498 1-654 (666) 40 protein:vir:6079 Length: 396 # 99.4 6.7E-13 4.2E-16 87.3 24.0 360 1-498 1-386 (396) 41 protein:vir:98553 Length: 395 99.4 5.5E-13 3.4E-16 87.8 23.4 355 8-498 1-386 (395) 42 protein:vir:2035 Length: 396 # 99.4 8.4E-13 5.2E-16 86.8 23.6 360 1-498 1-386 (396) 43 protein:vir:79141 Length: 391 99.4 4.9E-13 3E-16 88.1 21.9 353 1-498 1-381 (391) 44 protein:vir:5663 Length: 671 # 99.4 5.1E-11 3.2E-14 77.0 32.8 448 10-498 1-664 (671) 45 protein:vir:6594 Length: 666 # 99.4 3.4E-11 2.1E-14 78.0 31.7 452 10-498 1-654 (666) 46 protein:vir:1172 Length: 391 # 99.4 4.8E-13 2.9E-16 88.1 21.1 361 1-498 1-382 (391) 47 protein:vir:5711 Length: 396 # 99.4 1.8E-12 1.1E-15 85.0 23.8 360 1-498 1-386 (396) 48 protein:vir:100539 Length: 663 99.3 3.6E-11 2.2E-14 77.8 29.4 447 10-498 1-651 (663) 49 protein:vir:79181 Length: 390 99.3 4.3E-12 2.7E-15 82.9 24.2 355 1-498 1-381 (390) 50 protein:vir:107720 Length: 515 99.3 1.4E-10 8.8E-14 74.6 33.5 450 1-498 1-504 (515) 51 protein:vir:106427 Length: 679 99.3 2.5E-10 1.6E-13 73.2 35.4 444 10-498 1-668 (679) 52 protein:vir:1845 Length: 392 # 99.3 1.3E-11 8E-15 80.3 23.9 357 1-498 1-383 (392) 53 protein:vir:104858 Length: 729 99.3 2.5E-10 1.6E-13 73.2 30.7 451 8-498 1-720 (729) 54 protein:vir:104477 Length: 749 99.3 2.8E-10 1.7E-13 73.0 32.5 449 8-498 1-742 (749) 55 protein:vir:100323 Length: 393 99.2 1.1E-10 6.6E-14 75.3 25.0 359 1-498 1-383 (393) 56 protein:vir:103993 Length: 390 99.1 9.9E-11 6.2E-14 75.4 22.6 351 1-498 1-381 (390) 57 protein:vir:78206 Length: 390 99.1 9.9E-11 6.2E-14 75.4 22.6 351 1-498 1-381 (390) 58 protein:vir:106984 Length: 743 99.0 4E-09 2.5E-12 66.6 32.5 442 8-498 1-735 (743) 59 protein:vir:96740 Length: 388 99.0 3.7E-09 2.3E-12 66.8 25.3 347 1-498 1-380 (388) 60 protein:vir:10336 Length: 386 98.9 5.6E-09 3.5E-12 65.8 24.4 358 1-498 1-382 (386) 61 protein:vir:5833 Length: 742 # 98.3 1.4E-06 8.7E-10 52.7 22.0 424 1-498 254-739 (742) 62 protein:vir:103168 Length: 641 97.2 3.9E-05 2.4E-08 44.8 12.9 398 1-498 1-458 (641) 63 protein:vir:3165 Length: 426 # 97.1 0.00015 9.6E-08 41.5 24.0 407 8-491 1-426 (426) 64 protein:vir:3788 Length: 376 # 96.9 0.00026 1.6E-07 40.3 31.2 346 13-495 1-376 (376) 65 protein:vir:78782 Length: 370 96.5 0.00059 3.6E-07 38.3 30.5 345 13-498 1-370 (370) 66 protein:vir:80052 Length: 331 96.2 0.00087 5.4E-07 37.4 22.4 300 138-491 1-331 (331) 67 protein:vir:276 Length: 369 # 95.1 0.0028 1.7E-06 34.6 31.6 343 1-498 1-369 (369) 68 protein:vir:3751 Length: 376 # 94.9 0.0032 2E-06 34.3 29.7 335 13-495 1-376 (376) No 1 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=100.00 E-value=3.6e-235 Score=1305.88 Aligned_cols=498 Identities=100% Similarity=1.388 Sum_probs=496.0 Q ss_pred CccchhhcCcccccCeEEEEEecCCCCCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEAYRQ 80 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~a~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a~~~ 80 (498) |+|+||+||+||||||+|+|||||+|++|.++||||||||++++|++++++|++|+|++||+++||+|||+|+|+++||| T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~~~q~vLiiGq~la~gs~~~~~~v~v~s~~~a~~lfG~GSml~~M~~a~~~ 80 (498) T protein:vir:45 1 MTISFNTIPSNTLVPLFYAEMDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEAYRQ 80 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCCCCCCCCcceEEEEecCCccccccceeEEecCHHHHHHhcCcCcHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEEeecc Q lcl|Aclame:pro 81 TDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTASSSA 160 (498) Q Consensus 81 ~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA~~~~ 160 (498) +||++|||+|+++|++|++|+|+||++|+||++|+++|||||++|+++|.+|||+++||++|+++||+.++|||||++++ T Consensus 81 ~n~~~~l~~i~~~d~aG~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aaina~~~lPVTA~~~~ 160 (498) T protein:vir:45 81 TDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTASSSA 160 (498) T ss_pred hCCcceEEEEeeCCcccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHhCCCCCceEEEecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEecCCChHHHHHH Q lcl|Aclame:pro 161 GVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTASVNTL 240 (498) Q Consensus 161 ~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~p~tD~a~l~al 240 (498) ++|||||||||++||+|+|++|||+..+||.+|+||++++++|+||+||||++++|++||++|||+|+|||+|+++|++| T Consensus 161 ~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamagGag~PD~a~alaal~~~~~~~I~~p~~D~asL~al 240 (498) T protein:vir:45 161 GVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTASVNTL 240 (498) T ss_pred ceEEEEeeccCccccceeEEEeeccccccccccceeeEEEEccCCCccCchhHHHHHHhccCCccEEEEeeCCHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCCCCCCcHHHHHHHHHHHhhhhhccCccccc Q lcl|Aclame:pro 241 VTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEKETQTPADELAASRTARAAVFIRNDPARPT 320 (498) Q Consensus 241 ~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~~~~~p~~~~AAa~~a~~a~~l~~DPArpl 320 (498) ++||+++++||+|++|++||+|++++||++++++||..|||+|++|+|+++++++|+|+|||++++++|.++++|||||| T Consensus 241 ~~~L~~~sgRw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~~~~~~~sp~~~~AAa~aa~~A~~l~~DPArPL 320 (498) T protein:vir:45 241 VTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEKETQTPADELAASRTARAAVFIRNDPARPT 320 (498) T ss_pred HHHHhhhhhhhhHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEecCCCCCChHHHHHHHHHHHHHHHhhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEeccccCCCccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHH Q lcl|Aclame:pro 321 QTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSV 400 (498) Q Consensus 321 ~tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~ 400 (498) |+|+|+|++||+.++||+++|||+||++||+|++|++|+|+|||.|||||+|++|.+|+|||||||++|++|+||.+|++ T Consensus 321 ~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~ 400 (498) T protein:vir:45 321 QTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSV 400 (498) T ss_pred CceeecceecCCchhcCChHHHHHHHhCCcceEEEcCCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEEEcCCCCeEEEEEeeeEEecCe Q lcl|Aclame:pro 401 ITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASVPNRLNTLFPPDYVNQL 480 (498) Q Consensus 401 ~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvVerd~~d~nRvn~~~p~~~vn~l 480 (498) |++|||||||++||+|++|||+||||++||+||+++|++||++|||||+|.||++|+||||++||||||+++|+|+|||| T Consensus 401 i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~ell~~y~~le~~givEn~~~~~~~LiVerd~~dpnRln~~~p~d~vn~L 480 (498) T protein:vir:45 401 ITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASVPNRLNTLFPPDYVNQL 480 (498) T ss_pred hhhhcCCeeecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCCCcEEEEEecccccCch Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeeeeeeEEEecccCC Q lcl|Aclame:pro 481 RVFAVVNQFRLQYSEESA 498 (498) Q Consensus 481 ~v~A~~~~f~lq~~~~~~ 498 (498) ||||+++||||||++++| T Consensus 481 ~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:45 481 RVFAVVNQFRLQYSEESA 498 (498) T ss_pred hhhhhhhhhheehhhcCC Confidence 999999999999999999 No 2 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=100.00 E-value=4e-235 Score=1305.65 Aligned_cols=498 Identities=85% Similarity=1.243 Sum_probs=496.0 Q ss_pred CccchhhcCcccccCeEEEEEecCCCCCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEAYRQ 80 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~a~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a~~~ 80 (498) |+|+||+||+||||||+|+|||||+||+|.++||||||||++++|++++++|++|+|.+||+++||+|||+|+|+++||| T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~~~q~vLiiGq~la~gs~~~~~~v~v~s~~~a~~~fG~GSml~~M~~a~~~ 80 (498) T protein:vir:44 1 MAISFNSIPSDTRVPLFYAEMDNSAANTARDSGASLLIGHASNDASIAVNSLVLVSSVDYARQICGAGSQLARMVGAYRK 80 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCCCCCcCCcceEEEEecCcccccccceeEeecCHHHHHHhcCcccHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEEeecc Q lcl|Aclame:pro 81 TDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTASSSA 160 (498) Q Consensus 81 ~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA~~~~ 160 (498) +||+++||+|+++|++|++|+|+||++|++|++|++.|||||++|+++|.+|||+++||++|+++||+.++|||||++++ T Consensus 81 ~n~~~~l~~i~~~D~aG~aAtg~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aaina~~~lPVTA~~~~ 160 (498) T protein:vir:44 81 TDPFGELYVIAVPESTGAAATVALTVTGEATETGTVNVYTGRTRVQAPVTSGDDAAAVAVSIKDAVNANPDLPFTATSEA 160 (498) T ss_pred hCCCceeEEEecCCcccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHhCCCCCceEEeecc Confidence 99999999999999899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEecCCChHHHHHH Q lcl|Aclame:pro 161 GVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTASVNTL 240 (498) Q Consensus 161 ~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~p~tD~a~l~al 240 (498) ++|||||||||++||+|++++|||+..+||.+|+||++++++|+||+||||++++|++||++|||+|+|||+|+++|++| T Consensus 161 ~~vtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~~~~~~~i~~p~~D~asl~al 240 (498) T protein:vir:44 161 GVVTLTARHKGLYGNEIPVTLNYYGFGGGEVLPAGVNITVASGVKGAGAPALNDAVAAMGDEPFDYIGLPFNDTASVNSM 240 (498) T ss_pred ceEEEEEeccCcccCcceEEEeeccCccccccccceeEEEEcccCCccCchhHHHHHhhccCCccEEEEeecCHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCCCCCCcHHHHHHHHHHHhhhhhccCccccc Q lcl|Aclame:pro 241 VTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEKETQTPADELAASRTARAAVFIRNDPARPT 320 (498) Q Consensus 241 ~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~~~~~p~~~~AAa~~a~~a~~l~~DPArpl 320 (498) ++||+++++||+|++|++||+|++++||++++++||..|||+|++|+|+++++++|+|+|||++++++|.++++|||||| T Consensus 241 ~~~L~~~sgRw~~~~q~~g~~~~a~~gT~a~l~t~g~~~N~~~it~~~~~~~~~sp~~~~AAa~a~~aA~~l~~DPArPL 320 (498) T protein:vir:44 241 ATEMNDSSGRWSYVRQLYGHVYTAKTGTLSELVAAGDQFNLQHITLAGYEKDTQTPADELAASRTARAAVFIRNDPARPT 320 (498) T ss_pred HHHHhhhhcchHHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEecCCCCCCHHHHHHHHHHHHHHHHhhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEeccccCCCccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHH Q lcl|Aclame:pro 321 QTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSV 400 (498) Q Consensus 321 ~tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~ 400 (498) |+|+|+|++||+.++||+++|||+||++||||++|++|+|+|||.|||||+|++|.+|+|||||+|++|++|+||.+|++ T Consensus 321 ~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~ 400 (498) T protein:vir:44 321 QTGELVDMLPAPKGKRFTTTEQQTLLSHGVATAYVESGVLRIQRDITTYRKNAYGVADNSYLDSETLHTSAYVLRRLKSV 400 (498) T ss_pred CceeecccccCCchhcCChHHHHHHHhcCcceEEEcCCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEEEcCCCCeEEEEEeeeEEecCe Q lcl|Aclame:pro 401 ITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASVPNRLNTLFPPDYVNQL 480 (498) Q Consensus 401 ~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvVerd~~d~nRvn~~~p~~~vn~l 480 (498) |++|||||||++||+|+.|||+||||++||+||+++|++||++|||||+|.||++|+||||++||||||+++|+|+|||| T Consensus 401 i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~eli~~y~~le~~givEn~~~~~~~LiVerd~~dpnRln~~~p~d~vn~L 480 (498) T protein:vir:44 401 ITSKYGRHKLANDGTRFGSGQAIVTPAVIRGELGSTYRQMEREGIVENFDLFQQHLIVERNANDSNRLDVLFPPDYVNQL 480 (498) T ss_pred hhhhcCCcccccCCcccCCCcccccHHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCCCcEEEEEecccccCch Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeeeeeeEEEecccCC Q lcl|Aclame:pro 481 RVFAVVNQFRLQYSEESA 498 (498) Q Consensus 481 ~v~A~~~~f~lq~~~~~~ 498 (498) ||||+++||||||++++| T Consensus 481 ~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:44 481 RVFAVLNQFRLQYSEEAA 498 (498) T ss_pred hhhhhhhhhhhhhhhhcC Confidence 999999999999999999 No 3 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=100.00 E-value=2.6e-232 Score=1290.21 Aligned_cols=498 Identities=83% Similarity=1.233 Sum_probs=494.4 Q ss_pred CccchhhcCcccccCeEEEEEecCCCCCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEAYRQ 80 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~a~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a~~~ 80 (498) |+|+||+||+|+||||+|+|||||+|+....+||||||||++++|++++++|++|+|++||+++||+|||+|+|+++|+| T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~~~qrvLiiGq~la~gt~~~~~~v~v~s~~~a~~~fG~GS~l~~M~~a~~~ 80 (498) T protein:vir:48 1 MTISFSAVPSDTLVPLFYAEMDNSAANTAVTSAPALLIGHASNDAAIEVNSLVLMPSADYARQICGAGSQLARMVDVYRQ 80 (498) T ss_pred CCccccccCcccccceEEEEEecCCCccccCCcceEEEeecCccccccccceEEecCHHHHHHhcCcccHHHHHHHHHHH Confidence 99999999999999999999999999877778999999999999999999999999999999999999999999999999 Q ss_pred hCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEEeecc Q lcl|Aclame:pro 81 TDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTASSSA 160 (498) Q Consensus 81 ~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA~~~~ 160 (498) +||+++||+|+++|++|++|+|+|||+|+||++|++.|||||++|+++|.+|||+++||++|+++||+.++|||||++++ T Consensus 81 ~n~~~~l~~i~~~D~ag~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aai~a~~~lPVTA~~~~ 160 (498) T protein:vir:48 81 TDPFGELYVIAVPEARGAAATVRVTVTGEAEESGTLSLYVGRSSVQVPVVNGDDATAVATAIKEAVNGVITLPFAASSDA 160 (498) T ss_pred hCCCceeEEEeeCCcccceeEEEEEecccccCCceEEEEECCEEEEEeecCCCCHHHHHHHHHHHHhCCCCcceEEEecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEecCCChHHHHHH Q lcl|Aclame:pro 161 GVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTASVNTL 240 (498) Q Consensus 161 ~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~p~tD~a~l~al 240 (498) ++|||||||||++||+|++++|||+..+||.+|+||++++|+|+||+||||++++|++||++|||+|+|||+|+++|++| T Consensus 161 ~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamsgGag~PDia~aLaal~~~~~~~I~~p~~D~asl~al 240 (498) T protein:vir:48 161 GVVTLTARHKGLYGNELPVCLNYYGSGGGEILPAGLQVVTEAGTAGSGAPDLTAAVAAMGDEAFDFIGLPFNDAASINMM 240 (498) T ss_pred cEEEEEeeecccccccceeeeeeccCcccccccceeeEEEEcccCCccCcchHHHHHhhccCCccEEEEeecCHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCCCCCCcHHHHHHHHHHHhhhhhccCccccc Q lcl|Aclame:pro 241 VTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEKETQTPADELAASRTARAAVFIRNDPARPT 320 (498) Q Consensus 241 ~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~~~~~p~~~~AAa~~a~~a~~l~~DPArpl 320 (498) ++||+++++||+|++|++||+|++++||++++++||..|||+|++|+|+++++++|+|+|||++++++|.++++|||||| T Consensus 241 ~~~L~~~sgRw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~~~~~~~~p~~~~AAa~a~~aA~~l~~DPArPL 320 (498) T protein:vir:48 241 MTEMNDSSGRWSYARQLYGHVYTAKLGTLSELVNAGDMHNQQHITLAGYEKETQSPVDELVASRLAREAVFIRNDPARPT 320 (498) T ss_pred HHHHhhhhhhhhHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEecCCCCCChHHHHHHHHHHHHHHhhhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEeccccCCCccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHH Q lcl|Aclame:pro 321 QTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSV 400 (498) Q Consensus 321 ~tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~ 400 (498) |+|+|+|++||+.++||+++|||+||++||||++|++|+|+|||.|||||+|++|.+|+|||||||++|++|+||.+|++ T Consensus 321 qtl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~ 400 (498) T protein:vir:48 321 QTGELVGMLPAPKGKRFIMTEQQTLLSHGVATAYVEGGTLRIQRSVTTYKKNAYGVADNSYLDSETLHTSAYVLRKLKSV 400 (498) T ss_pred cceeeeccccCCchhcCChHHHHHHHhcCcceEEEcCCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEEEcCCCCeEEEEEeeeEEecCe Q lcl|Aclame:pro 401 ITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASVPNRLNTLFPPDYVNQL 480 (498) Q Consensus 401 ~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvVerd~~d~nRvn~~~p~~~vn~l 480 (498) |++|||||||++||+++.|||+||||++||+||+++|++||++|||||+|.||++|+||||++||||||+++|+|+|||| T Consensus 401 i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~eli~~y~~le~~given~~~~~~~LiVerd~~dpnRln~~~p~d~vn~L 480 (498) T protein:vir:48 401 ITSKYGRHKLANDGTRFGPGQAIVTPAVIKGELLATYRQMERAGIVENYDLFKQYLIVERDADNPNRLNTLFPPDYVNQL 480 (498) T ss_pred hhhhcCCceecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCCCcEEEEEecccccCch Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeeeeeeEEEecccCC Q lcl|Aclame:pro 481 RVFAVVNQFRLQYSEESA 498 (498) Q Consensus 481 ~v~A~~~~f~lq~~~~~~ 498 (498) ||||+++||||||++++| T Consensus 481 ~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:48 481 RVFAVVNQFRLQYSEESA 498 (498) T ss_pred hhhhhhhhhhhhhhhcCC Confidence 999999999999999999 No 4 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=100.00 E-value=2.6e-223 Score=1240.91 Aligned_cols=483 Identities=37% Similarity=0.581 Sum_probs=474.8 Q ss_pred Cc-cchhhcCcccccCeEEEEEecCCC--CCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHH Q lcl|Aclame:pro 1 MT-ISFNTIPSNTLVPLFYAEMDNQAA--NTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEA 77 (498) Q Consensus 1 M~-i~f~~Ip~~~rvPg~y~E~dns~a--~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a 77 (498) |+ |+||+||++|||||+|+|||||+| |+|.++||||||||++++|++++++|++|+|.+||+++||+|||+|+|+++ T Consensus 1 m~~i~F~~IP~~iRvP~~y~E~dns~A~~g~~~~~q~vLiiGq~la~gs~~~~~pv~v~s~~~a~~~fG~GS~la~M~~a 80 (495) T protein:vir:19 1 MSDISFNAIPSDVRVPLTYIEFDNSNAVSGTPAPRQRVLMFGQSGSKASAAPNVPVRIRSGSQASAAFGQGSMLALMADA 80 (495) T ss_pred CCCCchhhCCcccccCeEEEEEccCCCCcCCcCCCceEEEEEecCcccccccceeEEecCHHHHHHhcCcCcHHHHHHHH Confidence 98 999999999999999999999999 689999999999999999999999999999999999999999999999999 Q ss_pred HHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEEe Q lcl|Aclame:pro 78 YRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTAS 157 (498) Q Consensus 78 ~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA~ 157 (498) |+|+||+++||+|+++|++|++|+|+||++|++|++|+++|||||++|+++|.+|||+++||++|+++||+.++|||||+ T Consensus 81 ~~~~n~~~~l~~i~~~D~aG~aA~g~it~tg~at~~G~l~l~I~g~~v~v~V~~gdTaa~vA~al~aaina~~~lPvTA~ 160 (495) T protein:vir:19 81 FLNANRVAELWCIPQGNGTGNAAVGEISLSGTAGENGSLVTYIAGQRLAVSVAAGATGAALADLLVARIKGQPDLPVTAE 160 (495) T ss_pred HHHhCCcceEEEEeeCChhhceeEEEEEEeecCCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHhcCCccCceEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred e--------ccceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEe Q lcl|Aclame:pro 158 S--------SAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGL 229 (498) Q Consensus 158 ~--------~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~ 229 (498) + +.++|||||||||+ +|+|++++|||+ ||.+|+||++++|+|+||+||||++++|++||++||++|+| T Consensus 161 ~~~~~~~~~a~~~VtlTAr~kG~-~n~idi~~~~~~---ge~~p~Glt~titamsgGag~PDia~alaal~~~~~~~I~~ 236 (495) T protein:vir:19 161 VRADSGDDDTHADVVLSAKFTGA-LSAVDVRWNYYA---GETTPYGIITAFKAASGKNGNPDISASIAGMGDLQYKYIVM 236 (495) T ss_pred eeccCCCCcCceeEEEEEeeccc-cccceeEEEeec---ccccccceeEEEEecCCCCCCcchHHHHHHhccCCCcEEEE Confidence 8 55799999999998 599999999996 89999999999999999999999999999999999999999 Q ss_pred cCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCCCCCCcHHHHHHHHHHHhh Q lcl|Aclame:pro 230 PFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEKETQTPADELAASRTARAA 309 (498) Q Consensus 230 p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~~~~~p~~~~AAa~~a~~a 309 (498) ||+|+++|++|++||++ ||+|++|++||+|++++||++++++||..|||+|++|+|+++ +++|+|+|||++++++| T Consensus 237 P~tD~asL~al~~~l~~---rw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~~~g-sp~~~~~~AAA~aa~~A 312 (495) T protein:vir:19 237 PYTDEPNLNLLRTELQE---RWGPVNQADGFAVTVLSGTYGDISTFGVSRNDHLISCMGIAG-APEPSYLYAATLCAVAS 312 (495) T ss_pred ecCcHHHHHHHHHHHHH---hhhHHHhcCeEEEEeecCCHHHHHHhhhccCCceEEEEecCC-CCCcHHHHHHHHHHHHH Confidence 99999999999999976 999999999999999999999999999999999999999976 55788999999999999 Q ss_pred hhhccCccccccceEEeccccCCCccccChHHHHHHHhCCeeEEEEc-CCeEEEEeeeeeeeecCCCCCCchhhhhhhHH Q lcl|Aclame:pro 310 VFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVE-SGVLRIQRDVTTYRKNAYGVADNSYLDSETLH 388 (498) Q Consensus 310 ~~l~~DPArpl~tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~-~G~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~ 388 (498) .++++|||||||+|+|+|++||+.++||+++|||+||++||||++|+ +|+|+|||.|||||+|++|.+|+|||||||++ T Consensus 313 ~~l~~DPArPL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~ 392 (495) T protein:vir:19 313 QALSIDPARPLQTLTLPGRMPPAVGDRFTWSERNALLFDGISTFNVNDGGEMQIERMITMYRTNKYGDSDPSYLNVNTIA 392 (495) T ss_pred HHhhcccccccCceeecceecCCccccCChHHHHHHHhCCcceEEECCCCeEEEEeeeeeeeecCCCCcchhhhhhHHHH Confidence 99999999999999999999999999999999999999999999996 78999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEEEcCCCCeEE Q lcl|Aclame:pro 389 TSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASVPNRL 468 (498) Q Consensus 389 tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvVerd~~d~nRv 468 (498) ||+|+||++|+++++|||||||++||+++.|||+||||++||+||+++|++||++|||||+|.||++|+||||++||||| T Consensus 393 tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~ell~~~~~le~~given~~~~~~~LiVerd~~dpnRl 472 (495) T protein:vir:19 393 TLSYLRYSLRTRITQKFPNYKLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGLVEDFDTFKEELYVARNKDDKDRL 472 (495) T ss_pred HHHHHHHHHHHHHhhhcCCcccccCCCCCCCcccccChHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCCCcEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeeeEEecCeEEEeeeeeeEE Q lcl|Aclame:pro 469 NTLFPPDYVNQLRVFAVVNQFRL 491 (498) Q Consensus 469 n~~~p~~~vn~l~v~A~~~~f~l 491 (498) |+++|+|+||||||||+++|||| T Consensus 473 n~~~p~d~vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 473 DVLCGPNLINQFRIFAAQVQFIL 495 (495) T ss_pred EEEecceeeCceeeeeeeeeeeC Confidence 99999999999999999999999 No 5 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=100.00 E-value=4.5e-49 Score=285.62 Aligned_cols=462 Identities=16% Similarity=0.179 Sum_probs=324.8 Q ss_pred Ccc------chhhcCcccc--cCeEEEEEecCCC-CCCCCCccEE-EEEecCCCCccccceeEEecChHHHHHhhCcCcH Q lcl|Aclame:pro 1 MTI------SFNTIPSNTL--VPLFYAEMDNQAA-NTAQDSGASL-LIGHANNGAEIVANSLVLMPSADYARQICGAGSQ 70 (498) Q Consensus 1 M~i------~f~~Ip~~~r--vPg~y~E~dns~a-~~~~~~~~vL-liGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~ 70 (498) |+- |+..|-..-+ -||+|+|+|.|.. +.+...-+++ +||... .+++++|+++++.+||+.+||-|= T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~---~G~~~~~~~~~~~~~a~~~f~~g~- 76 (607) T protein:vir:10 1 MTTTITSAESYKRIYPLFYDSRPHVETNFDDSRLSNTASDSAKNIFMLGSAT---NGDPTKVYEIRTSQQATKIFGSGD- 76 (607) T ss_pred CcceecchhhHHHHhCCCCccCCceEEEEecCcCcCCCCCCcceEEEEEEeC---CCCCceEEEEcchhHHHHhhcCcc- Confidence 652 3444443333 4999999999887 4555555555 588653 346799999999999999999876 Q ss_pred HHHHHHHHHH-----hCCCceEEEEEecCC-ccceeEEEEEEeee--ccCCcEEE------------------------- Q lcl|Aclame:pro 71 LARMVEAYRQ-----TDPFGELYVIAVPEA-TGAAATVTLTVTGE--ATESGTVN------------------------- 117 (498) Q Consensus 71 l~~M~~a~~~-----~n~~~~l~~i~l~d~-ag~aatg~ititgt--at~~G~l~------------------------- 117 (498) |.++++.++. .|.-+.++++.+.++ +.++..+.++++.+ .+.++.++ T Consensus 77 l~~a~~~a~~~~~~~~~g~~~~~~~rv~~~~~a~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~~d~~~~~ 156 (607) T protein:vir:10 77 LVDGIKLAFDPTGNSVTNGGTVYALRVDNAKQASLVKDGLTFTSSIFGTNANQVSVALDNDVFGVPRITVNYSPDNYERT 156 (607) T ss_pred hHHHHHHhhccccCCccCCceEEEEeCCCccccceecccccccccccccCCCceEEEEEecCCCccceeEEeecccceee Confidence 5555556654 566788999998654 22222332222211 11222221 Q ss_pred ------------------------EEEccEEEEEEeecCCCHHH---------------HHHHHHHHHhcCCCceEEEee Q lcl|Aclame:pro 118 ------------------------VYVGRTRVQAPVTNGDNVTT---------------IASSIQDAINAVPTLPFTASS 158 (498) Q Consensus 118 ------------------------l~I~g~~v~v~V~~gdtaa~---------------iA~~l~~aIn~~~~lpVtA~~ 158 (498) ++-.|+.+.++...|+.+.. -+..+...||..++.-..+. T Consensus 157 ~~n~g~~~~i~y~g~~~~a~~~v~~~~~g~~~~lt~~~~~~~~~~~~V~~~~l~~~~~~t~~~l~~din~~~~~~A~~~- 235 (607) T protein:vir:10 157 YTNIGQMFSITYSGKSASAGYTVSHDTDGKAILLTLGSGDSIDKLTNVATFDLTMSKYDTIAKLMQAISATPNFSASVV- 235 (607) T ss_pred eeeccceeecccCcccccccceeeecCCCceeEEEecCCCccceeeeeecccccccccchHHHHHHHhhcCCceEEEEe- Confidence 11225555555555554322 34557778888877544443 Q ss_pred ccceEEEeeccCcccccceeEEEEec--ccCccc------------------c--c------------------ccce-- Q lcl|Aclame:pro 159 SAGVVTLTARHKGLCGNEIPVSLNYY--GFGGGE------------------V--L------------------PAGV-- 196 (498) Q Consensus 159 ~~~~VtlTAk~kG~~gN~i~l~~~~~--~~~~ge------------------~--~------------------p~Gl-- 196 (498) +...+++|+++..+++++++.... +..-++ . . +.|. T Consensus 236 --g~~~i~tky~d~~~~~i~V~~~~~iv~a~~~D~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 313 (607) T protein:vir:10 236 --GSPSVNTSYLDEVTSPVDVKTAPAVVTAKIGDAISKLGYDPYVVVTQTSNNKPIVNGVSAGTGSATASVTTAPESFPA 313 (607) T ss_pred --cccceeeeccccccceeEEEEeeeeechhhhhhhhcccccceEEeeecccchhhhhhhhccccceeeeeecccccccc Confidence 455689999999999888875322 111111 0 0 0011 Q ss_pred eeeecccCCCc---CcchhhhHHHhhccCcceEEEecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHH Q lcl|Aclame:pro 197 QIAVATGTAGT---GAPVLTGAVAAMADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELV 273 (498) Q Consensus 197 t~tit~~agGa---g~pD~~~alaalg~~~~~~I~~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~ 273 (498) ++.-++++||. .+.+++++|++|..+.+++++++..|++-+.++++|++. +....+++.++.......++.++. T Consensus 314 ~~a~~~LtGGtdG~~~~ty~dal~aLe~~e~~~i~~~t~d~ai~~~l~a~vkr---~~~~g~~~~aVlg~~~~~t~~~~~ 390 (607) T protein:vir:10 314 NFDTAFLTGGSTGDVPVSWADKFNGAIGNNVYYIIPLTSEENIHAELQAFIDE---QHVLGYNYHAFVGGGFAEPLEQIL 390 (607) T ss_pred ccceeeeeCCCCCCchhhHHHHHHHHhhcCceEEEecCCCHHHHHHHHHHHHH---HHhCCCcEEEEecCCCCCCHHHHH Confidence 12224566665 345789999999999999999999999989999999976 344567788888888889999999 Q ss_pred hhhhccCcceEEEEecCC-------CCCCcHHHHHHHHHHHhhhhhccCccccccceEEeccccCCCccccChHHHHHHH Q lcl|Aclame:pro 274 NAGDQFNQQHITLAGYEK-------ETQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLL 346 (498) Q Consensus 274 t~g~~~N~~~~t~~~~~~-------~~~~p~~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~~p~~~~r~~~~er~~lL 346 (498) ++....|++++..++..+ ....|++.+|+++|+..| ..+|++++...++.+. +...||+.+|++.|| T Consensus 391 t~a~~~N~ervv~V~~~~~~~~~G~~~~~~~~~~Aa~vAGl~A---g~~~~~SlT~k~i~~~---~v~~~lt~~e~e~ai 464 (607) T protein:vir:10 391 SRQVNINDSRFGLVGQSGHVQEGGESVHVPAYLMAAYVGGLSS---SLGVAVPITNKKLALV---DLDQNFSGDDLNTLN 464 (607) T ss_pred HHHHhhCCCcEEEEecCeeEeeCCcceeccHHHHHHHHHHHHh---cCccccCcccceeccc---cccccCCHHHHHHHH Confidence 999999999998775421 245778887888777777 6789999988887633 567799999999999 Q ss_pred hCCeeEEEEcC-----CeEEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCc Q lcl|Aclame:pro 347 SHGVATAYVES-----GVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQ 421 (498) Q Consensus 347 ~~Gist~~v~~-----G~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~ 421 (498) ++|+.+|+++. |.++|+|.||||... .|+.|++|.++|++||+.+++|..+..+|+|++++++ T Consensus 465 ~~Gv~~l~~~~~~~~~~~vrIv~~ItT~t~~----~~~~~~~i~viRv~D~i~~dir~~~~~~yIGk~nnd~-------- 532 (607) T protein:vir:10 465 QNGVIGIEHLVNRNATGGYYIVQDVSTNTVS----SSHVDGSLYLGELTDFLFDNLRFVLRDTYIGSNIRST-------- 532 (607) T ss_pred hCCeEEEEEccCccccceEEEeeeeeeccCC----CCcchheeehhhhHHHHHHHHHHHHhhcCCcccCCcc-------- Confidence 99999998753 348999999998643 5889999999999999999999999999999986664 Q ss_pred ccccHHHHHHHHHHHH--HHHhhcccccchhhhcCeEEEEEcCCCCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 422 AIVTPAVIKGELLATY--RQLERAGIVENYELFKQYLVVERDASVPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 422 ~ivTp~~ikaeli~~~--~~le~~given~~~~~~~lvVerd~~d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) +.+.+|.++.+.+ ++|++.|+|+|++. +++.|++++ |. +.+.+....|+.|+.|=..+.|+=|--++.- T Consensus 533 ---~~~~vk~~i~~~L~~~~l~~~gaI~df~~--edv~v~~~~-D~--v~v~~~v~Pv~~iekIyvtv~v~~~~~~~~~ 603 (607) T protein:vir:10 533 ---SADDIKSTVASYLYSEMNNDDGLIVDFSE--SDIVVTISG-TV--VYIQFAVAPTQEIKNIVVSGTYSNYSATSED 603 (607) T ss_pred ---hHHHHHHHHHHHHHHHHHHhcCceeCCCc--cccEEeeCC-CE--EEEEEEEEEcccceEEEEEEEEEEEEEeecc Confidence 3368999999875 45788999999853 678999865 44 4455555556666666566666555443333 No 6 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=100.00 E-value=7.5e-50 Score=289.91 Aligned_cols=446 Identities=15% Similarity=0.178 Sum_probs=315.4 Q ss_pred Cccc-hhhcCcccccCeEEEEEecCCC-CCCCCC-ccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHH Q lcl|Aclame:pro 1 MTIS-FNTIPSNTLVPLFYAEMDNQAA-NTAQDS-GASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEA 77 (498) Q Consensus 1 M~i~-f~~Ip~~~rvPg~y~E~dns~a-~~~~~~-~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a 77 (498) |.|+ |+.=| +-.||+|+|+|.|.. +..... ..+.+||... .+++++|+++++.+||+.+||-|- |..+++. T Consensus 1 ~a~~~~~~~~--~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~---~G~~~~~~~~~~~~~~~~~~~~g~-l~~~~~~ 74 (587) T protein:vir:95 1 MAVEPFPRRP--ITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAE---GGEPNTVYELRNYSQAKRLFRSGE-LLDAIEL 74 (587) T ss_pred CcccccCCcc--cccCceEEEEecCCccccCCCCCceEEEEEEeC---CCCCceeEEeccHHHHHHHhcCcc-hHHHHHH Confidence 9865 44444 567999999999876 344444 4455688643 447899999999999999999776 5555666 Q ss_pred HHH---hCCCceEEEEEecCC-ccceeEEEEEEee--eccCCcEEEEEE-----cc-EEEEEEeec-------------- Q lcl|Aclame:pro 78 YRQ---TDPFGELYVIAVPEA-TGAAATVTLTVTG--EATESGTVNVYV-----GR-TRVQAPVTN-------------- 131 (498) Q Consensus 78 ~~~---~n~~~~l~~i~l~d~-ag~aatg~ititg--tat~~G~l~l~I-----~g-~~v~v~V~~-------------- 131 (498) ++. .|.-++++++.+.++ ++++..+.++|+- +.+.+..|++.+ .+ ++.++...+ T Consensus 75 a~~~~~~~g~~~~~~~rv~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ 154 (587) T protein:vir:95 75 AWGSNPNYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIF 154 (587) T ss_pred HhccccCCCceEEEEEEcCCCceeEEEecCeEEEEecccccccceEEEEecCCCCCceeEEEEEecccceeeeeecccee Confidence 654 567789999999765 3334345455552 223444444432 11 223222111 Q ss_pred -----CCCHH------------------------------------HHHHHHHHHHhcCCCceEEEeeccceEEEeeccC Q lcl|Aclame:pro 132 -----GDNVT------------------------------------TIASSIQDAINAVPTLPFTASSSAGVVTLTARHK 170 (498) Q Consensus 132 -----gdtaa------------------------------------~iA~~l~~aIn~~~~lpVtA~~~~~~VtlTAk~k 170 (498) |..+. ..+..+.+.||. .+.+|||.. T Consensus 155 si~y~g~~~~~~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~in~-------------~~~~tAky~ 221 (587) T protein:vir:95 155 TIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQ-------------LPDFEAKLS 221 (587) T ss_pred eeeeeccccccceeeeecccceeeeeeeeecCCceEEEEEecCCchHHHHHHHHhhcc-------------ccceEEEEe Confidence 11111 011112222221 223566666 Q ss_pred cccccceeEEEEecccC------------------------------------ccccc------------------ccce Q lcl|Aclame:pro 171 GLCGNEIPVSLNYYGFG------------------------------------GGEVL------------------PAGV 196 (498) Q Consensus 171 G~~gN~i~l~~~~~~~~------------------------------------~ge~~------------------p~Gl 196 (498) |..||++.+.. +... .++.. +.|. T Consensus 222 g~~~~~i~~~~--~~~~~~~~v~~~~~~v~a~~~d~~~~~~~~~~v~~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~~ 299 (587) T protein:vir:95 222 PFGDKNLESSK--LDKIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKT 299 (587) T ss_pred cccCceeEEee--cCcccccceehhhhhhhhhhcceeeeeeceeeeeeecccccceeccchhhhhcccchheeccccccc Confidence 66666555432 1100 00000 0011 Q ss_pred --eeeecccCCCc---CcchhhhHHHhhccCcceEEEecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHH Q lcl|Aclame:pro 197 --QIAVATGTAGT---GAPVLTGAVAAMADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSE 271 (498) Q Consensus 197 --t~tit~~agGa---g~pD~~~alaalg~~~~~~I~~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~ 271 (498) .+..++++||+ .+.|++++|++|..+.|++|+++.+|++.+.++++|++.. .+..+++.++.......++++ T Consensus 300 ~a~~~~t~LtGG~dG~~~~~y~~~l~ale~~~~~~i~~~t~d~~v~a~l~a~vk~~---~~~g~~~~aVvg~~~~~~~~~ 376 (587) T protein:vir:95 300 IEPFELTKLKGGTNGEPPATWADKLDKFAHEGGYYIVPLSSKQSVHAEVASFVKER---SDAGEPMRAIVGGGFNESKEQ 376 (587) T ss_pred eeccceeeeecCCCCCCcccHHHHHHHHHhCCcEEEEecCCCHHHHHHHHHHHHHH---HhCCCcEEEEEcCCCCCCHHH Confidence 12234577765 3568999999999999999999988888889999999764 345567778877788899999 Q ss_pred HHhhhhccCcceEEEEecCCC--------CCCcHHHHHHHHHHHhhhhhccCccccccceEEeccccCCCccccChHHHH Q lcl|Aclame:pro 272 LVNAGDQFNQQHITLAGYEKE--------TQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQ 343 (498) Q Consensus 272 ~~t~g~~~N~~~~t~~~~~~~--------~~~p~~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~~p~~~~r~~~~er~ 343 (498) +.++....|++++..++..+. ...|++++||.+|+..| ..||++++...++.+. ....+|+.+|++ T Consensus 377 ~~~~a~~~n~ervi~v~~~~~~~~~dg~~~~~~~~~~aa~vAGl~A---g~~~~~SlT~~~i~~~---~v~~~~t~~e~e 450 (587) T protein:vir:95 377 LFGRQESLSNPRVSLVANSGTFVMDDGRKNHVPAYMVAVALGGLAS---GLEIGESITFKPLRVS---SLDQIYESIDLD 450 (587) T ss_pred HHHHHhhcCCCcEEEecccceEecCCCceeeechHHHHHHHHHHHh---cCchhcCccceeeecc---cccccCCHHHHH Confidence 999999999999988764311 23578888888888777 6789999998887643 456799999999 Q ss_pred HHHhCCeeEEEEcCC----eEEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCC Q lcl|Aclame:pro 344 TLLSHGVATAYVESG----VLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGP 419 (498) Q Consensus 344 ~lL~~Gist~~v~~G----~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~ 419 (498) .|+++|+.++++..+ .++|+|.||||+. ..|+.|++|.++|++||+.+++|..+..+|+|+|+.++ T Consensus 451 ~ai~~Gvl~l~~~~~~~~~~vriv~~itT~t~----~~d~~~~~i~viRv~D~i~~dir~~~~~~~iGk~nn~~------ 520 (587) T protein:vir:95 451 ELNENGIISIEFVRNRTNTFFRIVDDVTTFND----KSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINT------ 520 (587) T ss_pred HHHhCCeEEEEEecCCcceEEEEeecceeccC----CCCcchhhhhhhhhHHHHHHHHHHHHHhhCCccccchH------ Confidence 999999999988543 3799999999874 36899999999999999999999999999999986654 Q ss_pred CcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEEEcCCCCeEEEEEeeeEEecCeEEEeeeeeeEEEeccc Q lcl|Aclame:pro 420 GQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASVPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEE 496 (498) Q Consensus 420 g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvVerd~~d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~ 496 (498) +...+|+++.+++++|++.|+|+|++. +.+.|++..+ ++.+.+....++.|+-|=..+.|+-|.-++ T Consensus 521 -----~r~~v~~~i~~~L~~l~~~gaI~~~~~--~dv~v~~~~d---~~~v~~~v~Pv~~mekI~vt~~~~~~~~~~ 587 (587) T protein:vir:95 521 -----SASIIKDFIQSYLGRKKRDNEIQDFPA--EDVQVIVEGN---EARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred -----HHHHHHHHHHHHHHHHHhCCcccCCCc--cceEEEecCC---EEEEEEEEEEcccceEEEEEEEEeeeeecC Confidence 237899999999999999999999966 6788887553 566777777788888877788887777776 No 7 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=100.00 E-value=2.4e-48 Score=281.67 Aligned_cols=448 Identities=15% Similarity=0.171 Sum_probs=313.2 Q ss_pred Cccc-hhhcCcccccCeEEEEEecCCC-CCCCCC-ccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHH Q lcl|Aclame:pro 1 MTIS-FNTIPSNTLVPLFYAEMDNQAA-NTAQDS-GASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEA 77 (498) Q Consensus 1 M~i~-f~~Ip~~~rvPg~y~E~dns~a-~~~~~~-~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a 77 (498) |.|+ |+.=| +-.||+|+|+|.|.. +..... ..+.+||... .+++++|+++++.+||+.+||-|- |..+++. T Consensus 1 ~a~~~~~~~~--~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~---~G~~~~~~~~~~~~~~~~~~~~g~-l~~~~~~ 74 (587) T protein:vir:99 1 MAVEPFPRRP--ITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAE---GGEPNTVYELRNYSQAKRLFRSGE-LLDAIEL 74 (587) T ss_pred CcccccCCcc--cccCceEEEEecCCccccCCCCCceEEEEEEec---CCccceeEEeccHHHHHHHhcCcc-hHHHHHH Confidence 9865 44433 567999999999876 344444 4455688643 347799999999999999999766 7777777 Q ss_pred HHH---hCCCceEEEEEecCC-ccceeEEEEEEee--eccCCcEEEEEE-----cc-EEEEEEeec-------------- Q lcl|Aclame:pro 78 YRQ---TDPFGELYVIAVPEA-TGAAATVTLTVTG--EATESGTVNVYV-----GR-TRVQAPVTN-------------- 131 (498) Q Consensus 78 ~~~---~n~~~~l~~i~l~d~-ag~aatg~ititg--tat~~G~l~l~I-----~g-~~v~v~V~~-------------- 131 (498) ++. .|.-++++++++.++ ++++..+.++|+- +...+..|++.+ .+ ++.++...+ T Consensus 75 a~~~~~~~g~~~~~~~rv~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ 154 (587) T protein:vir:99 75 AWGSNPNYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIF 154 (587) T ss_pred HhccccCCCceEEEEEEcCCCceeEEEecCeEEEEeeccccccceEEEEccCCCCcceeEEEEEecccceeeeeecccee Confidence 774 466689999999765 3344445555552 122333333311 11 122211111 Q ss_pred -----CCCH------------------------------------HHHHHHHHHHHhcCCCceEEEeeccceEEEeeccC Q lcl|Aclame:pro 132 -----GDNV------------------------------------TTIASSIQDAINAVPTLPFTASSSAGVVTLTARHK 170 (498) Q Consensus 132 -----gdta------------------------------------a~iA~~l~~aIn~~~~lpVtA~~~~~~VtlTAk~k 170 (498) |..+ ...+..+.+.||.. +.+|||.+ T Consensus 155 ~i~y~g~~~~a~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~i~~~-------------~~~tAky~ 221 (587) T protein:vir:99 155 TIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQL-------------PDFEAKLS 221 (587) T ss_pred eEEeecccccceeeEeecCcceeeeeeeeecCCceeEEEEecCCchHHHHHHHhhhccc-------------cceeEEee Confidence 1111 11122333333322 22355555 Q ss_pred cccccceeEEEE-------------ecccC---------------------ccc------------------cccc--ce Q lcl|Aclame:pro 171 GLCGNEIPVSLN-------------YYGFG---------------------GGE------------------VLPA--GV 196 (498) Q Consensus 171 G~~gN~i~l~~~-------------~~~~~---------------------~ge------------------~~p~--Gl 196 (498) |.-+|++..+.. +-+.. .++ ..+. .. T Consensus 222 ~~~~~~i~~~~~~~~~~~~v~~~~~~v~a~~~D~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 301 (587) T protein:vir:99 222 PFGDKNLESSKLDKIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIE 301 (587) T ss_pred ccCCceeEeecccccccceeeeeeeeeehhccceeeecccceeeeeeecccccchhhhhhhhhccccceeeeecccccee Confidence 544443332110 00000 000 0000 11 Q ss_pred eeeecccCCCc---CcchhhhHHHhhccCcceEEEecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHH Q lcl|Aclame:pro 197 QIAVATGTAGT---GAPVLTGAVAAMADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELV 273 (498) Q Consensus 197 t~tit~~agGa---g~pD~~~alaalg~~~~~~I~~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~ 273 (498) .+..++++||+ .+.+++++|++|..+.|++|+++.+|++.+.++++|++.. .+..+++.++.......++.++. T Consensus 302 ~~~~t~LtGG~dG~~~~sy~~al~ale~~~~~~i~~~t~d~~i~a~l~a~vk~~---r~~g~~~~aVlg~~~~~~~~~~~ 378 (587) T protein:vir:99 302 PFELTKLKGGTNGEPPATWADKLDKFAHEGGYYIVPLSSKQSVHAEVASFVKER---SDAGEPMRAIVGGGFNESKEQLF 378 (587) T ss_pred cccceeeecCCCCCccccHHHHHHHHhhCCcEEEEecCCCHHHHHHHHHHHHHH---HhCCCcEEEEecCCCCCCHHHHH Confidence 22234566764 3568999999999999999999988888889999999764 24456778888778889999999 Q ss_pred hhhhccCcceEEEEecCCC--------CCCcHHHHHHHHHHHhhhhhccCccccccceEEeccccCCCccccChHHHHHH Q lcl|Aclame:pro 274 NAGDQFNQQHITLAGYEKE--------TQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTL 345 (498) Q Consensus 274 t~g~~~N~~~~t~~~~~~~--------~~~p~~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~~p~~~~r~~~~er~~l 345 (498) ++....|++++..++..+. ...|++.+|+.+|+..| ..+|++++...+|.+. +...+|+.+|++.| T Consensus 379 ~~a~~~n~e~vi~v~~~~~~~~~dg~~~~~~~~~~aa~vAGl~A---g~~~~~SlT~~~i~~~---~v~~~~t~~e~e~l 452 (587) T protein:vir:99 379 GRQASLSNPRVSLVANSGTFVMDDGRKNHVPAYMVAVALGGLAS---GLEIGESITFKPLRVS---SLDQIYESIDLDEL 452 (587) T ss_pred HHhhhcCCCcEEEEeccceEecCCCceeeechHHHHHHHHHHHh---cCchhcCccceeeecc---cccccCCHHHHHHH Confidence 9999999999988765321 33578888778777777 6789999988887633 55679999999999 Q ss_pred HhCCeeEEEEcCC----eEEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCc Q lcl|Aclame:pro 346 LSHGVATAYVESG----VLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQ 421 (498) Q Consensus 346 L~~Gist~~v~~G----~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~ 421 (498) +++|+.++++..+ .++|+|.||||+.. .|+.|++|.++|++||+.+++|..+..+|+|+|+.+++ T Consensus 453 i~~Gvl~l~~~~~~~~~~vriv~~ItT~t~~----~~~~~~~i~viRv~D~i~~di~~~~~~~yiGk~Nn~~~------- 521 (587) T protein:vir:99 453 NENGIISIEFVRNRTNTFFRIVDDVTTFNDK----SDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINTS------- 521 (587) T ss_pred HhCCeEEEEEecCCcceEEEEeeceeeccCC----CCchhhhhhhhhhHHHHHHHHHHHHHhhCCccccchHH------- Confidence 9999999988543 37999999998743 58999999999999999999999999999999866542 Q ss_pred ccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEEEcCCCCeEEEEEeeeEEecCeEEEeeeeeeEEEeccc Q lcl|Aclame:pro 422 AIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASVPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEE 496 (498) Q Consensus 422 ~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvVerd~~d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~ 496 (498) ...+|++|.+++++|++.|+|+|++. +.+.|++..+ ++.|.+....++.++-|=..+.|+.|.-++ T Consensus 522 ----r~~i~~~i~~~L~~l~~~gaI~~~~~--~dv~v~~~~d---~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:99 522 ----ASIIKDFIQSYLGRKKRDNEIQDFPA--EDVQVIVEGN---EARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred ----HHHHHHHHHHHHHHHHhCCcccCCCc--cceEEEecCC---EEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 37899999999999999999999866 6788887553 466677777788888888888888887777 No 8 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=100.00 E-value=1.1e-48 Score=283.51 Aligned_cols=460 Identities=15% Similarity=0.151 Sum_probs=318.5 Q ss_pred CccchhhcCc-ccccCeEEEEEecCCC-CCCCCCccE-EEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPS-NTLVPLFYAEMDNQAA-NTAQDSGAS-LLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEA 77 (498) Q Consensus 1 M~i~f~~Ip~-~~rvPg~y~E~dns~a-~~~~~~~~v-LliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a 77 (498) |.|++ .|. .+-.||+|+|++.|.. +.+...-++ .+||... .+++++|+++++.+||+.+||-|. |.+++++ T Consensus 1 ~~~~~--~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~---~g~~~~~~~~~~~~~~~~~~g~G~-l~~ai~~ 74 (587) T protein:vir:96 1 MAKDI--FPRRPIQRPHASIEVDSSGIGGSASNSEKILCLIGKAE---GGEPNTVYQVRNYAQAKSVFRSGE-LLDAIEL 74 (587) T ss_pred Ceeee--eCCCcccCCceEEEEecCCccCCCCCCCceEEEEEEec---CCCCceeEEEcChHHHHHhhcCCc-HHHHHHH Confidence 99987 453 3456999999998877 455555444 4577543 336799999999999999999998 5566666 Q ss_pred HHH---hCCCceEEEEEecCC-ccceeEEEEEEeeec--cCCcEEEEEEc------cEEEEEEee--------------- Q lcl|Aclame:pro 78 YRQ---TDPFGELYVIAVPEA-TGAAATVTLTVTGEA--TESGTVNVYVG------RTRVQAPVT--------------- 130 (498) Q Consensus 78 ~~~---~n~~~~l~~i~l~d~-ag~aatg~ititgta--t~~G~l~l~I~------g~~v~v~V~--------------- 130 (498) ++. .|..++++++.+.++ +++...+.++++++. +.++.+.+.+- -++.++... T Consensus 75 a~~~~~~~g~~~~~a~rv~~~~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~n~G~v~ 154 (587) T protein:vir:96 75 AWGSNPQYTAGKILAMRVEDAKASQLEKGGLRVTSKIFGSVSNDIQVALEKNTITDSLRLRVVFQKDNYQEVFDNLGNIF 154 (587) T ss_pred HhccCcCCCceEEEEEecCCCccceeecccccccccccCCCCceEEEEEEeccCCCccceEEEEecCCceeeccccCceE Confidence 665 455578999988765 333444556665542 45555555551 112222111 Q ss_pred ----cCCCHHHHHHH---------HHHHHhcCCCceE---E------------EeeccceEEEeeccCcccccceeEEEE Q lcl|Aclame:pro 131 ----NGDNVTTIASS---------IQDAINAVPTLPF---T------------ASSSAGVVTLTARHKGLCGNEIPVSLN 182 (498) Q Consensus 131 ----~gdtaa~iA~~---------l~~aIn~~~~lpV---t------------A~~~~~~VtlTAk~kG~~gN~i~l~~~ 182 (498) +|+.+.++-.. ..-++...+. .| . .....++..+|||+.|..||++.++.. T Consensus 155 ~i~y~g~~~~a~~~~~~~~~~~~A~~l~l~gg~~-~v~~yrl~~g~~~~~~~~~~~~~~~~~~tAky~g~~~n~~~v~v~ 233 (587) T protein:vir:96 155 SINYKGEGEKATFSVEKDKETQEAKRLVLKVDEK-EVKAYELNGGAYSFTNEIITDINELPDFEAKLSPFGDKNLESRKL 233 (587) T ss_pred EEEecccccceeEeeccCcccceeeeeEEEecCc-eEEEEEeCCCchhhhhhhhhhhccccceEEEeecccCceeEEEee Confidence 11111111000 0000111100 00 0 000112335789999999998887642 Q ss_pred ------------ecccC-ccc----------ccccce-------------------------------eeeecccCCCc- Q lcl|Aclame:pro 183 ------------YYGFG-GGE----------VLPAGV-------------------------------QIAVATGTAGT- 207 (498) Q Consensus 183 ------------~~~~~-~ge----------~~p~Gl-------------------------------t~tit~~agGa- 207 (498) +|... .++ ....|+ .+..++++||. T Consensus 234 d~~~~~~~k~~~~y~~t~~~di~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~aLtGG~d 313 (587) T protein:vir:96 234 DEATDVDIKGKAVYVKAVFGDIENQTQYNQYVKFEQLPEQASEPSDVEVHAETESATVTATSKPKAIEPFELTKLSGGTN 313 (587) T ss_pred ccccccccceEEEeehhhhhhhhhhhccccceeeccccchhhhhhcccccccccceeeeecccccccccccceeeecCCC Confidence 11100 000 000110 11123355653 Q ss_pred --CcchhhhHHHhhccCcceEEEecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEE Q lcl|Aclame:pro 208 --GAPVLTGAVAAMADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHIT 285 (498) Q Consensus 208 --g~pD~~~alaalg~~~~~~I~~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t 285 (498) .+.+++++|++|..+.|++|+++..|++.+.++++|++.. .+..+++.++.......+++++.+....+|++++. T Consensus 314 G~~~~~y~~~l~ale~~~~~~i~~~t~d~ai~~~l~a~vk~~---r~~gk~~~aVlg~~~~~~~~~~~~~a~~~n~e~vi 390 (587) T protein:vir:96 314 GEPPTSWSAKLEKFKNEGGYYIVPLTDRQSVHSEVATFVKNR---SDAGEPMRAIVGGGTSETKEKLFGRQAILNNPRVA 390 (587) T ss_pred CCCcccHHHHHHHHhhCCcEEEEecCCCHHHHHHHHHHHHHH---HhCCCeEEEEecCCCCCCHHHHHHHHhhcCCCcEE Confidence 3458899999999999999999998888889999999764 24567788888888889999999999999999998 Q ss_pred EEecCC--------CCCCcHHHHHHHHHHHhhhhhccCccccccceEEeccccCCCccccChHHHHHHHhCCeeEEEE-c Q lcl|Aclame:pro 286 LAGYEK--------ETQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYV-E 356 (498) Q Consensus 286 ~~~~~~--------~~~~p~~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v-~ 356 (498) +++..+ ....|++.+|+++|+..| ..+|..++...++.+. +...+|+.+|++.|+++|+.+++. + T Consensus 391 ~v~~~~~~~~~~~~~~~~~~~~~aa~vAG~~A---g~~~~~S~T~~~~~~~---~v~~~~t~~e~~~~i~~G~~~l~~~~ 464 (587) T protein:vir:96 391 LVANSGKFVMGNGRILQAPAYMVASAVAGLVS---GLDIGESITFKPLFVN---SLDKVYESEELDELNENGIITIEFVR 464 (587) T ss_pred EEecceEEecCCCceeeechhhHHHHHHHHHh---cCccccCccceeeecc---cccccCCHHHHHHHHhCCeEEEEEec Confidence 876432 234577888878787777 6788888888888754 456699999999999999999986 4 Q ss_pred CCe---EEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHH Q lcl|Aclame:pro 357 SGV---LRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGEL 433 (498) Q Consensus 357 ~G~---v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikael 433 (498) ++. .+++|.||||+ ...|+.|++|.++|++||+.+++|..+..+|+|+++.+++ ...+|+++ T Consensus 465 ~~~~~v~~~vnsitT~t----~~~~~~~~~i~virv~D~i~~di~~~~~~~yiGk~nn~~~-----------r~~v~~~i 529 (587) T protein:vir:96 465 NRMTTMFRIVDDVTTFP----DKNDPVKSEMALGEANDFLVSELKILLEEQYIGTRTINTS-----------ASQIKDFV 529 (587) T ss_pred CCcEEEEEeeccceecC----CCCCchhhhhhhHHHHHHHHHHHHHHHHhcCCccccCHHH-----------HHHHHHHH Confidence 443 36778888876 3568899999999999999999999999999999766553 37899999 Q ss_pred HHHHHHHhhcccccchhhhcCeEEEEEcCCCCeEEEEEeeeEEecCeEEEeeeeeeEEEeccc Q lcl|Aclame:pro 434 LATYRQLERAGIVENYELFKQYLVVERDASVPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEE 496 (498) Q Consensus 434 i~~~~~le~~given~~~~~~~lvVerd~~d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~ 496 (498) .+++++|++.|+|||++. +.+.|++..+ ++.|.+-...++.++-|=..+.++-|.-++ T Consensus 530 ~~~L~~l~~~g~I~~~~~--~dv~v~~~~D---~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:96 530 QSYLGRKKRDNEIQDFPP--EDVQVIIEGN---EARISLTIFPIRALKKISVSLVYRQQTLQA 587 (587) T ss_pred HHHHHHHHhCCcccCCCc--cceEEEecCC---EEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 999999999999999965 6788887653 466666677777777777777777777666 No 9 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=100.00 E-value=1.8e-48 Score=282.38 Aligned_cols=460 Identities=14% Similarity=0.147 Sum_probs=322.7 Q ss_pred Cccc-hhhcCcccccCeEEEEEecCCC-CCCCCCccEE-EEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHH Q lcl|Aclame:pro 1 MTIS-FNTIPSNTLVPLFYAEMDNQAA-NTAQDSGASL-LIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEA 77 (498) Q Consensus 1 M~i~-f~~Ip~~~rvPg~y~E~dns~a-~~~~~~~~vL-liGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a 77 (498) |.|+ |+ -.++--||+|+|+|.|.. +.....-+++ +||... .+++++|++++|-+|++.+||-|. +..+++. T Consensus 1 ~~~~~~~--~~~~~~Pgv~~~~~~s~~~~~~~~~~~~~~~iG~a~---~G~~~~~~~~~~~~~~~~~fg~g~-l~~~i~~ 74 (562) T protein:vir:63 1 MAIEIYP--RKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSAT---GGKPNAVYKVRNYSQAKSVFRSGE-LLDAIER 74 (562) T ss_pred CeeeeeC--CCcccCCceEEEEecCCCcccCCCCCceEEEEEEeC---CCCCceeEEEccHHHHHHHhcCCc-hHHHHHH Confidence 8876 43 223556999999999888 4555555555 488643 347799999999999999999988 6677777 Q ss_pred HHH---hCCCceEEEEEecCC-ccceeEEEEEEeee--ccCCcEEEEEE------ccEEEEEEeecCCCHHHHHHHHHHH Q lcl|Aclame:pro 78 YRQ---TDPFGELYVIAVPEA-TGAAATVTLTVTGE--ATESGTVNVYV------GRTRVQAPVTNGDNVTTIASSIQDA 145 (498) Q Consensus 78 ~~~---~n~~~~l~~i~l~d~-ag~aatg~ititgt--at~~G~l~l~I------~g~~v~v~V~~gdtaa~iA~~l~~a 145 (498) +.. .|.-+++|++.+.++ +++...+.++|+.. .+.++.+.+.+ +-++.++. -.++-..+++..|-.. T Consensus 75 a~~~~~~~g~~~~~~~rv~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~~~~~~~~~~~~~~v~-~~~~~~~ev~~~~g~V 153 (562) T protein:vir:63 75 AWNPGEGTGAGDILAMRVEEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIV-FAKERVNQVYDNLGSI 153 (562) T ss_pred hccccccCCceEEEEEEcCCCccceeEecceeEEEeecccCCCeEEEEEecCCCCCCcceEEE-ecCCCcchhhhhccce Confidence 774 566689999999765 44455566666543 35555555555 22344442 2333333333332100 Q ss_pred --H---------------hcCC--C---------ceEEE---eec------------cceEEEeeccCcccccceeEEEE Q lcl|Aclame:pro 146 --I---------------NAVP--T---------LPFTA---SSS------------AGVVTLTARHKGLCGNEIPVSLN 182 (498) Q Consensus 146 --I---------------n~~~--~---------lpVtA---~~~------------~~~VtlTAk~kG~~gN~i~l~~~ 182 (498) | +... . -+|.+ ... .....++||..|..||++.+... T Consensus 154 ~~i~y~g~~~~~~~~v~~~~~~~~a~~l~~~~g~~~v~~~~L~~g~~~~~~~l~~~in~~~~~~aky~~~~gn~i~~~~~ 233 (562) T protein:vir:63 154 FSIKYKGTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTDNF 233 (562) T ss_pred eeeeeecccccceEEEEecCcceeEEEEEeecCCcceeEEEecCCccchhHHHHHhhccccceEEEeeccCCceeeeecc Confidence 0 0000 0 00000 000 01123678888888887765321 Q ss_pred ecccCc--------------------------------ccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEec Q lcl|Aclame:pro 183 YYGFGG--------------------------------GEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLP 230 (498) Q Consensus 183 ~~~~~~--------------------------------ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~p 230 (498) -...+. .+.+.......++..+.|+.+.+++++|++|....|++|+++ T Consensus 234 d~~~~~~vkt~~~~v~t~~~d~~~~~~~~~~v~~~~~~~~~la~~~~~~LtGG~dGt~~~~~~~al~ale~~~~~~i~~~ 313 (562) T protein:vir:63 234 DAQIDVDIKTKEAYVKAVGGDIEKQTAYNGYVDFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANEGGYYLVPL 313 (562) T ss_pred ccccccchhhhhhhhhhhhhhhhhcccccceeeeeeccccceecccceeeecCCCCCchhhHHHHHHHHHhCCcEEEEec Confidence 000000 000000011223333445556689999999999999998877 Q ss_pred CCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCC--------CCCCcHHHHHH Q lcl|Aclame:pro 231 FNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEK--------ETQTPADELAA 302 (498) Q Consensus 231 ~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~--------~~~~p~~~~AA 302 (498) -.|.+-+.++++|++.. .+..+++.++.......++.++.+.....|++++..++..+ ....|++++++ T Consensus 314 t~d~av~~~l~a~vkr~---~~~g~~~~aVlg~~~~~~~~~~~~~a~~~n~ervv~v~~~~~~~~~~~~~~~~~~~~~aa 390 (562) T protein:vir:63 314 TSKQAVHAEALQFVRDC---SYNGNPMRVFVGGGIGESMEQLFTRAIGLQNERAGLIGFSGTVKMDDGRSLKMPGYMFAA 390 (562) T ss_pred CCCHHHHHHHHHHHHHH---HhCCCcEEEEecCCCCCCHHHHHHHhhhcCCCcEEEEecCeeEECCCCceeeechhHHHH Confidence 67777778899999764 34556777888778889999999999999999998876432 23468888888 Q ss_pred HHHHHhhhhhccCccccccceEEeccccCCCccccChHHHHHHHhCCeeEEEEc-CCe---EEEEeeeeeeeecCCCCCC Q lcl|Aclame:pro 303 SRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVE-SGV---LRIQRDVTTYRKNAYGVAD 378 (498) Q Consensus 303 a~~a~~a~~l~~DPArpl~tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~-~G~---v~IeR~ITTY~~n~~G~~D 378 (498) .+|+..| ..||..++...+|.+ .+...||+.+|++.|+.+|+.+++.. ++. ++|+|.||||. ...| T Consensus 391 ~vAGl~A---~~~~~~SlT~~~i~~---~~v~~~~t~~e~~~li~~Gv~~l~~~~~~~v~~~~iv~~itT~t----~~~~ 460 (562) T protein:vir:63 391 QVAGLTC---GLEIGEAITFKNIAI---ETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFN----DKTD 460 (562) T ss_pred HHHHHhh---cCchhcCccceeecc---ccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecC----CCCC Confidence 8887777 678888888877753 35567999999999999999999874 332 36888999986 3468 Q ss_pred chhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEE Q lcl|Aclame:pro 379 NSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVV 458 (498) Q Consensus 379 ~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvV 458 (498) +.|++|.++|++||+.+++|..+..+|+|+++.+++ ...+|+++.+++++|++.|+|+|++. +.+.| T Consensus 461 ~~~~ki~viRv~D~i~~dir~~~~~~yiGk~Nn~~~-----------r~~v~~~i~~~L~~l~~~gaI~~~~~--~dv~v 527 (562) T protein:vir:63 461 PVKSEIGVGEANDFLVSELKISLDNEYIGTKIIDTS-----------ASLVKNFVQSFLDRKKLAKEIQDYSP--EEVQV 527 (562) T ss_pred chhhhhhhhHHHHHHHHHHHHHHHhcCCccccChHH-----------HHHHHHHHHHHHHHHHhCCcccCCCc--cceEE Confidence 999999999999999999999999999999866553 26899999999999999999999975 56788 Q ss_pred EEcCCCCeEEEEEeeeEEecCeEEEeeeeeeEEEeccc Q lcl|Aclame:pro 459 ERDASVPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEE 496 (498) Q Consensus 459 erd~~d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~ 496 (498) +++. | ++.|.+....|+.|+.|=..+-++.|--++ T Consensus 528 ~~~~-d--~~~v~~~v~pv~~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:63 528 VIEG-D--VARISLTVFPIRSMKKIEVSLVYRQQILTA 562 (562) T ss_pred EecC-C--EEEEEEEEEEcccceEEEEEEEEeeeeecC Confidence 8764 3 466677789999999998999988888777 No 10 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=100.00 E-value=1.7e-46 Score=271.49 Aligned_cols=448 Identities=15% Similarity=0.166 Sum_probs=311.5 Q ss_pred Cccc-hhhcCcccccCeEEEEEecCCC-CCCCCCccEEE-EEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHH Q lcl|Aclame:pro 1 MTIS-FNTIPSNTLVPLFYAEMDNQAA-NTAQDSGASLL-IGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEA 77 (498) Q Consensus 1 M~i~-f~~Ip~~~rvPg~y~E~dns~a-~~~~~~~~vLl-iGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a 77 (498) |.|+ |. -..+-.||+|+|+|.|.. +.+...-+++. ||.. ..+++++|++++|-+||+.+||-|. |..+++. T Consensus 1 ~~~~~~~--~~~~~~pgv~~~~~~s~~~~~~~~~~~~~~~ig~a---~~G~~~~~~~~~~~~~~~~~f~~g~-l~~~i~~ 74 (562) T protein:vir:80 1 MAIEIYP--RKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSA---TGGKPNAVYKVRNYSQAKSVFRSGE-LLDAIER 74 (562) T ss_pred CeeeeeC--CCcccCCceEEEEecCCcccCCCCCCceEEEEEEe---CCCCcceeEEEccHHHHHHHhcCCC-hHHHHHH Confidence 9876 33 223567999999999888 45566656555 8864 3446799999999999999999998 5566666 Q ss_pred HHH---hCCCceEEEEEecCC-ccceeEEEEEEeee--ccCCcEEEEEE------ccEEEEE------------------ Q lcl|Aclame:pro 78 YRQ---TDPFGELYVIAVPEA-TGAAATVTLTVTGE--ATESGTVNVYV------GRTRVQA------------------ 127 (498) Q Consensus 78 ~~~---~n~~~~l~~i~l~d~-ag~aatg~ititgt--at~~G~l~l~I------~g~~v~v------------------ 127 (498) +.. .|.-+++|++.+.++ +++...+.++|+.. .+.++.+.+.+ +-++.++ T Consensus 75 a~~~~~~~g~~~~~~~rv~~a~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~v~ 154 (562) T protein:vir:80 75 AWNPGEGTGAGDILAMRVEEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIF 154 (562) T ss_pred hcccccccCceEEEEEEcCCCCcceEEecceEEEEeecccCCCceEEEEecCCCCCCcceEEEecCCcceEEeeccCcee Confidence 664 456689999999765 34444555555432 23333333322 1111111 Q ss_pred ---------------------------EeecCC-CH---------HHHHHHHHHHHhcCCCceEEEeeccceEEEeeccC Q lcl|Aclame:pro 128 ---------------------------PVTNGD-NV---------TTIASSIQDAINAVPTLPFTASSSAGVVTLTARHK 170 (498) Q Consensus 128 ---------------------------~V~~gd-ta---------a~iA~~l~~aIn~~~~lpVtA~~~~~~VtlTAk~k 170 (498) .+..|. +. ..-+..++.+||. ...+|||+. T Consensus 155 ~i~y~g~~~~a~~~i~~~~~~~~a~~l~~~~g~~~v~~~~l~~g~~~~~~~l~~~i~~-------------~~~~tAky~ 221 (562) T protein:vir:80 155 SIKYKGTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINN-------------LPDFEAKFF 221 (562) T ss_pred eeeeccccccceeEEEecCccceEEEEEEecCCcceeEEEeCCCccchhhhhhhhhcc-------------ccceEEEec Confidence 111110 00 0011122223332 123566677 Q ss_pred cccccceeEEEE-------------ecccCcc-------------------cccccceeeeecccCCCcCcchhhhHHHh Q lcl|Aclame:pro 171 GLCGNEIPVSLN-------------YYGFGGG-------------------EVLPAGVQIAVATGTAGTGAPVLTGAVAA 218 (498) Q Consensus 171 G~~gN~i~l~~~-------------~~~~~~g-------------------e~~p~Glt~tit~~agGag~pD~~~alaa 218 (498) |..||++.+..- |.....| ..+.......++..+.|..+.++.++|++ T Consensus 222 g~~~n~i~~~~~d~~~~~~~kt~~~~v~~~~~d~~~~~~~n~~v~~~~~~~~~la~~~~~~LtGG~dG~~~~~~~dal~~ 301 (562) T protein:vir:80 222 PIGDKNLTTDNFDAQIDVDIKTKEAYVKAVGGDIEKQTAYNGYVEFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSY 301 (562) T ss_pred ccCCceeeecccccchhhhcccceeeeeehhhhhhhcccccceEEEEeccCccccccceeeeeCCCCCCccccHHHHHHH Confidence 666666654211 0000000 01111112233333444556689999999 Q ss_pred hccCcceEEEecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCC------- Q lcl|Aclame:pro 219 MADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEK------- 291 (498) Q Consensus 219 lg~~~~~~I~~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~------- 291 (498) |..+.+++++++-.|.+-..++.+|++.. .+..+++.++.......++.++.+.....|++++..++..+ T Consensus 302 Le~~~~~~i~~~t~d~ai~~~~~a~vkr~---r~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~~~ 378 (562) T protein:vir:80 302 FANEGGYYLVPLTSKQAVHAEALQFVRDC---SYNGNPMRVFVGGGIGESMEQLFTRAIGLQNERAGLIGFSGTVKMDDG 378 (562) T ss_pred HHhCCcEEEEecCCChHHHHHHHHHHHHH---HhCCCeEEEEecCCCCCCHHHHHHHhhhcCCCeEEEEecCeeEECCCC Confidence 99999999888777777788899998753 24456777777777888999999999999999998875431 Q ss_pred -CCCCcHHHHHHHHHHHhhhhhccCccccccceEEeccccCCCccccChHHHHHHHhCCeeEEEEc-CCe---EEEEeee Q lcl|Aclame:pro 292 -ETQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVE-SGV---LRIQRDV 366 (498) Q Consensus 292 -~~~~p~~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~-~G~---v~IeR~I 366 (498) ....|++++++.+|+..| ..+|..++...+|.+. +...+|+.+|++.|+.+|+.+++.. ++. .+|++.| T Consensus 379 ~~~~~~~~~~aa~vAGl~A---g~~~~~S~T~~~i~~~---~v~~~lt~~e~~~li~~G~l~l~~~~~~~v~~~riv~~i 452 (562) T protein:vir:80 379 RSLKMPGYMFAAQVAGLTC---GLEIGEAITFKNIAIE---TLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDV 452 (562) T ss_pred ceeeechhHHHHHHHHHHh---cCccccCccceeeccc---cccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccc Confidence 234678888888888887 6788888888887754 4567999999999999999999874 332 3688899 Q ss_pred eeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 367 TTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIV 446 (498) Q Consensus 367 TTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~giv 446 (498) |||..+ .|+.|.+|.++|++||+.+++|..+..+|.++++.+++ | +.+|.++.+++++|++.|+| T Consensus 453 tT~t~~----~~~~~~ki~viRv~D~i~~dir~~~~~~yIGk~Nn~~~-r----------~~v~~~i~~~L~~l~~~gaI 517 (562) T protein:vir:80 453 TTFNDK----TDPVKSEIGVGEANDFLVSELKISLDNEYIGTKIIDTS-A----------SLVKNFVQSFLDRKKLAKEI 517 (562) T ss_pred eeccCC----CCchhhhhhhhHHHHHHHHHHHHHHHhcCCccccChHH-H----------HHHHHHHHHHHHHHHhCCcc Confidence 998654 58999999999999999999999999999999866553 2 68999999999999999999 Q ss_pred cchhhhcCeEEEEEcCCCCeEEEEEeeeEEecCeEEEeeeeeeEEEeccc Q lcl|Aclame:pro 447 ENYELFKQYLVVERDASVPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEE 496 (498) Q Consensus 447 en~~~~~~~lvVerd~~d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~ 496 (498) +|++. +++.|+++. | ++-|.+....++.++-|=..+-++.|--++ T Consensus 518 ~~~~~--~dv~v~~~~-d--~~~v~~~v~Pv~~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:80 518 QDYSP--EEVQVVIEG-D--IARISLTVFPIRSMKKIEVSLVYRQQILTA 562 (562) T ss_pred cCCCc--cceEEEecC-C--EEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 99875 567888754 3 466667778888888888888888887777 No 11 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=100.00 E-value=1.2e-44 Score=261.42 Aligned_cols=455 Identities=15% Similarity=0.182 Sum_probs=310.4 Q ss_pred CccchhhcCcc-cccCeEEEEEecCCC-CCCCCCccEE-EEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSN-TLVPLFYAEMDNQAA-NTAQDSGASL-LIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEA 77 (498) Q Consensus 1 M~i~f~~Ip~~-~rvPg~y~E~dns~a-~~~~~~~~vL-liGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a 77 (498) |.|++ +|.. +-.||+|+|+|.|.. +.+...-+++ +||... .+++++|+++++-+||+.+||-|. |..+++. T Consensus 1 ~~~~~--~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~---~G~~~~~~~~~~~~~~~~~f~~g~-l~~a~~~ 74 (569) T protein:vir:80 1 MAVEQ--FPRKKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAK---GGKPDTVYRFRNYQQAKQVLRSGD-LLDAIEL 74 (569) T ss_pred Ceeee--ecCCccccCceEEEEecCCCcCCCCCCceeEEEEEEeC---CCCCceeEEecCHHHHHHHhcCCc-hhHHHHh Confidence 99988 7763 335999999998877 4555555544 588643 336799999999999999999988 6666666 Q ss_pred HHHh-----CCCceEEEEEecCC-ccceeEEEEEEeeec--cCCcEEEEEEc------cEEEE----------------- Q lcl|Aclame:pro 78 YRQT-----DPFGELYVIAVPEA-TGAAATVTLTVTGEA--TESGTVNVYVG------RTRVQ----------------- 126 (498) Q Consensus 78 ~~~~-----n~~~~l~~i~l~d~-ag~aatg~ititgta--t~~G~l~l~I~------g~~v~----------------- 126 (498) +... |.-+.++++.+.++ ++.+..+.+++++.. +.++.+.+.+- -++++ T Consensus 75 a~~~~~~~~~~~~~~~~~rv~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~l~~~~~~~~~~~~v~~~~~~~~~~~~~ig~ 154 (569) T protein:vir:80 75 AWNASDVNTASAGDILAVRVEDAKNATLTKGGLTFASTIYGVDANEIQVALEDNNLTHTKRLTVAFSKDGYKKVFDNLGK 154 (569) T ss_pred hccCccccccCceEEEEEEcCCCeeeeeeccceeeeeeeccCCCceEEEEEecCcCCcceeeEEeeecCCCccccccccc Confidence 6532 33346888887553 223333555555543 23333333331 11111 Q ss_pred ----------------------------EEeecCC---------------CHHHHHHHHHHHHhcCCCceEEEeeccce- Q lcl|Aclame:pro 127 ----------------------------APVTNGD---------------NVTTIASSIQDAINAVPTLPFTASSSAGV- 162 (498) Q Consensus 127 ----------------------------v~V~~gd---------------taa~iA~~l~~aIn~~~~lpVtA~~~~~~- 162 (498) +....|+ .....+.++.++||...+.-++....++. T Consensus 155 v~si~ytg~~~~a~~~~~~~~~~~~a~~l~~~~g~~~~~~~~v~~~~~~~~~~~~~~~lv~~~~~~~~f~a~~~~~~~~~ 234 (569) T protein:vir:80 155 IFSIQYKGSEAQANFTIAQDSISKKATTLTLNVGSEPESTTEVMKYELGQGVYSETNVLVSAINSLPDWEAKFFPIGDKN 234 (569) T ss_pred eeeEEEeeccccceEEeecCcCcceeEEEEEEecCCcceeEEEEeeccCCccchhhhhhhhhcCCccCceEEEEecCCCc Confidence 1111111 11222335666666655544443221111 Q ss_pred EE-----------Ee------eccCccccccee----EEEEecccCcccccccceeeeecccCCCc---CcchhhhHHHh Q lcl|Aclame:pro 163 VT-----------LT------ARHKGLCGNEIP----VSLNYYGFGGGEVLPAGVQIAVATGTAGT---GAPVLTGAVAA 218 (498) Q Consensus 163 Vt-----------lT------Ak~kG~~gN~i~----l~~~~~~~~~ge~~p~Glt~tit~~agGa---g~pD~~~alaa 218 (498) +. .+ .-.+|+.-+.+. +.+... | .-....++-++|+||. .+.++.++|++ T Consensus 235 ~~~~~~d~~~~~~~~t~~~~~~~~~~di~~~~~~~~~v~~~~~----~--~~~l~~~~~~~LtGG~dG~~~~~~~~~l~~ 308 (569) T protein:vir:80 235 LPTDALEAVTKVDVKTEAVFVGALAGDIAKQLEYNDYVTVAVD----A--TKPVEDFELTNLTGGSDGTAPESWANKFPL 308 (569) T ss_pred ceehhccchhheeccccceeeehhHHHHHHhhcCCceEEEEec----C--CcceeeecceeecCCCCCCccchHHHHHHH Confidence 11 10 001111111110 111110 0 1112234445676764 45589999999 Q ss_pred hccCcceEEEecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCC------- Q lcl|Aclame:pro 219 MADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEK------- 291 (498) Q Consensus 219 lg~~~~~~I~~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~------- 291 (498) |..+.|++|+++..|.+-+.++.+|++.. .+..+++.++.......++.++.++....|++++.+++..+ T Consensus 309 le~~~~~~i~~~t~d~av~~~l~a~vkr~---r~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~~g 385 (569) T protein:vir:80 309 LANEGGYYLVPLTDKQAVHSEALAFVKDR---TDNGDPMRIIVGGGTNETVEESITRATNLRDPRASLVGFSGTRKMDDG 385 (569) T ss_pred HhhCCcEEEEecCCChHHHHHHHHHHHHH---HhCCCcEEEEecCCCCCCHHHHHHHHhhcCCCeEEEEecCceeecCCC Confidence 99999999998888888889999999753 23456777777777888999999999999999998876421 Q ss_pred -CCCCcHHHHHHHHHHHhhhhhccCccccccceEEeccccCCCccccChHHHHHHHhCCeeEEEEcC-Ce---EEEEeee Q lcl|Aclame:pro 292 -ETQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVES-GV---LRIQRDV 366 (498) Q Consensus 292 -~~~~p~~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~~-G~---v~IeR~I 366 (498) ....|++++++.+|+..| ..+|.+++...++.+ .....+|+.+|++.|+.+|+.+++..+ +. .++.+.| T Consensus 386 ~~~~~~~~~~aa~vAG~~A---~~~~~~S~T~k~i~~---~~i~~~lt~~e~~~li~~G~~~l~~~~~~~~~v~~~vn~i 459 (569) T protein:vir:80 386 RLLKLPGYMMASQIAGIAS---GLEVGEAITFKHFNV---TSVDRVFESSQLDMLNESGVISIEFVRNRTLTAFRVVQDV 459 (569) T ss_pred cceeechhhHHHHHHHHHh---cCccccCccceeecc---ccccccCCHHHHHHHHhCCeEEEEEecCceEEEEEEeccc Confidence 235677888888888777 677887777666652 356779999999999999999998753 33 3677888 Q ss_pred eeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 367 TTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIV 446 (498) Q Consensus 367 TTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~giv 446 (498) |||.. ..|+.|++|.++|++||+.+++|..+..+|+|+++.+++ | +.+|.++.+++++|+++|+| T Consensus 460 tT~t~----~~~~~~~~i~viRv~D~i~~dir~~~~~~yiGk~nn~~~-r----------~~v~~~i~~~L~~l~~~gaI 524 (569) T protein:vir:80 460 TTYND----KSDPVKNEMSVGEANDFLVSELKIELDNNFIGTKVIDTS-A----------SLIKNFIQSFLDNKKRAREI 524 (569) T ss_pred eecCC----CCCchhhhhhhhHHHHHHHHHHHHHHHhhcCcccCChhH-H----------HHHHHHHHHHHHHHHhCCcc Confidence 98864 468999999999999999999999999999999866553 3 68999999999999999999 Q ss_pred cchhhhcCeEEEEEcCCCCeEEEEEeeeEEecCeEEEeeeeeeEEEeccc Q lcl|Aclame:pro 447 ENYELFKQYLVVERDASVPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEE 496 (498) Q Consensus 447 en~~~~~~~lvVerd~~d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~ 496 (498) ++++. +++.++++. .++.|.+-...++.++-|=.++-++.|--++ T Consensus 525 ~~~~~--~dv~v~~~~---d~~~v~~~v~Pv~~~ekI~~ti~~~~~~~~~ 569 (569) T protein:vir:80 525 QDYTP--EEVQVVLEG---DVASISMTVMPIRSLNKITVQLVYKQQILTA 569 (569) T ss_pred cCCCc--cceEEEecC---CEEEEEEEEEEcccccEEEEEEEEeeeeecC Confidence 99874 568888764 3577777778888888888888888877776 No 12 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=100.00 E-value=2.2e-43 Score=254.47 Aligned_cols=421 Identities=13% Similarity=0.152 Sum_probs=261.4 Q ss_pred Cc-cchhhcCcccccCeEEEEEecCCCC-CC-CCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcH--HHHHH Q lcl|Aclame:pro 1 MT-ISFNTIPSNTLVPLFYAEMDNQAAN-TA-QDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQ--LARMV 75 (498) Q Consensus 1 M~-i~f~~Ip~~~rvPg~y~E~dns~a~-~~-~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~--l~~M~ 75 (498) |+ =.|+. .+-..||+|+||.++... .. .....+.++|. ...++.++|+.|+|.++...+||.... ...++ T Consensus 1 m~gg~~~~--~~k~~PGvYi~~~~~~~~~i~~~~~~~~a~~~~---~~~Gp~~~~~~i~s~~d~~~~fG~~~~~~~~~~~ 75 (437) T protein:vir:10 1 MAGGIWKR--QNKVRPGAYINVKSKDIAMTRLGGDGVVTVPLA---LSFGQSKKLMKIRRGEDLFKKLGYEQESPQLLLL 75 (437) T ss_pred CCcceecc--cceecCceeEEEecCCcceeeccCCcEEEEEEE---ecCCCCceeEEEecHHHHHHHcCCccchhHHHHH Confidence 98 46653 455789999999776552 22 33444555553 245578999999999999999996432 22344 Q ss_pred HHHHHhCCCceEEEEEecCCccceeEEEEE--EeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCce Q lcl|Aclame:pro 76 EAYRQTDPFGELYVIAVPEATGAAATVTLT--VTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLP 153 (498) Q Consensus 76 ~a~~~~n~~~~l~~i~l~d~ag~aatg~it--itgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lp 153 (498) +.++ +.-..++++.|.+ |++|+.++. ++-+|.-.|.. |..++|.|...- ......- T Consensus 76 ~~~~--~g~~~~~~~R~~~--g~~a~~tl~~~~~~~A~~~G~~-----gn~i~v~v~~~~-------------~d~~~~~ 133 (437) T protein:vir:10 76 NEAF--KRVSEVLLYRLNT--GEKANVSLSDNVTAQAKYSGVR-----GNDITVTVKTNV-------------DDPSSFD 133 (437) T ss_pred HHHh--cCCCEEEEEECCC--CceeeEeeccceEEEeccCCcc-----cceeEEEEeecc-------------CCccceE Confidence 4444 3345899999875 444443321 11122222221 222333333210 0001111 Q ss_pred EEEeeccc---eEEEeeccCcccccceeEEEEecccCccccccc-ceeeeecccCCCc-CcchhhhHHHhhccCcceEEE Q lcl|Aclame:pro 154 FTASSSAG---VVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPA-GVQIAVATGTAGT-GAPVLTGAVAAMADEPFDYIG 228 (498) Q Consensus 154 VtA~~~~~---~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~-Glt~tit~~agGa-g~pD~~~alaalg~~~~~~I~ 228 (498) |.--..+. .-++ ....+...|+. +.. .++.++. .-...++..+.|+ .+-|+.++|+++....|++|+ T Consensus 134 v~~~~~~~~~d~~~v-~~~~~~~~n~~-v~~------~~~~~l~~~a~~~LtGG~dg~~t~~dy~~al~~le~~~~n~l~ 205 (437) T protein:vir:10 134 VVTFLDTVVMDLQTV-KVLADLKNNAL-VEF------SGTGELQPVAGAKLTGGTDGAISTQDYLEYFKALETVEFNYMA 205 (437) T ss_pred EEEecCcceeeeeeh-hhhhhhhhhcc-ccc------ccccccccccceeeeccccCCCChhHHHHHHHHhccCcceEEE Confidence 11100010 0011 11111122221 111 1222211 1112233332222 345899999999999999999 Q ss_pred ecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEec----CCCCCCcHHHHHHHH Q lcl|Aclame:pro 229 LPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGY----EKETQTPADELAASR 304 (498) Q Consensus 229 ~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~----~~~~~~p~~~~AAa~ 304 (498) +|..|.+.++++.+|++.. |-. ......++.+.. .-|++++.-+.. .++...+++++++.+ T Consensus 206 ~~~~d~~~~t~~~~~ik~~--r~~--~g~~~~~V~~~~-----------~~d~e~Iin~~n~~~~~~~~~~~~~~~~a~v 270 (437) T protein:vir:10 206 LPVEDASIKKAAINFIKRM--RED--EGLGAQLVVADS-----------DADSEAVINVKNGVILSDKTVIDKTKATVWV 270 (437) T ss_pred ecCCChhHHHHHHHHHHHH--Hhc--cCceEEEEeCCC-----------CCCCceEEEeecceeecCcceechhhHHHHH Confidence 9999999999999998753 211 111122222211 125555543221 234446778888888 Q ss_pred HHHhhhhhccCccccccceEEeccccCCCccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCCCCCchhhhh Q lcl|Aclame:pro 305 TARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADNSYLDS 384 (498) Q Consensus 305 ~a~~a~~l~~DPArpl~tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G~~D~s~ldi 384 (498) |+..| ..++.+++....|.|+ .....||+.+|++.|+++|+..+..++|+|+|+|+|+||++.... .|++|.+| T Consensus 271 AG~~A---g~~~~~S~t~~~~~~~--~~v~~~~t~~e~~~~i~~G~~vl~~~~~~v~i~~gInTltt~~~~-~~~~~~ki 344 (437) T protein:vir:10 271 AAASA---NAGVEKSLTYEKYEDS--VDVVGRLSHTETEDALLKGQFVFTARRGRAVVEQDINSHVSFTIE-KNQDFRKN 344 (437) T ss_pred HHHhc---cCccccCccccccCCc--ccccccCCHHHHHHHHhCCcEEEEEeCCeEEEEEccccccccCCC-CCchhhhh Confidence 88888 4577777877777765 345669999999999999999998888999999999999877554 58899999 Q ss_pred hhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEEEcCCC Q lcl|Aclame:pro 385 ETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASV 464 (498) Q Consensus 385 ~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvVerd~~d 464 (498) +++|++||+.+++|..+..+|.++..++...| ..+++++.+++++|+.+|+|++++. +...+ ..+++ T Consensus 345 ~vir~~D~i~~di~~~~~~~yiGk~~N~~~~r----------~~~~~~i~~yl~~l~~~g~I~~~~~--~d~~v-~~~~~ 411 (437) T protein:vir:10 345 RILRTLDDIVNDTRYAFSEYFLGKVSNNEDGR----------QAFKANRIRYFKDLEARGAIEDFKV--EDIEV-LRGEL 411 (437) T ss_pred hHHHHHHHHHHHHHHHHHhccccccCCCHHHH----------HHHHHHHHHHHHHHHhCCCccCCCc--eeEEe-ecCCC Confidence 99999999999999999999998654444333 5799999999999999999999876 33333 23344 Q ss_pred CeEEEEEeeeEEecCeEEEeeeeeeE Q lcl|Aclame:pro 465 PNRLNTLFPPDYVNQLRVFAVVNQFR 490 (498) Q Consensus 465 ~nRvn~~~p~~~vn~l~v~A~~~~f~ 490 (498) ...+-+.+..+.++.+.-|=..+... T Consensus 412 ~~~v~v~~~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 412 KESVVVNVKVKPVDSMEKLYMTVTVE 437 (437) T ss_pred CCEEEEEEEEEEeeeeeeEEEEEEec Confidence 55666776666666666654444433 No 13 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=100.00 E-value=1e-39 Score=234.40 Aligned_cols=423 Identities=14% Similarity=0.133 Sum_probs=262.4 Q ss_pred CccchhhcCcccccCeEEEE-EecCCCCCCCCCccEEEEEecCCCCccccceeE--EecChHHHHHhhCcCcHHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAE-MDNQAANTAQDSGASLLIGHANNGAEIVANSLV--LMPSADYARQICGAGSQLARMVEA 77 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E-~dns~a~~~~~~~~vLliGq~~~~g~~~~~~~~--~v~s~~~A~~~fG~GS~l~~M~~a 77 (498) |.|.|...- .+|.|.+ ||+...|-.. .+|+ .++.- .+|+ ..+ .. T Consensus 1 ~~~~~~~~~----~~~~~~~~~~~~~~g~~~-------------------~~~~~~~i~g~-----~~g~--~g~---~~ 47 (581) T protein:vir:10 1 MAIDFSQYQ----TPGVYTEAVGAPQLGIRS-------------------SVPTAVAIFGT-----AVGY--QTY---RE 47 (581) T ss_pred Ceeeecccc----ccchhhhhccccccceee-------------------eeccccccccc-----cccc--ccc---cc Confidence 777765432 3444422 3332222111 0111 11100 0111 000 01 Q ss_pred HHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhc--------- Q lcl|Aclame:pro 78 YRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINA--------- 148 (498) Q Consensus 78 ~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~--------- 148 (498) .++.+| +....-...+|++++.+ .+|+.+|..+|+.- ..+.-+.|+++|..+|.+.=+- T Consensus 48 s~~~~p----------~~~~~~e~q~v~~~~~~-t~GtFtLsf~G~tT-~~I~~~asa~~v~~AL~~L~~i~~~~v~v~g 115 (581) T protein:vir:10 48 SIRINP----------DTGETITTQILALVGEP-TGGSFKLSLAGEPT-GNIPFNATQGQVQSALRALPNVEDDEVTVLG 115 (581) T ss_pred ccccCC----------CCCCccceEEEEEEecC-CCceEEEEeCceec-ccccccCCHHHHHHHHhccCCCCcceEEEEC Confidence 111222 11112333455555543 33566666666431 1233334555555555431000 Q ss_pred -----------------------------------------------------------C------------CCceEEEe Q lcl|Aclame:pro 149 -----------------------------------------------------------V------------PTLPFTAS 157 (498) Q Consensus 149 -----------------------------------------------------------~------------~~lpVtA~ 157 (498) . .++.++.. T Consensus 116 ~~g~~~~VtF~g~~~~l~~~~~~lt~g~~~~vtV~~~~~g~~~~~~~~s~~gi~~~~~~l~~~~~~~~~~~gsd~~~~~~ 195 (581) T protein:vir:10 116 DPGGPWTVTFTKAVAALTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRV 195 (581) T ss_pred CCCceEEEEEcCCccceeeeeceecCCCceeEEEeccccCcccccccccccccccccccccccccCcceeccccceeeec Confidence 0 00011000 Q ss_pred ec---------cceEEEeeccCcc---cccceeEEEEecccCccccccc------------------ceeeeec------ Q lcl|Aclame:pro 158 SS---------AGVVTLTARHKGL---CGNEIPVSLNYYGFGGGEVLPA------------------GVQIAVA------ 201 (498) Q Consensus 158 ~~---------~~~VtlTAk~kG~---~gN~i~l~~~~~~~~~ge~~p~------------------Glt~tit------ 201 (498) -. .+..++.....|. .|+-|.++.+|.+...+|..-- |..-.++ T Consensus 196 ~~~~~~~~~~~~D~~t~~~~~~g~~~~~~~v~~~~~~~~d~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~t~~~~~~ 275 (581) T protein:vir:10 196 NAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLA 275 (581) T ss_pred ccCccccccccccceeeeeeecccccccceEEEEEEEeecCCcceeEEeecCcchhhhhhhhhhccCccccchhhhheee Confidence 00 0001111112221 2333445555444211110000 1111111 Q ss_pred --------------ccCCCcCcchhhhHHHhhccCcceEEEecCCChHHH-HHHHHHHhhhhhhhhhhhheeeEEEEecc Q lcl|Aclame:pro 202 --------------TGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTASV-NTLVTEMNDTSGRWSYARQLYGHVYTAKT 266 (498) Q Consensus 202 --------------~~agGag~pD~~~alaalg~~~~~~I~~p~tD~a~l-~al~~~l~~~s~r~~~~~q~~g~~~~~~~ 266 (498) +..+...++|+.++|++|.++.|+.|++|+++++++ .++++|++..+..-.++|.+.|+...... T Consensus 276 ~tn~~~~~l~~gvd~~g~tvt~~dy~~Al~ale~~~~~~ivv~~t~~~~v~a~l~ahv~~~s~~~~~~ravigV~g~~~~ 355 (581) T protein:vir:10 276 ITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTP 355 (581) T ss_pred eecccceeEEeeccCCCCccchHHHHHHHHHHhcCCceEEEEeCCCCHHHHHHHHHHHHHHHhccCCcEEEEEecCCCCC Confidence 111123567999999999999999999999998875 66999999876655556666665555555 Q ss_pred CCHHHHHhhhhccCcceEEEEecC----------CCCCCcHHHHHHHHHHHhhhhhccCccccccceEEeccccCCCccc Q lcl|Aclame:pro 267 GTLSELVNAGDQFNQQHITLAGYE----------KETQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKR 336 (498) Q Consensus 267 gt~~~~~t~g~~~N~~~~t~~~~~----------~~~~~p~~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~~p~~~~r 336 (498) .+..+.++++...||+++.+++.. +....|++..||++|+..+ +.||++|+...+|+|+. ....+ T Consensus 356 ~~~~~~~~~a~~~n~~Rvvlv~p~~~~~~g~~~~~~v~lp~y~~AA~vAGl~a---~~~~~~slT~~~i~gi~--~l~~~ 430 (581) T protein:vir:10 356 VPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSV---SAIAAMPLTRKVIRGFS--GPAEV 430 (581) T ss_pred ccHHHHHHhhccCCCceEEEEecCceeecCcccCceeccchhhHHHHHHHHhh---ccccccCcccccccccc--ccccc Confidence 678888999999999999987631 2223688888888888887 78999999999999985 55779 Q ss_pred cChHHHHHHHhCCeeEEEEc-CCeEEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHh-hhcCCceeccCC Q lcl|Aclame:pro 337 FTMTEQQTLLSHGVATAYVE-SGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVIT-SKYGRHKLASDG 414 (498) Q Consensus 337 ~~~~er~~lL~~Gist~~v~-~G~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~-~~~~r~kla~dg 414 (498) |+.+|++.|+++|+.+++.. ++.|+|+|.||||.+ |++|++|.++|++||+++.+|..+. .+|+|+|+.++ T Consensus 431 ~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s------~~~~~~i~~iR~~D~v~~~ir~~~~~~~fIG~~n~~~- 503 (581) T protein:vir:10 431 QRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPT------SLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDT- 503 (581) T ss_pred CCHHHHHHHHhCCeEEEEEecCCeEEEEeeeecCCC------CCcceeeeeehhhhHHHHHHHHHhhhhcCCCcccCHH- Confidence 99999999999999999974 667999999999964 5679999999999999999999995 78999987776 Q ss_pred CCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEEEcCCCCeEEEEEeeeEEecCeEEEeeeeeeEEEec Q lcl|Aclame:pro 415 TRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASVPNRLNTLFPPDYVNQLRVFAVVNQFRLQYS 494 (498) Q Consensus 415 ~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvVerd~~d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~ 494 (498) +.+.||+++.+.+.+|++.|+|++++. +.+.+...++.++.+.|....+..|+.| .++.++. T Consensus 504 ----------~r~~ik~~i~~~L~~l~~~g~I~~~~~----~~~~~~~~~~d~v~V~i~v~Pv~~i~~I----~vti~~~ 565 (581) T protein:vir:10 504 ----------TIVQVKASAEAALVWLVDNNIIRGYRN----LKARQIERQPDVIEVRYEWRPAYPLNYI----VVRYSIA 565 (581) T ss_pred ----------HHHHHHHHHHHHHHHHHhcCcccCCcc----ceeeeeecCCCEEEEEEEEEecccceEE----EEEEEEe Confidence 448999999999999999999999853 3455666677889998888888888765 4455555 Q ss_pred ccCC Q lcl|Aclame:pro 495 EESA 498 (498) Q Consensus 495 ~~~~ 498 (498) -+.- T Consensus 566 p~~~ 569 (581) T protein:vir:10 566 PETG 569 (581) T ss_pred cCCC Confidence 5544 No 14 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=100.00 E-value=2.7e-39 Score=232.01 Aligned_cols=423 Identities=12% Similarity=0.093 Sum_probs=265.4 Q ss_pred CccchhhcCcccccCeEEEE-EecCCCCCC-CCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAE-MDNQAANTA-QDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEAY 78 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E-~dns~a~~~-~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a~ 78 (498) |.|.|..-. .+|.|.| +|.-..|.. ..+- .+.++... +|... + T Consensus 1 ~~~~~~~~~----~~~~~t~~~~~~~~g~~~~~~~------------------~~~i~g~~-----~g~~g--~------ 45 (581) T protein:vir:76 1 MAIDFSQYQ----TPGVYTEAVGAPQLGIRSSVPT------------------AVAIFGTA-----VGYQT--Y------ 45 (581) T ss_pred Ccccccccc----cchhhhhhccccccCcceeeee------------------eeeecccc-----ccccc--c------ Confidence 887775433 5676666 444333211 1111 11222211 12100 0 Q ss_pred HHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCC-CceEE-- Q lcl|Aclame:pro 79 RQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVP-TLPFT-- 155 (498) Q Consensus 79 ~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~-~lpVt-- 155 (498) .-.++..| +....-..-+|+++|.+ .+|+.+|..+|+.- ..+.-+.|+++|..+|.+.-+-.. +.-|+ T Consensus 46 -----~~s~r~~p--~~~~~~evq~v~~~~~~-t~G~ftLt~~g~tT-~~I~~~asa~~v~~AL~~L~~i~~~~v~vtg~ 116 (581) T protein:vir:76 46 -----RESIRINP--DTGETITTQILALVGEP-TGGSFKLSLAGEPT-GNIPFNATQGQVQSALRALPNVEDDEVTVLGD 116 (581) T ss_pred -----cceeeecC--CCCCCCceEEEEEeecC-CcceEEEEeCceec-cccccCCCHHHHHHHHhhccCCCCceEEEEcC Confidence 01223222 22234455678888876 47999999999642 234556688899888876432211 01111 Q ss_pred -----------------Ee-----e-ccceEEEeeccCcccccc------------------------------------ Q lcl|Aclame:pro 156 -----------------AS-----S-SAGVVTLTARHKGLCGNE------------------------------------ 176 (498) Q Consensus 156 -----------------A~-----~-~~~~VtlTAk~kG~~gN~------------------------------------ 176 (498) +. . .+..++++-..+|..+-+ T Consensus 117 ~~~~~~V~F~g~~~~~~~~~~~ltg~~~~~~~V~~~~~G~~~~~~~l~~~g~~~~~~~~~~s~~~~~~~l~~~~~~~~~~ 196 (581) T protein:vir:76 117 PGGPWTVTFTKAVAALTKDVTGLTGGDNPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVN 196 (581) T ss_pred CCceEEEEEcCCccceeEeeeeeecCCcceeEEEEEecCcCCcCceeeeccccccccceeecCCcceeeecccccceeec Confidence 00 0 001122222222211100 Q ss_pred ------------------------------eeEEEEecccCcc--------cccccceeeeecccCC------------- Q lcl|Aclame:pro 177 ------------------------------IPVSLNYYGFGGG--------EVLPAGVQIAVATGTA------------- 205 (498) Q Consensus 177 ------------------------------i~l~~~~~~~~~g--------e~~p~Glt~tit~~ag------------- 205 (498) |.+..+|.+...+ +.....+ .-.+.++| T Consensus 197 ~~~~~~~~~~~~~~t~~~~~~g~~~~~~~i~~~~~~~~D~~~~~~v~~~~~~~~~~~~-~~~~~~~g~~~~e~~~~~~~~ 275 (581) T protein:vir:76 197 AGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFY-GPAFDEAGNVQSEITLCAQLA 275 (581) T ss_pred cCcccceeeeeeeeeeEeecccccccceeEEEEEEEeecCCccceEEEecccccccce-eeehhhcCccccchhhhhhee Confidence 1111111110000 0000000 00111111 Q ss_pred ------------------CcCcchhhhHHHhhccCcceEEEecCCChHHH-HHHHHHHhhhhhhhhhhhheeeEEEEecc Q lcl|Aclame:pro 206 ------------------GTGAPVLTGAVAAMADEPFDYIGLPFNDTASV-NTLVTEMNDTSGRWSYARQLYGHVYTAKT 266 (498) Q Consensus 206 ------------------Gag~pD~~~alaalg~~~~~~I~~p~tD~a~l-~al~~~l~~~s~r~~~~~q~~g~~~~~~~ 266 (498) ...++|+.++|++|+++.|++|++|+++++.+ .++++|++..+..-.++|.+.|+...... T Consensus 276 ~t~~~~~~l~~gvd~~g~tvt~~dy~~aL~ale~~~~~~ivvp~t~~~~i~a~l~ahv~~~s~~~~~~ra~igv~g~~~~ 355 (581) T protein:vir:76 276 ITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTP 355 (581) T ss_pred eccccceEEEeeecCCCCccchHHHHHHHHHHhcCCeEEEEEecCCChHHHHHHHHHHHHHHhccCCceEEEEeeCCCCC Confidence 13556999999999999999999999998876 56999998776544455555555544455 Q ss_pred CCHHHHHhhhhccCcceEEEEecC----------CCCCCcHHHHHHHHHHHhhhhhccCccccccceEEeccccCCCccc Q lcl|Aclame:pro 267 GTLSELVNAGDQFNQQHITLAGYE----------KETQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKR 336 (498) Q Consensus 267 gt~~~~~t~g~~~N~~~~t~~~~~----------~~~~~p~~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~~p~~~~r 336 (498) .++.+.++....+||+|+.+++.. +....|++..||++|+.++ +.+|++|+..++|+|+. ....+ T Consensus 356 ~~~~~~~~~a~~~ns~Rvvlv~p~~~~~~g~~~~~~~~lp~~~~AA~vAG~~a---~~~~~~slT~~~i~g~~--~~~~~ 430 (581) T protein:vir:76 356 VPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSV---SAIAAMPLTRKVIRGFS--GPAEV 430 (581) T ss_pred chHHHHHHhhcccCCCcEEEEEcCceEeccccCCcceecchhhhhhhHHhhhh---ccccccCcccccccccc--ccccc Confidence 578888999999999999988631 2234577777777766665 78999999999999984 56779 Q ss_pred cChHHHHHHHhCCeeEEEE-cCCeEEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHh-hhcCCceeccCC Q lcl|Aclame:pro 337 FTMTEQQTLLSHGVATAYV-ESGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVIT-SKYGRHKLASDG 414 (498) Q Consensus 337 ~~~~er~~lL~~Gist~~v-~~G~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~-~~~~r~kla~dg 414 (498) |+.+|++.|+++|+.+|++ .++.|+|+|.||||.+ |+.|++|+++|++||+++.+|.++. .+|+|+|+.++ T Consensus 431 ~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s------~~~~k~i~viR~~D~v~~~vr~~~~~~~fiG~~n~~~- 503 (581) T protein:vir:76 431 QRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPT------SLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDT- 503 (581) T ss_pred CCHHHHHHHHhCCeEEEEEecCCeEEEEEeeecCCC------CCccceeeehhhhHHHHHHHHHHHhhhcCCCcccChH- Confidence 9999999999999999997 4667999999999964 5779999999999999999999995 78999987776 Q ss_pred CCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEEEcCCCCeEEEEEeeeEEecCeEEEeeeeeeEEEec Q lcl|Aclame:pro 415 TRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASVPNRLNTLFPPDYVNQLRVFAVVNQFRLQYS 494 (498) Q Consensus 415 ~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvVerd~~d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~ 494 (498) +.+.||+++.+.+++|++.|+|++++..+ ...++++ .|..+|++. ...+..|+- |.|+.++. T Consensus 504 ----------~r~~ik~~i~~~L~~l~~~g~I~g~~~~~-~~~~~~~-~d~v~V~i~--v~Pv~~ie~----I~vt~~~~ 565 (581) T protein:vir:76 504 ----------TIVQVKASAEAALVWLVDNNIIRGYRNLK-ARQIERQ-PDVIEVRYE--WRPAYPLNY----IVVRYSIA 565 (581) T ss_pred ----------HHHHHHHHHHHHHHHHHhcCcccCcccce-eeEEecC-CCEEEEEEE--EEecccceE----EEEEEEEe Confidence 45899999999999999999999998654 3455554 445555554 333444433 34444554 Q ss_pred ccCC Q lcl|Aclame:pro 495 EESA 498 (498) Q Consensus 495 ~~~~ 498 (498) -+.- T Consensus 566 p~~~ 569 (581) T protein:vir:76 566 PETG 569 (581) T ss_pred eCCC Confidence 4444 No 15 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=100.00 E-value=3.7e-36 Score=214.85 Aligned_cols=427 Identities=12% Similarity=0.101 Sum_probs=260.5 Q ss_pred Ccc-chhhcCcccccCeEEEEEecCCC--CCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcC--cHHHHHH Q lcl|Aclame:pro 1 MTI-SFNTIPSNTLVPLFYAEMDNQAA--NTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAG--SQLARMV 75 (498) Q Consensus 1 M~i-~f~~Ip~~~rvPg~y~E~dns~a--~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~G--S~l~~M~ 75 (498) |+= .|.. .+ =..||+|+||-++.. -......-++.||.....| ++.|+.+.|.++....||.. +....++ T Consensus 1 magg~~~~-~~-K~~PGvYi~~~~~~~~~~~~~~~~~~~~i~~~~~~g---~~~~v~i~~~~d~~~~fG~~~~~~~~~~~ 75 (451) T protein:vir:10 1 MAGGTWKA-QD-KRRPGTYINVVGNGQREAASSLGRVLLIRDKGLGWG---KNGVIEVEANSDFTKKLGTTLDDPSLTAL 75 (451) T ss_pred CCceeecc-ce-eecCceEEEEeccCcceeeccCCcEEEEEeeecCCC---CcccEEeecHHHHHHHcCCcccchhHHHH Confidence 883 4543 33 347999999988643 2344567788888765443 56789999999999999964 3444567 Q ss_pred HHHHHhCCCceEEEEEecCCccceeEEEEEEee---eccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCc Q lcl|Aclame:pro 76 EAYRQTDPFGELYVIAVPEATGAAATVTLTVTG---EATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTL 152 (498) Q Consensus 76 ~a~~~~n~~~~l~~i~l~d~ag~aatg~ititg---tat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~l 152 (498) +.+++. -..++++.|.+ |++++.+++..+ +|.-.|+ -|-.+.|.|... +...... T Consensus 76 ~~~~~g--~~~v~~yrl~~--g~~a~~t~~~~~~~~~Aky~G~-----~Gn~i~v~v~~~-------------~~d~~~~ 133 (451) T protein:vir:10 76 KETLKG--ASKVLVLNPNE--GTAATLTKEGLPWTVTANYPGE-----KGNQITVSVEVS-------------PADQNAA 133 (451) T ss_pred HHHhcC--CcEEEEEEcCC--CceEEEEeecCceEEEEeeCCc-----CCceEEEEEecc-------------cCCcCce Confidence 777753 34688888864 455555443211 1111111 133455554331 1111222 Q ss_pred eEEEeeccceEEE-eec---cCcccccc-eeEEEEecccCcccccccceeeeec-ccCC---CcCcchhhhHHHhhccCc Q lcl|Aclame:pro 153 PFTASSSAGVVTL-TAR---HKGLCGNE-IPVSLNYYGFGGGEVLPAGVQIAVA-TGTA---GTGAPVLTGAVAAMADEP 223 (498) Q Consensus 153 pVtA~~~~~~Vtl-TAk---~kG~~gN~-i~l~~~~~~~~~ge~~p~Glt~tit-~~ag---Gag~pD~~~alaalg~~~ 223 (498) -|+--..+..+.. +.+ -.....|+ +++.... .+...+... ..++ ..+| +..+.|+.++|+++.... T Consensus 134 ~v~t~~g~~~vd~qtv~~~~~~el~~nd~V~a~~~~----~g~~~~~~~-~~l~~~~~gg~~~~~~~~~~~~l~~~e~~~ 208 (451) T protein:vir:10 134 TVSTIFGTKLVDEQSIKFNELDKFKGNDYITAKVVE----EGSSKPVAF-TNVSGTLTGGTTTESNKVESLLNDALENEE 208 (451) T ss_pred EEEEEECCeEEEEEEeeccchhhccCCceEEEEecc----cccccceee-eecccccccccccCCccchHHHHHHhccce Confidence 2221111111111 110 01111122 1111111 111111111 1111 1123 234568999999999999 Q ss_pred ceEEEecCCCh--HHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEec----CCCCCCcH Q lcl|Aclame:pro 224 FDYIGLPFNDT--ASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGY----EKETQTPA 297 (498) Q Consensus 224 ~~~I~~p~tD~--a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~----~~~~~~p~ 297 (498) ||++++|..|. +-...+.+|++.. |-+.. ...+++.+... ....|++++..+.. .++...++ T Consensus 209 ~n~l~~~~~~~~~~i~~~~~a~ik~~--r~~~g--~~~~aVl~~~~--------~~~~d~egiinv~n~~~~~dg~~~~~ 276 (451) T protein:vir:10 209 YAVVTTAGFEPSSNMNKLVVEAVKRL--RENEG--RKVRGVIPTDA--------DTTYNYEGISTVVNGYTLSDGTNVDV 276 (451) T ss_pred eeEEEEccCCCchHHHHHHHHHHHHH--HHhcC--CeEEEEecCcc--------CCCCCCcceEEeecceEecCceeech Confidence 99999987653 3456778888642 22222 22233322110 01135555543321 23444577 Q ss_pred HHHHHHHHHHhhhhhccCccccccceEEeccccCCCccccChHHHHHHHhCCeeEEEEcC-CeEEEEeeeeeeeecCCCC Q lcl|Aclame:pro 298 DELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVES-GVLRIQRDVTTYRKNAYGV 376 (498) Q Consensus 298 ~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~~-G~v~IeR~ITTY~~n~~G~ 376 (498) +++++.+|+..|. -+..+.+....++|+ .....||+.+|++.++.+|+..+..++ +.|+|+|+|+|+++-.. . T Consensus 277 ~~~~~~vAG~~Ag---~~~~~S~T~~~~~~~--~~v~~~~t~~e~~~~i~~G~lvl~~~~g~~v~i~~~INTltt~~~-~ 350 (451) T protein:vir:10 277 KDATGYFAGISAS---ADVATSLTYFEVEDA--VSAYPKFDNEKTIKALDAGQIVFTTRPGQRVVIEQDINSLHKFTA-E 350 (451) T ss_pred hhhHHHHHHHHcc---cccccCccceecCCc--eeeeeeCCHHHHHHHHhCCeEEEEEEcCCeEEEEEccccceecCC-C Confidence 8888888877774 233344444555554 345679999999999999999998755 47999999999988754 4 Q ss_pred CCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeE Q lcl|Aclame:pro 377 ADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYL 456 (498) Q Consensus 377 ~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~l 456 (498) .+..|.+|+++|++|++..++|..+..+|.++..++...| ..+++++.+++++|+.+|.|+|++. ..+ T Consensus 351 k~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~~N~~~gr----------~~~~~~i~~yl~~l~~~g~i~~~~~--~d~ 418 (451) T protein:vir:10 351 KPQAFSKNRVIRTLDEIATNTENTFERTYLGNVGNNAAGR----------DLFKADRIAYLTSLQNRNMIQSFAN--TDI 418 (451) T ss_pred CCcchhhhhHHHHHHHHHHHHHHHhhhccceecCCCHHHH----------HHHHHHHHHHHHHHHhCCCccCCCc--cce Confidence 5778999999999999999999999999998654554444 5799999999999999999999874 455 Q ss_pred EEEEcCCCCeEEEEEeeeEEecCeEEEeeeeeeE Q lcl|Aclame:pro 457 VVERDASVPNRLNTLFPPDYVNQLRVFAVVNQFR 490 (498) Q Consensus 457 vVerd~~d~nRvn~~~p~~~vn~l~v~A~~~~f~ 490 (498) .|+... +...+-+.+..+.++-+.-|=..+.+| T Consensus 419 ~v~~~~-~~~~v~v~~~v~pvdame~iy~t~~v~ 451 (451) T protein:vir:10 419 TVEAGN-DMDSIVVNLAVTPVDAMEKLYMTMVVR 451 (451) T ss_pred EEeecC-CCCEEEEEEEEEEEeeeeeEEEEEEEc Confidence 565432 334566777778888888777777777 No 16 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=100.00 E-value=6.7e-34 Score=202.45 Aligned_cols=415 Identities=12% Similarity=0.134 Sum_probs=261.2 Q ss_pred Cccc---hhhcCcccccCeEEEEEecCCC-CC-CCCCccEEEEEecCCCCccccceeEEecCh---HHHHHhhCcCcHHH Q lcl|Aclame:pro 1 MTIS---FNTIPSNTLVPLFYAEMDNQAA-NT-AQDSGASLLIGHANNGAEIVANSLVLMPSA---DYARQICGAGSQLA 72 (498) Q Consensus 1 M~i~---f~~Ip~~~rvPg~y~E~dns~a-~~-~~~~~~vLliGq~~~~g~~~~~~~~~v~s~---~~A~~~fG~GS~l~ 72 (498) |+|. |..-. =..||+|++|-+... .. ...+-.+.|. . ...=+++++++.|.+. .+...+||..-... T Consensus 1 ~~magg~~~~~~--K~~PG~Y~n~~~~~~~~~~~~~rGi~a~p-~--~~~wGp~~~v~~i~~~~~~~~~~~~~G~~~~~~ 75 (436) T protein:vir:78 1 MALGGGTFVTQN--KVLPGSYINFVSATRATSSLSDRGIVAMP-L--ELDWGIDEEVFQVTSDDFEKYSTKYFGYDYTHE 75 (436) T ss_pred Ccccceeeccce--eecCceEEEEEecCcceeeccCCeEEEEE-E--EecCCCCceeEEeecccchHHHHHHhcCccchH Confidence 6653 64433 357999999975433 22 3333434443 3 3456688999999985 46778899853222 Q ss_pred --HHHHHHHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCc-EEEEEEccE-----EEEEEeecCCCHHHHHHHHHH Q lcl|Aclame:pro 73 --RMVEAYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESG-TVNVYVGRT-----RVQAPVTNGDNVTTIASSIQD 144 (498) Q Consensus 73 --~M~~a~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G-~l~l~I~g~-----~v~v~V~~gdtaa~iA~~l~~ 144 (498) .+++.+++.. ..++++.|.+ |++|++++.-.-.+...| .+++.|-.. ...+..-.|.+.-+. ..++ T Consensus 76 ~~~~l~~~~~~~--~tv~~yrl~~--G~~a~~~v~~Aky~g~~gn~i~v~v~~~~~d~~~~dv~~~~g~~~~d~--~~~~ 149 (436) T protein:vir:78 76 KLKGLRDLFKNI--RLGYFYKLNK--GVKASCSIATARCSGIRGNDLKVIVTTNIDDNAKFDVVTLLDNKKVDT--QIAK 149 (436) T ss_pred HHHHHHHHhcCC--CEEEEEECCC--cceeeeeeeeeecCCCCCcEEEEEecccccccCceEEEEEecchhhhh--hhHH Confidence 2355455433 3478888864 677777765444444444 677777322 333333233322221 2223 Q ss_pred HHhcCCCceEEEeeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCc-----CcchhhhHHHhh Q lcl|Aclame:pro 145 AINAVPTLPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGT-----GAPVLTGAVAAM 219 (498) Q Consensus 145 aIn~~~~lpVtA~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGa-----g~pD~~~alaal 219 (498) .|+...+ ..-|+++ ..|+..+ ++-++++||+ .+-|+.++|+++ T Consensus 150 ~~~~l~~--------n~~V~~~--~~g~la~----------------------~a~~~LtGG~dG~~~T~~dy~~al~~l 197 (436) T protein:vir:78 150 VITELQD--------NDYVTWK--KEATLEA----------------------TAGLTFTNGTNGEAVTGTEYQAFLDKI 197 (436) T ss_pred HHhhccC--------CceEEEE--ecccccc----------------------cceeeeeccccccccchHHHHHHHHHH Confidence 3332211 1122222 1222111 1112333332 245899999999 Q ss_pred ccCcceEEEecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCCCCCCcHHH Q lcl|Aclame:pro 220 ADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEKETQTPADE 299 (498) Q Consensus 220 g~~~~~~I~~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~~~~~p~~~ 299 (498) ....||+|++|..|++....+.+|++.. |-+..+++.++...+...++..++.++... .+....+++ T Consensus 198 e~~~fn~l~~~~~d~~~~~~~~a~ikr~--re~~g~~~~aV~~~~~~~d~EgIInv~n~v-----------~g~~~~~~~ 264 (436) T protein:vir:78 198 ESYSFNALGCLATTAEIKSLFVEFTKRM--RDKVGAKFQTVLYKKNDADYEGVVSVENKI-----------KDTGLLESS 264 (436) T ss_pred cccceeEEEecCCChHHHHHHHHHHHHH--HhhcCCeEEEEecCCCCCCCceEEEeeccc-----------CCceechhH Confidence 9999999999999999999999998753 222233444444333344544444443321 233345567 Q ss_pred HHHHHHHHhhhhhccCccccccceEEeccccCCCccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCCCCCc Q lcl|Aclame:pro 300 LAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADN 379 (498) Q Consensus 300 ~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G~~D~ 379 (498) +++.+|+..| .-+..+++....++|+ .....||+.+|++.++.+|...++.+++.|+|+|+|+|+++-.. ..+. T Consensus 265 ~~a~vAG~~A---g~~~~~S~T~~~~~~~--~~v~~~~t~~e~~~ai~~G~lvl~~d~~~v~I~~~VNTltt~~~-~k~~ 338 (436) T protein:vir:78 265 LIYWTTGAIA---GCDINKSNTNKRYDGE--FDVDVNYTQIHLEEALKTGKFIFHKVGDEVHVLEDINTFVSFTD-EKND 338 (436) T ss_pred HHHHHHHHHh---cCccccCccceecCcc--ccccccCCHHHHHHHHhCCeEEEEEeCCeEEEEEccccceecCC-CCCc Confidence 7777777777 3445566666777665 35566999999999999999999988889999999999887754 4567 Q ss_pred hhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEE Q lcl|Aclame:pro 380 SYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVE 459 (498) Q Consensus 380 s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvVe 459 (498) .|.+|+++|++|++.+++|..+..+|.++..++...| ..+++.+..++++|+.+|.|+||+. +.+.|+ T Consensus 339 ~~~kI~vir~~D~i~~di~~~~~~~yiGKv~N~~dgr----------~~l~~~i~~yl~~L~~~g~I~~f~~--~Dv~v~ 406 (436) T protein:vir:78 339 DFSSNQSVRVLDQIANDIATLFNTKYLGEVPNDKSGR----------ISFWNDVVKHHEQLQNMRAIEDFKA--DDVSVE 406 (436) T ss_pred chhhhhHHHHHHHHHHHHHHHhhhccccccCCCHHHH----------HHHHHHHHHHHHHHHhCCcccCCCC--cceEEe Confidence 8999999999999999999999999999654444333 5799999999999999999999985 355555 Q ss_pred EcCCCCeEEEEEeeeEEecCeEEEeeeeeeE Q lcl|Aclame:pro 460 RDASVPNRLNTLFPPDYVNQLRVFAVVNQFR 490 (498) Q Consensus 460 rd~~d~nRvn~~~p~~~vn~l~v~A~~~~f~ 490 (498) ...+ ...+-+.+..+.++-+.-|=..+..- T Consensus 407 ~~~~-~~~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 407 PGSD-KKTVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred ecCC-CCEEEEEEEEEEEEeeeeEEEEEEEC Confidence 4332 23344554444444444333222222 No 17 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=100.00 E-value=4.6e-31 Score=186.90 Aligned_cols=459 Identities=13% Similarity=0.130 Sum_probs=272.5 Q ss_pred Cccc--hhhcCcccccCeEEEEEecCCCC--CCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHH Q lcl|Aclame:pro 1 MTIS--FNTIPSNTLVPLFYAEMDNQAAN--TAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVE 76 (498) Q Consensus 1 M~i~--f~~Ip~~~rvPg~y~E~dns~a~--~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~ 76 (498) |.|+ |+.=| +-.||+|+|.+.|... ......-..+||.. ..+++++|++|+|-+|+...|| |+.+..+++ T Consensus 1 ma~~~yf~~~~--~~~PGVyvee~~sg~~~i~gv~tsva~fvG~a---~~Gp~~~p~~v~s~~~~~~~fg-gg~l~~av~ 74 (648) T protein:vir:10 1 MAISVYFDGKL--IKQLGAYVKTDLSAVKQINGVGTGIVALLGLA---EGGETYKPYRLTSFAEAVSIFK-GGPLLEHIK 74 (648) T ss_pred CeeeeeeCCCC--ccCCceEEEEeccccccccCCCCceEEEEEee---CCCCCceeEEecCHHHHHHHhc-CccHHHHHH Confidence 7755 55443 4679999999988763 22334556788854 3557899999999999999999 566788888 Q ss_pred HHHHhCCCceEEEEEecCC-ccceeEEEEEEeee--ccCCcEEEEEEc--------cEEEEEEeec-C------------ Q lcl|Aclame:pro 77 AYRQTDPFGELYVIAVPEA-TGAAATVTLTVTGE--ATESGTVNVYVG--------RTRVQAPVTN-G------------ 132 (498) Q Consensus 77 a~~~~n~~~~l~~i~l~d~-ag~aatg~ititgt--at~~G~l~l~I~--------g~~v~v~V~~-g------------ 132 (498) .|+ .|.-..+|++.+.++ .+++..+.++++.. ...+..+++.+. +.++++...+ + T Consensus 75 ~~F-~nGg~~~~~vRv~~~~~a~~~~~~~~~~a~~~g~~gn~i~~~v~~~~~~~~~~~~l~v~~~~~~~~~d~~v~~i~~ 153 (648) T protein:vir:10 75 AAF-IGGAGEVVAVRIGNPTTASVSIPVAQNTSDTSPANLNFVSYEASTRSNQIYVSFDLDENFTSANEADDTIIFTIYQ 153 (648) T ss_pred HHH-hCCCcEEEEEEcCCCcccceecceeEEeecccCCCCCceEEEEEEcCCCcCceeEEEEEecCCCcccceeEEEecc Confidence 888 566789999999775 33444444554432 122222322221 1122111000 0 Q ss_pred ------------------CC---------------HHHHH----------HHHHHHHhcCC--C-----ceEE----Eee Q lcl|Aclame:pro 133 ------------------DN---------------VTTIA----------SSIQDAINAVP--T-----LPFT----ASS 158 (498) Q Consensus 133 ------------------dt---------------aa~iA----------~~l~~aIn~~~--~-----lpVt----A~~ 158 (498) ++ ...+. +.+...||... + .|++ ..+ T Consensus 154 ~~~~y~gt~~~~t~~v~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~s~~~~~d~ 233 (648) T protein:vir:10 154 KHPDFSVTRETFTFPRKFTTPTVLVKRGSTLFFVDRSIVNAALAAGPAFQTALINLLKEQLQPTDVVQIFDASDTNPVDI 233 (648) T ss_pred CCCcccccceeccccccccccccccccccceeecCccchhhhhccCccchhhhhhchhhhhhhhhhheeccccccccccc Confidence 00 00000 11222222221 0 0100 000 Q ss_pred cc---------------ceEEEeeccCcccccceeEEEEec-------------------------------------cc Q lcl|Aclame:pro 159 SA---------------GVVTLTARHKGLCGNEIPVSLNYY-------------------------------------GF 186 (498) Q Consensus 159 ~~---------------~~VtlTAk~kG~~gN~i~l~~~~~-------------------------------------~~ 186 (498) .. ....++..+.| +++-...|. .. T Consensus 234 ~~~~~~~~a~~~~~~~~~~~~~~~~~~g----d~~~~~~~~~~~~~~~tp~~~~~~~~~~~~~~~~~~~~~v~~~~~~~l 309 (648) T protein:vir:10 234 PLGLFVYEVLYGGLFGFTKSRLVKTSFG----TVDDLLSNPLLFNLSATPFFDGSDYQDYTSLSDPANWFAKDAYTINHL 309 (648) T ss_pred ccccccccccchhhhcCCcchhhhhhhc----cccccccccceecccccccccccceeeeeccccccceeeeeccchhhc Confidence 00 00001111111 222111110 11 Q ss_pred CcccccccceeeeecccCCCcCc---------------chhhhHHHhhccCcceEEEec------------CCChHHH-H Q lcl|Aclame:pro 187 GGGEVLPAGVQIAVATGTAGTGA---------------PVLTGAVAAMADEPFDYIGLP------------FNDTASV-N 238 (498) Q Consensus 187 ~~ge~~p~Glt~tit~~agGag~---------------pD~~~alaalg~~~~~~I~~p------------~tD~a~l-~ 238 (498) ...+++|..+...+|..+||+.- .|++++|+.+.++.-.+|+.. .+|.... . T Consensus 310 ~~~~~~p~~~~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~~ivp~~~~~~~~~~~~~lt~~q~i~a 389 (648) T protein:vir:10 310 VDTTINPHILATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVNFVIPAYKFTNVTQLNDRLTIFKGIAS 389 (648) T ss_pred ccccccCcccccccceecccccCCCcccccccccccchhhHHHHhhhccCCCceEEEeecccccccccccccCCccchHH Confidence 12245666777778889888742 368899999988888777752 3555444 3 Q ss_pred HHHHHHhhhh--hhhhhhhheeeEEEEeccCCHHH--HHhhhhccCcceEEEEec----------C-------C-CCCCc Q lcl|Aclame:pro 239 TLVTEMNDTS--GRWSYARQLYGHVYTAKTGTLSE--LVNAGDQFNQQHITLAGY----------E-------K-ETQTP 296 (498) Q Consensus 239 al~~~l~~~s--~r~~~~~q~~g~~~~~~~gt~~~--~~t~g~~~N~~~~t~~~~----------~-------~-~~~~p 296 (498) ...+|.++.| +|.+.++...|.+..+..++..+ ..-.-..+|+.+..+++. . + ....| T Consensus 390 ~a~shv~~~s~~~~~~~r~~~~~~vg~~~~es~~~se~~~~~~~~~~~~a~~~~~d~~~~~~~~~~~~~~~~~G~~~~~p 469 (648) T protein:vir:10 390 TFLSHVQTMSQVNRRKARVGVFGLPAPSPNESVTASEYLYNRNILNTISAMFGGTDRAQAVVFPFYSNVFNDEGKVELLG 469 (648) T ss_pred HHHHHHHHhhhccccccccCeEEEeCCCCchhHHHHHHHhhhhcccccceeeeecCCceEEeecccceeECCCCcEEecc Confidence 4446766554 33333333444444455555422 222222345544333221 1 1 11258 Q ss_pred HHHHHHHHHHHhhhhhccCccccccceEEeccccCCCccccChHHHHHHHhCCeeEEEEc-C--C--eEEEEeeeeeeee Q lcl|Aclame:pro 297 ADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVE-S--G--VLRIQRDVTTYRK 371 (498) Q Consensus 297 ~~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~-~--G--~v~IeR~ITTY~~ 371 (498) ++..|+++|+..+ ...|..|+..-.|.+..-.. ..+|+.+|++.|+.+|+.+++.. . + .++|.+.||||.. T Consensus 470 ~~~~Aa~VAGl~a---~l~~~~s~T~k~i~~~~id~-~~~~t~~qld~L~~~Gv~~ie~~~~~~~~~~~rvv~gITT~~~ 545 (648) T protein:vir:10 470 GEFFASYVAGMHA---NREPQDSITFLPISGIGAEP-LYNWTYTQKDDLISNRVLFVEKVKTSFGGIVYRIHHNPTTWLG 545 (648) T ss_pred hhhHHHHHHhhhh---ccccccCcccceeecccccc-ccCCCHHHHHHHhcCCcEEEEEecCCcceeeEEEeccceeecC Confidence 8999999888887 56677777776665442111 35799999999999999999864 2 2 3779999999976 Q ss_pred cCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhh Q lcl|Aclame:pro 372 NAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYEL 451 (498) Q Consensus 372 n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~ 451 (498) . .|+.|.+|.+.|+.||+.+.+|..+..+|.|+|..++ +-+.+|..+.+.+.+++..+-|+++.. T Consensus 546 ~----~~~~~~eisv~ri~D~l~~~vr~~l~~~fIG~~n~~~-----------~~~~ik~~i~~~L~~~~~~~~I~~y~~ 610 (648) T protein:vir:10 546 P----VTQGFQEFVLRRIDDFLQSYVYKNLQEQFIGRKSYGR-----------KTENDIKVYTEALLSNLVGKQIVAYKD 610 (648) T ss_pred C----CCcceeeeeeeehhhHHHHHHHHHHhhhcCcccccHH-----------HHHHHHHHHHHHHhhHhhcCcccCccc Confidence 5 4899999999999999999999999999999887765 337899999999888888777888865 Q ss_pred hcCeEEEEEcCCCCeEEEEEeeeEEecCeEEEeeeeeeEEE Q lcl|Aclame:pro 452 FKQYLVVERDASVPNRLNTLFPPDYVNQLRVFAVVNQFRLQ 492 (498) Q Consensus 452 ~~~~lvVerd~~d~nRvn~~~p~~~vn~l~v~A~~~~f~lq 492 (498) .+ +.+..+ +|.-||++.+.|..-=....+-.++-|-|+ T Consensus 611 ~~--v~~~~~-~~vv~V~~~v~Pv~~i~~I~vti~it~~~~ 648 (648) T protein:vir:10 611 VK--VTSNED-KTVYYVEFFYQPVTEIKFILVTMKVTFDLE 648 (648) T ss_pred ce--EEEEec-CCEEEEEEEEEecceeeEEEEEEEEEeccC Confidence 44 333333 366666666554433333344455555665 No 18 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=99.89 E-value=2.8e-24 Score=149.70 Aligned_cols=307 Identities=10% Similarity=0.100 Sum_probs=199.4 Q ss_pred CCCceEEEeeccceEEEeeccCcccccceeEEEEecccCcc-------cccccce--------eeee----------ccc Q lcl|Aclame:pro 149 VPTLPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGG-------EVLPAGV--------QIAV----------ATG 203 (498) Q Consensus 149 ~~~lpVtA~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~g-------e~~p~Gl--------t~ti----------t~~ 203 (498) ..+||-. -..-...+.||+++|+.|.-+-+ .. +...+ +..|.-. ..++ .+. T Consensus 1 ~~glp~i-~i~f~~~a~ta~~~g~rGiv~~i--l~-d~~~~~~~~~~~~~v~~~~~~~n~~~i~~~~~g~~~~~~~~~p~ 76 (356) T protein:vir:10 1 MAGLVNI-NIEFKELATSFIQRSKAGIVAII--LK-DTTKMYKELTSEDDIPISLSADNKKYIKYGFVGATDNEKVLRPS 76 (356) T ss_pred CCCCCce-eEEEeecceeeccCCccceEEEE--Ee-cCCcceeEEeccccchhHHHHHHHHHHHHHhhccccccccccce Confidence 5666632 22334467778888888754422 11 11111 1111111 0111 011 Q ss_pred C----CCcCcchhhhHHHhhccCcceEEEecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhcc Q lcl|Aclame:pro 204 T----AGTGAPVLTGAVAAMADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQF 279 (498) Q Consensus 204 a----gGag~pD~~~alaalg~~~~~~I~~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~ 279 (498) . ++..+.|+.++|+++....||++++|..|++..+.+.+|++.. | +........+...+..++..++.++... T Consensus 77 ~~~~~~~~t~~~y~~aL~~le~~~fn~l~~~~~d~~~~~~~~a~ikr~--r-~~~~~~~~~V~~~~~aD~EgIInv~n~~ 153 (356) T protein:vir:10 77 KVIISTFTEDGKVEDILEELESVEFNYLCMPEAIEAEKTKIVTWIKKI--R-EEESTEAKAVLANIKADNEAIINFTENV 153 (356) T ss_pred eeeeecccCchhHHHHHHHhcCccceEEEecCCChHHHHHHHHHHHHH--H-hcCCcEEEEEecCCCCCCceeEEeecCe Confidence 1 1234789999999999999999999999999999999998742 2 2222333344444455555555553211 Q ss_pred CcceEEEEecCCCCCCcHHHHHHHHHHHhhhhhccCccccccceEEeccccCCCccccChHHHHHHHhCCeeEEEEcCCe Q lcl|Aclame:pro 280 NQQHITLAGYEKETQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGV 359 (498) Q Consensus 280 N~~~~t~~~~~~~~~~p~~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~~G~ 359 (498) + . .+....++++++.+|+..|. -.-.+.+...+++++. .-.||+.+|.+.++.+|...|..+++. T Consensus 154 ----~-~----~g~~~t~~~~~~~vAG~~Ag---~~~n~S~T~~~~~~~~---~~~~~t~~e~~~ai~~G~lvl~~d~~~ 218 (356) T protein:vir:10 154 ----V-V----DGEEITAEKYTTRVASLIAS---TPNTQSITYAPLDEVE---SIVKIDKASADAKVQAGELILRRLSGK 218 (356) T ss_pred ----E-e----cceeechhHHHHHHHHHHhc---cchhccccceecCCcc---ccccCCHHHHHHHHhCCeEEEEEEcCe Confidence 1 1 22233456777777777763 2223445666677653 345899999999999999999988899 Q ss_pred EEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHH Q lcl|Aclame:pro 360 LRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQ 439 (498) Q Consensus 360 v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~ 439 (498) |+|+|+|+|+++-. ...+..|.+|+++|++|.+..++|..+...|.++.-++...| ..+++.+..++++ T Consensus 219 V~I~~~VNSltt~t-~~k~~~f~Kirvvr~~D~i~~Di~~~f~~~yiGKv~N~~dgr----------~~l~~ai~~y~~~ 287 (356) T protein:vir:10 219 IRIARGINSLTTLT-AEKGEIFQKIKLVDTKDLISKDIKNIYVEKYLRKCPNTYDNK----------CLFIVAVQSYLTE 287 (356) T ss_pred EEEEecCccceecC-CCCCcchhhhHHHHHHHHHHHHHHHHHhhccccccCCCHHHH----------HHHHHHHHHHHHH Confidence 99999999987764 456788999999999999999999999999998765554444 5799999999999 Q ss_pred Hhhcccccch-------hhh-------------cCeEEEEEcCCCCeEEEEEeeeEEecCeEEEeeeeee Q lcl|Aclame:pro 440 LERAGIVENY-------ELF-------------KQYLVVERDASVPNRLNTLFPPDYVNQLRVFAVVNQF 489 (498) Q Consensus 440 le~~given~-------~~~-------------~~~lvVerd~~d~nRvn~~~p~~~vn~l~v~A~~~~f 489 (498) |+..|++++. |.. .+...|.+...+ ..|-+.+..+.++-+.-|=..+.. T Consensus 288 L~~~~~I~~~~~~eid~e~q~~~~~~~g~d~~~~~d~~v~~~~~~-~~v~~~~~v~~vdamE~iy~ti~v 356 (356) T protein:vir:10 288 LAKQELIDSNFTVEIDLEKQKEYLEGKKIAVSKMKENEIKEANTG-SNGFYLINLKLVDAMEDINIRVQM 356 (356) T ss_pred HHhCCccccCceeEecccchHHHhhhccccccccccceeecccCC-cEEEEEEEEEEEeeeeeEEeEEeC Confidence 9999999853 111 112222222222 223366666666666655555555 No 19 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=99.81 E-value=3.8e-19 Score=121.55 Aligned_cols=423 Identities=14% Similarity=0.141 Sum_probs=248.4 Q ss_pred cccCeEEEEEecCCC-CCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHHHHHhCCCceEEEE Q lcl|Aclame:pro 12 TLVPLFYAEMDNQAA-NTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEAYRQTDPFGELYVI 90 (498) Q Consensus 12 ~rvPg~y~E~dns~a-~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a~~~~n~~~~l~~i 90 (498) .--+-+-+.++-..+ ..+..-.-.||+|- ...+.+......|.+++.+.||.+|..+.|+++|++..|......| T Consensus 1 ~~s~iVnV~i~~~~~a~~~~~f~~~l~~~~----~~~~~~r~~~yss~~~V~~~FG~~S~ey~aA~~yF~q~p~p~~l~i 76 (450) T protein:vir:95 1 MWNPIVNVDITLNTAGTTREGFGLPLFLAS----TDNFEERVRGYTSLTEVAEDFDENTAAYKAAKQLWSQTPKVTQLYI 76 (450) T ss_pred CCCceEEEeecccccccccccceeEEEEcC----CCCCccceeeecCHHHHHHhcCCCcHHHHHHHHHHhCCCcccEEEE Confidence 112223233322222 33333344556653 2333444444457788899999999999999999999887643333 Q ss_pred EecCCccceeEEEEEEeeeccCCcEEEEEEccEE---EEEEeecCCCHHHHHHHHHHHHhcCCCc----eEEEeeccceE Q lcl|Aclame:pro 91 AVPEATGAAATVTLTVTGEATESGTVNVYVGRTR---VQAPVTNGDNVTTIASSIQDAINAVPTL----PFTASSSAGVV 163 (498) Q Consensus 91 ~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~---v~v~V~~gdtaa~iA~~l~~aIn~~~~l----pVtA~~~~~~V 163 (498) .=- ...++..+...+++..+|++++.|+|.. ..+.....++.+.+|+.+..+|++.+.. .++....+..+ T Consensus 77 gr~---~~~~t~~~~~~~~~~~~g~lt~tv~G~~~~~~~i~~s~a~s~~~va~~~~tai~~~~~~~~~~~~~s~g~~~~~ 153 (450) T protein:vir:95 77 GRR---AMQYTVSIPDAVTESTDYSITVAAGGGISQPYQYTAQSSDTAENVLQQFKTQIEADPTIKDKVSVNVTGSNGSA 153 (450) T ss_pred Eee---ccchhhhhhhhhccccceeEEEEecceeeeeeEEEEEecCChhhHHHHhhhhhcccceeeeeeeeeeeccccee Confidence 311 1223334445567788999999999964 5677888999999999999999876432 12333333455 Q ss_pred EEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhc---cCcceEEEecCCChHHHHHH Q lcl|Aclame:pro 164 TLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMA---DEPFDYIGLPFNDTASVNTL 240 (498) Q Consensus 164 tlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg---~~~~~~I~~p~tD~a~l~al 240 (498) +++-..+|... .. .++.+....+. +|.....+.++++++- ..|| +++++..|.+...++ T Consensus 154 t~~~~~~~~~~-~~-------------~l~~~~~~~~~---~g~~aet~~~a~~a~~~~~~~w~-~~~~~~~~~~~i~a~ 215 (450) T protein:vir:95 154 TMIIAKAGDND-FV-------------KVTTTAQTVYI---ASTTADTASTALAAIEAYSTDWY-FIAAEDRTQQFVLAM 215 (450) T ss_pred eeeeeccccch-hh-------------ccccccceeEe---cccccccHHHHHHHHHHhhCCeE-EEEecCCCHHHHHHH Confidence 55555444321 00 11111211111 2222334666666654 4566 455777777888888 Q ss_pred HHHHhhhhhhhhhhhheeeEEEEeccC---C-HHHHHhhhhcc---CcceEEEEecCCCCCCcHHHHHHHHHHHhhhhhc Q lcl|Aclame:pro 241 VTEMNDTSGRWSYARQLYGHVYTAKTG---T-LSELVNAGDQF---NQQHITLAGYEKETQTPADELAASRTARAAVFIR 313 (498) Q Consensus 241 ~~~l~~~s~r~~~~~q~~g~~~~~~~g---t-~~~~~t~g~~~---N~~~~t~~~~~~~~~~p~~~~AAa~~a~~a~~l~ 313 (498) ..|.+..+ ++++........ + .+....++..+ |..|..++... ..+.+ -..+++++.+. . T Consensus 216 a~w~~a~~-------~~f~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~-~~~~~--~~~aa~~g~~~---~ 282 (450) T protein:vir:95 216 ASEIQARK-------KIFFTANSDVTALQGTELASANDVPAQLAKNMYTRTVCLWHH-AAAED--YPEMAYIAYGA---P 282 (450) T ss_pred HHHHhhcC-------cEEEEEcCCchhhhhhhhhcccchHHHHHhccCCeeEEEeeC-CCchh--HHHHHHHHHhh---h Confidence 88887532 344433222111 1 11222344433 44566666542 22221 22344444443 4 Q ss_pred cCccc-cccceEEecccc---CCCccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCCCCCchhhhhhhHHH Q lcl|Aclame:pro 314 NDPAR-PTQTGELVGMLP---APKGKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADNSYLDSETLHT 389 (498) Q Consensus 314 ~DPAr-pl~tl~L~Gl~~---p~~~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~t 389 (498) .+|.| .+.--.|+||.| |....-++.+|.+.|-.+|+..+...+|+-.+.+.+|+ .| .|.| .++- T Consensus 283 ~~~g~~T~~fk~l~Gv~~~v~~~~~~~lt~~~~~al~~~~~n~y~~~~~~~~~~~G~~~-----~G----~~iD--~~~~ 351 (450) T protein:vir:95 283 YDAGSIAWGNAQLTGVAASLQPSNQRPLTSIQKSALDVRHCNFIDLDGGVPVVRRGITS-----GG----EWID--IIRG 351 (450) T ss_pred cccceeeeccccccceeeeccCccccccchHHHHHHHhCCcEEEEEecCceeeeCCeee-----Cc----chhH--HHHH Confidence 55654 222234566653 11122478899999999999977766787788888876 23 4877 7889 Q ss_pred HHHHHHHHHHHHhhhcCCc---eeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEE--EEEcCCC Q lcl|Aclame:pro 390 SAYVLRKLKSVITSKYGRH---KLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLV--VERDASV 464 (498) Q Consensus 390 l~yv~~~~r~~~~~~~~r~---kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lv--Verd~~d 464 (498) ++|+...++..+...+.+. |+--+.. -..+|++.+.+.+++....|+|-.+..+....- -..|-.+ T Consensus 352 ~~wl~~~iq~~l~~ll~~~~~~KiPy~~~---------G~~~i~a~i~~~l~~a~~~G~Ia~~~V~~~~~~~~~~~dr~~ 422 (450) T protein:vir:95 352 VDWLESDLKTSLRDLLINQKGGKITYDDT---------GITRIRQVIETSLQRAVNRNFLSSYTVNVPKASQVALADKKA 422 (450) T ss_pred HHHHHHHHHHHHHHHHHhcCCCCCccChh---------hHHHHHHHHHHHHHHHHhcCcccceeEecCChHhcCHHHHhc Confidence 9999999999998776432 3322211 237899999999999999999976543322210 0111122 Q ss_pred CeEEEEEeeeEEecCeEEEeeeeeeEEEec Q lcl|Aclame:pro 465 PNRLNTLFPPDYVNQLRVFAVVNQFRLQYS 494 (498) Q Consensus 465 ~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~ 494 (498) +.--++.+-..+-+..|.+ .|+.-|-|+ T Consensus 423 R~~~~i~~~~~laGAIh~~--~i~~~v~~~ 450 (450) T protein:vir:95 423 RILKDVTFAGILAGAILDV--DLKGTVAYE 450 (450) T ss_pred cCCCCeeEEEEEccceEEE--EEEEEEEeC Confidence 3334577888888988855 456666777 No 20 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=99.70 E-value=4.9e-15 Score=99.03 Aligned_cols=440 Identities=13% Similarity=0.126 Sum_probs=244.7 Q ss_pred CccchhhcCcccccCeEEEEEecC-CCCCCCCCccEEEEEecCCCCccccceeEEec-ChHHHHHhhCcCcHHHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQ-AANTAQDSGASLLIGHANNGAEIVANSLVLMP-SADYARQICGAGSQLARMVEAY 78 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns-~a~~~~~~~~vLliGq~~~~g~~~~~~~~~v~-s~~~A~~~fG~GS~l~~M~~a~ 78 (498) |+|+.++|=+ +-++-+ .+....+-.-.||+|......-..+...++.+ |.++..+.||..|....+++.| T Consensus 1 msip~s~ivn--------V~i~~~~~a~~~~~f~~~l~l~~~~~~~~~~~~~r~~~~~s~~~V~~~FG~~s~ey~aA~~y 72 (502) T protein:vir:52 1 MALSISHIVN--------VQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPF 72 (502) T ss_pred CCCCccceeE--------EeeccccccccccccCceEEEeeccCccccCCccceEEecCHHHHHHhcCCChHHHHHHHHH Confidence 9999988631 222212 11222333455888877655444455556554 7788999999999999999999 Q ss_pred HHhCCCc-eEEEEEecCCc-cceeEEEEEEee-e---------ccCCcEEEEEEccEEEE---EEeecCCCHHHHHHHHH Q lcl|Aclame:pro 79 RQTDPFG-ELYVIAVPEAT-GAAATVTLTVTG-E---------ATESGTVNVYVGRTRVQ---APVTNGDNVTTIASSIQ 143 (498) Q Consensus 79 ~~~n~~~-~l~~i~l~d~a-g~aatg~ititg-t---------at~~G~l~l~I~g~~v~---v~V~~gdtaa~iA~~l~ 143 (498) +..-|-. .|++-.-...+ ....++. .+.| + +-.+|++++.|+|...+ +.++...+.+.+|..|. T Consensus 73 F~q~p~P~~l~igR~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~G~l~i~i~g~~~t~~~i~lS~~ts~~~vA~~i~ 151 (502) T protein:vir:52 73 FAQSPRAKQLIVARWQKSASTIEATKN-TLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQ 151 (502) T ss_pred hcCCCccceEEEEeccccccceeechh-hhhhhhhHHhHHHhhhhcCceeEEEecceeeeeeccccccccchhHHHHHHH Confidence 9887764 55554433222 1122211 1111 1 22589999999998776 55677888899999999 Q ss_pred HHHhcCCC-ceEEEeeccceEEEeeccCcccccceeEEEEeccc--Ccccccccce--e-----eeecccCCCcCcchhh Q lcl|Aclame:pro 144 DAINAVPT-LPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGF--GGGEVLPAGV--Q-----IAVATGTAGTGAPVLT 213 (498) Q Consensus 144 ~aIn~~~~-lpVtA~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~--~~ge~~p~Gl--t-----~tit~~agGag~pD~~ 213 (498) +++.+... +-|+-...+...+++....|.. ..+ .+.|-.. ..|.-+.+.+ + +.+..-+.|....++. T Consensus 152 ~~l~~~~~~~tv~~d~~~~~F~i~s~ttg~~-~~~--~~~~a~~~~~~gt~~a~~l~l~~~~~av~v~~~~~g~~aet~~ 228 (502) T protein:vir:52 152 EKLTTLSVAVSIAYDETGNRFIVSANVAGED-KKT--EIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLG 228 (502) T ss_pred hhhcccccceEEEEecCCceEEEEeccCCCc-cee--EEEEeecCCcchhHHHHHhccccccceeeeeeecccccccCHH Confidence 99876543 2233335566777777665522 122 2222111 1111122221 1 1111122344455777 Q ss_pred hHHHhhcc---CcceEEEecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEE---eccCCHHHHHhhhhccCcceEEEE Q lcl|Aclame:pro 214 GAVAAMAD---EPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYT---AKTGTLSELVNAGDQFNQQHITLA 287 (498) Q Consensus 214 ~alaalg~---~~~~~I~~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~---~~~gt~~~~~t~g~~~N~~~~t~~ 287 (498) ++|+++-+ .||-+++..-.+.+...++..|.+... ++++.... ...++...+...-...|..+..++ T Consensus 229 ~al~a~~~~~~~w~~~~~a~~~~~~~~la~a~~iea~~-------~~f~~~~~d~~~~~~~~~~i~~~l~a~~~~~t~~~ 301 (502) T protein:vir:52 229 EALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQANT-------KLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAM 301 (502) T ss_pred HHHHHHHhccCceEEEEEeecCChhHHHHHHHHHhhcC-------cEEEEEecCcceeccccchHHHHHHhccCceeEEE Confidence 88877754 456555443334556678888876432 23332111 011111112222223455666666 Q ss_pred ecCCCCCCcHHHHHHHHHHHhhhhhccCcccc-----ccceEEeccccCCCccccChHHHHHHHhCCeeEEEEcCCeEEE Q lcl|Aclame:pro 288 GYEKETQTPADELAASRTARAAVFIRNDPARP-----TQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGVLRI 362 (498) Q Consensus 288 ~~~~~~~~p~~~~AAa~~a~~a~~l~~DPArp-----l~tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~~G~v~I 362 (498) ... .. + -..+++++.++ ..|+.+. +.--.|+|+.|. .++.+|.+.|-.+|+..+.--+|+-.+ T Consensus 302 y~~-~~--~--~~~aa~~g~~a---s~~f~~~~g~iT~~fk~l~GV~~~----~lt~t~~~al~~~~~N~y~~~~~~~~~ 369 (502) T protein:vir:52 302 FDK-ND--M--YPVSSALARLL---STNFAANNSTLTLKFKQQPTITAD----EITATEFAKAKRLGINVYTYFDDVAMI 369 (502) T ss_pred ecC-Cc--c--hhHHHHHHHHH---hcCCCcCcceeeecccccCCcccC----cCCHHHHHHHHhcCceEEEEecCeeEE Confidence 532 22 2 13344555554 4554433 333356788653 489999999999999998665777777 Q ss_pred EeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhh-hcC-CceeccCCCCcCCCcccccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 363 QRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITS-KYG-RHKLASDGTRFGPGQAIVTPAVIKGELLATYRQL 440 (498) Q Consensus 363 eR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~-~~~-r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~l 440 (498) .+.+++ .| .|.| .++-++++...++..+.. .|- -.|+--+... -.+|++.+-+.+++. T Consensus 370 ~~G~~~-----~G----~~iD--~~~~~~Wl~~~lq~~l~~~L~~s~~kIPy~~~G---------~~~l~a~i~~~l~~a 429 (502) T protein:vir:52 370 AEGTVI-----GG----KFAD--EIVILDWFVDAVQKEVFARLYKSPTKIPLTDKG---------QAILIAAVEKVCLEG 429 (502) T ss_pred ecCeee-----CC----chhh--HHHHHHHHHHHHHHHHHHHHHhcCCCcccChhH---------HHHHHHHHHHHHHHH Confidence 777776 23 4777 778899999999888854 342 1233322111 268999999999999 Q ss_pred hhccccc-chh---hh---------cCeEEEE---EcCC------CCeEEEEEeeeEEecCeEEEeeeeeeEEEecc Q lcl|Aclame:pro 441 ERAGIVE-NYE---LF---------KQYLVVE---RDAS------VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSE 495 (498) Q Consensus 441 e~~give-n~~---~~---------~~~lvVe---rd~~------d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~ 495 (498) ...|+|. ++. .+ .+.-.|. ++.+ ++.--.+.|-..+-+.+|.+-. .+..++ T Consensus 430 ~~~G~I~~G~~~~~~~g~~~~~d~~~~gy~v~~~~~~~~s~~dr~~R~~~~~~~~~~~aGaIh~v~i----~~nv~~ 502 (502) T protein:vir:52 430 INNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDV----IVNYNR 502 (502) T ss_pred HhcCccccccccCcccceeeecccccCceEEEeCchhhCCHHHHHcccCCCeEEEEEECceEEEEEE----EEEEeC Confidence 9999874 211 11 1111111 1111 1112234455555555554433 333334 No 21 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=99.68 E-value=4.3e-15 Score=99.34 Aligned_cols=444 Identities=14% Similarity=0.082 Sum_probs=245.7 Q ss_pred cchhhcCcccccCeEEEEEecCCCCCCCCC--ccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHHHHH Q lcl|Aclame:pro 3 ISFNTIPSNTLVPLFYAEMDNQAANTAQDS--GASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEAYRQ 80 (498) Q Consensus 3 i~f~~Ip~~~rvPg~y~E~dns~a~~~~~~--~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a~~~ 80 (498) |+|+.||-+- ++.++-+....+..+ .-.||++ .....+.+.+....|.+++.+.||.+|.-..|++.|+. T Consensus 1 m~~~~ip~s~-----iV~V~~~v~~~~~~~~~~~~lll~---~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs 72 (501) T protein:vir:78 1 MPTTTIPIDQ-----IVQMLPGVIGAGGAPGRLTGLVLT---QDTSIQPGQLADFFQKTDVENWFGGLSNEAVIADAYFP 72 (501) T ss_pred CCcCccccce-----EEEEeeecccCCCcceeeeeEEEe---cCCCCCccceeeecCHHHHHHhcCCChHHHHHHHHHhh Confidence 7899999763 455655543222221 2234443 22334566666666889999999999999999999996 Q ss_pred ----hCCC-ceEEEEEecCCcccee--EEEEE---EeeeccCCcEEEEEEccEEEE--EEeecCCCHHHHHHHHHHHHhc Q lcl|Aclame:pro 81 ----TDPF-GELYVIAVPEATGAAA--TVTLT---VTGEATESGTVNVYVGRTRVQ--APVTNGDNVTTIASSIQDAINA 148 (498) Q Consensus 81 ----~n~~-~~l~~i~l~d~ag~aa--tg~it---itgtat~~G~l~l~I~g~~v~--v~V~~gdtaa~iA~~l~~aIn~ 148 (498) ..|. ..|++-.-...+..+. .+.++ ++--..-+|+++|.|+|+... +..+...+.+++|+.|..+|++ T Consensus 73 ~~~~q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~la~~~~~~G~l~iti~g~~~~~~i~~S~~ts~~~vA~~i~~al~a 152 (501) T protein:vir:78 73 GIVNGGQLPYDLKFARYVAADAPASVYGIPLTGVTLTQLQGYSGTLTVTTAAQHVSSNISLAAATSFANAATLIEAAFTS 152 (501) T ss_pred cCCCCCcccceEEEEeecccCcceeEeccceeccchhhhceeeeEEEEEeccceeeeccccccccCHHHHHHHHHhhhcC Confidence 5555 4677666544332222 12222 111112359999999997654 4445788889999999999975 Q ss_pred CCCceEEEeeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeeccc----CCCcCcchhhhHHHhh---cc Q lcl|Aclame:pro 149 VPTLPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATG----TAGTGAPVLTGAVAAM---AD 221 (498) Q Consensus 149 ~~~lpVtA~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~----agGag~pD~~~alaal---g~ 221 (498) .+..|+-.+.....++++...|+.+ .|..... +--+.+.+.++-... +.|+....+.++++++ .. T Consensus 153 -~~~tv~~ds~~~~f~its~t~G~~~-~i~~~t~------~~~~a~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~ 224 (501) T protein:vir:78 153 -PDFVVSYDALRNRFVVNTNATGTAA-AISAVTG------TNNLADELGLSAAAGASLQAAGVAADTPASAMNRAVGLSR 224 (501) T ss_pred -cceEEEEccccceEEEEeeecCCce-eEEEEec------ccchhhhhcccccCceeeEeccccccCHHHHHHHHHhccC Confidence 3467888888888999999888754 3333321 111223333332221 2333333456666555 45 Q ss_pred CcceEEEecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCC---HHHHHhhhh---ccCcceEEEEecCCCCCC Q lcl|Aclame:pro 222 EPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGT---LSELVNAGD---QFNQQHITLAGYEKETQT 295 (498) Q Consensus 222 ~~~~~I~~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt---~~~~~t~g~---~~N~~~~t~~~~~~~~~~ 295 (498) .||-|..+.--+.+...++.+|.+....|+ ..+....... .+.-..++. ..|..|..++.. . T Consensus 225 ~Wy~f~~a~~~~~~~~lalA~wiea~~~~f-------~~~~~~~~~~~~~~~~~~~i~~~l~a~~y~~t~~~y~---~-- 292 (501) T protein:vir:78 225 NWATFTTAWTAVIADRLALASWNSGQAYKY-------MYVAPDLEPASIVTNNSASFGAQVFAAPYQGTLPLYG---D-- 292 (501) T ss_pred ceEEEEEecCCCHHHHHHHHHHHHhcCceE-------EEEEecCCcceeecccchhHHHHHhhcCCCceEEEcC---C-- Confidence 677776554344555667787776533222 1121111110 001112222 336666666542 1 Q ss_pred cHHHHHHHHHHHhhhhhccCccccccceEEecc--ccCCCccccChHHHHHHHhCCeeEEEE--cCC-e-EEEEeeeeee Q lcl|Aclame:pro 296 PADELAASRTARAAVFIRNDPARPTQTGELVGM--LPAPKGKRFTMTEQQTLLSHGVATAYV--ESG-V-LRIQRDVTTY 369 (498) Q Consensus 296 p~~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl--~~p~~~~r~~~~er~~lL~~Gist~~v--~~G-~-v~IeR~ITTY 369 (498) + -.++++++.++ ..|+.+..-+..++-- .+.-..+..+.+|.+.|..+|+..+.. ..| + -.+.+.+.. T Consensus 293 ~--~~~aa~~g~~a---s~nf~~~~g~~T~~fkq~~~Gv~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~s- 366 (501) T protein:vir:78 293 Q--ATAGAVMGYAA---SINFQLRNGRTVLAFRQFNAGVPATAHDLGTANALRSNNYTYIGAYANAANNYTIAYDGKLS- 366 (501) T ss_pred c--chHHHHHHHHH---hcCcccCcceeeeeccccCCCcCcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcCeee- Confidence 1 13456666666 4577666655555432 222335678999999999999998865 223 2 334555432 Q ss_pred eecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCC-ceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccc Q lcl|Aclame:pro 370 RKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGR-HKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVEN 448 (498) Q Consensus 370 ~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r-~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given 448 (498) | .|.||..++=.+++...++..+-.-|-. .|+--+.. | -.++++.+-+.+++-...|+|.- T Consensus 367 -----G----~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kIPyt~~----G-----~~~l~a~v~~~l~~av~nG~I~~ 428 (501) T protein:vir:78 367 -----G----KFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNED----G-----YTALYRAGVDVIDAAVTSGIIRA 428 (501) T ss_pred -----c----cceeehhhhhHHHHHHHHHHHHHHHHHhCCCcccCHH----H-----HHHHHHHHHHHHHHHHhCceeec Confidence 2 4888999888899999998888655421 13222211 1 26799999999999999999853 Q ss_pred hhhhcCeEEEEEc--------CCCC-eEEEEEee--eEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 449 YELFKQYLVVERD--------ASVP-NRLNTLFP--PDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 449 ~~~~~~~lvVerd--------~~d~-nRvn~~~p--~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) =...-..-..+.+ ..|. +|--++.+ ++-.-|-|.--..-.+.|-|...-| T Consensus 429 Gv~l~~~q~~~I~~~~g~~~~~~~~~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~ga 489 (501) T protein:vir:78 429 GVTLTNSQLQQIDAAAGVAGAGQLVQMRGWYFLIGDPANPGQARQNRTTPTCTLWYSDGGS 489 (501) T ss_pred CCCCCCccceeeccccCccccccceeccceEEeeccccCChhhhhhcccCcEEEEEEeCCc Confidence 1110001111111 1110 11111111 1111122222222334444444444 No 22 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=99.65 E-value=8.9e-14 Score=92.14 Aligned_cols=443 Identities=15% Similarity=0.093 Sum_probs=240.2 Q ss_pred cchhhcCcccccCeEEEEEecCCC---CCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHHHH Q lcl|Aclame:pro 3 ISFNTIPSNTLVPLFYAEMDNQAA---NTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEAYR 79 (498) Q Consensus 3 i~f~~Ip~~~rvPg~y~E~dns~a---~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a~~ 79 (498) |+|+.||-+- ++.++-+.. +.+.+ .-.||++. ....+++.+....|.+++.+.||.+|.-..|++.|+ T Consensus 1 m~~~~ip~s~-----iV~V~~~v~~~~~~~~~-~~~lllt~---~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yF 71 (501) T protein:vir:36 1 MPTTTIPIDQ-----IVQMLPGVIGAGGAPGR-LTGLVLTQ---DTSVQPGQLADFFQETDVENWFGALSNEAKIADAYF 71 (501) T ss_pred CCcCCcccce-----EEEEeeeeccCCCccee-eeeEEEec---cCCCCCcceeeecCHHHHHHhcCCChHHHHHHHHHh Confidence 7899999763 445554443 33222 22344432 233456666666688999999999999999999999 Q ss_pred H----hCCC-ceEEEEEecCCcccee-E-EEEEE---eeeccCCcEEEEEEccEEEEEE--eecCCCHHHHHHHHHHHHh Q lcl|Aclame:pro 80 Q----TDPF-GELYVIAVPEATGAAA-T-VTLTV---TGEATESGTVNVYVGRTRVQAP--VTNGDNVTTIASSIQDAIN 147 (498) Q Consensus 80 ~----~n~~-~~l~~i~l~d~ag~aa-t-g~iti---tgtat~~G~l~l~I~g~~v~v~--V~~gdtaa~iA~~l~~aIn 147 (498) . ..|. ..|++-.-...+..+. . ++++- .--..-.|++++.|+|+..... .+...+.+++|+.|..+|. T Consensus 72 s~~~~q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~~a~~~~~sg~l~vti~g~~~~~~i~lS~~ts~~~vA~~i~~al~ 151 (501) T protein:vir:36 72 PGIVNGGQLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFT 151 (501) T ss_pred hcccCCCccccEEEEEeecCcCcceeEeccchhhhhhhhccceeEEEEEEecceeeeeecccccccCHHHHHHHHhhhhc Confidence 5 5554 4777766554332222 1 22221 1111235899999999876543 4577788899999999997 Q ss_pred cCCCceEEEeeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeeccc----CCCcCcchhhhHHHhh---c Q lcl|Aclame:pro 148 AVPTLPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATG----TAGTGAPVLTGAVAAM---A 220 (498) Q Consensus 148 ~~~~lpVtA~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~----agGag~pD~~~alaal---g 220 (498) . .+..|+-.......+++....|... .|.... .+....+.+.++-... ..|+....+.++++++ . T Consensus 152 ~-~~~tv~~d~~~~~f~i~s~t~G~~~-~i~~~t------~~~~ia~~l~Lt~~~~a~v~~~g~~~et~~~al~a~~~~s 223 (501) T protein:vir:36 152 S-PDFVVAYDALRNRFTVVTNATGTAA-AISAVT------GTNNFADEIGLSAAAGATLQAAGVAADTPASAMNRAVGLS 223 (501) T ss_pred C-cceEEEEcCcceeEEEEeccCCcce-eeEeee------cccchhhhhcccccCcceEEecccccccHHHHHHHHHhcc Confidence 5 3456666667778888888877522 232221 1111233333333322 2333333455556555 4 Q ss_pred cCcceEEEecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCH--H-HHHhhh---hccCcceEEEEecCCCCC Q lcl|Aclame:pro 221 DEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTL--S-ELVNAG---DQFNQQHITLAGYEKETQ 294 (498) Q Consensus 221 ~~~~~~I~~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~--~-~~~t~g---~~~N~~~~t~~~~~~~~~ 294 (498) ..||-|.++.--+.+...++..|.+....|+ +++ ........ . .-..++ -..|..|..++.. .+ T Consensus 224 ~~Wy~f~~a~~~~~~~~la~A~wiea~~~~f-----~~~--~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~---~~ 293 (501) T protein:vir:36 224 RNWATFTTAWTAVIADRLAFASWNSGQAYKY-----MYV--APDLEAASIVSNNAASFGAQVFAAPYQGTLPLYG---DQ 293 (501) T ss_pred CceEEEEEecCCChHHHHHHHHHHhhcCceE-----EEE--EecCchhhhhccchhhHHHHHHhcCCCcEEEEcC---CC Confidence 5677766554333444557777776432221 112 11111000 0 111222 2346777777752 22 Q ss_pred CcHHHHHHHHHHHhhhhhccCccccccceEEeccc--cCCCccccChHHHHHHHhCCeeEEEE--cCC-e-EEEEeeeee Q lcl|Aclame:pro 295 TPADELAASRTARAAVFIRNDPARPTQTGELVGML--PAPKGKRFTMTEQQTLLSHGVATAYV--ESG-V-LRIQRDVTT 368 (498) Q Consensus 295 ~p~~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~--~p~~~~r~~~~er~~lL~~Gist~~v--~~G-~-v~IeR~ITT 368 (498) .| ++++.+.++ ..|..+..-+..++--. +.-..+..+.+|.+.|..+|+..+.+ ..| + -.+.+.+.+ T Consensus 294 ~~----~aa~~g~~a---s~nf~~~~g~~T~~fkq~~~Gi~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~s 366 (501) T protein:vir:36 294 AT----AGAVMGYAA---SINFQLRNGRTVLAFRQFNAGVPATVHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLS 366 (501) T ss_pred CH----HHHHHHHHH---hcCcccCcceeeeeccccCCCcCcCcCCHHHHHHHHhcCCcEEEEEecccceeeEEEcCeee Confidence 22 234555555 45665555555554322 22234578999999999999998854 222 3 345566433 Q ss_pred eeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCC-ceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhccccc Q lcl|Aclame:pro 369 YRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGR-HKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVE 447 (498) Q Consensus 369 Y~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r-~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~give 447 (498) | .|.+|..++=.+++...++..+-.-|-. .|+--+.. | -.++++.+-+.+++-...|+|. T Consensus 367 ------G----~~~wiD~~~g~dWL~~~iq~~l~~ll~~~~KIPytd~----G-----~~~l~a~i~~~l~~av~nG~I~ 427 (501) T protein:vir:36 367 ------G----KFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNED----G-----YTGLYRAGVDVIDAAVTSGIIR 427 (501) T ss_pred ------c----cchhhhHHHhHHHHHHHHHHHHHHHHhcCCCCccChh----h-----HHHHHHHHHHHHHHHHhCceee Confidence 2 3777888889999999999888665522 23222211 1 2689999999999999999985 Q ss_pred chhhhcCeEEEEEcC--------CC-CeEEEEEeeeE--EecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 448 NYELFKQYLVVERDA--------SV-PNRLNTLFPPD--YVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 448 n~~~~~~~lvVerd~--------~d-~nRvn~~~p~~--~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) -=...-..-..+.+. +| .+|--++.++. -.-|-|---..-.+.+-|...-| T Consensus 428 ~Gv~~~~~q~~~I~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~ga 489 (501) T protein:vir:36 428 AGVTLTNSQLQQIDKAARVAGAGQLVQTRGWYFLIGDPANPGQARQNRTTPACTLWYSDGGS 489 (501) T ss_pred cCCCCCcccceeecccccccccccceeccceEEeeCcccCChhhhhhcccCcEEEEEEeCCc Confidence 311111111111111 11 11111111111 11112222222233334444444 No 23 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=99.59 E-value=2.2e-13 Score=89.96 Aligned_cols=446 Identities=15% Similarity=0.084 Sum_probs=246.1 Q ss_pred cchhhcCcccccCeEEEEEecCCCCCC--CCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHHHHH Q lcl|Aclame:pro 3 ISFNTIPSNTLVPLFYAEMDNQAANTA--QDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEAYRQ 80 (498) Q Consensus 3 i~f~~Ip~~~rvPg~y~E~dns~a~~~--~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a~~~ 80 (498) |+|+.||-+- ++.++-+....+ ....-.||++ .....+++.+....|.+++.+.||..|.-..+++.|+. T Consensus 1 m~~~~ip~s~-----iV~V~~~v~~~~~~~~~f~~lll~---~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs 72 (501) T protein:vir:10 1 MPTTTIPIDQ-----IVQMLPGVIGAGGAPGRLTGLVLT---QDTSVQPGQLADFFQKTDVENWFGALSNEAKIADAYFP 72 (501) T ss_pred CCcCccccce-----EEEEeeecccCCCcccccceEEEe---cccCCCccceeeecCHHHHHHhcCCChHHHHHHHHHhh Confidence 7899999763 445555443222 2223355553 23345667777777999999999999999999999995 Q ss_pred ----hCCC-ceEEEEEecCCcccee--EEEEEE---eeeccCCcEEEEEEccEEEEEE--eecCCCHHHHHHHHHHHHhc Q lcl|Aclame:pro 81 ----TDPF-GELYVIAVPEATGAAA--TVTLTV---TGEATESGTVNVYVGRTRVQAP--VTNGDNVTTIASSIQDAINA 148 (498) Q Consensus 81 ----~n~~-~~l~~i~l~d~ag~aa--tg~iti---tgtat~~G~l~l~I~g~~v~v~--V~~gdtaa~iA~~l~~aIn~ 148 (498) .-|. ..|++-.-...+..+. .++++- +--..-+|+++|.|+|+..... .+...+.+++|+.|..+|+. T Consensus 73 g~~~q~p~P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~~g~l~i~i~g~~~~~~i~~s~ats~~~vA~~i~~al~~ 152 (501) T protein:vir:10 73 GIVNGGQLPYDLKFARYVAADAPASVYGIPLTGITLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTS 152 (501) T ss_pred hhcCCCccccEEEEEeecccCccceeeeceehhhhhhhhhheeeEEEEeeccceeeeccccccccCHHHHHHHHHHhhcC Confidence 4554 4777776554332222 222321 1112346999999999765443 56777889999999999975 Q ss_pred CCCceEEEeeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeeccc----CCCcCcchhhhHHHhhc---c Q lcl|Aclame:pro 149 VPTLPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATG----TAGTGAPVLTGAVAAMA---D 221 (498) Q Consensus 149 ~~~lpVtA~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~----agGag~pD~~~alaalg---~ 221 (498) ...+|+........+++....|+.. .|.... ..+ -+.+++.++-... ..|+....+.++++++- . T Consensus 153 -~~~tv~~d~~~~~f~i~~~t~G~~~-~i~~~t-----~~~-d~a~~l~Lt~~~~a~v~~~g~~aet~~~Al~a~~~~~~ 224 (501) T protein:vir:10 153 -PDFVVAYDALRNRFTVVTNTTGTAA-AISAVT-----GTN-NLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLSR 224 (501) T ss_pred -CceEEEEecccceEEEEecccCcce-eEEEee-----ccc-cchhhhcccccCceeEEecCcccccHHHHHHHHHhccc Confidence 4457888788888999988777542 233221 111 1233333332211 22333334566666654 5 Q ss_pred CcceEEEecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEec----cCCHHHHHhhhhccCcceEEEEecCCCCCCcH Q lcl|Aclame:pro 222 EPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAK----TGTLSELVNAGDQFNQQHITLAGYEKETQTPA 297 (498) Q Consensus 222 ~~~~~I~~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~----~gt~~~~~t~g~~~N~~~~t~~~~~~~~~~p~ 297 (498) .||-|..+.--+.+...++.+|.+....|+- ++..-... .+.-..+...--..|..|..++.. .+.| T Consensus 225 ~Wy~f~~a~~~~~~~~la~A~wi~a~~~~f~-----~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~---~~~~- 295 (501) T protein:vir:10 225 NWATFTTAWTAVIADRLAFAAWNSGQAYKYM-----YVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYG---DQAT- 295 (501) T ss_pred ceEEEEEEecCChHHHHHHHHHHHhcCceEE-----EEEecCcceeeecccchhHHHHHHhcCCCceEEECC---CCCH- Confidence 6666665543445556678877765433221 22110000 000011111122336667766642 2222 Q ss_pred HHHHHHHHHHhhhhhccCccccccceEEecc--ccCCCccccChHHHHHHHhCCeeEEEE--cCC-e-EEEEeeeeeeee Q lcl|Aclame:pro 298 DELAASRTARAAVFIRNDPARPTQTGELVGM--LPAPKGKRFTMTEQQTLLSHGVATAYV--ESG-V-LRIQRDVTTYRK 371 (498) Q Consensus 298 ~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl--~~p~~~~r~~~~er~~lL~~Gist~~v--~~G-~-v~IeR~ITTY~~ 371 (498) ++++.+.++ ..|+.+..-+..++-- .+.-..+..+.+|.+.|..+|+..+.. ..| + -.+.+.+.+ T Consensus 296 ---~aa~~g~~a---s~nf~~~~g~~T~~fkql~~Gv~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~s--- 366 (501) T protein:vir:10 296 ---AGAVMGYAA---SINFQLRNGRTVLAFRQFNAGVPATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLS--- 366 (501) T ss_pred ---HHHHHHHHH---hcCcccCcceeeeeecccCCCcCcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcceee--- Confidence 335555555 4566665554444432 222235578999999999999998855 223 2 344555433 Q ss_pred cCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCC-ceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccc-h Q lcl|Aclame:pro 372 NAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGR-HKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVEN-Y 449 (498) Q Consensus 372 n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r-~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given-~ 449 (498) | .|.+|..++=.+++...++..+-.-|-. .|+--+.. --.++++.+-+.+++-...|+|.- + T Consensus 367 ---G----~~~wiD~~~g~dWl~~~iq~~l~~ll~~~~kIPyt~~---------G~~~l~a~i~~~l~~av~nG~Ia~Gv 430 (501) T protein:vir:10 367 ---G----KFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNED---------GYTANYRAGVDVIDAAVTSGIIRAGV 430 (501) T ss_pred ---c----cceehhhHhhHHHHHHHHHHHHHHHHhcCCCcccCHH---------HHHHHHHHHHHHHHHHHhCcceecCc Confidence 2 3778888899999999999888665522 23222211 126899999999999999998843 2 Q ss_pred hhh-cCeEEEE------EcCCC-CeEEEEEee--eEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 450 ELF-KQYLVVE------RDASV-PNRLNTLFP--PDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 450 ~~~-~~~lvVe------rd~~d-~nRvn~~~p--~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) +.. +....+. ....| .+|--++.+ ++-.-|-|---..-.+.+-|...-| T Consensus 431 ~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~ga 489 (501) T protein:vir:10 431 TLTNSQLQQIDAVAGVAGAGALVQTRGWYFLIGNPANPGQARQNRTSPACTLWYSDGGS 489 (501) T ss_pred ccCcccceeecccccccccccceeccceEEeeCcccCChhhhhhcccCceEEEEEeCCc Confidence 111 1111111 11111 111111111 1111122322222334444444444 No 24 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=99.59 E-value=3.4e-15 Score=99.94 Aligned_cols=429 Identities=14% Similarity=0.079 Sum_probs=205.7 Q ss_pred cCcccccCeEEEEEecCCCCCCC--CCccEEEEEecCCCCccccceeEEecChHHHHHhhCcC--cHHHHHHHHHHHhCC Q lcl|Aclame:pro 8 IPSNTLVPLFYAEMDNQAANTAQ--DSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAG--SQLARMVEAYRQTDP 83 (498) Q Consensus 8 Ip~~~rvPg~y~E~dns~a~~~~--~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~G--S~l~~M~~a~~~~n~ 83 (498) .|... .||+|+|--++.+.+.. ..--..+||- +-.++.++|++|+|..++..+||.+ +-+..-+++++. |. T Consensus 1 M~~~~-~pGVyv~E~~~g~~~I~~v~Tsv~~~VG~---a~~~p~n~pv~its~~d~~~~g~~~~~~tL~~Av~~~f~-ng 75 (477) T protein:vir:79 1 MAANY-LHGVETIEKETGSRPVKVVKSAVIGLIGT---APIGPVNTPVQSLSDVDAAQFGPQLAGFTIPQALDAVYD-YG 75 (477) T ss_pred CcCCC-CCCeEEEEecCCcccccccCCceEEEEee---cccCCCcccEEEccHHHHHHhcCCCCCCcHHHHHHHHhh-cC Confidence 44444 69999987666554322 2234567773 4456789999999999999888764 667777888886 67 Q ss_pred CceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHH---HHHHHHHhcCCCceEEEeecc Q lcl|Aclame:pro 84 FGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIA---SSIQDAINAVPTLPFTASSSA 160 (498) Q Consensus 84 ~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA---~~l~~aIn~~~~lpVtA~~~~ 160 (498) -+.+|++.+.++...+.+......+......... ........+.+.......... ......++.. ..++..... T Consensus 76 g~~~~vvrV~~~~~~~~~~a~~~~~~~~~~~~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~- 152 (477) T protein:vir:79 76 SGTVIVINVLDPAVHKSNAASESVTFDAATGRAK-LAHPAAANLVLKNDSGGTTYTEGTDYAVDLINGV-ITRIKTGTI- 152 (477) T ss_pred CceEEEEeccCCcccccccccccccccccccccc-ccccccceeEEeecccccccccCccccccccchh-hhhhhcccc- Confidence 7899999997754322221111111111111100 000111111111110000000 0000000000 000000000 Q ss_pred ceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhc---c---CcceEEEecCCC- Q lcl|Aclame:pro 161 GVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMA---D---EPFDYIGLPFND- 233 (498) Q Consensus 161 ~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg---~---~~~~~I~~p~tD- 233 (498) +....... ..+.. ..+... ......|.....+..+.+.++. . .-.+++..|.-+ T Consensus 153 ----------~~~~~~~~--~~~~~-----~~~~~~--~~~~~~g~~~a~~~~tg~~al~~~~~~~~~~~~iv~apg~~~ 213 (477) T protein:vir:79 153 ----------PAAATAAK--ATYDY-----ADPTKV--TAADIIGAVNAAGMRTGMKALKDTYNLYGYFSKILIAPAYCT 213 (477) T ss_pred ----------ccccceee--ceecc-----CCcccc--eeeeecccccccccchhhhhhhhhhhhcccccceeecccccc Confidence 00000000 00000 000000 0000001111111112222221 1 122445445422 Q ss_pred -hHHHHHHHHHHhhhhhhhhhhhheeeEEEE-eccCC-HHHHHhhh-------hccCcceEEEEecC--------C-CCC Q lcl|Aclame:pro 234 -TASVNTLVTEMNDTSGRWSYARQLYGHVYT-AKTGT-LSELVNAG-------DQFNQQHITLAGYE--------K-ETQ 294 (498) Q Consensus 234 -~a~l~al~~~l~~~s~r~~~~~q~~g~~~~-~~~gt-~~~~~t~g-------~~~N~~~~t~~~~~--------~-~~~ 294 (498) .+-..+|.++.+ ++ .++++. ...++ .+++.++- ...|+.++.+.+.. + ... T Consensus 214 ~~~v~~~l~~~~~----~~------~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~ 283 (477) T protein:vir:79 214 QNSVSVELEAMAV----QL------GAIAYIDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDIATNAERL 283 (477) T ss_pred chhHHHHHHHHHh----hc------CeEEEEecCCCCChHHHhhhhhhccccccccccceEEEEcCeeEEecccCCceee Confidence 223444444432 22 233333 22233 44433322 23467676654321 0 111 Q ss_pred CcHHHHHHHHHHHhhhh-hccCccccccceEEeccccCCC----ccccChHHHHHHHhCCeeEEEEc--CCeEEEEeeee Q lcl|Aclame:pro 295 TPADELAASRTARAAVF-IRNDPARPTQTGELVGMLPAPK----GKRFTMTEQQTLLSHGVATAYVE--SGVLRIQRDVT 367 (498) Q Consensus 295 ~p~~~~AAa~~a~~a~~-l~~DPArpl~tl~L~Gl~~p~~----~~r~~~~er~~lL~~Gist~~v~--~G~v~IeR~IT 367 (498) .|+ ++.+|++.|.- .+..|...--...|.|+..+.. ....+..|++.|..+||.++..- .| .++--..| T Consensus 284 ~p~---s~~~ag~~a~~d~~~g~~~span~~~~gv~~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~~G-~~~wG~rT 359 (477) T protein:vir:79 284 EPL---SSRAAGLRARVDLDKGYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSG-LRLWGNRT 359 (477) T ss_pred ech---HHHHHHHHHHhhccCCceEccCCceeecceecccccccccCCChhhHHHHhhCCceEEEEecCCc-EEEEcccc Confidence 243 23334444321 1222322222235566643332 22335689999999999998653 34 55555555 Q ss_pred eeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhccccc Q lcl|Aclame:pro 368 TYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVE 447 (498) Q Consensus 368 TY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~give 447 (498) .- ....|+.|+.+.+.|+.+|+.+.++.... .|-.+.+... |-+.||..+=+.+++|..+|.+. T Consensus 360 ~~----~~~~~~~~~~i~vrR~~~~i~~~~~~~~~-~~v~e~~~~~-----------~~~~i~~~i~~~l~~l~~~g~l~ 423 (477) T protein:vir:79 360 AA----WPTVTHMRNFENVRRTGDVINESLRYFSQ-QFVDAPIDQG-----------LIDSLVESVNGFGRKLIGDGALL 423 (477) T ss_pred cC----CCCCCccceeeehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhCCcee Confidence 41 12345679999999999999999999775 4544443222 45788999999999999999999 Q ss_pred chhhhcCeEEEEEcC-----CCCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 448 NYELFKQYLVVERDA-----SVPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 448 n~~~~~~~lvVerd~-----~d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) .+. +.+-++. -+.+|+.+.+-...+..++-|-.++++..+|=++-+ T Consensus 424 g~~-----v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~ 474 (477) T protein:vir:79 424 GFK-----AWFDPARNPKEELAAGHLLINYKYTVPPPLERLTYETEITSEYLLTLK 474 (477) T ss_pred eeE-----EEEecCCCCHHHhhCCeEEEEEEEEecCCceeEEEEEEEechHHhhhc Confidence 853 3332221 123678888888888888776555555555444333 No 25 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=99.58 E-value=6.3e-15 Score=98.45 Aligned_cols=448 Identities=12% Similarity=0.043 Sum_probs=224.3 Q ss_pred Cc-cchhhcCcccccCeEEEEEecCCCCCCCC---CccEEEEEecCCCCccccceeEEecChHHHHHhhCc--CcHHHHH Q lcl|Aclame:pro 1 MT-ISFNTIPSNTLVPLFYAEMDNQAANTAQD---SGASLLIGHANNGAEIVANSLVLMPSADYARQICGA--GSQLARM 74 (498) Q Consensus 1 M~-i~f~~Ip~~~rvPg~y~E~dns~a~~~~~---~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~--GS~l~~M 74 (498) -. =.||+|--++.=||+|+|.=.|...+.+. .--.-++| .+..++.++|+.|+|-.|....||+ |.. ... T Consensus 271 ~~~~~~~~~~~~v~~~GVYVEEVpSGvrtIeGGV~TSVAAFVG---~A~rGPvn~PvlITS~aD~~~~Fg~~~GGl-~Ga 346 (774) T protein:vir:98 271 AGVEPFGEITRNVEDNGVVIQLEPALTGSISNRFSFYVTANDN---TANRGFTTSPALVTTIPDPAIHFTSFQGGL-DGP 346 (774) T ss_pred cccccccceEEEEecCceEEEEeCCCCccccccccceeeeecc---cccCCCCCcCEEEeehhHhhhhhccccCCc-ccc Confidence 11 24899999999999999976655433222 23345555 4556778999999999998888875 110 000 Q ss_pred HHHHHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEc---cEEEEEEee--cCCCHHH-HHHHHHHHHhc Q lcl|Aclame:pro 75 VEAYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVG---RTRVQAPVT--NGDNVTT-IASSIQDAINA 148 (498) Q Consensus 75 ~~a~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~---g~~v~v~V~--~gdtaa~-iA~~l~~aIn~ 148 (498) .++++ .....+....-+++...+.+.+..+++.|- ..+..+.+. .++.-.. .+......... T Consensus 347 ssA~r------------~~~~~sG~~~L~i~A~~pGawGN~ItV~I~~~t~~~~~l~v~~~~~s~f~~~~a~e~~tv~~~ 414 (774) T protein:vir:98 347 RSAFR------------DFYTFNGTPLLRLQAVSEGNWGNQVTVSIYPVNNSEFRLNVQDLNGSAFNPPLADEVYTVKLG 414 (774) T ss_pred ceeee------------eeeeecccceEEEEEeecCcCCCceEEEEEecCCceeEEEEEecCCccccccccceeEEEecc Confidence 11111 000011111112222222233333444431 111111111 1110000 00000000000 Q ss_pred CCCceEEEeeccceEEE---eeccCcccccceeE--EEEe---cccC---------ccc-ccccceeeeecccCCCcCcc Q lcl|Aclame:pro 149 VPTLPFTASSSAGVVTL---TARHKGLCGNEIPV--SLNY---YGFG---------GGE-VLPAGVQIAVATGTAGTGAP 210 (498) Q Consensus 149 ~~~lpVtA~~~~~~Vtl---TAk~kG~~gN~i~l--~~~~---~~~~---------~ge-~~p~Glt~tit~~agGag~p 210 (498) ..+.........+.+.+ ...++....|...- +... ...+ ..+ ...... +....++||...+ T Consensus 415 ~~~~~~~v~e~~dn~~i~~~~~~~~~~~in~vs~lv~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~-~v~v~lagG~Dg~ 493 (774) T protein:vir:98 415 DTNESGELNALLDSKFIRGFFLPKSIDSINYDAALVRQSPLRLAPPDESETDVENPAHVDFYGPNV-LVDVTLENGYDGP 493 (774) T ss_pred cccccceeeeeeceeeEeecccccccccccccccccccchhcccccccccccccccccccccCCcc-eEEEeecCCCCcc Confidence 00000000000000000 00011110000000 0000 0000 000 000111 1222345554332 Q ss_pred -----hhhhHHHhhccCcceEEEecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEE Q lcl|Aclame:pro 211 -----VLTGAVAAMADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHIT 285 (498) Q Consensus 211 -----D~~~alaalg~~~~~~I~~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t 285 (498) ++..+++..+....+.++.+..+..-..++-.|++.... ..+.+.++.-.+..-+..++.++....||.|.. T Consensus 494 ~tt~~~igg~~~~~~~tgi~aLl~a~~~~~V~~aii~~~e~~~~---~~~~r~avid~p~g~t~~~Ai~~r~~f~S~~aa 570 (774) T protein:vir:98 494 PVTNDDYVSIIRTLENQPVHILLVGTTNVGVQQALITEAERASD---SDGLRIAVLAAPPRTTPTLAASVTRGFNSTRAV 570 (774) T ss_pred cccchheecccccccccceeEEEcCccchhhHHHHHHHHHHhhh---cccceEEEEECCCCCCHHHHHHHHhccCCceEE Confidence 455566777788889999999998888888888765321 112344444445555677888998999999988 Q ss_pred EEecCC-------C--CCCcHHHHHHHHHHHhhhhhccCccccccceEEeccccC----CCccccChHHHHHHHhCCeeE Q lcl|Aclame:pro 286 LAGYEK-------E--TQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPA----PKGKRFTMTEQQTLLSHGVAT 352 (498) Q Consensus 286 ~~~~~~-------~--~~~p~~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~~p----~~~~r~~~~er~~lL~~Gist 352 (498) +++... + ...|+ ++.+|+..| +.||.-..-.-.|.|+.-+ ...+..+..|++.|...|+-. T Consensus 571 l~~Pwvkv~D~~~g~~~~vPp---Sg~vAGl~A---rtDv~kSPANk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~gIN~ 644 (774) T protein:vir:98 571 MVAGWFTYAGQPNSSRYGVPG---AAVYAGKLA---AIDFFVSPAARSLVGPLFNIIESDTDNYTSRSNQDIYSAARLEV 644 (774) T ss_pred EEeCcEEEeccCCCceeecCh---hHHHHHHHH---hcCcccccCCceeecceeccccccccccccchhhhhhcccccce Confidence 765310 0 11233 344445554 4566433333456666432 345667888898898888766 Q ss_pred EEE---cCCeEEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHH Q lcl|Aclame:pro 353 AYV---ESGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVI 429 (498) Q Consensus 353 ~~v---~~G~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~i 429 (498) ..+ ..| .++--..|. ..|+.|+.|.+.|+.+|+.+.++.... .|-.+.+..+ +-+.| T Consensus 645 i~itt~g~G-~rvWG~RTl-------ssDp~wr~InVRRlfd~Ie~SI~~~~~-~~VfEPNd~~-----------l~~~I 704 (774) T protein:vir:98 645 LSLDTVDRT-YRFASGVTL-------STDPAWERIYLRRVHDVVRQGAHAILR-NYVAMPNSRL-----------VRNQI 704 (774) T ss_pred eEEEEcCCc-EEEEccccc-------CCCcccceEeehhhHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHH Confidence 532 344 233323332 247889999999999999999999664 4544443322 33678 Q ss_pred HHHHHHHHHHHhhcccccchhhhcCeEEEEEc--CC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 430 KGELLATYRQLERAGIVENYELFKQYLVVERD--AS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 430 kaeli~~~~~le~~given~~~~~~~lvVerd--~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) |..+-+.+++|..+|.+.++.. ..+..+ ++ +.+|+.+.+-..-+-.++-|=.++++.-|+.+-.- T Consensus 705 ~~sI~~fL~~L~~~GaL~G~~~----V~~D~etNt~~dI~~G~l~i~I~vaP~~PAEfIilri~q~t~~~~l~E 774 (774) T protein:vir:98 705 AAALNAFMGELKRNGNIVSFRP----AIIDGSNNSTAAYFSRELYVSLQFQPLYSADYIYVTISRDTETSPLGE 774 (774) T ss_pred HHHHHHHHHHHHhCCceecceE----EEEcCCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEeecceeccC Confidence 9999999999999999987632 233322 21 33466666665555555544334444444333222 No 26 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=99.57 E-value=2.7e-13 Score=89.51 Aligned_cols=443 Identities=15% Similarity=0.073 Sum_probs=246.8 Q ss_pred cchhhcCcccccCeEEEEEecCCC---CCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHHHH Q lcl|Aclame:pro 3 ISFNTIPSNTLVPLFYAEMDNQAA---NTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEAYR 79 (498) Q Consensus 3 i~f~~Ip~~~rvPg~y~E~dns~a---~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a~~ 79 (498) |+|+.||-+- ++.++-+.. +.+ ...-.||++ .....+++...+.+|.+++.+.||..|.-..+++.|+ T Consensus 1 m~~~~ip~s~-----iV~V~~~v~~~~~~~-~~~~~l~l~---~~~~~~~~~~~~~~s~~~V~~~FG~~S~ey~aA~~yF 71 (501) T protein:vir:10 1 MPTTTIPIDQ-----IVQMLPGVIGAGGAP-GRLTGLVLT---QDTSVQPGQLADFFQETDVENWFGALSNEAKIADAYF 71 (501) T ss_pred CCCCCcccce-----EEEEeeecccCCCcc-ccceeEEEe---ccCCCCccceEEecCHHHHHHhcCCChHHHHHHHHHh Confidence 7888888753 445555443 333 223455664 3345678888899999999999999999999999999 Q ss_pred H----hCCC-ceEEEEEecCCcc-ceeEE-EEE---EeeeccCCcEEEEEEccEEEEE--EeecCCCHHHHHHHHHHHHh Q lcl|Aclame:pro 80 Q----TDPF-GELYVIAVPEATG-AAATV-TLT---VTGEATESGTVNVYVGRTRVQA--PVTNGDNVTTIASSIQDAIN 147 (498) Q Consensus 80 ~----~n~~-~~l~~i~l~d~ag-~aatg-~it---itgtat~~G~l~l~I~g~~v~v--~V~~gdtaa~iA~~l~~aIn 147 (498) + .-|. ..|++-.-...+. ..-.| +++ ++---.-+|+++|.|+|+.... ..+...+.+++|+.|..+|+ T Consensus 72 sg~~~q~p~P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~sg~l~vti~g~~~~~~i~ls~ats~~~vAs~i~~al~ 151 (501) T protein:vir:10 72 PGIVNGGQLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFT 151 (501) T ss_pred hhhcCCCccccEEEEEeecCCCccceEeccchhhhhhhhcceeeeEEEEeeccceeecccccccccCHHHHHHHHhhhcc Confidence 6 5555 4777766543322 11122 222 1111223599999999986553 35577788999999999997 Q ss_pred cCCCceEEEeeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeeccc----CCCcCcchhhhHHHhhc--- Q lcl|Aclame:pro 148 AVPTLPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATG----TAGTGAPVLTGAVAAMA--- 220 (498) Q Consensus 148 ~~~~lpVtA~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~----agGag~pD~~~alaalg--- 220 (498) .. +.+|+-.+.....++++...|.. ..|..... +.-+.+.|.++-... ..|+....+.++++++- T Consensus 152 ~~-~~tv~~d~~~~~f~its~ttG~~-~~i~~~~~------~~~la~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~ 223 (501) T protein:vir:10 152 SP-DFVVAYDALRNRFTVVTNATGTA-AAISAVTG------TNNLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLS 223 (501) T ss_pred CC-ceEEEEcccCceEEEEeeccCCc-eeEEEeeC------chhhhhhcCccccccceEEecCcccccHHHHHHHHHhcc Confidence 63 46777777888899999888853 33433321 111233333332211 22333334666666654 Q ss_pred cCcceEEEecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHH---HHHhhhh---ccCcceEEEEecCCCCC Q lcl|Aclame:pro 221 DEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLS---ELVNAGD---QFNQQHITLAGYEKETQ 294 (498) Q Consensus 221 ~~~~~~I~~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~---~~~t~g~---~~N~~~~t~~~~~~~~~ 294 (498) ..||-|..+.--+.+...++.+|.+... +++..+........- .-..++. .-|..|..++.. .+ T Consensus 224 ~~Wy~f~~a~~~~~~~~la~A~wiea~~-------~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~---~~ 293 (501) T protein:vir:10 224 RNWATFTTAWTAVIADRLAFAAWNSGQA-------YKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYG---DQ 293 (501) T ss_pred CceEEEEEecCCChHHHHHHHHHHHhcC-------ceEEEEEecCchhhhhhhhhhhHHHHHHhcCCCceEEECC---CC Confidence 5676665544334455557777765432 222222222211111 1112222 235566655542 21 Q ss_pred CcHHHHHHHHHHHhhhhhccCccccccceEEeccccC-CC-ccccChHHHHHHHhCCeeEEEEc--CC-e-EEEEeeeee Q lcl|Aclame:pro 295 TPADELAASRTARAAVFIRNDPARPTQTGELVGMLPA-PK-GKRFTMTEQQTLLSHGVATAYVE--SG-V-LRIQRDVTT 368 (498) Q Consensus 295 ~p~~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~~p-~~-~~r~~~~er~~lL~~Gist~~v~--~G-~-v~IeR~ITT 368 (498) . .++++++.++ ..|+.+..-+..++.-..| .. .+.++.+|.+.|..+|+..+..- .| + -.+++.+.+ T Consensus 294 ~----~~aa~~g~~a---s~nf~~~~g~~T~~fkq~~~Gi~a~~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~s 366 (501) T protein:vir:10 294 A----TAGAVMGYAA---SINFQLRNGRTVLAFRQFNAGVPATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLS 366 (501) T ss_pred c----HHHHHHHHHH---hhCcccCccceeeeccccCCCcCcccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeee Confidence 2 2345566655 4577666655555433222 22 45789999999999999998542 23 2 234555443 Q ss_pred eeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhc-CCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhccccc Q lcl|Aclame:pro 369 YRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKY-GRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVE 447 (498) Q Consensus 369 Y~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~-~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~give 447 (498) | .|.|+..++=.+++...++..+-..+ .-.|+--+.. | -.++++.+-+.+++-...|+|. T Consensus 367 ------G----~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kIPyt~~----G-----~~~l~a~v~~~l~~av~nG~I~ 427 (501) T protein:vir:10 367 ------G----KFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNED----G-----YTALYRAGVDVIDAAVTSGIIR 427 (501) T ss_pred ------c----cceeehhhhhHHHHHHHHHHHHHHHHHhcCCcccCHH----H-----HHHHHHHHHHHHHHHHhCceee Confidence 2 37888888888988888888875444 2223332221 1 2679999999999999999884 Q ss_pred chhhhcCeEEEEEc--------CCC-CeEEEEEeeeE--EecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 448 NYELFKQYLVVERD--------ASV-PNRLNTLFPPD--YVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 448 n~~~~~~~lvVerd--------~~d-~nRvn~~~p~~--~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) -=...-..-..+.+ .+| .+|--++.+.. -.-|-|.--..-.+.|-|...-| T Consensus 428 ~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~ga 489 (501) T protein:vir:10 428 AGVTLTNSQLQQIDAAAGVAGAGQLVQTRGWYFLIGDPANPGQARQNRTTPACTLWYSDGGS 489 (501) T ss_pred cCCCCCcccceeeccccCccccccceeccceeEeeccccCChhhhhhccccceEEEEEeCCc Confidence 31111111111111 111 11211111111 11122222222333444444444 No 27 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=99.56 E-value=2.7e-14 Score=94.95 Aligned_cols=432 Identities=14% Similarity=0.083 Sum_probs=203.0 Q ss_pred cCcccccCeEEEEEecCCCCCCC--CCccEEEEEecCCCCccccceeEEecChHHHHHhhCc--CcHHHHHHHHHHHhCC Q lcl|Aclame:pro 8 IPSNTLVPLFYAEMDNQAANTAQ--DSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA--GSQLARMVEAYRQTDP 83 (498) Q Consensus 8 Ip~~~rvPg~y~E~dns~a~~~~--~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~--GS~l~~M~~a~~~~n~ 83 (498) .|... .||+|+|-.++.+.+.. ..--..+||- +-.++.++|++|+|..++..++|. .+-++.-+++++. |. T Consensus 1 M~~~~-~pGVyv~E~~~~~~~i~~v~T~v~~~VG~---a~~gp~n~pv~its~~d~~~~g~~~~~~tL~~Av~~~f~-nG 75 (477) T protein:vir:10 1 MAANY-LHGVETIEKETGSRPVKVVKSAVIGLIGT---APIGPVNTPVQSLSDVDAAQFGPQLAGFTIPQALDAVYD-YG 75 (477) T ss_pred CcccC-CCCeEEEEccCCcccccccCCceeEEEec---ccCCCCCcCEEEccHHHHHHhccCCCCCcHHHHHHHHHh-cc Confidence 55544 59999997776664332 2334567773 445678999999999999876664 3566777777776 66 Q ss_pred CceEEEEEecCCccceeEEEEEEeeeccCCcEEE-EEEccEEEEEEeecC-CCHHHHHHHHHHHHhcCCCceEEEeeccc Q lcl|Aclame:pro 84 FGELYVIAVPEATGAAATVTLTVTGEATESGTVN-VYVGRTRVQAPVTNG-DNVTTIASSIQDAINAVPTLPFTASSSAG 161 (498) Q Consensus 84 ~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~-l~I~g~~v~v~V~~g-dtaa~iA~~l~~aIn~~~~lpVtA~~~~~ 161 (498) -...|++.+.+....+++-.-...+.....+... ..+......+....+ .+.........+.++.............+ T Consensus 76 g~~~~vVrV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (477) T protein:vir:10 76 SGTVIVINVLDPAVHKSNAANEPVTFDAATGRAKLAHPAAANLVLKNDSGGTTYAEGTDYAVDLINGVITRIKTGTIPPG 155 (477) T ss_pred ceEEEEEecCccccccccccccccccccccceecccccccccccccccccccccccchhhhhhhccccceeccccccccc Confidence 7899999997653322221111111111111111 001111111111111 11111111111111111110000000111 Q ss_pred eEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEecC-CCh-HHHHH Q lcl|Aclame:pro 162 VVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPF-NDT-ASVNT 239 (498) Q Consensus 162 ~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~p~-tD~-a~l~a 239 (498) ...+......... ....+.-+.-........+|-..+..+....+... .+++.|. +.. +-..+ T Consensus 156 ~~~~~~~~~~~~~--------------~~~~~~~~~g~~~~~~~~tGl~al~~~~~~~~~~~-~~l~apg~~~~~~v~~~ 220 (477) T protein:vir:10 156 ATAAKATYDYADP--------------TKVTAADIIGAVNAAGMRTGMKALKDTYNLYGYFS-KILIAPAYCTQNSVSVE 220 (477) T ss_pred ceeeeeccccccc--------------cccccccccccccccchhhhhhhhhhhhhhcchhc-ccccccccccchhhHHH Confidence 1111111100000 00000000000000000011111222222222222 3333332 221 22333 Q ss_pred HHHHHhhhhhhhhhhhheeeEEEEecc-CC-HHHHHhhh-------hccCcceEEEEecC--------C-CCCCcHHHHH Q lcl|Aclame:pro 240 LVTEMNDTSGRWSYARQLYGHVYTAKT-GT-LSELVNAG-------DQFNQQHITLAGYE--------K-ETQTPADELA 301 (498) Q Consensus 240 l~~~l~~~s~r~~~~~q~~g~~~~~~~-gt-~~~~~t~g-------~~~N~~~~t~~~~~--------~-~~~~p~~~~A 301 (498) |.++.+ ++ .++++.... ++ .+++.++- ...|+.++.+.+.. + ....|+ + T Consensus 221 l~~~~~----~~------~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s 287 (477) T protein:vir:10 221 LEAMAV----QL------GAIAYIDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDTATNAERLEPL---S 287 (477) T ss_pred HHHHHh----hC------CEEEEEecCCCCCHHHHHhhhhhccccccccccceEEEEcCeEEEecccCCceeEEch---H Confidence 444332 22 234443332 22 33333322 24567777665421 0 011233 2 Q ss_pred HHHHHHhhhh-hccCccccccceEEeccccCCC----ccccChHHHHHHHhCCeeEEEEc--CCeEEEEeeeeeeeecCC Q lcl|Aclame:pro 302 ASRTARAAVF-IRNDPARPTQTGELVGMLPAPK----GKRFTMTEQQTLLSHGVATAYVE--SGVLRIQRDVTTYRKNAY 374 (498) Q Consensus 302 Aa~~a~~a~~-l~~DPArpl~tl~L~Gl~~p~~----~~r~~~~er~~lL~~Gist~~v~--~G~v~IeR~ITTY~~n~~ 374 (498) +.+|++.|.- .+..|...--...|.|+.-+.. ....+..|++.|..+||.++.-- .| .++--..|.- . T Consensus 288 ~~~ag~~a~~d~~~g~~~span~~~~gi~~~~~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~~G-~~~wG~rT~~----~ 362 (477) T protein:vir:10 288 SRAAGLRARVDLDKGYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSG-LRLWGNRTAA----W 362 (477) T ss_pred HHHHHHHHHhhhcCCceeccCCceeccccccccccccccCCChhhHHHHhhCCceEEEEecCCc-EEEEcccccC----C Confidence 3333333321 1222311111234455543322 23346789999999999998753 34 5555555541 1 Q ss_pred CCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcC Q lcl|Aclame:pro 375 GVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQ 454 (498) Q Consensus 375 G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~ 454 (498) ...|+.|+.+.+.|+.+|+.+.++..+. .|-.+.+... |-+.|+..+=+.++.|..+|.+..+. T Consensus 363 ~~~~~~~~~~~vrR~~~~i~~~~~~~~~-~~v~~~~~~~-----------~~~~i~~~i~~~l~~l~~~g~l~g~~---- 426 (477) T protein:vir:10 363 PTVTHMRNFENVRRTGDVINESLRYFSQ-QFVDAPIDQG-----------LIDSLVESVNGFGRKLIGDGALLGFK---- 426 (477) T ss_pred CCCCcccceeehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeeeE---- Confidence 2346779999999999999999999775 3443333221 45788899999999999999998753 Q ss_pred eEEEEEcCC-----CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 455 YLVVERDAS-----VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 455 ~lvVerd~~-----d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) +.+.++.+ +.+++.+.+-...+..++- |.|+++++.+.- T Consensus 427 -v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~----i~~~~~~~~~~~ 470 (477) T protein:vir:10 427 -AWFDPARNPKEELAAGHLLINYKYTVPPPLER----LTYETEITSEYL 470 (477) T ss_pred -EEEecCCCCHHHhhCCeEEEEEEEEecCCcce----EEEEEEEcchHH Confidence 33333321 2257888877777777665 455555544333 No 28 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=99.54 E-value=6.7e-13 Score=87.32 Aligned_cols=446 Identities=13% Similarity=0.037 Sum_probs=231.7 Q ss_pred ccchhhcCcccccCeEEEEEecCCCCCCCC---CccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHHH Q lcl|Aclame:pro 2 TISFNTIPSNTLVPLFYAEMDNQAANTAQD---SGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEAY 78 (498) Q Consensus 2 ~i~f~~Ip~~~rvPg~y~E~dns~a~~~~~---~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a~ 78 (498) =|+.+. ++.++-+....+.. ---.|+++.. .-.+.+......|.++..+.||..|.-..+++.| T Consensus 1 mip~s~----------iV~V~~~v~~~~~~~~~~~~~l~l~~~---~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~y 67 (504) T protein:vir:96 1 MISQSR----------YIRIISGVGAGAPVAGRKLILRVMTTN---NVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAY 67 (504) T ss_pred CCCccc----------eeEeeecccccccccccccceeEeecc---cCCCccceEEecCHHHHHHhcCCChHHHHHHHHH Confidence 123322 34454444433333 3335666542 2344577777778899999999999999999999 Q ss_pred HHhCC-----CceEEEEEecCCcccee--EEEEEEe---eeccCCcEEEEEEccEEEEEE---eecCCCHHHHHHHHHHH Q lcl|Aclame:pro 79 RQTDP-----FGELYVIAVPEATGAAA--TVTLTVT---GEATESGTVNVYVGRTRVQAP---VTNGDNVTTIASSIQDA 145 (498) Q Consensus 79 ~~~n~-----~~~l~~i~l~d~ag~aa--tg~itit---gtat~~G~l~l~I~g~~v~v~---V~~gdtaa~iA~~l~~a 145 (498) ++..| -..|++-.-.+.+..+. .+.++-+ =.+-.+|+++|.|+|....+. .....+-+.+|+.|..+ T Consensus 68 F~~~~~~~~~P~~l~igR~~~~a~~~~l~g~~~~~~~~~~~~i~~G~lsitv~G~~~~~~~i~~S~~ts~~~vA~~i~~a 147 (504) T protein:vir:96 68 FKFISKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEKNITAIDTSAATSMDNVASIIQTE 147 (504) T ss_pred hhcCCCCCccccEEEEEeecCcCccceEEechhHHHHHHHhhhhceEEEEEEcceeeeecccccccccchHHHHHHHHhh Confidence 99865 35777766443221111 1111100 012356999999999877644 44555567888888888 Q ss_pred HhcCCC-----ceEEEeeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhc Q lcl|Aclame:pro 146 INAVPT-----LPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMA 220 (498) Q Consensus 146 In~~~~-----lpVtA~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg 220 (498) +++... ..|+....++..++|..-.|+.. ...........-...-|++..-.....|..-..+.++++++- T Consensus 148 l~~~~~~~~~~~tv~~d~~~~~f~its~~tg~~~----~~~~~~a~~~~~~~~lgl~~~~~~~v~g~~aet~~~al~al~ 223 (504) T protein:vir:96 148 IRKNTDPQLAQATVTWNPNTNQFTLVGATIGTGV----LAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKST 223 (504) T ss_pred hhcccccccccceEEEeccCCeEEEEeeccccce----eEEEeeccccchhhhhhcccccceEEeecccccHHHHHHHHH Confidence 876643 24566667788888887766432 222111100000111133322111123444445666776665 Q ss_pred c---CcceEEEe-cCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCCCCCCc Q lcl|Aclame:pro 221 D---EPFDYIGL-PFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEKETQTP 296 (498) Q Consensus 221 ~---~~~~~I~~-p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~~~~~p 296 (498) + .||-+.+. ...+.+...++..|.+... +++..+.....++-......-.. +..+..++....... + T Consensus 224 ~~~~~Wy~f~~a~~~~~dd~ilalA~w~ea~~-------~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~-~ 294 (504) T protein:vir:96 224 NVSNNFGSFLFAGATLDNDQIKAVSAWNAAQN-------NQFIYTVATSLANLGALFDLVKG-NSGTALNVLSATASN-D 294 (504) T ss_pred hhcCCeEEEEEEeccCCHHHHHHHHHHHhhcC-------ceEEEEEeecccchhhHHHhhhh-cceeEEEEeecCccc-h Confidence 4 45555443 2233445567777765422 23333333333333322222122 233333433322221 1 Q ss_pred HHHHHHHHHHHhhhhhccCccc-----cccceEEeccccCCCccccChHHHHHHHhCCeeEEEEc--CC--eEEEEeeee Q lcl|Aclame:pro 297 ADELAASRTARAAVFIRNDPAR-----PTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVE--SG--VLRIQRDVT 367 (498) Q Consensus 297 ~~~~AAa~~a~~a~~l~~DPAr-----pl~tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~--~G--~v~IeR~IT 367 (498) +..+ ..++.++ ..|+.+ ++.--.|+|+.|. .++.+|.+.|..+|+..+..- .| .-.+.+.++ T Consensus 295 -~~~~-~~~~~~a---s~~f~~~ng~~T~~fk~l~GVta~----~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~~ 365 (504) T protein:vir:96 295 -FVEQ-CPSEILA---ATNYDEPGASQNYMYYQFPGRNIT----VSDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGIL 365 (504) T ss_pred -hHHH-HHHHHHH---hcCcCcccccccccccccCCcCcc----cCCHHHHHHHHhcCCeEEEEeecccceeeEEecCee Confidence 1211 1122222 344433 3444456777643 589999999999999998542 23 234667777 Q ss_pred eeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCC-ceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 368 TYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGR-HKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIV 446 (498) Q Consensus 368 TY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r-~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~giv 446 (498) + .|. ..|.||..++-.+++...++..+-.-|-. .|+-=+.. --.+|++.+-+.+++-...|+| T Consensus 366 ~-----gG~--~~~~wiDv~~~~~WL~~~lq~~l~~l~~~~~kIPyt~~---------Gi~~l~a~i~~vl~~av~~G~I 429 (504) T protein:vir:96 366 C-----GGP--TDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASMV---------GEAMTLAVLQPVLDKATSNGTF 429 (504) T ss_pred e-----CCc--cccchhhhhhhHHHHHHHHHHHHHHHHhcCCCcccCHh---------hHHHHHHHHHHHHHHHHhccee Confidence 6 121 14788888889899999999888655522 13222211 1268999999999999999987 Q ss_pred cc-hh---hhcCeEEEEEcCC----CC-eEEE-EEeee--EEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 447 EN-YE---LFKQYLVVERDAS----VP-NRLN-TLFPP--DYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 447 en-~~---~~~~~lvVerd~~----d~-nRvn-~~~p~--~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) .- +. ..+..+.-+...+ |- +|-- +.+|+ +.--+-|---..-.+.+-|...-| T Consensus 430 ~~Gv~~~~~q~~~I~~~~~~d~~~~~~~~~GYyv~~~~~s~~s~~~r~~R~~~~~~~~y~~~ga 493 (504) T protein:vir:96 430 TYGKDISAVQQQYITQITGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDA 493 (504) T ss_pred ccCccCCccchheecccccccccccceeccceEEEecChhccChhHhhhccccceEEEEEECCe Confidence 32 11 1112222111111 00 1111 22222 111122222223334444555544 No 29 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=99.51 E-value=3.5e-13 Score=88.88 Aligned_cols=443 Identities=11% Similarity=0.045 Sum_probs=221.1 Q ss_pred cccccCeEEEE-EecCCCCCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCc---CcHHHHHHHHHHHhCCCc Q lcl|Aclame:pro 10 SNTLVPLFYAE-MDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA---GSQLARMVEAYRQTDPFG 85 (498) Q Consensus 10 ~~~rvPg~y~E-~dns~a~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~---GS~l~~M~~a~~~~n~~~ 85 (498) -+...||+|+| +|-++.-.........++|. ...++.++|++|+|..|-...||. ++-+..|++.|+. |--. T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~---~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~-ngg~ 76 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAAIVGK---FAWGPAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFL-QYGN 76 (663) T ss_pred CceecCceEEEEecCcccccccCccceeEEee---eccCCCCccEEecCHHHHHHHhCCcCccchhHHHHHHHHH-hCCC Confidence 45788999998 43233312223446677774 456788999999999999999998 7888888988885 4556 Q ss_pred eEEEEEecCCccceeE------EEEEEeeecc---CCcEEEEEE----------------ccEEEEEEeecC-------- Q lcl|Aclame:pro 86 ELYVIAVPEATGAAAT------VTLTVTGEAT---ESGTVNVYV----------------GRTRVQAPVTNG-------- 132 (498) Q Consensus 86 ~l~~i~l~d~ag~aat------g~ititgtat---~~G~l~l~I----------------~g~~v~v~V~~g-------- 132 (498) .+|++.+.+....+++ ..+++.+... .+-.+.+.+ ++.++.+.+... T Consensus 77 ~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~n~~~~~v~~~~a~~~~~~~~ 156 (663) T protein:vir:10 77 DLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKTRQ 156 (663) T ss_pred eEEEEEccCCcccccccccCCcceeeeeccccCccccccccccccccccccccceeeeccCCceEEEEeccccccccccc Confidence 8999999764222111 1222221110 000111111 111111111100 Q ss_pred ------------------CCHHHHHHHHHHHHhcCCCceE---------------EEeeccceEEEeeccCcccccceeE Q lcl|Aclame:pro 133 ------------------DNVTTIASSIQDAINAVPTLPF---------------TASSSAGVVTLTARHKGLCGNEIPV 179 (498) Q Consensus 133 ------------------dtaa~iA~~l~~aIn~~~~lpV---------------tA~~~~~~VtlTAk~kG~~gN~i~l 179 (498) .....-+..+...++....+.. +.+.......++++..|.+||.+.+ T Consensus 157 v~~~~~~~~~~~~~~~~~~~~~~~~~~v~~vv~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~G~~Gn~i~v 236 (663) T protein:vir:10 157 LGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLVSAVYPGEIGSTVEV 236 (663) T ss_pred cceeeeccccceeEeeeccccccccccccceecccceeeEeeccccccccccchhhhcccccceeeeeecccccccceeE Confidence 0000000011111221111110 0111223456788888999888776 Q ss_pred EEEecccCc----------------------------------------------------cc------------ccccc Q lcl|Aclame:pro 180 SLNYYGFGG----------------------------------------------------GE------------VLPAG 195 (498) Q Consensus 180 ~~~~~~~~~----------------------------------------------------ge------------~~p~G 195 (498) .+....... +. ....| T Consensus 237 ~i~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~ 316 (663) T protein:vir:10 237 EIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFRNG 316 (663) T ss_pred EecccccccccccccccccccccccccceeeeeccccccceeEEEecCCcceeeeeeeecccccccchhhhhhhhhhccC Confidence 533210000 00 00000 Q ss_pred ee--------------eeecccCCCcC------cchhhhHHHhhccC---cceEEEecCCC---hH----HHHHHHHHHh Q lcl|Aclame:pro 196 VQ--------------IAVATGTAGTG------APVLTGAVAAMADE---PFDYIGLPFND---TA----SVNTLVTEMN 245 (498) Q Consensus 196 lt--------------~tit~~agGag------~pD~~~alaalg~~---~~~~I~~p~tD---~a----~l~al~~~l~ 245 (498) .. ......+||.. +.|+..+++.+.+. ..+++++|-.+ .. -..+|-.|.+ T Consensus 317 ~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~al~~~a~ 396 (663) T protein:vir:10 317 GSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSLAD 396 (663) T ss_pred cceEEEEeecccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHHHH Confidence 00 00112334443 23455566666542 44566664322 11 2333444433 Q ss_pred hhhhhhhhhhheeeEEEEeccC---------CHHHHHhh-------------hhccCcceEEEEecC--------CC-CC Q lcl|Aclame:pro 246 DTSGRWSYARQLYGHVYTAKTG---------TLSELVNA-------------GDQFNQQHITLAGYE--------KE-TQ 294 (498) Q Consensus 246 ~~s~r~~~~~q~~g~~~~~~~g---------t~~~~~t~-------------g~~~N~~~~t~~~~~--------~~-~~ 294 (498) . | |.+.++. .+..+ ++.++.++ ...+++.+..+.+.. +. .. T Consensus 397 ~---~----~~~~ai~-d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d~~~~~~~~ 468 (663) T protein:vir:10 397 D---R----QDCVAIV-NPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRW 468 (663) T ss_pred h---h----CCEEEEE-ecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEecccCCceEE Confidence 2 1 1122221 22222 23333333 223456666655421 11 11 Q ss_pred CcHHHHHHHHHHHhhhh-hccCcc-cccc-ce-EEeccccCCCccccChHHHHHHHhCCeeEEEE--c-CCeEEEEeeee Q lcl|Aclame:pro 295 TPADELAASRTARAAVF-IRNDPA-RPTQ-TG-ELVGMLPAPKGKRFTMTEQQTLLSHGVATAYV--E-SGVLRIQRDVT 367 (498) Q Consensus 295 ~p~~~~AAa~~a~~a~~-l~~DPA-rpl~-tl-~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v--~-~G~v~IeR~IT 367 (498) .|+.. .+|+..|.. .+..|- .|-+ .+ .+.|+ ...+..++..|++.|..+||.++.. + +| .++-=.-| T Consensus 469 ~p~s~---~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~--~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G-~~~wG~rT 542 (663) T protein:vir:10 469 VPLAA---DIAGLCAYTDQVSHPWMSPAGYRRGQIRNC--IKLAIEPKQSMRDTMYQVAINPVTGFAGGDG-FVLFGDKM 542 (663) T ss_pred echhH---HHHHHHHHhhccCCceEccCCceecccccc--ccceeccChhHHHHHhhCCceEEEEEeCCCc-EEEEcccc Confidence 34433 333443321 111221 1211 11 23343 3446678999999999999987653 2 35 33333333 Q ss_pred eeeecCCCCCC-chhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 368 TYRKNAYGVAD-NSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIV 446 (498) Q Consensus 368 TY~~n~~G~~D-~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~giv 446 (498) . ..| ..|+.|...|+.+|+++.++.... .|-.+.+... +-+.||..+=..+++|..+|.+ T Consensus 543 ~-------s~~~s~~~~i~vrR~~~~i~~si~~~~~-~~v~e~n~~~-----------l~~~i~~~i~~~L~~l~~~gal 603 (663) T protein:vir:10 543 A-------TQVPSPFDRINVRRLFNMLKKNIGDTSK-YELFENNDAF-----------TRQSFRMETSQYLDGIRSLGGC 603 (663) T ss_pred c-------CCCCcccceEehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhcCce Confidence 2 123 369999999999999999998664 4444433221 3467899999999999999999 Q ss_pred cchhhhcCeEEEEE--cCC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 447 ENYELFKQYLVVER--DAS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 447 en~~~~~~~lvVer--d~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) ..+. +.+-+ |++ +.+|+.+.+-...+...+-| .|++|....-+ T Consensus 604 ~g~~-----v~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~i----~~~~~~~~~~~ 651 (663) T protein:vir:10 604 YDFR-----VVCDTTNNTPNVIDRNEFVGTIYVKPPRSINYI----TLNMVATSTGA 651 (663) T ss_pred eeeE-----EEEcCCCCCHHHhhCCeEEEEEEEEecCCcceE----EEEEEEeecCc Confidence 8853 23322 122 34677777766666665543 45555333333 No 30 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=99.51 E-value=1.5e-12 Score=85.46 Aligned_cols=445 Identities=14% Similarity=0.092 Sum_probs=240.7 Q ss_pred ccchhhcCcccccCeEEEEEecCCC---CCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHHH Q lcl|Aclame:pro 2 TISFNTIPSNTLVPLFYAEMDNQAA---NTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEAY 78 (498) Q Consensus 2 ~i~f~~Ip~~~rvPg~y~E~dns~a---~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a~ 78 (498) =|+.+. ++.++-+.. +.+...--.||++. ....+.+......|.++..+.||..|.-..+++.| T Consensus 1 mip~s~----------iVnV~~~v~~~a~~~~~~~~~lilt~---~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~y 67 (507) T protein:vir:99 1 MISQSR----------YVRIVSGVGAGAPVAQRRLIMRVMTT---NAVLPPGVVFESSSADAVGAYFGMASEEYKRAKAY 67 (507) T ss_pred CCCccc----------eeEEeeeccccCcccccccceeeecc---ccCCCccceEeecCHHHHHHhcCCChHHHHHHHHH Confidence 122222 344444433 34444456677754 22345677777789999999999999999999999 Q ss_pred HHhCC-----CceEEEEEecCCccce--eEEEEE---EeeeccCCcEEEEEEccEEEEE---EeecCCCHHHHHHHHHHH Q lcl|Aclame:pro 79 RQTDP-----FGELYVIAVPEATGAA--ATVTLT---VTGEATESGTVNVYVGRTRVQA---PVTNGDNVTTIASSIQDA 145 (498) Q Consensus 79 ~~~n~-----~~~l~~i~l~d~ag~a--atg~it---itgtat~~G~l~l~I~g~~v~v---~V~~gdtaa~iA~~l~~a 145 (498) ++..| -..|++-.-...+-.+ ..++++ ..=.+-.+|+++|.|+|..+++ ......+.+++|+.|..+ T Consensus 68 Fsq~p~~~~~P~~L~igR~~~~~~~a~l~g~~~~~~l~~~~~~~~G~lti~v~G~~~t~~~i~lS~~ts~~~vAs~i~~~ 147 (507) T protein:vir:99 68 MSFISKSINSPSYISFARWVNAAIASMIVGDSLVKNLPALKAVATPTLSLSIGGTVVPIAGIDLTAALTLTDVAATLQTK 147 (507) T ss_pred hccCCCCCcccceEEEEeecCccccceeecchhhhhHHHHhhhcceeEEEEEcCceeEeccccccccCCHHHHHHHHHHh Confidence 99887 4577776654322111 112222 1112346899999999987664 557888999999999999 Q ss_pred HhcCCCc-----eEEEeeccceEEEeeccCcccccceeEEEEecccCccccccc--ceeeeecccCCCcCcchhhhHHHh Q lcl|Aclame:pro 146 INAVPTL-----PFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPA--GVQIAVATGTAGTGAPVLTGAVAA 218 (498) Q Consensus 146 In~~~~l-----pVtA~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~--Glt~tit~~agGag~pD~~~alaa 218 (498) |.+..++ -|+-...+...++++...|+.. .++...... .|..... |++-+-.....|+...++.+++++ T Consensus 148 l~a~~~~~~~~~tv~~d~~~~~F~v~s~~tG~~s-~i~~at~~~---~gt~~s~l~~~~~~~a~~~~g~~aet~~~a~~a 223 (507) T protein:vir:99 148 IRASANAELATATVTFNTTTNQFVLNGTTTGALA-PTITAVRTD---PATDISSLLGWTNTGTVFVKGQAAETPDTSISK 223 (507) T ss_pred hhccccccccceEEEEecCCceEEEEeeeccccc-eeEEEEcCC---chhhHHHHhccccccceEeecccccCHHHHHHH Confidence 9877543 1333345566778877777442 344433211 1111111 111111112334445577888888 Q ss_pred hcc---CcceEEEe--cCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCCCC Q lcl|Aclame:pro 219 MAD---EPFDYIGL--PFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEKET 293 (498) Q Consensus 219 lg~---~~~~~I~~--p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~~~ 293 (498) +-+ .||-++.. |--+.+.+.++.+|.+... +++. .....+-+...+.....+..-....... .. T Consensus 224 ~~~~~~nW~~~~~a~~~~~td~~~lalA~wiea~~-------~~f~---~~~~~~~a~~~~~~~~~~~~~~~~~~~~-~~ 292 (507) T protein:vir:99 224 SAAISTNFGSFIYTSTPALTNDQITAVASWNASQN-------NMYM---YSVPTTIANIGTLYAAVKGFSGCALNIT-SD 292 (507) T ss_pred HHhhcCCeEEEEEEeccccChHHHHHHHHHHhhcC-------cEEE---EEEecCchhhhhhhhhhhhcceeEEEee-cc Confidence 755 45444332 2113345566777765422 2222 1122233444445444433322222211 12 Q ss_pred CCcHHHHHHHHHHHhhhhhccCccccccce-----EEeccccCCCccccChHHHHHHHhCCeeEEEE--cCC--eEEEEe Q lcl|Aclame:pro 294 QTPADELAASRTARAAVFIRNDPARPTQTG-----ELVGMLPAPKGKRFTMTEQQTLLSHGVATAYV--ESG--VLRIQR 364 (498) Q Consensus 294 ~~p~~~~AAa~~a~~a~~l~~DPArpl~tl-----~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v--~~G--~v~IeR 364 (498) ..|.|-.+|++++.++ ..|+.++.-+. .|+|+.| +.++.+|.+.|..+|+..+.. ..| .-.+.+ T Consensus 293 ~~~~~~~~aa~~g~~a---s~nf~~~ng~~T~~fk~l~GV~a----~~lt~t~a~al~~~n~N~y~~~a~~~~~~~~~~~ 365 (507) T protein:vir:99 293 SLPVDYIEQSPCEILA---ATDYTRVNATQNYMYYQFPSRNI----TVSDDTTANLVDANRGNYIGQTQSAGQSLAFYQR 365 (507) T ss_pred cccchhHHHHHHHHHH---hhccCcCccceeecccccCCccc----ccCCHHHHHHHHhcCCeEEEEeccccceeeEEec Confidence 2344455677777765 45665554444 3455554 358999999999999999853 222 345677 Q ss_pred eeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCc-eeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 365 DVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRH-KLASDGTRFGPGQAIVTPAVIKGELLATYRQLERA 443 (498) Q Consensus 365 ~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~-kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~ 443 (498) .+++ .|. -.|.|+...+=.+++...++..+-.-|-.- |+--+.. --.+|++.+-+.+++-... T Consensus 366 G~~~-----gG~--~~fid~d~~~~~~WL~~~iq~~l~~l~~~~~kIPyt~~---------G~~~l~a~i~~~l~~av~n 429 (507) T protein:vir:99 366 GILC-----GGP--NDAVDMNIYANEIWLKSAISAQILSLFLNVPRVPANET---------GESMLLSVIQSVVNTAKNN 429 (507) T ss_pred Ceee-----CCc--ccceeeeeecchHHHHHHHHHHHHHHHhcCCCCccChh---------hHHHHHHHHHHHHHHHHhc Confidence 7776 111 147778777777788888888776555222 2222111 1268999999999999999 Q ss_pred ccccc-hhhhcC-eEEEE---------EcCCCCeEEEEEeee-EE-ecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 444 GIVEN-YELFKQ-YLVVE---------RDASVPNRLNTLFPP-DY-VNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 444 given-~~~~~~-~lvVe---------rd~~d~nRvn~~~p~-~~-vn~l~v~A~~~~f~lq~~~~~~ 498 (498) |+|.- ++.... ...+. ++-.++.. -+.+|+ .. -.+-|.=-..-.+.|=|...-+ T Consensus 430 G~I~~Gvtl~~~q~~~in~~~~~~~~~~~~~~~Gy-y~~~~~~s~~~~~~r~~r~~~~~~~~y~~~ga 496 (507) T protein:vir:99 430 GTISAGKNLNVIQQQYITQISGDANAWRQVANIGY-WLNITFSSYTNPNTQLTEWKASYQLIYSKDDA 496 (507) T ss_pred cccccCCcccccchheecccccccccccceeccce-EEEeCChHhcChhhhhccccceEEEEEEeCCe Confidence 98853 211000 00111 11111110 112221 11 1112222222333344444444 No 31 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=99.48 E-value=9.2e-13 Score=86.57 Aligned_cols=450 Identities=11% Similarity=0.042 Sum_probs=217.2 Q ss_pred CccchhhcCcccccCeEEEE-EecCCCCCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCc---CcHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAE-MDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA---GSQLARMVE 76 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E-~dns~a~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~---GS~l~~M~~ 76 (498) |+ + ..||+|+| +|-++.=.........++|. ...++.++|++|+|..|-...||. .+-+..|++ T Consensus 1 ma--~-------~~PgVyv~E~~~~~~i~~~~ts~~~~vG~---~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~ 68 (664) T protein:vir:98 1 MA--L-------QSPGIETKETSVQSTVVRNSTGRAAIVGK---FSWGPAYQIRQISNEVELVNYFGAPDNLTADYFMSA 68 (664) T ss_pred Cc--e-------ecCceEEEecCCCcccccccccceEEEee---ccCCCCCccEEecCHHHHHHhcCCccccchhHHHHH Confidence 66 2 46999998 55333322233456777875 446788999999999999999996 688889999 Q ss_pred HHHHhCCCceEEEEEecCCccc-ee---EEEEEEe-------------------ee-----------ccCCcEEEEEEcc Q lcl|Aclame:pro 77 AYRQTDPFGELYVIAVPEATGA-AA---TVTLTVT-------------------GE-----------ATESGTVNVYVGR 122 (498) Q Consensus 77 a~~~~n~~~~l~~i~l~d~ag~-aa---tg~itit-------------------gt-----------at~~G~l~l~I~g 122 (498) .|+. |--..+|++.+.+.... .+ .+.+.++ ++ ...+..+.+.|-. T Consensus 69 ~~f~-ngg~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~gn~~~v~i~~ 147 (664) T protein:vir:98 69 VNFL-QYGNDLRLVRVVDKDAAKNASAIFNQIKTTIASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNGKILAVTIPK 147 (664) T ss_pred HHHH-hcCCeEEEEEecCccccccccccccccceeeccCCcccccccceeEeecCcccccceeeccCCCCCceeeEeecc Confidence 9985 55567999999653211 11 1111111 11 1111112221100 Q ss_pred EEEE-EEe-------------------ecCCCHHHHHHHHHHHHhcC--------------CCc-eEEEeeccceEEEee Q lcl|Aclame:pro 123 TRVQ-APV-------------------TNGDNVTTIASSIQDAINAV--------------PTL-PFTASSSAGVVTLTA 167 (498) Q Consensus 123 ~~v~-v~V-------------------~~gdtaa~iA~~l~~aIn~~--------------~~l-pVtA~~~~~~VtlTA 167 (498) .... +.. .++..... ..+...+... ... ++..........+++ T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~s~~~~s~g~a~a--~~v~~v~~d~~~~~~~~~~a~~~i~~~~~~~~~~~~~~~~~~a 225 (664) T protein:vir:98 148 RKKSLLVLNRSVLTQIFLLVGTTEIVSQSSGVSAS--ITIDGIESDSGITLLNLDIAKETIQGTSFQTLTQKYQIPSVVA 225 (664) T ss_pred Cccceeecccccccccceecccceeeeeeccccee--eecccccccceeeccccceeeeccccccceeeeeccccceeee Confidence 0000 000 00000000 0000000000 000 001111112234445 Q ss_pred ccCcccccceeEEEEeccc-Ccc--------------------------------------------------------- Q lcl|Aclame:pro 168 RHKGLCGNEIPVSLNYYGF-GGG--------------------------------------------------------- 189 (498) Q Consensus 168 k~kG~~gN~i~l~~~~~~~-~~g--------------------------------------------------------- 189 (498) +..|..||.+++.+.-... ..+ T Consensus 226 ~~~G~~Gn~isv~i~s~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ 305 (664) T protein:vir:98 226 LYPGELGSTVQVEIISKAAYDTGAMISGYPSGISVKNSGRSVMTYGPQTDNQYAFVVRRGGIVQESFIVSTDKTDKDIYG 305 (664) T ss_pred eecccccceeeeeecccccccCcceEeeccCceecccceeeeeeccccCccceeEEEecCCceeeeEEeecccCccccee Confidence 5556666655543321000 000 Q ss_pred ---------------------cccccceeeeecccCCCc------CcchhhhHHHhhccC---cceEEEecCCCh----- Q lcl|Aclame:pro 190 ---------------------EVLPAGVQIAVATGTAGT------GAPVLTGAVAAMADE---PFDYIGLPFNDT----- 234 (498) Q Consensus 190 ---------------------e~~p~Glt~tit~~agGa------g~pD~~~alaalg~~---~~~~I~~p~tD~----- 234 (498) +..|.+.+. .....||. |+.+..++|.++.+. ..++|++|--+. T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~gg~~~~~~~g~~~~~tgl~~l~~~~~~~~~ll~~p~~~~~~~~~ 384 (664) T protein:vir:98 306 VNIYMDDFFANGGSQYVFGTSMNWPKGFSG-ILEFGGGLSSNDTVGADELMTGWDMFADREALHVPLLIAGGCAGESVEI 384 (664) T ss_pred eeeechhheecccceeeeeecccCCcccce-eEeccCccccccccCchhHHHHHHhhhcccccccceEEecCCCCCcHHH Confidence 000011110 11122332 334455677777543 358888875332 Q ss_pred --HHHHHHHHHHhhhhhhhhhhhheeeEEEEe-ccCCHHHHHhhhh--------------ccCcceEEEEecC------- Q lcl|Aclame:pro 235 --ASVNTLVTEMNDTSGRWSYARQLYGHVYTA-KTGTLSELVNAGD--------------QFNQQHITLAGYE------- 290 (498) Q Consensus 235 --a~l~al~~~l~~~s~r~~~~~q~~g~~~~~-~~gt~~~~~t~g~--------------~~N~~~~t~~~~~------- 290 (498) +-..++.++++....|+.-.....+..... ..-+..++..+-. .+++.+..+.+.. T Consensus 385 ~~~v~~al~~~a~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~ 464 (664) T protein:vir:98 385 ASTVQKHVISIGDERQDCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNYKYQYDKY 464 (664) T ss_pred HHHHHHHHHHHHHhcCCeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCeEEEeccc Confidence 123444444433211221110001111111 1123444444432 3566666554321 Q ss_pred -CC-CCCcHHHHHHHHHHHhhhhhccCcc-ccccc--eEEeccccCCCccccChHHHHHHHhCCeeEEEE--c-CCeEEE Q lcl|Aclame:pro 291 -KE-TQTPADELAASRTARAAVFIRNDPA-RPTQT--GELVGMLPAPKGKRFTMTEQQTLLSHGVATAYV--E-SGVLRI 362 (498) Q Consensus 291 -~~-~~~p~~~~AAa~~a~~a~~l~~DPA-rpl~t--l~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v--~-~G~v~I 362 (498) +. ...|+....|.+.|.... +..|. .|-+. ..+.|. .+....++..|++.|..+||.++.. + .| .++ T Consensus 465 ~~~~~~~p~sg~~AGl~A~~D~--~~g~~~span~~~~~i~g~--~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~~G-~~~ 539 (664) T protein:vir:98 465 NDVNRWVPLAGDIAGLCVYTDS--VANPWMSPAGYNRGQIRNC--IKLAIEPRTAHRDAMYQVQINPVTGFAGGSG-FVL 539 (664) T ss_pred CCceEEechHHHHHHHHHHhhh--cCCcEECcCCceeeeeecc--ccceeecChhhHHHHHhCCCeEEEEeeCCCc-EEE Confidence 11 113443333333333321 22231 12221 124444 3456678899999999999988754 2 34 333 Q ss_pred EeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 363 QRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLER 442 (498) Q Consensus 363 eR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~ 442 (498) -=.-|. .+ .|..|+.|.+.|+.+|+.+.++.... .|--+.+.. -+-+.||..+-..+++|.. T Consensus 540 wG~rT~-----~~-~~s~~~~i~vrR~~~~i~~si~~~~~-~~v~epn~~-----------~l~~~i~~~i~~~L~~l~~ 601 (664) T protein:vir:98 540 YGDKTL-----TS-VPSPFDRINVRRLFNMIKKDIGDNAK-YKLFENNDD-----------FTRASFRMDTGQYMTNIRA 601 (664) T ss_pred Eccccc-----CC-CCcccceEeehhHHHHHHHHHHHHHH-HhhcCCCCH-----------HHHHHHHHHHHHHHHHHHh Confidence 333332 11 23469999999999999999998765 343333222 1457789999999999999 Q ss_pred cccccchhhhcCeEEEEE--cCC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 443 AGIVENYELFKQYLVVER--DAS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 443 ~given~~~~~~~lvVer--d~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) +|.+..+. +.+.+ |++ +.+|+.+.+-...+...+- |.|++|...+-+ T Consensus 602 ~gal~g~~-----V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~----I~~~~~q~~~~~ 653 (664) T protein:vir:98 602 LGGCYDYR-----VICDTTNNTPDVIDRNEFVATVYVKPPRSINY----ITLNFVATSTGA 653 (664) T ss_pred cCceeeeE-----EEEcCCCCCHHHhhCCeEEEEEEEEecCCcce----EEEEEEEeecCc Confidence 99998852 33332 222 2467777776666666543 445555443333 No 32 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=99.48 E-value=4.4e-12 Score=82.87 Aligned_cols=435 Identities=15% Similarity=0.125 Sum_probs=228.5 Q ss_pred Cc-cchhhcCcccccCeEEEEEecCCC---CCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHH Q lcl|Aclame:pro 1 MT-ISFNTIPSNTLVPLFYAEMDNQAA---NTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVE 76 (498) Q Consensus 1 M~-i~f~~Ip~~~rvPg~y~E~dns~a---~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~ 76 (498) |. |+.+.|= .++-+.. +.+ ...-.||+. .....+.+......|.+++.+.||.+|.-..|++ T Consensus 1 m~~ip~s~iV----------~V~~~v~~~~~~~-~~f~~~l~~---~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~ 66 (494) T protein:vir:94 1 MPNIPISQIV----------SINPQVVSAGGTQ-GTLDGLLLT---QATGFPVTQPQVYFSAADVGTAFGLTSDEYNAAL 66 (494) T ss_pred CCCCCcccEE----------EeeeeccccCCcc-cccceeEee---cCccCCccceeeecCHHHHHHhcCCChHHHHHHH Confidence 77 7776653 3333332 222 222244442 2334455555566688899999999999999999 Q ss_pred HHHH----hCCCc-eEEEEEecCCccc-eeEE---EEEEeeeccCCcEEEEEEccEEEEEEe--ecCCCHHHHHHHHHHH Q lcl|Aclame:pro 77 AYRQ----TDPFG-ELYVIAVPEATGA-AATV---TLTVTGEATESGTVNVYVGRTRVQAPV--TNGDNVTTIASSIQDA 145 (498) Q Consensus 77 a~~~----~n~~~-~l~~i~l~d~ag~-aatg---~ititgtat~~G~l~l~I~g~~v~v~V--~~gdtaa~iA~~l~~a 145 (498) .|++ ..|.. .|++-.-...+.. .-.| +.+++.-..-+|++++.|+|++....+ ....+.+++|+.|..+ T Consensus 67 ~yFs~~~~q~p~P~~l~igR~~~~a~~~~l~g~~~~~tl~~~~~~~g~l~iti~g~~~~~~i~lS~~ts~~~vA~~i~~a 146 (494) T protein:vir:94 67 VYFAGILGGGQQPASLTIGRYASAATSAAVFGAPLTLSLAQLQTLSGTLIVTTDTQRTSAAINLSGATSFANAASLMTSG 146 (494) T ss_pred HHhhhccCCCccccEEEEEeecCccccceeeccchhhhHHhhhhcceEEEEEEcceEEEeeecccccCChhhHHHHHhhh Confidence 9998 66664 6776665432211 1112 233444455689999999997665443 4667789999999999 Q ss_pred HhcCCCceEEEeeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeeccc----CCCcCcchhhhHHHhh-- Q lcl|Aclame:pro 146 INAVPTLPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATG----TAGTGAPVLTGAVAAM-- 219 (498) Q Consensus 146 In~~~~lpVtA~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~----agGag~pD~~~alaal-- 219 (498) |.. .+.+|+........++++...|... .|... . | .+..++-++-... ..|+....+.++++++ T Consensus 147 i~~-a~~~v~~d~~~~~f~v~s~ttG~~s-~is~~----t---~-~~a~~l~lt~~~~a~v~~~g~~aet~~~a~~a~~~ 216 (494) T protein:vir:94 147 FTT-PNFAITYDAQRRRFVLSTTATGTTA-SVSAV----T---G-TLADGVGLSTASGAYVEGSGLAADTAASALDRLAA 216 (494) T ss_pred hcc-ccceEEEcccCcEEEEEEccCCcee-EEEEe----c---c-chhhhhhhhccccceEeecCcccccHHHHHHHHHh Confidence 964 4567888888888999888777532 22221 1 1 1233332222221 2444445567777776 Q ss_pred -ccCcceEEEecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEecc-C-CHHHHHhhhhc---cCcceEEEEecCCCC Q lcl|Aclame:pro 220 -ADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKT-G-TLSELVNAGDQ---FNQQHITLAGYEKET 293 (498) Q Consensus 220 -g~~~~~~I~~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~-g-t~~~~~t~g~~---~N~~~~t~~~~~~~~ 293 (498) ...||-+++..--+.+...++.+|.+....|+ +++. ++... . ..+.-..++.. .|..|..++.. . T Consensus 217 ~~~~Wy~f~~~~~~~~~~ilalA~wiea~~~~~-----~~~~-~~~d~~~~~~~~~~~i~~~l~~~~y~~t~~~y~-~-- 287 (494) T protein:vir:94 217 SSSTWAIFTTAWAASLSDRTALAQWTSDQVFRR-----IYAA-WDQDAAGLSVNNVSSFGNIVKTTPFSNTIPVYG-L-- 287 (494) T ss_pred ccCceEEEEEecCCCHHHHHHHHHHHhhcCccE-----EEEE-ecCCcceeecccchhHHHHHHhhcCCceEEEcC-C-- Confidence 45677777664444566667888876532221 2221 11100 0 01112233332 36667766653 2 Q ss_pred CCcHHHHHHHHHHHhhhhhccCccccccceEEeccccCCCccccChHHHHHHHhCCeeEEEEc--CC-eEEEEeeeeeee Q lcl|Aclame:pro 294 QTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVE--SG-VLRIQRDVTTYR 370 (498) Q Consensus 294 ~~p~~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~--~G-~v~IeR~ITTY~ 370 (498) ..|. ++..+..++......|.+ .++.++.-++.-..+..+.+|.+.|..+|+..|..- .+ +..+-. T Consensus 288 ~~~~---aa~~g~~aa~~~~~~~g~--~T~~~k~q~~gi~~~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~------ 356 (494) T protein:vir:94 288 LANA---MIVLAWGASTNLQIAEGR--TTLALRSPVSSAGVRVDNLANANALLSNGYTYLGKYASATNTYTVTY------ 356 (494) T ss_pred CChH---HHHHHHHHhccccccCcc--eeEEeeccCCCCCCccCCHHHHHHHHhcCCeEEEEecccCceEEEec------ Confidence 2232 223333222222222332 344444323333355689999999999999998542 23 222211 Q ss_pred ecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhc-CCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccc- Q lcl|Aclame:pro 371 KNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKY-GRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVEN- 448 (498) Q Consensus 371 ~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~-~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given- 448 (498) .|...-.|.|+...+=.+++...++..+-..+ .-.|+--+... -.++++.+-+.+++-...|+|.- T Consensus 357 ---gg~~sG~~~~id~~~~~~WL~~~iq~~l~~ll~~~~KIPytd~G---------~~~l~a~i~~~l~~av~nG~I~~G 424 (494) T protein:vir:94 357 ---NGAIGGQFLWADTALGWIALRRNLQQALFETLLAYRSLPYNADG---------YNALYQGAQDVVSQFVAAGVIRAG 424 (494) T ss_pred ---CceeccccceeeeeccHHHHHHHHHHHHHHHHHhCCCcccChhh---------HHHHHHHHHHHHHHHHhCceeecc Confidence 12223346666665555566777666664333 21233222211 26899999999999999999942 Q ss_pred ---hhhhcCeE-------------------E-E-EEcCCCC-eEE--EEEeeeEEecCeEEEeeeeeeEE Q lcl|Aclame:pro 449 ---YELFKQYL-------------------V-V-ERDASVP-NRL--NTLFPPDYVNQLRVFAVVNQFRL 491 (498) Q Consensus 449 ---~~~~~~~l-------------------v-V-erd~~d~-nRv--n~~~p~~~vn~l~v~A~~~~f~l 491 (498) -+..+..+ . + ....+++ +|. .+.+=...-+-.|.+-...-+.+ T Consensus 425 v~~~~~q~~~i~~~~G~~~~~~~~~kGyy~~~~~~~s~~~ra~R~~~~~~~~y~~~GAIh~v~i~~~~v~ 494 (494) T protein:vir:94 425 VALSASQRAQIDQAAGVPISGDVVDKGWYLQVIDPITTTVRTDRGSPTVNFWYCDGGSIQRVVVSATTVI 494 (494) T ss_pred cccCcchhhhhhhhhcCccccceeccceeeeccCCCChhhhhccccCCceEEEEecCcEEEEEEeeEEeC Confidence 11100000 0 0 1111111 111 11111122222333333323333 No 33 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=99.47 E-value=1.4e-12 Score=85.61 Aligned_cols=443 Identities=14% Similarity=0.085 Sum_probs=217.3 Q ss_pred cccccCeEEEEEecCCCC-CCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCc---CcHHHHHHHHHHHhCCCc Q lcl|Aclame:pro 10 SNTLVPLFYAEMDNQAAN-TAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA---GSQLARMVEAYRQTDPFG 85 (498) Q Consensus 10 ~~~rvPg~y~E~dns~a~-~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~---GS~l~~M~~a~~~~n~~~ 85 (498) =+...||+|+|--.+... .........++| ....++.++|++|+|..|-...||. .+-+..|++.|+. |--. T Consensus 1 ~~~~~PgVyv~e~~~~~~~~~~~ts~~~fvG---~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~-ngg~ 76 (659) T protein:vir:10 1 MTLLSPGIELKETTVQSTVVNNSTGTAALAG---KFQWGPAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFL-QYGN 76 (659) T ss_pred CceecCceEEEEecCCceecccCccceEEEe---cccCCCCCccEEecCHHHHHHHcCCcCCCcchhHHHHHHHh-hCCC Confidence 446889999985455442 112355677777 4557788999999999999999997 4778888999885 5556 Q ss_pred eEEEEEecCCccceeE----EEEEE----eee----------------ccCCcEEEE-EEccEEEEEEeec--------- Q lcl|Aclame:pro 86 ELYVIAVPEATGAAAT----VTLTV----TGE----------------ATESGTVNV-YVGRTRVQAPVTN--------- 131 (498) Q Consensus 86 ~l~~i~l~d~ag~aat----g~iti----tgt----------------at~~G~l~l-~I~g~~v~v~V~~--------- 131 (498) .+|++.+.+.....++ +.+.+ .|. ....+.+.+ ...+..-.+.+.. T Consensus 77 ~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~vd~~~~~~~~~i~~~~~~~~~~~ 156 (659) T protein:vir:10 77 DLRVVRAVDRDTAKNSSPIAGNIEYTISTPGSNYAVGDKITVKYVSDAIETEGKITEVDTDGKIKKINIPTAKIIAKAKE 156 (659) T ss_pred eEEEEEccCcccccccccccccceeeEeecccccccccceeeeecCCCccccceeeEEecccccceeeeccccccccccc Confidence 8999998653211111 11111 110 000111100 0001100000000 Q ss_pred -CCCHHHH--------------HHHH-HHHHhcCC-Cce--------------EEEe-eccceEEEeeccCcccccceeE Q lcl|Aclame:pro 132 -GDNVTTI--------------ASSI-QDAINAVP-TLP--------------FTAS-SSAGVVTLTARHKGLCGNEIPV 179 (498) Q Consensus 132 -gdtaa~i--------------A~~l-~~aIn~~~-~lp--------------VtA~-~~~~~VtlTAk~kG~~gN~i~l 179 (498) |+...-+ +..+ ...++... .+. +... ...+.-.++++..|+.||.+.+ T Consensus 157 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~a~t~~~~~~~~~~~~~~~v~a~~~G~~g~~~tv 236 (659) T protein:vir:10 157 VGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKIEI 236 (659) T ss_pred ccccceeeeeeeeeeeeeccccceeeEEeeeecCCceeEEeeccccccccccccccceeecccccccccccceecccceE Confidence 0000000 0000 00000000 000 0000 0000112234444555554444 Q ss_pred EEEecccCc------------------------------------------------------cc--------------- Q lcl|Aclame:pro 180 SLNYYGFGG------------------------------------------------------GE--------------- 190 (498) Q Consensus 180 ~~~~~~~~~------------------------------------------------------ge--------------- 190 (498) ......... |. T Consensus 237 ~~~~~a~~~~~~~v~v~~~~~~~~~a~~~t~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (659) T protein:vir:10 237 EIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDFFA 316 (659) T ss_pred EEechhhccccceeeeeeeeecccccccceeeeeeccccccchhhccccccceeeeeeeeccccccccccchhhhhhhhc Confidence 321110000 00 Q ss_pred ------------ccccceeeeecccCCCcC------cchhhhHHHhhc---cCcceEEEecCCC-------hHHHHHHHH Q lcl|Aclame:pro 191 ------------VLPAGVQIAVATGTAGTG------APVLTGAVAAMA---DEPFDYIGLPFND-------TASVNTLVT 242 (498) Q Consensus 191 ------------~~p~Glt~tit~~agGag------~pD~~~alaalg---~~~~~~I~~p~tD-------~a~l~al~~ 242 (498) ..|.+. ......+||.. .+|+..++.++. ...++++++|--. .+...++.. T Consensus 317 ~~~~~~v~~~~~~~~~~~-~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~il~~p~~~~~~~~~~~~v~~al~~ 395 (659) T protein:vir:10 317 KGGSEYIFATAQNWPEGF-SGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVVS 395 (659) T ss_pred cCcccEEEEeecccCCCc-cceeeecccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHHH Confidence 000000 00112334322 234555555554 3467899887432 233455555 Q ss_pred HHhhhhhhhhhhhheeeEEEEe---------c-cCCHHHHHhhhhc----------cCcceEEEEec--------CCC-C Q lcl|Aclame:pro 243 EMNDTSGRWSYARQLYGHVYTA---------K-TGTLSELVNAGDQ----------FNQQHITLAGY--------EKE-T 293 (498) Q Consensus 243 ~l~~~s~r~~~~~q~~g~~~~~---------~-~gt~~~~~t~g~~----------~N~~~~t~~~~--------~~~-~ 293 (498) |++. | +..+++.. . .-+..++..+-.. +||.+..+.+. .+. . T Consensus 396 ~~~~---~------~~~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~ 466 (659) T protein:vir:10 396 IGDA---R------QDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNYKYQYDKYNDVNR 466 (659) T ss_pred HHHh---h------CCeEEEEcCccccccCCCcccCHHHHHHHHHhcccccccccccCcceEEEEeCcEEEecccCCceE Confidence 5543 2 12222221 1 1234555555442 46767665432 111 1 Q ss_pred CCcHHHHHHHHHHHhhhhhccCcc-cccc-c-eEEeccccCCCccccChHHHHHHHhCCeeEEEEc--CCeEEEEeeeee Q lcl|Aclame:pro 294 QTPADELAASRTARAAVFIRNDPA-RPTQ-T-GELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVE--SGVLRIQRDVTT 368 (498) Q Consensus 294 ~~p~~~~AAa~~a~~a~~l~~DPA-rpl~-t-l~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~--~G~v~IeR~ITT 368 (498) ..|+....|.+.|+.-. +..|- .|-+ . ..+.|+. +..-+++..|++.|..+||.++..- .| .++--..|. T Consensus 467 ~~p~sg~~AGl~Ar~D~--~~g~~~span~~~~~i~g~~--~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G-~~~wG~rT~ 541 (659) T protein:vir:10 467 WVPLAADIAGLCARTDN--VSQTWMSPAGYNRGQILNVI--KLAIETRQAQRDRLYQEAINPVTGTGGDG-YVLYGDKTA 541 (659) T ss_pred EechHHHHHHHHHHHhc--cCCceEccCCceeeeeeccc--cceecCCHhHHHHHhhCCeeEEEEeCCCe-EEEEccccc Confidence 13543433333333321 12220 1222 1 1244442 3455778999999999999998653 35 555555554 Q ss_pred eeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccc Q lcl|Aclame:pro 369 YRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVEN 448 (498) Q Consensus 369 Y~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given 448 (498) .+ .+..|+.|...|+.+|+.+.++.... .|-.+.+... +-+.||..+-..+++|..+|.++. T Consensus 542 -----~~-~~s~~~~i~vrR~~~~i~~si~~~~~-~~v~e~n~~~-----------l~~~i~~~i~~fL~~l~~~gal~~ 603 (659) T protein:vir:10 542 -----TS-VPSPFDRINVRRLFNMLKTNIGRSSK-YRLFELNNAF-----------TRSSFRTETAQYLQGIKALGGIYE 603 (659) T ss_pred -----CC-CCcccceEehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhcCceee Confidence 11 22458999999999999999998664 4444433221 447788999999999999999986 Q ss_pred hhhhcCeEEEEEc--CC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 449 YELFKQYLVVERD--AS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 449 ~~~~~~~lvVerd--~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) | .+.|..+ ++ +.+|+.+.+-...+..++- |.|++|....-+ T Consensus 604 ~-----~V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~----i~~~~~~~~~~~ 649 (659) T protein:vir:10 604 Y-----RVVCDTTNNTPSVIDRNEFVATFYIQPARSINY----ITLNFVATATGA 649 (659) T ss_pred E-----EEEEcCCCCCHHHhhCCeEEEEEEEEecCCcce----EEEEEEEEecCc Confidence 5 2444332 22 3457777777766666554 344444332222 No 34 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=99.46 E-value=6.3e-12 Score=82.00 Aligned_cols=448 Identities=16% Similarity=0.149 Sum_probs=233.6 Q ss_pred CccchhhcCcccccCeEEEEEecCCC-----CCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCc-------- Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQAA-----NTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA-------- 67 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~a-----~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~-------- 67 (498) |+ .|++.. -.||.-+.|...+- -.+...|.|||+|... .++.++||+|+ ...|..+||. T Consensus 1 ~~-~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 72 (717) T protein:vir:79 1 MA-GFDQYQ---AIPGHNARFKDGNLNLKSDPNPRETESVVLLGTAT---DGPVMQPVRVT-PETAYNIFGKVAHENGVY 72 (717) T ss_pred CC-chhhhh---cCCCceeeeecCceecCCCCCccccceEEEEeecc---CCcccCceeeC-hhHHHhhhhhhhhhcccc Confidence 87 898887 57899888754333 2445568999999665 34678999997 6679999996 Q ss_pred --CcHHHHHHHHHHHhCCCceEEEEE--------------------ecCC-ccceeEEEEEEeeeccCCcEEE------- Q lcl|Aclame:pro 68 --GSQLARMVEAYRQTDPFGELYVIA--------------------VPEA-TGAAATVTLTVTGEATESGTVN------- 117 (498) Q Consensus 68 --GS~l~~M~~a~~~~n~~~~l~~i~--------------------l~d~-ag~aatg~ititgtat~~G~l~------- 117 (498) |..+-.|.++++.-|....|.-+. ++|- .+..+-|.+.+|-+-..+|.+. T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (717) T protein:vir:79 73 NGATLLPKFEELWAAGNRDIRLMRTTGVNAVSSLLGTSYSKNSKEVAEDKLGGAQARGNVAATFTLPNGGIVEATFLLKA 152 (717) T ss_pred cchhhhHHHHHHHhcCCcceEEEEecchhHHHHHhhcccccchhhHHHHhhcccccccceEEEEEcCCCceeeeeeeeee Confidence 678889999999988876654332 1121 1122334444443322222211 Q ss_pred -----------EEEc-----------------------cEEEEEEe--------ecCCCHH-HHHHHHHHHHhcCCCceE Q lcl|Aclame:pro 118 -----------VYVG-----------------------RTRVQAPV--------TNGDNVT-TIASSIQDAINAVPTLPF 154 (498) Q Consensus 118 -----------l~I~-----------------------g~~v~v~V--------~~gdtaa-~iA~~l~~aIn~~~~lpV 154 (498) |-+| ...-.++| +.|+|-. ++-.+ =.....-|+ T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~ 228 (717) T protein:vir:79 153 RGVIIPPNNYTLDVGTEEDMKAGTQPTFAQVLLNENVADMESEITVSYEFTYKDAQGETKTSEVLDN----NTDKDGKPM 228 (717) T ss_pred cceEeCCCcceEeccChhhhhcCCCchhhhhhhccchhhccceeEEEEEEEeecccCcchhhhhhcC----CCCCCCcee Confidence 1111 00001111 1122111 00000 000001122 Q ss_pred EEe--------------------------------ecc------------------------------ceEEEeeccCcc Q lcl|Aclame:pro 155 TAS--------------------------------SSA------------------------------GVVTLTARHKGL 172 (498) Q Consensus 155 tA~--------------------------------~~~------------------------------~~VtlTAk~kG~ 172 (498) -+. .++ -++.|-+-..|+ T Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 308 (717) T protein:vir:79 229 IAKGADVTIKLEHVALAGLKLYADGIEVVDAKAFTVAGDQLTIHSNSKMKLGASLEAQYAYNLVEVIQPVIELESIFGGG 308 (717) T ss_pred EEecccceeehhhhhhhhhHHhhcchhhhhhhheeeecceEEEEecCCcccchhhHHHHHhhHHHhhccceEEeecccCc Confidence 111 011 124455555677 Q ss_pred cccceeEEEEecccC-----------cc-----------------cccccce-------------------ee------- Q lcl|Aclame:pro 173 CGNEIPVSLNYYGFG-----------GG-----------------EVLPAGV-------------------QI------- 198 (498) Q Consensus 173 ~gN~i~l~~~~~~~~-----------~g-----------------e~~p~Gl-------------------t~------- 198 (498) .-|++.+.++..+.. -| ....+|| ++ T Consensus 309 ~~n~~~~~v~~~D~~~~~~~t~~~~~~g~~~~~pl~~ts~dy~~~~~~vdgI~~~~~~~V~~~g~~s~a~a~~~~g~~s~ 388 (717) T protein:vir:79 309 VYNDIMRKVESKDGAVTVTITKPESKRGMISEDPLVFKSGDYTNFKMLVDAINNHPFNNVVRARTKPEFEATFTSTLQAA 388 (717) T ss_pred eeeeeeeEEecCCceEEEEEecccccCcceeccccccccCceeeeeeeecccccCchhheeeeecccccceeeeecccCc Confidence 777776666544320 00 0001121 00 Q ss_pred eecccCCCcCc--ch-----------------h--hhHHHhhccCcceEEEecCC--ChH-------HHHHHHHHHhhhh Q lcl|Aclame:pro 199 AVATGTAGTGA--PV-----------------L--TGAVAAMADEPFDYIGLPFN--DTA-------SVNTLVTEMNDTS 248 (498) Q Consensus 199 tit~~agGag~--pD-----------------~--~~alaalg~~~~~~I~~p~t--D~a-------~l~al~~~l~~~s 248 (498) .-..+.||..- |+ + ..+++.+..+..++++.|-. |+. -..++.+|+...+ T Consensus 389 d~a~f~Gg~dgl~~~~ee~Y~~lGgk~~d~g~lt~~aays~LE~~dVDlVil~ga~adtt~ga~~d~va~alad~caalS 468 (717) T protein:vir:79 389 ADAKFSGGKDELSLDKEEMYKRLGGEKNEEGFVTKQGAYQYLENYEVDYVIPLGVHADTKLIGKYDDFAYQLALACAVMS 468 (717) T ss_pred hhhccCCCccccccchhhhhccccccccccccccchhhhhhcCcceeEEEEecCccccccccchhhhHHHHHHHHHHHhh Confidence 00111111100 11 1 13556666677888887743 221 1345666665432 Q ss_pred hhhhhhhheeeEE-E-EeccCCHHHH-------Hhhhh---------------ccC----cceE-------EEEe-cCCC Q lcl|Aclame:pro 249 GRWSYARQLYGHV-Y-TAKTGTLSEL-------VNAGD---------------QFN----QQHI-------TLAG-YEKE 292 (498) Q Consensus 249 ~r~~~~~q~~g~~-~-~~~~gt~~~~-------~t~g~---------------~~N----~~~~-------t~~~-~~~~ 292 (498) . .. +...++. . .+..-+.+.. ..++. ..| +.+. .++. .... T Consensus 469 a-l~--r~ai~VI~l~sp~D~~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a~~~~~~~idis~y~~vv~~~~~iv~~~~~~ 545 (717) T protein:vir:79 469 H-YN--SVTIGIIPTTTPSDISLAGVEEHVKKLENYANEFYMRDRFGNIIFDADRNKIDLGQFIEVVAGPDFIVRNTRLG 545 (717) T ss_pred h-cc--ccceeeeccccccccchhhHHHHHHHHHhhhhhhhhhcchhccccccccccccccceeeeeecceeEEEcCCCc Confidence 1 11 1111111 1 1111111111 11100 000 1111 1111 1111 Q ss_pred CCCcHHHHHHHHHHHhhhhhccCccccccceEEeccccCCCccccChHHHHHHHhCCeeEEEEcCC-eEEEEeeeeeeee Q lcl|Aclame:pro 293 TQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESG-VLRIQRDVTTYRK 371 (498) Q Consensus 293 ~~~p~~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~~G-~v~IeR~ITTY~~ 371 (498) ...+ -.|+.+|+..+ +.+|....-...|.|+.- ....++..|++.|..+||.++....| -+++...+|+-. T Consensus 546 ~~~~--p~AG~vAGldA---~rGVwkSPANk~I~GVvg--La~~lT~sE~d~Ln~aGIntIr~~~GrGirVWGaRTtas- 617 (717) T protein:vir:79 546 QMAS--TPDASYIGMVS---QLKTQSAPTNKPLPSVTA--LRYTYSANQLNRLTKARFATFKYKQDGSIGVVDAPTSAH- 617 (717) T ss_pred eeec--CHHHHHHHHHh---cCCcccccccceeccccc--CcccCCHHHHHHHhhCCeEEEEEeCCceEEEEeeeecCC- Confidence 1111 12455555554 334433333446777743 45579999999999999999876433 489999999821 Q ss_pred cCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhh Q lcl|Aclame:pro 372 NAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYEL 451 (498) Q Consensus 372 n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~ 451 (498) .+..|+.|.+.|+.+++.+.+|..+. .|-.+.+..+ +-..+|..+-+.+++|..+|.|.+++. T Consensus 618 -----d~sdWryInVRRl~D~Ie~sIr~al~-~yVgEPNd~~-----------tr~~Ik~sI~afL~~L~r~GAI~Gykv 680 (717) T protein:vir:79 618 -----AGSDYTRLSTARIVKEAVNAVREVAD-PFIGEPNDTG-----------NRNALTAAVDKRLSKMIENKALLGFDF 680 (717) T ss_pred -----CCcccceeehhhhHHHHHHHHHHHHH-HhccccCCHH-----------HHHHHHHHHHHHHHHHHhcCceeccee Confidence 12349999999999999999999775 6887764433 347899999999999999999998753 Q ss_pred hcCeEEEEEcCCCCeEEEEEeeeEEecCeEEEeeeeeeEEEecc Q lcl|Aclame:pro 452 FKQYLVVERDASVPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSE 495 (498) Q Consensus 452 ~~~~lvVerd~~d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~ 495 (498) .+....+..+.+++.+.+....+..++.|= |++...- T Consensus 681 ---dvtnT~~di~~G~l~V~I~vaPv~PaEfI~----ititITA 717 (717) T protein:vir:79 681 ---RLVVTPQQELLGEGSIELSLEAPNELRRLT----TIVSLSA 717 (717) T ss_pred ---eEecChhHhhCCEEEEEEEEEecCcccEEE----EEEEEeC Confidence 344444555566777777666666666552 2222222 No 35 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=99.46 E-value=2.8e-12 Score=83.92 Aligned_cols=442 Identities=11% Similarity=0.035 Sum_probs=222.1 Q ss_pred cccccCeEEEE-EecCCCCCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCc---CcHHHHHHHHHHHhCCCc Q lcl|Aclame:pro 10 SNTLVPLFYAE-MDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA---GSQLARMVEAYRQTDPFG 85 (498) Q Consensus 10 ~~~rvPg~y~E-~dns~a~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~---GS~l~~M~~a~~~~n~~~ 85 (498) -++..||+|+| +|-++.=.........++|. ...++.++|++|+|..|-...||. .+-+..|++.|+-+ --+ T Consensus 1 ~~~~~Pgvyv~e~~~~~~i~~~~t~~~~~vg~---~~~gp~~~p~~v~s~~~~~~~fg~~~~~~~~~~~~~~~f~~-~g~ 76 (660) T protein:vir:10 1 MALLSPGIELKETSVQSTVVRNATGRAALVGK---FQWGPAFQVTQITNEVELVDLFGGPNNEVADYFMSGMNFLQ-YGN 76 (660) T ss_pred CceecCceEEEeecCCccccCCCcccceEEee---cCCCCCccCeEcCCHHHHHHHcCCcCCCchhHHHHHHHHHh-CCc Confidence 56789999998 75444322233456678885 446788999999999999999996 35667778887754 455 Q ss_pred eEEEEEecCCcc----ceeEEEEEEeee-----ccCCcEEEEEEccEEEE------------------------------ Q lcl|Aclame:pro 86 ELYVIAVPEATG----AAATVTLTVTGE-----ATESGTVNVYVGRTRVQ------------------------------ 126 (498) Q Consensus 86 ~l~~i~l~d~ag----~aatg~ititgt-----at~~G~l~l~I~g~~v~------------------------------ 126 (498) .+|++.+.+..- ......+.++.. .+.+..+.+.+.+.... T Consensus 77 ~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~v~~~~a~g~~~~~~~~ta~~~~~a~~ 156 (660) T protein:vir:10 77 DLRTVRVVSREFAKNASPIAGNIETTITTAGSNYAVGDKINIKYNQTVVESEGRVTSVDTDGKILSVFIPSAKIIAYARS 156 (660) T ss_pred eEEEEEecccccccccccccccceeEEeeccccccccceeeEeeccccccccccceeeccccceeeeccccccccccccc Confidence 799999965421 111112222211 12233344443322110 Q ss_pred ------------EEeecCCCHHHHHHHHHHHHh-cCCCceEEEe--------------eccceEEEeeccCcccccceeE Q lcl|Aclame:pro 127 ------------APVTNGDNVTTIASSIQDAIN-AVPTLPFTAS--------------SSAGVVTLTARHKGLCGNEIPV 179 (498) Q Consensus 127 ------------v~V~~gdtaa~iA~~l~~aIn-~~~~lpVtA~--------------~~~~~VtlTAk~kG~~gN~i~l 179 (498) ..+....+....+..+...++ +...++++.. .......++++..|..||.+.+ T Consensus 157 v~~~~~~~~~~~~~~~~~~~~~~~a~sv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i~v 236 (660) T protein:vir:10 157 LNQYPTLGPAWTAEVTSASSGVSGTITVGKIVTDSGILLTEAENSEEAITSLEFQAALKKFAMPGVVALYPGEIGSTLEV 236 (660) T ss_pred cccccccccceeEEEecccCccccceeeeeeeccCcceEEeeeccccccccccceeeccccccceeeeecccccCcceeE Confidence 000000000000100000000 0001111110 0112245566677777776655 Q ss_pred EEEecccCccc------cc-----------------------------ccce---------------------------- Q lcl|Aclame:pro 180 SLNYYGFGGGE------VL-----------------------------PAGV---------------------------- 196 (498) Q Consensus 180 ~~~~~~~~~ge------~~-----------------------------p~Gl---------------------------- 196 (498) .+.......+. .. ..|. T Consensus 237 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (660) T protein:vir:10 237 EIVSKAAYEAGSSKMLDVYPGGGTRASIAKAVFNYGPQTDDQYAIIVRRDGAIVESVVLSTKEGEKDVYGNNIYLDDYFA 316 (660) T ss_pred EEeeccccCCcceeEEeeeeccceeeEEeeeecccccccccccccccccCCcccceeeeeccccccccccceeeeehhhc Confidence 44211000000 00 0000 Q ss_pred ---e--------------eeecccCCCcC------cchhhhHHHhhcc---CcceEEEecCCC----h---HHHHHHHHH Q lcl|Aclame:pro 197 ---Q--------------IAVATGTAGTG------APVLTGAVAAMAD---EPFDYIGLPFND----T---ASVNTLVTE 243 (498) Q Consensus 197 ---t--------------~tit~~agGag------~pD~~~alaalg~---~~~~~I~~p~tD----~---a~l~al~~~ 243 (498) . ......+||.+ ..|+..++.++.+ ..++++++|-.. . +-..+|.+| T Consensus 317 ~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~t~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~al~~~ 396 (660) T protein:vir:10 317 KGTSNYIYATSLNWPKGFSGIINLSGGISANDKVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDEVASTVQKHVVSI 396 (660) T ss_pred CCCccEEEEEeccCCCCcccceeeeccccCccccccchhhhhhhhhhhhhhcccceEEEcCcCCCchhhhHHHHHHHHHH Confidence 0 00011222222 1244556666654 357888876422 1 223444444 Q ss_pred HhhhhhhhhhhhheeeEEEEe-c---------cCCHHHHHhhhhc----------cCcceEEEEecCC---------CCC Q lcl|Aclame:pro 244 MNDTSGRWSYARQLYGHVYTA-K---------TGTLSELVNAGDQ----------FNQQHITLAGYEK---------ETQ 294 (498) Q Consensus 244 l~~~s~r~~~~~q~~g~~~~~-~---------~gt~~~~~t~g~~----------~N~~~~t~~~~~~---------~~~ 294 (498) .+. |. +.+++.. . .-++.++..+-.. +++.+..+.+... ... T Consensus 397 ~~~---~~------~~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~ 467 (660) T protein:vir:10 397 ADE---RQ------DCLAFISPPKGLLVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTYAAIDGNYKYQYDKYNDVNRW 467 (660) T ss_pred HHh---hC------CEEEEEecCcccccccccccCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceeE Confidence 432 21 2222222 1 1256676666542 4577766654211 011 Q ss_pred CcHHHHHHHHHHHhhhh-hccCcc-ccccc-e-EEeccccCCCccccChHHHHHHHhCCeeEEEE--c-CCeEEEEeeee Q lcl|Aclame:pro 295 TPADELAASRTARAAVF-IRNDPA-RPTQT-G-ELVGMLPAPKGKRFTMTEQQTLLSHGVATAYV--E-SGVLRIQRDVT 367 (498) Q Consensus 295 ~p~~~~AAa~~a~~a~~-l~~DPA-rpl~t-l-~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v--~-~G~v~IeR~IT 367 (498) .|+. +.+|+..|.- .+..|- .|-+. + .+.|+ ...+-.++..|++.|..+||.++.. . .| .++-=.-| T Consensus 468 ~p~s---g~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~--~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~~G-~~~wG~rT 541 (660) T protein:vir:10 468 VPLA---ADLAGLCARTDDVSQPWMSPAGYNRGQILNV--LKLAIEPRQAQRDRMYQEAINPVVGFAGGDG-FVLFGDKT 541 (660) T ss_pred echh---HHHHHHHHHhhccCCcEEccCCeeeceeecc--ceeeecCChhhHHhHhhCCceEEEEeeCCCc-EEEEcccc Confidence 2433 3334444421 112221 12221 1 23343 3345578999999999999988754 2 34 44433333 Q ss_pred eeeecCCCCCCc-hhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 368 TYRKNAYGVADN-SYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIV 446 (498) Q Consensus 368 TY~~n~~G~~D~-s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~giv 446 (498) . ..|+ .|+.|.+.|..+|+.+.++.... .|-.+.+... +-+.||..+-+.+++|..+|.+ T Consensus 542 ~-------~~~~s~~~~i~vrR~~~~i~~si~~~~~-~~v~epn~~~-----------l~~~i~~~i~~fL~~l~~~gal 602 (660) T protein:vir:10 542 A-------TKVPSPMDHINVRRLFNMLKKNIGDASK-YKLFELNDNF-----------TRSSFRMEVSQYLDGIKALGGI 602 (660) T ss_pred c-------CCCCcccceEehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhCCce Confidence 2 2344 58999999999999999998664 4544433222 4578999999999999999999 Q ss_pred cchhhhcCeEEEEEc--CC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 447 ENYELFKQYLVVERD--AS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 447 en~~~~~~~lvVerd--~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) ..+. +.|..+ ++ +.+|+.+.+-...+..++- |.|++|....-+ T Consensus 603 ~g~~-----V~~d~~~nt~~di~~G~~~~~i~~~P~~pae~----I~~~~~~~~~~~ 650 (660) T protein:vir:10 603 YEGR-----VVCDTTVNTPAVIDRNEFIANIYVKPARSINY----ITLNFVATSTGA 650 (660) T ss_pred eeeE-----EEEcCCCCCHHHhhCCeEEEEEEEEecCCccE----EEEEEEEeecCc Confidence 8842 333322 11 2457777776666666554 445554333333 No 36 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=99.43 E-value=2.8e-12 Score=83.95 Aligned_cols=452 Identities=12% Similarity=0.043 Sum_probs=215.7 Q ss_pred cccccCeEEEE-EecCCCCCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCc---CcHHHHHHHHHHHhCCCc Q lcl|Aclame:pro 10 SNTLVPLFYAE-MDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA---GSQLARMVEAYRQTDPFG 85 (498) Q Consensus 10 ~~~rvPg~y~E-~dns~a~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~---GS~l~~M~~a~~~~n~~~ 85 (498) -+..-||+|+| +|-+..-.........+||. ...++.++|++|+|..|-...||. .+.+..|++.++-++ -+ T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~---~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~~~-g~ 76 (660) T protein:vir:68 1 MALLSPGVELKETTVQSTVVNNSTGTAALAGK---FQWGPAFQIKQITDEVALVDMFGTPNTDTADYFMSAMNFLQY-GN 76 (660) T ss_pred CccccCceEEEEecCCcccccCCCcceeEEec---ccCCCCccCEEecCHHHHHHhcCCccCccchhHHHHHHHHhC-CC Confidence 45788999998 54333311222446677874 446788999999999999999994 677888888888654 45 Q ss_pred eEEEEEecCCcc----ceeEEEEEEeeecc-----CCcEEEEEEcc----------------EEEEE------------- Q lcl|Aclame:pro 86 ELYVIAVPEATG----AAATVTLTVTGEAT-----ESGTVNVYVGR----------------TRVQA------------- 127 (498) Q Consensus 86 ~l~~i~l~d~ag----~aatg~ititgtat-----~~G~l~l~I~g----------------~~v~v------------- 127 (498) .+|++.+.+... .+..+.+.++-... ....+.+.+++ ....+ T Consensus 77 ~~~vvRv~~~~~~~~~~~~~~~~~~t~~~~g~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ta~~~~~a~~ 156 (660) T protein:vir:68 77 DLRVVRAVDRDTAKNSSPVAGNINFTISSAGTNYRVGDKVVVKYSTDIIEPDGEVTSVDSDGKILNIFIPSGKIIAKAKE 156 (660) T ss_pred eEEEEEecccccccccccccccceeeeeccCcceeeeeeeeeecccccccccccceeeeecCceeeeeecccccccccee Confidence 899999875321 11122222221110 01111111111 10100 Q ss_pred -------------EeecCCCHHHHHHHHHHHHhcCCC--ceEEEee--------------ccceEEEeeccCccccccee Q lcl|Aclame:pro 128 -------------PVTNGDNVTTIASSIQDAINAVPT--LPFTASS--------------SAGVVTLTARHKGLCGNEIP 178 (498) Q Consensus 128 -------------~V~~gdtaa~iA~~l~~aIn~~~~--lpVtA~~--------------~~~~VtlTAk~kG~~gN~i~ 178 (498) .+.++......+...... ..+.. +++..++ ..+..-+.++..|+.||.|. T Consensus 157 ~~~~~~~~~~~~~~v~~~~~~~~~~~~v~~~-~~d~~~~~~~~~ta~~~~~~~~~~~~~~~~~~~~~~A~~~g~~G~~i~ 235 (660) T protein:vir:68 157 IGEYPELGSNWTAEMSGSSSGLSAVITIDSV-VMDSGILLTEVETSEEAITSLTFQESIKKYGVPGVVALYPGELGDQLE 235 (660) T ss_pred eccccccccceeEEeecccccceeeeeeccc-cccccceeeeeccccccccccceeeeecccCccccccccccccccceE Confidence 000000000000000000 00000 0010000 00111133556677777776 Q ss_pred EEEEecccCcc-------------------------------c-----------------------------------cc Q lcl|Aclame:pro 179 VSLNYYGFGGG-------------------------------E-----------------------------------VL 192 (498) Q Consensus 179 l~~~~~~~~~g-------------------------------e-----------------------------------~~ 192 (498) +.+-....... + .. T Consensus 236 v~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (660) T protein:vir:68 236 IEIVSKADYDKGASAQLKIYPDGGTRYSTAKAIFGYGPQTDDQYAIIVRRNDSVVQSVVLSTKRGERDIYGSNIFIDDFF 315 (660) T ss_pred EEEeccccccccccccceeeecccccccceeeEeecccccccceeeeeecCCcceeeeeeecccccccccccceeeehhh Confidence 54311100000 0 00 Q ss_pred cccee----ee----------ecccCCCcC------cchhhhHHHhh---ccCcceEEEecCCCh-------HHHHHHHH Q lcl|Aclame:pro 193 PAGVQ----IA----------VATGTAGTG------APVLTGAVAAM---ADEPFDYIGLPFNDT-------ASVNTLVT 242 (498) Q Consensus 193 p~Glt----~t----------it~~agGag------~pD~~~alaal---g~~~~~~I~~p~tD~-------a~l~al~~ 242 (498) ..|.. .. .....||.. +-++..++..+ +....++++++.... +-..++.. T Consensus 316 ~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~v~~~l~~ 395 (660) T protein:vir:68 316 AKGASNYIFATAQGWPKGFSGVIKLNGGLSSNETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESLEVASTVQKHVVA 395 (660) T ss_pred ccCcccEEEEeecCCCccccceeeeccccccccccccchhhhHHHHhhhhhccccceeeccccCCCchHHHHHHHHHHHH Confidence 00000 00 001223321 11233443333 334445555544322 12334444 Q ss_pred HHhhhhhhhhhhh-heeeEEEEeccCCHHHHHhhhhc----------cCcceEEEEecC--------CC-CCCcHHHHHH Q lcl|Aclame:pro 243 EMNDTSGRWSYAR-QLYGHVYTAKTGTLSELVNAGDQ----------FNQQHITLAGYE--------KE-TQTPADELAA 302 (498) Q Consensus 243 ~l~~~s~r~~~~~-q~~g~~~~~~~gt~~~~~t~g~~----------~N~~~~t~~~~~--------~~-~~~p~~~~AA 302 (498) |.+...+|.--.. .+..+.-....-++.++..+-.. .|+.+..+.+.. +. ...|+....| T Consensus 396 ~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~A 475 (660) T protein:vir:68 396 IGDSRQDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADIA 475 (660) T ss_pred HHHhhCCeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceEEechhHHHH Confidence 4432111110000 01111111223455666655442 467777665321 00 1124433333 Q ss_pred HHHHHhhhhhccCccc-ccc-ce-EEeccccCCCccccChHHHHHHHhCCeeEEEEcCCe-EEEEeeeeeeeecCCCCCC Q lcl|Aclame:pro 303 SRTARAAVFIRNDPAR-PTQ-TG-ELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGV-LRIQRDVTTYRKNAYGVAD 378 (498) Q Consensus 303 a~~a~~a~~l~~DPAr-pl~-tl-~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~~G~-v~IeR~ITTY~~n~~G~~D 378 (498) .+.|+... +..|-. |-+ .+ .+.|.. .....++..|++.|..+||.++..-.|. .++--.-|. ..| T Consensus 476 Gl~Ar~d~--~~g~~~span~~~~~i~g~~--~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~-------~~~ 544 (660) T protein:vir:68 476 GLCARTDN--ISQPWMSPAGYNRGQILNVI--KLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTA-------TSV 544 (660) T ss_pred HHHHHHhc--cCCcEEccCCeeeceeeccc--eeeecCChhHHHHHhhCCceEEEEecCCeEEEEcceec-------CCC Confidence 33333321 222311 212 11 233432 3345678999999999999998753322 555555553 134 Q ss_pred c-hhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEE Q lcl|Aclame:pro 379 N-SYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLV 457 (498) Q Consensus 379 ~-s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lv 457 (498) + .|+.|.+.|..+|+.+.++.... .|--+.+... +-+.||..+-..+++|.++|.+..|. +. T Consensus 545 ~s~~~~i~vrR~~~~i~~si~~~~~-~~v~epn~~~-----------~~~~i~~~i~~~L~~l~~~gal~gf~-----V~ 607 (660) T protein:vir:68 545 PSPFDRINVRRLFNMVKTNIGSASK-YRLFELNNAF-----------TRSSFRTETSQYLQGIKALGGVYNFK-----VV 607 (660) T ss_pred CcccceEehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhcCceeeeE-----EE Confidence 4 59999999999999999999765 4443433221 45788999999999999999999842 33 Q ss_pred EEEcC--C---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 458 VERDA--S---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 458 Verd~--~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) +.++. + +.+|+.+.+-...+..++-| .|++|.....+ T Consensus 608 ~d~~~nt~~~i~~G~~~~~i~~~p~~pae~i----~l~~~~~~~~~ 649 (660) T protein:vir:68 608 CDTTNNTPAVIDRNEFVATFYLQPARSINYI----TLNFVATATGA 649 (660) T ss_pred EecCCCCHHHhhCCeEEEEEEEEecCCcceE----EEEEEEeecCc Confidence 43321 1 23677777777776666543 34544332322 No 37 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=99.43 E-value=1e-11 Score=80.90 Aligned_cols=443 Identities=13% Similarity=0.062 Sum_probs=210.1 Q ss_pred cccccCeEEEEEecCCCCCC-CCCccEEEEEecCCCCccccceeEEecChHHHHHhhC---cCcHHHHHHHHHHHhCCCc Q lcl|Aclame:pro 10 SNTLVPLFYAEMDNQAANTA-QDSGASLLIGHANNGAEIVANSLVLMPSADYARQICG---AGSQLARMVEAYRQTDPFG 85 (498) Q Consensus 10 ~~~rvPg~y~E~dns~a~~~-~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG---~GS~l~~M~~a~~~~n~~~ 85 (498) =+.+.||+|+|--.+..... .......+||. ...++.++|++|+|..|-...|| ..+-+..|++.|+.++-. T Consensus 1 ~~~~~PgVyvee~~~~~~~~~~~ts~~~fvG~---~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngg~- 76 (659) T protein:vir:72 1 MTLLSPGIELKETTVQSTVVNNSTGTAALAGK---FQWGPAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQYGN- 76 (659) T ss_pred CceecCceEEEEecCCcccccCCCcceEEEee---cCCCCCcccEEecCHHHHHHHcCCcCCCCchhHHHHHHHHhCCc- Confidence 45789999999544544222 23456677774 45678899999999999999999 467888899999865554 Q ss_pred eEEEEEecCCc-cceeE---EEEEEe---------------ee--c---cCCcEEEE-EEccEEEEEEeecCCCHH---- Q lcl|Aclame:pro 86 ELYVIAVPEAT-GAAAT---VTLTVT---------------GE--A---TESGTVNV-YVGRTRVQAPVTNGDNVT---- 136 (498) Q Consensus 86 ~l~~i~l~d~a-g~aat---g~itit---------------gt--a---t~~G~l~l-~I~g~~v~v~V~~gdtaa---- 136 (498) .+|++.+.+.. ...++ ..+.++ +. + -..|.+.. .-.+....+.+......+ T Consensus 77 ~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~g~v~~~~~~~~~~~~~v~t~~~~a~~~~ 156 (659) T protein:vir:72 77 DLRVVRAVDRDTAKNSSPIAGNIDYTISTPGSNYAVGDKITVKYVSDDIETEGKITEVDADGKIKKINIPTGKNYAKAKE 156 (659) T ss_pred eEEEEEccCCcccccccccccccceeecccccccccceeeeeeeccccccccceEEEeeccccceeeeeccccccccccc Confidence 79999986521 11111 111111 10 0 00001000 000000000000000000 Q ss_pred ------HHHH---HHHHHHhc-CCCceEEEe----------ec-----------------cceEEEeeccCcccccceeE Q lcl|Aclame:pro 137 ------TIAS---SIQDAINA-VPTLPFTAS----------SS-----------------AGVVTLTARHKGLCGNEIPV 179 (498) Q Consensus 137 ------~iA~---~l~~aIn~-~~~lpVtA~----------~~-----------------~~~VtlTAk~kG~~gN~i~l 179 (498) .+.. .+.+.... ...+.+... .. .+.-.+.+...|..++.+.+ T Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~v~~~~~~~~~~v~~~~~a~~~~~~~~~v~~~~~~~~~a~~~gt~g~~~tv 236 (659) T protein:vir:72 157 VGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKIEI 236 (659) T ss_pred cccccccccceeeEEeeccccccceEEEEEeecCcceeeeeccccchhhhcccccccccccccceeeeccccccccceeE Confidence 0000 00000000 000000000 00 00000111122222222211 Q ss_pred EEE-------------------------------ecc-----------------------cCcccc-------------- Q lcl|Aclame:pro 180 SLN-------------------------------YYG-----------------------FGGGEV-------------- 191 (498) Q Consensus 180 ~~~-------------------------------~~~-----------------------~~~ge~-------------- 191 (498) .+. +-. ...+.. T Consensus 237 ~i~~~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (659) T protein:vir:72 237 EIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDFFA 316 (659) T ss_pred EEccccccccceeeeeecccccccccccceeeeeeecccccccceeeecccceeeeeeeeeccccccccchhhhhhhhhh Confidence 110 000 000000 Q ss_pred -------------cccceeeeecccCCCc------CcchhhhHHHhhcc---CcceEEEecCCCh-------HHHHHHHH Q lcl|Aclame:pro 192 -------------LPAGVQIAVATGTAGT------GAPVLTGAVAAMAD---EPFDYIGLPFNDT-------ASVNTLVT 242 (498) Q Consensus 192 -------------~p~Glt~tit~~agGa------g~pD~~~alaalg~---~~~~~I~~p~tD~-------a~l~al~~ 242 (498) .|.+. ......+||. ...|+..++.++.. ..++++++|--.. +...++.+ T Consensus 317 ~~~~~~v~~~~~~~~~~~-~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~~l~~ 395 (659) T protein:vir:72 317 KGGSEYIFATAQNWPEGF-SGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVVS 395 (659) T ss_pred cCCceEEEEEecccCCcc-cccccccccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHHH Confidence 00000 0111233443 22355667776653 3588998874321 22334444 Q ss_pred HHhhhhhhhhhhhheeeEEEEe----------ccCCHHHHHhhhhc----------cCcceEEEEecC--------C-CC Q lcl|Aclame:pro 243 EMNDTSGRWSYARQLYGHVYTA----------KTGTLSELVNAGDQ----------FNQQHITLAGYE--------K-ET 293 (498) Q Consensus 243 ~l~~~s~r~~~~~q~~g~~~~~----------~~gt~~~~~t~g~~----------~N~~~~t~~~~~--------~-~~ 293 (498) +.+. +++..++.. ...+..++..+-.. .|+.+..+.+.. + .. T Consensus 396 ~~~~---------~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~ 466 (659) T protein:vir:72 396 IGDA---------RQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNHKYQYDKYNDVNR 466 (659) T ss_pred HHhh---------hCCEEEEEcCccccccCCCcccCHHHHHHHHhhccccccccccccceeEEEEcCceeeccccCCceE Confidence 4432 222222221 12334555554332 467776654321 1 11 Q ss_pred CCcHHHHHHHHHHHhhhhhccCcc-ccccceEEeccc-cCCCccccChHHHHHHHhCCeeEEEEc--CCeEEEEeeeeee Q lcl|Aclame:pro 294 QTPADELAASRTARAAVFIRNDPA-RPTQTGELVGML-PAPKGKRFTMTEQQTLLSHGVATAYVE--SGVLRIQRDVTTY 369 (498) Q Consensus 294 ~~p~~~~AAa~~a~~a~~l~~DPA-rpl~tl~L~Gl~-~p~~~~r~~~~er~~lL~~Gist~~v~--~G~v~IeR~ITTY 369 (498) ..|+.-..|.+.|+.-. +..|- .|-+ ..+.|+. +-+..-.++..|++.|..+||.++..- .| .++--.-|. T Consensus 467 ~~p~sg~vAGl~Ar~D~--~~G~~~span-~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G-~~~wG~rT~- 541 (659) T protein:vir:72 467 WVPLAADIAGLCARTDN--VSQTWMSPAG-YNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDG-YVLYGDKTA- 541 (659) T ss_pred EechHHHHHHHHHHhhc--cCCcEEccCC-eeeceeeccccccccCChhHHHHHhhCCceEEEEecCCe-EEEEccccc- Confidence 12443333333333321 11220 1222 2233332 223455678999999999999998653 45 344444443 Q ss_pred eecCCCCCCc-hhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccc Q lcl|Aclame:pro 370 RKNAYGVADN-SYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVEN 448 (498) Q Consensus 370 ~~n~~G~~D~-s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given 448 (498) ..|+ .|+.|...|+.+|+.+.++.... .|-.+.+... +-+.||..+-+.+++|..+|.++. T Consensus 542 ------~~~~s~~~~i~vrR~~~~i~~si~~~~~-~~v~e~n~~~-----------l~~~i~~~i~~fL~~l~~~gal~~ 603 (659) T protein:vir:72 542 ------TSVPSPFDRINVRRLFNMLKTNIGRSSK-YRLFELNNAF-----------TRSSFRTETAQYLQGNKALGGIYE 603 (659) T ss_pred ------CCCCcccceEeehhHHHHHHHHHHHHHH-HhhcCCCCHH-----------HHHHHHHHHHHHHHHHHhcCceee Confidence 1233 69999999999999999998664 4444433221 457789999999999999999986 Q ss_pred hhhhcCeEEEEEcC--C---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 449 YELFKQYLVVERDA--S---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 449 ~~~~~~~lvVerd~--~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) | .+.|..+. + +.+|+.+.+-...+-.++- |.|++|..-..+ T Consensus 604 ~-----~V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~----I~~~~~~~~~~~ 649 (659) T protein:vir:72 604 Y-----RVVCDTTNNTPSVIDRNEFVATFYIQPARSINY----ITLNFVATATGA 649 (659) T ss_pred E-----EEEEcCCCCCHHHhhCCeEEEEEEEEecCCccE----EEEEEEEeecCc Confidence 5 24443221 1 3357777777666666554 344555322322 No 38 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=99.42 E-value=4.2e-12 Score=82.96 Aligned_cols=440 Identities=12% Similarity=0.056 Sum_probs=219.1 Q ss_pred cccccCeEEEE-EecCCCCCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCc---CcHHHHHHHHHHHhCCCc Q lcl|Aclame:pro 10 SNTLVPLFYAE-MDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA---GSQLARMVEAYRQTDPFG 85 (498) Q Consensus 10 ~~~rvPg~y~E-~dns~a~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~---GS~l~~M~~a~~~~n~~~ 85 (498) -....||+|+| +|-++.=.........++|. ...++.++|++|+|..|-...||. .+-+..|++.|+. |--+ T Consensus 1 ~~~~~Pgvyv~e~~~~~~i~~~~t~~~~~vG~---~~~Gp~~~p~~v~~~~~~~~~fg~~~~~~~~~~~~~~~f~-ngg~ 76 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAAIVGK---FAWGPAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFL-QYGN 76 (663) T ss_pred CceecCceEEEEecCCccccccCcccceeEee---cccCCCCccEEecCHHHHHHhcCCcCCcchhHHHHHHHHH-hCCC Confidence 45788999998 54333312222446677875 446778999999999999999996 5677788888875 4456 Q ss_pred eEEEEEecCCcccee------EEEEEEeeec---cCCcEEEEEEcc----------------EEEEEEee---------- Q lcl|Aclame:pro 86 ELYVIAVPEATGAAA------TVTLTVTGEA---TESGTVNVYVGR----------------TRVQAPVT---------- 130 (498) Q Consensus 86 ~l~~i~l~d~ag~aa------tg~ititgta---t~~G~l~l~I~g----------------~~v~v~V~---------- 130 (498) .+|++.+.+....++ ...+++.... +.+-.+.+.+.+ .++.+... T Consensus 77 ~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~i~v~~~~~~~~~~~~~~~~~~~~~~~~v~~~ta~~~~~~~~ 156 (663) T protein:vir:10 77 DLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKTRQ 156 (663) T ss_pred eEEEEEccCCccccccccccccceeEEeecccccccccccccccccccccccccceeeecccceEEEeeccccccccccc Confidence 899999865321111 1122222111 011112221111 11111110 Q ss_pred ------------------cCCCHHHHHHHHHHHHhcCCCceEEE----------------eeccceEEEeeccCcccccc Q lcl|Aclame:pro 131 ------------------NGDNVTTIASSIQDAINAVPTLPFTA----------------SSSAGVVTLTARHKGLCGNE 176 (498) Q Consensus 131 ------------------~gdtaa~iA~~l~~aIn~~~~lpVtA----------------~~~~~~VtlTAk~kG~~gN~ 176 (498) +++.... ......++ +....+.. ........++++..|.+||. T Consensus 157 v~~~~~~~~~~~~~~s~~s~~~~~a--~~v~~v~~-d~~~~v~~~~~a~~~~t~~~~~~~~~~~~~~~i~A~~~G~~Gn~ 233 (663) T protein:vir:10 157 LGTYPTLGDNWRIDVSGASGGSAAA--LALGNIVV-DSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLISAVYPGEIGST 233 (663) T ss_pred cccceeeccceeeEeeeccCccccc--cccceecc-ccceEEeeccccccccccccccccccccccceEEeccCCcccce Confidence 0000000 00001111 11111111 01112457889999999999 Q ss_pred eeEEEEecccC-c--------------------------------------c---c----------c------------c Q lcl|Aclame:pro 177 IPVSLNYYGFG-G--------------------------------------G---E----------V------------L 192 (498) Q Consensus 177 i~l~~~~~~~~-~--------------------------------------g---e----------~------------~ 192 (498) +.+.+...... . | | . . T Consensus 234 i~V~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~ 313 (663) T protein:vir:10 234 VEVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYF 313 (663) T ss_pred eeeeeccccccccccccceecccccccccccceeecccccccceeeEeecCCcceeeecccccccccccccchhhhhhhh Confidence 88765321100 0 0 0 0 0 Q ss_pred cccee--------------eeecccCCCcCc------chhhhHHHhhccC---cceEEEecCCCh---H----HHHHHHH Q lcl|Aclame:pro 193 PAGVQ--------------IAVATGTAGTGA------PVLTGAVAAMADE---PFDYIGLPFNDT---A----SVNTLVT 242 (498) Q Consensus 193 p~Glt--------------~tit~~agGag~------pD~~~alaalg~~---~~~~I~~p~tD~---a----~l~al~~ 242 (498) ..|.+ ..+...+||... .|+..+++.+.+. ..+++++|-.+. . ...++-. T Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~ 393 (663) T protein:vir:10 314 RNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVS 393 (663) T ss_pred cCCcceEEEEeecccCccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHH Confidence 00000 001123344332 2455666666543 455666653322 1 1233333 Q ss_pred HHhhhhhhhhhhhheeeEEE-EeccC---------CHHHHHhhh-------------hccCcceEEEEecC--------C Q lcl|Aclame:pro 243 EMNDTSGRWSYARQLYGHVY-TAKTG---------TLSELVNAG-------------DQFNQQHITLAGYE--------K 291 (498) Q Consensus 243 ~l~~~s~r~~~~~q~~g~~~-~~~~g---------t~~~~~t~g-------------~~~N~~~~t~~~~~--------~ 291 (498) |.+. +++.+++ .+.++ +..++..+- ..+++.+..+.+.. + T Consensus 394 ~a~~---------~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~ 464 (663) T protein:vir:10 394 LADD---------RQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYND 464 (663) T ss_pred HHHh---------hCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCccceEEEcCceEEecccCC Confidence 3322 1122222 12211 223333332 23466666665321 1 Q ss_pred C-CCCcHHHHHHHHHHHhhhhhccCcc-cccc-ce-EEeccccCCCccccChHHHHHHHhCCeeEEEE-c--CCeEEEEe Q lcl|Aclame:pro 292 E-TQTPADELAASRTARAAVFIRNDPA-RPTQ-TG-ELVGMLPAPKGKRFTMTEQQTLLSHGVATAYV-E--SGVLRIQR 364 (498) Q Consensus 292 ~-~~~p~~~~AAa~~a~~a~~l~~DPA-rpl~-tl-~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v-~--~G~v~IeR 364 (498) . ...|+....|.+.|+.-. +..|- .|-+ .+ .+.|+ ...+..++..|++.|..+||.++.. . .| .++-= T Consensus 465 ~~~~~p~sg~vAGl~Ar~D~--~~g~~~sPan~~~~~i~g~--~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G-~~~wG 539 (663) T protein:vir:10 465 INRWVPLAADIAGLCAYTDQ--VSHPWMSPAGYRRGQIRNC--IKLAIEPKQSMRDTMYQVAINPVTGFAGGDG-FVLFG 539 (663) T ss_pred ceEEechhHHHHHHHHHhhc--cCCceEccCCceecccccc--ccceeecChhHHHHHhhCCceEEEEEeCCCc-EEEEc Confidence 1 113443333333332221 11121 1222 11 23343 3446678999999999999987653 2 35 33333 Q ss_pred eeeeeeecCCCCCC-chhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 365 DVTTYRKNAYGVAD-NSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERA 443 (498) Q Consensus 365 ~ITTY~~n~~G~~D-~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~ 443 (498) .-|. ..| ..|+.|...|+.+|+.+.++.... .|-.+.+... +-..||..+=..+++|..+ T Consensus 540 ~rT~-------~~~~s~~~~i~vrR~~~~i~~si~~~~~-~~v~epn~~~-----------l~~~i~~~i~~~L~~l~~~ 600 (663) T protein:vir:10 540 DKMA-------TQVPSPFDRINVRRLFNMLKKNIGDTSK-YELFENNDAF-----------TRQSFRMETSQYLDGIRSL 600 (663) T ss_pred cccc-------CCCCcccceEehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhC Confidence 3232 123 369999999999999999998664 4444433221 3467899999999999999 Q ss_pred ccccchhhhcCeEEEEEc--CC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 444 GIVENYELFKQYLVVERD--AS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 444 given~~~~~~~lvVerd--~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) |.+..|. +.+-++ ++ +.+|+.+.+-...+...+-| .|++|....-+ T Consensus 601 gal~g~~-----v~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~i----~~~~~~~~~~~ 651 (663) T protein:vir:10 601 GGCYDFR-----VVCDTTNNTPNVIDRNEFVGTIYVKPPRSINYI----TLNMVATSTGA 651 (663) T ss_pred CceeeeE-----EEEcCCCCCHHHhhCCeEEEEEEEEecCCcceE----EEEEEEeecCc Confidence 9998852 333222 11 34778888777666665544 44444332222 No 39 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=99.41 E-value=1.2e-11 Score=80.48 Aligned_cols=452 Identities=12% Similarity=0.046 Sum_probs=211.5 Q ss_pred cccccCeEEEE-EecCCCCCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCc---CcHHHHHHHHHHHhCCCc Q lcl|Aclame:pro 10 SNTLVPLFYAE-MDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA---GSQLARMVEAYRQTDPFG 85 (498) Q Consensus 10 ~~~rvPg~y~E-~dns~a~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~---GS~l~~M~~a~~~~n~~~ 85 (498) -+...||+|+| +|-+..-.........++|. ...++.++|++|+|..|-...||. .+-+.-|+.+++-+ --+ T Consensus 1 ~~~~~Pgvyv~e~~~~~~~~~~~t~~~~~vg~---~~~gp~~~p~~i~~~~~~~~~fg~~~~~~~~~~~~~~~f~~-~g~ 76 (666) T protein:vir:80 1 MTLLSPGFETKETTLSTTIVQSATGRAALVGK---FQWGPAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQ-YGN 76 (666) T ss_pred CceecCceEEEEecCCccccccCcccceEEec---cccCCCccceEecCHHHHHHhcCCccCccchHHHHHHHHhc-CCC Confidence 55788999998 54333312223446677775 446678999999999999999995 45666778888764 444 Q ss_pred eEEEEEecCCcc----ceeEEEEEEeeecc---------------------CCcEEEEEEccEEEEEEeec--------- Q lcl|Aclame:pro 86 ELYVIAVPEATG----AAATVTLTVTGEAT---------------------ESGTVNVYVGRTRVQAPVTN--------- 131 (498) Q Consensus 86 ~l~~i~l~d~ag----~aatg~ititgtat---------------------~~G~l~l~I~g~~v~v~V~~--------- 131 (498) .+|++.+.+... ....+.+.++.... ..++..+...++-..+.+.. T Consensus 77 ~~~v~R~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~i~~~~~~~~~~~~ta~~~~~a~~ 156 (666) T protein:vir:80 77 DLRVVRVLNKEKAKNATALAGNIEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKA 156 (666) T ss_pred eEEEEEecCccccccccccccceeEEEeeccccccccccccccccCcccccCcceEEEeecceeeeeecchhhhcccccc Confidence 899999965321 11112222221100 01111111111100000000 Q ss_pred -CCCH-------HHH---------HHHHHHHHhcCCCc-eEEEee--------------ccceEEEeeccCcccccceeE Q lcl|Aclame:pro 132 -GDNV-------TTI---------ASSIQDAINAVPTL-PFTASS--------------SAGVVTLTARHKGLCGNEIPV 179 (498) Q Consensus 132 -gdta-------a~i---------A~~l~~aIn~~~~l-pVtA~~--------------~~~~VtlTAk~kG~~gN~i~l 179 (498) +... .++ +-.+..-++..... +....+ ..+.-.+.++..|..|+.+.+ T Consensus 157 ~~~~~~v~~~~~~~~~~~~~~~~~a~~V~~~~~~~~~~~~~~~~a~~~~t~~~~~~~~~~~~~~a~~a~~~g~~g~~l~v 236 (666) T protein:vir:80 157 IGVYPELDGDWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLQKYDMPAVSAIYAGEIGNSLEV 236 (666) T ss_pred ccccceeeccceeeeccccccceeeeeeeeeecCCccceeeeccccccccccccccccccccchhhhhhcccccccceee Confidence 0000 000 00000000000000 000000 000000112222233332222 Q ss_pred EEEec--------------------------------------------------------------------cc---Cc Q lcl|Aclame:pro 180 SLNYY--------------------------------------------------------------------GF---GG 188 (498) Q Consensus 180 ~~~~~--------------------------------------------------------------------~~---~~ 188 (498) .+.-. .. .+ T Consensus 237 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (666) T protein:vir:80 237 EILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFGRG 316 (666) T ss_pred eeccccccccccccceeeeccccccccceeeeeccccccceeeEeccCCccceeeecccccccccccchhhhhhhhhccc Confidence 11000 00 00 Q ss_pred --------ccccccceeeeecccCCCcC----------cc----h---hhhHHHhhccCcceEEEecCC------ChHHH Q lcl|Aclame:pro 189 --------GEVLPAGVQIAVATGTAGTG----------AP----V---LTGAVAAMADEPFDYIGLPFN------DTASV 237 (498) Q Consensus 189 --------ge~~p~Glt~tit~~agGag----------~p----D---~~~alaalg~~~~~~I~~p~t------D~a~l 237 (498) .+..+.+.+. ...+.||.. .. + ....++......+++++.|.- +.+-. T Consensus 317 ~~~~~~~~~~~~~~~~~~-~~~~~~g~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v~ 395 (666) T protein:vir:80 317 SSQYIYATAQGWVDGFSG-IISLAGGVSANEATTGGVGADPFIGAMMQGWGLFAERESIHVNLLIAGACAGEGDAFSTVQ 395 (666) T ss_pred cceeeeecccccccccce-EEEecCCCCcccccccccccccccccchhhhhhhhhhcccccceEeecCcCCcccchHHHH Confidence 0000011110 011222211 00 1 111223334456778877643 12333 Q ss_pred HHHHHHHhhhhhhhhhhhh-eeeEEEEeccCCHHHHHhhhhc----------cCcceEEEEecC--------C-CCCCcH Q lcl|Aclame:pro 238 NTLVTEMNDTSGRWSYARQ-LYGHVYTAKTGTLSELVNAGDQ----------FNQQHITLAGYE--------K-ETQTPA 297 (498) Q Consensus 238 ~al~~~l~~~s~r~~~~~q-~~g~~~~~~~gt~~~~~t~g~~----------~N~~~~t~~~~~--------~-~~~~p~ 297 (498) .++.+|.+....++.-... +..++-....-++.++..+-.. ++|.|..+.+.. + ....|+ T Consensus 396 ~~~~~~~~~~~~~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~ 475 (666) T protein:vir:80 396 KHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVPL 475 (666) T ss_pred HHHHHHHHhhcceEEEeecceeEEeecCCCCCHHHHHHHHHhcccchhhhcccCcceEEEEcCceEEecccCCceeEech Confidence 4455554332222211110 1111111223466677666543 467776664321 1 111354 Q ss_pred HHHHHHHHHHhhhhhccCcc-ccccc--eEEeccccCCCccccChHHHHHHHhCCeeEEEEc--CCeEEEEeeeeeeeec Q lcl|Aclame:pro 298 DELAASRTARAAVFIRNDPA-RPTQT--GELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVE--SGVLRIQRDVTTYRKN 372 (498) Q Consensus 298 ~~~AAa~~a~~a~~l~~DPA-rpl~t--l~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~--~G~v~IeR~ITTY~~n 372 (498) ....|.+.|+.-. +..|- .|-+. ..+.|+. +..-+++..|++.|..+||.++.-- .| .++--..|. T Consensus 476 sg~~AGl~Ar~D~--~~g~~~sPan~~~~~i~g~~--~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G-~~~wG~rT~---- 546 (666) T protein:vir:80 476 AADIAGLCARTDA--VSQPWMSPAGYNRGQIMNVV--KLAIEPRKAHRDRLYQAAINPVIGAGGEG-FILMGDKTA---- 546 (666) T ss_pred HHHHHHHHHHHhh--cCCceEccCCeecceeeccc--cceeecChhHHHhhhhCCeeEEEEeCCCe-EEEEccccC---- Confidence 4444444343321 22331 13222 1344443 3456788999999999999998653 35 555555554 Q ss_pred CCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhh Q lcl|Aclame:pro 373 AYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELF 452 (498) Q Consensus 373 ~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~ 452 (498) .+ .|..|+.|.+.|..+|+.+.++.... .|-.+.+... +-+.||..+=+.+++|.++|.+..|. T Consensus 547 -~~-~~s~~~~i~vRRl~~~i~~si~~~~~-~~v~epn~~~-----------l~~~i~~~i~~~L~~l~~~gal~g~~-- 610 (666) T protein:vir:80 547 -TT-VPSPFDRINVRRLFNMLKKNIGDSSK-YKLFENNDNF-----------TRASFRMEVSQYLSTIRSLGGIYDFR-- 610 (666) T ss_pred -CC-CCcccceeehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhcCceeeeE-- Confidence 11 23469999999999999999998665 4444433221 44778999999999999999999853 Q ss_pred cCeEEEE--EcCC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 453 KQYLVVE--RDAS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 453 ~~~lvVe--rd~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) +.+- .|++ +.+|+.+.+-.......+-| .|++|.-..-+ T Consensus 611 ---V~~d~~~nt~~di~~G~~~~~i~~~P~~Pae~I----~~~~~~~~~~~ 654 (666) T protein:vir:80 611 ---VQCDTTNNTPDVIDRNEFVASMFIKPAKSINYI----MLNFTAVATGS 654 (666) T ss_pred ---EEEcCCCCCHHHhhCCeEEEEEEEEecCCcceE----EEEEEEeecCc Confidence 2332 2222 35788888777777766544 45555332222 No 40 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=99.39 E-value=6.7e-13 Score=87.31 Aligned_cols=360 Identities=16% Similarity=0.135 Sum_probs=194.1 Q ss_pred CccchhhcCcccccCeEEEEEecCCCCCC--CCCccEEEEEecCC--CCccccceeEEecChHHHHHhhCcCcHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQAANTA--QDSGASLLIGHANN--GAEIVANSLVLMPSADYARQICGAGSQLARMVE 76 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~a~~~--~~~~~vLliGq~~~--~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~ 76 (498) |+ . ..||+|+|-.++.+.+. ....-..+||-.-. .+..+.++|+++.|..++...||.++.++..++ T Consensus 1 m~-------~--~~~Gv~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~~~~~~~~g~~~tl~~a~~ 71 (396) T protein:vir:60 1 MS-------D--YHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPLNKPVLITNVQSAIAKAGKKGTLAASLQ 71 (396) T ss_pred CC-------C--CCCCeEEEEcCCCcccccccCceeEEEEecccccccccccCccCeEeechHHHHHhhcCcchhHHHHH Confidence 43 3 35999999766655322 23344567774422 245567899999999999999999999988888 Q ss_pred HHHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEE Q lcl|Aclame:pro 77 AYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTA 156 (498) Q Consensus 77 a~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA 156 (498) ++..+-. ...+++.+............ .++. T Consensus 72 ~~~~~gg-~~~~vv~~~~~~~~~~~~~~------------------------------------------------~~~~ 102 (396) T protein:vir:60 72 AIADQSK-PVTVVVRVEDGTGEDEETKL------------------------------------------------AQTV 102 (396) T ss_pred HHhhccC-ceEEEEeccccccccccccc------------------------------------------------cccc Confidence 8864422 12222222211000000000 0000 Q ss_pred eeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhc---cCcceEEEec-CC Q lcl|Aclame:pro 157 SSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMA---DEPFDYIGLP-FN 232 (498) Q Consensus 157 ~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg---~~~~~~I~~p-~t 232 (498) ..-.+.+.-+.+.+|. .+|.... ....++++.| ++ T Consensus 103 ~~~~~~~d~~~~~tg~-----------------------------------------~al~~~~~~~~~~~~il~ap~~~ 141 (396) T protein:vir:60 103 SNIIGTTDENGQYTGL-----------------------------------------KALLAAESVTGVKPRILGVPGLD 141 (396) T ss_pred ccccccccccccccch-----------------------------------------hhhhhcccceeeeeeeccccccc Confidence 0000000000111110 0011000 1112233333 22 Q ss_pred ChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCC---------CCCCcHHHHHHH Q lcl|Aclame:pro 233 DTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEK---------ETQTPADELAAS 303 (498) Q Consensus 233 D~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~---------~~~~p~~~~AAa 303 (498) +.....+|..+.+. + ...+..-.....+++++.++-...|+.+..+.+... ....|+. +. T Consensus 142 ~~~v~~al~~~~~~----~----~~~~i~d~p~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s---~~ 210 (396) T protein:vir:60 142 TKEVAVALASVCQK----L----RAFGYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVASTTATAYAT---AR 210 (396) T ss_pred cHHHHHHHHHHhcc----C----CeEEEEeCCCCCCHHHHHHHHhhcCCceEEEEeCceeeecccCCceeEEchh---HH Confidence 33333444444321 1 122222222334777777777777887776644211 0112432 23 Q ss_pred HHHHhhhh-hccCcc-ccccceEEeccccCCC----ccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCCCC Q lcl|Aclame:pro 304 RTARAAVF-IRNDPA-RPTQTGELVGMLPAPK----GKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVA 377 (498) Q Consensus 304 ~~a~~a~~-l~~DPA-rpl~tl~L~Gl~~p~~----~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G~~ 377 (498) +|++.|.- .+..|- .|-| ..|.|+.-+.. ...++..|++.|..+||.++.-+.| .++--..|. .. T Consensus 211 ~AG~~a~~d~~~g~~~spaN-~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~~~G-~~~wG~rT~-------~~ 281 (396) T protein:vir:60 211 ALGLRAKIDQEQGWHKTLSN-VGVNGVTGISASVFWDLQESGTDADLLNESGVTTLIRRDG-FRFWGNRTC-------SD 281 (396) T ss_pred HHHHHHHhhhccCcEeCcCC-ceecceeeceeecccccCCCcchhhhhhhcCcEEEEcCCC-EEEEccccc-------CC Confidence 33333321 122221 1332 34566654433 3345778999999999999865677 444455553 23 Q ss_pred CchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEE Q lcl|Aclame:pro 378 DNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLV 457 (498) Q Consensus 378 D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lv 457 (498) |+.|+.|.+.|+.+|+.+.++..+. .|-.+.+... |-+.++..+-+.+++|..+|.+..++.+- . T Consensus 282 d~~~~~i~~rR~~~~i~~~i~~~~~-~~v~e~n~~~-----------~~~~i~~~i~~~l~~l~~~gal~g~~~~~---d 346 (396) T protein:vir:60 282 DPLFLFENYTRTAQVLADTMAEAHM-WAVDKPITAT-----------LIRDIVDGINAKFRELKTNGYIVDATCWF---S 346 (396) T ss_pred CcccceeehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeceEEEE---e Confidence 7889999999999999999999775 4554443332 55788999999999999999999875432 2 Q ss_pred EEEcCC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 458 VERDAS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 458 Verd~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) -+.|.. +.+|+.+.+-...+..++ .|.|+++++.+-- T Consensus 347 ~~~nt~~~i~~G~~~~~i~~~p~~pae----~I~~~~~~~~~~~ 386 (396) T protein:vir:60 347 EESNDAETLKAGKLYIDYDYTPVPPLE----NLTLRQRITDKYL 386 (396) T ss_pred cCCCCHHHhhCCEEEEEEEEEecCCcc----eEEEEEEEchHHH Confidence 223322 247777777777776655 5666666665522 No 41 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=99.39 E-value=5.5e-13 Score=87.80 Aligned_cols=355 Identities=15% Similarity=0.123 Sum_probs=195.3 Q ss_pred cCcccccCeEEEEEecCCCCCC--CCCccEEEEEecCC--CCccccceeEEecChHHHHHhhCcCcHHHHHHHHHHHhCC Q lcl|Aclame:pro 8 IPSNTLVPLFYAEMDNQAANTA--QDSGASLLIGHANN--GAEIVANSLVLMPSADYARQICGAGSQLARMVEAYRQTDP 83 (498) Q Consensus 8 Ip~~~rvPg~y~E~dns~a~~~--~~~~~vLliGq~~~--~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a~~~~n~ 83 (498) ++. ..||+|+|-.++.+... ....-+.+||..-. .+..+.++|+.+.|..+....||.++.++..+..+..+. T Consensus 1 m~~--~~~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~~~pv~v~s~~~~~~~~g~~~tl~~al~~~~~~~- 77 (395) T protein:vir:98 1 MSD--FHHGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPLNEPVLITNVQSAIAKAGKKGTLAASLQAIADQS- 77 (395) T ss_pred CCC--CCCCeEEEEcCCCcccccccCcceEEEEeeccCCCccccccccceEeechHHhHhhcccccchhhHHHHHhhcc- Confidence 333 36999998766555322 22334567775432 233356899999999999999999888777776665331 Q ss_pred CceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEEeeccceE Q lcl|Aclame:pro 84 FGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTASSSAGVV 163 (498) Q Consensus 84 ~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA~~~~~~V 163 (498) -...+++.+....... .. ..+.++ T Consensus 78 ~~~~~vv~~~~~~~~~------------~~------------------------------------~~~a~~-------- 101 (395) T protein:vir:98 78 KPVTVVVRVEDGTGDD------------EE------------------------------------AALAQT-------- 101 (395) T ss_pred CceEEEeecccccccc------------cc------------------------------------cccccc-------- Confidence 1111221111100000 00 000000 Q ss_pred EEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhc------cCcceEEEec-CCChHH Q lcl|Aclame:pro 164 TLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMA------DEPFDYIGLP-FNDTAS 236 (498) Q Consensus 164 tlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg------~~~~~~I~~p-~tD~a~ 236 (498) .....|+.......+.+.++. .....++++| |++... T Consensus 102 ------------------------------------~~~i~g~~~~~~~~Tgl~al~~~~~~~~~~p~il~ap~~~~~~v 145 (395) T protein:vir:98 102 ------------------------------------VSNIIGGTDENGKYTGIKALLTAQAVTGVKPRILGVPGLDTKEV 145 (395) T ss_pred ------------------------------------ccccccccccccchhHHHHHhhhhhhhccchhhcccccccccHH Confidence 000001100111111221111 1233445554 444455 Q ss_pred HHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCCC---------CCCcHHHHHHHHHHH Q lcl|Aclame:pro 237 VNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEKE---------TQTPADELAASRTAR 307 (498) Q Consensus 237 l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~~---------~~~p~~~~AAa~~a~ 307 (498) ..++..+.+ ++ ...++.-.....++.++..+-...++.|..+.+.... ...|+ ++.+|+. T Consensus 146 ~~al~~~~~----~~----~~~~~~d~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~AG~ 214 (395) T protein:vir:98 146 AVALASAAI----KL----RAFAYVSAWGCKTISEAMEYRKNFSQRELMVIWPDFLAWDTVKNTTATAYA---TARALGL 214 (395) T ss_pred HHHHHHHhh----hc----CcEEEEEcCCCCCHHHHHHHHhccCCceEEEEecceeEecccCCceeeech---HHHHHHH Confidence 556665543 22 2334443334448889998988889988776643111 11243 2333333 Q ss_pred hhhhh-ccCcc-ccccceEEeccccCCC----ccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCCCCCchh Q lcl|Aclame:pro 308 AAVFI-RNDPA-RPTQTGELVGMLPAPK----GKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADNSY 381 (498) Q Consensus 308 ~a~~l-~~DPA-rpl~tl~L~Gl~~p~~----~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G~~D~s~ 381 (498) .|..- +..|- .|-+ ..|.|+.-+.. ....+..|++.|..+||.++.-+.| .++--..|. ..|+.| T Consensus 215 ~a~~d~~~g~~~spaN-~~i~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~~~G-~~~wG~rT~-------s~d~~~ 285 (395) T protein:vir:98 215 RAYIDQTVGWHKTLSN-VGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLVRKDG-FRFWGNRTC-------SDDPLF 285 (395) T ss_pred HHHhhcccCcEeccCC-ceeecccccceecccccCCCcchHHhhhhcCcEEEEcCCC-EEEEccccc-------CCCccc Confidence 33211 12231 1322 34556644332 2344688999999999999966677 444444443 247889 Q ss_pred hhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEEEc Q lcl|Aclame:pro 382 LDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERD 461 (498) Q Consensus 382 ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvVerd 461 (498) +.|.+.|+.+|+.+.++.... .|-.+.+... |-+.|+..+-..+++|..+|.+..++.+ +.++ T Consensus 286 ~~i~~rR~~~~i~~~i~~~~~-~~v~e~~~~~-----------~~~~i~~~i~~~L~~l~~~g~l~g~~v~-----~d~~ 348 (395) T protein:vir:98 286 LFENYTRTAQVLADTMAEAHM-WAVDKPITAT-----------LIRDIVDGINAKFRELKSNGYIVEGKCW-----FDEE 348 (395) T ss_pred ceeehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeceEEE-----EecC Confidence 999999999999999999775 4544433222 4578899999999999999999987532 3222 Q ss_pred CC-----CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 462 AS-----VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 462 ~~-----d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) .+ +.+|+.+.+-......++ .|.|+++++.+-- T Consensus 349 ~nt~~~i~~G~~~~~i~~~p~~p~e----~I~~~~~~~~~~~ 386 (395) T protein:vir:98 349 SNDKETLKAGKLYIDYDYTPVPPLE----SLTLRQRITDKYL 386 (395) T ss_pred CCCHHHhhCCeEEEEEEEEecCCcc----eEEEEEEEchHHH Confidence 21 125677777666666644 4566666655442 No 42 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=99.38 E-value=8.4e-13 Score=86.79 Aligned_cols=360 Identities=16% Similarity=0.147 Sum_probs=193.4 Q ss_pred CccchhhcCcccccCeEEEEEecCCCCCCCC--CccEEEEEecC--CCCccccceeEEecChHHHHHhhCcCcHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQAANTAQD--SGASLLIGHAN--NGAEIVANSLVLMPSADYARQICGAGSQLARMVE 76 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~a~~~~~--~~~vLliGq~~--~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~ 76 (498) |+ . ..||+|+|-.++.+..... .--+-+||..- .++..+.++|++++|..+....||.++.++..++ T Consensus 1 m~-------~--~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l~~pvlvts~~~~~~~~g~~~tL~~al~ 71 (396) T protein:vir:20 1 MS-------D--YHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAISKAGKKGTLAASLQ 71 (396) T ss_pred CC-------C--CCCCeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccCccCEEeechHHHHhhcccccchhhhhh Confidence 44 2 3599999987766632221 34456777542 2235567899999999999999999988887777 Q ss_pred HHHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEE Q lcl|Aclame:pro 77 AYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTA 156 (498) Q Consensus 77 a~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA 156 (498) +++.+. ....+++.+.......... ...++. T Consensus 72 ~~~~ng-g~~~~v~~~~~~~~~~~~~------------------------------------------------~~a~t~ 102 (396) T protein:vir:20 72 AIADQS-KPVTVVMRVEDGTGDDEET------------------------------------------------KLAQTV 102 (396) T ss_pred hhhccC-ceeEEEEeccccccccccc------------------------------------------------cccccc Confidence 665332 2222222221111000000 000000 Q ss_pred eeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEec-CCChH Q lcl|Aclame:pro 157 SSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLP-FNDTA 235 (498) Q Consensus 157 ~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~p-~tD~a 235 (498) ..-.+.+.-+.+.+|... + ..+.. +.....+++..| +.+.. T Consensus 103 ~~~~~~~~~~~~~tg~~a-----------------l--------------------~~~~~-~~~~~p~i~~ap~~~~~~ 144 (396) T protein:vir:20 103 SNIIGTTDENGQYTGLKA-----------------M--------------------LAAES-VTGVKPRILGVPGLDTKE 144 (396) T ss_pred cccccccccccccchhhh-----------------h--------------------hhhcc-ccccchhhhhhhhhccHH Confidence 000000000000111000 0 00000 000011112222 22233 Q ss_pred HHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCCC---------CCCcHHHHHHHHHH Q lcl|Aclame:pro 236 SVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEKE---------TQTPADELAASRTA 306 (498) Q Consensus 236 ~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~~---------~~~p~~~~AAa~~a 306 (498) ...+|.++.+. . ...+..-.....+++++.++-...|+.|..+.+.... ...|+ ++.+|+ T Consensus 145 v~~al~~~~~~----~----~~~~~iD~p~~~~~~~a~~~r~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~---s~~~Ag 213 (396) T protein:vir:20 145 VAVALASVCQK----L----RAFGYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVTSTTATAYA---TARALG 213 (396) T ss_pred HHHHHHHHHhc----C----CcEEEEecCCCCCHHHHHHHhhCCCCceEEEEcCccccccCcCCcceeech---hHHHHH Confidence 34444444321 1 1223333333447888888888888888876543111 11233 233333 Q ss_pred HhhhhhccCccc-----cccceEEeccccCCC----ccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCCCC Q lcl|Aclame:pro 307 RAAVFIRNDPAR-----PTQTGELVGMLPAPK----GKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVA 377 (498) Q Consensus 307 ~~a~~l~~DPAr-----pl~tl~L~Gl~~p~~----~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G~~ 377 (498) ..| +.|..+ |-| ..|.|+..+.. ...++..|.+.|..+||.++.-++| .++--..|. .. T Consensus 214 ~~a---~~d~~~g~~~spaN-~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~~~G-~~~wG~rT~-------s~ 281 (396) T protein:vir:20 214 LRA---KIDQEQGWHKTLSN-VGVNGVTGISASVFWDLQESGTDADLLNESGVTTLIRRDG-FRFWGNRTC-------SD 281 (396) T ss_pred HHH---HhhhhcCcEeccCC-ceeccceecceecccccCCCcchhhhhhhcCcEEEEcCCC-EEEEccccc-------CC Confidence 433 233211 333 25666654433 2345678999999999999866677 444344443 24 Q ss_pred CchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEE Q lcl|Aclame:pro 378 DNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLV 457 (498) Q Consensus 378 D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lv 457 (498) |+.|+.|.+.|+.+|+.+.++..+. .|-.+.+... |-+.||..+=+.+++|..+|.+..+..+- . T Consensus 282 d~~~~~i~~rR~~~~i~~~~~~~~~-~~v~e~~~~~-----------~~~~i~~~i~~~L~~l~~~G~l~g~~v~~---d 346 (396) T protein:vir:20 282 DPLFLFENYTRTAQVVADTMAEAHM-WAVDKPITAT-----------LIRDIVDGINAKFRELKTNGYIVDATCWF---S 346 (396) T ss_pred CcccceeehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhCcceeceEEEE---e Confidence 7899999999999999999999775 4544433222 45789999999999999999998864321 1 Q ss_pred EEEcCC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 458 VERDAS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 458 Verd~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) -+.|+. +.+|+.+.+-......++ .|.|+++|+.+-- T Consensus 347 ~~~nt~~~i~~G~~~~~i~~~p~~p~e----~i~~~~~~~~~~~ 386 (396) T protein:vir:20 347 EESNDAETLKAGKLYIDYDYTPVPPLE----NLTLRQRITDKYL 386 (396) T ss_pred cCCCCHHHhhCCEEEEEEEEEecCCcc----eEEEEEEEchHHH Confidence 122222 237788877777777765 5566666554433 No 43 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=99.37 E-value=4.9e-13 Score=88.07 Aligned_cols=353 Identities=14% Similarity=0.079 Sum_probs=184.0 Q ss_pred CccchhhcCcccccCeEEEEEecCCCCCC--CCCccEEEEEecC--CCCccccceeEEecChHHHHHhhCcCcHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQAANTA--QDSGASLLIGHAN--NGAEIVANSLVLMPSADYARQICGAGSQLARMVE 76 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~a~~~--~~~~~vLliGq~~--~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~ 76 (498) |. ... .||+|+|=.++..... ....-+-+||-.- .+...+.++|+++.|..+....||.+.-++.-++ T Consensus 1 M~-------~~~-~pGv~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~n~pv~iss~~~~~~~~g~~gtl~~al~ 72 (391) T protein:vir:79 1 MP-------TDY-HHGVRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPLDTPVLLTNPQAYIGKAGDKGTLAHTLD 72 (391) T ss_pred CC-------CCC-CCCeEEEECCCCcccccccCCceEEEEeecccccccccccccCEEeccHHHHHHhcCCccccchhhh Confidence 44 433 7999998655544211 2233445565322 1234456899999999999888885222211111 Q ss_pred HHHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEE Q lcl|Aclame:pro 77 AYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTA 156 (498) Q Consensus 77 a~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA 156 (498) .+.++ . ++..+.+.+...+ T Consensus 73 ~~~~~-g-------------------------------------g~~~~vv~~~~~~----------------------- 91 (391) T protein:vir:79 73 AITDQ-T-------------------------------------NPLTVVVRVAGGA----------------------- 91 (391) T ss_pred hhhcc-c-------------------------------------ccceeeecccccc----------------------- Confidence 11100 0 0000001000000 Q ss_pred eeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCc------ceEEEec Q lcl|Aclame:pro 157 SSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEP------FDYIGLP 230 (498) Q Consensus 157 ~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~------~~~I~~p 230 (498) +... +.....|+.+.....++++++.+.. ..++++| T Consensus 92 ------------------------------------~~~~--~~~~~~g~~~~~~~~tGl~~l~~~~~~~~~~p~~l~~p 133 (391) T protein:vir:79 92 ------------------------------------SEAE--TTSNLIGTTNAAGRYTGMKALLTARNRFGVAPRILAVP 133 (391) T ss_pred ------------------------------------cccc--ccccccccccchhhhHHHhhhhhhhhhhcccchhhcCC Confidence 0000 0001112222222222333322211 1233333 Q ss_pred CC-ChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCC---------CCCCcHHHH Q lcl|Aclame:pro 231 FN-DTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEK---------ETQTPADEL 300 (498) Q Consensus 231 ~t-D~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~---------~~~~p~~~~ 300 (498) .. +.....++..+.+ ++ +..++.-.....+..++..+....++.+..+.+... ....|+. T Consensus 134 ~~~~~~v~~al~~~~~----~~----~~~ai~d~p~~~t~~~a~~~~~~~~s~~~a~~~P~~~~~d~~~~~~~~~p~s-- 203 (391) T protein:vir:79 134 GLDSLPVGTELVTIAQ----KL----RAFAYLSAYGCQTKEEAVAYRSNFGQREAMVMWPDFVGWDTAANAETTLWAT-- 203 (391) T ss_pred ccchhHHHHHHHHHHh----hc----CcEEEEECCCCCCHHHHHHHHhccCCceeEEecceeeeecCcCCceeeechH-- Confidence 32 2333334443332 22 122233333345678888888888888776543211 1112332 Q ss_pred HHHHHHHhhhh-hccCccccccceEEeccccCCC----ccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCC Q lcl|Aclame:pro 301 AASRTARAAVF-IRNDPARPTQTGELVGMLPAPK----GKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYG 375 (498) Q Consensus 301 AAa~~a~~a~~-l~~DPArpl~tl~L~Gl~~p~~----~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G 375 (498) +.+|+..|.- .+..|--.--...|.|+.-+.. ..+....|.+.|..+||.++.-..| .++--..|+ T Consensus 204 -~~~AG~~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~~I~t~~~~~G-~~~wG~rT~------- 274 (391) T protein:vir:79 204 -ARAVGLRAKIDNDTGWHKTLSNVAVGGVTGLSRDVFWDLQDPATDAGYLNANEVTTLVHRDG-YRFWGSRTC------- 274 (391) T ss_pred -HHHHHHHHHhhhcccceeccCCceehhhhccccccccccccccchhhhhhhcCceEEECCCc-EEEEccccc------- Confidence 3333333311 1112311111235666654332 3344567889999999999865677 455555554 Q ss_pred CCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCe Q lcl|Aclame:pro 376 VADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQY 455 (498) Q Consensus 376 ~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~ 455 (498) ..|+.|+.|.+.|+.+|+.+.++.... .|-.+++... |-+.|+..+=..+++|..+|.+..++.|-+. T Consensus 275 ~~d~~~~~i~~rR~~~~i~~~i~~~~~-~~v~epn~~~-----------~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~ 342 (391) T protein:vir:79 275 SADPLFAFENYTRTAQVLADTMAEAHM-WANDLPMTPT-----------LVRDLLEGINAKLRMLTRNGYLLGGAAWFDA 342 (391) T ss_pred CCCcccceeehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeceEEEEec Confidence 237899999999999999999999775 5655554432 4477899999999999999999998654321 Q ss_pred EEEEEcCC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 456 LVVERDAS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 456 lvVerd~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) +.|.+ +.+|+.+.+-...+-.+. .|.|+++++.+-- T Consensus 343 ---~~nt~~~i~~G~~~~~i~~~p~~p~e----~i~~~~~~~~~~~ 381 (391) T protein:vir:79 343 ---DANSKDTLKAGQLAIDYDYTPVPPLE----NLTFRQRITDRYL 381 (391) T ss_pred ---CCCCHHHhhCCEEEEEEEEEecCCcc----eEEEEEEEchHHH Confidence 22222 235777776666666644 5667777665543 No 44 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=99.37 E-value=5.1e-11 Score=77.01 Aligned_cols=448 Identities=14% Similarity=0.073 Sum_probs=207.8 Q ss_pred cccccCeEEEEEec-CCCCCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCc---CcHHHHHHHHHHHhCCCc Q lcl|Aclame:pro 10 SNTLVPLFYAEMDN-QAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA---GSQLARMVEAYRQTDPFG 85 (498) Q Consensus 10 ~~~rvPg~y~E~dn-s~a~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~---GS~l~~M~~a~~~~n~~~ 85 (498) -+...||+|+|--. +..=.........+||. ...++.++|++|+|..|-...||. +|-+..|++.|+. |--. T Consensus 1 ~~~~~Pgvyv~e~~~~~~i~~v~t~~~~fvG~---~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~f~-ngg~ 76 (671) T protein:vir:56 1 MTLLSPGIENKEINLASAIGRAATGRAAMVGK---FEWGPAYSITQVTSESDLVTIFGRPNDYTAASFMTANNFL-KYGN 76 (671) T ss_pred CceecCceEEEeecCcccccccCcccceEEec---ccCCCCccCEEcCCHHHHHHHcCCcCCCcchhHHHHHHHH-hcCC Confidence 45788999998442 22211222346677774 556788999999999999999997 7888899999985 4455 Q ss_pred eEEEEEecCCccceeE----E----------------EEEEeee--c--cCCcEEEEEEcc---E----------EEEEE Q lcl|Aclame:pro 86 ELYVIAVPEATGAAAT----V----------------TLTVTGE--A--TESGTVNVYVGR---T----------RVQAP 128 (498) Q Consensus 86 ~l~~i~l~d~ag~aat----g----------------~ititgt--a--t~~G~l~l~I~g---~----------~v~v~ 128 (498) .+|++.+.+....+++ . ++.++.. + ...+.+...-.+ . -+.+. T Consensus 77 ~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~v~~~ 156 (671) T protein:vir:56 77 DLRLVRICDATTAQNATPLYNAVEYTIGASNGCVVGDDITITYSGVGALTAKGKVLEVDAGNNNAASKIFLPSAEIVAAA 156 (671) T ss_pred eEEEEEecCccccccchhhccccccccccCcceeeceeeeeecCcccccccCcceeEEeeeccceeeeeeccceeEEEee Confidence 7999999764221111 0 1111110 0 011111100000 0 00000 Q ss_pred -----------eecCCCHHHHHH------------------------H---------HHHHHhc--------------CC Q lcl|Aclame:pro 129 -----------VTNGDNVTTIAS------------------------S---------IQDAINA--------------VP 150 (498) Q Consensus 129 -----------V~~gdtaa~iA~------------------------~---------l~~aIn~--------------~~ 150 (498) +..+.....++. . ....... .. T Consensus 157 ~~~~~~~~~~~~t~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~ 236 (671) T protein:vir:56 157 KSDGNYPSVGTITLQPTQGDIALTNIEIIDTGSVYFPNIELAFDALTAIETEGGALKYADLIEKQGFPRLSARYVGDFGD 236 (671) T ss_pred eccccccccccccccccccceeeeeecccccceEEEeccccccccccccccccccccchhhhhcccccccccccccccCc Confidence 000000000000 0 0000000 00 Q ss_pred CceEEEeeccceEEEeeccCcccccceeEEEEecccCc--------------------------------------ccc- Q lcl|Aclame:pro 151 TLPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGG--------------------------------------GEV- 191 (498) Q Consensus 151 ~lpVtA~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~--------------------------------------ge~- 191 (498) .+.|..........+++...|..+|.++......+... ++. T Consensus 237 ~~~v~v~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~ 316 (671) T protein:vir:56 237 AISVEIINYADYQTAFAFAAGHTLGDIELPIYPDGGTRSINLSSYFTFGPSNSNQYAVIVRVSGEVEEAFIVSTNPGDKD 316 (671) T ss_pred ceEEEEecccccccccccccceeeeeccccccccccccccccceeecccccccccceeEEeecCccceeEEEeecccccc Confidence 00111100011122233333333333332211100000 000 Q ss_pred -----------cccceee-------------eecccCCC----cCcchhhhHHHhhccC-c--ceEEEecCCChHH---- Q lcl|Aclame:pro 192 -----------LPAGVQI-------------AVATGTAG----TGAPVLTGAVAAMADE-P--FDYIGLPFNDTAS---- 236 (498) Q Consensus 192 -----------~p~Glt~-------------tit~~agG----ag~pD~~~alaalg~~-~--~~~I~~p~tD~a~---- 236 (498) .+.|-.. +.....|| .+..++..++.++.+. . .++++.|...... T Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 396 (671) T protein:vir:56 317 VNGQSIFIDEYFENSGSAYITAIAEGWKTESGAYNFGGGSDANAGADDWMFGLDMLSDPEVLYTNLVIAGNAAAEEVSIA 396 (671) T ss_pred cchhhhhhhhhhcccCceEEEecCcccCCccccccccCccccccchhHHHHHHHhhhhccccceeEEEcCCCCCccchhH Confidence 0000000 00111232 2334556677777642 2 3455554322211 Q ss_pred HHHHHHHHhhhhhhhhhhhheeeEEEEe-c---------cCCHHHHHhhhh--------------ccCcceEEEEecC-- Q lcl|Aclame:pro 237 VNTLVTEMNDTSGRWSYARQLYGHVYTA-K---------TGTLSELVNAGD--------------QFNQQHITLAGYE-- 290 (498) Q Consensus 237 l~al~~~l~~~s~r~~~~~q~~g~~~~~-~---------~gt~~~~~t~g~--------------~~N~~~~t~~~~~-- 290 (498) .....+.+.... +. +....++.. . ..++.++..+-. .+++.+..+.+.. T Consensus 397 ~~~~~~~~~~~~---~~--~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~ 471 (671) T protein:vir:56 397 STVQKYAIDSVG---NV--RQDCVVFVSPPQSLIVNKQAGTAVANIQGWRTGIDPTNGQAVVDNLNVSTTYAVIDGNYKY 471 (671) T ss_pred HHHHHHHHHHHH---hh--cCCEEEEEeccccccccccccccHHHHHHHhhhccccchhhhhhhccCCcceEEEecCceE Confidence 111122222211 10 111222221 1 234444443332 3456666544321 Q ss_pred ------C-CCCCcHHHHHHHHHHHhhhhhccCccc-ccc-c-eEEeccccCCCccccChHHHHHHHhCCeeEEEEc--CC Q lcl|Aclame:pro 291 ------K-ETQTPADELAASRTARAAVFIRNDPAR-PTQ-T-GELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVE--SG 358 (498) Q Consensus 291 ------~-~~~~p~~~~AAa~~a~~a~~l~~DPAr-pl~-t-l~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~--~G 358 (498) + ....|+....|.+.|+.-. +..|-. |-+ . ..|.|+. ....+++..|++.|..+||.++..- .| T Consensus 472 ~~d~~~~~~~~~p~s~~~AGl~Ar~D~--~~g~~~span~~~~~i~g~~--~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G 547 (671) T protein:vir:56 472 QYDKYNDRNRWVPLAGDIAGLCAYTDQ--VSQPWMSPAGFNRGQIKGVN--RLAVDLRRAHRDALYQIGINPVVGFAGQG 547 (671) T ss_pred EecccCCceeEechHHHHHHHHHHhhc--cCCcEECcCCceeccccccc--cceeecChhHHHHHhhCCceEEEEecCCe Confidence 1 1123443333333332221 122211 222 1 1244543 3355788999999999999998753 45 Q ss_pred eEEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHH Q lcl|Aclame:pro 359 VLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYR 438 (498) Q Consensus 359 ~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~ 438 (498) .++--.-|. . ..|..|+.|.+.|+.+|+.+.++.... .|-.+++... +-+.||..+-..++ T Consensus 548 -~~~wG~rT~-----~-~~~~~~~~i~vrR~~~~i~~si~~~~~-~~v~epn~~~-----------~~~~i~~~i~~fL~ 608 (671) T protein:vir:56 548 -FVLYGDKTA-----T-QQASAFDRINVRRLFNLLKKAISDAAK-YRLFELNDEF-----------TRSSFKSEIDAYLT 608 (671) T ss_pred -EEEEcceec-----C-CCCcccceEehhhHHHHHHHHHHHHHH-HhcCCCCCHH-----------HHHHHHHHHHHHHH Confidence 344333332 1 234579999999999999999998664 4554443221 55788999999999 Q ss_pred HHhhcccccchhhhcCeEEEEEcCC-----CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 439 QLERAGIVENYELFKQYLVVERDAS-----VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 439 ~le~~given~~~~~~~lvVerd~~-----d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) .|..+|.+..|. +.+.++.+ +.+|+.+.+-...+..++- |.|++|..-..+ T Consensus 609 ~l~~~gal~g~~-----v~~d~~~nt~~~i~~G~~~~~i~~~p~~Pae~----I~~~~~~~~~~~ 664 (671) T protein:vir:56 609 NIQDLGGVYDFR-----VVCDETNNPGSVIDRNEFVASIYVKPAKSINF----ITLNFVATSTDA 664 (671) T ss_pred HHHhCCceeeeE-----EEEcCCCCCHHHhhCCeEEEEEEEEecCCcce----EEEEEEEeecCc Confidence 999999999842 33332211 3457777777766666554 344554333333 No 45 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=99.37 E-value=3.4e-11 Score=78.01 Aligned_cols=452 Identities=12% Similarity=0.066 Sum_probs=214.0 Q ss_pred cccccCeEEEE-EecCCCCCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCc---CcHHHHHHHHHHHhCCCc Q lcl|Aclame:pro 10 SNTLVPLFYAE-MDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA---GSQLARMVEAYRQTDPFG 85 (498) Q Consensus 10 ~~~rvPg~y~E-~dns~a~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~---GS~l~~M~~a~~~~n~~~ 85 (498) -+...||+|+| +|-+..-.........+||. ...++.++|++|+|..|-...||. .+-+..|+++|+. |--. T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~---~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~~~~~f~-ngg~ 76 (666) T protein:vir:65 1 MTLLSPGFETKETTLSTTIVQSETGRAALVGK---FQWGPAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFL-QYGN 76 (666) T ss_pred CceecCceEEEEecCcccccccCcccceEEec---ccCCCCccCEEecCHHHHHHHcCCccccchhHHHHHHHHH-hcCc Confidence 45788999998 54333322233457788884 446688999999999999999995 5667788888885 5556 Q ss_pred eEEEEEecCCccc-eeE---E-----------------EEEEee--e-ccCCc-EEE----------------------- Q lcl|Aclame:pro 86 ELYVIAVPEATGA-AAT---V-----------------TLTVTG--E-ATESG-TVN----------------------- 117 (498) Q Consensus 86 ~l~~i~l~d~ag~-aat---g-----------------~ititg--t-at~~G-~l~----------------------- 117 (498) .+|++.+.+.... ++. + ++++.. . ...++ +.. T Consensus 77 ~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~~~V~~~~~~~~~~~~~~~~~~~~~~~g~~~~t~~~~~~~~~ 156 (666) T protein:vir:65 77 DLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKA 156 (666) T ss_pred eEEEEEccCcccccccccccCceeeeEeeccccccccceEEEEecccccccccccccccccccccccccccceeeccccc Confidence 7999998553211 110 0 111110 0 00000 000 Q ss_pred ------EEEccEEEEEE-e--------ecCCCHH--------------HHHHHHHHHHhcCCCceEEEeec----c---- Q lcl|Aclame:pro 118 ------VYVGRTRVQAP-V--------TNGDNVT--------------TIASSIQDAINAVPTLPFTASSS----A---- 160 (498) Q Consensus 118 ------l~I~g~~v~v~-V--------~~gdtaa--------------~iA~~l~~aIn~~~~lpVtA~~~----~---- 160 (498) +..+. ...+. + ...++.. .+............++|.++... + T Consensus 157 ~g~~~~l~~~~-~~~~~~~~~~~~~a~sv~~~~~~g~~~~~~~~~a~~~~~~~~~~~~~~~~~~~a~~A~~~g~~g~~i~ 235 (666) T protein:vir:65 157 IGVYPELDGGW-TAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLKKYDMPAVSAIYAGEIGNSLE 235 (666) T ss_pred cCcceeEeecc-ceeecccCcccccceeeeecccccceeeeeecccccccccccccccccccccceeeeeecccccccee Confidence 00000 00000 0 0000000 00000000000001111111000 0 Q ss_pred ---------------------------------------ceEEEeeccCcccccceeEEEE-----------ecc----c Q lcl|Aclame:pro 161 ---------------------------------------GVVTLTARHKGLCGNEIPVSLN-----------YYG----F 186 (498) Q Consensus 161 ---------------------------------------~~VtlTAk~kG~~gN~i~l~~~-----------~~~----~ 186 (498) ...+++.+.+|..-...++... |+. . T Consensus 236 v~i~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (666) T protein:vir:65 236 VEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFAR 315 (666) T ss_pred EEeecccccccccccccccccccccccceeeecccccccccceeeeecCCcccceeecccCcccccccchhhhhhhhhcc Confidence 0011222222322111111100 000 0 Q ss_pred Cc-------ccccccceeeeecccCCCcC--------------cchhhhHHHhhcc---CcceEEEecCCC------hHH Q lcl|Aclame:pro 187 GG-------GEVLPAGVQIAVATGTAGTG--------------APVLTGAVAAMAD---EPFDYIGLPFND------TAS 236 (498) Q Consensus 187 ~~-------ge~~p~Glt~tit~~agGag--------------~pD~~~alaalg~---~~~~~I~~p~tD------~a~ 236 (498) .. ....+.+.+..+ .+++|.. ..+...++.++.+ ..+++++.|... .+. T Consensus 316 ~~~~~v~~~~~~~~~~~~~~~-~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v 394 (666) T protein:vir:65 316 GSSQYIYATAQGWVDGFSGII-SLAGGVSANEATTGGVGADPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTV 394 (666) T ss_pred cccceeeeecccccccccceE-EccCCCCcCcccccccccccccccHHHHHHHHhhhhhccCCceeecCcCCccchhHHH Confidence 00 001111222111 1222211 1234456666654 357888877532 344 Q ss_pred HHHHHHHHhhhhhhhhhhhh-eeeEEEEeccCCHHHHHhhhhc----------cCcceEEEEecC--------CC-CCCc Q lcl|Aclame:pro 237 VNTLVTEMNDTSGRWSYARQ-LYGHVYTAKTGTLSELVNAGDQ----------FNQQHITLAGYE--------KE-TQTP 296 (498) Q Consensus 237 l~al~~~l~~~s~r~~~~~q-~~g~~~~~~~gt~~~~~t~g~~----------~N~~~~t~~~~~--------~~-~~~p 296 (498) ..++.++.+....++..... +....-....-+..++.++-.. +||.|..+.+.. +. ...| T Consensus 395 ~~~l~~~~~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p 474 (666) T protein:vir:65 395 QKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVP 474 (666) T ss_pred HHHHHHHHhhccceEEEeccccceeeecCCCCCHHHHHHHHHhcccccccccccCcceEEEEcCceEEecccCCceeEec Confidence 45556555432222110000 0011111223456666665443 567776655321 11 1124 Q ss_pred HHHHHHHHHHHhhhhhccCcc-ccccc--eEEeccccCCCccccChHHHHHHHhCCeeEEEEcCCe-EEEEeeeeeeeec Q lcl|Aclame:pro 297 ADELAASRTARAAVFIRNDPA-RPTQT--GELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGV-LRIQRDVTTYRKN 372 (498) Q Consensus 297 ~~~~AAa~~a~~a~~l~~DPA-rpl~t--l~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~~G~-v~IeR~ITTY~~n 372 (498) +....|.+.|+.. .+..|- .|-+. ..+.|+. +..-.++..|++.|..+||.++..-.|. .++--.-|. T Consensus 475 ~sg~vAGl~Ar~D--~~~g~~~span~~~~~i~g~~--~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~---- 546 (666) T protein:vir:65 475 LAADIAGLCARTD--AVSQPWMSPAGYNRGQIMNVV--KLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTA---- 546 (666) T ss_pred hHHHHHHHHHHHh--ccCCcEEccCCeecceeeccc--cceeecChhHHHhhhhCCceEEEEeCCCeEEEEecccC---- Confidence 4333333333322 122231 12221 1244442 3455778999999999999998764332 666555554 Q ss_pred CCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhh Q lcl|Aclame:pro 373 AYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELF 452 (498) Q Consensus 373 ~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~ 452 (498) . ..|..|+.|.+.|..+|+.+.++.... .|-.+.+... +-+.||..+-..+++|..+|.+..|. T Consensus 547 -~-~~~s~~~~i~vrR~~~~i~~si~~~~~-~~v~epn~~~-----------l~~~i~~~i~~~L~~l~~~gal~g~~-- 610 (666) T protein:vir:65 547 -T-TVPSPFDRINVRRLFNMLKKNIGDSSK-YKLFENNDNF-----------TRASFRMEVSQYLSTIRSLGGIYDFR-- 610 (666) T ss_pred -C-CCCcccceEehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeeeE-- Confidence 1 123469999999999999999998664 4544443221 44788999999999999999999853 Q ss_pred cCeEEEEEc--CC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 453 KQYLVVERD--AS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 453 ~~~lvVerd--~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) +.|-.+ ++ +.+|+.+.+-......++-| .|++|....-+ T Consensus 611 ---V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~i----~~~~~~~~~~~ 654 (666) T protein:vir:65 611 ---VQCDTTNNTPDVIDRNEFVASMFIKPAKSINYI----MLNFTAVATGS 654 (666) T ss_pred ---EEEcCCCCCHHHhhCCeEEEEEEEEecCCcceE----EEEEEEeecCc Confidence 333222 21 24777777766665555443 34444332222 No 46 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=99.36 E-value=4.8e-13 Score=88.14 Aligned_cols=361 Identities=15% Similarity=0.121 Sum_probs=188.9 Q ss_pred CccchhhcCcccccCeEEEEEecCCCCC--CCCCccEEEEEecCC--CCccccceeEEecChHHHHHhhCcCcHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQAANT--AQDSGASLLIGHANN--GAEIVANSLVLMPSADYARQICGAGSQLARMVE 76 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~a~~--~~~~~~vLliGq~~~--~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~ 76 (498) |..+. ..||+|+|-.++.... ....--+.++|-.-. +...+.++|+++.|..++...||.+..+...++ T Consensus 1 M~~~~-------~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v~s~~~~~~~~g~~~tl~~al~ 73 (391) T protein:vir:11 1 MAADQ-------YHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLITNVQAAIGKAGTSGTLPASLQ 73 (391) T ss_pred CCCCc-------CCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEEecchhhheecCCCccchhhhh Confidence 65543 5799999876654421 122334455554321 124456899999999999999997665554444 Q ss_pred HHHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEE Q lcl|Aclame:pro 77 AYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTA 156 (498) Q Consensus 77 a~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA 156 (498) .+.++ .-...+++.+. .+ .+.++++ T Consensus 74 ~~~~~-~g~~~~vv~~~-------------------------------------~~-----------------~~~~~t~ 98 (391) T protein:vir:11 74 AIADQ-ANAATVVVRVK-------------------------------------PG-----------------EDEAATN 98 (391) T ss_pred hhhcc-ccceeEEeeec-------------------------------------cc-----------------ccccccc Confidence 44321 11112222111 10 0001111 Q ss_pred eeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEec-CCChH Q lcl|Aclame:pro 157 SSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLP-FNDTA 235 (498) Q Consensus 157 ~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~p-~tD~a 235 (498) ....+.+.-++..+|.. .+..+...++.. ..++..| +++.+ T Consensus 99 ~d~~g~~~a~~~~~g~~-------------------------------------a~~~~~~~~~~~-p~~~~ap~~~~~~ 140 (391) T protein:vir:11 99 SAVIGGVSADGKYTGMK-------------------------------------ALLAAKARLGVV-PRILGVPGLDTQP 140 (391) T ss_pred hhhhcccccccchhhhh-------------------------------------hhhhhhhhheec-cccccccccccHH Confidence 11111111111111100 001111111111 1222222 22233 Q ss_pred HHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCC---------CCCCcHHHHHHHHHH Q lcl|Aclame:pro 236 SVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEK---------ETQTPADELAASRTA 306 (498) Q Consensus 236 ~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~---------~~~~p~~~~AAa~~a 306 (498) ...++..+.+ +. +..++.-.....++.++..+-...++.|..+.+... ....|+...+|.+.+ T Consensus 141 v~~al~~~~~----~~----~~~~i~D~p~~~t~~~a~~~r~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a 212 (391) T protein:vir:11 141 VATALIAIAQ----QL----RAFAYVSASGCKTKEEATAYRENFAAREAMVIWPDFLTWSTVVNQTVPAPAVAQALGLRA 212 (391) T ss_pred HHHHHHHhhc----cc----ceEEEEEcCCCCCHHHHHHHhhhcCCceEEEEcCcceecccccCceEEechHHHHHHHHH Confidence 3344444332 11 222222222334788888888888888877664211 111344333333333 Q ss_pred HhhhhhccCccccccceEEeccccCCC----ccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCCCCCchhh Q lcl|Aclame:pro 307 RAAVFIRNDPARPTQTGELVGMLPAPK----GKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADNSYL 382 (498) Q Consensus 307 ~~a~~l~~DPArpl~tl~L~Gl~~p~~----~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G~~D~s~l 382 (498) ..-. +..|.-.--...|.|+..+.. +...+..|.+.|...||.++.-+.| .++--..|. ..|+.|+ T Consensus 213 ~~d~--~~g~~~span~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~~~G-~~~wG~rT~-------~~d~~~~ 282 (391) T protein:vir:11 213 RIDQ--EVGWHKTLSNVAVNGVTGISADVFWDLQSPSTDANYLNENEVTTLVQEGG-FRFWGSRTC-------SDDPLFA 282 (391) T ss_pred Hhhc--cCCcEEccCCceeeceeecccccccccCCCcchhhhhhhcCcEEEEcCCC-EEEEccccc-------CCCcccc Confidence 3221 111211111234555544333 2334678899999999999865677 444455553 2478899 Q ss_pred hhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEEEcC Q lcl|Aclame:pro 383 DSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDA 462 (498) Q Consensus 383 di~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvVerd~ 462 (498) .|.+.|+.+|+.+.++..+. .|-.+.+... |-+.|+..+=..+++|..+|.+.+++.+-+ -+.|+ T Consensus 283 ~i~vrR~~~~i~~~~~~~~~-~~v~e~n~~~-----------~~~~i~~~i~~~l~~l~~~g~l~g~~~~~~---~~~n~ 347 (391) T protein:vir:11 283 FENYTRTAQVLADTIAEAHM-WAVDKPMHPS-----------LVRDILEGVNAKFRELKGLGLIIDAQAWYD---PNVND 347 (391) T ss_pred eeehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhccceeceEEEEe---cCCCC Confidence 99999999999999999775 4444433222 457789999999999999999999754321 12222 Q ss_pred CC---CeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 463 SV---PNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 463 ~d---~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) ++ .+|+.+.+-...+..++ .|.|+++|+.+-- T Consensus 348 ~~~i~~G~~~~~i~~~p~~p~e----~i~~~~~~~~~~~ 382 (391) T protein:vir:11 348 KDTLKAGKLRITYDYTPVPPLE----DLTFFQKITDSYL 382 (391) T ss_pred HHHhhCCeEEEEEEEEecCCcc----eEEEEEEEchHHH Confidence 22 36777777766666644 5666666665543 No 47 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=99.35 E-value=1.8e-12 Score=85.01 Aligned_cols=360 Identities=16% Similarity=0.118 Sum_probs=192.7 Q ss_pred CccchhhcCcccccCeEEEEEecCCCCCC--CCCccEEEEEecCC--CCccccceeEEecChHHHHHhhCcCcHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQAANTA--QDSGASLLIGHANN--GAEIVANSLVLMPSADYARQICGAGSQLARMVE 76 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~a~~~--~~~~~vLliGq~~~--~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~ 76 (498) |+ . ..||+|+|-.++.+.+. ....-+.+||..-. ....+.++|++++|..+....||..+-++..++ T Consensus 1 m~-------~--~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~al~ 71 (396) T protein:vir:57 1 MS-------D--YHHGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAIAKAGKKGTLAASLQ 71 (396) T ss_pred CC-------C--CCCceEEEEcCCCcccccccCCceEEEEEeccCCCcccccCccCeEeecchhhhhhcccccchHHHHH Confidence 44 2 56999999776655322 33445677776432 224467899999999999999999888877777 Q ss_pred HHHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEE Q lcl|Aclame:pro 77 AYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTA 156 (498) Q Consensus 77 a~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA 156 (498) .+... .-...+++.+............ ..+. .-.|| T Consensus 72 ~~~~~-~~~~~~vv~~~~~~~~~~~~~~----a~t~----~~iiG----------------------------------- 107 (396) T protein:vir:57 72 AIADQ-SKPVTVVVRVEDGTGDDEETKL----AQTV----SNIIG----------------------------------- 107 (396) T ss_pred Hhhhc-CCceeEeeeccccccccccccc----cccc----eeeee----------------------------------- Confidence 66533 2222333322211100000000 0000 00000 Q ss_pred eeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhc---cCcceEEEecCCC Q lcl|Aclame:pro 157 SSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMA---DEPFDYIGLPFND 233 (498) Q Consensus 157 ~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg---~~~~~~I~~p~tD 233 (498) .+.-+.+.. |+ .+|.... ....+++++|..+ T Consensus 108 -----~~~~~~~~t------------------------gl-----------------~al~~~~~~~~~~p~i~~ap~~~ 141 (396) T protein:vir:57 108 -----TTDENGQYT------------------------GL-----------------KALMGAESVTGVKPRILGVPGLD 141 (396) T ss_pred -----eccccccch------------------------hh-----------------hhhhhcccceeEEeccccCcccc Confidence 000000000 11 0111100 0112333444333 Q ss_pred h-HHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCC---------CCCCcHHHHHHH Q lcl|Aclame:pro 234 T-ASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEK---------ETQTPADELAAS 303 (498) Q Consensus 234 ~-a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~---------~~~~p~~~~AAa 303 (498) . ....+|..+.+ +. ...+..-....-+..++.++...+|+.|..+.+... ....|+. +. T Consensus 142 ~~~v~~al~~~~~----~~----~~~~~~d~p~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s---~~ 210 (396) T protein:vir:57 142 TKEVAVALASVCQ----EL----NAFGYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVTSTTATAYAT---AR 210 (396) T ss_pred hhHHHHHHHHHhh----hC----ceEEEEcCCCCCCHHHHHHHHhccCCceEEEEcceeeeecccCCceeEEehh---HH Confidence 2 23344444432 11 122222222234677888888888998887764211 1113432 33 Q ss_pred HHHHhhhh-hccCcc-ccccceEEeccccCCC----ccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCCCC Q lcl|Aclame:pro 304 RTARAAVF-IRNDPA-RPTQTGELVGMLPAPK----GKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVA 377 (498) Q Consensus 304 ~~a~~a~~-l~~DPA-rpl~tl~L~Gl~~p~~----~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G~~ 377 (498) +|+..|.- .+..|- .|-+ ..|.|+..+.. .......|.+.|..+||.++.-+.| .++--..|. .. T Consensus 211 ~Ag~~a~~d~~~g~~~spaN-~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~t~~~~~G-~~~wG~rT~-------~~ 281 (396) T protein:vir:57 211 ALGLRAKIDQEQGWHKTLSN-VGVNGVTGISASVFWDLQKPGTDADLLNEAGVTTLVRRDG-FRFWGNRTC-------SD 281 (396) T ss_pred HHHHHHHhhhccCcEeccCC-ceeccccccceecccccCCcchhhhhhhhcCcEEEEcCCC-EEEEccccc-------CC Confidence 33333321 122221 1322 35666654332 2234578899999999999865677 455455553 24 Q ss_pred CchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEE Q lcl|Aclame:pro 378 DNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLV 457 (498) Q Consensus 378 D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lv 457 (498) |+.|+.|.+.|+.+|+.+.++..+. .|-.+.+... +-+.|+..+=..+++|..+|.+..++.+-+ T Consensus 282 d~~~~~i~vrR~~~~i~~~i~~~~~-~~v~e~n~~~-----------~~~~i~~~i~~~l~~l~~~gal~g~~v~~d--- 346 (396) T protein:vir:57 282 DPLFLFESYTRTAQVLADTMAEAHM-WAIDKPITAT-----------LIRDIIDGINAKFRELKNNGYIVDGTCWFS--- 346 (396) T ss_pred CcccceeehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeceEEEEe--- Confidence 7889999999999999999999775 4554443322 457899999999999999999998753321 Q ss_pred EEEcCC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 458 VERDAS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 458 Verd~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) -+.|.+ +.+|+.+.+-...+..++ .|.|++++..+-- T Consensus 347 ~~~n~~~~i~~G~~~~~v~~~p~~p~e----~I~~~~~~~~~~~ 386 (396) T protein:vir:57 347 EESNDAETLKAGKLYIDYDYTPVPPLE----NLTLRQRITSRYL 386 (396) T ss_pred cCCCCHHHhhCCeEEEEEEEEecCCcc----eEEEEEEEchHHH Confidence 122222 236666666666666655 5566666655443 No 48 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=99.32 E-value=3.6e-11 Score=77.83 Aligned_cols=447 Identities=11% Similarity=0.051 Sum_probs=210.8 Q ss_pred cccccCeEEEE-EecCCCCCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCc---CcHHHHHHHHHHHhCCCc Q lcl|Aclame:pro 10 SNTLVPLFYAE-MDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA---GSQLARMVEAYRQTDPFG 85 (498) Q Consensus 10 ~~~rvPg~y~E-~dns~a~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~---GS~l~~M~~a~~~~n~~~ 85 (498) -+...||+|+| ++-+..-.........++|. ...++.++|++|+|..|-...||. .+-+..|++.|+. |--+ T Consensus 1 ~~~~~Pgvyv~e~~~~~~~~~v~t~~~~fvG~---~~~gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~-ngg~ 76 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAALVGK---FAWGPAYEIRQVTNEVELVDMFGSPDNVTAPYFMSAMNFL-QYGN 76 (663) T ss_pred CccccCceEEEEecCcccccccccccceeeec---cccCCCCcCEEecCHHHHHHHcCCcccccchHHHHHHHHH-hCCC Confidence 55788999998 43222211222445677775 446688999999999999999997 6788899999984 4556 Q ss_pred eEEEEEecCCc-ccee-----EEEEEEe--eec-cCCcEEEEE----------------EccEEEEEEeecCC------- Q lcl|Aclame:pro 86 ELYVIAVPEAT-GAAA-----TVTLTVT--GEA-TESGTVNVY----------------VGRTRVQAPVTNGD------- 133 (498) Q Consensus 86 ~l~~i~l~d~a-g~aa-----tg~itit--gta-t~~G~l~l~----------------I~g~~v~v~V~~gd------- 133 (498) .+|++.+.+.. +.++ ..+.++. +++ +.+..+.+. -.|..+.+.+..+. T Consensus 77 ~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~~~~~a~~ 156 (663) T protein:vir:10 77 DLRLVRVIDMEQAKNASPLFNQIEVTITTEGQGYTVGDTVSIKHNTTTVTEEGKVTKVDADGKIKALFVPSSAVIAKAKQ 156 (663) T ss_pred eEEEEecCCcccccccccccccceeeEeecccCccccceeeecccccccccCcceeeeccCCceeEEEeccccccccccc Confidence 89999997631 1111 1111111 111 000111111 11111111111100 Q ss_pred -------------------CHHHHHHHHHHHHhcCC-CceEEEee--------------ccceEEEeeccCcccccceeE Q lcl|Aclame:pro 134 -------------------NVTTIASSIQDAINAVP-TLPFTASS--------------SAGVVTLTARHKGLCGNEIPV 179 (498) Q Consensus 134 -------------------taa~iA~~l~~aIn~~~-~lpVtA~~--------------~~~~VtlTAk~kG~~gN~i~l 179 (498) ....-+..+...++... ..+....+ ..+...+.++..|+.||.+.+ T Consensus 157 ~~~~~~~~~a~~~~v~~~~~~~~~a~av~~i~~dg~vt~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i~v 236 (663) T protein:vir:10 157 LGTYPVLGDNWRAEVSGASGGSAATLTLGGIVVDSGVTFGNSEEAPDVMTSTKVLANFAKYGMPLISAVYPGEIGSTVEV 236 (663) T ss_pred cccccccccceeeEEeeccccccccceeEeeecCCceeEEeeeccccccccceeeeeccccccceeeeecccccCcceeE Confidence 00000000000000000 00000000 001122335555666666655 Q ss_pred EEEecccCc----------------------------------------------------------------------- Q lcl|Aclame:pro 180 SLNYYGFGG----------------------------------------------------------------------- 188 (498) Q Consensus 180 ~~~~~~~~~----------------------------------------------------------------------- 188 (498) .+..+.... T Consensus 237 ~~~~~~~~~~~~~~~v~~~~g~~~~~~~~~~~~g~~~~~~~~~~~~~~g~~~e~~~ls~~~~~~~~~~~~~~~~~~~~~~ 316 (663) T protein:vir:10 237 EVISKTAFQSGAAQPIYPFGGTRASNARSVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRRGDRDVYGNNIFMDDYFRNG 316 (663) T ss_pred eecccccccccceeeecccCcccccccccccccccccchhhcccccCCCcccceeeeeccccccccchhhhhhhhhhcCc Confidence 432211000 Q ss_pred --------ccccccceeeeecccCCCcCcc------hhhhHHHhhcc---CcceEEEecCCChH---HHHHHHHHHhhhh Q lcl|Aclame:pro 189 --------GEVLPAGVQIAVATGTAGTGAP------VLTGAVAAMAD---EPFDYIGLPFNDTA---SVNTLVTEMNDTS 248 (498) Q Consensus 189 --------ge~~p~Glt~tit~~agGag~p------D~~~alaalg~---~~~~~I~~p~tD~a---~l~al~~~l~~~s 248 (498) .+..|.+.+ .....+||...| |+..+++.+.+ ....+++++...+. ....+..+|.... T Consensus 317 ~s~~v~~~~~~~~~~~~-~~~~l~gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~~ 395 (663) T protein:vir:10 317 SSNFIYASSVNWPAGFT-GIIQLGGGASANNAVGSDELIAGWDLFADREALHVNLMIAGACKSDGVAVASTVQKHVVALA 395 (663) T ss_pred ccceeEeeccccCcccc-eeEEecccccCcccchhhhhhhHHhhhccccccCceEEEeecCCCCchhhHHHHHHHHHHHH Confidence 000011111 011334554333 44444555432 23344444322221 1122333332211 Q ss_pred hhhhhhhheeeEEEE-eccCC---------HHHHHhh-------------hhccCcceEEEEecC--------C-CCCCc Q lcl|Aclame:pro 249 GRWSYARQLYGHVYT-AKTGT---------LSELVNA-------------GDQFNQQHITLAGYE--------K-ETQTP 296 (498) Q Consensus 249 ~r~~~~~q~~g~~~~-~~~gt---------~~~~~t~-------------g~~~N~~~~t~~~~~--------~-~~~~p 296 (498) .+ +++.+++. +.++. +.++..+ -..+++.+..+.+.. + ....| T Consensus 396 ~~-----~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p 470 (663) T protein:vir:10 396 DD-----RQDCVAFVNPPSELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYAFISGNYKYQYDKYNDINRWVP 470 (663) T ss_pred Hh-----hCCEEEEEecCcccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceEEEEecceeEecccCCceEEec Confidence 11 12222222 22221 1222222 223466666665421 1 11124 Q ss_pred HHHHHHHHHHHhhhh-hccCcc-ccccceEEeccc-cCCCccccChHHHHHHHhCCeeEEEE-c--CCeEEEEeeeeeee Q lcl|Aclame:pro 297 ADELAASRTARAAVF-IRNDPA-RPTQTGELVGML-PAPKGKRFTMTEQQTLLSHGVATAYV-E--SGVLRIQRDVTTYR 370 (498) Q Consensus 297 ~~~~AAa~~a~~a~~-l~~DPA-rpl~tl~L~Gl~-~p~~~~r~~~~er~~lL~~Gist~~v-~--~G~v~IeR~ITTY~ 370 (498) +.. .+|+..|.- .+..|. .|-+ ..+.|+. +.+....++..|++.|..+||.++.. . .| .++-=.-|. T Consensus 471 ~s~---~vAGl~Ar~D~~~g~~~span-~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G-~~~wG~rT~-- 543 (663) T protein:vir:10 471 LSA---DIAGLCAYTDQVGHPWMSPAG-YRRGQLRNTIKLAIEPKQSLRDTMYQVSINPVTGFAGGDG-FVLFGDKMA-- 543 (663) T ss_pred hHH---HHHHHHHHhhccCCcEEccCC-eeecceeccccceeecCchhHHHHHhCCCcEEEEeeCCCc-EEEEccccc-- Confidence 423 333333321 122231 1222 2223332 23445678999999999999988754 2 34 334333342 Q ss_pred ecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchh Q lcl|Aclame:pro 371 KNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYE 450 (498) Q Consensus 371 ~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~ 450 (498) .+ .|..|+.|.+.|+.+|+.+.++.... .|-.+.+... +-+.||..+-+.+++|..+|.+..|. T Consensus 544 ---s~-~~s~~~~i~vrR~~~~i~~si~~~~~-~~v~epn~~~-----------l~~~i~~~i~~~L~~l~~~gal~gf~ 607 (663) T protein:vir:10 544 ---TQ-VPSPFDRINVRRLFNMLKKNIGDTSK-YELFENNDAF-----------TRQSFRMEVSQYLDNIRSLGGVYDFR 607 (663) T ss_pred ---CC-CCcccceEehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeeeE Confidence 11 23469999999999999999999664 4544433221 44778999999999999999998842 Q ss_pred hhcCeEEEEEcC--C---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 451 LFKQYLVVERDA--S---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 451 ~~~~~lvVerd~--~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) +.+-.+. + +.+|+.+.+-...+..++-| .|+++...+-+ T Consensus 608 -----V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I----~~~~~~~~~~~ 651 (663) T protein:vir:10 608 -----VVCDTTNNTPQVIDSNEFVATIYIKAPRSINYI----TLNFVATSTGA 651 (663) T ss_pred -----EEEcCCCCCHHHhhCCeEEEEEEEEecCCcceE----EEEEEEEecCc Confidence 3333221 1 25677777777766665543 34544433333 No 49 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=99.32 E-value=4.3e-12 Score=82.88 Aligned_cols=355 Identities=15% Similarity=0.084 Sum_probs=182.2 Q ss_pred CccchhhcCcccccCeEEEEEecCCCC--CCCCCccEEEEEecCCC--CccccceeEEecChHHHHHhhCcCcHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQAAN--TAQDSGASLLIGHANNG--AEIVANSLVLMPSADYARQICGAGSQLARMVE 76 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~a~--~~~~~~~vLliGq~~~~--g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~ 76 (498) |. +.. .||+|+|-.++.+. ......-+.++|..-.. ...+.++|++++|..+....||.+--+...++ T Consensus 1 M~-------~~~-~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~its~~~~~~~~g~~~tL~~al~ 72 (390) T protein:vir:79 1 MP-------QDY-HHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLD 72 (390) T ss_pred Cc-------ccc-CCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccceEeecHHHHHHhcCCCccchhhhh Confidence 44 433 69999976555443 12223445566644322 23467999999999999999986433322222 Q ss_pred HHHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEE Q lcl|Aclame:pro 77 AYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTA 156 (498) Q Consensus 77 a~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA 156 (498) .+.++ .-...+ .+.+..++. T Consensus 73 ~~~~~-~~~~~~-------------------------------------vv~v~~~~~---------------------- 92 (390) T protein:vir:79 73 AIGKQ-TKPLTV-------------------------------------VVRVAEGKD---------------------- 92 (390) T ss_pred hhccc-ccceEE-------------------------------------EEeeccccc---------------------- Confidence 22111 000111 111111110 Q ss_pred eeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEecCCC-hH Q lcl|Aclame:pro 157 SSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFND-TA 235 (498) Q Consensus 157 ~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~p~tD-~a 235 (498) +... ...+.....+....+|-.-+...-.. -....++++.|..+ .. T Consensus 93 -----------------~~~~---------------~~~~ig~~~~~~~~tgl~al~~~~~~-~~~~p~il~ap~~~~~~ 139 (390) T protein:vir:79 93 -----------------ADET---------------TSNVIGTVTPDGKYTGIKALLAAQGA-LGVKPRILAAPGLDTQP 139 (390) T ss_pred -----------------cccc---------------cceeeecccccccchhhhhhhhhhhh-hccccccccCCcccchH Confidence 0000 00000000000000000001111111 12233555554433 33 Q ss_pred HHHHHHHHHhhhhhhhhhhhheeeEEEEe--ccCCHHHHHhhhhccCcceEEEEecCC---------CCCCcHHHHHHHH Q lcl|Aclame:pro 236 SVNTLVTEMNDTSGRWSYARQLYGHVYTA--KTGTLSELVNAGDQFNQQHITLAGYEK---------ETQTPADELAASR 304 (498) Q Consensus 236 ~l~al~~~l~~~s~r~~~~~q~~g~~~~~--~~gt~~~~~t~g~~~N~~~~t~~~~~~---------~~~~p~~~~AAa~ 304 (498) ..+++..+.+ + +.++++.. ..-+..++.++-...|+.|..+.+... ....|+....|.+ T Consensus 140 v~~~l~~~a~----~------~~~~ai~D~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~ 209 (390) T protein:vir:79 140 VAAALAATAQ----S------LRAMAYVSASGCKTKEEAAAYRRQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGL 209 (390) T ss_pred HHHHHHHhhh----h------cceEEEEEccCCCCHHHHHHHhcCCCCceEEEEcCceeecccccCceeEeehHHHHHHH Confidence 3444444432 1 22333333 233677888888888998887764211 1112432322222 Q ss_pred HHHhhhhhccCcc-ccccceEEeccccCCC----ccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCCCCCc Q lcl|Aclame:pro 305 TARAAVFIRNDPA-RPTQTGELVGMLPAPK----GKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADN 379 (498) Q Consensus 305 ~a~~a~~l~~DPA-rpl~tl~L~Gl~~p~~----~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G~~D~ 379 (498) .|+... +..|. .|- ...|.|+..+.. .......|.+.|..+||.|+.-+.| .++--..|. ..|+ T Consensus 210 ~a~~D~--~~g~~~sps-N~~i~gi~~~~~~~~~~~~~~~~~a~~Ln~~gi~t~~~~~G-~~~wG~rT~-------~~d~ 278 (390) T protein:vir:79 210 RAKIDN--DIGWHKTIS-NVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVNRNG-FRFWGERTC-------SDDP 278 (390) T ss_pred HHhhhc--cCCcEEccC-CceeeccceeeeeccccccccchhhhhhhhcCcEEEEcCCC-EEEEecccc-------CCCc Confidence 222221 11131 122 234556644332 2344566778899999999865667 444444443 2478 Q ss_pred hhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEE Q lcl|Aclame:pro 380 SYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVE 459 (498) Q Consensus 380 s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvVe 459 (498) .|+.|.+.|..+|+.+.++.... .|-.+++..+ |-+.|+..+=..++.|..+|.+..++ +.+. T Consensus 279 ~~~~i~vrR~~~~i~~~i~~~~~-~~v~e~~~~~-----------~~~~i~~~i~~~L~~l~~~gal~g~~-----v~~d 341 (390) T protein:vir:79 279 KFAFENYTRTAQVAADSIAEAQM-PVVDGPLNPS-----------LARDIVESINGWFRQQVANGYLIGGS-----AWID 341 (390) T ss_pred ccceeeehhhHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeeeE-----EEEe Confidence 99999999999999999999775 4554444332 45788999999999999999999864 3332 Q ss_pred EcCC-----CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 460 RDAS-----VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 460 rd~~-----d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) .+.+ +.+|+.+.+-...... +..|.|+++|+.+-. T Consensus 342 ~~~nt~~~i~~G~~~~~i~~~p~~p----~e~i~~~~~~~~~~~ 381 (390) T protein:vir:79 342 PEPNTADILASGKAYIDYDYTPVPP----LENLVLRQRITDRFL 381 (390) T ss_pred cCCCCHHHhhCCEEEEEEEEEecCC----cceEEEEEEEchHHH Confidence 2211 1245555554444444 346777777766553 No 50 >protein:vir:107720 Length: 515 # NCBI annotation: gp17 # Family: family:all:396 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024865;genbank:gi:48697507;genbank:GeneID:2948336 Probab=99.31 E-value=1.4e-10 Score=74.58 Aligned_cols=450 Identities=11% Similarity=-0.010 Sum_probs=223.7 Q ss_pred CccchhhcCcccccCeEEEEEecCC-C--CCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcCcHHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQA-A--NTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEA 77 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~-a--~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a 77 (498) |+|+. -+|+-|+.+- + +.+....-.||++. ....+.+......|.++..+.||..|.-..|++. T Consensus 1 m~I~~----------~~~V~i~~~v~aa~~~~~~~f~~li~t~---~~~~p~~r~~~y~s~~~V~~~FG~~S~ey~aA~~ 67 (515) T protein:vir:10 1 MPISF----------DKYVAITSGVAAQQQIAARSFAIRVYTP---NPMVSVDRLITATSAADVGAYFGTASEEYKRAVK 67 (515) T ss_pred CCCCc----------eeEEEeecccccCCccccccceeeeeec---ccCCCccceeeecCHHHHHHhcCCChHHHHHHHH Confidence 77764 2466776532 2 33333333666643 3344556666777889999999999999999999 Q ss_pred HHH----hCCC-ceEEEEEecCCcc-ceeEE-EEE---Ee-eeccCCcEEEEEEccEEE-EE---EeecCCCHHHHHHHH Q lcl|Aclame:pro 78 YRQ----TDPF-GELYVIAVPEATG-AAATV-TLT---VT-GEATESGTVNVYVGRTRV-QA---PVTNGDNVTTIASSI 142 (498) Q Consensus 78 ~~~----~n~~-~~l~~i~l~d~ag-~aatg-~it---it-gtat~~G~l~l~I~g~~v-~v---~V~~gdtaa~iA~~l 142 (498) |+. .-|. ..|++-.-...+. ....| .+. ++ -.+-.+|+++|.|+|..+ .+ ......+.+++|+.| T Consensus 68 yFsg~~~q~p~P~~L~igR~~~~a~~~~l~g~~~~~~~l~~~~~is~G~ltitidG~~~~t~s~i~~S~ats~~~vAs~i 147 (515) T protein:vir:10 68 NFGFISKKTRRPTSIQFARWQREAGPVAIYGGAKKAAALATLQAVTAGAISFLFGGATTVTVSGISFSAATSLADVASEL 147 (515) T ss_pred HhhhccCCcccccEEEEEeccCcccceEEEeccchhhhHHhhhcccceeEEEEEcceEEEEeeccccccccCHHHHHHHH Confidence 996 4444 5777655433211 11111 110 10 012247999999999764 33 335667788999999 Q ss_pred HHHHhcCCC-----ceEEEeeccceEEEeeccCcccccceeEEEEecccCcccccccceeeee---cccCCCcCcchhhh Q lcl|Aclame:pro 143 QDAINAVPT-----LPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAV---ATGTAGTGAPVLTG 214 (498) Q Consensus 143 ~~aIn~~~~-----lpVtA~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~ti---t~~agGag~pD~~~ 214 (498) ..+|.+..+ .+|+-...++..++++.-.|... .|.+.......+ |.-+.+.|-++- .-..+|+....+.+ T Consensus 148 ~tal~~~~~~~~~~~tv~~d~~~~~F~v~s~~tG~~~-~is~~~~t~~~~-~t~~a~~lglt~~~~av~~~g~aaet~~~ 225 (515) T protein:vir:10 148 QTALRANADANLATCTVSYDPVGARFNFAGSPSDDTV-QESISIVPQSNP-AIDVAQLLGWNSAQGASYIAASPVVSPVD 225 (515) T ss_pred HhhhccccccccceeEEEEecCCCeEEEEEeecCCce-eEEEEEecCCCc-hhhHHHHhccccccceEEecccccccHHH Confidence 999987654 35555667788888888776432 233322211111 111122222221 12234444455777 Q ss_pred HHHhhc---cCcceEEEecCCC----hHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHH-HHHhhhhccCcceEEE Q lcl|Aclame:pro 215 AVAAMA---DEPFDYIGLPFND----TASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLS-ELVNAGDQFNQQHITL 286 (498) Q Consensus 215 alaalg---~~~~~~I~~p~tD----~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~-~~~t~g~~~N~~~~t~ 286 (498) +++++. ..||-+.+..--| .+...++..+.+. .+. +.++++... ..+.. .-..+....+ ...+. T Consensus 226 a~~a~~~~s~nWy~f~~a~~~~~~~~~a~~~a~a~~~e~----~~~-~~~~~~~~~--~~~~~~~~a~~~~~~~-~~~~~ 297 (515) T protein:vir:10 226 TLIASVAGNNNFGSILFTKNGGTGITLSDAEAIALQNQS----YNV-AYKFQVGVD--DTTYSSWQAALAAIGG-VNMIY 297 (515) T ss_pred HHHHHHhccCCeEEEEEeecCccccchhHHHHHHHHHhh----cCc-eEEEEeccC--ccceechhhhhhhhhh-cCceE Confidence 777776 4566554432111 2233333333321 111 112222111 11111 1111111111 12122 Q ss_pred EecCCCCCCcHHHHHHHHHHHhhhhhccCccccccceEEeccccCCC-ccccChHHHHHHHhCCeeEEEE--cCC-e-EE Q lcl|Aclame:pro 287 AGYEKETQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPK-GKRFTMTEQQTLLSHGVATAYV--ESG-V-LR 361 (498) Q Consensus 287 ~~~~~~~~~p~~~~AAa~~a~~a~~l~~DPArpl~tl~L~Gl~~p~~-~~r~~~~er~~lL~~Gist~~v--~~G-~-v~ 361 (498) ..+.... + + .+++.++.++ ..|..++.-+..++.-..|.. .+-++.+|.+.|..+|+..+.- ..| . -. T Consensus 298 ~~~~~~~--~-~-~~a~~~g~~a---svnf~~~ng~iT~kfKq~~Gita~~lt~t~a~al~~~~~N~Y~~~~~~~~~~~~ 370 (515) T protein:vir:10 298 SPVALAA--E-Y-HDMQDGIIEA---ATDFTQQGGATGYMYVQFNNQTPAVNDDTLSGILDDLNINYYGQTQVNGTNLSF 370 (515) T ss_pred EEEeccC--c-c-hHHHHHHHHH---hcCCCccchhheeccccCCCCccccCCHHHHHHHHhcCCeEEEEEeccCceEEE Confidence 2211111 1 2 2344455554 566655544333332222332 3458999999999999999843 222 3 34 Q ss_pred EEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCc-e--eccCCCCcCCCcccccHHHHHHHHH-HHH Q lcl|Aclame:pro 362 IQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRH-K--LASDGTRFGPGQAIVTPAVIKGELL-ATY 437 (498) Q Consensus 362 IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~-k--la~dg~~~~~g~~ivTp~~ikaeli-~~~ 437 (498) +.+.+++ .|.-| |.+|..++=++++...++..+-.-|-.. | ..++|. .++++.+. +.+ T Consensus 371 ~~~G~~~-----gG~~~--~~WiD~~~g~~WL~~~iq~~l~~L~~s~~KIPytd~G~-----------a~i~a~v~q~vl 432 (515) T protein:vir:10 371 YQDGVMM-----GGPTD--PRDSNVYANEQWLKSYAGASFMSLQLAQGKIPANIEGR-----------GLLLGKMTKDII 432 (515) T ss_pred EeCCeee-----CCccc--hhHHHHHhhHHHHHHHHHHHHHHHHhcCCCCccChhhH-----------HHHHHHHHHHHH Confidence 5677776 22223 5566778899999999998886655322 2 222222 47888876 578 Q ss_pred HHHhhcccccc-hh---hhcCeEE----EEEcCCC-CeEEEEEeeeEEecCeEEEee--eeeeEEEecccCC Q lcl|Aclame:pro 438 RQLERAGIVEN-YE---LFKQYLV----VERDASV-PNRLNTLFPPDYVNQLRVFAV--VNQFRLQYSEESA 498 (498) Q Consensus 438 ~~le~~given-~~---~~~~~lv----Verd~~d-~nRvn~~~p~~~vn~l~v~A~--~~~f~lq~~~~~~ 498 (498) ++-...|+|.= +. ..|..+. +..-.+| -+|--++..+...-|-|.=.. .....|=|.+-.. T Consensus 433 ~~av~nG~I~~Gv~ls~~Q~~~i~~~~g~d~~~~~~~~~Gyy~~~~~~~~~~~~~r~~~~~~~~~~y~~g~~ 504 (515) T protein:vir:10 433 PAAKLNGTFSIGKTLTVDQQLFVTELTGDDTAWQKVQNLGYWYDVQISSFVDTGGTTKYQAVYSLVYSKDDL 504 (515) T ss_pred HHHHhCCeeecCcccchhHHHHHHhhhcCcccccchhhcceeEecCcCCCCCcccccccCceeEEEEEcCce Confidence 88777777653 21 0000000 0000000 122222222222223222111 1122333333333 No 51 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=99.27 E-value=2.5e-10 Score=73.21 Aligned_cols=444 Identities=13% Similarity=0.076 Sum_probs=211.4 Q ss_pred cccccCeEEEE-EecCCCCCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCc---CcHHHHHHHHHHHhCCCc Q lcl|Aclame:pro 10 SNTLVPLFYAE-MDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA---GSQLARMVEAYRQTDPFG 85 (498) Q Consensus 10 ~~~rvPg~y~E-~dns~a~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~---GS~l~~M~~a~~~~n~~~ 85 (498) -+...||+|+| +|-+..=.........++|. ...++.++|++|+|..|-...||. .+-+..|+..|+. |--. T Consensus 1 ~~~~~Pgvyv~e~~~~~~i~~~~t~~~~~vg~---~~~gp~~~p~~i~s~~~~~~~fg~~~~~~~~~~~~~~~f~-~gg~ 76 (679) T protein:vir:10 1 MTLLSPGVETKEINLQTTIARSSTGRAALVGK---FNWGPAYQISQVVSEVDLVDKFGRPDDQTADSFFSGVNFL-NYGN 76 (679) T ss_pred CceecCceEEEeecCCcccccCccccceeeec---ccCCCCccCEEecCHHHHHHHcCCcccccchHHHHHHHHH-hCCC Confidence 55788999998 44333312222346677775 446788999999999999999996 5778888999985 5556 Q ss_pred eEEEEEecCCccc----eeEEEEEEe-------------------eeccCCcEE-EEEEccEEEEEEeecCC--CHHHHH Q lcl|Aclame:pro 86 ELYVIAVPEATGA----AATVTLTVT-------------------GEATESGTV-NVYVGRTRVQAPVTNGD--NVTTIA 139 (498) Q Consensus 86 ~l~~i~l~d~ag~----aatg~itit-------------------gtat~~G~l-~l~I~g~~v~v~V~~gd--taa~iA 139 (498) .+|++.+.+.... +..+.+.++ ++....+.. .+...+..+.+.+.... ..+..+ T Consensus 77 ~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~s~~~~~~~~~~~~~~~~~~~~v~~~~~~~~a~~~ 156 (679) T protein:vir:10 77 DLRLVRVLNETKSRNSSALYQSLSYTITSPGVDYKVGDVVNVLQGGNVIATGKVTVVNASGGIVAFYVPTAAIIDKAKSL 156 (679) T ss_pred eEEEEEccCcccccccccccccccccccccccccccccceeeeeCCCcccceeEEEeeccCceeeeeecccccccccccc Confidence 6999998653211 111111111 111111111 12222222222211100 000000 Q ss_pred HH---HH----------------------HHHhcCCCceEEEee---------------------ccceEEEeeccCccc Q lcl|Aclame:pro 140 SS---IQ----------------------DAINAVPTLPFTASS---------------------SAGVVTLTARHKGLC 173 (498) Q Consensus 140 ~~---l~----------------------~aIn~~~~lpVtA~~---------------------~~~~VtlTAk~kG~~ 173 (498) .. +. ..+.......+.... ..+...+.++..|.. T Consensus 157 ~~~~~l~~a~~~~~~~~t~~~g~~~~~~v~~v~~~~~~~~~~~~~a~~~i~~~~~~~~t~~~~~~~~~~~~~~A~~~g~~ 236 (679) T protein:vir:10 157 NDYPALDNAWQIQFAAGGPGAGQAATATVVGINLDSTIFVPNDEYAMSAISERSETKRTFIDICEEMKVPAIVARYAGTY 236 (679) T ss_pred cccceecccceeeeeeccccccceeeeeeeeeccCCceeeccccccccccccccccchhhhhhhhccccceeeeeccccc Confidence 00 00 000000000000000 000112233444444 Q ss_pred ccceeEEEEeccc-----Cc-----------------------------------------------------------c Q lcl|Aclame:pro 174 GNEIPVSLNYYGF-----GG-----------------------------------------------------------G 189 (498) Q Consensus 174 gN~i~l~~~~~~~-----~~-----------------------------------------------------------g 189 (498) ||.+.+....... .. + T Consensus 237 gn~i~v~~va~~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vvv~~~g~~~~~~~~~~~~~ 316 (679) T protein:vir:10 237 GDNIKVLMIAYKDYYKFNEAGKIVSVNTINPKVFPTGLDYGNVTPSSYLEFGPQNESQFAFIVFNNGVAVESKILSTKPG 316 (679) T ss_pred CCcceEEEEeecccccccccccccccccccccccccccccccceeeeecccccccccceeeEEecccccccceeeecccc Confidence 4444432211100 00 0 Q ss_pred ccccccee--------------------------eeecccCCCcC-cc-----hhhhHHHhhc---cCcceEEEecCCCh Q lcl|Aclame:pro 190 EVLPAGVQ--------------------------IAVATGTAGTG-AP-----VLTGAVAAMA---DEPFDYIGLPFNDT 234 (498) Q Consensus 190 e~~p~Glt--------------------------~tit~~agGag-~p-----D~~~alaalg---~~~~~~I~~p~tD~ 234 (498) +..+.+.. ......+||.. ++ ++..++..+. ...++++++|-... T Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~ 396 (679) T protein:vir:10 317 DRDIYGTSIYINEYFGNGYSSFVQGVAESWPVGYTGVLAFGGGQSSNTDISAAEFMKGWDMFADREHTDVNLFIAGAVAG 396 (679) T ss_pred cccccchhhhhhhhhcCcccceeeeccccccccccceeeccCCccCCCccchhhhhhhhhhhhcccccccceEEecCCCC Confidence 00000000 00011123321 12 2233333333 23568888764321 Q ss_pred -------HHHHHHHHHHhhhhhhhhhhhheeeEEEEe----------ccCCHHHHHhhhhc-------------cCcceE Q lcl|Aclame:pro 235 -------ASVNTLVTEMNDTSGRWSYARQLYGHVYTA----------KTGTLSELVNAGDQ-------------FNQQHI 284 (498) Q Consensus 235 -------a~l~al~~~l~~~s~r~~~~~q~~g~~~~~----------~~gt~~~~~t~g~~-------------~N~~~~ 284 (498) +-..++..|.+. | ++..++.. ...+..++..+-.. ++|.+. T Consensus 397 ~~~~~~~~v~~~l~~~~~~---~------~~~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~ 467 (679) T protein:vir:10 397 EGAQIASTVQKAVVAIADE---R------RDCLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYA 467 (679) T ss_pred CchhhhHHHHHHHHHHHHh---h------CCeEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceE Confidence 123334444432 1 11222211 12334455554432 345555 Q ss_pred EEEec--------CCC-CCCcHHHHHHHHHHHhhhh-hccCcc-ccccceEEeccc-cCCCccccChHHHHHHHhCCeeE Q lcl|Aclame:pro 285 TLAGY--------EKE-TQTPADELAASRTARAAVF-IRNDPA-RPTQTGELVGML-PAPKGKRFTMTEQQTLLSHGVAT 352 (498) Q Consensus 285 t~~~~--------~~~-~~~p~~~~AAa~~a~~a~~-l~~DPA-rpl~tl~L~Gl~-~p~~~~r~~~~er~~lL~~Gist 352 (498) .+.+. .+. ...|+. +.+|+..|.- .+..|- .|-+ ..+.|+. +-+..-.++..|++.|..+||.+ T Consensus 468 ~~~~p~~~~~d~~~~~~~~~p~s---g~vAGl~Ar~D~~~g~~~sPan-~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~ 543 (679) T protein:vir:10 468 SVDGNYKYQYDKYNDVNRWIPLA---ADIAGLCARTDTVGQPWQSPAG-FNRGQIVNVIKLAVDTRQAHRDEMYTNGINP 543 (679) T ss_pred EEEccceeeecccCCceEEechH---HHHHHHHHHhhccCCcEECcCC-eeeccccccccceeecChhhHHhhhhCCceE Confidence 44321 111 113443 3344444421 112221 1322 3333332 22345578899999999999999 Q ss_pred EEEc--CCeEEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHH Q lcl|Aclame:pro 353 AYVE--SGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIK 430 (498) Q Consensus 353 ~~v~--~G~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ik 430 (498) +..- .| .++--.-|. .+ .|..|+.|.+.|+.+|+.+.++.... .|-.+.+... +-..|| T Consensus 544 i~~~~g~G-~~~wG~rT~-----~~-~~s~~~~i~vrR~~~~i~~si~~~~~-~~v~epn~~~-----------~~~~i~ 604 (679) T protein:vir:10 544 IVGFAGQG-YILYGDKTA-----SQ-APTPFDRINVRRLFNLLKKSISESAK-YKLFELNDAF-----------TRSSFR 604 (679) T ss_pred EEEecCCe-EEEEccccc-----CC-CCcccceEehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHH Confidence 8753 35 445444443 11 12359999999999999999998664 4554443222 457899 Q ss_pred HHHHHHHHHHhhcccccchhhhcCeEEEEEcCC-----CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 431 GELLATYRQLERAGIVENYELFKQYLVVERDAS-----VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 431 aeli~~~~~le~~given~~~~~~~lvVerd~~-----d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) ..+-+.+++|.++|.+..| .+.|.++.+ +.+|+.+.+-...+..++-| .|++|....-+ T Consensus 605 ~~i~~fL~~l~~~gal~gf-----~v~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~i----~~~~~~~~~~~ 668 (679) T protein:vir:10 605 SEVGSYLDTIRSLGGIYDF-----RVVCDESNNTPAVIDRNEFVATILIKPARSINYI----TLSFVATSTGA 668 (679) T ss_pred HHHHHHHHHHHhCCceeee-----EEEEcCCCCCHHHhhCCeEEEEEEEEecCCccEE----EEEEEEeecCc Confidence 9999999999999999984 244543322 44778888766666666554 33433222222 No 52 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=99.26 E-value=1.3e-11 Score=80.28 Aligned_cols=357 Identities=16% Similarity=0.122 Sum_probs=190.3 Q ss_pred CccchhhcCcccccCeEEEEEecCCCCCC--CCCccEEEEEecCCC--CccccceeEEecChHHHHHhhCcCcHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQAANTA--QDSGASLLIGHANNG--AEIVANSLVLMPSADYARQICGAGSQLARMVE 76 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~a~~~--~~~~~vLliGq~~~~--g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~ 76 (498) |+ . ..||+|+|-.++.+... .....+-++|..-.. ...+.++|+.+.|..+....||.++-+...++ T Consensus 1 m~-------~--~~~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~~~p~~its~~~~~~~~g~~gtl~~al~ 71 (392) T protein:vir:18 1 MS-------D--FHHGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPLNEPVLITNVQSAIAKAGKKGTLSASLQ 71 (392) T ss_pred CC-------C--CCCCeEEEEcCCCceeeeccCcceeEEEEeccCCCCcccccccceEeechHHHHhhcCCCcchHHHHH Confidence 43 3 35999988766554211 223445566654322 23357899999999999999999887777766 Q ss_pred HHHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEE Q lcl|Aclame:pro 77 AYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTA 156 (498) Q Consensus 77 a~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA 156 (498) .+..+- -...+++.+.. +... ....++. T Consensus 72 ~~~~ng-g~~~~vv~v~~-------------------------------------~~~~--------------~~~~~t~ 99 (392) T protein:vir:18 72 AIADQS-KPVTVVVRVAE-------------------------------------GTGD--------------DAEAQTT 99 (392) T ss_pred Hhhccc-CceEEEecccc-------------------------------------cccc--------------cccccch Confidence 665321 11111111110 0000 0000000 Q ss_pred eeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEecC-CChH Q lcl|Aclame:pro 157 SSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPF-NDTA 235 (498) Q Consensus 157 ~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~p~-tD~a 235 (498) ..--+.+..+++.+|.. -+..+.... ...++++++|. ++.. T Consensus 100 ~dliG~~~~~~~~tg~~-------------------------------------al~~~~~~~-~~~p~il~ap~~~~~~ 141 (392) T protein:vir:18 100 SNIIGGTDENGKYTGIK-------------------------------------ALLTAEAVT-GVKPRILGVPGLDTQE 141 (392) T ss_pred hhheecccccchhhhHH-------------------------------------HHHhhhhhh-ceeehhcccCccchHH Confidence 00000000011111100 000000000 11234444433 2333 Q ss_pred HHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCC---------CCCCcHHHHHHHHHH Q lcl|Aclame:pro 236 SVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEK---------ETQTPADELAASRTA 306 (498) Q Consensus 236 ~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~---------~~~~p~~~~AAa~~a 306 (498) ..++|.++.+. + ...+..-.....+..++.++-...+|.+..+.+... ....|+ ++.+|+ T Consensus 142 v~~~l~~~~~~----~----~~~~~~d~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~AG 210 (392) T protein:vir:18 142 VATALASVCIS----L----RAFGYVSAWGCKTISEAMAYRENFSQRELMVIWPDFLAWDTTANATATAYA---TARALG 210 (392) T ss_pred HHHHHHHHHhh----c----CcEEEEecCCCCCHHHHHHHHhhccCceEEEEeCceeeecccCCceEEech---HHHHHH Confidence 34444444321 1 122222222333667777777777777776654211 111243 233333 Q ss_pred HhhhhhccCccc-----cccceEEeccccCCC----ccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCCCC Q lcl|Aclame:pro 307 RAAVFIRNDPAR-----PTQTGELVGMLPAPK----GKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVA 377 (498) Q Consensus 307 ~~a~~l~~DPAr-----pl~tl~L~Gl~~p~~----~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G~~ 377 (498) ..+ ..|..+ |-| ..|.|+..+.. ....+..|.+.|-.+||.++.-+.| .++--..|. .. T Consensus 211 ~~a---~~d~~~g~~~spaN-~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~~~~G-~~~wG~rT~-------~~ 278 (392) T protein:vir:18 211 LRA---YIDQTIGWHKTLSN-VGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLVRKDG-FRFWGNRTC-------SD 278 (392) T ss_pred HHH---hhhccCCceEccCC-ceeeceeecceecccccCCCcchhhhhhhcCceEEEcCCC-EEEEccccc-------CC Confidence 333 233222 333 35666654433 2345678999999999999865677 556566664 24 Q ss_pred CchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEE Q lcl|Aclame:pro 378 DNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLV 457 (498) Q Consensus 378 D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lv 457 (498) |+.|+.|.+.|+.+|+.+.++..+. .|-.+.+..+ +-+.||..+=+.+++|..+|.+..++.|-+. T Consensus 279 d~~~~~i~~rR~~~~i~~~i~~~~~-~~v~e~n~~~-----------~~~~i~~~i~~~L~~l~~~gal~g~~v~~d~-- 344 (392) T protein:vir:18 279 DPLFLFENYTRTAQVLADTMAEAHM-WAVDKPITAS-----------LIRDIVDGINAKFRELKSNGYIVDGECWFDE-- 344 (392) T ss_pred CcccceeehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhcCcccceEEEEec-- Confidence 7889999999999999999999764 4554443332 4477889999999999999999997644322 Q ss_pred EEEcCC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 458 VERDAS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 458 Verd~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) +.|++ +.+|+.+.+-......++ .|.|+++|+.+-- T Consensus 345 -~~nt~~~i~~G~~~~~v~~~p~~p~e----~I~~~~~~~~~~~ 383 (392) T protein:vir:18 345 -ESNDKETLKAGKLYIDYDYTPVPPLE----SLTLRQRITDKYL 383 (392) T ss_pred -CCCCHHHhhCCeEEEEEEEEecCCcc----eEEEEEEEchHHH Confidence 12221 236677777766666655 4677777765543 No 53 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=99.26 E-value=2.5e-10 Score=73.22 Aligned_cols=451 Identities=12% Similarity=0.068 Sum_probs=202.7 Q ss_pred cCcccccCeEEEEEecCCC-C-CCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCc-----CcHHHHHHHHHHH Q lcl|Aclame:pro 8 IPSNTLVPLFYAEMDNQAA-N-TAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA-----GSQLARMVEAYRQ 80 (498) Q Consensus 8 Ip~~~rvPg~y~E~dns~a-~-~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~-----GS~l~~M~~a~~~ 80 (498) .|.+...||+|+|--.+.. . .........+||. ...++.++|++|+|..|-...||. .+.+..+++.|+. T Consensus 1 m~~~~~~PgVyv~e~~~~~~~i~~v~ts~~~fvG~---~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~~~f~ 77 (729) T protein:vir:10 1 MPLNLASPGIVVREVDLTIGRVDPTSGSIGALVAP---FAKGPVNDPQLIESEEDLLQTFGQPYSTDKHYEYWMVASSYL 77 (729) T ss_pred CCccccCCceEEEEecCCCcccccccccceeEEec---cccCCCccCeEcCCHHHHHHHcCccccCCcchhHHHHHHHHH Confidence 6767899999999655543 1 2222345667874 456788999999999999999996 4566778888887 Q ss_pred hCCCceEEEEEecCCccceeEEE------------------------------------EEEee--eccCCcEEEEEEc- Q lcl|Aclame:pro 81 TDPFGELYVIAVPEATGAAATVT------------------------------------LTVTG--EATESGTVNVYVG- 121 (498) Q Consensus 81 ~n~~~~l~~i~l~d~ag~aatg~------------------------------------ititg--tat~~G~l~l~I~- 121 (498) +.-. .+|++.+.+....++++. +++.. +...+..+.+.|- T Consensus 78 ngg~-~~~vvRv~~~~~~~a~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gn~~~v~v~~ 156 (729) T protein:vir:10 78 AYGG-TMQVVRADDYNTQTGVGLKNAFVTGGVGVGATMLRITSNTHYNQLGYDENTISGVSVAAKNPGTWANGIKVAIID 156 (729) T ss_pred hCCc-eEEEEecCcccccccccccccccccccccccccccccccccccccccccCCCcceEEEEeccCccccceeeEEec Confidence 5555 799999876432222111 11110 0011111222220 Q ss_pred -c--EEEEE------------------------------------EeecCCCHHH---HHHH------HHHHH--hc--- Q lcl|Aclame:pro 122 -R--TRVQA------------------------------------PVTNGDNVTT---IASS------IQDAI--NA--- 148 (498) Q Consensus 122 -g--~~v~v------------------------------------~V~~gdtaa~---iA~~------l~~aI--n~--- 148 (498) + +.+.+ .....+.... +... +.... +. T Consensus 157 ~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~s~~~~~~~~~~~~~~~~~~~~ 236 (729) T protein:vir:10 157 GKADQILTVASGNTTAVGSAVTQSISKTIGTATGTTTIDGVLKGIVTGSTDTTLEVKVISHISAAGVETAVEYQQNGTYT 236 (729) T ss_pred ccCcceeeeeccccccceeeeeeeccccccccccceeeeeeecccccccccccccceecccccccccceeccccccceee Confidence 0 00000 0000000000 0000 00000 00 Q ss_pred ---CCCceEEEee---ccceE---------------------------------------------------EEeeccCc Q lcl|Aclame:pro 149 ---VPTLPFTASS---SAGVV---------------------------------------------------TLTARHKG 171 (498) Q Consensus 149 ---~~~lpVtA~~---~~~~V---------------------------------------------------tlTAk~kG 171 (498) ....+..+.. .+... .......| T Consensus 237 ~~~~~s~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~d~~~~~~~d~~~~~~~ 316 (729) T protein:vir:10 237 FDNSGSVNVIAAGSSGSGSAKSYTAQTDWFESQNIVLSNSTLEWDSIADAPGTSTYVSTRGGKNDEIHVLVIDDKGTITG 316 (729) T ss_pred ecccCccceeeeccccccccccceeeeccccccccccccccccccccccccccccccccccccccccceeeecccccccc Confidence 0000000000 00000 00000011 Q ss_pred ccccceeEEEEecccCccc-------------------------------------------------------ccccce Q lcl|Aclame:pro 172 LCGNEIPVSLNYYGFGGGE-------------------------------------------------------VLPAGV 196 (498) Q Consensus 172 ~~gN~i~l~~~~~~~~~ge-------------------------------------------------------~~p~Gl 196 (498) ..|+-+....+........ ..+.+. T Consensus 317 ~~g~vve~~~~~s~~~~~~~~~~~~~~~~~vi~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~ 396 (729) T protein:vir:10 317 NSGTILEKHLSLSKAKDAEYSVGSSSYWRDFLATNSKYIFGGGATSGITTTGYSVSSTNTLDTDSGWDQNAEGVNFGASG 396 (729) T ss_pred Ccccceeeeeeeeeccccccccccccccceeeccccceeeecccccccccccccccccceeccccccccccccccccccc Confidence 1111111100000000000 000000 Q ss_pred eeeecccCCCc----------------CcchhhhHHHhhccC---cceEEEec--C-CCh---HHHHHHHHHHhhhhhhh Q lcl|Aclame:pro 197 QIAVATGTAGT----------------GAPVLTGAVAAMADE---PFDYIGLP--F-NDT---ASVNTLVTEMNDTSGRW 251 (498) Q Consensus 197 t~tit~~agGa----------------g~pD~~~alaalg~~---~~~~I~~p--~-tD~---a~l~al~~~l~~~s~r~ 251 (498) . .....++|. +.+++.+++.++.+. .++.++++ + .+. ....++..|.+....+. T Consensus 397 ~-~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~~~~~~v~~a~~~~~~~~~~~~ 475 (729) T protein:vir:10 397 V-ATLTLAGGTNYGDKTDLTTSGALSSGVDDIISGYTLFENTEEIEVDFILMGAAHHPKEQSQAVAEKVTAVAEARKDAV 475 (729) T ss_pred e-eEEEeecccccccccccccccccccchhHHHHHHHHhhcccccccceeeecCCCCCccchHHHHHHHHHHHHhcCCeE Confidence 0 001112221 223455677777653 23333332 1 221 22334444443211110 Q ss_pred hhh---h-hee-----eE-EEEeccCCHHHHHhhhhccCc-ceEEEEec--------CC-CCCCcHHHHHHHHHHHhhhh Q lcl|Aclame:pro 252 SYA---R-QLY-----GH-VYTAKTGTLSELVNAGDQFNQ-QHITLAGY--------EK-ETQTPADELAASRTARAAVF 311 (498) Q Consensus 252 ~~~---~-q~~-----g~-~~~~~~gt~~~~~t~g~~~N~-~~~t~~~~--------~~-~~~~p~~~~AAa~~a~~a~~ 311 (498) -.. + ... +. .......+..+...+....++ .+..+.+. .+ ....|+.. .+|+..| T Consensus 476 a~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~p~s~---~~aGl~a-- 550 (729) T protein:vir:10 476 AFISPYRQAFLNDTVSGTVTVSNIDQTTENVVGFYAPLSSSTYSVFDSGYKYMFDRFNNTFRYVPLNG---DIAGTCA-- 550 (729) T ss_pred EEecccccccccccccccccccccchhhHHHHHHHhhccCCceEEEEcCeeEEecccCCceEEechhH---HHHHHHH-- Confidence 000 0 000 00 011111223344444444433 33333210 11 11234433 3333433 Q ss_pred hccCccc-----cccceEEeccc-cCCCccccChHHHHHHHhCCeeEEEEc--CCeEEEEeeeeeeeecCCCCCCchhhh Q lcl|Aclame:pro 312 IRNDPAR-----PTQTGELVGML-PAPKGKRFTMTEQQTLLSHGVATAYVE--SGVLRIQRDVTTYRKNAYGVADNSYLD 383 (498) Q Consensus 312 l~~DPAr-----pl~tl~L~Gl~-~p~~~~r~~~~er~~lL~~Gist~~v~--~G~v~IeR~ITTY~~n~~G~~D~s~ld 383 (498) +.|..+ |-+ ..+.|+. +-.....++..|++.|..+||.++..- .| .++--.-|. -..|+.|+. T Consensus 551 -~~d~~~g~~~span-~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G-~~~wG~rT~------~~~d~~~~~ 621 (729) T protein:vir:10 551 -RTDIEQFPWFSPAG-TARGPILNSVKLVYNPGKKQRDILYSNRINPVILSPGAG-IILFGDKTG------FGKSSAFDR 621 (729) T ss_pred -HhhccCCcEEccCC-ccccceecccceeeecChhhHhhhhhCCceEEEEecCCe-EEEEcceec------CCCCcccce Confidence 233222 222 2233332 223345678899999999999998653 45 444444442 135788999 Q ss_pred hhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEEE--c Q lcl|Aclame:pro 384 SETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVER--D 461 (498) Q Consensus 384 i~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvVer--d 461 (498) |.+.|+.+|+.+.++..+. .|-.+++... +-+.||..+-+.+++|..+|.+..|. +.+.. | T Consensus 622 i~vrR~~~~i~~si~~~~~-~~v~epn~~~-----------~~~~i~~~i~~~L~~l~~~g~l~g~~-----v~~d~~~n 684 (729) T protein:vir:10 622 INVRRLFIYLEDAISAAAK-DQLFEFNDEL-----------TRTNFVNIVEPFLRDVQAKRGIFDFV-----VICDETNN 684 (729) T ss_pred eehhhhHHHHHHHHHHHHH-HhhcCCCCHH-----------HHHHHHHHHHHHHHHHHhccceeeeE-----EEEcCCCC Confidence 9999999999999999774 4554443222 44788999999999999999998852 23322 2 Q ss_pred CC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 462 AS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 462 ~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) ++ +.+|+.+.+-......++-| .|++|-....+ T Consensus 685 t~~~i~~G~~~~~v~~~p~~p~e~i----~~~~~~~~~~~ 720 (729) T protein:vir:10 685 TAAVIDSNEFVADIFIKPARSINFI----GLTFVATRTGV 720 (729) T ss_pred CHHHhhCCeEEEEEEEEecCCccEE----EEEEEEeecCc Confidence 22 23567777666666655433 34444333333 No 54 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=99.26 E-value=2.8e-10 Score=72.96 Aligned_cols=449 Identities=12% Similarity=0.101 Sum_probs=206.9 Q ss_pred cCcccccCeEEEEEecCCCC-CCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCc---CcHHHHHHHHHHHhCC Q lcl|Aclame:pro 8 IPSNTLVPLFYAEMDNQAAN-TAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA---GSQLARMVEAYRQTDP 83 (498) Q Consensus 8 Ip~~~rvPg~y~E~dns~a~-~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~---GS~l~~M~~a~~~~n~ 83 (498) .|.+..-||+|+|--.+... .....-...++|. ...++.++|++|+|..|-...||. ++-+..|++.|+.+.- T Consensus 1 M~~~~~~PgVyv~e~~~~~~~~~~~t~~~~fvG~---~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~F~ngg 77 (749) T protein:vir:10 1 MATNQSSPGVVIQERDLTTVSTIPTANVGVIAAP---FTKGPVEEVIEITSERQLAEKFGEPNESNYEYWFSAAQFLSYG 77 (749) T ss_pred CCccccCCeeEEEEecCCcccccccCceeEEEec---cCCCCCccCEEcCCHHHHHHHcCCccCCcccHHHHHHHHhhcC Confidence 55566889999985333331 1122356777774 456788999999999999999997 6778889999996554 Q ss_pred CceEEEEEecCCccceeE----------------------EEEEEe--eeccCCcEEEEEEc--cEE------------- Q lcl|Aclame:pro 84 FGELYVIAVPEATGAAAT----------------------VTLTVT--GEATESGTVNVYVG--RTR------------- 124 (498) Q Consensus 84 ~~~l~~i~l~d~ag~aat----------------------g~itit--gtat~~G~l~l~I~--g~~------------- 124 (498) ..+|++.+.......++ ..+++. .+-+.+..|.+.|. +.. T Consensus 78 -~~~~vvRv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~pG~~gn~l~v~v~~~~~~~~~~~~~~~~~~~ 156 (749) T protein:vir:10 78 -GLLKTIRVNSSSLKNAVDTGTAPLVKNLQDYETSIEDASNNFSWVARTPGDTGNSIGIFVTDAGADQVVVVPAPGSGNE 156 (749) T ss_pred -CeEEEEEccCccccccccccccccccccccccccccccccceEEEeccCCCcCCceEEEEEcCCCceeeeeecCCccce Confidence 47999998542111100 011111 11222333333331 000 Q ss_pred -------------------------EEE-------------EeecCCCHHH----------------------------- Q lcl|Aclame:pro 125 -------------------------VQA-------------PVTNGDNVTT----------------------------- 137 (498) Q Consensus 125 -------------------------v~v-------------~V~~gdtaa~----------------------------- 137 (498) +.+ ++..+++... T Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~a~~~~~~~~~~~~~~~~~~s~~~~~~~a~~ 236 (749) T protein:vir:10 157 HEFVADAAVSAASGAAGKVFKYSIILTIDDVVGTFAPGSATTITIGGSAESVNVLAYDATNKKLEIGLPSGGVTGILADN 236 (749) T ss_pred eeEEeeecccccccccccccccceeeeeccccceeecccceeeeccCcccccccccccCCcceEEEeeecccccceeeee Confidence 000 0000000000 Q ss_pred ---------------HHHHHHHHHhcCCCceEEEe-----eccceEEEeec--------------c-------------- Q lcl|Aclame:pro 138 ---------------IASSIQDAINAVPTLPFTAS-----SSAGVVTLTAR--------------H-------------- 169 (498) Q Consensus 138 ---------------iA~~l~~aIn~~~~lpVtA~-----~~~~~VtlTAk--------------~-------------- 169 (498) +........++.. ..+++. ..+..++++.+ + T Consensus 237 ~~v~~~~~~~~~~~~i~~~~~~~~~~~~-~~~a~~~~~~~~~g~~~~it~v~~~~~~~~~~t~~~~~~~a~~~gt~~~~~ 315 (749) T protein:vir:10 237 QVITQGTNTAKINVTIERKLLVALNKSS-IEFAASDVVQDTNSTNITITSVRDEYTEREYLPGVKWINVAPRPGTSLYAN 315 (749) T ss_pred ecccccccccccccccccchhhhhcccc-ceeeccccccCCccceeEEEeeeccccccccccceeeccccccccceeeee Confidence 0000000001000 000000 00011111111 0 Q ss_pred -CcccccceeEEEEe------------------cccCcccc----cc--------------------------------- Q lcl|Aclame:pro 170 -KGLCGNEIPVSLNY------------------YGFGGGEV----LP--------------------------------- 193 (498) Q Consensus 170 -kG~~gN~i~l~~~~------------------~~~~~ge~----~p--------------------------------- 193 (498) +|-..+.+.+.+.- ........ ++ T Consensus 316 ~~~g~~D~~~v~v~~~~g~~~~~~g~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~ 395 (749) T protein:vir:10 316 GVGGHRDEMHVILVDIDGGVTGTVGALLERYIDVSKASDAKTSVGETNYYAEVIKQKSEFIYWAEHESTLYAATSSASDG 395 (749) T ss_pred cccCCCCceEEEEecCCCeeeecccceeeeeeeccccccccccccccchhhhhhccCCCEEEEEeccccccccccccccc Confidence 00011111111100 00000000 00 Q ss_pred ------------------------ccee-------ee-ecccCCCc-----------CcchhhhHHHhhcc---CcceEE Q lcl|Aclame:pro 194 ------------------------AGVQ-------IA-VATGTAGT-----------GAPVLTGAVAAMAD---EPFDYI 227 (498) Q Consensus 194 ------------------------~Glt-------~t-it~~agGa-----------g~pD~~~alaalg~---~~~~~I 227 (498) .+.. .. +....||. ...|+..+++++.+ ..++++ T Consensus 396 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~l 475 (749) T protein:vir:10 396 LFGQTAANRQFNLFRSAAGSVDYPAGVTTLGSKNNATYYYRLSGGVNYTVSAGQYTITNTDIGSAYELIGDPESQIVDFI 475 (749) T ss_pred ccccccccceeeccccccccceeccccccccccCCcEEEEEccCCcccccccccccccchhHHHHHHHhhhhhhcccceE Confidence 0000 00 00011111 12356666666654 346666 Q ss_pred Ee--c-CCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEe-ccCC----------HHHHHhhhh-ccCcceEEEEec--- Q lcl|Aclame:pro 228 GL--P-FNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTA-KTGT----------LSELVNAGD-QFNQQHITLAGY--- 289 (498) Q Consensus 228 ~~--p-~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~-~~gt----------~~~~~t~g~-~~N~~~~t~~~~--- 289 (498) ++ | ++|......+.+.+....+| +..+++.. .+++ ..+...+-. ..++.+..+.+. T Consensus 476 i~~~~~~~~~~~~~v~~al~~~~~~~------~~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~ 549 (749) T protein:vir:10 476 ISGPSGTSDANALAKITSLVNIAEER------RDCMVFVSPRRGNVIGISNTTTITTNIVDFFKKLPSSSYMVFDSGYKY 549 (749) T ss_pred EEecCCCCcchhHHHHHHHHHHHhhc------CCEEEEEcCCCCcccccccchhhhhHHHHHHhhccCceeEEEEcccee Confidence 54 2 33333333333333222222 22233322 2221 112222222 234444444321 Q ss_pred -----CCC-CCCcHHHHHHHHHHHhhhhhccCcc-cccc-c-eEEeccccCCCccccChHHHHHHHhCCeeEEEEcCCe- Q lcl|Aclame:pro 290 -----EKE-TQTPADELAASRTARAAVFIRNDPA-RPTQ-T-GELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGV- 359 (498) Q Consensus 290 -----~~~-~~~p~~~~AAa~~a~~a~~l~~DPA-rpl~-t-l~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~~G~- 359 (498) ++. ...|+.-..|.+.++.-. +..|- .|-+ . ..|.|+. +.+.+++..|++.|..+||.++..-.|+ T Consensus 550 ~~d~~~~~~~~~p~s~~vAGl~Ar~D~--~~g~~~SPan~~~~~i~g~~--~~~~~~~~~e~~~Ln~~gIn~i~~~~g~G 625 (749) T protein:vir:10 550 IYDKYNDVYRYIPCNGDTAGLCLQTNE--ISEPWFSPAGFQRGVLRNAI--KLAYTPNKAQRDQLYANRVNPIVSFPGQG 625 (749) T ss_pred eeccccCceEEechHHHHHHHHHHhhc--cCCcEECcCCceeeeeeccc--cceeecChhHHHhhhhCCceEEEEecCCe Confidence 110 113543333333333321 11221 1222 1 2355553 4456788999999999999998764332 Q ss_pred EEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHH Q lcl|Aclame:pro 360 LRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQ 439 (498) Q Consensus 360 v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~ 439 (498) .++--.-|.- ..|+.|+.|.+.|+.+|+.+.++.... .|-.+.+... +-+.||..+-..+++ T Consensus 626 ~~~wG~rT~~------s~d~~~~~i~vRRl~~~ie~si~~~~~-~~v~epn~~~-----------l~~~i~~~i~~fL~~ 687 (749) T protein:vir:10 626 VVLYGDKTAL------GFASAFDRINIRRLFLTVERVISTAAK-AQLFEQNDEA-----------QRSLFINIVEPYLRD 687 (749) T ss_pred EEEEcceecC------CCCcccceeehhhhHHHHHHHHHHHHH-HhhcCCCCHH-----------HHHHHHHHHHHHHHH Confidence 5555555531 246789999999999999999998664 4544433221 447789999999999 Q ss_pred HhhcccccchhhhcCeEEEEEc--CC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 440 LERAGIVENYELFKQYLVVERD--AS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 440 le~~given~~~~~~~lvVerd--~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) |..+|.++.|. +.+.++ ++ +.+|+.+.+-......++-| .|+++-....+ T Consensus 688 l~~~G~i~~f~-----V~~d~~~Nt~~~i~~G~~~~~i~~~P~~pae~I----~~~~~~~~~~~ 742 (749) T protein:vir:10 688 VQGRRGVVDFL-----VKCDSTNNTPEAVDRGEFYAEVFLKPTRTINYV----QLTFVATRTGV 742 (749) T ss_pred HHhcCCeeeeE-----EEEcCCCCCHHHhhCCEEEEEEEEEecCCccEE----EEEEEEeecCc Confidence 99999997663 444322 21 22577777766666665543 33333111111 No 55 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=99.18 E-value=1.1e-10 Score=75.25 Aligned_cols=359 Identities=13% Similarity=0.043 Sum_probs=178.4 Q ss_pred CccchhhcCcccccCeEEEEEecCCCCCC--CCCccEEEEEecCCC--CccccceeEEecChHHHHHhhCcCcHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQAANTA--QDSGASLLIGHANNG--AEIVANSLVLMPSADYARQICGAGSQLARMVE 76 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~a~~~--~~~~~vLliGq~~~~--g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~ 76 (498) |+|. .. -.||+|+|-.++..... ....-+.+||..-.. ...+.++|++|.|..+....||....+..-++ T Consensus 1 m~m~-----~~-~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~~~~~~~~g~~g~L~~al~ 74 (393) T protein:vir:10 1 MSIL-----DT-YLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLN 74 (393) T ss_pred CCCC-----Cc-cCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEecchHHHHHhhCCccchhhhhh Confidence 6654 11 23999998766555322 223445678864422 23367999999999999999998777666666 Q ss_pred HHHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEE Q lcl|Aclame:pro 77 AYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTA 156 (498) Q Consensus 77 a~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA 156 (498) .+.+ +.-...+++.+.+....+.+. ...|++. ..+ ..+-..+|..+-+ .+ T Consensus 75 ~~~~-~~~~~~~vv~v~~~~~~~~t~--------------~~iig~~------~~~--~~tgl~al~~~~~---~~---- 124 (393) T protein:vir:10 75 SIGS-IVKTPTVIVRVAESDDSDTLT--------------ANIVGTQ------ENG--KFTGIKALLTAQS---TV---- 124 (393) T ss_pred hhhc-ccCceEEEeecccCccccccc--------------ccccccc------ccc--hhhHHHHHHhhhh---hc---- Confidence 5543 444555555554332222111 1111111 000 0011112211111 00 Q ss_pred eeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcce-EEE-ecCCCh Q lcl|Aclame:pro 157 SSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFD-YIG-LPFNDT 234 (498) Q Consensus 157 ~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~-~I~-~p~tD~ 234 (498) +...++.+ ..|. .+.....+|..+.+.... +++ .|. T Consensus 125 -----------------~~~p~li~-----------apg~-----------~~~~~~~al~~~~~~~~~~~~v~d~~--- 162 (393) T protein:vir:10 125 -----------------FVKPKLLC-----------VPQH-----------DNQAVATELLSVAKKLNAFAFISDNG--- 162 (393) T ss_pred -----------------ceeeeeee-----------eccc-----------cchHHHHHHHHHhhccCcEEEEEcCC--- Confidence 00011110 0111 111222333333322111 111 222 Q ss_pred HHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCC---------CCCCcHHHHHHHHH Q lcl|Aclame:pro 235 ASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEK---------ETQTPADELAASRT 305 (498) Q Consensus 235 a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~---------~~~~p~~~~AAa~~ 305 (498) ..|..+...+....++.+..+.+... ....|+...+|.+. T Consensus 163 -------------------------------~~t~~~ai~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~ 211 (393) T protein:vir:10 163 -------------------------------ATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQ 211 (393) T ss_pred -------------------------------CCCHHHHHHHhhhcCCceEEEEecccccccccCCceeEeehhHHHHHHH Confidence 22333333333333333333322100 01123322222222 Q ss_pred HHhhhhhccCcc-ccccceEEeccccCCCccc----cChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCCCCCch Q lcl|Aclame:pro 306 ARAAVFIRNDPA-RPTQTGELVGMLPAPKGKR----FTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADNS 380 (498) Q Consensus 306 a~~a~~l~~DPA-rpl~tl~L~Gl~~p~~~~r----~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G~~D~s 380 (498) +... .+..|. .|-| ..|.|+.-+..... ....|++.|..+||.++.-+.| .++--.-|. ..|+. T Consensus 212 a~~d--~~~G~~~spaN-~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~~~~G-~~~wG~rT~-------s~d~~ 280 (393) T protein:vir:10 212 AYID--KTVGWHKNISN-VELDGVTGITKAVEFDINESSTEANYLNEKGITICLNHNG-FRYWGSRTL-------ATDTR 280 (393) T ss_pred HHhh--cCCCcEEccCC-ceeeceeecceecccccCCCcchhHhHhhcCceEEEcCCC-EEEEccccc-------CCCcc Confidence 2221 122221 1333 35667765444322 3577899999999999854567 444444442 24788 Q ss_pred hhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcc--cccchhhhcCeEEE Q lcl|Aclame:pro 381 YLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAG--IVENYELFKQYLVV 458 (498) Q Consensus 381 ~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~g--iven~~~~~~~lvV 458 (498) |+.|.+.|+.+|+.+.++.... .|--+.+... |-+.|+..+=..++.|...| .+..++.|.+. T Consensus 281 ~~~i~vrR~~~~i~~~i~~~~~-~~v~e~~~~~-----------~~~~i~~~i~~~L~~l~~~g~~al~g~~v~~~~--- 345 (393) T protein:vir:10 281 WAFQQSVRTAQIIKETIGAGLA-WAVDMPLTPL-----------RVKTMLEAINNKLRSWASGDDPRILGARVWVAE--- 345 (393) T ss_pred cceeehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhccccccccceEEecC--- Confidence 9999999999999999999775 4444443322 44678888888999888755 67777543321 Q ss_pred EEcCC--CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 459 ERDAS--VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 459 erd~~--d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) +.+.+ +.+|+.+.+-......++ .|.|+++|+.+-- T Consensus 346 ~nt~~~i~~G~~~~~i~~~p~~p~e----~I~~~~~~~~~~~ 383 (393) T protein:vir:10 346 EITADIIKSGKFVIKYDYHWIPSLE----SLGLEQRVNDEYV 383 (393) T ss_pred CCCHHHhhCCEEEEEEEEEecCCcc----eEEEEEEEchHHH Confidence 11111 235566666555555544 4677777765533 No 56 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=99.13 E-value=9.9e-11 Score=75.43 Aligned_cols=351 Identities=16% Similarity=0.105 Sum_probs=186.0 Q ss_pred CccchhhcCcccccCeEEEEEecCCCCC--CCCCccEEEEEecCC--CCccccceeEEecChHHHHHhhCcCcHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQAANT--AQDSGASLLIGHANN--GAEIVANSLVLMPSADYARQICGAGSQLARMVE 76 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~a~~--~~~~~~vLliGq~~~--~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~ 76 (498) |. +. ..||+|+|--++.... .....-+.++|..-. ....+.++|+++.|..+....||.++.+...++ T Consensus 1 M~-------~~-~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~ 72 (390) T protein:vir:10 1 MP-------QD-YHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLD 72 (390) T ss_pred Cc-------cc-ccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhh Confidence 44 33 3699997765544422 122344556665321 223356899999999999999997655555444 Q ss_pred HHHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEE Q lcl|Aclame:pro 77 AYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTA 156 (498) Q Consensus 77 a~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA 156 (498) .+..+.. ...++ +.|..++..+.. +. T Consensus 73 ~~~~~gg-~~~~v-------------------------------------v~v~~~~~~~~~----------------~~ 98 (390) T protein:vir:10 73 AIGKQTK-PLTVV-------------------------------------VRVAEGKDADET----------------TS 98 (390) T ss_pred hhccccC-ceEEE-------------------------------------EEeccccccccc----------------cc Confidence 4432111 11111 122211111000 00 Q ss_pred eeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEecCCCh-H Q lcl|Aclame:pro 157 SSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDT-A 235 (498) Q Consensus 157 ~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~p~tD~-a 235 (498) ...++ +.-+... .|+.. +..+.. .-....++++.|..+. + T Consensus 99 ~~ig~-~~~~~~~------------------------tg~~a-------------l~~~~~-~~~~~p~il~ap~~~~~~ 139 (390) T protein:vir:10 99 NVIGT-VTPDGKY------------------------TGIKA-------------LLAAQG-ALGVKPRILAAPGLDTQP 139 (390) T ss_pred ccccc-ccccccc------------------------chhhh-------------hhhhhh-hhcceehhhcccccchHH Confidence 00111 0001111 11100 011111 1122234455554333 2 Q ss_pred HHHHHHHHHhhhhhhhhhhhheeeEEEEe--ccCCHHHHHhhhhccCcceEEEEecCC---------CCCCcHHHHHHHH Q lcl|Aclame:pro 236 SVNTLVTEMNDTSGRWSYARQLYGHVYTA--KTGTLSELVNAGDQFNQQHITLAGYEK---------ETQTPADELAASR 304 (498) Q Consensus 236 ~l~al~~~l~~~s~r~~~~~q~~g~~~~~--~~gt~~~~~t~g~~~N~~~~t~~~~~~---------~~~~p~~~~AAa~ 304 (498) ...++..+. .+ +.++++.. ...+..++..+-...++.+..+.+... ....|+. +.+ T Consensus 140 v~~~l~~~a----~~------~~~~aivD~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s---~~~ 206 (390) T protein:vir:10 140 VAAALAATA----QS------LRAMAYVSASGCKTKEEAAAYRKQFGQREIMVIWPDWLGWDDTTNSTAVIPAP---AIA 206 (390) T ss_pred HHHHHHHhh----cc------cceEEEEecCCCCCHHHHHHHhhccCCceEEEEcCceEeecccCCcccccchH---HHH Confidence 223333332 22 22333333 345677888888888888887754211 1113432 333 Q ss_pred HHHhhhhhccCccc-----cccceEEeccccCCC----ccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCC Q lcl|Aclame:pro 305 TARAAVFIRNDPAR-----PTQTGELVGMLPAPK----GKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYG 375 (498) Q Consensus 305 ~a~~a~~l~~DPAr-----pl~tl~L~Gl~~p~~----~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G 375 (498) |+..+ ..|..+ |-| ..|.|+.-+.. .......|.+.|..+||.++.-+.| .++--..|. T Consensus 207 Agl~a---~~D~~~g~~~spaN-~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~~~G-~~~wG~rT~------- 274 (390) T protein:vir:10 207 AGLRA---KIDNDIGWHKTISN-VVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVNRNG-FRFWGERTC------- 274 (390) T ss_pred HHHHH---HhhcCCCcEECcCC-ceeeceeecceecccccccccchhhhhhhcCcEEEEcCCC-EEEEccccc------- Confidence 33333 223211 222 34556654332 2333566778899999999866677 344455553 Q ss_pred CCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCe Q lcl|Aclame:pro 376 VADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQY 455 (498) Q Consensus 376 ~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~ 455 (498) ..|+.|+.|.+.|+.+|+.+.++..+. .|-.+.+... |-+.||..+-+.+++|..+|.+..++ T Consensus 275 s~d~~~~~i~~rR~~~~i~~~i~~~~~-~~v~e~n~~~-----------~~~~i~~~i~~~L~~l~~~g~l~g~~----- 337 (390) T protein:vir:10 275 SDDPKFAFENYTRTAQVAGDSIAEAQM-PVVDGPLNPS-----------LARDIVESINGWFRQQVANGYLIGGS----- 337 (390) T ss_pred CCCcccceeehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeeeE----- Confidence 247889999999999999999999775 4554444332 45788999999999999999998864 Q ss_pred EEEE--EcCC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 456 LVVE--RDAS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 456 lvVe--rd~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) +.+. .|++ +.+|+.+.+-...+...+ .|.|+++|+.+-- T Consensus 338 v~~d~~~nt~~~i~~G~~~~~v~~~p~~pae----~I~~~~~~~~~~~ 381 (390) T protein:vir:10 338 AWIDPEPNTADILASGKAYIDYDYTPVPPLE----NLVLRQRITDRFL 381 (390) T ss_pred EEEccCCCCHHHhhCCeEEEEEEEEecCCcc----eEEEEEEEchHHH Confidence 2222 2221 125666666666555544 5777777776633 No 57 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=99.13 E-value=9.9e-11 Score=75.43 Aligned_cols=351 Identities=16% Similarity=0.105 Sum_probs=186.0 Q ss_pred CccchhhcCcccccCeEEEEEecCCCCC--CCCCccEEEEEecCC--CCccccceeEEecChHHHHHhhCcCcHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQAANT--AQDSGASLLIGHANN--GAEIVANSLVLMPSADYARQICGAGSQLARMVE 76 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~a~~--~~~~~~vLliGq~~~--~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~ 76 (498) |. +. ..||+|+|--++.... .....-+.++|..-. ....+.++|+++.|..+....||.++.+...++ T Consensus 1 M~-------~~-~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~ 72 (390) T protein:vir:78 1 MP-------QD-YHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLD 72 (390) T ss_pred Cc-------cc-ccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhh Confidence 44 33 3699997765544422 122344556665321 223356899999999999999997655555444 Q ss_pred HHHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEE Q lcl|Aclame:pro 77 AYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTA 156 (498) Q Consensus 77 a~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA 156 (498) .+..+.. ...++ +.|..++..+.. +. T Consensus 73 ~~~~~gg-~~~~v-------------------------------------v~v~~~~~~~~~----------------~~ 98 (390) T protein:vir:78 73 AIGKQTK-PLTVV-------------------------------------VRVAEGKDADET----------------TS 98 (390) T ss_pred hhccccC-ceEEE-------------------------------------EEeccccccccc----------------cc Confidence 4432111 11111 122211111000 00 Q ss_pred eeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEecCCCh-H Q lcl|Aclame:pro 157 SSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDT-A 235 (498) Q Consensus 157 ~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~p~tD~-a 235 (498) ...++ +.-+... .|+.. +..+.. .-....++++.|..+. + T Consensus 99 ~~ig~-~~~~~~~------------------------tg~~a-------------l~~~~~-~~~~~p~il~ap~~~~~~ 139 (390) T protein:vir:78 99 NVIGT-VTPDGKY------------------------TGIKA-------------LLAAQG-ALGVKPRILAAPGLDTQP 139 (390) T ss_pred ccccc-ccccccc------------------------chhhh-------------hhhhhh-hhcceehhhcccccchHH Confidence 00111 0001111 11100 011111 1122234455554333 2 Q ss_pred HHHHHHHHHhhhhhhhhhhhheeeEEEEe--ccCCHHHHHhhhhccCcceEEEEecCC---------CCCCcHHHHHHHH Q lcl|Aclame:pro 236 SVNTLVTEMNDTSGRWSYARQLYGHVYTA--KTGTLSELVNAGDQFNQQHITLAGYEK---------ETQTPADELAASR 304 (498) Q Consensus 236 ~l~al~~~l~~~s~r~~~~~q~~g~~~~~--~~gt~~~~~t~g~~~N~~~~t~~~~~~---------~~~~p~~~~AAa~ 304 (498) ...++..+. .+ +.++++.. ...+..++..+-...++.+..+.+... ....|+. +.+ T Consensus 140 v~~~l~~~a----~~------~~~~aivD~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s---~~~ 206 (390) T protein:vir:78 140 VAAALAATA----QS------LRAMAYVSASGCKTKEEAAAYRKQFGQREIMVIWPDWLGWDDTTNSTAVIPAP---AIA 206 (390) T ss_pred HHHHHHHhh----cc------cceEEEEecCCCCCHHHHHHHhhccCCceEEEEcCceEeecccCCcccccchH---HHH Confidence 223333332 22 22333333 345677888888888888887754211 1113432 333 Q ss_pred HHHhhhhhccCccc-----cccceEEeccccCCC----ccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCC Q lcl|Aclame:pro 305 TARAAVFIRNDPAR-----PTQTGELVGMLPAPK----GKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYG 375 (498) Q Consensus 305 ~a~~a~~l~~DPAr-----pl~tl~L~Gl~~p~~----~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G 375 (498) |+..+ ..|..+ |-| ..|.|+.-+.. .......|.+.|..+||.++.-+.| .++--..|. T Consensus 207 Agl~a---~~D~~~g~~~spaN-~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~~~G-~~~wG~rT~------- 274 (390) T protein:vir:78 207 AGLRA---KIDNDIGWHKTISN-VVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVNRNG-FRFWGERTC------- 274 (390) T ss_pred HHHHH---HhhcCCCcEECcCC-ceeeceeecceecccccccccchhhhhhhcCcEEEEcCCC-EEEEccccc------- Confidence 33333 223211 222 34556654332 2333566778899999999866677 344455553 Q ss_pred CCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCe Q lcl|Aclame:pro 376 VADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQY 455 (498) Q Consensus 376 ~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~ 455 (498) ..|+.|+.|.+.|+.+|+.+.++..+. .|-.+.+... |-+.||..+-+.+++|..+|.+..++ T Consensus 275 s~d~~~~~i~~rR~~~~i~~~i~~~~~-~~v~e~n~~~-----------~~~~i~~~i~~~L~~l~~~g~l~g~~----- 337 (390) T protein:vir:78 275 SDDPKFAFENYTRTAQVAGDSIAEAQM-PVVDGPLNPS-----------LARDIVESINGWFRQQVANGYLIGGS----- 337 (390) T ss_pred CCCcccceeehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeeeE----- Confidence 247889999999999999999999775 4554444332 45788999999999999999998864 Q ss_pred EEEE--EcCC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 456 LVVE--RDAS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 456 lvVe--rd~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) +.+. .|++ +.+|+.+.+-...+...+ .|.|+++|+.+-- T Consensus 338 v~~d~~~nt~~~i~~G~~~~~v~~~p~~pae----~I~~~~~~~~~~~ 381 (390) T protein:vir:78 338 AWIDPEPNTADILASGKAYIDYDYTPVPPLE----NLVLRQRITDRFL 381 (390) T ss_pred EEEccCCCCHHHhhCCeEEEEEEEEecCCcc----eEEEEEEEchHHH Confidence 2222 2221 125666666666555544 5777777776633 No 58 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=99.04 E-value=4e-09 Score=66.62 Aligned_cols=442 Identities=11% Similarity=0.072 Sum_probs=208.3 Q ss_pred cCcccccCeEEEEEecCCCCC-C-CCCccEEEEEecCCCCccccceeEEecChHHHHHhhCc---CcHHHHHHHHHHHhC Q lcl|Aclame:pro 8 IPSNTLVPLFYAEMDNQAANT-A-QDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA---GSQLARMVEAYRQTD 82 (498) Q Consensus 8 Ip~~~rvPg~y~E~dns~a~~-~-~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~---GS~l~~M~~a~~~~n 82 (498) .|.- ..||+|+|--++.... . .......++|. ...++.++|++|+|..|-...||. +|-+..+++.|+.+. T Consensus 1 m~~~-~~Pgvyv~e~~~~~~~i~~~~t~~~~~vg~---~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~ng 76 (743) T protein:vir:10 1 MASQ-VSPGILIKERDLTNAVVTGALQIRAAHAST---FAKGPIGDIVNINTQKELVSVFGEPKEDNAEDWMVASEFLNY 76 (743) T ss_pred Cccc-cCCceEEEEecCCCceeccCCcceeEEEEe---ccCCCCCcCEEecCHHHHHHHcCCccCCcchHHHHHHHHHhC Confidence 4543 3699999976655522 2 22334566764 456788999999999999999995 688899999999777 Q ss_pred CCceEEEEEecCCccceeE--E-----------------EEEEe--eeccCCcEEEEEEc-------------------c Q lcl|Aclame:pro 83 PFGELYVIAVPEATGAAAT--V-----------------TLTVT--GEATESGTVNVYVG-------------------R 122 (498) Q Consensus 83 ~~~~l~~i~l~d~ag~aat--g-----------------~itit--gtat~~G~l~l~I~-------------------g 122 (498) - ..+|++.+.+.....++ + .++++ .+.+.+..+.+.|- + T Consensus 77 g-~~~~vvrv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gN~i~V~v~~~~~d~~~~~~~~~~~~~~~ 155 (743) T protein:vir:10 77 G-GRLAVVRAETTGVLNATTGSAGVLVKNRESWDAGSGNGEVFVARTAGSWGNSLMGVLVDRGADYIVTFAATPTDTAVG 155 (743) T ss_pred C-ceEEEEEccCccccccccccccccccccccccccccceeEEEEeeccccccceEEEEecCCCcceeeeeccccccccc Confidence 5 89999999753211111 0 11111 01111111111110 0 Q ss_pred EEE-----------------------EEEee-------------cCCCH-----------------------------HH Q lcl|Aclame:pro 123 TRV-----------------------QAPVT-------------NGDNV-----------------------------TT 137 (498) Q Consensus 123 ~~v-----------------------~v~V~-------------~gdta-----------------------------a~ 137 (498) ..+ .+.+. .++.+ .. T Consensus 156 ~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (743) T protein:vir:10 156 TQLLFSYSGTLVTGEILSYDSATNTATITASGTLTSQYLLDTPEQGLIGSFTDNSTTEVGRTPGTYSNVPASGGTGTGAT 235 (743) T ss_pred eeeeecccccccccceeeeeecCcceeeeeccccceeeecccccccccccccccccccccccccceeeEEeccccccccc Confidence 000 00000 00000 00 Q ss_pred HH-----------HHH-------HHHHhcCCCceE--------------EEeeccceEEEeec----------------- Q lcl|Aclame:pro 138 IA-----------SSI-------QDAINAVPTLPF--------------TASSSAGVVTLTAR----------------- 168 (498) Q Consensus 138 iA-----------~~l-------~~aIn~~~~lpV--------------tA~~~~~~VtlTAk----------------- 168 (498) .. ..+ ...++....+.+ .+....+.+.+++. T Consensus 236 ~tv~v~~~~~~vg~~v~~~~~~~~~~~~~~~~~~v~~~~~~~~t~~~~~~~~~~~g~~~~~a~~~~~~~~~~~~~~~~~~ 315 (743) T protein:vir:10 236 FNVVVADAGGGVGGSVVVTLANPGTGYNQGETLTIASAATGDGTDILVTVATLSDGTIAITELKDWYLNTEIGSTGIKLG 315 (743) T ss_pred ccccccccccccccccccccccccceeeeccccccccccccccccchhheecccccceeeeecccccccchhhccccccc Confidence 00 000 000000000000 00000001111100 Q ss_pred -------------cCcccccceeEE------------------EEecccCcccccccce--------------------- Q lcl|Aclame:pro 169 -------------HKGLCGNEIPVS------------------LNYYGFGGGEVLPAGV--------------------- 196 (498) Q Consensus 169 -------------~kG~~gN~i~l~------------------~~~~~~~~ge~~p~Gl--------------------- 196 (498) ..+..+..+.+. ............+.|- T Consensus 316 ~~~~~~~t~~~~~~~~~~~d~~~v~v~~~~~~~~~~~~~v~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~ 395 (743) T protein:vir:10 316 DIGPRPGTSQFATDNGITDDQVHFAVIDTTGELTGTANTIVERLTYLSKLSDARSEENANIYYKNVINEQSAYLYHGNDA 395 (743) T ss_pred cccccceeeeccccccccccceEEEEecCcceeeeccCceeEEEeeeecccccccccCcceeecceeccccceeeccCcc Confidence 000000000000 0000000000000000 Q ss_pred -----------------------------eeeecccCCCcCcc-----hhhhHHHhhcc---CcceEEEecCC-----C- Q lcl|Aclame:pro 197 -----------------------------QIAVATGTAGTGAP-----VLTGAVAAMAD---EPFDYIGLPFN-----D- 233 (498) Q Consensus 197 -----------------------------t~tit~~agGag~p-----D~~~alaalg~---~~~~~I~~p~t-----D- 233 (498) ..+...+.||.... ++..+++.+.. ..++++++|-. | T Consensus 396 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gG~d~~~~~~~~~~~~~~~~~~~~~~~~~ll~~p~~~~~~~~~ 475 (743) T protein:vir:10 396 AVQIAASGEAWGQSSDQVLADAGTAFSRTTGYWVNLAGGNDDFAYDAGEFGAAMDLFLDTEETEIDFVLMGGSMADEADT 475 (743) T ss_pred cceeeeccccCccccceeeeecccccccccceEEEeecCccccccchhHHHHHHHHhhhccccCcceEEecCcccCccch Confidence 00112334554321 34556666543 45688888732 1 Q ss_pred hHHHHHHHHHHhhhhhhhhhhhheeeEEEEec-----------------cCCHHHHHhhhhccCcceEEEEecC------ Q lcl|Aclame:pro 234 TASVNTLVTEMNDTSGRWSYARQLYGHVYTAK-----------------TGTLSELVNAGDQFNQQHITLAGYE------ 290 (498) Q Consensus 234 ~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~-----------------~gt~~~~~t~g~~~N~~~~t~~~~~------ 290 (498) .+...++-++.+ .|+. .+++... .........+....++.+..+.+.. T Consensus 476 ~~v~~a~~~~~~---~~~~------~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~ 546 (743) T protein:vir:10 476 KSKATKVIAIAA---SRKD------ALAFVSPHKGNQIASTGNVALSSAQQKENTIAFFSDLTSTSYAVFDSGYKYVYDR 546 (743) T ss_pred HHHHHHHHHHHH---hhCC------eEEEEecCCCccccccccccccccccchHHHHHHHhccCCeeEEEEccceeeecc Confidence 222334444432 2332 2222211 1112233344444566666554311 Q ss_pred --CC-CCCcHHHHHHHHHHHhhhhhccCccc-----cccceEEecccc-CCCccccChHHHHHHHhCCeeEEEEc--CCe Q lcl|Aclame:pro 291 --KE-TQTPADELAASRTARAAVFIRNDPAR-----PTQTGELVGMLP-APKGKRFTMTEQQTLLSHGVATAYVE--SGV 359 (498) Q Consensus 291 --~~-~~~p~~~~AAa~~a~~a~~l~~DPAr-----pl~tl~L~Gl~~-p~~~~r~~~~er~~lL~~Gist~~v~--~G~ 359 (498) +. ...|+.. .+|+..| +.|..| |-+ ..+.|+.- -+..-.++..|++.|..+||.++..- .| T Consensus 547 ~~~~~~~~p~s~---~~AGl~a---~~D~~~g~~~span-~~~~gi~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G- 618 (743) T protein:vir:10 547 FTDKYRYIPCNG---DVAGLCV---QTSNQLDDWYSPAG-LNRGGILNAVKLAYNPNKADRDELYQNRINPVVSLRGQG- 618 (743) T ss_pred ccCceeEechhH---HHHHHHH---HhhccCCcEEccCC-eeeeeeeccccceecCChhHHHhHhhCCceEEEEecCCe- Confidence 10 1124433 3333433 233222 332 33444432 23345678899999999999998753 45 Q ss_pred EEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHH Q lcl|Aclame:pro 360 LRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQ 439 (498) Q Consensus 360 v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~ 439 (498) .++--.-|. . ..|+.|+.|.+.|+.+|+.+.++.... .|--+.+.. .+-+.||..+=+.+++ T Consensus 619 ~~~wG~rT~---~---s~d~~~~~i~vrR~~~~i~~si~~~~~-~~v~e~n~~-----------~~~~~i~~~i~~fL~~ 680 (743) T protein:vir:10 619 ITLFGDKTA---L---AAPSAFDRINVRRLFLNLEKRARRLAE-GVLFEQNDA-----------TTRAGFSSALNSYLSE 680 (743) T ss_pred EEEEccccc---C---CCCcccceEeehhhHHHHHHHHHHHHH-HhccCCCCH-----------HHHHHHHHHHHHHHHH Confidence 444444443 1 357889999999999999999999774 444443322 1447788999999999 Q ss_pred HhhcccccchhhhcCeEEEEEcCC-----CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 440 LERAGIVENYELFKQYLVVERDAS-----VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 440 le~~given~~~~~~~lvVerd~~-----d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) |..+|.++.+. +.|-++.+ +.+|+.+.+-....-.++- |.|++|..-.-+ T Consensus 681 l~~~gal~~~~-----V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~----I~~~~~~~~~~~ 735 (743) T protein:vir:10 681 VQARRGVTDYL-----VICDESNNTPDIIDRNEFVAEVYVKPTRSINF----ITITFTATKTGV 735 (743) T ss_pred HHhcCceeeeE-----EEEcCCCCCHHHhhCCeEEEEEEEEecCCcce----EEEEEEEeecCc Confidence 99999997663 44432211 2366777766666665543 334554332332 No 59 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=98.98 E-value=3.7e-09 Score=66.79 Aligned_cols=347 Identities=14% Similarity=0.086 Sum_probs=176.4 Q ss_pred Ccc--chhhcCcccccCeEEEEEecCCCC--CCCCCccEEEEEecCC-CCccccceeEEecChHHHHHhhCcCc---HHH Q lcl|Aclame:pro 1 MTI--SFNTIPSNTLVPLFYAEMDNQAAN--TAQDSGASLLIGHANN-GAEIVANSLVLMPSADYARQICGAGS---QLA 72 (498) Q Consensus 1 M~i--~f~~Ip~~~rvPg~y~E~dns~a~--~~~~~~~vLliGq~~~-~g~~~~~~~~~v~s~~~A~~~fG~GS---~l~ 72 (498) |.+ +| .||+|++-.++... .....-.+-+||-.-. ++..+.++|+.+.+..+...++|.+. .+. T Consensus 1 m~~~~~~--------~hGv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~~p~~~~~~i~~~~d~~~~~~~~~~~gtl~ 72 (388) T protein:vir:96 1 MPVIDQF--------EHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGW 72 (388) T ss_pred CCCCCCC--------CCceEEEEcCCCcccccccCcceeEEEEecCCCccccccccceeeecchhhhhhhccccccccch Confidence 653 33 38999886555442 2233445566665432 34567889999999988888887653 333 Q ss_pred HHHHHHHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCc Q lcl|Aclame:pro 73 RMVEAYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTL 152 (498) Q Consensus 73 ~M~~a~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~l 152 (498) ..+..+. .+.-..++++.+.+.....++.+- .|++ T Consensus 73 ~al~~~~-~~~~~~~~vv~v~~g~~~~at~a~--------------iig~------------------------------ 107 (388) T protein:vir:96 73 HAASETL-KKTSVPQYFIVVPEGADDAATMAN--------------IIGG------------------------------ 107 (388) T ss_pred hhhHhhh-ccCCceEEEEEeccccccccccce--------------eeee------------------------------ Confidence 3333322 112223333333221111110000 0000 Q ss_pred eEEEeeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEecC- Q lcl|Aclame:pro 153 PFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPF- 231 (498) Q Consensus 153 pVtA~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~p~- 231 (498) ....+|......++..++ ...++|+.|. T Consensus 108 --------------------------------------------------~~~~tg~~~gl~al~~~~-~~p~il~aPg~ 136 (388) T protein:vir:96 108 --------------------------------------------------IDPTTGRRTGIAALTECT-ERPTLIGAPGF 136 (388) T ss_pred --------------------------------------------------cccccchhhHHHHhhhcc-cceeEEEeecc Confidence 000001111111222111 1234555443 Q ss_pred CCh-HHHHHHHHHHhhhhhhhhhhhheeeEEE-EeccCCHHHHHhhhh-----ccCcceEEEEecC--------C-CCCC Q lcl|Aclame:pro 232 NDT-ASVNTLVTEMNDTSGRWSYARQLYGHVY-TAKTGTLSELVNAGD-----QFNQQHITLAGYE--------K-ETQT 295 (498) Q Consensus 232 tD~-a~l~al~~~l~~~s~r~~~~~q~~g~~~-~~~~gt~~~~~t~g~-----~~N~~~~t~~~~~--------~-~~~~ 295 (498) +|. .-.++|..+++ ++ .++++ .+..++..+...+.. ..||.+..+.+.. + .... T Consensus 137 s~~~~v~~al~~~~~----~~------~~~~i~D~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~ 206 (388) T protein:vir:96 137 SQNKAVIDALASMAK----RL------KCRAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYV 206 (388) T ss_pred ccchHHHHHHHHHHh----hc------CcEEEEeccCCchhHHHHHHhhhhccCcCcceEEEEeCceeeecccCCceeee Confidence 222 22333444432 11 12222 222333333332221 2466666655421 0 1112 Q ss_pred cHHHHHHHHHHHhhhhhccCcc-cccc-ceEEeccccCCC-ccccChHHHHHHHhCCeeEEEEc--CCeEEEEeeeeeee Q lcl|Aclame:pro 296 PADELAASRTARAAVFIRNDPA-RPTQ-TGELVGMLPAPK-GKRFTMTEQQTLLSHGVATAYVE--SGVLRIQRDVTTYR 370 (498) Q Consensus 296 p~~~~AAa~~a~~a~~l~~DPA-rpl~-tl~L~Gl~~p~~-~~r~~~~er~~lL~~Gist~~v~--~G~v~IeR~ITTY~ 370 (498) |+ ++.+|+..| +.||- .|-| .+.+.|+.-+-. ....+..|.+.|..+||.++.-. .| .++--..| T Consensus 207 p~---s~~~AG~~a---~~D~~~spaN~~i~i~g~~~~~~~~~~~~~~~~~~Ln~~gI~~i~~~~~~G-~~~wG~rT--- 276 (388) T protein:vir:96 207 PP---STIAMGAVA---AVKPWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGG-FSLIGNRT--- 276 (388) T ss_pred ch---HHHHHHHHH---hhcCcccccCeeEEeeeecccccccccCChhhHHhhhhcCceEEEEecCCc-EEEEcccc--- Confidence 33 233344444 45652 2333 355666643222 23346789999999999998653 45 33333333 Q ss_pred ecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchh Q lcl|Aclame:pro 371 KNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYE 450 (498) Q Consensus 371 ~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~ 450 (498) -+|+.|.+.|+.+|+.+.++.... .|-.+.+... +-+.|+..+-..+++|..+|.+..++ T Consensus 277 --------~~~~~i~vrR~~~~i~~si~~~~~-~~v~epn~~~-----------~~~~i~~~i~~fL~~l~~~Gal~g~~ 336 (388) T protein:vir:96 277 --------VTGKFISFVGLEDAIARKLEAASQ-RAMSKQLTKS-----------FMEQEIKKINLFMQDLVAAEIIPGGE 336 (388) T ss_pred --------cCCcceeehhhHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeeeE Confidence 248899999999999999999775 4544443322 55788999999999999999998865 Q ss_pred hhcCeEEEEEcCCC---CeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 451 LFKQYLVVERDASV---PNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 451 ~~~~~lvVerd~~d---~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) .|- .-+.|+++ .+|+.+.+-...+..++ .|.|+++|+.+-- T Consensus 337 ~~~---d~~~nt~~~i~~G~~~~~i~~~p~~pae----~I~~~~~~~~~~~ 380 (388) T protein:vir:96 337 VYL---HPTLNTVERYKNGSWYIVIDYGRYSPNE----HMIFHLNAVDRIV 380 (388) T ss_pred EEE---ecCCCCHHHhhCCEEEEEEEEEecCCcc----eEEEEEEEchHHH Confidence 332 12222222 34777777777666655 4667777665544 No 60 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=98.92 E-value=5.6e-09 Score=65.82 Aligned_cols=358 Identities=15% Similarity=0.095 Sum_probs=179.3 Q ss_pred CccchhhcCcccccCeEEEEEecCCCC--CCCCCccEEEEEecCC--CCccccceeEEecChHHHHHhhCcCcHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQAAN--TAQDSGASLLIGHANN--GAEIVANSLVLMPSADYARQICGAGSQLARMVE 76 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~a~--~~~~~~~vLliGq~~~--~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~ 76 (498) |. .. ..||+|+|-.++.+. ......-+.+||-.-. .+..+.++|+.+.|..+....||.+-.+...++ T Consensus 1 M~-------~~-~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~a~~ 72 (386) T protein:vir:10 1 MA-------EQ-YLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVLIAGSRREAAKLGAGGTLPQAID 72 (386) T ss_pred Cc-------cc-cCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceEecchHHHHhhcCCCcchhHHHH Confidence 44 43 469999986655442 1122344566774322 234467899999999999999998766666666 Q ss_pred HHHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEE Q lcl|Aclame:pro 77 AYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTA 156 (498) Q Consensus 77 a~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA 156 (498) .++++ .-...+++.+........+. . T Consensus 73 ~~~~~-gg~~~~vv~~~~~~~~~~t~-----------------------------------------------------~ 98 (386) T protein:vir:10 73 GIFDQ-TGAVVVVIRVDEGVDSAATQ-----------------------------------------------------S 98 (386) T ss_pred HHhcc-CceeEEEeeccccccccccc-----------------------------------------------------h Confidence 65532 22233333332211000000 0 Q ss_pred eeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEecCCChHH Q lcl|Aclame:pro 157 SSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTAS 236 (498) Q Consensus 157 ~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~p~tD~a~ 236 (498) ...++....+.+..|... +.-.+ ...|+...+..-.+......+.+++..+.+..--+...+ T Consensus 99 ~~ig~~~~~t~~~tgl~~----l~~~~--------~~~~~~p~i~~ap~~~~~~~v~~~l~~~~~~~~~~~~~~------ 160 (386) T protein:vir:10 99 NVIGKVDADTEQYTGILA----LLSAE--------NTVKVQPRILIAPGFSNQKAVADQLVSVADTAAWLCHSG------ 160 (386) T ss_pred hhhcccccccchhhhhHH----hhhhc--------ccccccccccccccccchhHHHHHHHHhhcceEEEEEeC------ Confidence 000000000111111000 00000 000000000000011112233444444443322222221 Q ss_pred HHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEec--------C-CCCCCcHHHHHHHHHHH Q lcl|Aclame:pro 237 VNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGY--------E-KETQTPADELAASRTAR 307 (498) Q Consensus 237 l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~--------~-~~~~~p~~~~AAa~~a~ 307 (498) +...+..+..++....++.+..+.+. . +....|+. +.+|+. T Consensus 161 ---------------------------~~~~~~~~a~~~~~~~~s~~~~~~~p~~~v~~~~~~~~~~~p~s---~~~ag~ 210 (386) T protein:vir:10 161 ---------------------------WSNTTDAAAITYRELFGSRRCEVVDPWYKVWDVETSAHIIQPPS---ARHAGV 210 (386) T ss_pred ---------------------------CCCCchHHHHHhhhcccccceEEecCceeeeccccccceeechH---HHHHHH Confidence 11222223333333334444333221 0 11112432 233333 Q ss_pred hhhh-hccCcc-ccccceEEeccccCCC----ccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCCCCCchh Q lcl|Aclame:pro 308 AAVF-IRNDPA-RPTQTGELVGMLPAPK----GKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADNSY 381 (498) Q Consensus 308 ~a~~-l~~DPA-rpl~tl~L~Gl~~p~~----~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G~~D~s~ 381 (498) .|.- .+..|. .|-+ ..|.|+.-+.. ....+..|++.|..+||.++.-+.| .++--.-|. ..|+.| T Consensus 211 ~a~~D~~~G~~~spaN-~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gi~~~~~~~G-~~~wG~rT~-------~~d~~~ 281 (386) T protein:vir:10 211 MAKVHNTLGFWWSNSN-QEILGIDGLCRPVDFKLDDPTCRANLLNAKEVTTTIQQNG-FRVWGDRTC-------SADSKW 281 (386) T ss_pred HHHhhhcCCcEEccCC-ceeecccccceecccccccCcchhhhhhhcCcEEEEcCCC-EEEEccccc-------CCCccc Confidence 3311 122231 1333 35666654433 2334678999999999999876677 555555553 237789 Q ss_pred hhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEEE- Q lcl|Aclame:pro 382 LDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVER- 460 (498) Q Consensus 382 ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvVer- 460 (498) +.|.+.|+.+|+.+.++..+. .|-.+.+... |-+.|+..+=+.++.|..+|.+..++ +.+.+ T Consensus 282 ~~i~vrR~~~~i~~~~~~~~~-~~v~e~~~~~-----------~~~~i~~~i~~~L~~l~~~g~l~g~~-----v~~d~~ 344 (386) T protein:vir:10 282 AFKNVVITNDMIADSLVRNHL-WAVDRNITKT-----------YVEDVTEGVNNYLRHLKNIGAIAGGE-----CWVDPE 344 (386) T ss_pred ceeehhhHHHHHHHHHHHHHH-HhccCCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeeeE-----EEEccc Confidence 999999999999999999775 4554443322 55788899999999999999999864 22322 Q ss_pred -cCC---CCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 461 -DAS---VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 461 -d~~---d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) |++ +.+|+.+.+-...+-.++- |.|+++|+.+.- T Consensus 345 ~nt~~~~~~G~~~~~i~~~p~~p~e~----i~~~~~~~~~~~ 382 (386) T protein:vir:10 345 LNSPDQIQQGKVYFDYDFSAYAPAEH----ITFRSHMVNGYL 382 (386) T ss_pred CCCHHHhhCCeEEEEEEEEecCCcee----EEEEEEEehhHH Confidence 221 2377777777777776654 456666665544 No 61 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=98.31 E-value=1.4e-06 Score=52.67 Aligned_cols=424 Identities=12% Similarity=0.101 Sum_probs=173.8 Q ss_pred CccchhhcCcccccC---eEEEEEecCCCCCCCCCccEEEEEecCC--------CCccccce-eEEecChHHHHHhhCcC Q lcl|Aclame:pro 1 MTISFNTIPSNTLVP---LFYAEMDNQAANTAQDSGASLLIGHANN--------GAEIVANS-LVLMPSADYARQICGAG 68 (498) Q Consensus 1 M~i~f~~Ip~~~rvP---g~y~E~dns~a~~~~~~~~vLliGq~~~--------~g~~~~~~-~~~v~s~~~A~~~fG~G 68 (498) .-|.|..| |-| --||.|.+-..+.+..++-.-++|-++. .|..-+|. |.+..+..+....-|.| T Consensus 254 ~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~n~~~~~~~~~~~~~~~~~~~ 329 (742) T protein:vir:58 254 VVVHFRDI----RGVSANTEYIRFRQVNLNPESPNYIERVIGNMTFEFDGERIVTGGEYPNQVPFLRVVVSQDIKQNVAG 329 (742) T ss_pred EEEEEeec----cCCCCCccceeeeeeecCCCCcceeeecccceeeeeccceeeecccccccccceeeEeccccCcCccc Confidence 22444432 111 1244444444444433333333433322 11111122 33333333332222221 Q ss_pred -cHHHHH-HHHHHHhCCCceEEEE---------EecCCcccee-----EEEEEEe--------eeccCCcEEEEEEccEE Q lcl|Aclame:pro 69 -SQLARM-VEAYRQTDPFGELYVI---------AVPEATGAAA-----TVTLTVT--------GEATESGTVNVYVGRTR 124 (498) Q Consensus 69 -S~l~~M-~~a~~~~n~~~~l~~i---------~l~d~ag~aa-----tg~itit--------gtat~~G~l~l~I~g~~ 124 (498) |.+... ...+ .+.+++.++ ++.+.+-... .-.+++. +..+.+.++. .+... T Consensus 330 ~s~~~~~~~~~~---~~v~d~~~~~~~~~~v~~~~t~~~~~pp~~~~~~e~v~~ngG~~f~v~s~~~~g~~i~--~~~as 404 (742) T protein:vir:58 330 VEKWVPVGFEGI---YSVGDFTVIVNELTNVSIPVTDSAIIPPMRFTRIEQITLSGGASFSVISNQPYGFNIQ--DSRHS 404 (742) T ss_pred eeEEEecccccc---ccccceeeeccccccceeeccccccCCcccccccceeecccCcceEEEEecccCccee--ccCcc Confidence 111100 0001 111211111 2211110000 0011111 1111111000 00000 Q ss_pred EEEEeecCCCHHHHH--HHHHHHHhcCCCceEEEeec-cceEEEeeccCcccccceeEEEEecccCcccccccceeeeec Q lcl|Aclame:pro 125 VQAPVTNGDNVTTIA--SSIQDAINAVPTLPFTASSS-AGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVA 201 (498) Q Consensus 125 v~v~V~~gdtaa~iA--~~l~~aIn~~~~lpVtA~~~-~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit 201 (498) ..+. .-+....|. .....+.+...+.-++.... -..+.++....| |.+-.+..... +....|- ...+ T Consensus 405 ~~~s--~ln~~~~V~Gt~aa~~~~d~~t~~~v~s~~~alp~~a~sv~laG--G~dg~v~v~~~-----~~D~iG~-~~~~ 474 (742) T protein:vir:58 405 YWLS--PFKDDELIIGTELVLPALDVSTEFGVSSWEEALPEFSFLMPFQG--GSDGYIRVDEN-----EPDTIGR-VKIT 474 (742) T ss_pred eEEe--ccCCceEEEeehhhccccccchheeccccccccceeeEEEeecC--CccccccccCC-----Ccccccc-cccc Confidence 0000 000000000 00000000000000000000 001111211111 11111111000 0000000 0000 Q ss_pred ccCCCcCcchhhhHHHhh-ccCcceEEEec-CCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccC--CHHHHHhhhh Q lcl|Aclame:pro 202 TGTAGTGAPVLTGAVAAM-ADEPFDYIGLP-FNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTG--TLSELVNAGD 277 (498) Q Consensus 202 ~~agGag~pD~~~alaal-g~~~~~~I~~p-~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~g--t~~~~~t~g~ 277 (498) ... .-|. +.|.++ ..+.+++|+.| |++.....++.++++....|+. .+. ....+ +..+...+.. T Consensus 475 d~~----~adr-TGL~ALlev~eVtILiAPG~t~~~v~aav~A~la~a~~Rl~------vL~-D~P~~~tt~~~A~a~r~ 542 (742) T protein:vir:58 475 PAL----LANY-ERLLPLLTEDQFDLVLTPYLTFADHAGTVNAFINRAENRFL------YLF-DIAGDDDTENLAISLAG 542 (742) T ss_pred ccc----ccch-hHHHHhhhcCCCcEEEEcCCCchHHHHHHHHHHHhhcCCeE------EEE-ecCCCCchHHHHHHHHh Confidence 000 0122 234444 34678999887 6666667788888876555543 222 22222 2345666777 Q ss_pred ccCcceEEEEecCC-------CCCCcHHHHHHHHHHHhhhhhccCccc-----cccceEEeccccCCCccccChHHHHHH Q lcl|Aclame:pro 278 QFNQQHITLAGYEK-------ETQTPADELAASRTARAAVFIRNDPAR-----PTQTGELVGMLPAPKGKRFTMTEQQTL 345 (498) Q Consensus 278 ~~N~~~~t~~~~~~-------~~~~p~~~~AAa~~a~~a~~l~~DPAr-----pl~tl~L~Gl~~p~~~~r~~~~er~~l 345 (498) ..|+.|..+.+..- ....|+.. .+|+..| +.|..+ |.|...+.++ .....|++.| T Consensus 543 ~~nSsraaly~PwVkv~d~~~~r~vPpSg---aIAGL~A---RtD~erGvw~SPANrgii~~~-------~~s~se~d~L 609 (742) T protein:vir:58 543 YINSSFATTFFPWVRRLTNKGMRTVPASL---AAYRSIR---TTDPETGLAPVGARRGVVTGE-------PVRQVDWEDL 609 (742) T ss_pred ccCCceEEEEeceeeeccCCcceeechHH---HHHHHHH---HhccCCceEecCCcceeeecc-------ccchhhHHHH Confidence 78888887764210 01124332 3344443 233222 3343222222 2456899999 Q ss_pred HhCCeeEEEE-cCCeEEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccc Q lcl|Aclame:pro 346 LSHGVATAYV-ESGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIV 424 (498) Q Consensus 346 L~~Gist~~v-~~G~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~iv 424 (498) ..+||.++.- +.| .++--.-|. . ..|+.|+.|.+.|+.+|+.+.++.... .|-.+.+.. . T Consensus 610 N~~GINtIrsfG~G-~rlWGnRTl-----a-ssDs~wryInVRRlfd~Ie~SI~~a~q-~~VfEPNd~-----------~ 670 (742) T protein:vir:58 610 YNNRINPIVRVGND-VLLFGQKTM-----L-NVNSALNRINVRRLLIVMRNRISQILS-SYLFENNTS-----------E 670 (742) T ss_pred hhCCceEEEECCCc-EEEEcceec-----C-CCCcccceEeehhhHHHHHHHHHHHHH-HhccCCCCH-----------H Confidence 9999999864 346 444343342 1 347889999999999999999998664 443332211 1 Q ss_pred cHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEEEcCCC------CeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 425 TPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASV------PNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 425 Tp~~ikaeli~~~~~le~~given~~~~~~~lvVerd~~d------~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) +-..||..+-+.++.|..+|.+.++.. .-|.+| .+|+.+.+-...+...+ .|.|++...+.-+ T Consensus 671 L~~sIk~sInafL~~L~aqGALlGfrV-------~lDetNTpeDI~~Gklvv~I~vAP~~PAE----fI~lrf~it~tga 739 (742) T protein:vir:58 671 NRLRAEALVRQYLESLRLRGAVTDYEV-------AIDSVTTPTDIDNNTLRARVTVQPARSIE----YIDITFVITPTGV 739 (742) T ss_pred HHHHHHHHHHHHHHHHHhCCceeeeEE-------EEcCCCCHHHhhCCEEEEEEEEEccCCcc----eEEEEEEEEeccc Confidence 447789999999999999999998532 222222 23566655555555543 3345555555555 No 62 >protein:vir:103168 Length: 641 # NCBI annotation: gp122 # Family: family:all:661 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717789;genbank:gi:113200626;genbank:GeneID:4239177 Probab=97.17 E-value=3.9e-05 Score=44.75 Aligned_cols=398 Identities=14% Similarity=0.079 Sum_probs=124.0 Q ss_pred CccchhhcCcccccCeEEEEEecCCC-CCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCc---CcHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQAA-NTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA---GSQLARMVE 76 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~a-~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~---GS~l~~M~~ 76 (498) |.|.. ..-||+|+|--.+.. -.........+||. ...++.++|++|+|..|-...||. ++-+..|++ T Consensus 1 ~~m~~------~~sPGVyv~E~~~~~~i~~v~tsvaafvG~---~~~GP~~~p~~v~s~~d~~~~FG~~~~~~~l~~av~ 71 (641) T protein:vir:10 1 MSVSN------QLSPGVVIQERDLTAVTTPIGLNVGVLAAP---FTKGPVEEIFEVSTERDLASVFGEPNDYNYEYWFTA 71 (641) T ss_pred CCCcc------ccCCceEEEEecCCCcccccCCccceEEec---ccCCCCCccEEecCHHHHHHHcCCcCCCcchHHHHH Confidence 77662 467999998544332 22233457778874 457788999999999999999996 688888999 Q ss_pred HHHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEE- Q lcl|Aclame:pro 77 AYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFT- 155 (498) Q Consensus 77 a~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVt- 155 (498) .|+.+. -..+|++.+.+.....++.. ..+.+ |.......-+ T Consensus 72 ~fF~ng-G~~~~vvRv~~~~~~~a~~~--------~~~~~-----------------------------~~~~~~~~~~~ 113 (641) T protein:vir:10 72 SQFLSY-GGVLKAIRLNAASLKNSVDS--------GTAPL-----------------------------IKNLQEYETTY 113 (641) T ss_pred HHHHhc-CCEEEEEEecCccccccccc--------cchhh-----------------------------ccccccccccc Confidence 999655 57899999865332222110 00000 0001111111 Q ss_pred EeeccceEEEeeccCcccccceeEEEEecccCccccc---------ccceeeeecccCCCcCcchh-hhHHHh---hcc- Q lcl|Aclame:pro 156 ASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVL---------PAGVQIAVATGTAGTGAPVL-TGAVAA---MAD- 221 (498) Q Consensus 156 A~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~---------p~Glt~tit~~agGag~pD~-~~alaa---lg~- 221 (498) .......++++||+.|.+||.|.+.+.-.+.+..... -......++..+++++.++- ...+.. ++. T Consensus 114 ~~~~~~~~~~~A~~pG~~gn~l~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~ 193 (641) T protein:vir:10 114 ESSNSNTFKFASRDAGALGNSVGIFITDAGPDQIAVLPAPGTGNEWEFVADEAVTAASGASAKVYRYSIRLTLTNVVGTF 193 (641) T ss_pred cCcCccccEEEeccCCCcCCceEEEEEcCCCcceeeeecccccccceeccceeeeeccCcccccccccccccccceeeec Confidence 1123456899999999999999998753332110000 00111223333444444321 111111 110 Q ss_pred --CcceEEEecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhccCcceEEEEecCCCCCCcHHH Q lcl|Aclame:pro 222 --EPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEKETQTPADE 299 (498) Q Consensus 222 --~~~~~I~~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~N~~~~t~~~~~~~~~~p~~~ 299 (498) ..-..|...-+|... ..+. ...+++... +.. .-....|.......+..+-+ ..++.. . +.+.-.+. T Consensus 194 ~~~~~~~i~~a~~~~~~-~~~~----~~a~~~~~~--i~~-~~~g~~g~~~~~~~~t~gt~--~~t~a~-~-g~~~~~~~ 261 (641) T protein:vir:10 194 VPGGATTISISGSDESV-DVLA----WDAGNKYLE--IAL-PAGGVTGIFADAQVVTQGTN--TAAIAS-S-GIERRLYI 261 (641) T ss_pred ccCCcceeEeccccccc-cccc----ccCCcceee--eee-cCCcceeeeeeeeeccCCcc--ceeeec-c-cchhhhhh Confidence 111122222222211 1110 011111100 000 00000000000000000000 001100 0 00000000 Q ss_pred H--H--HHHHHHhhhhhccCccccccceE----------EeccccCCCccccCh----HHHHHHHhCCeeEEEEc-CCe- Q lcl|Aclame:pro 300 L--A--ASRTARAAVFIRNDPARPTQTGE----------LVGMLPAPKGKRFTM----TEQQTLLSHGVATAYVE-SGV- 359 (498) Q Consensus 300 ~--A--Aa~~a~~a~~l~~DPArpl~tl~----------L~Gl~~p~~~~r~~~----~er~~lL~~Gist~~v~-~G~- 359 (498) . + .++++..+......++ ...+. ++|..-+....|-.- .++ ....+-...+.++ +|. T Consensus 262 ~~~~~~ia~aat~ag~~g~~~~--v~~~~v~d~~~a~~~~~g~~~~~va~~~gts~~a~~~-g~~~D~~~~lv~d~~~~~ 338 (641) T protein:vir:10 262 GKDSGSINFAATDAVVDTNATS--ATISSVRNEYAEREYLPGSKWVNVAARPGTSLYANSV-GGVNDELHVLVIDVDGKI 338 (641) T ss_pred ccccccceeeeeccccccccee--eEeeeeeeeecccccccccccccccccchhhhhhhhc-CCcccceEEEEEeeccee Confidence 0 0 0000000000000000 00000 000000000000000 000 0000111111121 110 Q ss_pred -----EEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHH Q lcl|Aclame:pro 360 -----LRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELL 434 (498) Q Consensus 360 -----v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli 434 (498) ..+|+-+.. +-+...++ +.-+..|+...++. .++|-+++-..... T Consensus 339 ~g~~g~v~e~~~~~--s~~~~~~~-------~~~~~~~~~~~~~~--~s~~v~~~~~~~~~------------------- 388 (641) T protein:vir:10 339 TGNPGSVLERFIGV--SKASDAKT-------SIGEVNYYKEVIKQ--QSAYVYWGSHETAP------------------- 388 (641) T ss_pred eccccceeeeeecc--cccCCccc-------ccccceeeeeeecc--ccceEEEecccccc------------------- Confidence 112222211 00000000 00011122222221 12222111110000 Q ss_pred HHHHHHhhcccccchhhhcCeEEEEEcCCCCeEEEEEeeeEEecCeEEEeeeee----eEEEecc----------cCC Q lcl|Aclame:pro 435 ATYRQLERAGIVENYELFKQYLVVERDASVPNRLNTLFPPDYVNQLRVFAVVNQ----FRLQYSE----------ESA 498 (498) Q Consensus 435 ~~~~~le~~given~~~~~~~lvVerd~~d~nRvn~~~p~~~vn~l~v~A~~~~----f~lq~~~----------~~~ 498 (498) |......-...++.. ..+.+.... .+. ..-......+-..+....+ +.|.=-. ..+ T Consensus 389 --~~~~~~~~~~~~~~~----~~~~~~~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~~l~gG~d~~~~~~~~~~~~ 458 (641) T protein:vir:10 389 --FLGTAANAAAGDWGA----SALNRRYNL-LRS-TAGTTSFPAGAVTVGSKNNATHYYRLANGADYSASGALYNLSN 458 (641) T ss_pred --ccccccccccccccc----ccccccccc-ccc-ccccccccccccccCCCCcceeEEEeecCcccccccccccccc Confidence 000000000111110 000000000 000 0000011112222222221 2221000 000 No 63 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=97.14 E-value=0.00015 Score=41.48 Aligned_cols=407 Identities=13% Similarity=0.089 Sum_probs=180.9 Q ss_pred cCcccccCeEEEEEecCCCCCCCCC-ccEEEEEecCC-CCccccceeEEecChHHHHHhhCcCcHHHHHHHHHHHhCCCc Q lcl|Aclame:pro 8 IPSNTLVPLFYAEMDNQAANTAQDS-GASLLIGHANN-GAEIVANSLVLMPSADYARQICGAGSQLARMVEAYRQTDPFG 85 (498) Q Consensus 8 Ip~~~rvPg~y~E~dns~a~~~~~~-~~vLliGq~~~-~g~~~~~~~~~v~s~~~A~~~fG~GS~l~~M~~a~~~~n~~~ 85 (498) .|. +-+-+.++-+-+..+... .-+||||-... .-+..-+.+.+.+|.+++.+=||.+|..++++++++++.+.. T Consensus 1 m~~----~iVnV~Is~~t~A~~~~~Fg~~liigs~~~~~p~~~f~~~~~Yss~~~V~~Dfg~~s~~Y~AA~~~f~Q~~~~ 76 (426) T protein:vir:31 1 MPK----QIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQ 76 (426) T ss_pred CCc----ceEEEEeecccccccccccceeeeeeeccccccccccchhhhhhhHHHHHhcCCCChHHHHHHHHHHhCCcee Confidence 331 333344433223333332 34888885532 111123456778899999999999999999999999986442 Q ss_pred eEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCC-ceEEEeeccceEE Q lcl|Aclame:pro 86 ELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPT-LPFTASSSAGVVT 164 (498) Q Consensus 86 ~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~-lpVtA~~~~~~Vt 164 (498) . ..+.+ ....++-.+ + +-...|+|..++..-...++++.|...+.+..+..+. .++.-....+.++ T Consensus 77 ~-r~~v~-------~at~~~~~~-~----t~~~tv~g~~~s~~a~~~~~a~~i~~~~~~~~~~~~~~~~~~~~t~~g~~t 143 (426) T protein:vir:31 77 W-RVMVL-------EATEVTEEE-L----SDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVA 143 (426) T ss_pred E-Eeecc-------ccceeeecc-C----CcceeecceeeeecccCcchHHHHHHhhccccccccceeeeEeccccceee Confidence 1 11111 111111111 1 2224578887776666667777777666665443321 1111111111111 Q ss_pred EeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEecCCChHHHHHHHHHH Q lcl|Aclame:pro 165 LTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTASVNTLVTEM 244 (498) Q Consensus 165 lTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~p~tD~a~l~al~~~l 244 (498) .+ ... +. +.+... ....-++++ . ++. .+.+-+..+.+ -+...++.+. T Consensus 144 ~~--~~~-----~~--~~~s~~--dw~~~~~~~-------s-----~~~--~~~ia~~~~~~-----~~~~~~~~~~--- 190 (426) T protein:vir:31 144 TS--EDS-----IE--LTYFHA--DWSQLDEFP-------S-----DVN--NFAVADRRFDL-----KGVGVLDETH--- 190 (426) T ss_pred cc--ccc-----ee--eeeccC--cchhhhccc-------c-----cch--hhhhhccccch-----hhhhhhHhhh--- Confidence 11 000 00 000000 000000010 0 000 01111111110 0001111122 Q ss_pred hhhhhhhhhhhheeeEEEEeccCCHHHH-HhhhhccCcceE----EEEecCCCCCCcHHHHHHHHHHHhhhhhccCcccc Q lcl|Aclame:pro 245 NDTSGRWSYARQLYGHVYTAKTGTLSEL-VNAGDQFNQQHI----TLAGYEKETQTPADELAASRTARAAVFIRNDPARP 319 (498) Q Consensus 245 ~~~s~r~~~~~q~~g~~~~~~~gt~~~~-~t~g~~~N~~~~----t~~~~~~~~~~p~~~~AAa~~a~~a~~l~~DPArp 319 (498) -|.....+..++.+.-..++... ..++..+...-. ..+.+. ....|. . .+......+ .++|-++ T Consensus 191 -----~wa~~~~i~~va~~~e~~~~~~~~~~~a~~~~~~~y~p~~~~~~~~-~~~~~~-~-~~~~~~~~a---a~~~~~~ 259 (426) T protein:vir:31 191 -----SWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIV-DASDDD-L-AAYQLGKFA---VSEPWYN 259 (426) T ss_pred -----hhhhhcceeeeeeccchhhhcchhhhhhhhhcccccccchhheeeh-hccccc-h-hhHHhhhhh---hhccccc Confidence 22222223233222222222221 222222211111 122211 111111 1 111111111 3444333 Q ss_pred cc-------ceEEeccccCCCccccChHHHHHHHhCCeeEEEEcCCeEEEEeeeeeeeecCCCCCCchhhhhhhHHHHHH Q lcl|Aclame:pro 320 TQ-------TGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAY 392 (498) Q Consensus 320 l~-------tl~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~~G~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~y 392 (498) .- +....-+.=|... +.-..+....+.+-+-.|+.-+|.++|-|.+||= -+..|-.|.| ++|.+|| T Consensus 260 ~~~~~~~~~~~~~~~~~~~gv~-~t~~~~~~A~~~~~~n~~~~~~~~~~i~~~~~~~----G~~~~G~~iD--~~~g~dw 332 (426) T protein:vir:31 260 PLWNELPAGETVSKNVGDPEEQ-GTFEGGDEAEGEGPVNVLIDVSDANRVSNAVTTA----GADSDTSFFD--IRRTKVY 332 (426) T ss_pred hhhhhccccccceeeccccccc-cccchhhhhhhcCCceEEEEecCceeeecceeec----ccccchhhhh--hHHHHHH Confidence 21 1111111112222 2223334456665566666667889999998872 2344555655 5789999 Q ss_pred HHHHHHHHHhhhcC-CceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhc--ccccchhhhcCeE-EEEEcCCCCeEE Q lcl|Aclame:pro 393 VLRKLKSVITSKYG-RHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERA--GIVENYELFKQYL-VVERDASVPNRL 468 (498) Q Consensus 393 v~~~~r~~~~~~~~-r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~--given~~~~~~~l-vVerd~~d~nRv 468 (498) +...+|..+.+.+- ..|+.=+.. | -.+|++.+-..+++.... .++..+....-.. ....|-.+|+-= T Consensus 333 l~~~iq~~l~~ll~~~~KIpyt~~----G-----i~~I~~~i~~~L~~~v~~~g~~~~~y~v~~P~~~~~~~dra~R~~~ 403 (426) T protein:vir:31 333 TAEMLELDLESLQVSDDDVPFTED----G-----QAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWG 403 (426) T ss_pred HHHHHHHHHHHHhhcCCCCccchh----H-----HHHHHHHHHHHHHHHhcCCCccccceeecCCCccccchhhhhhccC Confidence 99999999976653 334333222 1 257888888877655443 2222222111100 011233344445 Q ss_pred EEEeeeEEecCeEEEeeeeeeEE Q lcl|Aclame:pro 469 NTLFPPDYVNQLRVFAVVNQFRL 491 (498) Q Consensus 469 n~~~p~~~vn~l~v~A~~~~f~l 491 (498) ++.|-..+.+-+|.+-.+..+.| T Consensus 404 ~i~~~~~laGAIh~v~I~g~v~v 426 (426) T protein:vir:31 404 GIDLDARLAQRAHTFSLGLNVSV 426 (426) T ss_pred CceEEEEEeCcEEEEEEEEEEeC Confidence 68889999999999988888888 No 64 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=96.92 E-value=0.00026 Score=40.25 Aligned_cols=346 Identities=16% Similarity=0.193 Sum_probs=181.3 Q ss_pred ccCeEEEEEecCCC---CCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcC-cHHHHHHHHHHHhCCCceEE Q lcl|Aclame:pro 13 LVPLFYAEMDNQAA---NTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAG-SQLARMVEAYRQTDPFGELY 88 (498) Q Consensus 13 rvPg~y~E~dns~a---~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~G-S~l~~M~~a~~~~n~~~~l~ 88 (498) --|= +.+|+-+- .+++-...+|+||+... ..+....+.+.++-+.++|.. |.|-..++|++.+ T Consensus 1 ~~~~--v~vn~~n~~~g~~~~~er~~Lfig~~~~----~~~~~~~~~~~sdld~~lg~~~~~lk~~v~aa~~n------- 67 (376) T protein:vir:37 1 MFPS--VQINALNQLSGETKEIERHALFVGVGTT----NQGKLLALTPDSDFDKVFGETDTDLKKQVRAAMLN------- 67 (376) T ss_pred CCCe--EEEecccccCCCcccccceEEeeccccc----cccceeeecCccchHhhhCCCchHHHHHHHHHHhC------- Confidence 2233 45555322 46677899999998543 356777888888899999987 8898889888733 Q ss_pred EEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEEeeccceEEEeec Q lcl|Aclame:pro 89 VIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTASSSAGVVTLTAR 168 (498) Q Consensus 89 ~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA~~~~~~VtlTAk 168 (498) +|...+..++. + ..+.+++.+++..+ |..-+ + T Consensus 68 -------aG~~~~~~~~~--~----------------------~~~~~~~~~Av~~a-~~~~s--~-------------- 99 (376) T protein:vir:37 68 -------AGQNWFAHVYI--A----------------------QEDGYDFVECVKKA-NQTAS--F-------------- 99 (376) T ss_pred -------CCCcEEEEEEe--e----------------------cCCchHHHHHHHHh-hhhcC--c-------------- Confidence 22222211111 1 11223344444433 21111 0 Q ss_pred cCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEecC-CChHHH---HHHHHHH Q lcl|Aclame:pro 169 HKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPF-NDTASV---NTLVTEM 244 (498) Q Consensus 169 ~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~p~-tD~a~l---~al~~~l 244 (498) .++ +++.|- +|.+.. .+++++| T Consensus 100 -------------------------E~V-----------------------------~v~~pv~t~~a~i~aa~~~a~el 125 (376) T protein:vir:37 100 -------------------------EYC-----------------------------VNTRYLGVDKASIGKLQECYAEL 125 (376) T ss_pred -------------------------eEE-----------------------------EEeccccccHHHHHHHHHHHHHH Confidence 001 011111 122222 2223333 Q ss_pred hhhhhhhhhhhheeeEEEEe---ccCCHH----HHHhhhhccCcceEEEEecCCCCCCcHHHHHHHHHHHhhhhhccCcc Q lcl|Aclame:pro 245 NDTSGRWSYARQLYGHVYTA---KTGTLS----ELVNAGDQFNQQHITLAGYEKETQTPADELAASRTARAAVFIRNDPA 317 (498) Q Consensus 245 ~~~s~r~~~~~q~~g~~~~~---~~gt~~----~~~t~g~~~N~~~~t~~~~~~~~~~p~~~~AAa~~a~~a~~l~~DPA 317 (498) ...-+||-+. -+.+.++.. ...|+. .+.+....+.+.++.++|..-+ ......+..+| .++.+.+.+|+ T Consensus 126 ~~~~~Rpv~f-ile~r~~~~~~~~~e~w~~y~~~~~al~~gia~~~V~~V~~~~g--n~~G~~aGRl~-~aaVsVadspg 201 (376) T protein:vir:37 126 LAKFGRRTFF-IQAVQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLVPLLFG--NETGVLAGRLA-NRAVTVADSPA 201 (376) T ss_pred HHhcCCeEEE-EEeccCcCcccccccCHHHHHHHHHHhhcccccccceeeeeehh--hhHHHHHHHHh-hcccchhhCcc Confidence 2222233211 011111100 001222 4455555667777776654211 11122222222 24555678999 Q ss_pred ccccceEEecc----ccCCC-ccccChHHHHHHHhCCeeEEEE--c-CCeEEEEeeeeeeeecCCCCCCchhhhhhhHHH Q lcl|Aclame:pro 318 RPTQTGELVGM----LPAPK-GKRFTMTEQQTLLSHGVATAYV--E-SGVLRIQRDVTTYRKNAYGVADNSYLDSETLHT 389 (498) Q Consensus 318 rpl~tl~L~Gl----~~p~~-~~r~~~~er~~lL~~Gist~~v--~-~G~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~t 389 (498) | ..+..|.|+ +|++. ...++....+.|=..|.+++.. + +| +++-+.-|. -.+...|..||..|+ T Consensus 202 R-V~tG~l~gl~~~~lp~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G-~Y~~d~~tl------~~~gsDY~~ie~~RV 273 (376) T protein:vir:37 202 R-VQTGALVSLGSANKPLDKDRNELTLAHLKSLETARYSVPMWYPDYDG-YYWADGRTL------DVEGGDYQVIENLRV 273 (376) T ss_pred c-eeccccccccccccccCcCcccCCHHHHHHHHhCCCeEEEeeCCCCc-eEEeCceEe------ccCCCChhhhhhhhH Confidence 8 778888887 24433 3578999999999999999865 4 56 777777665 234456999999999 Q ss_pred HHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEEEcC------C Q lcl|Aclame:pro 390 SAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDA------S 463 (498) Q Consensus 390 l~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvVerd~------~ 463 (498) .+-..|.+|...-.+.....|.+. ||. -...|.-+..-+++|.+...+... .|...+.+..|. . T Consensus 274 vdKa~R~vR~~ai~~i~D~~lnst-----~~s----ia~~~~yi~~pLr~M~~s~~i~g~-~fpGeI~~p~d~Di~i~w~ 343 (376) T protein:vir:37 274 VDKVARKVRLLAIGKIADRSFNST-----TSS----TEYHKNYFAKPLRDMSKSATINGK-DFPGECMPPKDDAITIVWQ 343 (376) T ss_pred HHHHHHHHHHHHHHHhCCcccCcc-----hhh----HHHHHHHHHHHHHHHHhcchhccc-cccceeecCCCCCceEEee Confidence 999999999876655444444321 111 123344455667777665554443 344555443332 2 Q ss_pred CCeEEEEEeeeEEecCeEEEeeeeeeEEE-ecc Q lcl|Aclame:pro 464 VPNRLNTLFPPDYVNQLRVFAVVNQFRLQ-YSE 495 (498) Q Consensus 464 d~nRvn~~~p~~~vn~l~v~A~~~~f~lq-~~~ 495 (498) .+.+|.+.+-..-.|=-.-|=..|.|-|. +-| T Consensus 344 s~~~V~I~~~v~P~~~pk~Itv~I~Ldlsn~~~ 376 (376) T protein:vir:37 344 SKTKVTIYIKVRPYDCPKEITANIFLDLDSLGE 376 (376) T ss_pred ccceEEEEEEEEeccCCceEEEEEEeecCCCCC Confidence 34455555543333333344444445444 222 No 65 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=96.48 E-value=0.00059 Score=38.30 Aligned_cols=345 Identities=19% Similarity=0.209 Sum_probs=175.7 Q ss_pred ccCeEEEEEecCCC---CCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcC-cHHHHHHHHHHHhCCCceEE Q lcl|Aclame:pro 13 LVPLFYAEMDNQAA---NTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAG-SQLARMVEAYRQTDPFGELY 88 (498) Q Consensus 13 rvPg~y~E~dns~a---~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~G-S~l~~M~~a~~~~n~~~~l~ 88 (498) --| ++.+|+-+- .+++-...+|+||... ...++...+...++-.+++|.. |.|-..++||+.+- ++-| T Consensus 1 ~~~--~v~vn~~n~~~g~~~~~er~~lfig~~~----~~~g~~~~~~~~sdld~~l~~~ds~lk~~v~aa~~na--G~~~ 72 (370) T protein:vir:78 1 MWP--YVQIYNLNQMQGPVTEVERHLLFIGSAA----SNTGKLLSLNAQSDFDQLLGAADSELKANLLAARDNA--GQNW 72 (370) T ss_pred CCc--eEEEeeccccCCCcCccceeEEEEeccc----ccccceEeecCccCHHHhcCCcChhHHHHHHHHHhCC--CCce Confidence 223 345555333 4667788999999754 3457778888888999999987 89999998887432 2222 Q ss_pred EEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEEeeccceEEEeec Q lcl|Aclame:pro 89 VIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTASSSAGVVTLTAR 168 (498) Q Consensus 89 ~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA~~~~~~VtlTAk 168 (498) .+++.. -.+..++.+++..+ |+.- ++ T Consensus 73 ------------~~~~~p-------------------------~~~~~d~~~Av~~a-~~~~--s~-------------- 98 (370) T protein:vir:78 73 ------------SAAAYV-------------------------LPTDKPWLDAARDA-QQTQ--SF-------------- 98 (370) T ss_pred ------------EEEEEE-------------------------ecCchhHHHHHHHH-HhhC--Cc-------------- Confidence 111100 01122344444333 2211 11 Q ss_pred cCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEecCCChHHHHHHHHHHhhhh Q lcl|Aclame:pro 169 HKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTASVNTLVTEMNDTS 248 (498) Q Consensus 169 ~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~p~tD~a~l~al~~~l~~~s 248 (498) .|=+|+-|-+|.+...++.+...+.. T Consensus 99 ------------------------------------------------------E~V~v~~~~s~~a~~~a~~~~a~el~ 124 (370) T protein:vir:78 99 ------------------------------------------------------EGVVVLGQEWHQAAINAAHALNQELI 124 (370) T ss_pred ------------------------------------------------------cEEEEecCcchHHHHHHHHHHHHHHH Confidence 01111222233333333333322222 Q ss_pred hhhhhhhheeeEEEEeccC-----CHH----HHHhhhhccCcceEEEEecCCCCCCcHHHHHHHHHHHhhhhhccCcccc Q lcl|Aclame:pro 249 GRWSYARQLYGHVYTAKTG-----TLS----ELVNAGDQFNQQHITLAGYEKETQTPADELAASRTARAAVFIRNDPARP 319 (498) Q Consensus 249 ~r~~~~~q~~g~~~~~~~g-----t~~----~~~t~g~~~N~~~~t~~~~~~~~~~p~~~~AAa~~a~~a~~l~~DPArp 319 (498) .++. ++-......++ |.+ .+.+.-....+.++.+++.-.+.. ....+-.+|- ++.+.+.-|+| T Consensus 125 n~~~----Rpv~file~~~~~~~e~w~~y~~~l~al~~gia~~~V~vvp~~~g~~--~G~~aGRL~n-aavsVadsP~R- 196 (370) T protein:vir:78 125 AKWG----RWQFMLLAVPAIADEQDWATYEAELATLQDGIAASSVSLIPQLWPTL--AGAYAGRLCN-RAVSIADSPCR- 196 (370) T ss_pred HhcC----CeEEEEEeecCCCCcCCHHHHHHHHHHhhhccccccceEEeeecccc--HHHHHHHHhc-Ceeeeccccee- Confidence 2221 11122222222 322 344444455566777765432221 1222222222 34445677887 Q ss_pred ccceEEeccc-cCCC--ccccChHHHHHHHhCCeeEEEE--c-CCeEEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHH Q lcl|Aclame:pro 320 TQTGELVGML-PAPK--GKRFTMTEQQTLLSHGVATAYV--E-SGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYV 393 (498) Q Consensus 320 l~tl~L~Gl~-~p~~--~~r~~~~er~~lL~~Gist~~v--~-~G~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv 393 (498) .++..|.|+- -|-+ ...++....+.|=.+|-+++.. + +| ++.-+.-|. -.+...|..|+..|+.+-. T Consensus 197 v~tG~l~gl~~~p~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G-~Y~~d~~tl------~~~gsDYq~ie~~RVvdKa 269 (370) T protein:vir:78 197 VKTGALVGLGNKPVGKDGIPLPLATLQTLEANRYSVPMWYPDYDG-IYWADGRTL------DAEGGDYQVIENLRIAYKV 269 (370) T ss_pred eeccccccccccccccCCcccCHHHHHHHHhCCCeEEEeeCCCCc-eEEeCceEe------ccCCCChhhhhhhhHHHHH Confidence 5555555542 2332 3457899999999999999865 4 56 777777665 2344569999999999999 Q ss_pred HHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhcccccchhhhcCeEEEEEcC------CCCeE Q lcl|Aclame:pro 394 LRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDA------SVPNR 467 (498) Q Consensus 394 ~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~given~~~~~~~lvVerd~------~d~nR 467 (498) .|.+|...-.+....++.+- +|.- .-.+..+..-++++...+-+-.. .|...+.+..|. ....+ T Consensus 270 ~R~vR~~ai~~i~D~~lnst-----~gsi----a~~~~~~~~~L~ema~s~~i~~~-~fpgeI~~p~d~Di~i~w~s~~~ 339 (370) T protein:vir:78 270 ARRMRLRAIARIGDRSFNST-----PGST----AAAITYFGKDLREMAKSTTINGQ-PFPGDIASPQDGDIRIQWVAKNL 339 (370) T ss_pred HHHHHHHHHHHhCCcccCCC-----Ccch----hHHHHHHHhhHHHHHhhhhhccc-ccceeEeccCCCcceEEeeccce Confidence 99999766555544444321 1110 11223333334444444444333 244444333222 23345 Q ss_pred EEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 468 LNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 468 vn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) |.+.+-..-.|--.-|=..|.|-|-..+..- T Consensus 340 v~I~~~v~P~~~pk~Itv~I~LDls~e~~~~ 370 (370) T protein:vir:78 340 VSVFVVVRTVDCPKGITVNIMLDLSLNNGEG 370 (370) T ss_pred EEEEEEEEeccCCceEEEEEEEeeccccCCC Confidence 5555554444444444445555554443333 No 66 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=96.21 E-value=0.00087 Score=37.37 Aligned_cols=300 Identities=13% Similarity=0.091 Sum_probs=148.7 Q ss_pred HHHHHHHHHhcCCCceEEEeec----cceEEEeeccCcccccceeEEEEecc----cCcccccc----cceeeee----c Q lcl|Aclame:pro 138 IASSIQDAINAVPTLPFTASSS----AGVVTLTARHKGLCGNEIPVSLNYYG----FGGGEVLP----AGVQIAV----A 201 (498) Q Consensus 138 iA~~l~~aIn~~~~lpVtA~~~----~~~VtlTAk~kG~~gN~i~l~~~~~~----~~~ge~~p----~Glt~ti----t 201 (498) +..+|++ -.-.+++++... +--..+ ...++ ..++..... .+-++..| ++..+.- . T Consensus 1 ~~~~iv~---V~v~~~~~~~~~~~~~~~~~~~-~~~t~-----~~~~~y~s~~~v~~d~~~~~~~Ykaa~~~f~Q~~~~~ 71 (331) T protein:vir:80 1 MVETITD---VRVHISVLYPSPRIGLGRPAIF-VKGTA-----MGYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRPD 71 (331) T ss_pred Cccceec---ceeeecccccccccccCcceeE-Eeccc-----cceEEEechhhhccCCCCCcHHHHHHHHHHhccCccc Confidence 2222222 122222222110 111111 11111 112221100 00011111 0000000 0 Q ss_pred ccCCCcCcc-hhhh-HHHhhccCcceEEEecCCChHHHHHHHHHHhhhhhhhhhhhheeeEEEEeccCCHHHHHhhhhcc Q lcl|Aclame:pro 202 TGTAGTGAP-VLTG-AVAAMADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQF 279 (498) Q Consensus 202 ~~agGag~p-D~~~-alaalg~~~~~~I~~p~tD~a~l~al~~~l~~~s~r~~~~~q~~g~~~~~~~gt~~~~~t~g~~~ 279 (498) ....+...+ +... ..+.+...|| ++++.-.|.+.+.++..|++.... ++. .....+.+.+..... T Consensus 72 ~i~v~~~~~~~~~~a~~a~~~~~w~-~~~~~~~~~~~~~a~a~~~~a~~~-------~f~---~~~~~~~~~~~~~~~-- 138 (331) T protein:vir:80 72 TVAVITYEDTKLLEAAEAYFLKSWH-FALLAEFKAADALALSNLIEEQKF-------KFA---VFQVTAVADITPLAK-- 138 (331) T ss_pred eEEEeccchHHHHHHHHHhccCcee-EEEeecCCHHHHHHHHHHHhhCCc-------EEE---EEecCchHHHHHhhc-- Confidence 001111111 2223 3333445566 555555566666788888764322 111 222344445544432 Q ss_pred CcceEEEEecCCCCCCcHHHHHHHHHHHhhhhhccCcccc-ccce-EEeccccCCCccccChHHHHHHHhCCeeEEEEcC Q lcl|Aclame:pro 280 NQQHITLAGYEKETQTPADELAASRTARAAVFIRNDPARP-TQTG-ELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVES 357 (498) Q Consensus 280 N~~~~t~~~~~~~~~~p~~~~AAa~~a~~a~~l~~DPArp-l~tl-~L~Gl~~p~~~~r~~~~er~~lL~~Gist~~v~~ 357 (498) | .+..++.+.. . .+ -.++++.+.++ ..||.+- +.-. .|+|+.|+ .++.+|.+.|..+|+..+.-.+ T Consensus 139 ~-~~t~~~~~~~-~-~~--~~~aa~~g~~~---~~~~g~~t~~fk~~l~GV~~~----~lt~t~~~al~~~~~N~y~~~~ 206 (331) T protein:vir:80 139 N-TRTIAIVHSK-T-GE--KLDAALIGNVA---SLPVGSATWKGRHGLAGITSE----ELKVSEIDAIQKAGGMCYIEKA 206 (331) T ss_pred c-ccEEEEEcCC-c-cc--hhHHHHHHHHH---hcCccceeeeeecccCCCCCC----CCCHHHHHHHHhcCceEEEEec Confidence 3 3444444332 2 22 23455555554 5788653 2222 37788764 4899999999999999997767 Q ss_pred CeEEEEeeeeeeeecCCCCCCchhhhhhhHHHHHHHHHHHHHHHhhhcCCc-eeccCCCCcCCCcccccHHHHHHHHHHH Q lcl|Aclame:pro 358 GVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRH-KLASDGTRFGPGQAIVTPAVIKGELLAT 436 (498) Q Consensus 358 G~v~IeR~ITTY~~n~~G~~D~s~ldi~t~~tl~yv~~~~r~~~~~~~~r~-kla~dg~~~~~g~~ivTp~~ikaeli~~ 436 (498) |+-.+.+.+|+ .| . .|..++-++++...++..+...+-.. |+-=+.. | -.+|++.+-+. T Consensus 207 ~~~~~~~G~~~-----~G----~--~iD~~~~~dWl~~~lq~~l~~ll~~~~kiPy~~~----G-----~~~l~a~i~~~ 266 (331) T protein:vir:80 207 GIAQTSEGKTV-----SG----E--FIDSIHGDDWIKATIETRLQKLLTETDKLTFDAR----G-----IALLQSELTTV 266 (331) T ss_pred CeeEEecceEe-----Cc----h--hHHHHHHHHHHHHHHHHHHHHHHHhCCCCccChh----h-----HHHHHHHHHHH Confidence 77777777764 34 2 46778899999999999997666332 2211111 1 26899999999 Q ss_pred HHHHhhcccccchhhh-cCeEEEE------EcCCC---CeEEEEEeeeEEecCeEEEeeeeeeEE Q lcl|Aclame:pro 437 YRQLERAGIVENYELF-KQYLVVE------RDASV---PNRLNTLFPPDYVNQLRVFAVVNQFRL 491 (498) Q Consensus 437 ~~~le~~given~~~~-~~~lvVe------rd~~d---~nRvn~~~p~~~vn~l~v~A~~~~f~l 491 (498) +++....|+|.--... +..-.|. ...+| +.--.+.|-..+.+..|.+-..+...| T Consensus 267 ~~~av~~G~I~~g~~~~~~~~~v~~~~~~~~s~~dr~~R~~~~~~~~~~~~gaI~~v~i~~~v~~ 331 (331) T protein:vir:80 267 LNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) T ss_pred HHHHHhCCceecCccCCCcceEEEeCchhcCCHHHHhccCCCCeEEEEEEcceEEEEEEEEEEeC Confidence 9999999988521110 1111121 12222 111336677788888888776666666 No 67 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=95.10 E-value=0.0028 Score=34.60 Aligned_cols=343 Identities=16% Similarity=0.149 Sum_probs=173.1 Q ss_pred CccchhhcCcccccCeEEEEEecCCC---CCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcC-cHHHHHHH Q lcl|Aclame:pro 1 MTISFNTIPSNTLVPLFYAEMDNQAA---NTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAG-SQLARMVE 76 (498) Q Consensus 1 M~i~f~~Ip~~~rvPg~y~E~dns~a---~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~G-S~l~~M~~ 76 (498) |+. |= +.+|+-+. .+++-...+|+||.... ....++...+...++-+.++|.. |.|+..++ T Consensus 1 m~~-----------~~--V~in~~n~~qg~~~~ver~~lfig~g~~--~~~~g~~~~~~~~sdld~~lg~~ds~lk~~v~ 65 (369) T protein:vir:27 1 MAW-----------PT--VIIKILNLMNGPIADIECHFLFVIRGTV--SGEVRNLIMVDSTSDLDDVLAEASAEGLAIVK 65 (369) T ss_pred CCC-----------Cc--eEEecccccCCCcccccceEEEEEeccc--cccccceEEecCccchHhhcCCcChhHHHHHH Confidence 442 22 34454333 35566888999986543 23567788888889999999987 99999999 Q ss_pred HHHHhCCCceEEEEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEE Q lcl|Aclame:pro 77 AYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTA 156 (498) Q Consensus 77 a~~~~n~~~~l~~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA 156 (498) ||+.+- |....+++... .+..+..+++..+ |... T Consensus 66 aa~~na--------------G~~w~a~~~p~-------------------------~~~~~~~~Av~~a-~~~~------ 99 (369) T protein:vir:27 66 AAQLNG--------------KQAWTAGVMIL-------------------------SEEDNWQDAVKKA-NEVS------ 99 (369) T ss_pred HHHhCC--------------CCceEEEEEEe-------------------------CCchhHHHHHHhh-hhhC------ Confidence 987432 22222222110 0111222333221 1100 Q ss_pred eeccceEEEeeccCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEEecCCChHH Q lcl|Aclame:pro 157 SSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTAS 236 (498) Q Consensus 157 ~~~~~~VtlTAk~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~~p~tD~a~ 236 (498) ...|=+|+-|-+|.+. T Consensus 100 ----------------------------------------------------------------s~E~V~v~~p~t~~a~ 115 (369) T protein:vir:27 100 ----------------------------------------------------------------SFEFVVLGFDAETKAM 115 (369) T ss_pred ----------------------------------------------------------------CccEEEEecCcccHHH Confidence 0011112223233332 Q ss_pred ---HHHHHHHHhhhhhhhhhhhheeeEEEE---eccCCH----HHHHhhhhccCcceEEEEecCCCCCCcHHHHHHHHHH Q lcl|Aclame:pro 237 ---VNTLVTEMNDTSGRWSYARQLYGHVYT---AKTGTL----SELVNAGDQFNQQHITLAGYEKETQTPADELAASRTA 306 (498) Q Consensus 237 ---l~al~~~l~~~s~r~~~~~q~~g~~~~---~~~gt~----~~~~t~g~~~N~~~~t~~~~~~~~~~p~~~~AAa~~a 306 (498) ..+++++|...-+||-+.. +.+.++. .-..|+ ..+.+.-..+.+.++++++.-..-..-.+..+..+|. T Consensus 116 i~aaq~~a~el~~~~~R~vffi-~e~~~~~~~~~~~e~w~dy~a~l~al~~g~a~~~V~vv~~~~~~gn~~G~~aGRl~n 194 (369) T protein:vir:27 116 IEDAITLRTELKNSLGREVGVL-CQLPAINNDPTNGQTWSEWLADTVDIPKDVASEYISVVPNVHAAGDTLGKYAGRLAN 194 (369) T ss_pred HHHHHHHHHHHHHhcCCeEEEE-EeccccCCCccccCCHHHHHHHHHHHhhccCcccceeeeeeccccchHHHHHHHHHh Confidence 2333444433233443211 0000000 001122 2344444455677888765321101112333333332 Q ss_pred HhhhhhccCccccccceEEeccc-cCCC--ccccChHHHHHHHhCCeeEEEE--c-CCeEEEEeeeeeeeecCCCCCCch Q lcl|Aclame:pro 307 RAAVFIRNDPARPTQTGELVGML-PAPK--GKRFTMTEQQTLLSHGVATAYV--E-SGVLRIQRDVTTYRKNAYGVADNS 380 (498) Q Consensus 307 ~~a~~l~~DPArpl~tl~L~Gl~-~p~~--~~r~~~~er~~lL~~Gist~~v--~-~G~v~IeR~ITTY~~n~~G~~D~s 380 (498) ++.+.+..|+| ..+..|.|+. -|.+ ..+|+..-...|=..|.+++.. + +| +++-+.-|. -.+..- T Consensus 195 -~aVsIadsp~R-VktG~l~g~~~~p~d~~g~~l~~a~l~aLd~agysvp~~Y~gy~G-~Yw~d~~tl------~~~gsD 265 (369) T protein:vir:27 195 -KEVSIADSPAR-VQTGSVLGNTELMKDKAGKALDLATLKALESNRIAVPMWYPDYPG-QYWTTGRTL------DVPGGD 265 (369) T ss_pred -cccchhcCcce-eeecccccccccccCCCCcccCHHHHHHHHhCCCeEEEeeCCCCc-eEEeCceEe------ccCCCC Confidence 34456788988 6777777763 2222 3458899999999999999865 4 56 777777765 234455 Q ss_pred hhhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHH---HHHHHHHHHHHHhhc---ccccchhhhcC Q lcl|Aclame:pro 381 YLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAV---IKGELLATYRQLERA---GIVENYELFKQ 454 (498) Q Consensus 381 ~ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~---ikaeli~~~~~le~~---given~~~~~~ 454 (498) |..||.+|+.+-+.|.+|-..-.+..-..|.+ ||.- .+.-+..-+++|... |-|+-.+. + T Consensus 266 Yq~iE~~RVvdKa~R~vR~~Ai~~i~Dr~lns------------tp~sia~~~~~~~~pLr~M~ks~fpgei~~P~d--~ 331 (369) T protein:vir:27 266 YQDIRHIRVAMKAARKVRIRAIARIADRTLNS------------TPQSIAAAKLYFTQDLRTMALTGVPGEIYPPED--E 331 (369) T ss_pred eehhhhhhHHHHHHHHHHHHHHHHhcCccccc------------ChhHHHHHHHHHhhHHHHHHhhcCCeEEecCCC--C Confidence 99999999999999999987766654333332 3333 344555566777643 22222221 1 Q ss_pred eEEEEEcCCCCeEEEEEeeeEEecCeEEEeeeeeeEEEecccCC Q lcl|Aclame:pro 455 YLVVERDASVPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) Q Consensus 455 ~lvVerd~~d~nRvn~~~p~~~vn~l~v~A~~~~f~lq~~~~~~ 498 (498) .+..... ...+|.+.+-..-.|--.-|=..|.+-| + +- T Consensus 332 dI~i~w~--~k~~V~I~~~vrP~~~pk~it~~I~ldl--~--~~ 369 (369) T protein:vir:27 332 DIQIKWV--NSTDVEIYMSVQPYECPVKITIAISVKQ--G--DY 369 (369) T ss_pred ceEEEee--ccceEEEEEEEeeccCCceEEEEEEEec--c--CC Confidence 2221111 2244554444333332222222222222 1 11 No 68 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=94.92 E-value=0.0032 Score=34.28 Aligned_cols=335 Identities=17% Similarity=0.218 Sum_probs=184.4 Q ss_pred ccCeEEEEEecC--CC-CCCCCCccEEEEEecCCCCccccceeEEecChHHHHHhhCcC-cHHHHHHHHHHHhCCCceEE Q lcl|Aclame:pro 13 LVPLFYAEMDNQ--AA-NTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAG-SQLARMVEAYRQTDPFGELY 88 (498) Q Consensus 13 rvPg~y~E~dns--~a-~~~~~~~~vLliGq~~~~g~~~~~~~~~v~s~~~A~~~fG~G-S~l~~M~~a~~~~n~~~~l~ 88 (498) --|=+ .+|+- .. ..++-....|+||+... ..+....+...++-..++|.. |.|-..+.|++.+- T Consensus 1 ~~~~v--~vn~ln~~qg~~~~ver~~lfig~~~~----~~~~~~~~~~~sdld~~lg~~ds~lk~~v~aa~~na------ 68 (376) T protein:vir:37 1 MFPSV--QINALNQLSGETKEIERHALFVGVGTT----NQGKLLALTPDSDFDKVFGETDTDLKKQVRAAMLNA------ 68 (376) T ss_pred CCCeE--EEeeeeccCCCcccccceEEEeecccc----ccCceEEecCCCChHHhhCCCchhHHHHHHHHHhCC------ Confidence 22333 45443 22 46677899999998542 367888888888999999998 89999999997432 Q ss_pred EEEecCCccceeEEEEEEeeeccCCcEEEEEEccEEEEEEeecCCCHHHHHHHHHHHHhcCCCceEEEeeccceEEEeec Q lcl|Aclame:pro 89 VIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTASSSAGVVTLTAR 168 (498) Q Consensus 89 ~i~l~d~ag~aatg~ititgtat~~G~l~l~I~g~~v~v~V~~gdtaa~iA~~l~~aIn~~~~lpVtA~~~~~~VtlTAk 168 (498) |..-+..++. + +.+.+++.+++..+ |+ T Consensus 69 --------G~~w~a~~~~--p----------------------~~~~~~~~~Av~~a-~~-------------------- 95 (376) T protein:vir:37 69 --------GQNWFAHVYI--A----------------------QEDGYDFVECVKKA-NQ-------------------- 95 (376) T ss_pred --------CCceEEEEEe--c----------------------CCChhhHHHHHHHH-Hh-------------------- Confidence 2222111111 1 11222333333332 21 Q ss_pred cCcccccceeEEEEecccCcccccccceeeeecccCCCcCcchhhhHHHhhccCcceEEE--ecC-CChHH---HHHHHH Q lcl|Aclame:pro 169 HKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIG--LPF-NDTAS---VNTLVT 242 (498) Q Consensus 169 ~kG~~gN~i~l~~~~~~~~~ge~~p~Glt~tit~~agGag~pD~~~alaalg~~~~~~I~--~p~-tD~a~---l~al~~ 242 (498) ...|.+|+ -|- +|.+. +.++++ T Consensus 96 ----------------------------------------------------~~s~E~V~v~~p~~t~~a~i~a~qa~a~ 123 (376) T protein:vir:37 96 ----------------------------------------------------TASFEYCVNTRYLGVDKASIGKLQECYA 123 (376) T ss_pred ----------------------------------------------------hCCeeEEEEecCcchhHHHHHHHHHHHH Confidence 12222222 121 23333 334455 Q ss_pred HHhhhhhhhhhhhheeeEEEEeccC---------CH----HHHHhhhhccCcceEEEEecCCCCCCcHHHHHHHHHHHhh Q lcl|Aclame:pro 243 EMNDTSGRWSYARQLYGHVYTAKTG---------TL----SELVNAGDQFNQQHITLAGYEKETQTPADELAASRTARAA 309 (498) Q Consensus 243 ~l~~~s~r~~~~~q~~g~~~~~~~g---------t~----~~~~t~g~~~N~~~~t~~~~~~~~~~p~~~~AAa~~a~~a 309 (498) +|...-+||-+ .....+| |. ..+.+.-....+.++.+++.--+ ......+..+|. ++ T Consensus 124 el~~~~~R~vf-------file~~g~d~~~~~ge~w~~y~~~l~a~~~gia~~~V~vV~~~~g--n~~G~~aGRl~n-aa 193 (376) T protein:vir:37 124 ELLAKFGRRTF-------FIQAVQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLVPLLFG--NETGVLAGRLAN-RA 193 (376) T ss_pred HHHHhcCCeEE-------EEEeccCCCCcccccCCHHHHHHHHHHHhccccccceeeeeeecc--chHHHHHHHHHh-CC Confidence 55543345542 2223221 33 34455555667778887764211 122333333332 34 Q ss_pred hhhccCccccccceEEeccc---cCCC--ccccChHHHHHHHhCCeeEEEE--c-CCeEEEEeeeeeeeecCCCCCCchh Q lcl|Aclame:pro 310 VFIRNDPARPTQTGELVGML---PAPK--GKRFTMTEQQTLLSHGVATAYV--E-SGVLRIQRDVTTYRKNAYGVADNSY 381 (498) Q Consensus 310 ~~l~~DPArpl~tl~L~Gl~---~p~~--~~r~~~~er~~lL~~Gist~~v--~-~G~v~IeR~ITTY~~n~~G~~D~s~ 381 (498) .+.+.+|.| ..+..|.|+- .|-+ ...++..-...|=..|.+++.. + +| +++-+.-|. -.+..-| T Consensus 194 VsVadspgR-V~tGai~gl~~~~~p~d~~g~el~~a~l~aLd~arysvpr~Y~gydG-~Yw~dg~tl------~~~gsDY 265 (376) T protein:vir:37 194 VTVADSPAR-VQTGALVSLGSANKPLDKDGNELTLAHLKSLETARYSVPMWYPDYDG-YYRADGRTL------DVEGGDY 265 (376) T ss_pred cchhcCccc-eeecccccccccccccccCCcccchHHHHHHHhCCCeEEEeeCCCCc-eEEeCCeEe------ccCCCCe Confidence 455789998 7778888873 1222 3346888999999999999976 4 56 777777775 2344559 Q ss_pred hhhhhHHHHHHHHHHHHHHHhhhcCCceeccCCCCcCCCcccccHHHHHHHHHHHHHHHhhc----cc-----ccchhhh Q lcl|Aclame:pro 382 LDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERA----GI-----VENYELF 452 (498) Q Consensus 382 ldi~t~~tl~yv~~~~r~~~~~~~~r~kla~dg~~~~~g~~ivTp~~ikaeli~~~~~le~~----gi-----ven~~~~ 452 (498) ..||.+|+.+-+.|.+|-..-.+-....|++. |+ .-.-.|.-+..-+++|... |. |+-++. T Consensus 266 q~ie~~RVvdKa~R~vR~~Ai~~i~Dr~lnst-----p~----sia~~~~~~~~pLr~M~ks~ei~g~~fpgei~~P~d- 335 (376) T protein:vir:37 266 QVIENLRVVDKVARKVRLLAIGKIADRSFNST-----TS----STEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKD- 335 (376) T ss_pred eeehhchHHHHHHHHHHHHHHHHhcCccccCC-----hh----HHHHHHHHHhHHHHHHHhhhhhccccccceeecCCC- Confidence 99999999999999999877655544445432 11 1233455566666777544 22 222222 Q ss_pred cCeEEEEEcCCCCeEEEEEeeeEEecCeEEEeeeeeeEEE-ecc Q lcl|Aclame:pro 453 KQYLVVERDASVPNRLNTLFPPDYVNQLRVFAVVNQFRLQ-YSE 495 (498) Q Consensus 453 ~~~lvVerd~~d~nRvn~~~p~~~vn~l~v~A~~~~f~lq-~~~ 495 (498) +.+++.- ..+.+|.+.+-+.-.|-=.-|=..|.+-|. +-| T Consensus 336 -~dI~i~w--~sk~~V~I~~~vrPy~cpk~i~~~I~LDls~~~~ 376 (376) T protein:vir:37 336 -DAITIVW--QSKTKVTIYIKVRPYDCPKEITANIFLDLDSLGE 376 (376) T ss_pred -CceEEEe--ccCceEEEEEEEeeecCcceeEEEEEEecCCCCC Confidence 2222222 223556666554444433333334444444 222 Done!