Query lcl|NC_013597.1_cdsid_YP_003344796.1 [gene=D11S_2223] [protein=hypothetical protein] [protein_id=YP_003344796.1] [location=complement(10693..12201)] Match_columns 502 No_of_seqs 161 out of 229 Neff 8.1 Searched_HMMs 1612 Date Thu Nov 7 13:23:37 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_16 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_16_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:5260 Length: 502 # 100.0 3E-168 2E-171 939.0 52.7 502 1-502 1-502 (502) 2 protein:vir:106730 Length: 501 100.0 2E-140 1E-143 786.6 48.6 476 1-502 1-501 (501) 3 protein:vir:78611 Length: 501 100.0 2E-140 1E-143 786.1 48.2 476 1-502 1-501 (501) 4 protein:vir:99586 Length: 507 100.0 2E-140 1E-143 786.9 46.3 480 1-501 1-507 (507) 5 protein:vir:3636 Length: 501 # 100.0 6E-140 4E-143 783.7 48.6 476 1-502 1-501 (501) 6 protein:vir:101576 Length: 501 100.0 7E-140 4E-143 783.6 48.6 476 1-502 1-501 (501) 7 protein:vir:96104 Length: 504 100.0 2E-137 1E-140 769.7 46.4 476 1-501 1-504 (504) 8 protein:vir:94073 Length: 494 100.0 3E-136 2E-139 763.2 47.2 472 1-502 1-494 (494) 9 protein:vir:107720 Length: 515 100.0 4E-133 2E-136 746.6 47.0 483 1-501 1-515 (515) 10 protein:vir:95263 Length: 450 100.0 5E-122 3E-125 685.5 43.9 435 4-502 1-449 (450) 11 protein:vir:80052 Length: 331 100.0 1.7E-93 1.1E-96 529.2 38.0 327 4-502 1-331 (331) 12 protein:vir:3165 Length: 426 # 100.0 6.9E-74 4.2E-77 421.7 29.6 411 1-502 1-426 (426) 13 protein:vir:4517 Length: 498 # 99.6 3.7E-13 2.3E-16 88.8 35.2 444 1-490 1-498 (498) 14 protein:vir:489 Length: 498 # 99.6 4.3E-13 2.7E-16 88.4 34.5 444 1-490 1-498 (498) 15 protein:vir:4463 Length: 498 # 99.5 1.5E-12 9.3E-16 85.4 33.5 442 1-490 1-498 (498) 16 protein:vir:107865 Length: 477 99.3 1.7E-10 1.1E-13 74.1 30.4 428 1-502 1-467 (477) 17 protein:vir:79092 Length: 477 99.2 4.3E-10 2.7E-13 71.9 33.0 423 1-502 1-467 (477) 18 protein:vir:1996 Length: 495 # 99.2 5.4E-10 3.4E-13 71.4 36.7 438 1-502 1-495 (495) 19 protein:vir:102957 Length: 437 99.1 1.2E-09 7.6E-13 69.4 32.0 399 1-501 1-437 (437) 20 protein:vir:99306 Length: 587 99.1 2.4E-09 1.5E-12 67.8 33.4 451 1-502 1-582 (587) 21 protein:vir:95741 Length: 587 99.0 3.8E-09 2.4E-12 66.7 36.0 453 1-502 1-582 (587) 22 protein:vir:6079 Length: 396 # 99.0 4.2E-09 2.6E-12 66.5 31.7 368 1-502 1-383 (396) 23 protein:vir:1845 Length: 392 # 99.0 5.8E-09 3.6E-12 65.8 29.7 365 1-502 1-380 (392) 24 protein:vir:5711 Length: 396 # 98.9 1.3E-08 8E-12 63.8 32.4 369 1-502 1-383 (396) 25 protein:vir:98553 Length: 395 98.9 1.3E-08 8.1E-12 63.8 30.9 364 1-502 1-383 (395) 26 protein:vir:78986 Length: 436 98.9 1.4E-08 8.7E-12 63.6 32.0 398 1-501 1-436 (436) 27 protein:vir:2035 Length: 396 # 98.9 2.4E-08 1.5E-11 62.4 30.1 368 1-502 1-383 (396) 28 protein:vir:96586 Length: 587 98.8 4.7E-08 2.9E-11 60.7 38.5 453 1-502 1-582 (587) 29 protein:vir:80488 Length: 562 98.8 6E-08 3.7E-11 60.2 36.4 449 1-502 1-557 (562) 30 protein:vir:96740 Length: 388 98.7 6.7E-08 4.2E-11 59.9 27.5 320 110-502 1-377 (388) 31 protein:vir:78206 Length: 390 98.7 7.9E-08 4.9E-11 59.5 26.6 350 68-502 1-378 (390) 32 protein:vir:103993 Length: 390 98.7 7.9E-08 4.9E-11 59.5 26.6 350 68-502 1-378 (390) 33 protein:vir:105470 Length: 451 98.7 8.9E-08 5.5E-11 59.2 31.2 406 1-501 1-451 (451) 34 protein:vir:79181 Length: 390 98.6 1.5E-07 9.1E-11 58.1 29.8 360 1-502 1-378 (390) 35 protein:vir:63742 Length: 562 98.6 1.7E-07 1.1E-10 57.6 37.8 449 1-502 1-557 (562) 36 protein:vir:79141 Length: 391 98.6 1.9E-07 1.2E-10 57.5 25.5 349 68-502 1-378 (391) 37 protein:vir:104858 Length: 729 98.6 2E-07 1.2E-10 57.3 27.4 456 1-502 165-717 (729) 38 protein:vir:1172 Length: 391 # 98.6 2.1E-07 1.3E-10 57.2 28.9 362 1-502 1-379 (391) 39 protein:vir:5833 Length: 742 # 98.6 2.7E-07 1.7E-10 56.6 27.2 436 1-502 220-736 (742) 40 protein:vir:107310 Length: 581 98.5 3.4E-07 2.1E-10 56.0 27.6 423 1-502 101-566 (581) 41 protein:vir:80984 Length: 666 98.4 6.2E-07 3.9E-10 54.6 27.3 440 1-502 143-651 (666) 42 protein:vir:80779 Length: 569 98.3 1.2E-06 7.2E-10 53.1 34.4 449 1-502 1-564 (569) 43 protein:vir:7206 Length: 659 # 98.3 1.5E-06 9.5E-10 52.5 36.6 465 1-502 1-646 (659) 44 protein:vir:98824 Length: 774 98.3 1.8E-06 1.1E-09 52.1 28.8 442 1-502 279-767 (774) 45 protein:vir:102359 Length: 356 98.2 2.9E-06 1.8E-09 50.9 25.5 326 113-500 1-356 (356) 46 protein:vir:103456 Length: 659 98.1 4.2E-06 2.6E-09 50.1 37.0 463 1-502 1-646 (659) 47 protein:vir:100323 Length: 393 98.1 4.4E-06 2.7E-09 49.9 30.3 360 1-502 1-380 (393) 48 protein:vir:108052 Length: 660 98.1 5E-06 3.1E-09 49.6 35.5 464 1-502 1-647 (660) 49 protein:vir:104477 Length: 749 98.1 5.1E-06 3.1E-09 49.6 28.8 470 1-502 180-739 (749) 50 protein:vir:106984 Length: 743 98.1 5.5E-06 3.4E-09 49.4 26.9 463 1-502 181-732 (743) 51 protein:vir:10336 Length: 386 98.0 7.7E-06 4.8E-09 48.6 30.1 363 1-502 1-379 (386) 52 protein:vir:7653 Length: 581 # 98.0 8.1E-06 5E-09 48.5 31.2 439 1-502 51-566 (581) 53 protein:vir:101187 Length: 663 98.0 9.1E-06 5.6E-09 48.2 38.6 466 1-502 1-648 (663) 54 protein:vir:101804 Length: 663 97.9 1.1E-05 6.7E-09 47.8 36.1 465 1-502 1-648 (663) 55 protein:vir:6594 Length: 666 # 97.8 2.2E-05 1.4E-08 46.1 35.9 465 1-502 1-651 (666) 56 protein:vir:6894 Length: 660 # 97.6 3.7E-05 2.3E-08 44.9 35.8 465 1-502 1-646 (660) 57 protein:vir:100539 Length: 663 97.6 3.7E-05 2.3E-08 44.9 35.4 465 1-502 1-648 (663) 58 protein:vir:5663 Length: 671 # 97.6 4.2E-05 2.6E-08 44.6 31.5 440 1-502 145-661 (671) 59 protein:vir:100829 Length: 607 97.3 9.2E-05 5.7E-08 42.7 35.1 453 1-502 1-596 (607) 60 protein:vir:106427 Length: 679 97.3 0.00011 6.9E-08 42.3 35.3 465 1-502 1-665 (679) 61 protein:vir:98263 Length: 664 97.0 0.00023 1.5E-07 40.5 35.3 465 1-502 1-650 (664) 62 protein:vir:79798 Length: 717 96.4 0.00066 4.1E-07 38.0 23.5 371 1-502 311-717 (717) 63 protein:vir:3788 Length: 376 # 92.8 0.0099 6.2E-06 31.6 24.1 319 156-502 1-371 (376) 64 protein:vir:102819 Length: 648 91.0 0.018 1.1E-05 30.2 28.5 452 1-502 1-645 (648) 65 protein:vir:78782 Length: 370 89.4 0.027 1.7E-05 29.2 27.3 332 119-502 1-363 (370) 66 protein:vir:276 Length: 369 # 55.5 0.48 0.0003 22.3 28.6 328 121-502 1-366 (369) 67 protein:vir:3751 Length: 376 # 30.3 1.6 0.00099 19.5 23.7 315 156-502 1-371 (376) No 1 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=100.00 E-value=3e-168 Score=938.99 Aligned_cols=502 Identities=100% Similarity=1.372 Sum_probs=494.1 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhcCCCCcc Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSPRAK 80 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~q~p~P~ 80 (502) ||||+||||||+|++++.++++++||++|||++++..+++++.+|+|.|+|+++|++|||.+|||||||++||+|+|||+ T Consensus 1 msip~s~ivnV~i~~~~~a~~~~~f~~~l~l~~~~~~~~~~~~~r~~~~~s~~~V~~~FG~~s~ey~aA~~yF~q~p~P~ 80 (502) T protein:vir:52 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSPRAK 80 (502) T ss_pred CCCCccceeEEeeccccccccccccCceEEEeeccCccccCCccceEEecCHHHHHHhcCCChHHHHHHHHHhcCCCccc Confidence 99999999999999999999999999999999999988999999999999999999999999999999999999999999 Q ss_pred eEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhcccccc Q lcl|NC_013597. 81 QLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLSVA 160 (502) Q Consensus 81 ~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a~~~ 160 (502) +|+||||++++..++++++++++.++++.+.+|+++.+|+|+++|+|+.+++++||||.+++++++|+.+++++.+.+.. T Consensus 81 ~l~igR~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~i~i~g~~~t~~~i~lS~~ts~~~vA~~i~~~l~~~~~~ 160 (502) T protein:vir:52 81 QLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLSVA 160 (502) T ss_pred eEEEEeccccccceeechhhhhhhhhHHhHHHhhhhcCceeEEEecceeeeeeccccccccchhHHHHHHHhhhcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999998889 Q ss_pred eeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHHHHhccCc Q lcl|NC_013597. 161 VSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVAEVNNT 240 (502) Q Consensus 161 ~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~al~~~~~~ 240 (502) ++|+||+.+++|.++++++|..+++.+.++..++.++++++++++++..+++..++....+..+|+|.++|+++.+.+++ T Consensus 161 ~tv~~d~~~~~F~i~s~ttg~~~~~~~~~a~~~~~~gt~~a~~l~l~~~~~av~v~~~~~g~~aet~~~al~a~~~~~~~ 240 (502) T protein:vir:52 161 VSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVAEVNNT 240 (502) T ss_pred eEEEEecCCceEEEEeccCCCcceeEEEEeecCCcchhHHHHHhccccccceeeeeeecccccccCHHHHHHHHHhccCc Confidence 99999999999999999999999999999999999999999999999999999998888899999999999999999999 Q ss_pred eeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecCchhcccchhHHHHHHHHccCCceEEEecCCccchHHHHHHHHHhc Q lcl|NC_013597. 241 WYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDKNDMYPVSSALARLLST 320 (502) Q Consensus 241 w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~aa~~g~~as~ 320 (502) ||+|++++++++++++++|+|+|+|+|+|++++++++++....++++++|+.++|+||+++||++++|+++++||+++++ T Consensus 241 w~~~~~a~~~~~~~~la~a~~iea~~~~f~~~~~d~~~~~~~~~~i~~~l~a~~~~~t~~~y~~~~~~~~aa~~g~~as~ 320 (502) T protein:vir:52 241 WYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDKNDMYPVSSALARLLST 320 (502) T ss_pred eEEEEEeecCChhHHHHHHHHHhhcCcEEEEEecCcceeccccchHHHHHHhccCceeEEEecCCcchhHHHHHHHHHhc Confidence 99999999999999999999999999999999999999988889999999999999999999999999999999999999 Q ss_pred CCCCCCceeeEeeeecCccccCCCCHHHHHHHHhCCceEEEEEcCceEEecCEeecCeehhHHHHHHHHHHHHHHHHHHH Q lcl|NC_013597. 321 NFAANNSTLTLKFKQQPTITADEITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGGKFADEIVILDWFVDAVQKEVFAR 400 (502) Q Consensus 321 n~~~~~g~~T~~fk~~~Gv~~~~lt~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~~G~~iD~~~~~dwl~~~iq~~l~~~ 400 (502) ||++.||++|||||+++||+|++++.+|+++|+++|||||+++++.+++++|++++|+|||++||+|||+++||++|+++ T Consensus 321 ~f~~~~g~iT~~fk~l~GV~~~~lt~t~~~al~~~~~N~y~~~~~~~~~~~G~~~~G~~iD~~~~~~Wl~~~lq~~l~~~ 400 (502) T protein:vir:52 321 NFAANNSTLTLKFKQQPTITADEITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGGKFADEIVILDWFVDAVQKEVFAR 400 (502) T ss_pred CCCcCcceeeecccccCCcccCcCCHHHHHHHHhcCceEEEEecCeeEEecCeeeCCchhhHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcCchhcCCHHHHhhcccCc Q lcl|NC_013597. 401 LYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATP 480 (502) Q Consensus 401 l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~ 480 (502) |+++++|||||+.|+++|+++|+++|+|+++||+|+||+|+++++|.+..+|++.+||||++|++++|+++||++|++|+ T Consensus 401 L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~G~~~~~~~g~~~~~d~~~~gy~v~~~~~~~~s~~dr~~R~~~~ 480 (502) T protein:vir:52 401 LYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATP 480 (502) T ss_pred HHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeecccccCceEEEeCchhhCCHHHHHcccCCC Confidence 99988999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 481 IQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 481 i~~~~~~aGaIh~v~i~~~v~~ 502 (502) |+|+|+++||||+|+|++|||| T Consensus 481 ~~~~~~~aGaIh~v~i~~nv~~ 502 (502) T protein:vir:52 481 IQTAVKLAGAIHSSDVIVNYNR 502 (502) T ss_pred eEEEEEECceEEEEEEEEEEeC Confidence 9999999999999999999999 No 2 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=100.00 E-value=1.9e-140 Score=786.61 Aligned_cols=476 Identities=18% Similarity=0.211 Sum_probs=425.4 Q ss_pred CC---cCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhc--- Q lcl|NC_013597. 1 MA---LSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFA--- 74 (502) Q Consensus 1 Ms---ip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~--- 74 (502) |+ ||+||||||+|+|.++++.+++|+ +|||+++...| .+|+|.|+++++|++|||.+|||||||++||+ T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~f~-~lll~~~~~~~----~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsg~~ 75 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLT-GLVLTQDTSVQ----PGQLADFFQKTDVENWFGALSNEAKIADAYFPGIV 75 (501) T ss_pred CCcCccccceEEEEeeecccCCCcccccc-eEEEecccCCC----ccceeeecCHHHHHHhcCCChHHHHHHHHHhhhhc Confidence 98 999999999999999999999999 66777776554 37999999999999999999999999999998 Q ss_pred -CCCCcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhh Q lcl|NC_013597. 75 -QSPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEK 153 (502) Q Consensus 75 -q~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aa 153 (502) |+|||++|+||||++++.++.++++.+.+.+ ++.++.+ +|+|+|+++|+.+. .+||||.+++++++|+.|+++ T Consensus 76 ~q~p~P~~l~igR~~~~~~~~~l~g~~l~~~~----la~~~~~-~g~l~i~i~g~~~~-~~i~~s~ats~~~vA~~i~~a 149 (501) T protein:vir:10 76 NGGQLPYDLKFARYVAADAPASVYGIPLTGIT----LAQLQGY-SGTLTVTTAAQHVS-ANISLAAATSFANAATLIEAA 149 (501) T ss_pred CCCccccEEEEEeecccCccceeeeceehhhh----hhhhhhe-eeEEEEeeccceee-eccccccccCHHHHHHHHHHh Confidence 9999999999999999999999999887644 4567776 49999999998654 679999999999999999999 Q ss_pred hcccccceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHH Q lcl|NC_013597. 154 LTTLSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFN 233 (502) Q Consensus 154 l~~a~~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~a 233 (502) |+.. .++|+||+...+|+++++++|..+.+++... +.+++..++++.+.+...+ +.+.++|+|.++|++ T Consensus 150 l~~~--~~tv~~d~~~~~f~i~~~t~G~~~~i~~~t~------~~d~a~~l~Lt~~~~a~v~---~~g~~aet~~~Al~a 218 (501) T protein:vir:10 150 FTSP--DFVVAYDALRNRFTVVTNTTGTAAAISAVTG------TNNLADELGLSAAAGATLQ---AAGVAADTPASAMNR 218 (501) T ss_pred hcCC--ceEEEEecccceEEEEecccCcceeEEEeec------cccchhhhcccccCceeEE---ecCcccccHHHHHHH Confidence 9763 4689999999999999999999988776432 2478999999988765432 467889999999999 Q ss_pred HHhccCceeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecCchh--cc-cchhHHHHHHHHccCCceEEEecCCccchH Q lcl|NC_013597. 234 VAEVNNTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAEQ--IE-WSADNIYKKLYDAGLDHTLAMFDKNDMYPV 310 (502) Q Consensus 234 l~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~~~~--~~-~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~ 310 (502) +.+.+++||+|.+++++++++++++|+|+|+|+++|++..++.+. +. ...++++++|++++|+||+++||+ ++++ T Consensus 219 ~~~~~~~Wy~f~~a~~~~~~~~la~A~wi~a~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~--~~~~ 296 (501) T protein:vir:10 219 AVGLSRNWATFTTAWTAVIADRLAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGD--QATA 296 (501) T ss_pred HHhcccceEEEEEEecCChHHHHHHHHHHHhcCceEEEEEecCcceeeecccchhHHHHHHhcCCCceEEECCC--CCHH Confidence 999999999999999999999999999999999999888777654 33 457899999999999999999995 4568 Q ss_pred HHHHHHHHhcCCCCCCceeeEeeeec-CccccCCCCHHHHHHHHhCCceEEEEEcC----ceEEecCEeecC-eehhHHH Q lcl|NC_013597. 311 SSALARLLSTNFAANNSTLTLKFKQQ-PTITADEITATEFAKAKRLGINVYTYFDD----VAMIAEGTVIGG-KFADEIV 384 (502) Q Consensus 311 aa~~g~~as~n~~~~~g~~T~~fk~~-~Gv~~~~lt~t~~~~l~~~~~n~y~~~~~----~~~~~~G~~~~G-~~iD~~~ 384 (502) ++++|+++++||++.||++||||||+ +||+|++++++|+++|+++|||||+.+.+ ..++++|+|++| +|||+++ T Consensus 297 aa~~g~~as~nf~~~~g~~T~~fkql~~Gv~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~ 376 (501) T protein:vir:10 297 GAVMGYAASINFQLRNGRTVLAFRQFNAGVPATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYL 376 (501) T ss_pred HHHHHHHHhcCcccCcceeeeeecccCCCcCcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcceeeccceehhhHh Confidence 89999999999999999999999997 89999999999999999999999999965 468999999988 8999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccc---------cccccccccc Q lcl|NC_013597. 385 ILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGF---------GNLSTGDYLD 455 (502) Q Consensus 385 ~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~---------g~~~~~~~~~ 455 (502) |+|||+++||+++|++| ++++|||||+.|+++|++.|+++|+|+++||+|+||+++++.+ +...++|.++ T Consensus 377 g~dWl~~~iq~~l~~ll-~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~~~q~~~i~~~~g~~~~~~~v~~ 455 (501) T protein:vir:10 377 DQIYLNAELQRAEFEAM-LAYNSLPYNEDGYTANYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAVAGVAGAGALVQT 455 (501) T ss_pred hHHHHHHHHHHHHHHHH-hcCCCcccCHHHHHHHHHHHHHHHHHHHhCcceecCcccCcccceeecccccccccccceec Confidence 99999999999999976 5578999999999999999999999999999999999665554 4566789999 Q ss_pred cceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 456 KGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 456 ~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) +|||+++++++.++ ++|++|++|+++|+|+++|+||+|+|.-+.+= T Consensus 456 ~Gyy~~~~~~~~~~-~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 456 RGWYFLIGNPANPG-QARQNRTSPACTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred cceEEeeCcccCCh-hhhhhcccCceEEEEEeCCceeEEEeeeeecC Confidence 99999999988655 67999999999999999999999999544444 No 3 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=100.00 E-value=2.3e-140 Score=786.14 Aligned_cols=476 Identities=17% Similarity=0.215 Sum_probs=427.8 Q ss_pred CC---cCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhc--- Q lcl|NC_013597. 1 MA---LSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFA--- 74 (502) Q Consensus 1 Ms---ip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~--- 74 (502) |+ ||+||||||+|+|.++++.+++|+ +|||+++...| .+|+|.|+++++|++|||.+||||+||++||+ T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~-~lll~~~~~~~----~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~ 75 (501) T protein:vir:78 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLT-GLVLTQDTSIQ----PGQLADFFQKTDVENWFGGLSNEAVIADAYFPGIV 75 (501) T ss_pred CCcCccccceEEEEeeecccCCCcceeee-eEEEecCCCCC----ccceeeecCHHHHHHhcCCChHHHHHHHHHhhcCC Confidence 98 999999999999999999999998 67788776554 37999999999999999999999999999999 Q ss_pred -CCCCcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhh Q lcl|NC_013597. 75 -QSPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEK 153 (502) Q Consensus 75 -q~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aa 153 (502) |+|||++|+||||++++.++.++++++.+.+ ++.|+++ +|+|+|+++|+ +...+||||.+++++++|+.|+++ T Consensus 76 ~q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~----la~~~~~-~G~l~iti~g~-~~~~~i~~S~~ts~~~vA~~i~~a 149 (501) T protein:vir:78 76 NGGQLPYDLKFARYVAADAPASVYGIPLTGVT----LTQLQGY-SGTLTVTTAAQ-HVSSNISLAAATSFANAATLIEAA 149 (501) T ss_pred CCCcccceEEEEeecccCcceeEeccceeccc----hhhhcee-eeEEEEEeccc-eeeeccccccccCHHHHHHHHHhh Confidence 9999999999999999999999998887644 5578888 59999999997 556789999999999999999999 Q ss_pred hcccccceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHH Q lcl|NC_013597. 154 LTTLSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFN 233 (502) Q Consensus 154 l~~a~~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~a 233 (502) |++. .++|+||+...+|+++++++|..+++++.. .+++++..++|+...++..+ +.+.++|+|.++|++ T Consensus 150 l~a~--~~tv~~ds~~~~f~its~t~G~~~~i~~~t------~~~~~a~~l~Lt~~~~a~v~---~~g~~aet~~~a~~a 218 (501) T protein:vir:78 150 FTSP--DFVVSYDALRNRFVVNTNATGTAAAISAVT------GTNNLADELGLSAAAGASLQ---AAGVAADTPASAMNR 218 (501) T ss_pred hcCc--ceEEEEccccceEEEEeeecCCceeEEEEe------cccchhhhhcccccCceeeE---eccccccCHHHHHHH Confidence 9864 468999999999999999999988776643 35688999999988766543 457789999999999 Q ss_pred HHhccCceeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecCch--hcc-cchhHHHHHHHHccCCceEEEecCCccchH Q lcl|NC_013597. 234 VAEVNNTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAE--QIE-WSADNIYKKLYDAGLDHTLAMFDKNDMYPV 310 (502) Q Consensus 234 l~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~~~--~~~-~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~ 310 (502) +.+.+++||+|.+++++++++++++|+|+|+|+++|++..++.+ .+. ...++++++|++++|.||+++|| +++++ T Consensus 219 ~~~~~~~Wy~f~~a~~~~~~~~lalA~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~a~~y~~t~~~y~--~~~~~ 296 (501) T protein:vir:78 219 AVGLSRNWATFTTAWTAVIADRLALASWNSGQAYKYMYVAPDLEPASIVTNNSASFGAQVFAAPYQGTLPLYG--DQATA 296 (501) T ss_pred HHhccCceEEEEEecCCCHHHHHHHHHHHHhcCceEEEEEecCCcceeecccchhHHHHHhhcCCCceEEEcC--CcchH Confidence 99999999999999999999999999999999999888776655 333 45788999999999999999999 57889 Q ss_pred HHHHHHHHhcCCCCCCceeeEeeeec-CccccCCCCHHHHHHHHhCCceEEEEEcC----ceEEecCEeecC-eehhHHH Q lcl|NC_013597. 311 SSALARLLSTNFAANNSTLTLKFKQQ-PTITADEITATEFAKAKRLGINVYTYFDD----VAMIAEGTVIGG-KFADEIV 384 (502) Q Consensus 311 aa~~g~~as~n~~~~~g~~T~~fk~~-~Gv~~~~lt~t~~~~l~~~~~n~y~~~~~----~~~~~~G~~~~G-~~iD~~~ 384 (502) +++||+++++||++.+|++||||||+ +||+|++++++|+++|+++|||||+.+.+ ..++++|+|+++ +|||.++ T Consensus 297 aa~~g~~as~nf~~~~g~~T~~fkq~~~Gv~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~ 376 (501) T protein:vir:78 297 GAVMGYAASINFQLRNGRTVLAFRQFNAGVPATAHDLGTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYL 376 (501) T ss_pred HHHHHHHHhcCcccCcceeeeeccccCCCcCcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcCeeeccceeehhhh Confidence 99999999999999999999999996 89999999999999999999999999965 458999999877 7999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccc---------cccccccccc Q lcl|NC_013597. 385 ILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGF---------GNLSTGDYLD 455 (502) Q Consensus 385 ~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~---------g~~~~~~~~~ 455 (502) |+|||+++||+++|++| .+++|||||+.|+++|+++|+++|+++++||+|+||+|+++.+ +...++|+++ T Consensus 377 ~~~Wl~~~iq~~l~~ll-~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~I~~~~g~~~~~~~~~~ 455 (501) T protein:vir:78 377 DQIYLNAELQRAEFEAM-LAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQM 455 (501) T ss_pred hHHHHHHHHHHHHHHHH-HhCCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCCccceeeccccCccccccceec Confidence 99999999999999976 5578999999999999999999999999999999999987765 4466788999 Q ss_pred cceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 456 KGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 456 ~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) +|||+++++++.++ ++|++|++|+++|+|+++|+||+|+|.-+.+= T Consensus 456 ~Gyy~~~~~~~~~~-~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:78 456 RGWYFLIGDPANPG-QARQNRTTPTCTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred cceEEeeccccCCh-hhhhhcccCcEEEEEEeCCceeEEEeeeeecC Confidence 99999999988655 67999999999999999999999999544444 No 4 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=100.00 E-value=1.7e-140 Score=786.90 Aligned_cols=480 Identities=17% Similarity=0.193 Sum_probs=427.9 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhcCCC--- Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSP--- 77 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~q~p--- 77 (502) |- |+||||||+|+|.++++.+++|+.+|||++++..| .||+|.|+++++|++|||.+|||||||++||+|+| T Consensus 1 mi-p~s~iVnV~~~v~~~a~~~~~~~~~lilt~~~~~~----~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsq~p~~~ 75 (507) T protein:vir:99 1 MI-SQSRYVRIVSGVGAGAPVAQRRLIMRVMTTNAVLP----PGVVFESSSADAVGAYFGMASEEYKRAKAYMSFISKSI 75 (507) T ss_pred CC-CccceeEEeeeccccCcccccccceeeeccccCCC----ccceEeecCHHHHHHhcCCChHHHHHHHHHhccCCCCC Confidence 74 99999999999999999999999999999997654 38999999999999999999999999999999999 Q ss_pred -CcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhcc Q lcl|NC_013597. 78 -RAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTT 156 (502) Q Consensus 78 -~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~ 156 (502) +|++|+||||++++..+.++++ ++++.+..++++++|+|+|+|||+.+++++||||.+++++++|+.|+++|++ T Consensus 76 ~~P~~L~igR~~~~~~~a~l~g~-----~~~~~l~~~~~~~~G~lti~v~G~~~t~~~i~lS~~ts~~~vAs~i~~~l~a 150 (507) T protein:vir:99 76 NSPSYISFARWVNAAIASMIVGD-----SLVKNLPALKAVATPTLSLSIGGTVVPIAGIDLTAALTLTDVAATLQTKIRA 150 (507) T ss_pred cccceEEEEeecCccccceeecc-----hhhhhHHHHhhhcceeEEEEEcCceeEeccccccccCCHHHHHHHHHHhhhc Confidence 6999999999998876666554 4556788999999999999999999999999999999999999999999987 Q ss_pred ccc----ceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHH Q lcl|NC_013597. 157 LSV----AVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALF 232 (502) Q Consensus 157 a~~----~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~ 232 (502) ... .++|+||.++++|+++++++|..+++.+..+ ..++++++.+++++.. ++.. +.+.++|+|.++++ T Consensus 151 ~~~~~~~~~tv~~d~~~~~F~v~s~~tG~~s~i~~at~---~~~gt~~s~l~~~~~~-~a~~----~~g~~aet~~~a~~ 222 (507) T protein:vir:99 151 SANAELATATVTFNTTTNQFVLNGTTTGALAPTITAVR---TDPATDISSLLGWTNT-GTVF----VKGQAAETPDTSIS 222 (507) T ss_pred cccccccceEEEEecCCceEEEEeeeccccceeEEEEc---CCchhhHHHHhccccc-cceE----eecccccCHHHHHH Confidence 643 4789999999999999999999888776544 2357888988888744 3332 45778999999999 Q ss_pred HHHhccCceeEEEEecC--CChhHHHHHHHHHhhcCCEEEEEecCchhcccchhHHHHHHHHccCCceEEEecCCccchH Q lcl|NC_013597. 233 NVAEVNNTWYGFTVAAQ--LTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDKNDMYPV 310 (502) Q Consensus 233 al~~~~~~w~~~~~~~~--~~~~~~~a~a~w~~a~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~ 310 (502) ++.+.+++||+|++++. +++++++++|+|+|+|+++|++..++.+. ....+.+..++...+.++...+....+|++ T Consensus 223 a~~~~~~nW~~~~~a~~~~~td~~~lalA~wiea~~~~f~~~~~~~~a--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 300 (507) T protein:vir:99 223 KSAAISTNFGSFIYTSTPALTNDQITAVASWNASQNNMYMYSVPTTIA--NIGTLYAAVKGFSGCALNITSDSLPVDYIE 300 (507) T ss_pred HHHhhcCCeEEEEEEeccccChHHHHHHHHHHhhcCcEEEEEEecCch--hhhhhhhhhhhcceeEEEeecccccchhHH Confidence 99999999999988764 68999999999999999999988776543 223455666666666666655555568999 Q ss_pred HHHHHHHHhcCCCCCCceeeEeeeecCccccCCCCHHHHHHHHhCCceEEEEEcC----ceEEecCEeecCe----ehhH Q lcl|NC_013597. 311 SSALARLLSTNFAANNSTLTLKFKQQPTITADEITATEFAKAKRLGINVYTYFDD----VAMIAEGTVIGGK----FADE 382 (502) Q Consensus 311 aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~~~~lt~t~~~~l~~~~~n~y~~~~~----~~~~~~G~~~~G~----~iD~ 382 (502) +++||+++++||++.||++|||||+++||+|++++++|+++|++||||||+.+++ ..|+++|+|++|+ |+|. T Consensus 301 aa~~g~~as~nf~~~ng~~T~~fk~l~GV~a~~lt~t~a~al~~~n~N~y~~~a~~~~~~~~~~~G~~~gG~~~fid~d~ 380 (507) T protein:vir:99 301 QSPCEILAATDYTRVNATQNYMYYQFPSRNITVSDDTTANLVDANRGNYIGQTQSAGQSLAFYQRGILCGGPNDAVDMNI 380 (507) T ss_pred HHHHHHHHhhccCcCccceeecccccCCcccccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeeCCcccceeeee Confidence 9999999999999999999999999999999999999999999999999999976 4689999999995 5566 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCcccccc---------ccCccccccccccc Q lcl|NC_013597. 383 IVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGK---------WTGAGFGNLSTGDY 453 (502) Q Consensus 383 ~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~---------~~~~~~g~~~~~~~ 453 (502) ++++|||+++||++||++| .+++|||||+.|+++|+++|+++|+++++||+|+||+ |++...+.+.++|+ T Consensus 381 ~~~~~WL~~~iq~~l~~l~-~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~I~~Gvtl~~~q~~~in~~~~~~~~~~~~ 459 (507) T protein:vir:99 381 YANEIWLKSAISAQILSLF-LNVPRVPANETGESMLLSVIQSVVNTAKNNGTISAGKNLNVIQQQYITQISGDANAWRQV 459 (507) T ss_pred ecchHHHHHHHHHHHHHHH-hcCCCCccChhhHHHHHHHHHHHHHHHHhccccccCCcccccchheecccccccccccce Confidence 7799999999999999966 5578999999999999999999999999999999999 66666678899999 Q ss_pred cccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEe Q lcl|NC_013597. 454 LDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYN 501 (502) Q Consensus 454 ~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~ 501 (502) +.+|||+++|+++.+++++|++|++|+++|+|+++|+||+|+|..+.+ T Consensus 460 ~~~Gyy~~~~~~s~~~~~~r~~r~~~~~~~~y~~~gaI~~v~~~~~~v 507 (507) T protein:vir:99 460 ANIGYWLNITFSSYTNPNTQLTEWKASYQLIYSKDDAIRFVEGTDTLI 507 (507) T ss_pred eccceEEEeCChHhcChhhhhccccceEEEEEEeCCeEEEEEeeeecC Confidence 999999999999999999999999999999999999999999999999 No 5 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=100.00 E-value=6.4e-140 Score=783.68 Aligned_cols=476 Identities=18% Similarity=0.213 Sum_probs=425.9 Q ss_pred CC---cCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhc--- Q lcl|NC_013597. 1 MA---LSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFA--- 74 (502) Q Consensus 1 Ms---ip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~--- 74 (502) |+ ||+||||||+|+|.++++.+++|+ +|||+++...| .+|+|.|+++++|++|||.+||||+||++||+ T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~-~lllt~~~~~~----~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~ 75 (501) T protein:vir:36 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLT-GLVLTQDTSVQ----PGQLADFFQETDVENWFGALSNEAKIADAYFPGIV 75 (501) T ss_pred CCcCCcccceEEEEeeeeccCCCcceeee-eEEEeccCCCC----CcceeeecCHHHHHHhcCCChHHHHHHHHHhhccc Confidence 98 999999999999999999999998 67888887654 37999999999999999999999999999998 Q ss_pred -CCCCcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhh Q lcl|NC_013597. 75 -QSPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEK 153 (502) Q Consensus 75 -q~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aa 153 (502) |+|||++|+||||++++.++.++++++.+.+ ++.++.+ +|+|+++++|+.+ .++||||.+++++++|+.|+++ T Consensus 76 ~q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~----~a~~~~~-sg~l~vti~g~~~-~~~i~lS~~ts~~~vA~~i~~a 149 (501) T protein:vir:36 76 NGGQLPYDLKFARYVAADAPASVYGIPLTGVT----LAQLQGY-SGTLTVTTAAQHV-SANISLAAATSFANAATLIEAA 149 (501) T ss_pred CCCccccEEEEEeecCcCcceeEeccchhhhh----hhhccce-eEEEEEEecceee-eeecccccccCHHHHHHHHhhh Confidence 9999999999999999999999999887644 4566776 5999999999965 5789999999999999999999 Q ss_pred hcccccceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHH Q lcl|NC_013597. 154 LTTLSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFN 233 (502) Q Consensus 154 l~~a~~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~a 233 (502) |+.. .++|+|++...+|+++++++|..+.+++..+ ++++++.++++...+...+ +.+..+|+|.++|++ T Consensus 150 l~~~--~~tv~~d~~~~~f~i~s~t~G~~~~i~~~t~------~~~ia~~l~Lt~~~~a~v~---~~g~~~et~~~al~a 218 (501) T protein:vir:36 150 FTSP--DFVVAYDALRNRFTVVTNATGTAAAISAVTG------TNNFADEIGLSAAAGATLQ---AAGVAADTPASAMNR 218 (501) T ss_pred hcCc--ceEEEEcCcceeEEEEeccCCcceeeEeeec------ccchhhhhcccccCcceEE---ecccccccHHHHHHH Confidence 9864 4689999999999999999998887776542 4578999999998876433 467788999999999 Q ss_pred HHhccCceeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecCch--hcc-cchhHHHHHHHHccCCceEEEecCCccchH Q lcl|NC_013597. 234 VAEVNNTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAE--QIE-WSADNIYKKLYDAGLDHTLAMFDKNDMYPV 310 (502) Q Consensus 234 l~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~~~--~~~-~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~ 310 (502) +.+.+++||+|.+++++++++++++|+|+|+|+++|++..++.+ .+. ...++++++|+.++|+||+++||+ ++++ T Consensus 219 ~~~~s~~Wy~f~~a~~~~~~~~la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~--~~~~ 296 (501) T protein:vir:36 219 AVGLSRNWATFTTAWTAVIADRLAFASWNSGQAYKYMYVAPDLEAASIVSNNAASFGAQVFAAPYQGTLPLYGD--QATA 296 (501) T ss_pred HHhccCceEEEEEecCCChHHHHHHHHHHhhcCceEEEEEecCchhhhhccchhhHHHHHHhcCCCcEEEEcCC--CCHH Confidence 99999999999999999999999999999999999887776654 333 357889999999999999999995 4567 Q ss_pred HHHHHHHHhcCCCCCCceeeEeeeec-CccccCCCCHHHHHHHHhCCceEEEEEcC----ceEEecCEeecC-eehhHHH Q lcl|NC_013597. 311 SSALARLLSTNFAANNSTLTLKFKQQ-PTITADEITATEFAKAKRLGINVYTYFDD----VAMIAEGTVIGG-KFADEIV 384 (502) Q Consensus 311 aa~~g~~as~n~~~~~g~~T~~fk~~-~Gv~~~~lt~t~~~~l~~~~~n~y~~~~~----~~~~~~G~~~~G-~~iD~~~ 384 (502) ++++|+++++||++.||++||||||+ +||+|++++++|+++|+++|||||+.+.+ ..++++|+|++| +|||++| T Consensus 297 aa~~g~~as~nf~~~~g~~T~~fkq~~~Gi~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~ 376 (501) T protein:vir:36 297 GAVMGYAASINFQLRNGRTVLAFRQFNAGVPATVHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYL 376 (501) T ss_pred HHHHHHHHhcCcccCcceeeeeccccCCCcCcCcCCHHHHHHHHhcCCcEEEEEecccceeeEEEcCeeeccchhhhHHH Confidence 89999999999999999999999997 89999999999999999999999999865 468999999987 8999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccc---------cccccccccc Q lcl|NC_013597. 385 ILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGF---------GNLSTGDYLD 455 (502) Q Consensus 385 ~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~---------g~~~~~~~~~ 455 (502) |+||||++||++||++| .+++|||||+.|+++|+++|+++|+|+++||+|+||+|+++.+ +...+++.++ T Consensus 377 g~dWL~~~iq~~l~~ll-~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~I~~~~g~~~~~~~v~~ 455 (501) T protein:vir:36 377 DQIYLNAELQRAEFEAM-LAYNSLPYNEDGYTGLYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDKAARVAGAGQLVQT 455 (501) T ss_pred hHHHHHHHHHHHHHHHH-hcCCCCccChhhHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeecccccccccccceec Confidence 99999999999999977 5578999999999999999999999999999999999977655 4466778899 Q ss_pred cceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 456 KGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 456 ~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) +|||+++++++. +++||++|++|+++|+|+++|+||+|+|.-+.+= T Consensus 456 ~Gyy~~~~~~~~-~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:36 456 RGWYFLIGDPAN-PGQARQNRTTPACTLWYSDGGSIQSLTIGSNAVI 501 (501) T ss_pred cceEEeeCcccC-ChhhhhhcccCcEEEEEEeCCceeEEEeeeeeeC Confidence 999999988875 5679999999999999999999999999544444 No 6 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=100.00 E-value=6.7e-140 Score=783.57 Aligned_cols=476 Identities=18% Similarity=0.212 Sum_probs=427.5 Q ss_pred CC---cCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhc--- Q lcl|NC_013597. 1 MA---LSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFA--- 74 (502) Q Consensus 1 Ms---ip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~--- 74 (502) |+ ||+||||||+|+|.++++.+++|+ +|||++++..|+ +|++.|+|+++|++|||.+|||||||++||+ T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~-~l~l~~~~~~~~----~~~~~~~s~~~V~~~FG~~S~ey~aA~~yFsg~~ 75 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLT-GLVLTQDTSVQP----GQLADFFQETDVENWFGALSNEAKIADAYFPGIV 75 (501) T ss_pred CCCCCcccceEEEEeeecccCCCccccce-eEEEeccCCCCc----cceEEecCHHHHHHhcCCChHHHHHHHHHhhhhc Confidence 99 999999999999999999999998 678888876654 5667799999999999999999999999999 Q ss_pred -CCCCcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhh Q lcl|NC_013597. 75 -QSPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEK 153 (502) Q Consensus 75 -q~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aa 153 (502) |+|||++|+||||++++.++.++++++.+. .+++|++++ |+|+|+++|+.+. .+||||.+++++++|+.|+++ T Consensus 76 ~q~p~P~~l~igR~~~~~~~~~l~g~~l~~~----~la~~~~~s-g~l~vti~g~~~~-~~i~ls~ats~~~vAs~i~~a 149 (501) T protein:vir:10 76 NGGQLPYDLKFARYVAADAPASVYGIPLTGV----TLAQLQGYS-GTLTVTTAAQHVS-ANISLAAATSFANAATLIEAA 149 (501) T ss_pred CCCccccEEEEEeecCCCccceEeccchhhh----hhhhcceee-eEEEEeeccceee-cccccccccCHHHHHHHHhhh Confidence 999999999999999999999998888654 356788885 9999999998654 689999999999999999999 Q ss_pred hcccccceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHH Q lcl|NC_013597. 154 LTTLSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFN 233 (502) Q Consensus 154 l~~a~~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~a 233 (502) |+.. .++|+||+...+|+++++++|..+++++.. .+++++..|+++...+...+ +.+..+|+|.+++++ T Consensus 150 l~~~--~~tv~~d~~~~~f~its~ttG~~~~i~~~~------~~~~la~~l~Lt~~~~a~v~---~~g~~aet~~~a~~a 218 (501) T protein:vir:10 150 FTSP--DFVVAYDALRNRFTVVTNATGTAAAISAVT------GTNNLADELGLSAAAGATLQ---AAGVAADTPASAMNR 218 (501) T ss_pred ccCC--ceEEEEcccCceEEEEeeccCCceeEEEee------CchhhhhhcCccccccceEE---ecCcccccHHHHHHH Confidence 9864 478999999999999999999988877653 35689999999998876532 567789999999999 Q ss_pred HHhccCceeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecCch--hcc-cchhHHHHHHHHccCCceEEEecCCccchH Q lcl|NC_013597. 234 VAEVNNTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAE--QIE-WSADNIYKKLYDAGLDHTLAMFDKNDMYPV 310 (502) Q Consensus 234 l~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~~~--~~~-~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~ 310 (502) +.+.+++||+|.+++++++++++++|+|+|+|+++|++..++.+ .+. ...++++++|+.++|.|++++||+ ++++ T Consensus 219 ~~~~~~~Wy~f~~a~~~~~~~~la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~--~~~~ 296 (501) T protein:vir:10 219 AVGLSRNWATFTTAWTAVIADRLAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGD--QATA 296 (501) T ss_pred HHhccCceEEEEEecCCChHHHHHHHHHHHhcCceEEEEEecCchhhhhhhhhhhHHHHHHhcCCCceEEECCC--CcHH Confidence 99999999999999999999999999999999999887766655 343 356889999999999999999994 6788 Q ss_pred HHHHHHHHhcCCCCCCceeeEeeeecC-ccccCCCCHHHHHHHHhCCceEEEEEcCc----eEEecCEeecC-eehhHHH Q lcl|NC_013597. 311 SSALARLLSTNFAANNSTLTLKFKQQP-TITADEITATEFAKAKRLGINVYTYFDDV----AMIAEGTVIGG-KFADEIV 384 (502) Q Consensus 311 aa~~g~~as~n~~~~~g~~T~~fk~~~-Gv~~~~lt~t~~~~l~~~~~n~y~~~~~~----~~~~~G~~~~G-~~iD~~~ 384 (502) +++||+++++||++.||++||||||++ ||+|++++++|+++|+++|||||+.|++. .++++|+|+++ +|||.++ T Consensus 297 aa~~g~~as~nf~~~~g~~T~~fkq~~~Gi~a~~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~ 376 (501) T protein:vir:10 297 GAVMGYAASINFQLRNGRTVLAFRQFNAGVPATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYL 376 (501) T ss_pred HHHHHHHHhhCcccCccceeeeccccCCCcCcccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeeccceeehhhh Confidence 999999999999999999999999986 99999999999999999999999999654 58899999988 8999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccc---------cccccccccc Q lcl|NC_013597. 385 ILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGF---------GNLSTGDYLD 455 (502) Q Consensus 385 ~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~---------g~~~~~~~~~ 455 (502) |+|||+++||+++|++|++ ++|||||+.|+++|+++|+++|+++++||+|+||+|+++.| +...++|.++ T Consensus 377 ~~~Wl~~~iq~~l~~ll~~-~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~i~~~~g~~~~~~~v~~ 455 (501) T protein:vir:10 377 DQIYLNAELQRAEFEAMLA-YNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQT 455 (501) T ss_pred hHHHHHHHHHHHHHHHHHh-cCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccCccccccceec Confidence 9999999999999998765 58999999999999999999999999999999999987766 4466788999 Q ss_pred cceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 456 KGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 456 ~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) +|||+++++++. +++||++|++|+++|+|+++|+||+|+|.-+.+= T Consensus 456 ~Gyy~~~~~~~~-~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 456 RGWYFLIGDPAN-PGQARQNRTTPACTLWYSDGGSIQQLTIGSNAVI 501 (501) T ss_pred cceeEeeccccC-ChhhhhhccccceEEEEEeCCceeEEEeeeeecC Confidence 999999998875 5569999999999999999999999999544444 No 7 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=100.00 E-value=2.3e-137 Score=769.71 Aligned_cols=476 Identities=14% Similarity=0.179 Sum_probs=419.7 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhcCCC--- Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSP--- 77 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~q~p--- 77 (502) |- |+||||||+|+|.++++.+++|+.+|||++|+.+|+ ||+|+|+|+++|++|||.+|||||||++||+|+| T Consensus 1 mi-p~s~iV~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~----~r~~~y~s~~~V~~~FG~~S~ey~aA~~yF~~~~~~~ 75 (504) T protein:vir:96 1 MI-SQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPP----GIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSV 75 (504) T ss_pred CC-CccceeEeeecccccccccccccceeEeecccCCCc----cceEEecCHHHHHHhcCCChHHHHHHHHHhhcCCCCC Confidence 75 999999999999999999999999999999986643 8999999999999999999999999999999988 Q ss_pred -CcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhcc Q lcl|NC_013597. 78 -RAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTT 156 (502) Q Consensus 78 -~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~ 156 (502) ||++|+||||++++..+.+++ .++++.+.+++++++|+|+|+|+|+.+++++||||.+++|+++|+.|++++++ T Consensus 76 ~~P~~l~igR~~~~a~~~~l~g-----~~~~~~~~~~~~i~~G~lsitv~G~~~~~~~i~~S~~ts~~~vA~~i~~al~~ 150 (504) T protein:vir:96 76 NSPSSISFARWVNTAIAPMVVG-----DNLPKTIADFAGFSAGVLTIMVGAAEKNITAIDTSAATSMDNVASIIQTEIRK 150 (504) T ss_pred ccccEEEEEeecCcCccceEEe-----chhHHHHHHHhhhhceEEEEEEcceeeeecccccccccchHHHHHHHHhhhhc Confidence 999999999999877655554 55567888999999999999999999999999999999999999999999987 Q ss_pred cc----cceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHH Q lcl|NC_013597. 157 LS----VAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALF 232 (502) Q Consensus 157 a~----~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~ 232 (502) .. ..++|+||...++|+++++++|..+... ...+++++++++++++..+... ..+.++|+|.++|+ T Consensus 151 ~~~~~~~~~tv~~d~~~~~f~its~~tg~~~~~~-----~~~a~~~~~~~~lgl~~~~~~~-----v~g~~aet~~~al~ 220 (504) T protein:vir:96 151 NTDPQLAQATVTWNPNTNQFTLVGATIGTGVLAV-----AKSADPQDMSTALGWSTSNVVN-----VAGQAADLPDAAVA 220 (504) T ss_pred ccccccccceEEEeccCCeEEEEeeccccceeEE-----EeeccccchhhhhhcccccceE-----EeecccccHHHHHH Confidence 54 3478999999999999999998754332 2345677899999998654432 34678899999999 Q ss_pred HHHhccCceeEEEEecC-CChhHHHHHHHHHhhcCCEEEEEecCchhcccchhHHHHHHHHccCCceEEEecCC--ccch Q lcl|NC_013597. 233 NVAEVNNTWYGFTVAAQ-LTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDKN--DMYP 309 (502) Q Consensus 233 al~~~~~~w~~~~~~~~-~~~~~~~a~a~w~~a~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~--~~~~ 309 (502) ++.+++++||+|.++++ .++++++++|+|+|+|+++|++.+++... ...+. ..+...++.+++.+||.. .+|+ T Consensus 221 al~~~~~~Wy~f~~a~~~~~dd~ilalA~w~ea~~~~~~~~~~~~~~---~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 296 (504) T protein:vir:96 221 KSTNVSNNFGSFLFAGATLDNDQIKAVSAWNAAQNNQFIYTVATSLA---NLGAL-FDLVKGNSGTALNVLSATASNDFV 296 (504) T ss_pred HHHhhcCCeEEEEEEeccCCHHHHHHHHHHHhhcCceEEEEEeeccc---chhhH-HHhhhhcceeEEEEeecCccchhH Confidence 99999999999998876 67899999999999999999988776432 12222 234445556677777654 4688 Q ss_pred HHHHHHHHHhcCCCCCCceeeEeeeecCccccCCCCHHHHHHHHhCCceEEEEEcCc----eEEecCEeecCe----ehh Q lcl|NC_013597. 310 VSSALARLLSTNFAANNSTLTLKFKQQPTITADEITATEFAKAKRLGINVYTYFDDV----AMIAEGTVIGGK----FAD 381 (502) Q Consensus 310 ~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~~~~lt~t~~~~l~~~~~n~y~~~~~~----~~~~~G~~~~G~----~iD 381 (502) ++..+|+++++||++.||++|||||+++||+|++++++|+++|+++|||||+.+.+. .+++||+|++|+ ||| T Consensus 297 ~~~~~~~~as~~f~~~ng~~T~~fk~l~GVta~~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~~~gG~~~~~wiD 376 (504) T protein:vir:96 297 EQCPSEILAATNYDEPGASQNYMYYQFPGRNITVSDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMN 376 (504) T ss_pred HHHHHHHHHhcCcCcccccccccccccCCcCcccCCHHHHHHHHhcCCeEEEEeecccceeeEEecCeeeCCccccchhh Confidence 899999999999999999999999999999999999999999999999999999753 588999999997 799 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCcccc---------cccccc Q lcl|NC_013597. 382 EIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFG---------NLSTGD 452 (502) Q Consensus 382 ~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g---------~~~~~~ 452 (502) ++++++|||++||++|+++| ++++|||||+.|+++|+++|+++|+++++||+|+||+|++..+. ....+| T Consensus 377 v~~~~~WL~~~lq~~l~~l~-~~~~kIPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~I~~~~~~d~~~~~ 455 (504) T protein:vir:96 377 VYANEIWLKSAIAQALLDLF-LNVNAVPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQYITQITGDRRAWRQ 455 (504) T ss_pred hhhhHHHHHHHHHHHHHHHH-hcCCCcccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchheecccccccccccc Confidence 99999999999999999965 65789999999999999999999999999999999999886543 344578 Q ss_pred ccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEe Q lcl|NC_013597. 453 YLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYN 501 (502) Q Consensus 453 ~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~ 501 (502) .+.+||||++|++++++++||++|++|+|+|+|+++|+||+|+|..+.+ T Consensus 456 ~~~~GYyv~~~~~s~~s~~~r~~R~~~~~~~~y~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 456 VQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) T ss_pred eeccceEEEecChhccChhHhhhccccceEEEEEECCeEEEEEeccccC Confidence 8999999999999999999999999999999999999999999999988 No 8 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=100.00 E-value=3.5e-136 Score=763.21 Aligned_cols=472 Identities=18% Similarity=0.223 Sum_probs=421.8 Q ss_pred CC-cCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhc----C Q lcl|NC_013597. 1 MA-LSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFA----Q 75 (502) Q Consensus 1 Ms-ip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~----q 75 (502) |+ ||+||||||+|+|.++++.+++|+.+||++.+. . +.||+|.|+++++|++|||.+|||||||++||+ | T Consensus 1 m~~ip~s~iV~V~~~v~~~~~~~~~f~~~l~~~~~~-~----~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~q 75 (494) T protein:vir:94 1 MPNIPISQIVSINPQVVSAGGTQGTLDGLLLTQATG-F----PVTQPQVYFSAADVGTAFGLTSDEYNAALVYFAGILGG 75 (494) T ss_pred CCCCCcccEEEeeeeccccCCcccccceeEeecCcc-C----CccceeeecCHHHHHHhcCCChHHHHHHHHHhhhccCC Confidence 99 999999999999999999999999777766553 2 248999999999999999999999999999999 9 Q ss_pred CCCcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhc Q lcl|NC_013597. 76 SPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLT 155 (502) Q Consensus 76 ~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~ 155 (502) +|||++|+||||++++.++.+++ .+++..++.++.+ +|+|+++++|+ ++..+||||.+++++++|+.|+++|+ T Consensus 76 ~p~P~~l~igR~~~~a~~~~l~g-----~~~~~tl~~~~~~-~g~l~iti~g~-~~~~~i~lS~~ts~~~vA~~i~~ai~ 148 (494) T protein:vir:94 76 GQQPASLTIGRYASAATSAAVFG-----APLTLSLAQLQTL-SGTLIVTTDTQ-RTSAAINLSGATSFANAASLMTSGFT 148 (494) T ss_pred CccccEEEEEeecCccccceeec-----cchhhhHHhhhhc-ceEEEEEEcce-EEEeeecccccCChhhHHHHHhhhhc Confidence 99999999999999877766654 4555677788887 69999999995 67899999999999999999999998 Q ss_pred ccccceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHHHH Q lcl|NC_013597. 156 TLSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVA 235 (502) Q Consensus 156 ~a~~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~al~ 235 (502) .. .++|+||+...+|.++++++|..+.+++.. .+++..++++...++..+ ..+.++|+|.++++++. T Consensus 149 ~a--~~~v~~d~~~~~f~v~s~ttG~~s~is~~t--------~~~a~~l~lt~~~~a~v~---~~g~~aet~~~a~~a~~ 215 (494) T protein:vir:94 149 TP--NFAITYDAQRRRFVLSTTATGTTASVSAVT--------GTLADGVGLSTASGAYVE---GSGLAADTAASALDRLA 215 (494) T ss_pred cc--cceEEEcccCcEEEEEEccCCceeEEEEec--------cchhhhhhhhccccceEe---ecCcccccHHHHHHHHH Confidence 64 357999999999999999999877655432 257889999987765433 46778999999999999 Q ss_pred hccCceeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecCch--hcc-cchhHHHHHHHHccCCceEEEecCCccchHHH Q lcl|NC_013597. 236 EVNNTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAE--QIE-WSADNIYKKLYDAGLDHTLAMFDKNDMYPVSS 312 (502) Q Consensus 236 ~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~~~--~~~-~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~aa 312 (502) +.+++||+|++++++++++++++|+|+|+|+++|++..++.+ .++ ...++++++|++++|+||+++||+..+ +++ T Consensus 216 ~~~~~Wy~f~~~~~~~~~~ilalA~wiea~~~~~~~~~~~~d~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~--~aa 293 (494) T protein:vir:94 216 ASSSTWAIFTTAWAASLSDRTALAQWTSDQVFRRIYAAWDQDAAGLSVNNVSSFGNIVKTTPFSNTIPVYGLLAN--AMI 293 (494) T ss_pred hccCceEEEEEecCCCHHHHHHHHHHHhhcCccEEEEEecCCcceeecccchhHHHHHHhhcCCceEEEcCCCCh--HHH Confidence 999999999999999999999999999999998877665554 443 457899999999999999999997654 688 Q ss_pred HHHHHHhcCCCCCCceeeEeee-ecCccccCCCCHHHHHHHHhCCceEEEEEcCc---eEEecCEeecCe--ehhHHHHH Q lcl|NC_013597. 313 ALARLLSTNFAANNSTLTLKFK-QQPTITADEITATEFAKAKRLGINVYTYFDDV---AMIAEGTVIGGK--FADEIVIL 386 (502) Q Consensus 313 ~~g~~as~n~~~~~g~~T~~fk-~~~Gv~~~~lt~t~~~~l~~~~~n~y~~~~~~---~~~~~G~~~~G~--~iD~~~~~ 386 (502) ++|++++.+|+..+|++||||| +++|++|++++.+|+++|+++|||||+.|++. ..+++|++++|+ |||.++++ T Consensus 294 ~~g~~aa~~~~~~~g~~T~~~k~q~~gi~~~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~gg~~sG~~~~id~~~~~ 373 (494) T protein:vir:94 294 VLAWGASTNLQIAEGRTTLALRSPVSSAGVRVDNLANANALLSNGYTYLGKYASATNTYTVTYNGAIGGQFLWADTALGW 373 (494) T ss_pred HHHHHHhccccccCcceeEEeeccCCCCCCccCCHHHHHHHHhcCCeEEEEecccCceEEEecCceeccccceeeeeccH Confidence 9999999999999999999999 68999999999999999999999999999753 466788888997 68888899 Q ss_pred HHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCcc--------ccccccccccccce Q lcl|NC_013597. 387 DWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAG--------FGNLSTGDYLDKGF 458 (502) Q Consensus 387 dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~--------~g~~~~~~~~~~gy 458 (502) +|||++||++||++|++ ++|||||+.|+++|+++|+++|+|+++||+|+||+|++++ .|.+..++++++|| T Consensus 374 ~WL~~~iq~~l~~ll~~-~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~i~~~~G~~~~~~~~~kGy 452 (494) T protein:vir:94 374 IALRRNLQQALFETLLA-YRSLPYNADGYNALYQGAQDVVSQFVAAGVIRAGVALSASQRAQIDQAAGVPISGDVVDKGW 452 (494) T ss_pred HHHHHHHHHHHHHHHHh-CCCcccChhhHHHHHHHHHHHHHHHHhCceeecccccCcchhhhhhhhhcCccccceeccce Confidence 99999999999998875 5899999999999999999999999999999999999997 68899999999999 Q ss_pred EEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 459 YVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 459 ~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) |++.. +.+++++|++|.+|+++|+|+++||||+|+|+++++- T Consensus 453 y~~~~--~~~s~~~ra~R~~~~~~~~y~~~GAIh~v~i~~~~v~ 494 (494) T protein:vir:94 453 YLQVI--DPITTTVRTDRGSPTVNFWYCDGGSIQRVVVSATTVI 494 (494) T ss_pred eeecc--CCCChhhhhccccCCceEEEEecCcEEEEEEeeEEeC Confidence 99962 4578899999999999999999999999999999999 No 9 >protein:vir:107720 Length: 515 # NCBI annotation: gp17 # Family: family:all:396 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024865;genbank:gi:48697507;genbank:GeneID:2948336 Probab=100.00 E-value=3.8e-133 Score=746.57 Aligned_cols=483 Identities=18% Similarity=0.224 Sum_probs=422.9 Q ss_pred CCcCcCceeEEeeccc-ccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhc----C Q lcl|NC_013597. 1 MALSISHIVNVQLNTV-PKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFA----Q 75 (502) Q Consensus 1 Msip~s~iV~V~i~~~-~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~----q 75 (502) |+||++++|+|++++. +.+...++|+ +|||+++...|+ ||+|+|+|+++|++|||.+|||||||++||+ | T Consensus 1 m~I~~~~~V~i~~~v~aa~~~~~~~f~-~li~t~~~~~p~----~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q 75 (515) T protein:vir:10 1 MPISFDKYVAITSGVAAQQQIAARSFA-IRVYTPNPMVSV----DRLITATSAADVGAYFGTASEEYKRAVKNFGFISKK 75 (515) T ss_pred CCCCceeEEEeecccccCCccccccce-eeeeecccCCCc----cceeeecCHHHHHHhcCCChHHHHHHHHHhhhccCC Confidence 9999999999999874 4556677999 688888876654 7999999999999999999999999999999 9 Q ss_pred CCCcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccc-cccccccccccchhhHHHHHHhhh Q lcl|NC_013597. 76 SPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVK-KVDGLSFARLADFNAVATKIQEKL 154 (502) Q Consensus 76 ~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~-~~~~i~~s~~ts~~~vA~~i~aal 154 (502) +|||++|+||||++++.++.++++.+.+. .++.|+.+++|+|+|+|||+.+ ++++||||.+++++++|+.|+++| T Consensus 76 ~p~P~~L~igR~~~~a~~~~l~g~~~~~~----~l~~~~~is~G~ltitidG~~~~t~s~i~~S~ats~~~vAs~i~tal 151 (515) T protein:vir:10 76 TRRPTSIQFARWQREAGPVAIYGGAKKAA----ALATLQAVTAGAISFLFGGATTVTVSGISFSAATSLADVASELQTAL 151 (515) T ss_pred cccccEEEEEeccCcccceEEEeccchhh----hHHhhhcccceeEEEEEcceEEEEeeccccccccCHHHHHHHHHhhh Confidence 99999999999999999999999988754 5678999999999999999886 688999999999999999999999 Q ss_pred cccc----cceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHH Q lcl|NC_013597. 155 TTLS----VAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEA 230 (502) Q Consensus 155 ~~a~----~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~a 230 (502) .... ..++|+||+..++|+++++++|..+++++.+... +.++++++++|+++++.++.. +.+.++|+|.++ T Consensus 152 ~~~~~~~~~~~tv~~d~~~~~F~v~s~~tG~~~~is~~~~t~-~~~~t~~a~~lglt~~~~av~----~~g~aaet~~~a 226 (515) T protein:vir:10 152 RANADANLATCTVSYDPVGARFNFAGSPSDDTVQESISIVPQ-SNPAIDVAQLLGWNSAQGASY----IAASPVVSPVDT 226 (515) T ss_pred ccccccccceeEEEEecCCCeEEEEEeecCCceeEEEEEecC-CCchhhHHHHhccccccceEE----ecccccccHHHH Confidence 8754 3478999999999999999999999988877654 356899999999999888765 467889999999 Q ss_pred HHHHHhccCceeEEEEecC----CChhHHHHHHHHHhhcCCEEEEEecCchhcccchhHHHHHHHHccCCceEEEecCCc Q lcl|NC_013597. 231 LFNVAEVNNTWYGFTVAAQ----LTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDKND 306 (502) Q Consensus 231 l~al~~~~~~w~~~~~~~~----~~~~~~~a~a~w~~a~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~ 306 (502) |+++.+.++|||+|+++++ .++++++++++|+|+++++|++.+.+...... ..+. .....+.+.++...++... T Consensus 227 ~~a~~~~s~nWy~f~~a~~~~~~~~~a~~~a~a~~~e~~~~~~~~~~~~~~~~~~-~~~a-~~~~~~~~~~~~~~~~~~~ 304 (515) T protein:vir:10 227 LIASVAGNNNFGSILFTKNGGTGITLSDAEAIALQNQSYNVAYKFQVGVDDTTYS-SWQA-ALAAIGGVNMIYSPVALAA 304 (515) T ss_pred HHHHHhccCCeEEEEEeecCccccchhHHHHHHHHHhhcCceEEEEeccCcccee-chhh-hhhhhhhcCceEEEEeccC Confidence 9999999999999999864 45899999999999999999887755443222 2222 2234456788898888888 Q ss_pred cchHHHHHHHHHhcCCCCCCceeeEeeeecCccccCCCCHHHHHHHHhCCceEEEEEcC----ceEEecCEeecCe---- Q lcl|NC_013597. 307 MYPVSSALARLLSTNFAANNSTLTLKFKQQPTITADEITATEFAKAKRLGINVYTYFDD----VAMIAEGTVIGGK---- 378 (502) Q Consensus 307 ~~~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~~~~lt~t~~~~l~~~~~n~y~~~~~----~~~~~~G~~~~G~---- 378 (502) +|++++++|+++++||++.+|++||||||+|||+|++++++|+++|++||||||+.|.+ ..|++||+|++|+ T Consensus 305 ~~~~a~~~g~~asvnf~~~ng~iT~kfKq~~Gita~~lt~t~a~al~~~~~N~Y~~~~~~~~~~~~~~~G~~~gG~~~~~ 384 (515) T protein:vir:10 305 EYHDMQDGIIEAATDFTQQGGATGYMYVQFNNQTPAVNDDTLSGILDDLNINYYGQTQVNGTNLSFYQDGVMMGGPTDPR 384 (515) T ss_pred cchHHHHHHHHHhcCCCccchhheeccccCCCCccccCCHHHHHHHHhcCCeEEEEEeccCceEEEEeCCeeeCCccchh Confidence 89999999999999999999999999999999999999999999999999999999965 5689999999997 Q ss_pred ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHH-HHHHHHHHHcCccccccccCccc--------ccc- Q lcl|NC_013597. 379 FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAV-EKVCLEGINNGAFAPGKWTGAGF--------GNL- 448 (502) Q Consensus 379 ~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v-~~vl~~a~~~G~I~~g~~~~~~~--------g~~- 448 (502) |||++||+|||+++||++||++| .+++|||||+.|+++|++.| +++|++|++||+|+||+++++.| |.. T Consensus 385 WiD~~~g~~WL~~~iq~~l~~L~-~s~~KIPytd~G~a~i~a~v~q~vl~~av~nG~I~~Gv~ls~~Q~~~i~~~~g~d~ 463 (515) T protein:vir:10 385 DSNVYANEQWLKSYAGASFMSLQ-LAQGKIPANIEGRGLLLGKMTKDIIPAAKLNGTFSIGKTLTVDQQLFVTELTGDDT 463 (515) T ss_pred HHHHHhhHHHHHHHHHHHHHHHH-hcCCCCccChhhHHHHHHHHHHHHHHHHHhCCeeecCcccchhHHHHHHhhhcCcc Confidence 79999999999999999999976 66789999999999999987 57999999999999999999886 333 Q ss_pred ccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEe Q lcl|NC_013597. 449 STGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYN 501 (502) Q Consensus 449 ~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~ 501 (502) .+++.+.+|||+++++.+.+.+.+|..++. ++.|||+++|+||+|+++-+.+ T Consensus 464 ~~~~~~~~Gyy~~~~~~~~~~~~~r~~~~~-~~~~~y~~g~~i~~i~~~~~~v 515 (515) T protein:vir:10 464 AWQKVQNLGYWYDVQISSFVDTGGTTKYQA-VYSLVYSKDDLIRKVVGTHTLI 515 (515) T ss_pred cccchhhcceeEecCcCCCCCcccccccCc-eeEEEEEcCceEEEEEeeeecC Confidence 578899999999999987666444443333 4589999999999999999999 No 10 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=100.00 E-value=5.3e-122 Score=685.48 Aligned_cols=435 Identities=18% Similarity=0.225 Sum_probs=377.1 Q ss_pred CcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhcCCCCcceEE Q lcl|NC_013597. 4 SISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSPRAKQLI 83 (502) Q Consensus 4 p~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~q~p~P~~l~ 83 (502) --||||||+|+++++++.+++|+.+||++++... .||+|.|+++++|++|||.+|||||||++||+|+|+|++|+ T Consensus 1 ~~s~iVnV~i~~~~~a~~~~~f~~~l~~~~~~~~-----~~r~~~yss~~~V~~~FG~~S~ey~aA~~yF~q~p~p~~l~ 75 (450) T protein:vir:95 1 MWNPIVNVDITLNTAGTTREGFGLPLFLASTDNF-----EERVRGYTSLTEVAEDFDENTAAYKAAKQLWSQTPKVTQLY 75 (450) T ss_pred CCCceEEEeecccccccccccceeEEEEcCCCCC-----ccceeeecCHHHHHHhcCCCcHHHHHHHHHHhCCCcccEEE Confidence 2499999999999999999999999999988632 48999999999999999999999999999999999999999 Q ss_pred EEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhcccccceeE Q lcl|NC_013597. 84 VARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLSVAVSI 163 (502) Q Consensus 84 igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a~~~~tv 163 (502) ||||+++.... .+..+.++++|+|+++++|+.++..+++++.+++++++|+.+++++.... + T Consensus 76 igr~~~~~t~~--------------~~~~~~~~~~g~lt~tv~G~~~~~~~i~~s~a~s~~~va~~~~tai~~~~----~ 137 (450) T protein:vir:95 76 IGRRAMQYTVS--------------IPDAVTESTDYSITVAAGGGISQPYQYTAQSSDTAENVLQQFKTQIEADP----T 137 (450) T ss_pred EEeeccchhhh--------------hhhhhccccceeEEEEecceeeeeeEEEEEecCChhhHHHHhhhhhcccc----e Confidence 99999875432 23345677899999999999999999999999999999999999997642 3 Q ss_pred EEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHHHHhccCceeE Q lcl|NC_013597. 164 AYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVAEVNNTWYG 243 (502) Q Consensus 164 ~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~al~~~~~~w~~ 243 (502) .++ .|.+.+. |....+++..+... .+..++++...... ...+..+|++.++|+++.+.+++||+ T Consensus 138 ~~~----~~~~~s~--g~~~~~t~~~~~~~------~~~~~~l~~~~~~~----~~~g~~aet~~~a~~a~~~~~~~w~~ 201 (450) T protein:vir:95 138 IKD----KVSVNVT--GSNGSATMIIAKAG------DNDFVKVTTTAQTV----YIASTTADTASTALAAIEAYSTDWYF 201 (450) T ss_pred eee----eeeeeee--cccceeeeeeeccc------cchhhcccccccee----EecccccccHHHHHHHHHHhhCCeEE Confidence 333 3444433 33333333333211 12344444433332 24567889999999999999999998 Q ss_pred EEEecCCChhHHHHHHHHHhhcCCEEEEEecCchhcc----cchhHHHHHHHHccCCceEEEecCC--ccchHHHHHHHH Q lcl|NC_013597. 244 FTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAEQIE----WSADNIYKKLYDAGLDHTLAMFDKN--DMYPVSSALARL 317 (502) Q Consensus 244 ~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~~~~~~----~~~~~i~~~l~~~~~~~t~~~y~~~--~~~~~aa~~g~~ 317 (502) |.+. +.++++++++|+|+++|+|+|++++++.+.+. ...++++++|+.++|+||+++||++ .+|++++++|++ T Consensus 202 ~~~~-~~~~~~i~a~a~w~~a~~~~f~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~~~aa~~g~~ 280 (450) T protein:vir:95 202 IAAE-DRTQQFVLAMASEIQARKKIFFTANSDVTALQGTELASANDVPAQLAKNMYTRTVCLWHHAAAEDYPEMAYIAYG 280 (450) T ss_pred EEec-CCCHHHHHHHHHHHhhcCcEEEEEcCCchhhhhhhhhcccchHHHHHhccCCeeEEEeeCCCchhHHHHHHHHHh Confidence 8765 47899999999999999999999999988764 3578899999999999999999976 457888888776 Q ss_pred HhcCCCCCCceeeEeeeecCccccC-------CCCHHHHHHHHhCCceEEEEEcCceEEecCEeecCeehhHHHHHHHHH Q lcl|NC_013597. 318 LSTNFAANNSTLTLKFKQQPTITAD-------EITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGGKFADEIVILDWFV 390 (502) Q Consensus 318 as~n~~~~~g~~T~~fk~~~Gv~~~-------~lt~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~~G~~iD~~~~~dwl~ 390 (502) ++ ..+|++|||||+++||+++ +|+.+|+++|+++|||||+++++.+++++|++++|+|||++||+|||+ T Consensus 281 ~~----~~~g~~T~~fk~l~Gv~~~v~~~~~~~lt~~~~~al~~~~~n~y~~~~~~~~~~~G~~~~G~~iD~~~~~~wl~ 356 (450) T protein:vir:95 281 AP----YDAGSIAWGNAQLTGVAASLQPSNQRPLTSIQKSALDVRHCNFIDLDGGVPVVRRGITSGGEWIDIIRGVDWLE 356 (450) T ss_pred hh----cccceeeeccccccceeeeccCccccccchHHHHHHHhCCcEEEEEecCceeeeCCeeeCcchhHHHHHHHHHH Confidence 54 6789999999999999996 589999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhcC-CCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcCchhcCC Q lcl|NC_013597. 391 DAVQKEVFARLYKSP-TKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLS 469 (502) Q Consensus 391 ~~iq~~l~~~l~~~~-~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s 469 (502) ++||++|+++|++++ +|||||+.|+++|+++|+++|+|+++||+|+ ||+|+.|++++++ T Consensus 357 ~~iq~~l~~ll~~~~~~KiPy~~~G~~~i~a~i~~~l~~a~~~G~Ia--------------------~~~V~~~~~~~~~ 416 (450) T protein:vir:95 357 SDLKTSLRDLLINQKGGKITYDDTGITRIRQVIETSLQRAVNRNFLS--------------------SYTVNVPKASQVA 416 (450) T ss_pred HHHHHHHHHHHHhcCCCCCccChhhHHHHHHHHHHHHHHHHhcCccc--------------------ceeEecCChHhcC Confidence 999999999999875 6999999999999999999999999999995 5999999999999 Q ss_pred HHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 470 DSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 470 ~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) ++||++|++|+++|+|+|+||||.++|+|+|-- T Consensus 417 ~~dr~~R~~~~i~~~~~laGAIh~~~i~~~v~~ 449 (450) T protein:vir:95 417 LADKKARILKDVTFAGILAGAILDVDLKGTVAY 449 (450) T ss_pred HHHHhccCCCCeeEEEEEccceEEEEEEEEEEe Confidence 999999999999999999999999999999988 No 11 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=100.00 E-value=1.7e-93 Score=529.17 Aligned_cols=327 Identities=20% Similarity=0.266 Sum_probs=284.3 Q ss_pred CcCceeEEeeccccc-ccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhcCCCCcceE Q lcl|NC_013597. 4 SISHIVNVQLNTVPK-SAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSPRAKQL 82 (502) Q Consensus 4 p~s~iV~V~i~~~~~-~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~q~p~P~~l 82 (502) =+||||+|+|.+... +.++.+||.+++|...+ .+|+|.|+++++|+.|||.++++||+|.++|+|.|+|.++ T Consensus 1 ~~~~iv~V~v~~~~~~~~~~~~~~~~~~~~~~t-------~~~~~~y~s~~~v~~d~~~~~~~Ykaa~~~f~Q~~~~~~i 73 (331) T protein:vir:80 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGT-------AMGYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRPDTV 73 (331) T ss_pred CccceecceeeecccccccccccCcceeEEecc-------ccceEEEechhhhccCCCCCcHHHHHHHHHHhccCccceE Confidence 579999999999844 34555666666665443 3689999999999999999999999999999999999999 Q ss_pred EEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhccccccee Q lcl|NC_013597. 83 IVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLSVAVS 162 (502) Q Consensus 83 ~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a~~~~t 162 (502) ++++|..+. T Consensus 74 ~v~~~~~~~----------------------------------------------------------------------- 82 (331) T protein:vir:80 74 AVITYEDTK----------------------------------------------------------------------- 82 (331) T ss_pred EEeccchHH----------------------------------------------------------------------- Confidence 998653210 Q ss_pred EEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHHHH-hccCce Q lcl|NC_013597. 163 IAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVA-EVNNTW 241 (502) Q Consensus 163 v~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~al~-~~~~~w 241 (502) .+.++. ..+++| T Consensus 83 -------------------------------------------------------------------~~~a~~a~~~~~w 95 (331) T protein:vir:80 83 -------------------------------------------------------------------LLEAAEAYFLKSW 95 (331) T ss_pred -------------------------------------------------------------------HHHHHHHhccCce Confidence 001111 124678 Q ss_pred eEEEEecCCChhHHHHHHHHHhhcCCEEEEEecCchhcccchhHHHHHHHHccCCceEEEecC-CccchHHHHHHHHHhc Q lcl|NC_013597. 242 YGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDK-NDMYPVSSALARLLST 320 (502) Q Consensus 242 ~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~-~~~~~~aa~~g~~as~ 320 (502) |+++ +.++++++++++|+|+|+++++|++..++ +.+..++..++.|++++||+ +++|++++++|+++++ T Consensus 96 ~~~~-~~~~~~~~~~a~a~~~~a~~~~f~~~~~~---------~~~~~~~~~~~~~t~~~~~~~~~~~~~aa~~g~~~~~ 165 (331) T protein:vir:80 96 HFAL-LAEFKAADALALSNLIEEQKFKFAVFQVT---------AVADITPLAKNTRTIAIVHSKTGEKLDAALIGNVASL 165 (331) T ss_pred eEEE-eecCCHHHHHHHHHHHhhCCcEEEEEecC---------chHHHHHhhccccEEEEEcCCccchhHHHHHHHHHhc Confidence 8554 44578999999999999999999887543 33455666778899988885 5789999999999998 Q ss_pred CCCCCCceeeEeeee-cCccccCCCCHHHHHHHHhCCceEEEEEcCceEEecCEeecCeehhHHHHHHHHHHHHHHHHHH Q lcl|NC_013597. 321 NFAANNSTLTLKFKQ-QPTITADEITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGGKFADEIVILDWFVDAVQKEVFA 399 (502) Q Consensus 321 n~~~~~g~~T~~fk~-~~Gv~~~~lt~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~~G~~iD~~~~~dwl~~~iq~~l~~ 399 (502) +| |++|||||+ |+||+|++++.+|+++|+++|||||++++|.+++++|++++|+|||++||+|||+++||++|++ T Consensus 166 ~~----g~~t~~fk~~l~GV~~~~lt~t~~~al~~~~~N~y~~~~~~~~~~~G~~~~G~~iD~~~~~dWl~~~lq~~l~~ 241 (331) T protein:vir:80 166 PV----GSATWKGRHGLAGITSEELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGEFIDSIHGDDWIKATIETRLQK 241 (331) T ss_pred Cc----cceeeeeecccCCCCCCCCCHHHHHHHHhcCceEEEEecCeeEEecceEeCchhHHHHHHHHHHHHHHHHHHHH Confidence 77 789999997 8999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcCchhcCCHHHHhhcccC Q lcl|NC_013597. 400 RLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRAT 479 (502) Q Consensus 400 ~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~ 479 (502) +|.+ ++|||||+.|+++|+++|+++|+++++||+|+||+|+++ +||+|+.|++++++++||++|++| T Consensus 242 ll~~-~~kiPy~~~G~~~l~a~i~~~~~~av~~G~I~~g~~~~~------------~~~~v~~~~~~~~s~~dr~~R~~~ 308 (331) T protein:vir:80 242 LLTE-TDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGE------------PNFSITALQRSDLNDDDIAKRNYK 308 (331) T ss_pred HHHh-CCCCccChhhHHHHHHHHHHHHHHHHhCCceecCccCCC------------cceEEEeCchhcCCHHHHhccCCC Confidence 8876 479999999999999999999999999999999999875 379999999999999999999999 Q ss_pred ceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 480 PIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 480 ~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) +++|+|+++|+||+|+|+|||+= T Consensus 309 ~~~~~~~~~gaI~~v~i~~~v~~ 331 (331) T protein:vir:80 309 GLSFRYKRSGAIHSVDVYGEVEV 331 (331) T ss_pred CeEEEEEEcceEEEEEEEEEEeC Confidence 99999999999999999999999 No 12 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=100.00 E-value=6.9e-74 Score=421.70 Aligned_cols=411 Identities=14% Similarity=0.034 Sum_probs=288.1 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhcCCCCcc Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSPRAK 80 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~q~p~P~ 80 (502) |. .+||||+|+++++++.+++||.+||||.|+..++...-+|+|.|+++++|++|||.+||+||||.+||+|.++ T Consensus 1 m~---~~iVnV~Is~~t~A~~~~~Fg~~liigs~~~~~p~~~f~~~~~Yss~~~V~~Dfg~~s~~Y~AA~~~f~Q~~~-- 75 (426) T protein:vir:31 1 MP---KQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAE-- 75 (426) T ss_pred CC---cceEEEEeecccccccccccceeeeeeeccccccccccchhhhhhhHHHHHhcCCCChHHHHHHHHHHhCCce-- Confidence 77 7999999999999999999999999999987655332357777999999999999999999999999999855 Q ss_pred eEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhcccccc Q lcl|NC_013597. 81 QLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLSVA 160 (502) Q Consensus 81 ~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a~~~ 160 (502) +||+..-..+ .+. . +. -.++.+++| ++++......+++..+..++...-. T Consensus 76 ---~~r~~v~~at------~~~-----~------~~--~t~~~tv~g-------~~~s~~a~~~~~a~~i~~~~~~~~~- 125 (426) T protein:vir:31 76 ---QWRVMVLEAT------EVT-----E------EE--LSDGDTIDK-------VPILGNHEVESPDGDIEFTTDDDPD- 125 (426) T ss_pred ---eEEeeccccc------eee-----e------cc--CCcceeecc-------eeeeecccCcchHHHHHHhhccccc- Confidence 5555322111 000 0 11 123345554 4445555667777777777755321 Q ss_pred eeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHHHHhccCc Q lcl|NC_013597. 161 VSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVAEVNNT 240 (502) Q Consensus 161 ~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~al~~~~~~ 240 (502) ..+. ...+..++.....+++...+...-. ..+.. .-.+...++-.+....... T Consensus 126 ---~~~~---~~~~~~~t~~g~~t~~~~~~~~~~s-~~dw~--------------------~~~~~~s~~~~~~ia~~~~ 178 (426) T protein:vir:31 126 ---VEDF---DAEIVINSATGDVATSEDSIELTYF-HADWS--------------------QLDEFPSDVNNFAVADRRF 178 (426) T ss_pred ---cccc---eeeeEeccccceeeccccceeeeec-cCcch--------------------hhhcccccchhhhhhcccc Confidence 1111 1111111111111111111100000 00000 0001111111122333445 Q ss_pred eeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecCchhcccchhHHHHHHHHccCCceE--EEecC-CccchHHHHHHHH Q lcl|NC_013597. 241 WYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTL--AMFDK-NDMYPVSSALARL 317 (502) Q Consensus 241 w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~--~~y~~-~~~~~~aa~~g~~ 317 (502) ||.. ..+..++..|.+++.++++.+..+.+......++++.+++.++|.+.. +.|.. .++...+..++.+ T Consensus 179 ~~~~-------~~~~~~~~~wa~~~~i~~va~~~e~~~~~~~~~~~a~~~~~~~y~p~~~~~~~~~~~~~~~~~~~~~~~ 251 (426) T protein:vir:31 179 DLKG-------VGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDASDDDLAAYQLGKF 251 (426) T ss_pred chhh-------hhhhHhhhhhhhhcceeeeeeccchhhhcchhhhhhhhhcccccccchhheeehhccccchhhHHhhhh Confidence 5522 223446789999999999998888777777778999999999996554 44433 2455678889988 Q ss_pred HhcCC-------CCCCceeeEeeeecCccccCCCCHHHHHHHHhCCceEEEEEcCc-----eEEecCEeecCeehhHHHH Q lcl|NC_013597. 318 LSTNF-------AANNSTLTLKFKQQPTITADEITATEFAKAKRLGINVYTYFDDV-----AMIAEGTVIGGKFADEIVI 385 (502) Q Consensus 318 as~n~-------~~~~g~~T~~fk~~~Gv~~~~lt~t~~~~l~~~~~n~y~~~~~~-----~~~~~G~~~~G~~iD~~~~ 385 (502) ++.++ +..++...++|++.+|+... +..+++..+ ++++|+|..+.+. .++++|++++|+|||++|| T Consensus 252 aa~~~~~~~~~~~~~~~~~~~~~~~~~gv~~t-~~~~~~A~~-~~~~n~~~~~~~~~~i~~~~~~~G~~~~G~~iD~~~g 329 (426) T protein:vir:31 252 AVSEPWYNPLWNELPAGETVSKNVGDPEEQGT-FEGGDEAEG-EGPVNVLIDVSDANRVSNAVTTAGADSDTSFFDIRRT 329 (426) T ss_pred hhhccccchhhhhccccccceeeccccccccc-cchhhhhhh-cCCceEEEEecCceeeecceeecccccchhhhhhHHH Confidence 88875 34456677788889988843 333344444 5889999999875 4577899999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcCch Q lcl|NC_013597. 386 LDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPM 465 (502) Q Consensus 386 ~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~ 465 (502) +|||+++||++|+++|.+ ++|||||+.|+++|++.|+.+|+++++.|. ...++|+|..|++ T Consensus 330 ~dwl~~~iq~~l~~ll~~-~~KIpyt~~Gi~~I~~~i~~~L~~~v~~~g------------------~~~~~y~v~~P~~ 390 (426) T protein:vir:31 330 KVYTAEMLELDLESLQVS-DDDVPFTEDGQAMIEDAIKGTMSGLTGSVG------------------QPLAEYEVDVPEW 390 (426) T ss_pred HHHHHHHHHHHHHHHhhc-CCCCccchhHHHHHHHHHHHHHHHHhcCCC------------------ccccceeecCCCc Confidence 999999999999998866 579999999999999999999999987542 3455799999998 Q ss_pred hcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 466 DTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 466 ~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) ++++ +||++|++++|+|.|+|+||||.++|+|+|.= T Consensus 391 ~~~~-~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) T protein:vir:31 391 DDDD-VDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) T ss_pred cccc-hhhhhhccCCceEEEEEeCcEEEEEEEEEEeC Confidence 8865 69999999999999999999999999999988 No 13 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=99.57 E-value=3.7e-13 Score=88.77 Aligned_cols=444 Identities=12% Similarity=0.096 Sum_probs=240.0 Q ss_pred CCcCcCce----------eEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHH Q lcl|NC_013597. 1 MALSISHI----------VNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQ 70 (502) Q Consensus 1 Msip~s~i----------V~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~ 70 (502) |+|+.++| +.++-+... ...+---.||+|......-..+...++.+ |.++..+.||..|-...|++ T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~---~~~~~q~vLiiGq~la~gs~~~~~~v~v~-s~~~a~~lfG~GSml~~M~~ 76 (498) T protein:vir:45 1 MTISFNTIPSNTLVPLFYAEMDNQAAN---TAQDSGASLLIGHANNGAEIVANSLVLMP-SADYARQICGAGSQLARMVE 76 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCCC---CCCCCcceEEEEecCCccccccceeEEec-CHHHHHHhcCcCcHHHHHHH Confidence 99988876 233333332 22233358899887554344455677776 67889999999999999999 Q ss_pred HHhcCCCCcceEE-EEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHH Q lcl|NC_013597. 71 PFFAQSPRAKQLI-VARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATK 149 (502) Q Consensus 71 ~~F~q~p~P~~l~-igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~ 149 (502) .|..--|-- .|+ |+- .. +....+++..-..++ +-..|.+.+.|+|....+. +....+.+.+|+. T Consensus 77 a~~~~n~~~-~l~~i~~-~d-~aG~aA~g~it~tg~---------at~~G~l~l~Igg~~v~v~---V~~gdTaa~vA~a 141 (498) T protein:vir:45 77 AYRQTDPFG-ELYVIAV-PE-ATGAAATVTLTVTGE---------ATESGTVNVYVGRTRVQAP---VTNGDNVTTIASS 141 (498) T ss_pred HHHHhCCcc-eEEEEee-CC-cccceeEEEEEeecc---------cCCCcEEEEEECCEEEEEE---ecCCCCHHHHHHH Confidence 999876643 343 332 22 222333332211222 2257999999999887753 4566678889999 Q ss_pred HHhhhcccccceeEEEecccceeeEeeeccccccc-ceeeeeeccccchhhhhhhhhhcccccceeeee--ccccccccC Q lcl|NC_013597. 150 IQEKLTTLSVAVSIAYDETGNRFIVSANVAGEDKK-TEIDYAIDEGGEGEYIGALLKLENGQASRKVGK--NSVSLKKET 226 (502) Q Consensus 150 i~aal~~a~~~~tv~~~~~~~~f~~~s~ttG~~~~-v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v--~~~~~~~et 226 (502) +++++.+. ....|+-........++.+-.|.... +.+....-....++.. +.-..+.+ -+.|....+ T Consensus 142 l~aaina~-~~lPVTA~~~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~---------p~Glt~~itamagGag~PD 211 (498) T protein:vir:45 142 IQDAINAV-PTLPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVL---------PAGVQIAVATGTAGTGAPV 211 (498) T ss_pred HHHHHhCC-CCCceEEEecCceEEEEeeccCccccceeEEEeeccccccccc---------cceeeEEEEccCCCccCch Confidence 99998763 33444444555666666666554321 1111100000000000 11111111 123444557 Q ss_pred HHHHHHHHHhccCceeEEEEecCCChhHHHHHHHHHhh-------cCCEEEEEecCchhcccchhHHHHHHHHccCCceE Q lcl|NC_013597. 227 LGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQA-------NTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTL 299 (502) Q Consensus 227 ~~~al~al~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a-------~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~ 299 (502) +..+|+++.+.. |-+++..-.+.+.+.++.++++. -+++++....- ..++-......-...|..|.. T Consensus 212 ~a~alaal~~~~---~~~I~~p~~D~asL~al~~~L~~~sgRw~~~~q~~g~~~~a---~~gT~~~l~t~g~~~N~~~it 285 (498) T protein:vir:45 212 LTGAVAAMADEP---FDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTA---KTGTLSELVNAGDQFNQQHIT 285 (498) T ss_pred hHHHHHHhccCC---ccEEEEeeCCHHHHHHHHHHHhhhhhhhhHHhhcCeEEEEe---ccCCHHHHHHhhhccCCceEE Confidence 788887776554 44433333456677778777764 22334332211 112223333344455677776 Q ss_pred EEec-CCcc----chHHHHHHHHH---hcCCCCCCceeeEeeeecCccccC----CCCHHHHHHHHhCCceEEEEEcCce Q lcl|NC_013597. 300 AMFD-KNDM----YPVSSALARLL---STNFAANNSTLTLKFKQQPTITAD----EITATEFAKAKRLGINVYTYFDDVA 367 (502) Q Consensus 300 ~~y~-~~~~----~~~aa~~g~~a---s~n~~~~~g~~T~~fk~~~Gv~~~----~lt~t~~~~l~~~~~n~y~~~~~~~ 367 (502) ++.+ ...+ -.+|++.|+++ ..|+.+.-. --.|+||.|. .++.+|.+.|..+|+..+..-.|.- T Consensus 286 ~~~~~~~~~sp~~~~AAa~aa~~A~~l~~DPArPL~-----tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G~V 360 (498) T protein:vir:45 286 LAGYEKETQTPADELAASRTARAAVFIRNDPARPTQ-----TGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGVL 360 (498) T ss_pred EEecCCCCCChHHHHHHHHHHHHHHHhhcccccccC-----ceeecceecCCchhcCChHHHHHHHhCCcceEEEcCCeE Confidence 6643 3322 23455566665 456644333 2357788755 3689999999999999886656765 Q ss_pred EEecCEe-----ecC----eehh--HHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHH---------HHHHHHHHHHHHH Q lcl|NC_013597. 368 MIAEGTV-----IGG----KFAD--EIVILDWFVDAVQKEVFARLYKSPTKIPLTDKG---------QAILIAAVEKVCL 427 (502) Q Consensus 368 ~~~~G~~-----~~G----~~iD--~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G---------~~~l~~~v~~vl~ 427 (502) .+.+..+ ..| .|.| .++-.+++...++..+-.+ |-+ .|+--+... -..|++.+-.+++ T Consensus 361 ~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~k-fpR-~KLa~dg~~~~~gq~IvTp~~ir~ell~~y~ 438 (498) T protein:vir:45 361 RIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSK-YGR-HKLASDGTRFGPGQAIVTPAVIKGELLATYR 438 (498) T ss_pred EEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhh-cCC-eeecccCcccCCCCcccchHHHHHHHHHHHH Confidence 5555444 345 3877 7889999999999998544 343 465433222 2578899999999 Q ss_pred HHHHcCccccccccCcccccccccccccc-ceEEEcCchhcCCHHHHhhcccCceEEEEEECce Q lcl|NC_013597. 428 EGINNGAFAPGKWTGAGFGNLSTGDYLDK-GFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGA 490 (502) Q Consensus 428 ~a~~~G~I~~g~~~~~~~g~~~~~~~~~~-gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGa 490 (502) +....|++. .... .........|-.++ -..+.+|+ +.-..=|-=-..-.+.+.|.-++| T Consensus 439 ~le~~givE-n~~~-~~~~LiVerd~~dpnRln~~~p~--d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:45 439 QLERAGIVE-NYEL-FKQYLVVERDASVPNRLNTLFPP--DYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred hhhhhcccc-Chhh-hcceeEEEECCCCCcEEEEEecc--cccCchhhhhhhhhhheehhhcCC Confidence 999999985 2110 00000111110000 01112211 000000111111123344444444 No 14 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=99.56 E-value=4.3e-13 Score=88.37 Aligned_cols=444 Identities=14% Similarity=0.099 Sum_probs=238.3 Q ss_pred CCcCcCce----------eEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHH Q lcl|NC_013597. 1 MALSISHI----------VNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQ 70 (502) Q Consensus 1 Msip~s~i----------V~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~ 70 (502) |+|+.++| +.++-+....+... .-.||+|......-..+...++.+ |.++..+.||..|-...|++ T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~~~---qrvLiiGq~la~gt~~~~~~v~v~-s~~~a~~~fG~GS~l~~M~~ 76 (498) T protein:vir:48 1 MTISFSAVPSDTLVPLFYAEMDNSAANTAVTS---APALLIGHASNDAAIEVNSLVLMP-SADYARQICGAGSQLARMVD 76 (498) T ss_pred CCccccccCcccccceEEEEEecCCCccccCC---cceEEEeecCccccccccceEEec-CHHHHHHhcCcccHHHHHHH Confidence 99988876 22322222222111 248888887654344455677776 66888999999999999999 Q ss_pred HHhcCCCCcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHH Q lcl|NC_013597. 71 PFFAQSPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKI 150 (502) Q Consensus 71 ~~F~q~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i 150 (502) .|...-|--.--.|+- .. +....+++..-..++ +-..|.+.+.|+|....+. +....+.+.+|+.+ T Consensus 77 a~~~~n~~~~l~~i~~-~D-~ag~aA~g~it~tg~---------at~~G~l~l~Igg~~v~v~---V~~gdTaa~vA~al 142 (498) T protein:vir:48 77 VYRQTDPFGELYVIAV-PE-ARGAAATVRVTVTGE---------AEESGTLSLYVGRSSVQVP---VVNGDDATAVATAI 142 (498) T ss_pred HHHHhCCCceeEEEee-CC-cccceeEEEEEeccc---------ccCCceEEEEECCEEEEEe---ecCCCCHHHHHHHH Confidence 9987765433333332 21 222333332211222 2257999999999988753 45566778899999 Q ss_pred HhhhcccccceeEEEecccceeeEeeecccccc-cceeeeeeccccchhhhhhhhhhcccccceeeee--ccccccccCH Q lcl|NC_013597. 151 QEKLTTLSVAVSIAYDETGNRFIVSANVAGEDK-KTEIDYAIDEGGEGEYIGALLKLENGQASRKVGK--NSVSLKKETL 227 (502) Q Consensus 151 ~aal~~a~~~~tv~~~~~~~~f~~~s~ttG~~~-~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v--~~~~~~~et~ 227 (502) .+++.+. ....|+-...+....++.+-.|... .+.+.........++. .+.-..+.+ -+.|....++ T Consensus 143 ~aai~a~-~~lPVTA~~~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~---------~p~Glt~~itamsgGag~PDi 212 (498) T protein:vir:48 143 KEAVNGV-ITLPFAASSDAGVVTLTARHKGLYGNELPVCLNYYGSGGGEI---------LPAGLQVVTEAGTAGSGAPDL 212 (498) T ss_pred HHHHhCC-CCcceEEEecCcEEEEEeeecccccccceeeeeeccCccccc---------ccceeeEEEEcccCCccCcch Confidence 9888763 2344454445566666666555432 1111110000000011 111111111 1234445567 Q ss_pred HHHHHHHHhccCceeEEEEecCCChhHHHHHHHHHhh-------cCCEEEEEecCchhcccchhHHHHHHHHccCCceEE Q lcl|NC_013597. 228 GEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQA-------NTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLA 300 (502) Q Consensus 228 ~~al~al~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a-------~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~ 300 (502) ..+|+++.+.. |-+++.--.+.+.+.++.++++. -+++++....- ..++-......-...|..|..+ T Consensus 213 a~aLaal~~~~---~~~I~~p~~D~asl~al~~~L~~~sgRw~~~~q~~g~~~~a---~~gT~~~l~t~g~~~N~~~it~ 286 (498) T protein:vir:48 213 TAAVAAMGDEA---FDFIGLPFNDAASINMMMTEMNDSSGRWSYARQLYGHVYTA---KLGTLSELVNAGDMHNQQHITL 286 (498) T ss_pred HHHHHhhccCC---ccEEEEeecCHHHHHHHHHHHhhhhhhhhHHhhcCeEEEEe---ccCCHHHHHHhhhccCCceEEE Confidence 78777776554 44433332456677778777753 22344332211 1122233333444556677766 Q ss_pred Ee-cCCc--c-c-hHHHHHHHHH---hcCCCCCCceeeEeeeecCccccCC----CCHHHHHHHHhCCceEEEEEcCceE Q lcl|NC_013597. 301 MF-DKND--M-Y-PVSSALARLL---STNFAANNSTLTLKFKQQPTITADE----ITATEFAKAKRLGINVYTYFDDVAM 368 (502) Q Consensus 301 ~y-~~~~--~-~-~~aa~~g~~a---s~n~~~~~g~~T~~fk~~~Gv~~~~----lt~t~~~~l~~~~~n~y~~~~~~~~ 368 (502) +. ++.. + + .+|++.++++ ..|+.+.-.+ -.|+||.|.. ++.+|.+.|..+|+..+..-+|.-. T Consensus 287 ~~~~~~~~~p~~~~AAa~a~~aA~~l~~DPArPLqt-----l~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G~V~ 361 (498) T protein:vir:48 287 AGYEKETQSPVDELVASRLAREAVFIRNDPARPTQT-----GELVGMLPAPKGKRFIMTEQQTLLSHGVATAYVEGGTLR 361 (498) T ss_pred EecCCCCCChHHHHHHHHHHHHHHhhhccccccccc-----eeeeccccCCchhcCChHHHHHHHhcCcceEEEcCCeEE Confidence 54 3322 2 2 3455666655 5566543333 3567887654 5899999999999998876566554 Q ss_pred EecCEe-----ecC----eehh--HHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHH---------HHHHHHHHHHHHHH Q lcl|NC_013597. 369 IAEGTV-----IGG----KFAD--EIVILDWFVDAVQKEVFARLYKSPTKIPLTDKG---------QAILIAAVEKVCLE 428 (502) Q Consensus 369 ~~~G~~-----~~G----~~iD--~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G---------~~~l~~~v~~vl~~ 428 (502) +.+..+ ..| .|.| .++-.+++...++..+-.. |-+ .|+--+..+ -..|++.+-.++++ T Consensus 362 I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~k-fpR-~KLa~dg~~~~~gq~IvTp~~ir~eli~~y~~ 439 (498) T protein:vir:48 362 IQRSVTTYKKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSK-YGR-HKLANDGTRFGPGQAIVTPAVIKGELLATYRQ 439 (498) T ss_pred EEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhh-cCC-ceecccCcccCCCCcccchHHHHHHHHHHHHh Confidence 444433 345 3877 7889999999999998544 343 466543222 25788999999999 Q ss_pred HHHcCccccccccCcccccccccccccc-ceEEEcCc-hhcCCHHHHhhcccCceEEEEEECce Q lcl|NC_013597. 429 GINNGAFAPGKWTGAGFGNLSTGDYLDK-GFYVWAAP-MDTLSDSDRQARRATPIQTAVKLAGA 490 (502) Q Consensus 429 a~~~G~I~~g~~~~~~~g~~~~~~~~~~-gy~v~~~~-~~~~s~~dra~R~~~~i~~~~~~aGa 490 (502) ....|++. .... .........|-.++ -..+.+|+ .-++ =|-=-..-.+.+.|.-++| T Consensus 440 le~~give-n~~~-~~~~LiVerd~~dpnRln~~~p~d~vn~---L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:48 440 MERAGIVE-NYDL-FKQYLIVERDADNPNRLNTLFPPDYVNQ---LRVFAVVNQFRLQYSEESA 498 (498) T ss_pred hhhhcccc-Chhh-hcceeEEEECCCCCcEEEEEecccccCc---hhhhhhhhhhhhhhhhcCC Confidence 99999985 2110 00000011110000 01111111 1111 0111111123333444444 No 15 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=99.51 E-value=1.5e-12 Score=85.41 Aligned_cols=442 Identities=14% Similarity=0.098 Sum_probs=237.1 Q ss_pred CCcCcCce----------eEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHH Q lcl|NC_013597. 1 MALSISHI----------VNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQ 70 (502) Q Consensus 1 Msip~s~i----------V~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~ 70 (502) |+|+.++| +.++-+.. ....+---.||+|......-..+...++.+ |.++..+.||..|-...|++ T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A---~~~~~~q~vLiiGq~la~gs~~~~~~v~v~-s~~~a~~~fG~GSml~~M~~ 76 (498) T protein:vir:44 1 MAISFNSIPSDTRVPLFYAEMDNSAA---NTARDSGASLLIGHASNDASIAVNSLVLVS-SVDYARQICGAGSQLARMVG 76 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCC---CCCcCCcceEEEEecCcccccccceeEeec-CHHHHHHhcCcccHHHHHHH Confidence 99988876 23322332 233333358899987654444455677776 77889999999999999999 Q ss_pred HHhcCCCCcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHH Q lcl|NC_013597. 71 PFFAQSPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKI 150 (502) Q Consensus 71 ~~F~q~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i 150 (502) .|..--|--.--.|+- .. +....+++..-..++ +-..|.+.+.|+|....+. +....+.+.+|+.+ T Consensus 77 a~~~~n~~~~l~~i~~-~D-~aG~aAtg~it~tg~---------at~~G~l~l~Igg~~v~v~---V~~gdTaa~vA~al 142 (498) T protein:vir:44 77 AYRKTDPFGELYVIAV-PE-STGAAATVALTVTGE---------ATETGTVNVYTGRTRVQAP---VTSGDDAAAVAVSI 142 (498) T ss_pred HHHHhCCCceeEEEec-CC-cccceeEEEEEeecc---------cCCCcEEEEEECCEEEEEE---ecCCCCHHHHHHHH Confidence 9998765433333332 22 223333332221222 2257999999999888743 45667788899999 Q ss_pred HhhhcccccceeEEEecccceeeEeeecccccc---cceeeeeeccccchhhhhhhhhhcccccceeeee--cccccccc Q lcl|NC_013597. 151 QEKLTTLSVAVSIAYDETGNRFIVSANVAGEDK---KTEIDYAIDEGGEGEYIGALLKLENGQASRKVGK--NSVSLKKE 225 (502) Q Consensus 151 ~aal~~a~~~~tv~~~~~~~~f~~~s~ttG~~~---~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v--~~~~~~~e 225 (502) ++++.+. ....|+-........++.+-.|... .+...+-...+ ++.. +.-..+.+ -+.|.... T Consensus 143 ~aaina~-~~lPVTA~~~~~~vtlTAr~kG~~GN~I~l~~~~~~~~~--ge~~---------p~Glt~titamsgGag~P 210 (498) T protein:vir:44 143 KDAVNAN-PDLPFTATSEAGVVTLTARHKGLYGNEIPVTLNYYGFGG--GEVL---------PAGVNITVASGVKGAGAP 210 (498) T ss_pred HHHHhCC-CCCceEEeeccceEEEEEeccCcccCcceEEEeeccCcc--cccc---------ccceeEEEEcccCCccCc Confidence 9988763 2334444444555666666555432 11111100000 0000 11111111 12344445 Q ss_pred CHHHHHHHHHhccCceeEEEEecCCChhHHHHHHHHHhh-------cCCEEEEEecCchhcccchhHHHHHHHHccCCce Q lcl|NC_013597. 226 TLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQA-------NTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHT 298 (502) Q Consensus 226 t~~~al~al~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a-------~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t 298 (502) ++..+|+++.+...+|. +.--.+.+.+.++.++++. -+.+++..... ..++-......-...|..|. T Consensus 211 Dia~alaal~~~~~~~i---~~p~~D~asl~al~~~L~~~sgRw~~~~q~~g~~~~a---~~gT~a~l~t~g~~~N~~~i 284 (498) T protein:vir:44 211 ALNDAVAAMGDEPFDYI---GLPFNDTASVNSMATEMNDSSGRWSYVRQLYGHVYTA---KTGTLSELVAAGDQFNLQHI 284 (498) T ss_pred hhHHHHHhhccCCccEE---EEeecCHHHHHHHHHHHhhhhcchHHHhhcCeEEEEe---ccCCHHHHHHhhhccCCceE Confidence 67788777766544443 3322356677777777754 22344332211 11222233333344566777 Q ss_pred EEEec-CCc--c-c-hHHHHHHHHH---hcCCCCCCceeeEeeeecCccccCC----CCHHHHHHHHhCCceEEEEEcCc Q lcl|NC_013597. 299 LAMFD-KND--M-Y-PVSSALARLL---STNFAANNSTLTLKFKQQPTITADE----ITATEFAKAKRLGINVYTYFDDV 366 (502) Q Consensus 299 ~~~y~-~~~--~-~-~~aa~~g~~a---s~n~~~~~g~~T~~fk~~~Gv~~~~----lt~t~~~~l~~~~~n~y~~~~~~ 366 (502) .++.+ +.. + + .+|++.++++ ..|+.+.- .--.|+||.|.. ++.+|.+.|..+|+..+..-.|. T Consensus 285 t~~~~~~~~~sp~~~~AAa~a~~aA~~l~~DPArPL-----~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G~ 359 (498) T protein:vir:44 285 TLAGYEKDTQTPADELAASRTARAAVFIRNDPARPT-----QTGELVDMLPAPKGKRFTTTEQQTLLSHGVATAYVESGV 359 (498) T ss_pred EEEecCCCCCCHHHHHHHHHHHHHHHHhhccccccc-----CceeecccccCCchhcCChHHHHHHHhcCcceEEEcCCe Confidence 66644 322 2 2 3445566665 45664433 333577888653 68999999999999988665676 Q ss_pred eEEecCEe-----ecC----eehh--HHHHHHHHHHHHHHHHHHHHHhcCCCCccCH----HH-----HHHHHHHHHHHH Q lcl|NC_013597. 367 AMIAEGTV-----IGG----KFAD--EIVILDWFVDAVQKEVFARLYKSPTKIPLTD----KG-----QAILIAAVEKVC 426 (502) Q Consensus 367 ~~~~~G~~-----~~G----~~iD--~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~----~G-----~~~l~~~v~~vl 426 (502) -.+.+..+ ..| .|.| .++-.+++...++..+-.. |-+ .|+-=+. .| -..|++.+-+++ T Consensus 360 V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~k-fpR-~KLa~d~~~~~~gq~IvTp~~ir~eli~~y 437 (498) T protein:vir:44 360 LRIQRDITTYRKNAYGVADNSYLDSETLHTSAYVLRRLKSVITSK-YGR-HKLANDGTRFGSGQAIVTPAVIRGELGSTY 437 (498) T ss_pred EEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhh-cCC-cccccCCcccCCCcccccHHHHHHHHHHHH Confidence 55555444 345 3877 7889999999999998544 343 4543221 12 247889999999 Q ss_pred HHHHHcCccccccccCcccccccccccccc-ceEEEcCc-hhcCCHHHHhhcccCceEEEEEECce Q lcl|NC_013597. 427 LEGINNGAFAPGKWTGAGFGNLSTGDYLDK-GFYVWAAP-MDTLSDSDRQARRATPIQTAVKLAGA 490 (502) Q Consensus 427 ~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~-gy~v~~~~-~~~~s~~dra~R~~~~i~~~~~~aGa 490 (502) ++....|++. .... .........|-.++ -..+.+|+ .-++ =|-=-..-.+.+.|.-++| T Consensus 438 ~~le~~givE-n~~~-~~~~LiVerd~~dpnRln~~~p~d~vn~---L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:44 438 RQMEREGIVE-NFDL-FQQHLIVERNANDSNRLDVLFPPDYVNQ---LRVFAVLNQFRLQYSEEAA 498 (498) T ss_pred Hhhhhhcccc-Chhh-hcceeEEEECCCCCcEEEEEecccccCc---hhhhhhhhhhhhhhhhhcC Confidence 9999999985 2210 00000000010000 01111111 1000 0111111122333333333 No 16 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=99.27 E-value=1.7e-10 Score=74.09 Aligned_cols=428 Identities=10% Similarity=-0.017 Sum_probs=197.7 Q ss_pred CCcCcCceeEEeecc-cccccccccccceEEEecccccccccCccceEEecCHHHHHhhcC--CCcHHHHHHHHHhcCCC Q lcl|NC_013597. 1 MALSISHIVNVQLNT-VPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFG--TNSETAKAAQPFFAQSP 77 (502) Q Consensus 1 Msip~s~iV~V~i~~-~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg--~~s~ey~aA~~~F~q~p 77 (502) |+=-...=|-|.-.- .+.++....-+.+.|+|.....| .+..++..+- .+....+| .....+.|...||.+-. T Consensus 1 M~~~~~pGVyv~E~~~~~~~i~~v~T~v~~~VG~a~~gp---~n~pv~its~-~d~~~~g~~~~~~tL~~Av~~~f~nGg 76 (477) T protein:vir:10 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGP---VNTPVQSLSD-VDAAQFGPQLAGFTIPQALDAVYDYGS 76 (477) T ss_pred CcccCCCCeEEEEccCCcccccccCCceeEEEecccCCC---CCcCEEEccH-HHHHHhccCCCCCcHHHHHHHHHhccc Confidence 883233334444322 24566677777888998665433 3345666544 44444333 46788999999999754 Q ss_pred CcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhccc Q lcl|NC_013597. 78 RAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTL 157 (502) Q Consensus 78 ~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a 157 (502) . .+++-|-...............+...................+......... ................ T Consensus 77 ~--~~~vVrV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~a~~~~~-------~~~~~~~~~~~~~~~~-- 145 (477) T protein:vir:10 77 G--TVIVINVLDPAVHKSNAANEPVTFDAATGRAKLAHPAAANLVLKNDSGGTTY-------AEGTDYAVDLINGVIT-- 145 (477) T ss_pred e--EEEEEecCccccccccccccccccccccceeccccccccccccccccccccc-------ccchhhhhhhccccce-- Confidence 3 4444444322111110000000000000000000000000000000000000 0000000000000000 Q ss_pred ccceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHHHHhc Q lcl|NC_013597. 158 SVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVAEV 237 (502) Q Consensus 158 ~~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~al~~~ 237 (502) .... ...+.......... ...++........... .......+-.+++...... T Consensus 146 -------------~~~~--~~~~~~~~~~~~~~--~~~~~~~~~~~~~~g~----------~~~~~~~tGl~al~~~~~~ 198 (477) T protein:vir:10 146 -------------RIKT--GTIPPGATAAKATY--DYADPTKVTAADIIGA----------VNAAGMRTGMKALKDTYNL 198 (477) T ss_pred -------------eccc--ccccccceeeeecc--cccccccccccccccc----------ccccchhhhhhhhhhhhhh Confidence 0000 00000000000000 0000000000000000 0000000111111111111 Q ss_pred cC-ceeEEEEecCC-ChhHHHHHHHHHhhcCCEEEEEecCchhcccchhHHHHHHH-------HccCCceEEEec----- Q lcl|NC_013597. 238 NN-TWYGFTVAAQL-TDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLY-------DAGLDHTLAMFD----- 303 (502) Q Consensus 238 ~~-~w~~~~~~~~~-~~~~~~a~a~w~~a~~~~~~~~~~~~~~~~~~~~~i~~~l~-------~~~~~~t~~~y~----- 303 (502) .. -...+..+... ..+-..+|...++.- +.+.+.-...+. . ..++..... ..+.+|..+.|. T Consensus 199 ~~~~~~~l~apg~~~~~~v~~~l~~~~~~~-~~~~~~d~p~~~-~--~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~ 274 (477) T protein:vir:10 199 YGYFSKILIAPAYCTQNSVSVELEAMAVQL-GAIAYIDAPIGT-T--LAQALAGRGPAGTINFNTSSDRVRLCYPHVKVY 274 (477) T ss_pred cchhcccccccccccchhhHHHHHHHHhhC-CEEEEEecCCCC-C--HHHHHhhhhhccccccccccceEEEEcCeEEEe Confidence 00 00111111111 111222233333322 233222111000 0 001000000 011223333321 Q ss_pred -CCc-----cchHHHHHHHHHhcCCCCCCceeeEeeeecCccccC---C-----CCHHHHHHHHhCCceEEEEEcCc-eE Q lcl|NC_013597. 304 -KND-----MYPVSSALARLLSTNFAANNSTLTLKFKQQPTITAD---E-----ITATEFAKAKRLGINVYTYFDDV-AM 368 (502) Q Consensus 304 -~~~-----~~~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~~~---~-----lt~t~~~~l~~~~~n~y~~~~~~-~~ 368 (502) +.+ -.+.+.++|.++.+|-.+.+ -.....|.+.||..- . .+++|.+.|.++++|.+.++.+. .. T Consensus 275 d~~~~~~~~~p~s~~~ag~~a~~d~~~g~-~~span~~~~gi~~~~~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~~G~~ 353 (477) T protein:vir:10 275 DTATNAERLEPLSSRAAGLRARVDLDKGY-WWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLR 353 (477) T ss_pred cccCCceeEEchHHHHHHHHHHhhhcCCc-eeccCCceeccccccccccccccCCChhhHHHHhhCCceEEEEecCCcEE Confidence 111 13457778888887743321 123334555554322 1 35689999999999999999765 46 Q ss_pred EecCEeecC-------eehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCcccccccc Q lcl|NC_013597. 369 IAEGTVIGG-------KFADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWT 441 (502) Q Consensus 369 ~~~G~~~~G-------~~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~ 441 (502) ++.++++.+ .||-+.+-.+|+...|+..+...++. |.+..-...|+..|+.-|++.++.|.|. T Consensus 354 ~wG~rT~~~~~~~~~~~~~~vrR~~~~i~~~~~~~~~~~v~~-----~~~~~~~~~i~~~i~~~l~~l~~~g~l~----- 423 (477) T protein:vir:10 354 LWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDA-----PIDQGLIDSLVESVNGFGRKLIGDGALL----- 423 (477) T ss_pred EEcccccCCCCCCcccceeehhhHHHHHHHHHHHHHHHhccC-----CCCHHHHHHHHHHHHHHHHHHHhCCcee----- Confidence 888898854 26778889999999999998665432 5678888999999999999999999985 Q ss_pred CccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 442 GAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 442 ~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) ||++.+. .++.|++|+.+++. .+.+.+.....+++|.+....+. T Consensus 424 ---------------g~~v~~~-~~~nt~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~~ 467 (477) T protein:vir:10 424 ---------------GFKAWFD-PARNPKEELAAGHL-LINYKYTVPPPLERLTYETEITS 467 (477) T ss_pred ---------------eeEEEEe-cCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEcc Confidence 4788884 67889999999999 59999999999999999988887 No 17 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=99.23 E-value=4.3e-10 Score=71.94 Aligned_cols=423 Identities=12% Similarity=0.049 Sum_probs=202.0 Q ss_pred CCcCcCceeEEeec-ccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcC--CCcHHHHHHHHHhcCCC Q lcl|NC_013597. 1 MALSISHIVNVQLN-TVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFG--TNSETAKAAQPFFAQSP 77 (502) Q Consensus 1 Msip~s~iV~V~i~-~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg--~~s~ey~aA~~~F~q~p 77 (502) |+=....=|-|.-. -.+.++....-+.+.|+|.....|+ ..+++.. |..+....|| .....+.+...||.+-. T Consensus 1 M~~~~~pGVyv~E~~~g~~~I~~v~Tsv~~~VG~a~~~p~---n~pv~it-s~~d~~~~g~~~~~~tL~~Av~~~f~ngg 76 (477) T protein:vir:79 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPV---NTPVQSL-SDVDAAQFGPQLAGFTIPQALDAVYDYGS 76 (477) T ss_pred CcCCCCCCeEEEEecCCcccccccCCceEEEEeecccCCC---cccEEEc-cHHHHHHhcCCCCCCcHHHHHHHHhhcCC Confidence 88323333444332 2245666777778889987654433 3555554 4444444444 55778899999998744 Q ss_pred CcceEEEEEeecccccceeeeeeccchhhhHH--HHHhhcccceeEEEEecCccccccccccccccchhhH---HHHHHh Q lcl|NC_013597. 78 RAKQLIVARWQKSASTIEATKNTLSGATLSDD--LERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAV---ATKIQE 152 (502) Q Consensus 78 ~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~v---A~~i~a 152 (502) .+++|=|-............ ........ ...........+.+..+....... .........+ ...+.. T Consensus 77 --~~~~vvrV~~~~~~~~~~a~--~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~ 149 (477) T protein:vir:79 77 --GTVIVINVLDPAVHKSNAAS--ESVTFDAATGRAKLAHPAAANLVLKNDSGGTTYT---EGTDYAVDLINGVITRIKT 149 (477) T ss_pred --ceEEEEeccCCccccccccc--cccccccccccccccccccceeEEeecccccccc---cCccccccccchhhhhhhc Confidence 34555453222111111000 00000000 000000011111111111100000 0000000000 000000 Q ss_pred -hhcccccceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHH Q lcl|NC_013597. 153 -KLTTLSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEAL 231 (502) Q Consensus 153 -al~~a~~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al 231 (502) ..........+.++.....-.......|. .......+..+++ T Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-------------------------------------~~a~~~~tg~~al 192 (477) T protein:vir:79 150 GTIPAAATAAKATYDYADPTKVTAADIIGA-------------------------------------VNAAGMRTGMKAL 192 (477) T ss_pred cccccccceeeceeccCCcccceeeeeccc-------------------------------------ccccccchhhhhh Confidence 00000000000000000000000000000 0000011112222 Q ss_pred HHHHhccCceeEEEEecC--CChhHHHHHHHHHhhcCCEEEEEecCchhcccchhHHHHHHH------HccCCceEEEe- Q lcl|NC_013597. 232 FNVAEVNNTWYGFTVAAQ--LTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLY------DAGLDHTLAMF- 302 (502) Q Consensus 232 ~al~~~~~~w~~~~~~~~--~~~~~~~a~a~w~~a~~~~~~~~~~~~~~~~~~~~~i~~~l~------~~~~~~t~~~y- 302 (502) .........--.++.... ....-..++...++.. +.+.+.-..... .. .+-+...-. ..+..|..+.| T Consensus 193 ~~~~~~~~~~~~iv~apg~~~~~~v~~~l~~~~~~~-~~~a~~d~p~~~-~~-~~~~~~~~~~~~~~~~~~s~~~~~~~p 269 (477) T protein:vir:79 193 KDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQL-GAIAYIDAPIGT-TL-AQALAGRGPAGTINFNTSSDRVRLCYP 269 (477) T ss_pred hhhhhhcccccceeeccccccchhHHHHHHHHHhhc-CeEEEEecCCCC-Ch-HHHhhhhhhccccccccccceEEEEcC Confidence 222211110011111111 1112222333333322 233322111000 00 000000000 01122333332 Q ss_pred -----cCCc-----cchHHHHHHHHHhcCCCCCCceeeEeeeecCcccc---C-----CCCHHHHHHHHhCCceEEEEEc Q lcl|NC_013597. 303 -----DKND-----MYPVSSALARLLSTNFAANNSTLTLKFKQQPTITA---D-----EITATEFAKAKRLGINVYTYFD 364 (502) Q Consensus 303 -----~~~~-----~~~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~~---~-----~lt~t~~~~l~~~~~n~y~~~~ 364 (502) ++.+ -.+.+.++|.++.+|-.+.+ -.....|.+.||.. + ..+++|.+.|.++|+|.+.++. T Consensus 270 ~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~-~~span~~~~gv~~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~ 348 (477) T protein:vir:79 270 HVKVYDIATNAERLEPLSSRAAGLRARVDLDKGY-WWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSY 348 (477) T ss_pred eeEEecccCCceeeechHHHHHHHHHHhhccCCc-eEccCCceeecceecccccccccCCChhhHHHHhhCCceEEEEec Confidence 1111 12467788888887753321 12333455555542 2 2356899999999999999997 Q ss_pred Cce-EEecCEeecC-------eehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccc Q lcl|NC_013597. 365 DVA-MIAEGTVIGG-------KFADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFA 436 (502) Q Consensus 365 ~~~-~~~~G~~~~G-------~~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~ 436 (502) +.+ .++.++++.+ .||-+.+-.+|+...|+..+...++. |-+..=...|+..|+.-|++.++.|.|. T Consensus 349 ~~G~~~wG~rT~~~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e-----~~~~~~~~~i~~~i~~~l~~l~~~g~l~ 423 (477) T protein:vir:79 349 GSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDA-----PIDQGLIDSLVESVNGFGRKLIGDGALL 423 (477) T ss_pred CCcEEEEcccccCCCCCCccceeeehhhHHHHHHHHHHHHHHHhccC-----CCCHHHHHHHHHHHHHHHHHHHhCCcee Confidence 754 6788888843 26778889999999999998765543 5577778999999999999999999985 Q ss_pred cccccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 437 PGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 437 ~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) ||.+.+ .+++.+++|+.+.+. -+.+.+.....+++|.++...+. T Consensus 424 --------------------g~~v~~-~~~~nt~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~~ 467 (477) T protein:vir:79 424 --------------------GFKAWF-DPARNPKEELAAGHL-LINYKYTVPPPLERLTYETEITS 467 (477) T ss_pred --------------------eeEEEE-ecCCCCHHHhhCCeE-EEEEEEEecCCceeEEEEEEEec Confidence 478888 467889999999998 59999999999999999999988 No 18 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=99.21 E-value=5.4e-10 Score=71.39 Aligned_cols=438 Identities=15% Similarity=0.118 Sum_probs=233.8 Q ss_pred CC------cCcCc-----eeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHH Q lcl|NC_013597. 1 MA------LSISH-----IVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAA 69 (502) Q Consensus 1 Ms------ip~s~-----iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA 69 (502) |+ ||-|- ++.++.+..- .....+---.||+|......-..+...++.+ |.++..+.||..|-...|+ T Consensus 1 m~~i~F~~IP~~iRvP~~y~E~dns~A~-~g~~~~~q~vLiiGq~la~gs~~~~~pv~v~-s~~~a~~~fG~GS~la~M~ 78 (495) T protein:vir:19 1 MSDISFNAIPSDVRVPLTYIEFDNSNAV-SGTPAPRQRVLMFGQSGSKASAAPNVPVRIR-SGSQASAAFGQGSMLALMA 78 (495) T ss_pred CCCCchhhCCcccccCeEEEEEccCCCC-cCCcCCCceEEEEEecCcccccccceeEEec-CHHHHHHhcCcCcHHHHHH Confidence 43 44331 2333333321 1222333347899886544344455677877 5678899999999999999 Q ss_pred HHHhcCCCCcceEEEEEeecccccceeeee-eccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHH Q lcl|NC_013597. 70 QPFFAQSPRAKQLIVARWQKSASTIEATKN-TLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVAT 148 (502) Q Consensus 70 ~~~F~q~p~P~~l~igr~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~ 148 (502) +.|...-|--.--.|+- .. +....+++. .+.|. +-..|.+.+.|+|....+. +....+.+.+|+ T Consensus 79 ~a~~~~n~~~~l~~i~~-~D-~aG~aA~g~it~tg~----------at~~G~l~l~I~g~~v~v~---V~~gdTaa~vA~ 143 (495) T protein:vir:19 79 DAFLNANRVAELWCIPQ-GN-GTGNAAVGEISLSGT----------AGENGSLVTYIAGQRLAVS---VAAGATGAALAD 143 (495) T ss_pred HHHHHhCCcceEEEEee-CC-hhhceeEEEEEEeec----------CCCCcEEEEEECCEEEEEE---ecCCCCHHHHHH Confidence 99997655433333332 22 222333332 22222 2247999999999988753 456677888999 Q ss_pred HHHhhhcccc-cceeEEEec------ccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeee--ecc Q lcl|NC_013597. 149 KIQEKLTTLS-VAVSIAYDE------TGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVG--KNS 219 (502) Q Consensus 149 ~i~aal~~a~-~~~tv~~~~------~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~--v~~ 219 (502) .+.+++.+.. ..++.+.+. ......++.+-.|+...+.+....-. ++ ..+.-..+. --+ T Consensus 144 al~aaina~~~lPvTA~~~~~~~~~~a~~~VtlTAr~kG~~n~idi~~~~~~---ge---------~~p~Glt~titams 211 (495) T protein:vir:19 144 LLVARIKGQPDLPVTAEVRADSGDDDTHADVVLSAKFTGALSAVDVRWNYYA---GE---------TTPYGIITAFKAAS 211 (495) T ss_pred HHHHHhcCCccCceEEEeeccCCCCcCceeEEEEEeeccccccceeEEEeec---cc---------ccccceeEEEEecC Confidence 9999887642 223322211 23456666666665433332211100 00 011111111 112 Q ss_pred ccccccCHHHHHHHHHhccCceeEEEEecCCChhHHHHHHHHHhhc----CCEEEEEecCchhcccchhHHHHHHHHccC Q lcl|NC_013597. 220 VSLKKETLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQAN----TKLFGANVIRAEQIEWSADNIYKKLYDAGL 295 (502) Q Consensus 220 ~~~~~et~~~al~al~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~----~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~ 295 (502) .|....++..+|+++.+. ||-+++..-.+.+.+.+|-++++.- +++++....- ..++-......-...|. T Consensus 212 gGag~PDia~alaal~~~---~~~~I~~P~tD~asL~al~~~l~~rw~~~~q~~g~~~~a---~~gT~~~l~t~g~~~N~ 285 (495) T protein:vir:19 212 GKNGNPDISASIAGMGDL---QYKYIVMPYTDEPNLNLLRTELQERWGPVNQADGFAVTV---LSGTYGDISTFGVSRND 285 (495) T ss_pred CCCCCcchHHHHHHhccC---CCcEEEEecCcHHHHHHHHHHHHHhhhHHHhcCeEEEEe---ecCCHHHHHHhhhccCC Confidence 344455678888877654 5544444334566777888888762 2333322110 11222333333344566 Q ss_pred CceEEEecCCcc---c-hHHHHHHHHH---hcCCCCCCceeeEeeeecCccccCC----CCHHHHHHHHhCCceEEEE-E Q lcl|NC_013597. 296 DHTLAMFDKNDM---Y-PVSSALARLL---STNFAANNSTLTLKFKQQPTITADE----ITATEFAKAKRLGINVYTY-F 363 (502) Q Consensus 296 ~~t~~~y~~~~~---~-~~aa~~g~~a---s~n~~~~~g~~T~~fk~~~Gv~~~~----lt~t~~~~l~~~~~n~y~~-~ 363 (502) .|..++.....+ + .+|++.++++ ..|+.+. +.--.|+||.|.. ++.+|.+.|..+|+..|.. - T Consensus 286 ~~it~~~~~gsp~~~~~~AAA~aa~~A~~l~~DPArP-----L~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~ 360 (495) T protein:vir:19 286 HLISCMGIAGAPEPSYLYAATLCAVASQALSIDPARP-----LQTLTLPGRMPPAVGDRFTWSERNALLFDGISTFNVND 360 (495) T ss_pred ceEEEEecCCCCCcHHHHHHHHHHHHHHHhhcccccc-----cCceeecceecCCccccCChHHHHHHHhCCcceEEECC Confidence 777666543322 2 2234444443 4565443 3334577888554 6899999999999998875 4 Q ss_pred cCceEEecCEee-----cC----eehh--HHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHH---------HHHHHHHHH Q lcl|NC_013597. 364 DDVAMIAEGTVI-----GG----KFAD--EIVILDWFVDAVQKEVFARLYKSPTKIPLTDKG---------QAILIAAVE 423 (502) Q Consensus 364 ~~~~~~~~G~~~-----~G----~~iD--~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G---------~~~l~~~v~ 423 (502) +|.-.+.+..+. .| .|.| .++-++++...++..+-..+ -+ .|+-=+..+ -..|++.+- T Consensus 361 ~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kf-pR-~KLa~d~~~~~~gq~IvTp~~ir~ell 438 (495) T protein:vir:19 361 GGEMQIERMITMYRTNKYGDSDPSYLNVNTIATLSYLRYSLRTRITQKF-PN-YKLASDGTRFATGQAVVTPSVIKTELL 438 (495) T ss_pred CCeEEEEeeeeeeeecCCCCcchhhhhhHHHHHHHHHHHHHHHHHhhhc-CC-cccccCCCCCCCcccccChHHHHHHHH Confidence 566544444333 45 3877 78899999999999985543 43 455443222 247889999 Q ss_pred HHHHHHHHcCccccccccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 424 KVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 424 ~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) +++++....|++. .. +.+. +. +.+ -|+...+ .|-+=..| ..+-...|.+-.++.+-= T Consensus 439 ~~~~~le~~give-n~---~~~~---------~~--LiV-erd~~dp-nRln~~~p-----~d~vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 439 ALFEEWENAGLVE-DF---DTFK---------EE--LYV-ARNKDDK-DRLDVLCG-----PNLINQFRIFAAQVQFIL 495 (495) T ss_pred HHHHhhhhhcccc-Ch---hhhc---------ce--eEE-EECCCCC-cEEEEEec-----ceeeCceeeeeeeeeeeC Confidence 9999999999985 11 1110 00 111 1111111 13222222 222222222222111111 No 19 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=99.15 E-value=1.2e-09 Score=69.45 Aligned_cols=399 Identities=14% Similarity=0.106 Sum_probs=206.1 Q ss_pred CC--------cCc-CceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCC--cHHHHHH Q lcl|NC_013597. 1 MA--------LSI-SHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTN--SETAKAA 69 (502) Q Consensus 1 Ms--------ip~-s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~--s~ey~aA 69 (502) |+ ..+ --+|++...- ...+....=+...|++.-.- ++..+.+.. .|.++....||.. .+.|++. T Consensus 1 m~gg~~~~~~k~~PGvYi~~~~~~-~~~i~~~~~~~~a~~~~~~~---Gp~~~~~~i-~s~~d~~~~fG~~~~~~~~~~~ 75 (437) T protein:vir:10 1 MAGGIWKRQNKVRPGAYINVKSKD-IAMTRLGGDGVVTVPLALSF---GQSKKLMKI-RRGEDLFKKLGYEQESPQLLLL 75 (437) T ss_pred CCcceecccceecCceeEEEecCC-cceeeccCCcEEEEEEEecC---CCCceeEEE-ecHHHHHHHcCCccchhHHHHH Confidence 32 111 2345543222 22232333345555555433 333344555 5668999999954 5677777 Q ss_pred HHHhcCCCCcceEEEEEeecccccceeeeeeccc-hhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHH Q lcl|NC_013597. 70 QPFFAQSPRAKQLIVARWQKSASTIEATKNTLSG-ATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVAT 148 (502) Q Consensus 70 ~~~F~q~p~P~~l~igr~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~ 148 (502) +.+|. .++++++-|... +..+..+ +.+ .++.+ .......-.++++|.......+..++. T Consensus 76 ~~~~~---g~~~~~~~R~~~-g~~a~~t---l~~~~~~~A---~~~G~~gn~i~v~v~~~~~d~~~~~v~---------- 135 (437) T protein:vir:10 76 NEAFK---RVSEVLLYRLNT-GEKANVS---LSDNVTAQA---KYSGVRGNDITVTVKTNVDDPSSFDVV---------- 135 (437) T ss_pred HHHhc---CCCEEEEEECCC-CceeeEe---eccceEEEe---ccCCcccceeEEEEeeccCCccceEEE---------- Confidence 77774 488999988653 2222211 111 00000 001111112444433221111111110 Q ss_pred HHHhhhcccccceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeecccc-ccccCH Q lcl|NC_013597. 149 KIQEKLTTLSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVS-LKKETL 227 (502) Q Consensus 149 ~i~aal~~a~~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~-~~~et~ 227 (502) ++..............-+........ ..++.. + ...+..++.....+ ...++. T Consensus 136 ---------------~~~~~~~~d~~~v~~~~~~~~n~~v~-----~~~~~~-----l-~~~a~~~LtGG~dg~~t~~dy 189 (437) T protein:vir:10 136 ---------------TFLDTVVMDLQTVKVLADLKNNALVE-----FSGTGE-----L-QPVAGAKLTGGTDGAISTQDY 189 (437) T ss_pred ---------------EecCcceeeeeehhhhhhhhhhcccc-----cccccc-----c-ccccceeeeccccCCCChhHH Confidence 00000000000000000000000000 000000 0 00000111111111 234567 Q ss_pred HHHHHHHHhccCceeEEEEecCCChhHHHHHHHHHhhc----CCEEEEEecCchhcccchhHHHHHHHHccCCceEEE-- Q lcl|NC_013597. 228 GEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQAN----TKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAM-- 301 (502) Q Consensus 228 ~~al~al~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~----~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~-- 301 (502) .++|+++.....+|. +++ ..+.+.+.++..|++.. .+.+......... .+.+.+.+ T Consensus 190 ~~al~~le~~~~n~l--~~~-~~d~~~~t~~~~~ik~~r~~~g~~~~~V~~~~~~---------------d~e~Iin~~n 251 (437) T protein:vir:10 190 LEYFKALETVEFNYM--ALP-VEDASIKKAAINFIKRMREDEGLGAQLVVADSDA---------------DSEAVINVKN 251 (437) T ss_pred HHHHHHhccCcceEE--Eec-CCChhHHHHHHHHHHHHHhccCceEEEEeCCCCC---------------CCceEEEeec Confidence 899999987654443 333 34678899999998852 3444333222110 11222211 Q ss_pred ----ecC---CccchHHHHHHHHHhcCCCCCCceeeEeeeecCccc-c-CCCCHHHHHHHHhCCceEEEEEcCceEEecC Q lcl|NC_013597. 302 ----FDK---NDMYPVSSALARLLSTNFAANNSTLTLKFKQQPTIT-A-DEITATEFAKAKRLGINVYTYFDDVAMIAEG 372 (502) Q Consensus 302 ----y~~---~~~~~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~-~-~~lt~t~~~~l~~~~~n~y~~~~~~~~~~~G 372 (502) .+. +.....+.+.|.+|+.+++. .+-||.++|+. . ..++.+|++.+.++|...+.+.++.-.+.+| T Consensus 252 ~~~~~~~~~~~~~~~~a~vAG~~Ag~~~~~-----S~t~~~~~~~~~v~~~~t~~e~~~~i~~G~~vl~~~~~~v~i~~g 326 (437) T protein:vir:10 252 GVILSDKTVIDKTKATVWVAAASANAGVEK-----SLTYEKYEDSVDVVGRLSHTETEDALLKGQFVFTARRGRAVVEQD 326 (437) T ss_pred ceeecCcceechhhHHHHHHHHhccCcccc-----CccccccCCcccccccCCHHHHHHHHhCCcEEEEEeCCeEEEEEc Confidence 111 11123455566667665543 24578899874 3 4789999999999999998877666666666 Q ss_pred Eee----c---C-e--ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccC Q lcl|NC_013597. 373 TVI----G---G-K--FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTG 442 (502) Q Consensus 373 ~~~----~---G-~--~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~ 442 (502) ... + + + .|-.++-.|.+.+.++..+ +..+. +|+|=+..|...+++.|+..|++..+.|.|.+.... T Consensus 327 InTltt~~~~~~~~~~ki~vir~~D~i~~di~~~~-~~~yi--Gk~~N~~~~r~~~~~~i~~yl~~l~~~g~I~~~~~~- 402 (437) T protein:vir:10 327 INSHVSFTIEKNQDFRKNRILRTLDDIVNDTRYAF-SEYFL--GKVSNNEDGRQAFKANRIRYFKDLEARGAIEDFKVE- 402 (437) T ss_pred cccccccCCCCCchhhhhhHHHHHHHHHHHHHHHH-Hhccc--cccCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCce- Confidence 533 1 1 2 3667888889998887764 43344 689988899999999999999999999999643211 Q ss_pred ccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEe Q lcl|NC_013597. 443 AGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYN 501 (502) Q Consensus 443 ~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~ 501 (502) .+.+. + .+ .+..--+++.++.-.++.++.+.+.|+ T Consensus 403 ----------------d~~v~---~--~~---~~~~v~v~~~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 403 ----------------DIEVL---R--GE---LKESVVVNVKVKPVDSMEKLYMTVTVE 437 (437) T ss_pred ----------------eEEee---c--CC---CCCEEEEEEEEEEeeeeeeEEEEEEec Confidence 11111 0 01 122334899999999999999999999 No 20 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=99.09 E-value=2.4e-09 Score=67.81 Aligned_cols=451 Identities=13% Similarity=0.104 Sum_probs=225.5 Q ss_pred CCcCc---Cce----eEEeeccc-ccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHH Q lcl|NC_013597. 1 MALSI---SHI----VNVQLNTV-PKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPF 72 (502) Q Consensus 1 Msip~---s~i----V~V~i~~~-~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~ 72 (502) |++.+ -++ |-|.+.-+ ..+....+.+.+.|+|....-++ .++..+++.++...-||... .-.+.... T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~----~~~~~~~~~~~~~~~~~~g~-l~~~~~~a 75 (587) T protein:vir:99 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEP----NTVYELRNYSQAKRLFRSGE-LLDAIELA 75 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEecCCcc----ceeEEeccHHHHHHHhcCcc-hHHHHHHH Confidence 88754 222 33333333 34455666778889998876655 34566888899999998854 66777888 Q ss_pred hcCCC--CcceEEEEEeecccccceeeeeecc------------------chhhhHHH------------HHhhccccee Q lcl|NC_013597. 73 FAQSP--RAKQLIVARWQKSASTIEATKNTLS------------------GATLSDDL------------ERFKSVVNGR 120 (502) Q Consensus 73 F~q~p--~P~~l~igr~~~~~~~~~~~~~~~~------------------~~~~~~~~------------~~~~~~~~g~ 120 (502) |.+++ .++++|+-|.. .+..+.++.+.|. ..++.... +.+..+- -. T Consensus 76 ~~~~~~~g~~~~~~~rv~-~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~v 153 (587) T protein:vir:99 76 WGSNPNYTAGRILAMRIE-DAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIG-NI 153 (587) T ss_pred hccccCCCceEEEEEEcC-CCceeEEEecCeEEEEeeccccccceEEEEccCCCCcceeEEEEEecccceeeeeecc-ce Confidence 87755 57888888763 3334444433222 00000000 0000000 01 Q ss_pred EEEEecCcccc-------------ccccccccc---------cch-hhHHHHHHhhhcccccceeEEEec-ccc------ Q lcl|NC_013597. 121 FSLTIGGDVKK-------------VDGLSFARL---------ADF-NAVATKIQEKLTTLSVAVSIAYDE-TGN------ 170 (502) Q Consensus 121 ~~iti~g~~~~-------------~~~i~~s~~---------ts~-~~vA~~i~aal~~a~~~~tv~~~~-~~~------ 170 (502) |+|.-.|.... ...+-+... ++. ...+..+...+..- ...+..+-. ..+ T Consensus 154 ~~i~y~g~~~~a~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~i~~~-~~~tAky~~~~~~~i~~~~ 232 (587) T protein:vir:99 154 FTIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQL-PDFEAKLSPFGDKNLESSK 232 (587) T ss_pred eeEEeecccccceeeEeecCcceeeeeeeeecCCceeEEEEecCCchHHHHHHHhhhccc-cceeEEeeccCCceeEeec Confidence 22222222111 000111000 000 11111121111110 000111100 000 Q ss_pred -----eeeEeeec------cccc----ccceeeeeeccccch-----------hhhhhhhhhcc-----cccceeeeecc Q lcl|NC_013597. 171 -----RFIVSANV------AGED----KKTEIDYAIDEGGEG-----------EYIGALLKLEN-----GQASRKVGKNS 219 (502) Q Consensus 171 -----~f~~~s~t------tG~~----~~v~~~~a~~~~~t~-----------t~~aa~l~~t~-----~~~~~~v~v~~ 219 (502) .+...... .++. ....+........++ .+.+....... ......+.-.. T Consensus 233 ~~~~~~~~v~~~~~~v~a~~~D~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~LtGG~ 312 (587) T protein:vir:99 233 LDKIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGT 312 (587) T ss_pred ccccccceeeeeeeeeehhccceeeecccceeeeeeecccccchhhhhhhhhccccceeeeeccccceecccceeeecCC Confidence 01110000 0000 000000000000000 00000000000 00001122222 Q ss_pred ccccccCHHHHHHHHHhccCceeEEEEecCCChhHHHHHHHHHhhc----CCEEEEEecCchhcccchhHHHHHHHHccC Q lcl|NC_013597. 220 VSLKKETLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQAN----TKLFGANVIRAEQIEWSADNIYKKLYDAGL 295 (502) Q Consensus 220 ~~~~~et~~~al~al~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~----~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~ 295 (502) .|...++..++|+++... +|+.++. ...+.+.+.++.+|++.. +++..+..... ..+...+....+..++ T Consensus 313 dG~~~~sy~~al~ale~~--~~~~i~~-~t~d~~i~a~l~a~vk~~r~~g~~~~aVlg~~~---~~~~~~~~~~a~~~n~ 386 (587) T protein:vir:99 313 NGEPPATWADKLDKFAHE--GGYYIVP-LSSKQSVHAEVASFVKERSDAGEPMRAIVGGGF---NESKEQLFGRQASLSN 386 (587) T ss_pred CCCccccHHHHHHHHhhC--CcEEEEe-cCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCC---CCCHHHHHHHhhhcCC Confidence 333345678899998775 5665543 333456667899999753 23443332221 1123334455566788 Q ss_pred CceEEEecC------C---ccch----HHHHHHHHHhcCCCCCCceeeEeeeecC--ccccCCCCHHHHHHHHhCCceEE Q lcl|NC_013597. 296 DHTLAMFDK------N---DMYP----VSSALARLLSTNFAANNSTLTLKFKQQP--TITADEITATEFAKAKRLGINVY 360 (502) Q Consensus 296 ~~t~~~y~~------~---~~~~----~aa~~g~~as~n~~~~~g~~T~~fk~~~--Gv~~~~lt~t~~~~l~~~~~n~y 360 (502) .|.+.+... + ..++ .+.+.|..+..+++..+ | ||.++ ++. ..++.+|++.+..+|++.+ T Consensus 387 e~vi~v~~~~~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~~~~Sl---T--~~~i~~~~v~-~~~t~~e~e~li~~Gvl~l 460 (587) T protein:vir:99 387 PRVSLVANSGTFVMDDGRKNHVPAYMVAVALGGLASGLEIGESI---T--FKPLRVSSLD-QIYESIDLDELNENGIISI 460 (587) T ss_pred CcEEEEeccceEecCCCceeeechHHHHHHHHHHHhcCchhcCc---c--ceeeeccccc-ccCCHHHHHHHHhCCeEEE Confidence 887665332 0 1122 34555777777665433 3 34444 443 3689999999999999998 Q ss_pred EEEcCc----eEEecCEeec----C-ee--hhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHH Q lcl|NC_013597. 361 TYFDDV----AMIAEGTVIG----G-KF--ADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEG 429 (502) Q Consensus 361 ~~~~~~----~~~~~G~~~~----G-~~--iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a 429 (502) ....+. -..-+|.+.= + .| |-.++-.|.+...++..+-+. +- +| |=++.|...|++.|++.|++. T Consensus 461 ~~~~~~~~~~vriv~~ItT~t~~~~~~~~~i~viRv~D~i~~di~~~~~~~-yi--Gk-~Nn~~~r~~i~~~i~~~L~~l 536 (587) T protein:vir:99 461 EFVRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQ-FI--GT-RTINTSASIIKDFIQSYLGRK 536 (587) T ss_pred EEecCCcceEEEEeeceeeccCCCCchhhhhhhhhhHHHHHHHHHHHHHhh-CC--cc-ccchHHHHHHHHHHHHHHHHH Confidence 765432 1233444442 1 25 568889999999998886333 33 45 678889999999999999999 Q ss_pred HHcCccccccccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 430 INNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 430 ~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) .+.|.|.... . . + +.+. ..+|+ . -+++.++.--++++|.+++.+.+ T Consensus 537 ~~~gaI~~~~-~-~--------d-------v~v~-----~~~d~---~--~v~~~v~Pv~~mekIy~tv~~~~ 582 (587) T protein:vir:99 537 KRDNEIQDFP-A-E--------D-------VQVI-----VEGNE---A--RISMTVYPIRSFKKISVSLVYKQ 582 (587) T ss_pred HhCCcccCCC-c-c--------c-------eEEE-----ecCCE---E--EEEEEEEEcccceEEEEEEEEEe Confidence 9999995211 0 0 0 1111 11222 2 48899999999999999999987 No 21 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=99.05 E-value=3.8e-09 Score=66.73 Aligned_cols=453 Identities=12% Similarity=0.084 Sum_probs=225.8 Q ss_pred CCcCc---Cce----eEEeeccc-ccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHH Q lcl|NC_013597. 1 MALSI---SHI----VNVQLNTV-PKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPF 72 (502) Q Consensus 1 Msip~---s~i----V~V~i~~~-~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~ 72 (502) |++.+ -++ |-|.+.-+ ..+......+.+.|+|....-++ .++..+++.++...-||... .-.+.... T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~----~~~~~~~~~~~~~~~~~~g~-l~~~~~~a 75 (587) T protein:vir:95 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEP----NTVYELRNYSQAKRLFRSGE-LLDAIELA 75 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEeCCCCC----ceeEEeccHHHHHHHhcCcc-hHHHHHHH Confidence 88754 222 33333333 34456666778889998876655 34566888899999998854 55677888 Q ss_pred hcCCC--CcceEEEEEeecccccceeeeeecc-----------chhhhH---HHHH---hhc--ccce----------eE Q lcl|NC_013597. 73 FAQSP--RAKQLIVARWQKSASTIEATKNTLS-----------GATLSD---DLER---FKS--VVNG----------RF 121 (502) Q Consensus 73 F~q~p--~P~~l~igr~~~~~~~~~~~~~~~~-----------~~~~~~---~~~~---~~~--~~~g----------~~ 121 (502) |.|++ .++++|+-|. ..+..+.++.+.|. .-.+.. .+.. ++. ..++ .| T Consensus 76 ~~~~~~~g~~~~~~~rv-~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ 154 (587) T protein:vir:95 76 WGSNPNYTAGRILAMRI-EDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIF 154 (587) T ss_pred hccccCCCceEEEEEEc-CCCceeEEEecCeEEEEecccccccceEEEEecCCCCCceeEEEEEecccceeeeeecccee Confidence 87755 5788888885 44444444444222 000000 0000 000 0000 12 Q ss_pred EEEecCcccc-------------cccccccc---------ccch-hhHHHHHHhhhccc-----------ccceeEEEec Q lcl|NC_013597. 122 SLTIGGDVKK-------------VDGLSFAR---------LADF-NAVATKIQEKLTTL-----------SVAVSIAYDE 167 (502) Q Consensus 122 ~iti~g~~~~-------------~~~i~~s~---------~ts~-~~vA~~i~aal~~a-----------~~~~tv~~~~ 167 (502) +|.-.|.... ...+-+.. .++. ...+..+...+..- +..-.+.+.. T Consensus 155 si~y~g~~~~~~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~in~~~~~tAky~g~~~~~i~~~~~~ 234 (587) T protein:vir:95 155 TIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLD 234 (587) T ss_pred eeeeeccccccceeeeecccceeeeeeeeecCCceEEEEEecCCchHHHHHHHHhhccccceEEEEecccCceeEEeecC Confidence 2221121110 00000000 0000 01111121111110 0000011101 Q ss_pred ccceeeEeeec------cccc----ccceeeeeeccc-----------cchhhhhhhhhhccc-----ccceeeeecccc Q lcl|NC_013597. 168 TGNRFIVSANV------AGED----KKTEIDYAIDEG-----------GEGEYIGALLKLENG-----QASRKVGKNSVS 221 (502) Q Consensus 168 ~~~~f~~~s~t------tG~~----~~v~~~~a~~~~-----------~t~t~~aa~l~~t~~-----~~~~~v~v~~~~ 221 (502) ....+.++... .++. ....+....... ....+.+....+... .....+.-...| T Consensus 235 ~~~~~~v~~~~~~v~a~~~d~~~~~~~~~~v~~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~t~LtGG~dG 314 (587) T protein:vir:95 235 KIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNG 314 (587) T ss_pred cccccceehhhhhhhhhhcceeeeeeceeeeeeecccccceeccchhhhhcccchheeccccccceeccceeeeecCCCC Confidence 00011111000 0000 000000000000 000000000000000 000112222233 Q ss_pred ccccCHHHHHHHHHhccCceeEEEEecCCChhHHHHHHHHHhhc----CCEEEEEecCchhcccchhHHHHHHHHccCCc Q lcl|NC_013597. 222 LKKETLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQAN----TKLFGANVIRAEQIEWSADNIYKKLYDAGLDH 297 (502) Q Consensus 222 ~~~et~~~al~al~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~----~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~ 297 (502) ...++..++|+++... +|+.++... .+.+.+.++.+|++.. +++..+..... ..+...+....+..++.| T Consensus 315 ~~~~~y~~~l~ale~~--~~~~i~~~t-~d~~v~a~l~a~vk~~~~~g~~~~aVvg~~~---~~~~~~~~~~a~~~n~er 388 (587) T protein:vir:95 315 EPPATWADKLDKFAHE--GGYYIVPLS-SKQSVHAEVASFVKERSDAGEPMRAIVGGGF---NESKEQLFGRQESLSNPR 388 (587) T ss_pred CCcccHHHHHHHHHhC--CcEEEEecC-CCHHHHHHHHHHHHHHHhCCCcEEEEEcCCC---CCCHHHHHHHHhhcCCCc Confidence 3345678899998775 566554333 3456667899999753 23443332221 112334445556678888 Q ss_pred eEEEecC------C---ccch----HHHHHHHHHhcCCCCCCceeeEeeeecC--ccccCCCCHHHHHHHHhCCceEEEE Q lcl|NC_013597. 298 TLAMFDK------N---DMYP----VSSALARLLSTNFAANNSTLTLKFKQQP--TITADEITATEFAKAKRLGINVYTY 362 (502) Q Consensus 298 t~~~y~~------~---~~~~----~aa~~g~~as~n~~~~~g~~T~~fk~~~--Gv~~~~lt~t~~~~l~~~~~n~y~~ 362 (502) .+.+... + ..++ .+.+.|..+..+++..+ | ||.++ ++. ..++.+|++.+..+|++.... T Consensus 389 vi~v~~~~~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~~~~Sl---T--~~~i~~~~v~-~~~t~~e~e~ai~~Gvl~l~~ 462 (587) T protein:vir:95 389 VSLVANSGTFVMDDGRKNHVPAYMVAVALGGLASGLEIGESI---T--FKPLRVSSLD-QIYESIDLDELNENGIISIEF 462 (587) T ss_pred EEEecccceEecCCCceeeechHHHHHHHHHHHhcCchhcCc---c--ceeeeccccc-ccCCHHHHHHHHhCCeEEEEE Confidence 8766432 1 1122 34556777777665433 3 34444 443 368999999999999999876 Q ss_pred EcCce----EEecCEeec----C-ee--hhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013597. 363 FDDVA----MIAEGTVIG----G-KF--ADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGIN 431 (502) Q Consensus 363 ~~~~~----~~~~G~~~~----G-~~--iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~ 431 (502) ..+.. ..-+|.+.= + .| |=.++-.|.+...++..+-+. +- +| |=++.|...|++.|++.|++..+ T Consensus 463 ~~~~~~~~vriv~~itT~t~~~d~~~~~i~viRv~D~i~~dir~~~~~~-~i--Gk-~nn~~~r~~v~~~i~~~L~~l~~ 538 (587) T protein:vir:95 463 VRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQ-FI--GT-RTINTSASIIKDFIQSYLGRKKR 538 (587) T ss_pred ecCCcceEEEEeecceeccCCCCcchhhhhhhhhHHHHHHHHHHHHHhh-CC--cc-ccchHHHHHHHHHHHHHHHHHHh Confidence 54321 222444431 1 35 668889999999998886333 33 45 67889999999999999999999 Q ss_pred cCccccccccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 432 NGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 432 ~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) .|.|.-... . + +.+. ..+|+ --++|.++..-++++|.+++.+.+ T Consensus 539 ~gaI~~~~~-~---------d-------v~v~-----~~~d~-----~~v~~~v~Pv~~mekI~vt~~~~~ 582 (587) T protein:vir:95 539 DNEIQDFPA-E---------D-------VQVI-----VEGNE-----ARISMTVYPIRSFKKISVSLVYKQ 582 (587) T ss_pred CCcccCCCc-c---------c-------eEEE-----ecCCE-----EEEEEEEEEcccceEEEEEEEEee Confidence 999952110 0 0 1111 11222 257899999999999999999988 No 22 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=99.04 E-value=4.2e-09 Score=66.51 Aligned_cols=368 Identities=13% Similarity=0.067 Sum_probs=203.6 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEeccccccc--ccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhcCCCC Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAF--ADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSPR 78 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~--~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~q~p~ 78 (502) |+-++--+==+.+.-.+.++...+.+.+.|++....... .....++ ..++..+-...||.++..+.+...+|.+... T Consensus 1 m~~~~~Gv~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~-~v~s~~~~~~~~g~~~tl~~a~~~~~~~gg~ 79 (396) T protein:vir:60 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPLNKPV-LITNVQSAIAKAGKKGTLAASLQAIADQSKP 79 (396) T ss_pred CCCCCCCeEEEEcCCCcccccccCceeEEEEecccccccccccCccCe-EeechHHHHHhhcCcchhHHHHHHHhhccCc Confidence 998764433333455577888899998999886532211 1112334 4456677788899999999999999987543 Q ss_pred cceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhcccc Q lcl|NC_013597. 79 AKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLS 158 (502) Q Consensus 79 P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a~ 158 (502) . .++-+....... ..... ...+..++.......+ ..+++.. T Consensus 80 ~--~~vv~~~~~~~~-----------~~~~~--------------------~~~~~~~~~~~~d~~~----~~tg~~a-- 120 (396) T protein:vir:60 80 V--TVVVRVEDGTGE-----------DEETK--------------------LAQTVSNIIGTTDENG----QYTGLKA-- 120 (396) T ss_pred e--EEEEeccccccc-----------ccccc--------------------cccccccccccccccc----cccchhh-- Confidence 2 222221110000 00000 0000000000000000 0000000 Q ss_pred cceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHHHHhcc Q lcl|NC_013597. 159 VAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVAEVN 238 (502) Q Consensus 159 ~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~al~~~~ 238 (502) ....+ ...+ . .+. + ....+........++..+.. T Consensus 121 ---------l~~~~----~~~~------~-----------------------~~~-i-l~ap~~~~~~v~~al~~~~~-- 154 (396) T protein:vir:60 121 ---------LLAAE----SVTG------V-----------------------KPR-I-LGVPGLDTKEVAVALASVCQ-- 154 (396) T ss_pred ---------hhhcc----ccee------e-----------------------eee-e-ccccccccHHHHHHHHHHhc-- Confidence 00000 0000 0 000 0 00111111112223332222 Q ss_pred CceeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecC-chhcccchhHHHHHHHHccCCceEEEecCCccchHHHHHHHH Q lcl|NC_013597. 239 NTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIR-AEQIEWSADNIYKKLYDAGLDHTLAMFDKNDMYPVSSALARL 317 (502) Q Consensus 239 ~~w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~-~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~aa~~g~~ 317 (502) ....+.+.+.....+..++-+|.+.-+..+....+. ....+. +...... ..+.+.++|.+ T Consensus 155 -~~~~~~i~d~p~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~-------------~~~~~~~-----~p~s~~~AG~~ 215 (396) T protein:vir:60 155 -KLRAFGYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDT-------------VASTTAT-----AYATARALGLR 215 (396) T ss_pred -cCCeEEEEeCCCCCCHHHHHHHHhhcCCceEEEEeCceeeecc-------------cCCceeE-----EchhHHHHHHH Confidence 223333444433333444445555321111111100 000000 0000111 12457788888 Q ss_pred HhcCCCCCCceeeEeeeecCccccC--------CCCHHHHHHHHhCCceEEEEEcCceEEecCEeecCe----ehhHHHH Q lcl|NC_013597. 318 LSTNFAANNSTLTLKFKQQPTITAD--------EITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGGK----FADEIVI 385 (502) Q Consensus 318 as~n~~~~~g~~T~~fk~~~Gv~~~--------~lt~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~~G~----~iD~~~~ 385 (502) +.+|..... -.....|.+.||..- ..+.+|++.|..+|+|......| ..++.+++++++ ||-+.+- T Consensus 216 a~~d~~~g~-~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~~~G-~~~wG~rT~~~d~~~~~i~~rR~ 293 (396) T protein:vir:60 216 AKIDQEQGW-HKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGVTTLIRRDG-FRFWGNRTCSDDPLFLFENYTRT 293 (396) T ss_pred HHhhhccCc-EeCcCCceecceeeceeecccccCCCcchhhhhhhcCcEEEEcCCC-EEEEcccccCCCcccceeehhhH Confidence 888765422 112235666665432 34678999999999999966444 467899999984 7889999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcCch Q lcl|NC_013597. 386 LDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPM 465 (502) Q Consensus 386 ~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~ 465 (502) .+|++..|+..+...++. |-+..-...|+..|+.-|+...++|.|. ||.+++. . T Consensus 294 ~~~i~~~i~~~~~~~v~e-----~n~~~~~~~i~~~i~~~l~~l~~~gal~--------------------g~~~~~d-~ 347 (396) T protein:vir:60 294 AQVLADTMAEAHMWAVDK-----PITATLIRDIVDGINAKFRELKTNGYIV--------------------DATCWFS-E 347 (396) T ss_pred HHHHHHHHHHHHHHhccC-----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------ceEEEEe-c Confidence 999999999998765533 6788889999999999999999999985 3667765 4 Q ss_pred hcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 466 DTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 466 ~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) +..+++|+.+.+. -+.+.+.....+++|.++...+. T Consensus 348 ~~nt~~~i~~G~~-~~~i~~~p~~pae~I~~~~~~~~ 383 (396) T protein:vir:60 348 ESNDAETLKAGKL-YIDYDYTPVPPLENLTLRQRITD 383 (396) T ss_pred CCCCHHHhhCCEE-EEEEEEEecCCcceEEEEEEEch Confidence 6888999988888 58999999999999999999988 No 23 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=99.01 E-value=5.8e-09 Score=65.76 Aligned_cols=365 Identities=12% Similarity=0.043 Sum_probs=204.9 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEeccccccc--ccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhcCCCC Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAF--ADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSPR 78 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~--~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~q~p~ 78 (502) |+=++--+--+.+.-.+.++...+-+.+-+++....... ....+++ ..++..+-...||.+.....+...+|.+... T Consensus 1 m~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~~~p~-~its~~~~~~~~g~~gtl~~al~~~~~ngg~ 79 (392) T protein:vir:18 1 MSDFHHGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPLNEPV-LITNVQSAIAKAGKKGTLSASLQAIADQSKP 79 (392) T ss_pred CCCCCCCeEEEEcCCCceeeeccCcceeEEEEeccCCCCcccccccce-EeechHHHHhhcCCCcchHHHHHHhhcccCc Confidence 998776655555556667777777777777776532110 0112334 4567777777889888888888888877543 Q ss_pred cceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhcccc Q lcl|NC_013597. 79 AKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLS 158 (502) Q Consensus 79 P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a~ 158 (502) +. ++-+....... .+. ..+..++-....... ......+|.... T Consensus 80 ~~--~vv~v~~~~~~----------~~~------------------------~~t~~dliG~~~~~~-~~tg~~al~~~~ 122 (392) T protein:vir:18 80 VT--VVVRVAEGTGD----------DAE------------------------AQTTSNIIGGTDENG-KYTGIKALLTAE 122 (392) T ss_pred eE--EEecccccccc----------ccc------------------------ccchhhheecccccc-hhhhHHHHHhhh Confidence 32 22111000000 000 000000000000000 000000111100 Q ss_pred cceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHHHHhcc Q lcl|NC_013597. 159 VAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVAEVN 238 (502) Q Consensus 159 ~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~al~~~~ 238 (502) ....+. ..+ . ...+........+|..+.+ T Consensus 123 ~~~~~~-------p~i-----------l-------------------------------~ap~~~~~~v~~~l~~~~~-- 151 (392) T protein:vir:18 123 AVTGVK-------PRI-----------L-------------------------------GVPGLDTQEVATALASVCI-- 151 (392) T ss_pred hhhcee-------ehh-----------c-------------------------------ccCccchHHHHHHHHHHHh-- Confidence 000000 000 0 0000011111222222222 Q ss_pred CceeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecC-chhcccchhHHHHHHHHccCCceEEEecCCccchHHHHHHHH Q lcl|NC_013597. 239 NTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIR-AEQIEWSADNIYKKLYDAGLDHTLAMFDKNDMYPVSSALARL 317 (502) Q Consensus 239 ~~w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~-~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~aa~~g~~ 317 (502) .+..+.+.+.....+..++.+|.+.-...+....+. ....+. .+.... ...+.+.+.|.. T Consensus 152 -~~~~~~~~d~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~------------~~~~~~------~~p~s~~~AG~~ 212 (392) T protein:vir:18 152 -SLRAFGYVSAWGCKTISEAMAYRENFSQRELMVIWPDFLAWDT------------TANATA------TAYATARALGLR 212 (392) T ss_pred -hcCcEEEEecCCCCCHHHHHHHHhhccCceEEEEeCceeeecc------------cCCceE------EechHHHHHHHH Confidence 233344455433445555556766422111111110 000000 000000 112467778888 Q ss_pred HhcCCCCCCceeeEeeeecCccccC--------CCCHHHHHHHHhCCceEEEEEcCceEEecCEeecCe----ehhHHHH Q lcl|NC_013597. 318 LSTNFAANNSTLTLKFKQQPTITAD--------EITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGGK----FADEIVI 385 (502) Q Consensus 318 as~n~~~~~g~~T~~fk~~~Gv~~~--------~lt~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~~G~----~iD~~~~ 385 (502) +.+|.+..+ -.....|.+.||..- ..+..|++.|..+|+|.+....| ..+|.+++++++ ||-+.+- T Consensus 213 a~~d~~~g~-~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~~~~G-~~~wG~rT~~~d~~~~~i~~rR~ 290 (392) T protein:vir:18 213 AYIDQTIGW-HKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLVRKDG-FRFWGNRTCSDDPLFLFENYTRT 290 (392) T ss_pred HhhhccCCc-eEccCCceeeceeecceecccccCCCcchhhhhhhcCceEEEcCCC-EEEEcccccCCCcccceeehhhH Confidence 888754422 122345566665432 24678999999999999976444 578899999985 8889999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcCch Q lcl|NC_013597. 386 LDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPM 465 (502) Q Consensus 386 ~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~ 465 (502) .+|++..|+..+...+. + |-++.-...|+..++.-|++.+++|.|. ||.+++. + T Consensus 291 ~~~i~~~i~~~~~~~v~----e-~n~~~~~~~i~~~i~~~L~~l~~~gal~--------------------g~~v~~d-~ 344 (392) T protein:vir:18 291 AQVLADTMAEAHMWAVD----K-PITASLIRDIVDGINAKFRELKSNGYIV--------------------DGECWFD-E 344 (392) T ss_pred HHHHHHHHHHHHHHhcc----C-CCCHHHHHHHHHHHHHHHHHHHhcCccc--------------------ceEEEEe-c Confidence 99999999999866543 3 7899999999999999999999999985 3667764 4 Q ss_pred hcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 466 DTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 466 ~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) +..+++|+.+.+. -+.+.+.....+++|.++...+. T Consensus 345 ~~nt~~~i~~G~~-~~~v~~~p~~p~e~I~~~~~~~~ 380 (392) T protein:vir:18 345 ESNDKETLKAGKL-YIDYDYTPVPPLESLTLRQRITD 380 (392) T ss_pred CCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEch Confidence 6788999999988 58999999999999999999888 No 24 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=98.93 E-value=1.3e-08 Score=63.84 Aligned_cols=369 Identities=14% Similarity=0.097 Sum_probs=205.6 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEecccccc--cccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhcCCCC Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQA--FADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSPR 78 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~--~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~q~p~ 78 (502) |+-+.--+.=+.+.-.+.++...+.+.+.+++...... ..+..++++. ++..+....||.+...+.+-..+|.+... T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~~~pv~i-~s~~~~~~~~g~~~tl~~al~~~~~~~~~ 79 (396) T protein:vir:57 1 MSDYHHGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLI-TNVQSAIAKAGKKGTLAASLQAIADQSKP 79 (396) T ss_pred CCCCCCceEEEEcCCCcccccccCCceEEEEEeccCCCcccccCccCeEe-ecchhhhhhcccccchHHHHHHhhhcCCc Confidence 99988766555666667788888888888887653211 1112244544 56667777788888777777777766433 Q ss_pred cceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhcccc Q lcl|NC_013597. 79 AKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLS 158 (502) Q Consensus 79 P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a~ 158 (502) +.. +-+......... .... + ....-.+++... ....++ + .++. T Consensus 80 ~~~--vv~~~~~~~~~~-------~~~~--------a---~t~~~iiG~~~~---------~~~~tg----l-~al~--- 122 (396) T protein:vir:57 80 VTV--VVRVEDGTGDDE-------ETKL--------A---QTVSNIIGTTDE---------NGQYTG----L-KALM--- 122 (396) T ss_pred eeE--eeeccccccccc-------cccc--------c---ccceeeeeeccc---------cccchh----h-hhhh--- Confidence 222 111110000000 0000 0 000000000000 000000 0 0000 Q ss_pred cceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHHHHhcc Q lcl|NC_013597. 159 VAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVAEVN 238 (502) Q Consensus 159 ~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~al~~~~ 238 (502) .....+... +.. +...+........+|..+.+. T Consensus 123 --------~~~~~~~~~------------------------------------p~i--~~ap~~~~~~v~~al~~~~~~- 155 (396) T protein:vir:57 123 --------GAESVTGVK------------------------------------PRI--LGVPGLDTKEVAVALASVCQE- 155 (396) T ss_pred --------hcccceeEE------------------------------------ecc--ccCcccchhHHHHHHHHHhhh- Confidence 000000000 000 001111112223333333332 Q ss_pred CceeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecCchhcccchhHHHHHHHHccCCceEEEecCCccchHHHHHHHHH Q lcl|NC_013597. 239 NTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDKNDMYPVSSALARLL 318 (502) Q Consensus 239 ~~w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~aa~~g~~a 318 (502) -..+.+.+.....+..++.+|-+.-+..+....+ +-..... .+.+....+ .+.+.+.|.++ T Consensus 156 --~~~~~~~d~p~~~~~~~~~~~~~~~~s~~~~~~~-p~~~~~d-----------~~~~~~~~~-----p~s~~~Ag~~a 216 (396) T protein:vir:57 156 --LNAFGYISAWGCKTISEVKAYRQNFSQRELMVIW-PDFLAWD-----------TVTSTTATA-----YATARALGLRA 216 (396) T ss_pred --CceEEEEcCCCCCCHHHHHHHHhccCCceEEEEc-ceeeeec-----------ccCCceeEE-----ehhHHHHHHHH Confidence 2344445543333445555666642211111111 0000000 001111111 24567778888 Q ss_pred hcCCCCCCceeeEeeeecCccccC--------CCCHHHHHHHHhCCceEEEEEcCceEEecCEeecCe----ehhHHHHH Q lcl|NC_013597. 319 STNFAANNSTLTLKFKQQPTITAD--------EITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGGK----FADEIVIL 386 (502) Q Consensus 319 s~n~~~~~g~~T~~fk~~~Gv~~~--------~lt~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~~G~----~iD~~~~~ 386 (502) .+|.... --.....|.+.||... ..+.+|++.|..+|+|......| ..++.+++++++ ||-+.+-. T Consensus 217 ~~d~~~g-~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~t~~~~~G-~~~wG~rT~~~d~~~~~i~vrR~~ 294 (396) T protein:vir:57 217 KIDQEQG-WHKTLSNVGVNGVTGISASVFWDLQKPGTDADLLNEAGVTTLVRRDG-FRFWGNRTCSDDPLFLFESYTRTA 294 (396) T ss_pred HhhhccC-cEeccCCceeccccccceecccccCCcchhhhhhhhcCcEEEEcCCC-EEEEcccccCCCcccceeehhhHH Confidence 8775442 1223345677776532 23578999999999999976444 578899999985 78889999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcCchh Q lcl|NC_013597. 387 DWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMD 466 (502) Q Consensus 387 dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~ 466 (502) +|++..|+..+...+.. |-+..=...|+..|+.-|+..+++|.|. ||.+.+. ++ T Consensus 295 ~~i~~~i~~~~~~~v~e-----~n~~~~~~~i~~~i~~~l~~l~~~gal~--------------------g~~v~~d-~~ 348 (396) T protein:vir:57 295 QVLADTMAEAHMWAIDK-----PITATLIRDIIDGINAKFRELKNNGYIV--------------------DGTCWFS-EE 348 (396) T ss_pred HHHHHHHHHHHHHhccC-----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------ceEEEEe-cC Confidence 99999999998765532 6788889999999999999999999985 3667775 46 Q ss_pred cCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 467 TLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 467 ~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) ..+++++.+.+. -+.+.+.....+++|.++...+. T Consensus 349 ~n~~~~i~~G~~-~~~v~~~p~~p~e~I~~~~~~~~ 383 (396) T protein:vir:57 349 SNDAETLKAGKL-YIDYDYTPVPPLENLTLRQRITS 383 (396) T ss_pred CCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEch Confidence 788999998888 58999999999999999999888 No 25 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=98.93 E-value=1.3e-08 Score=63.81 Aligned_cols=364 Identities=13% Similarity=0.089 Sum_probs=199.6 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEecccccc--cccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhcCCCC Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQA--FADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSPR 78 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~--~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~q~p~ 78 (502) |+-++--+-=+.+.-.+.+......+.+.|+|...... ..+-.++++ .++..+....||.....+.+-..+|.+... T Consensus 1 m~~~~~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~~~pv~-v~s~~~~~~~~g~~~tl~~al~~~~~~~~~ 79 (395) T protein:vir:98 1 MSDFHHGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPLNEPVL-ITNVQSAIAKAGKKGTLAASLQAIADQSKP 79 (395) T ss_pred CCCCCCCeEEEEcCCCcccccccCcceEEEEeeccCCCccccccccceE-eechHHhHhhcccccchhhHHHHHhhccCc Confidence 99887655445555666677777777777777543211 111224444 366677777788877777777777776543 Q ss_pred cceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhcccc Q lcl|NC_013597. 79 AKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLS 158 (502) Q Consensus 79 P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a~ 158 (502) +..+. +....... .. ...+ .+ T Consensus 80 ~~~vv--~~~~~~~~---------~~-------------~~~~-----------------a~------------------ 100 (395) T protein:vir:98 80 VTVVV--RVEDGTGD---------DE-------------EAAL-----------------AQ------------------ 100 (395) T ss_pred eEEEe--eccccccc---------cc-------------cccc-----------------cc------------------ Confidence 32211 11000000 00 0000 00 Q ss_pred cceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHHHHhcc Q lcl|NC_013597. 159 VAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVAEVN 238 (502) Q Consensus 159 ~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~al~~~~ 238 (502) ..+ ...|.... ....+.+.++.......+..+--+...+-.......+|..+...- T Consensus 101 -----~~~----------~i~g~~~~---------~~~~Tgl~al~~~~~~~~~~p~il~ap~~~~~~v~~al~~~~~~~ 156 (395) T protein:vir:98 101 -----TVS----------NIIGGTDE---------NGKYTGIKALLTAQAVTGVKPRILGVPGLDTKEVAVALASAAIKL 156 (395) T ss_pred -----ccc----------cccccccc---------ccchhHHHHHhhhhhhhccchhhcccccccccHHHHHHHHHhhhc Confidence 000 00000000 000000111110000000000000111222223334444333332 Q ss_pred CceeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecCchhcccchhHHHHHHHHccCCceEEEecCC-----ccchHHHH Q lcl|NC_013597. 239 NTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDKN-----DMYPVSSA 313 (502) Q Consensus 239 ~~w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~-----~~~~~aa~ 313 (502) . .+.+.+........++-+|.+.-...+....+. - +.++++. ...+.+.+ T Consensus 157 ~---~~~~~d~p~~~t~~~a~~~~~~~~s~~~~~~~p-~---------------------~~~~d~~~~~~~~~p~s~~~ 211 (395) T protein:vir:98 157 R---AFAYVSAWGCKTISEAMEYRKNFSQRELMVIWP-D---------------------FLAWDTVKNTTATAYATARA 211 (395) T ss_pred C---cEEEEEcCCCCCHHHHHHHHhccCCceEEEEec-c---------------------eeEecccCCceeeechHHHH Confidence 2 233333322223334445554311111111100 0 0011111 01256777 Q ss_pred HHHHHhcCCCCCCceeeEeeeecCccccC--------CCCHHHHHHHHhCCceEEEEEcCceEEecCEeecCe----ehh Q lcl|NC_013597. 314 LARLLSTNFAANNSTLTLKFKQQPTITAD--------EITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGGK----FAD 381 (502) Q Consensus 314 ~g~~as~n~~~~~g~~T~~fk~~~Gv~~~--------~lt~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~~G~----~iD 381 (502) .|.++.+|..+.. -.....|.+.||..- ..+.+|++.|..+|+|.+....| -.++.+++++++ ||- T Consensus 212 AG~~a~~d~~~g~-~~spaN~~i~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~~~G-~~~wG~rT~s~d~~~~~i~ 289 (395) T protein:vir:98 212 LGLRAYIDQTVGW-HKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLVRKDG-FRFWGNRTCSDDPLFLFEN 289 (395) T ss_pred HHHHHHhhcccCc-EeccCCceeecccccceecccccCCCcchHHhhhhcCcEEEEcCCC-EEEEcccccCCCcccceee Confidence 8888887754421 112234555554322 24688999999999999966433 577899999984 788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEE Q lcl|NC_013597. 382 EIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVW 461 (502) Q Consensus 382 ~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~ 461 (502) +.+-.+|+...|+..+...+.. |-++.=...|+..|+.-|++.+++|.|. ||.+. T Consensus 290 ~rR~~~~i~~~i~~~~~~~v~e-----~~~~~~~~~i~~~i~~~L~~l~~~g~l~--------------------g~~v~ 344 (395) T protein:vir:98 290 YTRTAQVLADTMAEAHMWAVDK-----PITATLIRDIVDGINAKFRELKSNGYIV--------------------EGKCW 344 (395) T ss_pred hhhHHHHHHHHHHHHHHHhccC-----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------ceEEE Confidence 8899999999999998765533 6788888999999999999999999985 46677 Q ss_pred cCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 462 AAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 462 ~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) +. ++..+++|+.+.+. -+.+.+.....+++|+++...+. T Consensus 345 ~d-~~~nt~~~i~~G~~-~~~i~~~p~~p~e~I~~~~~~~~ 383 (395) T protein:vir:98 345 FD-EESNDKETLKAGKL-YIDYDYTPVPPLESLTLRQRITD 383 (395) T ss_pred Ee-cCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEch Confidence 75 36788999999998 58999999999999999999988 No 26 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=98.92 E-value=1.4e-08 Score=63.63 Aligned_cols=398 Identities=13% Similarity=0.152 Sum_probs=214.3 Q ss_pred CCc------CcCc-----eeEEeecccc-cccccccccceEEEecccccccccCccceEEecC--HHHHHhhcCCC--cH Q lcl|NC_013597. 1 MAL------SISH-----IVNVQLNTVP-KSAARKSFGIVALFTPEAGQAFADEKTRYVYVEN--QRDVEQLFGTN--SE 64 (502) Q Consensus 1 Msi------p~s~-----iV~V~i~~~~-~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s--~~~v~~~fg~~--s~ 64 (502) |++ -.+| ++|+...-.. .+.+.++.- .+.+. ..-++..+.+...++ ..++...||.+ .+ T Consensus 1 ~~magg~~~~~~K~~PG~Y~n~~~~~~~~~~~~~rGi~-a~p~~----~~wGp~~~v~~i~~~~~~~~~~~~~G~~~~~~ 75 (436) T protein:vir:78 1 MALGGGTFVTQNKVLPGSYINFVSATRATSSLSDRGIV-AMPLE----LDWGIDEEVFQVTSDDFEKYSTKYFGYDYTHE 75 (436) T ss_pred CcccceeeccceeecCceEEEEEecCcceeeccCCeEE-EEEEE----ecCCCCceeEEeecccchHHHHHHhcCccchH Confidence 443 1233 3555432222 234444432 22222 234555566666654 45788889974 44 Q ss_pred HHHHHHHHhcCCCCcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchh Q lcl|NC_013597. 65 TAKAAQPFFAQSPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFN 144 (502) Q Consensus 65 ey~aA~~~F~q~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~ 144 (502) ..+..+..| .+|++|++-|... +..+..+ + ..+.........++++|.......+..|+.....-. T Consensus 76 ~~~~l~~~~---~~~~tv~~yrl~~-G~~a~~~---v-------~~Aky~g~~gn~i~v~v~~~~~d~~~~dv~~~~g~~ 141 (436) T protein:vir:78 76 KLKGLRDLF---KNIRLGYFYKLNK-GVKASCS---I-------ATARCSGIRGNDLKVIVTTNIDDNAKFDVVTLLDNK 141 (436) T ss_pred HHHHHHHHh---cCCCEEEEEECCC-cceeeee---e-------eeeecCCCCCcEEEEEecccccccCceEEEEEecch Confidence 445566677 4688899999753 2221111 1 112233333336788876554444444443321111 Q ss_pred hHHHHHHhhhcccccceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccc Q lcl|NC_013597. 145 AVATKIQEKLTTLSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKK 224 (502) Q Consensus 145 ~vA~~i~aal~~a~~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~ 224 (502) .+=......+..-....-|.+ ...|....+. ...|+.+. ....... T Consensus 142 ~~d~~~~~~~~~l~~n~~V~~-----------~~~g~la~~a----------------~~~LtGG~-------dG~~~T~ 187 (436) T protein:vir:78 142 KVDTQIAKVITELQDNDYVTW-----------KKEATLEATA----------------GLTFTNGT-------NGEAVTG 187 (436) T ss_pred hhhhhhHHHHhhccCCceEEE-----------Eecccccccc----------------eeeeeccc-------cccccch Confidence 110011011110000011111 1111110000 00111110 0111235 Q ss_pred cCHHHHHHHHHhccCceeEEEEecCCChhHHHHHHHHHhhc----CCEEEEEecCchhcccchhHHHHHHHHccCCceEE Q lcl|NC_013597. 225 ETLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQAN----TKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLA 300 (502) Q Consensus 225 et~~~al~al~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~----~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~ 300 (502) ++..++|+++.... |..+.++. .+.+.+..+..|+... .+.+-.........+ +.+.+- T Consensus 188 ~dy~~al~~le~~~--fn~l~~~~-~d~~~~~~~~a~ikr~re~~g~~~~aV~~~~~~~d--------------~EgIIn 250 (436) T protein:vir:78 188 TEYQAFLDKIESYS--FNALGCLA-TTAEIKSLFVEFTKRMRDKVGAKFQTVLYKKNDAD--------------YEGVVS 250 (436) T ss_pred HHHHHHHHHHcccc--eeEEEecC-CChHHHHHHHHHHHHHHhhcCCeEEEEecCCCCCC--------------CceEEE Confidence 67889999987775 55444444 4778889999999852 345544332211111 111111 Q ss_pred Eec---C---CccchHHHHHHHHHhcCCCCCCceeeEeeeecCccc-c-CCCCHHHHHHHHhCCceEEEEEcCceEEecC Q lcl|NC_013597. 301 MFD---K---NDMYPVSSALARLLSTNFAANNSTLTLKFKQQPTIT-A-DEITATEFAKAKRLGINVYTYFDDVAMIAEG 372 (502) Q Consensus 301 ~y~---~---~~~~~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~-~-~~lt~t~~~~l~~~~~n~y~~~~~~~~~~~G 372 (502) +-. . +.....+.+.|..|+..++.. +-|+.++|+. . ..++.+|++.+.++|.-.+..-++.-.+.+| T Consensus 251 v~n~v~g~~~~~~~~~a~vAG~~Ag~~~~~S-----~T~~~~~~~~~v~~~~t~~e~~~ai~~G~lvl~~d~~~v~I~~~ 325 (436) T protein:vir:78 251 VENKIKDTGLLESSLIYWTTGAIAGCDINKS-----NTNKRYDGEFDVDVNYTQIHLEEALKTGKFIFHKVGDEVHVLED 325 (436) T ss_pred eecccCCceechhHHHHHHHHHHhcCccccC-----ccceecCccccccccCCHHHHHHHHhCCeEEEEEeCCeEEEEEc Confidence 111 0 011234555666666655433 3477888873 4 4689999999999999888766666666666 Q ss_pred Eee----c----Cee--hhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccC Q lcl|NC_013597. 373 TVI----G----GKF--ADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTG 442 (502) Q Consensus 373 ~~~----~----G~~--iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~ 442 (502) ... + -+| |-.++..|-+.+.++..+ +.-+. +|+|=+..|..++.+.|+.-|++..+.|.|..-. .. T Consensus 326 VNTltt~~~~k~~~~~kI~vir~~D~i~~di~~~~-~~~yi--GKv~N~~dgr~~l~~~i~~yl~~L~~~g~I~~f~-~~ 401 (436) T protein:vir:78 326 INTFVSFTDEKNDDFSSNQSVRVLDQIANDIATLF-NTKYL--GEVPNDKSGRISFWNDVVKHHEQLQNMRAIEDFK-AD 401 (436) T ss_pred cccceecCCCCCcchhhhhHHHHHHHHHHHHHHHh-hhccc--cccCCCHHHHHHHHHHHHHHHHHHHhCCcccCCC-Cc Confidence 533 1 123 778888899998887664 43333 6999999999999999999999999999995311 00 Q ss_pred ccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEe Q lcl|NC_013597. 443 AGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYN 501 (502) Q Consensus 443 ~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~ 501 (502) .+.+. +. ..+..--+++..+.-.|+.++.++++|. T Consensus 402 ----------------Dv~v~------~~--~~~~~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 402 ----------------DVSVE------PG--SDKKTVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred ----------------ceEEe------ec--CCCCEEEEEEEEEEEEeeeeEEEEEEEC Confidence 11111 11 1122334888899999999999999999 No 27 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=98.87 E-value=2.4e-08 Score=62.37 Aligned_cols=368 Identities=13% Similarity=0.074 Sum_probs=202.2 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEecccccc--cccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhcCCCC Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQA--FADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSPR 78 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~--~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~q~p~ 78 (502) |+-+.--+-=+.+.-.+.++.....+.+.++|...... ..+..+.+ ..++..+....||.....+.+...+|.+... T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l~~pv-lvts~~~~~~~~g~~~tL~~al~~~~~ngg~ 79 (396) T protein:vir:20 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPLNKPV-LITNVQSAISKAGKKGTLAASLQAIADQSKP 79 (396) T ss_pred CCCCCCCeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccCccCE-EeechHHHHhhcccccchhhhhhhhhccCce Confidence 99876544444455556677777777777887543221 11112334 4467778788899888888877777765422 Q ss_pred cceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhcccc Q lcl|NC_013597. 79 AKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLS 158 (502) Q Consensus 79 P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a~ 158 (502) .. ++-+..... ..... .. . ..+ ..++.......+-.+.+. ++... T Consensus 80 ~~--~v~~~~~~~---------~~~~~--~~-------------~-----a~t--~~~~~~~~~~~~~~tg~~-al~~~- 124 (396) T protein:vir:20 80 VT--VVMRVEDGT---------GDDEE--TK-------------L-----AQT--VSNIIGTTDENGQYTGLK-AMLAA- 124 (396) T ss_pred eE--EEEeccccc---------ccccc--cc-------------c-----ccc--ccccccccccccccchhh-hhhhh- Confidence 21 111110000 00000 00 0 000 000000000000000000 00000 Q ss_pred cceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHHHHhcc Q lcl|NC_013597. 159 VAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVAEVN 238 (502) Q Consensus 159 ~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~al~~~~ 238 (502) . ...+ ..+.-+ ...+........+|..+.+.. T Consensus 125 ------~-----------~~~~--------------~~p~i~-----------------~ap~~~~~~v~~al~~~~~~~ 156 (396) T protein:vir:20 125 ------E-----------SVTG--------------VKPRIL-----------------GVPGLDTKEVAVALASVCQKL 156 (396) T ss_pred ------c-----------cccc--------------cchhhh-----------------hhhhhccHHHHHHHHHHHhcC Confidence 0 0000 000000 001111122334444444433 Q ss_pred CceeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecC-chhcccchhHHHHHHHHccCCceEEEecCCccchHHHHHHHH Q lcl|NC_013597. 239 NTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIR-AEQIEWSADNIYKKLYDAGLDHTLAMFDKNDMYPVSSALARL 317 (502) Q Consensus 239 ~~w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~-~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~aa~~g~~ 317 (502) .. +.+.+.....+..++.+|.+.-+..+....+. ....+ ...+.... ..+.+.+.|.+ T Consensus 157 ~~---~~~iD~p~~~~~~~a~~~r~~~~s~~~~~~~P~~~~~d-------------~~~~~~~~-----~p~s~~~Ag~~ 215 (396) T protein:vir:20 157 RA---FGYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWD-------------TVTSTTAT-----AYATARALGLR 215 (396) T ss_pred Cc---EEEEecCCCCCHHHHHHHhhCCCCceEEEEcCcccccc-------------CcCCccee-----echhHHHHHHH Confidence 33 33344433334445556665422222211111 00000 00001111 12456777888 Q ss_pred HhcCCCCCCceeeEeeeecCccccC--------CCCHHHHHHHHhCCceEEEEEcCceEEecCEeecCe----ehhHHHH Q lcl|NC_013597. 318 LSTNFAANNSTLTLKFKQQPTITAD--------EITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGGK----FADEIVI 385 (502) Q Consensus 318 as~n~~~~~g~~T~~fk~~~Gv~~~--------~lt~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~~G~----~iD~~~~ 385 (502) +.+|..+.. -.....|.+.||... ..+++|++.|..+|+|......| ..++.+++++++ ||-+.+- T Consensus 216 a~~d~~~g~-~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~~~G-~~~wG~rT~s~d~~~~~i~~rR~ 293 (396) T protein:vir:20 216 AKIDQEQGW-HKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGVTTLIRRDG-FRFWGNRTCSDDPLFLFENYTRT 293 (396) T ss_pred HHhhhhcCc-EeccCCceeccceecceecccccCCCcchhhhhhhcCcEEEEcCCC-EEEEcccccCCCcccceeehhhH Confidence 887764421 123345566665432 25678999999999999966443 578899999985 7888899 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcCch Q lcl|NC_013597. 386 LDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPM 465 (502) Q Consensus 386 ~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~ 465 (502) .+|+...|+..+...+.. |-+..=...|+..++.-|++..+.|.|. ||.+.+. + T Consensus 294 ~~~i~~~~~~~~~~~v~e-----~~~~~~~~~i~~~i~~~L~~l~~~G~l~--------------------g~~v~~d-~ 347 (396) T protein:vir:20 294 AQVVADTMAEAHMWAVDK-----PITATLIRDIVDGINAKFRELKTNGYIV--------------------DATCWFS-E 347 (396) T ss_pred HHHHHHHHHHHHHHhccC-----CCCHHHHHHHHHHHHHHHHHHHhCccee--------------------ceEEEEe-c Confidence 999999999998765532 6788889999999999999999999985 4677775 5 Q ss_pred hcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 466 DTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 466 ~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) ++.|++|+.+.+. -+.+.+.....+++|.++...+. T Consensus 348 ~~nt~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~~ 383 (396) T protein:vir:20 348 ESNDAETLKAGKL-YIDYDYTPVPPLENLTLRQRITD 383 (396) T ss_pred CCCCHHHhhCCEE-EEEEEEEecCCcceEEEEEEEch Confidence 7888999999998 58999999999999999998888 No 28 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=98.79 E-value=4.7e-08 Score=60.74 Aligned_cols=453 Identities=13% Similarity=0.090 Sum_probs=220.6 Q ss_pred CCcCc---Cce----eEEeecccccc-cccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHH Q lcl|NC_013597. 1 MALSI---SHI----VNVQLNTVPKS-AARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPF 72 (502) Q Consensus 1 Msip~---s~i----V~V~i~~~~~~-~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~ 72 (502) |+|.+ .++ |.|.+.-+... ....+.+.+.|+|....-|+ .++..+++.++..+.||.. +.+.|..+. T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~g~~----~~~~~~~~~~~~~~~~g~G-~l~~ai~~a 75 (587) T protein:vir:96 1 MAKDIFPRRPIQRPHASIEVDSSGIGGSASNSEKILCLIGKAEGGEP----NTVYQVRNYAQAKSVFRSG-ELLDAIELA 75 (587) T ss_pred CeeeeeCCCcccCCceEEEEecCCccCCCCCCCceEEEEEEecCCCC----ceeEEEcChHHHHHhhcCC-cHHHHHHHH Confidence 99866 555 44444444332 44455677888888776655 3445578888889999887 477888888 Q ss_pred hcCCC--CcceEEEEEeecccccceeeeeeccc---------hhhhHHHH--------Hhhc--ccce----------eE Q lcl|NC_013597. 73 FAQSP--RAKQLIVARWQKSASTIEATKNTLSG---------ATLSDDLE--------RFKS--VVNG----------RF 121 (502) Q Consensus 73 F~q~p--~P~~l~igr~~~~~~~~~~~~~~~~~---------~~~~~~~~--------~~~~--~~~g----------~~ 121 (502) |...+ ...+++.=| +..+..+.++.+.+.- -.+.-.++ .+.. ..++ -+ T Consensus 76 ~~~~~~~g~~~~~a~r-v~~~~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~n~G~v~ 154 (587) T protein:vir:96 76 WGSNPQYTAGKILAMR-VEDAKASQLEKGGLRVTSKIFGSVSNDIQVALEKNTITDSLRLRVVFQKDNYQEVFDNLGNIF 154 (587) T ss_pred hccCcCCCceEEEEEe-cCCCccceeecccccccccccCCCCceEEEEEEeccCCCccceEEEEecCCceeeccccCceE Confidence 86433 234444323 2333333222221110 00000000 0100 0001 12 Q ss_pred EEEecCcccc-----------cccccc--cc----------ccchhhHHHHHHhhhcccccceeEEEe-cccceeeEeee Q lcl|NC_013597. 122 SLTIGGDVKK-----------VDGLSF--AR----------LADFNAVATKIQEKLTTLSVAVSIAYD-ETGNRFIVSAN 177 (502) Q Consensus 122 ~iti~g~~~~-----------~~~i~~--s~----------~ts~~~vA~~i~aal~~a~~~~tv~~~-~~~~~f~~~s~ 177 (502) +|...|+... ..+..+ .. .......+..+...+..- ...+..|- ..++.+.+... T Consensus 155 ~i~y~g~~~~a~~~~~~~~~~~~A~~l~l~gg~~~v~~yrl~~g~~~~~~~~~~~~~~~-~~~tAky~g~~~n~~~v~v~ 233 (587) T protein:vir:96 155 SINYKGEGEKATFSVEKDKETQEAKRLVLKVDEKEVKAYELNGGAYSFTNEIITDINEL-PDFEAKLSPFGDKNLESRKL 233 (587) T ss_pred EEEecccccceeEeeccCcccceeeeeEEEecCceEEEEEeCCCchhhhhhhhhhhccc-cceEEEeecccCceeEEEee Confidence 2322222111 000000 00 000001111111111100 00000110 00111111100 Q ss_pred --ccc-ccccceee------------------eeecccc---chhh-----h--hhhhhhcccc------cceeeeeccc Q lcl|NC_013597. 178 --VAG-EDKKTEID------------------YAIDEGG---EGEY-----I--GALLKLENGQ------ASRKVGKNSV 220 (502) Q Consensus 178 --ttG-~~~~v~~~------------------~a~~~~~---t~t~-----~--aa~l~~t~~~------~~~~v~v~~~ 220 (502) ... +.....+- ....... ...+ . .......... .-..+.-... T Consensus 234 d~~~~~~~k~~~~y~~t~~~di~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~aLtGG~d 313 (587) T protein:vir:96 234 DEATDVDIKGKAVYVKAVFGDIENQTQYNQYVKFEQLPEQASEPSDVEVHAETESATVTATSKPKAIEPFELTKLSGGTN 313 (587) T ss_pred ccccccccceEEEeehhhhhhhhhhhccccceeeccccchhhhhhcccccccccceeeeecccccccccccceeeecCCC Confidence 000 00000000 0000000 0000 0 0000000000 0001111122 Q ss_pred cccccCHHHHHHHHHhccCceeEEEEecCCChhHHHHHHHHHhhc----CCEEEEEecCchhcccchhHHHHHHHHccCC Q lcl|NC_013597. 221 SLKKETLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQAN----TKLFGANVIRAEQIEWSADNIYKKLYDAGLD 296 (502) Q Consensus 221 ~~~~et~~~al~al~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~----~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~ 296 (502) |...++..++++++..+ +|+.+.+ ...+.+.+..+.+|++.. +++..+..... ......+....+..++. T Consensus 314 G~~~~~y~~~l~ale~~--~~~~i~~-~t~d~ai~~~l~a~vk~~r~~gk~~~aVlg~~~---~~~~~~~~~~a~~~n~e 387 (587) T protein:vir:96 314 GEPPTSWSAKLEKFKNE--GGYYIVP-LTDRQSVHSEVATFVKNRSDAGEPMRAIVGGGT---SETKEKLFGRQAILNNP 387 (587) T ss_pred CCCcccHHHHHHHHhhC--CcEEEEe-cCCCHHHHHHHHHHHHHHHhCCCeEEEEecCCC---CCCHHHHHHHHhhcCCC Confidence 33345678899998876 4554443 333456677899999752 23444332221 12334445556677888 Q ss_pred ceEEEecC------C------ccc-hHHHHHHHHHhcCCCCCCceeeEeeeecCccc-cCCCCHHHHHHHHhCCceEEEE Q lcl|NC_013597. 297 HTLAMFDK------N------DMY-PVSSALARLLSTNFAANNSTLTLKFKQQPTIT-ADEITATEFAKAKRLGINVYTY 362 (502) Q Consensus 297 ~t~~~y~~------~------~~~-~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~-~~~lt~t~~~~l~~~~~n~y~~ 362 (502) |.+.+.+. + ..| ..+.+.|..++.+++.. +| ||.++++. ...++.+|++.+.++|+..+.. T Consensus 388 ~vi~v~~~~~~~~~~~~~~~~~~~~~aa~vAG~~Ag~~~~~S---~T--~~~~~~~~v~~~~t~~e~~~~i~~G~~~l~~ 462 (587) T protein:vir:96 388 RVALVANSGKFVMGNGRILQAPAYMVASAVAGLVSGLDIGES---IT--FKPLFVNSLDKVYESEELDELNENGIITIEF 462 (587) T ss_pred cEEEEecceEEecCCCceeeechhhHHHHHHHHHhcCccccC---cc--ceeeecccccccCCHHHHHHHHhCCeEEEEE Confidence 87766542 1 112 34556677777766543 33 44554332 2368999999999999999987 Q ss_pred EcCce-E---EecCEeecC-----ee--hhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013597. 363 FDDVA-M---IAEGTVIGG-----KF--ADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGIN 431 (502) Q Consensus 363 ~~~~~-~---~~~G~~~~G-----~~--iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~ 431 (502) ..+.. . .-++.+.-. .| |-.++-.|.+...++..+-+. +- +| |=++.|...|++.|++.|++..+ T Consensus 463 ~~~~~~~v~~~vnsitT~t~~~~~~~~~i~virv~D~i~~di~~~~~~~-yi--Gk-~nn~~~r~~v~~~i~~~L~~l~~ 538 (587) T protein:vir:96 463 VRNRMTTMFRIVDDVTTFPDKNDPVKSEMALGEANDFLVSELKILLEEQ-YI--GT-RTINTSASQIKDFVQSYLGRKKR 538 (587) T ss_pred ecCCcEEEEEeeccceecCCCCCchhhhhhhHHHHHHHHHHHHHHHHhc-CC--cc-ccCHHHHHHHHHHHHHHHHHHHh Confidence 65432 1 223443322 24 668888999999998775333 33 56 67889999999999999999999 Q ss_pred cCccccccccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 432 NGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 432 ~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) .|.|.-. .. + + ..+.. .+|+ --+++.++..-++++|.+++++.+ T Consensus 539 ~g~I~~~-~~-~--------d-----v~v~~-------~~D~-----~~v~~~v~Pv~~mekIy~tv~~~~ 582 (587) T protein:vir:96 539 DNEIQDF-PP-E--------D-----VQVII-------EGNE-----ARISLTIFPIRALKKISVSLVYRQ 582 (587) T ss_pred CCcccCC-Cc-c--------c-----eEEEe-------cCCE-----EEEEEEEEEcccceEEEEEEEEEe Confidence 9999521 10 0 0 11211 1222 248999999999999999999988 No 29 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=98.76 E-value=6e-08 Score=60.18 Aligned_cols=449 Identities=12% Similarity=0.084 Sum_probs=219.6 Q ss_pred CCcCc---Cce----eEEeeccc-ccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHH Q lcl|NC_013597. 1 MALSI---SHI----VNVQLNTV-PKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPF 72 (502) Q Consensus 1 Msip~---s~i----V~V~i~~~-~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~ 72 (502) |++.+ .++ |-|.+.-+ ..+....+.+.+.|+|....-|+ .++..+++.++...-||... .-.+.... T Consensus 1 ~~~~~~~~~~~~~pgv~~~~~~s~~~~~~~~~~~~~~~ig~a~~G~~----~~~~~~~~~~~~~~~f~~g~-l~~~i~~a 75 (562) T protein:vir:80 1 MAIEIYPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKP----NAVYKVRNYSQAKSVFRSGE-LLDAIERA 75 (562) T ss_pred CeeeeeCCCcccCCceEEEEecCCcccCCCCCCceEEEEEEeCCCCc----ceeEEEccHHHHHHHhcCCC-hHHHHHHh Confidence 88754 222 23333322 23455666777888888766554 45677788889899998754 44556777 Q ss_pred hcCCC--CcceEEEEEeecccccceeeeeecc------------------chhhh--HHH----------HHhhccccee Q lcl|NC_013597. 73 FAQSP--RAKQLIVARWQKSASTIEATKNTLS------------------GATLS--DDL----------ERFKSVVNGR 120 (502) Q Consensus 73 F~q~p--~P~~l~igr~~~~~~~~~~~~~~~~------------------~~~~~--~~~----------~~~~~~~~g~ 120 (502) |..++ .-.++|+=|- ..+..+.++.+.+. ..++. ..+ +.+..+ .-. T Consensus 76 ~~~~~~~g~~~~~~~rv-~~a~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~-g~v 153 (562) T protein:vir:80 76 WNPGEGTGAGDILAMRV-EEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNL-GSI 153 (562) T ss_pred cccccccCceEEEEEEc-CCCCcceEEecceEEEEeecccCCCceEEEEecCCCCCCcceEEEecCCcceEEeecc-Cce Confidence 75433 1234554443 22333333332111 00000 000 000000 001 Q ss_pred EEE-------------------------EecCccccccccccccccchhhHHHHHHhhhcccccceeEEEec-ccceeeE Q lcl|NC_013597. 121 FSL-------------------------TIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLSVAVSIAYDE-TGNRFIV 174 (502) Q Consensus 121 ~~i-------------------------ti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a~~~~tv~~~~-~~~~f~~ 174 (502) |+| .+.+..+++..+.+... ....+..+..++..- ...+..|.. .++.+.+ T Consensus 154 ~~i~y~g~~~~a~~~i~~~~~~~~a~~l~~~~g~~~v~~~~l~~g--~~~~~~~l~~~i~~~-~~~tAky~g~~~n~i~~ 230 (562) T protein:vir:80 154 FSIKYKGTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSG--AYAETNVLISDINNL-PDFEAKFFPIGDKNLTT 230 (562) T ss_pred eeeeeccccccceeEEEecCccceEEEEEEecCCcceeEEEeCCC--ccchhhhhhhhhccc-cceEEEecccCCceeee Confidence 222 11111111111111111 001111222222210 001111100 0111100 Q ss_pred eeec--c-cccccceeeeeeccccch---hhhhhhhhhcc---c----ccceeeeeccccccccCHHHHHHHHHhccCce Q lcl|NC_013597. 175 SANV--A-GEDKKTEIDYAIDEGGEG---EYIGALLKLEN---G----QASRKVGKNSVSLKKETLGEALFNVAEVNNTW 241 (502) Q Consensus 175 ~s~t--t-G~~~~v~~~~a~~~~~t~---t~~aa~l~~t~---~----~~~~~v~v~~~~~~~et~~~al~al~~~~~~w 241 (502) .... + .+..+. ..+......+- ........+.. . .....+.-...|...++..+++++|... +| T Consensus 231 ~~~d~~~~~~~kt~-~~~v~~~~~d~~~~~~~n~~v~~~~~~~~~la~~~~~~LtGG~dG~~~~~~~dal~~Le~~--~~ 307 (562) T protein:vir:80 231 DNFDAQIDVDIKTK-EAYVKAVGGDIEKQTAYNGYVEFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANE--GG 307 (562) T ss_pred cccccchhhhcccc-eeeeeehhhhhhhcccccceEEEEeccCccccccceeeeeCCCCCCccccHHHHHHHHHhC--Cc Confidence 0000 0 000000 00000000000 00000000000 0 0111222222333445678899998875 56 Q ss_pred eEEEEecCCChhHHHHHHHHHhhc---C-CEEEEEecCchhcccchhHHHHHHHHccCCceEEEecCC---------ccc Q lcl|NC_013597. 242 YGFTVAAQLTDSEVEAAAKYAQAN---T-KLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDKN---------DMY 308 (502) Q Consensus 242 ~~~~~~~~~~~~~~~a~a~w~~a~---~-~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~---------~~~ 308 (502) +.+.+. ..+.+.+.++..|++.. . ++..+..... ......+....+..++.|.+.+..+- ..| T Consensus 308 ~~i~~~-t~d~ai~~~~~a~vkr~r~~g~~~~aVvg~~~---~~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~~~~~~~~ 383 (562) T protein:vir:80 308 YYLVPL-TSKQAVHAEALQFVRDCSYNGNPMRVFVGGGI---GESMEQLFTRAIGLQNERAGLIGFSGTVKMDDGRSLKM 383 (562) T ss_pred EEEEec-CCChHHHHHHHHHHHHHHhCCCeEEEEecCCC---CCCHHHHHHHhhhcCCCeEEEEecCeeEECCCCceeee Confidence 655433 33456678899999752 2 3433332221 11233444555667888887765420 122 Q ss_pred ----hHHHHHHHHHhcCCCCCCceeeEeeeecCcccc-CCCCHHHHHHHHhCCceEEEEEcCceE----EecCEeecC-- Q lcl|NC_013597. 309 ----PVSSALARLLSTNFAANNSTLTLKFKQQPTITA-DEITATEFAKAKRLGINVYTYFDDVAM----IAEGTVIGG-- 377 (502) Q Consensus 309 ----~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~~-~~lt~t~~~~l~~~~~n~y~~~~~~~~----~~~G~~~~G-- 377 (502) ..+.+.|..+..+++. + .-||.++++.. ..++.+|++.+..+|++.+....+... .-++.+.-+ T Consensus 384 ~~~~~aa~vAGl~Ag~~~~~---S--~T~~~i~~~~v~~~lt~~e~~~li~~G~l~l~~~~~~~v~~~riv~~itT~t~~ 458 (562) T protein:vir:80 384 PGYMFAAQVAGLTCGLEIGE---A--ITFKNIAIETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDK 458 (562) T ss_pred chhHHHHHHHHHHhcCcccc---C--ccceeeccccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceeccCC Confidence 3446667777766543 2 34456665432 368999999999999999987654321 223433322 Q ss_pred ---ee--hhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCcccccccccc Q lcl|NC_013597. 378 ---KF--ADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGD 452 (502) Q Consensus 378 ---~~--iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~ 452 (502) .| |-.++-.|.+.+.++..+-+. +- +| |=++.|...|++.+++.|++..+.|.|.-. ... + T Consensus 459 ~~~~~~ki~viRv~D~i~~dir~~~~~~-yI--Gk-~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~-~~~---------d 524 (562) T protein:vir:80 459 TDPVKSEIGVGEANDFLVSELKISLDNE-YI--GT-KIIDTSASLVKNFVQSFLDRKKLAKEIQDY-SPE---------E 524 (562) T ss_pred CCchhhhhhhhHHHHHHHHHHHHHHHhc-CC--cc-ccChHHHHHHHHHHHHHHHHHHhCCcccCC-Ccc---------c Confidence 24 668888999999988775332 33 56 678899999999999999999999999521 100 0 Q ss_pred ccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 453 YLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 453 ~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) +.+. ..+|+ . -+++.+...-++++|.+++.+.+ T Consensus 525 -------v~v~-----~~~d~---~--~v~~~v~Pv~~mekIy~ti~~~~ 557 (562) T protein:vir:80 525 -------VQVV-----IEGDI---A--RISLTVFPIRSMKKIEVSLVYRQ 557 (562) T ss_pred -------eEEE-----ecCCE---E--EEEEEEEEcccceEEEEEEEEEe Confidence 1111 12222 2 47899999999999999999999 No 30 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=98.75 E-value=6.7e-08 Score=59.90 Aligned_cols=320 Identities=10% Similarity=-0.019 Sum_probs=169.5 Q ss_pred HHHhhcccceeEEEEecCccccccccccc------------------cc---cchhhHHHHHHhhhcccccceeEEEecc Q lcl|NC_013597. 110 LERFKSVVNGRFSLTIGGDVKKVDGLSFA------------------RL---ADFNAVATKIQEKLTTLSVAVSIAYDET 168 (502) Q Consensus 110 ~~~~~~~~~g~~~iti~g~~~~~~~i~~s------------------~~---ts~~~vA~~i~aal~~a~~~~tv~~~~~ 168 (502) |+......-|...+.++....++...+-+ .. .+..+.+. +...... T Consensus 1 m~~~~~~~hGv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~~p~~~~~~i~~~~d~~~-~~~~~~~------------ 67 (388) T protein:vir:96 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQY-LDSTGNE------------ 67 (388) T ss_pred CCCCCCCCCceEEEEcCCCcccccccCcceeEEEEecCCCccccccccceeeecchhhhh-hhccccc------------ Confidence 11111111233333333333222211100 00 00011100 0000000 Q ss_pred cceeeEeeecccccccceeeeeeccccchhh-hhhhhhhcccccceeeeeccc-------------ccccc-CHHHHHHH Q lcl|NC_013597. 169 GNRFIVSANVAGEDKKTEIDYAIDEGGEGEY-IGALLKLENGQASRKVGKNSV-------------SLKKE-TLGEALFN 233 (502) Q Consensus 169 ~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~-~aa~l~~t~~~~~~~v~v~~~-------------~~~~e-t~~~al~a 233 (502) .++. .+....+.+......+-.... +.+.. .....+.+ T Consensus 68 ---------------------------~gtl~~al~~~~~~~~~~~~vv~v~~g~~~~at~a~iig~~~~~tg~~~gl~a 120 (388) T protein:vir:96 68 ---------------------------LGTGWHAASETLKKTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAA 120 (388) T ss_pred ---------------------------cccchhhhHhhhccCCceEEEEEeccccccccccceeeeecccccchhhHHHH Confidence 0000 000111111111111110000 01111 11233444 Q ss_pred HHhccCceeEEEEecCCC--hhHHHHHHHHHhhcCCEEEEEecCchhcccchhHHHHHHHHccC--CceEEEec------ Q lcl|NC_013597. 234 VAEVNNTWYGFTVAAQLT--DSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGL--DHTLAMFD------ 303 (502) Q Consensus 234 l~~~~~~w~~~~~~~~~~--~~~~~a~a~w~~a~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~--~~t~~~y~------ 303 (502) +...... ..++++-+.+ ..-..++...++.- +.|.+.-...+ ......+........++ .|.++.|- T Consensus 121 l~~~~~~-p~il~aPg~s~~~~v~~al~~~~~~~-~~~~i~D~p~~-~~~~~~~~~~~~~~~~~~s~~~~~~~P~~~~~d 197 (388) T protein:vir:96 121 LTECTER-PTLIGAPGFSQNKAVIDALASMAKRL-KCRAVIDGPSG-STQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYS 197 (388) T ss_pred hhhcccc-eeEEEeeccccchHHHHHHHHHHhhc-CcEEEEeccCC-chhHHHHHHhhhhccCcCcceEEEEeCceeeec Confidence 4443322 2344432222 23345555555543 33443321111 00111111122222233 34443331 Q ss_pred CCc-----cchHHHHHHHHHhcCCCCCCceeeEeeeecCcccc-----CCCCHHHHHHHHhCCceEEEEEcCc-eEEecC Q lcl|NC_013597. 304 KND-----MYPVSSALARLLSTNFAANNSTLTLKFKQQPTITA-----DEITATEFAKAKRLGINVYTYFDDV-AMIAEG 372 (502) Q Consensus 304 ~~~-----~~~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~~-----~~lt~t~~~~l~~~~~n~y~~~~~~-~~~~~G 372 (502) +.. -.+.+.+.|..+.+|+...+.-..+. +.|+.. ...+.+|++.|..+|+|.+.++.+. ..+|.+ T Consensus 198 ~~~~~~~~~p~s~~~AG~~a~~D~~~spaN~~i~---i~g~~~~~~~~~~~~~~~~~~Ln~~gI~~i~~~~~~G~~~wG~ 274 (388) T protein:vir:96 198 RKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGVL---IQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGN 274 (388) T ss_pred ccCCceeeechHHHHHHHHHhhcCcccccCeeEE---eeeecccccccccCChhhHHhhhhcCceEEEEecCCcEEEEcc Confidence 111 24667888999998875544433222 344432 2346789999999999999998665 468999 Q ss_pred EeecCeehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCcccccccccc Q lcl|NC_013597. 373 TVIGGKFADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGD 452 (502) Q Consensus 373 ~~~~G~~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~ 452 (502) ++++..||-+.+-.+|++..|+..+...+ .+ |.++.=...|+..|+.-|++-++.|.|. T Consensus 275 rT~~~~~i~vrR~~~~i~~si~~~~~~~v----~e-pn~~~~~~~i~~~i~~fL~~l~~~Gal~---------------- 333 (388) T protein:vir:96 275 RTVTGKFISFVGLEDAIARKLEAASQRAM----SK-QLTKSFMEQEIKKINLFMQDLVAAEIIP---------------- 333 (388) T ss_pred cccCCcceeehhhHHHHHHHHHHHHHHhc----cC-CCCHHHHHHHHHHHHHHHHHHHhCCcee---------------- Confidence 99999999999999999999999986654 23 7788889999999999999999999985 Q ss_pred ccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 453 YLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 453 ~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) ||.+++ .++..+++|+.+.+. -+.+.+.....+++|+++...+. T Consensus 334 ----g~~~~~-d~~~nt~~~i~~G~~-~~~i~~~p~~pae~I~~~~~~~~ 377 (388) T protein:vir:96 334 ----GGEVYL-HPTLNTVERYKNGSW-YIVIDYGRYSPNEHMIFHLNAVD 377 (388) T ss_pred ----eeEEEE-ecCCCCHHHhhCCEE-EEEEEEEecCCcceEEEEEEEch Confidence 355666 356788999998888 58999999999999999998888 No 31 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=98.73 E-value=7.9e-08 Score=59.52 Aligned_cols=350 Identities=10% Similarity=0.032 Sum_probs=154.6 Q ss_pred HHHHHhcCCCCcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccc--cccccccccchhh Q lcl|NC_013597. 68 AAQPFFAQSPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKV--DGLSFARLADFNA 145 (502) Q Consensus 68 aA~~~F~q~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~--~~i~~s~~ts~~~ 145 (502) |+..|.-+ ++|=+.+....+.. .+ ......+ -|+.... +...+..+..+++ T Consensus 1 M~~~~~~G------v~v~e~~~g~~~i~----~~---------------~tav~g~--vg~a~~ad~~~~pln~pv~i~s 53 (390) T protein:vir:78 1 MPQDYHHG------VRVIEINEGGRPIR----SV---------------STAVLGV--VCTAADADASAFPLNTPVLLTN 53 (390) T ss_pred CcccccCC------eEEEEcCCCccccc----cc---------------CcceeEE--EEcccCcCccccccccceEecc Confidence 33222111 11111110000000 00 0000000 0000000 0001111111111 Q ss_pred HHHHHHhhhccccc---ceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccc Q lcl|NC_013597. 146 VATKIQEKLTTLSV---AVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSL 222 (502) Q Consensus 146 vA~~i~aal~~a~~---~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~ 222 (502) ....+.. ++..+. .....++..+..-.+.+-..+........... +. .... T Consensus 54 ~~~~~~~-~g~~gtL~~al~~~~~~gg~~~~vv~v~~~~~~~~~~~~~i------------------------g~-~~~~ 107 (390) T protein:vir:78 54 VVAALGK-AGKKGTLRRTLDAIGKQTKPLTVVVRVAEGKDADETTSNVI------------------------GT-VTPD 107 (390) T ss_pred HHHHHhh-cCCCceehhhhhhhccccCceEEEEEecccccccccccccc------------------------cc-cccc Confidence 1111110 000000 00000111011101111011111110000000 00 0000 Q ss_pred cccCHHHHHHHHHhccCceeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecCchhcccchhHHHHHHHHccCCceEEEe Q lcl|NC_013597. 223 KKETLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMF 302 (502) Q Consensus 223 ~~et~~~al~al~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y 302 (502) ...+..+++........-=-..+.....+...+.+....+..+-+.+++.-.... .............+..|..+.| T Consensus 108 ~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~~~aivD~p~~---~t~~~a~~~~~~~~s~~~~~~~ 184 (390) T protein:vir:78 108 GKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSLRAMAYVSASGC---KTKEEAAAYRKQFGQREIMVIW 184 (390) T ss_pred cccchhhhhhhhhhhhcceehhhcccccchHHHHHHHHHhhcccceEEEEecCCC---CCHHHHHHHhhccCCceEEEEc Confidence 0011111111111110000001111111111111111111112222222211000 0001111111111222333222 Q ss_pred ------cCC-----ccchHHHHHHHHHhcCCCCCCceeeEeeeecCccccC--------CCCHHHHHHHHhCCceEEEEE Q lcl|NC_013597. 303 ------DKN-----DMYPVSSALARLLSTNFAANNSTLTLKFKQQPTITAD--------EITATEFAKAKRLGINVYTYF 363 (502) Q Consensus 303 ------~~~-----~~~~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~~~--------~lt~t~~~~l~~~~~n~y~~~ 363 (502) ++. .-.+.+.+.|.++.+|.++-+ -.....|.+.|+.-- ..+..|.+.|..+|+|.+... T Consensus 185 p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~-~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~~ 263 (390) T protein:vir:78 185 PDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGW-HKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVNR 263 (390) T ss_pred CceEeecccCCcccccchHHHHHHHHHHhhcCCCc-EECcCCceeeceeecceecccccccccchhhhhhhcCcEEEEcC Confidence 111 013467888888888854321 122334566665532 235667889999999999775 Q ss_pred cCceEEecCEeecCe----ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCcccccc Q lcl|NC_013597. 364 DDVAMIAEGTVIGGK----FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGK 439 (502) Q Consensus 364 ~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~ 439 (502) .| ..+|.+++++++ ||-+.+-.+|+...|+..+...++ + |.++.-...|+..++.-|+..+++|.|. T Consensus 264 ~G-~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~----e-~n~~~~~~~i~~~i~~~L~~l~~~g~l~--- 334 (390) T protein:vir:78 264 NG-FRFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVD----G-PLNPSLARDIVESINGWFRQQVANGYLI--- 334 (390) T ss_pred CC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhcc----C-CCCHHHHHHHHHHHHHHHHHHHhCCcee--- Confidence 55 467899999885 888999999999999999876543 3 7899999999999999999999999985 Q ss_pred ccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 440 WTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 440 ~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) ||.+.+. .+..+++|+.+.+. -+.+.+.....+++|.++...+. T Consensus 335 -----------------g~~v~~d-~~~nt~~~i~~G~~-~~~v~~~p~~pae~I~~~~~~~~ 378 (390) T protein:vir:78 335 -----------------GGSAWID-PEPNTADILASGKA-YIDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred -----------------eeEEEEc-cCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEch Confidence 4778886 46889999998888 58999999999999999988888 No 32 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=98.73 E-value=7.9e-08 Score=59.52 Aligned_cols=350 Identities=10% Similarity=0.032 Sum_probs=154.6 Q ss_pred HHHHHhcCCCCcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccc--cccccccccchhh Q lcl|NC_013597. 68 AAQPFFAQSPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKV--DGLSFARLADFNA 145 (502) Q Consensus 68 aA~~~F~q~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~--~~i~~s~~ts~~~ 145 (502) |+..|.-+ ++|=+.+....+.. .+ ......+ -|+.... +...+..+..+++ T Consensus 1 M~~~~~~G------v~v~e~~~g~~~i~----~~---------------~tav~g~--vg~a~~ad~~~~pln~pv~i~s 53 (390) T protein:vir:10 1 MPQDYHHG------VRVIEINEGGRPIR----SV---------------STAVLGV--VCTAADADASAFPLNTPVLLTN 53 (390) T ss_pred CcccccCC------eEEEEcCCCccccc----cc---------------CcceeEE--EEcccCcCccccccccceEecc Confidence 33222111 11111110000000 00 0000000 0000000 0001111111111 Q ss_pred HHHHHHhhhccccc---ceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccc Q lcl|NC_013597. 146 VATKIQEKLTTLSV---AVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSL 222 (502) Q Consensus 146 vA~~i~aal~~a~~---~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~ 222 (502) ....+.. ++..+. .....++..+..-.+.+-..+........... +. .... T Consensus 54 ~~~~~~~-~g~~gtL~~al~~~~~~gg~~~~vv~v~~~~~~~~~~~~~i------------------------g~-~~~~ 107 (390) T protein:vir:10 54 VVAALGK-AGKKGTLRRTLDAIGKQTKPLTVVVRVAEGKDADETTSNVI------------------------GT-VTPD 107 (390) T ss_pred HHHHHhh-cCCCceehhhhhhhccccCceEEEEEecccccccccccccc------------------------cc-cccc Confidence 1111110 000000 00000111011101111011111110000000 00 0000 Q ss_pred cccCHHHHHHHHHhccCceeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecCchhcccchhHHHHHHHHccCCceEEEe Q lcl|NC_013597. 223 KKETLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMF 302 (502) Q Consensus 223 ~~et~~~al~al~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y 302 (502) ...+..+++........-=-..+.....+...+.+....+..+-+.+++.-.... .............+..|..+.| T Consensus 108 ~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~~~aivD~p~~---~t~~~a~~~~~~~~s~~~~~~~ 184 (390) T protein:vir:10 108 GKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSLRAMAYVSASGC---KTKEEAAAYRKQFGQREIMVIW 184 (390) T ss_pred cccchhhhhhhhhhhhcceehhhcccccchHHHHHHHHHhhcccceEEEEecCCC---CCHHHHHHHhhccCCceEEEEc Confidence 0011111111111110000001111111111111111111112222222211000 0001111111111222333222 Q ss_pred ------cCC-----ccchHHHHHHHHHhcCCCCCCceeeEeeeecCccccC--------CCCHHHHHHHHhCCceEEEEE Q lcl|NC_013597. 303 ------DKN-----DMYPVSSALARLLSTNFAANNSTLTLKFKQQPTITAD--------EITATEFAKAKRLGINVYTYF 363 (502) Q Consensus 303 ------~~~-----~~~~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~~~--------~lt~t~~~~l~~~~~n~y~~~ 363 (502) ++. .-.+.+.+.|.++.+|.++-+ -.....|.+.|+.-- ..+..|.+.|..+|+|.+... T Consensus 185 p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~-~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~~ 263 (390) T protein:vir:10 185 PDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGW-HKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVNR 263 (390) T ss_pred CceEeecccCCcccccchHHHHHHHHHHhhcCCCc-EECcCCceeeceeecceecccccccccchhhhhhhcCcEEEEcC Confidence 111 013467888888888854321 122334566665532 235667889999999999775 Q ss_pred cCceEEecCEeecCe----ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCcccccc Q lcl|NC_013597. 364 DDVAMIAEGTVIGGK----FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGK 439 (502) Q Consensus 364 ~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~ 439 (502) .| ..+|.+++++++ ||-+.+-.+|+...|+..+...++ + |.++.-...|+..++.-|+..+++|.|. T Consensus 264 ~G-~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~----e-~n~~~~~~~i~~~i~~~L~~l~~~g~l~--- 334 (390) T protein:vir:10 264 NG-FRFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVD----G-PLNPSLARDIVESINGWFRQQVANGYLI--- 334 (390) T ss_pred CC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhcc----C-CCCHHHHHHHHHHHHHHHHHHHhCCcee--- Confidence 55 467899999885 888999999999999999876543 3 7899999999999999999999999985 Q ss_pred ccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 440 WTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 440 ~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) ||.+.+. .+..+++|+.+.+. -+.+.+.....+++|.++...+. T Consensus 335 -----------------g~~v~~d-~~~nt~~~i~~G~~-~~~v~~~p~~pae~I~~~~~~~~ 378 (390) T protein:vir:10 335 -----------------GGSAWID-PEPNTADILASGKA-YIDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred -----------------eeEEEEc-cCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEch Confidence 4778886 46889999998888 58999999999999999988888 No 33 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=98.71 E-value=8.9e-08 Score=59.24 Aligned_cols=406 Identities=12% Similarity=0.014 Sum_probs=204.7 Q ss_pred CCc----CcCc-----eeEEeecc-cc-cccccccccceEEEecccccccccCccceEEecCHHHHHhhcC--CCcHHHH Q lcl|NC_013597. 1 MAL----SISH-----IVNVQLNT-VP-KSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFG--TNSETAK 67 (502) Q Consensus 1 Msi----p~s~-----iV~V~i~~-~~-~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg--~~s~ey~ 67 (502) |+= -.+| +|++..+- .+ .+++.+ ..++++....-++ .+.+. -.+.+++..-|| .+++.|+ T Consensus 1 magg~~~~~~K~~PGvYi~~~~~~~~~~~~~~~~---~~~~i~~~~~~g~---~~~v~-i~~~~d~~~~fG~~~~~~~~~ 73 (451) T protein:vir:10 1 MAGGTWKAQDKRRPGTYINVVGNGQREAASSLGR---VLLIRDKGLGWGK---NGVIE-VEANSDFTKKLGTTLDDPSLT 73 (451) T ss_pred CCceeeccceeecCceEEEEeccCcceeeccCCc---EEEEEeeecCCCC---cccEE-eecHHHHHHHcCCcccchhHH Confidence 331 1223 35554432 22 122222 3455554433222 23344 455588999999 5667788 Q ss_pred HHHHHhcCCCCcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccce----eEEEEecCccccccccccccccch Q lcl|NC_013597. 68 AAQPFFAQSPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNG----RFSLTIGGDVKKVDGLSFARLADF 143 (502) Q Consensus 68 aA~~~F~q~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g----~~~iti~g~~~~~~~i~~s~~ts~ 143 (502) +-+.+|. .|+++++-|-.. +..+.++.. ...+ .+++.-.| .++++|.......+.-+++. T Consensus 74 ~~~~~~~---g~~~v~~yrl~~-g~~a~~t~~---~~~~-----~~~Aky~G~~Gn~i~v~v~~~~~d~~~~~v~t---- 137 (451) T protein:vir:10 74 ALKETLK---GASKVLVLNPNE-GTAATLTKE---GLPW-----TVTANYPGEKGNQITVSVEVSPADQNAATVST---- 137 (451) T ss_pred HHHHHhc---CCcEEEEEEcCC-CceEEEEee---cCce-----EEEEeeCCcCCceEEEEEecccCCcCceEEEE---- Confidence 8888875 488999988643 222222211 1000 11222222 35555433222211111110 Q ss_pred hhHHHHHHhhhcccccceeEEEecccceeeEee--ecccccccceeeeeeccccchhhhhhhhhhcccccceeeeecccc Q lcl|NC_013597. 144 NAVATKIQEKLTTLSVAVSIAYDETGNRFIVSA--NVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVS 221 (502) Q Consensus 144 ~~vA~~i~aal~~a~~~~tv~~~~~~~~f~~~s--~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~ 221 (502) +.........+. ...-......+..................++... .....+ T Consensus 138 ---------------------~~g~~~vd~qtv~~~~~~el~~nd~V~a~~~~~g~~~~~~~~~l~~~~-----~gg~~~ 191 (451) T protein:vir:10 138 ---------------------IFGTKLVDEQSIKFNELDKFKGNDYITAKVVEEGSSKPVAFTNVSGTL-----TGGTTT 191 (451) T ss_pred ---------------------EECCeEEEEEEeeccchhhccCCceEEEEecccccccceeeeeccccc-----cccccc Confidence 000000000000 0000000000000000000000000000011000 001122 Q ss_pred ccccCHHHHHHHHHhccCceeEEEEecCCChhHHHHHHHHHhh---c-CCEEEEEecCchhcccchhHHHHHHHHccCCc Q lcl|NC_013597. 222 LKKETLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQA---N-TKLFGANVIRAEQIEWSADNIYKKLYDAGLDH 297 (502) Q Consensus 222 ~~~et~~~al~al~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a---~-~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~ 297 (502) ...+...++|+++.....+|..+.. .+.+.+.+..+.+|+.. + .+.+.......... ..++.+ T Consensus 192 ~~~~~~~~~l~~~e~~~~n~l~~~~-~~~~~~i~~~~~a~ik~~r~~~g~~~~aVl~~~~~~------------~~d~eg 258 (451) T protein:vir:10 192 ESNKVESLLNDALENEEYAVVTTAG-FEPSSNMNKLVVEAVKRLRENEGRKVRGVIPTDADT------------TYNYEG 258 (451) T ss_pred CCccchHHHHHHhccceeeEEEEcc-CCCchHHHHHHHHHHHHHHHhcCCeEEEEecCccCC------------CCCCcc Confidence 3456677888888887666543322 22334567788999985 2 34443332111000 012222 Q ss_pred eEEEecC---------CccchHHHHHHHHHhcCCCCCCceeeEeeeecCccc-c-CCCCHHHHHHHHhCCceEEEEEcCc Q lcl|NC_013597. 298 TLAMFDK---------NDMYPVSSALARLLSTNFAANNSTLTLKFKQQPTIT-A-DEITATEFAKAKRLGINVYTYFDDV 366 (502) Q Consensus 298 t~~~y~~---------~~~~~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~-~-~~lt~t~~~~l~~~~~n~y~~~~~~ 366 (502) .+.+-+. +.....+.+.|..|+..++. ..-||.++|+. . ..++.+|++.+.++|...+....|. T Consensus 259 iinv~n~~~~~dg~~~~~~~~~~~vAG~~Ag~~~~~-----S~T~~~~~~~~~v~~~~t~~e~~~~i~~G~lvl~~~~g~ 333 (451) T protein:vir:10 259 ISTVVNGYTLSDGTNVDVKDATGYFAGISASADVAT-----SLTYFEVEDAVSAYPKFDNEKTIKALDAGQIVFTTRPGQ 333 (451) T ss_pred eEEeecceEecCceeechhhhHHHHHHHHccccccc-----CccceecCCceeeeeeCCHHHHHHHHhCCeEEEEEEcCC Confidence 2222111 11233456667777765543 23566888763 3 4789999999999999877544443 Q ss_pred -eEEecCEee----c---C-e--ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCcc Q lcl|NC_013597. 367 -AMIAEGTVI----G---G-K--FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAF 435 (502) Q Consensus 367 -~~~~~G~~~----~---G-~--~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I 435 (502) -.+.+|..+ + + . .|-.++-.|-+.+.++..+ +..+. +|+|=+..|..++.+.|+.-|++..+.|.| T Consensus 334 ~v~i~~~INTltt~~~~k~~~~~ki~vir~~D~i~~di~~~~-~~~yi--Gk~~N~~~gr~~~~~~i~~yl~~l~~~g~i 410 (451) T protein:vir:10 334 RVVIEQDINSLHKFTAEKPQAFSKNRVIRTLDEIATNTENTF-ERTYL--GNVGNNAAGRDLFKADRIAYLTSLQNRNMI 410 (451) T ss_pred eEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHh-hhccc--eecCCCHHHHHHHHHHHHHHHHHHHhCCCc Confidence 345566433 1 1 2 4778888999998887764 43333 689999999999999999999999999999 Q ss_pred ccccccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEe Q lcl|NC_013597. 436 APGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYN 501 (502) Q Consensus 436 ~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~ 501 (502) ..+... .+.+.. . ..+..--+++..+.-.+|.++.+++.|. T Consensus 411 ~~~~~~-----------------d~~v~~---~-----~~~~~v~v~~~v~pvdame~iy~t~~v~ 451 (451) T protein:vir:10 411 QSFANT-----------------DITVEA---G-----NDMDSIVVNLAVTPVDAMEKLYMTMVVR 451 (451) T ss_pred cCCCcc-----------------ceEEee---c-----CCCCEEEEEEEEEEEeeeeeEEEEEEEc Confidence 643211 111110 0 1123345899999999999999999998 No 34 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=98.65 E-value=1.5e-07 Score=58.05 Aligned_cols=360 Identities=12% Similarity=0.070 Sum_probs=199.5 Q ss_pred CCcCcCceeEE-eecccccccccccccceEEEecccccc--cccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhcCCC Q lcl|NC_013597. 1 MALSISHIVNV-QLNTVPKSAARKSFGIVALFTPEAGQA--FADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSP 77 (502) Q Consensus 1 Msip~s~iV~V-~i~~~~~~~~~~~f~~~lil~~~~~~~--~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~q~p 77 (502) |+--.-.=|.| .+.-.+.++.......+.|++...... ..+-...++. ++..+....||.....+.+...+|.+.. T Consensus 1 M~~~~~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~i-ts~~~~~~~~g~~~tL~~al~~~~~~~~ 79 (390) T protein:vir:79 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLL-TNVVAALGKAGKKGTLRRTLDAIGKQTK 79 (390) T ss_pred CccccCCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccceEe-ecHHHHHHhcCCCccchhhhhhhccccc Confidence 76444333444 344556667777777777777553221 1112244554 5666667779988888888888888755 Q ss_pred CcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhccc Q lcl|NC_013597. 78 RAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTL 157 (502) Q Consensus 78 ~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a 157 (502) .+ .++-+....... . . + ..-.+++.. .....++ +. ++... T Consensus 80 ~~--~~vv~v~~~~~~-~------------~--------~---~~~~ig~~~---------~~~~~tg----l~-al~~~ 119 (390) T protein:vir:79 80 PL--TVVVRVAEGKDA-D------------E--------T---TSNVIGTVT---------PDGKYTG----IK-ALLAA 119 (390) T ss_pred ce--EEEEeecccccc-c------------c--------c---cceeeeccc---------ccccchh----hh-hhhhh Confidence 43 333332211000 0 0 0 000000000 0000000 00 00000 Q ss_pred ccceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHHHHhc Q lcl|NC_013597. 158 SVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVAEV 237 (502) Q Consensus 158 ~~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~al~~~ 237 (502) . ..+.. . +.. +...+........ ++... T Consensus 120 ~-----------~~~~~-------~-----------------------------p~i--l~ap~~~~~~v~~---~l~~~ 147 (390) T protein:vir:79 120 Q-----------GALGV-------K-----------------------------PRI--LAAPGLDTQPVAA---ALAAT 147 (390) T ss_pred h-----------hhhcc-------c-----------------------------ccc--ccCCcccchHHHH---HHHHh Confidence 0 00000 0 000 0000111111122 22223 Q ss_pred cCceeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecCchhcccchhHHHHHHHHccCCceEEEecCCccchHHHHHHHH Q lcl|NC_013597. 238 NNTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDKNDMYPVSSALARL 317 (502) Q Consensus 238 ~~~w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~aa~~g~~ 317 (502) ...+..+.+.+..+.....++.+|.+.-+-.+....+. -..... .....-.. ..+.+.+.|.+ T Consensus 148 a~~~~~~ai~D~p~~~t~~~a~~~~~~~~s~~~~~~~p-~~~~~d-----------~~~~~~~~-----~p~s~~~Ag~~ 210 (390) T protein:vir:79 148 AQSLRAMAYVSASGCKTKEEAAAYRRQFGQREIMVIWP-DWLGWD-----------DTTNSTAV-----IPAPAIAAGLR 210 (390) T ss_pred hhhcceEEEEEccCCCCHHHHHHHhcCCCCceEEEEcC-ceeecc-----------cccCceeE-----eehHHHHHHHH Confidence 33444555555443333444556655422111111110 000000 00000011 12467778888 Q ss_pred HhcCCCCCCceeeEe---eeecCccccC--------CCCHHHHHHHHhCCceEEEEEcCceEEecCEeecCe----ehhH Q lcl|NC_013597. 318 LSTNFAANNSTLTLK---FKQQPTITAD--------EITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGGK----FADE 382 (502) Q Consensus 318 as~n~~~~~g~~T~~---fk~~~Gv~~~--------~lt~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~~G~----~iD~ 382 (502) +.+|..+ | -|+ .|.+.|+..- ..+..|++.|..+|+|......| ..+|.+++++++ ||-+ T Consensus 211 a~~D~~~--g--~~~spsN~~i~gi~~~~~~~~~~~~~~~~~a~~Ln~~gi~t~~~~~G-~~~wG~rT~~~d~~~~~i~v 285 (390) T protein:vir:79 211 AKIDNDI--G--WHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVNRNG-FRFWGERTCSDDPKFAFENY 285 (390) T ss_pred HhhhccC--C--cEEccCCceeeccceeeeeccccccccchhhhhhhhcCcEEEEcCCC-EEEEeccccCCCcccceeee Confidence 8887433 2 233 5566565322 23566888899999999866433 567899999884 7889 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEc Q lcl|NC_013597. 383 IVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWA 462 (502) Q Consensus 383 ~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~ 462 (502) .+-.+|+...|+..+...++ + |.+..=...|+..++.-|++.+++|.|. ||.+.+ T Consensus 286 rR~~~~i~~~i~~~~~~~v~----e-~~~~~~~~~i~~~i~~~L~~l~~~gal~--------------------g~~v~~ 340 (390) T protein:vir:79 286 TRTAQVAADSIAEAQMPVVD----G-PLNPSLARDIVESINGWFRQQVANGYLI--------------------GGSAWI 340 (390) T ss_pred hhhHHHHHHHHHHHHHHhcc----C-CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------eeEEEE Confidence 99999999999999876553 2 7788889999999999999999999985 477877 Q ss_pred CchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 463 APMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 463 ~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) . .+..+++|+.+.+. -+.+.+.....+++|+++...+. T Consensus 341 d-~~~nt~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~~ 378 (390) T protein:vir:79 341 D-PEPNTADILASGKA-YIDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred e-cCCCCHHHhhCCEE-EEEEEEEecCCcceEEEEEEEch Confidence 6 46788999998888 58999999999999999999888 No 35 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=98.63 E-value=1.7e-07 Score=57.63 Aligned_cols=449 Identities=12% Similarity=0.087 Sum_probs=218.8 Q ss_pred CCc---CcCce----eEEeeccc-ccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHH Q lcl|NC_013597. 1 MAL---SISHI----VNVQLNTV-PKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPF 72 (502) Q Consensus 1 Msi---p~s~i----V~V~i~~~-~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~ 72 (502) |++ |..++ |-|.+.-+ ..+....+.+.+.|+|....-|+ .+...+++.++...-||... .-.+...+ T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~s~~~~~~~~~~~~~~~iG~a~~G~~----~~~~~~~~~~~~~~~fg~g~-l~~~i~~a 75 (562) T protein:vir:63 1 MAIEIYPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKP----NAVYKVRNYSQAKSVFRSGE-LLDAIERA 75 (562) T ss_pred CeeeeeCCCcccCCceEEEEecCCCcccCCCCCceEEEEEEeCCCCC----ceeEEEccHHHHHHHhcCCc-hHHHHHHh Confidence 887 33333 23333322 34566677778889988776554 45677788889899998754 55666777 Q ss_pred hcCCC--CcceEEEEEeecccccceeeeeecc------------------chhhh--HH----------HHHhhccccee Q lcl|NC_013597. 73 FAQSP--RAKQLIVARWQKSASTIEATKNTLS------------------GATLS--DD----------LERFKSVVNGR 120 (502) Q Consensus 73 F~q~p--~P~~l~igr~~~~~~~~~~~~~~~~------------------~~~~~--~~----------~~~~~~~~~g~ 120 (502) |..++ .-.++|+-|- ..+..+.++.+.+. ..++. .. .+.+..+ ... T Consensus 76 ~~~~~~~g~~~~~~~rv-~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~-g~V 153 (562) T protein:vir:63 76 WNPGEGTGAGDILAMRV-EEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNL-GSI 153 (562) T ss_pred ccccccCCceEEEEEEc-CCCccceeEecceeEEEeecccCCCeEEEEEecCCCCCCcceEEEecCCCcchhhhhc-cce Confidence 75433 2345666565 32333333333211 00000 00 0000000 001 Q ss_pred EEEEecC-------------------------ccccccccccccccchhhHHHHHHhhhcccccceeEEEec-ccceeeE Q lcl|NC_013597. 121 FSLTIGG-------------------------DVKKVDGLSFARLADFNAVATKIQEKLTTLSVAVSIAYDE-TGNRFIV 174 (502) Q Consensus 121 ~~iti~g-------------------------~~~~~~~i~~s~~ts~~~vA~~i~aal~~a~~~~tv~~~~-~~~~f~~ 174 (502) |+|+..| ..+++..+.+... ....+..+..++... ...+..|.. .++.+.+ T Consensus 154 ~~i~y~g~~~~~~~~v~~~~~~~~a~~l~~~~g~~~v~~~~L~~g--~~~~~~~l~~~in~~-~~~~aky~~~~gn~i~~ 230 (562) T protein:vir:63 154 FSIKYKGTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSG--AYAETNVLISDINNL-PDFEAKFFPIGDKNLTT 230 (562) T ss_pred eeeeeecccccceEEEEecCcceeEEEEEeecCCcceeEEEecCC--ccchhHHHHHhhccc-cceEEEeeccCCceeee Confidence 2221111 1111111111110 011122222222211 011111110 0111111 Q ss_pred eeecccccccc--eeeeee-------ccccchhhhhh----hhhhcccccceeeeeccccccccCHHHHHHHHHhccCce Q lcl|NC_013597. 175 SANVAGEDKKT--EIDYAI-------DEGGEGEYIGA----LLKLENGQASRKVGKNSVSLKKETLGEALFNVAEVNNTW 241 (502) Q Consensus 175 ~s~ttG~~~~v--~~~~a~-------~~~~t~t~~aa----~l~~t~~~~~~~v~v~~~~~~~et~~~al~al~~~~~~w 241 (502) .....-....+ ...+.. .......++.. ...+... ....+.-...|...++..++++++... +| T Consensus 231 ~~~d~~~~~~vkt~~~~v~t~~~d~~~~~~~~~~v~~~~~~~~~la~~-~~~~LtGG~dGt~~~~~~~al~ale~~--~~ 307 (562) T protein:vir:63 231 DNFDAQIDVDIKTKEAYVKAVGGDIEKQTAYNGYVDFEFDRSKEIANF-PLTKLTGGDNGTIPESWADKFSYFANE--GG 307 (562) T ss_pred eccccccccchhhhhhhhhhhhhhhhhcccccceeeeeeccccceecc-cceeeecCCCCCchhhHHHHHHHHHhC--Cc Confidence 00000000000 000000 00000000000 0000000 011111222333344567888888765 56 Q ss_pred eEEEEecCCChhHHHHHHHHHhhc---C-CEEEEEecCchhcccchhHHHHHHHHccCCceEEEecCC---------ccc Q lcl|NC_013597. 242 YGFTVAAQLTDSEVEAAAKYAQAN---T-KLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDKN---------DMY 308 (502) Q Consensus 242 ~~~~~~~~~~~~~~~a~a~w~~a~---~-~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~---------~~~ 308 (502) +.+.+ ...+.+-+.++.+|++.. . ++..+..... ......+....+..++.|.+.+...- -.+ T Consensus 308 ~~i~~-~t~d~av~~~l~a~vkr~~~~g~~~~aVlg~~~---~~~~~~~~~~a~~~n~ervv~v~~~~~~~~~~~~~~~~ 383 (562) T protein:vir:63 308 YYLVP-LTSKQAVHAEALQFVRDCSYNGNPMRVFVGGGI---GESMEQLFTRAIGLQNERAGLIGFSGTVKMDDGRSLKM 383 (562) T ss_pred EEEEe-cCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCC---CCCHHHHHHHhhhcCCCcEEEEecCeeEECCCCceeee Confidence 65443 333445567899999752 2 3333332111 11223444555667888887765431 123 Q ss_pred h----HHHHHHHHHhcCCCCCCceeeEeeeecCccc-cCCCCHHHHHHHHhCCceEEEEEcCceE----EecCEeecC-- Q lcl|NC_013597. 309 P----VSSALARLLSTNFAANNSTLTLKFKQQPTIT-ADEITATEFAKAKRLGINVYTYFDDVAM----IAEGTVIGG-- 377 (502) Q Consensus 309 ~----~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~-~~~lt~t~~~~l~~~~~n~y~~~~~~~~----~~~G~~~~G-- 377 (502) + .+.+.|..+..+++. ++ -||.++++. ...++.+|++.+..+|++.+....+... .-++.+.-+ T Consensus 384 ~~~~~aa~vAGl~A~~~~~~---Sl--T~~~i~~~~v~~~~t~~e~~~li~~Gv~~l~~~~~~~v~~~~iv~~itT~t~~ 458 (562) T protein:vir:63 384 PGYMFAAQVAGLTCGLEIGE---AI--TFKNIAIETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDK 458 (562) T ss_pred chhHHHHHHHHHhhcCchhc---Cc--cceeeccccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCC Confidence 3 445666666665543 33 334554332 2478999999999999999987654321 123433322 Q ss_pred ---ee--hhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCcccccccccc Q lcl|NC_013597. 378 ---KF--ADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGD 452 (502) Q Consensus 378 ---~~--iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~ 452 (502) .| |=.++-.|.+...++..+-+. +- +| |=++.|...|++.|++.|++..+.|.|. +.... + T Consensus 459 ~~~~~~ki~viRv~D~i~~dir~~~~~~-yi--Gk-~Nn~~~r~~v~~~i~~~L~~l~~~gaI~-~~~~~---------d 524 (562) T protein:vir:63 459 TDPVKSEIGVGEANDFLVSELKISLDNE-YI--GT-KIIDTSASLVKNFVQSFLDRKKLAKEIQ-DYSPE---------E 524 (562) T ss_pred CCchhhhhhhhHHHHHHHHHHHHHHHhc-CC--cc-ccChHHHHHHHHHHHHHHHHHHhCCccc-CCCcc---------c Confidence 24 668889999999998775333 33 56 6788999999999999999999999995 21100 0 Q ss_pred ccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 453 YLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 453 ~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) +.+. ..+|+ --+++.+...-++|+|.+++++.+ T Consensus 525 -------v~v~-----~~~d~-----~~v~~~v~pv~~mekIy~ti~~~~ 557 (562) T protein:vir:63 525 -------VQVV-----IEGDV-----ARISLTVFPIRSMKKIEVSLVYRQ 557 (562) T ss_pred -------eEEE-----ecCCE-----EEEEEEEEEcccceEEEEEEEEee Confidence 1111 11222 247889999999999999999999 No 36 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=98.62 E-value=1.9e-07 Score=57.46 Aligned_cols=349 Identities=11% Similarity=0.024 Sum_probs=152.8 Q ss_pred HHHHHhcCCCCcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHH Q lcl|NC_013597. 68 AAQPFFAQSPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVA 147 (502) Q Consensus 68 aA~~~F~q~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA 147 (502) |+. .+.|.- +|=+.+....+ ...+... + .+ |..+........ ..+..+.-.++.+ T Consensus 1 M~~-----~~~pGv-~v~e~~~~~~~----i~~~~ta-----------v-~~-~vg~a~~a~~~~--~p~n~pv~iss~~ 55 (391) T protein:vir:79 1 MPT-----DYHHGV-RVVELNDGTRP----IRTIETA-----------V-AG-IVCTADDADAAT--FPLDTPVLLTNPQ 55 (391) T ss_pred CCC-----CCCCCe-EEEECCCCccc----ccccCCc-----------e-EE-EEeecccccccc--cccccCEEeccHH Confidence 221 111211 22111110000 0000000 0 00 000000000000 0011111111111 Q ss_pred HHHHhhhccccccee--EEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeecccccccc Q lcl|NC_013597. 148 TKIQEKLTTLSVAVS--IAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKE 225 (502) Q Consensus 148 ~~i~aal~~a~~~~t--v~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~e 225 (502) ..+............ ..++..+..-....-............. + .+.... T Consensus 56 ~~~~~~g~~gtl~~al~~~~~~gg~~~~vv~~~~~~~~~~~~~~~------------------------~----g~~~~~ 107 (391) T protein:vir:79 56 AYIGKAGDKGTLAHTLDAITDQTNPLTVVVRVAGGASEAETTSNL------------------------I----GTTNAA 107 (391) T ss_pred HHHHhcCCccccchhhhhhhcccccceeeeccccccccccccccc------------------------c----ccccch Confidence 111110000000000 0000000000000000000000000000 0 000000 Q ss_pred CHHHHHHHHHhccCcee---EEE-EecCCChhHHHHHHHHHhhcCCEEEEEecCchhcccchhHHHHHHHHccCCceEEE Q lcl|NC_013597. 226 TLGEALFNVAEVNNTWY---GFT-VAAQLTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAM 301 (502) Q Consensus 226 t~~~al~al~~~~~~w~---~~~-~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~ 301 (502) .....+..+.+...... ..+ ..+........++...++....+.+.-.... ....+........+..|..+. T Consensus 108 ~~~tGl~~l~~~~~~~~~~p~~l~~p~~~~~~v~~al~~~~~~~~~~ai~d~p~~----~t~~~a~~~~~~~~s~~~a~~ 183 (391) T protein:vir:79 108 GRYTGMKALLTARNRFGVAPRILAVPGLDSLPVGTELVTIAQKLRAFAYLSAYGC----QTKEEAVAYRSNFGQREAMVM 183 (391) T ss_pred hhhHHHhhhhhhhhhhcccchhhcCCccchhHHHHHHHHHHhhcCcEEEEECCCC----CCHHHHHHHHhccCCceeEEe Confidence 00111111111100000 000 0111112222233333332222211111000 011111111111222232222 Q ss_pred e------cCC-----ccchHHHHHHHHHhcCCCCCCceeeEeeeecCccccC--------CCCHHHHHHHHhCCceEEEE Q lcl|NC_013597. 302 F------DKN-----DMYPVSSALARLLSTNFAANNSTLTLKFKQQPTITAD--------EITATEFAKAKRLGINVYTY 362 (502) Q Consensus 302 y------~~~-----~~~~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~~~--------~lt~t~~~~l~~~~~n~y~~ 362 (502) | ++. ...+.+.+.|.++.+|.++.+ -.....|.+.|+... ..+.+|.+.|..+++|.+.. T Consensus 184 ~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D~~~g~-~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~~I~t~~~ 262 (391) T protein:vir:79 184 WPDFVGWDTAANAETTLWATARAVGLRAKIDNDTGW-HKTLSNVAVGGVTGLSRDVFWDLQDPATDAGYLNANEVTTLVH 262 (391) T ss_pred cceeeeecCcCCceeeechHHHHHHHHHHhhhcccc-eeccCCceehhhhccccccccccccccchhhhhhhcCceEEEC Confidence 2 111 123567888999988854321 112223566665422 23566788999999999865 Q ss_pred EcCceEEecCEeecCe----ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccc Q lcl|NC_013597. 363 FDDVAMIAEGTVIGGK----FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPG 438 (502) Q Consensus 363 ~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g 438 (502) ..| ..+|.+++++++ ||-+.+-.+|+...|+..+...++ + |.++.-...|+..|+.-|++.+++|.|. T Consensus 263 ~~G-~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~----e-pn~~~~~~~i~~~i~~~l~~l~~~g~l~-- 334 (391) T protein:vir:79 263 RDG-YRFWGSRTCSADPLFAFENYTRTAQVLADTMAEAHMWAND----L-PMTPTLVRDLLEGINAKLRMLTRNGYLL-- 334 (391) T ss_pred CCc-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhcc----C-CCCHHHHHHHHHHHHHHHHHHHhCCcee-- Confidence 433 468899999985 888999999999999999876553 3 7889999999999999999999999985 Q ss_pred cccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 439 KWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 439 ~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) ||.+.+. .+..+++|+.+.+. -+.+.+...-.+++|+++...++ T Consensus 335 ------------------g~~v~~~-~~~nt~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~~ 378 (391) T protein:vir:79 335 ------------------GGAAWFD-ADANSKDTLKAGQL-AIDYDYTPVPPLENLTFRQRITD 378 (391) T ss_pred ------------------ceEEEEe-cCCCCHHHhhCCEE-EEEEEEEecCCcceEEEEEEEch Confidence 3566664 46788999888887 58999999999999999999988 No 37 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=98.61 E-value=2e-07 Score=57.31 Aligned_cols=456 Identities=9% Similarity=-0.007 Sum_probs=175.1 Q ss_pred CCcCc----CceeEEeecccc----cccccccccceEEEe-cccccccccCccceEEecCHHHHHhhcCCCcH---HHHH Q lcl|NC_013597. 1 MALSI----SHIVNVQLNTVP----KSAARKSFGIVALFT-PEAGQAFADEKTRYVYVENQRDVEQLFGTNSE---TAKA 68 (502) Q Consensus 1 Msip~----s~iV~V~i~~~~----~~~~~~~f~~~lil~-~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~---ey~a 68 (502) -..+- ...++..+.-.. ......+-...++.. ..... .+...+........+..... .+-+ T Consensus 165 ~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~v~~~s~~~~~~~~~~~~~~~~~~~~~ 237 (729) T protein:vir:10 165 VASGNTTAVGSAVTQSISKTIGTATGTTTIDGVLKGIVTGSTDTTL-------EVKVISHISAAGVETAVEYQQNGTYTF 237 (729) T ss_pred eeccccccceeeeeeeccccccccccceeeeeeecccccccccccc-------cceecccccccccceeccccccceeee Confidence 00000 000000000000 000000000000000 00000 00000000000000000000 0000 Q ss_pred HHH---------Hhc-CCCCcceEEEEEeecccccceeeee-----eccchhhhHHHHHhhcccceeEEEE-------ec Q lcl|NC_013597. 69 AQP---------FFA-QSPRAKQLIVARWQKSASTIEATKN-----TLSGATLSDDLERFKSVVNGRFSLT-------IG 126 (502) Q Consensus 69 A~~---------~F~-q~p~P~~l~igr~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~g~~~it-------i~ 126 (502) ... .-+ ...++..... .|.... ...+... .+...+...............+... +. T Consensus 238 ~~~~s~~~~a~~~~~~~~~~~~t~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~d~~~~~~~d~~~~~~ 315 (729) T protein:vir:10 238 DNSGSVNVIAAGSSGSGSAKSYTAQT-DWFESQ-NIVLSNSTLEWDSIADAPGTSTYVSTRGGKNDEIHVLVIDDKGTIT 315 (729) T ss_pred cccCccceeeeccccccccccceeee-cccccc-ccccccccccccccccccccccccccccccccccceeeeccccccc Confidence 000 000 0000000000 000000 0000000 0000000000000000000000000 00 Q ss_pred Ccccccc--ccccccccch---hhHHHHHHhhhcccccceeEEEecccceeeEeeecccccccceeeeeeccccchhhhh Q lcl|NC_013597. 127 GDVKKVD--GLSFARLADF---NAVATKIQEKLTTLSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIG 201 (502) Q Consensus 127 g~~~~~~--~i~~s~~ts~---~~vA~~i~aal~~a~~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~a 201 (502) +...... ...++...+. .+.......-+.. ....+....... ............ .... ......+.+ T Consensus 316 ~~~g~vve~~~~~s~~~~~~~~~~~~~~~~~vi~~--~s~~~~~~~~~~-~~~~~~~~~~~~-~~~~----~~~~~~~~a 387 (729) T protein:vir:10 316 GNSGTILEKHLSLSKAKDAEYSVGSSSYWRDFLAT--NSKYIFGGGATS-GITTTGYSVSST-NTLD----TDSGWDQNA 387 (729) T ss_pred cCcccceeeeeeeeeccccccccccccccceeecc--ccceeeeccccc-cccccccccccc-ceec----ccccccccc Confidence 0000000 0000000000 0000000000000 000000000000 000000000000 0000 000000000 Q ss_pred hhhhhcccccceeeeec---------------cccccccCHHHHHHHHHhccC-ceeEEEEe-----cCCChhHHHHHHH Q lcl|NC_013597. 202 ALLKLENGQASRKVGKN---------------SVSLKKETLGEALFNVAEVNN-TWYGFTVA-----AQLTDSEVEAAAK 260 (502) Q Consensus 202 a~l~~t~~~~~~~v~v~---------------~~~~~~et~~~al~al~~~~~-~w~~~~~~-----~~~~~~~~~a~a~ 260 (502) ......... ...+... ......+....++.++.+... .....+.. +..+.....++.. T Consensus 388 ~~~~~~~~~-~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~~~~~~v~~a~~~ 466 (729) T protein:vir:10 388 EGVNFGASG-VATLTLAGGTNYGDKTDLTTSGALSSGVDDIISGYTLFENTEEIEVDFILMGAAHHPKEQSQAVAEKVTA 466 (729) T ss_pred ccccccccc-eeEEEeecccccccccccccccccccchhHHHHHHHHhhcccccccceeeecCCCCCccchHHHHHHHHH Confidence 000000000 0000000 000111223455655554321 12211111 1123445567777 Q ss_pred HHhhcCCEEEEEecCc--------------hhcccchhHHHHHHHHccCCceEEEecC-------C-c----cchHHHHH Q lcl|NC_013597. 261 YAQANTKLFGANVIRA--------------EQIEWSADNIYKKLYDAGLDHTLAMFDK-------N-D----MYPVSSAL 314 (502) Q Consensus 261 w~~a~~~~~~~~~~~~--------------~~~~~~~~~i~~~l~~~~~~~t~~~y~~-------~-~----~~~~aa~~ 314 (502) .++....++.+..... +.......+........+.++-..+|++ . . -.+.+.++ T Consensus 467 ~~~~~~~~~a~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~p~s~~~a 546 (729) T protein:vir:10 467 VAEARKDAVAFISPYRQAFLNDTVSGTVTVSNIDQTTENVVGFYAPLSSSTYSVFDSGYKYMFDRFNNTFRYVPLNGDIA 546 (729) T ss_pred HHHhcCCeEEEecccccccccccccccccccccchhhHHHHHHHhhccCCceEEEEcCeeEEecccCCceEEechhHHHH Confidence 7877665655442110 0011111222222222222333444443 1 1 12456788 Q ss_pred HHHHhcCCCCCCceeeEeeeecCcccc-----CCCCHHHHHHHHhCCceEEEEEcCc-eEEecCEeecC-----eehhHH Q lcl|NC_013597. 315 ARLLSTNFAANNSTLTLKFKQQPTITA-----DEITATEFAKAKRLGINVYTYFDDV-AMIAEGTVIGG-----KFADEI 383 (502) Q Consensus 315 g~~as~n~~~~~g~~T~~fk~~~Gv~~-----~~lt~t~~~~l~~~~~n~y~~~~~~-~~~~~G~~~~G-----~~iD~~ 383 (502) |.++.+|.++.+ ......|.+.||.- ..+++.|++.|..+|+|.+.++.+. ..+|.++++.+ .||-+. T Consensus 547 Gl~a~~d~~~g~-~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~d~~~~~i~vr 625 (729) T protein:vir:10 547 GTCARTDIEQFP-WFSPAGTARGPILNSVKLVYNPGKKQRDILYSNRINPVILSPGAGIILFGDKTGFGKSSAFDRINVR 625 (729) T ss_pred HHHHHhhccCCc-EEccCCccccceecccceeeecChhhHhhhhhCCceEEEEecCCeEEEEcceecCCCCcccceeehh Confidence 888988865432 12233444444422 3578999999999999999999765 56888888755 478899 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcC Q lcl|NC_013597. 384 VILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAA 463 (502) Q Consensus 384 ~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~ 463 (502) +-.+|++..|+..+...++. |.++.=...|+..|+.-|+..+++|.|. ||.|.+. T Consensus 626 R~~~~i~~si~~~~~~~v~e-----pn~~~~~~~i~~~i~~~L~~l~~~g~l~--------------------g~~v~~d 680 (729) T protein:vir:10 626 RLFIYLEDAISAAAKDQLFE-----FNDELTRTNFVNIVEPFLRDVQAKRGIF--------------------DFVVICD 680 (729) T ss_pred hhHHHHHHHHHHHHHHhhcC-----CCCHHHHHHHHHHHHHHHHHHHhcccee--------------------eeEEEEc Confidence 99999999999998776543 6688889999999999999999999984 4899987 Q ss_pred chhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 464 PMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 464 ~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) .+..+++|+.+.+. .+.+.+.....+++|.+++.-.| T Consensus 681 -~~~nt~~~i~~G~~-~~~v~~~p~~p~e~i~~~~~~~~ 717 (729) T protein:vir:10 681 -ETNNTAAVIDSNEF-VADIFIKPARSINFIGLTFVATR 717 (729) T ss_pred -CCCCCHHHhhCCeE-EEEEEEEecCCccEEEEEEEEee Confidence 67889999999888 59999999999999999876666 No 38 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=98.60 E-value=2.1e-07 Score=57.17 Aligned_cols=362 Identities=12% Similarity=0.047 Sum_probs=193.7 Q ss_pred CC--cCcCceeEEeecccccccccccccceEEEecccccc--cccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhcCC Q lcl|NC_013597. 1 MA--LSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQA--FADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQS 76 (502) Q Consensus 1 Ms--ip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~--~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~q~ 76 (502) |+ .+.--+-=+.+.-.+.++.....+.+.+++...... ..+..+.++. ++..+....||.....+.+...+|.+. T Consensus 1 M~~~~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v-~s~~~~~~~~g~~~tl~~al~~~~~~~ 79 (391) T protein:vir:11 1 MAADQYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLI-TNVQAAIGKAGTSGTLPASLQAIADQA 79 (391) T ss_pred CCCCcCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEE-ecchhhheecCCCccchhhhhhhhccc Confidence 44 222122222333445566666777777776654211 1112233443 455565667898888888888888865 Q ss_pred CCcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhcc Q lcl|NC_013597. 77 PRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTT 156 (502) Q Consensus 77 p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~ 156 (502) ..+ .++-+....+ . ...+..++....+.......+...+.. T Consensus 80 g~~--~~vv~~~~~~-------------~------------------------~~~t~~d~~g~~~a~~~~~g~~a~~~~ 120 (391) T protein:vir:11 80 NAA--TVVVRVKPGE-------------D------------------------EAATNSAVIGGVSADGKYTGMKALLAA 120 (391) T ss_pred cce--eEEeeecccc-------------c------------------------ccccchhhhcccccccchhhhhhhhhh Confidence 443 2222211000 0 000000000000000000000000000 Q ss_pred cccceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHHHHh Q lcl|NC_013597. 157 LSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVAE 236 (502) Q Consensus 157 a~~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~al~~ 236 (502) + ...+.... . +...+........++. . T Consensus 121 ---------------~----~~~~~~p~--~-----------------------------~~ap~~~~~~v~~al~---~ 147 (391) T protein:vir:11 121 ---------------K----ARLGVVPR--I-----------------------------LGVPGLDTQPVATALI---A 147 (391) T ss_pred ---------------h----hhheeccc--c-----------------------------ccccccccHHHHHHHH---H Confidence 0 00000000 0 0001111111222232 2 Q ss_pred ccCceeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecC-chhcccchhHHHHHHHHccCCceEEEecCCccchHHHHHH Q lcl|NC_013597. 237 VNNTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIR-AEQIEWSADNIYKKLYDAGLDHTLAMFDKNDMYPVSSALA 315 (502) Q Consensus 237 ~~~~w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~-~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~aa~~g 315 (502) .....-.|.+.+........++-+|-+.-+-.+....+. ....+ ..++... ...+.+.+.| T Consensus 148 ~~~~~~~~~i~D~p~~~t~~~a~~~r~~~~s~~~~~~~p~~~~~~------------~~~~~~~------~~p~s~~~ag 209 (391) T protein:vir:11 148 IAQQLRAFAYVSASGCKTKEEATAYRENFAAREAMVIWPDFLTWS------------TVVNQTV------PAPAVAQALG 209 (391) T ss_pred hhcccceEEEEEcCCCCCHHHHHHHhhhcCCceEEEEcCcceecc------------cccCceE------EechHHHHHH Confidence 223334455555433333444445555322111111110 00000 0001000 1124667778 Q ss_pred HHHhcCCCCCCceeeEeeeecCccccC--------CCCHHHHHHHHhCCceEEEEEcCceEEecCEeecCe----ehhHH Q lcl|NC_013597. 316 RLLSTNFAANNSTLTLKFKQQPTITAD--------EITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGGK----FADEI 383 (502) Q Consensus 316 ~~as~n~~~~~g~~T~~fk~~~Gv~~~--------~lt~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~~G~----~iD~~ 383 (502) ..+.+|.+..+ -.....|.+.||..- ..+..|.+.|..+|+|......| ..+|.+++++++ ||-+. T Consensus 210 ~~a~~d~~~g~-~~span~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~~~G-~~~wG~rT~~~d~~~~~i~vr 287 (391) T protein:vir:11 210 LRARIDQEVGW-HKTLSNVAVNGVTGISADVFWDLQSPSTDANYLNENEVTTLVQEGG-FRFWGSRTCSDDPLFAFENYT 287 (391) T ss_pred HHHHhhccCCc-EEccCCceeeceeecccccccccCCCcchhhhhhhcCcEEEEcCCC-EEEEcccccCCCcccceeehh Confidence 88877754321 112234566655532 23578999999999999865333 578899999985 78899 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcC Q lcl|NC_013597. 384 VILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAA 463 (502) Q Consensus 384 ~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~ 463 (502) +-.+|++..|+..+...++. |-++.=...|+..|+.-|++.+++|.|. ||.+.+. T Consensus 288 R~~~~i~~~~~~~~~~~v~e-----~n~~~~~~~i~~~i~~~l~~l~~~g~l~--------------------g~~~~~~ 342 (391) T protein:vir:11 288 RTAQVLADTIAEAHMWAVDK-----PMHPSLVRDILEGVNAKFRELKGLGLII--------------------DAQAWYD 342 (391) T ss_pred hHHHHHHHHHHHHHHHhccC-----CCCHHHHHHHHHHHHHHHHHHHhcccee--------------------ceEEEEe Confidence 99999999999998765533 6788889999999999999999999985 3566664 Q ss_pred chhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 464 PMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 464 ~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) .+..+++|+.+.+. -+.+.+.....+++|.++...+. T Consensus 343 -~~~n~~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~~ 379 (391) T protein:vir:11 343 -PNVNDKDTLKAGKL-RITYDYTPVPPLEDLTFFQKITD 379 (391) T ss_pred -cCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEch Confidence 46788999999888 58999999999999999999988 No 39 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=98.57 E-value=2.7e-07 Score=56.62 Aligned_cols=436 Identities=12% Similarity=0.111 Sum_probs=185.3 Q ss_pred CCcCc---Cc-----eeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHH---- Q lcl|NC_013597. 1 MALSI---SH-----IVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKA---- 68 (502) Q Consensus 1 Msip~---s~-----iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~a---- 68 (502) --|.+ .| |=+++++..+....-..|....-|-.--+.+......|.|. -++-..||-|.- T Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~ 291 (742) T protein:vir:58 220 EGIDLNSFNKQFVVSIENITVNREKGQVLYPSFDVVVHFRDIRGVSANTEYIRFRQ--------VNLNPESPNYIERVIG 291 (742) T ss_pred cccccCcccceeeEEEeeeeecccCCceeccceeEEEEEeeccCCCCCccceeeee--------eecCCCCcceeeeccc Confidence 00000 00 22334444444444445543333322111111111122221 122333333321 Q ss_pred -------------HHHHhcCCC----------CcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEe Q lcl|NC_013597. 69 -------------AQPFFAQSP----------RAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTI 125 (502) Q Consensus 69 -------------A~~~F~q~p----------~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti 125 (502) .-.|+.+-| ++.-.-+.+|...... + .....+|..+.+....+++ T Consensus 292 ~~~~~~~~~~~~~g~~~~n~~~~~~~~~~~~~~~~~~~~s~~~~~~~~---------~---~~~v~d~~~~~~~~~~v~~ 359 (742) T protein:vir:58 292 NMTFEFDGERIVTGGEYPNQVPFLRVVVSQDIKQNVAGVEKWVPVGFE---------G---IYSVGDFTVIVNELTNVSI 359 (742) T ss_pred ceeeeeccceeeecccccccccceeeEeccccCcCccceeEEEecccc---------c---cccccceeeecccccccee Confidence 111111111 0000001111110000 0 0001111111111112222 Q ss_pred cCccccccccccccccchhhHHHHHHhhhcccccceeE------------------EEecccceeeEeeeccccccccee Q lcl|NC_013597. 126 GGDVKKVDGLSFARLADFNAVATKIQEKLTTLSVAVSI------------------AYDETGNRFIVSANVAGEDKKTEI 187 (502) Q Consensus 126 ~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a~~~~tv------------------~~~~~~~~f~~~s~ttG~~~~v~~ 187 (502) ..+..... +.+..+..+.+ .+. .+...++ .........+.-+........... T Consensus 360 ~~t~~~~~--pp~~~~~~e~v-----~~n--gG~~f~v~s~~~~g~~i~~~~as~~~s~ln~~~~V~Gt~aa~~~~d~~t 430 (742) T protein:vir:58 360 PVTDSAII--PPMRFTRIEQI-----TLS--GGASFSVISNQPYGFNIQDSRHSYWLSPFKDDELIIGTELVLPALDVST 430 (742) T ss_pred eccccccC--Cccccccccee-----ecc--cCcceEEEEecccCcceeccCcceEEeccCCceEEEeehhhccccccch Confidence 21111100 00000000000 000 0000000 000000000000000000000000 Q ss_pred eeeeccccc-hhhhhhhhhhcccccc---------eeeeeccccccc-cCHHHHHHHHHhccCceeEEEEecCCC-hhHH Q lcl|NC_013597. 188 DYAIDEGGE-GEYIGALLKLENGQAS---------RKVGKNSVSLKK-ETLGEALFNVAEVNNTWYGFTVAAQLT-DSEV 255 (502) Q Consensus 188 ~~a~~~~~t-~t~~aa~l~~t~~~~~---------~~v~v~~~~~~~-et~~~al~al~~~~~~w~~~~~~~~~~-~~~~ 255 (502) .+....... ....+....+..+... ..++.. ...+. ..--+.|.++.+. .+. .++++-+.+ .+.. T Consensus 431 ~~~v~s~~~alp~~a~sv~laGG~dg~v~v~~~~~D~iG~~-~~~d~~~adrTGL~ALlev-~eV-tILiAPG~t~~~v~ 507 (742) T protein:vir:58 431 EFGVSSWEEALPEFSFLMPFQGGSDGYIRVDENEPDTIGRV-KITPALLANYERLLPLLTE-DQF-DLVLTPYLTFADHA 507 (742) T ss_pred heeccccccccceeeEEEeecCCccccccccCCCccccccc-ccccccccchhHHHHhhhc-CCC-cEEEEcCCCchHHH Confidence 000000000 0000000111110000 000000 00000 0011234444433 122 344443333 3344 Q ss_pred HHHHHHHhh-cCCEEEEEecCchhcccchhHHHHHHHHccCCceEEEec----CCc-----cchHHHHHHHHHhcCCCCC Q lcl|NC_013597. 256 EAAAKYAQA-NTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFD----KND-----MYPVSSALARLLSTNFAAN 325 (502) Q Consensus 256 ~a~a~w~~a-~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~----~~~-----~~~~aa~~g~~as~n~~~~ 325 (502) .++.+.++. +++++.+...+ ................+..|..+.|- .+. ..+.++++|.++.+|.+ T Consensus 508 aav~A~la~a~~Rl~vL~D~P--~~~tt~~~A~a~r~~~nSsraaly~PwVkv~d~~~~r~vPpSgaIAGL~ARtD~e-- 583 (742) T protein:vir:58 508 GTVNAFINRAENRFLYLFDIA--GDDDTENLAISLAGYINSSFATTFFPWVRRLTNKGMRTVPASLAAYRSIRTTDPE-- 583 (742) T ss_pred HHHHHHHHhhcCCeEEEEecC--CCCchHHHHHHHHhccCCceEEEEeceeeeccCCcceeechHHHHHHHHHHhccC-- Confidence 566666664 34444332111 11111122222233334556555442 111 12456788888888753 Q ss_pred CceeeEeee-ecCccccCCCCHHHHHHHHhCCceEEEEEcCceEEecCEeecC-----eehhHHHHHHHHHHHHHHHHHH Q lcl|NC_013597. 326 NSTLTLKFK-QQPTITADEITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGG-----KFADEIVILDWFVDAVQKEVFA 399 (502) Q Consensus 326 ~g~~T~~fk-~~~Gv~~~~lt~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~~G-----~~iD~~~~~dwl~~~iq~~l~~ 399 (502) +| + |+-- ....+.....+++|++.|..+++|...++.+.-.+|.++++.+ .||-+.+-.+|++..|+..+.. T Consensus 584 rG-v-w~SPANrgii~~~~~s~se~d~LN~~GINtIrsfG~G~rlWGnRTlassDs~wryInVRRlfd~Ie~SI~~a~q~ 661 (742) T protein:vir:58 584 TG-L-APVGARRGVVTGEPVRQVDWEDLYNNRINPIVRVGNDVLLFGQKTMLNVNSALNRINVRRLLIVMRNRISQILSS 661 (742) T ss_pred Cc-e-EecCCcceeeeccccchhhHHHHhhCCceEEEECCCcEEEEcceecCCCCcccceEeehhhHHHHHHHHHHHHHH Confidence 33 1 2211 1122333456889999999999999998754457888888855 3788999999999999998865 Q ss_pred HHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcCchhcCCHHHHhhcccC Q lcl|NC_013597. 400 RLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRAT 479 (502) Q Consensus 400 ~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~ 479 (502) .++. |-++.-...|+..|+.-|+..+++|.|. ||.|.+. ++.++.|+.+.+. T Consensus 662 ~VfE-----PNd~~L~~sIk~sInafL~~L~aqGALl--------------------GfrV~lD--etNTpeDI~~Gkl- 713 (742) T protein:vir:58 662 YLFE-----NNTSENRLRAEALVRQYLESLRLRGAVT--------------------DYEVAID--SVTTPTDIDNNTL- 713 (742) T ss_pred hccC-----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------eeEEEEc--CCCCHHHhhCCEE- Confidence 5432 6788889999999999999999999985 4888886 3577888888887 Q ss_pred ceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 480 PIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 480 ~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) -+.+.+...-.+++|+++....| T Consensus 714 vv~I~vAP~~PAEfI~lrf~it~ 736 (742) T protein:vir:58 714 RARVTVQPARSIEYIDITFVITP 736 (742) T ss_pred EEEEEEEccCCcceEEEEEEEEe Confidence 58999999999999999999988 No 40 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=98.53 E-value=3.4e-07 Score=56.02 Aligned_cols=423 Identities=10% Similarity=0.011 Sum_probs=181.7 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecC-HHHHHhhcCCCcHHHHHHHHHhcCCCCc Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVEN-QRDVEQLFGTNSETAKAAQPFFAQSPRA 79 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s-~~~v~~~fg~~s~ey~aA~~~F~q~p~P 79 (502) ++||-=.-.+|+++ ..++ ..+ - ..|.. ...+..++..-. ....| T Consensus 101 ~~L~~i~~~~v~v~--g~~g--~~~----~----------------VtF~g~~~~l~~~~~~lt-----------~g~~~ 145 (581) T protein:vir:10 101 RALPNVEDDEVTVL--GDPG--GPW----T----------------VTFTKAVAALTKDVTGLT-----------GGDDP 145 (581) T ss_pred hccCCCCcceEEEE--CCCC--ceE----E----------------EEEcCCccceeeeeceec-----------CCCce Confidence 55432222333332 1110 011 0 11110 000000000000 00011 Q ss_pred ceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCcccccccccccc---ccchhhHH---HHHHhh Q lcl|NC_013597. 80 KQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFAR---LADFNAVA---TKIQEK 153 (502) Q Consensus 80 ~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~---~ts~~~vA---~~i~aa 153 (502) .|.|++-.+.......++.. .-+......+.....+. +++.|........++.. +..-.|+- ..+-++ T Consensus 146 -~vtV~~~~~g~~~~~~~~s~---~gi~~~~~~l~~~~~~~--~~~~gsd~~~~~~~~~~~~~~~~~~D~~t~~~~~~g~ 219 (581) T protein:vir:10 146 -DLNIASEQTGVPAMNRALAK---KGIKTDTIRVVNPNSGQ--VYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGG 219 (581) T ss_pred -eEEEeccccCcccccccccc---cccccccccccccccCc--ceeccccceeeecccCccccccccccceeeeeeeccc Confidence 23333211110000000000 00111111111111111 22233333332222211 11111111 111111 Q ss_pred hcccccceeEEE--ecccceeeEeeeccccccc-ceeeeeeccccch-hhhhhhhhhcccccceeeeecccc----cccc Q lcl|NC_013597. 154 LTTLSVAVSIAY--DETGNRFIVSANVAGEDKK-TEIDYAIDEGGEG-EYIGALLKLENGQASRKVGKNSVS----LKKE 225 (502) Q Consensus 154 l~~a~~~~tv~~--~~~~~~f~~~s~ttG~~~~-v~~~~a~~~~~t~-t~~aa~l~~t~~~~~~~v~v~~~~----~~~e 225 (502) +...+....+.+ .+.+.+-++....+-.... +.-.+....+... .-..+.+.++.... ..+.....+ ...+ T Consensus 220 ~~~~~~v~~~~~~~~d~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~t~~~~~~~tn~~~-~~l~~gvd~~g~tvt~~ 298 (581) T protein:vir:10 220 HIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGAS-TILACAVDPEGDTVTMG 298 (581) T ss_pred ccccceEEEEEEEeecCCcceeEEeecCcchhhhhhhhhhccCccccchhhhheeeeecccc-eeEEeeccCCCCccchH Confidence 111111111111 1111111111111100000 0000000001110 01111222232222 112211111 2233 Q ss_pred CHHHHHHHHHhccCceeEEEEecCCChhHHHHHHHHHhhcC-----CEEEEEecCchhcccchhHHHHHHHHccCCceEE Q lcl|NC_013597. 226 TLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQANT-----KLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLA 300 (502) Q Consensus 226 t~~~al~al~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~~-----~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~ 300 (502) +..++|+++.++. ...+++++..+.+-+.+|..|++..+ ++-...+-... -...........+..+..|... T Consensus 299 dy~~Al~ale~~~--~~~ivv~~t~~~~v~a~l~ahv~~~s~~~~~~ravigV~g~~-~~~~~~~~~~~a~~~n~~Rvvl 375 (581) T protein:vir:10 299 DYQNALNKFRDED--EIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSV-TPVPSATRIANAQSIKDQRVAL 375 (581) T ss_pred HHHHHHHHHhcCC--ceEEEEeCCCCHHHHHHHHHHHHHHHhccCCcEEEEEecCCC-CCccHHHHHHhhccCCCceEEE Confidence 5678888888753 22234555433343466888886531 22222111000 0011112222333456677776 Q ss_pred EecC------C--------ccc-hHHHHHHHHHhcCCCCCCceeeEeeeecCccccC--CCCHHHHHHHHhCCceEEEEE Q lcl|NC_013597. 301 MFDK------N--------DMY-PVSSALARLLSTNFAANNSTLTLKFKQQPTITAD--EITATEFAKAKRLGINVYTYF 363 (502) Q Consensus 301 ~y~~------~--------~~~-~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~~~--~lt~t~~~~l~~~~~n~y~~~ 363 (502) ++.. + ..| .++.+.|..+..++. ..+-||.++|+..- .++.+|++.|..+|++.+... T Consensus 376 v~p~~~~~~g~~~~~~v~lp~y~~AA~vAGl~a~~~~~-----~slT~~~i~gi~~l~~~~s~~e~e~ll~~Gv~~l~~~ 450 (581) T protein:vir:10 376 ISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAA-----MPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKT 450 (581) T ss_pred EecCceeecCcccCceeccchhhHHHHHHHHhhccccc-----cCcccccccccccccccCCHHHHHHHHhCCeEEEEEe Confidence 6531 1 112 345555666665543 34677888887633 679999999999999999986 Q ss_pred cCce-EEecCEeec-----CeehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCcccc Q lcl|NC_013597. 364 DDVA-MIAEGTVIG-----GKFADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAP 437 (502) Q Consensus 364 ~~~~-~~~~G~~~~-----G~~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~ 437 (502) .+.. .+.+|...- .+.|-.++-.|.+...+++.+-...|- +| |=++.|...|++.+++.|.+..++|+|.. T Consensus 451 ~~~~v~Iv~gItT~~s~~~~~~i~~iR~~D~v~~~ir~~~~~~~fI--G~-~n~~~~r~~ik~~i~~~L~~l~~~g~I~~ 527 (581) T protein:vir:10 451 PRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLI--GM-PIYDTTIVQVKASAEAALVWLVDNNIIRG 527 (581) T ss_pred cCCeEEEEeeeecCCCCCcceeeeeehhhhHHHHHHHHHhhhhcCC--Cc-ccCHHHHHHHHHHHHHHHHHHHhcCcccC Confidence 5543 455666542 245778999999999999887322233 34 78899999999999999999999999962 Q ss_pred ccccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 438 GKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 438 g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) |... + .++.++.. -.-.+.|.+...-+|++|.+++.++- T Consensus 528 --------------------~~~~----~-~~~~~~~~-d~v~V~i~v~Pv~~i~~I~vti~~~p 566 (581) T protein:vir:10 528 --------------------YRNL----K-ARQIERQP-DVIEVRYEWRPAYPLNYIVVRYSIAP 566 (581) T ss_pred --------------------Cccc----e-eeeeecCC-CEEEEEEEEEecccceEEEEEEEEec Confidence 2100 0 01112221 12258899999999999999999888 No 41 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=98.45 E-value=6.2e-07 Score=54.61 Aligned_cols=440 Identities=10% Similarity=0.005 Sum_probs=171.8 Q ss_pred CCcC----------cCceeEE----eeccccccccc-ccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHH Q lcl|NC_013597. 1 MALS----------ISHIVNV----QLNTVPKSAAR-KSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSET 65 (502) Q Consensus 1 Msip----------~s~iV~V----~i~~~~~~~~~-~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~e 65 (502) ...| +-.+-.+ +......+.+. ......-++.......+ .........-...+....+. T Consensus 143 ~~~~ta~~~~~a~~~~~~~~v~~~~~~~~~~~~~~~~~a~~V~~~~~~~~~~~~------~~~~a~~~~t~~~~~~~~~~ 216 (666) T protein:vir:80 143 VFIPTGKIIAHAKAIGVYPELDGDWTAEFTSSSGNGSAALSVTKIVTDSGLLLT------DLETSRANITNQTFLTKLQK 216 (666) T ss_pred eecchhhhccccccccccceeeccceeeeccccccceeeeeeeeeecCCcccee------eecccccccccccccccccc Confidence 1101 0000000 00000000000 00000000000000000 00000000000000000000 Q ss_pred H---HHHHHHhcCCCCcceEEEEEeeccc---ccceeeeeeccchhhh-HHHHHhhcccceeEEEEecCccccccccccc Q lcl|NC_013597. 66 A---KAAQPFFAQSPRAKQLIVARWQKSA---STIEATKNTLSGATLS-DDLERFKSVVNGRFSLTIGGDVKKVDGLSFA 138 (502) Q Consensus 66 y---~aA~~~F~q~p~P~~l~igr~~~~~---~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s 138 (502) . ..+.++-+.. ...+.+.-..... ....++...+...... ..........+..+.+.+.......+...++ T Consensus 217 ~~~~a~~a~~~g~~--g~~l~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~ 294 (666) T protein:vir:80 217 YDMPAVSAIYAGEI--GNSLEVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLS 294 (666) T ss_pred ccchhhhhhccccc--ccceeeeeccccccccccccceeeeccccccccceeeeeccccccceeeEeccCCccceeeecc Confidence 0 0011111111 1111111000000 0000000000000000 0000000111222222221111111111111 Q ss_pred cccchhhH---HHHHHhhhcccccceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceee Q lcl|NC_013597. 139 RLADFNAV---ATKIQEKLTTLSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKV 215 (502) Q Consensus 139 ~~ts~~~v---A~~i~aal~~a~~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v 215 (502) ........ ...+...+.. +....+.. ...+.............+.+.... ....+ T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~-----------~~~~~~~~~~~~~~~~~g~~~~~~----------~~~~~ 352 (666) T protein:vir:80 295 TLKGDKDVYGNSIYMDDFFGR-GSSQYIYA-----------TAQGWVDGFSGIISLAGGVSANEA----------TTGGV 352 (666) T ss_pred cccccccccchhhhhhhhhcc-ccceeeee-----------cccccccccceEEEecCCCCcccc----------ccccc Confidence 11110000 0000000000 00000000 000000000000000000000000 00000 Q ss_pred eeccccccccCHHHHHHHHHhccCceeEEEEecCC------ChhHHHHHHHHHhhcCCEEEEEecCc----hhc-ccchh Q lcl|NC_013597. 216 GKNSVSLKKETLGEALFNVAEVNNTWYGFTVAAQL------TDSEVEAAAKYAQANTKLFGANVIRA----EQI-EWSAD 284 (502) Q Consensus 216 ~v~~~~~~~et~~~al~al~~~~~~w~~~~~~~~~------~~~~~~a~a~w~~a~~~~~~~~~~~~----~~~-~~~~~ 284 (502) +............. +-++.+. .++. ++++... ...-..++...++....+|.+..... +.. ..... T Consensus 353 ~~~~~~g~~~~~~~-~~~~~~~-~~~~-~l~~p~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~vd~~~~~~~~ 429 (666) T protein:vir:80 353 GADPFIGAMMQGWG-LFAERES-IHVN-LLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAID 429 (666) T ss_pred ccccccccchhhhh-hhhhhcc-cccc-eEeecCcCCcccchHHHHHHHHHHHHhhcceEEEeecceeEEeecCCCCCHH Confidence 00000000001111 1122221 1222 2333211 12333455566665544443221100 000 11122 Q ss_pred HHHHHHHHc--------cC--CceEEEecC-------Cc-----cchHHHHHHHHHhcCCCCCCceeeEeeeecCccc-- Q lcl|NC_013597. 285 NIYKKLYDA--------GL--DHTLAMFDK-------ND-----MYPVSSALARLLSTNFAANNSTLTLKFKQQPTIT-- 340 (502) Q Consensus 285 ~i~~~l~~~--------~~--~~t~~~y~~-------~~-----~~~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~-- 340 (502) ++....... ++ .|.. +|++ .+ -.+.+.++|.++.+|..+.+. .....|.+.||. T Consensus 430 ~~~~~~~~~~~~~~~~~~~~s~~~~-l~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar~D~~~g~~-~sPan~~~~~i~g~ 507 (666) T protein:vir:80 430 NLIAWREGSGNYNENNMNINTTYAV-IDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPW-MSPAGYNRGQIMNV 507 (666) T ss_pred HHHHHHHhcccchhhhcccCcceEE-EEcCceEEecccCCceeEechHHHHHHHHHHHhhcCCce-EccCCeecceeecc Confidence 333222221 12 2333 3332 11 124567788888887644211 112244444442 Q ss_pred ---cCCCCHHHHHHHHhCCceEEEEEcCc-eEEecCEeecCe-----ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccC Q lcl|NC_013597. 341 ---ADEITATEFAKAKRLGINVYTYFDDV-AMIAEGTVIGGK-----FADEIVILDWFVDAVQKEVFARLYKSPTKIPLT 411 (502) Q Consensus 341 ---~~~lt~t~~~~l~~~~~n~y~~~~~~-~~~~~G~~~~G~-----~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt 411 (502) .-.+++.|.+.|..+|+|.+.++.+. ..+|.++++++. ||-+.+-.+|+...|+..+...++. |.+ T Consensus 508 ~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~~~s~~~~i~vRRl~~~i~~si~~~~~~~v~e-----pn~ 582 (666) T protein:vir:80 508 VKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFE-----NND 582 (666) T ss_pred ccceeecChhHHHhhhhCCeeEEEEeCCCeEEEEccccCCCCCcccceeehhhHHHHHHHHHHHHHHHhccC-----CCC Confidence 12568999999999999999999875 578899998872 6788889999999999998765543 667 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceE Q lcl|NC_013597. 412 DKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAI 491 (502) Q Consensus 412 ~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaI 491 (502) +.=...|+..|+.-|++.+++|.|. ||.|.++ .++.+++|+.+.+. -+.+.++..-.+ T Consensus 583 ~~l~~~i~~~i~~~L~~l~~~gal~--------------------g~~V~~d-~~~nt~~di~~G~~-~~~i~~~P~~Pa 640 (666) T protein:vir:80 583 NFTRASFRMEVSQYLSTIRSLGGIY--------------------DFRVQCD-TTNNTPDVIDRNEF-VASMFIKPAKSI 640 (666) T ss_pred HHHHHHHHHHHHHHHHHHHhcCcee--------------------eeEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCc Confidence 8888999999999999999999985 4899998 67889999999998 699999999999 Q ss_pred EEEEEEEEEeC Q lcl|NC_013597. 492 HSSDVIVNYNR 502 (502) Q Consensus 492 h~v~i~~~v~~ 502 (502) ++|.+++.-.| T Consensus 641 e~I~~~~~~~~ 651 (666) T protein:vir:80 641 NYIMLNFTAVA 651 (666) T ss_pred ceEEEEEEEee Confidence 99999987555 No 42 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=98.35 E-value=1.2e-06 Score=53.12 Aligned_cols=449 Identities=10% Similarity=0.071 Sum_probs=218.1 Q ss_pred CCcCc---Cce----eEEeeccc-ccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHH Q lcl|NC_013597. 1 MALSI---SHI----VNVQLNTV-PKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPF 72 (502) Q Consensus 1 Msip~---s~i----V~V~i~~~-~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~ 72 (502) |++.+ .++ |-|.+.-+ ..+....+.+.+.|+|....-++ .++..+++.++..+-||... .-.|..++ T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~~G~~----~~~~~~~~~~~~~~~f~~g~-l~~a~~~a 75 (569) T protein:vir:80 1 MAVEQFPRKKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAKGGKP----DTVYRFRNYQQAKQVLRSGD-LLDAIELA 75 (569) T ss_pred CeeeeecCCccccCceEEEEecCCCcCCCCCCceeEEEEEEeCCCCC----ceeEEecCHHHHHHHhcCCc-hhHHHHhh Confidence 88732 222 22333222 34466667778888988766554 35677788889899998754 66777888 Q ss_pred hcCC----CCcceEEEEEeecccccceeeeeecc---------chhhhHHHHHhhccccee------------------- Q lcl|NC_013597. 73 FAQS----PRAKQLIVARWQKSASTIEATKNTLS---------GATLSDDLERFKSVVNGR------------------- 120 (502) Q Consensus 73 F~q~----p~P~~l~igr~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~g~------------------- 120 (502) |+-. -.|.++|+=|-.. +..+.++.+.+. +..+.-.+.. .-..++ T Consensus 76 ~~~~~~~~~~~~~~~~~rv~~-a~~a~~~~~~~~~~a~~~g~~~n~i~v~l~~--~~~~~~~~~~v~~~~~~~~~~~~~i 152 (569) T protein:vir:80 76 WNASDVNTASAGDILAVRVED-AKNATLTKGGLTFASTIYGVDANEIQVALED--NNLTHTKRLTVAFSKDGYKKVFDNL 152 (569) T ss_pred ccCccccccCceEEEEEEcCC-CeeeeeeccceeeeeeeccCCCceEEEEEec--CcCCcceeeEEeeecCCCccccccc Confidence 8643 3577888776533 223333322111 0000000000 000011 Q ss_pred ---EEEEecCcccccc-----------c--cccccccc------h---------hhHHHHHHhhhcc-cccceeEEEecc Q lcl|NC_013597. 121 ---FSLTIGGDVKKVD-----------G--LSFARLAD------F---------NAVATKIQEKLTT-LSVAVSIAYDET 168 (502) Q Consensus 121 ---~~iti~g~~~~~~-----------~--i~~s~~ts------~---------~~vA~~i~aal~~-a~~~~tv~~~~~ 168 (502) ++|+..|+..... + +-+..... + ...+..+.+.+.. .++.+.+... . T Consensus 153 g~v~si~ytg~~~~a~~~~~~~~~~~~a~~l~~~~g~~~~~~~~v~~~~~~~~~~~~~~~lv~~~~~~~~f~a~~~~~-~ 231 (569) T protein:vir:80 153 GKIFSIQYKGSEAQANFTIAQDSISKKATTLTLNVGSEPESTTEVMKYELGQGVYSETNVLVSAINSLPDWEAKFFPI-G 231 (569) T ss_pred cceeeEEEeeccccceEEeecCcCcceeEEEEEEecCCcceeEEEEeeccCCccchhhhhhhhhcCCccCceEEEEec-C Confidence 1121111111100 0 00000000 0 0000011111100 0000100000 0 Q ss_pred cceeeEeeecccccccceeeee-eccccchhhhhhh------hhhcc-cccc------eeeeeccccccccCHHHHHHHH Q lcl|NC_013597. 169 GNRFIVSANVAGEDKKTEIDYA-IDEGGEGEYIGAL------LKLEN-GQAS------RKVGKNSVSLKKETLGEALFNV 234 (502) Q Consensus 169 ~~~f~~~s~ttG~~~~v~~~~a-~~~~~t~t~~aa~------l~~t~-~~~~------~~v~v~~~~~~~et~~~al~al 234 (502) ++.... ..-.....+.+... ........++... ..++. +.++ ..+.-...+...++..++|+++ T Consensus 232 ~~~~~~--~~~d~~~~~~~~t~~~~~~~~~~di~~~~~~~~~v~~~~~~~~~l~~~~~~~LtGG~dG~~~~~~~~~l~~l 309 (569) T protein:vir:80 232 DKNLPT--DALEAVTKVDVKTEAVFVGALAGDIAKQLEYNDYVTVAVDATKPVEDFELTNLTGGSDGTAPESWANKFPLL 309 (569) T ss_pred CCccee--hhccchhheeccccceeeehhHHHHHHhhcCCceEEEEecCCcceeeecceeecCCCCCCccchHHHHHHHH Confidence 000000 00000000000000 0000000011110 00000 0000 0111112233344678899988 Q ss_pred HhccCceeEEEEecCCChhHHHHHHHHHhhc----CCEEEEEecCchhcccchhHHHHHHHHccCCceEEEecC------ Q lcl|NC_013597. 235 AEVNNTWYGFTVAAQLTDSEVEAAAKYAQAN----TKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDK------ 304 (502) Q Consensus 235 ~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~----~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~------ 304 (502) ... +|+.+.+ ...+.+.+.++..|++.. +++..+...... .....+....+..++.|.+.++.. T Consensus 310 e~~--~~~~i~~-~t~d~av~~~l~a~vkr~r~~g~~~~aVvg~~~~---~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~ 383 (569) T protein:vir:80 310 ANE--GGYYLVP-LTDKQAVHSEALAFVKDRTDNGDPMRIIVGGGTN---ETVEESITRATNLRDPRASLVGFSGTRKMD 383 (569) T ss_pred hhC--CcEEEEe-cCCChHHHHHHHHHHHHHHhCCCcEEEEecCCCC---CCHHHHHHHHhhcCCCeEEEEecCceeecC Confidence 875 4554443 333556678899999853 234444332211 122334445556677777665421 Q ss_pred C---ccc----hHHHHHHHHHhcCCCCCCceeeEeeeecCcccc-CCCCHHHHHHHHhCCceEEEEEcCce----EEecC Q lcl|NC_013597. 305 N---DMY----PVSSALARLLSTNFAANNSTLTLKFKQQPTITA-DEITATEFAKAKRLGINVYTYFDDVA----MIAEG 372 (502) Q Consensus 305 ~---~~~----~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~~-~~lt~t~~~~l~~~~~n~y~~~~~~~----~~~~G 372 (502) + ..| ..+.+.|..++.+++. ++| ||.++++.. ..++.+|++.+..+|++.+....+.. ..-++ T Consensus 384 ~g~~~~~~~~~~aa~vAG~~A~~~~~~---S~T--~k~i~~~~i~~~lt~~e~~~li~~G~~~l~~~~~~~~~v~~~vn~ 458 (569) T protein:vir:80 384 DGRLLKLPGYMMASQIAGIASGLEVGE---AIT--FKHFNVTSVDRVFESSQLDMLNESGVISIEFVRNRTLTAFRVVQD 458 (569) T ss_pred CCcceeechhhHHHHHHHHHhcCcccc---Ccc--ceeeccccccccCCHHHHHHHHhCCeEEEEEecCceEEEEEEecc Confidence 0 122 2345556667766544 333 455553332 36899999999999999998765432 12244 Q ss_pred EeecC-----ee--hhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccc Q lcl|NC_013597. 373 TVIGG-----KF--ADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGF 445 (502) Q Consensus 373 ~~~~G-----~~--iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~ 445 (502) .+.-+ .| |-.++-.|.+...++..+-+. +- +| |=++.|...|++.+++.|++..+.|.|. |.... T Consensus 459 itT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~-yi--Gk-~nn~~~r~~v~~~i~~~L~~l~~~gaI~-~~~~~--- 530 (569) T protein:vir:80 459 VTTYNDKSDPVKNEMSVGEANDFLVSELKIELDNN-FI--GT-KVIDTSASLIKNFIQSFLDNKKRAREIQ-DYTPE--- 530 (569) T ss_pred ceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhh-cC--cc-cCChhHHHHHHHHHHHHHHHHHhCCccc-CCCcc--- Confidence 44422 24 778889999999998775333 33 45 6788999999999999999999999995 21100 Q ss_pred cccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 446 GNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 446 g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) + ..+.. .+| |. -+.|.+..--++++|.+++++.+ T Consensus 531 ------d-----v~v~~-------~~d---~~--~v~~~v~Pv~~~ekI~~ti~~~~ 564 (569) T protein:vir:80 531 ------E-----VQVVL-------EGD---VA--SISMTVMPIRSLNKITVQLVYKQ 564 (569) T ss_pred ------c-----eEEEe-------cCC---EE--EEEEEEEEcccccEEEEEEEEee Confidence 0 11111 112 22 48899999999999999999999 No 43 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=98.30 E-value=1.5e-06 Score=52.47 Aligned_cols=465 Identities=14% Similarity=0.068 Sum_probs=218.3 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcC---CCcHHHHHHHHHhcCCC Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFG---TNSETAKAAQPFFAQSP 77 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg---~~s~ey~aA~~~F~q~p 77 (502) |.+ ++.=|-|.---.+.......-+...|+|...--|+. + -+.-+|..|....|| ..+.++.+...||-+-- T Consensus 1 ~~~-~~PgVyvee~~~~~~~~~~~ts~~~fvG~~~~Gp~~---~-p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngg 75 (659) T protein:vir:72 1 MTL-LSPGIELKETTVQSTVVNNSTGTAALAGKFQWGPAF---Q-IKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQYG 75 (659) T ss_pred Cce-ecCceEEEEecCCcccccCCCcceEEEeecCCCCCc---c-cEEecCHHHHHHHcCCcCCCCchhHHHHHHHHhCC Confidence 887 566555544333333333356678898887655542 3 455667999999999 56778888888886544 Q ss_pred CcceEEEEEeecccccceee--eeecc------ch--hhhHHHH-Hhhc---ccceeEE-EEecCcccccc---cccccc Q lcl|NC_013597. 78 RAKQLIVARWQKSASTIEAT--KNTLS------GA--TLSDDLE-RFKS---VVNGRFS-LTIGGDVKKVD---GLSFAR 139 (502) Q Consensus 78 ~P~~l~igr~~~~~~~~~~~--~~~~~------~~--~~~~~~~-~~~~---~~~g~~~-iti~g~~~~~~---~i~~s~ 139 (502) . ++||-|........... ...+. +. ....... .+.. ...+... +..++...... ..+++. T Consensus 76 ~--~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~g~v~~~~~~~~~~~~~v~t~~~~a~ 153 (659) T protein:vir:72 76 N--DLRVVRAVDRDTAKNSSPIAGNIDYTISTPGSNYAVGDKITVKYVSDDIETEGKITEVDADGKIKKINIPTGKNYAK 153 (659) T ss_pred c--eEEEEEccCCcccccccccccccceeecccccccccceeeeeeeccccccccceEEEeeccccceeeeecccccccc Confidence 3 57777764322111100 00000 00 0000000 0000 0001110 00111000000 000000 Q ss_pred ccchhhHHHHH-------HhhhcccccceeEE--EecccceeeEee----------------------ecccccccc--e Q lcl|NC_013597. 140 LADFNAVATKI-------QEKLTTLSVAVSIA--YDETGNRFIVSA----------------------NVAGEDKKT--E 186 (502) Q Consensus 140 ~ts~~~vA~~i-------~aal~~a~~~~tv~--~~~~~~~f~~~s----------------------~ttG~~~~v--~ 186 (502) ........... ............+. ............ ......... . T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~v~~~~~~~~~~v~~~~~a~~~~~~~~~v~~~~~~~~~a~~~gt~g~~ 233 (659) T protein:vir:72 154 AKEVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDK 233 (659) T ss_pred ccccccccccccceeeEEeeccccccceEEEEEeecCcceeeeeccccchhhhcccccccccccccceeeeccccccccc Confidence 00000000000 00000000000000 000000000000 000000000 0 Q ss_pred eeeeeccccch---------------h---hhhh--------------------------------------------hh Q lcl|NC_013597. 187 IDYAIDEGGEG---------------E---YIGA--------------------------------------------LL 204 (502) Q Consensus 187 ~~~a~~~~~t~---------------t---~~aa--------------------------------------------~l 204 (502) ........... . .... .. T Consensus 234 ~tv~i~~~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (659) T protein:vir:72 234 IEIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDD 313 (659) T ss_pred eeEEEccccccccceeeeeecccccccccccceeeeeeecccccccceeeecccceeeeeeeeeccccccccchhhhhhh Confidence 00000000000 0 0000 00 Q ss_pred hhcccccceeeeecc--------------ccc------cccCHHHHHHHHHhcc-CceeEEEEecC--CChhH----HHH Q lcl|NC_013597. 205 KLENGQASRKVGKNS--------------VSL------KKETLGEALFNVAEVN-NTWYGFTVAAQ--LTDSE----VEA 257 (502) Q Consensus 205 ~~t~~~~~~~v~v~~--------------~~~------~~et~~~al~al~~~~-~~w~~~~~~~~--~~~~~----~~a 257 (502) .+.. .+...+.... .+. ...+...++..+.... .+...+++... .+.++ ..+ T Consensus 314 ~~~~-~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~~ 392 (659) T protein:vir:72 314 FFAK-GGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKH 392 (659) T ss_pred hhhc-CCceEEEEEecccCCcccccccccccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHH Confidence 0000 0000000000 000 0011223333333221 12322222221 11122 233 Q ss_pred HHHHHhhcCCEEEEEecCchh-c---c-cchhHHHHHHHHc--------c--CCceEEEecC-------Cc-----cchH Q lcl|NC_013597. 258 AAKYAQANTKLFGANVIRAEQ-I---E-WSADNIYKKLYDA--------G--LDHTLAMFDK-------ND-----MYPV 310 (502) Q Consensus 258 ~a~w~~a~~~~~~~~~~~~~~-~---~-~~~~~i~~~l~~~--------~--~~~t~~~y~~-------~~-----~~~~ 310 (502) +...++....+|.+....... + . ...+++....... + ..|. .+|++ .+ -.+. T Consensus 393 l~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~-~~~~p~~~~~d~~~~~~~~~p~s 471 (659) T protein:vir:72 393 VVSIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYA-AIDGNHKYQYDKYNDVNRWVPLA 471 (659) T ss_pred HHHHHhhhCCEEEEEcCccccccCCCcccCHHHHHHHHhhccccccccccccceeE-EEEcCceeeccccCCceEEechH Confidence 445555555555544211111 0 1 1122222222111 1 2233 34443 11 1245 Q ss_pred HHHHHHHHhcCCCCCCceeeEeeeecCccc-----cCCCCHHHHHHHHhCCceEEEEEcCc-eEEecCEeecCe-----e Q lcl|NC_013597. 311 SSALARLLSTNFAANNSTLTLKFKQQPTIT-----ADEITATEFAKAKRLGINVYTYFDDV-AMIAEGTVIGGK-----F 379 (502) Q Consensus 311 aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~-----~~~lt~t~~~~l~~~~~n~y~~~~~~-~~~~~G~~~~G~-----~ 379 (502) +.++|.++.+|.++.+ ......|.+.||. ...+++.|.+.|..+++|...++.+. ..+|..++++++ | T Consensus 472 g~vAGl~Ar~D~~~G~-~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~~~s~~~~ 550 (659) T protein:vir:72 472 ADIAGLCARTDNVSQT-WMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDR 550 (659) T ss_pred HHHHHHHHHhhccCCc-EEccCCeeeceeeccccccccCChhHHHHHhhCCceEEEEecCCeEEEEcccccCCCCcccce Confidence 7788899988764421 1222344444432 22468999999999999999998765 477888888762 7 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceE Q lcl|NC_013597. 380 ADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFY 459 (502) Q Consensus 380 iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~ 459 (502) |-+.+-.+|+...|+..+...++. |.++.=...|+..|+.-|++.+++|.|. +|. T Consensus 551 i~vrR~~~~i~~si~~~~~~~v~e-----~n~~~l~~~i~~~i~~fL~~l~~~gal~--------------------~~~ 605 (659) T protein:vir:72 551 INVRRLFNMLKTNIGRSSKYRLFE-----LNNAFTRSSFRTETAQYLQGNKALGGIY--------------------EYR 605 (659) T ss_pred EeehhHHHHHHHHHHHHHHHhhcC-----CCCHHHHHHHHHHHHHHHHHHHhcCcee--------------------eEE Confidence 888899999999999998665533 6788888999999999999999999983 589 Q ss_pred EEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 460 VWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 460 v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) |.++ .++.+++|+.+.+. .+.+.+...-.+++|.+++.-.| T Consensus 606 V~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~I~~~~~~~~ 646 (659) T protein:vir:72 606 VVCD-TTNNTPSVIDRNEF-VATFYIQPARSINYITLNFVATA 646 (659) T ss_pred EEEc-CCCCCHHHhhCCeE-EEEEEEEecCCccEEEEEEEEee Confidence 9998 67889999999988 59999999999999999987555 No 44 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=98.27 E-value=1.8e-06 Score=52.10 Aligned_cols=442 Identities=10% Similarity=-0.007 Sum_probs=189.7 Q ss_pred CCcCcCc---eeEEeecccccccc-cccccceEEEecccccccccCccceEEecCHHHHHhhc----CCCcHHHHHHHHH Q lcl|NC_013597. 1 MALSISH---IVNVQLNTVPKSAA-RKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLF----GTNSETAKAAQPF 72 (502) Q Consensus 1 Msip~s~---iV~V~i~~~~~~~~-~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~f----g~~s~ey~aA~~~ 72 (502) |+-.+++ +|...++- ..++. ....+...++|...--|+. ++++ -+|..|....| |.......|-..+ T Consensus 279 ~~~~v~~~GVYVEEVpSG-vrtIeGGV~TSVAAFVG~A~rGPvn---~Pvl-ITS~aD~~~~Fg~~~GGl~GassA~r~~ 353 (774) T protein:vir:98 279 ITRNVEDNGVVIQLEPAL-TGSISNRFSFYVTANDNTANRGFTT---SPAL-VTTIPDPAIHFTSFQGGLDGPRSAFRDF 353 (774) T ss_pred eEEEEecCceEEEEeCCC-CccccccccceeeeecccccCCCCC---cCEE-EeehhHhhhhhccccCCccccceeeeee Confidence 4444333 23332222 12332 2455677777766544432 3333 34455544444 3221111110111 Q ss_pred hcCCCCcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccc-cccccccccchhhHHHHHH Q lcl|NC_013597. 73 FAQSPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKV-DGLSFARLADFNAVATKIQ 151 (502) Q Consensus 73 F~q~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~-~~i~~s~~ts~~~vA~~i~ 151 (502) |.-.-.| .|.+-....-...-.++... ....++.+.+.+.-...+. ........-+......... T Consensus 354 ~~~sG~~-~L~i~A~~pGawGN~ItV~I-------------~~~t~~~~~l~v~~~~~s~f~~~~a~e~~tv~~~~~~~~ 419 (774) T protein:vir:98 354 YTFNGTP-LLRLQAVSEGNWGNQVTVSI-------------YPVNNSEFRLNVQDLNGSAFNPPLADEVYTVKLGDTNES 419 (774) T ss_pred eeecccc-eEEEEEeecCcCCCceEEEE-------------EecCCceeEEEEEecCCccccccccceeEEEeccccccc Confidence 1111111 11110000000000000000 0000111111111000000 0000000000000000000 Q ss_pred hhhcccccceeEEEecccceeeEeeecccccccc----eeeeeecc-ccchhhhhhhhhhcccccceeeeeccccccccC Q lcl|NC_013597. 152 EKLTTLSVAVSIAYDETGNRFIVSANVAGEDKKT----EIDYAIDE-GGEGEYIGALLKLENGQASRKVGKNSVSLKKET 226 (502) Q Consensus 152 aal~~a~~~~tv~~~~~~~~f~~~s~ttG~~~~v----~~~~a~~~-~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et 226 (502) ..+...... ............ ......+.. .+..+... .....................+.+.......++ T Consensus 420 ~~v~e~~dn-~~i~~~~~~~~~---~~in~vs~lv~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~v~v~lagG~Dg~~t 495 (774) T protein:vir:98 420 GELNALLDS-KFIRGFFLPKSI---DSINYDAALVRQSPLRLAPPDESETDVENPAHVDFYGPNVLVDVTLENGYDGPPV 495 (774) T ss_pred ceeeeeece-eeEeeccccccc---ccccccccccccchhcccccccccccccccccccccCCcceEEEeecCCCCcccc Confidence 000000000 000000000000 000000000 00000000 000000000000000111111111111111111 Q ss_pred HHHHHHHHHh--ccCceeEEEEecCCChhHHHHHHHHHhhc----CCEEEEEecCchhcccchhHHHHHHHHccCCceEE Q lcl|NC_013597. 227 LGEALFNVAE--VNNTWYGFTVAAQLTDSEVEAAAKYAQAN----TKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLA 300 (502) Q Consensus 227 ~~~al~al~~--~~~~w~~~~~~~~~~~~~~~a~a~w~~a~----~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~ 300 (502) ..+.+....+ ....++.++. .........++..+++.. ..++.+...... .+...........+..|..+ T Consensus 496 t~~~igg~~~~~~~tgi~aLl~-a~~~~~V~~aii~~~e~~~~~~~~r~avid~p~g---~t~~~Ai~~r~~f~S~~aal 571 (774) T protein:vir:98 496 TNDDYVSIIRTLENQPVHILLV-GTTNVGVQQALITEAERASDSDGLRIAVLAAPPR---TTPTLAASVTRGFNSTRAVM 571 (774) T ss_pred cchheecccccccccceeEEEc-CccchhhHHHHHHHHHHhhhcccceEEEEECCCC---CCHHHHHHHHhccCCceEEE Confidence 1222211111 1245554443 333344555666666642 334444322211 11222222223333345443 Q ss_pred EecC-------Cc-----cchHHHHHHHHHhcCCCCCCceeeEeeeecCccc--------cCCCCHHHHHHHHhCCceEE Q lcl|NC_013597. 301 MFDK-------ND-----MYPVSSALARLLSTNFAANNSTLTLKFKQQPTIT--------ADEITATEFAKAKRLGINVY 360 (502) Q Consensus 301 ~y~~-------~~-----~~~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~--------~~~lt~t~~~~l~~~~~n~y 360 (502) |++ .+ -.+.+.++|.++.+|+...+ ..|.+.|+. .+..++.+.+.|..+++|.. T Consensus 572 -~~Pwvkv~D~~~g~~~~vPpSg~vAGl~ArtDv~kSP-----ANk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~gIN~i 645 (774) T protein:vir:98 572 -VAGWFTYAGQPNSSRYGVPGAAVYAGKLAAIDFFVSP-----AARSLVGPLFNIIESDTDNYTSRSNQDIYSAARLEVL 645 (774) T ss_pred -EeCcEEEeccCCCceeecChhHHHHHHHHhcCccccc-----CCceeecceeccccccccccccchhhhhhccccccee Confidence 333 11 13568889999998864443 345566653 22346788889999999988 Q ss_pred EE-EcCce-EEecCEeecCe----ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCc Q lcl|NC_013597. 361 TY-FDDVA-MIAEGTVIGGK----FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGA 434 (502) Q Consensus 361 ~~-~~~~~-~~~~G~~~~G~----~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~ 434 (502) +. .-+.+ .+|.+++++++ ||-+.+-.+|++..|+..+...+ .+ |.++.....|+..++.-|+..++.|. T Consensus 646 ~itt~g~G~rvWG~RTlssDp~wr~InVRRlfd~Ie~SI~~~~~~~V----fE-PNd~~l~~~I~~sI~~fL~~L~~~Ga 720 (774) T protein:vir:98 646 SLDTVDRTYRFASGVTLSTDPAWERIYLRRVHDVVRQGAHAILRNYV----AM-PNSRLVRNQIAAALNAFMGELKRNGN 720 (774) T ss_pred EEEEcCCcEEEEcccccCCCcccceEeehhhHHHHHHHHHHHHHHhc----cC-CCCHHHHHHHHHHHHHHHHHHHhCCc Confidence 73 43433 57788888874 78888999999999999876544 34 78999999999999999999999999 Q ss_pred cccccccCccccccccccccccceE-EEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 435 FAPGKWTGAGFGNLSTGDYLDKGFY-VWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 435 I~~g~~~~~~~g~~~~~~~~~~gy~-v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) |. ||+ +.+. .+..+++++.+.+. -+.+.+.....+++|.+++.... T Consensus 721 L~--------------------G~~~V~~D-~etNt~~dI~~G~l-~i~I~vaP~~PAEfIilri~q~t 767 (774) T protein:vir:98 721 IV--------------------SFRPAIID-GSNNSTAAYFSREL-YVSLQFQPLYSADYIYVTISRDT 767 (774) T ss_pred ee--------------------cceEEEEc-CCCCCHHHhhCCEE-EEEEEEEecCCcceEEEEEEEee Confidence 85 243 4444 56678889888877 58999999999999999887766 No 45 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=98.18 E-value=2.9e-06 Score=50.90 Aligned_cols=326 Identities=12% Similarity=0.091 Sum_probs=172.5 Q ss_pred hhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhcccccceeEEEecccceeeEeeecccccccceeeeeec Q lcl|NC_013597. 113 FKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAID 192 (502) Q Consensus 113 ~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a~~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~ 192 (502) ..+++. ..|.+ .....++ ++.+.+ |..+.+..+.......++ ... .+... . T Consensus 1 ~~glp~--i~i~f--~~~a~ta---------------~~~g~r--Giv~~il~d~~~~~~~~~-~~~----~v~~~--~- 51 (356) T protein:vir:10 1 MAGLVN--INIEF--KELATSF---------------IQRSKA--GIVAIILKDTTKMYKELT-SED----DIPIS--L- 51 (356) T ss_pred CCCCCc--eeEEE--eecceee---------------ccCCcc--ceEEEEEecCCcceeEEe-ccc----cchhH--H- Confidence 112221 12222 1111111 111111 222333344333222221 111 11110 0 Q ss_pred cccchhhhhhhhhhcc----cccceeeeeccccccccCHHHHHHHHHhccCceeEEEEecCCChhHHHHHHHHHhh---- Q lcl|NC_013597. 193 EGGEGEYIGALLKLEN----GQASRKVGKNSVSLKKETLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQA---- 264 (502) Q Consensus 193 ~~~t~t~~aa~l~~t~----~~~~~~v~v~~~~~~~et~~~al~al~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a---- 264 (502) ......++...+.... ...|..+.+... ...++..++|+++..+..||. .++. .+++++..++.|+.. T Consensus 52 ~~~n~~~i~~~~~g~~~~~~~~~p~~~~~~~~-~t~~~y~~aL~~le~~~fn~l--~~~~-~d~~~~~~~~a~ikr~r~~ 127 (356) T protein:vir:10 52 SADNKKYIKYGFVGATDNEKVLRPSKVIISTF-TEDGKVEDILEELESVEFNYL--CMPE-AIEAEKTKIVTWIKKIREE 127 (356) T ss_pred HHHHHHHHHHHhhccccccccccceeeeeecc-cCchhHHHHHHHhcCccceEE--EecC-CChHHHHHHHHHHHHHHhc Confidence 0111122322221110 112333333322 345789999999988776664 4444 567899999999985 Q ss_pred cCCEEEEEecCchhcccchhHHHHHHHHccCCceEE-----EecCC---ccchHHHHHHHHHhcCCCCCCceeeEeeeec Q lcl|NC_013597. 265 NTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLA-----MFDKN---DMYPVSSALARLLSTNFAANNSTLTLKFKQQ 336 (502) Q Consensus 265 ~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~-----~y~~~---~~~~~aa~~g~~as~n~~~~~g~~T~~fk~~ 336 (502) ..+++........+ .+.+.+- ++... ..-..+.+.|..|+...++. +-|+.+ T Consensus 128 ~~~~~~~V~~~~~a---------------D~EgIInv~n~~~~~g~~~t~~~~~~~vAG~~Ag~~~n~S-----~T~~~~ 187 (356) T protein:vir:10 128 ESTEAKAVLANIKA---------------DNEAIINFTENVVVDGEEITAEKYTTRVASLIASTPNTQS-----ITYAPL 187 (356) T ss_pred CCcEEEEEecCCCC---------------CCceeEEeecCeEecceeechhHHHHHHHHHHhccchhcc-----ccceec Confidence 23555544332211 1111111 11111 11234466677777655443 345567 Q ss_pred CccccC-CCCHHHHHHHHhCCceEEEEEcCceEEecCE----eecCee------hhHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013597. 337 PTITAD-EITATEFAKAKRLGINVYTYFDDVAMIAEGT----VIGGKF------ADEIVILDWFVDAVQKEVFARLYKSP 405 (502) Q Consensus 337 ~Gv~~~-~lt~t~~~~l~~~~~n~y~~~~~~~~~~~G~----~~~G~~------iD~~~~~dwl~~~iq~~l~~~l~~~~ 405 (502) +++... .++.+|++.+.++|.-.+..-++.-.+-+|. +.+.+. |-.++..|-+.+.++.. |+.-+. T Consensus 188 ~~~~~~~~~t~~e~~~ai~~G~lvl~~d~~~V~I~~~VNSltt~t~~k~~~f~Kirvvr~~D~i~~Di~~~-f~~~yi-- 264 (356) T protein:vir:10 188 DEVESIVKIDKASADAKVQAGELILRRLSGKIRIARGINSLTTLTAEKGEIFQKIKLVDTKDLISKDIKNI-YVEKYL-- 264 (356) T ss_pred CCccccccCCHHHHHHHHhCCeEEEEEEcCeEEEEecCccceecCCCCCcchhhhHHHHHHHHHHHHHHHH-Hhhccc-- Confidence 776543 5889999999999999987776666666775 224433 78888999999988765 443333 Q ss_pred CCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccC---ccccccccccccccceEEEcCchhcCCHHHHhhcccCceE Q lcl|NC_013597. 406 TKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTG---AGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQ 482 (502) Q Consensus 406 ~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~---~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~ 482 (502) +|+|=+..|..++.+.++.-+++..+.|+|.++..-. +.|.. -...+|- ....++++.-....-+..--++ T Consensus 265 GKv~N~~dgr~~l~~ai~~y~~~L~~~~~I~~~~~~eid~e~q~~----~~~~~g~--d~~~~~d~~v~~~~~~~~v~~~ 338 (356) T protein:vir:10 265 RKCPNTYDNKCLFIVAVQSYLTELAKQELIDSNFTVEIDLEKQKE----YLEGKKI--AVSKMKENEIKEANTGSNGFYL 338 (356) T ss_pred cccCCCHHHHHHHHHHHHHHHHHHHhCCccccCceeEecccchHH----Hhhhccc--cccccccceeecccCCcEEEEE Confidence 7999999999999999999999999999997652100 00000 0001111 1111111111111222333478 Q ss_pred EEEEECceEEEEEEEEEE Q lcl|NC_013597. 483 TAVKLAGAIHSSDVIVNY 500 (502) Q Consensus 483 ~~~~~aGaIh~v~i~~~v 500 (502) +.++.-.|+..+.+.++| T Consensus 339 ~~v~~vdamE~iy~ti~v 356 (356) T protein:vir:10 339 INLKLVDAMEDINIRVQM 356 (356) T ss_pred EEEEEEeeeeeEEeEEeC Confidence 888999999999999999 No 46 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=98.12 E-value=4.2e-06 Score=50.05 Aligned_cols=463 Identities=12% Similarity=0.034 Sum_probs=218.7 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcC---CCcHHHHHHHHHhcCCC Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFG---TNSETAKAAQPFFAQSP 77 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg---~~s~ey~aA~~~F~q~p 77 (502) |.+ ++.=|-|..--++.......-+...|+|...--|+. + -..-+|..|....|| ..+.++.+...||-+-- T Consensus 1 ~~~-~~PgVyv~e~~~~~~~~~~~ts~~~fvG~~~~Gp~~---~-p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ngg 75 (659) T protein:vir:10 1 MTL-LSPGIELKETTVQSTVVNNSTGTAALAGKFQWGPAF---Q-IKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQYG 75 (659) T ss_pred Cce-ecCceEEEEecCCceecccCccceEEEecccCCCCC---c-cEEecCHHHHHHHcCCcCCCcchhHHHHHHHhhCC Confidence 987 566555554444333334467788899887655543 3 355667899999998 45566777888885432 Q ss_pred CcceEEEEEeecccccceee--ee------eccchh---------------------h-----hHHHHHhh--------- Q lcl|NC_013597. 78 RAKQLIVARWQKSASTIEAT--KN------TLSGAT---------------------L-----SDDLERFK--------- 114 (502) Q Consensus 78 ~P~~l~igr~~~~~~~~~~~--~~------~~~~~~---------------------~-----~~~~~~~~--------- 114 (502) . ++||-|........... .. ...+.. + ......+. T Consensus 76 ~--~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~vd~~~~~~~~~i~~~~~~~~ 153 (659) T protein:vir:10 76 N--DLRVVRAVDRDTAKNSSPIAGNIEYTISTPGSNYAVGDKITVKYVSDAIETEGKITEVDTDGKIKKINIPTAKIIAK 153 (659) T ss_pred C--eEEEEEccCcccccccccccccceeeEeecccccccccceeeeecCCCccccceeeEEecccccceeeecccccccc Confidence 2 44444432211000000 00 000000 0 00000000 Q ss_pred c---------ccceeEEEEecC--cccccc--ccccccccchhhHH--------HHHHhhh-------------cccccc Q lcl|NC_013597. 115 S---------VVNGRFSLTIGG--DVKKVD--GLSFARLADFNAVA--------TKIQEKL-------------TTLSVA 160 (502) Q Consensus 115 ~---------~~~g~~~iti~g--~~~~~~--~i~~s~~ts~~~vA--------~~i~aal-------------~~a~~~ 160 (502) . ..++...++-.+ ....+. .+.......+.++. ...+... ...+.. T Consensus 154 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~a~t~~~~~~~~~~~~~~~v~a~~~G~~g~~ 233 (659) T protein:vir:10 154 AKEVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDK 233 (659) T ss_pred cccccccceeeeeeeeeeeeeccccceeeEEeeeecCCceeEEeeccccccccccccccceeecccccccccccceeccc Confidence 0 000000000000 000000 00000000000000 0000000 000000 Q ss_pred eeEEEeccc------ceeeEeeecc--------------cccc-------------cceeeeeeccccchhh----hhhh Q lcl|NC_013597. 161 VSIAYDETG------NRFIVSANVA--------------GEDK-------------KTEIDYAIDEGGEGEY----IGAL 203 (502) Q Consensus 161 ~tv~~~~~~------~~f~~~s~tt--------------G~~~-------------~v~~~~a~~~~~t~t~----~aa~ 203 (502) .++...... .......... +... ...+..... ..++.+ .... T Consensus 234 ~tv~~~~~a~~~~~~~v~v~~~~~~~~~a~~~t~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 312 (659) T protein:vir:10 234 IEIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTK-RGEKDIYDSNIYID 312 (659) T ss_pred ceEEEechhhccccceeeeeeeeecccccccceeeeeeccccccchhhccccccceeeeeeeecc-ccccccccchhhhh Confidence 001000000 0000000000 0000 000000000 000000 0000 Q ss_pred hhhccccccee--------------eeecccccc------ccCHHHHHHHHHhcc-CceeEEEEecCC---C----hhHH Q lcl|NC_013597. 204 LKLENGQASRK--------------VGKNSVSLK------KETLGEALFNVAEVN-NTWYGFTVAAQL---T----DSEV 255 (502) Q Consensus 204 l~~t~~~~~~~--------------v~v~~~~~~------~et~~~al~al~~~~-~~w~~~~~~~~~---~----~~~~ 255 (502) ..+........ +.+ ..+.+ ..+...++..+.... .+.. +++.... + ..-. T Consensus 313 ~~~~~~~~~~v~~~~~~~~~~~~~~~~l-~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~-il~~p~~~~~~~~~~~~v~ 390 (659) T protein:vir:10 313 DFFAKGGSEYIFATAQNWPEGFSGILTL-SGGLSSNAEVTAGDLMEAWDFFADRESVDVQ-LFIAGSCAGESLETASTVQ 390 (659) T ss_pred hhhccCcccEEEEeecccCCCccceeee-cccccccccccchhHHHHHHHhhhcccccee-EEEecCCCCcchhhhHHHH Confidence 01111111000 000 00111 111234444443332 2333 3333221 1 1224 Q ss_pred HHHHHHHhhcCCEEEEEecCchh-c---c-cchhHHHHHHHHc--------cC--CceEEEecC-------Cc-----cc Q lcl|NC_013597. 256 EAAAKYAQANTKLFGANVIRAEQ-I---E-WSADNIYKKLYDA--------GL--DHTLAMFDK-------ND-----MY 308 (502) Q Consensus 256 ~a~a~w~~a~~~~~~~~~~~~~~-~---~-~~~~~i~~~l~~~--------~~--~~t~~~y~~-------~~-----~~ 308 (502) .++...++....+|.+..-.... + . ....++....... ++ .| +.+|++ .+ -. T Consensus 391 ~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~-~~l~~p~~~~~d~~~~~~~~~p 469 (659) T protein:vir:10 391 KHVVSIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTY-AAIDGNYKYQYDKYNDVNRWVP 469 (659) T ss_pred HHHHHHHHhhCCeEEEEcCccccccCCCcccCHHHHHHHHHhcccccccccccCcce-EEEEeCcEEEecccCCceEEec Confidence 45566666666666554221111 1 1 1122332222221 12 23 344443 11 13 Q ss_pred hHHHHHHHHHhcCCCCCCceeeEeeeecCccc-----cCCCCHHHHHHHHhCCceEEEEEcCc-eEEecCEeecC----- Q lcl|NC_013597. 309 PVSSALARLLSTNFAANNSTLTLKFKQQPTIT-----ADEITATEFAKAKRLGINVYTYFDDV-AMIAEGTVIGG----- 377 (502) Q Consensus 309 ~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~-----~~~lt~t~~~~l~~~~~n~y~~~~~~-~~~~~G~~~~G----- 377 (502) +.+.++|.++.+|..+.+. .....|.+.||. ...+++.|++.|..+++|.+.++.+. ..+|..+++++ T Consensus 470 ~sg~~AGl~Ar~D~~~g~~-~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~ 548 (659) T protein:vir:10 470 LAADIAGLCARTDNVSQTW-MSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPF 548 (659) T ss_pred hHHHHHHHHHHHhccCCce-EccCCceeeeeeccccceecCCHhHHHHHhhCCeeEEEEeCCCeEEEEcccccCCCCccc Confidence 4578889999887655321 122334433332 22578999999999999999998775 57888998876 Q ss_pred eehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccc Q lcl|NC_013597. 378 KFADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKG 457 (502) Q Consensus 378 ~~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~g 457 (502) .||-+.+-.+|+...|+..+...++. |.++.=...|+..|+.-|++.+++|.|. | T Consensus 549 ~~i~vrR~~~~i~~si~~~~~~~v~e-----~n~~~l~~~i~~~i~~fL~~l~~~gal~--------------------~ 603 (659) T protein:vir:10 549 DRINVRRLFNMLKTNIGRSSKYRLFE-----LNNAFTRSSFRTETAQYLQGIKALGGIY--------------------E 603 (659) T ss_pred ceEehhhHHHHHHHHHHHHHHHhccC-----CCCHHHHHHHHHHHHHHHHHHHhcCcee--------------------e Confidence 26788889999999999998665533 6788888999999999999999999984 5 Q ss_pred eEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 458 FYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 458 y~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) |.|.++ .+..|++|+.+.+. -+.+.+...-.+++|.+++.-.| T Consensus 604 ~~V~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~i~~~~~~~~ 646 (659) T protein:vir:10 604 YRVVCD-TTNNTPSVIDRNEF-VATFYIQPARSINYITLNFVATA 646 (659) T ss_pred EEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEEe Confidence 899998 47889999999888 59999999999999999887666 No 47 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=98.11 E-value=4.4e-06 Score=49.95 Aligned_cols=360 Identities=13% Similarity=0.102 Sum_probs=189.9 Q ss_pred CCcCcC---ceeEEeecccccccccccccceEEEeccccccc--ccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhcC Q lcl|NC_013597. 1 MALSIS---HIVNVQLNTVPKSAARKSFGIVALFTPEAGQAF--ADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQ 75 (502) Q Consensus 1 Msip~s---~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~--~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~q 75 (502) |+.+-. =|--+.+.-.+.++.....+.+.|+|....... .+-...++. ++..+....||.....+.+...+|.+ T Consensus 1 m~m~~~~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i-~s~~~~~~~~g~~g~L~~al~~~~~~ 79 (393) T protein:vir:10 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLI-TNPLNYLEKAGSTGTLRRTLNSIGSI 79 (393) T ss_pred CCCCCccCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEe-cchHHHHHhhCCccchhhhhhhhhcc Confidence 554422 222233444456666777777777776543211 011234444 56667778899888888888888877 Q ss_pred CCCcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhc Q lcl|NC_013597. 76 SPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLT 155 (502) Q Consensus 76 ~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~ 155 (502) ...+. ++-+....+.. +......+++.. ....++ +. +|. T Consensus 80 ~~~~~--~vv~v~~~~~~------------------------~~t~~~iig~~~----------~~~~tg----l~-al~ 118 (393) T protein:vir:10 80 VKTPT--VIVRVAESDDS------------------------DTLTANIVGTQE----------NGKFTG----IK-ALL 118 (393) T ss_pred cCceE--EEeecccCccc------------------------cccccccccccc----------cchhhH----HH-HHH Confidence 53322 22222111000 000000000000 000011 11 111 Q ss_pred ccccceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHHHH Q lcl|NC_013597. 156 TLSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVA 235 (502) Q Consensus 156 ~a~~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~al~ 235 (502) ..... +... ..... ..+-.......++..+. T Consensus 119 ~~~~~-----------~~~~-------p~li~-------------------------------apg~~~~~~~~al~~~~ 149 (393) T protein:vir:10 119 TAQST-----------VFVK-------PKLLC-------------------------------VPQHDNQAVATELLSVA 149 (393) T ss_pred hhhhh-----------ccee-------eeeee-------------------------------eccccchHHHHHHHHHh Confidence 10000 0000 00000 00111111223333333 Q ss_pred hccCceeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecC-chhcccchhHHHHHHHHccCCceEEEecCCccchHHHHH Q lcl|NC_013597. 236 EVNNTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIR-AEQIEWSADNIYKKLYDAGLDHTLAMFDKNDMYPVSSAL 314 (502) Q Consensus 236 ~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~-~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~aa~~ 314 (502) +.-++-+.+.... .....++-.|.+.-+..+....+. ....+. . .+.-.. ..+.+.++ T Consensus 150 ~~~~~~~~v~d~~---~~t~~~ai~~~~~~~s~~~~~~~P~~~~~d~----------~---~~~~~~-----~p~s~~~A 208 (393) T protein:vir:10 150 KKLNAFAFISDNG---ATTKEQAYTYRQNFSQREGMMIFGDWKSYNT----------D---KKAYDT-----DYAVARAC 208 (393) T ss_pred hccCcEEEEEcCC---CCCHHHHHHHhhhcCCceEEEEecccccccc----------c---CCceeE-----eehhHHHH Confidence 3333332222221 222334445655422212111110 000000 0 000111 12456777 Q ss_pred HHHHhcCCCCCCceeeEeeeecCccccC--------CCCHHHHHHHHhCCceEEEEEcCceEEecCEeecCe----ehhH Q lcl|NC_013597. 315 ARLLSTNFAANNSTLTLKFKQQPTITAD--------EITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGGK----FADE 382 (502) Q Consensus 315 g~~as~n~~~~~g~~T~~fk~~~Gv~~~--------~lt~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~~G~----~iD~ 382 (502) |.++.+|-.+-+ ......|.+.||..- .++++|++.|..+|+|.+....| ..+|.+++++++ ||-+ T Consensus 209 g~~a~~d~~~G~-~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~~~~G-~~~wG~rT~s~d~~~~~i~v 286 (393) T protein:vir:10 209 ALQAYIDKTVGW-HKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICLNHNG-FRYWGSRTLATDTRWAFQQS 286 (393) T ss_pred HHHHHhhcCCCc-EEccCCceeeceeecceecccccCCCcchhHhHhhcCceEEEcCCC-EEEEcccccCCCcccceeeh Confidence 888888754321 122345566665532 24588999999999999865433 467888998883 7889 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcC--ccccccccCccccccccccccccceEE Q lcl|NC_013597. 383 IVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNG--AFAPGKWTGAGFGNLSTGDYLDKGFYV 460 (502) Q Consensus 383 ~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G--~I~~g~~~~~~~g~~~~~~~~~~gy~v 460 (502) .+-.+|++..|+..+...++ | |.++.=...++..++.-|+.-+++| .|. ||.+ T Consensus 287 rR~~~~i~~~i~~~~~~~v~----e-~~~~~~~~~i~~~i~~~L~~l~~~g~~al~--------------------g~~v 341 (393) T protein:vir:10 287 VRTAQIIKETIGAGLAWAVD----M-PLTPLRVKTMLEAINNKLRSWASGDDPRIL--------------------GARV 341 (393) T ss_pred hhHHHHHHHHHHHHHHHhcc----C-CCCHHHHHHHHHHHHHHHHHHHhccccccc--------------------cceE Confidence 99999999999999876553 3 7788888889999999998887766 232 3566 Q ss_pred EcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 461 WAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 461 ~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) ...+ +.+++|..+.+. -+.+.+...-.+++|+++...++ T Consensus 342 ~~~~--~nt~~~i~~G~~-~~~i~~~p~~p~e~I~~~~~~~~ 380 (393) T protein:vir:10 342 WVAE--EITADIIKSGKF-VIKYDYHWIPSLESLGLEQRVND 380 (393) T ss_pred EecC--CCCHHHhhCCEE-EEEEEEEecCCcceEEEEEEEch Confidence 6653 477888888777 58999999999999999999998 No 48 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=98.08 E-value=5e-06 Score=49.64 Aligned_cols=464 Identities=11% Similarity=0.016 Sum_probs=220.1 Q ss_pred CCcCcCceeEEe-ecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCC---CcHHHHHHHHHhcCC Q lcl|NC_013597. 1 MALSISHIVNVQ-LNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGT---NSETAKAAQPFFAQS 76 (502) Q Consensus 1 Msip~s~iV~V~-i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~---~s~ey~aA~~~F~q~ 76 (502) |+| ++.=|-|. ++ .+.++....-+...|+|...--|+. + -...+|..|....||. .+.++.++..||-+- T Consensus 1 ~~~-~~Pgvyv~e~~-~~~~i~~~~t~~~~~vg~~~~gp~~---~-p~~v~s~~~~~~~fg~~~~~~~~~~~~~~~f~~~ 74 (660) T protein:vir:10 1 MAL-LSPGIELKETS-VQSTVVRNATGRAALVGKFQWGPAF---Q-VTQITNEVELVDLFGGPNNEVADYFMSGMNFLQY 74 (660) T ss_pred Cce-ecCceEEEeec-CCccccCCCcccceEEeecCCCCCc---c-CeEcCCHHHHHHHcCCcCCCchhHHHHHHHHHhC Confidence 886 77777776 44 3567777778888899887665543 3 3455779999999983 355666777777542 Q ss_pred CCcceEEEEEeecccccce---------eee---e-----------eccchhhhH------H------------------ Q lcl|NC_013597. 77 PRAKQLIVARWQKSASTIE---------ATK---N-----------TLSGATLSD------D------------------ 109 (502) Q Consensus 77 p~P~~l~igr~~~~~~~~~---------~~~---~-----------~~~~~~~~~------~------------------ 109 (502) -. ++||-|......... .+. + ...+..... . T Consensus 75 g~--~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~v~~~~a~g~~~~~~~~ta~~~~ 152 (660) T protein:vir:10 75 GN--DLRTVRVVSREFAKNASPIAGNIETTITTAGSNYAVGDKINIKYNQTVVESEGRVTSVDTDGKILSVFIPSAKIIA 152 (660) T ss_pred Cc--eEEEEEecccccccccccccccceeEEeeccccccccceeeEeeccccccccccceeeccccceeeeccccccccc Confidence 21 344444432211000 000 0 000000000 0 Q ss_pred ----HHHhhcccceeEEEEecCcccccc------------ccccc----cccchhhHHHHH----------Hh-hhcccc Q lcl|NC_013597. 110 ----LERFKSVVNGRFSLTIGGDVKKVD------------GLSFA----RLADFNAVATKI----------QE-KLTTLS 158 (502) Q Consensus 110 ----~~~~~~~~~g~~~iti~g~~~~~~------------~i~~s----~~ts~~~vA~~i----------~a-al~~a~ 158 (502) ......... .....+.+...... ++.+. ........+... .+ .....+ T Consensus 153 ~a~~v~~~~~~~~-~~~~~~~~~~~~~~~a~sv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G 231 (660) T protein:vir:10 153 YARSLNQYPTLGP-AWTAEVTSASSGVSGTITVGKIVTDSGILLTEAENSEEAITSLEFQAALKKFAMPGVVALYPGEIG 231 (660) T ss_pred ccccccccccccc-ceeEEEecccCccccceeeeeeeccCcceEEeeeccccccccccceeeccccccceeeeecccccC Confidence 000000000 00000000000000 00000 000000000000 00 000000 Q ss_pred cceeEEEe---c-------------ccceeeEeeecccc-c--c----cc----------eeeeeecccc---chhhhhh Q lcl|NC_013597. 159 VAVSIAYD---E-------------TGNRFIVSANVAGE-D--K----KT----------EIDYAIDEGG---EGEYIGA 202 (502) Q Consensus 159 ~~~tv~~~---~-------------~~~~f~~~s~ttG~-~--~----~v----------~~~~a~~~~~---t~t~~aa 202 (502) ....+... . .............. . . .. .+......+. .++..-. T Consensus 232 ~~i~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 311 (660) T protein:vir:10 232 STLEVEIVSKAAYEAGSSKMLDVYPGGGTRASIAKAVFNYGPQTDDQYAIIVRRDGAIVESVVLSTKEGEKDVYGNNIYL 311 (660) T ss_pred cceeEEEeeccccCCcceeEEeeeeccceeeEEeeeecccccccccccccccccCCcccceeeeeccccccccccceeee Confidence 00000000 0 00000000000000 0 0 00 0000000000 0000000 Q ss_pred hhhhcccccceeeeec-------------c---cccc---ccCHHHHHHHHHhccC-ceeEEEEecC--CCh----hHHH Q lcl|NC_013597. 203 LLKLENGQASRKVGKN-------------S---VSLK---KETLGEALFNVAEVNN-TWYGFTVAAQ--LTD----SEVE 256 (502) Q Consensus 203 ~l~~t~~~~~~~v~v~-------------~---~~~~---~et~~~al~al~~~~~-~w~~~~~~~~--~~~----~~~~ 256 (502) ...+..+......... . .+.+ ..+....+..+.+... ++-.++.... ..+ .-.. T Consensus 312 ~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~t~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~ 391 (660) T protein:vir:10 312 DDYFAKGTSNYIYATSLNWPKGFSGIINLSGGISANDKVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDEVASTVQK 391 (660) T ss_pred ehhhcCCCccEEEEEeccCCCCcccceeeeccccCccccccchhhhhhhhhhhhhhcccceEEEcCcCCCchhhhHHHHH Confidence 0000111111000000 0 0001 1112234444443322 2322222221 112 2334 Q ss_pred HHHHHHhhcCCEEEEEecCch----hccc-chhHHHHHHHHc--------cC--CceEEEecC-------Cc-----cch Q lcl|NC_013597. 257 AAAKYAQANTKLFGANVIRAE----QIEW-SADNIYKKLYDA--------GL--DHTLAMFDK-------ND-----MYP 309 (502) Q Consensus 257 a~a~w~~a~~~~~~~~~~~~~----~~~~-~~~~i~~~l~~~--------~~--~~t~~~y~~-------~~-----~~~ 309 (502) +|...++..+.+|.+.-.... .... ...++....... ++ .+ ..+|++ .+ -.+ T Consensus 392 al~~~~~~~~~~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~-~~~~~p~~~~~d~~~~~~~~~p~ 470 (660) T protein:vir:10 392 HVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTY-AAIDGNYKYQYDKYNDVNRWVPL 470 (660) T ss_pred HHHHHHHhhCCEEEEEecCcccccccccccCHHHHHHHHhhcccccccccccCcce-EEEEcCceEEecccCCceeEech Confidence 556666666656655421111 0111 112222222111 12 22 233433 11 135 Q ss_pred HHHHHHHHHhcCCCCCCceeeEeeeecCccc-----cCCCCHHHHHHHHhCCceEEEEEcC-c-eEEecCEeecC---e- Q lcl|NC_013597. 310 VSSALARLLSTNFAANNSTLTLKFKQQPTIT-----ADEITATEFAKAKRLGINVYTYFDD-V-AMIAEGTVIGG---K- 378 (502) Q Consensus 310 ~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~-----~~~lt~t~~~~l~~~~~n~y~~~~~-~-~~~~~G~~~~G---~- 378 (502) .+.++|.++.+|.++.+. ....+|.+.||. ...+++.|.+.|..+|+|.+.++-+ . -.+|..+++++ + T Consensus 471 sg~~AGl~Ar~D~~~g~~-~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~s~~ 549 (660) T protein:vir:10 471 AADLAGLCARTDDVSQPW-MSPAGYNRGQILNVLKLAIEPRQAQRDRMYQEAINPVVGFAGGDGFVLFGDKTATKVPSPM 549 (660) T ss_pred hHHHHHHHHHhhccCCcE-EccCCeeeceeeccceeeecCChhhHHhHhhCCceEEEEeeCCCcEEEEcccccCCCCccc Confidence 678889999887554221 112345544442 1357899999999999999998744 3 46788888876 2 Q ss_pred -ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccc Q lcl|NC_013597. 379 -FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKG 457 (502) Q Consensus 379 -~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~g 457 (502) ||-+.+-.+|+.+.|+......++. |.++.-...|+..|+.-|+..+++|.|. | T Consensus 550 ~~i~vrR~~~~i~~si~~~~~~~v~e-----pn~~~l~~~i~~~i~~fL~~l~~~gal~--------------------g 604 (660) T protein:vir:10 550 DHINVRRLFNMLKKNIGDASKYKLFE-----LNDNFTRSSFRMEVSQYLDGIKALGGIY--------------------E 604 (660) T ss_pred ceEehhhHHHHHHHHHHHHHHHhccC-----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------e Confidence 5778889999999999998765533 6788899999999999999999999995 4 Q ss_pred eEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 458 FYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 458 y~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) |+|.++ .++.+++|+.+.+. -+.+.++..-.+++|.+++.-.| T Consensus 605 ~~V~~d-~~~nt~~di~~G~~-~~~i~~~P~~pae~I~~~~~~~~ 647 (660) T protein:vir:10 605 GRVVCD-TTVNTPAVIDRNEF-IANIYVKPARSINYITLNFVATS 647 (660) T ss_pred eEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCccEEEEEEEEee Confidence 889998 67889999999988 59999999999999999988766 No 49 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=98.08 E-value=5.1e-06 Score=49.61 Aligned_cols=470 Identities=11% Similarity=0.060 Sum_probs=179.9 Q ss_pred CCcCcCceeEEeec-c---cccccccccccc--------eEEEecccccccccCccceEE-e-cCHHHHHhhcCCCcHHH Q lcl|NC_013597. 1 MALSISHIVNVQLN-T---VPKSAARKSFGI--------VALFTPEAGQAFADEKTRYVY-V-ENQRDVEQLFGTNSETA 66 (502) Q Consensus 1 Msip~s~iV~V~i~-~---~~~~~~~~~f~~--------~lil~~~~~~~~~~~~~r~~~-y-~s~~~v~~~fg~~s~ey 66 (502) ..+-+++++...+. . ...+......+. .+.+........+....+... . .+...... ....+.+ T Consensus 180 ~~~~~~~~~~~~~~~~~~t~~~~~~~~a~~~~~~~~~~~~~~~~~~s~~~~~~~a~~~~v~~~~~~~~~~~--~i~~~~~ 257 (749) T protein:vir:10 180 IILTIDDVVGTFAPGSATTITIGGSAESVNVLAYDATNKKLEIGLPSGGVTGILADNQVITQGTNTAKINV--TIERKLL 257 (749) T ss_pred eeeeeccccceeecccceeeeccCcccccccccccCCcceEEEeeecccccceeeeeeccccccccccccc--ccccchh Confidence 11111222111000 0 000000000000 011110000000000000000 0 00000000 0000011 Q ss_pred HHH---HHHhcC--CC---CcceEEEEEeecccccceeeee----eccchhhhHHHHHhhcccceeEEE---EecCcccc Q lcl|NC_013597. 67 KAA---QPFFAQ--SP---RAKQLIVARWQKSASTIEATKN----TLSGATLSDDLERFKSVVNGRFSL---TIGGDVKK 131 (502) Q Consensus 67 ~aA---~~~F~q--~p---~P~~l~igr~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~g~~~i---ti~g~~~~ 131 (502) ... ...|.- .. ...++-+..-...........+ .+...+.......-.......+.+ ..+|.... T Consensus 258 ~~~~~~~~~~a~~~~~~~~~g~~~~it~v~~~~~~~~~~t~~~~~~~a~~~gt~~~~~~~~g~~D~~~v~v~~~~g~~~~ 337 (749) T protein:vir:10 258 VALNKSSIEFAASDVVQDTNSTNITITSVRDEYTEREYLPGVKWINVAPRPGTSLYANGVGGHRDEMHVILVDIDGGVTG 337 (749) T ss_pred hhhccccceeeccccccCCccceeEEEeeeccccccccccceeeccccccccceeeeecccCCCCceEEEEecCCCeeee Confidence 100 000110 00 0111211111100000000000 000000000000000000001111 11111110 Q ss_pred c-----c-ccccccccchh---hHHHHHHhhhcccccceeEEEecccce-eeEeeecc-c----ccccceeeeeeccccc Q lcl|NC_013597. 132 V-----D-GLSFARLADFN---AVATKIQEKLTTLSVAVSIAYDETGNR-FIVSANVA-G----EDKKTEIDYAIDEGGE 196 (502) Q Consensus 132 ~-----~-~i~~s~~ts~~---~vA~~i~aal~~a~~~~tv~~~~~~~~-f~~~s~tt-G----~~~~v~~~~a~~~~~t 196 (502) . + -++++.+.+.. +...-+...+..... .+.+...... +..+.... + ...............+ T Consensus 338 ~~g~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~--~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 415 (749) T protein:vir:10 338 TVGALLERYIDVSKASDAKTSVGETNYYAEVIKQKSE--FIYWAEHESTLYAATSSASDGLFGQTAANRQFNLFRSAAGS 415 (749) T ss_pred cccceeeeeeeccccccccccccccchhhhhhccCCC--EEEEEecccccccccccccccccccccccceeecccccccc Confidence 0 0 01111111100 000000001111000 0110000000 00000000 0 0000000000000000 Q ss_pred hhhhhhhhhhcccccceeeeeccc-----------cccccCHHHHHHHHHhccCceeEEEEe--cCCC----hhHHHHHH Q lcl|NC_013597. 197 GEYIGALLKLENGQASRKVGKNSV-----------SLKKETLGEALFNVAEVNNTWYGFTVA--AQLT----DSEVEAAA 259 (502) Q Consensus 197 ~t~~aa~l~~t~~~~~~~v~v~~~-----------~~~~et~~~al~al~~~~~~w~~~~~~--~~~~----~~~~~a~a 259 (502) ..+......+....+......... .........++..+......-.-+++. ...+ .....++. T Consensus 416 ~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~li~~~~~~~~~~~~~v~~al~ 495 (749) T protein:vir:10 416 VDYPAGVTTLGSKNNATYYYRLSGGVNYTVSAGQYTITNTDIGSAYELIGDPESQIVDFIISGPSGTSDANALAKITSLV 495 (749) T ss_pred ceeccccccccccCCcEEEEEccCCcccccccccccccchhHHHHHHHhhhhhhcccceEEEecCCCCcchhHHHHHHHH Confidence 001111111111111111110010 111223345555554433222223222 1122 23455666 Q ss_pred HHHhhcCCEEEEEecCch-hcc-----cchhHHHHHHHHccCCceEEEecC-------Cc-----cchHHHHHHHHHhcC Q lcl|NC_013597. 260 KYAQANTKLFGANVIRAE-QIE-----WSADNIYKKLYDAGLDHTLAMFDK-------ND-----MYPVSSALARLLSTN 321 (502) Q Consensus 260 ~w~~a~~~~~~~~~~~~~-~~~-----~~~~~i~~~l~~~~~~~t~~~y~~-------~~-----~~~~aa~~g~~as~n 321 (502) ..++....++.+..-... .+. ....+.....+.....+-..+|++ .+ --+.+.++|.++.+| T Consensus 496 ~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D 575 (749) T protein:vir:10 496 NIAEERRDCMVFVSPRRGNVIGISNTTTITTNIVDFFKKLPSSSYMVFDSGYKYIYDKYNDVYRYIPCNGDTAGLCLQTN 575 (749) T ss_pred HHHhhcCCEEEEEcCCCCcccccccchhhhhHHHHHHhhccCceeEEEEccceeeeccccCceEEechHHHHHHHHHHhh Confidence 777766665554321111 111 011122222122121222334432 11 124677889999888 Q ss_pred CCCCCceeeEeeeecC---ccc--cCCCCHHHHHHHHhCCceEEEEEcCc-eEEecCEeecC-----eehhHHHHHHHHH Q lcl|NC_013597. 322 FAANNSTLTLKFKQQP---TIT--ADEITATEFAKAKRLGINVYTYFDDV-AMIAEGTVIGG-----KFADEIVILDWFV 390 (502) Q Consensus 322 ~~~~~g~~T~~fk~~~---Gv~--~~~lt~t~~~~l~~~~~n~y~~~~~~-~~~~~G~~~~G-----~~iD~~~~~dwl~ 390 (502) ..+.+. .....|++. |+. ...+++.|.+.|..+|+|....+.+. ..+|.++++.+ .||-+.+-.+|++ T Consensus 576 ~~~g~~-~SPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~s~d~~~~~i~vRRl~~~ie 654 (749) T protein:vir:10 576 EISEPW-FSPAGFQRGVLRNAIKLAYTPNKAQRDQLYANRVNPIVSFPGQGVVLYGDKTALGFASAFDRINIRRLFLTVE 654 (749) T ss_pred ccCCcE-ECcCCceeeeeeccccceeecChhHHHhhhhCCceEEEEecCCeEEEEcceecCCCCcccceeehhhhHHHHH Confidence 654311 111244433 332 23568999999999999999998776 46788888754 3788889999999 Q ss_pred HHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcCchhcCCH Q lcl|NC_013597. 391 DAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSD 470 (502) Q Consensus 391 ~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~ 470 (502) ..|+..+...++. |.++.=...|+..|+.-|+..+++|.|. ||.|.+. .+..++ T Consensus 655 ~si~~~~~~~v~e-----pn~~~l~~~i~~~i~~fL~~l~~~G~i~--------------------~f~V~~d-~~~Nt~ 708 (749) T protein:vir:10 655 RVISTAAKAQLFE-----QNDEAQRSLFINIVEPYLRDVQGRRGVV--------------------DFLVKCD-STNNTP 708 (749) T ss_pred HHHHHHHHHhhcC-----CCCHHHHHHHHHHHHHHHHHHHhcCCee--------------------eeEEEEc-CCCCCH Confidence 9999998665543 6788888999999999999999999873 5899998 778899 Q ss_pred HHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 471 SDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 471 ~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) +|+.+.+. .+.+.++....+++|.+++.-.| T Consensus 709 ~~i~~G~~-~~~i~~~P~~pae~I~~~~~~~~ 739 (749) T protein:vir:10 709 EAVDRGEF-YAEVFLKPTRTINYVQLTFVATR 739 (749) T ss_pred HHhhCCEE-EEEEEEEecCCccEEEEEEEEee Confidence 99999888 69999999999999999876444 No 50 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=98.06 E-value=5.5e-06 Score=49.41 Aligned_cols=463 Identities=9% Similarity=-0.013 Sum_probs=172.8 Q ss_pred CCcCcCce-eEEeecccccccccccccc--eEEEecccccccccCccceEEecCHHHHH-------hhcC---CCcHHHH Q lcl|NC_013597. 1 MALSISHI-VNVQLNTVPKSAARKSFGI--VALFTPEAGQAFADEKTRYVYVENQRDVE-------QLFG---TNSETAK 67 (502) Q Consensus 1 Msip~s~i-V~V~i~~~~~~~~~~~f~~--~lil~~~~~~~~~~~~~r~~~y~s~~~v~-------~~fg---~~s~ey~ 67 (502) -++..+.. +...+.-...+.....+.. ....+.... .............. .+.+ ....... T Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~tv~v~~~~~~vg~~v~~~ 254 (743) T protein:vir:10 181 ATITASGTLTSQYLLDTPEQGLIGSFTDNSTTEVGRTPG------TYSNVPASGGTGTGATFNVVVADAGGGVGGSVVVT 254 (743) T ss_pred eeeeeccccceeeeccccccccccccccccccccccccc------ceeeEEecccccccccccccccccccccccccccc Confidence 00000000 0000000000000000000 000000000 00000000000000 0000 0000000 Q ss_pred HHHHHhc--C----------------------CCCcceEEEEEeecccc-cc-eeeeeec---cchhhhHHHHHhhcccc Q lcl|NC_013597. 68 AAQPFFA--Q----------------------SPRAKQLIVARWQKSAS-TI-EATKNTL---SGATLSDDLERFKSVVN 118 (502) Q Consensus 68 aA~~~F~--q----------------------~p~P~~l~igr~~~~~~-~~-~~~~~~~---~~~~~~~~~~~~~~~~~ 118 (502) ++..-+. + ......+.+........ .. .+++..+ ...+.......-..... T Consensus 255 ~~~~~~~~~~~~~~~v~~~~~~~~t~~~~~~~~~~~g~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~ 334 (743) T protein:vir:10 255 LANPGTGYNQGETLTIASAATGDGTDILVTVATLSDGTIAITELKDWYLNTEIGSTGIKLGDIGPRPGTSQFATDNGITD 334 (743) T ss_pred cccccceeeeccccccccccccccccchhheecccccceeeeecccccccchhhccccccccccccceeeeccccccccc Confidence 0000000 0 00000000000000000 00 0000000 00000000000000000 Q ss_pred eeEEEEecCccccccccccccccchhhHHHHHHhhhccccccee-EEEecccceeeEeeecccccccceeeeeeccccch Q lcl|NC_013597. 119 GRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLSVAVS-IAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEG 197 (502) Q Consensus 119 g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a~~~~t-v~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~ 197 (502) ..+.+.+-.....+..-.-.....+..+......+ ...+.... ..+-.....+.......+... ............. T Consensus 335 d~~~v~v~~~~~~~~~~~~~v~~~~~~~s~~~~~~-~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~-~~~~~~~~~~~~~ 412 (743) T protein:vir:10 335 DQVHFAVIDTTGELTGTANTIVERLTYLSKLSDAR-SEENANIYYKNVINEQSAYLYHGNDAAVQI-AASGEAWGQSSDQ 412 (743) T ss_pred cceEEEEecCcceeeeccCceeEEEeeeecccccc-cccCcceeecceeccccceeeccCccccee-eeccccCccccce Confidence 00000000000000000000000000000000000 00000000 000000000000000000000 0000000000000 Q ss_pred hhhhhhhhhcccccceeeeeccccc-----cccCHHHHHHHHHhccCceeEEEEecC--C----ChhHHHHHHHHHhhcC Q lcl|NC_013597. 198 EYIGALLKLENGQASRKVGKNSVSL-----KKETLGEALFNVAEVNNTWYGFTVAAQ--L----TDSEVEAAAKYAQANT 266 (502) Q Consensus 198 t~~aa~l~~t~~~~~~~v~v~~~~~-----~~et~~~al~al~~~~~~w~~~~~~~~--~----~~~~~~a~a~w~~a~~ 266 (502) .+..... .........+.+ ..|. .......++..+.....-...++++.. . ...-+.++.+.++... T Consensus 413 ~~~~~~~-~~~~~~~~~~~~-~gG~d~~~~~~~~~~~~~~~~~~~~~~~~~ll~~p~~~~~~~~~~~v~~a~~~~~~~~~ 490 (743) T protein:vir:10 413 VLADAGT-AFSRTTGYWVNL-AGGNDDFAYDAGEFGAAMDLFLDTEETEIDFVLMGGSMADEADTKSKATKVIAIAASRK 490 (743) T ss_pred eeeeccc-ccccccceEEEe-ecCccccccchhHHHHHHHHhhhccccCcceEEecCcccCccchHHHHHHHHHHHHhhC Confidence 0000000 000000111111 1111 112233444544443222223444321 1 1334556677777666 Q ss_pred CEEEEEecCchh---------c---ccchhHHHHHHHHccCCceEEEecC-------Cc-----cchHHHHHHHHHhcCC Q lcl|NC_013597. 267 KLFGANVIRAEQ---------I---EWSADNIYKKLYDAGLDHTLAMFDK-------ND-----MYPVSSALARLLSTNF 322 (502) Q Consensus 267 ~~~~~~~~~~~~---------~---~~~~~~i~~~l~~~~~~~t~~~y~~-------~~-----~~~~aa~~g~~as~n~ 322 (502) .+|.+....... . ....+.+...-.....+|.+ +|++ .+ ..+.+.++|.++.+|. T Consensus 491 ~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~-~~~p~~~~~d~~~~~~~~~p~s~~~AGl~a~~D~ 569 (743) T protein:vir:10 491 DALAFVSPHKGNQIASTGNVALSSAQQKENTIAFFSDLTSTSYAV-FDSGYKYVYDRFTDKYRYIPCNGDVAGLCVQTSN 569 (743) T ss_pred CeEEEEecCCCccccccccccccccccchHHHHHHHhccCCeeEE-EEccceeeeccccCceeEechhHHHHHHHHHhhc Confidence 566554321110 0 00112222221111223333 3332 11 1345778888898876 Q ss_pred CCCCceeeEeeeecCcccc-----CCCCHHHHHHHHhCCceEEEEEcCc-eEEecCEeecC-----eehhHHHHHHHHHH Q lcl|NC_013597. 323 AANNSTLTLKFKQQPTITA-----DEITATEFAKAKRLGINVYTYFDDV-AMIAEGTVIGG-----KFADEIVILDWFVD 391 (502) Q Consensus 323 ~~~~g~~T~~fk~~~Gv~~-----~~lt~t~~~~l~~~~~n~y~~~~~~-~~~~~G~~~~G-----~~iD~~~~~dwl~~ 391 (502) ++.+ -.....|.+.||.- ..+++.|++.|..+++|.+..+.+. ..+|..+++.+ .||-+.+-.+|++. T Consensus 570 ~~g~-~~span~~~~gi~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~s~d~~~~~i~vrR~~~~i~~ 648 (743) T protein:vir:10 570 QLDD-WYSPAGLNRGGILNAVKLAYNPNKADRDELYQNRINPVVSLRGQGITLFGDKTALAAPSAFDRINVRRLFLNLEK 648 (743) T ss_pred cCCc-EEccCCeeeeeeeccccceecCChhHHHhHhhCCceEEEEecCCeEEEEcccccCCCCcccceEeehhhHHHHHH Confidence 5432 12334455555532 2478999999999999999998765 57888888866 27888899999999 Q ss_pred HHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcCchhcCCHH Q lcl|NC_013597. 392 AVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDS 471 (502) Q Consensus 392 ~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~ 471 (502) .|+..+...++. |.++.=...|+..|+.-|++.+++|.|. ||.|.+. .+..+++ T Consensus 649 si~~~~~~~v~e-----~n~~~~~~~i~~~i~~fL~~l~~~gal~--------------------~~~V~~d-~~~nt~~ 702 (743) T protein:vir:10 649 RARRLAEGVLFE-----QNDATTRAGFSSALNSYLSEVQARRGVT--------------------DYLVICD-ESNNTPD 702 (743) T ss_pred HHHHHHHHhccC-----CCCHHHHHHHHHHHHHHHHHHHhcCcee--------------------eeEEEEc-CCCCCHH Confidence 999998776644 5688889999999999999999999872 5899997 6889999 Q ss_pred HHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 472 DRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 472 dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) |+.+.+. -+.+.++....+++|.+++.-.| T Consensus 703 ~i~~G~~-~~~i~~~p~~pae~I~~~~~~~~ 732 (743) T protein:vir:10 703 IIDRNEF-VAEVYVKPTRSINFITITFTATK 732 (743) T ss_pred HhhCCeE-EEEEEEEecCCcceEEEEEEEee Confidence 9999998 59999999999999999987544 No 51 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=98.00 E-value=7.7e-06 Score=48.61 Aligned_cols=363 Identities=12% Similarity=0.044 Sum_probs=196.2 Q ss_pred CCcCcCceeEEe-ecccccccccccccceEEEeccccc--ccccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhcCCC Q lcl|NC_013597. 1 MALSISHIVNVQ-LNTVPKSAARKSFGIVALFTPEAGQ--AFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSP 77 (502) Q Consensus 1 Msip~s~iV~V~-i~~~~~~~~~~~f~~~lil~~~~~~--~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~q~p 77 (502) |+--...=|.|. +.-.+.++....-..+.|++..... ...+..+.++ .++..+....||..-..+++...+|.+.. T Consensus 1 M~~~~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~-i~s~~~~~~~~g~~~tl~~a~~~~~~~gg 79 (386) T protein:vir:10 1 MAEQYLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVL-IAGSRREAAKLGAGGTLPQAIDGIFDQTG 79 (386) T ss_pred CccccCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceE-ecchHHHHhhcCCCcchhHHHHHHhccCc Confidence 885443333333 2222344555566667777754321 1111123344 45556666778888899999999998764 Q ss_pred CcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccccccccchhhHHHHHHhhhccc Q lcl|NC_013597. 78 RAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTL 157 (502) Q Consensus 78 ~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a 157 (502) .+ .++-+..... ....+... .+++... .+ ...+.+.. +... T Consensus 80 ~~--~~vv~~~~~~-~~~~t~~~-----------------------~ig~~~~---------~t---~~~tgl~~-l~~~ 120 (386) T protein:vir:10 80 AV--VVVIRVDEGV-DSAATQSN-----------------------VIGKVDA---------DT---EQYTGILA-LLSA 120 (386) T ss_pred ee--EEEeeccccc-cccccchh-----------------------hhccccc---------cc---chhhhhHH-hhhh Confidence 33 3332221110 00000000 0000000 00 00000000 0000 Q ss_pred ccceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeeeccccccccCHHHHHHHHHhc Q lcl|NC_013597. 158 SVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVAEV 237 (502) Q Consensus 158 ~~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v~~~~~~~et~~~al~al~~~ 237 (502) ... +... ..+.. ..+. .......+.+... T Consensus 121 ~~~-----------~~~~-------p~i~~-------------------------------ap~~--~~~~~v~~~l~~~ 149 (386) T protein:vir:10 121 ENT-----------VKVQ-------PRILI-------------------------------APGF--SNQKAVADQLVSV 149 (386) T ss_pred ccc-----------cccc-------ccccc-------------------------------cccc--cchhHHHHHHHHh Confidence 000 0000 00000 0000 0011122233333 Q ss_pred cCceeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecC-chhcccchhHHHHHHHHccCCceEEEecCCccchHHHHHHH Q lcl|NC_013597. 238 NNTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIR-AEQIEWSADNIYKKLYDAGLDHTLAMFDKNDMYPVSSALAR 316 (502) Q Consensus 238 ~~~w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~-~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~aa~~g~ 316 (502) ...+-.+...+... .......+|.+.-...+....+. ....+. ......++ .+.+.++|. T Consensus 150 ~~~~~~~~~~~~~~-~~~~~a~~~~~~~~s~~~~~~~p~~~v~~~-------------~~~~~~~~-----p~s~~~ag~ 210 (386) T protein:vir:10 150 ADTAAWLCHSGWSN-TTDAAAITYRELFGSRRCEVVDPWYKVWDV-------------ETSAHIIQ-----PPSARHAGV 210 (386) T ss_pred hcceEEEEEeCCCC-CchHHHHHhhhcccccceEEecCceeeecc-------------ccccceee-----chHHHHHHH Confidence 33444444444322 22233334554322111111100 000000 00001111 245677888 Q ss_pred HHhcCCCCCCceeeEeeeecCccccC--------CCCHHHHHHHHhCCceEEEEEcCceEEecCEeecCe----ehhHHH Q lcl|NC_013597. 317 LLSTNFAANNSTLTLKFKQQPTITAD--------EITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGGK----FADEIV 384 (502) Q Consensus 317 ~as~n~~~~~g~~T~~fk~~~Gv~~~--------~lt~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~~G~----~iD~~~ 384 (502) ++.+|....+ -.....|.+.||.-- ..+..|.+.|..+|+|....-.| ..+|.+++++++ ||-+.+ T Consensus 211 ~a~~D~~~G~-~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gi~~~~~~~G-~~~wG~rT~~~d~~~~~i~vrR 288 (386) T protein:vir:10 211 MAKVHNTLGF-WWSNSNQEILGIDGLCRPVDFKLDDPTCRANLLNAKEVTTTIQQNG-FRVWGDRTCSADSKWAFKNVVI 288 (386) T ss_pred HHHhhhcCCc-EEccCCceeecccccceecccccccCcchhhhhhhcCcEEEEcCCC-EEEEcccccCCCcccceeehhh Confidence 8888764421 123345666665422 24688999999999998865434 578899998875 788888 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcCc Q lcl|NC_013597. 385 ILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAP 464 (502) Q Consensus 385 ~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~ 464 (502) -.+|+...|+..+...++. |.+..=...|+..++.-|+..+++|.|. ||.|.+. T Consensus 289 ~~~~i~~~~~~~~~~~v~e-----~~~~~~~~~i~~~i~~~L~~l~~~g~l~--------------------g~~v~~d- 342 (386) T protein:vir:10 289 TNDMIADSLVRNHLWAVDR-----NITKTYVEDVTEGVNNYLRHLKNIGAIA--------------------GGECWVD- 342 (386) T ss_pred HHHHHHHHHHHHHHHhccC-----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------eeEEEEc- Confidence 9999999999998665532 7788889999999999999999999985 4788887 Q ss_pred hhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 465 MDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 465 ~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) ++..+++|+.+.+. -+.+.+.....+++|.++...+. T Consensus 343 ~~~nt~~~~~~G~~-~~~i~~~p~~p~e~i~~~~~~~~ 379 (386) T protein:vir:10 343 PELNSPDQIQQGKV-YFDYDFSAYAPAEHITFRSHMVN 379 (386) T ss_pred ccCCCHHHhhCCeE-EEEEEEEecCCceeEEEEEEEeh Confidence 67889999999998 59999999999999999998887 No 52 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=97.98 E-value=8.1e-06 Score=48.49 Aligned_cols=439 Identities=12% Similarity=0.025 Sum_probs=181.5 Q ss_pred CC---cCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHHHhcCCC Q lcl|NC_013597. 1 MA---LSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSP 77 (502) Q Consensus 1 Ms---ip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~~F~q~p 77 (502) +. -+.+.+-.|++.-.+ ....| -|-|..++.. .+-.-.++.+| |.|-.-...+ T Consensus 51 ~~p~~~~~~evq~v~~~~~~---t~G~f--tLt~~g~tT~-------~I~~~asa~~v-----------~~AL~~L~~i- 106 (581) T protein:vir:76 51 INPDTGETITTQILALVGEP---TGGSF--KLSLAGEPTG-------NIPFNATQGQV-----------QSALRALPNV- 106 (581) T ss_pred ecCCCCCCCceEEEEEeecC---CcceE--EEEeCceecc-------ccccCCCHHHH-----------HHHHhhccCC- Confidence 11 112222222221111 11223 3333333211 12222334444 3333333221 Q ss_pred CcceEEEE---------Eeecccccceeeeeeccchhh-hHHHH-HhhcccceeEEEEecCccccccccccc-----ccc Q lcl|NC_013597. 78 RAKQLIVA---------RWQKSASTIEATKNTLSGATL-SDDLE-RFKSVVNGRFSLTIGGDVKKVDGLSFA-----RLA 141 (502) Q Consensus 78 ~P~~l~ig---------r~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~~~~g~~~iti~g~~~~~~~i~~s-----~~t 141 (502) .+..+.+- .+...-.........|.+..- ..... .-+.+....++++.+|...+.....-+ .+. T Consensus 107 ~~~~v~vtg~~~~~~~V~F~g~~~~~~~~~~~ltg~~~~~~~V~~~~~G~~~~~~~l~~~g~~~~~~~~~~s~~~~~~~l 186 (581) T protein:vir:76 107 EDDEVTVLGDPGGPWTVTFTKAVAALTKDVTGLTGGDNPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVL 186 (581) T ss_pred CCceEEEEcCCCceEEEEEcCCccceeEeeeeeecCCcceeEEEEEecCcCCcCceeeeccccccccceeecCCcceeee Confidence 11111110 010000000010111111100 00000 001111223444444433221100000 000 Q ss_pred chhhHHHHH------HhhhcccccceeEEEecc----cceeeEeeecccccccceeeee------------ec--cccch Q lcl|NC_013597. 142 DFNAVATKI------QEKLTTLSVAVSIAYDET----GNRFIVSANVAGEDKKTEIDYA------------ID--EGGEG 197 (502) Q Consensus 142 s~~~vA~~i------~aal~~a~~~~tv~~~~~----~~~f~~~s~ttG~~~~v~~~~a------------~~--~~~t~ 197 (502) +-....... ++.++..........+.. ...+..............+.+. .. .+... T Consensus 187 ~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~g~~~~~~~i~~~~~~~~D~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~ 266 (581) T protein:vir:76 187 GTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQS 266 (581) T ss_pred cccccceeeccCcccceeeeeeeeeeEeecccccccceeEEEEEEEeecCCccceEEEecccccccceeeehhhcCcccc Confidence 000000000 000000000000000000 0000000000000000001110 00 00000 Q ss_pred -hhhhhhhhhcccccceeeeecccc----ccccCHHHHHHHHHhccCceeEEEEecCCChhHHHHHHHHHhhcC---C-E Q lcl|NC_013597. 198 -EYIGALLKLENGQASRKVGKNSVS----LKKETLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQANT---K-L 268 (502) Q Consensus 198 -t~~aa~l~~t~~~~~~~v~v~~~~----~~~et~~~al~al~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~~---~-~ 268 (502) .-+.+.+.++.... ..+.....+ ...++..++|+++.++. +..+++++..+.+-+.++..|++..+ + + T Consensus 267 e~~~~~~~~~t~~~~-~~l~~gvd~~g~tvt~~dy~~aL~ale~~~--~~~ivvp~t~~~~i~a~l~ahv~~~s~~~~~~ 343 (581) T protein:vir:76 267 EITLCAQLAITNGAS-TILACAVDPEGDTVTMGDYQNALNKFRDED--EIAIIVAGTGAQPIQALVQQHVSAQSNNKYER 343 (581) T ss_pred chhhhhheeeccccc-eEEEeeecCCCCccchHHHHHHHHHHhcCC--eEEEEEecCCChHHHHHHHHHHHHHHhccCCc Confidence 01112222333222 222211111 22334678888887753 33334544433333345778886532 1 1 Q ss_pred E-EEEe-cCchhcccchhHHHHHHHHccCCceEEEec------CC--------ccc-hHHHHHHHHHhcCCCCCCceeeE Q lcl|NC_013597. 269 F-GANV-IRAEQIEWSADNIYKKLYDAGLDHTLAMFD------KN--------DMY-PVSSALARLLSTNFAANNSTLTL 331 (502) Q Consensus 269 ~-~~~~-~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~------~~--------~~~-~~aa~~g~~as~n~~~~~g~~T~ 331 (502) . ...+ -... ..............+..|...++. .+ ..| .++.+.|..+..++.. .+ T Consensus 344 ra~igv~g~~~--~~~~~~~~~~a~~~ns~Rvvlv~p~~~~~~g~~~~~~~~lp~~~~AA~vAG~~a~~~~~~-----sl 416 (581) T protein:vir:76 344 RAILGMDGSVT--PVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAM-----PL 416 (581) T ss_pred eEEEEeeCCCC--CchHHHHHHhhcccCCCcEEEEEcCceEeccccCCcceecchhhhhhhHHhhhhcccccc-----Cc Confidence 1 1111 1100 011112222334456678776653 11 112 2344445555555433 45 Q ss_pred eeeecCccccC--CCCHHHHHHHHhCCceEEEEEcCce-EEecCEeec---C--eehhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013597. 332 KFKQQPTITAD--EITATEFAKAKRLGINVYTYFDDVA-MIAEGTVIG---G--KFADEIVILDWFVDAVQKEVFARLYK 403 (502) Q Consensus 332 ~fk~~~Gv~~~--~lt~t~~~~l~~~~~n~y~~~~~~~-~~~~G~~~~---G--~~iD~~~~~dwl~~~iq~~l~~~l~~ 403 (502) -||.++|+..- .++.+|++.|..+|++.+....+.. .+.+|...- . +.|-.++-.|.+...+++.+-..-|- T Consensus 417 T~~~i~g~~~~~~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s~~~~k~i~viR~~D~v~~~vr~~~~~~~fi 496 (581) T protein:vir:76 417 TRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLI 496 (581) T ss_pred ccccccccccccccCCHHHHHHHHhCCeEEEEEecCCeEEEEEeeecCCCCCccceeeehhhhHHHHHHHHHHHhhhcCC Confidence 67888888743 6799999999999999999765543 345665442 2 45778899999999998886322233 Q ss_pred cCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEE Q lcl|NC_013597. 404 SPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQT 483 (502) Q Consensus 404 ~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~ 483 (502) ++ |=++.|...|++.+++.|.+..++|+|..-. .. .. +..+++ +..--+.+ T Consensus 497 --G~-~n~~~~r~~ik~~i~~~L~~l~~~g~I~g~~-~~----------------~~--------~~~~~~-~d~v~V~i 547 (581) T protein:vir:76 497 --GM-PIYDTTIVQVKASAEAALVWLVDNNIIRGYR-NL----------------KA--------RQIERQ-PDVIEVRY 547 (581) T ss_pred --Cc-ccChHHHHHHHHHHHHHHHHHHhcCcccCcc-cc----------------ee--------eEEecC-CCEEEEEE Confidence 34 7889999999999999999999999996211 00 00 011111 11124788 Q ss_pred EEEECceEEEEEEEEEEeC Q lcl|NC_013597. 484 AVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 484 ~~~~aGaIh~v~i~~~v~~ 502 (502) .++..-+|.+|.+++.++- T Consensus 548 ~v~Pv~~ie~I~vt~~~~p 566 (581) T protein:vir:76 548 EWRPAYPLNYIVVRYSIAP 566 (581) T ss_pred EEEecccceEEEEEEEEee Confidence 8999999999999999888 No 53 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=97.96 E-value=9.1e-06 Score=48.22 Aligned_cols=466 Identities=12% Similarity=0.032 Sum_probs=220.4 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCC---CcHHHHHHHHHhcCCC Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGT---NSETAKAAQPFFAQSP 77 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~---~s~ey~aA~~~F~q~p 77 (502) |++ ++.=|-|.--=.+.++....-+...|+|...--|+. ++ ..-+|..|....||. .+.++.+...||-+-- T Consensus 1 ~~~-~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~---~p-~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ngg 75 (663) T protein:vir:10 1 MAL-LSPGIEMKETSINSTVVRSATGRAAIVGKFAWGPAY---EV-RQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYG 75 (663) T ss_pred Cce-ecCceEEEEecCcccccccCccceeEEeeeccCCCC---cc-EEecCHHHHHHHhCCcCccchhHHHHHHHHHhCC Confidence 886 666666654334566667777788899887766553 33 345678999999996 6777788888886432 Q ss_pred CcceEEEEEeecccccceeee----e---e-------ccchhhhHH-----H---HHhhcc-cce-eEEEEec------- Q lcl|NC_013597. 78 RAKQLIVARWQKSASTIEATK----N---T-------LSGATLSDD-----L---ERFKSV-VNG-RFSLTIG------- 126 (502) Q Consensus 78 ~P~~l~igr~~~~~~~~~~~~----~---~-------~~~~~~~~~-----~---~~~~~~-~~g-~~~iti~------- 126 (502) +++||-|.........+.. . . ..+..+... . .....+ .++ ...+.+. T Consensus 76 --~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~n~~~~~v~~~~a~~~~~ 153 (663) T protein:vir:10 76 --NDLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAK 153 (663) T ss_pred --CeEEEEEccCCcccccccccCCcceeeeeccccCccccccccccccccccccccceeeeccCCceEEEEecccccccc Confidence 2555555432111000000 0 0 000000000 0 000000 000 0000000 Q ss_pred -----------------------Ccccccccccccc---------ccchhhH-HHHHHh--------hhc-----ccccc Q lcl|NC_013597. 127 -----------------------GDVKKVDGLSFAR---------LADFNAV-ATKIQE--------KLT-----TLSVA 160 (502) Q Consensus 127 -----------------------g~~~~~~~i~~s~---------~ts~~~v-A~~i~a--------al~-----~a~~~ 160 (502) +........++.. .+...+. +..... .+. ..+.. T Consensus 154 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~v~~vv~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~G~~Gn~ 233 (663) T protein:vir:10 154 TRQLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLVSAVYPGEIGST 233 (663) T ss_pred ccccceeeeccccceeEeeeccccccccccccceecccceeeEeeccccccccccchhhhcccccceeeeeecccccccc Confidence 0000000000000 0000000 000000 000 00000 Q ss_pred eeEEEeccc--------------------------------ceeeEeeecccccccceeeeee---ccccchhhhhhhhh Q lcl|NC_013597. 161 VSIAYDETG--------------------------------NRFIVSANVAGEDKKTEIDYAI---DEGGEGEYIGALLK 205 (502) Q Consensus 161 ~tv~~~~~~--------------------------------~~f~~~s~ttG~~~~v~~~~a~---~~~~t~t~~aa~l~ 205 (502) ..+...... ..+.......+.... ...... .....++....... T Consensus 234 i~v~i~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~s~~~~~~~~~~~~~~~~~~ 312 (663) T protein:vir:10 234 VEVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVE-STVLSTRKGDRDVYGSNIFMDDY 312 (663) T ss_pred eeEEecccccccccccccccccccccccccceeeeeccccccceeEEEecCCccee-eeeeeecccccccchhhhhhhhh Confidence 000000000 000000000000000 000000 00000111111111 Q ss_pred hcccccceeee-------------ecccccc------ccCHHHHHHHHHhccCceeEEEEecC---CChhH----HHHHH Q lcl|NC_013597. 206 LENGQASRKVG-------------KNSVSLK------KETLGEALFNVAEVNNTWYGFTVAAQ---LTDSE----VEAAA 259 (502) Q Consensus 206 ~t~~~~~~~v~-------------v~~~~~~------~et~~~al~al~~~~~~w~~~~~~~~---~~~~~----~~a~a 259 (502) +..+....... ....+.+ ..+...++..+.+...-.-.++++.. .+.++ ..++. T Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~al~ 392 (663) T protein:vir:10 313 FRNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVV 392 (663) T ss_pred hccCcceEEEEeecccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHH Confidence 11111110000 0011111 12233445545443221112233221 12222 33444 Q ss_pred HHHhhcCCEEEEEecCchhc-----ccchhHHHHHHHH-----------ccCCce-EEEecC-------Cc-----cchH Q lcl|NC_013597. 260 KYAQANTKLFGANVIRAEQI-----EWSADNIYKKLYD-----------AGLDHT-LAMFDK-------ND-----MYPV 310 (502) Q Consensus 260 ~w~~a~~~~~~~~~~~~~~~-----~~~~~~i~~~l~~-----------~~~~~t-~~~y~~-------~~-----~~~~ 310 (502) ..++....+|.+........ .....++...... .+++.. ..+|++ .+ -.+. T Consensus 393 ~~a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d~~~~~~~~~p~s 472 (663) T protein:vir:10 393 SLADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLA 472 (663) T ss_pred HHHHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEecccCCceEEechh Confidence 55555444554432111110 0011121111111 112211 234443 11 1356 Q ss_pred HHHHHHHHhcCCCCCCceeeEeeeecCccc-----cCCCCHHHHHHHHhCCceEEEEEcC-c-eEEecCEeecC---e-- Q lcl|NC_013597. 311 SSALARLLSTNFAANNSTLTLKFKQQPTIT-----ADEITATEFAKAKRLGINVYTYFDD-V-AMIAEGTVIGG---K-- 378 (502) Q Consensus 311 aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~-----~~~lt~t~~~~l~~~~~n~y~~~~~-~-~~~~~G~~~~G---~-- 378 (502) +.++|.++.+|..+.+ ......|.+.+|. ...+++.|++.|..+|+|.+..+-+ . ..+|..+++++ + T Consensus 473 ~~vAGl~Ar~D~~~g~-~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s~~~ 551 (663) T protein:vir:10 473 ADIAGLCAYTDQVSHP-WMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPSPFD 551 (663) T ss_pred HHHHHHHHHhhccCCc-eEccCCceeccccccccceeccChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCcccc Confidence 7788999988865421 1112234333332 2357999999999999999998754 3 45888888876 2 Q ss_pred ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccce Q lcl|NC_013597. 379 FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGF 458 (502) Q Consensus 379 ~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy 458 (502) ||-+.+-.+|+.+.|+..+...++. |.++.-...|+..|+.-|++.+++|.|. || T Consensus 552 ~i~vrR~~~~i~~si~~~~~~~v~e-----~n~~~l~~~i~~~i~~~L~~l~~~gal~--------------------g~ 606 (663) T protein:vir:10 552 RINVRRLFNMLKKNIGDTSKYELFE-----NNDAFTRQSFRMETSQYLDGIRSLGGCY--------------------DF 606 (663) T ss_pred eEehhhHHHHHHHHHHHHHHHhccC-----CCCHHHHHHHHHHHHHHHHHHHhcCcee--------------------ee Confidence 5778889999999999998765432 6788889999999999999999999985 48 Q ss_pred EEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 459 YVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 459 ~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) .|.++ .+..|++|+.+.+. -+.+.++....+++|.+++.-.+ T Consensus 607 ~v~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~i~~~~~~~~ 648 (663) T protein:vir:10 607 RVVCD-TTNNTPNVIDRNEF-VGTIYVKPPRSINYITLNMVATS 648 (663) T ss_pred EEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEee Confidence 99998 67889999999998 59999999999999999877666 No 54 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=97.92 E-value=1.1e-05 Score=47.80 Aligned_cols=465 Identities=12% Similarity=0.027 Sum_probs=219.7 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcC---CCcHHHHHHHHHhcCCC Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFG---TNSETAKAAQPFFAQSP 77 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg---~~s~ey~aA~~~F~q~p 77 (502) |++ ++.=|-|.--=.+.++....-+...|+|...--|+. +. +.-+|..|....|| ..+.++.+...||-+-- T Consensus 1 ~~~-~~Pgvyv~e~~~~~~i~~~~t~~~~~vG~~~~Gp~~---~p-~~v~~~~~~~~~fg~~~~~~~~~~~~~~~f~ngg 75 (663) T protein:vir:10 1 MAL-LSPGIEMKETSINSTVVRSATGRAAIVGKFAWGPAY---EV-RQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYG 75 (663) T ss_pred Cce-ecCceEEEEecCCccccccCcccceeEeecccCCCC---cc-EEecCHHHHHHhcCCcCCcchhHHHHHHHHHhCC Confidence 886 677666653334667777778888999988766653 33 34466899999998 45567778888886533 Q ss_pred CcceEEEEEeeccccccee---------eeee-----ccchhhhHHHH--------Hhhcc-cce-eEEEEecCcccccc Q lcl|NC_013597. 78 RAKQLIVARWQKSASTIEA---------TKNT-----LSGATLSDDLE--------RFKSV-VNG-RFSLTIGGDVKKVD 133 (502) Q Consensus 78 ~P~~l~igr~~~~~~~~~~---------~~~~-----~~~~~~~~~~~--------~~~~~-~~g-~~~iti~g~~~~~~ 133 (502) . ++||-|.......... +... ..|..+..... ....+ .++ ...+.+. +..... T Consensus 76 ~--~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~i~v~~~~~~~~~~~~~~~~~~~~~~~~v~~~-ta~~~~ 152 (663) T protein:vir:10 76 N--DLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVP-TAEIIA 152 (663) T ss_pred C--eEEEEEccCCccccccccccccceeEEeecccccccccccccccccccccccccceeeecccceEEEeec-cccccc Confidence 2 5566665321110000 0000 00000000000 00000 000 0001000 000000 Q ss_pred ---------------ccccccccchhhHHHHHHhhhcccccce-------------------------eEEEe---cccc Q lcl|NC_013597. 134 ---------------GLSFARLADFNAVATKIQEKLTTLSVAV-------------------------SIAYD---ETGN 170 (502) Q Consensus 134 ---------------~i~~s~~ts~~~vA~~i~aal~~a~~~~-------------------------tv~~~---~~~~ 170 (502) .+.++........+..+...+...+... .++-. ..++ T Consensus 153 ~~~~v~~~~~~~~~~~~~~s~~s~~~~~a~~v~~v~~d~~~~v~~~~~a~~~~t~~~~~~~~~~~~~~~i~A~~~G~~Gn 232 (663) T protein:vir:10 153 KTRQLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLISAVYPGEIGS 232 (663) T ss_pred cccccccceeeccceeeEeeeccCccccccccceeccccceEEeeccccccccccccccccccccccceEEeccCCcccc Confidence 0000000000000000000000000000 00000 0000 Q ss_pred ee--eEeeeccccc------------------------------ccceeeee--------e--c---cccchhhhhhhhh Q lcl|NC_013597. 171 RF--IVSANVAGED------------------------------KKTEIDYA--------I--D---EGGEGEYIGALLK 205 (502) Q Consensus 171 ~f--~~~s~ttG~~------------------------------~~v~~~~a--------~--~---~~~t~t~~aa~l~ 205 (502) .. .+........ ..+..... . . ....++....... T Consensus 233 ~i~V~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~ 312 (663) T protein:vir:10 233 TVEVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDY 312 (663) T ss_pred eeeeeeccccccccccccceecccccccccccceeecccccccceeeEeecCCcceeeecccccccccccccchhhhhhh Confidence 00 0000000000 00000000 0 0 0000000000000 Q ss_pred hcccccce-------------eeeecccccc------ccCHHHHHHHHHhccCceeEEEEecC---CChh----HHHHHH Q lcl|NC_013597. 206 LENGQASR-------------KVGKNSVSLK------KETLGEALFNVAEVNNTWYGFTVAAQ---LTDS----EVEAAA 259 (502) Q Consensus 206 ~t~~~~~~-------------~v~v~~~~~~------~et~~~al~al~~~~~~w~~~~~~~~---~~~~----~~~a~a 259 (502) +.++.... .+.....+.+ ..+...++..+.+...-.-.++++.. .+.+ -..++. T Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~ 392 (663) T protein:vir:10 313 FRNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVV 392 (663) T ss_pred hcCCcceEEEEeecccCccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHH Confidence 11110000 0000011111 11223344444443222222333321 1111 233455 Q ss_pred HHHhhcCCEEEEEecCchhc-----ccchhHHHHHHHH-----------ccC--CceEEEecC-------Cc-----cch Q lcl|NC_013597. 260 KYAQANTKLFGANVIRAEQI-----EWSADNIYKKLYD-----------AGL--DHTLAMFDK-------ND-----MYP 309 (502) Q Consensus 260 ~w~~a~~~~~~~~~~~~~~~-----~~~~~~i~~~l~~-----------~~~--~~t~~~y~~-------~~-----~~~ 309 (502) ..++....+|.+........ .....++...... .++ .+.. +|++ .+ -.+ T Consensus 393 ~~a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~-~~~p~~~~~d~~~~~~~~~p~ 471 (663) T protein:vir:10 393 SLADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAF-IIGNYKYQYDKYNDINRWVPL 471 (663) T ss_pred HHHHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCccceE-EEcCceEEecccCCceEEech Confidence 55555554554432111100 0111222111111 122 2333 3332 11 134 Q ss_pred HHHHHHHHHhcCCCCCCceeeEeeeecC---ccc--cCCCCHHHHHHHHhCCceEEEEEcC-c-eEEecCEeecC---e- Q lcl|NC_013597. 310 VSSALARLLSTNFAANNSTLTLKFKQQP---TIT--ADEITATEFAKAKRLGINVYTYFDD-V-AMIAEGTVIGG---K- 378 (502) Q Consensus 310 ~aa~~g~~as~n~~~~~g~~T~~fk~~~---Gv~--~~~lt~t~~~~l~~~~~n~y~~~~~-~-~~~~~G~~~~G---~- 378 (502) .+.++|.++.+|..+.+ -.....|.+. |+. ...+++.|.+.|..+|+|....+.+ . -.+|..+++++ + T Consensus 472 sg~vAGl~Ar~D~~~g~-~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~s~~ 550 (663) T protein:vir:10 472 AADIAGLCAYTDQVSHP-WMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPSPF 550 (663) T ss_pred hHHHHHHHHHhhccCCc-eEccCCceeccccccccceeecChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCccc Confidence 56788999998865532 1122334433 332 2357999999999999999988754 3 45888888776 2 Q ss_pred -ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccc Q lcl|NC_013597. 379 -FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKG 457 (502) Q Consensus 379 -~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~g 457 (502) ||-+.+-.+|+...|++.+...++. |.++.=...|+..|+.-|++.+++|.|. | T Consensus 551 ~~i~vrR~~~~i~~si~~~~~~~v~e-----pn~~~l~~~i~~~i~~~L~~l~~~gal~--------------------g 605 (663) T protein:vir:10 551 DRINVRRLFNMLKKNIGDTSKYELFE-----NNDAFTRQSFRMETSQYLDGIRSLGGCY--------------------D 605 (663) T ss_pred ceEehhhHHHHHHHHHHHHHHHhccC-----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------e Confidence 5778889999999999998765533 6788889999999999999999999985 4 Q ss_pred eEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 458 FYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 458 y~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) |.|.++ .+..|++|+.+.+. -+.+.+.....+++|.+++.-.+ T Consensus 606 ~~v~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~i~~~~~~~~ 648 (663) T protein:vir:10 606 FRVVCD-TTNNTPNVIDRNEF-VGTIYVKPPRSINYITLNMVATS 648 (663) T ss_pred eEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEee Confidence 899998 67889999999998 69999999999999999877665 No 55 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=97.75 E-value=2.2e-05 Score=46.08 Aligned_cols=465 Identities=10% Similarity=0.014 Sum_probs=223.0 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcC---CCcHHHHHHHHHhcCCC Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFG---TNSETAKAAQPFFAQSP 77 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg---~~s~ey~aA~~~F~q~p 77 (502) |.+ ++.=|-|.--=.+.++....-+...|+|...--|+. + -+.-+|..|....|| ..+.++.+...||-+-- T Consensus 1 ~~~-~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~---~-p~~v~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngg 75 (666) T protein:vir:65 1 MTL-LSPGFETKETTLSTTIVQSETGRAALVGKFQWGPAF---Q-IIQVTNEVELVNKFGQPDNNTADYFMSGANFLQYG 75 (666) T ss_pred Cce-ecCceEEEEecCcccccccCcccceEEecccCCCCc---c-CEEecCHHHHHHHcCCccccchhHHHHHHHHHhcC Confidence 885 666665554434556677777788999987766553 3 344566899999999 56677888888885322 Q ss_pred CcceEEEEEeeccccc--ce---------e-eee---------eccc--hhhhH--HHHHh--hc--------------- Q lcl|NC_013597. 78 RAKQLIVARWQKSAST--IE---------A-TKN---------TLSG--ATLSD--DLERF--KS--------------- 115 (502) Q Consensus 78 ~P~~l~igr~~~~~~~--~~---------~-~~~---------~~~~--~~~~~--~~~~~--~~--------------- 115 (502) ++++|-|....... +. . ..+ .+.. ..+.. ....+ .+ T Consensus 76 --~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~~~V~~~~~~~~~~~~~~~~~~~~~~~g~~~~t~~~~~~ 153 (666) T protein:vir:65 76 --NDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAH 153 (666) T ss_pred --ceEEEEEccCcccccccccccCceeeeEeeccccccccceEEEEecccccccccccccccccccccccccccceeecc Confidence 13444343211000 00 0 000 0000 00000 00000 00 Q ss_pred -ccceeEEEEecCcccccccc------ccccccchhh----------H----HH-HHHh-----------hhc--ccccc Q lcl|NC_013597. 116 -VVNGRFSLTIGGDVKKVDGL------SFARLADFNA----------V----AT-KIQE-----------KLT--TLSVA 160 (502) Q Consensus 116 -~~~g~~~iti~g~~~~~~~i------~~s~~ts~~~----------v----A~-~i~a-----------al~--~a~~~ 160 (502) ...+.....+.+........ .++......+ . .. .... +.. ..+.. T Consensus 154 ~~~~g~~~~l~~~~~~~~~~~~~~~~~a~sv~~~~~~g~~~~~~~~~a~~~~~~~~~~~~~~~~~~~a~~A~~~g~~g~~ 233 (666) T protein:vir:65 154 AKAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLKKYDMPAVSAIYAGEIGNS 233 (666) T ss_pred ccccCcceeEeeccceeecccCcccccceeeeecccccceeeeeecccccccccccccccccccccceeeeeeccccccc Confidence 00000000000000000000 0000000000 0 00 0000 000 00000 Q ss_pred eeEEEecc--------------------------------cceeeEeeecccccccceeeeeecc---ccch--hhhhhh Q lcl|NC_013597. 161 VSIAYDET--------------------------------GNRFIVSANVAGEDKKTEIDYAIDE---GGEG--EYIGAL 203 (502) Q Consensus 161 ~tv~~~~~--------------------------------~~~f~~~s~ttG~~~~v~~~~a~~~---~~t~--t~~aa~ 203 (502) ..+..... ...|.+.....|... ..+...... ...+ .+.... T Consensus 234 i~v~i~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~-e~~~~~~~~~~~~~~~~~~~~~~~ 312 (666) T protein:vir:65 234 LEVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVV-ESYVLSTLKGDKDVYGNSIYMDDF 312 (666) T ss_pred eeEEeecccccccccccccccccccccccceeeecccccccccceeeeecCCccc-ceeecccCcccccccchhhhhhhh Confidence 00000000 000111111111000 000000000 0000 010000 Q ss_pred hhhcc-----------cccceeeeecccccc--------------ccCHHHHHHHHHhccCceeEEEEecC------CCh Q lcl|NC_013597. 204 LKLEN-----------GQASRKVGKNSVSLK--------------KETLGEALFNVAEVNNTWYGFTVAAQ------LTD 252 (502) Q Consensus 204 l~~t~-----------~~~~~~v~v~~~~~~--------------~et~~~al~al~~~~~~w~~~~~~~~------~~~ 252 (502) +.-.. ......+.....+.+ ..+....+..+.+.......+++... ... T Consensus 313 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~ 392 (666) T protein:vir:65 313 FARGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFS 392 (666) T ss_pred hcccccceeeeecccccccccceEEccCCCCcCcccccccccccccccHHHHHHHHhhhhhccCCceeecCcCCccchhH Confidence 00000 000000000001111 11234455555554332333343321 123 Q ss_pred hHHHHHHHHHhhcCCEEEEEecCch----hc-ccchhHHHHHHHHc--------cC--CceEEEecC-------Cc---- Q lcl|NC_013597. 253 SEVEAAAKYAQANTKLFGANVIRAE----QI-EWSADNIYKKLYDA--------GL--DHTLAMFDK-------ND---- 306 (502) Q Consensus 253 ~~~~a~a~w~~a~~~~~~~~~~~~~----~~-~~~~~~i~~~l~~~--------~~--~~t~~~y~~-------~~---- 306 (502) .-..++...++..+.+|.+...... .. .....++....... ++ .|. .+|++ .+ T Consensus 393 ~v~~~l~~~~~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~-~~~~p~~~~~d~~~~~~~ 471 (666) T protein:vir:65 393 TVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYA-VIDGNYKYQYDKYNDVNR 471 (666) T ss_pred HHHHHHHHHHhhccceEEEeccccceeeecCCCCCHHHHHHHHHhcccccccccccCcceE-EEEcCceEEecccCCcee Confidence 4455666666666556544321111 00 11122222222211 12 232 23432 11 Q ss_pred -cchHHHHHHHHHhcCCCCCCceeeEeeeecCccc-----cCCCCHHHHHHHHhCCceEEEEEcCc-eEEecCEeecCe- Q lcl|NC_013597. 307 -MYPVSSALARLLSTNFAANNSTLTLKFKQQPTIT-----ADEITATEFAKAKRLGINVYTYFDDV-AMIAEGTVIGGK- 378 (502) Q Consensus 307 -~~~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~-----~~~lt~t~~~~l~~~~~n~y~~~~~~-~~~~~G~~~~G~- 378 (502) -.+.+.+.|.++.+|..+.+ -.....|.+.||. ...+++.|++.|..+|+|.+.++.+. ..+|.++++++. T Consensus 472 ~~p~sg~vAGl~Ar~D~~~g~-~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~ 550 (666) T protein:vir:65 472 WVPLAADIAGLCARTDAVSQP-WMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVP 550 (666) T ss_pred EechHHHHHHHHHHHhccCCc-EEccCCeecceeeccccceeecChhHHHhhhhCCceEEEEeCCCeEEEEecccCCCCC Confidence 12457778888888764421 1122344444432 12468899999999999999999775 578899998872 Q ss_pred ----ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCcccccccccccc Q lcl|NC_013597. 379 ----FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYL 454 (502) Q Consensus 379 ----~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~ 454 (502) ||-+.+-.+|++..|+......++. |.++.=...|+..|+.-|++.+++|.|. T Consensus 551 s~~~~i~vrR~~~~i~~si~~~~~~~v~e-----pn~~~l~~~i~~~i~~~L~~l~~~gal~------------------ 607 (666) T protein:vir:65 551 SPFDRINVRRLFNMLKKNIGDSSKYKLFE-----NNDNFTRASFRMEVSQYLSTIRSLGGIY------------------ 607 (666) T ss_pred cccceEehhhHHHHHHHHHHHHHHHhccC-----CCCHHHHHHHHHHHHHHHHHHHhCCcee------------------ Confidence 6788889999999999998766543 6788889999999999999999999985 Q ss_pred ccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 455 DKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 455 ~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) ||.|.++ .++.|++|+.+.+. -+.+.++....+++|.+++.-.+ T Consensus 608 --g~~V~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~i~~~~~~~~ 651 (666) T protein:vir:65 608 --DFRVQCD-TTNNTPDVIDRNEF-VASMFIKPAKSINYIMLNFTAVA 651 (666) T ss_pred --eeEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEee Confidence 4899998 67889999999988 69999999999999999987666 No 56 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=97.62 E-value=3.7e-05 Score=44.89 Aligned_cols=465 Identities=12% Similarity=0.013 Sum_probs=219.7 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcC---CCcHHHHHHHHHhcCCC Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFG---TNSETAKAAQPFFAQSP 77 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg---~~s~ey~aA~~~F~q~p 77 (502) |++ ++.=|-|.--=.+.++....-+...|+|...--|+. +. ..-+|..|....|| ..+.++.+...+|-+-- T Consensus 1 ~~~-~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~---~p-~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~~~g 75 (660) T protein:vir:68 1 MAL-LSPGVELKETTVQSTVVNNSTGTAALAGKFQWGPAF---QI-KQITDEVALVDMFGTPNTDTADYFMSAMNFLQYG 75 (660) T ss_pred Ccc-ccCceEEEEecCCcccccCCCcceeEEecccCCCCc---cC-EEecCHHHHHHhcCCccCccchhHHHHHHHHhCC Confidence 986 666655543323566777777889999987766553 33 44556899999999 56677778888885422 Q ss_pred CcceEEEEEeecccccce---------------e----ee---------------eecc-----ch---------hh--- Q lcl|NC_013597. 78 RAKQLIVARWQKSASTIE---------------A----TK---------------NTLS-----GA---------TL--- 106 (502) Q Consensus 78 ~P~~l~igr~~~~~~~~~---------------~----~~---------------~~~~-----~~---------~~--- 106 (502) .++||-|......... . .+ +... +. .. T Consensus 76 --~~~~vvRv~~~~~~~~~~~~~~~~~~t~~~~g~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ta~~~~~ 153 (660) T protein:vir:68 76 --NDLRVVRAVDRDTAKNSSPVAGNINFTISSAGTNYRVGDKVVVKYSTDIIEPDGEVTSVDSDGKILNIFIPSGKIIAK 153 (660) T ss_pred --CeEEEEEecccccccccccccccceeeeeccCcceeeeeeeeeecccccccccccceeeeecCceeeeeecccccccc Confidence 1334444321110000 0 00 0000 00 00 Q ss_pred hHHHHHhhcc-cceeE-----------EEEecCccc-----------------cccccccccccchhhH-H--------- Q lcl|NC_013597. 107 SDDLERFKSV-VNGRF-----------SLTIGGDVK-----------------KVDGLSFARLADFNAV-A--------- 147 (502) Q Consensus 107 ~~~~~~~~~~-~~g~~-----------~iti~g~~~-----------------~~~~i~~s~~ts~~~v-A--------- 147 (502) .......... .+... .+.+.+... ..+............+ | T Consensus 154 a~~~~~~~~~~~~~~~~v~~~~~~~~~~~~v~~~~~d~~~~~~~~~ta~~~~~~~~~~~~~~~~~~~~~~A~~~g~~G~~ 233 (660) T protein:vir:68 154 AKEIGEYPELGSNWTAEMSGSSSGLSAVITIDSVVMDSGILLTEVETSEEAITSLTFQESIKKYGVPGVVALYPGELGDQ 233 (660) T ss_pred ceeeccccccccceeEEeecccccceeeeeeccccccccceeeeeccccccccccceeeeecccCccccccccccccccc Confidence 0000000000 00000 011111000 0000000000000000 0 Q ss_pred ------------HHHHhhhc--------ccccceeEEEec-ccceeeEeeecccccccceeeeeecc---ccchhhhhhh Q lcl|NC_013597. 148 ------------TKIQEKLT--------TLSVAVSIAYDE-TGNRFIVSANVAGEDKKTEIDYAIDE---GGEGEYIGAL 203 (502) Q Consensus 148 ------------~~i~aal~--------~a~~~~tv~~~~-~~~~f~~~s~ttG~~~~v~~~~a~~~---~~t~t~~aa~ 203 (502) ......+. .......+.... ....|.+.....+... ..+...... ...+...... T Consensus 234 i~v~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 312 (660) T protein:vir:68 234 LEIEIVSKADYDKGASAQLKIYPDGGTRYSTAKAIFGYGPQTDDQYAIIVRRNDSVV-QSVVLSTKRGERDIYGSNIFID 312 (660) T ss_pred eEEEEeccccccccccccceeeecccccccceeeEeecccccccceeeeeecCCcce-eeeeeecccccccccccceeee Confidence 00000000 000000000000 0011111111111000 000000000 0000000000 Q ss_pred hhhcccccceeeee--------------ccccc------cccCHHHHHHHHHhcc-CceeEEEEecC--CChhH----HH Q lcl|NC_013597. 204 LKLENGQASRKVGK--------------NSVSL------KKETLGEALFNVAEVN-NTWYGFTVAAQ--LTDSE----VE 256 (502) Q Consensus 204 l~~t~~~~~~~v~v--------------~~~~~------~~et~~~al~al~~~~-~~w~~~~~~~~--~~~~~----~~ 256 (502) ..+... ....+.. ...+. ...+...++..+.+.. .....++.... .+.++ +. T Consensus 313 ~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~v~~ 391 (660) T protein:vir:68 313 DFFAKG-ASNYIFATAQGWPKGFSGVIKLNGGLSSNETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESLEVASTVQK 391 (660) T ss_pred hhhccC-cccEEEEeecCCCccccceeeeccccccccccccchhhhHHHHhhhhhccccceeeccccCCCchHHHHHHHH Confidence 000000 0000000 00111 1112333444433322 12222222211 12233 34 Q ss_pred HHHHHHhhcCCEEEEEecCc----hhcc-cchhHHHHHHHHc--------cCCc-eEEEecC-------Cc-----cchH Q lcl|NC_013597. 257 AAAKYAQANTKLFGANVIRA----EQIE-WSADNIYKKLYDA--------GLDH-TLAMFDK-------ND-----MYPV 310 (502) Q Consensus 257 a~a~w~~a~~~~~~~~~~~~----~~~~-~~~~~i~~~l~~~--------~~~~-t~~~y~~-------~~-----~~~~ 310 (502) +|...++..+.+|.+..... +... ...+++....... +++. =+.+|++ .. -.+. T Consensus 392 ~l~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s 471 (660) T protein:vir:68 392 HVVAIGDSRQDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLA 471 (660) T ss_pred HHHHHHHhhCCeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceEEechh Confidence 45555665555554432111 1111 1122333222221 1221 1334443 11 1356 Q ss_pred HHHHHHHHhcCCCCCCceeeEeeeecCcccc-----CCCCHHHHHHHHhCCceEEEEEcCc-eEEecCEeecCe-----e Q lcl|NC_013597. 311 SSALARLLSTNFAANNSTLTLKFKQQPTITA-----DEITATEFAKAKRLGINVYTYFDDV-AMIAEGTVIGGK-----F 379 (502) Q Consensus 311 aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~~-----~~lt~t~~~~l~~~~~n~y~~~~~~-~~~~~G~~~~G~-----~ 379 (502) +.++|.++.+|-++.+ -.....|.+.||.- ..++..|++.|-.+++|....+.+. ..+|.+++++++ | T Consensus 472 g~~AGl~Ar~d~~~g~-~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~ 550 (660) T protein:vir:68 472 ADIAGLCARTDNISQP-WMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDR 550 (660) T ss_pred HHHHHHHHHHhccCCc-EEccCCeeeceeeccceeeecCChhHHHHHhhCCceEEEEecCCeEEEEcceecCCCCcccce Confidence 7788999988754421 11222444444421 1468999999999999999999876 578899998872 6 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceE Q lcl|NC_013597. 380 ADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFY 459 (502) Q Consensus 380 iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~ 459 (502) |-+.+-.+|+...|+..+...++. |.++.=...|+..|+.-|++.+++|.|. ||. T Consensus 551 i~vrR~~~~i~~si~~~~~~~v~e-----pn~~~~~~~i~~~i~~~L~~l~~~gal~--------------------gf~ 605 (660) T protein:vir:68 551 INVRRLFNMVKTNIGSASKYRLFE-----LNNAFTRSSFRTETSQYLQGIKALGGVY--------------------NFK 605 (660) T ss_pred EehhhHHHHHHHHHHHHHHHhccC-----CCCHHHHHHHHHHHHHHHHHHHhcCcee--------------------eeE Confidence 778889999999999998766543 5677778999999999999999999985 488 Q ss_pred EEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 460 VWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 460 v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) |.+ ..+..+++|+.+.+. -+.+.+.....+++|.+++.=.| T Consensus 606 V~~-d~~~nt~~~i~~G~~-~~~i~~~p~~pae~i~l~~~~~~ 646 (660) T protein:vir:68 606 VVC-DTTNNTPAVIDRNEF-VATFYLQPARSINYITLNFVATA 646 (660) T ss_pred EEE-ecCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEee Confidence 988 477899999999988 59999999999999999887666 No 57 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=97.61 E-value=3.7e-05 Score=44.86 Aligned_cols=465 Identities=11% Similarity=0.008 Sum_probs=217.6 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcC---CCcHHHHHHHHHhcCCC Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFG---TNSETAKAAQPFFAQSP 77 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg---~~s~ey~aA~~~F~q~p 77 (502) |.| ++.=|-|.---.+..+....-+...|+|...--|+. + -+.-+|..+....|| ..+.++.+.+.||-+-- T Consensus 1 ~~~-~~Pgvyv~e~~~~~~~~~v~t~~~~fvG~~~~gp~~---~-p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~ngg 75 (663) T protein:vir:10 1 MAL-LSPGIEMKETSINSTVVRSATGRAALVGKFAWGPAY---E-IRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYG 75 (663) T ss_pred Ccc-ccCceEEEEecCcccccccccccceeeeccccCCCC---c-CEEecCHHHHHHHcCCcccccchHHHHHHHHHhCC Confidence 886 666666653333455566666778899887765553 3 345566899999999 46678889999996432 Q ss_pred CcceEEEEEeeccccc--cee-------------ee------------------------eeccchhhh----------- Q lcl|NC_013597. 78 RAKQLIVARWQKSAST--IEA-------------TK------------------------NTLSGATLS----------- 107 (502) Q Consensus 78 ~P~~l~igr~~~~~~~--~~~-------------~~------------------------~~~~~~~~~----------- 107 (502) . ++||-|....... +.+ +. ....+.... T Consensus 76 ~--~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~~~~~ 153 (663) T protein:vir:10 76 N--DLRLVRVIDMEQAKNASPLFNQIEVTITTEGQGYTVGDTVSIKHNTTTVTEEGKVTKVDADGKIKALFVPSSAVIAK 153 (663) T ss_pred C--eEEEEecCCcccccccccccccceeeEeecccCccccceeeecccccccccCcceeeeccCCceeEEEecccccccc Confidence 2 4444443221100 000 00 000000000 Q ss_pred -HHHHHhhcc-cceeEEE-----------EecCccc----cccccc--cccccchh--------hHHHHHHhhhcccccc Q lcl|NC_013597. 108 -DDLERFKSV-VNGRFSL-----------TIGGDVK----KVDGLS--FARLADFN--------AVATKIQEKLTTLSVA 160 (502) Q Consensus 108 -~~~~~~~~~-~~g~~~i-----------ti~g~~~----~~~~i~--~s~~ts~~--------~vA~~i~aal~~a~~~ 160 (502) ......... .+....+ ++.+... ...... ....++.. +............+.. T Consensus 154 a~~~~~~~~~~~a~~~~v~~~~~~~~~a~av~~i~~dg~vt~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~ 233 (663) T protein:vir:10 154 AKQLGTYPVLGDNWRAEVSGASGGSAATLTLGGIVVDSGVTFGNSEEAPDVMTSTKVLANFAKYGMPLISAVYPGEIGST 233 (663) T ss_pred ccccccccccccceeeEEeeccccccccceeEeeecCCceeEEeeeccccccccceeeeeccccccceeeeecccccCcc Confidence 000000000 0000000 0000000 000000 00000000 0000000000000000 Q ss_pred eeEEE---ecc-----------------------------cceeeEeeecccccccceeeeeec---cccchhhhhhhhh Q lcl|NC_013597. 161 VSIAY---DET-----------------------------GNRFIVSANVAGEDKKTEIDYAID---EGGEGEYIGALLK 205 (502) Q Consensus 161 ~tv~~---~~~-----------------------------~~~f~~~s~ttG~~~~v~~~~a~~---~~~t~t~~aa~l~ 205 (502) ..+.. ... ..++.+.....|.... ....... ....+...-.... T Consensus 234 i~v~~~~~~~~~~~~~~~v~~~~g~~~~~~~~~~~~g~~~~~~~~~~~~~~g~~~e-~~~ls~~~~~~~~~~~~~~~~~~ 312 (663) T protein:vir:10 234 VEVEVISKTAFQSGAAQPIYPFGGTRASNARSVIQYGPMTDDQFAIIVRRDGIVVE-STVLSTRRGDRDVYGNNIFMDDY 312 (663) T ss_pred eeEeecccccccccceeeecccCcccccccccccccccccchhhcccccCCCcccc-eeeeeccccccccchhhhhhhhh Confidence 00000 000 0000000000000000 0000000 0000000000001 Q ss_pred hccccc------------c-eeeeeccccccc------cCHHHHHHHHHhc-cCceeEEEEecCC--Chh----HHHHHH Q lcl|NC_013597. 206 LENGQA------------S-RKVGKNSVSLKK------ETLGEALFNVAEV-NNTWYGFTVAAQL--TDS----EVEAAA 259 (502) Q Consensus 206 ~t~~~~------------~-~~v~v~~~~~~~------et~~~al~al~~~-~~~w~~~~~~~~~--~~~----~~~a~a 259 (502) +..... . ..+.....+.+. .+....++.+.+. ..+...+++.... ..+ -+.++. T Consensus 313 ~~~~~s~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~ 392 (663) T protein:vir:10 313 FRNGSSNFIYASSVNWPAGFTGIIQLGGGASANNAVGSDELIAGWDLFADREALHVNLMIAGACKSDGVAVASTVQKHVV 392 (663) T ss_pred hcCcccceeEeeccccCcccceeEEecccccCcccchhhhhhhHHhhhccccccCceEEEeecCCCCchhhHHHHHHHHH Confidence 111000 0 000000111111 1122233333332 2344443333211 112 233445 Q ss_pred HHHhhcCCEEEEEecCchhcc--cchhHHH---HHH-----------HHccC--CceEEEecC-------Cc-----cch Q lcl|NC_013597. 260 KYAQANTKLFGANVIRAEQIE--WSADNIY---KKL-----------YDAGL--DHTLAMFDK-------ND-----MYP 309 (502) Q Consensus 260 ~w~~a~~~~~~~~~~~~~~~~--~~~~~i~---~~l-----------~~~~~--~~t~~~y~~-------~~-----~~~ 309 (502) ..++....+|.+.-....... .....+. ... ...++ .|..+ |++ .+ -.+ T Consensus 393 ~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l-~~p~~~~~d~~~~~~~~~p~ 471 (663) T protein:vir:10 393 ALADDRQDCVAFVNPPSELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYAFI-SGNYKYQYDKYNDINRWVPL 471 (663) T ss_pred HHHHhhCCEEEEEecCcccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceEEE-EecceeEecccCCceEEech Confidence 555555445554422111110 0011111 100 01122 23333 332 11 135 Q ss_pred HHHHHHHHHhcCCCCCCceeeEeeeecCcccc-----CCCCHHHHHHHHhCCceEEEEEcC-ce-EEecCEeecCe---- Q lcl|NC_013597. 310 VSSALARLLSTNFAANNSTLTLKFKQQPTITA-----DEITATEFAKAKRLGINVYTYFDD-VA-MIAEGTVIGGK---- 378 (502) Q Consensus 310 ~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~~-----~~lt~t~~~~l~~~~~n~y~~~~~-~~-~~~~G~~~~G~---- 378 (502) .+.++|.++.+|.++.+ ......|.+.||.- ..+++.|.+.|..+|+|.+..+-+ .+ .+|..++++++ T Consensus 472 s~~vAGl~Ar~D~~~g~-~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s~~ 550 (663) T protein:vir:10 472 SADIAGLCAYTDQVGHP-WMSPAGYRRGQLRNTIKLAIEPKQSLRDTMYQVSINPVTGFAGGDGFVLFGDKMATQVPSPF 550 (663) T ss_pred HHHHHHHHHHhhccCCc-EEccCCeeecceeccccceeecCchhHHHHHhCCCcEEEEeeCCCcEEEEcccccCCCCccc Confidence 67888988888754421 11223344434332 257899999999999999998754 44 58888988873 Q ss_pred -ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccc Q lcl|NC_013597. 379 -FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKG 457 (502) Q Consensus 379 -~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~g 457 (502) ||-+.+-.+|+...|+..+...++. |.++.-...|+..|+.-|++.+++|.|. | T Consensus 551 ~~i~vrR~~~~i~~si~~~~~~~v~e-----pn~~~l~~~i~~~i~~~L~~l~~~gal~--------------------g 605 (663) T protein:vir:10 551 DRINVRRLFNMLKKNIGDTSKYELFE-----NNDAFTRQSFRMEVSQYLDNIRSLGGVY--------------------D 605 (663) T ss_pred ceEehhhHHHHHHHHHHHHHHHhccC-----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------e Confidence 6788889999999999998665432 7788899999999999999999999985 4 Q ss_pred eEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 458 FYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 458 y~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) |.|.++ .+..+++|+.+.+. -+.+.++....+++|.+++.-.| T Consensus 606 f~V~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~I~~~~~~~~ 648 (663) T protein:vir:10 606 FRVVCD-TTNNTPQVIDSNEF-VATIYIKAPRSINYITLNFVATS 648 (663) T ss_pred eEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEEe Confidence 889998 67889999998888 69999999999999999888776 No 58 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=97.58 E-value=4.2e-05 Score=44.60 Aligned_cols=440 Identities=9% Similarity=-0.026 Sum_probs=161.9 Q ss_pred CCcCcCceeEEeecccc------cccccccccceEEEecccccccccCccceEEecCHHHHH---h-hcCCCcHHHHHHH Q lcl|NC_013597. 1 MALSISHIVNVQLNTVP------KSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVE---Q-LFGTNSETAKAAQ 70 (502) Q Consensus 1 Msip~s~iV~V~i~~~~------~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~---~-~fg~~s~ey~aA~ 70 (502) +..+-.+.+.+...... .......-...+.....+.. ...........+. . ........|.-+. T Consensus 145 ~~~~~~~~v~~~~~~~~~~~~~~~t~~~~~~~~~v~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (671) T protein:vir:56 145 IFLPSAEIVAAAKSDGNYPSVGTITLQPTQGDIALTNIEIIDT------GSVYFPNIELAFDALTAIETEGGALKYADLI 218 (671) T ss_pred eeccceeEEEeeeccccccccccccccccccceeeeeeccccc------ceEEEeccccccccccccccccccccchhhh Confidence 22222222222111100 00000000000000000000 0000000000000 0 0000011111110 Q ss_pred HHhc---------CCC-CcceEEEEE-eecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCccccccccc-cc Q lcl|NC_013597. 71 PFFA---------QSP-RAKQLIVAR-WQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLS-FA 138 (502) Q Consensus 71 ~~F~---------q~p-~P~~l~igr-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~-~s 138 (502) ..+. +.. ..-.+.+.. ....... .. ........-...++.++......... +. T Consensus 219 ~~~~~~~~~a~~~g~~g~~~~v~v~~~~~~~~~~-a~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 283 (671) T protein:vir:56 219 EKQGFPRLSARYVGDFGDAISVEIINYADYQTAF-AF--------------AAGHTLGDIELPIYPDGGTRSINLSSYFT 283 (671) T ss_pred hcccccccccccccccCcceEEEEeccccccccc-cc--------------ccceeeeeccccccccccccccccceeec Confidence 0000 000 000111100 0000000 00 00000000000011111000000000 00 Q ss_pred c-ccchhhHHHHHHhhhcccccceeEEEecccceeeE---eeecccccccceeeeeeccccchhhhhhhhhhccccccee Q lcl|NC_013597. 139 R-LADFNAVATKIQEKLTTLSVAVSIAYDETGNRFIV---SANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRK 214 (502) Q Consensus 139 ~-~ts~~~vA~~i~aal~~a~~~~tv~~~~~~~~f~~---~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~ 214 (502) . ....+.....+.. -........++.......... ............+..... ...+. .. ..... T Consensus 284 ~~~~~~~~~~~~v~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~--------~~~~~ 352 (671) T protein:vir:56 284 FGPSNSNQYAVIVRV-SGEVEEAFIVSTNPGDKDVNGQSIFIDEYFENSGSAYITAIA-EGWKT-ES--------GAYNF 352 (671) T ss_pred ccccccccceeEEee-cCccceeEEEeecccccccchhhhhhhhhhcccCceEEEecC-cccCC-cc--------ccccc Confidence 0 0000000000000 000000000000000000000 000000000000000000 00000 00 00000 Q ss_pred eeeccccccccCHHHHHHHHHhccCcee-EEEEecC-CC-h---h---HHHHHHHHHhhcCCEEEEEecCchh-c----c Q lcl|NC_013597. 215 VGKNSVSLKKETLGEALFNVAEVNNTWY-GFTVAAQ-LT-D---S---EVEAAAKYAQANTKLFGANVIRAEQ-I----E 280 (502) Q Consensus 215 v~v~~~~~~~et~~~al~al~~~~~~w~-~~~~~~~-~~-~---~---~~~a~a~w~~a~~~~~~~~~~~~~~-~----~ 280 (502) ++-........+...++.++.+.. ... .++.+.. .. . . ....+..-++....++.+....... + . T Consensus 353 ~gg~d~~~~~~~~~~~~~~~~~~~-~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~ 431 (671) T protein:vir:56 353 GGGSDANAGADDWMFGLDMLSDPE-VLYTNLVIAGNAAAEEVSIASTVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAG 431 (671) T ss_pred cCccccccchhHHHHHHHhhhhcc-ccceeEEEcCCCCCccchhHHHHHHHHHHHHHhhcCCEEEEEecccccccccccc Confidence 000000011122333343333321 111 1122111 00 0 0 0111222222333333332111000 0 0 Q ss_pred cchhHHHHHHH------------Hcc--CCceEEEecC-------Cc-----cchHHHHHHHHHhcCCCCCCceeeEeee Q lcl|NC_013597. 281 WSADNIYKKLY------------DAG--LDHTLAMFDK-------ND-----MYPVSSALARLLSTNFAANNSTLTLKFK 334 (502) Q Consensus 281 ~~~~~i~~~l~------------~~~--~~~t~~~y~~-------~~-----~~~~aa~~g~~as~n~~~~~g~~T~~fk 334 (502) ....++..... ..+ ..+.. +|++ .. -.+.+.++|.++.+|.++.+. .....| T Consensus 432 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~-~~~p~~~~~d~~~~~~~~~p~s~~~AGl~Ar~D~~~g~~-~span~ 509 (671) T protein:vir:56 432 TAVANIQGWRTGIDPTNGQAVVDNLNVSTTYAV-IDGNYKYQYDKYNDRNRWVPLAGDIAGLCAYTDQVSQPW-MSPAGF 509 (671) T ss_pred ccHHHHHHHhhhccccchhhhhhhccCCcceEE-EecCceEEecccCCceeEechHHHHHHHHHHhhccCCcE-ECcCCc Confidence 00111111110 011 12222 2322 11 125678889999888654211 111234 Q ss_pred ecCc---cc--cCCCCHHHHHHHHhCCceEEEEEcCc-eEEecCEeecC-----eehhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013597. 335 QQPT---IT--ADEITATEFAKAKRLGINVYTYFDDV-AMIAEGTVIGG-----KFADEIVILDWFVDAVQKEVFARLYK 403 (502) Q Consensus 335 ~~~G---v~--~~~lt~t~~~~l~~~~~n~y~~~~~~-~~~~~G~~~~G-----~~iD~~~~~dwl~~~iq~~l~~~l~~ 403 (502) .+.+ +. ...+++.|.+.|..+|+|...++.+. ..+|.++++++ .||-+.+-.+|+...|+..++..++. T Consensus 510 ~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~v~e 589 (671) T protein:vir:56 510 NRGQIKGVNRLAVDLRRAHRDALYQIGINPVVGFAGQGFVLYGDKTATQQASAFDRINVRRLFNLLKKAISDAAKYRLFE 589 (671) T ss_pred eeccccccccceeecChhHHHHHhhCCceEEEEecCCeEEEEcceecCCCCcccceEehhhHHHHHHHHHHHHHHHhcCC Confidence 3333 22 23578999999999999999998765 56788898876 27888999999999999998765543 Q ss_pred cCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEE Q lcl|NC_013597. 404 SPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQT 483 (502) Q Consensus 404 ~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~ 483 (502) |.++.=...|+..|+.-|+..+++|.|. ||.|.+. .++.|++|+.+.+. -+.+ T Consensus 590 -----pn~~~~~~~i~~~i~~fL~~l~~~gal~--------------------g~~v~~d-~~~nt~~~i~~G~~-~~~i 642 (671) T protein:vir:56 590 -----LNDEFTRSSFKSEIDAYLTNIQDLGGVY--------------------DFRVVCD-ETNNPGSVIDRNEF-VASI 642 (671) T ss_pred -----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------eeEEEEc-CCCCCHHHhhCCeE-EEEE Confidence 6677778899999999999999999985 4899998 67899999999988 6999 Q ss_pred EEEECceEEEEEEEEEEeC Q lcl|NC_013597. 484 AVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 484 ~~~~aGaIh~v~i~~~v~~ 502 (502) .++....+++|.+++.=.+ T Consensus 643 ~~~p~~Pae~I~~~~~~~~ 661 (671) T protein:vir:56 643 YVKPAKSINFITLNFVATS 661 (671) T ss_pred EEEecCCcceEEEEEEEee Confidence 9999999999999886555 No 59 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=97.33 E-value=9.2e-05 Score=42.71 Aligned_cols=453 Identities=10% Similarity=0.048 Sum_probs=209.9 Q ss_pred CCc------------CcCce----eEEeecccc-cccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCc Q lcl|NC_013597. 1 MAL------------SISHI----VNVQLNTVP-KSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNS 63 (502) Q Consensus 1 Msi------------p~s~i----V~V~i~~~~-~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s 63 (502) |+- |..++ |-|.+.-+. .+....+.+.+.++|....-|+ .++..+++.++...-||.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~G~~----~~~~~~~~~~~a~~~f~~g- 75 (607) T protein:vir:10 1 MTTTITSAESYKRIYPLFYDSRPHVETNFDDSRLSNTASDSAKNIFMLGSATNGDP----TKVYEIRTSQQATKIFGSG- 75 (607) T ss_pred CcceecchhhHHHHhCCCCccCCceEEEEecCcCcCCCCCCcceEEEEEEeCCCCC----ceEEEEcchhHHHHhhcCc- Confidence 321 22222 333333332 3355666778888998876665 3567777888889999874 Q ss_pred HHHHHHHHHhcC----CCCcceEEEEEeecccccceeeeeecc-----------chhhhH--HHH---Hhh--------- Q lcl|NC_013597. 64 ETAKAAQPFFAQ----SPRAKQLIVARWQKSASTIEATKNTLS-----------GATLSD--DLE---RFK--------- 114 (502) Q Consensus 64 ~ey~aA~~~F~q----~p~P~~l~igr~~~~~~~~~~~~~~~~-----------~~~~~~--~~~---~~~--------- 114 (502) +...|..+.|.- .-.++.+|.=|-.. +..+.++.+.+. ...++. ... .++ T Consensus 76 ~l~~a~~~a~~~~~~~~~g~~~~~~~rv~~-~~~a~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~~d~~~ 154 (607) T protein:vir:10 76 DLVDGIKLAFDPTGNSVTNGGTVYALRVDN-AKQASLVKDGLTFTSSIFGTNANQVSVALDNDVFGVPRITVNYSPDNYE 154 (607) T ss_pred chHHHHHHhhccccCCccCCceEEEEeCCC-ccccceecccccccccccccCCCceEEEEEecCCCccceeEEeecccce Confidence 466667788842 13456666555322 111111111000 000000 000 000 Q ss_pred cc----------------cceeEEEEe--cCcccccccccc----cccc---------chhhHHHHHHhhhccc-cccee Q lcl|NC_013597. 115 SV----------------VNGRFSLTI--GGDVKKVDGLSF----ARLA---------DFNAVATKIQEKLTTL-SVAVS 162 (502) Q Consensus 115 ~~----------------~~g~~~iti--~g~~~~~~~i~~----s~~t---------s~~~vA~~i~aal~~a-~~~~t 162 (502) .+ ....+++.. +|..+.++ +.. .... ..-..+..+...|.+- .+.+. T Consensus 155 ~~~~n~g~~~~i~y~g~~~~a~~~v~~~~~g~~~~lt-~~~~~~~~~~~~V~~~~l~~~~~~t~~~l~~din~~~~~~A~ 233 (607) T protein:vir:10 155 RTYTNIGQMFSITYSGKSASAGYTVSHDTDGKAILLT-LGSGDSIDKLTNVATFDLTMSKYDTIAKLMQAISATPNFSAS 233 (607) T ss_pred eeeeeccceeecccCcccccccceeeecCCCceeEEE-ecCCCccceeeeeecccccccccchHHHHHHHhhcCCceEEE Confidence 00 000111221 13322221 000 0000 0001111111111110 00010 Q ss_pred --------EE-EecccceeeEeeeccc--ccc-cc-------e-eeeeeccccchhhh--------hhhhhhccc---c- Q lcl|NC_013597. 163 --------IA-YDETGNRFIVSANVAG--EDK-KT-------E-IDYAIDEGGEGEYI--------GALLKLENG---Q- 210 (502) Q Consensus 163 --------v~-~~~~~~~f~~~s~ttG--~~~-~v-------~-~~~a~~~~~t~t~~--------aa~l~~t~~---~- 210 (502) .. .+.....+.+.....- ... .+ . +............. ......+.. + T Consensus 234 ~~g~~~i~tky~d~~~~~i~V~~~~~iv~a~~~D~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 313 (607) T protein:vir:10 234 VVGSPSVNTSYLDEVTSPVDVKTAPAVVTAKIGDAISKLGYDPYVVVTQTSNNKPIVNGVSAGTGSATASVTTAPESFPA 313 (607) T ss_pred EecccceeeeccccccceeEEEEeeeeechhhhhhhhcccccceEEeeecccchhhhhhhhccccceeeeeecccccccc Confidence 00 1122222222221110 000 00 0 00000000000000 000000000 0 Q ss_pred --cceeeeeccccccccCHHHHHHHHHhccCceeEEEEecCCChhHHHHHHHHHhhc---CCEEEEEecCchhcccchhH Q lcl|NC_013597. 211 --ASRKVGKNSVSLKKETLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQAN---TKLFGANVIRAEQIEWSADN 285 (502) Q Consensus 211 --~~~~v~v~~~~~~~et~~~al~al~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~---~~~~~~~~~~~~~~~~~~~~ 285 (502) +...+.-...|...++..+++++|... +|+.+.+.. .+.+.+.++.+|++.. .+.+........ ...... T Consensus 314 ~~a~~~LtGGtdG~~~~ty~dal~aLe~~--e~~~i~~~t-~d~ai~~~l~a~vkr~~~~g~~~~aVlg~~~--~~t~~~ 388 (607) T protein:vir:10 314 NFDTAFLTGGSTGDVPVSWADKFNGAIGN--NVYYIIPLT-SEENIHAELQAFIDEQHVLGYNYHAFVGGGF--AEPLEQ 388 (607) T ss_pred ccceeeeeCCCCCCchhhHHHHHHHHhhc--CceEEEecC-CCHHHHHHHHHHHHHHHhCCCcEEEEecCCC--CCCHHH Confidence 001111112233334567888888875 466555443 3456678899999752 333333222111 122344 Q ss_pred HHHHHHHccCCceEEEecC-----C---ccc----hHHHHHHHHHhcCCCCCCceeeEeeeecC--ccccCCCCHHHHHH Q lcl|NC_013597. 286 IYKKLYDAGLDHTLAMFDK-----N---DMY----PVSSALARLLSTNFAANNSTLTLKFKQQP--TITADEITATEFAK 351 (502) Q Consensus 286 i~~~l~~~~~~~t~~~y~~-----~---~~~----~~aa~~g~~as~n~~~~~g~~T~~fk~~~--Gv~~~~lt~t~~~~ 351 (502) +....+..++.|.+.+... . ..+ ..+.+.|..++.+++.. +| ||.++ ++.+ .++.+|++. T Consensus 389 ~~t~a~~~N~ervv~V~~~~~~~~~G~~~~~~~~~~Aa~vAGl~Ag~~~~~S---lT--~k~i~~~~v~~-~lt~~e~e~ 462 (607) T protein:vir:10 389 ILSRQVNINDSRFGLVGQSGHVQEGGESVHVPAYLMAAYVGGLSSSLGVAVP---IT--NKKLALVDLDQ-NFSGDDLNT 462 (607) T ss_pred HHHHHHhhCCCcEEEEecCeeEeeCCcceeccHHHHHHHHHHHHhcCccccC---cc--cceeccccccc-cCCHHHHHH Confidence 5556667788887765421 0 112 23445577776665443 33 34444 4443 699999999 Q ss_pred HHhCCceEEEEEcCc-----eEEecCEeecC-----ee--hhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHH Q lcl|NC_013597. 352 AKRLGINVYTYFDDV-----AMIAEGTVIGG-----KF--ADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILI 419 (502) Q Consensus 352 l~~~~~n~y~~~~~~-----~~~~~G~~~~G-----~~--iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~ 419 (502) +..+|+..+....+. -.+.+|.+.-+ .| |=.++-.|.+...++..+-+. +- +|++.+ .....++ T Consensus 463 ai~~Gv~~l~~~~~~~~~~~vrIv~~ItT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~-yI--Gk~nnd-~~~~~vk 538 (607) T protein:vir:10 463 LNQNGVIGIEHLVNRNATGGYYIVQDVSTNTVSSSHVDGSLYLGELTDFLFDNLRFVLRDT-YI--GSNIRS-TSADDIK 538 (607) T ss_pred HHhCCeEEEEEccCccccceEEEeeeeeeccCCCCcchheeehhhhHHHHHHHHHHHHhhc-CC--cccCCc-chHHHHH Confidence 999999988654322 23445555532 24 668889999999998876433 33 565444 5667788 Q ss_pred HHHHHHHHH--HHHcCccccccccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEE Q lcl|NC_013597. 420 AAVEKVCLE--GINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVI 497 (502) Q Consensus 420 ~~v~~vl~~--a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~ 497 (502) ..+...|.. -...|.|. +.... | +.+. ...| | --+++.++.-.+|++|.++ T Consensus 539 ~~i~~~L~~~~l~~~gaI~-df~~e---------d-------v~v~-----~~~D---~--v~v~~~v~Pv~~iekIyvt 591 (607) T protein:vir:10 539 STVASYLYSEMNNDDGLIV-DFSES---------D-------IVVT-----ISGT---V--VYIQFAVAPTQEIKNIVVS 591 (607) T ss_pred HHHHHHHHHHHHHhcCcee-CCCcc---------c-------cEEe-----eCCC---E--EEEEEEEEEcccceEEEEE Confidence 888888743 34457774 21100 0 1111 1122 2 2488999999999999999 Q ss_pred EEEeC Q lcl|NC_013597. 498 VNYNR 502 (502) Q Consensus 498 ~~v~~ 502 (502) +.+.. T Consensus 592 v~v~~ 596 (607) T protein:vir:10 592 GTYSN 596 (607) T ss_pred EEEEE Confidence 99998 No 60 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=97.27 E-value=0.00011 Score=42.27 Aligned_cols=465 Identities=13% Similarity=0.044 Sum_probs=219.1 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcC---CCcHHHHHHHHHhcCCC Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFG---TNSETAKAAQPFFAQSP 77 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg---~~s~ey~aA~~~F~q~p 77 (502) |.+ ++.=|-|.--=.+.++....-+...|+|...--|+. ++ +.-+|..+....|| ..+.++.++..||-+-- T Consensus 1 ~~~-~~Pgvyv~e~~~~~~i~~~~t~~~~~vg~~~~gp~~---~p-~~i~s~~~~~~~fg~~~~~~~~~~~~~~~f~~gg 75 (679) T protein:vir:10 1 MTL-LSPGVETKEINLQTTIARSSTGRAALVGKFNWGPAY---QI-SQVVSEVDLVDKFGRPDDQTADSFFSGVNFLNYG 75 (679) T ss_pred Cce-ecCceEEEeecCCcccccCccccceeeecccCCCCc---cC-EEecCHHHHHHHcCCcccccchHHHHHHHHHhCC Confidence 885 666665554434567777778889999988766553 33 44566899999999 46678889999996544 Q ss_pred CcceEEEEEeecccccceee--eee------------ccchhhhHHHHHhhccccee-EEEEecCccccc---------- Q lcl|NC_013597. 78 RAKQLIVARWQKSASTIEAT--KNT------------LSGATLSDDLERFKSVVNGR-FSLTIGGDVKKV---------- 132 (502) Q Consensus 78 ~P~~l~igr~~~~~~~~~~~--~~~------------~~~~~~~~~~~~~~~~~~g~-~~iti~g~~~~~---------- 132 (502) . ++||-|........... ... ..+..+...... .....+. ..+..++..... T Consensus 76 ~--~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~-s~~~~~~~~~~~~~~~~~~~~v~~~~~~~~ 152 (679) T protein:vir:10 76 N--DLRLVRVLNETKSRNSSALYQSLSYTITSPGVDYKVGDVVNVLQGG-NVIATGKVTVVNASGGIVAFYVPTAAIIDK 152 (679) T ss_pred C--eEEEEEccCcccccccccccccccccccccccccccccceeeeeCC-CcccceeEEEeeccCceeeeeecccccccc Confidence 3 36666653321110000 000 000000000000 0000000 011111100000 Q ss_pred -------ccc------ccccccchhh-HH-HHHHhhhcccc-------cceeEEEe----------------------cc Q lcl|NC_013597. 133 -------DGL------SFARLADFNA-VA-TKIQEKLTTLS-------VAVSIAYD----------------------ET 168 (502) Q Consensus 133 -------~~i------~~s~~ts~~~-vA-~~i~aal~~a~-------~~~tv~~~----------------------~~ 168 (502) ..+ .++..+...+ +. ..+.......+ ........ .. T Consensus 153 a~~~~~~~~l~~a~~~~~~~~t~~~g~~~~~~v~~v~~~~~~~~~~~~~a~~~i~~~~~~~~t~~~~~~~~~~~~~~A~~ 232 (679) T protein:vir:10 153 AKSLNDYPALDNAWQIQFAAGGPGAGQAATATVVGINLDSTIFVPNDEYAMSAISERSETKRTFIDICEEMKVPAIVARY 232 (679) T ss_pred cccccccceecccceeeeeeccccccceeeeeeeeeccCCceeeccccccccccccccccchhhhhhhhccccceeeeec Confidence 000 0000000000 00 00000000000 00000000 00 Q ss_pred ----cc--eeeEeeecc-----cccc-------------------cceeee------------ee--------------- Q lcl|NC_013597. 169 ----GN--RFIVSANVA-----GEDK-------------------KTEIDY------------AI--------------- 191 (502) Q Consensus 169 ----~~--~f~~~s~tt-----G~~~-------------------~v~~~~------------a~--------------- 191 (502) ++ ......... .... ...... .. T Consensus 233 ~g~~gn~i~v~~va~~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vvv~~~g~~~~~~~~~ 312 (679) T protein:vir:10 233 AGTYGDNIKVLMIAYKDYYKFNEAGKIVSVNTINPKVFPTGLDYGNVTPSSYLEFGPQNESQFAFIVFNNGVAVESKILS 312 (679) T ss_pred ccccCCcceEEEEeecccccccccccccccccccccccccccccccceeeeecccccccccceeeEEecccccccceeee Confidence 00 000000000 0000 000000 00 Q ss_pred ccccchhhhh----hhhhhcccccceeee-e-------------ccccccc------cCHHHHHHHHHhccCceeEEEEe Q lcl|NC_013597. 192 DEGGEGEYIG----ALLKLENGQASRKVG-K-------------NSVSLKK------ETLGEALFNVAEVNNTWYGFTVA 247 (502) Q Consensus 192 ~~~~t~t~~a----a~l~~t~~~~~~~v~-v-------------~~~~~~~------et~~~al~al~~~~~~w~~~~~~ 247 (502) ....+..... ....+..+. ...+. . ...+.+. ......+..+......-..+++. T Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~-~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 391 (679) T protein:vir:10 313 TKPGDRDIYGTSIYINEYFGNGY-SSFVQGVAESWPVGYTGVLAFGGGQSSNTDISAAEFMKGWDMFADREHTDVNLFIA 391 (679) T ss_pred cccccccccchhhhhhhhhcCcc-cceeeeccccccccccceeeccCCccCCCccchhhhhhhhhhhhcccccccceEEe Confidence 0000000000 000000000 00000 0 0001110 11111222111111111123333 Q ss_pred cCCC-------hhHHHHHHHHHhhcCCEEEEEecCc-hhc---ccc-hhHHHHHHH-----------HccCCce-EEEec Q lcl|NC_013597. 248 AQLT-------DSEVEAAAKYAQANTKLFGANVIRA-EQI---EWS-ADNIYKKLY-----------DAGLDHT-LAMFD 303 (502) Q Consensus 248 ~~~~-------~~~~~a~a~w~~a~~~~~~~~~~~~-~~~---~~~-~~~i~~~l~-----------~~~~~~t-~~~y~ 303 (502) .... ..-+.++...++....+|.+..--. ... ... ...+..... ..+++.. ..+|+ T Consensus 392 p~~~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~~~~~ 471 (679) T protein:vir:10 392 GAVAGEGAQIASTVQKAVVAIADERRDCLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYASVDG 471 (679) T ss_pred cCCCCCchhhhHHHHHHHHHHHHhhCCeEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceEEEEc Confidence 2211 1234455666676666665542111 101 111 111111110 0112211 22343 Q ss_pred C-------Cc-----cchHHHHHHHHHhcCCCCCCceeeEeeeecCccc-----cCCCCHHHHHHHHhCCceEEEEEcCc Q lcl|NC_013597. 304 K-------ND-----MYPVSSALARLLSTNFAANNSTLTLKFKQQPTIT-----ADEITATEFAKAKRLGINVYTYFDDV 366 (502) Q Consensus 304 ~-------~~-----~~~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~-----~~~lt~t~~~~l~~~~~n~y~~~~~~ 366 (502) + .+ -.+.+.+.|.++.+|..+.+. .....|++.||. .-.+++.|++.|..+|+|...++.+. T Consensus 472 p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~~-~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~g~ 550 (679) T protein:vir:10 472 NYKYQYDKYNDVNRWIPLAADIAGLCARTDTVGQPW-QSPAGFNRGQIVNVIKLAVDTRQAHRDEMYTNGINPIVGFAGQ 550 (679) T ss_pred cceeeecccCCceEEechHHHHHHHHHHhhccCCcE-ECcCCeeeccccccccceeecChhhHHhhhhCCceEEEEecCC Confidence 3 11 124577888888887544221 122344444442 12478999999999999999998775 Q ss_pred -eEEecCEeecCe-----ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccc Q lcl|NC_013597. 367 -AMIAEGTVIGGK-----FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKW 440 (502) Q Consensus 367 -~~~~~G~~~~G~-----~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~ 440 (502) ..+|..+++++. ||-+.+-.+|++..|+......++. |.++.=...|+..|..-|++.+++|.|. T Consensus 551 G~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e-----pn~~~~~~~i~~~i~~fL~~l~~~gal~---- 621 (679) T protein:vir:10 551 GYILYGDKTASQAPTPFDRINVRRLFNLLKKSISESAKYKLFE-----LNDAFTRSSFRSEVGSYLDTIRSLGGIY---- 621 (679) T ss_pred eEEEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccC-----CCCHHHHHHHHHHHHHHHHHHHhCCcee---- Confidence 467899998872 6778889999999999998765543 6677788999999999999999999985 Q ss_pred cCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 441 TGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 441 ~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) ||.|.+. .++.+++|+.+.+. -+.+.+...-.+++|.+++.=.| T Consensus 622 ----------------gf~v~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~i~~~~~~~~ 665 (679) T protein:vir:10 622 ----------------DFRVVCD-ESNNTPAVIDRNEF-VATILIKPARSINYITLSFVATS 665 (679) T ss_pred ----------------eeEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCccEEEEEEEEee Confidence 4899998 68899999999988 69999999999999999877555 No 61 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=96.96 E-value=0.00023 Score=40.49 Aligned_cols=465 Identities=13% Similarity=0.036 Sum_probs=220.4 Q ss_pred CCcCcCceeEEeecccccccccccccceEEEecccccccccCccceEEecCHHHHHhhcC---CCcHHHHHHHHHhcCCC Q lcl|NC_013597. 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFG---TNSETAKAAQPFFAQSP 77 (502) Q Consensus 1 Msip~s~iV~V~i~~~~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg---~~s~ey~aA~~~F~q~p 77 (502) |++ ++.=|-|.-.=.+.++....-+...|+|...--|+. + -...+|..|...-|| ..+.++.+...||-+-- T Consensus 1 ma~-~~PgVyv~E~~~~~~i~~~~ts~~~~vG~~~~Gp~~---~-p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ngg 75 (664) T protein:vir:98 1 MAL-QSPGIETKETSVQSTVVRNSTGRAAIVGKFSWGPAY---Q-IRQISNEVELVNYFGAPDNLTADYFMSAVNFLQYG 75 (664) T ss_pred Cce-ecCceEEEecCCCcccccccccceEEEeeccCCCCC---c-cEEecCHHHHHHhcCCccccchhHHHHHHHHHhcC Confidence 996 788887773224677888888889999987765553 3 355667899999999 56677888888885422 Q ss_pred CcceEEEEEeecccccceee--eee------------ccchhhhH------HHHHhhcc---cce-eEEEEe-------- Q lcl|NC_013597. 78 RAKQLIVARWQKSASTIEAT--KNT------------LSGATLSD------DLERFKSV---VNG-RFSLTI-------- 125 (502) Q Consensus 78 ~P~~l~igr~~~~~~~~~~~--~~~------------~~~~~~~~------~~~~~~~~---~~g-~~~iti-------- 125 (502) +++||-|........... .+. ..+..+.. ....+... ..| .+.+.+ T Consensus 76 --~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~gn~~~v~i~~~~~~~~ 153 (664) T protein:vir:98 76 --NDLRLVRVVDKDAAKNASAIFNQIKTTIASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNGKILAVTIPKRKKSLL 153 (664) T ss_pred --CeEEEEEecCccccccccccccccceeeccCCcccccccceeEeecCcccccceeeccCCCCCceeeEeeccCcccee Confidence 234555543211000000 000 00000000 00000000 000 000000 Q ss_pred -----------------------cCcccccc--c-c-ccccccchhhHH-HHHH--------h--------hh--ccccc Q lcl|NC_013597. 126 -----------------------GGDVKKVD--G-L-SFARLADFNAVA-TKIQ--------E--------KL--TTLSV 159 (502) Q Consensus 126 -----------------------~g~~~~~~--~-i-~~s~~ts~~~vA-~~i~--------a--------al--~~a~~ 159 (502) .+...... . + +.+......+.+ ..+. . +. ...+. T Consensus 154 ~~~~~~~~~~~~~~~~~s~~~~s~g~a~a~~v~~v~~d~~~~~~~~~~a~~~i~~~~~~~~~~~~~~~~~~a~~~G~~Gn 233 (664) T protein:vir:98 154 VLNRSVLTQIFLLVGTTEIVSQSSGVSASITIDGIESDSGITLLNLDIAKETIQGTSFQTLTQKYQIPSVVALYPGELGS 233 (664) T ss_pred ecccccccccceecccceeeeeecccceeeecccccccceeeccccceeeeccccccceeeeeccccceeeeeecccccc Confidence 00000000 0 0 000000000000 0000 0 00 00000 Q ss_pred --------------ceeEEEecc------------------cceeeEeeecccccccceeeeeeccccchhhhhh----h Q lcl|NC_013597. 160 --------------AVSIAYDET------------------GNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGA----L 203 (502) Q Consensus 160 --------------~~tv~~~~~------------------~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa----~ 203 (502) ...+..... ...+.++....+... ..+....... ...+... . T Consensus 234 ~isv~i~s~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-e~~~~~~~~~-~~~~~~~~~~~~ 311 (664) T protein:vir:98 234 TVQVEIISKAAYDTGAMISGYPSGISVKNSGRSVMTYGPQTDNQYAFVVRRGGIVQ-ESFIVSTDKT-DKDIYGVNIYMD 311 (664) T ss_pred eeeeeecccccccCcceEeeccCceecccceeeeeeccccCccceeEEEecCCcee-eeEEeecccC-cccceeeeeech Confidence 000000000 000100000000000 0000000000 0000000 0 Q ss_pred -hhhccc-----------c-cceeeeecccccc------ccCHHHHHHHHHhcc-CceeEEEEecC--CChh----HHHH Q lcl|NC_013597. 204 -LKLENG-----------Q-ASRKVGKNSVSLK------KETLGEALFNVAEVN-NTWYGFTVAAQ--LTDS----EVEA 257 (502) Q Consensus 204 -l~~t~~-----------~-~~~~v~v~~~~~~------~et~~~al~al~~~~-~~w~~~~~~~~--~~~~----~~~a 257 (502) ...... + +...+.....+.+ .+.....+.++.+.. .+.-.+++... .+.+ -+.+ T Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~~~~~~~~~ll~~p~~~~~~~~~~~~v~~a 391 (664) T protein:vir:98 312 DFFANGGSQYVFGTSMNWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFADREALHVPLLIAGGCAGESVEIASTVQKH 391 (664) T ss_pred hheecccceeeeeecccCCcccceeEeccCccccccccCchhHHHHHHhhhcccccccceEEecCCCCCcHHHHHHHHHH Confidence 000000 0 0000000001111 122234455554432 12211223321 1122 2334 Q ss_pred HHHHHhhcCCEEEEEecCchh-cc---c-chhHHHHHHH------------HccCCce-EEEecC-------Cc-----c Q lcl|NC_013597. 258 AAKYAQANTKLFGANVIRAEQ-IE---W-SADNIYKKLY------------DAGLDHT-LAMFDK-------ND-----M 307 (502) Q Consensus 258 ~a~w~~a~~~~~~~~~~~~~~-~~---~-~~~~i~~~l~------------~~~~~~t-~~~y~~-------~~-----~ 307 (502) +...++..+.+|.+...-... ++ . ...++..... ..+++.. ..+|++ .. - T Consensus 392 l~~~a~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~ 471 (664) T protein:vir:98 392 VISIGDERQDCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNYKYQYDKYNDVNRWV 471 (664) T ss_pred HHHHHHhcCCeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCeEEEecccCCceEEe Confidence 555555555566543211110 00 1 1111111111 1122211 234443 11 1 Q ss_pred chHHHHHHHHHhcCCCCCCceeeEeeeecCccc-----cCCCCHHHHHHHHhCCceEEEEEcC-ce-EEecCEeecC--- Q lcl|NC_013597. 308 YPVSSALARLLSTNFAANNSTLTLKFKQQPTIT-----ADEITATEFAKAKRLGINVYTYFDD-VA-MIAEGTVIGG--- 377 (502) Q Consensus 308 ~~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~-----~~~lt~t~~~~l~~~~~n~y~~~~~-~~-~~~~G~~~~G--- 377 (502) .+.+.++|.++.+|..+.+ -.....|.+.||. ...+++.|.+.|-.+|+|....+-+ .+ .+|.++++++ T Consensus 472 p~sg~~AGl~A~~D~~~g~-~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~s 550 (664) T protein:vir:98 472 PLAGDIAGLCVYTDSVANP-WMSPAGYNRGQIRNCIKLAIEPRTAHRDAMYQVQINPVTGFAGGSGFVLYGDKTLTSVPS 550 (664) T ss_pred chHHHHHHHHHHhhhcCCc-EECcCCceeeeeeccccceeecChhhHHHHHhCCCeEEEEeeCCCcEEEEcccccCCCCc Confidence 2567788999988754421 1122234433332 2347889999999999999998765 34 6889998876 Q ss_pred e--ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccc Q lcl|NC_013597. 378 K--FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLD 455 (502) Q Consensus 378 ~--~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~ 455 (502) + ||-+.+-.+|+...|+..+...++. |.++.=...|+..|+.-|+..+++|.|. T Consensus 551 ~~~~i~vrR~~~~i~~si~~~~~~~v~e-----pn~~~l~~~i~~~i~~~L~~l~~~gal~------------------- 606 (664) T protein:vir:98 551 PFDRINVRRLFNMIKKDIGDNAKYKLFE-----NNDDFTRASFRMDTGQYMTNIRALGGCY------------------- 606 (664) T ss_pred ccceEeehhHHHHHHHHHHHHHHHhhcC-----CCCHHHHHHHHHHHHHHHHHHHhcCcee------------------- Confidence 2 5778889999999999998765533 6788888999999999999999999985 Q ss_pred cceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 456 KGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 456 ~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) ||+|.+. .++.|++|+.+.+. -+.+.++..-.+++|.+++.-.+ T Consensus 607 -g~~V~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~I~~~~~q~~ 650 (664) T protein:vir:98 607 -DYRVICD-TTNNTPDVIDRNEF-VATVYVKPPRSINYITLNFVATS 650 (664) T ss_pred -eeEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEee Confidence 4899998 77889999999998 69999999999999999977665 No 62 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=96.40 E-value=0.00066 Score=38.02 Aligned_cols=371 Identities=13% Similarity=0.112 Sum_probs=144.6 Q ss_pred CCc--Cc-CceeEEeecccccccccccc--cceEEEecccccccccCccceEEecCH-HHHHhh---------------- Q lcl|NC_013597. 1 MAL--SI-SHIVNVQLNTVPKSAARKSF--GIVALFTPEAGQAFADEKTRYVYVENQ-RDVEQL---------------- 58 (502) Q Consensus 1 Msi--p~-s~iV~V~i~~~~~~~~~~~f--~~~lil~~~~~~~~~~~~~r~~~y~s~-~~v~~~---------------- 58 (502) |=| .+ +++.+++-.+ |.+-..++| ..+|.++... +..|..+ +.|..+ T Consensus 311 n~~~~~v~~~D~~~~~~~-t~~~~~~g~~~~~pl~~ts~d----------y~~~~~~vdgI~~~~~~~V~~~g~~s~a~a 379 (717) T protein:vir:79 311 NDIMRKVESKDGAVTVTI-TKPESKRGMISEDPLVFKSGD----------YTNFKMLVDAINNHPFNNVVRARTKPEFEA 379 (717) T ss_pred eeeeeEEecCCceEEEEE-ecccccCcceeccccccccCc----------eeeeeeeecccccCchhheeeeecccccce Confidence 111 00 1111111111 111111111 0122221111 2222221 111111 Q ss_pred -cCCCcHHHHHHHHHhcCCCCcceEEEEEeecccccceeeeeeccchhhhHHHHHhhcccceeEEEEecCcccccccccc Q lcl|NC_013597. 59 -FGTNSETAKAAQPFFAQSPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSF 137 (502) Q Consensus 59 -fg~~s~ey~aA~~~F~q~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~iti~g~~~~~~~i~~ 137 (502) |-..-. .+...+|++-+..-.+..-. +-...+|.... . T Consensus 380 ~~~~g~~--s~d~a~f~Gg~dgl~~~~ee----------------------------------~Y~~lGgk~~d-----~ 418 (717) T protein:vir:79 380 TFTSTLQ--AAADAKFSGGKDELSLDKEE----------------------------------MYKRLGGEKNE-----E 418 (717) T ss_pred eeeeccc--CchhhccCCCccccccchhh----------------------------------hhccccccccc-----c Confidence 000000 01122233222111110000 00000011000 0 Q ss_pred ccccchhhHHHHHHhhhcccccceeEEEecccceeeEeeecccccccceeeeeeccccchhhhhhhhhhcccccceeeee Q lcl|NC_013597. 138 ARLADFNAVATKIQEKLTTLSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGK 217 (502) Q Consensus 138 s~~ts~~~vA~~i~aal~~a~~~~tv~~~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~t~t~~aa~l~~t~~~~~~~v~v 217 (502) +..+. ..+...|... .+.+.+.. ................ .+-..++.....+. ..+...+.. T Consensus 419 g~lt~-----~aays~LE~~--dVDlVil~---------ga~adtt~ga~~d~va-~alad~caalSal~-r~ai~VI~l 480 (717) T protein:vir:79 419 GFVTK-----QGAYQYLENY--EVDYVIPL---------GVHADTKLIGKYDDFA-YQLALACAVMSHYN-SVTIGIIPT 480 (717) T ss_pred ccccc-----hhhhhhcCcc--eeEEEEec---------CccccccccchhhhHH-HHHHHHHHHhhhcc-ccceeeecc Confidence 00000 0000111100 00000000 0000000000000000 00000000000000 000000000 Q ss_pred cccc-ccccCHHHHHHHHHhccCceeEEEEecCCChhHHHHHHHHHhhcCCEEEEEecCchhcccchhHHHHHHHHccCC Q lcl|NC_013597. 218 NSVS-LKKETLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLD 296 (502) Q Consensus 218 ~~~~-~~~et~~~al~al~~~~~~w~~~~~~~~~~~~~~~a~a~w~~a~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~ 296 (502) .... ....+..+.++.+.....- .+.|.......+.. .....++...+... +. T Consensus 481 ~sp~D~~~AtVe~~~~kLs~~Aaa-----------------~~~~d~~~a~a~~~--------~~~~idis~y~~vv-~~ 534 (717) T protein:vir:79 481 TTPSDISLAGVEEHVKKLENYANE-----------------FYMRDRFGNIIFDA--------DRNKIDLGQFIEVV-AG 534 (717) T ss_pred ccccccchhhHHHHHHHHHhhhhh-----------------hhhhcchhcccccc--------ccccccccceeeee-ec Confidence 0000 0000001111111110000 00010000000000 00000000000000 01 Q ss_pred ceEEEecCCc----cchHHHHHHHHHhcCCCCCCceeeEeeeecCccc--cCCCCHHHHHHHHhCCceEEEEEcCce-EE Q lcl|NC_013597. 297 HTLAMFDKND----MYPVSSALARLLSTNFAANNSTLTLKFKQQPTIT--ADEITATEFAKAKRLGINVYTYFDDVA-MI 369 (502) Q Consensus 297 ~t~~~y~~~~----~~~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~--~~~lt~t~~~~l~~~~~n~y~~~~~~~-~~ 369 (502) +-..+.+... ..+++.+.|..+...+...+ .+|.+.|+. ...++..|++.|..+|+|.+..+.|.+ .+ T Consensus 535 ~~~iv~~~~~~~~~~p~AG~vAGldA~rGVwkSP-----ANk~I~GVvgLa~~lT~sE~d~Ln~aGIntIr~~~GrGirV 609 (717) T protein:vir:79 535 PDFIVRNTRLGQMASTPDASYIGMVSQLKTQSAP-----TNKPLPSVTALRYTYSANQLNRLTKARFATFKYKQDGSIGV 609 (717) T ss_pred ceeEEEcCCCceeecCHHHHHHHHHhcCCccccc-----ccceecccccCcccCCHHHHHHHhhCCeEEEEEeCCceEEE Confidence 1112222111 12345566666666554443 366777765 346899999999999999999887655 57 Q ss_pred ecCEeecCe-----ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCcc Q lcl|NC_013597. 370 AEGTVIGGK-----FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAG 444 (502) Q Consensus 370 ~~G~~~~G~-----~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~ 444 (502) +.+++++++ ||-+++-.|++...|+..+...+ .+ |-+..+...|+..|+.-|++..+.|.|. T Consensus 610 WGaRTtasd~sdWryInVRRl~D~Ie~sIr~al~~yV----gE-PNd~~tr~~Ik~sI~afL~~L~r~GAI~-------- 676 (717) T protein:vir:79 610 VDAPTSAHAGSDYTRLSTARIVKEAVNAVREVADPFI----GE-PNDTGNRNALTAAVDKRLSKMIENKALL-------- 676 (717) T ss_pred EeeeecCCCCcccceeehhhhHHHHHHHHHHHHHHhc----cc-cCCHHHHHHHHHHHHHHHHHHHhcCcee-------- Confidence 899988762 57889999999999988875433 33 7788899999999999999999999995 Q ss_pred ccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 445 FGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 445 ~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) ||.+.+ .++++|..+-+. -+.+.+.....+++|.|++++-= T Consensus 677 ------------Gykvdv----tnT~~di~~G~l-~V~I~vaPv~PaEfI~ititITA 717 (717) T protein:vir:79 677 ------------GFDFRL----VVTPQQELLGEG-SIELSLEAPNELRRLTTIVSLSA 717 (717) T ss_pred ------------cceeeE----ecChhHhhCCEE-EEEEEEEecCcccEEEEEEEEeC Confidence 244332 356666665444 48999999999999999998888 No 63 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=92.80 E-value=0.0099 Score=31.57 Aligned_cols=319 Identities=10% Similarity=0.033 Sum_probs=147.2 Q ss_pred ccccceeEEEec----------ccceeeEee-ecccccccceeeeeec----cccchhhhhhhhhhcccc-cce-eeeec Q lcl|NC_013597. 156 TLSVAVSIAYDE----------TGNRFIVSA-NVAGEDKKTEIDYAID----EGGEGEYIGALLKLENGQ-ASR-KVGKN 218 (502) Q Consensus 156 ~a~~~~tv~~~~----------~~~~f~~~s-~ttG~~~~v~~~~a~~----~~~t~t~~aa~l~~t~~~-~~~-~v~v~ 218 (502) ..+ .|..+. .-.+|-+.. ..+..++...+..-.+ .+.....+...+...+.. +.. ...+. T Consensus 1 ~~~---~v~vn~~n~~~g~~~~~er~~Lfig~~~~~~~~~~~~~~~sdld~~lg~~~~~lk~~v~aa~~naG~~~~~~~~ 77 (376) T protein:vir:37 1 MFP---SVQINALNQLSGETKEIERHALFVGVGTTNQGKLLALTPDSDFDKVFGETDTDLKKQVRAAMLNAGQNWFAHVY 77 (376) T ss_pred CCC---eEEEecccccCCCcccccceEEeeccccccccceeeecCccchHhhhCCCchHHHHHHHHHHhCCCCcEEEEEE Confidence 111 122211 112333321 1122222222211111 000111222222111211 111 12222 Q ss_pred cccccccCHHHHHHHHHhccCceeEEEEecC--CChhHHHHHHHHH---hhcCC--EEEEE-ecCchhc--cc-chhHHH Q lcl|NC_013597. 219 SVSLKKETLGEALFNVAEVNNTWYGFTVAAQ--LTDSEVEAAAKYA---QANTK--LFGAN-VIRAEQI--EW-SADNIY 287 (502) Q Consensus 219 ~~~~~~et~~~al~al~~~~~~w~~~~~~~~--~~~~~~~a~a~w~---~a~~~--~~~~~-~~~~~~~--~~-~~~~i~ 287 (502) ....+.++..+++.... ...++.+..++.. .+.+++.++.+-. ..+.+ +|+.. ..+.+.. .. .=++.. T Consensus 78 ~~~~~~~~~~~Av~~a~-~~~s~E~V~v~~pv~t~~a~i~aa~~~a~el~~~~~Rpv~file~r~~~~~~~~~e~w~~y~ 156 (376) T protein:vir:37 78 IAQEDGYDFVECVKKAN-QTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQAVQGINHDQSDGETWDQYV 156 (376) T ss_pred eecCCchHHHHHHHHhh-hhcCceEEEEeccccccHHHHHHHHHHHHHHHHhcCCeEEEEEeccCcCcccccccCHHHHH Confidence 33344456667766543 3334444444443 2455666554444 34433 34333 2211111 11 112222 Q ss_pred HHHHHc--c--CCce--EEEecCCccchHHHHHHHH--HhcCCCCCCceeeEeeeecCc---------cccCCCCHHHHH Q lcl|NC_013597. 288 KKLYDA--G--LDHT--LAMFDKNDMYPVSSALARL--LSTNFAANNSTLTLKFKQQPT---------ITADEITATEFA 350 (502) Q Consensus 288 ~~l~~~--~--~~~t--~~~y~~~~~~~~aa~~g~~--as~n~~~~~g~~T~~fk~~~G---------v~~~~lt~t~~~ 350 (502) ..+.+. + ..++ ++..|. .....++||+ +++-....+|++.-- .+.| .....++...+. T Consensus 157 ~~~~al~~gia~~~V~~V~~~~g---n~~G~~aGRl~~aaVsVadspgRV~tG--~l~gl~~~~lp~d~~~~~l~~a~l~ 231 (376) T protein:vir:37 157 QKLTTLQQTIVADHVCLVPLLFG---NETGVLAGRLANRAVTVADSPARVQTG--ALVSLGSANKPLDKDRNELTLAHLK 231 (376) T ss_pred HHHHHhhcccccccceeeeeehh---hhHHHHHHHHhhcccchhhCccceecc--ccccccccccccCcCcccCCHHHHH Confidence 333331 1 2232 232232 2356778997 455444556554211 1223 233467899999 Q ss_pred HHHhCCceEEEEEcCce--EEecCEeec---CeehhHHHHHHHHHHH--HHHHHHHHHHhcCCCCccCHHHHHHHHHHHH Q lcl|NC_013597. 351 KAKRLGINVYTYFDDVA--MIAEGTVIG---GKFADEIVILDWFVDA--VQKEVFARLYKSPTKIPLTDKGQAILIAAVE 423 (502) Q Consensus 351 ~l~~~~~n~y~~~~~~~--~~~~G~~~~---G~~iD~~~~~dwl~~~--iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~ 423 (502) +|+++|+.+...|.|.. ++.+|+|+. |+|=-.-+.+.+-|.. ++......+.. ..+=-+..+++..+..+. T Consensus 232 aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~gsDY~~ie~~RVvdKa~R~vR~~ai~~i~D--~~lnst~~sia~~~~yi~ 309 (376) T protein:vir:37 232 SLETARYSVPMWYPDYDGYYWADGRTLDVEGGDYQVIENLRVVDKVARKVRLLAIGKIAD--RSFNSTTSSTEYHKNYFA 309 (376) T ss_pred HHHhCCCeEEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHHhCC--cccCcchhhHHHHHHHHH Confidence 99999999999998853 556899986 4554444455555554 33333333311 123234467888888899 Q ss_pred HHHHHHHHcCccccccccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 424 KVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 424 ~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) .+|++..+...|. |+ ..|| .|..|.-.++..+-....+. .|.+..+.=|.-..+++++-..= T Consensus 310 ~pLr~M~~s~~i~-g~--------------~fpG-eI~~p~d~Di~i~w~s~~~V-~I~~~v~P~~~pk~Itv~I~Ldl 371 (376) T protein:vir:37 310 KPLRDMSKSATIN-GK--------------DFPG-ECMPPKDDAITIVWQSKTKV-TIYIKVRPYDCPKEITANIFLDL 371 (376) T ss_pred HHHHHHHhcchhc-cc--------------cccc-eeecCCCCCceEEeeccceE-EEEEEEEeccCCceEEEEEEeec Confidence 9999998877765 21 1233 46666544444443333333 45665556666666665544444 No 64 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=91.04 E-value=0.018 Score=30.18 Aligned_cols=452 Identities=12% Similarity=0.022 Sum_probs=196.3 Q ss_pred CCcCc--C------ceeEEeeccc-ccccccccccceEEEecccccccccCccceEEecCHHHHHhhcCCCcHHHHHHHH Q lcl|NC_013597. 1 MALSI--S------HIVNVQLNTV-PKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQP 71 (502) Q Consensus 1 Msip~--s------~iV~V~i~~~-~~~~~~~~f~~~lil~~~~~~~~~~~~~r~~~y~s~~~v~~~fg~~s~ey~aA~~ 71 (502) |++.+ . .=|-|..--+ ..++....=+...|+|....-|+ ......+|.++...-||.. +.-.|... T Consensus 1 ma~~~yf~~~~~~~PGVyvee~~sg~~~i~gv~tsva~fvG~a~~Gp~----~~p~~v~s~~~~~~~fggg-~l~~av~~ 75 (648) T protein:vir:10 1 MAISVYFDGKLIKQLGAYVKTDLSAVKQINGVGTGIVALLGLAEGGET----YKPYRLTSFAEAVSIFKGG-PLLEHIKA 75 (648) T ss_pred CeeeeeeCCCCccCCceEEEEeccccccccCCCCceEEEEEeeCCCCC----ceeEEecCHHHHHHHhcCc-cHHHHHHH Confidence 88744 1 1122222222 23455566667889988766554 2356778889999999864 56678889 Q ss_pred HhcCCCCcceEEEEEeecccccceeeeeecc------ch---hhhHHHHH-hhccccee------------------EEE Q lcl|NC_013597. 72 FFAQSPRAKQLIVARWQKSASTIEATKNTLS------GA---TLSDDLER-FKSVVNGR------------------FSL 123 (502) Q Consensus 72 ~F~q~p~P~~l~igr~~~~~~~~~~~~~~~~------~~---~~~~~~~~-~~~~~~g~------------------~~i 123 (502) ||.+- -+++|+-|.... ..+.++.+.+. |. .+...... -..+ .+. +++ T Consensus 76 ~F~nG--g~~~~~vRv~~~-~~a~~~~~~~~~~a~~~g~~gn~i~~~v~~~~~~~-~~~~~l~v~~~~~~~~~d~~v~~i 151 (648) T protein:vir:10 76 AFIGG--AGEVVAVRIGNP-TTASVSIPVAQNTSDTSPANLNFVSYEASTRSNQI-YVSFDLDENFTSANEADDTIIFTI 151 (648) T ss_pred HHhCC--CcEEEEEEcCCC-cccceecceeEEeecccCCCCCceEEEEEEcCCCc-CceeEEEEEecCCCcccceeEEEe Confidence 99653 355555554321 11111110000 00 00000000 0000 011 111 Q ss_pred E-----ecCccccccccccccc-------------------------cchhhHHHHHHhhhcccc-cceeEE-------- Q lcl|NC_013597. 124 T-----IGGDVKKVDGLSFARL-------------------------ADFNAVATKIQEKLTTLS-VAVSIA-------- 164 (502) Q Consensus 124 t-----i~g~~~~~~~i~~s~~-------------------------ts~~~vA~~i~aal~~a~-~~~tv~-------- 164 (502) . -.|+....+ ...... .+.+..+..+ ..+.... .-..+. T Consensus 152 ~~~~~~y~gt~~~~t-~~v~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~v-~~~~~~~~~~~~~~~~~~s~~~ 229 (648) T protein:vir:10 152 YQKHPDFSVTRETFT-FPRKFTTPTVLVKRGSTLFFVDRSIVNAALAAGPAFQTALI-NLLKEQLQPTDVVQIFDASDTN 229 (648) T ss_pred ccCCCcccccceecc-ccccccccccccccccceeecCccchhhhhccCccchhhhh-hchhhhhhhhhhheeccccccc Confidence 0 001110000 000000 0000000000 0000000 000000 Q ss_pred -EecccceeeEee---------------ecccccc-----cceeeeeeccccchhh--------------------hhhh Q lcl|NC_013597. 165 -YDETGNRFIVSA---------------NVAGEDK-----KTEIDYAIDEGGEGEY--------------------IGAL 203 (502) Q Consensus 165 -~~~~~~~f~~~s---------------~ttG~~~-----~v~~~~a~~~~~t~t~--------------------~aa~ 203 (502) .|.......... ...|+.. ..-+....++..+... ..++ T Consensus 230 ~~d~~~~~~~~~a~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~tp~~~~~~~~~~~~~~~~~~~~~v~~~~~~~l 309 (648) T protein:vir:10 230 PVDIPLGLFVYEVLYGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNLSATPFFDGSDYQDYTSLSDPANWFAKDAYTINHL 309 (648) T ss_pred ccccccccccccccchhhhcCCcchhhhhhhccccccccccceecccccccccccceeeeeccccccceeeeeccchhhc Confidence 000000000000 0000000 0000000000000000 0000 Q ss_pred hhhcccccc--eeee---ecccccc------------ccCHHHHHHHHHhccCceeEEEE----------ecCCC--hhH Q lcl|NC_013597. 204 LKLENGQAS--RKVG---KNSVSLK------------KETLGEALFNVAEVNNTWYGFTV----------AAQLT--DSE 254 (502) Q Consensus 204 l~~t~~~~~--~~v~---v~~~~~~------------~et~~~al~al~~~~~~w~~~~~----------~~~~~--~~~ 254 (502) ...+..+.. ..+. -...|.. ..+..++++.+++....|- +- .+..+ ..- T Consensus 310 ~~~~~~p~~~~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~~i--vp~~~~~~~~~~~~~lt~~q~i 387 (648) T protein:vir:10 310 VDTTINPHILATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVNFV--IPAYKFTNVTQLNDRLTIFKGI 387 (648) T ss_pred ccccccCcccccccceecccccCCCcccccccccccchhhHHHHhhhccCCCceEE--EeecccccccccccccCCccch Confidence 000000000 0010 0111222 2335677777766543332 21 01111 222 Q ss_pred HHHHHHHHhhcC---------CEEEEEecCchhcccchhHHHHHHHHccCCceEE----------------EecCC---- Q lcl|NC_013597. 255 VEAAAKYAQANT---------KLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLA----------------MFDKN---- 305 (502) Q Consensus 255 ~~a~a~w~~a~~---------~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~----------------~y~~~---- 305 (502) +.++-.|+...+ .+++.....+........-+.... ..+..|... .|+++ T Consensus 388 ~a~a~shv~~~s~~~~~~~r~~~~~~vg~~~~es~~~se~~~~~~-~~~~~~a~~~~~d~~~~~~~~~~~~~~~~~G~~~ 466 (648) T protein:vir:10 388 ASTFLSHVQTMSQVNRRKARVGVFGLPAPSPNESVTASEYLYNRN-ILNTISAMFGGTDRAQAVVFPFYSNVFNDEGKVE 466 (648) T ss_pred HHHHHHHHHHhhhccccccccCeEEEeCCCCchhHHHHHHHhhhh-cccccceeeeecCCceEEeecccceeECCCCcEE Confidence 333335665432 134433222111100000011000 011111111 11111 Q ss_pred ---ccchHHHHHHHHHhcCCCCCCceeeEeeeecCccc--c-CCCCHHHHHHHHhCCceEEEEEcCce-----EEecCEe Q lcl|NC_013597. 306 ---DMYPVSSALARLLSTNFAANNSTLTLKFKQQPTIT--A-DEITATEFAKAKRLGINVYTYFDDVA-----MIAEGTV 374 (502) Q Consensus 306 ---~~~~~aa~~g~~as~n~~~~~g~~T~~fk~~~Gv~--~-~~lt~t~~~~l~~~~~n~y~~~~~~~-----~~~~G~~ 374 (502) ..|.++++.|..+.+++...+ -||.+.++. + ..++++|++.|..+|++++....+.. ..-.|.+ T Consensus 467 ~~p~~~~Aa~VAGl~a~l~~~~s~-----T~k~i~~~~id~~~~~t~~qld~L~~~Gv~~ie~~~~~~~~~~~rvv~gIT 541 (648) T protein:vir:10 467 LLGGEFFASYVAGMHANREPQDSI-----TFLPISGIGAEPLYNWTYTQKDDLISNRVLFVEKVKTSFGGIVYRIHHNPT 541 (648) T ss_pred ecchhhHHHHHHhhhhccccccCc-----ccceeeccccccccCCCHHHHHHHhcCCcEEEEEecCCcceeeEEEeccce Confidence 124467888888887665543 466665443 3 47899999999999999998875421 2346777 Q ss_pred ecCe-------ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccc Q lcl|NC_013597. 375 IGGK-------FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGN 447 (502) Q Consensus 375 ~~G~-------~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~ 447 (502) ..+. -|=.++-.|.+...++..+.+.++- | |=++.....|++.+.+-|.+-++.+.|.+ ... T Consensus 542 T~~~~~~~~~~eisv~ri~D~l~~~vr~~l~~~fIG---~-~n~~~~~~~ik~~i~~~L~~~~~~~~I~~-y~~------ 610 (648) T protein:vir:10 542 TWLGPVTQGFQEFVLRRIDDFLQSYVYKNLQEQFIG---R-KSYGRKTENDIKVYTEALLSNLVGKQIVA-YKD------ 610 (648) T ss_pred eecCCCCcceeeeeeeehhhHHHHHHHHHHhhhcCc---c-cccHHHHHHHHHHHHHHHhhHhhcCcccC-ccc------ Confidence 6652 4667888999999999988776533 2 55777899999999999888888777752 111 Q ss_pred cccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 448 LSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 448 ~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 502 (502) ..+++.. ..|| . .|.|.+....+|++|.+++.|.- T Consensus 611 ----------~~v~~~~-----~~~v---v--~V~~~v~Pv~~i~~I~vti~it~ 645 (648) T protein:vir:10 611 ----------VKVTSNE-----DKTV---Y--YVEFFYQPVTEIKFILVTMKVTF 645 (648) T ss_pred ----------ceEEEEe-----cCCE---E--EEEEEEEecceeeEEEEEEEEEe Confidence 1132210 1123 2 68999999999999999888877 No 65 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=89.39 E-value=0.027 Score=29.22 Aligned_cols=332 Identities=12% Similarity=0.048 Sum_probs=146.6 Q ss_pred eeEEEEecCccccccccccccccchhhHHHHHHhhhcccccceeEEE---ecccceeeEeeecccccccceeeeeecccc Q lcl|NC_013597. 119 GRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLSVAVSIAY---DETGNRFIVSANVAGEDKKTEIDYAIDEGG 195 (502) Q Consensus 119 g~~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a~~~~tv~~---~~~~~~f~~~s~ttG~~~~v~~~~a~~~~~ 195 (502) =|=+++ +..+|+... .+.. ..+.+-| .+......+.-++.-+-..+ . +. T Consensus 1 ~~~~v~-------vn~~n~~~g------------~~~~--~er~~lfig~~~~~~g~~~~~~~~sdld~~-l------~~ 52 (370) T protein:vir:78 1 MWPYVQ-------IYNLNQMQG------------PVTE--VERHLLFIGSAASNTGKLLSLNAQSDFDQL-L------GA 52 (370) T ss_pred CCceEE-------EeeccccCC------------CcCc--cceeEEEEecccccccceEeecCccCHHHh-c------CC Confidence 111111 112221110 0000 0001111 00000000111111111000 0 00 Q ss_pred chhhhhhhhhhcc-cccceeeeeccccccccCHHHHHHHHHhccCceeEEEEecCC-ChhHHHHHHHHHhh---c--CCE Q lcl|NC_013597. 196 EGEYIGALLKLEN-GQASRKVGKNSVSLKKETLGEALFNVAEVNNTWYGFTVAAQL-TDSEVEAAAKYAQA---N--TKL 268 (502) Q Consensus 196 t~t~~aa~l~~t~-~~~~~~v~v~~~~~~~et~~~al~al~~~~~~w~~~~~~~~~-~~~~~~a~a~w~~a---~--~~~ 268 (502) ....+...+...+ +.+..--....+....++..+|+..+. ....+.+..+++.. +.+++.++...++. . ..+ T Consensus 53 ~ds~lk~~v~aa~~naG~~~~~~~~p~~~~~d~~~Av~~a~-~~~s~E~V~v~~~~s~~a~~~a~~~~a~el~n~~~Rpv 131 (370) T protein:vir:78 53 ADSELKANLLAARDNAGQNWSAAAYVLPTDKPWLDAARDAQ-QTQSFEGVVVLGQEWHQAAINAAHALNQELIAKWGRWQ 131 (370) T ss_pred cChhHHHHHHHHHhCCCCceEEEEEEecCchhHHHHHHHHH-hhCCccEEEEecCcchHHHHHHHHHHHHHHHHhcCCeE Confidence 1111222222222 122111111122233445666665553 34455555555543 35777776665553 1 234 Q ss_pred EEEE-ecCchhcccchhHHHHHHHH----ccCCc--eEEEecCCccchHHHHHHHHH--hcCCCCCCceeeE-eeee--- Q lcl|NC_013597. 269 FGAN-VIRAEQIEWSADNIYKKLYD----AGLDH--TLAMFDKNDMYPVSSALARLL--STNFAANNSTLTL-KFKQ--- 335 (502) Q Consensus 269 ~~~~-~~~~~~~~~~~~~i~~~l~~----~~~~~--t~~~y~~~~~~~~aa~~g~~a--s~n~~~~~g~~T~-~fk~--- 335 (502) |+.. ..+.+.-. .-++....+++ -...+ .++.+|..+ ...++||++ ++.....++++.- .-+. T Consensus 132 ~file~~~~~~~e-~w~~y~~~l~al~~gia~~~V~vvp~~~g~~---~G~~aGRL~naavsVadsP~Rv~tG~l~gl~~ 207 (370) T protein:vir:78 132 FMLLAVPAIADEQ-DWATYEAELATLQDGIAASSVSLIPQLWPTL---AGAYAGRLCNRAVSIADSPCRVKTGALVGLGN 207 (370) T ss_pred EEEEeecCCCCcC-CHHHHHHHHHHhhhccccccceEEeeecccc---HHHHHHHHhcCeeeecccceeeeccccccccc Confidence 4433 22222111 11122222222 12233 344456432 567788863 3333333433211 1111 Q ss_pred cC-ccccCCCCHHHHHHHHhCCceEEEEEcCce--EEecCEeec---CeehhHHHHHHHHHHHHHHHH--HHHHHhcCCC Q lcl|NC_013597. 336 QP-TITADEITATEFAKAKRLGINVYTYFDDVA--MIAEGTVIG---GKFADEIVILDWFVDAVQKEV--FARLYKSPTK 407 (502) Q Consensus 336 ~~-Gv~~~~lt~t~~~~l~~~~~n~y~~~~~~~--~~~~G~~~~---G~~iD~~~~~dwl~~~iq~~l--~~~l~~~~~k 407 (502) +| .-....++...+++|+++|+.+...|.|.. ++.+|+|+. |+|=-.-+.+.+-|..-+.++ ...+.. .+ T Consensus 208 ~p~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~gsDYq~ie~~RVvdKa~R~vR~~ai~~i~D--~~ 285 (370) T protein:vir:78 208 KPVGKDGIPLPLATLQTLEANRYSVPMWYPDYDGIYWADGRTLDAEGGDYQVIENLRIAYKVARRMRLRAIARIGD--RS 285 (370) T ss_pred cccccCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHHhCC--cc Confidence 11 012234788999999999999999998853 556899985 455555555555555544443 232211 12 Q ss_pred CccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEE Q lcl|NC_013597. 408 IPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKL 487 (502) Q Consensus 408 iPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~ 487 (502) +==+...++..+......|++..+.+-|.. + ..+| +|..|.-.++..+-.+.++. .|.+..+. T Consensus 286 lnst~gsia~~~~~~~~~L~ema~s~~i~~-~--------------~fpg-eI~~p~d~Di~i~w~s~~~v-~I~~~v~P 348 (370) T protein:vir:78 286 FNSTPGSTAAAITYFGKDLREMAKSTTING-Q--------------PFPG-DIASPQDGDIRIQWVAKNLV-SVFVVVRT 348 (370) T ss_pred cCCCCcchhHHHHHHHhhHHHHHhhhhhcc-c--------------ccce-eEeccCCCcceEEeeccceE-EEEEEEEe Confidence 222335688888899999999999888862 1 1233 46655533444443344433 46666666 Q ss_pred CceEEEEEEEEEEeC Q lcl|NC_013597. 488 AGAIHSSDVIVNYNR 502 (502) Q Consensus 488 aGaIh~v~i~~~v~~ 502 (502) =|.-..+++.+-++= T Consensus 349 ~~~pk~Itv~I~LDl 363 (370) T protein:vir:78 349 VDCPKGITVNIMLDL 363 (370) T ss_pred ccCCceEEEEEEEee Confidence 666556655554433 No 66 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=55.55 E-value=0.48 Score=22.35 Aligned_cols=328 Identities=10% Similarity=0.049 Sum_probs=141.0 Q ss_pred EEEEecCccccccccccccccchhhHHHHHHhhhcccccceeEEEecc-----cceeeEeeecccccccceeeeeecccc Q lcl|NC_013597. 121 FSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLSVAVSIAYDET-----GNRFIVSANVAGEDKKTEIDYAIDEGG 195 (502) Q Consensus 121 ~~iti~g~~~~~~~i~~s~~ts~~~vA~~i~aal~~a~~~~tv~~~~~-----~~~f~~~s~ttG~~~~v~~~~a~~~~~ 195 (502) |. .. .+++..+|+. |..+.. ..+.+-|--. .....+.-+++-+-..+. + ... T Consensus 1 m~--~~--~V~in~~n~~------------qg~~~~--ver~~lfig~g~~~~~~g~~~~~~~~sdld~~l-g----~~d 57 (369) T protein:vir:27 1 MA--WP--TVIIKILNLM------------NGPIAD--IECHFLFVIRGTVSGEVRNLIMVDSTSDLDDVL-A----EAS 57 (369) T ss_pred CC--CC--ceEEeccccc------------CCCccc--ccceEEEEEeccccccccceEEecCccchHhhc-C----CcC Confidence 11 10 0111112211 111110 0001111100 000001111111111110 0 011 Q ss_pred chhhhhhhhhhcc-cccceeeeeccccccccCHHHHHHHHHhccCceeEEEEecC-CChhHHHHHHHHHhh---c--CCE Q lcl|NC_013597. 196 EGEYIGALLKLEN-GQASRKVGKNSVSLKKETLGEALFNVAEVNNTWYGFTVAAQ-LTDSEVEAAAKYAQA---N--TKL 268 (502) Q Consensus 196 t~t~~aa~l~~t~-~~~~~~v~v~~~~~~~et~~~al~al~~~~~~w~~~~~~~~-~~~~~~~a~a~w~~a---~--~~~ 268 (502) ..+...+...+ +.+..--.........++..+|+..... .-.+.+..++.. .+.+++.++.+..+. + ..+ T Consensus 58 --s~lk~~v~aa~~naG~~w~a~~~p~~~~~~~~~Av~~a~~-~~s~E~V~v~~p~t~~a~i~aaq~~a~el~~~~~R~v 134 (369) T protein:vir:27 58 --AEGLAIVKAAQLNGKQAWTAGVMILSEEDNWQDAVKKANE-VSSFEFVVLGFDAETKAMIEDAITLRTELKNSLGREV 134 (369) T ss_pred --hhHHHHHHHHHhCCCCceEEEEEEeCCchhHHHHHHhhhh-hCCccEEEEecCcccHHHHHHHHHHHHHHHHhcCCeE Confidence 11222111111 1111111111223344556666665432 344554555554 334666666555543 2 234 Q ss_pred EEEEe-cC--chhcccc-hhHHHHHH----HHccCCceEE--EecCCccchHHHHHHHHH--hcCCCCCCceeeEeeeec Q lcl|NC_013597. 269 FGANV-IR--AEQIEWS-ADNIYKKL----YDAGLDHTLA--MFDKNDMYPVSSALARLL--STNFAANNSTLTLKFKQQ 336 (502) Q Consensus 269 ~~~~~-~~--~~~~~~~-~~~i~~~l----~~~~~~~t~~--~y~~~~~~~~aa~~g~~a--s~n~~~~~g~~T~~fk~~ 336 (502) |+..- .. .+..++. =++....+ +.-...++.+ .+|...+ ....++||++ ++-....++++-- -.+ T Consensus 135 ffi~e~~~~~~~~~~~e~w~dy~a~l~al~~g~a~~~V~vv~~~~~~gn-~~G~~aGRl~n~aVsIadsp~RVkt--G~l 211 (369) T protein:vir:27 135 GVLCQLPAINNDPTNGQTWSEWLADTVDIPKDVASEYISVVPNVHAAGD-TLGKYAGRLANKEVSIADSPARVQT--GSV 211 (369) T ss_pred EEEEeccccCCCccccCCHHHHHHHHHHHhhccCcccceeeeeeccccc-hHHHHHHHHHhcccchhcCcceeee--ccc Confidence 44432 11 1111111 12222222 2223444443 3453222 2455678864 4444445555411 123 Q ss_pred Cccc-----cC--CCCHHHHHHHHhCCceEEEEEcCc-e-EEecCEeec---CeehhHHHHHHHHHHHHHHH--HHHHHH Q lcl|NC_013597. 337 PTIT-----AD--EITATEFAKAKRLGINVYTYFDDV-A-MIAEGTVIG---GKFADEIVILDWFVDAVQKE--VFARLY 402 (502) Q Consensus 337 ~Gv~-----~~--~lt~t~~~~l~~~~~n~y~~~~~~-~-~~~~G~~~~---G~~iD~~~~~dwl~~~iq~~--l~~~l~ 402 (502) .|+. ++ .++.+.+.+|+++|+.+...|.|. + ++.+|+++. |+|=-.-+.+.+-|..=+.+ ....+ T Consensus 212 ~g~~~~p~d~~g~~l~~a~l~aLd~agysvp~~Y~gy~G~Yw~d~~tl~~~gsDYq~iE~~RVvdKa~R~vR~~Ai~~i- 290 (369) T protein:vir:27 212 LGNTELMKDKAGKALDLATLKALESNRIAVPMWYPDYPGQYWTTGRTLDVPGGDYQDIRHIRVAMKAARKVRIRAIARI- 290 (369) T ss_pred ccccccccCCCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCCeehhhhhhHHHHHHHHHHHHHHHHh- Confidence 3322 11 277899999999999999999885 3 556899985 45544445555555544444 33333 Q ss_pred hcCCCCccCHHHHHHHHHHHHHHHHHHHHcCccccccccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceE Q lcl|NC_013597. 403 KSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQ 482 (502) Q Consensus 403 ~~~~kiPyt~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~ 482 (502) . ...+.-+..+++..+..+..+|++..+-+ -||+ |..|...++.-+ ...|.--.|. T Consensus 291 ~-Dr~lnstp~sia~~~~~~~~pLr~M~ks~--fpge--------------------i~~P~d~dI~i~-w~~k~~V~I~ 346 (369) T protein:vir:27 291 A-DRTLNSTPQSIAAAKLYFTQDLRTMALTG--VPGE--------------------IYPPEDEDIQIK-WVNSTDVEIY 346 (369) T ss_pred c-CcccccChhHHHHHHHHHhhHHHHHHhhc--CCeE--------------------EecCCCCceEEE-eeccceEEEE Confidence 2 34577888999999999999999987653 3554 344433333211 1112222333 Q ss_pred EEEEECceEEEEEEEEEEeC Q lcl|NC_013597. 483 TAVKLAGAIHSSDVIVNYNR 502 (502) Q Consensus 483 ~~~~~aGaIh~v~i~~~v~~ 502 (502) +..+.=+.=..+++++-++- T Consensus 347 ~~vrP~~~pk~it~~I~ldl 366 (369) T protein:vir:27 347 MSVQPYECPVKITIAISVKQ 366 (369) T ss_pred EEEeeccCCceEEEEEEEec Confidence 33333344445555555555 No 67 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=30.27 E-value=1.6 Score=19.48 Aligned_cols=315 Identities=11% Similarity=0.075 Sum_probs=143.3 Q ss_pred ccccceeEEEecc----------cceeeEeeec-ccccccceeeeeec----cccchhhhhhhhhhccccc-cee-eeec Q lcl|NC_013597. 156 TLSVAVSIAYDET----------GNRFIVSANV-AGEDKKTEIDYAID----EGGEGEYIGALLKLENGQA-SRK-VGKN 218 (502) Q Consensus 156 ~a~~~~tv~~~~~----------~~~f~~~s~t-tG~~~~v~~~~a~~----~~~t~t~~aa~l~~t~~~~-~~~-v~v~ 218 (502) ..+ .|..+.. -.+|-+.... ...++...+..-.+ .+.....+...+...+..+ ..- ..+. T Consensus 1 ~~~---~v~vn~ln~~qg~~~~ver~~lfig~~~~~~~~~~~~~~~sdld~~lg~~ds~lk~~v~aa~~naG~~w~a~~~ 77 (376) T protein:vir:37 1 MFP---SVQINALNQLSGETKEIERHALFVGVGTTNQGKLLALTPDSDFDKVFGETDTDLKKQVRAAMLNAGQNWFAHVY 77 (376) T ss_pred CCC---eEEEeeeeccCCCcccccceEEEeeccccccCceEEecCCCChHHhhCCCchhHHHHHHHHHhCCCCceEEEEE Confidence 111 1222111 1123222211 11112212211110 0111122222222222221 111 1222 Q ss_pred cccccccCHHHHHHHHHhccCceeEEEEecC--CChhHHHHHHHHHh---hcC--CEEEEE-ecCchhcccch---hHHH Q lcl|NC_013597. 219 SVSLKKETLGEALFNVAEVNNTWYGFTVAAQ--LTDSEVEAAAKYAQ---ANT--KLFGAN-VIRAEQIEWSA---DNIY 287 (502) Q Consensus 219 ~~~~~~et~~~al~al~~~~~~w~~~~~~~~--~~~~~~~a~a~w~~---a~~--~~~~~~-~~~~~~~~~~~---~~i~ 287 (502) ..+.+.++..+|+..+. ..-.+.+..++.. .+.+++.++....+ .+. .+|+.. +...+...... ++.. T Consensus 78 ~p~~~~~~~~~Av~~a~-~~~s~E~V~v~~p~~t~~a~i~a~qa~a~el~~~~~R~vffile~~g~d~~~~~ge~w~~y~ 156 (376) T protein:vir:37 78 IAQEDGYDFVECVKKAN-QTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQAVQGINHDQSDGETWDQYV 156 (376) T ss_pred ecCCChhhHHHHHHHHH-hhCCeeEEEEecCcchhHHHHHHHHHHHHHHHHhcCCeEEEEEeccCCCCcccccCCHHHHH Confidence 33444566777777663 3445555555553 34667666655443 322 344443 22221111111 2222 Q ss_pred HHHHH----ccCCceE--EEecCCccchHHHHHHHHH--hcCCCCCCceeeEeeeecCccc----cC-----CCCHHHHH Q lcl|NC_013597. 288 KKLYD----AGLDHTL--AMFDKNDMYPVSSALARLL--STNFAANNSTLTLKFKQQPTIT----AD-----EITATEFA 350 (502) Q Consensus 288 ~~l~~----~~~~~t~--~~y~~~~~~~~aa~~g~~a--s~n~~~~~g~~T~~fk~~~Gv~----~~-----~lt~t~~~ 350 (502) ..+.+ ....++. +.+|.+ ....++||++ ++-....+|++--- -+.|+- |. .++...+. T Consensus 157 ~~l~a~~~gia~~~V~vV~~~~gn---~~G~~aGRl~naaVsVadspgRV~tG--ai~gl~~~~~p~d~~g~el~~a~l~ 231 (376) T protein:vir:37 157 QKLTTLQQTIVADHVCLVPLLFGN---ETGVLAGRLANRAVTVADSPARVQTG--ALVSLGSANKPLDKDGNELTLAHLK 231 (376) T ss_pred HHHHHHhccccccceeeeeeeccc---hHHHHHHHHHhCCcchhcCccceeec--ccccccccccccccCCcccchHHHH Confidence 22222 2333443 334442 3567889974 44445566665211 233332 11 36889999 Q ss_pred HHHhCCceEEEEEcCc-e-EEecCEeec---Ce--ehhHHHHHHHHHHHHHHHHHHHHHhcCCCCccCHHHHHHHHHHHH Q lcl|NC_013597. 351 KAKRLGINVYTYFDDV-A-MIAEGTVIG---GK--FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVE 423 (502) Q Consensus 351 ~l~~~~~n~y~~~~~~-~-~~~~G~~~~---G~--~iD~~~~~dwl~~~iq~~l~~~l~~~~~kiPyt~~G~~~l~~~v~ 423 (502) +|+++|+.+...|.|. + ++..|+++. |+ +|-.++=.|=...+++......+. ...+.-+..+++..+..+. T Consensus 232 aLd~arysvpr~Y~gydG~Yw~dg~tl~~~gsDYq~ie~~RVvdKa~R~vR~~Ai~~i~--Dr~lnstp~sia~~~~~~~ 309 (376) T protein:vir:37 232 SLETARYSVPMWYPDYDGYYRADGRTLDVEGGDYQVIENLRVVDKVARKVRLLAIGKIA--DRSFNSTTSSTEYHKNYFA 309 (376) T ss_pred HHHhCCCeEEEeeCCCCceEEeCCeEeccCCCCeeeehhchHHHHHHHHHHHHHHHHhc--CccccCChhHHHHHHHHHh Confidence 9999999999999885 3 456899985 34 566666666666666655555442 2347777889999999999 Q ss_pred HHHHHHHHcCcc----ccccccCccccccccccccccceEEEcCchhcCCHHHHhhcccCceEEEEEECceEEEEEEEEE Q lcl|NC_013597. 424 KVCLEGINNGAF----APGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVN 499 (502) Q Consensus 424 ~vl~~a~~~G~I----~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~s~~dra~R~~~~i~~~~~~aGaIh~v~i~~~ 499 (502) .+|++..+.+=| -||+..- |+-.++.-. =..|..-.|.+..+.=+.=..+++++- T Consensus 310 ~pLr~M~ks~ei~g~~fpgei~~--------------------P~d~dI~i~-w~sk~~V~I~~~vrPy~cpk~i~~~I~ 368 (376) T protein:vir:37 310 KPLRDMSKSATINGKDFPGECMP--------------------PKDDAITIV-WQSKTKVTIYIKVRPYDCPKEITANIF 368 (376) T ss_pred HHHHHHHhhhhhccccccceeec--------------------CCCCceEEE-eccCceEEEEEEEeeecCcceeEEEEE Confidence 999998766544 3343332 221111100 001111112222222222222222222 Q ss_pred EeC Q lcl|NC_013597. 500 YNR 502 (502) Q Consensus 500 v~~ 502 (502) .+= T Consensus 369 LDl 371 (376) T protein:vir:37 369 LDL 371 (376) T ss_pred Eec Confidence 222 Done!