Query lcl|NC_019918.1_cdsid_YP_007236878.1 [gene=BN405_2-10_Ab1_orf_57] [protein=hypothetical protein] [protein_id=YP_007236878.1] [location=32436..33722] Match_columns 428 No_of_seqs 157 out of 227 Neff 8.5 Searched_HMMs 1612 Date Thu Nov 7 16:47:57 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_57 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_57_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:5260 Length: 502 # 100.0 1E-121 9E-125 683.1 43.8 426 1-428 1-502 (502) 2 protein:vir:95263 Length: 450 100.0 7E-118 5E-121 662.7 43.5 422 3-428 1-449 (450) 3 protein:vir:106730 Length: 501 100.0 3E-117 2E-120 659.3 42.5 426 1-428 1-500 (501) 4 protein:vir:3636 Length: 501 # 100.0 8E-117 5E-120 657.1 42.7 426 1-428 1-500 (501) 5 protein:vir:101576 Length: 501 100.0 6E-116 4E-119 652.1 42.1 426 1-428 1-500 (501) 6 protein:vir:78611 Length: 501 100.0 9E-116 6E-119 651.2 41.8 426 1-428 1-500 (501) 7 protein:vir:94073 Length: 494 100.0 8E-114 5E-117 640.6 41.0 426 1-428 1-494 (494) 8 protein:vir:96104 Length: 504 100.0 1E-113 9E-117 639.3 39.7 422 1-427 1-504 (504) 9 protein:vir:99586 Length: 507 100.0 9E-114 5E-117 640.5 38.0 425 1-427 1-507 (507) 10 protein:vir:107720 Length: 515 100.0 2E-104 1E-107 589.6 39.3 422 1-427 1-515 (515) 11 protein:vir:80052 Length: 331 100.0 3E-94 1.8E-97 533.3 38.0 321 3-428 1-331 (331) 12 protein:vir:3165 Length: 426 # 100.0 8.8E-76 5.5E-79 432.1 30.2 400 1-428 1-426 (426) 13 protein:vir:102957 Length: 437 99.4 6.3E-11 3.9E-14 76.5 34.1 396 1-427 1-437 (437) 14 protein:vir:79092 Length: 477 99.3 2.1E-10 1.3E-13 73.6 31.3 415 1-428 1-467 (477) 15 protein:vir:107865 Length: 477 99.3 2E-10 1.3E-13 73.7 30.8 411 1-428 1-467 (477) 16 protein:vir:6079 Length: 396 # 99.3 1.7E-10 1E-13 74.2 29.7 360 1-428 1-383 (396) 17 protein:vir:1845 Length: 392 # 99.2 8.4E-11 5.2E-14 75.8 26.8 358 1-428 1-380 (392) 18 protein:vir:1996 Length: 495 # 99.2 4.2E-10 2.6E-13 72.0 30.2 410 1-428 1-495 (495) 19 protein:vir:2035 Length: 396 # 99.2 9.5E-11 5.9E-14 75.5 26.5 360 1-428 1-383 (396) 20 protein:vir:5711 Length: 396 # 99.2 1.6E-10 1E-13 74.3 27.6 361 1-428 1-383 (396) 21 protein:vir:4517 Length: 498 # 99.2 6E-10 3.7E-13 71.1 29.9 407 1-416 1-498 (498) 22 protein:vir:98553 Length: 395 99.2 2.2E-10 1.3E-13 73.6 27.5 358 1-428 1-383 (395) 23 protein:vir:105470 Length: 451 99.2 6.8E-10 4.2E-13 70.8 34.3 403 1-427 1-451 (451) 24 protein:vir:489 Length: 498 # 99.2 3.5E-10 2.2E-13 72.4 27.7 411 1-428 1-491 (498) 25 protein:vir:10336 Length: 386 99.2 4.6E-10 2.9E-13 71.8 27.2 353 1-428 1-379 (386) 26 protein:vir:4463 Length: 498 # 99.1 1.3E-09 8.3E-13 69.2 27.6 414 1-428 1-491 (498) 27 protein:vir:78206 Length: 390 99.1 8.1E-10 5E-13 70.4 25.6 352 1-428 1-378 (390) 28 protein:vir:103993 Length: 390 99.1 8.1E-10 5E-13 70.4 25.6 352 1-428 1-378 (390) 29 protein:vir:96740 Length: 388 99.1 2.7E-09 1.6E-12 67.6 30.4 348 1-428 1-377 (388) 30 protein:vir:79181 Length: 390 99.0 2.3E-09 1.4E-12 68.0 26.8 350 1-428 1-378 (390) 31 protein:vir:1172 Length: 391 # 99.0 1.5E-09 9.2E-13 69.0 25.5 355 1-428 1-379 (391) 32 protein:vir:79141 Length: 391 99.0 4E-09 2.5E-12 66.6 25.1 350 1-428 1-378 (391) 33 protein:vir:99306 Length: 587 98.9 2.1E-08 1.3E-11 62.6 36.4 409 1-428 1-582 (587) 34 protein:vir:96586 Length: 587 98.9 2.6E-08 1.6E-11 62.2 30.4 404 1-428 111-582 (587) 35 protein:vir:107310 Length: 581 98.8 3.5E-08 2.2E-11 61.5 24.9 394 1-428 103-566 (581) 36 protein:vir:100323 Length: 393 98.8 4.5E-08 2.8E-11 60.9 27.2 351 1-428 1-380 (393) 37 protein:vir:78986 Length: 436 98.7 7E-08 4.4E-11 59.8 35.1 384 1-427 3-436 (436) 38 protein:vir:104858 Length: 729 98.7 7.5E-08 4.6E-11 59.7 28.3 389 1-428 277-717 (729) 39 protein:vir:95741 Length: 587 98.7 8.9E-08 5.5E-11 59.2 36.0 409 1-428 1-582 (587) 40 protein:vir:80488 Length: 562 98.6 2.5E-07 1.5E-10 56.8 38.2 409 1-428 1-557 (562) 41 protein:vir:80779 Length: 569 98.5 5.4E-07 3.4E-10 54.9 37.1 409 1-428 1-564 (569) 42 protein:vir:102359 Length: 356 98.4 6.2E-07 3.8E-10 54.6 25.5 323 67-426 1-356 (356) 43 protein:vir:104477 Length: 749 98.4 6.3E-07 3.9E-10 54.6 30.2 397 1-428 272-739 (749) 44 protein:vir:7653 Length: 581 # 98.4 9E-07 5.6E-10 53.7 29.5 334 1-428 198-566 (581) 45 protein:vir:63742 Length: 562 98.3 1.5E-06 9.5E-10 52.5 38.4 409 1-428 1-557 (562) 46 protein:vir:5833 Length: 742 # 98.2 2.1E-06 1.3E-09 51.7 27.8 387 1-428 308-736 (742) 47 protein:vir:102819 Length: 648 98.1 5E-06 3.1E-09 49.6 20.3 408 1-428 137-645 (648) 48 protein:vir:106984 Length: 743 98.1 5.5E-06 3.4E-09 49.4 30.5 414 1-428 216-732 (743) 49 protein:vir:80984 Length: 666 98.0 7.5E-06 4.7E-09 48.7 24.7 412 1-428 153-651 (666) 50 protein:vir:6894 Length: 660 # 98.0 8E-06 4.9E-09 48.5 24.8 401 1-428 171-646 (660) 51 protein:vir:103456 Length: 659 97.9 9.8E-06 6.1E-09 48.0 28.1 408 1-428 153-646 (659) 52 protein:vir:98824 Length: 774 97.9 1.1E-05 6.8E-09 47.8 26.0 411 1-428 273-767 (774) 53 protein:vir:108052 Length: 660 97.8 1.7E-05 1.1E-08 46.7 29.5 413 1-428 120-647 (660) 54 protein:vir:6594 Length: 666 # 97.8 1.9E-05 1.2E-08 46.4 34.4 416 1-428 1-651 (666) 55 protein:vir:100829 Length: 607 97.6 3.5E-05 2.1E-08 45.0 28.6 388 1-428 138-596 (607) 56 protein:vir:79798 Length: 717 97.6 4.4E-05 2.8E-08 44.4 23.5 390 1-428 287-717 (717) 57 protein:vir:7206 Length: 659 # 97.5 4.9E-05 3E-08 44.2 36.0 416 1-428 1-646 (659) 58 protein:vir:106427 Length: 679 97.2 0.00012 7.4E-08 42.1 25.6 398 1-428 182-665 (679) 59 protein:vir:5663 Length: 671 # 97.0 0.0002 1.2E-07 40.9 36.8 416 1-428 1-661 (671) 60 protein:vir:98263 Length: 664 96.9 0.00025 1.5E-07 40.3 35.2 416 1-428 1-650 (664) 61 protein:vir:100539 Length: 663 96.8 0.00034 2.1E-07 39.6 34.4 417 1-428 1-648 (663) 62 protein:vir:101804 Length: 663 96.7 0.00043 2.6E-07 39.1 30.4 385 1-428 176-648 (663) 63 protein:vir:101187 Length: 663 96.3 0.0008 4.9E-07 37.6 36.3 417 1-428 1-648 (663) 64 protein:vir:78782 Length: 370 95.5 0.0019 1.2E-06 35.5 22.8 337 1-428 1-363 (370) 65 protein:vir:3788 Length: 376 # 92.8 0.0098 6.1E-06 31.6 28.2 331 1-428 1-371 (376) 66 protein:vir:276 Length: 369 # 83.0 0.072 4.5E-05 26.8 25.4 323 71-428 1-366 (369) 67 protein:vir:3751 Length: 376 # 81.9 0.082 5.1E-05 26.6 27.7 328 1-428 1-371 (376) No 1 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=100.00 E-value=1.4e-121 Score=683.14 Aligned_cols=426 Identities=20% Similarity=0.315 Sum_probs=362.5 Q ss_pred CC-CCCceEEEeeeeecccccccccceEEEEcccC-----CCccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCccc Q lcl|NC_019918. 1 MT-VLTDVIDIQISRETAAVAQTNFNVPLFIASHT-----NFSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALKPR 74 (428) Q Consensus 1 M~-~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~-----~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~P~ 74 (428) |+ ||||||||+|++.+.++++++|+.+||++++. ++.+|+|.|+|+++|++|||++|||||||++||+|+|||. T Consensus 1 msip~s~ivnV~i~~~~~a~~~~~f~~~l~l~~~~~~~~~~~~~r~~~~~s~~~V~~~FG~~s~ey~aA~~yF~q~p~P~ 80 (502) T protein:vir:52 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSPRAK 80 (502) T ss_pred CCCCccceeEEeeccccccccccccCceEEEeeccCccccCCccceEEecCHHHHHHhcCCChHHHHHHHHHhcCCCccc Confidence 99 99999999999999999999999999998764 3568999999999999999999999999999999999999 Q ss_pred EEEEEeeecccccccchhee----------------eccccc-------ccccceeeeeeecccchhhhhhhhhee---- Q lcl|NC_019918. 75 SLVIGRRQVPSATVSVSVVQ----------------EGQSYV-------LTVNGLPVSYVSHQDDTATLIATGLKA---- 127 (428) Q Consensus 75 ~l~igr~~~~~~~~~~~~~~----------------~~~~~~-------~~v~g~~~s~~~~~~~~a~~i~a~l~~---- 127 (428) +|+||||+++.....+...+ ..+... .++.+++++..++++++++.+++++.+ T Consensus 81 ~l~igR~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~i~i~g~~~t~~~i~lS~~ts~~~vA~~i~~~l~~~~~~ 160 (502) T protein:vir:52 81 QLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLSVA 160 (502) T ss_pred eEEEEeccccccceeechhhhhhhhhHHhHHHhhhhcCceeEEEecceeeeeeccccccccchhHHHHHHHhhhcccccc Confidence 99999999877654332111 011122 345567888899999999999887753 Q ss_pred ---eeccc--ceEEEEeeccccceeeeeccc--c-ccccc---------cceEE---EEeeccccCHHHHHHHHHhcccC Q lcl|NC_019918. 128 ---AYDVT--PVVGVTVTDNEDGTLTVASNG--D-WSLKV---------SSNLT---MAAAPSTEGWPATITAVQGENDE 187 (428) Q Consensus 128 ---a~~~~--~~~~~~~tt~~~~~~t~as~~--~-~~~~~---------s~~~~---~~~~~aa~~~~~al~~~~~~~~~ 187 (428) .++.+ ++...+.+++....++..... . .++.. +.... ...+.+++++.++|.++.+.+++ T Consensus 161 ~tv~~d~~~~~F~i~s~ttg~~~~~~~~~a~~~~~~gt~~a~~l~l~~~~~av~v~~~~~g~~aet~~~al~a~~~~~~~ 240 (502) T protein:vir:52 161 VSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVAEVNNT 240 (502) T ss_pred eEEEEecCCceEEEEeccCCCcceeEEEEeecCCcchhHHHHHhccccccceeeeeeecccccccCHHHHHHHHHhccCc Confidence 34443 455555565555554332221 1 11111 11111 23467789999999999999999 Q ss_pred ceEEEEe-cCCHHHHHHHHHHHhhhCCEEEEEecCcccccchhHHHHHHHHhcccCceEEEecCCccchhHHHHHHHHHh Q lcl|NC_019918. 188 WYALSID-SHADDDIMAVATHIEGTKKVFIGATAQANTKTSAENDIASRLVAAGFQRTALIYHPNADAQFPECAWVGYQL 266 (428) Q Consensus 188 w~~~~~~-~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~~~~~~~~a~~~~~~~ 266 (428) ||++.+. +.++++++++|+|+|+++|+|++++.+.+......++++++|++++|.||+++||++. ++++++++|+++ T Consensus 241 w~~~~~a~~~~~~~~la~a~~iea~~~~f~~~~~d~~~~~~~~~~i~~~l~a~~~~~t~~~y~~~~--~~~~aa~~g~~a 318 (502) T protein:vir:52 241 WYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDKND--MYPVSSALARLL 318 (502) T ss_pred eEEEEEeecCChhHHHHHHHHHhhcCcEEEEEecCcceeccccchHHHHHHhccCceeEEEecCCc--chhHHHHHHHHH Confidence 9987665 6689999999999999999999999999888888899999999999999999999864 466666666655 Q ss_pred c----cCCCceeeeeeeecCccccCCCHHHHHHHHhCCceEEEEEcCceeeecCEecCCchhHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 267 Q----EQPGSNTWTHKALAAVDAYRLTPTESTNLKNKNVTTFERVGGVNRTFGGAMAGGEWIDVMIFVDWLEARMTERLW 342 (428) Q Consensus 267 ~----~~~g~~t~~fk~~~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~~G~~iD~~~~~dwl~~~lq~~l~ 342 (428) + ..+|++|||||+++||+|++++.+|+++|+++|||||+++.+..++++|+|++|+|||++||+|||+++||++|+ T Consensus 319 s~~f~~~~g~iT~~fk~l~GV~~~~lt~t~~~al~~~~~N~y~~~~~~~~~~~G~~~~G~~iD~~~~~~Wl~~~lq~~l~ 398 (502) T protein:vir:52 319 STNFAANNSTLTLKFKQQPTITADEITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGGKFADEIVILDWFVDAVQKEVF 398 (502) T ss_pred hcCCCcCcceeeecccccCCcccCcCCHHHHHHHHhcCceEEEEecCeeEEecCeeeCCchhhHHHHHHHHHHHHHHHHH Confidence 4 568999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHh-cCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecC-----------------CceEEEeCchHhCCHHHHhcccc Q lcl|NC_019918. 343 FRMAN-SKKIPYDAVGATILESEIRAQLNEGIRVGGLAEA-----------------PAPKVFVPDVLSMSPNMRAQRIF 404 (428) Q Consensus 343 ~ll~~-~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g-----------------~~~~v~~~~~~~~~~~dra~R~~ 404 (428) ++|.+ ++|||||+.|+++|+++|+++|+++++||+|+|| +||+|+.|++++|+++||++|++ T Consensus 399 ~~L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~G~~~~~~~g~~~~~d~~~~gy~v~~~~~~~~s~~dr~~R~~ 478 (502) T protein:vir:52 399 ARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRA 478 (502) T ss_pred HHHHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeecccccCceEEEeCchhhCCHHHHHcccC Confidence 98865 5799999999999999999999999999999996 47999999999999999999999 Q ss_pred CCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 405 EGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 405 ~~i~~~~~~agaih~v~i~~~v~~ 428 (428) |+++|+|+++|+||+|+|+++|+= T Consensus 479 ~~~~~~~~~aGaIh~v~i~~nv~~ 502 (502) T protein:vir:52 479 TPIQTAVKLAGAIHSSDVIVNYNR 502 (502) T ss_pred CCeEEEEEECceEEEEEEEEEEeC Confidence 999999999999999999988888 No 2 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=100.00 E-value=7.4e-118 Score=662.73 Aligned_cols=422 Identities=27% Similarity=0.440 Sum_probs=369.3 Q ss_pred CCCceEEEeeeeecccccccccceEEEEcccCCCccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCcccEEEEEeee Q lcl|NC_019918. 3 VLTDVIDIQISRETAAVAQTNFNVPLFIASHTNFSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALKPRSLVIGRRQ 82 (428) Q Consensus 3 ~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~P~~l~igr~~ 82 (428) --||||||+|++.++++++++|+.+||++++..++||+|.|+++++|++|||.+|||||||++||+|+|+|.+|+||||+ T Consensus 1 ~~s~iVnV~i~~~~~a~~~~~f~~~l~~~~~~~~~~r~~~yss~~~V~~~FG~~S~ey~aA~~yF~q~p~p~~l~igr~~ 80 (450) T protein:vir:95 1 MWNPIVNVDITLNTAGTTREGFGLPLFLASTDNFEERVRGYTSLTEVAEDFDENTAAYKAAKQLWSQTPKVTQLYIGRRA 80 (450) T ss_pred CCCceEEEeecccccccccccceeEEEEcCCCCCccceeeecCHHHHHHhcCCCcHHHHHHHHHHhCCCcccEEEEEeec Confidence 45999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred cccccccchheeeccc--cc-------ccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccc--cceeeeec Q lcl|NC_019918. 83 VPSATVSVSVVQEGQS--YV-------LTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNE--DGTLTVAS 151 (428) Q Consensus 83 ~~~~~~~~~~~~~~~~--~~-------~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~--~~~~t~as 151 (428) ...+...+...+.... .+ .+..+++++..+++.++++.+.+++.... .....+.....+. ..+++++. T Consensus 81 ~~~t~~~~~~~~~~~~g~lt~tv~G~~~~~~~i~~s~a~s~~~va~~~~tai~~~~-~~~~~~~~~s~g~~~~~t~~~~~ 159 (450) T protein:vir:95 81 MQYTVSIPDAVTESTDYSITVAAGGGISQPYQYTAQSSDTAENVLQQFKTQIEADP-TIKDKVSVNVTGSNGSATMIIAK 159 (450) T ss_pred cchhhhhhhhhccccceeEEEEecceeeeeeEEEEEecCChhhHHHHhhhhhcccc-eeeeeeeeeeecccceeeeeeec Confidence 8765554443332222 22 23455788888999999999988886542 2222222333332 33333333 Q ss_pred ccc---ccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEEecCCHHHHHHHHHHHhhhCCEEEEEecCccccc-- Q lcl|NC_019918. 152 NGD---WSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSHADDDIMAVATHIEGTKKVFIGATAQANTKT-- 226 (428) Q Consensus 152 ~~~---~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~-- 226 (428) .+. .............+.++|++.+++.++.+.+++||++.+++++++++++||+|+++++++|+++.++..... T Consensus 160 ~~~~~~~~l~~~~~~~~~~g~~aet~~~a~~a~~~~~~~w~~~~~~~~~~~~i~a~a~w~~a~~~~f~~~~~~~~~~~~~ 239 (450) T protein:vir:95 160 AGDNDFVKVTTTAQTVYIASTTADTASTALAAIEAYSTDWYFIAAEDRTQQFVLAMASEIQARKKIFFTANSDVTALQGT 239 (450) T ss_pred cccchhhccccccceeEecccccccHHHHHHHHHHhhCCeEEEEecCCCHHHHHHHHHHHhhcCcEEEEEcCCchhhhhh Confidence 322 122334566777788999999999999999999999999999999999999999999999999998877543 Q ss_pred --chhHHHHHHHHhcccCceEEEecCCccchhHHHHHHHHHhccCCCceeeeeeeecCcccc-------CCCHHHHHHHH Q lcl|NC_019918. 227 --SAENDIASRLVAAGFQRTALIYHPNADAQFPECAWVGYQLQEQPGSNTWTHKALAAVDAY-------RLTPTESTNLK 297 (428) Q Consensus 227 --~~~~~~~~~l~~~~~~~t~~~y~~~~~~~~~~a~~~~~~~~~~~g~~t~~fk~~~Gv~~~-------~~t~t~~~~l~ 297 (428) ...++++++|+.++|.||+++||+..+.++++++++|++++.+||++|||||+++||+|+ +|+.+|+++|+ T Consensus 240 ~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~~~aa~~g~~~~~~~g~~T~~fk~l~Gv~~~v~~~~~~~lt~~~~~al~ 319 (450) T protein:vir:95 240 ELASANDVPAQLAKNMYTRTVCLWHHAAAEDYPEMAYIAYGAPYDAGSIAWGNAQLTGVAASLQPSNQRPLTSIQKSALD 319 (450) T ss_pred hhhcccchHHHHHhccCCeeEEEeeCCCchhHHHHHHHHHhhhcccceeeeccccccceeeeccCccccccchHHHHHHH Confidence 457899999999999999999999888899999999999999999999999999999996 58999999999 Q ss_pred hCCceEEEEEcCceeeecCEecCCchhHHHHHHHHHHHHHHHHHHHHHHhc--CCCCcCHhHHHHHHHHHHHHHHHHHhc Q lcl|NC_019918. 298 NKNVTTFERVGGVNRTFGGAMAGGEWIDVMIFVDWLEARMTERLWFRMANS--KKIPYDAVGATILESEIRAQLNEGIRV 375 (428) Q Consensus 298 ~~~~n~y~~~~~~~~~~~G~~~~G~~iD~~~~~dwl~~~lq~~l~~ll~~~--~kip~~~~G~~~i~~~i~~~~~~~~~~ 375 (428) ++|||||+.+.+.+++++|+|++|+|||++||+|||+++||++|++||+++ +|||||+.|+++|+++|+++|+++++| T Consensus 320 ~~~~n~y~~~~~~~~~~~G~~~~G~~iD~~~~~~wl~~~iq~~l~~ll~~~~~~KiPy~~~G~~~i~a~i~~~l~~a~~~ 399 (450) T protein:vir:95 320 VRHCNFIDLDGGVPVVRRGITSGGEWIDIIRGVDWLESDLKTSLRDLLINQKGGKITYDDTGITRIRQVIETSLQRAVNR 399 (450) T ss_pred hCCcEEEEEecCceeeeCCeeeCcchhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCccChhhHHHHHHHHHHHHHHHHhc Confidence 999999999999999999999999999999999999999999999999875 489999999999999999999999999 Q ss_pred CceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 376 GGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 376 G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) |+|+ +|+|+.|++++|+++||++|++|+++|+|+|+||||.++|+++|+- T Consensus 400 G~Ia---~~~V~~~~~~~~~~~dr~~R~~~~i~~~~~laGAIh~~~i~~~v~~ 449 (450) T protein:vir:95 400 NFLS---SYTVNVPKASQVALADKKARILKDVTFAGILAGAILDVDLKGTVAY 449 (450) T ss_pred Cccc---ceeEecCChHhcCHHHHhccCCCCeeEEEEEccceEEEEEEEEEEe Confidence 9997 7999999999999999999999999999999999999999999999 No 3 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=100.00 E-value=3.1e-117 Score=659.32 Aligned_cols=426 Identities=16% Similarity=0.192 Sum_probs=368.4 Q ss_pred CC----CCCceEEEeeeeecccccccccceEEEEcccCCCccceEEeeCHHHHHhhcCCChHHHHHHHHHHh----cCCc Q lcl|NC_019918. 1 MT----VLTDVIDIQISRETAAVAQTNFNVPLFIASHTNFSERARVYNSLKGVAEDFGESDPTYLAAVRYFG----QALK 72 (428) Q Consensus 1 M~----~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~----q~p~ 72 (428) || |+||||||+|+|.++++.+++|+++||+.++..|++|+|.|+++++|++|||.+|||||||++||+ |+|| T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~f~~lll~~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q~p~ 80 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSVQPGQLADFFQKTDVENWFGALSNEAKIADAYFPGIVNGGQL 80 (501) T ss_pred CCcCccccceEEEEeeecccCCCcccccceEEEecccCCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhhcCCCcc Confidence 99 599999999999999999999999999999999999999999999999999999999999999998 9999 Q ss_pred ccEEEEEeeecccccccchheee-----------ccccccc------ccceeeeeeecccchhhhhhhhhee-----eec Q lcl|NC_019918. 73 PRSLVIGRRQVPSATVSVSVVQE-----------GQSYVLT------VNGLPVSYVSHQDDTATLIATGLKA-----AYD 130 (428) Q Consensus 73 P~~l~igr~~~~~~~~~~~~~~~-----------~~~~~~~------v~g~~~s~~~~~~~~a~~i~a~l~~-----a~~ 130 (428) |.+|+||||+++.....+.+.+. .....++ ..+++++.++++.++|+.|++++.. .++ T Consensus 81 P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~~g~l~i~i~g~~~~~~i~~s~ats~~~vA~~i~~al~~~~~tv~~d 160 (501) T protein:vir:10 81 PYDLKFARYVAADAPASVYGIPLTGITLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDFVVAYD 160 (501) T ss_pred ccEEEEEeecccCccceeeeceehhhhhhhhhheeeEEEEeeccceeeeccccccccCHHHHHHHHHHhhcCCceEEEEe Confidence 99999999998766544321111 1122222 3467889999999999999998864 244 Q ss_pred c--cceEEEEeeccccceeeeecccc-c--cccc--c-ceEEEEeeccccCHHHHHHHHHhcccCceEEE-EecCCHHHH Q lcl|NC_019918. 131 V--TPVVGVTVTDNEDGTLTVASNGD-W--SLKV--S-SNLTMAAAPSTEGWPATITAVQGENDEWYALS-IDSHADDDI 201 (428) Q Consensus 131 ~--~~~~~~~~tt~~~~~~t~as~~~-~--~~~~--s-~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~-~~~~~~~~~ 201 (428) . .++...+.+++..+++++++... . ...+ . +..+...+.++++|.+++.++.+.+++||++. +++++++|+ T Consensus 161 ~~~~~f~i~~~t~G~~~~i~~~t~~~d~a~~l~Lt~~~~a~v~~~g~~aet~~~Al~a~~~~~~~Wy~f~~a~~~~~~~~ 240 (501) T protein:vir:10 161 ALRNRFTVVTNTTGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTAVIADR 240 (501) T ss_pred cccceEEEEecccCcceeEEEeeccccchhhhcccccCceeEEecCcccccHHHHHHHHHhcccceEEEEEEecCChHHH Confidence 3 36777777888888888776542 1 1222 2 23355778889999999999999999999764 567999999 Q ss_pred HHHHHHHhhhCCEEEEEecCcccc---cchhHHHHHHHHhcccCceEEEecCCccchhHHHHHHHHHhccCCCceeeeee Q lcl|NC_019918. 202 MAVATHIEGTKKVFIGATAQANTK---TSAENDIASRLVAAGFQRTALIYHPNADAQFPECAWVGYQLQEQPGSNTWTHK 278 (428) Q Consensus 202 ~ala~~~~a~~~~~~~~~~~~~~~---~~~~~~~~~~l~~~~~~~t~~~y~~~~~~~~~~a~~~~~~~~~~~g~~t~~fk 278 (428) +++|+|+|+++++|+|..++.+.. ....++++++|++++|.||+++||+.++.+++++++++.+|+..+|++||||| T Consensus 241 la~A~wi~a~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g~~T~~fk 320 (501) T protein:vir:10 241 LAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNGRTVLAFR 320 (501) T ss_pred HHHHHHHHhcCceEEEEEecCcceeeecccchhHHHHHHhcCCCceEEECCCCCHHHHHHHHHHhcCcccCcceeeeeec Confidence 999999999999998887766543 34678999999999999999999998888888889999999999999999999 Q ss_pred ee-cCccccCCCHHHHHHHHhCCceEEEEEcCc----eeeecCEecCC-chhHHHHHHHHHHHHHHHHHHHHHHhcCCCC Q lcl|NC_019918. 279 AL-AAVDAYRLTPTESTNLKNKNVTTFERVGGV----NRTFGGAMAGG-EWIDVMIFVDWLEARMTERLWFRMANSKKIP 352 (428) Q Consensus 279 ~~-~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~----~~~~~G~~~~G-~~iD~~~~~dwl~~~lq~~l~~ll~~~~kip 352 (428) ++ +||+|++++++|+++|+++|||||+.+.+. .++++|++++| +|||+++|+|||+++||.+|++||.+++||| T Consensus 321 ql~~Gv~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~g~dWl~~~iq~~l~~ll~~~~kIP 400 (501) T protein:vir:10 321 QFNAGVPATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLP 400 (501) T ss_pred ccCCCcCcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcceeeccceehhhHhhHHHHHHHHHHHHHHHHhcCCCcc Confidence 97 899999999999999999999999999863 37789999987 8999999999999999999999999999999 Q ss_pred cCHhHHHHHHHHHHHHHHHHHhcCceecC--------------------------CceEEEeCchHhCCHHHHhccccCC Q lcl|NC_019918. 353 YDAVGATILESEIRAQLNEGIRVGGLAEA--------------------------PAPKVFVPDVLSMSPNMRAQRIFEG 406 (428) Q Consensus 353 ~~~~G~~~i~~~i~~~~~~~~~~G~I~~g--------------------------~~~~v~~~~~~~~~~~dra~R~~~~ 406 (428) ||+.|+++|+++|+++|+++++||+|+|| +||+++.++++. ++++|++|++|+ T Consensus 401 yt~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~-~~~~R~~R~~p~ 479 (501) T protein:vir:10 401 YNEDGYTANYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAVAGVAGAGALVQTRGWYFLIGNPAN-PGQARQNRTSPA 479 (501) T ss_pred cCHHHHHHHHHHHHHHHHHHHhCcceecCcccCcccceeecccccccccccceeccceEEeeCcccC-ChhhhhhcccCc Confidence 99999999999999999999999999997 479999998876 447899999999 Q ss_pred eEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 407 IEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 407 i~~~~~~agaih~v~i~~~v~~ 428 (428) ++|+|+++|+||+|+| ++++| T Consensus 480 ~~~~y~~~gaIh~v~i-~s~~v 500 (501) T protein:vir:10 480 CTLWYSDGGSIQELTI-GSNAV 500 (501) T ss_pred eEEEEEeCCceeEEEe-eeeec Confidence 9999999999999999 55555 No 4 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=100.00 E-value=8e-117 Score=657.07 Aligned_cols=426 Identities=16% Similarity=0.193 Sum_probs=366.7 Q ss_pred CC----CCCceEEEeeeeecccccccccceEEEEcccCCCccceEEeeCHHHHHhhcCCChHHHHHHHHHHh----cCCc Q lcl|NC_019918. 1 MT----VLTDVIDIQISRETAAVAQTNFNVPLFIASHTNFSERARVYNSLKGVAEDFGESDPTYLAAVRYFG----QALK 72 (428) Q Consensus 1 M~----~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~----q~p~ 72 (428) || |+||||||+|+|.++++.+++|+++||+.++..|++|+|.|+++++|++|||.+|||||||++||+ |+|| T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~lllt~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~q~~~ 80 (501) T protein:vir:36 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSVQPGQLADFFQETDVENWFGALSNEAKIADAYFPGIVNGGQL 80 (501) T ss_pred CCcCCcccceEEEEeeeeccCCCcceeeeeEEEeccCCCCCcceeeecCHHHHHHhcCCChHHHHHHHHHhhcccCCCcc Confidence 99 599999999999999999999999988889999999999999999999999999999999999998 9999 Q ss_pred ccEEEEEeeecccccccchheee-----------cccccccc------cceeeeeeecccchhhhhhhhheee-----ec Q lcl|NC_019918. 73 PRSLVIGRRQVPSATVSVSVVQE-----------GQSYVLTV------NGLPVSYVSHQDDTATLIATGLKAA-----YD 130 (428) Q Consensus 73 P~~l~igr~~~~~~~~~~~~~~~-----------~~~~~~~v------~g~~~s~~~~~~~~a~~i~a~l~~a-----~~ 130 (428) |++|+||||++++....+.+.+. .....+++ .+++++.++++.++|+.|++++..+ ++ T Consensus 81 P~~l~igR~~~~a~~~~l~g~~l~~~~~a~~~~~sg~l~vti~g~~~~~~i~lS~~ts~~~vA~~i~~al~~~~~tv~~d 160 (501) T protein:vir:36 81 PYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDFVVAYD 160 (501) T ss_pred ccEEEEEeecCcCcceeEeccchhhhhhhhccceeEEEEEEecceeeeeecccccccCHHHHHHHHhhhhcCcceEEEEc Confidence 99999999997766554322110 11233333 3578888888999999999988643 44 Q ss_pred cc--ceEEEEeeccccceeeeecccc-c----ccccc-ceEEEEeeccccCHHHHHHHHHhcccCceEEE-EecCCHHHH Q lcl|NC_019918. 131 VT--PVVGVTVTDNEDGTLTVASNGD-W----SLKVS-SNLTMAAAPSTEGWPATITAVQGENDEWYALS-IDSHADDDI 201 (428) Q Consensus 131 ~~--~~~~~~~tt~~~~~~t~as~~~-~----~~~~s-~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~-~~~~~~~~~ 201 (428) .. ++...+.+++..+++++++.+. . ++... +..+...+.++++|.+++.++.+.+++||++. +++++++|+ T Consensus 161 ~~~~~f~i~s~t~G~~~~i~~~t~~~~ia~~l~Lt~~~~a~v~~~g~~~et~~~al~a~~~~s~~Wy~f~~a~~~~~~~~ 240 (501) T protein:vir:36 161 ALRNRFTVVTNATGTAAAISAVTGTNNFADEIGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTAVIADR 240 (501) T ss_pred CcceeEEEEeccCCcceeeEeeecccchhhhhcccccCcceEEecccccccHHHHHHHHHhccCceEEEEEecCCChHHH Confidence 33 5667777777777888776542 1 11112 23455778889999999999999999999764 567999999 Q ss_pred HHHHHHHhhhCCEEEEEecCcccc---cchhHHHHHHHHhcccCceEEEecCCccchhHHHHHHHHHhccCCCceeeeee Q lcl|NC_019918. 202 MAVATHIEGTKKVFIGATAQANTK---TSAENDIASRLVAAGFQRTALIYHPNADAQFPECAWVGYQLQEQPGSNTWTHK 278 (428) Q Consensus 202 ~ala~~~~a~~~~~~~~~~~~~~~---~~~~~~~~~~l~~~~~~~t~~~y~~~~~~~~~~a~~~~~~~~~~~g~~t~~fk 278 (428) +++|+|+|+++++|+|..++.+.. ....++|+++|+.++|.||+++||+..+.+++++++++.+|+..+|++||||| T Consensus 241 la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g~~T~~fk 320 (501) T protein:vir:36 241 LAFASWNSGQAYKYMYVAPDLEAASIVSNNAASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNGRTVLAFR 320 (501) T ss_pred HHHHHHHhhcCceEEEEEecCchhhhhccchhhHHHHHHhcCCCcEEEEcCCCCHHHHHHHHHHhcCcccCcceeeeecc Confidence 999999999999998887765532 34578999999999999999999998888888899999999999999999999 Q ss_pred ee-cCccccCCCHHHHHHHHhCCceEEEEEcCc----eeeecCEecCC-chhHHHHHHHHHHHHHHHHHHHHHHhcCCCC Q lcl|NC_019918. 279 AL-AAVDAYRLTPTESTNLKNKNVTTFERVGGV----NRTFGGAMAGG-EWIDVMIFVDWLEARMTERLWFRMANSKKIP 352 (428) Q Consensus 279 ~~-~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~----~~~~~G~~~~G-~~iD~~~~~dwl~~~lq~~l~~ll~~~~kip 352 (428) ++ +||+|++++++|+++|+++|||||+.+.+. .++++|+++++ +|||+++|+||||++||++|++||.+++||| T Consensus 321 q~~~Gi~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~g~dWL~~~iq~~l~~ll~~~~KIP 400 (501) T protein:vir:36 321 QFNAGVPATVHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLP 400 (501) T ss_pred ccCCCcCcCcCCHHHHHHHHhcCCcEEEEEecccceeeEEEcCeeeccchhhhHHHhHHHHHHHHHHHHHHHHhcCCCCc Confidence 97 799999999999999999999999999763 47799999877 8999999999999999999999999999999 Q ss_pred cCHhHHHHHHHHHHHHHHHHHhcCceecC--------------------------CceEEEeCchHhCCHHHHhccccCC Q lcl|NC_019918. 353 YDAVGATILESEIRAQLNEGIRVGGLAEA--------------------------PAPKVFVPDVLSMSPNMRAQRIFEG 406 (428) Q Consensus 353 ~~~~G~~~i~~~i~~~~~~~~~~G~I~~g--------------------------~~~~v~~~~~~~~~~~dra~R~~~~ 406 (428) ||+.|+++|+++|+++|+++++||+|+|| +||+++.++++ ++++||++|++|+ T Consensus 401 ytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~I~~~~g~~~~~~~v~~~Gyy~~~~~~~-~~~~~R~~R~~p~ 479 (501) T protein:vir:36 401 YNEDGYTGLYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDKAARVAGAGQLVQTRGWYFLIGDPA-NPGQARQNRTTPA 479 (501) T ss_pred cChhhHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeecccccccccccceeccceEEeeCccc-CChhhhhhcccCc Confidence 99999999999999999999999999997 58999988777 4667999999999 Q ss_pred eEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 407 IEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 407 i~~~~~~agaih~v~i~~~v~~ 428 (428) ++|+|+++|+||+|+| ++++| T Consensus 480 ~~~~y~~~gaIh~v~i-~s~~v 500 (501) T protein:vir:36 480 CTLWYSDGGSIQSLTI-GSNAV 500 (501) T ss_pred EEEEEEeCCceeEEEe-eeeee Confidence 9999999999999999 55555 No 5 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=100.00 E-value=6.4e-116 Score=652.13 Aligned_cols=426 Identities=16% Similarity=0.192 Sum_probs=368.4 Q ss_pred CC----CCCceEEEeeeeecccccccccceEEEEcccCCCccceEEeeCHHHHHhhcCCChHHHHHHHHHHh----cCCc Q lcl|NC_019918. 1 MT----VLTDVIDIQISRETAAVAQTNFNVPLFIASHTNFSERARVYNSLKGVAEDFGESDPTYLAAVRYFG----QALK 72 (428) Q Consensus 1 M~----~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~----q~p~ 72 (428) || |+||||||+|+|.++++.+++|+++||..++..|++|++.|+|+++|++|||.+|||||||++||+ |+|+ T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~l~l~~~~~~~~~~~~~~~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q~p~ 80 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSVQPGQLADFFQETDVENWFGALSNEAKIADAYFPGIVNGGQL 80 (501) T ss_pred CCCCCcccceEEEEeeecccCCCccccceeEEEeccCCCCccceEEecCHHHHHHhcCCChHHHHHHHHHhhhhcCCCcc Confidence 99 599999999999999999999999999999999999999999999999999999999999999999 9999 Q ss_pred ccEEEEEeeecccccccchheeec-----------cccccc------ccceeeeeeecccchhhhhhhhheee-----ec Q lcl|NC_019918. 73 PRSLVIGRRQVPSATVSVSVVQEG-----------QSYVLT------VNGLPVSYVSHQDDTATLIATGLKAA-----YD 130 (428) Q Consensus 73 P~~l~igr~~~~~~~~~~~~~~~~-----------~~~~~~------v~g~~~s~~~~~~~~a~~i~a~l~~a-----~~ 130 (428) |.+|+||||++++....+.+.+.. ....++ ..+++++.++++.++|+.|++++... ++ T Consensus 81 P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~sg~l~vti~g~~~~~~i~ls~ats~~~vAs~i~~al~~~~~tv~~d 160 (501) T protein:vir:10 81 PYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDFVVAYD 160 (501) T ss_pred ccEEEEEeecCCCccceEeccchhhhhhhhcceeeeEEEEeeccceeecccccccccCHHHHHHHHhhhccCCceEEEEc Confidence 999999999987665543221110 112223 34678888999999999999988643 33 Q ss_pred c--cceEEEEeeccccceeeeeccccc---ccccc---ceEEEEeeccccCHHHHHHHHHhcccCceEEE-EecCCHHHH Q lcl|NC_019918. 131 V--TPVVGVTVTDNEDGTLTVASNGDW---SLKVS---SNLTMAAAPSTEGWPATITAVQGENDEWYALS-IDSHADDDI 201 (428) Q Consensus 131 ~--~~~~~~~~tt~~~~~~t~as~~~~---~~~~s---~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~-~~~~~~~~~ 201 (428) . .++...+.+++..+++++++++.. ...++ +..+...+.+++++.+++.++.+.+++||++. +++++++|+ T Consensus 161 ~~~~~f~its~ttG~~~~i~~~~~~~~la~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~a~~~~~~~~ 240 (501) T protein:vir:10 161 ALRNRFTVVTNATGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTAVIADR 240 (501) T ss_pred ccCceEEEEeeccCCceeEEEeeCchhhhhhcCccccccceEEecCcccccHHHHHHHHHhccCceEEEEEecCCChHHH Confidence 3 467777788888888888766431 12222 23355778899999999999999999999764 578999999 Q ss_pred HHHHHHHhhhCCEEEEEecCcccc---cchhHHHHHHHHhcccCceEEEecCCccchhHHHHHHHHHhccCCCceeeeee Q lcl|NC_019918. 202 MAVATHIEGTKKVFIGATAQANTK---TSAENDIASRLVAAGFQRTALIYHPNADAQFPECAWVGYQLQEQPGSNTWTHK 278 (428) Q Consensus 202 ~ala~~~~a~~~~~~~~~~~~~~~---~~~~~~~~~~l~~~~~~~t~~~y~~~~~~~~~~a~~~~~~~~~~~g~~t~~fk 278 (428) +++|+|+|+++++|+|..++.+.. ....++|+++|+.++|.||+++||+..+.+++++++++.+|+..+|++||||| T Consensus 241 la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g~~T~~fk 320 (501) T protein:vir:10 241 LAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNGRTVLAFR 320 (501) T ss_pred HHHHHHHHhcCceEEEEEecCchhhhhhhhhhhHHHHHHhcCCCceEEECCCCcHHHHHHHHHHhhCcccCccceeeecc Confidence 999999999999998887766532 34578999999999999999999998888888899999999999999999999 Q ss_pred eec-CccccCCCHHHHHHHHhCCceEEEEEcCce----eeecCEecCC-chhHHHHHHHHHHHHHHHHHHHHHHhcCCCC Q lcl|NC_019918. 279 ALA-AVDAYRLTPTESTNLKNKNVTTFERVGGVN----RTFGGAMAGG-EWIDVMIFVDWLEARMTERLWFRMANSKKIP 352 (428) Q Consensus 279 ~~~-Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~~----~~~~G~~~~G-~~iD~~~~~dwl~~~lq~~l~~ll~~~~kip 352 (428) +++ ||+|++++++|+++|+++|||||+.+++.+ ++++|+++++ +|||.++|+|||+++||.++++||.+++||| T Consensus 321 q~~~Gi~a~~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kIP 400 (501) T protein:vir:10 321 QFNAGVPATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLP 400 (501) T ss_pred ccCCCcCcccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeeccceeehhhhhHHHHHHHHHHHHHHHHHhcCCcc Confidence 986 999999999999999999999999998653 6789999887 8999999999999999999999999999999 Q ss_pred cCHhHHHHHHHHHHHHHHHHHhcCceecC--------------------------CceEEEeCchHhCCHHHHhccccCC Q lcl|NC_019918. 353 YDAVGATILESEIRAQLNEGIRVGGLAEA--------------------------PAPKVFVPDVLSMSPNMRAQRIFEG 406 (428) Q Consensus 353 ~~~~G~~~i~~~i~~~~~~~~~~G~I~~g--------------------------~~~~v~~~~~~~~~~~dra~R~~~~ 406 (428) ||+.|+++|+++|+++|+++++||+|+|| +||+++.++++. +++||++|++|+ T Consensus 401 yt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~-~~~~R~~R~~p~ 479 (501) T protein:vir:10 401 YNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQTRGWYFLIGDPAN-PGQARQNRTTPA 479 (501) T ss_pred cCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccCccccccceeccceeEeeccccC-Chhhhhhccccc Confidence 99999999999999999999999999996 489999888874 667999999999 Q ss_pred eEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 407 IEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 407 i~~~~~~agaih~v~i~~~v~~ 428 (428) ++|+|+++|+||+|+| ++++| T Consensus 480 ~~~~y~~~gaIh~v~i-~s~~v 500 (501) T protein:vir:10 480 CTLWYSDGGSIQQLTI-GSNAV 500 (501) T ss_pred eEEEEEeCCceeEEEe-eeeec Confidence 9999999999999999 55555 No 6 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=100.00 E-value=9.5e-116 Score=651.18 Aligned_cols=426 Identities=17% Similarity=0.194 Sum_probs=366.2 Q ss_pred CC----CCCceEEEeeeeecccccccccceEEEEcccCCCccceEEeeCHHHHHhhcCCChHHHHHHHHHHh----cCCc Q lcl|NC_019918. 1 MT----VLTDVIDIQISRETAAVAQTNFNVPLFIASHTNFSERARVYNSLKGVAEDFGESDPTYLAAVRYFG----QALK 72 (428) Q Consensus 1 M~----~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~----q~p~ 72 (428) || |+||||||+|+|.++++.+++|+++||+.++..|++|+|.|+++++|++|||.+||||+||++||+ |+|+ T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~lll~~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~q~~~ 80 (501) T protein:vir:78 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSIQPGQLADFFQKTDVENWFGGLSNEAVIADAYFPGIVNGGQL 80 (501) T ss_pred CCcCccccceEEEEeeecccCCCcceeeeeEEEecCCCCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhcCCCCCcc Confidence 99 599999999999999999999999999999999999999999999999999999999999999999 9999 Q ss_pred ccEEEEEeeecccccccchheee-----------ccccccccc------ceeeeeeecccchhhhhhhhheee-----ec Q lcl|NC_019918. 73 PRSLVIGRRQVPSATVSVSVVQE-----------GQSYVLTVN------GLPVSYVSHQDDTATLIATGLKAA-----YD 130 (428) Q Consensus 73 P~~l~igr~~~~~~~~~~~~~~~-----------~~~~~~~v~------g~~~s~~~~~~~~a~~i~a~l~~a-----~~ 130 (428) |.+|+||||++++....+.+.+. .....++++ +++++..+++.++++.|.+++..+ ++ T Consensus 81 P~~l~igR~~~~a~~~~l~g~~l~~~~la~~~~~~G~l~iti~g~~~~~~i~~S~~ts~~~vA~~i~~al~a~~~tv~~d 160 (501) T protein:vir:78 81 PYDLKFARYVAADAPASVYGIPLTGVTLTQLQGYSGTLTVTTAAQHVSSNISLAAATSFANAATLIEAAFTSPDFVVSYD 160 (501) T ss_pred cceEEEEeecccCcceeEeccceeccchhhhceeeeEEEEEeccceeeeccccccccCHHHHHHHHHhhhcCcceEEEEc Confidence 99999999998765544322111 012233333 478888899999999999988643 44 Q ss_pred cc--ceEEEEeeccccceeeeeccccc---ccccc---ceEEEEeeccccCHHHHHHHHHhcccCceEEE-EecCCHHHH Q lcl|NC_019918. 131 VT--PVVGVTVTDNEDGTLTVASNGDW---SLKVS---SNLTMAAAPSTEGWPATITAVQGENDEWYALS-IDSHADDDI 201 (428) Q Consensus 131 ~~--~~~~~~~tt~~~~~~t~as~~~~---~~~~s---~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~-~~~~~~~~~ 201 (428) .. ++...+.+++..+++++++++.. ...++ .......+.+++++.+++.++.+.+++||++. +++++++|+ T Consensus 161 s~~~~f~its~t~G~~~~i~~~t~~~~~a~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~a~~~~~~~~ 240 (501) T protein:vir:78 161 ALRNRFVVNTNATGTAAAISAVTGTNNLADELGLSAAAGASLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTAVIADR 240 (501) T ss_pred cccceEEEEeeecCCceeEEEEecccchhhhhcccccCceeeEeccccccCHHHHHHHHHhccCceEEEEEecCCCHHHH Confidence 43 56677777888888888776431 12222 23345778899999999999999999999765 567999999 Q ss_pred HHHHHHHhhhCCEEEEEecCcccc---cchhHHHHHHHHhcccCceEEEecCCccchhHHHHHHHHHhccCCCceeeeee Q lcl|NC_019918. 202 MAVATHIEGTKKVFIGATAQANTK---TSAENDIASRLVAAGFQRTALIYHPNADAQFPECAWVGYQLQEQPGSNTWTHK 278 (428) Q Consensus 202 ~ala~~~~a~~~~~~~~~~~~~~~---~~~~~~~~~~l~~~~~~~t~~~y~~~~~~~~~~a~~~~~~~~~~~g~~t~~fk 278 (428) +++|+|+|+++++|+|..++.+.. ....++++++|+.++|.||+++||+....+..++++++.+|+..+|++||||| T Consensus 241 lalA~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~a~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g~~T~~fk 320 (501) T protein:vir:78 241 LALASWNSGQAYKYMYVAPDLEPASIVTNNSASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNGRTVLAFR 320 (501) T ss_pred HHHHHHHHhcCceEEEEEecCCcceeecccchhHHHHHhhcCCCceEEEcCCcchHHHHHHHHHhcCcccCcceeeeecc Confidence 999999999999998887665533 34578999999999999999999987766777888888889999999999999 Q ss_pred ee-cCccccCCCHHHHHHHHhCCceEEEEEcCc----eeeecCEecCC-chhHHHHHHHHHHHHHHHHHHHHHHhcCCCC Q lcl|NC_019918. 279 AL-AAVDAYRLTPTESTNLKNKNVTTFERVGGV----NRTFGGAMAGG-EWIDVMIFVDWLEARMTERLWFRMANSKKIP 352 (428) Q Consensus 279 ~~-~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~----~~~~~G~~~~G-~~iD~~~~~dwl~~~lq~~l~~ll~~~~kip 352 (428) ++ +||+|++++++|+++|+++|||||+.+.+. .++++|+++++ +|||.++|+|||+++||.++++||.+++||| T Consensus 321 q~~~Gv~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kIP 400 (501) T protein:vir:78 321 QFNAGVPATAHDLGTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLP 400 (501) T ss_pred ccCCCcCcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcCeeeccceeehhhhhHHHHHHHHHHHHHHHHHhCCCcc Confidence 96 899999999999999999999999999863 37789999877 7999999999999999999999999999999 Q ss_pred cCHhHHHHHHHHHHHHHHHHHhcCceecC--------------------------CceEEEeCchHhCCHHHHhccccCC Q lcl|NC_019918. 353 YDAVGATILESEIRAQLNEGIRVGGLAEA--------------------------PAPKVFVPDVLSMSPNMRAQRIFEG 406 (428) Q Consensus 353 ~~~~G~~~i~~~i~~~~~~~~~~G~I~~g--------------------------~~~~v~~~~~~~~~~~dra~R~~~~ 406 (428) ||+.|+++|+++|+++|+++++||+|+|| +||+++.++++. ++++|++|++|+ T Consensus 401 yt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~I~~~~g~~~~~~~~~~~Gyy~~~~~~~~-~~~~R~~R~~p~ 479 (501) T protein:vir:78 401 YNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQMRGWYFLIGDPAN-PGQARQNRTTPT 479 (501) T ss_pred cCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCCccceeeccccCccccccceeccceEEeeccccC-ChhhhhhcccCc Confidence 99999999999999999999999999996 489999998876 457899999999 Q ss_pred eEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 407 IEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 407 i~~~~~~agaih~v~i~~~v~~ 428 (428) ++|+|+++|+||+|+| ++++| T Consensus 480 ~~~~y~~~gaIh~v~i-~s~~v 500 (501) T protein:vir:78 480 CTLWYSDGGSIQELTI-GSNAV 500 (501) T ss_pred EEEEEEeCCceeEEEe-eeeec Confidence 9999999999999999 55555 No 7 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=100.00 E-value=8.2e-114 Score=640.55 Aligned_cols=426 Identities=16% Similarity=0.197 Sum_probs=362.9 Q ss_pred CC--CCCceEEEeeeeecccccccccceEEEEcccCCCccceEEeeCHHHHHhhcCCChHHHHHHHHHHh----cCCccc Q lcl|NC_019918. 1 MT--VLTDVIDIQISRETAAVAQTNFNVPLFIASHTNFSERARVYNSLKGVAEDFGESDPTYLAAVRYFG----QALKPR 74 (428) Q Consensus 1 M~--~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~----q~p~P~ 74 (428) || ||||||||+|+|.++++++++|+.+||++++..|.||+|.|+++++|++|||.+|||||||++||+ |+|+|. T Consensus 1 m~~ip~s~iV~V~~~v~~~~~~~~~f~~~l~~~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~q~p~P~ 80 (494) T protein:vir:94 1 MPNIPISQIVSINPQVVSAGGTQGTLDGLLLTQATGFPVTQPQVYFSAADVGTAFGLTSDEYNAALVYFAGILGGGQQPA 80 (494) T ss_pred CCCCCcccEEEeeeeccccCCcccccceeEeecCccCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhccCCCcccc Confidence 99 889999999999999999999999999999999999999999999999999999999999999999 999999 Q ss_pred EEEEEeeecccccccchheee----------cccccc------cccceeeeeeecccchhhhhhhhheee-----eccc- Q lcl|NC_019918. 75 SLVIGRRQVPSATVSVSVVQE----------GQSYVL------TVNGLPVSYVSHQDDTATLIATGLKAA-----YDVT- 132 (428) Q Consensus 75 ~l~igr~~~~~~~~~~~~~~~----------~~~~~~------~v~g~~~s~~~~~~~~a~~i~a~l~~a-----~~~~- 132 (428) +|+||||++++....+.+... ...+.+ ++.+++++..++++++|+.|++++..+ ++.. T Consensus 81 ~l~igR~~~~a~~~~l~g~~~~~tl~~~~~~~g~l~iti~g~~~~~~i~lS~~ts~~~vA~~i~~ai~~a~~~v~~d~~~ 160 (494) T protein:vir:94 81 SLTIGRYASAATSAAVFGAPLTLSLAQLQTLSGTLIVTTDTQRTSAAINLSGATSFANAASLMTSGFTTPNFAITYDAQR 160 (494) T ss_pred EEEEEeecCccccceeeccchhhhHHhhhhcceEEEEEEcceEEEeeecccccCChhhHHHHHhhhhccccceEEEcccC Confidence 999999997654433321110 112223 345578889999999999999888642 4433 Q ss_pred -ceEEEEeeccccceeeeecccccc-ccc---cceEEEEeeccccCHHHHHHHHHhcccCceEEEE-ecCCHHHHHHHHH Q lcl|NC_019918. 133 -PVVGVTVTDNEDGTLTVASNGDWS-LKV---SSNLTMAAAPSTEGWPATITAVQGENDEWYALSI-DSHADDDIMAVAT 206 (428) Q Consensus 133 -~~~~~~~tt~~~~~~t~as~~~~~-~~~---s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~-~~~~~~~~~ala~ 206 (428) ++...+.+++..+++++++..... ... ....+...+.++|+|.+++.++.+.+++||++.+ ++++++|+++||+ T Consensus 161 ~~f~v~s~ttG~~s~is~~t~~~a~~l~lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~~~~~~~~~ilalA~ 240 (494) T protein:vir:94 161 RRFVLSTTATGTTASVSAVTGTLADGVGLSTASGAYVEGSGLAADTAASALDRLAASSSTWAIFTTAWAASLSDRTALAQ 240 (494) T ss_pred cEEEEEEccCCceeEEEEeccchhhhhhhhccccceEeecCcccccHHHHHHHHHhccCceEEEEEecCCCHHHHHHHHH Confidence 566666677777777776643221 111 2223456788899999999999999999997765 4678999999999 Q ss_pred HHhhhCCEEEEEecCcccc---cchhHHHHHHHHhcccCceEEEecCCccchhHHHHHHHHHhccCCCceeeeee-eecC Q lcl|NC_019918. 207 HIEGTKKVFIGATAQANTK---TSAENDIASRLVAAGFQRTALIYHPNADAQFPECAWVGYQLQEQPGSNTWTHK-ALAA 282 (428) Q Consensus 207 ~~~a~~~~~~~~~~~~~~~---~~~~~~~~~~l~~~~~~~t~~~y~~~~~~~~~~a~~~~~~~~~~~g~~t~~fk-~~~G 282 (428) |+|+++++|+|..++.+.. ....++|+++|+.++|.||+++||+..+...+++++++.+++..+|++||+|| +++| T Consensus 241 wiea~~~~~~~~~~~~d~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~aa~~~~~~~g~~T~~~k~q~~g 320 (494) T protein:vir:94 241 WTSDQVFRRIYAAWDQDAAGLSVNNVSSFGNIVKTTPFSNTIPVYGLLANAMIVLAWGASTNLQIAEGRTTLALRSPVSS 320 (494) T ss_pred HHhhcCccEEEEEecCCcceeecccchhHHHHHHhhcCCceEEEcCCCChHHHHHHHHHhccccccCcceeEEeeccCCC Confidence 9999999998876655432 34578999999999999999999998887778888888888999999999999 6899 Q ss_pred ccccCCCHHHHHHHHhCCceEEEEEcCce---eeecCEecCCc--hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhH Q lcl|NC_019918. 283 VDAYRLTPTESTNLKNKNVTTFERVGGVN---RTFGGAMAGGE--WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVG 357 (428) Q Consensus 283 v~~~~~t~t~~~~l~~~~~n~y~~~~~~~---~~~~G~~~~G~--~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G 357 (428) ++|++++.+|+++|+++|||||+.+++.+ .+++|++++|+ |||.+++++|||++||.+|++||.+++|||||+.| T Consensus 321 i~~~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~gg~~sG~~~~id~~~~~~WL~~~iq~~l~~ll~~~~KIPytd~G 400 (494) T protein:vir:94 321 AGVRVDNLANANALLSNGYTYLGKYASATNTYTVTYNGAIGGQFLWADTALGWIALRRNLQQALFETLLAYRSLPYNADG 400 (494) T ss_pred CCCccCCHHHHHHHHhcCCeEEEEecccCceEEEecCceeccccceeeeeccHHHHHHHHHHHHHHHHHhCCCcccChhh Confidence 99999999999999999999999998754 34566677886 68899999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCceecC-------------------------CceEEEeCchHhCCHHHHhccccCCeEEEEE Q lcl|NC_019918. 358 ATILESEIRAQLNEGIRVGGLAEA-------------------------PAPKVFVPDVLSMSPNMRAQRIFEGIEFEAR 412 (428) Q Consensus 358 ~~~i~~~i~~~~~~~~~~G~I~~g-------------------------~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~ 412 (428) +++|+++|+++|+++++||+|+|| +|||++.. ..+++++|++|.+|+++|+|+ T Consensus 401 ~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~i~~~~G~~~~~~~~~kGyy~~~~--~~~s~~~ra~R~~~~~~~~y~ 478 (494) T protein:vir:94 401 YNALYQGAQDVVSQFVAAGVIRAGVALSASQRAQIDQAAGVPISGDVVDKGWYLQVI--DPITTTVRTDRGSPTVNFWYC 478 (494) T ss_pred HHHHHHHHHHHHHHHHhCceeecccccCcchhhhhhhhhcCccccceeccceeeecc--CCCChhhhhccccCCceEEEE Confidence 999999999999999999999996 57887752 346899999999999999999 Q ss_pred ECceEEEEEEEEEEec Q lcl|NC_019918. 413 LAGAIHFVHIRGTVTV 428 (428) Q Consensus 413 ~agaih~v~i~~~v~~ 428 (428) ++|+||+|+|+++..+ T Consensus 479 ~~GAIh~v~i~~~~v~ 494 (494) T protein:vir:94 479 DGGSIQRVVVSATTVI 494 (494) T ss_pred ecCcEEEEEEeeEEeC Confidence 9999999999999999 No 8 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=100.00 E-value=1.4e-113 Score=639.27 Aligned_cols=422 Identities=15% Similarity=0.144 Sum_probs=348.1 Q ss_pred CCCCCceEEEeeeeecccccccccceEEEEcccCC-CccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCC----cccE Q lcl|NC_019918. 1 MTVLTDVIDIQISRETAAVAQTNFNVPLFIASHTN-FSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQAL----KPRS 75 (428) Q Consensus 1 M~~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~~-~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p----~P~~ 75 (428) |.|+||||||+|+|.++++.+++|+.+||++.+.. |+||+|.|+|+++|++|||.+|||||||++||+|.| +|++ T Consensus 1 mip~s~iV~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yF~~~~~~~~~P~~ 80 (504) T protein:vir:96 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPSS 80 (504) T ss_pred CCCccceeEeeecccccccccccccceeEeecccCCCccceEEecCHHHHHHhcCCChHHHHHHHHHhhcCCCCCccccE Confidence 99999999999999999999999999999999875 469999999999999999999999999999999988 9999 Q ss_pred EEEEeeecccccccchh------------eeecccccc-------cccceeeeeeecccchhhhhhhhhee--------- Q lcl|NC_019918. 76 LVIGRRQVPSATVSVSV------------VQEGQSYVL-------TVNGLPVSYVSHQDDTATLIATGLKA--------- 127 (428) Q Consensus 76 l~igr~~~~~~~~~~~~------------~~~~~~~~~-------~v~g~~~s~~~~~~~~a~~i~a~l~~--------- 127 (428) |+||||++++....+.+ ++++ ..++ ++.+++++..+++.++|+.|++++.+ T Consensus 81 l~igR~~~~a~~~~l~g~~~~~~~~~~~~i~~G-~lsitv~G~~~~~~~i~~S~~ts~~~vA~~i~~al~~~~~~~~~~~ 159 (504) T protein:vir:96 81 ISFARWVNTAIAPMVVGDNLPKTIADFAGFSAG-VLTIMVGAAEKNITAIDTSAATSMDNVASIIQTEIRKNTDPQLAQA 159 (504) T ss_pred EEEEeecCcCccceEEechhHHHHHHHhhhhce-EEEEEEcceeeeecccccccccchHHHHHHHHhhhhcccccccccc Confidence 99999997766544322 1222 2233 34567889999999999999988864 Q ss_pred --eecccc--eEEEEeeccccceeeeecccc--cc--ccc-cceEEEEeeccccCHHHHHHHHHhcccCceEEEEe--cC Q lcl|NC_019918. 128 --AYDVTP--VVGVTVTDNEDGTLTVASNGD--WS--LKV-SSNLTMAAAPSTEGWPATITAVQGENDEWYALSID--SH 196 (428) Q Consensus 128 --a~~~~~--~~~~~~tt~~~~~~t~as~~~--~~--~~~-s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~--~~ 196 (428) .+|... |...+.+++..+.....+... .+ ..+ .+......+.+++++.+++.++.+++++||++++. .+ T Consensus 160 tv~~d~~~~~f~its~~tg~~~~~~~~~a~~~~~~~~lgl~~~~~~~v~g~~aet~~~al~al~~~~~~Wy~f~~a~~~~ 239 (504) T protein:vir:96 160 TVTWNPNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGATL 239 (504) T ss_pred eEEEeccCCeEEEEeeccccceeEEEeeccccchhhhhhcccccceEEeecccccHHHHHHHHHhhcCCeEEEEEEeccC Confidence 344443 444444444433333322211 11 111 13344566778999999999999999999987664 36 Q ss_pred CHHHHHHHHHHHhhhCCEEEEEecCcccccchhHHHHHHHHhcccCceEEEecCCccchh----HHHHHHHHHhccCCCc Q lcl|NC_019918. 197 ADDDIMAVATHIEGTKKVFIGATAQANTKTSAENDIASRLVAAGFQRTALIYHPNADAQF----PECAWVGYQLQEQPGS 272 (428) Q Consensus 197 ~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~~~~~~----~~a~~~~~~~~~~~g~ 272 (428) ++++++++|+|+|+++++|+|..++.... ..+... +...++.+++.+||...+.++ +++++++.+|+..+|+ T Consensus 240 ~dd~ilalA~w~ea~~~~~~~~~~~~~~~---~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~as~~f~~~ng~ 315 (504) T protein:vir:96 240 DNDQIKAVSAWNAAQNNQFIYTVATSLAN---LGALFD-LVKGNSGTALNVLSATASNDFVEQCPSEILAATNYDEPGAS 315 (504) T ss_pred CHHHHHHHHHHHhhcCceEEEEEeecccc---hhhHHH-hhhhcceeEEEEeecCccchhHHHHHHHHHHhcCcCccccc Confidence 88999999999999999999887654322 122233 344455667777776554444 4677788889999999 Q ss_pred eeeeeeeecCccccCCCHHHHHHHHhCCceEEEEEcCc----eeeecCEecCCc----hhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 273 NTWTHKALAAVDAYRLTPTESTNLKNKNVTTFERVGGV----NRTFGGAMAGGE----WIDVMIFVDWLEARMTERLWFR 344 (428) Q Consensus 273 ~t~~fk~~~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~----~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~l 344 (428) +|||||+++||+|++++++|+++|+++|||||+.+++. .++++|+|++|+ |||++++++||+++||++|++| T Consensus 316 ~T~~fk~l~GVta~~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~~~gG~~~~~wiDv~~~~~WL~~~lq~~l~~l 395 (504) T protein:vir:96 316 QNYMYYQFPGRNITVSDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWLKSAIAQALLDL 395 (504) T ss_pred ccccccccCCcCcccCCHHHHHHHHhcCCeEEEEeecccceeeEEecCeeeCCccccchhhhhhhHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999865 367999999997 7999999999999999999999 Q ss_pred HHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecC--------------------------CceEEEeCchHhCCHHH Q lcl|NC_019918. 345 MANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEA--------------------------PAPKVFVPDVLSMSPNM 398 (428) Q Consensus 345 l~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g--------------------------~~~~v~~~~~~~~~~~d 398 (428) |.+++|||||+.|+++|+++|+++|++|++||+|+|| +||+++.|++++++++| T Consensus 396 ~~~~~kIPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~I~~~~~~d~~~~~~~~~GYyv~~~~~s~~s~~~ 475 (504) T protein:vir:96 396 FLNVNAVPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQYITQITGDRRAWRQVQTLGYWINITFSSYTNSNT 475 (504) T ss_pred HhcCCCcccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchheecccccccccccceeccceEEEecChhccChhH Confidence 9999999999999999999999999999999999996 47999999999999999 Q ss_pred HhccccCCeEEEEEECceEEEEEEEEEEe Q lcl|NC_019918. 399 RAQRIFEGIEFEARLAGAIHFVHIRGTVT 427 (428) Q Consensus 399 ra~R~~~~i~~~~~~agaih~v~i~~~v~ 427 (428) |++|++|+++|+|+++|+||+|+|..++. T Consensus 476 r~~R~~~~~~~~y~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 476 GLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) T ss_pred hhhccccceEEEEEECCeEEEEEeccccC Confidence 99999999999999999999999988887 No 9 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=100.00 E-value=8.5e-114 Score=640.47 Aligned_cols=425 Identities=13% Similarity=0.111 Sum_probs=351.6 Q ss_pred CCCCCceEEEeeeeecccccccccceEEEEcccC-CCccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCC----cccE Q lcl|NC_019918. 1 MTVLTDVIDIQISRETAAVAQTNFNVPLFIASHT-NFSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQAL----KPRS 75 (428) Q Consensus 1 M~~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~-~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p----~P~~ 75 (428) |.||||||||+|+|.++++.+++|+.+||++++. .|.||+|.|+++++|++|||.+|||||||++||+|.| +|++ T Consensus 1 mip~s~iVnV~~~v~~~a~~~~~~~~~lilt~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsq~p~~~~~P~~ 80 (507) T protein:vir:99 1 MISQSRYVRIVSGVGAGAPVAQRRLIMRVMTTNAVLPPGVVFESSSADAVGAYFGMASEEYKRAKAYMSFISKSINSPSY 80 (507) T ss_pred CCCccceeEEeeeccccCcccccccceeeeccccCCCccceEeecCHHHHHHhcCCChHHHHHHHHHhccCCCCCcccce Confidence 9999999999999999999999998888887775 4679999999999999999999999999999999999 7999 Q ss_pred EEEEeeecccccccchhee-----------eccccccccc-------ceeeeeeecccchhhhhhhhhee---------- Q lcl|NC_019918. 76 LVIGRRQVPSATVSVSVVQ-----------EGQSYVLTVN-------GLPVSYVSHQDDTATLIATGLKA---------- 127 (428) Q Consensus 76 l~igr~~~~~~~~~~~~~~-----------~~~~~~~~v~-------g~~~s~~~~~~~~a~~i~a~l~~---------- 127 (428) |+||||+++.....+.+.+ ..+..+++|+ +++++..++++++|+.|++++.. T Consensus 81 L~igR~~~~~~~a~l~g~~~~~~l~~~~~~~~G~lti~v~G~~~t~~~i~lS~~ts~~~vAs~i~~~l~a~~~~~~~~~t 160 (507) T protein:vir:99 81 ISFARWVNAAIASMIVGDSLVKNLPALKAVATPTLSLSIGGTVVPIAGIDLTAALTLTDVAATLQTKIRASANAELATAT 160 (507) T ss_pred EEEEeecCccccceeecchhhhhHHHHhhhcceeEEEEEcCceeEeccccccccCCHHHHHHHHHHhhhccccccccceE Confidence 9999998766554332211 1123344444 46788899999999999998874 Q ss_pred -eecc--cceEEEEeeccccceeeeeccccccccc-------cceEEEEeeccccCHHHHHHHHHhcccCceEEEEe--- Q lcl|NC_019918. 128 -AYDV--TPVVGVTVTDNEDGTLTVASNGDWSLKV-------SSNLTMAAAPSTEGWPATITAVQGENDEWYALSID--- 194 (428) Q Consensus 128 -a~~~--~~~~~~~~tt~~~~~~t~as~~~~~~~~-------s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~--- 194 (428) .+|. .+|...+.+++.++++.++.....++.. ..+..+..+.+++++.+++.++.+.+++||++.+. T Consensus 161 v~~d~~~~~F~v~s~~tG~~s~i~~at~~~~gt~~s~l~~~~~~~a~~~~g~~aet~~~a~~a~~~~~~nW~~~~~a~~~ 240 (507) T protein:vir:99 161 VTFNTTTNQFVLNGTTTGALAPTITAVRTDPATDISSLLGWTNTGTVFVKGQAAETPDTSISKSAAISTNFGSFIYTSTP 240 (507) T ss_pred EEEecCCceEEEEeeeccccceeEEEEcCCchhhHHHHhccccccceEeecccccCHHHHHHHHHhhcCCeEEEEEEecc Confidence 3433 3567777788888888887754433322 23556778889999999999999999999987653 Q ss_pred cCCHHHHHHHHHHHhhhCCEEEEEecCcccccchhHHHHHHHHhcccCceEEEecC--CccchhHHHHHHHHHhccCCCc Q lcl|NC_019918. 195 SHADDDIMAVATHIEGTKKVFIGATAQANTKTSAENDIASRLVAAGFQRTALIYHP--NADAQFPECAWVGYQLQEQPGS 272 (428) Q Consensus 195 ~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~--~~~~~~~~a~~~~~~~~~~~g~ 272 (428) +++++++++||+|+|+++++|+|..++.+.. ...+.+..++...+.++...+.. ..+...+++.+++.+|+..+|+ T Consensus 241 ~~td~~~lalA~wiea~~~~f~~~~~~~~a~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~ng~ 318 (507) T protein:vir:99 241 ALTNDQITAVASWNASQNNMYMYSVPTTIAN--IGTLYAAVKGFSGCALNITSDSLPVDYIEQSPCEILAATDYTRVNAT 318 (507) T ss_pred ccChHHHHHHHHHHhhcCcEEEEEEecCchh--hhhhhhhhhhcceeEEEeecccccchhHHHHHHHHHHhhccCcCccc Confidence 3689999999999999999999987765432 23344455555555444332211 1122344445555557788999 Q ss_pred eeeeeeeecCccccCCCHHHHHHHHhCCceEEEEEcCc----eeeecCEecCCc----hhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 273 NTWTHKALAAVDAYRLTPTESTNLKNKNVTTFERVGGV----NRTFGGAMAGGE----WIDVMIFVDWLEARMTERLWFR 344 (428) Q Consensus 273 ~t~~fk~~~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~----~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~l 344 (428) +|||||+++||+|++++++|+++|+++|||||+.+++. .++++|+|++|+ |+|.++++|||+++||++|++| T Consensus 319 ~T~~fk~l~GV~a~~lt~t~a~al~~~n~N~y~~~a~~~~~~~~~~~G~~~gG~~~fid~d~~~~~~WL~~~iq~~l~~l 398 (507) T protein:vir:99 319 QNYMYYQFPSRNITVSDDTTANLVDANRGNYIGQTQSAGQSLAFYQRGILCGGPNDAVDMNIYANEIWLKSAISAQILSL 398 (507) T ss_pred eeecccccCCcccccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeeCCcccceeeeeecchHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999873 478999999995 5667889999999999999999 Q ss_pred HHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecC--------------------------CceEEEeCchHhCCHHH Q lcl|NC_019918. 345 MANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEA--------------------------PAPKVFVPDVLSMSPNM 398 (428) Q Consensus 345 l~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g--------------------------~~~~v~~~~~~~~~~~d 398 (428) |.+++|||||+.|+++|+++|+++|++|++||+|+|| +||+++.|+++.|+++| T Consensus 399 ~~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~I~~Gvtl~~~q~~~in~~~~~~~~~~~~~~~Gyy~~~~~~s~~~~~~ 478 (507) T protein:vir:99 399 FLNVPRVPANETGESMLLSVIQSVVNTAKNNGTISAGKNLNVIQQQYITQISGDANAWRQVANIGYWLNITFSSYTNPNT 478 (507) T ss_pred HhcCCCCccChhhHHHHHHHHHHHHHHHHhccccccCCcccccchheecccccccccccceeccceEEEeCChHhcChhh Confidence 9999999999999999999999999999999999997 57999999999999999 Q ss_pred HhccccCCeEEEEEECceEEEEEEEEEEe Q lcl|NC_019918. 399 RAQRIFEGIEFEARLAGAIHFVHIRGTVT 427 (428) Q Consensus 399 ra~R~~~~i~~~~~~agaih~v~i~~~v~ 427 (428) |++|++|+++|||+++|+||+|+|..++. T Consensus 479 r~~r~~~~~~~~y~~~gaI~~v~~~~~~v 507 (507) T protein:vir:99 479 QLTEWKASYQLIYSKDDAIRFVEGTDTLI 507 (507) T ss_pred hhccccceEEEEEEeCCeEEEEEeeeecC Confidence 99999999999999999999999998888 No 10 >protein:vir:107720 Length: 515 # NCBI annotation: gp17 # Family: family:all:396 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024865;genbank:gi:48697507;genbank:GeneID:2948336 Probab=100.00 E-value=1.6e-104 Score=589.65 Aligned_cols=422 Identities=15% Similarity=0.105 Sum_probs=337.2 Q ss_pred CC-CCCceEEEeeee-ecccccccccceEEEEcccCCCccceEEeeCHHHHHhhcCCChHHHHHHHHHHh----cCCccc Q lcl|NC_019918. 1 MT-VLTDVIDIQISR-ETAAVAQTNFNVPLFIASHTNFSERARVYNSLKGVAEDFGESDPTYLAAVRYFG----QALKPR 74 (428) Q Consensus 1 M~-~is~iV~V~i~~-~~~~~~~~~f~~~li~~~~~~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~----q~p~P~ 74 (428) || |.+++|+|++++ .+.+..+++|+++||+.++..|.||+|+|+|+++|++|||++|||||||++||+ |+|||+ T Consensus 1 m~I~~~~~V~i~~~v~aa~~~~~~~f~~li~t~~~~~p~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q~p~P~ 80 (515) T protein:vir:10 1 MPISFDKYVAITSGVAAQQQIAARSFAIRVYTPNPMVSVDRLITATSAADVGAYFGTASEEYKRAVKNFGFISKKTRRPT 80 (515) T ss_pred CCCCceeEEEeecccccCCccccccceeeeeecccCCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhccCCccccc Confidence 98 555555555554 445667789999999999999999999999999999999999999999999999 999999 Q ss_pred EEEEEeeecccccccchheee------------cccccccc--------cceeeeeeecccchhhhhhhhhee------- Q lcl|NC_019918. 75 SLVIGRRQVPSATVSVSVVQE------------GQSYVLTV--------NGLPVSYVSHQDDTATLIATGLKA------- 127 (428) Q Consensus 75 ~l~igr~~~~~~~~~~~~~~~------------~~~~~~~v--------~g~~~s~~~~~~~~a~~i~a~l~~------- 127 (428) +|+||||+++.....+.+... .+..+++| .+++++..++++++|+.|++++.. T Consensus 81 ~L~igR~~~~a~~~~l~g~~~~~~~l~~~~~is~G~ltitidG~~~~t~s~i~~S~ats~~~vAs~i~tal~~~~~~~~~ 160 (515) T protein:vir:10 81 SIQFARWQREAGPVAIYGGAKKAAALATLQAVTAGAISFLFGGATTVTVSGISFSAATSLADVASELQTALRANADANLA 160 (515) T ss_pred EEEEEeccCcccceEEEeccchhhhHHhhhcccceeEEEEEcceEEEEeeccccccccCHHHHHHHHHhhhccccccccc Confidence 999999998776654432211 12233344 456678888999999999999864 Q ss_pred ----eecc--cceEEEEeeccccceeeeecccc----------ccccccceEEEEeeccccCHHHHHHHHHhcccCceEE Q lcl|NC_019918. 128 ----AYDV--TPVVGVTVTDNEDGTLTVASNGD----------WSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYAL 191 (428) Q Consensus 128 ----a~~~--~~~~~~~~tt~~~~~~t~as~~~----------~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~ 191 (428) .++. .+|...+.+++..++++++.... .+.....+.++..+.++|++.++|.++.+.+++||++ T Consensus 161 ~~tv~~d~~~~~F~v~s~~tG~~~~is~~~~t~~~~~t~~a~~lglt~~~~av~~~g~aaet~~~a~~a~~~~s~nWy~f 240 (515) T protein:vir:10 161 TCTVSYDPVGARFNFAGSPSDDTVQESISIVPQSNPAIDVAQLLGWNSAQGASYIAASPVVSPVDTLIASVAGNNNFGSI 240 (515) T ss_pred eeEEEEecCCCeEEEEEeecCCceeEEEEEecCCCchhhHHHHhccccccceEEecccccccHHHHHHHHHhccCCeEEE Confidence 3433 35666666777777776554332 1222334567788899999999999999999999988 Q ss_pred EEec-----CCHHHHHHHHHHHhhhCCEEEEEecCcccccchhHHHHHHHHhcccCceEEEecCC--ccchhHHHHHHHH Q lcl|NC_019918. 192 SIDS-----HADDDIMAVATHIEGTKKVFIGATAQANTKTSAENDIASRLVAAGFQRTALIYHPN--ADAQFPECAWVGY 264 (428) Q Consensus 192 ~~~~-----~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~--~~~~~~~a~~~~~ 264 (428) ++.+ .+++++++++.|+++++++|++.+...........+.. .....+.++...++.. .+...+++++++. T Consensus 241 ~~a~~~~~~~~~a~~~a~a~~~e~~~~~~~~~~~~~~~~~~~~~a~~--~~~~~~~~~~~~~~~~~~~~~a~~~g~~asv 318 (515) T protein:vir:10 241 LFTKNGGTGITLSDAEAIALQNQSYNVAYKFQVGVDDTTYSSWQAAL--AAIGGVNMIYSPVALAAEYHDMQDGIIEAAT 318 (515) T ss_pred EEeecCccccchhHHHHHHHHHhhcCceEEEEeccCccceechhhhh--hhhhhcCceEEEEeccCcchHHHHHHHHHhc Confidence 7753 45789999999999999999988765554433332222 2334556666665442 2344566666777 Q ss_pred HhccCCCceeeeeeeecCccccCCCHHHHHHHHhCCceEEEEEcCc----eeeecCEecCCc----hhHHHHHHHHHHHH Q lcl|NC_019918. 265 QLQEQPGSNTWTHKALAAVDAYRLTPTESTNLKNKNVTTFERVGGV----NRTFGGAMAGGE----WIDVMIFVDWLEAR 336 (428) Q Consensus 265 ~~~~~~g~~t~~fk~~~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~----~~~~~G~~~~G~----~iD~~~~~dwl~~~ 336 (428) +|+..+|++|||||+++||+|++++++|+++|++||||||+.+.+. .+++||+|++|+ |||++||+|||+++ T Consensus 319 nf~~~ng~iT~kfKq~~Gita~~lt~t~a~al~~~~~N~Y~~~~~~~~~~~~~~~G~~~gG~~~~~WiD~~~g~~WL~~~ 398 (515) T protein:vir:10 319 DFTQQGGATGYMYVQFNNQTPAVNDDTLSGILDDLNINYYGQTQVNGTNLSFYQDGVMMGGPTDPRDSNVYANEQWLKSY 398 (515) T ss_pred CCCccchhheeccccCCCCccccCCHHHHHHHHhcCCeEEEEEeccCceEEEEeCCeeeCCccchhHHHHHhhHHHHHHH Confidence 7888899999999999999999999999999999999999999763 477999999986 79999999999999 Q ss_pred HHHHHHHHHHhcCCCCcCHhHHHHHHHHH-HHHHHHHHhcCceecC--------------------------CceEEEeC Q lcl|NC_019918. 337 MTERLWFRMANSKKIPYDAVGATILESEI-RAQLNEGIRVGGLAEA--------------------------PAPKVFVP 389 (428) Q Consensus 337 lq~~l~~ll~~~~kip~~~~G~~~i~~~i-~~~~~~~~~~G~I~~g--------------------------~~~~v~~~ 389 (428) ||++|++||.+++|||||+.|+++|+++| +++|++|++||+|+|| +|||++.+ T Consensus 399 iq~~l~~L~~s~~KIPytd~G~a~i~a~v~q~vl~~av~nG~I~~Gv~ls~~Q~~~i~~~~g~d~~~~~~~~~Gyy~~~~ 478 (515) T protein:vir:10 399 AGASFMSLQLAQGKIPANIEGRGLLLGKMTKDIIPAAKLNGTFSIGKTLTVDQQLFVTELTGDDTAWQKVQNLGYWYDVQ 478 (515) T ss_pred HHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHHhCCeeecCcccchhHHHHHHhhhcCcccccchhhcceeEecC Confidence 99999999999999999999999999987 5799999999999998 68999999 Q ss_pred chHhCCHHHHhccccC--CeEEEEEECceEEEEEEEEEEe Q lcl|NC_019918. 390 DVLSMSPNMRAQRIFE--GIEFEARLAGAIHFVHIRGTVT 427 (428) Q Consensus 390 ~~~~~~~~dra~R~~~--~i~~~~~~agaih~v~i~~~v~ 427 (428) +...++ |..|..+ ++.|||+++|+||+|++..++. T Consensus 479 ~~~~~~---~~~r~~~~~~~~~~y~~g~~i~~i~~~~~~v 515 (515) T protein:vir:10 479 ISSFVD---TGGTTKYQAVYSLVYSKDDLIRKVVGTHTLI 515 (515) T ss_pred cCCCCC---cccccccCceeEEEEEcCceEEEEEeeeecC Confidence 886655 4555544 4579999999999999988888 No 11 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=100.00 E-value=3e-94 Score=533.34 Aligned_cols=321 Identities=22% Similarity=0.322 Sum_probs=287.0 Q ss_pred CCCceEEEeeeeecc---cccccccceEEEEcccCCCccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCcccEEEEE Q lcl|NC_019918. 3 VLTDVIDIQISRETA---AVAQTNFNVPLFIASHTNFSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALKPRSLVIG 79 (428) Q Consensus 3 ~is~iV~V~i~~~~~---~~~~~~f~~~li~~~~~~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~P~~l~ig 79 (428) =|||||+|+|.+... +...++|+.+++++. .+|+|.|+++++|+.|||.++|+||+|.++|+|.|+|.+++++ T Consensus 1 ~~~~iv~V~v~~~~~~~~~~~~~~~~~~~~~~t----~~~~~~y~s~~~v~~d~~~~~~~Ykaa~~~f~Q~~~~~~i~v~ 76 (331) T protein:vir:80 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGT----AMGYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRPDTVAVI 76 (331) T ss_pred CccceecceeeecccccccccccCcceeEEecc----ccceEEEechhhhccCCCCCcHHHHHHHHHHhccCccceEEEe Confidence 689999999998743 344456666666554 3789999999999999999999999999999999999999998 Q ss_pred eeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeeccccccccc Q lcl|NC_019918. 80 RRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVASNGDWSLKV 159 (428) Q Consensus 80 r~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as~~~~~~~~ 159 (428) ++..+ T Consensus 77 ~~~~~--------------------------------------------------------------------------- 81 (331) T protein:vir:80 77 TYEDT--------------------------------------------------------------------------- 81 (331) T ss_pred ccchH--------------------------------------------------------------------------- Confidence 65311 Q ss_pred cceEEEEeeccccCHHHHHHHHH-hcccCceEEEEecCCHHHHHHHHHHHhhhCCEEEEEecCcccccchhHHHHHHHHh Q lcl|NC_019918. 160 SSNLTMAAAPSTEGWPATITAVQ-GENDEWYALSIDSHADDDIMAVATHIEGTKKVFIGATAQANTKTSAENDIASRLVA 238 (428) Q Consensus 160 s~~~~~~~~~aa~~~~~al~~~~-~~~~~w~~~~~~~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 238 (428) +.+.++. ..+++|||+.+.++++++++++|+|+++++++|++..++ +.++.++. T Consensus 82 ----------------~~~~a~~a~~~~~w~~~~~~~~~~~~~~a~a~~~~a~~~~f~~~~~~---------~~~~~~~~ 136 (331) T protein:vir:80 82 ----------------KLLEAAEAYFLKSWHFALLAEFKAADALALSNLIEEQKFKFAVFQVT---------AVADITPL 136 (331) T ss_pred ----------------HHHHHHHHhccCceeEEEeecCCHHHHHHHHHHHhhCCcEEEEEecC---------chHHHHHh Confidence 0112222 235779999999999999999999999999999887643 33455666 Q ss_pred cccCceEEEecCCccchhHHHHHHHHHhccCCCceeeeeee-ecCccccCCCHHHHHHHHhCCceEEEEEcCceeeecCE Q lcl|NC_019918. 239 AGFQRTALIYHPNADAQFPECAWVGYQLQEQPGSNTWTHKA-LAAVDAYRLTPTESTNLKNKNVTTFERVGGVNRTFGGA 317 (428) Q Consensus 239 ~~~~~t~~~y~~~~~~~~~~a~~~~~~~~~~~g~~t~~fk~-~~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~G~ 317 (428) .++.|++++||+..+ ++++++++|++++.+||++|||||+ ++||+|++++.+|+++|+++|||||+++++..++++|+ T Consensus 137 ~~~~~t~~~~~~~~~-~~~~aa~~g~~~~~~~g~~t~~fk~~l~GV~~~~lt~t~~~al~~~~~N~y~~~~~~~~~~~G~ 215 (331) T protein:vir:80 137 AKNTRTIAIVHSKTG-EKLDAALIGNVASLPVGSATWKGRHGLAGITSEELKVSEIDAIQKAGGMCYIEKAGIAQTSEGK 215 (331) T ss_pred hccccEEEEEcCCcc-chhHHHHHHHHHhcCccceeeeeecccCCCCCCCCCHHHHHHHHhcCceEEEEecCeeEEecce Confidence 778999999998664 6889999999999999999999997 89999999999999999999999999999999999999 Q ss_pred ecCCchhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecC-----CceEEEeCchH Q lcl|NC_019918. 318 MAGGEWIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEA-----PAPKVFVPDVL 392 (428) Q Consensus 318 ~~~G~~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g-----~~~~v~~~~~~ 392 (428) |++|+|||++||+|||+++||++|++||++++|||||+.|+++|+++|+++|+++++||+|+|| ++|+|+.|+++ T Consensus 216 ~~~G~~iD~~~~~dWl~~~lq~~l~~ll~~~~kiPy~~~G~~~l~a~i~~~~~~av~~G~I~~g~~~~~~~~~v~~~~~~ 295 (331) T protein:vir:80 216 TVSGEFIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRS 295 (331) T ss_pred EeCchhHHHHHHHHHHHHHHHHHHHHHHHhCCCCccChhhHHHHHHHHHHHHHHHHhCCceecCccCCCcceEEEeCchh Confidence 9999999999999999999999999999999999999999999999999999999999999998 48999999999 Q ss_pred hCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 393 SMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 393 ~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) +|+++||++|++||++|+|+++|+||+|+|+++|+| T Consensus 296 ~~s~~dr~~R~~~~~~~~~~~~gaI~~v~i~~~v~~ 331 (331) T protein:vir:80 296 DLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) T ss_pred cCCHHHHhccCCCCeEEEEEEcceEEEEEEEEEEeC Confidence 999999999999999999999999999999999999 No 12 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=100.00 E-value=8.8e-76 Score=432.06 Aligned_cols=400 Identities=16% Similarity=0.146 Sum_probs=298.4 Q ss_pred CCCCCceEEEeeeeecccccccccceEEEEcccCCCc-----cceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCcccE Q lcl|NC_019918. 1 MTVLTDVIDIQISRETAAVAQTNFNVPLFIASHTNFS-----ERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALKPRS 75 (428) Q Consensus 1 M~~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~~~~-----~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~P~~ 75 (428) || .+||||+|+++++++.+++|+.+||+|.|...+ +|++.|+|+++|++|||.+||+||||.++|+|.++ T Consensus 1 m~--~~iVnV~Is~~t~A~~~~~Fg~~liigs~~~~~p~~~f~~~~~Yss~~~V~~Dfg~~s~~Y~AA~~~f~Q~~~--- 75 (426) T protein:vir:31 1 MP--KQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAE--- 75 (426) T ss_pred CC--cceEEEEeecccccccccccceeeeeeeccccccccccchhhhhhhHHHHHhcCCCChHHHHHHHHHHhCCce--- Confidence 99 699999999999999999999999999997532 47888999999999999999999999999999865 Q ss_pred EEEEeeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeeccccc Q lcl|NC_019918. 76 LVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVASNGDW 155 (428) Q Consensus 76 l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as~~~~ 155 (428) +||+....++.. +.....+..+|.|+.++....+.++++.+...+.+..+.............+++.+....... T Consensus 76 --~~r~~v~~at~~---~~~~~t~~~tv~g~~~s~~a~~~~~a~~i~~~~~~~~~~~~~~~~~~~~t~~g~~t~~~~~~~ 150 (426) T protein:vir:31 76 --QWRVMVLEATEV---TEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIE 150 (426) T ss_pred --eEEeecccccee---eeccCCcceeecceeeeecccCcchHHHHHHhhccccccccceeeeEeccccceeecccccee Confidence 555433322221 223445667899999999999999999999999888877766666555544444443322211 Q ss_pred cccccceEEEEee--ccc-cCHHHHHHHHHhcccCceEEEEecCCHHHHHHHHHHHhhhCCEEEEEecCcccccchhHHH Q lcl|NC_019918. 156 SLKVSSNLTMAAA--PST-EGWPATITAVQGENDEWYALSIDSHADDDIMAVATHIEGTKKVFIGATAQANTKTSAENDI 232 (428) Q Consensus 156 ~~~~s~~~~~~~~--~aa-~~~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~~~ 232 (428) . ....+ ... +....+-.........|+... .+..++..|.+++.++++.............+++ T Consensus 151 ~-------~~s~~dw~~~~~~~s~~~~~~ia~~~~~~~~~------~~~~~~~~wa~~~~i~~va~~~e~~~~~~~~~~~ 217 (426) T protein:vir:31 151 L-------TYFHADWSQLDEFPSDVNNFAVADRRFDLKGV------GVLDETHSWASDEDMGMIANGVNVDDYDSVDEAM 217 (426) T ss_pred e-------eeccCcchhhhcccccchhhhhhccccchhhh------hhhHhhhhhhhhcceeeeeeccchhhhcchhhhh Confidence 0 00000 000 000111111122233454221 2234678899999999998887777777778899 Q ss_pred HHHHHhcccCceE--EEecCCccchhHHHHHHHHHhccC-----------CCceeeeeeeecCccccCCCHHHHHHHHhC Q lcl|NC_019918. 233 ASRLVAAGFQRTA--LIYHPNADAQFPECAWVGYQLQEQ-----------PGSNTWTHKALAAVDAYRLTPTESTNLKNK 299 (428) Q Consensus 233 ~~~l~~~~~~~t~--~~y~~~~~~~~~~a~~~~~~~~~~-----------~g~~t~~fk~~~Gv~~~~~t~t~~~~l~~~ 299 (428) +.+++..+|.++. ..|..... ....++.++.....+ .+...++|++.+|+... +...++.. .++ T Consensus 218 a~~~~~~~y~p~~~~~~~~~~~~-~~~~~~~~~~~aa~~~~~~~~~~~~~~~~~~~~~~~~~gv~~t-~~~~~~A~-~~~ 294 (426) T protein:vir:31 218 DVAHEVAGYVPSGDLMMIVDASD-DDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGT-FEGGDEAE-GEG 294 (426) T ss_pred hhhhcccccccchhheeehhccc-cchhhHHhhhhhhhccccchhhhhccccccceeeccccccccc-cchhhhhh-hcC Confidence 9999999996554 44433322 233566666555443 23455678888888844 33334444 458 Q ss_pred CceEEEEEcCcee-----eecCEecCCchhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHh Q lcl|NC_019918. 300 NVTTFERVGGVNR-----TFGGAMAGGEWIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIR 374 (428) Q Consensus 300 ~~n~y~~~~~~~~-----~~~G~~~~G~~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~ 374 (428) ++|+|..+.+.+. +.+|++++|+|||++|++|||+++||++|++||.+++|||||+.|++||++.|+++|+++++ T Consensus 295 ~~n~~~~~~~~~~i~~~~~~~G~~~~G~~iD~~~g~dwl~~~iq~~l~~ll~~~~KIpyt~~Gi~~I~~~i~~~L~~~v~ 374 (426) T protein:vir:31 295 PVNVLIDVSDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTG 374 (426) T ss_pred CceEEEEecCceeeecceeecccccchhhhhhHHHHHHHHHHHHHHHHHHhhcCCCCccchhHHHHHHHHHHHHHHHHhc Confidence 8999999987754 56799999999999999999999999999999999999999999999999999999999998 Q ss_pred cCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 375 VGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 375 ~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) .|. .+-++|+|+.|.+++++ +||++|++++|+|.++|+||||.++|+|+|+| T Consensus 375 ~~g-~~~~~y~v~~P~~~~~~-~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) T protein:vir:31 375 SVG-QPLAEYEVDVPEWDDDD-VDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) T ss_pred CCC-ccccceeecCCCccccc-hhhhhhccCCceEEEEEeCcEEEEEEEEEEeC Confidence 653 33457999999988865 69999999999999999999999999999999 No 13 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=99.36 E-value=6.3e-11 Score=76.50 Aligned_cols=396 Identities=13% Similarity=0.057 Sum_probs=209.2 Q ss_pred CC--------CC--CceEEEeeeeecccccccccceEEEEcc-cCCCccceEEeeCHHHHHhhcCCC--hHHHHHHHHHH Q lcl|NC_019918. 1 MT--------VL--TDVIDIQISRETAAVAQTNFNVPLFIAS-HTNFSERARVYNSLKGVAEDFGES--DPTYLAAVRYF 67 (428) Q Consensus 1 M~--------~i--s~iV~V~i~~~~~~~~~~~f~~~li~~~-~~~~~~~~~~y~s~~~V~~~fg~~--s~eY~aA~~~F 67 (428) |+ .+ .-++++... ....+....=+...|.+. .-.|.+.....+|-+|....||.. .+.|++.+.+| T Consensus 1 m~gg~~~~~~k~~PGvYi~~~~~-~~~~i~~~~~~~~a~~~~~~~Gp~~~~~~i~s~~d~~~~fG~~~~~~~~~~~~~~~ 79 (437) T protein:vir:10 1 MAGGIWKRQNKVRPGAYINVKSK-DIAMTRLGGDGVVTVPLALSFGQSKKLMKIRRGEDLFKKLGYEQESPQLLLLNEAF 79 (437) T ss_pred CCcceecccceecCceeEEEecC-CcceeeccCCcEEEEEEEecCCCCceeEEEecHHHHHHHcCCccchhHHHHHHHHh Confidence 44 11 112333211 111222222234344433 346677777778889999999964 45677777777 Q ss_pred hcCCcccEEEEEeeec-ccccccchh-eeeccccccccc-ceeeeeeecccc-hhhhhhhhheeeecccceEEEEeeccc Q lcl|NC_019918. 68 GQALKPRSLVIGRRQV-PSATVSVSV-VQEGQSYVLTVN-GLPVSYVSHQDD-TATLIATGLKAAYDVTPVVGVTVTDNE 143 (428) Q Consensus 68 ~q~p~P~~l~igr~~~-~~~~~~~~~-~~~~~~~~~~v~-g~~~s~~~~~~~-~a~~i~a~l~~a~~~~~~~~~~~tt~~ 143 (428) .+ ++++++.|-.. +.+..++.. ++.-..+...+. .+.+.......+ ....+. ........-........ T Consensus 80 ~g---~~~~~~~R~~~g~~a~~tl~~~~~~~A~~~G~~gn~i~v~v~~~~~d~~~~~v~----~~~~~~~~d~~~v~~~~ 152 (437) T protein:vir:10 80 KR---VSEVLLYRLNTGEKANVSLSDNVTAQAKYSGVRGNDITVTVKTNVDDPSSFDVV----TFLDTVVMDLQTVKVLA 152 (437) T ss_pred cC---CCEEEEEECCCCceeeEeeccceEEEeccCCcccceeEEEEeeccCCccceEEE----EecCcceeeeeehhhhh Confidence 53 67899988532 111111111 111000111111 011111111110 000000 00000000000000000 Q ss_pred c---ceeeeeccccccccccceEEEEeec----cccCHHHHHHHHHhcccCceEEEEecCCHHHHHHHHHHHhhh----C Q lcl|NC_019918. 144 D---GTLTVASNGDWSLKVSSNLTMAAAP----STEGWPATITAVQGENDEWYALSIDSHADDDIMAVATHIEGT----K 212 (428) Q Consensus 144 ~---~~~t~as~~~~~~~~s~~~~~~~~~----aa~~~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~~~~a~----~ 212 (428) + ....... ....+....+..++.+. ..+++.++|+++... +|..+.+...+.+.+.++..|++.. + T Consensus 153 ~~~~n~~v~~~-~~~~l~~~a~~~LtGG~dg~~t~~dy~~al~~le~~--~~n~l~~~~~d~~~~t~~~~~ik~~r~~~g 229 (437) T protein:vir:10 153 DLKNNALVEFS-GTGELQPVAGAKLTGGTDGAISTQDYLEYFKALETV--EFNYMALPVEDASIKKAAINFIKRMREDEG 229 (437) T ss_pred hhhhhcccccc-cccccccccceeeeccccCCCChhHHHHHHHHhccC--cceEEEecCCChhHHHHHHHHHHHHHhccC Confidence 0 0000000 00111111222233222 235678888888755 4555666777778889999998753 3 Q ss_pred CEEEEEecCcccccchhHHHHHHHHhcccCceEEEecC-CccchhHHHHHHHHHhccCCCceeeeeeeecCcc-c-cCCC Q lcl|NC_019918. 213 KVFIGATAQANTKTSAENDIASRLVAAGFQRTALIYHP-NADAQFPECAWVGYQLQEQPGSNTWTHKALAAVD-A-YRLT 289 (428) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~-~~~~~~~~a~~~~~~~~~~~g~~t~~fk~~~Gv~-~-~~~t 289 (428) +++..+...... ..+.+. +........+. .-+....+++++|......+. ..+-||.++|+. . ..++ T Consensus 230 ~~~~~V~~~~~~---d~e~Ii------n~~n~~~~~~~~~~~~~~~~a~vAG~~Ag~~~~-~S~t~~~~~~~~~v~~~~t 299 (437) T protein:vir:10 230 LGAQLVVADSDA---DSEAVI------NVKNGVILSDKTVIDKTKATVWVAAASANAGVE-KSLTYEKYEDSVDVVGRLS 299 (437) T ss_pred ceEEEEeCCCCC---CCceEE------EeecceeecCcceechhhHHHHHHHHhccCccc-cCccccccCCcccccccCC Confidence 344333222211 011010 01111111111 011223456777777776553 456788899874 3 5789 Q ss_pred HHHHHHHHhCCceEEEEEcCceeeecCEec----------CCchhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHH Q lcl|NC_019918. 290 PTESTNLKNKNVTTFERVGGVNRTFGGAMA----------GGEWIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGAT 359 (428) Q Consensus 290 ~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~----------~G~~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~ 359 (428) .+|++.+.++|...+.+.++.-.+-+|+.+ +...|-.++-.|.+...++..+-+.++ +|+|=+..|.. T Consensus 300 ~~e~~~~i~~G~~vl~~~~~~v~i~~gInTltt~~~~~~~~~~ki~vir~~D~i~~di~~~~~~~yi--Gk~~N~~~~r~ 377 (437) T protein:vir:10 300 HTETEDALLKGQFVFTARRGRAVVEQDINSHVSFTIEKNQDFRKNRILRTLDDIVNDTRYAFSEYFL--GKVSNNEDGRQ 377 (437) T ss_pred HHHHHHHHhCCcEEEEEeCCeEEEEEccccccccCCCCCchhhhhhHHHHHHHHHHHHHHHHHhccc--cccCCCHHHHH Confidence 999999999999999877766666666532 112455777788888777776555444 58998999999 Q ss_pred HHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEe Q lcl|NC_019918. 360 ILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVT 427 (428) Q Consensus 360 ~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~ 427 (428) .+++.|+..|++..+.|.|.+.....+..... . .+..--+++.+++-.++..+.+.++|. T Consensus 378 ~~~~~i~~yl~~l~~~g~I~~~~~~d~~v~~~---~-----~~~~v~v~~~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 378 AFKANRIRYFKDLEARGAIEDFKVEDIEVLRG---E-----LKESVVVNVKVKPVDSMEKLYMTVTVE 437 (437) T ss_pred HHHHHHHHHHHHHHhCCCccCCCceeEEeecC---C-----CCCEEEEEEEEEEeeeeeeEEEEEEec Confidence 99999999999999999998754433332211 1 122223899999999999999999999 No 14 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=99.28 E-value=2.1e-10 Score=73.62 Aligned_cols=415 Identities=12% Similarity=-0.008 Sum_probs=201.4 Q ss_pred CC-CCCceEEEe-eeeecccccccccceEEEEcccCC-CccceEEeeCHHHHHhhcC--CChHHHHHHHHHHhcCCcccE Q lcl|NC_019918. 1 MT-VLTDVIDIQ-ISRETAAVAQTNFNVPLFIASHTN-FSERARVYNSLKGVAEDFG--ESDPTYLAAVRYFGQALKPRS 75 (428) Q Consensus 1 M~-~is~iV~V~-i~~~~~~~~~~~f~~~li~~~~~~-~~~~~~~y~s~~~V~~~fg--~~s~eY~aA~~~F~q~p~P~~ 75 (428) || +...=|-|. +.-.+.++...+-+.+.|+|.+.. +...-...+|..|-...|| .+...+.+...+|.+.. .+ T Consensus 1 M~~~~~pGVyv~E~~~g~~~I~~v~Tsv~~~VG~a~~~p~n~pv~its~~d~~~~g~~~~~~tL~~Av~~~f~ngg--~~ 78 (477) T protein:vir:79 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPVQSLSDVDAAQFGPQLAGFTIPQALDAVYDYGS--GT 78 (477) T ss_pred CcCCCCCCeEEEEecCCcccccccCCceEEEEeecccCCCcccEEEccHHHHHHhcCCCCCCcHHHHHHHHhhcCC--ce Confidence 99 343334442 222345677777788888887643 3333344556666655333 55778899999998754 34 Q ss_pred EEEEeeeccc---ccccchheeec-ccccccccceeeee--eecccchhhhhhhhheeeecccc---eEEEEeeccccce Q lcl|NC_019918. 76 LVIGRRQVPS---ATVSVSVVQEG-QSYVLTVNGLPVSY--VSHQDDTATLIATGLKAAYDVTP---VVGVTVTDNEDGT 146 (428) Q Consensus 76 l~igr~~~~~---~~~~~~~~~~~-~~~~~~v~g~~~s~--~~~~~~~a~~i~a~l~~a~~~~~---~~~~~~tt~~~~~ 146 (428) +++-|-.... .......+... ........+..... ....... ...........+... ............. T Consensus 79 ~~vvrV~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (477) T protein:vir:79 79 VIVINVLDPAVHKSNAASESVTFDAATGRAKLAHPAAANLVLKNDSGG-TTYTEGTDYAVDLINGVITRIKTGTIPAAAT 157 (477) T ss_pred EEEEeccCCccccccccccccccccccccccccccccceeEEeecccc-cccccCccccccccchhhhhhhccccccccc Confidence 4554421111 11000000000 00000000000000 0000000 000000000000000 0000000000000 Q ss_pred eeeecccccccc-ccc--eEEEEeeccccCHHHHHHHHHhcccCce-EEEEecC--CHHHHHHHHHHHhhhCCEEEEEec Q lcl|NC_019918. 147 LTVASNGDWSLK-VSS--NLTMAAAPSTEGWPATITAVQGENDEWY-ALSIDSH--ADDDIMAVATHIEGTKKVFIGATA 220 (428) Q Consensus 147 ~t~as~~~~~~~-~s~--~~~~~~~~aa~~~~~al~~~~~~~~~w~-~~~~~~~--~~~~~~ala~~~~a~~~~~~~~~~ 220 (428) ............ ... ...........+..+++.........-. .+..... ...-..+|...++.. +.+.+.-. T Consensus 158 ~~~~~~~~~~~~~~~~~~~~g~~~a~~~~tg~~al~~~~~~~~~~~~iv~apg~~~~~~v~~~l~~~~~~~-~~~a~~d~ 236 (477) T protein:vir:79 158 AAKATYDYADPTKVTAADIIGAVNAAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQL-GAIAYIDA 236 (477) T ss_pred eeeceeccCCcccceeeeecccccccccchhhhhhhhhhhhcccccceeeccccccchhHHHHHHHHHhhc-CeEEEEec Confidence 000000000000 000 0000001111222223332322211111 1111121 112223344434332 23333222 Q ss_pred CcccccchhHHHHHHH----HhcccCceEEEec------CCc---cchhHHHHHHHHHhccCCCc---eeeeeeeecCcc Q lcl|NC_019918. 221 QANTKTSAENDIASRL----VAAGFQRTALIYH------PNA---DAQFPECAWVGYQLQEQPGS---NTWTHKALAAVD 284 (428) Q Consensus 221 ~~~~~~~~~~~~~~~l----~~~~~~~t~~~y~------~~~---~~~~~~a~~~~~~~~~~~g~---~t~~fk~~~Gv~ 284 (428) ..........+.-..+ ...+..|..+.|. ... ....+.+.++|.+...+.-. .....|.+.||. T Consensus 237 p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~gv~ 316 (477) T protein:vir:79 237 PIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDIATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQLVGVT 316 (477) T ss_pred CCCCChHHHhhhhhhccccccccccceEEEEcCeeEEecccCCceeeechHHHHHHHHHHhhccCCceEccCCceeecce Confidence 1111111111100111 0112333343332 111 11245677777766554322 344566666655 Q ss_pred c--------cCCCHHHHHHHHhCCceEEEEEcCce-eeecCEecCC-------chhHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019918. 285 A--------YRLTPTESTNLKNKNVTTFERVGGVN-RTFGGAMAGG-------EWIDVMIFVDWLEARMTERLWFRMANS 348 (428) Q Consensus 285 ~--------~~~t~t~~~~l~~~~~n~y~~~~~~~-~~~~G~~~~G-------~~iD~~~~~dwl~~~lq~~l~~ll~~~ 348 (428) . ...+++|.+.|.++|+|.+.++.+.+ .++.++|+.+ .|+-+.+-.+|+...|+..+..++-. T Consensus 317 ~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e- 395 (477) T protein:vir:79 317 GVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDA- 395 (477) T ss_pred ecccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCCCccceeeehhhHHHHHHHHHHHHHHHhccC- Confidence 2 12457899999999999999998766 5688888743 25678888889988888888765543 Q ss_pred CCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 349 KKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 349 ~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) |.+..-...|+..|+.-|++.++.|.|. ||.|.+ +.++.+++|+.+++.. +.+.+.....+++|.++...+. T Consensus 396 ---~~~~~~~~~i~~~i~~~l~~l~~~g~l~---g~~v~~-~~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 467 (477) T protein:vir:79 396 ---PIDQGLIDSLVESVNGFGRKLIGDGALL---GFKAWF-DPARNPKEELAAGHLL-INYKYTVPPPLERLTYETEITS 467 (477) T ss_pred ---CCCHHHHHHHHHHHHHHHHHHHhCCcee---eeEEEE-ecCCCCHHHhhCCeEE-EEEEEEecCCceeEEEEEEEec Confidence 6688889999999999999999999997 588887 5667899999999986 9999999999999999999888 No 15 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=99.27 E-value=2e-10 Score=73.71 Aligned_cols=411 Identities=11% Similarity=-0.003 Sum_probs=198.8 Q ss_pred CCC-CCceEEEee-eeecccccccccceEEEEcccC-CCccceEEeeCHHHHHhhc--CCChHHHHHHHHHHhcCCcccE Q lcl|NC_019918. 1 MTV-LTDVIDIQI-SRETAAVAQTNFNVPLFIASHT-NFSERARVYNSLKGVAEDF--GESDPTYLAAVRYFGQALKPRS 75 (428) Q Consensus 1 M~~-is~iV~V~i-~~~~~~~~~~~f~~~li~~~~~-~~~~~~~~y~s~~~V~~~f--g~~s~eY~aA~~~F~q~p~P~~ 75 (428) ||. ...=|-|.- .-.+.++...+-+.+.|+|... .|...-...+|..+....+ ..++..+.+...+|.+... . T Consensus 1 M~~~~~pGVyv~E~~~~~~~i~~v~T~v~~~VG~a~~gp~n~pv~its~~d~~~~g~~~~~~tL~~Av~~~f~nGg~--~ 78 (477) T protein:vir:10 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPVQSLSDVDAAQFGPQLAGFTIPQALDAVYDYGSG--T 78 (477) T ss_pred CcccCCCCeEEEEccCCcccccccCCceeEEEecccCCCCCcCEEEccHHHHHHhccCCCCCcHHHHHHHHHhccce--E Confidence 993 433344432 2233456677778888888654 2333334455666665432 3567888999999998654 3 Q ss_pred EEEEeeec---ccccccchhe--eecccc--c--ccccceeeeee---ecccchhhhhhhhheeeecccceEEEEeeccc Q lcl|NC_019918. 76 LVIGRRQV---PSATVSVSVV--QEGQSY--V--LTVNGLPVSYV---SHQDDTATLIATGLKAAYDVTPVVGVTVTDNE 143 (428) Q Consensus 76 l~igr~~~---~~~~~~~~~~--~~~~~~--~--~~v~g~~~s~~---~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~ 143 (428) +++-|-.. .......... ...... . .......+... ................. ........... T Consensus 79 ~~vVrV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~ 154 (477) T protein:vir:10 79 VIVINVLDPAVHKSNAANEPVTFDAATGRAKLAHPAAANLVLKNDSGGTTYAEGTDYAVDLINGV----ITRIKTGTIPP 154 (477) T ss_pred EEEEecCccccccccccccccccccccceecccccccccccccccccccccccchhhhhhhcccc----ceecccccccc Confidence 34433211 1111110000 000000 0 00000000000 00000000000000000 00000000000 Q ss_pred cceeeeeccccccc-cccceEEEEeeccccCHHHHHHHHHhcc--cCc--eEEEEecC-CHH-HHHHHHHHHhhhCCEEE Q lcl|NC_019918. 144 DGTLTVASNGDWSL-KVSSNLTMAAAPSTEGWPATITAVQGEN--DEW--YALSIDSH-ADD-DIMAVATHIEGTKKVFI 216 (428) Q Consensus 144 ~~~~t~as~~~~~~-~~s~~~~~~~~~aa~~~~~al~~~~~~~--~~w--~~~~~~~~-~~~-~~~ala~~~~a~~~~~~ 216 (428) .............. .......+. ..........+..+.... ..+ ..+..... ... -..+|...++.. +.+. T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~g-~~~~~~~~tGl~al~~~~~~~~~~~~~l~apg~~~~~~v~~~l~~~~~~~-~~~~ 232 (477) T protein:vir:10 155 GATAAKATYDYADPTKVTAADIIG-AVNAAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQL-GAIA 232 (477) T ss_pred cceeeeeccccccccccccccccc-cccccchhhhhhhhhhhhhhcchhcccccccccccchhhHHHHHHHHhhC-CEEE Confidence 00000000000000 000000000 001111111222222211 111 11111111 111 122344433322 2333 Q ss_pred EEecCcccccchhHHHHHHHH----hcccCceEEEec------CCc---cchhHHHHHHHHHhccCCCc---eeeeeeee Q lcl|NC_019918. 217 GATAQANTKTSAENDIASRLV----AAGFQRTALIYH------PNA---DAQFPECAWVGYQLQEQPGS---NTWTHKAL 280 (428) Q Consensus 217 ~~~~~~~~~~~~~~~~~~~l~----~~~~~~t~~~y~------~~~---~~~~~~a~~~~~~~~~~~g~---~t~~fk~~ 280 (428) +.-................+. ..+..|..+.|. ... .-..+.+.++|.....+..+ .+..+|.+ T Consensus 233 ~~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~ 312 (477) T protein:vir:10 233 YIDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDTATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQL 312 (477) T ss_pred EEecCCCCCHHHHHhhhhhccccccccccceEEEEcCeEEEecccCCceeEEchHHHHHHHHHHhhhcCCceeccCCcee Confidence 322111111111100000000 112333444332 110 11235566777666554322 34455666 Q ss_pred cCccc---c-----CCCHHHHHHHHhCCceEEEEEcCce-eeecCEecCC-------chhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 281 AAVDA---Y-----RLTPTESTNLKNKNVTTFERVGGVN-RTFGGAMAGG-------EWIDVMIFVDWLEARMTERLWFR 344 (428) Q Consensus 281 ~Gv~~---~-----~~t~t~~~~l~~~~~n~y~~~~~~~-~~~~G~~~~G-------~~iD~~~~~dwl~~~lq~~l~~l 344 (428) .||.. . ..+++|.+.|.++|+|.+.++.+.+ .++.++|+.+ .|+-+.+-.+|+...|+..+... T Consensus 313 ~gi~~~~~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~~~vrR~~~~i~~~~~~~~~~~ 392 (477) T protein:vir:10 313 VGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQF 392 (477) T ss_pred ccccccccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCCCcccceeehhhHHHHHHHHHHHHHHHh Confidence 65542 1 2367899999999999999998766 5688888754 25677788888888888887765 Q ss_pred HHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEE Q lcl|NC_019918. 345 MANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRG 424 (428) Q Consensus 345 l~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~ 424 (428) +-. |.+..-...|+..|+.-|++.++.|.|. ||+|.+ +.++.|++|+.+++.. +.+.+.....+++|.+.. T Consensus 393 v~~----~~~~~~~~~i~~~i~~~l~~l~~~g~l~---g~~v~~-~~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~ 463 (477) T protein:vir:10 393 VDA----PIDQGLIDSLVESVNGFGRKLIGDGALL---GFKAWF-DPARNPKEELAAGHLL-INYKYTVPPPLERLTYET 463 (477) T ss_pred ccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee---eeEEEE-ecCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEE Confidence 433 6688889999999999999999999997 588888 4567899999999997 999999999999999999 Q ss_pred EEec Q lcl|NC_019918. 425 TVTV 428 (428) Q Consensus 425 ~v~~ 428 (428) .... T Consensus 464 ~~~~ 467 (477) T protein:vir:10 464 EITS 467 (477) T ss_pred EEcc Confidence 9888 No 16 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=99.26 E-value=1.7e-10 Score=74.18 Aligned_cols=360 Identities=12% Similarity=0.093 Sum_probs=199.7 Q ss_pred CCC-CCceEEE-eeeeecccccccccceEEEEcccCC------CccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCc Q lcl|NC_019918. 1 MTV-LTDVIDI-QISRETAAVAQTNFNVPLFIASHTN------FSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALK 72 (428) Q Consensus 1 M~~-is~iV~V-~i~~~~~~~~~~~f~~~li~~~~~~------~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~ 72 (428) |++ +=- |.| .+.-.+.++...+...+.|+|.... +...-...++..+-...||.++..+.+...+|.+... T Consensus 1 m~~~~~G-v~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~~~~~~~~g~~~tl~~a~~~~~~~gg~ 79 (396) T protein:vir:60 1 MSDYHHG-VQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPLNKPVLITNVQSAIAKAGKKGTLAASLQAIADQSKP 79 (396) T ss_pred CCCCCCC-eEEEEcCCCcccccccCceeEEEEecccccccccccCccCeEeechHHHHHhhcCcchhHHHHHHHhhccCc Confidence 985 422 333 2234556778888898999886632 2233455677888888899999999999999988643 Q ss_pred ccEEEEEeeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeecc Q lcl|NC_019918. 73 PRSLVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVASN 152 (428) Q Consensus 73 P~~l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as~ 152 (428) . .++-+.................... ..+.+ ..... .....+....+ T Consensus 80 ~--~~vv~~~~~~~~~~~~~~~~~~~~~--~~~~d------~~~~~-tg~~al~~~~~---------------------- 126 (396) T protein:vir:60 80 V--TVVVRVEDGTGEDEETKLAQTVSNI--IGTTD------ENGQY-TGLKALLAAES---------------------- 126 (396) T ss_pred e--EEEEecccccccccccccccccccc--ccccc------ccccc-cchhhhhhccc---------------------- Confidence 2 2332211110000000000000000 00000 00000 00000000000 Q ss_pred ccccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEEecCCHHHHHHHHHHHhhhCCEEEEEecCcccccchhHHH Q lcl|NC_019918. 153 GDWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSHADDDIMAVATHIEGTKKVFIGATAQANTKTSAENDI 232 (428) Q Consensus 153 ~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~~~ 232 (428) .......+.+..+........++..+...-+ .+..++.....+..++-+|-+.-+..+......--..... T Consensus 127 ---~~~~~~~il~ap~~~~~~v~~al~~~~~~~~--~~~i~d~p~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~---- 197 (396) T protein:vir:60 127 ---VTGVKPRILGVPGLDTKEVAVALASVCQKLR--AFGYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDT---- 197 (396) T ss_pred ---ceeeeeeeccccccccHHHHHHHHHHhccCC--eEEEEeCCCCCCHHHHHHHHhhcCCceEEEEeCceeeecc---- Confidence 0000011111111222223333333332211 1222333222223333345443222222211110000000 Q ss_pred HHHHHhcccCceEEEecCCccchhHHHHHHHHHhccCCCc---eeeeeeeecCccc--------cCCCHHHHHHHHhCCc Q lcl|NC_019918. 233 ASRLVAAGFQRTALIYHPNADAQFPECAWVGYQLQEQPGS---NTWTHKALAAVDA--------YRLTPTESTNLKNKNV 301 (428) Q Consensus 233 ~~~l~~~~~~~t~~~y~~~~~~~~~~a~~~~~~~~~~~g~---~t~~fk~~~Gv~~--------~~~t~t~~~~l~~~~~ 301 (428) ....- ....+.+.++|.....+..+ .....|.+.|+.. ...+.+|++.|..+|+ T Consensus 198 --------~~~~~-------~~~p~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI 262 (396) T protein:vir:60 198 --------VASTT-------ATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGV 262 (396) T ss_pred --------cCCce-------eEEchhHHHHHHHHHhhhccCcEeCcCCceecceeeceeecccccCCCcchhhhhhhcCc Confidence 00000 01234566666655444322 2334677777652 2357889999999999 Q ss_pred eEEEEEcCceeeecCEecCCc----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_019918. 302 TTFERVGGVNRTFGGAMAGGE----WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGG 377 (428) Q Consensus 302 n~y~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~ 377 (428) |+.....| ..++.++|++++ ||-+.+-.+|+...|+..+..++-. |.+..-...|+..|+.-|+...++|. T Consensus 263 ~~~~~~~G-~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e----~n~~~~~~~i~~~i~~~l~~l~~~ga 337 (396) T protein:vir:60 263 TTLIRRDG-FRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDK----PITATLIRDIVDGINAKFRELKTNGY 337 (396) T ss_pred EEEEcCCC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCc Confidence 99965333 467899999984 7778888899999998888875543 77889999999999999999999999 Q ss_pred eecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 378 LAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 378 I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) |. ||++++. .++.+++|+.+++.. +.+.+.+...++.|.++...+. T Consensus 338 l~---g~~~~~d-~~~nt~~~i~~G~~~-~~i~~~p~~pae~I~~~~~~~~ 383 (396) T protein:vir:60 338 IV---DATCWFS-EESNDAETLKAGKLY-IDYDYTPVPPLENLTLRQRITD 383 (396) T ss_pred ee---ceEEEEe-cCCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 97 5777774 467899999999887 9999999999999999999988 No 17 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=99.23 E-value=8.4e-11 Score=75.81 Aligned_cols=358 Identities=12% Similarity=0.067 Sum_probs=199.4 Q ss_pred CCC-CCceEEEeeeeecccccccccceEEEEcccCCC------ccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCcc Q lcl|NC_019918. 1 MTV-LTDVIDIQISRETAAVAQTNFNVPLFIASHTNF------SERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALKP 73 (428) Q Consensus 1 M~~-is~iV~V~i~~~~~~~~~~~f~~~li~~~~~~~------~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~P 73 (428) |+. +--+-=+.+.-.+.++...+-..+-+++..... ...-...++..+-...||.++....+...+|.+...+ T Consensus 1 m~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~~~p~~its~~~~~~~~g~~gtl~~al~~~~~ngg~~ 80 (392) T protein:vir:18 1 MSDFHHGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPLNEPVLITNVQSAIAKAGKKGTLSASLQAIADQSKPV 80 (392) T ss_pred CCCCCCCeEEEEcCCCceeeeccCcceeEEEEeccCCCCcccccccceEeechHHHHhhcCCCcchHHHHHHhhcccCce Confidence 984 644332334445566777776777777765422 2233456788888888999988888888999876433 Q ss_pred cEEEEEeeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeeccc Q lcl|NC_019918. 74 RSLVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVASNG 153 (428) Q Consensus 74 ~~l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as~~ 153 (428) .. +.+-. .........++. ..+-|. ..... .......+........ T Consensus 81 ~~--vv~v~-~~~~~~~~~~t~-----~dliG~-----~~~~~-~~tg~~al~~~~~~~~-------------------- 126 (392) T protein:vir:18 81 TV--VVRVA-EGTGDDAEAQTT-----SNIIGG-----TDENG-KYTGIKALLTAEAVTG-------------------- 126 (392) T ss_pred EE--Eeccc-ccccccccccch-----hhheec-----ccccc-hhhhHHHHHhhhhhhc-------------------- Confidence 22 21100 000000000000 000000 00000 0000000111000000 Q ss_pred cccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEEecCCHHHHHHHHHHHhhhCCEEEEEecCcccccchhHHHH Q lcl|NC_019918. 154 DWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSHADDDIMAVATHIEGTKKVFIGATAQANTKTSAENDIA 233 (428) Q Consensus 154 ~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 233 (428) ....+.+..+........++..+.+.-. .+..++.....+..++.+|.+..+..+......--..... T Consensus 127 -----~~p~il~ap~~~~~~v~~~l~~~~~~~~--~~~~~d~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~----- 194 (392) T protein:vir:18 127 -----VKPRILGVPGLDTQEVATALASVCISLR--AFGYVSAWGCKTISEAMAYRENFSQRELMVIWPDFLAWDT----- 194 (392) T ss_pred -----eeehhcccCccchHHHHHHHHHHHhhcC--cEEEEecCCCCCHHHHHHHHhhccCceEEEEeCceeeecc----- Confidence 0001111111112223333333333221 1223333233333444456554333222221110000000 Q ss_pred HHHHhcccCceEEEecCCccchhHHHHHHHHHhccCCC---ceeeeeeeecCccc--------cCCCHHHHHHHHhCCce Q lcl|NC_019918. 234 SRLVAAGFQRTALIYHPNADAQFPECAWVGYQLQEQPG---SNTWTHKALAAVDA--------YRLTPTESTNLKNKNVT 302 (428) Q Consensus 234 ~~l~~~~~~~t~~~y~~~~~~~~~~a~~~~~~~~~~~g---~~t~~fk~~~Gv~~--------~~~t~t~~~~l~~~~~n 302 (428) .++. ..-..|.+.++|.....+.. .....+|.+.|+.. ...+..|++.|..+|+| T Consensus 195 -------------~~~~-~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~ 260 (392) T protein:vir:18 195 -------------TANA-TATAYATARALGLRAYIDQTIGWHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVT 260 (392) T ss_pred -------------cCCc-eEEechHHHHHHHHHhhhccCCceEccCCceeeceeecceecccccCCCcchhhhhhhcCce Confidence 0000 01123456666665544432 23445677777652 23467899999999999 Q ss_pred EEEEEcCceeeecCEecCCc----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_019918. 303 TFERVGGVNRTFGGAMAGGE----WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGL 378 (428) Q Consensus 303 ~y~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I 378 (428) .+....| ..++.+++++++ ||-+.+-.+|+...|+..+...+-. |.++.-...|+..++.-|++.++.|.| T Consensus 261 t~~~~~G-~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e----~n~~~~~~~i~~~i~~~L~~l~~~gal 335 (392) T protein:vir:18 261 TLVRKDG-FRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDK----PITASLIRDIVDGINAKFRELKSNGYI 335 (392) T ss_pred EEEcCCC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcc Confidence 9965433 567899999985 7888888899998888887765543 889999999999999999999999999 Q ss_pred ecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 379 AEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 379 ~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) . ||++++. ..+.+++|+.+++.. +.+.+.+...+++|+++...+. T Consensus 336 ~---g~~v~~d-~~~nt~~~i~~G~~~-~~v~~~p~~p~e~I~~~~~~~~ 380 (392) T protein:vir:18 336 V---DGECWFD-EESNDKETLKAGKLY-IDYDYTPVPPLESLTLRQRITD 380 (392) T ss_pred c---ceEEEEe-cCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 7 4677774 467899999999987 9999999999999999999888 No 18 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=99.22 E-value=4.2e-10 Score=72.01 Aligned_cols=410 Identities=14% Similarity=0.139 Sum_probs=225.9 Q ss_pred CCCCC--ce----------EEEeeeeecccccccccceEEEEcccC----CCccceEEeeCHHHHHhhcCCChHHHHHHH Q lcl|NC_019918. 1 MTVLT--DV----------IDIQISRETAAVAQTNFNVPLFIASHT----NFSERARVYNSLKGVAEDFGESDPTYLAAV 64 (428) Q Consensus 1 M~~is--~i----------V~V~i~~~~~~~~~~~f~~~li~~~~~----~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~ 64 (428) |+.|+ +| +.++.+..- .....+-.-.||+|... .++.......|.++..+.||..|-...|++ T Consensus 1 m~~i~F~~IP~~iRvP~~y~E~dns~A~-~g~~~~~q~vLiiGq~la~gs~~~~~pv~v~s~~~a~~~fG~GS~la~M~~ 79 (495) T protein:vir:19 1 MSDISFNAIPSDVRVPLTYIEFDNSNAV-SGTPAPRQRVLMFGQSGSKASAAPNVPVRIRSGSQASAAFGQGSMLALMAD 79 (495) T ss_pred CCCCchhhCCcccccCeEEEEEccCCCC-cCCcCCCceEEEEEecCcccccccceeEEecCHHHHHHhcCcCcHHHHHHH Confidence 87653 22 112222111 11223334556776543 233344445588899999999999999999 Q ss_pred HHHhcCCcccEEEEEeeeccccc----ccchh-eeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEe Q lcl|NC_019918. 65 RYFGQALKPRSLVIGRRQVPSAT----VSVSV-VQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTV 139 (428) Q Consensus 65 ~~F~q~p~P~~l~igr~~~~~~~----~~~~~-~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~ 139 (428) .|....|--.--+|+--+.+... +++++ .+..+.....|.|.-+.......++++.+++++.+++++....-.+. T Consensus 80 a~~~~n~~~~l~~i~~~D~aG~aA~g~it~tg~at~~G~l~l~I~g~~v~v~V~~gdTaa~vA~al~aaina~~~lPvTA 159 (495) T protein:vir:19 80 AFLNANRVAELWCIPQGNGTGNAAVGEISLSGTAGENGSLVTYIAGQRLAVSVAAGATGAALADLLVARIKGQPDLPVTA 159 (495) T ss_pred HHHHhCCcceEEEEeeCChhhceeEEEEEEeecCCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHhcCCccCceEE Confidence 99987664333334322211111 11111 12344556778888888888889999999998888777654433333 Q ss_pred ec-------cccceeeeec--cccc-ccc----------ccceEEE-----EeeccccCHHHHHHHHHhcccCce-EEEE Q lcl|NC_019918. 140 TD-------NEDGTLTVAS--NGDW-SLK----------VSSNLTM-----AAAPSTEGWPATITAVQGENDEWY-ALSI 193 (428) Q Consensus 140 tt-------~~~~~~t~as--~~~~-~~~----------~s~~~~~-----~~~~aa~~~~~al~~~~~~~~~w~-~~~~ 193 (428) .. ...+.++... .+.. +.. ...++.+ +.++...++.++++++. +.|| ++++ T Consensus 160 ~~~~~~~~~~a~~~VtlTAr~kG~~n~idi~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~---~~~~~~I~~ 236 (495) T protein:vir:19 160 EVRADSGDDDTHADVVLSAKFTGALSAVDVRWNYYAGETTPYGIITAFKAASGKNGNPDISASIAGMG---DLQYKYIVM 236 (495) T ss_pred EeeccCCCCcCceeEEEEEeeccccccceeEEEeecccccccceeEEEEecCCCCCCcchHHHHHHhc---cCCCcEEEE Confidence 22 1112222111 1111 000 0111211 22444456777777776 4565 4444 Q ss_pred ecCCHHHHHHHHHHHhhh----CCEE---EEEecCcccccchhHHHHHHHHhcccCceEEEec-CCccchh-HHHHHHHH Q lcl|NC_019918. 194 DSHADDDIMAVATHIEGT----KKVF---IGATAQANTKTSAENDIASRLVAAGFQRTALIYH-PNADAQF-PECAWVGY 264 (428) Q Consensus 194 ~~~~~~~~~ala~~~~a~----~~~~---~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~-~~~~~~~-~~a~~~~~ 264 (428) ...+.+...+|-.+++.- +.++ +....+ +...+...-...|..|..++.. ..+...+ ..|+++++ T Consensus 237 P~tD~asL~al~~~l~~rw~~~~q~~g~~~~a~~g------T~~~l~t~g~~~N~~~it~~~~~gsp~~~~~~AAA~aa~ 310 (495) T protein:vir:19 237 PYTDEPNLNLLRTELQERWGPVNQADGFAVTVLSG------TYGDISTFGVSRNDHLISCMGIAGAPEPSYLYAATLCAV 310 (495) T ss_pred ecCcHHHHHHHHHHHHHhhhHHHhcCeEEEEeecC------CHHHHHHhhhccCCceEEEEecCCCCCcHHHHHHHHHHH Confidence 445555666777777652 2222 222221 1122333334456666555443 3332221 12344343 Q ss_pred H---hccCCCceeeeeeeecCcccc----CCCHHHHHHHHhCCceEEEE-EcCceeeecCEec-----CC----chhH-- Q lcl|NC_019918. 265 Q---LQEQPGSNTWTHKALAAVDAY----RLTPTESTNLKNKNVTTFER-VGGVNRTFGGAMA-----GG----EWID-- 325 (428) Q Consensus 265 ~---~~~~~g~~t~~fk~~~Gv~~~----~~t~t~~~~l~~~~~n~y~~-~~~~~~~~~G~~~-----~G----~~iD-- 325 (428) + ...+| .-.+.--.|+||.|. .++.+|.+.|..+|+..+.. .+|.-.+.+.++. .| .|.| T Consensus 311 ~A~~l~~DP-ArPL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~~G~V~I~R~ITTY~~n~~G~~D~syLDi~ 389 (495) T protein:vir:19 311 ASQALSIDP-ARPLQTLTLPGRMPPAVGDRFTWSERNALLFDGISTFNVNDGGEMQIERMITMYRTNKYGDSDPSYLNVN 389 (495) T ss_pred HHHHhhccc-ccccCceeecceecCCccccCChHHHHHHHhCCcceEEECCCCeEEEEeeeeeeeecCCCCcchhhhhhH Confidence 3 35666 445555577888853 47899999999999999875 4566666555543 35 3776 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCcCHh---------HHHHHHHHHHHHHHHHHhcCceecCCceEEEe-CchHhCC Q lcl|NC_019918. 326 VMIFVDWLEARMTERLWFRMANSKKIPYDAV---------GATILESEIRAQLNEGIRVGGLAEAPAPKVFV-PDVLSMS 395 (428) Q Consensus 326 ~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~---------G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~-~~~~~~~ 395 (428) .++-+++++..++..+...|-. .|+.-+.. --..|++.+-..+++....|++..-+.|.-.. -.+.. . T Consensus 390 T~~tl~yvr~~~r~~i~~kfpR-~KLa~d~~~~~~gq~IvTp~~ir~ell~~~~~le~~given~~~~~~~LiVerd~-~ 467 (495) T protein:vir:19 390 TIATLSYLRYSLRTRITQKFPN-YKLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGLVEDFDTFKEELYVARNK-D 467 (495) T ss_pred HHHHHHHHHHHHHHHHhhhcCC-cccccCCCCCCCcccccChHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECC-C Confidence 8899999999999998876644 23332211 23578999999999999999997654433111 11211 1 Q ss_pred HHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 396 PNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 396 ~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) .-+|-+ +.+-..+-...|.+-.+...-| T Consensus 468 dpnRln-----~~~p~d~vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 468 DKDRLD-----VLCGPNLINQFRIFAAQVQFIL 495 (495) T ss_pred CCcEEE-----EEecceeeCceeeeeeeeeeeC Confidence 112333 3333455555665555555555 No 19 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=99.22 E-value=9.5e-11 Score=75.53 Aligned_cols=360 Identities=12% Similarity=0.102 Sum_probs=198.2 Q ss_pred CCC-CCceEEE-eeeeecccccccccceEEEEcccCC------CccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCc Q lcl|NC_019918. 1 MTV-LTDVIDI-QISRETAAVAQTNFNVPLFIASHTN------FSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALK 72 (428) Q Consensus 1 M~~-is~iV~V-~i~~~~~~~~~~~f~~~li~~~~~~------~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~ 72 (428) |+. .=- |.| .+.-.+.++.......+.++|.+.. +...-...++..+-...||.....+.+...+|.+... T Consensus 1 m~~~~~G-V~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l~~pvlvts~~~~~~~~g~~~tL~~al~~~~~ngg~ 79 (396) T protein:vir:20 1 MSDYHHG-VQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAISKAGKKGTLAASLQAIADQSKP 79 (396) T ss_pred CCCCCCC-eEEEEcCCCcceeeecCCceeEEEeeeccCCCccccCccCEEeechHHHHhhcccccchhhhhhhhhccCce Confidence 984 422 222 2333445566666677777775532 2234456788888888999999988888888877532 Q ss_pred ccEEEEEeeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeecc Q lcl|NC_019918. 73 PRSLVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVASN 152 (428) Q Consensus 73 P~~l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as~ 152 (428) .. ++-+.............. .....+.+. ....+.. .....+..+.+ T Consensus 80 ~~--~v~~~~~~~~~~~~~~~a---~t~~~~~~~-----~~~~~~~-tg~~al~~~~~---------------------- 126 (396) T protein:vir:20 80 VT--VVMRVEDGTGDDEETKLA---QTVSNIIGT-----TDENGQY-TGLKAMLAAES---------------------- 126 (396) T ss_pred eE--EEEecccccccccccccc---ccccccccc-----ccccccc-chhhhhhhhcc---------------------- Confidence 22 221110000000000000 000000000 0000000 00000000000 Q ss_pred ccccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEEecCCHHHHHHHHHHHhhhCCEEEEEecCcccccchhHHH Q lcl|NC_019918. 153 GDWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSHADDDIMAVATHIEGTKKVFIGATAQANTKTSAENDI 232 (428) Q Consensus 153 ~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~~~ 232 (428) .......+.+..+........++..+.+.-.. +..++.....+..++.+|-+.-+..+......--..... T Consensus 127 ---~~~~~p~i~~ap~~~~~~v~~al~~~~~~~~~--~~~iD~p~~~~~~~a~~~r~~~~s~~~~~~~P~~~~~d~---- 197 (396) T protein:vir:20 127 ---VTGVKPRILGVPGLDTKEVAVALASVCQKLRA--FGYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDT---- 197 (396) T ss_pred ---ccccchhhhhhhhhccHHHHHHHHHHHhcCCc--EEEEecCCCCCHHHHHHHhhCCCCceEEEEcCccccccC---- Confidence 00000011111122222334444444433222 223333322233344456554433333222211000000 Q ss_pred HHHHHhcccCceEEEecCCccchhHHHHHHHHHhccCC--C-ceeeeeeeecCccc--------cCCCHHHHHHHHhCCc Q lcl|NC_019918. 233 ASRLVAAGFQRTALIYHPNADAQFPECAWVGYQLQEQP--G-SNTWTHKALAAVDA--------YRLTPTESTNLKNKNV 301 (428) Q Consensus 233 ~~~l~~~~~~~t~~~y~~~~~~~~~~a~~~~~~~~~~~--g-~~t~~fk~~~Gv~~--------~~~t~t~~~~l~~~~~ 301 (428) ...... ...+.+.++|.....+. | .....+|.+.||.. ...++.|++.|.++|+ T Consensus 198 --------~~~~~~-------~~p~s~~~Ag~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi 262 (396) T protein:vir:20 198 --------VTSTTA-------TAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGV 262 (396) T ss_pred --------cCCcce-------eechhHHHHHHHHHhhhhcCcEeccCCceeccceecceecccccCCCcchhhhhhhcCc Confidence 000011 12345555555553332 2 13445667777652 2356889999999999 Q ss_pred eEEEEEcCceeeecCEecCCc----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_019918. 302 TTFERVGGVNRTFGGAMAGGE----WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGG 377 (428) Q Consensus 302 n~y~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~ 377 (428) |......| ..++.++|++++ ||-+.+-.+|+...|+..+...+-. |.+..-...|+..++.-|++.++.|. T Consensus 263 ~~~~~~~G-~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~~~~~~~~~v~e----~~~~~~~~~i~~~i~~~L~~l~~~G~ 337 (396) T protein:vir:20 263 TTLIRRDG-FRFWGNRTCSDDPLFLFENYTRTAQVVADTMAEAHMWAVDK----PITATLIRDIVDGINAKFRELKTNGY 337 (396) T ss_pred EEEEcCCC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCcc Confidence 99965333 467899999885 7778888899888888887765543 77888899999999999999999999 Q ss_pred eecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 378 LAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 378 I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) |. ||.+.+. .++.|++|+.+++.. +.+.+.+...++.|.++...+. T Consensus 338 l~---g~~v~~d-~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 383 (396) T protein:vir:20 338 IV---DATCWFS-EESNDAETLKAGKLY-IDYDYTPVPPLENLTLRQRITD 383 (396) T ss_pred ee---ceEEEEe-cCCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 97 5778875 567899999999987 9999999999999999999888 No 20 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=99.22 E-value=1.6e-10 Score=74.27 Aligned_cols=361 Identities=13% Similarity=0.106 Sum_probs=198.0 Q ss_pred CCC-CCceEEEeeeeecccccccccceEEEEcccCCC------ccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCcc Q lcl|NC_019918. 1 MTV-LTDVIDIQISRETAAVAQTNFNVPLFIASHTNF------SERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALKP 73 (428) Q Consensus 1 M~~-is~iV~V~i~~~~~~~~~~~f~~~li~~~~~~~------~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~P 73 (428) |+. .--+.=+.+.-.+.++...+.+.+.+++...+. ...-...++..+....||.++..+.+-..+|.+...+ T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~al~~~~~~~~~~ 80 (396) T protein:vir:57 1 MSDYHHGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAIAKAGKKGTLAASLQAIADQSKPV 80 (396) T ss_pred CCCCCCceEEEEcCCCcccccccCCceEEEEEeccCCCcccccCccCeEeecchhhhhhcccccchHHHHHHhhhcCCce Confidence 984 433322233345566777888888888765432 1222345677788888999998888888888876433 Q ss_pred cEEEEEeeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeeccc Q lcl|NC_019918. 74 RSLVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVASNG 153 (428) Q Consensus 74 ~~l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as~~ 153 (428) .. +-+................. ..+-|. ....+..+.+ .++....+. . T Consensus 81 ~~--vv~~~~~~~~~~~~~~a~t~---~~iiG~-----~~~~~~~tgl-~al~~~~~~--------------~------- 128 (396) T protein:vir:57 81 TV--VVRVEDGTGDDEETKLAQTV---SNIIGT-----TDENGQYTGL-KALMGAESV--------------T------- 128 (396) T ss_pred eE--eeeccccccccccccccccc---eeeeee-----ccccccchhh-hhhhhcccc--------------e------- Confidence 22 21111000000000000000 000000 0000000000 000000000 0 Q ss_pred cccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEEecCCHHHHHHHHHHHhhhCCEEEEEecCcccccchhHHHH Q lcl|NC_019918. 154 DWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSHADDDIMAVATHIEGTKKVFIGATAQANTKTSAENDIA 233 (428) Q Consensus 154 ~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 233 (428) .....+.+..+........++..+.+.-+ .+..++.....+..++-+|-+.-+..+......--.... T Consensus 129 ----~~~p~i~~ap~~~~~~v~~al~~~~~~~~--~~~~~d~p~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d------ 196 (396) T protein:vir:57 129 ----GVKPRILGVPGLDTKEVAVALASVCQELN--AFGYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWD------ 196 (396) T ss_pred ----eEEeccccCcccchhHHHHHHHHHhhhCc--eEEEEcCCCCCCHHHHHHHHhccCCceEEEEcceeeeec------ Confidence 00001111122222233344444443221 223333322222333445655433333222211000000 Q ss_pred HHHHhcccCceEEEecCCccchhHHHHHHHHHhccCC--C-ceeeeeeeecCcccc--------CCCHHHHHHHHhCCce Q lcl|NC_019918. 234 SRLVAAGFQRTALIYHPNADAQFPECAWVGYQLQEQP--G-SNTWTHKALAAVDAY--------RLTPTESTNLKNKNVT 302 (428) Q Consensus 234 ~~l~~~~~~~t~~~y~~~~~~~~~~a~~~~~~~~~~~--g-~~t~~fk~~~Gv~~~--------~~t~t~~~~l~~~~~n 302 (428) ...+.... ..+.+.++|.....+. | .....+|.+.||..- ..+++|++.|..+|+| T Consensus 197 ------~~~~~~~~-------~p~s~~~Ag~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~ 263 (396) T protein:vir:57 197 ------TVTSTTAT-------AYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGISASVFWDLQKPGTDADLLNEAGVT 263 (396) T ss_pred ------ccCCceeE-------EehhHHHHHHHHHhhhccCcEeccCCceeccccccceecccccCCcchhhhhhhhcCcE Confidence 00001111 2234555555543332 2 234556777776532 3467899999999999 Q ss_pred EEEEEcCceeeecCEecCCc----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_019918. 303 TFERVGGVNRTFGGAMAGGE----WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGL 378 (428) Q Consensus 303 ~y~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I 378 (428) +.....| ..++.+++++++ ||-+.+-.+|++..|+..+...+-. |.++.-...|+..|+.-|+..++.|.| T Consensus 264 t~~~~~G-~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e----~n~~~~~~~i~~~i~~~l~~l~~~gal 338 (396) T protein:vir:57 264 TLVRRDG-FRFWGNRTCSDDPLFLFESYTRTAQVLADTMAEAHMWAIDK----PITATLIRDIIDGINAKFRELKNNGYI 338 (396) T ss_pred EEEcCCC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCce Confidence 9865433 467899999885 7778888889888888887765443 778999999999999999999999999 Q ss_pred ecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 379 AEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 379 ~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) . ||.+.+. .++.+++|+.+++.. +.+.+.+...+++|.++...+. T Consensus 339 ~---g~~v~~d-~~~n~~~~i~~G~~~-~~v~~~p~~p~e~I~~~~~~~~ 383 (396) T protein:vir:57 339 V---DGTCWFS-EESNDAETLKAGKLY-IDYDYTPVPPLENLTLRQRITS 383 (396) T ss_pred e---ceEEEEe-cCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 7 4677775 467799999999887 9999999999999999999988 No 21 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=99.20 E-value=6e-10 Score=71.14 Aligned_cols=407 Identities=15% Similarity=0.148 Sum_probs=218.2 Q ss_pred CC-CCCce---EEE---eeeeec-ccccccccceEEEEcccC----CCccceEEeeCHHHHHhhcCCChHHHHHHHHHHh Q lcl|NC_019918. 1 MT-VLTDV---IDI---QISRET-AAVAQTNFNVPLFIASHT----NFSERARVYNSLKGVAEDFGESDPTYLAAVRYFG 68 (428) Q Consensus 1 M~-~is~i---V~V---~i~~~~-~~~~~~~f~~~li~~~~~----~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~ 68 (428) |. ..+.| +.| -+.+.. .+....+-.-.||+|... .++.......|.++..+.||..|-...|++.|.. T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~~~q~vLiiGq~la~gs~~~~~~v~v~s~~~a~~lfG~GSml~~M~~a~~~ 80 (498) T protein:vir:45 1 MTISFNTIPSNTLVPLFYAEMDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEAYRQ 80 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCCCCCCCCcceEEEEecCCccccccceeEEecCHHHHHHhcCcCcHHHHHHHHHHH Confidence 44 22222 111 112211 222233334567776543 2333444445889999999999999999999998 Q ss_pred cCCcccEEEEEeeecccc----cccchh-eeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccc Q lcl|NC_019918. 69 QALKPRSLVIGRRQVPSA----TVSVSV-VQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNE 143 (428) Q Consensus 69 q~p~P~~l~igr~~~~~~----~~~~~~-~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~ 143 (428) -.|--.--+|+--+.+.. .+++++ .+..+.....|.|.-+.......++++.+++++.+++++....-.++.... T Consensus 81 ~n~~~~l~~i~~~d~aG~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aaina~~~lPVTA~~~~ 160 (498) T protein:vir:45 81 TDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTASSSA 160 (498) T ss_pred hCCcceEEEEeeCCcccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHhCCCCCceEEEecC Confidence 876533223332211111 112211 123445566788888888888888999999888887776544433333222 Q ss_pred -cceeeeeccccccc---------------cccceEE--EE---eeccccCHHHHHHHHHhcccCce-EEEEecCCHHHH Q lcl|NC_019918. 144 -DGTLTVASNGDWSL---------------KVSSNLT--MA---AAPSTEGWPATITAVQGENDEWY-ALSIDSHADDDI 201 (428) Q Consensus 144 -~~~~t~as~~~~~~---------------~~s~~~~--~~---~~~aa~~~~~al~~~~~~~~~w~-~~~~~~~~~~~~ 201 (428) ..+.|..-.+..+- ....++. +. .+....++.++++++. +.|| ++.+...+.+.+ T Consensus 161 ~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamagGag~PD~a~alaal~---~~~~~~I~~p~~D~asL 237 (498) T protein:vir:45 161 GVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMA---DEPFDYIGLPFNDTASV 237 (498) T ss_pred ceEEEEeeccCccccceeEEEeeccccccccccceeeEEEEccCCCccCchhHHHHHHhc---cCCccEEEEeeCCHHHH Confidence 11222211111110 0112222 22 2333446666777665 4454 455555556666 Q ss_pred HHHHHHHhhh-------CCEEEEEecCcccccchhHHHHHHHHhcccCceEEEec-CCccch--hHHHHHHHHHh---cc Q lcl|NC_019918. 202 MAVATHIEGT-------KKVFIGATAQANTKTSAENDIASRLVAAGFQRTALIYH-PNADAQ--FPECAWVGYQL---QE 268 (428) Q Consensus 202 ~ala~~~~a~-------~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~-~~~~~~--~~~a~~~~~~~---~~ 268 (428) .++..+.+.- +.++....... .++...+...-...|..|..++.+ ...+.. ...|++++++. .. T Consensus 238 ~al~~~L~~~sgRw~~~~q~~g~~~~a~---~gT~~~l~t~g~~~N~~~it~~~~~~~~~sp~~~~AAa~aa~~A~~l~~ 314 (498) T protein:vir:45 238 NTLVTEMNDTSGRWSYARQLYGHVYTAK---TGTLSELVNAGDQFNQQHITLAGYEKETQTPADELAASRTARAAVFIRN 314 (498) T ss_pred HHHHHHHhhhhhhhhHHhhcCeEEEEec---cCCHHHHHHhhhccCCceEEEEecCCCCCChHHHHHHHHHHHHHHHhhc Confidence 6777777542 22332221111 111223333444556677665543 332222 23455555554 56 Q ss_pred CCCceeeeeeeecCcccc----CCCHHHHHHHHhCCceEEEEEcCceeeecCEec-----CC----chhH--HHHHHHHH Q lcl|NC_019918. 269 QPGSNTWTHKALAAVDAY----RLTPTESTNLKNKNVTTFERVGGVNRTFGGAMA-----GG----EWID--VMIFVDWL 333 (428) Q Consensus 269 ~~g~~t~~fk~~~Gv~~~----~~t~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~-----~G----~~iD--~~~~~dwl 333 (428) +|. -.+.=-.|+|+.|. .++.+|.+.|..+|+..+..-.|.-.+.+.++. .| .|.| .++-.+++ T Consensus 315 DPA-rPL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yv 393 (498) T protein:vir:45 315 DPA-RPTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYV 393 (498) T ss_pred ccc-cccCceeecceecCCchhcCChHHHHHHHhCCcceEEEcCCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHH Confidence 674 44444567888854 478999999999999998766665555666553 45 3776 88999999 Q ss_pred HHHHHHHHHHHHHhcCCCCcCHh---------HHHHHHHHHHHHHHHHHhcCceecCCceEEEe-CchHhCCHHHHhccc Q lcl|NC_019918. 334 EARMTERLWFRMANSKKIPYDAV---------GATILESEIRAQLNEGIRVGGLAEAPAPKVFV-PDVLSMSPNMRAQRI 403 (428) Q Consensus 334 ~~~lq~~l~~ll~~~~kip~~~~---------G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~-~~~~~~~~~dra~R~ 403 (428) ...++..+..-|-. .|+.-+.. --.+|++.+-..+++....|++..-+.|.-.. -.+.. ..-+|-+=. T Consensus 394 r~~~r~~i~~kfpR-~KLa~dg~~~~~gq~IvTp~~ir~ell~~y~~le~~givEn~~~~~~~LiVerd~-~dpnRln~~ 471 (498) T protein:vir:45 394 LRKLKSVITSKYGR-HKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDA-SVPNRLNTL 471 (498) T ss_pred HHHHHHHhhhhcCC-eeecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECC-CCCcEEEEE Confidence 99999988876633 34332222 24688999999999999999997543322100 01110 001111111 Q ss_pred cC--------------CeEEEEEECce Q lcl|NC_019918. 404 FE--------------GIEFEARLAGA 416 (428) Q Consensus 404 ~~--------------~i~~~~~~aga 416 (428) .| .+.+.|.-+++ T Consensus 472 ~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:45 472 FPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred ecccccCchhhhhhhhhhheehhhcCC Confidence 11 12223333333 No 22 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=99.20 E-value=2.2e-10 Score=73.56 Aligned_cols=358 Identities=12% Similarity=0.089 Sum_probs=198.7 Q ss_pred CCC-CCceEEEeeeeecccccccccceEEEEcccCCC------ccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCcc Q lcl|NC_019918. 1 MTV-LTDVIDIQISRETAAVAQTNFNVPLFIASHTNF------SERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALKP 73 (428) Q Consensus 1 M~~-is~iV~V~i~~~~~~~~~~~f~~~li~~~~~~~------~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~P 73 (428) |++ +--+-=+.+.-.+.++...+...+.++|.+.+. ...-...++..+....||.....+.+-..+|.+...+ T Consensus 1 m~~~~~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~~~pv~v~s~~~~~~~~g~~~tl~~al~~~~~~~~~~ 80 (395) T protein:vir:98 1 MSDFHHGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPLNEPVLITNVQSAIAKAGKKGTLAASLQAIADQSKPV 80 (395) T ss_pred CCCCCCCeEEEEcCCCcccccccCcceEEEEeeccCCCccccccccceEeechHHhHhhcccccchhhHHHHHhhccCce Confidence 994 543321233445556677777777777765321 1122356788888888999999999889999887544 Q ss_pred cEEEEEeeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeeccc Q lcl|NC_019918. 74 RSLVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVASNG 153 (428) Q Consensus 74 ~~l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as~~ 153 (428) ..+. +............... ....+.|. ....+..+.+. ++..... T Consensus 81 ~~vv--~~~~~~~~~~~~~~a~---~~~~i~g~-----~~~~~~~Tgl~-al~~~~~----------------------- 126 (395) T protein:vir:98 81 TVVV--RVEDGTGDDEEAALAQ---TVSNIIGG-----TDENGKYTGIK-ALLTAQA----------------------- 126 (395) T ss_pred EEEe--eccccccccccccccc---cccccccc-----cccccchhHHH-HHhhhhh----------------------- Confidence 3322 2111000000000000 00000000 00000000000 0000000 Q ss_pred cccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEEecCCHHHHHHHHHHHhhhCCEEEEEecCcccccchhHHHH Q lcl|NC_019918. 154 DWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSHADDDIMAVATHIEGTKKVFIGATAQANTKTSAENDIA 233 (428) Q Consensus 154 ~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 233 (428) .......+.+..+........++..+...-. .+..++.....+..++-+|.+.-+..+....+.--. T Consensus 127 --~~~~~p~il~ap~~~~~~v~~al~~~~~~~~--~~~~~d~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~--------- 193 (395) T protein:vir:98 127 --VTGVKPRILGVPGLDTKEVAVALASAAIKLR--AFAYVSAWGCKTISEAMEYRKNFSQRELMVIWPDFL--------- 193 (395) T ss_pred --hhccchhhcccccccccHHHHHHHHHhhhcC--cEEEEEcCCCCCHHHHHHHHhccCCceEEEEeccee--------- Confidence 0000111111222222333444444443322 223333322222333344544333323222211000 Q ss_pred HHHHhcccCceEEEecCC---ccchhHHHHHHHHHhccCCCc---eeeeeeeecCccc--------cCCCHHHHHHHHhC Q lcl|NC_019918. 234 SRLVAAGFQRTALIYHPN---ADAQFPECAWVGYQLQEQPGS---NTWTHKALAAVDA--------YRLTPTESTNLKNK 299 (428) Q Consensus 234 ~~l~~~~~~~t~~~y~~~---~~~~~~~a~~~~~~~~~~~g~---~t~~fk~~~Gv~~--------~~~t~t~~~~l~~~ 299 (428) +|++. .-...+.+.++|.....+..+ ..-..|.+.|+.. ...+.+|++.|.++ T Consensus 194 -------------~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~i~gi~~~~~~~~~~~~~~~~~~~~Ln~~ 260 (395) T protein:vir:98 194 -------------AWDTVKNTTATAYATARALGLRAYIDQTVGWHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEA 260 (395) T ss_pred -------------EecccCCceeeechHHHHHHHHHHhhcccCcEeccCCceeecccccceecccccCCCcchHHhhhhc Confidence 11110 001234566666555444222 2335666666542 23468899999999 Q ss_pred CceEEEEEcCceeeecCEecCCc----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhc Q lcl|NC_019918. 300 NVTTFERVGGVNRTFGGAMAGGE----WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRV 375 (428) Q Consensus 300 ~~n~y~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~ 375 (428) |+|.+....| ..++.+++++++ ||-+.+-.+|+...|+..+...+-. |.++.=...|+..|+.-|++.+++ T Consensus 261 gI~~~~~~~G-~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e----~~~~~~~~~i~~~i~~~L~~l~~~ 335 (395) T protein:vir:98 261 GVTTLVRKDG-FRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDK----PITATLIRDIVDGINAKFRELKSN 335 (395) T ss_pred CcEEEEcCCC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhC Confidence 9999965333 467889998884 7778888899888888887765543 778888999999999999999999 Q ss_pred CceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 376 GGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 376 G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) |.|. ||++.+. .++.+++|+.+++.. +.+.+.+...+++|+++...+. T Consensus 336 g~l~---g~~v~~d-~~~nt~~~i~~G~~~-~~i~~~p~~p~e~I~~~~~~~~ 383 (395) T protein:vir:98 336 GYIV---EGKCWFD-EESNDKETLKAGKLY-IDYDYTPVPPLESLTLRQRITD 383 (395) T ss_pred Ccee---ceEEEEe-cCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 9997 4778774 467799999999987 9999999999999999999988 No 23 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=99.19 E-value=6.8e-10 Score=70.83 Aligned_cols=403 Identities=10% Similarity=-0.003 Sum_probs=200.7 Q ss_pred CCC-----CCceE-EEeeeeeccc---ccccccceEEEEcccC-CCccceEEeeCHHHHHhhcC--CChHHHHHHHHHHh Q lcl|NC_019918. 1 MTV-----LTDVI-DIQISRETAA---VAQTNFNVPLFIASHT-NFSERARVYNSLKGVAEDFG--ESDPTYLAAVRYFG 68 (428) Q Consensus 1 M~~-----is~iV-~V~i~~~~~~---~~~~~f~~~li~~~~~-~~~~~~~~y~s~~~V~~~fg--~~s~eY~aA~~~F~ 68 (428) |+= -+|+. -|.|+..+.+ +...+=...++++-.. -++.....-.+-+|...-|| .+++.|++-+.+|. T Consensus 1 magg~~~~~~K~~PGvYi~~~~~~~~~~~~~~~~~~~~i~~~~~~g~~~~v~i~~~~d~~~~fG~~~~~~~~~~~~~~~~ 80 (451) T protein:vir:10 1 MAGGTWKAQDKRRPGTYINVVGNGQREAASSLGRVLLIRDKGLGWGKNGVIEVEANSDFTKKLGTTLDDPSLTALKETLK 80 (451) T ss_pred CCceeeccceeecCceEEEEeccCcceeeccCCcEEEEEeeecCCCCcccEEeecHHHHHHHcCCcccchhHHHHHHHhc Confidence 551 12221 1222222222 1122223455555332 22333445667788988898 45667887777775 Q ss_pred cCCcccEEEEEeeecc-cccccch--heeecccccccccceeeeeee--cccchhhhhhhhheeeecccceEEEEe---- Q lcl|NC_019918. 69 QALKPRSLVIGRRQVP-SATVSVS--VVQEGQSYVLTVNGLPVSYVS--HQDDTATLIATGLKAAYDVTPVVGVTV---- 139 (428) Q Consensus 69 q~p~P~~l~igr~~~~-~~~~~~~--~~~~~~~~~~~v~g~~~s~~~--~~~~~a~~i~a~l~~a~~~~~~~~~~~---- 139 (428) + |+++++.|-... .+..+.. .+..-..+-. .-|-++..+. ...+..... +..-.+....-..+. T Consensus 81 g---~~~v~~yrl~~g~~a~~t~~~~~~~~~Aky~G-~~Gn~i~v~v~~~~~d~~~~~---v~t~~g~~~vd~qtv~~~~ 153 (451) T protein:vir:10 81 G---ASKVLVLNPNEGTAATLTKEGLPWTVTANYPG-EKGNQITVSVEVSPADQNAAT---VSTIFGTKLVDEQSIKFNE 153 (451) T ss_pred C---CcEEEEEEcCCCceEEEEeecCceEEEEeeCC-cCCceEEEEEecccCCcCceE---EEEEECCeEEEEEEeeccc Confidence 3 688999885321 1111110 0000000000 1111122211 111110000 001000111000000 Q ss_pred -ecc-ccceeeeeccccccccccceEEE-------EeeccccCHHHHHHHHHhcccCceEEEEecCCHHHHHHHHHHHhh Q lcl|NC_019918. 140 -TDN-EDGTLTVASNGDWSLKVSSNLTM-------AAAPSTEGWPATITAVQGENDEWYALSIDSHADDDIMAVATHIEG 210 (428) Q Consensus 140 -tt~-~~~~~t~as~~~~~~~~s~~~~~-------~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~~~~a 210 (428) ... ....+.................+ ....+.+...+++..+.....+|..+...+.+.+.+..+..|+.. T Consensus 154 ~~el~~nd~V~a~~~~~g~~~~~~~~~l~~~~~gg~~~~~~~~~~~~l~~~e~~~~n~l~~~~~~~~~~i~~~~~a~ik~ 233 (451) T protein:vir:10 154 LDKFKGNDYITAKVVEEGSSKPVAFTNVSGTLTGGTTTESNKVESLLNDALENEEYAVVTTAGFEPSSNMNKLVVEAVKR 233 (451) T ss_pred hhhccCCceEEEEecccccccceeeeecccccccccccCCccchHHHHHHhccceeeEEEEccCCCchHHHHHHHHHHHH Confidence 000 01111111000111111111111 112345667778887776654443333334445567788999985 Q ss_pred h----CCEEEEEecCcccccchhHHHHHHHHhcccCceEEEecC-CccchhHHHHHHHHHhccCCCceeeeeeeecCcc- Q lcl|NC_019918. 211 T----KKVFIGATAQANTKTSAENDIASRLVAAGFQRTALIYHP-NADAQFPECAWVGYQLQEQPGSNTWTHKALAAVD- 284 (428) Q Consensus 211 ~----~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~-~~~~~~~~a~~~~~~~~~~~g~~t~~fk~~~Gv~- 284 (428) . ++++..+...........+.+. +....+...+. .-+....+++++|..+.... ....-||.++|+. T Consensus 234 ~r~~~g~~~~aVl~~~~~~~~d~egii------nv~n~~~~~dg~~~~~~~~~~~vAG~~Ag~~~-~~S~T~~~~~~~~~ 306 (451) T protein:vir:10 234 LRENEGRKVRGVIPTDADTTYNYEGIS------TVVNGYTLSDGTNVDVKDATGYFAGISASADV-ATSLTYFEVEDAVS 306 (451) T ss_pred HHHhcCCeEEEEecCccCCCCCCcceE------EeecceEecCceeechhhhHHHHHHHHccccc-ccCccceecCCcee Confidence 3 3444333322111111111110 01111111111 01223346777777776654 3556677888764 Q ss_pred c-cCCCHHHHHHHHhCCceEEEEEcCc-eeeecCEec----------CCchhHHHHHHHHHHHHHHHHHHHHHHhcCCCC Q lcl|NC_019918. 285 A-YRLTPTESTNLKNKNVTTFERVGGV-NRTFGGAMA----------GGEWIDVMIFVDWLEARMTERLWFRMANSKKIP 352 (428) Q Consensus 285 ~-~~~t~t~~~~l~~~~~n~y~~~~~~-~~~~~G~~~----------~G~~iD~~~~~dwl~~~lq~~l~~ll~~~~kip 352 (428) . ..++.+|++.+.++|..++....|. -.+-+|+.+ +...|-.++-.|-+.+.++..+-+.++ +|+| T Consensus 307 v~~~~t~~e~~~~i~~G~lvl~~~~g~~v~i~~~INTltt~~~~k~~~~~ki~vir~~D~i~~di~~~~~~~yi--Gk~~ 384 (451) T protein:vir:10 307 AYPKFDNEKTIKALDAGQIVFTTRPGQRVVIEQDINSLHKFTAEKPQAFSKNRVIRTLDEIATNTENTFERTYL--GNVG 384 (451) T ss_pred eeeeCCHHHHHHHHhCCeEEEEEEcCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccc--eecC Confidence 2 5699999999999999877544443 334456532 123466777777777777665443333 6999 Q ss_pred cCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEe Q lcl|NC_019918. 353 YDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVT 427 (428) Q Consensus 353 ~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~ 427 (428) =+.+|...+.+.|+.-|++..+.|.|.++....+....-. .+..--+++.+++-.++..+.+.+.|. T Consensus 385 N~~~gr~~~~~~i~~yl~~l~~~g~i~~~~~~d~~v~~~~--------~~~~v~v~~~v~pvdame~iy~t~~v~ 451 (451) T protein:vir:10 385 NNAAGRDLFKADRIAYLTSLQNRNMIQSFANTDITVEAGN--------DMDSIVVNLAVTPVDAMEKLYMTMVVR 451 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHhCCCccCCCccceEEeecC--------CCCEEEEEEEEEEEeeeeeEEEEEEEc Confidence 9999999999999999999999999987655444432211 133344899999999999999988888 No 24 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=99.18 E-value=3.5e-10 Score=72.43 Aligned_cols=411 Identities=15% Similarity=0.155 Sum_probs=217.0 Q ss_pred CC-CCCce----------EEEeeeeecccccccccceEEEEcccC----CCccceEEeeCHHHHHhhcCCChHHHHHHHH Q lcl|NC_019918. 1 MT-VLTDV----------IDIQISRETAAVAQTNFNVPLFIASHT----NFSERARVYNSLKGVAEDFGESDPTYLAAVR 65 (428) Q Consensus 1 M~-~is~i----------V~V~i~~~~~~~~~~~f~~~li~~~~~----~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~ 65 (428) |. ..+.| +.++-+....+... .-.|++|... .++.......|.++..+.||..|-...|++. T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~~~---qrvLiiGq~la~gt~~~~~~v~v~s~~~a~~~fG~GS~l~~M~~a 77 (498) T protein:vir:48 1 MTISFSAVPSDTLVPLFYAEMDNSAANTAVTS---APALLIGHASNDAAIEVNSLVLMPSADYARQICGAGSQLARMVDV 77 (498) T ss_pred CCccccccCcccccceEEEEEecCCCccccCC---cceEEEeecCccccccccceEEecCHHHHHHhcCcccHHHHHHHH Confidence 54 23322 22222221111111 2467776543 2333334445889999999999999999999 Q ss_pred HHhcCCcccEEEEEeeeccccc----ccchh-eeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEee Q lcl|NC_019918. 66 YFGQALKPRSLVIGRRQVPSAT----VSVSV-VQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVT 140 (428) Q Consensus 66 ~F~q~p~P~~l~igr~~~~~~~----~~~~~-~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~t 140 (428) |....|--.--+|+--+.+... +++++ .+..+.....|.|.-+.......++++.+++++..++++....-.++. T Consensus 78 ~~~~n~~~~l~~i~~~D~ag~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aai~a~~~lPVTA~ 157 (498) T protein:vir:48 78 YRQTDPFGELYVIAVPEARGAAATVRVTVTGEAEESGTLSLYVGRSSVQVPVVNGDDATAVATAIKEAVNGVITLPFAAS 157 (498) T ss_pred HHHhCCCceeEEEeeCCcccceeEEEEEecccccCCceEEEEECCEEEEEeecCCCCHHHHHHHHHHHHhCCCCcceEEE Confidence 9988764433334322211111 11111 123445566788888888788888999998888877766544433333 Q ss_pred ccc-cceeeeecccccccc---------------ccceEEE-----EeeccccCHHHHHHHHHhcccCce-EEEEecCCH Q lcl|NC_019918. 141 DNE-DGTLTVASNGDWSLK---------------VSSNLTM-----AAAPSTEGWPATITAVQGENDEWY-ALSIDSHAD 198 (428) Q Consensus 141 t~~-~~~~t~as~~~~~~~---------------~s~~~~~-----~~~~aa~~~~~al~~~~~~~~~w~-~~~~~~~~~ 198 (428) ... ..+.+..-.+..+-. ...++.+ +.+....++.++++.+.+ .|| ++.+...+. T Consensus 158 ~~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamsgGag~PDia~aLaal~~---~~~~~I~~p~~D~ 234 (498) T protein:vir:48 158 SDAGVVTLTARHKGLYGNELPVCLNYYGSGGGEILPAGLQVVTEAGTAGSGAPDLTAAVAAMGD---EAFDFIGLPFNDA 234 (498) T ss_pred ecCcEEEEEeeecccccccceeeeeeccCcccccccceeeEEEEcccCCccCcchHHHHHhhcc---CCccEEEEeecCH Confidence 222 112221111111100 0112221 223444566767776654 454 555555666 Q ss_pred HHHHHHHHHHhh-------hCCEEEEEecCcccccchhHHHHHHHHhcccCceEEEe-cCCccch--hHHHHHHHHHh-- Q lcl|NC_019918. 199 DDIMAVATHIEG-------TKKVFIGATAQANTKTSAENDIASRLVAAGFQRTALIY-HPNADAQ--FPECAWVGYQL-- 266 (428) Q Consensus 199 ~~~~ala~~~~a-------~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y-~~~~~~~--~~~a~~~~~~~-- 266 (428) +.+.++..+.+. -+.++....... .++...+...-...|..|..++. +.....+ ...+++++++. T Consensus 235 asl~al~~~L~~~sgRw~~~~q~~g~~~~a~---~gT~~~l~t~g~~~N~~~it~~~~~~~~~~p~~~~AAa~a~~aA~~ 311 (498) T protein:vir:48 235 ASINMMMTEMNDSSGRWSYARQLYGHVYTAK---LGTLSELVNAGDMHNQQHITLAGYEKETQSPVDELVASRLAREAVF 311 (498) T ss_pred HHHHHHHHHHhhhhhhhhHHhhcCeEEEEec---cCCHHHHHHhhhccCCceEEEEecCCCCCChHHHHHHHHHHHHHHh Confidence 667777777753 223332221111 11122333344455666665554 4333212 23445555544 Q ss_pred -ccCCCceeeeeeeecCcccc----CCCHHHHHHHHhCCceEEEEEcCceeeecCEec-----CC----chhH--HHHHH Q lcl|NC_019918. 267 -QEQPGSNTWTHKALAAVDAY----RLTPTESTNLKNKNVTTFERVGGVNRTFGGAMA-----GG----EWID--VMIFV 330 (428) Q Consensus 267 -~~~~g~~t~~fk~~~Gv~~~----~~t~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~-----~G----~~iD--~~~~~ 330 (428) ..+|. -.+.=-.|+||.|. .++.+|.+.|..+|+..+...+|.-.+.+.++. .| .|.| .++-. T Consensus 312 l~~DPA-rPLqtl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl 390 (498) T protein:vir:48 312 IRNDPA-RPTQTGELVGMLPAPKGKRFIMTEQQTLLSHGVATAYVEGGTLRIQRSVTTYKKNAYGVADNSYLDSETLHTS 390 (498) T ss_pred hhcccc-ccccceeeeccccCCchhcCChHHHHHHHhcCcceEEEcCCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHH Confidence 66674 34444567888854 468999999999999998766666555555543 45 3766 88999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCCcCHh---------HHHHHHHHHHHHHHHHHhcCceecCCceEEEe-CchHhCCHHHHh Q lcl|NC_019918. 331 DWLEARMTERLWFRMANSKKIPYDAV---------GATILESEIRAQLNEGIRVGGLAEAPAPKVFV-PDVLSMSPNMRA 400 (428) Q Consensus 331 dwl~~~lq~~l~~ll~~~~kip~~~~---------G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~-~~~~~~~~~dra 400 (428) +++...++..+...|-. .|+.-+.. --.+|++.+-..+++....|++..-+.|.-.. -.+.. ..-+|- T Consensus 391 ~yvr~~~r~~i~~kfpR-~KLa~dg~~~~~gq~IvTp~~ir~eli~~y~~le~~given~~~~~~~LiVerd~-~dpnRl 468 (498) T protein:vir:48 391 AYVLRKLKSVITSKYGR-HKLANDGTRFGPGQAIVTPAVIKGELLATYRQMERAGIVENYDLFKQYLIVERDA-DNPNRL 468 (498) T ss_pred HHHHHHHHHHhhhhcCC-ceecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECC-CCCcEE Confidence 99999999998876633 34333222 23678999999999999999997553322100 01110 000111 Q ss_pred ccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 401 QRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 401 ~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) +=..| ..+-...|-+-.+...-| T Consensus 469 n~~~p-----~d~vn~L~V~A~~~~f~l 491 (498) T protein:vir:48 469 NTLFP-----PDYVNQLRVFAVVNQFRL 491 (498) T ss_pred EEEec-----ccccCchhhhhhhhhhhh Confidence 11111 111122221111111111 No 25 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=99.15 E-value=4.6e-10 Score=71.78 Aligned_cols=353 Identities=14% Similarity=0.107 Sum_probs=197.2 Q ss_pred CC-CCCceEEEe-eeeecccccccccceEEEEcccCC------CccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCc Q lcl|NC_019918. 1 MT-VLTDVIDIQ-ISRETAAVAQTNFNVPLFIASHTN------FSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALK 72 (428) Q Consensus 1 M~-~is~iV~V~-i~~~~~~~~~~~f~~~li~~~~~~------~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~ 72 (428) || ....=|.|. +.-.+.++...+-..+.|++.+.. +...-...++..+....||.....+.+...+|.+... T Consensus 1 M~~~~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~a~~~~~~~gg~ 80 (386) T protein:vir:10 1 MAEQYLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVLIAGSRREAAKLGAGGTLPQAIDGIFDQTGA 80 (386) T ss_pred CccccCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceEecchHHHHhhcCCCcchhHHHHHHhccCce Confidence 99 343333332 222344556666677777775432 2233455667777777899999999999999988754 Q ss_pred ccEEEEEeeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeecc Q lcl|NC_019918. 73 PRSLVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVASN 152 (428) Q Consensus 73 P~~l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as~ 152 (428) + +++-+....... .. +. ..+.|.....+ ...+.+. .+..... ... . T Consensus 81 ~--~~vv~~~~~~~~-~~---t~-----~~~ig~~~~~t----~~~tgl~-~l~~~~~---~~~------------~--- 126 (386) T protein:vir:10 81 V--VVVIRVDEGVDS-AA---TQ-----SNVIGKVDADT----EQYTGIL-ALLSAEN---TVK------------V--- 126 (386) T ss_pred e--EEEeeccccccc-cc---cc-----hhhhccccccc----chhhhhH-Hhhhhcc---ccc------------c--- Confidence 3 333221110000 00 00 00000000000 0000000 0000000 000 0 Q ss_pred ccccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEEecCCHHHHHHHHHHHhhhCCEEEEEecCcccccchhHHH Q lcl|NC_019918. 153 GDWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSHADDDIMAVATHIEGTKKVFIGATAQANTKTSAENDI 232 (428) Q Consensus 153 ~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~~~ 232 (428) ...+.. ...-.......+.+....+.+-++...+..........+|.+.-+..+......--. T Consensus 127 -------~p~i~~--ap~~~~~~~v~~~l~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~-------- 189 (386) T protein:vir:10 127 -------QPRILI--APGFSNQKAVADQLVSVADTAAWLCHSGWSNTTDAAAITYRELFGSRRCEVVDPWYK-------- 189 (386) T ss_pred -------cccccc--cccccchhHHHHHHHHhhcceEEEEEeCCCCCchHHHHHhhhcccccceEEecCcee-------- Confidence 000000 000111122333333333445444443332222223334555433333222111000 Q ss_pred HHHHHhcccCceEEEecC---CccchhHHHHHHHHHhccCC--C-ceeeeeeeecCccc--------cCCCHHHHHHHHh Q lcl|NC_019918. 233 ASRLVAAGFQRTALIYHP---NADAQFPECAWVGYQLQEQP--G-SNTWTHKALAAVDA--------YRLTPTESTNLKN 298 (428) Q Consensus 233 ~~~l~~~~~~~t~~~y~~---~~~~~~~~a~~~~~~~~~~~--g-~~t~~fk~~~Gv~~--------~~~t~t~~~~l~~ 298 (428) +|++ ...-..+.+.++|.....+. | ......|.+.||.- ...++.|.+.|.+ T Consensus 190 --------------v~~~~~~~~~~~p~s~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~l~~ 255 (386) T protein:vir:10 190 --------------VWDVETSAHIIQPPSARHAGVMAKVHNTLGFWWSNSNQEILGIDGLCRPVDFKLDDPTCRANLLNA 255 (386) T ss_pred --------------eeccccccceeechHHHHHHHHHHhhhcCCcEEccCCceeecccccceecccccccCcchhhhhhh Confidence 0110 00111244555555544332 2 23445677776652 2346889999999 Q ss_pred CCceEEEEEcCceeeecCEecCCc----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHh Q lcl|NC_019918. 299 KNVTTFERVGGVNRTFGGAMAGGE----WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIR 374 (428) Q Consensus 299 ~~~n~y~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~ 374 (428) +|+|....-.| ..++.+++++++ ||-+.+-.+|+...|+..+...+-. |.+..-...|+..|+.-|+..++ T Consensus 256 ~gi~~~~~~~G-~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e----~~~~~~~~~i~~~i~~~L~~l~~ 330 (386) T protein:vir:10 256 KEVTTTIQQNG-FRVWGDRTCSADSKWAFKNVVITNDMIADSLVRNHLWAVDR----NITKTYVEDVTEGVNNYLRHLKN 330 (386) T ss_pred cCcEEEEcCCC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHh Confidence 99998865333 567889998875 6778888888888888877765443 78999999999999999999999 Q ss_pred cCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 375 VGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 375 ~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) .|.|. ||.|.+. .++.+++|+.+++.. +.+.+.....+++|.++...+. T Consensus 331 ~g~l~---g~~v~~d-~~~nt~~~~~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 379 (386) T protein:vir:10 331 IGAIA---GGECWVD-PELNSPDQIQQGKVY-FDYDFSAYAPAEHITFRSHMVN 379 (386) T ss_pred CCcee---eeEEEEc-ccCCCHHHhhCCeEE-EEEEEEecCCceeEEEEEEEeh Confidence 99997 5888887 678899999999988 9999999999999999999888 No 26 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=99.10 E-value=1.3e-09 Score=69.23 Aligned_cols=414 Identities=14% Similarity=0.148 Sum_probs=217.0 Q ss_pred CC-CCCce---EEE---eeeee-cccccccccceEEEEcccC----CCccceEEeeCHHHHHhhcCCChHHHHHHHHHHh Q lcl|NC_019918. 1 MT-VLTDV---IDI---QISRE-TAAVAQTNFNVPLFIASHT----NFSERARVYNSLKGVAEDFGESDPTYLAAVRYFG 68 (428) Q Consensus 1 M~-~is~i---V~V---~i~~~-~~~~~~~~f~~~li~~~~~----~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~ 68 (428) |. ..+.| +.| -+.+. ..+....+-.-.||+|... .++......+|.++..+.||..|-...|++.|.. T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~~~q~vLiiGq~la~gs~~~~~~v~v~s~~~a~~~fG~GSml~~M~~a~~~ 80 (498) T protein:vir:44 1 MAISFNSIPSDTRVPLFYAEMDNSAANTARDSGASLLIGHASNDASIAVNSLVLVSSVDYARQICGAGSQLARMVGAYRK 80 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCCCCCcCCcceEEEEecCcccccccceeEeecCHHHHHHhcCcccHHHHHHHHHHH Confidence 44 22222 111 11221 1223333334567776543 2333444446889999999999999999999998 Q ss_pred cCCcccEEEEEeeeccccc----ccchh-eeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccc Q lcl|NC_019918. 69 QALKPRSLVIGRRQVPSAT----VSVSV-VQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNE 143 (428) Q Consensus 69 q~p~P~~l~igr~~~~~~~----~~~~~-~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~ 143 (428) -.|--.--+|+--+.+... +++++ .+..+.....|.|.-+.......++++.+++++.+++++....-.+++... T Consensus 81 ~n~~~~l~~i~~~D~aG~aAtg~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aaina~~~lPVTA~~~~ 160 (498) T protein:vir:44 81 TDPFGELYVIAVPESTGAAATVALTVTGEATETGTVNVYTGRTRVQAPVTSGDDAAAVAVSIKDAVNANPDLPFTATSEA 160 (498) T ss_pred hCCCceeEEEecCCcccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHhCCCCCceEEeecc Confidence 8764333333322211111 11111 123344556788888888888888999999988887776544433333222 Q ss_pred -cceeeeeccccccc---------------cccceEE--EE---eeccccCHHHHHHHHHhcccCce-EEEEecCCHHHH Q lcl|NC_019918. 144 -DGTLTVASNGDWSL---------------KVSSNLT--MA---AAPSTEGWPATITAVQGENDEWY-ALSIDSHADDDI 201 (428) Q Consensus 144 -~~~~t~as~~~~~~---------------~~s~~~~--~~---~~~aa~~~~~al~~~~~~~~~w~-~~~~~~~~~~~~ 201 (428) ..+.|..-.+..+- ....++. +. .+....++.++++.+. +.|| ++.+...+.+.. T Consensus 161 ~~vtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~---~~~~~~i~~p~~D~asl 237 (498) T protein:vir:44 161 GVVTLTARHKGLYGNEIPVTLNYYGFGGGEVLPAGVNITVASGVKGAGAPALNDAVAAMG---DEPFDYIGLPFNDTASV 237 (498) T ss_pred ceEEEEEeccCcccCcceEEEeeccCccccccccceeEEEEcccCCccCchhHHHHHhhc---cCCccEEEEeecCHHHH Confidence 12222111111110 0112222 22 2333345666666665 4454 445545566667 Q ss_pred HHHHHHHhhh-------CCEEEEEecCcccccchhHHHHHHHHhcccCceEEEec-CCccc--hhHHHHHHHHHh---cc Q lcl|NC_019918. 202 MAVATHIEGT-------KKVFIGATAQANTKTSAENDIASRLVAAGFQRTALIYH-PNADA--QFPECAWVGYQL---QE 268 (428) Q Consensus 202 ~ala~~~~a~-------~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~-~~~~~--~~~~a~~~~~~~---~~ 268 (428) .++..+.+.- +.++........ ++...+...-...|..|..++.+ ...+. ....|++++++. .. T Consensus 238 ~al~~~L~~~sgRw~~~~q~~g~~~~a~~---gT~a~l~t~g~~~N~~~it~~~~~~~~~sp~~~~AAa~a~~aA~~l~~ 314 (498) T protein:vir:44 238 NSMATEMNDSSGRWSYVRQLYGHVYTAKT---GTLSELVAAGDQFNLQHITLAGYEKDTQTPADELAASRTARAAVFIRN 314 (498) T ss_pred HHHHHHHhhhhcchHHHhhcCeEEEEecc---CCHHHHHHhhhccCCceEEEEecCCCCCCHHHHHHHHHHHHHHHHhhc Confidence 7777777531 223322221111 11122223334456666655544 33222 233445555554 66 Q ss_pred CCCceeeeeeeecCcccc----CCCHHHHHHHHhCCceEEEEEcCceeeecCEe-----cCC----chhH--HHHHHHHH Q lcl|NC_019918. 269 QPGSNTWTHKALAAVDAY----RLTPTESTNLKNKNVTTFERVGGVNRTFGGAM-----AGG----EWID--VMIFVDWL 333 (428) Q Consensus 269 ~~g~~t~~fk~~~Gv~~~----~~t~t~~~~l~~~~~n~y~~~~~~~~~~~G~~-----~~G----~~iD--~~~~~dwl 333 (428) +|. -.+.=-.|+|+.|. .++.+|.+.|..+|+..+..-.|.-.+.+.++ ..| .|.| .++-.+++ T Consensus 315 DPA-rPL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yv 393 (498) T protein:vir:44 315 DPA-RPTQTGELVDMLPAPKGKRFTTTEQQTLLSHGVATAYVESGVLRIQRDITTYRKNAYGVADNSYLDSETLHTSAYV 393 (498) T ss_pred ccc-cccCceeecccccCCchhcCChHHHHHHHhcCcceEEEcCCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHH Confidence 674 44455567888854 47899999999999999876666555566555 345 3776 88999999 Q ss_pred HHHHHHHHHHHHHhcCCCCcCH----h-----HHHHHHHHHHHHHHHHHhcCceecCCceEEEe-CchHhCCHHHHhccc Q lcl|NC_019918. 334 EARMTERLWFRMANSKKIPYDA----V-----GATILESEIRAQLNEGIRVGGLAEAPAPKVFV-PDVLSMSPNMRAQRI 403 (428) Q Consensus 334 ~~~lq~~l~~ll~~~~kip~~~----~-----G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~-~~~~~~~~~dra~R~ 403 (428) ...++..+..-|-. .|+.=+. . --..|++.+-..+++....|++..-+.|.-.. -.+.. ..-+|-+=. T Consensus 394 r~~~r~~i~~kfpR-~KLa~d~~~~~~gq~IvTp~~ir~eli~~y~~le~~givEn~~~~~~~LiVerd~-~dpnRln~~ 471 (498) T protein:vir:44 394 LRRLKSVITSKYGR-HKLANDGTRFGSGQAIVTPAVIRGELGSTYRQMEREGIVENFDLFQQHLIVERNA-NDSNRLDVL 471 (498) T ss_pred HHHHHHHhhhhcCC-cccccCCcccCCCcccccHHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECC-CCCcEEEEE Confidence 99999999765533 3333221 1 23578999999999999999997543322100 01110 111111111 Q ss_pred cCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 404 FEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 404 ~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) .| ..+-+..|-+-.+...-| T Consensus 472 ~p-----~d~vn~L~V~A~~~~f~l 491 (498) T protein:vir:44 472 FP-----PDYVNQLRVFAVLNQFRL 491 (498) T ss_pred ec-----ccccCchhhhhhhhhhhh Confidence 11 112222222211111111 No 27 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=99.08 E-value=8.1e-10 Score=70.43 Aligned_cols=352 Identities=13% Similarity=0.066 Sum_probs=198.9 Q ss_pred CCC-CCceEEE-eeeeecccccccccceEEEEcccCC------CccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCc Q lcl|NC_019918. 1 MTV-LTDVIDI-QISRETAAVAQTNFNVPLFIASHTN------FSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALK 72 (428) Q Consensus 1 M~~-is~iV~V-~i~~~~~~~~~~~f~~~li~~~~~~------~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~ 72 (428) ||. .-.=|.| .+...+.++...+...+.|++...+ +-..-...++..+....||.....+.+...+|.+... T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~~gg~ 80 (390) T protein:vir:78 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhccccCc Confidence 883 2111333 3344556677777777778775432 2222345678888888899999999999999988765 Q ss_pred ccEEEEEeeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeecc Q lcl|NC_019918. 73 PRSLVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVASN 152 (428) Q Consensus 73 P~~l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as~ 152 (428) +. ++-+.... .....+ ...+.|.... .+..+.+ ..+...... T Consensus 81 ~~--~vv~v~~~-~~~~~~--------~~~~ig~~~~-----~~~~tg~-~al~~~~~~--------------------- 122 (390) T protein:vir:78 81 LT--VVVRVAEG-KDADET--------TSNVIGTVTP-----DGKYTGI-KALLAAQGA--------------------- 122 (390) T ss_pred eE--EEEEeccc-cccccc--------cccccccccc-----ccccchh-hhhhhhhhh--------------------- Confidence 43 33221100 000000 0000000000 0000000 000000000 Q ss_pred ccccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEEecCCHHHHHHHHHHHhhhCCEEEEEecCcccccchhHHH Q lcl|NC_019918. 153 GDWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSHADDDIMAVATHIEGTKKVFIGATAQANTKTSAENDI 232 (428) Q Consensus 153 ~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~~~ 232 (428) ......+.+..+........++..+.+.-. .+..++........++.+|.+..+..+....+.-- T Consensus 123 ----~~~~p~il~ap~~~~~~v~~~l~~~a~~~~--~~aivD~p~~~t~~~a~~~~~~~~s~~~~~~~p~~--------- 187 (390) T protein:vir:78 123 ----LGVKPRILAAPGLDTQPVAAALAATAQSLR--AMAYVSASGCKTKEEAAAYRKQFGQREIMVIWPDW--------- 187 (390) T ss_pred ----hcceehhhcccccchHHHHHHHHHhhcccc--eEEEEecCCCCCHHHHHHHhhccCCceEEEEcCce--------- Confidence 000001111111112222233333322111 22333333233333444555543333322221100 Q ss_pred HHHHHhcccCceEEEecCC---ccchhHHHHHHHHHhccCCCc---eeeeeeeecCccc--------cCCCHHHHHHHHh Q lcl|NC_019918. 233 ASRLVAAGFQRTALIYHPN---ADAQFPECAWVGYQLQEQPGS---NTWTHKALAAVDA--------YRLTPTESTNLKN 298 (428) Q Consensus 233 ~~~l~~~~~~~t~~~y~~~---~~~~~~~a~~~~~~~~~~~g~---~t~~fk~~~Gv~~--------~~~t~t~~~~l~~ 298 (428) .+|++. .....+.+.++|.....+.-. ....+|.+.|+.- ...+..|.+.|.. T Consensus 188 -------------~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~ 254 (390) T protein:vir:78 188 -------------LGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNE 254 (390) T ss_pred -------------EeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCceeeceeecceecccccccccchhhhhhh Confidence 011110 011234566666655444322 3345677776663 2345677889999 Q ss_pred CCceEEEEEcCceeeecCEecCCc----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHh Q lcl|NC_019918. 299 KNVTTFERVGGVNRTFGGAMAGGE----WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIR 374 (428) Q Consensus 299 ~~~n~y~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~ 374 (428) +|+|.+....| ..++.+++++++ ||-+.+-.+|+...|+..+...+-. |.++.-...|+..++.-|+..++ T Consensus 255 ~gi~t~~~~~G-~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e----~n~~~~~~~i~~~i~~~L~~l~~ 329 (390) T protein:vir:78 255 HEVTTLVNRNG-FRFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVDG----PLNPSLARDIVESINGWFRQQVA 329 (390) T ss_pred cCcEEEEcCCC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHh Confidence 99999976444 467899998874 7888888899999888887765433 88999999999999999999999 Q ss_pred cCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 375 VGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 375 ~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) .|.|. ||.|.+. .++.+++|+.+.+.. +.+.+.....+++|+++...+. T Consensus 330 ~g~l~---g~~v~~d-~~~nt~~~i~~G~~~-~~v~~~p~~pae~I~~~~~~~~ 378 (390) T protein:vir:78 330 NGYLI---GGSAWID-PEPNTADILASGKAY-IDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred CCcee---eeEEEEc-cCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 99997 5888886 457899999999988 9999999999999999999888 No 28 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=99.08 E-value=8.1e-10 Score=70.43 Aligned_cols=352 Identities=13% Similarity=0.066 Sum_probs=198.9 Q ss_pred CCC-CCceEEE-eeeeecccccccccceEEEEcccCC------CccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCc Q lcl|NC_019918. 1 MTV-LTDVIDI-QISRETAAVAQTNFNVPLFIASHTN------FSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALK 72 (428) Q Consensus 1 M~~-is~iV~V-~i~~~~~~~~~~~f~~~li~~~~~~------~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~ 72 (428) ||. .-.=|.| .+...+.++...+...+.|++...+ +-..-...++..+....||.....+.+...+|.+... T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~~gg~ 80 (390) T protein:vir:10 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhccccCc Confidence 883 2111333 3344556677777777778775432 2222345678888888899999999999999988765 Q ss_pred ccEEEEEeeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeecc Q lcl|NC_019918. 73 PRSLVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVASN 152 (428) Q Consensus 73 P~~l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as~ 152 (428) +. ++-+.... .....+ ...+.|.... .+..+.+ ..+...... T Consensus 81 ~~--~vv~v~~~-~~~~~~--------~~~~ig~~~~-----~~~~tg~-~al~~~~~~--------------------- 122 (390) T protein:vir:10 81 LT--VVVRVAEG-KDADET--------TSNVIGTVTP-----DGKYTGI-KALLAAQGA--------------------- 122 (390) T ss_pred eE--EEEEeccc-cccccc--------cccccccccc-----ccccchh-hhhhhhhhh--------------------- Confidence 43 33221100 000000 0000000000 0000000 000000000 Q ss_pred ccccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEEecCCHHHHHHHHHHHhhhCCEEEEEecCcccccchhHHH Q lcl|NC_019918. 153 GDWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSHADDDIMAVATHIEGTKKVFIGATAQANTKTSAENDI 232 (428) Q Consensus 153 ~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~~~ 232 (428) ......+.+..+........++..+.+.-. .+..++........++.+|.+..+..+....+.-- T Consensus 123 ----~~~~p~il~ap~~~~~~v~~~l~~~a~~~~--~~aivD~p~~~t~~~a~~~~~~~~s~~~~~~~p~~--------- 187 (390) T protein:vir:10 123 ----LGVKPRILAAPGLDTQPVAAALAATAQSLR--AMAYVSASGCKTKEEAAAYRKQFGQREIMVIWPDW--------- 187 (390) T ss_pred ----hcceehhhcccccchHHHHHHHHHhhcccc--eEEEEecCCCCCHHHHHHHhhccCCceEEEEcCce--------- Confidence 000001111111112222233333322111 22333333233333444555543333322221100 Q ss_pred HHHHHhcccCceEEEecCC---ccchhHHHHHHHHHhccCCCc---eeeeeeeecCccc--------cCCCHHHHHHHHh Q lcl|NC_019918. 233 ASRLVAAGFQRTALIYHPN---ADAQFPECAWVGYQLQEQPGS---NTWTHKALAAVDA--------YRLTPTESTNLKN 298 (428) Q Consensus 233 ~~~l~~~~~~~t~~~y~~~---~~~~~~~a~~~~~~~~~~~g~---~t~~fk~~~Gv~~--------~~~t~t~~~~l~~ 298 (428) .+|++. .....+.+.++|.....+.-. ....+|.+.|+.- ...+..|.+.|.. T Consensus 188 -------------~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~ 254 (390) T protein:vir:10 188 -------------LGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNE 254 (390) T ss_pred -------------EeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCceeeceeecceecccccccccchhhhhhh Confidence 011110 011234566666655444322 3345677776663 2345677889999 Q ss_pred CCceEEEEEcCceeeecCEecCCc----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHh Q lcl|NC_019918. 299 KNVTTFERVGGVNRTFGGAMAGGE----WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIR 374 (428) Q Consensus 299 ~~~n~y~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~ 374 (428) +|+|.+....| ..++.+++++++ ||-+.+-.+|+...|+..+...+-. |.++.-...|+..++.-|+..++ T Consensus 255 ~gi~t~~~~~G-~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e----~n~~~~~~~i~~~i~~~L~~l~~ 329 (390) T protein:vir:10 255 HEVTTLVNRNG-FRFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVDG----PLNPSLARDIVESINGWFRQQVA 329 (390) T ss_pred cCcEEEEcCCC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHh Confidence 99999976444 467899998874 7888888899999888887765433 88999999999999999999999 Q ss_pred cCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 375 VGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 375 ~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) .|.|. ||.|.+. .++.+++|+.+.+.. +.+.+.....+++|+++...+. T Consensus 330 ~g~l~---g~~v~~d-~~~nt~~~i~~G~~~-~~v~~~p~~pae~I~~~~~~~~ 378 (390) T protein:vir:10 330 NGYLI---GGSAWID-PEPNTADILASGKAY-IDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred CCcee---eeEEEEc-cCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 99997 5888886 457899999999988 9999999999999999999888 No 29 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=99.08 E-value=2.7e-09 Score=67.60 Aligned_cols=348 Identities=11% Similarity=0.050 Sum_probs=194.1 Q ss_pred CCCCCce---EE-EeeeeecccccccccceEEEEcccCCC-----ccceEEeeCHHHHHhh---cCCChHHHHHHHHHHh Q lcl|NC_019918. 1 MTVLTDV---ID-IQISRETAAVAQTNFNVPLFIASHTNF-----SERARVYNSLKGVAED---FGESDPTYLAAVRYFG 68 (428) Q Consensus 1 M~~is~i---V~-V~i~~~~~~~~~~~f~~~li~~~~~~~-----~~~~~~y~s~~~V~~~---fg~~s~eY~aA~~~F~ 68 (428) ||-.++. |. +.+.-.+.++...+...+.+++...+. ...-....+..+.... .+.....+.+...+|. T Consensus 1 m~~~~~~~hGv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~~p~~~~~~i~~~~d~~~~~~~~~~~gtl~~al~~~~~ 80 (388) T protein:vir:96 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) T ss_pred CCCCCCCCCceEEEEcCCCcccccccCcceeEEEEecCCCccccccccceeeecchhhhhhhccccccccchhhhHhhhc Confidence 8866663 22 244445567777777877787765332 1111222333444333 3445666788888887 Q ss_pred cCCcccEEEEEeeecccc-cccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeecccccee Q lcl|NC_019918. 69 QALKPRSLVIGRRQVPSA-TVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTL 147 (428) Q Consensus 69 q~p~P~~l~igr~~~~~~-~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~ 147 (428) +...+ +++-+....+. ..+. ..+.|. .....+ ..+.+.+... T Consensus 81 ~~~~~--~~vv~v~~g~~~~at~----------a~iig~----~~~~tg----~~~gl~al~~----------------- 123 (388) T protein:vir:96 81 KTSVP--QYFIVVPEGADDAATM----------ANIIGG----IDPTTG----RRTGIAALTE----------------- 123 (388) T ss_pred cCCce--EEEEEecccccccccc----------ceeeee----cccccc----hhhHHHHhhh----------------- Confidence 76433 23322111000 0000 000000 000000 0000000000 Q ss_pred eeeccccccccccceEEEEeec-cccCHHHHHHHHHhcccCceEEEEecC--CHHHHHHHHHHHhhhC--CEEEEEecCc Q lcl|NC_019918. 148 TVASNGDWSLKVSSNLTMAAAP-STEGWPATITAVQGENDEWYALSIDSH--ADDDIMAVATHIEGTK--KVFIGATAQA 222 (428) Q Consensus 148 t~as~~~~~~~~s~~~~~~~~~-aa~~~~~al~~~~~~~~~w~~~~~~~~--~~~~~~ala~~~~a~~--~~~~~~~~~~ 222 (428) ......+.+..+. .......+|..+.+.-+ .+.+++.. +..+..+...|....+ ..+....+.- T Consensus 124 ---------~~~~p~il~aPg~s~~~~v~~al~~~~~~~~--~~~i~D~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~P~ 192 (388) T protein:vir:96 124 ---------CTERPTLIGAPGFSQNKAVIDALASMAKRLK--CRAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPM 192 (388) T ss_pred ---------cccceeEEEeeccccchHHHHHHHHHHhhcC--cEEEEeccCCchhHHHHHHhhhhccCcCcceEEEEeCc Confidence 0000111222221 11233344444443222 23333322 3333333344433221 1222211110 Q ss_pred ccccchhHHHHHHHHhcccCceEEEecCC---ccchhHHHHHHHHHhccCCCceeeeeeee--cCccc-----cCCCHHH Q lcl|NC_019918. 223 NTKTSAENDIASRLVAAGFQRTALIYHPN---ADAQFPECAWVGYQLQEQPGSNTWTHKAL--AAVDA-----YRLTPTE 292 (428) Q Consensus 223 ~~~~~~~~~~~~~l~~~~~~~t~~~y~~~---~~~~~~~a~~~~~~~~~~~g~~t~~fk~~--~Gv~~-----~~~t~t~ 292 (428) -. +|++. .-...+.+.++|.....++ ......|.+ .|+.- ...+.+| T Consensus 193 ~~----------------------~~d~~~~~~~~~p~s~~~AG~~a~~D~-~~spaN~~i~i~g~~~~~~~~~~~~~~~ 249 (388) T protein:vir:96 193 PA----------------------IYSRKAQGNIYVPPSTIAMGAVAAVKP-WESPGNQGVLIQDVARVIDYNILDKSTE 249 (388) T ss_pred ee----------------------eecccCCceeeechHHHHHHHHHhhcC-cccccCeeEEeeeecccccccccCChhh Confidence 00 11110 0112356677777766665 333334443 34431 2347789 Q ss_pred HHHHHhCCceEEEEEcCce-eeecCEecCCchhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHH Q lcl|NC_019918. 293 STNLKNKNVTTFERVGGVN-RTFGGAMAGGEWIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNE 371 (428) Q Consensus 293 ~~~l~~~~~n~y~~~~~~~-~~~~G~~~~G~~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~ 371 (428) ++.|..+|+|.+.++.+.+ .++.+++++..||-+.+-.+|++..|+..+...+- + |.++.=...|+..|+.-|+. T Consensus 250 ~~~Ln~~gI~~i~~~~~~G~~~wG~rT~~~~~i~vrR~~~~i~~si~~~~~~~v~---e-pn~~~~~~~i~~~i~~fL~~ 325 (388) T protein:vir:96 250 GDLLNRNGVSYFARTSMGGFSLIGNRTVTGKFISFVGLEDAIARKLEAASQRAMS---K-QLTKSFMEQEIKKINLFMQD 325 (388) T ss_pred HHhhhhcCceEEEEecCCcEEEEcccccCCcceeehhhHHHHHHHHHHHHHHhcc---C-CCCHHHHHHHHHHHHHHHHH Confidence 9999999999999997765 57999999999999999999999999888776543 3 77888899999999999999 Q ss_pred HHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 372 GIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 372 ~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) -++.|.|. ||.+.+ +.++.+++|+.+.+.. +.+.+.....+++|+++...+. T Consensus 326 l~~~Gal~---g~~~~~-d~~~nt~~~i~~G~~~-~~i~~~p~~pae~I~~~~~~~~ 377 (388) T protein:vir:96 326 LVAAEIIP---GGEVYL-HPTLNTVERYKNGSWY-IVIDYGRYSPNEHMIFHLNAVD 377 (388) T ss_pred HHhCCcee---eeEEEE-ecCCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 99999997 467777 4567899999999887 9999999999999999999988 No 30 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=99.05 E-value=2.3e-09 Score=67.96 Aligned_cols=350 Identities=11% Similarity=0.049 Sum_probs=196.1 Q ss_pred CC-CCCceEEE-eeeeecccccccccceEEEEcccCCC------ccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCc Q lcl|NC_019918. 1 MT-VLTDVIDI-QISRETAAVAQTNFNVPLFIASHTNF------SERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALK 72 (428) Q Consensus 1 M~-~is~iV~V-~i~~~~~~~~~~~f~~~li~~~~~~~------~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~ 72 (428) |+ +.-.=|.| .+.-.+.++.......+.|++...+. ...-...++..+....||.+.-.+.+...+|.+... T Consensus 1 M~~~~~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~its~~~~~~~~g~~~tL~~al~~~~~~~~~ 80 (390) T protein:vir:79 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CccccCCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccceEeecHHHHHHhcCCCccchhhhhhhcccccc Confidence 88 34322344 33445566777777777777765432 222244567777777799988888888888988654 Q ss_pred ccEEEEEeeecccc-cccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeec Q lcl|NC_019918. 73 PRSLVIGRRQVPSA-TVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVAS 151 (428) Q Consensus 73 P~~l~igr~~~~~~-~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as 151 (428) + +++-+...... ...... . +.+.+ .. ...+++...... T Consensus 81 ~--~~vv~v~~~~~~~~~~~~------~---ig~~~------~~----~~~tgl~al~~~-------------------- 119 (390) T protein:vir:79 81 L--TVVVRVAEGKDADETTSN------V---IGTVT------PD----GKYTGIKALLAA-------------------- 119 (390) T ss_pred e--EEEEeeccccccccccce------e---eeccc------cc----ccchhhhhhhhh-------------------- Confidence 3 33332211100 000000 0 00000 00 000000000000 Q ss_pred cccccccccceEEEEeeccccCHHHHHHHHHhcccCce-EEEEecCCHHHHHHHHHHHhhhCCEEEEEecCcccccchhH Q lcl|NC_019918. 152 NGDWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWY-ALSIDSHADDDIMAVATHIEGTKKVFIGATAQANTKTSAEN 230 (428) Q Consensus 152 ~~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~-~~~~~~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~ 230 (428) .........+.+..+.......+++..+.+ .+. +.+++........++.+|.+.-+..+......--. T Consensus 120 --~~~~~~~p~il~ap~~~~~~v~~~l~~~a~---~~~~~ai~D~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~------ 188 (390) T protein:vir:79 120 --QGALGVKPRILAAPGLDTQPVAAALAATAQ---SLRAMAYVSASGCKTKEEAAAYRRQFGQREIMVIWPDWL------ 188 (390) T ss_pred --hhhhccccccccCCcccchHHHHHHHHhhh---hcceEEEEEccCCCCHHHHHHHhcCCCCceEEEEcCcee------ Confidence 000000011111122222223333333332 232 22333222222233445554433333222211000 Q ss_pred HHHHHHHhcccCceEEEecC---CccchhHHHHHHHHHhccCCCceee---eeeeecCccc--------cCCCHHHHHHH Q lcl|NC_019918. 231 DIASRLVAAGFQRTALIYHP---NADAQFPECAWVGYQLQEQPGSNTW---THKALAAVDA--------YRLTPTESTNL 296 (428) Q Consensus 231 ~~~~~l~~~~~~~t~~~y~~---~~~~~~~~a~~~~~~~~~~~g~~t~---~fk~~~Gv~~--------~~~t~t~~~~l 296 (428) +|++ ..-...+.+.++|.+...+.-.-.| ..|.+.|+.. ...+..|++.| T Consensus 189 ----------------~~d~~~~~~~~~p~s~~~Ag~~a~~D~~~g~~~spsN~~i~gi~~~~~~~~~~~~~~~~~a~~L 252 (390) T protein:vir:79 189 ----------------GWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYL 252 (390) T ss_pred ----------------ecccccCceeEeehHHHHHHHHHhhhccCCcEEccCCceeeccceeeeeccccccccchhhhhh Confidence 0000 0001224566666655555432233 3666766542 23456788899 Q ss_pred HhCCceEEEEEcCceeeecCEecCCc----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHH Q lcl|NC_019918. 297 KNKNVTTFERVGGVNRTFGGAMAGGE----WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEG 372 (428) Q Consensus 297 ~~~~~n~y~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~ 372 (428) ..+|+|...... -..++.+++++++ ||-+.+-.+|+...|+..+...+-. |.+..-...|+..++.-|++. T Consensus 253 n~~gi~t~~~~~-G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e----~~~~~~~~~i~~~i~~~L~~l 327 (390) T protein:vir:79 253 NEHEVTTLVNRN-GFRFWGERTCSDDPKFAFENYTRTAQVAADSIAEAQMPVVDG----PLNPSLARDIVESINGWFRQQ 327 (390) T ss_pred hhcCcEEEEcCC-CEEEEeccccCCCcccceeeehhhHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHH Confidence 999999986533 3467899998885 7778888889888888887765443 888999999999999999999 Q ss_pred HhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 373 IRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 373 ~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) +++|.|. ||.|.+. .++.+++|+.+.+.. +.+.+.....+++|+++...+. T Consensus 328 ~~~gal~---g~~v~~d-~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 378 (390) T protein:vir:79 328 VANGYLI---GGSAWID-PEPNTADILASGKAY-IDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred HhCCcee---eeEEEEe-cCCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 9999997 5778876 557799999999988 9999999999999999999888 No 31 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=99.04 E-value=1.5e-09 Score=68.99 Aligned_cols=355 Identities=12% Similarity=0.061 Sum_probs=194.7 Q ss_pred CCCCCceEEE---eeeeecccccccccceEEEEcccCCC------ccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCC Q lcl|NC_019918. 1 MTVLTDVIDI---QISRETAAVAQTNFNVPLFIASHTNF------SERARVYNSLKGVAEDFGESDPTYLAAVRYFGQAL 71 (428) Q Consensus 1 M~~is~iV~V---~i~~~~~~~~~~~f~~~li~~~~~~~------~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p 71 (428) |++-...--| .+.-.+.++.......+.+++..... ...-...++..+-...||.....+.+...+|.+.. T Consensus 1 M~~~~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v~s~~~~~~~~g~~~tl~~al~~~~~~~g 80 (391) T protein:vir:11 1 MAADQYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLITNVQAAIGKAGTSGTLPASLQAIADQAN 80 (391) T ss_pred CCCCcCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEEecchhhheecCCCccchhhhhhhhcccc Confidence 6653332222 22234455667777777777766432 12224556777777779999999999999998765 Q ss_pred cccEEEEEeeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeec Q lcl|NC_019918. 72 KPRSLVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVAS 151 (428) Q Consensus 72 ~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as 151 (428) .. +++-+.... ..... +. ..+.|. .........+.. +....... T Consensus 81 ~~--~~vv~~~~~-~~~~~---t~-----~d~~g~-----~~a~~~~~g~~a-~~~~~~~~------------------- 124 (391) T protein:vir:11 81 AA--TVVVRVKPG-EDEAA---TN-----SAVIGG-----VSADGKYTGMKA-LLAAKARL------------------- 124 (391) T ss_pred ce--eEEeeeccc-ccccc---cc-----hhhhcc-----cccccchhhhhh-hhhhhhhh------------------- Confidence 44 233221100 00000 00 000000 000000000000 00000000 Q ss_pred cccccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEEecCCHHHHHHHHHHHhhhCCEEEEEecCcccccchhHH Q lcl|NC_019918. 152 NGDWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSHADDDIMAVATHIEGTKKVFIGATAQANTKTSAEND 231 (428) Q Consensus 152 ~~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~~ 231 (428) .....+....+........++..+.+.-+ .+..++........++-+|-+.-+..+......--..... T Consensus 125 ------~~~p~~~~ap~~~~~~v~~al~~~~~~~~--~~~i~D~p~~~t~~~a~~~r~~~~s~~~~~~~p~~~~~~~--- 193 (391) T protein:vir:11 125 ------GVVPRILGVPGLDTQPVATALIAIAQQLR--AFAYVSASGCKTKEEATAYRENFAAREAMVIWPDFLTWST--- 193 (391) T ss_pred ------eeccccccccccccHHHHHHHHHhhcccc--eEEEEEcCCCCCHHHHHHHhhhcCCceEEEEcCcceeccc--- Confidence 00000111111112223334443333221 2233332222223333445543333332222110000000 Q ss_pred HHHHHHhcccCceEEEecCCccchhHHHHHHHHHhccCCCc---eeeeeeeecCccc--------cCCCHHHHHHHHhCC Q lcl|NC_019918. 232 IASRLVAAGFQRTALIYHPNADAQFPECAWVGYQLQEQPGS---NTWTHKALAAVDA--------YRLTPTESTNLKNKN 300 (428) Q Consensus 232 ~~~~l~~~~~~~t~~~y~~~~~~~~~~a~~~~~~~~~~~g~---~t~~fk~~~Gv~~--------~~~t~t~~~~l~~~~ 300 (428) .+... --..+.++++|.....+... .....|.+.|+.. ...++.|.+.|..+| T Consensus 194 --------~~~~~--------~~~p~s~~~ag~~a~~d~~~g~~~span~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~g 257 (391) T protein:vir:11 194 --------VVNQT--------VPAPAVAQALGLRARIDQEVGWHKTLSNVAVNGVTGISADVFWDLQSPSTDANYLNENE 257 (391) T ss_pred --------ccCce--------EEechHHHHHHHHHHhhccCCcEEccCCceeeceeecccccccccCCCcchhhhhhhcC Confidence 00001 01235566666665544322 3334567766653 224678999999999 Q ss_pred ceEEEEEcCceeeecCEecCCc----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019918. 301 VTTFERVGGVNRTFGGAMAGGE----WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVG 376 (428) Q Consensus 301 ~n~y~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G 376 (428) +|......| ..++.+++++++ ||-+.+-.+|+...|+..+...+-. |.++.-...|+..|+.-|++.++.| T Consensus 258 i~~~~~~~G-~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e----~n~~~~~~~i~~~i~~~l~~l~~~g 332 (391) T protein:vir:11 258 VTTLVQEGG-FRFWGSRTCSDDPLFAFENYTRTAQVLADTIAEAHMWAVDK----PMHPSLVRDILEGVNAKFRELKGLG 332 (391) T ss_pred cEEEEcCCC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcc Confidence 999854322 467899998885 7778888899888888877755443 7788889999999999999999999 Q ss_pred ceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 377 GLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 377 ~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) .|. ||.+.+. .++.+++|+.+++.. +.+.+.+...++.|.++...+. T Consensus 333 ~l~---g~~~~~~-~~~n~~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 379 (391) T protein:vir:11 333 LII---DAQAWYD-PNVNDKDTLKAGKLR-ITYDYTPVPPLEDLTFFQKITD 379 (391) T ss_pred cee---ceEEEEe-cCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 997 4677764 567899999998888 9999999999999999999888 No 32 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=98.97 E-value=4e-09 Score=66.61 Aligned_cols=350 Identities=11% Similarity=0.068 Sum_probs=195.1 Q ss_pred CCC-CCceEEE-eeeeecccccccccceEEEEcccCC------CccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCc Q lcl|NC_019918. 1 MTV-LTDVIDI-QISRETAAVAQTNFNVPLFIASHTN------FSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALK 72 (428) Q Consensus 1 M~~-is~iV~V-~i~~~~~~~~~~~f~~~li~~~~~~------~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~ 72 (428) |+- ...=|.| .+.-.+.++.......+.|++.+.. +...-...++..+-...||...-.+.+-..+|.+.-. T Consensus 1 M~~~~~pGv~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~n~pv~iss~~~~~~~~g~~gtl~~al~~~~~~gg~ 80 (391) T protein:vir:79 1 MPTDYHHGVRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPLDTPVLLTNPQAYIGKAGDKGTLAHTLDAITDQTNP 80 (391) T ss_pred CCCCCCCCeEEEECCCCcccccccCCceEEEEeecccccccccccccCEEeccHHHHHHhcCCccccchhhhhhhccccc Confidence 873 2221333 2233455677777778888876532 2223346788888888899988888888888887644 Q ss_pred ccEEEEE-eeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeec Q lcl|NC_019918. 73 PRSLVIG-RRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVAS 151 (428) Q Consensus 73 P~~l~ig-r~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as 151 (428) +..+... +........ ..+.|. ....+..+.+. .+........ T Consensus 81 ~~~vv~~~~~~~~~~~~------------~~~~g~-----~~~~~~~tGl~-~l~~~~~~~~------------------ 124 (391) T protein:vir:79 81 LTVVVRVAGGASEAETT------------SNLIGT-----TNAAGRYTGMK-ALLTARNRFG------------------ 124 (391) T ss_pred ceeeecccccccccccc------------cccccc-----ccchhhhHHHh-hhhhhhhhhc------------------ Confidence 4332221 111000000 000000 00000000000 0000000000 Q ss_pred cccccccccceEEEEeeccccCHHHHHHHHHhcccCce-EEEEecCCHHHHHHHHHHHhhhCCEEEEEecCcccccchhH Q lcl|NC_019918. 152 NGDWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWY-ALSIDSHADDDIMAVATHIEGTKKVFIGATAQANTKTSAEN 230 (428) Q Consensus 152 ~~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~-~~~~~~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~ 230 (428) ....+.+..+........++..+. ..+. +.+++........+.-+|.+.-+..+......--. T Consensus 125 -------~~p~~l~~p~~~~~~v~~al~~~~---~~~~~~ai~d~p~~~t~~~a~~~~~~~~s~~~a~~~P~~~------ 188 (391) T protein:vir:79 125 -------VAPRILAVPGLDSLPVGTELVTIA---QKLRAFAYLSAYGCQTKEEAVAYRSNFGQREAMVMWPDFV------ 188 (391) T ss_pred -------ccchhhcCCccchhHHHHHHHHHH---hhcCcEEEEECCCCCCHHHHHHHHhccCCceeEEecceee------ Confidence 000000011111112222333222 2232 23333322222233445555433333222211000 Q ss_pred HHHHHHHhcccCceEEEecCC---ccchhHHHHHHHHHhccCCCc---eeeeeeeecCccc--------cCCCHHHHHHH Q lcl|NC_019918. 231 DIASRLVAAGFQRTALIYHPN---ADAQFPECAWVGYQLQEQPGS---NTWTHKALAAVDA--------YRLTPTESTNL 296 (428) Q Consensus 231 ~~~~~l~~~~~~~t~~~y~~~---~~~~~~~a~~~~~~~~~~~g~---~t~~fk~~~Gv~~--------~~~t~t~~~~l 296 (428) +|++. .-...+.+.++|.....+.-. .....|.+.|+.. ...+.+|.+.| T Consensus 189 ----------------~~d~~~~~~~~~p~s~~~AG~~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~L 252 (391) T protein:vir:79 189 ----------------GWDTAANAETTLWATARAVGLRAKIDNDTGWHKTLSNVAVGGVTGLSRDVFWDLQDPATDAGYL 252 (391) T ss_pred ----------------eecCcCCceeeechHHHHHHHHHHhhhcccceeccCCceehhhhccccccccccccccchhhhh Confidence 11110 011234566666655554322 3334567777652 23466788899 Q ss_pred HhCCceEEEEEcCceeeecCEecCCc----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHH Q lcl|NC_019918. 297 KNKNVTTFERVGGVNRTFGGAMAGGE----WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEG 372 (428) Q Consensus 297 ~~~~~n~y~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~ 372 (428) ..+++|.+....| ..++.+++++++ ||-+.+-.+|+...|+..+...+-. |.++.-...|+..|+.-|++- T Consensus 253 n~~~I~t~~~~~G-~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e----pn~~~~~~~i~~~i~~~l~~l 327 (391) T protein:vir:79 253 NANEVTTLVHRDG-YRFWGSRTCSADPLFAFENYTRTAQVLADTMAEAHMWANDL----PMTPTLVRDLLEGINAKLRML 327 (391) T ss_pred hhcCceEEECCCc-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHH Confidence 9999999854322 467899999885 7888888899999998888865543 889999999999999999999 Q ss_pred HhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 373 IRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 373 ~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) ++.|.|. ||++.+. .+..+++|+.+.+.. +.+.+.....+++|+++...+. T Consensus 328 ~~~g~l~---g~~v~~~-~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 378 (391) T protein:vir:79 328 TRNGYLL---GGAAWFD-ADANSKDTLKAGQLA-IDYDYTPVPPLENLTFRQRITD 378 (391) T ss_pred HhCCcee---ceEEEEe-cCCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 9999997 4677774 567899999998887 9999999999999999999888 No 33 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=98.88 E-value=2.1e-08 Score=62.62 Aligned_cols=409 Identities=13% Similarity=0.096 Sum_probs=216.6 Q ss_pred CC-------CCCce-EEEeeeee-cccccccccceEEEEcccC-CCccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcC Q lcl|NC_019918. 1 MT-------VLTDV-IDIQISRE-TAAVAQTNFNVPLFIASHT-NFSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQA 70 (428) Q Consensus 1 M~-------~is~i-V~V~i~~~-~~~~~~~~f~~~li~~~~~-~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~ 70 (428) |+ |+.+= |-|.+.-+ ..+....+.+.+.|+|... .+++++..+++.++...-||... .-.+....|.|. T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~~~~~~~~~~g~-l~~~~~~a~~~~ 79 (587) T protein:vir:99 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNYSQAKRLFRSGE-LLDAIELAWGSN 79 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEecCCccceeEEeccHHHHHHHhcCcc-hHHHHHHHhccc Confidence 55 33332 22322222 2455667778888888764 56788888999999999998844 556677788764 Q ss_pred C--cccEEEEEeeec-ccccccchhee-----------------------ecc-----------c---------cccccc Q lcl|NC_019918. 71 L--KPRSLVIGRRQV-PSATVSVSVVQ-----------------------EGQ-----------S---------YVLTVN 104 (428) Q Consensus 71 p--~P~~l~igr~~~-~~~~~~~~~~~-----------------------~~~-----------~---------~~~~v~ 104 (428) + .++++++.|-.. ..+.++...++ ... . +.++.. T Consensus 80 ~~~g~~~~~~~rv~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~i~y~ 159 (587) T protein:vir:99 80 PNYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIFTIKYK 159 (587) T ss_pred cCCCceEEEEEEcCCCceeEEEecCeEEEEeeccccccceEEEEccCCCCcceeEEEEEecccceeeeeeccceeeEEee Confidence 3 567788776421 11111111110 000 0 000000 Q ss_pred ceeeeeeec--------------------------c-cc---hhhhhhhhh------eeeecccce-EEE---------- Q lcl|NC_019918. 105 GLPVSYVSH--------------------------Q-DD---TATLIATGL------KAAYDVTPV-VGV---------- 137 (428) Q Consensus 105 g~~~s~~~~--------------------------~-~~---~a~~i~a~l------~~a~~~~~~-~~~---------- 137 (428) |...+.... + ++ ++......+ ++.+.+... .+. T Consensus 160 g~~~~a~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~i~~~~~~tAky~~~~~~~i~~~~~~~~~~~ 239 (587) T protein:vir:99 160 GEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLDKIENA 239 (587) T ss_pred cccccceeeEeecCcceeeeeeeeecCCceeEEEEecCCchHHHHHHHhhhccccceeEEeeccCCceeEeecccccccc Confidence 100000000 0 00 000000000 111110000 000 Q ss_pred Eeecc--------cc--------cee--eeecc-----------------------ccccccccceEEEEee---ccccC Q lcl|NC_019918. 138 TVTDN--------ED--------GTL--TVASN-----------------------GDWSLKVSSNLTMAAA---PSTEG 173 (428) Q Consensus 138 ~~tt~--------~~--------~~~--t~as~-----------------------~~~~~~~s~~~~~~~~---~aa~~ 173 (428) ...+. .+ ... +...+ ............+..+ ....+ T Consensus 240 ~v~~~~~~v~a~~~D~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~LtGG~dG~~~~s 319 (587) T protein:vir:99 240 NIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNGEPPAT 319 (587) T ss_pred eeeeeeeeeehhccceeeecccceeeeeeecccccchhhhhhhhhccccceeeeeccccceecccceeeecCCCCCcccc Confidence 00000 00 000 00000 0000000011112232 23456 Q ss_pred HHHHHHHHHhcccCceEEEEecCCHHHHHHHHHHHhhh---CCEEEEEecCcccccchhHHHHHHHHhcccCceEEEecC Q lcl|NC_019918. 174 WPATITAVQGENDEWYALSIDSHADDDIMAVATHIEGT---KKVFIGATAQANTKTSAENDIASRLVAAGFQRTALIYHP 250 (428) Q Consensus 174 ~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~~~~a~---~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~ 250 (428) ..++++++... +|+.+.+...+.+-+.++..|++.. .+....+..... ..+...+....+..+++|.+.+... T Consensus 320 y~~al~ale~~--~~~~i~~~t~d~~i~a~l~a~vk~~r~~g~~~~aVlg~~~--~~~~~~~~~~a~~~n~e~vi~v~~~ 395 (587) T protein:vir:99 320 WADKLDKFAHE--GGYYIVPLSSKQSVHAEVASFVKERSDAGEPMRAIVGGGF--NESKEQLFGRQASLSNPRVSLVANS 395 (587) T ss_pred HHHHHHHHhhC--CcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCC--CCCHHHHHHHhhhcCCCcEEEEecc Confidence 78899988764 5666655555556667899998753 233333322211 1223334445567788887655332 Q ss_pred ------C-----ccchhHHHHHHHHHhccCCCceeeeeeeecCccc-cCCCHHHHHHHHhCCceEEEEEcCce----eee Q lcl|NC_019918. 251 ------N-----ADAQFPECAWVGYQLQEQPGSNTWTHKALAAVDA-YRLTPTESTNLKNKNVTTFERVGGVN----RTF 314 (428) Q Consensus 251 ------~-----~~~~~~~a~~~~~~~~~~~g~~t~~fk~~~Gv~~-~~~t~t~~~~l~~~~~n~y~~~~~~~----~~~ 314 (428) + .+....+++++|......+. ..+.||.++++.. ..++.+|++.+.++|++.+....+.. .+- T Consensus 396 ~~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~~~-~SlT~~~i~~~~v~~~~t~~e~e~li~~Gvl~l~~~~~~~~~~vriv 474 (587) T protein:vir:99 396 GTFVMDDGRKNHVPAYMVAVALGGLASGLEIG-ESITFKPLRVSSLDQIYESIDLDELNENGIISIEFVRNRTNTFFRIV 474 (587) T ss_pred ceEecCCCceeeechHHHHHHHHHHHhcCchh-cCccceeeecccccccCCHHHHHHHHhCCeEEEEEecCCcceEEEEe Confidence 0 11233456777777777654 3444555553332 36999999999999999987654432 223 Q ss_pred cCEecC----C-ch--hHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEE Q lcl|NC_019918. 315 GGAMAG----G-EW--IDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVF 387 (428) Q Consensus 315 ~G~~~~----G-~~--iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~ 387 (428) +|++.- + .| |-.++-.|.+...++..+-+.++- | |=++.|...|++.|++.|++..+.|.|...+.-.+. T Consensus 475 ~~ItT~t~~~~~~~~~i~viRv~D~i~~di~~~~~~~yiG--k-~Nn~~~r~~i~~~i~~~L~~l~~~gaI~~~~~~dv~ 551 (587) T protein:vir:99 475 DDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIG--T-RTINTSASIIKDFIQSYLGRKKRDNEIQDFPAEDVQ 551 (587) T ss_pred eceeeccCCCCchhhhhhhhhhHHHHHHHHHHHHHhhCCc--c-ccchHHHHHHHHHHHHHHHHHHhCCcccCCCccceE Confidence 454431 1 24 568888899988888887766654 3 567889999999999999999999999743211122 Q ss_pred eCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 388 VPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 388 ~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) .. ..+|+ --+++.+++--++++|.+++.+.- T Consensus 552 v~-----~~~d~-----~~v~~~v~Pv~~mekIy~tv~~~~ 582 (587) T protein:vir:99 552 VI-----VEGNE-----ARISMTVYPIRSFKKISVSLVYKQ 582 (587) T ss_pred EE-----ecCCE-----EEEEEEEEEcccceEEEEEEEEEe Confidence 21 11222 237889999999999999888866 No 34 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=98.86 E-value=2.6e-08 Score=62.15 Aligned_cols=404 Identities=12% Similarity=0.042 Sum_probs=184.3 Q ss_pred CCC-CCceEEEeeeeecccccccccceEEEEcccC---CCccceE--Eee--CHHHHHhhcCCChHHHHHHHHHHhcCCc Q lcl|NC_019918. 1 MTV-LTDVIDIQISRETAAVAQTNFNVPLFIASHT---NFSERAR--VYN--SLKGVAEDFGESDPTYLAAVRYFGQALK 72 (428) Q Consensus 1 M~~-is~iV~V~i~~~~~~~~~~~f~~~li~~~~~---~~~~~~~--~y~--s~~~V~~~fg~~s~eY~aA~~~F~q~p~ 72 (428) -+. -.+-++|.+.-.+.+.. ..|-.....+... +.-+.+. .|. +..++..-|+. +..|.|....+.+.+ T Consensus 111 ~~g~~~n~i~v~~~~~~~~~~-~~~~~~~~~~~~~~~~~n~G~v~~i~y~g~~~~a~~~~~~~-~~~~~A~~l~l~gg~- 187 (587) T protein:vir:96 111 IFGSVSNDIQVALEKNTITDS-LRLRVVFQKDNYQEVFDNLGNIFSINYKGEGEKATFSVEKD-KETQEAKRLVLKVDE- 187 (587) T ss_pred ccCCCCceEEEEEEeccCCCc-cceEEEEecCCceeeccccCceEEEEecccccceeEeeccC-cccceeeeeEEEecC- Confidence 111 23334444432222222 2221111111110 0011111 111 11122222332 222333333333322 Q ss_pred ccEEEEEeeecccccccc---hheeec---ccccccccceeeeeeec------------ccchhh--hhhhhh------- Q lcl|NC_019918. 73 PRSLVIGRRQVPSATVSV---SVVQEG---QSYVLTVNGLPVSYVSH------------QDDTAT--LIATGL------- 125 (428) Q Consensus 73 P~~l~igr~~~~~~~~~~---~~~~~~---~~~~~~v~g~~~s~~~~------------~~~~a~--~i~a~l------- 125 (428) ..|+-.|-......... ..+... ..-....-+-++..... ....+. .+.... T Consensus 188 -~~v~~yrl~~g~~~~~~~~~~~~~~~~~~tAky~g~~~n~~~v~v~d~~~~~~~k~~~~y~~t~~~di~~~~~~~~~~~ 266 (587) T protein:vir:96 188 -KEVKAYELNGGAYSFTNEIITDINELPDFEAKLSPFGDKNLESRKLDEATDVDIKGKAVYVKAVFGDIENQTQYNQYVK 266 (587) T ss_pred -ceEEEEEeCCCchhhhhhhhhhhccccceEEEeecccCceeEEEeeccccccccceEEEeehhhhhhhhhhhcccccee Confidence 23444443211100000 000000 00000000101100000 000000 000000 Q ss_pred ----eeeecccceEEEEeeccccceeeeeccccccccccceEEEE---eeccccCHHHHHHHHHhcccCceEEEEecCCH Q lcl|NC_019918. 126 ----KAAYDVTPVVGVTVTDNEDGTLTVASNGDWSLKVSSNLTMA---AAPSTEGWPATITAVQGENDEWYALSIDSHAD 198 (428) Q Consensus 126 ----~~a~~~~~~~~~~~tt~~~~~~t~as~~~~~~~~s~~~~~~---~~~aa~~~~~al~~~~~~~~~w~~~~~~~~~~ 198 (428) ............+.... ...+....+...........+. .+....+..++++++..+ +|+.+.+...+. T Consensus 267 ~~~~~~~~~~~~~~~v~~~~~--~~~~~~~~~~~~~~~~~~~aLtGG~dG~~~~~y~~~l~ale~~--~~~~i~~~t~d~ 342 (587) T protein:vir:96 267 FEQLPEQASEPSDVEVHAETE--SATVTATSKPKAIEPFELTKLSGGTNGEPPTSWSAKLEKFKNE--GGYYIVPLTDRQ 342 (587) T ss_pred eccccchhhhhhccccccccc--ceeeeecccccccccccceeeecCCCCCCcccHHHHHHHHhhC--CcEEEEecCCCH Confidence 00000000000000000 0000000000000001111122 233345678899998765 566666655566 Q ss_pred HHHHHHHHHHhhh---CCEEEEEecCcccccchhHHHHHHHHhcccCceEEEecC------C-----ccchhHHHHHHHH Q lcl|NC_019918. 199 DDIMAVATHIEGT---KKVFIGATAQANTKTSAENDIASRLVAAGFQRTALIYHP------N-----ADAQFPECAWVGY 264 (428) Q Consensus 199 ~~~~ala~~~~a~---~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~------~-----~~~~~~~a~~~~~ 264 (428) +.+.++.+|++.. .+.+..+..... ..+.+.+....+..+++|.+.+.+. + .+....+++++|. T Consensus 343 ai~~~l~a~vk~~r~~gk~~~aVlg~~~--~~~~~~~~~~a~~~n~e~vi~v~~~~~~~~~~~~~~~~~~~~~aa~vAG~ 420 (587) T protein:vir:96 343 SVHSEVATFVKNRSDAGEPMRAIVGGGT--SETKEKLFGRQAILNNPRVALVANSGKFVMGNGRILQAPAYMVASAVAGL 420 (587) T ss_pred HHHHHHHHHHHHHHhCCCeEEEEecCCC--CCCHHHHHHHHhhcCCCcEEEEecceEEecCCCceeeechhhHHHHHHHH Confidence 6677899999753 333433332211 1233444556677788887665442 1 1123345677777 Q ss_pred HhccCCCceeeeeeeecCccc-cCCCHHHHHHHHhCCceEEEEEcCcee----eecCEecCC-------chhHHHHHHHH Q lcl|NC_019918. 265 QLQEQPGSNTWTHKALAAVDA-YRLTPTESTNLKNKNVTTFERVGGVNR----TFGGAMAGG-------EWIDVMIFVDW 332 (428) Q Consensus 265 ~~~~~~g~~t~~fk~~~Gv~~-~~~t~t~~~~l~~~~~n~y~~~~~~~~----~~~G~~~~G-------~~iD~~~~~dw 332 (428) .....+. ..+.||.++++.. ..++.+|++.+.++|+.++....+... .-++.+.-. ..|-.++-.|. T Consensus 421 ~Ag~~~~-~S~T~~~~~~~~v~~~~t~~e~~~~i~~G~~~l~~~~~~~~~v~~~vnsitT~t~~~~~~~~~i~virv~D~ 499 (587) T protein:vir:96 421 VSGLDIG-ESITFKPLFVNSLDKVYESEELDELNENGIITIEFVRNRMTTMFRIVDDVTTFPDKNDPVKSEMALGEANDF 499 (587) T ss_pred HhcCccc-cCccceeeecccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhhhHHHHHH Confidence 7766653 4445666654432 369999999999999999987655421 223444321 14667888888 Q ss_pred HHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEE Q lcl|NC_019918. 333 LEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEAR 412 (428) Q Consensus 333 l~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~ 412 (428) +...++..+-+.++- | |=++.|...|++.|++.|++..+.|.|...+.-.+.+. ..+|+ --+.+.++ T Consensus 500 i~~di~~~~~~~yiG--k-~nn~~~r~~v~~~i~~~L~~l~~~g~I~~~~~~dv~v~-----~~~D~-----~~v~~~v~ 566 (587) T protein:vir:96 500 LVSELKILLEEQYIG--T-RTINTSASQIKDFVQSYLGRKKRDNEIQDFPPEDVQVI-----IEGNE-----ARISLTIF 566 (587) T ss_pred HHHHHHHHHHhcCCc--c-ccCHHHHHHHHHHHHHHHHHHHhCCcccCCCccceEEE-----ecCCE-----EEEEEEEE Confidence 888887776655544 4 56889999999999999999999999974321112211 11222 23788999 Q ss_pred ECceEEEEEEEEEEec Q lcl|NC_019918. 413 LAGAIHFVHIRGTVTV 428 (428) Q Consensus 413 ~agaih~v~i~~~v~~ 428 (428) +.-++++|.+++++.- T Consensus 567 Pv~~mekIy~tv~~~~ 582 (587) T protein:vir:96 567 PIRALKKISVSLVYRQ 582 (587) T ss_pred EcccceEEEEEEEEEe Confidence 9999999999888866 No 35 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=98.81 E-value=3.5e-08 Score=61.46 Aligned_cols=394 Identities=12% Similarity=0.062 Sum_probs=175.9 Q ss_pred CCCCCceEEEeeeeeccccccc----ccceEEEEcccCCC--ccceEEeeCHHHHHh-hc-----CCChHHHH-----HH Q lcl|NC_019918. 1 MTVLTDVIDIQISRETAAVAQT----NFNVPLFIASHTNF--SERARVYNSLKGVAE-DF-----GESDPTYL-----AA 63 (428) Q Consensus 1 M~~is~iV~V~i~~~~~~~~~~----~f~~~li~~~~~~~--~~~~~~y~s~~~V~~-~f-----g~~s~eY~-----aA 63 (428) ++.|. -.+|+++=.+...-.. +|+.+-........ .-.+..-.....+.. .+ |..+.... .+ T Consensus 103 L~~i~-~~~v~v~g~~g~~~~VtF~g~~~~l~~~~~~lt~g~~~~vtV~~~~~g~~~~~~~~s~~gi~~~~~~l~~~~~~ 181 (581) T protein:vir:10 103 LPNVE-DDEVTVLGDPGGPWTVTFTKAVAALTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSG 181 (581) T ss_pred cCCCC-cceEEEECCCCceEEEEEcCCccceeeeeceecCCCceeEEEeccccCcccccccccccccccccccccccccC Confidence 44443 1334333111111111 11111110000000 001111111111100 00 00000000 00 Q ss_pred HHHHhcCCcccEEEEEeeecc-cccccchheeecccccccccceeeeeeecccchhhhhhhhheee-ecccceEEEEeec Q lcl|NC_019918. 64 VRYFGQALKPRSLVIGRRQVP-SATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAA-YDVTPVVGVTVTD 141 (428) Q Consensus 64 ~~~F~q~p~P~~l~igr~~~~-~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a-~~~~~~~~~~~tt 141 (428) ..++.++ .+-+++-+.. .+..... .+.......+.|.-... . .+ ..+... .|....-...+.. T Consensus 182 ~~~~~gs----d~~~~~~~~~~~~~~~~~--~D~~t~~~~~~g~~~~~-------~-~v-~~~~~~~~d~~~~~~v~~~~ 246 (581) T protein:vir:10 182 QVYVLGT----DYVVTRVNAGEDGEANTR--DDLYTIQRVVDGGHIDP-------G-DI-VQLSYRYTDPNYHEVIRFTD 246 (581) T ss_pred cceeccc----cceeeecccCcccccccc--ccceeeeeeeccccccc-------c-eE-EEEEEEeecCCcceeEEeec Confidence 0000000 0111110000 0000000 00000011111100000 0 00 000000 0111000000100 Q ss_pred c----------------ccceeeeeccccccccccceEEEEeec-------cccCHHHHHHHHHhcccCceEEEEecCCH Q lcl|NC_019918. 142 N----------------EDGTLTVASNGDWSLKVSSNLTMAAAP-------STEGWPATITAVQGENDEWYALSIDSHAD 198 (428) Q Consensus 142 ~----------------~~~~~t~as~~~~~~~~s~~~~~~~~~-------aa~~~~~al~~~~~~~~~w~~~~~~~~~~ 198 (428) . ..+.++....... .......+..+. ..+++.++++++.++. .+..+ +...+. T Consensus 247 ~~~~~~~~~~~~~~~g~~~~~~t~~~~~~~--tn~~~~~l~~gvd~~g~tvt~~dy~~Al~ale~~~-~~~iv-v~~t~~ 322 (581) T protein:vir:10 247 PDDIQDFYGPAFDEAGNVQSEITLCAQLAI--TNGASTILACAVDPEGDTVTMGDYQNALNKFRDED-EIAII-VAGTGA 322 (581) T ss_pred CcchhhhhhhhhhccCccccchhhhheeee--ecccceeEEeeccCCCCccchHHHHHHHHHHhcCC-ceEEE-EeCCCC Confidence 0 0011110000000 001111122221 2235778888887653 23333 433444 Q ss_pred HH-HHHHHHHHhhhC---C-EE-EEEecCcccccchhHHHHHHHHhcccCceEEEecC------C-------ccchhHHH Q lcl|NC_019918. 199 DD-IMAVATHIEGTK---K-VF-IGATAQANTKTSAENDIASRLVAAGFQRTALIYHP------N-------ADAQFPEC 259 (428) Q Consensus 199 ~~-~~ala~~~~a~~---~-~~-~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~------~-------~~~~~~~a 259 (428) +. +.+|..|++... + ++ ....... .......+.....+..+..|.+.+++. . -+..+.+| T Consensus 323 ~~v~a~l~ahv~~~s~~~~~~ravigV~g~-~~~~~~~~~~~~a~~~n~~Rvvlv~p~~~~~~g~~~~~~v~lp~y~~AA 401 (581) T protein:vir:10 323 QPIQALVQQHVSAQSNNKYERRAILGMDGS-VTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAA 401 (581) T ss_pred HHHHHHHHHHHHHHHhccCCcEEEEEecCC-CCCccHHHHHHhhccCCCceEEEEecCceeecCcccCceeccchhhHHH Confidence 44 456888876531 2 22 2222111 111112222233445567777766532 1 12334566 Q ss_pred HHHHHHhccCCCceeeeeeeecCccc--cCCCHHHHHHHHhCCceEEEEEcCce-eeecCEecC-----CchhHHHHHHH Q lcl|NC_019918. 260 AWVGYQLQEQPGSNTWTHKALAAVDA--YRLTPTESTNLKNKNVTTFERVGGVN-RTFGGAMAG-----GEWIDVMIFVD 331 (428) Q Consensus 260 ~~~~~~~~~~~g~~t~~fk~~~Gv~~--~~~t~t~~~~l~~~~~n~y~~~~~~~-~~~~G~~~~-----G~~iD~~~~~d 331 (428) +++|......+ ...+-||.++|+.. ..++.+|++.|.++|++.+....+.. .+-+|++.- .+.|-.++-.| T Consensus 402 ~vAGl~a~~~~-~~slT~~~i~gi~~l~~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s~~~~~~i~~iR~~D 480 (581) T protein:vir:10 402 AVAGKSVSAIA-AMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQD 480 (581) T ss_pred HHHHHhhcccc-ccCcccccccccccccccCCHHHHHHHHhCCeEEEEEecCCeEEEEeeeecCCCCCcceeeeeehhhh Confidence 77777777766 46788999998874 46899999999999999999866554 355676542 24577899999 Q ss_pred HHHHHHHHHHH-HHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEE Q lcl|NC_019918. 332 WLEARMTERLW-FRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFE 410 (428) Q Consensus 332 wl~~~lq~~l~-~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~ 410 (428) .+...+++.+. ..|+- + |=++.|.+.|++.+++.|.+.+++|.|...+..+. +..++.. ..--+.|. T Consensus 481 ~v~~~ir~~~~~~~fIG--~-~n~~~~r~~ik~~i~~~L~~l~~~g~I~~~~~~~~--------~~~~~~~-d~v~V~i~ 548 (581) T protein:vir:10 481 VMVYRIRDYLDADGLIG--M-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKA--------RQIERQP-DVIEVRYE 548 (581) T ss_pred HHHHHHHHHhhhhcCCC--c-ccCHHHHHHHHHHHHHHHHHHHhcCcccCCcccee--------eeeecCC-CEEEEEEE Confidence 99999998886 34553 4 77889999999999999999999999985332221 2222222 22348899 Q ss_pred EEECceEEEEEEEEEEec Q lcl|NC_019918. 411 ARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 411 ~~~agaih~v~i~~~v~~ 428 (428) +.+..+|++|.++..++= T Consensus 549 v~Pv~~i~~I~vti~~~p 566 (581) T protein:vir:10 549 WRPAYPLNYIVVRYSIAP 566 (581) T ss_pred EEecccceEEEEEEEEec Confidence 999999999999877765 No 36 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=98.80 E-value=4.5e-08 Score=60.88 Aligned_cols=351 Identities=11% Similarity=0.108 Sum_probs=189.4 Q ss_pred CCCCCce---EEE-eeeeecccccccccceEEEEcccCCC------ccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcC Q lcl|NC_019918. 1 MTVLTDV---IDI-QISRETAAVAQTNFNVPLFIASHTNF------SERARVYNSLKGVAEDFGESDPTYLAAVRYFGQA 70 (428) Q Consensus 1 M~~is~i---V~V-~i~~~~~~~~~~~f~~~li~~~~~~~------~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~ 70 (428) ||=.+.+ |.| .+.-.+.++...+...+.|+|.+... -..-...++..+....||.....+.+-..+|.|. T Consensus 1 m~m~~~~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~~~~~~~~g~~g~L~~al~~~~~~~ 80 (393) T protein:vir:10 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIV 80 (393) T ss_pred CCCCCccCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEecchHHHHHhhCCccchhhhhhhhhccc Confidence 5532322 222 33344556667777778788765432 2233455777888888999999999999999876 Q ss_pred CcccEEEEEeeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeee Q lcl|NC_019918. 71 LKPRSLVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVA 150 (428) Q Consensus 71 p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~a 150 (428) ..+. ++-+....... ..+ ...+.|..- ....+.+ ..+........ T Consensus 81 ~~~~--~vv~v~~~~~~-~~t--------~~~iig~~~------~~~~tgl-~al~~~~~~~~----------------- 125 (393) T protein:vir:10 81 KTPT--VIVRVAESDDS-DTL--------TANIVGTQE------NGKFTGI-KALLTAQSTVF----------------- 125 (393) T ss_pred CceE--EEeecccCccc-ccc--------ccccccccc------cchhhHH-HHHHhhhhhcc----------------- Confidence 4222 22221111000 000 000000000 0000000 11111000000 Q ss_pred ccccccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEEe--cCCHHHHHHHHHHHhhhCCEEEEEecCcccccch Q lcl|NC_019918. 151 SNGDWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSID--SHADDDIMAVATHIEGTKKVFIGATAQANTKTSA 228 (428) Q Consensus 151 s~~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~--~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~ 228 (428) ....+.+..+........++..+.+.-..- ++..+ ..+.+ +.-.|.+.-+..+......--..... T Consensus 126 --------~~p~li~apg~~~~~~~~al~~~~~~~~~~-~~v~d~~~~t~~---~ai~~~~~~~s~~~~~~~P~~~~~d~ 193 (393) T protein:vir:10 126 --------VKPKLLCVPQHDNQAVATELLSVAKKLNAF-AFISDNGATTKE---QAYTYRQNFSQREGMMIFGDWKSYNT 193 (393) T ss_pred --------eeeeeeeeccccchHHHHHHHHHhhccCcE-EEEEcCCCCCHH---HHHHHhhhcCCceEEEEecccccccc Confidence 001111222222222333444443332222 22222 22333 33355554333332221110000000 Q ss_pred hHHHHHHHHhcccCceEEEecCCccchhHHHHHHHHHhccCCCc---eeeeeeeecCcccc--------CCCHHHHHHHH Q lcl|NC_019918. 229 ENDIASRLVAAGFQRTALIYHPNADAQFPECAWVGYQLQEQPGS---NTWTHKALAAVDAY--------RLTPTESTNLK 297 (428) Q Consensus 229 ~~~~~~~l~~~~~~~t~~~y~~~~~~~~~~a~~~~~~~~~~~g~---~t~~fk~~~Gv~~~--------~~t~t~~~~l~ 297 (428) ..+. . .. .++.+.++|.....++.. ..-..|.+.|+..- .++++|++.|. T Consensus 194 ---------~~~~--~-~~-------~p~s~~~Ag~~a~~d~~~G~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln 254 (393) T protein:vir:10 194 ---------DKKA--Y-DT-------DYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLN 254 (393) T ss_pred ---------cCCc--e-eE-------eehhHHHHHHHHHhhcCCCcEEccCCceeeceeecceecccccCCCcchhHhHh Confidence 0000 0 11 224455555555444322 34456677776632 34688999999 Q ss_pred hCCceEEEEEcCceeeecCEecCCc----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHH Q lcl|NC_019918. 298 NKNVTTFERVGGVNRTFGGAMAGGE----WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGI 373 (428) Q Consensus 298 ~~~~n~y~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~ 373 (428) .+|+|++....| ..++.+++++++ |+-+.+-.+|++..|+..+..++- + |.++.=...++..++.-|+.-+ T Consensus 255 ~~gI~t~~~~~G-~~~wG~rT~s~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~---e-~~~~~~~~~i~~~i~~~L~~l~ 329 (393) T protein:vir:10 255 EKGITICLNHNG-FRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVD---M-PLTPLRVKTMLEAINNKLRSWA 329 (393) T ss_pred hcCceEEEcCCC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhcc---C-CCCHHHHHHHHHHHHHHHHHHH Confidence 999999854322 457888998874 677888888888888877776443 3 7888889999999999999888 Q ss_pred hcC--ceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 374 RVG--GLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 374 ~~G--~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) +.| .|. |+.+...+ +.+++|..+.+.. +.+.+.....+++|+++...+. T Consensus 330 ~~g~~al~---g~~v~~~~--~nt~~~i~~G~~~-~~i~~~p~~p~e~I~~~~~~~~ 380 (393) T protein:vir:10 330 SGDDPRIL---GARVWVAE--EITADIIKSGKFV-IKYDYHWIPSLESLGLEQRVND 380 (393) T ss_pred hccccccc---cceEEecC--CCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 766 343 46776654 4788898888877 8999999999999999999987 No 37 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=98.74 E-value=7e-08 Score=59.80 Aligned_cols=384 Identities=12% Similarity=0.052 Sum_probs=203.7 Q ss_pred CCC-----CCc-----eEEEeee-eecccccccccceEEEEcccCCCccceEEeeC---HHHHHhhcCCChH--HHHHHH Q lcl|NC_019918. 1 MTV-----LTD-----VIDIQIS-RETAAVAQTNFNVPLFIASHTNFSERARVYNS---LKGVAEDFGESDP--TYLAAV 64 (428) Q Consensus 1 M~~-----is~-----iV~V~i~-~~~~~~~~~~f~~~li~~~~~~~~~~~~~y~s---~~~V~~~fg~~s~--eY~aA~ 64 (428) |+= .++ ++|+.-. ....+.+.|+.-.+.+.. .=.|++.+...++ ..++..-||.+-. ..++.+ T Consensus 3 magg~~~~~~K~~PG~Y~n~~~~~~~~~~~~~rGi~a~p~~~-~wGp~~~v~~i~~~~~~~~~~~~~G~~~~~~~~~~l~ 81 (436) T protein:vir:78 3 LGGGTFVTQNKVLPGSYINFVSATRATSSLSDRGIVAMPLEL-DWGIDEEVFQVTSDDFEKYSTKYFGYDYTHEKLKGLR 81 (436) T ss_pred ccceeeccceeecCceEEEEEecCcceeeccCCeEEEEEEEe-cCCCCceeEEeecccchHHHHHHhcCccchHHHHHHH Confidence 331 122 2333211 122334555544433333 3345556655555 3467777887543 345566 Q ss_pred HHHhcCCcccEEEEEeeecccccccchheeec------cccccccc-------ceeeeeeecccchhhhhhhhheeeecc Q lcl|NC_019918. 65 RYFGQALKPRSLVIGRRQVPSATVSVSVVQEG------QSYVLTVN-------GLPVSYVSHQDDTATLIATGLKAAYDV 131 (428) Q Consensus 65 ~~F~q~p~P~~l~igr~~~~~~~~~~~~~~~~------~~~~~~v~-------g~~~s~~~~~~~~a~~i~a~l~~a~~~ 131 (428) ..|. .|+.|++.|-.. .+......+++. ....++|. ..++..-.........+...+. +. T Consensus 82 ~~~~---~~~tv~~yrl~~-G~~a~~~v~~Aky~g~~gn~i~v~v~~~~~d~~~~dv~~~~g~~~~d~~~~~~~~---~l 154 (436) T protein:vir:78 82 DLFK---NIRLGYFYKLNK-GVKASCSIATARCSGIRGNDLKVIVTTNIDDNAKFDVVTLLDNKKVDTQIAKVIT---EL 154 (436) T ss_pred HHhc---CCCEEEEEECCC-cceeeeeeeeeecCCCCCcEEEEEecccccccCceEEEEEecchhhhhhhHHHHh---hc Confidence 6773 568899998642 111111111211 11111111 1111111000011111111000 00 Q ss_pred cceEEEEeeccccceeeeeccccccccccceEEEEe---e--ccccCHHHHHHHHHhcccCceEEEEecCCHHHHHHHHH Q lcl|NC_019918. 132 TPVVGVTVTDNEDGTLTVASNGDWSLKVSSNLTMAA---A--PSTEGWPATITAVQGENDEWYALSIDSHADDDIMAVAT 206 (428) Q Consensus 132 ~~~~~~~~tt~~~~~~t~as~~~~~~~~s~~~~~~~---~--~aa~~~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~ 206 (428) ...-+..+.. .+. +...++..+.. + .+.+++.++++.+... .|..+.+...+.+.+..+.. T Consensus 155 ~~n~~V~~~~--~g~----------la~~a~~~LtGG~dG~~~T~~dy~~al~~le~~--~fn~l~~~~~d~~~~~~~~a 220 (436) T protein:vir:78 155 QDNDYVTWKK--EAT----------LEATAGLTFTNGTNGEAVTGTEYQAFLDKIESY--SFNALGCLATTAEIKSLFVE 220 (436) T ss_pred cCCceEEEEe--ccc----------ccccceeeeeccccccccchHHHHHHHHHHccc--ceeEEEecCCChHHHHHHHH Confidence 0011111110 000 11111111221 2 2446788888888665 57777777778888899999 Q ss_pred HHhhh----CCEEEEEecCcccccchhHHHHHHHHhcccCceEEEecCCccchhHHHHHHHHHhccCCCceeeeeeeecC Q lcl|NC_019918. 207 HIEGT----KKVFIGATAQANTKTSAENDIASRLVAAGFQRTALIYHPNADAQFPECAWVGYQLQEQPGSNTWTHKALAA 282 (428) Q Consensus 207 ~~~a~----~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~~~~~~~~a~~~~~~~~~~~g~~t~~fk~~~G 282 (428) |+... ++++-.......... .+.+.+.- .++. +......-.+++++|....... ..++-|+.++| T Consensus 221 ~ikr~re~~g~~~~aV~~~~~~~d--~EgIInv~--n~v~------g~~~~~~~~~a~vAG~~Ag~~~-~~S~T~~~~~~ 289 (436) T protein:vir:78 221 FTKRMRDKVGAKFQTVLYKKNDAD--YEGVVSVE--NKIK------DTGLLESSLIYWTTGAIAGCDI-NKSNTNKRYDG 289 (436) T ss_pred HHHHHHhhcCCeEEEEecCCCCCC--CceEEEee--cccC------CceechhHHHHHHHHHHhcCcc-ccCccceecCc Confidence 99853 345533332211110 01110000 0011 1111122245677777666654 35566888888 Q ss_pred cc-c-cCCCHHHHHHHHhCCceEEEEEcCceeeecCEec----C------CchhHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_019918. 283 VD-A-YRLTPTESTNLKNKNVTTFERVGGVNRTFGGAMA----G------GEWIDVMIFVDWLEARMTERLWFRMANSKK 350 (428) Q Consensus 283 v~-~-~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~----~------G~~iD~~~~~dwl~~~lq~~l~~ll~~~~k 350 (428) +. . ..++.+|++.+.++|.-++...++.-.+-+|+.+ + ..-|-.++-.|-+.+.++..+-+.++ +| T Consensus 290 ~~~v~~~~t~~e~~~ai~~G~lvl~~d~~~v~I~~~VNTltt~~~~k~~~~~kI~vir~~D~i~~di~~~~~~~yi--GK 367 (436) T protein:vir:78 290 EFDVDVNYTQIHLEEALKTGKFIFHKVGDEVHVLEDINTFVSFTDEKNDDFSSNQSVRVLDQIANDIATLFNTKYL--GE 367 (436) T ss_pred cccccccCCHHHHHHHHhCCeEEEEEeCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccc--cc Confidence 63 3 4599999999999999888776665556666532 1 12466777777777777665544343 59 Q ss_pred CCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEe Q lcl|NC_019918. 351 IPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVT 427 (428) Q Consensus 351 ip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~ 427 (428) +|=+.+|..++.+.|+.-|++..+.|.|.+.....+.... . ..+..--+++.+++-.++..+.+.++|. T Consensus 368 v~N~~dgr~~l~~~i~~yl~~L~~~g~I~~f~~~Dv~v~~------~--~~~~~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 368 VPNDKSGRISFWNDVVKHHEQLQNMRAIEDFKADDVSVEP------G--SDKKTVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred cCCCHHHHHHHHHHHHHHHHHHHhCCcccCCCCcceEEee------c--CCCCEEEEEEEEEEEEeeeeEEEEEEEC Confidence 9999999999999999999999999999865433333321 1 1122233888899999999999999999 No 38 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=98.73 E-value=7.5e-08 Score=59.65 Aligned_cols=389 Identities=12% Similarity=0.057 Sum_probs=175.2 Q ss_pred CCCCCce--EEEeeeeecccccccccceEEEEccc---CCCccc-eEEeeCHHHHHhh-cCCChHHHHHHHHHHhcCCcc Q lcl|NC_019918. 1 MTVLTDV--IDIQISRETAAVAQTNFNVPLFIASH---TNFSER-ARVYNSLKGVAED-FGESDPTYLAAVRYFGQALKP 73 (428) Q Consensus 1 M~~is~i--V~V~i~~~~~~~~~~~f~~~li~~~~---~~~~~~-~~~y~s~~~V~~~-fg~~s~eY~aA~~~F~q~p~P 73 (428) ......+ ...+...........+.....+.... ...++. +..|..+.-.... -....+.|.. ..+.+ .. T Consensus 277 ~~~~~~~~~~~~t~~~~~~~~~~~d~~~~~~~d~~~~~~~~~g~vve~~~~~s~~~~~~~~~~~~~~~~--~vi~~--~s 352 (729) T protein:vir:10 277 TLEWDSIADAPGTSTYVSTRGGKNDEIHVLVIDDKGTITGNSGTILEKHLSLSKAKDAEYSVGSSSYWR--DFLAT--NS 352 (729) T ss_pred cccccccccccccccccccccccccccceeeeccccccccCcccceeeeeeeeeccccccccccccccc--eeecc--cc Confidence 1001110 11111111111111111111111111 111111 2222221111000 0111111110 00000 11 Q ss_pred cEEEEEeeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeeccc Q lcl|NC_019918. 74 RSLVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVASNG 153 (428) Q Consensus 74 ~~l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as~~ 153 (428) ..+..+.-. . ..... . ....... ......... ... ...............+.+. T Consensus 353 ~~~~~~~~~-~-~~~~~-~--------~~~~~~~-~~~~~~~~~--~~a------------~~~~~~~~~~~~~~~~~g~ 406 (729) T protein:vir:10 353 KYIFGGGAT-S-GITTT-G--------YSVSSTN-TLDTDSGWD--QNA------------EGVNFGASGVATLTLAGGT 406 (729) T ss_pred ceeeecccc-c-ccccc-c--------ccccccc-eeccccccc--ccc------------ccccccccceeEEEeeccc Confidence 111111100 0 00000 0 0000000 000000000 000 0000000000011111111 Q ss_pred cccccccceEEEEeeccccCHHHHHHHHHhccc-CceEEEEec------CCHHHHHHHHHHHhhhCCEEEEEecCcc--- Q lcl|NC_019918. 154 DWSLKVSSNLTMAAAPSTEGWPATITAVQGEND-EWYALSIDS------HADDDIMAVATHIEGTKKVFIGATAQAN--- 223 (428) Q Consensus 154 ~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~-~w~~~~~~~------~~~~~~~ala~~~~a~~~~~~~~~~~~~--- 223 (428) +.................+.....+..+.+... ....+.... .+..-..++...++....++.+...... T Consensus 407 ~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~~~~~~v~~a~~~~~~~~~~~~a~~~~~~~~~i 486 (729) T protein:vir:10 407 NYGDKTDLTTSGALSSGVDDIISGYTLFENTEEIEVDFILMGAAHHPKEQSQAVAEKVTAVAEARKDAVAFISPYRQAFL 486 (729) T ss_pred ccccccccccccccccchhHHHHHHHHhhcccccccceeeecCCCCCccchHHHHHHHHHHHHhcCCeEEEecccccccc Confidence 110000000000111122233445555543321 122222211 2233445666777766555544322110 Q ss_pred -----------cccchhHHHHHHHHhcccCceEEEecCC---------c-cchhHHHHHHHHHhccCCCc---eeeeeee Q lcl|NC_019918. 224 -----------TKTSAENDIASRLVAAGFQRTALIYHPN---------A-DAQFPECAWVGYQLQEQPGS---NTWTHKA 279 (428) Q Consensus 224 -----------~~~~~~~~~~~~l~~~~~~~t~~~y~~~---------~-~~~~~~a~~~~~~~~~~~g~---~t~~fk~ 279 (428) .......+.....+..+..+-..+|++- . ....+.+.++|.+...+..+ ....+|. T Consensus 487 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~p~s~~~aGl~a~~d~~~g~~~span~~ 566 (729) T protein:vir:10 487 NDTVSGTVTVSNIDQTTENVVGFYAPLSSSTYSVFDSGYKYMFDRFNNTFRYVPLNGDIAGTCARTDIEQFPWFSPAGTA 566 (729) T ss_pred cccccccccccccchhhHHHHHHHhhccCCceEEEEcCeeEEecccCCceEEechhHHHHHHHHHhhccCCcEEccCCcc Confidence 0111122222222222222223344321 1 12235567777766555432 2344555 Q ss_pred ecCccc-----cCCCHHHHHHHHhCCceEEEEEcCce-eeecCEecCC-----chhHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019918. 280 LAAVDA-----YRLTPTESTNLKNKNVTTFERVGGVN-RTFGGAMAGG-----EWIDVMIFVDWLEARMTERLWFRMANS 348 (428) Q Consensus 280 ~~Gv~~-----~~~t~t~~~~l~~~~~n~y~~~~~~~-~~~~G~~~~G-----~~iD~~~~~dwl~~~lq~~l~~ll~~~ 348 (428) +.||.- ..+++.|++.|..+|+|++.++.+.+ .++.++++.+ .||-+.+-.+|++..|+..++..+-. T Consensus 567 ~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~d~~~~~i~vrR~~~~i~~si~~~~~~~v~e- 645 (729) T protein:vir:10 567 RGPILNSVKLVYNPGKKQRDILYSNRINPVILSPGAGIILFGDKTGFGKSSAFDRINVRRLFIYLEDAISAAAKDQLFE- 645 (729) T ss_pred ccceecccceeeecChhhHhhhhhCCceEEEEecCCeEEEEcceecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcC- Confidence 555432 35789999999999999999998765 5688888754 47888888999999999888765543 Q ss_pred CCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 349 KKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 349 ~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) |.++.=...|+..|+.-|+..+++|.|. ||.|... .++.+++|+.+++.. +.+.+.+...+++|.++..-.- T Consensus 646 ---pn~~~~~~~i~~~i~~~L~~l~~~g~l~---g~~v~~d-~~~nt~~~i~~G~~~-~~v~~~p~~p~e~i~~~~~~~~ 717 (729) T protein:vir:10 646 ---FNDELTRTNFVNIVEPFLRDVQAKRGIF---DFVVICD-ETNNTAAVIDSNEFV-ADIFIKPARSINFIGLTFVATR 717 (729) T ss_pred ---CCCHHHHHHHHHHHHHHHHHHHhcccee---eeEEEEc-CCCCCHHHhhCCeEE-EEEEEEecCCccEEEEEEEEee Confidence 7788889999999999999999999996 5999886 677899999999988 9999999999999999865554 No 39 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=98.71 E-value=8.9e-08 Score=59.24 Aligned_cols=409 Identities=12% Similarity=0.090 Sum_probs=215.6 Q ss_pred CC-------CCCce-EEEeeeee-cccccccccceEEEEcccC-CCccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcC Q lcl|NC_019918. 1 MT-------VLTDV-IDIQISRE-TAAVAQTNFNVPLFIASHT-NFSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQA 70 (428) Q Consensus 1 M~-------~is~i-V~V~i~~~-~~~~~~~~f~~~li~~~~~-~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~ 70 (428) |+ |+++= |-|.+.-+ ..+....+.+.+.|+|... .+++++..+++.++...-||... .-.+....|.|. T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~~~~~~~~~~g~-l~~~~~~a~~~~ 79 (587) T protein:vir:95 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNYSQAKRLFRSGE-LLDAIELAWGSN 79 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEeCCCCCceeEEeccHHHHHHHhcCcc-hHHHHHHHhccc Confidence 54 33332 22222222 2445666778888888764 66788888999999999998844 556667777664 Q ss_pred C--cccEEEEEeeec-ccccccchhee-----------------------ecc-----------c---------cccccc Q lcl|NC_019918. 71 L--KPRSLVIGRRQV-PSATVSVSVVQ-----------------------EGQ-----------S---------YVLTVN 104 (428) Q Consensus 71 p--~P~~l~igr~~~-~~~~~~~~~~~-----------------------~~~-----------~---------~~~~v~ 104 (428) + .++++++.|-.. +.+.++...+. ... . +.+... T Consensus 80 ~~~g~~~~~~~rv~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~si~y~ 159 (587) T protein:vir:95 80 PNYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIFTIKYK 159 (587) T ss_pred cCCCceEEEEEEcCCCceeEEEecCeEEEEecccccccceEEEEecCCCCCceeEEEEEecccceeeeeeccceeeeeee Confidence 3 467777776421 11111111000 000 0 000000 Q ss_pred ceeee----------------------------eeecccc--hhhhhhhhh------eeeecccce-EEEEe--eccccc Q lcl|NC_019918. 105 GLPVS----------------------------YVSHQDD--TATLIATGL------KAAYDVTPV-VGVTV--TDNEDG 145 (428) Q Consensus 105 g~~~s----------------------------~~~~~~~--~a~~i~a~l------~~a~~~~~~-~~~~~--tt~~~~ 145 (428) |...+ +...... ++......+ ++.+.+... .+.+. ....+. T Consensus 160 g~~~~~~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~in~~~~~tAky~g~~~~~i~~~~~~~~~~~ 239 (587) T protein:vir:95 160 GEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLDKIENA 239 (587) T ss_pred ccccccceeeeecccceeeeeeeeecCCceEEEEEecCCchHHHHHHHHhhccccceEEEEecccCceeEEeecCccccc Confidence 10000 0000000 000000000 111100000 00000 000000 Q ss_pred e------------------------ee--eecc-----------------------ccccccccceEEEEee---ccccC Q lcl|NC_019918. 146 T------------------------LT--VASN-----------------------GDWSLKVSSNLTMAAA---PSTEG 173 (428) Q Consensus 146 ~------------------------~t--~as~-----------------------~~~~~~~s~~~~~~~~---~aa~~ 173 (428) . .. ...+ ............+..+ ....+ T Consensus 240 ~v~~~~~~v~a~~~d~~~~~~~~~~v~~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~t~LtGG~dG~~~~~ 319 (587) T protein:vir:95 240 NIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNGEPPAT 319 (587) T ss_pred ceehhhhhhhhhhcceeeeeeceeeeeeecccccceeccchhhhhcccchheeccccccceeccceeeeecCCCCCCccc Confidence 0 00 0000 0000000001113332 23456 Q ss_pred HHHHHHHHHhcccCceEEEEecCCHHHHHHHHHHHhhh---CCEEEEEecCcccccchhHHHHHHHHhcccCceEEEecC Q lcl|NC_019918. 174 WPATITAVQGENDEWYALSIDSHADDDIMAVATHIEGT---KKVFIGATAQANTKTSAENDIASRLVAAGFQRTALIYHP 250 (428) Q Consensus 174 ~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~~~~a~---~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~ 250 (428) ..++++++... +|+.+.+...+.+-+.++..|++.. .+....+..... ..+...+....+..+++|.+.+.+. T Consensus 320 y~~~l~ale~~--~~~~i~~~t~d~~v~a~l~a~vk~~~~~g~~~~aVvg~~~--~~~~~~~~~~a~~~n~ervi~v~~~ 395 (587) T protein:vir:95 320 WADKLDKFAHE--GGYYIVPLSSKQSVHAEVASFVKERSDAGEPMRAIVGGGF--NESKEQLFGRQESLSNPRVSLVANS 395 (587) T ss_pred HHHHHHHHHhC--CcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEEcCCC--CCCHHHHHHHHhhcCCCcEEEeccc Confidence 78899988764 5666655555556667899998753 233333322211 1223344555667788888765432 Q ss_pred ------Cc-----cchhHHHHHHHHHhccCCCceeeeeeeecCccc-cCCCHHHHHHHHhCCceEEEEEcCce----eee Q lcl|NC_019918. 251 ------NA-----DAQFPECAWVGYQLQEQPGSNTWTHKALAAVDA-YRLTPTESTNLKNKNVTTFERVGGVN----RTF 314 (428) Q Consensus 251 ------~~-----~~~~~~a~~~~~~~~~~~g~~t~~fk~~~Gv~~-~~~t~t~~~~l~~~~~n~y~~~~~~~----~~~ 314 (428) +. +....+++++|......+. ..+.||.++++.. ..++.+|++.+.++|++.+....+.. ..- T Consensus 396 ~~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~~~-~SlT~~~i~~~~v~~~~t~~e~e~ai~~Gvl~l~~~~~~~~~~vriv 474 (587) T protein:vir:95 396 GTFVMDDGRKNHVPAYMVAVALGGLASGLEIG-ESITFKPLRVSSLDQIYESIDLDELNENGIISIEFVRNRTNTFFRIV 474 (587) T ss_pred ceEecCCCceeeechHHHHHHHHHHHhcCchh-cCccceeeecccccccCCHHHHHHHHhCCeEEEEEecCCcceEEEEe Confidence 10 1223356677777777654 3444555553332 36899999999999999987654432 223 Q ss_pred cCEecC-----Cch--hHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEE Q lcl|NC_019918. 315 GGAMAG-----GEW--IDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVF 387 (428) Q Consensus 315 ~G~~~~-----G~~--iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~ 387 (428) +|.+.- -.| |-.++-.|.+...++..+-+.+.- | |=++.|...|++.|++.|++..+.|.|...+.-.+. T Consensus 475 ~~itT~t~~~d~~~~~i~viRv~D~i~~dir~~~~~~~iG--k-~nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~dv~ 551 (587) T protein:vir:95 475 DDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIG--T-RTINTSASIIKDFIQSYLGRKKRDNEIQDFPAEDVQ 551 (587) T ss_pred ecceeccCCCCcchhhhhhhhhHHHHHHHHHHHHHhhCCc--c-ccchHHHHHHHHHHHHHHHHHHhCCcccCCCccceE Confidence 444431 124 668888888888888887766654 4 568899999999999999999999999743221111 Q ss_pred eCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 388 VPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 388 ~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) .. ..+|+ --++|.+++.-++++|.+++.+.- T Consensus 552 v~-----~~~d~-----~~v~~~v~Pv~~mekI~vt~~~~~ 582 (587) T protein:vir:95 552 VI-----VEGNE-----ARISMTVYPIRSFKKISVSLVYKQ 582 (587) T ss_pred EE-----ecCCE-----EEEEEEEEEcccceEEEEEEEEee Confidence 11 11122 247889999999999999888866 No 40 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=98.58 E-value=2.5e-07 Score=56.81 Aligned_cols=409 Identities=11% Similarity=0.055 Sum_probs=212.7 Q ss_pred CC-------CCCce-EEEeeee-ecccccccccceEEEEcccC-CCccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcC Q lcl|NC_019918. 1 MT-------VLTDV-IDIQISR-ETAAVAQTNFNVPLFIASHT-NFSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQA 70 (428) Q Consensus 1 M~-------~is~i-V~V~i~~-~~~~~~~~~f~~~li~~~~~-~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~ 70 (428) |+ ++++= |-|.+-- ...+....+.+.+.|+|... .+++++..+++.++...-||... .-.+....|..+ T Consensus 1 ~~~~~~~~~~~~~pgv~~~~~~s~~~~~~~~~~~~~~~ig~a~~G~~~~~~~~~~~~~~~~~f~~g~-l~~~i~~a~~~~ 79 (562) T protein:vir:80 1 MAIEIYPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKPNAVYKVRNYSQAKSVFRSGE-LLDAIERAWNPG 79 (562) T ss_pred CeeeeeCCCcccCCceEEEEecCCcccCCCCCCceEEEEEEeCCCCcceeEEEccHHHHHHHhcCCC-hHHHHHHhcccc Confidence 55 22221 2222222 22445667778888888764 66789999999999999997744 333455556422 Q ss_pred C--cccEEEEEeee-cccccccchhee-----------------------ecc--------------------cccc--- Q lcl|NC_019918. 71 L--KPRSLVIGRRQ-VPSATVSVSVVQ-----------------------EGQ--------------------SYVL--- 101 (428) Q Consensus 71 p--~P~~l~igr~~-~~~~~~~~~~~~-----------------------~~~--------------------~~~~--- 101 (428) + --+++|+-|-. ...+..+...+. ... -+.+ T Consensus 80 ~~~g~~~~~~~rv~~a~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~v~~i~y~ 159 (562) T protein:vir:80 80 EGTGAGDILAMRVEEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIFSIKYK 159 (562) T ss_pred cccCceEEEEEEcCCCCcceEEecceEEEEeecccCCCceEEEEecCCCCCCcceEEEecCCcceEEeeccCceeeeeec Confidence 1 11234433321 111111100000 000 0000 Q ss_pred --------cccc-----ee------------eeeeeccc--chhhhhhhhhe------eeecccceEEEE---------- Q lcl|NC_019918. 102 --------TVNG-----LP------------VSYVSHQD--DTATLIATGLK------AAYDVTPVVGVT---------- 138 (428) Q Consensus 102 --------~v~g-----~~------------~s~~~~~~--~~a~~i~a~l~------~a~~~~~~~~~~---------- 138 (428) ++.+ .. ..+..... .....+..++. +.+.+.+.-... T Consensus 160 g~~~~a~~~i~~~~~~~~a~~l~~~~g~~~v~~~~l~~g~~~~~~~l~~~i~~~~~~tAky~g~~~n~i~~~~~d~~~~~ 239 (562) T protein:vir:80 160 GTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTDNFDAQIDV 239 (562) T ss_pred cccccceeEEEecCccceEEEEEEecCCcceeEEEeCCCccchhhhhhhhhccccceEEEecccCCceeeecccccchhh Confidence 0000 00 00000000 00111111111 111110000000 Q ss_pred ---------eeccc--------cceeeeeccccccccccceEEEEeec---cccCHHHHHHHHHhcccCceEEEEecCCH Q lcl|NC_019918. 139 ---------VTDNE--------DGTLTVASNGDWSLKVSSNLTMAAAP---STEGWPATITAVQGENDEWYALSIDSHAD 198 (428) Q Consensus 139 ---------~tt~~--------~~~~t~as~~~~~~~~s~~~~~~~~~---aa~~~~~al~~~~~~~~~w~~~~~~~~~~ 198 (428) .+... ...+..............+..++.+. ..++..++++++... +|+.+.+...+. T Consensus 240 ~~kt~~~~v~~~~~d~~~~~~~n~~v~~~~~~~~~la~~~~~~LtGG~dG~~~~~~~dal~~Le~~--~~~~i~~~t~d~ 317 (562) T protein:vir:80 240 DIKTKEAYVKAVGGDIEKQTAYNGYVEFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANE--GGYYLVPLTSKQ 317 (562) T ss_pred hcccceeeeeehhhhhhhcccccceEEEEeccCccccccceeeeeCCCCCCccccHHHHHHHHHhC--CcEEEEecCCCh Confidence 00000 00000000000001111223333333 245678889988764 566665555556 Q ss_pred HHHHHHHHHHhhh---CCEEEEEecCcccccchhHHHHHHHHhcccCceEEEecCC-----------ccchhHHHHHHHH Q lcl|NC_019918. 199 DDIMAVATHIEGT---KKVFIGATAQANTKTSAENDIASRLVAAGFQRTALIYHPN-----------ADAQFPECAWVGY 264 (428) Q Consensus 199 ~~~~ala~~~~a~---~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~-----------~~~~~~~a~~~~~ 264 (428) +.+.++..|++.. ++....+..... ..+...+....+..+++|.+.+.+.- .+....+++++|. T Consensus 318 ai~~~~~a~vkr~r~~g~~~~aVvg~~~--~~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~~~~~~~~~~~~~aa~vAGl 395 (562) T protein:vir:80 318 AVHAEALQFVRDCSYNGNPMRVFVGGGI--GESMEQLFTRAIGLQNERAGLIGFSGTVKMDDGRSLKMPGYMFAAQVAGL 395 (562) T ss_pred HHHHHHHHHHHHHHhCCCeEEEEecCCC--CCCHHHHHHHhhhcCCCeEEEEecCeeEECCCCceeeechhHHHHHHHHH Confidence 6677899999753 333333332221 12234455566677888887765431 1122346677777 Q ss_pred HhccCCCceeeeeeeecCccc-cCCCHHHHHHHHhCCceEEEEEcCcee----eecCEecCC-----c--hhHHHHHHHH Q lcl|NC_019918. 265 QLQEQPGSNTWTHKALAAVDA-YRLTPTESTNLKNKNVTTFERVGGVNR----TFGGAMAGG-----E--WIDVMIFVDW 332 (428) Q Consensus 265 ~~~~~~g~~t~~fk~~~Gv~~-~~~t~t~~~~l~~~~~n~y~~~~~~~~----~~~G~~~~G-----~--~iD~~~~~dw 332 (428) .....+. ..+.||.++|+.. ..++.+|++.+.++|++.+....+... .-++.+.-. . .|-.++-.|. T Consensus 396 ~Ag~~~~-~S~T~~~i~~~~v~~~lt~~e~~~li~~G~l~l~~~~~~~v~~~riv~~itT~t~~~~~~~~ki~viRv~D~ 474 (562) T protein:vir:80 396 TCGLEIG-EAITFKNIAIETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVGEANDF 474 (562) T ss_pred HhcCccc-cCccceeeccccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceeccCCCCchhhhhhhhHHHHH Confidence 7776653 5556677775432 368999999999999999987654432 223444321 2 4667777888 Q ss_pred HHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEE Q lcl|NC_019918. 333 LEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEAR 412 (428) Q Consensus 333 l~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~ 412 (428) +...++..+-+.++- | |=|+.|...|++.++..|++..+.|.|.....-.+... ..+|+ --+.+.+. T Consensus 475 i~~dir~~~~~~yIG--k-~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~dv~v~-----~~~d~-----~~v~~~v~ 541 (562) T protein:vir:80 475 LVSELKISLDNEYIG--T-KIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPEEVQVV-----IEGDI-----ARISLTVF 541 (562) T ss_pred HHHHHHHHHHhcCCc--c-ccChHHHHHHHHHHHHHHHHHHhCCcccCCCccceEEE-----ecCCE-----EEEEEEEE Confidence 877777766665544 4 56889999999999999999999999964321112211 12222 13788999 Q ss_pred ECceEEEEEEEEEEec Q lcl|NC_019918. 413 LAGAIHFVHIRGTVTV 428 (428) Q Consensus 413 ~agaih~v~i~~~v~~ 428 (428) +.-++++|.+++.+.- T Consensus 542 Pv~~mekIy~ti~~~~ 557 (562) T protein:vir:80 542 PIRSMKKIEVSLVYRQ 557 (562) T ss_pred EcccceEEEEEEEEEe Confidence 9999999999888877 No 41 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=98.47 E-value=5.4e-07 Score=54.94 Aligned_cols=409 Identities=11% Similarity=0.068 Sum_probs=216.4 Q ss_pred CC-------CCCce-EEEeeee-ecccccccccceEEEEcccC-CCccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcC Q lcl|NC_019918. 1 MT-------VLTDV-IDIQISR-ETAAVAQTNFNVPLFIASHT-NFSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQA 70 (428) Q Consensus 1 M~-------~is~i-V~V~i~~-~~~~~~~~~f~~~li~~~~~-~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~ 70 (428) |+ +++|= |-|.+.- ...+....+.+.+.++|... .+++++..+++.++..+-||... .-.+..+.|+-. T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~~G~~~~~~~~~~~~~~~~~f~~g~-l~~a~~~a~~~~ 79 (569) T protein:vir:80 1 MAVEQFPRKKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAKGGKPDTVYRFRNYQQAKQVLRSGD-LLDAIELAWNAS 79 (569) T ss_pred CeeeeecCCccccCceEEEEecCCCcCCCCCCceeEEEEEEeCCCCCceeEEecCHHHHHHHhcCCc-hhHHHHhhccCc Confidence 55 33331 2232222 23456677888888898764 56788989999999999998743 555666777533 Q ss_pred ----CcccEEEEEeeec-ccccccchhee-----------------------ecccc---------------ccccccee Q lcl|NC_019918. 71 ----LKPRSLVIGRRQV-PSATVSVSVVQ-----------------------EGQSY---------------VLTVNGLP 107 (428) Q Consensus 71 ----p~P~~l~igr~~~-~~~~~~~~~~~-----------------------~~~~~---------------~~~v~g~~ 107 (428) -.|+++++-|-.. ..+.++...++ ..... +.++-++. T Consensus 80 ~~~~~~~~~~~~~rv~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~l~~~~~~~~~~~~v~~~~~~~~~~~~~ig~v~si~ 159 (569) T protein:vir:80 80 DVNTASAGDILAVRVEDAKNATLTKGGLTFASTIYGVDANEIQVALEDNNLTHTKRLTVAFSKDGYKKVFDNLGKIFSIQ 159 (569) T ss_pred cccccCceEEEEEEcCCCeeeeeeccceeeeeeeccCCCceEEEEEecCcCCcceeeEEeeecCCCccccccccceeeEE Confidence 3466777766421 11111100000 00000 00000001 Q ss_pred eeee-----ecc-----cchhh--------------------------hhhhhheeeecccceEEEEe-eccc------- Q lcl|NC_019918. 108 VSYV-----SHQ-----DDTAT--------------------------LIATGLKAAYDVTPVVGVTV-TDNE------- 143 (428) Q Consensus 108 ~s~~-----~~~-----~~~a~--------------------------~i~a~l~~a~~~~~~~~~~~-tt~~------- 143 (428) ++.. ... ...+. .....+.+.++......... .... T Consensus 160 ytg~~~~a~~~~~~~~~~~~a~~l~~~~g~~~~~~~~v~~~~~~~~~~~~~~~lv~~~~~~~~f~a~~~~~~~~~~~~~~ 239 (569) T protein:vir:80 160 YKGSEAQANFTIAQDSISKKATTLTLNVGSEPESTTEVMKYELGQGVYSETNVLVSAINSLPDWEAKFFPIGDKNLPTDA 239 (569) T ss_pred EeeccccceEEeecCcCcceeEEEEEEecCCcceeEEEEeeccCCccchhhhhhhhhcCCccCceEEEEecCCCcceehh Confidence 1000 000 00000 00001111111111000000 0000 Q ss_pred -----------------------------cceeeeeccccccccccceEEEEee---ccccCHHHHHHHHHhcccCceEE Q lcl|NC_019918. 144 -----------------------------DGTLTVASNGDWSLKVSSNLTMAAA---PSTEGWPATITAVQGENDEWYAL 191 (428) Q Consensus 144 -----------------------------~~~~t~as~~~~~~~~s~~~~~~~~---~aa~~~~~al~~~~~~~~~w~~~ 191 (428) ...+.....+........+..++.+ ....+..++++++... +|..+ T Consensus 240 ~d~~~~~~~~t~~~~~~~~~~di~~~~~~~~~v~~~~~~~~~l~~~~~~~LtGG~dG~~~~~~~~~l~~le~~--~~~~i 317 (569) T protein:vir:80 240 LEAVTKVDVKTEAVFVGALAGDIAKQLEYNDYVTVAVDATKPVEDFELTNLTGGSDGTAPESWANKFPLLANE--GGYYL 317 (569) T ss_pred ccchhheeccccceeeehhHHHHHHhhcCCceEEEEecCCcceeeecceeecCCCCCCccchHHHHHHHHhhC--CcEEE Confidence 0000000000000000111122222 2334678888888764 46556 Q ss_pred EEecCCHHHHHHHHHHHhhh---CCEEEEEecCcccccchhHHHHHHHHhcccCceEEEecC-----------CccchhH Q lcl|NC_019918. 192 SIDSHADDDIMAVATHIEGT---KKVFIGATAQANTKTSAENDIASRLVAAGFQRTALIYHP-----------NADAQFP 257 (428) Q Consensus 192 ~~~~~~~~~~~ala~~~~a~---~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~-----------~~~~~~~ 257 (428) .+...+.+.+.++..|++.. ++..+.+...... ...+.+....+..+++|.+.++.. ..+.... T Consensus 318 ~~~t~d~av~~~l~a~vkr~r~~g~~~~aVvg~~~~--~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~~g~~~~~~~~~~ 395 (569) T protein:vir:80 318 VPLTDKQAVHSEALAFVKDRTDNGDPMRIIVGGGTN--ETVEESITRATNLRDPRASLVGFSGTRKMDDGRLLKLPGYMM 395 (569) T ss_pred EecCCChHHHHHHHHHHHHHHhCCCcEEEEecCCCC--CCHHHHHHHHhhcCCCeEEEEecCceeecCCCcceeechhhH Confidence 66566666678899999864 2333333222111 123344455666778877665432 1112334 Q ss_pred HHHHHHHHhccCCCceeeeeeeecCccc-cCCCHHHHHHHHhCCceEEEEEcCcee----eecCEecCC-----ch--hH Q lcl|NC_019918. 258 ECAWVGYQLQEQPGSNTWTHKALAAVDA-YRLTPTESTNLKNKNVTTFERVGGVNR----TFGGAMAGG-----EW--ID 325 (428) Q Consensus 258 ~a~~~~~~~~~~~g~~t~~fk~~~Gv~~-~~~t~t~~~~l~~~~~n~y~~~~~~~~----~~~G~~~~G-----~~--iD 325 (428) +++++|......+. ..+.||.++++.. ..++.+|++.+.++|++.+....+... .-++.+.-. .| |- T Consensus 396 aa~vAG~~A~~~~~-~S~T~k~i~~~~i~~~lt~~e~~~li~~G~~~l~~~~~~~~~v~~~vn~itT~t~~~~~~~~~i~ 474 (569) T protein:vir:80 396 ASQIAGIASGLEVG-EAITFKHFNVTSVDRVFESSQLDMLNESGVISIEFVRNRTLTAFRVVQDVTTYNDKSDPVKNEMS 474 (569) T ss_pred HHHHHHHHhcCccc-cCccceeeccccccccCCHHHHHHHHhCCeEEEEEecCceEEEEEEeccceecCCCCCchhhhhh Confidence 66777777776654 3455666664332 358999999999999999987654321 224444422 24 67 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccC Q lcl|NC_019918. 326 VMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFE 405 (428) Q Consensus 326 ~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~ 405 (428) .++-.|.+...++..+-+.++- | |-++.|...|++.++..|++..+.|.|.....-.+... ..+| | - T Consensus 475 viRv~D~i~~dir~~~~~~yiG--k-~nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~dv~v~-----~~~d---~--~ 541 (569) T protein:vir:80 475 VGEANDFLVSELKIELDNNFIG--T-KVIDTSASLIKNFIQSFLDNKKRAREIQDYTPEEVQVV-----LEGD---V--A 541 (569) T ss_pred hhHHHHHHHHHHHHHHHhhcCc--c-cCChhHHHHHHHHHHHHHHHHHhCCcccCCCccceEEE-----ecCC---E--E Confidence 8888888888887776665543 4 67889999999999999999999999963321112111 1112 2 2 Q ss_pred CeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 406 GIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 406 ~i~~~~~~agaih~v~i~~~v~~ 428 (428) -+.+.+.+--++++|.+++.+.- T Consensus 542 ~v~~~v~Pv~~~ekI~~ti~~~~ 564 (569) T protein:vir:80 542 SISMTVMPIRSLNKITVQLVYKQ 564 (569) T ss_pred EEEEEEEEcccccEEEEEEEEee Confidence 37888999999999999998887 No 42 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=98.45 E-value=6.2e-07 Score=54.62 Aligned_cols=323 Identities=11% Similarity=0.060 Sum_probs=164.6 Q ss_pred HhcCCcccEEEEEeeecccccccch--heeec--ccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeecc Q lcl|NC_019918. 67 FGQALKPRSLVIGRRQVPSATVSVS--VVQEG--QSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDN 142 (428) Q Consensus 67 F~q~p~P~~l~igr~~~~~~~~~~~--~~~~~--~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~ 142 (428) ..+-|...-.+-..- .+....+ ++... ......+ ..++ ..........+....... ..+. . T Consensus 1 ~~glp~i~i~f~~~a---~ta~~~g~rGiv~~il~d~~~~~--~~~~---~~~~v~~~~~~~n~~~i~-~~~~--g---- 65 (356) T protein:vir:10 1 MAGLVNINIEFKELA---TSFIQRSKAGIVAIILKDTTKMY--KELT---SEDDIPISLSADNKKYIK-YGFV--G---- 65 (356) T ss_pred CCCCCceeEEEeecc---eeeccCCccceEEEEEecCCcce--eEEe---ccccchhHHHHHHHHHHH-HHhh--c---- Confidence 444443333332211 1111100 11100 0000000 0111 111111111110000000 0000 0 Q ss_pred ccceeeeeccccccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEEecCCHHHHHHHHHHHhhh----CCEEEEE Q lcl|NC_019918. 143 EDGTLTVASNGDWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSHADDDIMAVATHIEGT----KKVFIGA 218 (428) Q Consensus 143 ~~~~~t~as~~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~~~~a~----~~~~~~~ 218 (428) +.. +......+.. ........+++.++|..+.... |-.+.+.+.+++++..++.|+... ++++-.. T Consensus 66 --~~~-----~~~~~~p~~~-~~~~~~t~~~y~~aL~~le~~~--fn~l~~~~~d~~~~~~~~a~ikr~r~~~~~~~~~V 135 (356) T protein:vir:10 66 --ATD-----NEKVLRPSKV-IISTFTEDGKVEDILEELESVE--FNYLCMPEAIEAEKTKIVTWIKKIREEESTEAKAV 135 (356) T ss_pred --ccc-----ccccccceee-eeecccCchhHHHHHHHhcCcc--ceEEEecCCChHHHHHHHHHHHHHHhcCCcEEEEE Confidence 000 0000001111 1112234578999999997654 444667777888899999999853 3444333 Q ss_pred ecCcccccchhHHHHHHHHhcccCceEEEecCCccchhHHHHHHHHHhccCCCceeeeeeeecCcccc-CCCHHHHHHHH Q lcl|NC_019918. 219 TAQANTKTSAENDIASRLVAAGFQRTALIYHPNADAQFPECAWVGYQLQEQPGSNTWTHKALAAVDAY-RLTPTESTNLK 297 (428) Q Consensus 219 ~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~~~~~~~~a~~~~~~~~~~~g~~t~~fk~~~Gv~~~-~~t~t~~~~l~ 297 (428) ...... ..+.+.+ +...+..-+......-.+++++|..+.... ..++-|+.++++... .++.+|++... T Consensus 136 ~~~~~a---D~EgIIn------v~n~~~~~g~~~t~~~~~~~vAG~~Ag~~~-n~S~T~~~~~~~~~~~~~t~~e~~~ai 205 (356) T protein:vir:10 136 LANIKA---DNEAIIN------FTENVVVDGEEITAEKYTTRVASLIASTPN-TQSITYAPLDEVESIVKIDKASADAKV 205 (356) T ss_pred ecCCCC---CCceeEE------eecCeEecceeechhHHHHHHHHHHhccch-hccccceecCCccccccCCHHHHHHHH Confidence 322211 1111111 111111111111122235677777666654 345567777776643 58999999999 Q ss_pred hCCceEEEEEcCceeeecCEe----cCCc------hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHH Q lcl|NC_019918. 298 NKNVTTFERVGGVNRTFGGAM----AGGE------WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRA 367 (428) Q Consensus 298 ~~~~n~y~~~~~~~~~~~G~~----~~G~------~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~ 367 (428) ++|.-++..-++.-.+-+|+. .+.+ -|-.++..|-+.+.++..+-+.++ +|+|=+.+|..++.+.++. T Consensus 206 ~~G~lvl~~d~~~V~I~~~VNSltt~t~~k~~~f~Kirvvr~~D~i~~Di~~~f~~~yi--GKv~N~~dgr~~l~~ai~~ 283 (356) T protein:vir:10 206 QAGELILRRLSGKIRIARGINSLTTLTAEKGEIFQKIKLVDTKDLISKDIKNIYVEKYL--RKCPNTYDNKCLFIVAVQS 283 (356) T ss_pred hCCeEEEEEEcCeEEEEecCccceecCCCCCcchhhhHHHHHHHHHHHHHHHHHhhccc--cccCCCHHHHHHHHHHHHH Confidence 999998877766655666652 2332 377788877777777654433332 7999999999999999999 Q ss_pred HHHHHHhcCceecCCceEEEeCch-----------HhCCHHH---HhccccCCeEEEEEECceEEEEEEEEEE Q lcl|NC_019918. 368 QLNEGIRVGGLAEAPAPKVFVPDV-----------LSMSPNM---RAQRIFEGIEFEARLAGAIHFVHIRGTV 426 (428) Q Consensus 368 ~~~~~~~~G~I~~g~~~~v~~~~~-----------~~~~~~d---ra~R~~~~i~~~~~~agaih~v~i~~~v 426 (428) -+++..+.|.|.++...++..... ++++..+ ..-+..-=+++.+++-.++..+.+.++| T Consensus 284 y~~~L~~~~~I~~~~~~eid~e~q~~~~~~~g~d~~~~~d~~v~~~~~~~~v~~~~~v~~vdamE~iy~ti~v 356 (356) T protein:vir:10 284 YLTELAKQELIDSNFTVEIDLEKQKEYLEGKKIAVSKMKENEIKEANTGSNGFYLINLKLVDAMEDINIRVQM 356 (356) T ss_pred HHHHHHhCCccccCceeEecccchHHHhhhccccccccccceeecccCCcEEEEEEEEEEEeeeeeEEeEEeC Confidence 999999999998764333333211 1111111 1112223377888999999999998888 No 43 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=98.44 E-value=6.3e-07 Score=54.56 Aligned_cols=397 Identities=12% Similarity=0.098 Sum_probs=177.1 Q ss_pred CCC--CCceEEEe-----------------eeeeccccc----------ccccceEEEEccc---CCCccc-eEEeeCHH Q lcl|NC_019918. 1 MTV--LTDVIDIQ-----------------ISRETAAVA----------QTNFNVPLFIASH---TNFSER-ARVYNSLK 47 (428) Q Consensus 1 M~~--is~iV~V~-----------------i~~~~~~~~----------~~~f~~~li~~~~---~~~~~~-~~~y~s~~ 47 (428) ..+ ....+.++ +.+.++++. ...+-. ++.... ....+. +..|..++ T Consensus 272 ~~~~~~g~~~~it~v~~~~~~~~~~t~~~~~~~a~~~gt~~~~~~~~g~~D~~~v-~v~~~~g~~~~~~g~v~e~~~~~~ 350 (749) T protein:vir:10 272 VVQDTNSTNITITSVRDEYTEREYLPGVKWINVAPRPGTSLYANGVGGHRDEMHV-ILVDIDGGVTGTVGALLERYIDVS 350 (749) T ss_pred cccCCccceeEEEeeeccccccccccceeeccccccccceeeeecccCCCCceEE-EEecCCCeeeecccceeeeeeecc Confidence 000 00000010 001111100 000000 011000 011111 12222222 Q ss_pred HHH-hhcCCChHHHHHHHHHHhcCCcccEEEEEeeecccccccchheeecccccccccceeeeeeecccchhhhhhhhhe Q lcl|NC_019918. 48 GVA-EDFGESDPTYLAAVRYFGQALKPRSLVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLK 126 (428) Q Consensus 48 ~V~-~~fg~~s~eY~aA~~~F~q~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~ 126 (428) .-. ..+...++.|- ...+.+ ....++++............ ... ..+.....+..+.......... ....... T Consensus 351 ~~~~~~~~~~~~~~~--~~~~~~--~s~~v~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 423 (749) T protein:vir:10 351 KASDAKTSVGETNYY--AEVIKQ--KSEFIYWAEHESTLYAATSS-ASD-GLFGQTAANRQFNLFRSAAGSV-DYPAGVT 423 (749) T ss_pred ccccccccccccchh--hhhhcc--CCCEEEEEeccccccccccc-ccc-cccccccccceeeccccccccc-eeccccc Confidence 110 01222333332 122222 12334444321110000000 000 0000000000000000000000 0000000 Q ss_pred eeecccceEEEEeeccccceeeeeccccccccccceEEEEeeccccCHHHHHHHHHhcc-cCceEEEEe--cCCH----H Q lcl|NC_019918. 127 AAYDVTPVVGVTVTDNEDGTLTVASNGDWSLKVSSNLTMAAAPSTEGWPATITAVQGEN-DEWYALSID--SHAD----D 199 (428) Q Consensus 127 ~a~~~~~~~~~~~tt~~~~~~t~as~~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~-~~w~~~~~~--~~~~----~ 199 (428) ...+. ......+..........+.. ............+..+.+.. .....+.+. ..+. . T Consensus 424 ~~~~~-----------~~~~~~~~~~gg~d~~~~~~---~~~~~~~~~~~~~~~l~~~~~~~~~~li~~~~~~~~~~~~~ 489 (749) T protein:vir:10 424 TLGSK-----------NNATYYYRLSGGVNYTVSAG---QYTITNTDIGSAYELIGDPESQIVDFIISGPSGTSDANALA 489 (749) T ss_pred ccccc-----------CCcEEEEEccCCcccccccc---cccccchhHHHHHHHhhhhhhcccceEEEecCCCCcchhHH Confidence 00000 00011110000000000000 00011223334444444332 122222221 2222 2 Q ss_pred HHHHHHHHHhhhCCEEEEEecCcccc------cchhHHHHHHHHhcccCceEEEecC-------Ccc---chhHHHHHHH Q lcl|NC_019918. 200 DIMAVATHIEGTKKVFIGATAQANTK------TSAENDIASRLVAAGFQRTALIYHP-------NAD---AQFPECAWVG 263 (428) Q Consensus 200 ~~~ala~~~~a~~~~~~~~~~~~~~~------~~~~~~~~~~l~~~~~~~t~~~y~~-------~~~---~~~~~a~~~~ 263 (428) ...++...++....++.+.-...... .....+.....+.....+-..+|++ ..+ ...+.+.++| T Consensus 490 v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~vAG 569 (749) T protein:vir:10 490 KITSLVNIAEERRDCMVFVSPRRGNVIGISNTTTITTNIVDFFKKLPSSSYMVFDSGYKYIYDKYNDVYRYIPCNGDTAG 569 (749) T ss_pred HHHHHHHHHhhcCCEEEEEcCCCCcccccccchhhhhHHHHHHhhccCceeEEEEccceeeeccccCceEEechHHHHHH Confidence 34566667776666555443222111 1111222222222221122333332 111 1346677888 Q ss_pred HHhccCCCceee---eeeeecCcc-----ccCCCHHHHHHHHhCCceEEEEEcCce-eeecCEecCC-----chhHHHHH Q lcl|NC_019918. 264 YQLQEQPGSNTW---THKALAAVD-----AYRLTPTESTNLKNKNVTTFERVGGVN-RTFGGAMAGG-----EWIDVMIF 329 (428) Q Consensus 264 ~~~~~~~g~~t~---~fk~~~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~~~-~~~~G~~~~G-----~~iD~~~~ 329 (428) .+...+..+--| .+|++.|+. ...+++.|.+.|..+|+|....+.+.+ .++..+|+.+ .||-+.+- T Consensus 570 l~Ar~D~~~g~~~SPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~s~d~~~~~i~vRRl 649 (749) T protein:vir:10 570 LCLQTNEISEPWFSPAGFQRGVLRNAIKLAYTPNKAQRDQLYANRVNPIVSFPGQGVVLYGDKTALGFASAFDRINIRRL 649 (749) T ss_pred HHHHhhccCCcEECcCCceeeeeeccccceeecChhHHHhhhhCCceEEEEecCCeEEEEcceecCCCCcccceeehhhh Confidence 777666443233 355544332 235789999999999999999998875 5688888744 36778888 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEE Q lcl|NC_019918. 330 VDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEF 409 (428) Q Consensus 330 ~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~ 409 (428) .+|++..|+..+...+-. |.++.=...|+..|+.-|+..++.|.|. +|.|... .++.+++|+.+++.. +.+ T Consensus 650 ~~~ie~si~~~~~~~v~e----pn~~~l~~~i~~~i~~fL~~l~~~G~i~---~f~V~~d-~~~Nt~~~i~~G~~~-~~i 720 (749) T protein:vir:10 650 FLTVERVISTAAKAQLFE----QNDEAQRSLFINIVEPYLRDVQGRRGVV---DFLVKCD-STNNTPEAVDRGEFY-AEV 720 (749) T ss_pred HHHHHHHHHHHHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhcCCee---eeEEEEc-CCCCCHHHhhCCEEE-EEE Confidence 889888888877765443 7788889999999999999999999884 6899887 777899999998886 999 Q ss_pred EEEECceEEEEEEEEEEec Q lcl|NC_019918. 410 EARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 410 ~~~~agaih~v~i~~~v~~ 428 (428) .+++...+++|+++..-.- T Consensus 721 ~~~P~~pae~I~~~~~~~~ 739 (749) T protein:vir:10 721 FLKPTRTINYVQLTFVATR 739 (749) T ss_pred EEEecCCccEEEEEEEEee Confidence 9999999999999876554 No 44 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=98.39 E-value=9e-07 Score=53.73 Aligned_cols=334 Identities=12% Similarity=0.055 Sum_probs=168.0 Q ss_pred CC----CCCceEEEeeeeecccccccccceEEEEcc--cCCCc-cceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCcc Q lcl|NC_019918. 1 MT----VLTDVIDIQISRETAAVAQTNFNVPLFIAS--HTNFS-ERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALKP 73 (428) Q Consensus 1 M~----~is~iV~V~i~~~~~~~~~~~f~~~li~~~--~~~~~-~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~P 73 (428) +. +++..+ ..+..+...........+-.. -.+|. +.+..|.+-++..+-|+....+- T Consensus 198 ~~~~~~~~~~~~---~t~~~~~~g~~~~~~~i~~~~~~~~D~~~~~~v~~~~~~~~~~~~~~~~~~~------------- 261 (581) T protein:vir:76 198 GEDGEANTRDDL---YTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEA------------- 261 (581) T ss_pred Ccccceeeeeee---eeeEeecccccccceeEEEEEEEeecCCccceEEEecccccccceeeehhhc------------- Confidence 11 111110 011111111111111111000 01111 12333433333322222111100 Q ss_pred cEEEEEeeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeeccc Q lcl|NC_019918. 74 RSLVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVASNG 153 (428) Q Consensus 74 ~~l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as~~ 153 (428) | .+.+. + . .... ..-.++.........+ + T Consensus 262 -----g-------------~~~~e-----~---~------------~~~~--~~~t~~~~~~l~~gvd-----------~ 290 (581) T protein:vir:76 262 -----G-------------NVQSE-----I---T------------LCAQ--LAITNGASTILACAVD-----------P 290 (581) T ss_pred -----C-------------ccccc-----h---h------------hhhh--eeeccccceEEEeeec-----------C Confidence 0 00000 0 0 0000 0000000000000000 0 Q ss_pred cccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEEecCCHHHH-HHHHHHHhhhC---C-EE-EEEecCcccccc Q lcl|NC_019918. 154 DWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSHADDDI-MAVATHIEGTK---K-VF-IGATAQANTKTS 227 (428) Q Consensus 154 ~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~~~~~~-~ala~~~~a~~---~-~~-~~~~~~~~~~~~ 227 (428) .. .....+.+.++++++.++. ...+++....++.+ .++..|++... + ++ ...... ..... T Consensus 291 ~g-----------~tvt~~dy~~aL~ale~~~--~~~ivvp~t~~~~i~a~l~ahv~~~s~~~~~~ra~igv~g-~~~~~ 356 (581) T protein:vir:76 291 EG-----------DTVTMGDYQNALNKFRDED--EIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDG-SVTPV 356 (581) T ss_pred CC-----------CccchHHHHHHHHHHhcCC--eEEEEEecCCChHHHHHHHHHHHHHHhccCCceEEEEeeC-CCCCc Confidence 00 0112335778888887653 33333444444444 44777776542 2 22 111111 11111 Q ss_pred hhHHHHHHHHhcccCceEEEecC------C-------ccchhHHHHHHHHHhccCCCceeeeeeeecCccc--cCCCHHH Q lcl|NC_019918. 228 AENDIASRLVAAGFQRTALIYHP------N-------ADAQFPECAWVGYQLQEQPGSNTWTHKALAAVDA--YRLTPTE 292 (428) Q Consensus 228 ~~~~~~~~l~~~~~~~t~~~y~~------~-------~~~~~~~a~~~~~~~~~~~g~~t~~fk~~~Gv~~--~~~t~t~ 292 (428) ...+.....+..+..|.+.+++. . .+..+..++++|......+ ...+-||.++|+.. ..++.+| T Consensus 357 ~~~~~~~~a~~~ns~Rvvlv~p~~~~~~g~~~~~~~~lp~~~~AA~vAG~~a~~~~-~~slT~~~i~g~~~~~~~~s~~e 435 (581) T protein:vir:76 357 PSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIA-AMPLTRKVIRGFSGPAEVQRDGE 435 (581) T ss_pred hHHHHHHhhcccCCCcEEEEEcCceEeccccCCcceecchhhhhhhHHhhhhcccc-ccCcccccccccccccccCCHHH Confidence 22222334455577787766532 1 1123345666667766666 45778999998874 4689999 Q ss_pred HHHHHhCCceEEEEEcCce-eeecCEec---CC--chhHHHHHHHHHHHHHHHHHHH-HHHhcCCCCcCHhHHHHHHHHH Q lcl|NC_019918. 293 STNLKNKNVTTFERVGGVN-RTFGGAMA---GG--EWIDVMIFVDWLEARMTERLWF-RMANSKKIPYDAVGATILESEI 365 (428) Q Consensus 293 ~~~l~~~~~n~y~~~~~~~-~~~~G~~~---~G--~~iD~~~~~dwl~~~lq~~l~~-ll~~~~kip~~~~G~~~i~~~i 365 (428) ++.+.++|++.+....+.. .+-+|++. +. +.|-.++-.|.+...+++.+.. .|.. + |=++.|...|++.+ T Consensus 436 ~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s~~~~k~i~viR~~D~v~~~vr~~~~~~~fiG--~-~n~~~~r~~ik~~i 512 (581) T protein:vir:76 436 KSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIG--M-PIYDTTIVQVKASA 512 (581) T ss_pred HHHHHhCCeEEEEEecCCeEEEEEeeecCCCCCccceeeehhhhHHHHHHHHHHHhhhcCCC--c-ccChHHHHHHHHHH Confidence 9999999999999766554 34567654 23 4567888899999888888753 3543 3 77889999999999 Q ss_pred HHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 366 RAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 366 ~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) ++.|.+..++|.|......+... .++. +..--+.+.+++.-++.+|.++..+.= T Consensus 513 ~~~L~~l~~~g~I~g~~~~~~~~--------~~~~-~d~v~V~i~v~Pv~~ie~I~vt~~~~p 566 (581) T protein:vir:76 513 EAALVWLVDNNIIRGYRNLKARQ--------IERQ-PDVIEVRYEWRPAYPLNYIVVRYSIAP 566 (581) T ss_pred HHHHHHHHhcCcccCcccceeeE--------EecC-CCEEEEEEEEEecccceEEEEEEEEee Confidence 99999999999997543222211 1122 112347889999999999998877765 No 45 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=98.30 E-value=1.5e-06 Score=52.45 Aligned_cols=409 Identities=11% Similarity=0.068 Sum_probs=210.2 Q ss_pred CC-------CCCce-EEEeeee-ecccccccccceEEEEcccC-CCccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcC Q lcl|NC_019918. 1 MT-------VLTDV-IDIQISR-ETAAVAQTNFNVPLFIASHT-NFSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQA 70 (428) Q Consensus 1 M~-------~is~i-V~V~i~~-~~~~~~~~~f~~~li~~~~~-~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~ 70 (428) |+ +.+|= |-|.+-- ...+....+.+.+.|+|-.. .++++...+++.++-..-||... .-.+...+|... T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~s~~~~~~~~~~~~~~~iG~a~~G~~~~~~~~~~~~~~~~~fg~g~-l~~~i~~a~~~~ 79 (562) T protein:vir:63 1 MAIEIYPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKPNAVYKVRNYSQAKSVFRSGE-LLDAIERAWNPG 79 (562) T ss_pred CeeeeeCCCcccCCceEEEEecCCCcccCCCCCceEEEEEEeCCCCCceeEEEccHHHHHHHhcCCc-hHHHHHHhcccc Confidence 44 43332 2232222 33456677788888998764 67788999999999999998754 334455556322 Q ss_pred C--cccEEEEEeeec-ccccccchhee-----------------------e--------------------ccccccccc Q lcl|NC_019918. 71 L--KPRSLVIGRRQV-PSATVSVSVVQ-----------------------E--------------------GQSYVLTVN 104 (428) Q Consensus 71 p--~P~~l~igr~~~-~~~~~~~~~~~-----------------------~--------------------~~~~~~~v~ 104 (428) + --.++|+-|-.. ..+..+...+. . +.-+.++.. T Consensus 80 ~~~g~~~~~~~rv~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~V~~i~y~ 159 (562) T protein:vir:63 80 EGTGAGDILAMRVEEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIFSIKYK 159 (562) T ss_pred ccCCceEEEEEEcCCCccceeEecceeEEEeecccCCCeEEEEEecCCCCCCcceEEEecCCCcchhhhhccceeeeeee Confidence 1 113455544211 11111110000 0 000000000 Q ss_pred ce----------------ee------------eee--ecccchhhhhhhhhe------eeecccceEEEEeec---cc-- Q lcl|NC_019918. 105 GL----------------PV------------SYV--SHQDDTATLIATGLK------AAYDVTPVVGVTVTD---NE-- 143 (428) Q Consensus 105 g~----------------~~------------s~~--~~~~~~a~~i~a~l~------~a~~~~~~~~~~~tt---~~-- 143 (428) |. .+ .+. .........+..++. +.+.+.+........ .. T Consensus 160 g~~~~~~~~v~~~~~~~~a~~l~~~~g~~~v~~~~L~~g~~~~~~~l~~~in~~~~~~aky~~~~gn~i~~~~~d~~~~~ 239 (562) T protein:vir:63 160 GTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTDNFDAQIDV 239 (562) T ss_pred cccccceEEEEecCcceeEEEEEeecCCcceeEEEecCCccchhHHHHHhhccccceEEEeeccCCceeeeecccccccc Confidence 00 00 000 000000111111111 111111000000000 00 Q ss_pred ----------------------cceeeeeccccccccccceEEEEeec---cccCHHHHHHHHHhcccCceEEEEecCCH Q lcl|NC_019918. 144 ----------------------DGTLTVASNGDWSLKVSSNLTMAAAP---STEGWPATITAVQGENDEWYALSIDSHAD 198 (428) Q Consensus 144 ----------------------~~~~t~as~~~~~~~~s~~~~~~~~~---aa~~~~~al~~~~~~~~~w~~~~~~~~~~ 198 (428) ...+..............+..++.+. ...+..++++++... +|+.+.+...+. T Consensus 240 ~vkt~~~~v~t~~~d~~~~~~~~~~v~~~~~~~~~la~~~~~~LtGG~dGt~~~~~~~al~ale~~--~~~~i~~~t~d~ 317 (562) T protein:vir:63 240 DIKTKEAYVKAVGGDIEKQTAYNGYVDFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANE--GGYYLVPLTSKQ 317 (562) T ss_pred chhhhhhhhhhhhhhhhhcccccceeeeeeccccceecccceeeecCCCCCchhhHHHHHHHHHhC--CcEEEEecCCCH Confidence 00000000000000001122222222 234567788888754 466555555555 Q ss_pred HHHHHHHHHHhhh---CCEEEEEecCcccccchhHHHHHHHHhcccCceEEEecCC-----------ccchhHHHHHHHH Q lcl|NC_019918. 199 DDIMAVATHIEGT---KKVFIGATAQANTKTSAENDIASRLVAAGFQRTALIYHPN-----------ADAQFPECAWVGY 264 (428) Q Consensus 199 ~~~~ala~~~~a~---~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~-----------~~~~~~~a~~~~~ 264 (428) +-+.++..|++.. ++....+..... ..+...+....+..+++|.+.+.+.- .+....+++++|. T Consensus 318 av~~~l~a~vkr~~~~g~~~~aVlg~~~--~~~~~~~~~~a~~~n~ervv~v~~~~~~~~~~~~~~~~~~~~~aa~vAGl 395 (562) T protein:vir:63 318 AVHAEALQFVRDCSYNGNPMRVFVGGGI--GESMEQLFTRAIGLQNERAGLIGFSGTVKMDDGRSLKMPGYMFAAQVAGL 395 (562) T ss_pred HHHHHHHHHHHHHHhCCCcEEEEecCCC--CCCHHHHHHHhhhcCCCcEEEEecCeeEECCCCceeeechhHHHHHHHHH Confidence 5567899999643 233333222111 12334455566667888887765431 1123345677777 Q ss_pred HhccCCCceeeeeeeecCccc-cCCCHHHHHHHHhCCceEEEEEcCcee----eecCEecCC-----c--hhHHHHHHHH Q lcl|NC_019918. 265 QLQEQPGSNTWTHKALAAVDA-YRLTPTESTNLKNKNVTTFERVGGVNR----TFGGAMAGG-----E--WIDVMIFVDW 332 (428) Q Consensus 265 ~~~~~~g~~t~~fk~~~Gv~~-~~~t~t~~~~l~~~~~n~y~~~~~~~~----~~~G~~~~G-----~--~iD~~~~~dw 332 (428) .....+. ..+.||.++++.. ..++.+|++.+.++|++.+....+... .-++.+.-+ . .|-+++-.|. T Consensus 396 ~A~~~~~-~SlT~~~i~~~~v~~~~t~~e~~~li~~Gv~~l~~~~~~~v~~~~iv~~itT~t~~~~~~~~ki~viRv~D~ 474 (562) T protein:vir:63 396 TCGLEIG-EAITFKNIAIETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVGEANDF 474 (562) T ss_pred hhcCchh-cCccceeeccccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhhhhHHHHH Confidence 7766653 4455566654332 469999999999999999987654432 123443321 2 4667888888 Q ss_pred HHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEE Q lcl|NC_019918. 333 LEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEAR 412 (428) Q Consensus 333 l~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~ 412 (428) +...++..+-+.++- | |=++.|...|++.|++.|++..+.|.|.....-.+... ..+|+ --+.+.+. T Consensus 475 i~~dir~~~~~~yiG--k-~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~dv~v~-----~~~d~-----~~v~~~v~ 541 (562) T protein:vir:63 475 LVSELKISLDNEYIG--T-KIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPEEVQVV-----IEGDV-----ARISLTVF 541 (562) T ss_pred HHHHHHHHHHhcCCc--c-ccChHHHHHHHHHHHHHHHHHHhCCcccCCCccceEEE-----ecCCE-----EEEEEEEE Confidence 888777766655544 4 66889999999999999999999999964321112111 11222 24688899 Q ss_pred ECceEEEEEEEEEEec Q lcl|NC_019918. 413 LAGAIHFVHIRGTVTV 428 (428) Q Consensus 413 ~agaih~v~i~~~v~~ 428 (428) +.-++|+|.+++++.- T Consensus 542 pv~~mekIy~ti~~~~ 557 (562) T protein:vir:63 542 PIRSMKKIEVSLVYRQ 557 (562) T ss_pred EcccceEEEEEEEEee Confidence 9999999999988877 No 46 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=98.25 E-value=2.1e-06 Score=51.72 Aligned_cols=387 Identities=12% Similarity=0.058 Sum_probs=166.0 Q ss_pred CCCC--CceEEEeeeeec--cc---ccccccceEEEEcccCCCccc---------------eEEeeCHHHHHhhcCCChH Q lcl|NC_019918. 1 MTVL--TDVIDIQISRET--AA---VAQTNFNVPLFIASHTNFSER---------------ARVYNSLKGVAEDFGESDP 58 (428) Q Consensus 1 M~~i--s~iV~V~i~~~~--~~---~~~~~f~~~li~~~~~~~~~~---------------~~~y~s~~~V~~~fg~~s~ 58 (428) .++. +..+.|.-+... .+ ...-+|..+.-+++...+... -..+...+.+...=|.. T Consensus 308 ~~n~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~v~d~~~~~~~~~~v~~~~t~~~~~pp~~~~~~e~v~~ngG~~-- 385 (742) T protein:vir:58 308 YPNQVPFLRVVVSQDIKQNVAGVEKWVPVGFEGIYSVGDFTVIVNELTNVSIPVTDSAIIPPMRFTRIEQITLSGGAS-- 385 (742) T ss_pred ccccccceeeEeccccCcCccceeEEEeccccccccccceeeeccccccceeeccccccCCcccccccceeecccCcc-- Confidence 2211 111112111100 00 001122222222221111000 01111111111000000 Q ss_pred HHHHHHHHHhcCCcccEEEEEeeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheee-ecccce--E Q lcl|NC_019918. 59 TYLAAVRYFGQALKPRSLVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAA-YDVTPV--V 135 (428) Q Consensus 59 eY~aA~~~F~q~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a-~~~~~~--~ 135 (428) +..+ +..|..-.+.-++......+... ...+.|.+..... .......... ...... . T Consensus 386 -f~v~----s~~~~g~~i~~~~as~~~s~ln~---------~~~V~Gt~aa~~~------~d~~t~~~v~s~~~alp~~a 445 (742) T protein:vir:58 386 -FSVI----SNQPYGFNIQDSRHSYWLSPFKD---------DELIIGTELVLPA------LDVSTEFGVSSWEEALPEFS 445 (742) T ss_pred -eEEE----EecccCcceeccCcceEEeccCC---------ceEEEeehhhccc------cccchheeccccccccceee Confidence 0000 00000000000100000000000 0000011100000 0000000000 000000 0 Q ss_pred -EEEeeccccceeeeeccccccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEEecCCHHH-HHHHHHHHhhhCC Q lcl|NC_019918. 136 -GVTVTDNEDGTLTVASNGDWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSHADDD-IMAVATHIEGTKK 213 (428) Q Consensus 136 -~~~~tt~~~~~~t~as~~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~~~~~-~~ala~~~~a~~~ 213 (428) ......+.+.......... ............. .-.+.+.++.+. .+.-.+.+.+.+..+ ..++.+.++..++ T Consensus 446 ~sv~laGG~dg~v~v~~~~~---D~iG~~~~~d~~~--adrTGL~ALlev-~eVtILiAPG~t~~~v~aav~A~la~a~~ 519 (742) T protein:vir:58 446 FLMPFQGGSDGYIRVDENEP---DTIGRVKITPALL--ANYERLLPLLTE-DQFDLVLTPYLTFADHAGTVNAFINRAEN 519 (742) T ss_pred EEEeecCCccccccccCCCc---ccccccccccccc--cchhHHHHhhhc-CCCcEEEEcCCCchHHHHHHHHHHHhhcC Confidence 0000111111110000000 0000000000000 111233333322 122233443444333 3455555554333 Q ss_pred EEEEEecCcccccchhHHHHHHHHhcccCceEEEecC----C--c-cchhHHHHHHHHHhccCCCceeee---eeeecCc Q lcl|NC_019918. 214 VFIGATAQANTKTSAENDIASRLVAAGFQRTALIYHP----N--A-DAQFPECAWVGYQLQEQPGSNTWT---HKALAAV 283 (428) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~----~--~-~~~~~~a~~~~~~~~~~~g~~t~~---fk~~~Gv 283 (428) +++... +........+.........+..|.++.|.- . . --..+.++++|.+...+.-.--|+ .|.+ + T Consensus 520 Rl~vL~-D~P~~~tt~~~A~a~r~~~nSsraaly~PwVkv~d~~~~r~vPpSgaIAGL~ARtD~erGvw~SPANrgi--i 596 (742) T protein:vir:58 520 RFLYLF-DIAGDDDTENLAISLAGYINSSFATTFFPWVRRLTNKGMRTVPASLAAYRSIRTTDPETGLAPVGARRGV--V 596 (742) T ss_pred CeEEEE-ecCCCCchHHHHHHHHhccCCceEEEEeceeeeccCCcceeechHHHHHHHHHHhccCCceEecCCccee--e Confidence 433322 111111111222233333445555544421 0 0 112356777777765554221121 2222 2 Q ss_pred cccCCCHHHHHHHHhCCceEEEEEcCceeeecCEecCC-----chhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHH Q lcl|NC_019918. 284 DAYRLTPTESTNLKNKNVTTFERVGGVNRTFGGAMAGG-----EWIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGA 358 (428) Q Consensus 284 ~~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~G~~~~G-----~~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~ 358 (428) .....+++|++.|..+++|++.++++-..++.++++.+ .||-+.+-.+|+...|+..+...+-. |.|+.-. T Consensus 597 ~~~~~s~se~d~LN~~GINtIrsfG~G~rlWGnRTlassDs~wryInVRRlfd~Ie~SI~~a~q~~VfE----PNd~~L~ 672 (742) T protein:vir:58 597 TGEPVRQVDWEDLYNNRINPIVRVGNDVLLFGQKTMLNVNSALNRINVRRLLIVMRNRISQILSSYLFE----NNTSENR 672 (742) T ss_pred eccccchhhHHHHhhCCceEEEECCCcEEEEcceecCCCCcccceEeehhhHHHHHHHHHHHHHHhccC----CCCHHHH Confidence 23356789999999999999988754456788888755 46888888999998888887765433 7788999 Q ss_pred HHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 359 TILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 359 ~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) ..|+..|+.-|+..+++|.|. ||.|... ++.+++|+.+.+.. +.+.+.+...++.|+++..++- T Consensus 673 ~sIk~sInafL~~L~aqGALl---GfrV~lD--etNTpeDI~~Gklv-v~I~vAP~~PAEfI~lrf~it~ 736 (742) T protein:vir:58 673 LRAEALVRQYLESLRLRGAVT---DYEVAID--SVTTPTDIDNNTLR-ARVTVQPARSIEYIDITFVITP 736 (742) T ss_pred HHHHHHHHHHHHHHHhCCcee---eeEEEEc--CCCCHHHhhCCEEE-EEEEEEccCCcceEEEEEEEEe Confidence 999999999999999999997 5889886 35788999988876 9999999999999998877766 No 47 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=98.08 E-value=5e-06 Score=49.63 Aligned_cols=408 Identities=14% Similarity=0.077 Sum_probs=171.1 Q ss_pred CC-C---CCce-EEEeeeeecccccccccceEEEEcccC---CCccceEEeeCHHHHHhhcCC-------------ChHH Q lcl|NC_019918. 1 MT-V---LTDV-IDIQISRETAAVAQTNFNVPLFIASHT---NFSERARVYNSLKGVAEDFGE-------------SDPT 59 (428) Q Consensus 1 M~-~---is~i-V~V~i~~~~~~~~~~~f~~~li~~~~~---~~~~~~~~y~s~~~V~~~fg~-------------~s~e 59 (428) .. + -+++ +++......-......+...+-..... .+.+....+. .+.+...-.. .... T Consensus 137 ~~~~~~~~d~~v~~i~~~~~~y~gt~~~~t~~v~~~~~~~~~~~~~~~~~~~-~~~v~~~~~~~~~~~~~~v~~~~~~~~ 215 (648) T protein:vir:10 137 FTSANEADDTIIFTIYQKHPDFSVTRETFTFPRKFTTPTVLVKRGSTLFFVD-RSIVNAALAAGPAFQTALINLLKEQLQ 215 (648) T ss_pred ecCCCcccceeEEEeccCCCcccccceeccccccccccccccccccceeecC-ccchhhhhccCccchhhhhhchhhhhh Confidence 11 1 2333 222111111112222222222111111 0011111111 1222111000 1111 Q ss_pred HHHHHHHHhcCCc-ccEEEEEeeecccccccchheeec-ccccccccc-----------eeeeeeecccchhhhhhhhhe Q lcl|NC_019918. 60 YLAAVRYFGQALK-PRSLVIGRRQVPSATVSVSVVQEG-QSYVLTVNG-----------LPVSYVSHQDDTATLIATGLK 126 (428) Q Consensus 60 Y~aA~~~F~q~p~-P~~l~igr~~~~~~~~~~~~~~~~-~~~~~~v~g-----------~~~s~~~~~~~~a~~i~a~l~ 126 (428) |......|.-++. |..+-.+....+.. ........ ...+.++.| ..++.+.......-...+.+. T Consensus 216 ~~~~~~~~~~s~~~~~d~~~~~~~~~a~--~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~tp~~~~~~~~~~~~~~ 293 (648) T protein:vir:10 216 PTDVVQIFDASDTNPVDIPLGLFVYEVL--YGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNLSATPFFDGSDYQDYTSLS 293 (648) T ss_pred hhhhheeccccccccccccccccccccc--chhhhcCCcchhhhhhhccccccccccceecccccccccccceeeeeccc Confidence 1111222221110 11010000000000 00000000 000011111 001100000000000000000 Q ss_pred eeecc------cceEEEEeec---cccceeeeeccccccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEE---- Q lcl|NC_019918. 127 AAYDV------TPVVGVTVTD---NEDGTLTVASNGDWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSI---- 193 (428) Q Consensus 127 ~a~~~------~~~~~~~~tt---~~~~~~t~as~~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~---- 193 (428) ...+. ........+. .....++..+++.-+....+-..-.......++.++++.+++....| ++. T Consensus 294 ~~~~~~~v~~~~~~~l~~~~~~p~~~~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~~--ivp~~~~ 371 (648) T protein:vir:10 294 DPANWFAKDAYTINHLVDTTINPHILATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVNF--VIPAYKF 371 (648) T ss_pred cccceeeeeccchhhcccccccCcccccccceecccccCCCcccccccccccchhhHHHHhhhccCCCceE--EEeeccc Confidence 00000 0000000000 00001111111111111111111112223455778888877654333 222 Q ss_pred ---ec----CCHHH--HHHHHHHHhhhC--C-------EEEEEecCcccccchhHHHHHHHHhcccCceEE--------- Q lcl|NC_019918. 194 ---DS----HADDD--IMAVATHIEGTK--K-------VFIGATAQANTKTSAENDIASRLVAAGFQRTAL--------- 246 (428) Q Consensus 194 ---~~----~~~~~--~~ala~~~~a~~--~-------~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~--------- 246 (428) +. .++++ +.++-.|+.... + ++..............+-+..... .+..|... T Consensus 372 ~~~~~~~~~lt~~q~i~a~a~shv~~~s~~~~~~~r~~~~~~vg~~~~es~~~se~~~~~~~-~~~~~a~~~~~d~~~~~ 450 (648) T protein:vir:10 372 TNVTQLNDRLTIFKGIASTFLSHVQTMSQVNRRKARVGVFGLPAPSPNESVTASEYLYNRNI-LNTISAMFGGTDRAQAV 450 (648) T ss_pred ccccccccccCCccchHHHHHHHHHHhhhccccccccCeEEEeCCCCchhHHHHHHHhhhhc-ccccceeeeecCCceEE Confidence 11 12222 222334554321 1 222222221221111111111111 11112111 Q ss_pred -------EecCCc-----cchhHHHHHHHHHhccCCCceeeeeeeecCcc--c-cCCCHHHHHHHHhCCceEEEEEcCce Q lcl|NC_019918. 247 -------IYHPNA-----DAQFPECAWVGYQLQEQPGSNTWTHKALAAVD--A-YRLTPTESTNLKNKNVTTFERVGGVN 311 (428) Q Consensus 247 -------~y~~~~-----~~~~~~a~~~~~~~~~~~g~~t~~fk~~~Gv~--~-~~~t~t~~~~l~~~~~n~y~~~~~~~ 311 (428) .|+++. ...+.+++++|......++ ...-||.+.++. + ..++++|++.|.++|+.++....+.+ T Consensus 451 ~~~~~~~~~~~~G~~~~~p~~~~Aa~VAGl~a~l~~~-~s~T~k~i~~~~id~~~~~t~~qld~L~~~Gv~~ie~~~~~~ 529 (648) T protein:vir:10 451 VFPFYSNVFNDEGKVELLGGEFFASYVAGMHANREPQ-DSITFLPISGIGAEPLYNWTYTQKDDLISNRVLFVEKVKTSF 529 (648) T ss_pred eecccceeECCCCcEEecchhhHHHHHHhhhhccccc-cCcccceeeccccccccCCCHHHHHHHhcCCcEEEEEecCCc Confidence 122221 2344567888888887775 556777776553 3 47899999999999999998876542 Q ss_pred -----eeecCEecCCc-------hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCcee Q lcl|NC_019918. 312 -----RTFGGAMAGGE-------WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLA 379 (428) Q Consensus 312 -----~~~~G~~~~G~-------~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~ 379 (428) .+-.|++..+. -|-+.+-.|.+...++..+.+.|+-. |=++.....|++.+.+-|.+-++.+-|. T Consensus 530 ~~~~~rvv~gITT~~~~~~~~~~eisv~ri~D~l~~~vr~~l~~~fIG~---~n~~~~~~~ik~~i~~~L~~~~~~~~I~ 606 (648) T protein:vir:10 530 GGIVYRIHHNPTTWLGPVTQGFQEFVLRRIDDFLQSYVYKNLQEQFIGR---KSYGRKTENDIKVYTEALLSNLVGKQIV 606 (648) T ss_pred ceeeEEEeccceeecCCCCcceeeeeeeehhhHHHHHHHHHHhhhcCcc---cccHHHHHHHHHHHHHHHhhHhhcCccc Confidence 13457777662 56788899999999999999988874 5567789999999999999888888887 Q ss_pred cCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 380 EAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 380 ~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) +-...++++.. ..|| --|.|.+.+..+|+.|.+++.|+- T Consensus 607 ~y~~~~v~~~~-----~~~v-----v~V~~~v~Pv~~i~~I~vti~it~ 645 (648) T protein:vir:10 607 AYKDVKVTSNE-----DKTV-----YYVEFFYQPVTEIKFILVTMKVTF 645 (648) T ss_pred CcccceEEEEe-----cCCE-----EEEEEEEEecceeeEEEEEEEEEe Confidence 65555666532 1133 369999999999999999888888 No 48 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=98.06 E-value=5.5e-06 Score=49.41 Aligned_cols=414 Identities=13% Similarity=0.064 Sum_probs=177.6 Q ss_pred CCCCCce-EE----------EeeeeecccccccccceEEE-----------EcccCC-CccceEEeeCHH---------H Q lcl|NC_019918. 1 MTVLTDV-ID----------IQISRETAAVAQTNFNVPLF-----------IASHTN-FSERARVYNSLK---------G 48 (428) Q Consensus 1 M~~is~i-V~----------V~i~~~~~~~~~~~f~~~li-----------~~~~~~-~~~~~~~y~s~~---------~ 48 (428) |.+.... +. +++.+... ....+..+- .+.... ........+... . T Consensus 216 ~~~~~~~~~~~~~~~~~~~~~tv~v~~~---~~~vg~~v~~~~~~~~~~~~~~~~~~v~~~~~~~~t~~~~~~~~~~~g~ 292 (743) T protein:vir:10 216 RTPGTYSNVPASGGTGTGATFNVVVADA---GGGVGGSVVVTLANPGTGYNQGETLTIASAATGDGTDILVTVATLSDGT 292 (743) T ss_pred ccccceeeEEeccccccccccccccccc---ccccccccccccccccceeeeccccccccccccccccchhheecccccc Confidence 3321110 00 00000000 000000000 000000 000000000000 0 Q ss_pred H--Hh-hcCCChHHHHHHHHHHhc-CCcccEEEEEeeec-ccccccchheeecccccccccc-e--eeeeeecccchhhh Q lcl|NC_019918. 49 V--AE-DFGESDPTYLAAVRYFGQ-ALKPRSLVIGRRQV-PSATVSVSVVQEGQSYVLTVNG-L--PVSYVSHQDDTATL 120 (428) Q Consensus 49 V--~~-~fg~~s~eY~aA~~~F~q-~p~P~~l~igr~~~-~~~~~~~~~~~~~~~~~~~v~g-~--~~s~~~~~~~~a~~ 120 (428) + .+ .....++.+......+.. .++|..+.++.-.. .........+.. ........+ + .+...+...+.... T Consensus 293 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~d~~~v~v~~~-~~~~~~~~~~v~~~~~~~s~~~~~~~~ 371 (743) T protein:vir:10 293 IAITELKDWYLNTEIGSTGIKLGDIGPRPGTSQFATDNGITDDQVHFAVIDT-TGELTGTANTIVERLTYLSKLSDARSE 371 (743) T ss_pred eeeeecccccccchhhccccccccccccceeeeccccccccccceEEEEecC-cceeeeccCceeEEEeeeecccccccc Confidence 0 00 001122333332222222 23344333321000 000000000000 000000000 0 00000000000000 Q ss_pred hhh--hheeeecc-cceEE--------EEeec-c--ccceeeeeccccc-cccccceEEEEeecc-----ccCHHHHHHH Q lcl|NC_019918. 121 IAT--GLKAAYDV-TPVVG--------VTVTD-N--EDGTLTVASNGDW-SLKVSSNLTMAAAPS-----TEGWPATITA 180 (428) Q Consensus 121 i~a--~l~~a~~~-~~~~~--------~~~tt-~--~~~~~t~as~~~~-~~~~s~~~~~~~~~a-----a~~~~~al~~ 180 (428) ... ........ ..... ..... . ............. .........+..+.. .......+.. T Consensus 372 ~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gG~d~~~~~~~~~~~~~~~ 451 (743) T protein:vir:10 372 ENANIYYKNVINEQSAYLYHGNDAAVQIAASGEAWGQSSDQVLADAGTAFSRTTGYWVNLAGGNDDFAYDAGEFGAAMDL 451 (743) T ss_pred cCcceeecceeccccceeeccCcccceeeeccccCccccceeeeecccccccccceEEEeecCccccccchhHHHHHHHH Confidence 000 00000000 00000 00000 0 0000000000000 000011112222222 1223334444 Q ss_pred HHhcccCc-eEEEEec--C----CHHHHHHHHHHHhhhCCEEEEEecCcccc------------cchhHHHHHHHHhccc Q lcl|NC_019918. 181 VQGENDEW-YALSIDS--H----ADDDIMAVATHIEGTKKVFIGATAQANTK------------TSAENDIASRLVAAGF 241 (428) Q Consensus 181 ~~~~~~~w-~~~~~~~--~----~~~~~~ala~~~~a~~~~~~~~~~~~~~~------------~~~~~~~~~~l~~~~~ 241 (428) +.....-. -.+.+.. . ...-+.++...++...+++.+.-...... ....+.+..+-...+. T Consensus 452 ~~~~~~~~~~ll~~p~~~~~~~~~~~v~~a~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s 531 (743) T protein:vir:10 452 FLDTEETEIDFVLMGGSMADEADTKSKATKVIAIAASRKDALAFVSPHKGNQIASTGNVALSSAQQKENTIAFFSDLTST 531 (743) T ss_pred hhhccccCcceEEecCcccCccchHHHHHHHHHHHHhhCCeEEEEecCCCccccccccccccccccchHHHHHHHhccCC Confidence 44332211 2233322 1 12334566667776665665543221110 0111222222111222 Q ss_pred CceEEEecC-------Cc---cchhHHHHHHHHHhccCCCc---eeeeeeeecCccc-----cCCCHHHHHHHHhCCceE Q lcl|NC_019918. 242 QRTALIYHP-------NA---DAQFPECAWVGYQLQEQPGS---NTWTHKALAAVDA-----YRLTPTESTNLKNKNVTT 303 (428) Q Consensus 242 ~~t~~~y~~-------~~---~~~~~~a~~~~~~~~~~~g~---~t~~fk~~~Gv~~-----~~~t~t~~~~l~~~~~n~ 303 (428) .+.+ +|++ .. ....+.+.++|.+...+.-+ ....+|.+.||.- -.+++.|++.|..+++|+ T Consensus 532 ~~~~-~~~p~~~~~d~~~~~~~~~p~s~~~AGl~a~~D~~~g~~~span~~~~gi~g~~~~~~~~~~~~~~~Ln~~gIn~ 610 (743) T protein:vir:10 532 SYAV-FDSGYKYVYDRFTDKYRYIPCNGDVAGLCVQTSNQLDDWYSPAGLNRGGILNAVKLAYNPNKADRDELYQNRINP 610 (743) T ss_pred eeEE-EEccceeeeccccCceeEechhHHHHHHHHHhhccCCcEEccCCeeeeeeeccccceecCChhHHHhHhhCCceE Confidence 3333 3332 11 11345677777766555322 3345566666531 247899999999999999 Q ss_pred EEEEcCce-eeecCEecCC-----chhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_019918. 304 FERVGGVN-RTFGGAMAGG-----EWIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGG 377 (428) Q Consensus 304 y~~~~~~~-~~~~G~~~~G-----~~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~ 377 (428) +..+.+.+ .++..+++.+ .||-+.+-.+|++..|+..++..+-. |.|+.=...|+..|+.-|++.++.|. T Consensus 611 i~~~~~~G~~~wG~rT~~s~d~~~~~i~vrR~~~~i~~si~~~~~~~v~e----~n~~~~~~~i~~~i~~fL~~l~~~ga 686 (743) T protein:vir:10 611 VVSLRGQGITLFGDKTALAAPSAFDRINVRRLFLNLEKRARRLAEGVLFE----QNDATTRAGFSSALNSYLSEVQARRG 686 (743) T ss_pred EEEecCCeEEEEcccccCCCCcccceEeehhhHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCc Confidence 99988765 5688888765 26778888899999999888765543 66888899999999999999999999 Q ss_pred eecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 378 LAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 378 I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) |. +|.|... .++.+++|+.+++.. +.+.+++...+++|.++..-.- T Consensus 687 l~---~~~V~~d-~~~nt~~~i~~G~~~-~~i~~~p~~pae~I~~~~~~~~ 732 (743) T protein:vir:10 687 VT---DYLVICD-ESNNTPDIIDRNEFV-AEVYVKPTRSINFITITFTATK 732 (743) T ss_pred ee---eeEEEEc-CCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEee Confidence 84 7899997 678899999999988 9999999999999999876544 No 49 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=98.00 E-value=7.5e-06 Score=48.67 Aligned_cols=412 Identities=10% Similarity=0.052 Sum_probs=164.5 Q ss_pred CC-CCCceEEE----eeeeecccccccccceEE-EEc---ccCCCccceEEeeCHHHH-HhhcCCChHHH---HHHH--- Q lcl|NC_019918. 1 MT-VLTDVIDI----QISRETAAVAQTNFNVPL-FIA---SHTNFSERARVYNSLKGV-AEDFGESDPTY---LAAV--- 64 (428) Q Consensus 1 M~-~is~iV~V----~i~~~~~~~~~~~f~~~l-i~~---~~~~~~~~~~~y~s~~~V-~~~fg~~s~eY---~aA~--- 64 (428) -. .+-.+-.+ .......+. +-...+ +.+ ............ .... ...+....+.. ..+. T Consensus 153 ~a~~~~~~~~v~~~~~~~~~~~~~---~~~~a~~V~~~~~~~~~~~~~~~~a--~~~~t~~~~~~~~~~~~~~a~~a~~~ 227 (666) T protein:vir:80 153 HAKAIGVYPELDGDWTAEFTSSSG---NGSAALSVTKIVTDSGLLLTDLETS--RANITNQTFLTKLQKYDMPAVSAIYA 227 (666) T ss_pred ccccccccceeeccceeeeccccc---cceeeeeeeeeecCCccceeeeccc--cccccccccccccccccchhhhhhcc Confidence 00 00000000 000110000 000011 100 000000000000 0000 00000000000 0000 Q ss_pred ---------------HHHhcCCcccEEEEEee-ecccccccchheeecccc--cccccceee-eeeecccchhhhhhh-- Q lcl|NC_019918. 65 ---------------RYFGQALKPRSLVIGRR-QVPSATVSVSVVQEGQSY--VLTVNGLPV-SYVSHQDDTATLIAT-- 123 (428) Q Consensus 65 ---------------~~F~q~p~P~~l~igr~-~~~~~~~~~~~~~~~~~~--~~~v~g~~~-s~~~~~~~~a~~i~a-- 123 (428) .+++..|.|..-..+.. ........... .....+ +....|... ++.......+..... T Consensus 228 g~~g~~l~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~ 306 (666) T protein:vir:80 228 GEIGNSLEVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAP-QNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNS 306 (666) T ss_pred cccccceeeeeccccccccccccceeeeccccccccceeeeecc-ccccceeeEeccCCccceeeecccccccccccchh Confidence 01111111110000000 00000000000 000000 000001000 000000000000000 Q ss_pred -hheeee-cccceEEEEeec----cccceeeeeccccccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEEecC- Q lcl|NC_019918. 124 -GLKAAY-DVTPVVGVTVTD----NEDGTLTVASNGDWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSH- 196 (428) Q Consensus 124 -~l~~a~-~~~~~~~~~~tt----~~~~~~t~as~~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~- 196 (428) .+.... +........... ..........+.........................+-++.+. .+...+..... T Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~-~~~~~l~~p~~~ 385 (666) T protein:vir:80 307 IYMDDFFGRGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWGLFAERES-IHVNLLIAGACA 385 (666) T ss_pred hhhhhhhccccceeeeecccccccccceEEEecCCCCcccccccccccccccccchhhhhhhhhhcc-cccceEeecCcC Confidence 000000 000000000000 0000011111111100000000000000000011112222222 12333333221 Q ss_pred -----CHHHHHHHHHHHhhhCCEEEEEecCcc-----cccchhHHHHHHHHhc----------ccCceEEEec------C Q lcl|NC_019918. 197 -----ADDDIMAVATHIEGTKKVFIGATAQAN-----TKTSAENDIASRLVAA----------GFQRTALIYH------P 250 (428) Q Consensus 197 -----~~~~~~ala~~~~a~~~~~~~~~~~~~-----~~~~~~~~~~~~l~~~----------~~~~t~~~y~------~ 250 (428) ...-..++...++....++.+.-.... ......+++..+.... +..+.++.|. + T Consensus 386 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~ 465 (666) T protein:vir:80 386 GEGDAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDK 465 (666) T ss_pred CcccchHHHHHHHHHHHHhhcceEEEeecceeEEeecCCCCCHHHHHHHHHhcccchhhhcccCcceEEEEcCceEEecc Confidence 122234556666665545433221110 0111233333333221 1223333221 1 Q ss_pred Ccc---chhHHHHHHHHHhccCCCc---eeeeeeeecCcc-----ccCCCHHHHHHHHhCCceEEEEEcCce-eeecCEe Q lcl|NC_019918. 251 NAD---AQFPECAWVGYQLQEQPGS---NTWTHKALAAVD-----AYRLTPTESTNLKNKNVTTFERVGGVN-RTFGGAM 318 (428) Q Consensus 251 ~~~---~~~~~a~~~~~~~~~~~g~---~t~~fk~~~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~~~-~~~~G~~ 318 (428) ... ...+.+.++|.+...+.-+ ..-..|.+.|+. .-.+++.|.+.|..+|+|++.++.+.+ .++.++| T Consensus 466 ~~~~~~~~p~sg~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT 545 (666) T protein:vir:80 466 YNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKT 545 (666) T ss_pred cCCceeEechHHHHHHHHHHHhhcCCceEccCCeecceeeccccceeecChhHHHhhhhCCeeEEEEeCCCeEEEEcccc Confidence 111 1235677777766554322 222345544443 135789999999999999999998865 5689998 Q ss_pred cCCc-----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHh Q lcl|NC_019918. 319 AGGE-----WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLS 393 (428) Q Consensus 319 ~~G~-----~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~ 393 (428) +++. ||-+.+-.+|+...|+..++..+-. |.|+.=...|+..|+.-|++.+++|.|. ||.|... .++ T Consensus 546 ~~~~~s~~~~i~vRRl~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~---g~~V~~d-~~~ 617 (666) T protein:vir:80 546 ATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFE----NNDNFTRASFRMEVSQYLSTIRSLGGIY---DFRVQCD-TTN 617 (666) T ss_pred CCCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee---eeEEEEc-CCC Confidence 8763 5777788889888888887765443 6788889999999999999999999997 5899987 678 Q ss_pred CCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 394 MSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 394 ~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) .+++|+.+++.. +.+.+++...+++|.++..-.= T Consensus 618 nt~~di~~G~~~-~~i~~~P~~Pae~I~~~~~~~~ 651 (666) T protein:vir:80 618 NTPDVIDRNEFV-ASMFIKPAKSINYIMLNFTAVA 651 (666) T ss_pred CCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEee Confidence 899999999986 9999999999999999866443 No 50 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=97.99 E-value=8e-06 Score=48.53 Aligned_cols=401 Identities=12% Similarity=0.078 Sum_probs=163.1 Q ss_pred CCCCCceEEEeeeeecccccccccceEEE-EcccCCC----ccceEEeeCHHHHHhhcCCCh-----------HHH---H Q lcl|NC_019918. 1 MTVLTDVIDIQISRETAAVAQTNFNVPLF-IASHTNF----SERARVYNSLKGVAEDFGESD-----------PTY---L 61 (428) Q Consensus 1 M~~is~iV~V~i~~~~~~~~~~~f~~~li-~~~~~~~----~~~~~~y~s~~~V~~~fg~~s-----------~eY---~ 61 (428) +..-.......+.+....... +....+. ..+.... ...+..+....-+....|... +.+ . T Consensus 171 v~~~~~~~~~~~~v~~~~~d~-~~~~~~~~ta~~~~~~~~~~~~~~~~~~~~~~A~~~g~~G~~i~v~~~~~a~~~~~~~ 249 (660) T protein:vir:68 171 MSGSSSGLSAVITIDSVVMDS-GILLTEVETSEEAITSLTFQESIKKYGVPGVVALYPGELGDQLEIEIVSKADYDKGAS 249 (660) T ss_pred eecccccceeeeeeccccccc-cceeeeeccccccccccceeeeecccCccccccccccccccceEEEEecccccccccc Confidence 000000000011110000000 0000000 0000000 000000000000000000000 000 0 Q ss_pred HHHHH-HhcCCccc--EEEEEeeecccccccchheeecccccccccceee-eeeecc-cchhhhhhhhhe--ee-ecc-c Q lcl|NC_019918. 62 AAVRY-FGQALKPR--SLVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPV-SYVSHQ-DDTATLIATGLK--AA-YDV-T 132 (428) Q Consensus 62 aA~~~-F~q~p~P~--~l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~-s~~~~~-~~~a~~i~a~l~--~a-~~~-~ 132 (428) .+... ....++|. ..+++.-..++.... ......+... ...... .+.......... .. .+. . T Consensus 250 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (660) T protein:vir:68 250 AQLKIYPDGGTRYSTAKAIFGYGPQTDDQYA---------IIVRRNDSVVQSVVLSTKRGERDIYGSNIFIDDFFAKGAS 320 (660) T ss_pred ccceeeecccccccceeeEeeccccccccee---------eeeecCCcceeeeeeecccccccccccceeeehhhccCcc Confidence 00000 00111111 111110000000000 0000000000 000000 000000000000 00 000 0 Q ss_pred ceEEEEeeccccceeeeeccccccccccceEEEEeeccccCHHHHHHHHHhccc-CceEEEEe---cCCHHHH----HHH Q lcl|NC_019918. 133 PVVGVTVTDNEDGTLTVASNGDWSLKVSSNLTMAAAPSTEGWPATITAVQGEND-EWYALSID---SHADDDI----MAV 204 (428) Q Consensus 133 ~~~~~~~tt~~~~~~t~as~~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~-~w~~~~~~---~~~~~~~----~al 204 (428) .............. ........+..-.......+...++..+..... ....+.+. ..+.++. .+| T Consensus 321 ~~v~~~~~~~~~~~-------~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~v~~~l 393 (660) T protein:vir:68 321 NYIFATAQGWPKGF-------SGVIKLNGGLSSNETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESLEVASTVQKHV 393 (660) T ss_pred cEEEEeecCCCccc-------cceeeeccccccccccccchhhhHHHHhhhhhccccceeeccccCCCchHHHHHHHHHH Confidence 00000000000000 000000000000001111223333333333221 12223322 1223333 344 Q ss_pred HHHHhhhCCEEEEEecCcc----cc-cchhHHHHHHHHhc----------ccCceEEEecC-------Ccc---chhHHH Q lcl|NC_019918. 205 ATHIEGTKKVFIGATAQAN----TK-TSAENDIASRLVAA----------GFQRTALIYHP-------NAD---AQFPEC 259 (428) Q Consensus 205 a~~~~a~~~~~~~~~~~~~----~~-~~~~~~~~~~l~~~----------~~~~t~~~y~~-------~~~---~~~~~a 259 (428) ...++....+|...-.... .. ....+++...-... +..+.. +|++ ... ...+.+ T Consensus 394 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~-~~~p~~~~~d~~~~~~~~~p~sg 472 (660) T protein:vir:68 394 VAIGDSRQDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAA-IDGNYKYQYDKYNDVNRWVPLAA 472 (660) T ss_pred HHHHHhhCCeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEE-EEcCceEEecccCCceEEechhH Confidence 4555555545543321110 01 11223333322221 122232 3332 111 123557 Q ss_pred HHHHHHhccCCCc---eeeeeeeecCcc-----ccCCCHHHHHHHHhCCceEEEEEcCce-eeecCEecCCc-----hhH Q lcl|NC_019918. 260 AWVGYQLQEQPGS---NTWTHKALAAVD-----AYRLTPTESTNLKNKNVTTFERVGGVN-RTFGGAMAGGE-----WID 325 (428) Q Consensus 260 ~~~~~~~~~~~g~---~t~~fk~~~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~~~-~~~~G~~~~G~-----~iD 325 (428) .++|.+...+.-+ .....|.+.||. .-.+++.|++.|..+|+|+...+.+.+ .++..+|++++ ||- T Consensus 473 ~~AGl~Ar~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i~ 552 (660) T protein:vir:68 473 DIAGLCARTDNISQPWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRIN 552 (660) T ss_pred HHHHHHHHHhccCCcEEccCCeeeceeeccceeeecCChhHHHHHhhCCceEEEEecCCeEEEEcceecCCCCcccceEe Confidence 7777666554322 223455555543 124789999999999999999998875 56899988762 566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccC Q lcl|NC_019918. 326 VMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFE 405 (428) Q Consensus 326 ~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~ 405 (428) +.+-.+|+...|+..+...+-. |.++.=...|+..|+.-|++.+++|.|. ||.|.. +.++.+++|+.+++.. T Consensus 553 vrR~~~~i~~si~~~~~~~v~e----pn~~~~~~~i~~~i~~~L~~l~~~gal~---gf~V~~-d~~~nt~~~i~~G~~~ 624 (660) T protein:vir:68 553 VRRLFNMVKTNIGSASKYRLFE----LNNAFTRSSFRTETSQYLQGIKALGGVY---NFKVVC-DTTNNTPAVIDRNEFV 624 (660) T ss_pred hhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee---eeEEEE-ecCCCCHHHhhCCeEE Confidence 7788888888888877764443 6688888999999999999999999997 588987 5778899999999888 Q ss_pred CeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 406 GIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 406 ~i~~~~~~agaih~v~i~~~v~~ 428 (428) +.+.+.+...+++|.++..-.- T Consensus 625 -~~i~~~p~~pae~i~l~~~~~~ 646 (660) T protein:vir:68 625 -ATFYLQPARSINYITLNFVATA 646 (660) T ss_pred -EEEEEEecCCcceEEEEEEEee Confidence 9999999999999999876654 No 51 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=97.94 E-value=9.8e-06 Score=48.04 Aligned_cols=408 Identities=11% Similarity=0.053 Sum_probs=169.6 Q ss_pred CCC--CCc---eEEEeeeeecccccc-cccceEEEEcccCCCccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCccc Q lcl|NC_019918. 1 MTV--LTD---VIDIQISRETAAVAQ-TNFNVPLFIASHTNFSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALKPR 74 (428) Q Consensus 1 M~~--is~---iV~V~i~~~~~~~~~-~~f~~~li~~~~~~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~P~ 74 (428) .+. .+. +.+....+....... ..+...-+.... ...+........... ..-..+...+...+.+. T Consensus 153 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~v~~~~~a~t---~~~~~~~~~~~~~~~v~ 223 (659) T protein:vir:10 153 KAKEVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDS------GILLAEIENAEAAMT---AVDFQANLKKYGIPGVV 223 (659) T ss_pred ccccccccceeeeeeeeeeeeeccccceeeEEeeeecCC------ceeEEeecccccccc---ccccccceeeccccccc Confidence 111 110 011111111111111 111110010000 000000000000000 00000000000111111 Q ss_pred EEEEEeee---------ccccccc----c----hheeecc--cccccccceeeeeeecccchhhhhhhhheee------- Q lcl|NC_019918. 75 SLVIGRRQ---------VPSATVS----V----SVVQEGQ--SYVLTVNGLPVSYVSHQDDTATLIATGLKAA------- 128 (428) Q Consensus 75 ~l~igr~~---------~~~~~~~----~----~~~~~~~--~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a------- 128 (428) .+.-|-+. ....... . ....... .......+..-............+....... T Consensus 224 a~~~G~~g~~~tv~~~~~a~~~~~~~v~v~~~~~~~~~a~~~t~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 303 (659) T protein:vir:10 224 ALYPGELGDKIEIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKD 303 (659) T ss_pred ccccceecccceEEEechhhccccceeeeeeeeecccccccceeeeeeccccccchhhccccccceeeeeeeeccccccc Confidence 11101000 0000000 0 0000000 0000000000000000000000000000000 Q ss_pred ecccce-EEEEeeccccceeeeeccccccccccceEEEEeec------cccCHHHHHHHHHhcc-cCceEEEEecC---C Q lcl|NC_019918. 129 YDVTPV-VGVTVTDNEDGTLTVASNGDWSLKVSSNLTMAAAP------STEGWPATITAVQGEN-DEWYALSIDSH---A 197 (428) Q Consensus 129 ~~~~~~-~~~~~tt~~~~~~t~as~~~~~~~~s~~~~~~~~~------aa~~~~~al~~~~~~~-~~w~~~~~~~~---~ 197 (428) ...... .......+. +.................+.+..+. ........+..+.... .+...+.+... . T Consensus 304 ~~~~~~~~~~~~~~~~-~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~il~~p~~~~~~ 382 (659) T protein:vir:10 304 IYDSNIYIDDFFAKGG-SEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGES 382 (659) T ss_pred cccchhhhhhhhccCc-ccEEEEeecccCCCccceeeecccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcc Confidence 000000 000000000 0000000000000000011111111 1122333444443322 12334444332 1 Q ss_pred HHH----HHHHHHHHhhhCCEEEEEecCccc-----ccchhHHHHHHHHhc----------ccCceEEEecC-------C Q lcl|NC_019918. 198 DDD----IMAVATHIEGTKKVFIGATAQANT-----KTSAENDIASRLVAA----------GFQRTALIYHP-------N 251 (428) Q Consensus 198 ~~~----~~ala~~~~a~~~~~~~~~~~~~~-----~~~~~~~~~~~l~~~----------~~~~t~~~y~~-------~ 251 (428) .++ ..+|...++....++...-..... .....+++....... +..+. .+|++ . T Consensus 383 ~~~~~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~-~l~~p~~~~~d~~ 461 (659) T protein:vir:10 383 LETASTVQKHVVSIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYA-AIDGNYKYQYDKY 461 (659) T ss_pred hhhhHHHHHHHHHHHHhhCCeEEEEcCccccccCCCcccCHHHHHHHHHhcccccccccccCcceE-EEEeCcEEEeccc Confidence 122 344555666666666554322111 112233333333221 12233 33332 1 Q ss_pred cc---chhHHHHHHHHHhccCCCc---eeeeeeeecCcc-----ccCCCHHHHHHHHhCCceEEEEEcCce-eeecCEec Q lcl|NC_019918. 252 AD---AQFPECAWVGYQLQEQPGS---NTWTHKALAAVD-----AYRLTPTESTNLKNKNVTTFERVGGVN-RTFGGAMA 319 (428) Q Consensus 252 ~~---~~~~~a~~~~~~~~~~~g~---~t~~fk~~~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~~~-~~~~G~~~ 319 (428) .+ ...|.+.++|.+...+.-+ ....+|.+.|+. ...+++.|++.|..+++|++.++.+.+ .++..+++ T Consensus 462 ~~~~~~~p~sg~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~ 541 (659) T protein:vir:10 462 NDVNRWVPLAADIAGLCARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTA 541 (659) T ss_pred CCceEEechHHHHHHHHHHHhccCCceEccCCceeeeeeccccceecCCHhHHHHHhhCCeeEEEEeCCCeEEEEccccc Confidence 11 1345677777777555422 233444444433 235789999999999999999998765 56888887 Q ss_pred CC-----chhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhC Q lcl|NC_019918. 320 GG-----EWIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSM 394 (428) Q Consensus 320 ~G-----~~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~ 394 (428) ++ .||-+.+-.+|+...|+..+...+-. |.++.=...|+..|+.-|+..++.|.|. +|.|.+.. ++. T Consensus 542 ~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e----~n~~~l~~~i~~~i~~fL~~l~~~gal~---~~~V~~d~-~~n 613 (659) T protein:vir:10 542 TSVPSPFDRINVRRLFNMLKTNIGRSSKYRLFE----LNNAFTRSSFRTETAQYLQGIKALGGIY---EYRVVCDT-TNN 613 (659) T ss_pred CCCCcccceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee---eEEEEEcC-CCC Confidence 75 36777788888888888877664433 7788889999999999999999999996 69999874 788 Q ss_pred CHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 395 SPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 395 ~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) +++|+.+++.. +.+.+.+...+++|.++..-+- T Consensus 614 t~~~i~~G~~~-~~i~~~p~~pae~i~~~~~~~~ 646 (659) T protein:vir:10 614 TPSVIDRNEFV-ATFYIQPARSINYITLNFVATA 646 (659) T ss_pred CHHHhhCCeEE-EEEEEEecCCcceEEEEEEEEe Confidence 99999999888 9999999999999999877664 No 52 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=97.92 E-value=1.1e-05 Score=47.79 Aligned_cols=411 Identities=12% Similarity=0.035 Sum_probs=188.1 Q ss_pred CCCCCce-EE-----Eeeeeec---ccccc-cccceEEEEccc-CCCccceEEeeCHHHHHhhcC----CChHHHHHHHH Q lcl|NC_019918. 1 MTVLTDV-ID-----IQISRET---AAVAQ-TNFNVPLFIASH-TNFSERARVYNSLKGVAEDFG----ESDPTYLAAVR 65 (428) Q Consensus 1 M~~is~i-V~-----V~i~~~~---~~~~~-~~f~~~li~~~~-~~~~~~~~~y~s~~~V~~~fg----~~s~eY~aA~~ 65 (428) .-|..-| |+ |.|...+ .++.. ..-....++|.. ..|..+-...+|..|....|| --.....+-.. T Consensus 273 ~~~~~~~~~~v~~~GVYVEEVpSGvrtIeGGV~TSVAAFVG~A~rGPvn~PvlITS~aD~~~~Fg~~~GGl~GassA~r~ 352 (774) T protein:vir:98 273 VEPFGEITRNVEDNGVVIQLEPALTGSISNRFSFYVTANDNTANRGFTTSPALVTTIPDPAIHFTSFQGGLDGPRSAFRD 352 (774) T ss_pred cccccceEEEEecCceEEEEeCCCCccccccccceeeeecccccCCCCCcCEEEeehhHhhhhhccccCCccccceeeee Confidence 1121111 11 1111211 12211 233445555543 244445556667777554443 22111111011 Q ss_pred HHhcCCcccEEEE----Eeeecccccccchheeec-----------ccccccccceeeeeeecccchhhhhhhhheeeec Q lcl|NC_019918. 66 YFGQALKPRSLVI----GRRQVPSATVSVSVVQEG-----------QSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYD 130 (428) Q Consensus 66 ~F~q~p~P~~l~i----gr~~~~~~~~~~~~~~~~-----------~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~ 130 (428) ++...-.|.-... |.|... -.+.....+.+ ..+.....+..++...... .....+....+ T Consensus 353 ~~~~sG~~~L~i~A~~pGawGN~-ItV~I~~~t~~~~~l~v~~~~~s~f~~~~a~e~~tv~~~~~----~~~~~v~e~~d 427 (774) T protein:vir:98 353 FYTFNGTPLLRLQAVSEGNWGNQ-VTVSIYPVNNSEFRLNVQDLNGSAFNPPLADEVYTVKLGDT----NESGELNALLD 427 (774) T ss_pred eeeecccceEEEEEeecCcCCCc-eEEEEEecCCceeEEEEEecCCccccccccceeEEEecccc----cccceeeeeec Confidence 1111111111111 111100 00000000000 0000000000000000000 00000000000 Q ss_pred ccceEEEEee-------------cc-------ccceeeeeccc-cccccccce---EEEEeeccc-cCHHHHHHHHHh-- Q lcl|NC_019918. 131 VTPVVGVTVT-------------DN-------EDGTLTVASNG-DWSLKVSSN---LTMAAAPST-EGWPATITAVQG-- 183 (428) Q Consensus 131 ~~~~~~~~~t-------------t~-------~~~~~t~as~~-~~~~~~s~~---~~~~~~~aa-~~~~~al~~~~~-- 183 (428) .......... .. .+......... ......... ..+..+.+. ++..+.+....+ T Consensus 428 n~~i~~~~~~~~~~~in~vs~lv~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~v~v~lagG~Dg~~tt~~~igg~~~~~ 507 (774) T protein:vir:98 428 SKFIRGFFLPKSIDSINYDAALVRQSPLRLAPPDESETDVENPAHVDFYGPNVLVDVTLENGYDGPPVTNDDYVSIIRTL 507 (774) T ss_pred eeeEeecccccccccccccccccccchhcccccccccccccccccccccCCcceEEEeecCCCCcccccchheecccccc Confidence 0000000000 00 00000000000 000000000 111122211 111122211111 Q ss_pred cccCceEEEEecCCHHHHHHHHHHHhhh----CCEEEEEecCcccccchhHHHHHHHHhcccCceEEEecC------Cc- Q lcl|NC_019918. 184 ENDEWYALSIDSHADDDIMAVATHIEGT----KKVFIGATAQANTKTSAENDIASRLVAAGFQRTALIYHP------NA- 252 (428) Q Consensus 184 ~~~~w~~~~~~~~~~~~~~ala~~~~a~----~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~------~~- 252 (428) ....++.+........-..++..+++.. ..++.+.-..... +.+...+..+..+..|..+.|.. .. T Consensus 508 ~~tgi~aLl~a~~~~~V~~aii~~~e~~~~~~~~r~avid~p~g~---t~~~Ai~~r~~f~S~~aal~~Pwvkv~D~~~g 584 (774) T protein:vir:98 508 ENQPVHILLVGTTNVGVQQALITEAERASDSDGLRIAVLAAPPRT---TPTLAASVTRGFNSTRAVMVAGWFTYAGQPNS 584 (774) T ss_pred cccceeEEEcCccchhhHHHHHHHHHHhhhcccceEEEEECCCCC---CHHHHHHHHhccCCceEEEEeCcEEEeccCCC Confidence 1234555554444444455555555532 3344433322221 12233333344444555444321 11 Q ss_pred --cchhHHHHHHHHHhccCCCceeeeeeeecCcc--------ccCCCHHHHHHHHhCCceEEE-EEcCce-eeecCEecC Q lcl|NC_019918. 253 --DAQFPECAWVGYQLQEQPGSNTWTHKALAAVD--------AYRLTPTESTNLKNKNVTTFE-RVGGVN-RTFGGAMAG 320 (428) Q Consensus 253 --~~~~~~a~~~~~~~~~~~g~~t~~fk~~~Gv~--------~~~~t~t~~~~l~~~~~n~y~-~~~~~~-~~~~G~~~~ 320 (428) ....|.+.++|.....++ .....+|.+.|+. .+..++.+.+.|..+++|..+ ...+.+ .++.+++++ T Consensus 585 ~~~~vPpSg~vAGl~ArtDv-~kSPANk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~gIN~i~itt~g~G~rvWG~RTls 663 (774) T protein:vir:98 585 SRYGVPGAAVYAGKLAAIDF-FVSPAARSLVGPLFNIIESDTDNYTSRSNQDIYSAARLEVLSLDTVDRTYRFASGVTLS 663 (774) T ss_pred ceeecChhHHHHHHHHhcCc-ccccCCceeecceeccccccccccccchhhhhhcccccceeEEEEcCCcEEEEcccccC Confidence 123456788888877764 4455677777764 223567888899999999987 344555 568888887 Q ss_pred Cc----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCH Q lcl|NC_019918. 321 GE----WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSP 396 (428) Q Consensus 321 G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~ 396 (428) ++ ||-+.+-.+|++..|+..+..++ .+ |.|+.....|+..++.-|+..++.|.|. |++.-.-+.++.++ T Consensus 664 sDp~wr~InVRRlfd~Ie~SI~~~~~~~V---fE-PNd~~l~~~I~~sI~~fL~~L~~~GaL~---G~~~V~~D~etNt~ 736 (774) T protein:vir:98 664 TDPAWERIYLRRVHDVVRQGAHAILRNYV---AM-PNSRLVRNQIAAALNAFMGELKRNGNIV---SFRPAIIDGSNNST 736 (774) T ss_pred CCcccceEeehhhHHHHHHHHHHHHHHhc---cC-CCCHHHHHHHHHHHHHHHHHHHhCCcee---cceEEEEcCCCCCH Confidence 74 77788888999998888777654 34 7899999999999999999999999996 45532334666789 Q ss_pred HHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 397 NMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 397 ~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) +|+.+++.. +.+.+.+...+++|.++..-.- T Consensus 737 ~dI~~G~l~-i~I~vaP~~PAEfIilri~q~t 767 (774) T protein:vir:98 737 AAYFSRELY-VSLQFQPLYSADYIYVTISRDT 767 (774) T ss_pred HHhhCCEEE-EEEEEEecCCcceEEEEEEEee Confidence 999988877 9999999999999999887777 No 53 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=97.82 E-value=1.7e-05 Score=46.73 Aligned_cols=413 Identities=11% Similarity=0.075 Sum_probs=168.2 Q ss_pred CC-----------CCCceEEE-eeeeecccccccccceEE-------------EEcccC---CCccceEEeeCHHHH--- Q lcl|NC_019918. 1 MT-----------VLTDVIDI-QISRETAAVAQTNFNVPL-------------FIASHT---NFSERARVYNSLKGV--- 49 (428) Q Consensus 1 M~-----------~is~iV~V-~i~~~~~~~~~~~f~~~l-------------i~~~~~---~~~~~~~~y~s~~~V--- 49 (428) +. .++.--.+ ...+.+.. ...++..+ +.+... .+..-...+.+..-+ T Consensus 120 ~~~~~~~~~~~v~~~~a~g~~~~~~~~ta~--~~~~a~~v~~~~~~~~~~~~~~~~~~~~~~~a~sv~~~v~~~~~~~~~ 197 (660) T protein:vir:10 120 YNQTVVESEGRVTSVDTDGKILSVFIPSAK--IIAYARSLNQYPTLGPAWTAEVTSASSGVSGTITVGKIVTDSGILLTE 197 (660) T ss_pred eccccccccccceeeccccceeeecccccc--ccccccccccccccccceeEEEecccCccccceeeeeeeccCcceEEe Confidence 00 00000000 00000000 00000000 000000 000000000000000 Q ss_pred --HhhcCCChHHHHHHHHHHhcCCcc--cEEEEEeeecccccc--cchheeeccccccccc--ceeee------------ Q lcl|NC_019918. 50 --AEDFGESDPTYLAAVRYFGQALKP--RSLVIGRRQVPSATV--SVSVVQEGQSYVLTVN--GLPVS------------ 109 (428) Q Consensus 50 --~~~fg~~s~eY~aA~~~F~q~p~P--~~l~igr~~~~~~~~--~~~~~~~~~~~~~~v~--g~~~s------------ 109 (428) ...-+...+.+. ....+.+.| .-...|.+....... .......+.....++. +.... T Consensus 198 ~~~~~~~~~~~~~~---~~~~~~~~~~~~a~~~g~~G~~i~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 274 (660) T protein:vir:10 198 AENSEEAITSLEFQ---AALKKFAMPGVVALYPGEIGSTLEVEIVSKAAYEAGSSKMLDVYPGGGTRASIAKAVFNYGPQ 274 (660) T ss_pred eeccccccccccce---eeccccccceeeeecccccCcceeEEEeeccccCCcceeEEeeeeccceeeEEeeeecccccc Confidence 000000000000 000000011 011111110000000 0000000000000000 00000 Q ss_pred ------eeecccchhhh---hhhhheee-ecccceEEEEeeccccceeeeeccccccccccceEEEEeec------cccC Q lcl|NC_019918. 110 ------YVSHQDDTATL---IATGLKAA-YDVTPVVGVTVTDNEDGTLTVASNGDWSLKVSSNLTMAAAP------STEG 173 (428) Q Consensus 110 ------~~~~~~~~a~~---i~a~l~~a-~~~~~~~~~~~tt~~~~~~t~as~~~~~~~~s~~~~~~~~~------aa~~ 173 (428) ......+.... +....... ................+....+............+.+..+. ...+ T Consensus 275 ~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~t~~~ 354 (660) T protein:vir:10 275 TDDQYAIIVRRDGAIVESVVLSTKEGEKDVYGNNIYLDDYFAKGTSNYIYATSLNWPKGFSGIINLSGGISANDKVTAGD 354 (660) T ss_pred cccccccccccCCcccceeeeeccccccccccceeeeehhhcCCCccEEEEEeccCCCCcccceeeeccccCccccccch Confidence 00000000000 00000000 00000000000000000000000000000000111111111 1122 Q ss_pred HHHHHHHHHhcc-cCceEEEEecC---CHHH----HHHHHHHHhhhCCEEEEEecCcc-----cccchhHHHHHHHHhc- Q lcl|NC_019918. 174 WPATITAVQGEN-DEWYALSIDSH---ADDD----IMAVATHIEGTKKVFIGATAQAN-----TKTSAENDIASRLVAA- 239 (428) Q Consensus 174 ~~~al~~~~~~~-~~w~~~~~~~~---~~~~----~~ala~~~~a~~~~~~~~~~~~~-----~~~~~~~~~~~~l~~~- 239 (428) ....+..+.+.. ..+-.++.... .+++ ..+|...++....++.+.-.... ......+++....... T Consensus 355 ~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~al~~~~~~~~~~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~ 434 (660) T protein:vir:10 355 LMQGWDLFADREALHINLLIAGAVAGEGDEVASTVQKHVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGAG 434 (660) T ss_pred hhhhhhhhhhhhhcccceEEEcCcCCCchhhhHHHHHHHHHHHHhhCCEEEEEecCcccccccccccCHHHHHHHHhhcc Confidence 333444443322 12333333321 2222 33455666666556654422111 1111223333222211 Q ss_pred -------c--cCceEEEecCC-------c---cchhHHHHHHHHHhccCCCce---eeeeeeecCcc-----ccCCCHHH Q lcl|NC_019918. 240 -------G--FQRTALIYHPN-------A---DAQFPECAWVGYQLQEQPGSN---TWTHKALAAVD-----AYRLTPTE 292 (428) Q Consensus 240 -------~--~~~t~~~y~~~-------~---~~~~~~a~~~~~~~~~~~g~~---t~~fk~~~Gv~-----~~~~t~t~ 292 (428) + ..+.+ +|++- . -...+.+.++|.+...+.-+- .-.+|++.|+. .-.+++.| T Consensus 435 ~~~~~~~~~~s~~~~-~~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e 513 (660) T protein:vir:10 435 TFDANNMNISTTYAA-IDGNYKYQYDKYNDVNRWVPLAADLAGLCARTDDVSQPWMSPAGYNRGQILNVLKLAIEPRQAQ 513 (660) T ss_pred cccccccccCcceEE-EEcCceEEecccCCceeEechhHHHHHHHHHhhccCCcEEccCCeeeceeeccceeeecCChhh Confidence 1 22222 33321 1 113466777777776554322 23456555443 13589999 Q ss_pred HHHHHhCCceEEEEEcC-ce-eeecCEecCC-----chhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHH Q lcl|NC_019918. 293 STNLKNKNVTTFERVGG-VN-RTFGGAMAGG-----EWIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEI 365 (428) Q Consensus 293 ~~~l~~~~~n~y~~~~~-~~-~~~~G~~~~G-----~~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i 365 (428) .+.|..+|+|++.++-+ .+ .++..+|+++ .||-+.+-.+|+.+.|+...+..+-. |.++.-...|+..| T Consensus 514 ~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i 589 (660) T protein:vir:10 514 RDRMYQEAINPVVGFAGGDGFVLFGDKTATKVPSPMDHINVRRLFNMLKKNIGDASKYKLFE----LNDNFTRSSFRMEV 589 (660) T ss_pred HHhHhhCCceEEEEeeCCCcEEEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHH Confidence 99999999999998754 45 5688888766 25677788899999888888775544 77889999999999 Q ss_pred HHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 366 RAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 366 ~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) +.-|+..++.|.|. ||.|... .++.+++|+.+++.. +.+.+++...+++|.++..-+- T Consensus 590 ~~fL~~l~~~gal~---g~~V~~d-~~~nt~~di~~G~~~-~~i~~~P~~pae~I~~~~~~~~ 647 (660) T protein:vir:10 590 SQYLDGIKALGGIY---EGRVVCD-TTVNTPAVIDRNEFI-ANIYVKPARSINYITLNFVATS 647 (660) T ss_pred HHHHHHHHhCCcee---eeEEEEc-CCCCCHHHhhCCeEE-EEEEEEecCCccEEEEEEEEee Confidence 99999999999997 4889887 667899999999988 9999999999999999877665 No 54 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=97.79 E-value=1.9e-05 Score=46.42 Aligned_cols=416 Identities=11% Similarity=0.076 Sum_probs=209.3 Q ss_pred CCCCCceEEEeeeeecccccccccceEEEEcccC-CCccceEEeeCHHHHHhhcC---CChHHHHHHHHHHhcCCcccEE Q lcl|NC_019918. 1 MTVLTDVIDIQISRETAAVAQTNFNVPLFIASHT-NFSERARVYNSLKGVAEDFG---ESDPTYLAAVRYFGQALKPRSL 76 (428) Q Consensus 1 M~~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~-~~~~~~~~y~s~~~V~~~fg---~~s~eY~aA~~~F~q~p~P~~l 76 (428) |.+++.=|-|.--=.+.++....-+...|+|... .|.+.-...+|..|....|| ..+.++.+...+|-+.- .++ T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngg--~~~ 78 (666) T protein:vir:65 1 MTLLSPGFETKETTLSTTIVQSETGRAALVGKFQWGPAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQYG--NDL 78 (666) T ss_pred CceecCceEEEEecCcccccccCcccceEEecccCCCCccCEEecCHHHHHHHcCCccccchhHHHHHHHHHhcC--ceE Confidence 9999887766432234556666777788888764 56677788889999999998 45556677777775433 356 Q ss_pred EEEeeecccc---cccc------hheeecc------ccccccc-------ceeee-------------eee------ccc Q lcl|NC_019918. 77 VIGRRQVPSA---TVSV------SVVQEGQ------SYVLTVN-------GLPVS-------------YVS------HQD 115 (428) Q Consensus 77 ~igr~~~~~~---~~~~------~~~~~~~------~~~~~v~-------g~~~s-------------~~~------~~~ 115 (428) +|-|-..... ...+ .....+. ....+.. +.... ... ... T Consensus 79 ~vvrv~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~~~V~~~~~~~~~~~~~~~~~~~~~~~g~~~~t~~~~~~~~~~g 158 (666) T protein:vir:65 79 RVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIG 158 (666) T ss_pred EEEEccCcccccccccccCceeeeEeeccccccccceEEEEecccccccccccccccccccccccccccceeeccccccC Confidence 6666421110 0000 0000000 0000000 00000 000 000 Q ss_pred chhhhhhhhhee--eec-c-------------cceEEEEee-------c-----------------------cccceeee Q lcl|NC_019918. 116 DTATLIATGLKA--AYD-V-------------TPVVGVTVT-------D-----------------------NEDGTLTV 149 (428) Q Consensus 116 ~~a~~i~a~l~~--a~~-~-------------~~~~~~~~t-------t-----------------------~~~~~~t~ 149 (428) ..+..+...... ... . ......... . +....+.. T Consensus 159 ~~~~l~~~~~~~~~~~~~~~~~a~sv~~~~~~g~~~~~~~~~a~~~~~~~~~~~~~~~~~~~a~~A~~~g~~g~~i~v~i 238 (666) T protein:vir:65 159 VYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLKKYDMPAVSAIYAGEIGNSLEVEI 238 (666) T ss_pred cceeEeeccceeecccCcccccceeeeecccccceeeeeecccccccccccccccccccccceeeeeeccccccceeEEe Confidence 000000000000 000 0 000000000 0 00000000 Q ss_pred eccccc---cccc----------------------cceEEEE---e---------------------------------- Q lcl|NC_019918. 150 ASNGDW---SLKV----------------------SSNLTMA---A---------------------------------- 167 (428) Q Consensus 150 as~~~~---~~~~----------------------s~~~~~~---~---------------------------------- 167 (428) ...... ...+ ...+.+. . T Consensus 239 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (666) T protein:vir:65 239 LARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGSS 318 (666) T ss_pred ecccccccccccccccccccccccceeeecccccccccceeeeecCCcccceeecccCcccccccchhhhhhhhhccccc Confidence 000000 0000 0000000 0 Q ss_pred ----------------------ecc--------------ccCHHHHHHHHHhcccCce-EEEEecC------CHHHHHHH Q lcl|NC_019918. 168 ----------------------APS--------------TEGWPATITAVQGENDEWY-ALSIDSH------ADDDIMAV 204 (428) Q Consensus 168 ----------------------~~a--------------a~~~~~al~~~~~~~~~w~-~~~~~~~------~~~~~~al 204 (428) +.. .......+..+.+...... .+..... ...-..+| T Consensus 319 ~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v~~~l 398 (666) T protein:vir:65 319 QYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQKHA 398 (666) T ss_pred ceeeeecccccccccceEEccCCCCcCcccccccccccccccHHHHHHHHhhhhhccCCceeecCcCCccchhHHHHHHH Confidence 000 0001122222222111111 1111110 11112334 Q ss_pred HHHHhhhCCEEEEEecCcc----c-ccchhHHHHHHHHhc----------ccCceEEEecC-------Ccc---chhHHH Q lcl|NC_019918. 205 ATHIEGTKKVFIGATAQAN----T-KTSAENDIASRLVAA----------GFQRTALIYHP-------NAD---AQFPEC 259 (428) Q Consensus 205 a~~~~a~~~~~~~~~~~~~----~-~~~~~~~~~~~l~~~----------~~~~t~~~y~~-------~~~---~~~~~a 259 (428) ...++....++...-.... . .....+++....... +..|.. +|++ ... ...+.+ T Consensus 399 ~~~~~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~-~~~p~~~~~d~~~~~~~~~p~sg 477 (666) T protein:vir:65 399 VSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAV-IDGNYKYQYDKYNDVNRWVPLAA 477 (666) T ss_pred HHHHhhccceEEEeccccceeeecCCCCCHHHHHHHHHhcccccccccccCcceEE-EEcCceEEecccCCceeEechHH Confidence 4444444444433211100 0 011122222222211 122332 3332 111 124567 Q ss_pred HHHHHHhccCCCc---eeeeeeeecCcc-----ccCCCHHHHHHHHhCCceEEEEEcCce-eeecCEecCCc-----hhH Q lcl|NC_019918. 260 AWVGYQLQEQPGS---NTWTHKALAAVD-----AYRLTPTESTNLKNKNVTTFERVGGVN-RTFGGAMAGGE-----WID 325 (428) Q Consensus 260 ~~~~~~~~~~~g~---~t~~fk~~~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~~~-~~~~G~~~~G~-----~iD 325 (428) .++|.+...+.-+ .....|.+.||. .-.+++.|++.|..+|+|++.++.+.+ .++.++|+++. ||- T Consensus 478 ~vAGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i~ 557 (666) T protein:vir:65 478 DIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDRIN 557 (666) T ss_pred HHHHHHHHHhccCCcEEccCCeecceeeccccceeecChhHHHhhhhCCceEEEEeCCCeEEEEecccCCCCCcccceEe Confidence 7777766554322 233455544443 135788999999999999999998875 56899988762 667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccC Q lcl|NC_019918. 326 VMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFE 405 (428) Q Consensus 326 ~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~ 405 (428) +.+-.+|+...|+..++..+-. |.|+.=...|+..|+.-|++.+++|.|. ||.|... .++.+++|+.+++.. T Consensus 558 vrR~~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~---g~~V~~d-~~~nt~~~i~~G~~~ 629 (666) T protein:vir:65 558 VRRLFNMLKKNIGDSSKYKLFE----NNDNFTRASFRMEVSQYLSTIRSLGGIY---DFRVQCD-TTNNTPDVIDRNEFV 629 (666) T ss_pred hhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee---eeEEEEc-CCCCCHHHhhCCeEE Confidence 8888899988888887765543 7788889999999999999999999997 5999987 668899999999886 Q ss_pred CeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 406 GIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 406 ~i~~~~~~agaih~v~i~~~v~~ 428 (428) +.+.+++...+++|.++..-.- T Consensus 630 -~~i~~~p~~pae~i~~~~~~~~ 651 (666) T protein:vir:65 630 -ASMFIKPAKSINYIMLNFTAVA 651 (666) T ss_pred -EEEEEEecCCcceEEEEEEEee Confidence 9999999999999999866554 No 55 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=97.64 E-value=3.5e-05 Score=45.05 Aligned_cols=388 Identities=14% Similarity=0.106 Sum_probs=166.0 Q ss_pred CC-------------------CCCceEEE-----------eeeeecccccccccceEE-EEcccCCCccceEEeeCHHHH Q lcl|NC_019918. 1 MT-------------------VLTDVIDI-----------QISRETAAVAQTNFNVPL-FIASHTNFSERARVYNSLKGV 49 (428) Q Consensus 1 M~-------------------~is~iV~V-----------~i~~~~~~~~~~~f~~~l-i~~~~~~~~~~~~~y~s~~~V 49 (428) .+ +|.++.+| ++.+-+- +... +.-+.....++....-.. ++ T Consensus 138 ~~~~~~~~~~~~~d~~~~~~~n~g~~~~i~y~g~~~~a~~~v~~~~~-------g~~~~lt~~~~~~~~~~~~V~~~-~l 209 (607) T protein:vir:10 138 VFGVPRITVNYSPDNYERTYTNIGQMFSITYSGKSASAGYTVSHDTD-------GKAILLTLGSGDSIDKLTNVATF-DL 209 (607) T ss_pred CCCccceeEEeecccceeeeeeccceeecccCcccccccceeeecCC-------CceeEEEecCCCccceeeeeecc-cc Confidence 11 01111111 1111100 1111 111111111111110000 01 Q ss_pred HhhcCCChHHHHHHHHHHhcCCcccEEEEEeeecccc-----cccch------heeec-ccccccccceeeeeeecccch Q lcl|NC_019918. 50 AEDFGESDPTYLAAVRYFGQALKPRSLVIGRRQVPSA-----TVSVS------VVQEG-QSYVLTVNGLPVSYVSHQDDT 117 (428) Q Consensus 50 ~~~fg~~s~eY~aA~~~F~q~p~P~~l~igr~~~~~~-----~~~~~------~~~~~-~~~~~~v~g~~~s~~~~~~~~ 117 (428) .. |.-+..++ +..+++..|.-..=++|......- .-..+ .+.+. ...........+-..+.... T Consensus 210 ~~--~~~~t~~~-l~~din~~~~~~A~~~g~~~i~tky~d~~~~~i~V~~~~~iv~a~~~D~~~~~~~~~~~~~t~~~~- 285 (607) T protein:vir:10 210 TM--SKYDTIAK-LMQAISATPNFSASVVGSPSVNTSYLDEVTSPVDVKTAPAVVTAKIGDAISKLGYDPYVVVTQTSN- 285 (607) T ss_pred cc--cccchHHH-HHHHhhcCCceEEEEecccceeeeccccccceeEEEEeeeeechhhhhhhhcccccceEEeeeccc- Confidence 00 11111111 223344444322222321110000 00000 00000 00000000000000000011 Q ss_pred hhhhhhhheeeecccceEEEEeeccccceeeeeccccccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEEecCC Q lcl|NC_019918. 118 ATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVASNGDWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSHA 197 (428) Q Consensus 118 a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as~~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~~ 197 (428) ...+.+.... ..........+...+.. .........+-..+....+..++++++... +|+.+.+...+ T Consensus 286 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~-------a~~a~~~LtGGtdG~~~~ty~dal~aLe~~--e~~~i~~~t~d 353 (607) T protein:vir:10 286 NKPIVNGVSA---GTGSATASVTTAPESFP-------ANFDTAFLTGGSTGDVPVSWADKFNGAIGN--NVYYIIPLTSE 353 (607) T ss_pred chhhhhhhhc---cccceeeeeeccccccc-------cccceeeeeCCCCCCchhhHHHHHHHHhhc--CceEEEecCCC Confidence 0111111100 00000000000000000 000000111112222345677888888765 46666666666 Q ss_pred HHHHHHHHHHHhhh---CCEEEEEecCcccccchhHHHHHHHHhcccCceEEEecCC----------ccchhHHHHHHHH Q lcl|NC_019918. 198 DDDIMAVATHIEGT---KKVFIGATAQANTKTSAENDIASRLVAAGFQRTALIYHPN----------ADAQFPECAWVGY 264 (428) Q Consensus 198 ~~~~~ala~~~~a~---~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~----------~~~~~~~a~~~~~ 264 (428) .+-+.++.+|++.. .+.+..+..... ......+....+..+++|.+.+.... .+....+++++|. T Consensus 354 ~ai~~~l~a~vkr~~~~g~~~~aVlg~~~--~~t~~~~~t~a~~~N~ervv~V~~~~~~~~~G~~~~~~~~~~Aa~vAGl 431 (607) T protein:vir:10 354 ENIHAELQAFIDEQHVLGYNYHAFVGGGF--AEPLEQILSRQVNINDSRFGLVGQSGHVQEGGESVHVPAYLMAAYVGGL 431 (607) T ss_pred HHHHHHHHHHHHHHHhCCCcEEEEecCCC--CCCHHHHHHHHHhhCCCcEEEEecCeeEeeCCcceeccHHHHHHHHHHH Confidence 66677899998753 334433332221 12234455566777888876554321 1122345666777 Q ss_pred HhccCCCceeeeeeeecCccc-cCCCHHHHHHHHhCCceEEEEEcCc-----eeeecCEecCC-----ch--hHHHHHHH Q lcl|NC_019918. 265 QLQEQPGSNTWTHKALAAVDA-YRLTPTESTNLKNKNVTTFERVGGV-----NRTFGGAMAGG-----EW--IDVMIFVD 331 (428) Q Consensus 265 ~~~~~~g~~t~~fk~~~Gv~~-~~~t~t~~~~l~~~~~n~y~~~~~~-----~~~~~G~~~~G-----~~--iD~~~~~d 331 (428) .....+. ..+.||.++++.. ..++.+|++.+.++|+..+....+. -.+.+|++.-+ .| |-.++-.| T Consensus 432 ~Ag~~~~-~SlT~k~i~~~~v~~~lt~~e~e~ai~~Gv~~l~~~~~~~~~~~vrIv~~ItT~t~~~~~~~~~i~viRv~D 510 (607) T protein:vir:10 432 SSSLGVA-VPITNKKLALVDLDQNFSGDDLNTLNQNGVIGIEHLVNRNATGGYYIVQDVSTNTVSSSHVDGSLYLGELTD 510 (607) T ss_pred HhcCccc-cCcccceeccccccccCCHHHHHHHHhCCeEEEEEccCccccceEEEeeeeeeccCCCCcchheeehhhhHH Confidence 7666553 3444555554332 3599999999999999988654332 23456665422 24 66888888 Q ss_pred HHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHH--HHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEE Q lcl|NC_019918. 332 WLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNE--GIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEF 409 (428) Q Consensus 332 wl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~--~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~ 409 (428) .+...++..+-+.++ .|++. +.....++..+...|.. -...|.|.....-.+.+. ..+| . --+.+ T Consensus 511 ~i~~dir~~~~~~yI--Gk~nn-d~~~~~vk~~i~~~L~~~~l~~~gaI~df~~edv~v~-----~~~D----~-v~v~~ 577 (607) T protein:vir:10 511 FLFDNLRFVLRDTYI--GSNIR-STSADDIKSTVASYLYSEMNNDDGLIVDFSESDIVVT-----ISGT----V-VYIQF 577 (607) T ss_pred HHHHHHHHHHhhcCC--cccCC-cchHHHHHHHHHHHHHHHHHHhcCceeCCCccccEEe-----eCCC----E-EEEEE Confidence 888888777766555 34333 45667888888888753 344678853211011111 1122 2 23789 Q ss_pred EEEECceEEEEEEEEEEec Q lcl|NC_019918. 410 EARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 410 ~~~~agaih~v~i~~~v~~ 428 (428) .+++-.+|++|.+++.+.= T Consensus 578 ~v~Pv~~iekIyvtv~v~~ 596 (607) T protein:vir:10 578 AVAPTQEIKNIVVSGTYSN 596 (607) T ss_pred EEEEcccceEEEEEEEEEE Confidence 9999999999988777765 No 56 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=97.56 E-value=4.4e-05 Score=44.45 Aligned_cols=390 Identities=12% Similarity=0.107 Sum_probs=163.2 Q ss_pred CC-CCCceEEEeeeeecccccccccceEEEEcccCCCccceEEeeCHHHHHhh-----cCCChHHHHHHHHHHhcC-Ccc Q lcl|NC_019918. 1 MT-VLTDVIDIQISRETAAVAQTNFNVPLFIASHTNFSERARVYNSLKGVAED-----FGESDPTYLAAVRYFGQA-LKP 73 (428) Q Consensus 1 M~-~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~~~~~~~~~y~s~~~V~~~-----fg~~s~eY~aA~~~F~q~-p~P 73 (428) .+ +|=..++=.+.|.+.- +.+-||.+.+.=++.+-...+ .++-+..=... |-.+|+.|.-=+..-.++ ..| T Consensus 287 ~~~~~~~~~~~~~~~~~~~-~g~~~n~~~~~v~~~D~~~~~-~~t~~~~~~g~~~~~pl~~ts~dy~~~~~~vdgI~~~~ 364 (717) T protein:vir:79 287 YAYNLVEVIQPVIELESIF-GGGVYNDIMRKVESKDGAVTV-TITKPESKRGMISEDPLVFKSGDYTNFKMLVDAINNHP 364 (717) T ss_pred HHhhHHHhhccceEEeecc-cCceeeeeeeEEecCCceEEE-EEecccccCcceeccccccccCceeeeeeeecccccCc Confidence 11 1111111123333322 245666665554443321111 11111110000 111121111000000011 001 Q ss_pred -cEEEEEe-eecccccccchheeecc-cccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeee Q lcl|NC_019918. 74 -RSLVIGR-RQVPSATVSVSVVQEGQ-SYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVA 150 (428) Q Consensus 74 -~~l~igr-~~~~~~~~~~~~~~~~~-~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~a 150 (428) .+|++.| +....+....+...... ++....+++...... +...+.... ...+..+.. T Consensus 365 ~~~V~~~g~~s~a~a~~~~g~~s~d~a~f~Gg~dgl~~~~ee--------~Y~~lGgk~------------~d~g~lt~~ 424 (717) T protein:vir:79 365 FNNVVRARTKPEFEATFTSTLQAAADAKFSGGKDELSLDKEE--------MYKRLGGEK------------NEEGFVTKQ 424 (717) T ss_pred hhheeeeecccccceeeeecccCchhhccCCCccccccchhh--------hhccccccc------------cccccccch Confidence 1222222 21122211111111111 011111111100000 000000000 000000000 Q ss_pred ccccccc--cccceEEEEee--------ccccCHHHHHH-HHHhcc-cCce-EEEE--ecCCHHHHHHHHHHHhhh---C Q lcl|NC_019918. 151 SNGDWSL--KVSSNLTMAAA--------PSTEGWPATIT-AVQGEN-DEWY-ALSI--DSHADDDIMAVATHIEGT---K 212 (428) Q Consensus 151 s~~~~~~--~~s~~~~~~~~--------~aa~~~~~al~-~~~~~~-~~w~-~~~~--~~~~~~~~~ala~~~~a~---~ 212 (428) +.... ...-.+.+..+ ........++. .+...+ .... .... ....+...-.+..|...- . T Consensus 425 --aays~LE~~dVDlVil~ga~adtt~ga~~d~va~alad~caalSal~r~ai~VI~l~sp~D~~~AtVe~~~~kLs~~A 502 (717) T protein:vir:79 425 --GAYQYLENYEVDYVIPLGVHADTKLIGKYDDFAYQLALACAVMSHYNSVTIGIIPTTTPSDISLAGVEEHVKKLENYA 502 (717) T ss_pred --hhhhhcCcceeEEEEecCccccccccchhhhHHHHHHHHHHHhhhccccceeeeccccccccchhhHHHHHHHHHhhh Confidence 00000 00000000000 00011111111 111111 0011 1111 111111111122222210 0 Q ss_pred CE-EEEEecCc--ccccchhHHHHHHHHhcccCceEEEecCCc--cchhHHHHHHHHHhccCCCceeeeeeeecCcc--c Q lcl|NC_019918. 213 KV-FIGATAQA--NTKTSAENDIASRLVAAGFQRTALIYHPNA--DAQFPECAWVGYQLQEQPGSNTWTHKALAAVD--A 285 (428) Q Consensus 213 ~~-~~~~~~~~--~~~~~~~~~~~~~l~~~~~~~t~~~y~~~~--~~~~~~a~~~~~~~~~~~g~~t~~fk~~~Gv~--~ 285 (428) .. +.+..... .......-++..++.... .+-..+.+... ....+++.+++....+.+ .....+|.+.|+. . T Consensus 503 aa~~~~d~~~a~a~~~~~~~idis~y~~vv~-~~~~iv~~~~~~~~~~p~AG~vAGldA~rGV-wkSPANk~I~GVvgLa 580 (717) T protein:vir:79 503 NEFYMRDRFGNIIFDADRNKIDLGQFIEVVA-GPDFIVRNTRLGQMASTPDASYIGMVSQLKT-QSAPTNKPLPSVTALR 580 (717) T ss_pred hhhhhhcchhccccccccccccccceeeeee-cceeEEEcCCCceeecCHHHHHHHHHhcCCc-ccccccceecccccCc Confidence 00 00000000 000000001111110000 01111122111 123456777877776654 3445688888776 3 Q ss_pred cCCCHHHHHHHHhCCceEEEEEcCcee-eecCEecCCc-----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHH Q lcl|NC_019918. 286 YRLTPTESTNLKNKNVTTFERVGGVNR-TFGGAMAGGE-----WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGAT 359 (428) Q Consensus 286 ~~~t~t~~~~l~~~~~n~y~~~~~~~~-~~~G~~~~G~-----~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~ 359 (428) ..++..|++.|..+|+|++..+.|.++ ++.+++++++ +|-+.+-.|++...|+..+..++ .+ |-++.+.. T Consensus 581 ~~lT~sE~d~Ln~aGIntIr~~~GrGirVWGaRTtasd~sdWryInVRRl~D~Ie~sIr~al~~yV---gE-PNd~~tr~ 656 (717) T protein:vir:79 581 YTYSANQLNRLTKARFATFKYKQDGSIGVVDAPTSAHAGSDYTRLSTARIVKEAVNAVREVADPFI---GE-PNDTGNRN 656 (717) T ss_pred ccCCHHHHHHHhhCCeEEEEEeCCceEEEEeeeecCCCCcccceeehhhhHHHHHHHHHHHHHHhc---cc-cCCHHHHH Confidence 568999999999999999998877664 6889987652 57788889999999988776543 33 77889999 Q ss_pred HHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 360 ILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 360 ~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) .|++.|+.-|++..+.|.|. ||.+.. ..+++|..+.+.. +.+.+.+..++++|.|+.+|+= T Consensus 657 ~Ik~sI~afL~~L~r~GAI~---Gykvdv----tnT~~di~~G~l~-V~I~vaPv~PaEfI~ititITA 717 (717) T protein:vir:79 657 ALTAAVDKRLSKMIENKALL---GFDFRL----VVTPQQELLGEGS-IELSLEAPNELRRLTTIVSLSA 717 (717) T ss_pred HHHHHHHHHHHHHHhcCcee---cceeeE----ecChhHhhCCEEE-EEEEEEecCcccEEEEEEEEeC Confidence 99999999999999999997 455543 4577777766554 8899999999999999988888 No 57 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=97.54 E-value=4.9e-05 Score=44.23 Aligned_cols=416 Identities=14% Similarity=0.132 Sum_probs=207.9 Q ss_pred CCCCCceEEEeeeeecccccccccceEEEEcccC-CCccceEEeeCHHHHHhhcC---CChHHHHHHHHHHhcCCcccEE Q lcl|NC_019918. 1 MTVLTDVIDIQISRETAAVAQTNFNVPLFIASHT-NFSERARVYNSLKGVAEDFG---ESDPTYLAAVRYFGQALKPRSL 76 (428) Q Consensus 1 M~~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~-~~~~~~~~y~s~~~V~~~fg---~~s~eY~aA~~~F~q~p~P~~l 76 (428) |.+++.=|-|.---.+.......-+...|+|... .|.+.....+|..|...-|| ..+.++.+...+|-+.- +++ T Consensus 1 ~~~~~PgVyvee~~~~~~~~~~~ts~~~fvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngg--~~~ 78 (659) T protein:vir:72 1 MTLLSPGIELKETTVQSTVVNNSTGTAALAGKFQWGPAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQYG--NDL 78 (659) T ss_pred CceecCceEEEEecCCcccccCCCcceEEEeecCCCCCcccEEecCHHHHHHHcCCcCCCCchhHHHHHHHHhCC--ceE Confidence 9999887666422222222233556777888764 56677888999999999999 55667777777775543 245 Q ss_pred EEEeeec---c-cccccchhe-----------eeccccccccc----------------c----------eeee------ Q lcl|NC_019918. 77 VIGRRQV---P-SATVSVSVV-----------QEGQSYVLTVN----------------G----------LPVS------ 109 (428) Q Consensus 77 ~igr~~~---~-~~~~~~~~~-----------~~~~~~~~~v~----------------g----------~~~s------ 109 (428) +|-|-.. . .+......+ .........+. + ..+. T Consensus 79 ~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~g~v~~~~~~~~~~~~~v~t~~~~a~~~~~~ 158 (659) T protein:vir:72 79 RVVRAVDRDTAKNSSPIAGNIDYTISTPGSNYAVGDKITVKYVSDDIETEGKITEVDADGKIKKINIPTGKNYAKAKEVG 158 (659) T ss_pred EEEEccCCcccccccccccccceeecccccccccceeeeeeeccccccccceEEEeeccccceeeeeccccccccccccc Confidence 6555311 0 000000000 00000000000 0 0000 Q ss_pred ------------eeecccchhhhh----------------h---hhhee-eec-------ccceE-EEEeeccccceee- Q lcl|NC_019918. 110 ------------YVSHQDDTATLI----------------A---TGLKA-AYD-------VTPVV-GVTVTDNEDGTLT- 148 (428) Q Consensus 110 ------------~~~~~~~~a~~i----------------~---a~l~~-a~~-------~~~~~-~~~~tt~~~~~~t- 148 (428) ........+..+ . ..+.. .++ ..... ....+.+...++. T Consensus 159 ~~~~~~~~~~~~~~~~~~~~a~~~~~v~v~~~~~~~~~~v~~~~~a~~~~~~~~~v~~~~~~~~~a~~~gt~g~~~tv~i 238 (659) T protein:vir:72 159 EYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKIEIEI 238 (659) T ss_pred cccccccceeeEEeeccccccceEEEEEeecCcceeeeeccccchhhhcccccccccccccceeeeccccccccceeEEE Confidence 000000000000 0 00000 000 00000 0000000000000 Q ss_pred --------------------------------eecccccc--c-----------------------------------cc Q lcl|NC_019918. 149 --------------------------------VASNGDWS--L-----------------------------------KV 159 (428) Q Consensus 149 --------------------------------~as~~~~~--~-----------------------------------~~ 159 (428) +....... . .. T Consensus 239 ~~~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (659) T protein:vir:72 239 VSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDFFAKG 318 (659) T ss_pred ccccccccceeeeeecccccccccccceeeeeeecccccccceeeecccceeeeeeeeeccccccccchhhhhhhhhhcC Confidence 00000000 0 00 Q ss_pred cceEEEE----------------eec------cccCHHHHHHHHHhcc-cCceEEEEecC---CHHHH----HHHHHHHh Q lcl|NC_019918. 160 SSNLTMA----------------AAP------STEGWPATITAVQGEN-DEWYALSIDSH---ADDDI----MAVATHIE 209 (428) Q Consensus 160 s~~~~~~----------------~~~------aa~~~~~al~~~~~~~-~~w~~~~~~~~---~~~~~----~ala~~~~ 209 (428) ...++.. .+. ...+...++..+.... .+...+.+... ..++. .+|...++ T Consensus 319 ~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~~l~~~~~ 398 (659) T protein:vir:72 319 GSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVVSIGD 398 (659) T ss_pred CceEEEEEecccCCcccccccccccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHHHHHh Confidence 0000000 000 0001122222222211 11222333221 11222 23444444 Q ss_pred hhCCEEEEEecCccc-----ccchhHHHHHHHHhc----------ccCceEEEecC-------Ccc---chhHHHHHHHH Q lcl|NC_019918. 210 GTKKVFIGATAQANT-----KTSAENDIASRLVAA----------GFQRTALIYHP-------NAD---AQFPECAWVGY 264 (428) Q Consensus 210 a~~~~~~~~~~~~~~-----~~~~~~~~~~~l~~~----------~~~~t~~~y~~-------~~~---~~~~~a~~~~~ 264 (428) ....++.+.-..... .....+++...-+.. +..|.. +|++ ... ...|.+.++|. T Consensus 399 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~-~~~p~~~~~d~~~~~~~~~p~sg~vAGl 477 (659) T protein:vir:72 399 ARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAA-IDGNHKYQYDKYNDVNRWVPLAADIAGL 477 (659) T ss_pred hhCCEEEEEcCccccccCCCcccCHHHHHHHHhhccccccccccccceeEE-EEcCceeeccccCCceEEechHHHHHHH Confidence 444455443222111 111222332222211 122333 3332 111 12356777777 Q ss_pred HhccCCCc---eeeeeeeecCcc-----ccCCCHHHHHHHHhCCceEEEEEcCce-eeecCEecCCc-----hhHHHHHH Q lcl|NC_019918. 265 QLQEQPGS---NTWTHKALAAVD-----AYRLTPTESTNLKNKNVTTFERVGGVN-RTFGGAMAGGE-----WIDVMIFV 330 (428) Q Consensus 265 ~~~~~~g~---~t~~fk~~~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~~~-~~~~G~~~~G~-----~iD~~~~~ 330 (428) +...+.-+ ..-.+|.+.||. ...+++.|.+.|..+++|++.++.+.+ .++..++++++ ||-+.+-. T Consensus 478 ~Ar~D~~~G~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~~~s~~~~i~vrR~~ 557 (659) T protein:vir:72 478 CARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRINVRRLF 557 (659) T ss_pred HHHhhccCCcEEccCCeeeceeeccccccccCChhHHHHHhhCCceEEEEecCCeEEEEcccccCCCCcccceEeehhHH Confidence 76555322 233455544443 235789999999999999999998765 56888887762 67778888 Q ss_pred HHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEE Q lcl|NC_019918. 331 DWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFE 410 (428) Q Consensus 331 dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~ 410 (428) +|+...|+..+...+-. |.++.=...|+..|+.-|++.++.|.|. +|.|.+. .++.+++|+.+.+.. +.+. T Consensus 558 ~~i~~si~~~~~~~v~e----~n~~~l~~~i~~~i~~fL~~l~~~gal~---~~~V~~d-~~~nt~~~i~~G~~~-~~i~ 628 (659) T protein:vir:72 558 NMLKTNIGRSSKYRLFE----LNNAFTRSSFRTETAQYLQGNKALGGIY---EYRVVCD-TTNNTPSVIDRNEFV-ATFY 628 (659) T ss_pred HHHHHHHHHHHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhcCcee---eEEEEEc-CCCCCHHHhhCCeEE-EEEE Confidence 88888888877764433 7788889999999999999999999995 6999987 778899999999988 9999 Q ss_pred EEECceEEEEEEEEEEec Q lcl|NC_019918. 411 ARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 411 ~~~agaih~v~i~~~v~~ 428 (428) +.+...+++|.++..-.- T Consensus 629 ~~p~~pae~I~~~~~~~~ 646 (659) T protein:vir:72 629 IQPARSINYITLNFVATA 646 (659) T ss_pred EEecCCccEEEEEEEEee Confidence 999999999999876544 No 58 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=97.24 E-value=0.00012 Score=42.11 Aligned_cols=398 Identities=11% Similarity=0.081 Sum_probs=160.8 Q ss_pred CCCCCceEE---Eee---------------eeecccccccccceEEEEcccCC-Cc--cceEE--eeC------HHHHHh Q lcl|NC_019918. 1 MTVLTDVID---IQI---------------SRETAAVAQTNFNVPLFIASHTN-FS--ERARV--YNS------LKGVAE 51 (428) Q Consensus 1 M~~is~iV~---V~i---------------~~~~~~~~~~~f~~~li~~~~~~-~~--~~~~~--y~s------~~~V~~ 51 (428) ...+..++. +.+ .-.+.......++.+.+...... .. -.+.. +.. ...+.. T Consensus 182 ~~~v~~v~~~~~~~~~~~~~a~~~i~~~~~~~~t~~~~~~~~~~~~~~A~~~g~~gn~i~v~~va~~~~~~~~~~~a~v~ 261 (679) T protein:vir:10 182 TATVVGINLDSTIFVPNDEYAMSAISERSETKRTFIDICEEMKVPAIVARYAGTYGDNIKVLMIAYKDYYKFNEAGKIVS 261 (679) T ss_pred eeeeeeeccCCceeeccccccccccccccccchhhhhhhhccccceeeeecccccCCcceEEEEeecccccccccccccc Confidence 111111110 000 00000000011111111111000 00 00000 000 000000 Q ss_pred hcCCChHHHHHHHHHHhcCCcccEEEEEeeecccccccchheeecccccccccceeeeeeeccc---chhhhhhhhheee Q lcl|NC_019918. 52 DFGESDPTYLAAVRYFGQALKPRSLVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQD---DTATLIATGLKAA 128 (428) Q Consensus 52 ~fg~~s~eY~aA~~~F~q~p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~---~~a~~i~a~l~~a 128 (428) ........+......-..... -...+.-. ........ +..... ......++...... .....+...+ T Consensus 262 ~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~-~~~~~~vv-v~~~g~---~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 331 (679) T protein:vir:10 262 VNTINPKVFPTGLDYGNVTPS--SYLEFGPQ-NESQFAFI-VFNNGV---AVESKILSTKPGDRDIYGTSIYINEYF--- 331 (679) T ss_pred cccccccccccccccccceee--eecccccc-cccceeeE-Eecccc---cccceeeecccccccccchhhhhhhhh--- Confidence 000000000000000000000 00000000 00000000 000000 00000000000000 0000000000 Q ss_pred ecccceEEEEeec----cccceeeeeccccccccccceEEEEeeccccCHHHHHHHHHhccc-CceEEEEecC---CH-- Q lcl|NC_019918. 129 YDVTPVVGVTVTD----NEDGTLTVASNGDWSLKVSSNLTMAAAPSTEGWPATITAVQGEND-EWYALSIDSH---AD-- 198 (428) Q Consensus 129 ~~~~~~~~~~~tt----~~~~~~t~as~~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~-~w~~~~~~~~---~~-- 198 (428) .++.......... .....+....+...... ............+..... .--.+++... .. T Consensus 332 ~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~ 401 (679) T protein:vir:10 332 GNGYSSFVQGVAESWPVGYTGVLAFGGGQSSNTD----------ISAAEFMKGWDMFADREHTDVNLFIAGAVAGEGAQI 401 (679) T ss_pred cCcccceeeeccccccccccceeeccCCccCCCc----------cchhhhhhhhhhhhcccccccceEEecCCCCCchhh Confidence 0000000000000 00000000000000000 011111112221111111 1112222221 11 Q ss_pred --HHHHHHHHHHhhhCCEEEEEecCcccc-----cchhHHHHHHHHh-----------cc--cCceEEEecC-------C Q lcl|NC_019918. 199 --DDIMAVATHIEGTKKVFIGATAQANTK-----TSAENDIASRLVA-----------AG--FQRTALIYHP-------N 251 (428) Q Consensus 199 --~~~~ala~~~~a~~~~~~~~~~~~~~~-----~~~~~~~~~~l~~-----------~~--~~~t~~~y~~-------~ 251 (428) .-+.+|-..++....+|.+.-...... ....+++...-.. .+ ..|.+ +|++ . T Consensus 402 ~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~~-~~~p~~~~~d~~ 480 (679) T protein:vir:10 402 ASTVQKAVVAIADERRDCLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYAS-VDGNYKYQYDKY 480 (679) T ss_pred hHHHHHHHHHHHHhhCCeEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceEE-EEccceeeeccc Confidence 123445556666665665543221111 1111222111110 01 12233 2332 1 Q ss_pred cc---chhHHHHHHHHHhccCCCc---eeeeeeeecCcc-----ccCCCHHHHHHHHhCCceEEEEEcCce-eeecCEec Q lcl|NC_019918. 252 AD---AQFPECAWVGYQLQEQPGS---NTWTHKALAAVD-----AYRLTPTESTNLKNKNVTTFERVGGVN-RTFGGAMA 319 (428) Q Consensus 252 ~~---~~~~~a~~~~~~~~~~~g~---~t~~fk~~~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~~~-~~~~G~~~ 319 (428) .. ...|.+.++|.+...+.-+ ....+|++.||. .-.+++.|++.|..+|+|....+.+.+ .++..+|+ T Consensus 481 ~~~~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~g~G~~~wG~rT~ 560 (679) T protein:vir:10 481 NDVNRWIPLAADIAGLCARTDTVGQPWQSPAGFNRGQIVNVIKLAVDTRQAHRDEMYTNGINPIVGFAGQGYILYGDKTA 560 (679) T ss_pred CCceEEechHHHHHHHHHHhhccCCcEECcCCeeeccccccccceeecChhhHHhhhhCCceEEEEecCCeEEEEccccc Confidence 11 1245677777776555322 223455555443 234789999999999999999998775 56889988 Q ss_pred CCc-----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhC Q lcl|NC_019918. 320 GGE-----WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSM 394 (428) Q Consensus 320 ~G~-----~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~ 394 (428) ++. ||-+.+-.+|++..|+......+-. |.|+.=...|+..|+.-|.+..++|.|. ||.|... .++. T Consensus 561 ~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~~~~~i~~~i~~fL~~l~~~gal~---gf~v~~d-~~~n 632 (679) T protein:vir:10 561 SQAPTPFDRINVRRLFNLLKKSISESAKYKLFE----LNDAFTRSSFRSEVGSYLDTIRSLGGIY---DFRVVCD-ESNN 632 (679) T ss_pred CCCCcccceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee---eeEEEEc-CCCC Confidence 763 5667788888888888877765443 6788889999999999999999999997 5999988 5788 Q ss_pred CHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 395 SPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 395 ~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) +++|+.+++.. +.+.+.+...+++|.++..-.- T Consensus 633 t~~~i~~G~~~-~~i~~~p~~pae~i~~~~~~~~ 665 (679) T protein:vir:10 633 TPAVIDRNEFV-ATILIKPARSINYITLSFVATS 665 (679) T ss_pred CHHHhhCCeEE-EEEEEEecCCccEEEEEEEEee Confidence 99999999886 9999999999999999877655 No 59 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=97.04 E-value=0.0002 Score=40.89 Aligned_cols=416 Identities=13% Similarity=0.112 Sum_probs=205.9 Q ss_pred CCCCCceEEEeeeeecccccccccceEEEEcccC-CCccceEEeeCHHHHHhhcCC---ChHHHHHHHHHHhcCCcccEE Q lcl|NC_019918. 1 MTVLTDVIDIQISRETAAVAQTNFNVPLFIASHT-NFSERARVYNSLKGVAEDFGE---SDPTYLAAVRYFGQALKPRSL 76 (428) Q Consensus 1 M~~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~-~~~~~~~~y~s~~~V~~~fg~---~s~eY~aA~~~F~q~p~P~~l 76 (428) |.+++.=|-|..--.+.++....-+...|+|... .|.+.-...+|..|....||. .+.++.+...||-+.- +++ T Consensus 1 ~~~~~Pgvyv~e~~~~~~i~~v~t~~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~f~ngg--~~~ 78 (671) T protein:vir:56 1 MTLLSPGIENKEINLASAIGRAATGRAAMVGKFEWGPAYSITQVTSESDLVTIFGRPNDYTAASFMTANNFLKYG--NDL 78 (671) T ss_pred CceecCceEEEeecCcccccccCcccceEEecccCCCCccCEEcCCHHHHHHHcCCcCCCcchhHHHHHHHHhcC--CeE Confidence 9999987777533256678888888888998764 566677788899999999886 6777788888887654 356 Q ss_pred EEEeeecccc---cccch-----heeecccc----cccc---------cc-----eeeeee----ecccchhh------h Q lcl|NC_019918. 77 VIGRRQVPSA---TVSVS-----VVQEGQSY----VLTV---------NG-----LPVSYV----SHQDDTAT------L 120 (428) Q Consensus 77 ~igr~~~~~~---~~~~~-----~~~~~~~~----~~~v---------~g-----~~~s~~----~~~~~~a~------~ 120 (428) +|-|-..... ..... ........ .+.+ .+ .+.... ......+. . T Consensus 79 ~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~v~~~~~ 158 (671) T protein:vir:56 79 RLVRICDATTAQNATPLYNAVEYTIGASNGCVVGDDITITYSGVGALTAKGKVLEVDAGNNNAASKIFLPSAEIVAAAKS 158 (671) T ss_pred EEEEecCccccccchhhccccccccccCcceeeceeeeeecCcccccccCcceeEEeeeccceeeeeeccceeEEEeeec Confidence 7766421110 00000 00000000 0000 00 000000 00000000 0 Q ss_pred hh--hhhe-eeecccc--eEEEEeeccccceee----------------------eec-----c---------cccccc- Q lcl|NC_019918. 121 IA--TGLK-AAYDVTP--VVGVTVTDNEDGTLT----------------------VAS-----N---------GDWSLK- 158 (428) Q Consensus 121 i~--a~l~-~a~~~~~--~~~~~~tt~~~~~~t----------------------~as-----~---------~~~~~~- 158 (428) .. ..+. ....... ...........+... +.. . ...+.. T Consensus 159 ~~~~~~~~~~t~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~~ 238 (671) T protein:vir:56 159 DGNYPSVGTITLQPTQGDIALTNIEIIDTGSVYFPNIELAFDALTAIETEGGALKYADLIEKQGFPRLSARYVGDFGDAI 238 (671) T ss_pred cccccccccccccccccceeeeeecccccceEEEeccccccccccccccccccccchhhhhcccccccccccccccCcce Confidence 00 0000 0000100 000000000000000 000 0 000000 Q ss_pred ---------------ccceEE----------------------------------E--E-eecc---------------- Q lcl|NC_019918. 159 ---------------VSSNLT----------------------------------M--A-AAPS---------------- 170 (428) Q Consensus 159 ---------------~s~~~~----------------------------------~--~-~~~a---------------- 170 (428) ...... . . .+.. T Consensus 239 ~v~v~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~ 318 (671) T protein:vir:56 239 SVEIINYADYQTAFAFAAGHTLGDIELPIYPDGGTRSINLSSYFTFGPSNSNQYAVIVRVSGEVEEAFIVSTNPGDKDVN 318 (671) T ss_pred EEEEecccccccccccccceeeeeccccccccccccccccceeecccccccccceeEEeecCccceeEEEeecccccccc Confidence 000000 0 0 0000 Q ss_pred -------------------------------------------ccCHHHHHHHHHhcccCce-EEEEec--CCH----HH Q lcl|NC_019918. 171 -------------------------------------------TEGWPATITAVQGENDEWY-ALSIDS--HAD----DD 200 (428) Q Consensus 171 -------------------------------------------a~~~~~al~~~~~~~~~w~-~~~~~~--~~~----~~ 200 (428) ..+..+++..+.+. .... -+.... ... .. T Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~-~~~~~~~~~a~~~~~~~~~~~~ 397 (671) T protein:vir:56 319 GQSIFIDEYFENSGSAYITAIAEGWKTESGAYNFGGGSDANAGADDWMFGLDMLSDP-EVLYTNLVIAGNAAAEEVSIAS 397 (671) T ss_pred hhhhhhhhhhcccCceEEEecCcccCCccccccccCccccccchhHHHHHHHhhhhc-cccceeEEEcCCCCCccchhHH Confidence 00000011111100 0000 000000 000 00 Q ss_pred ---HHHHHHHHhhhCCEEEEEecCccc-----ccchhHHHHHHHHh--------------cccCceEEEec------CCc Q lcl|NC_019918. 201 ---IMAVATHIEGTKKVFIGATAQANT-----KTSAENDIASRLVA--------------AGFQRTALIYH------PNA 252 (428) Q Consensus 201 ---~~ala~~~~a~~~~~~~~~~~~~~-----~~~~~~~~~~~l~~--------------~~~~~t~~~y~------~~~ 252 (428) ...+....+....++.+....... ......++...... .+..+.++.|. +.. T Consensus 398 ~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~ 477 (671) T protein:vir:56 398 TVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAGTAVANIQGWRTGIDPTNGQAVVDNLNVSTTYAVIDGNYKYQYDKYN 477 (671) T ss_pred HHHHHHHHHHHhhcCCEEEEEeccccccccccccccHHHHHHHhhhccccchhhhhhhccCCcceEEEecCceEEecccC Confidence 001111112222333332211100 00111111111111 11222322221 111 Q ss_pred c---chhHHHHHHHHHhccCCCcee---eeeeeecCcc-----ccCCCHHHHHHHHhCCceEEEEEcCce-eeecCEecC Q lcl|NC_019918. 253 D---AQFPECAWVGYQLQEQPGSNT---WTHKALAAVD-----AYRLTPTESTNLKNKNVTTFERVGGVN-RTFGGAMAG 320 (428) Q Consensus 253 ~---~~~~~a~~~~~~~~~~~g~~t---~~fk~~~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~~~-~~~~G~~~~ 320 (428) . ...+.+.++|.+...+.-+-- ...|.+.|+. ...+++.|.+.|..+|+|+..++.+.+ .++..++++ T Consensus 478 ~~~~~~p~s~~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~ 557 (671) T protein:vir:56 478 DRNRWVPLAGDIAGLCAYTDQVSQPWMSPAGFNRGQIKGVNRLAVDLRRAHRDALYQIGINPVVGFAGQGFVLYGDKTAT 557 (671) T ss_pred CceeEechHHHHHHHHHHhhccCCcEECcCCceeccccccccceeecChhHHHHHhhCCceEEEEecCCeEEEEcceecC Confidence 1 123567777777655533222 2344444332 235789999999999999999998765 568888877 Q ss_pred C-----chhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCC Q lcl|NC_019918. 321 G-----EWIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMS 395 (428) Q Consensus 321 G-----~~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~ 395 (428) + .||-+.+-.+|+...|+..++..+-. |.++.=...|+..|+.-|+..++.|.|. ||.|.+. .++.+ T Consensus 558 ~~~~~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~~~~~i~~~i~~fL~~l~~~gal~---g~~v~~d-~~~nt 629 (671) T protein:vir:56 558 QQASAFDRINVRRLFNLLKKAISDAAKYRLFE----LNDEFTRSSFKSEIDAYLTNIQDLGGVY---DFRVVCD-ETNNP 629 (671) T ss_pred CCCcccceEehhhHHHHHHHHHHHHHHHhcCC----CCCHHHHHHHHHHHHHHHHHHHhCCcee---eeEEEEc-CCCCC Confidence 6 26778888899988888887764433 6688888999999999999999999997 5999987 67889 Q ss_pred HHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 396 PNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 396 ~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) ++|+.+++.. +.+.+++...+++|+++..-+- T Consensus 630 ~~~i~~G~~~-~~i~~~p~~Pae~I~~~~~~~~ 661 (671) T protein:vir:56 630 GSVIDRNEFV-ASIYVKPAKSINFITLNFVATS 661 (671) T ss_pred HHHhhCCeEE-EEEEEEecCCcceEEEEEEEee Confidence 9999999886 9999999999999999877665 No 60 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=96.94 E-value=0.00025 Score=40.33 Aligned_cols=416 Identities=10% Similarity=0.087 Sum_probs=209.6 Q ss_pred CCCCCceEEEeeeeecccccccccceEEEEcccC-CCccceEEeeCHHHHHhhcCC---ChHHHHHHHHHHhcCCcccEE Q lcl|NC_019918. 1 MTVLTDVIDIQISRETAAVAQTNFNVPLFIASHT-NFSERARVYNSLKGVAEDFGE---SDPTYLAAVRYFGQALKPRSL 76 (428) Q Consensus 1 M~~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~-~~~~~~~~y~s~~~V~~~fg~---~s~eY~aA~~~F~q~p~P~~l 76 (428) |+.++.=|-|.--=.+.++....-+...|+|... .|.+.....+|..|...-||. .+.++-+...+|-+.- +++ T Consensus 1 ma~~~PgVyv~E~~~~~~i~~~~ts~~~~vG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ngg--~~~ 78 (664) T protein:vir:98 1 MALQSPGIETKETSVQSTVVRNSTGRAAIVGKFSWGPAYQIRQISNEVELVNYFGAPDNLTADYFMSAVNFLQYG--NDL 78 (664) T ss_pred CceecCceEEEecCCCcccccccccceEEEeeccCCCCCccEEecCHHHHHHhcCCccccchhHHHHHHHHHhcC--CeE Confidence 9999987777522245677777778888888764 566777888899999998883 4556677777775432 234 Q ss_pred EEEeeecc-----cccc----c------------c-------hh------------eeecccc----------------- Q lcl|NC_019918. 77 VIGRRQVP-----SATV----S------------V-------SV------------VQEGQSY----------------- 99 (428) Q Consensus 77 ~igr~~~~-----~~~~----~------------~-------~~------------~~~~~~~----------------- 99 (428) +|-|-... +... . . .. ...+... T Consensus 79 ~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~gn~~~v~i~~~~~~~~~~~~~ 158 (664) T protein:vir:98 79 RLVRVVDKDAAKNASAIFNQIKTTIASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNGKILAVTIPKRKKSLLVLNRS 158 (664) T ss_pred EEEEecCccccccccccccccceeeccCCcccccccceeEeecCcccccceeeccCCCCCceeeEeeccCccceeecccc Confidence 44442100 0000 0 0 00 0000000 Q ss_pred -------------cc----------cccce--eeeeeec-ccchhhhhh--------h-----hheeeeccc-------- Q lcl|NC_019918. 100 -------------VL----------TVNGL--PVSYVSH-QDDTATLIA--------T-----GLKAAYDVT-------- 132 (428) Q Consensus 100 -------------~~----------~v~g~--~~s~~~~-~~~~a~~i~--------a-----~l~~a~~~~-------- 132 (428) .. .+.+. +.+.... .......+. . .+.....+. T Consensus 159 ~~~~~~~~~~~~s~~~~s~g~a~a~~v~~v~~d~~~~~~~~~~a~~~i~~~~~~~~~~~~~~~~~~a~~~G~~Gn~isv~ 238 (664) T protein:vir:98 159 VLTQIFLLVGTTEIVSQSSGVSASITIDGIESDSGITLLNLDIAKETIQGTSFQTLTQKYQIPSVVALYPGELGSTVQVE 238 (664) T ss_pred cccccceecccceeeeeecccceeeecccccccceeeccccceeeeccccccceeeeeccccceeeeeecccccceeeee Confidence 00 00000 0000000 000000000 0 000000000 Q ss_pred ---------ceEEEEeecc-----------------ccc-eeeee-----------c-cc---c-ccc---------ccc Q lcl|NC_019918. 133 ---------PVVGVTVTDN-----------------EDG-TLTVA-----------S-NG---D-WSL---------KVS 160 (428) Q Consensus 133 ---------~~~~~~~tt~-----------------~~~-~~t~a-----------s-~~---~-~~~---------~~s 160 (428) .......... .+. .++.. + .. . ... ... T Consensus 239 i~s~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (664) T protein:vir:98 239 IISKAAYDTGAMISGYPSGISVKNSGRSVMTYGPQTDNQYAFVVRRGGIVQESFIVSTDKTDKDIYGVNIYMDDFFANGG 318 (664) T ss_pred ecccccccCcceEeeccCceecccceeeeeeccccCccceeEEEecCCceeeeEEeecccCcccceeeeeechhheeccc Confidence 0000000000 000 00000 0 00 0 000 000 Q ss_pred ceEE----------------EEeecc------ccCHHHHHHHHHhcc-cCceEEEEecC---CHHH----HHHHHHHHhh Q lcl|NC_019918. 161 SNLT----------------MAAAPS------TEGWPATITAVQGEN-DEWYALSIDSH---ADDD----IMAVATHIEG 210 (428) Q Consensus 161 ~~~~----------------~~~~~a------a~~~~~al~~~~~~~-~~w~~~~~~~~---~~~~----~~ala~~~~a 210 (428) ..+. +..+.. .....+.+..+.+.. .+.-.+.+... ..+. ..+|...++. T Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~~~~~~~~~ll~~p~~~~~~~~~~~~v~~al~~~a~~ 398 (664) T protein:vir:98 319 SQYVFGTSMNWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFADREALHVPLLIAGGCAGESVEIASTVQKHVISIGDE 398 (664) T ss_pred ceeeeeecccCCcccceeEeccCccccccccCchhHHHHHHhhhcccccccceEEecCCCCCcHHHHHHHHHHHHHHHHh Confidence 0000 000000 111223344443321 11122333221 1222 2334444555 Q ss_pred hCCEEEEEecCccc-----ccchhHHHHHHHHh------------cc--cCceEEEecC-------Ccc---chhHHHHH Q lcl|NC_019918. 211 TKKVFIGATAQANT-----KTSAENDIASRLVA------------AG--FQRTALIYHP-------NAD---AQFPECAW 261 (428) Q Consensus 211 ~~~~~~~~~~~~~~-----~~~~~~~~~~~l~~------------~~--~~~t~~~y~~-------~~~---~~~~~a~~ 261 (428) ...+|.+.-..... .....+++....+. .+ ..+. .+|++ ... ...+.+.+ T Consensus 399 ~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~-~l~~p~~~~~d~~~~~~~~~p~sg~~ 477 (664) T protein:vir:98 399 RQDCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYG-FLDGNYKYQYDKYNDVNRWVPLAGDI 477 (664) T ss_pred cCCeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceE-EEEcCeEEEecccCCceEEechHHHH Confidence 55455443211100 01111222221111 11 2222 23332 111 12456777 Q ss_pred HHHHhccCCCc---eeeeeeeecCcc-----ccCCCHHHHHHHHhCCceEEEEEcC-ce-eeecCEecCCc-----hhHH Q lcl|NC_019918. 262 VGYQLQEQPGS---NTWTHKALAAVD-----AYRLTPTESTNLKNKNVTTFERVGG-VN-RTFGGAMAGGE-----WIDV 326 (428) Q Consensus 262 ~~~~~~~~~g~---~t~~fk~~~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~-~~-~~~~G~~~~G~-----~iD~ 326 (428) +|.+...+.-+ .....|.+.||. ...+++.|.+.|..+|+|.+..+-+ .+ .++..+|+++. ||-+ T Consensus 478 AGl~A~~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~s~~~~i~v 557 (664) T protein:vir:98 478 AGLCVYTDSVANPWMSPAGYNRGQIRNCIKLAIEPRTAHRDAMYQVQINPVTGFAGGSGFVLYGDKTLTSVPSPFDRINV 557 (664) T ss_pred HHHHHHhhhcCCcEECcCCceeeeeeccccceeecChhhHHHHHhCCCeEEEEeeCCCcEEEEcccccCCCCcccceEee Confidence 77666554322 223445444443 2457889999999999999998766 45 57888887762 5677 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCC Q lcl|NC_019918. 327 MIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEG 406 (428) Q Consensus 327 ~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~ 406 (428) .+-.+|+...|+..++..+-. |.++.=...|+..|+.-|+..+++|.|. ||.|... .++.+++|+.+++.. T Consensus 558 rR~~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~---g~~V~~d-~~~nt~~~i~~G~~~- 628 (664) T protein:vir:98 558 RRLFNMIKKDIGDNAKYKLFE----NNDDFTRASFRMDTGQYMTNIRALGGCY---DYRVICD-TTNNTPDVIDRNEFV- 628 (664) T ss_pred hhHHHHHHHHHHHHHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhcCcee---eeEEEEc-CCCCCHHHhhCCeEE- Confidence 788888888888877765443 7788889999999999999999999997 5899998 778899999999986 Q ss_pred eEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 407 IEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 407 i~~~~~~agaih~v~i~~~v~~ 428 (428) +.+.+++...+++|.++..-+- T Consensus 629 ~~i~~~p~~pae~I~~~~~q~~ 650 (664) T protein:vir:98 629 ATVYVKPPRSINYITLNFVATS 650 (664) T ss_pred EEEEEEecCCcceEEEEEEEee Confidence 9999999999999999877665 No 61 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=96.78 E-value=0.00034 Score=39.57 Aligned_cols=417 Identities=12% Similarity=0.077 Sum_probs=204.2 Q ss_pred CCCCCceEEEeeeeecccccccccceEEEEcccC-CCccceEEeeCHHHHHhhcCC---ChHHHHHHHHHHhcCCcccEE Q lcl|NC_019918. 1 MTVLTDVIDIQISRETAAVAQTNFNVPLFIASHT-NFSERARVYNSLKGVAEDFGE---SDPTYLAAVRYFGQALKPRSL 76 (428) Q Consensus 1 M~~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~-~~~~~~~~y~s~~~V~~~fg~---~s~eY~aA~~~F~q~p~P~~l 76 (428) |.+++.=|-|.--=.+.++....-+...|+|... .|.+.-...+|..|....||. .+.++-+...+|-+.- .++ T Consensus 1 ~~~~~Pgvyv~e~~~~~~~~~v~t~~~~fvG~~~~gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~ngg--~~~ 78 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAALVGKFAWGPAYEIRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYG--NDL 78 (663) T ss_pred CccccCceEEEEecCcccccccccccceeeeccccCCCCcCEEecCHHHHHHHcCCcccccchHHHHHHHHHhCC--CeE Confidence 9999887766422233445555566777887764 566677888899999998885 4566788888887543 355 Q ss_pred EEEeeeccc----c-cccc----h---heee---cccc----------------cccccc-------------------- Q lcl|NC_019918. 77 VIGRRQVPS----A-TVSV----S---VVQE---GQSY----------------VLTVNG-------------------- 105 (428) Q Consensus 77 ~igr~~~~~----~-~~~~----~---~~~~---~~~~----------------~~~v~g-------------------- 105 (428) +|-|-.... + .... + ..+. +... .....| T Consensus 79 ~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~~~~~a~~~~ 158 (663) T protein:vir:10 79 RLVRVIDMEQAKNASPLFNQIEVTITTEGQGYTVGDTVSIKHNTTTVTEEGKVTKVDADGKIKALFVPSSAVIAKAKQLG 158 (663) T ss_pred EEEecCCcccccccccccccceeeEeecccCccccceeeecccccccccCcceeeeccCCceeEEEeccccccccccccc Confidence 555432100 0 0000 0 0000 0000 000000 Q ss_pred -------ee---eeeeeccc--------------------chhhhhhhhhee--------------eecccc---eEEEE Q lcl|NC_019918. 106 -------LP---VSYVSHQD--------------------DTATLIATGLKA--------------AYDVTP---VVGVT 138 (428) Q Consensus 106 -------~~---~s~~~~~~--------------------~~a~~i~a~l~~--------------a~~~~~---~~~~~ 138 (428) -. +....... ..+..-...... ...+.. ..... T Consensus 159 ~~~~~~~a~~~~v~~~~~~~~~a~av~~i~~dg~vt~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i~v~~ 238 (663) T protein:vir:10 159 TYPVLGDNWRAEVSGASGGSAATLTLGGIVVDSGVTFGNSEEAPDVMTSTKVLANFAKYGMPLISAVYPGEIGSTVEVEV 238 (663) T ss_pred cccccccceeeEEeeccccccccceeEeeecCCceeEEeeeccccccccceeeeeccccccceeeeecccccCcceeEee Confidence 00 00000000 000000000000 000000 00000 Q ss_pred ee-cccc-----------c----e----eeeeccc----------cc--------c--------------------cccc Q lcl|NC_019918. 139 VT-DNED-----------G----T----LTVASNG----------DW--------S--------------------LKVS 160 (428) Q Consensus 139 ~t-t~~~-----------~----~----~t~as~~----------~~--------~--------------------~~~s 160 (428) .. +... . . ....... .. . ...+ T Consensus 239 ~~~~~~~~~~~~~v~~~~g~~~~~~~~~~~~g~~~~~~~~~~~~~~g~~~e~~~ls~~~~~~~~~~~~~~~~~~~~~~~s 318 (663) T protein:vir:10 239 ISKTAFQSGAAQPIYPFGGTRASNARSVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRRGDRDVYGNNIFMDDYFRNGSS 318 (663) T ss_pred cccccccccceeeecccCcccccccccccccccccchhhcccccCCCcccceeeeeccccccccchhhhhhhhhhcCccc Confidence 00 0000 0 0 0000000 00 0 0000 Q ss_pred ce---------------EEEEeeccc------cCHHHHHHHHHhcc-cCceEEEEec---CCHHHH----HHHHHHHhhh Q lcl|NC_019918. 161 SN---------------LTMAAAPST------EGWPATITAVQGEN-DEWYALSIDS---HADDDI----MAVATHIEGT 211 (428) Q Consensus 161 ~~---------------~~~~~~~aa------~~~~~al~~~~~~~-~~w~~~~~~~---~~~~~~----~ala~~~~a~ 211 (428) .. +.++.+... .+....++.+.+.. .+...+.+.. ...++. .+|...++.. T Consensus 319 ~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~~~~~ 398 (663) T protein:vir:10 319 NFIYASSVNWPAGFTGIIQLGGGASANNAVGSDELIAGWDLFADREALHVNLMIAGACKSDGVAVASTVQKHVVALADDR 398 (663) T ss_pred ceeEeeccccCcccceeEEecccccCcccchhhhhhhHHhhhccccccCceEEEeecCCCCchhhHHHHHHHHHHHHHhh Confidence 00 000000000 01111112222111 1222332221 111222 2333444444 Q ss_pred CCEEEEEecCcccccc--h---hHHHHHHHH-------------hcccCceEEEec------CCc--c-chhHHHHHHHH Q lcl|NC_019918. 212 KKVFIGATAQANTKTS--A---ENDIASRLV-------------AAGFQRTALIYH------PNA--D-AQFPECAWVGY 264 (428) Q Consensus 212 ~~~~~~~~~~~~~~~~--~---~~~~~~~l~-------------~~~~~~t~~~y~------~~~--~-~~~~~a~~~~~ 264 (428) ..+|.+.-........ . ..++..... ..+..|..+.|. +.. . ...|.+.++|. T Consensus 399 ~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~s~~vAGl 478 (663) T protein:vir:10 399 QDCVAFVNPPSELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYAFISGNYKYQYDKYNDINRWVPLSADIAGL 478 (663) T ss_pred CCEEEEEecCcccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceEEEEecceeEecccCCceEEechHHHHHHH Confidence 4444443222111110 0 111111110 112223333322 111 1 12355677776 Q ss_pred HhccCCCc---eeeeeeeecCccc-----cCCCHHHHHHHHhCCceEEEEEcC-ce-eeecCEecCCc-----hhHHHHH Q lcl|NC_019918. 265 QLQEQPGS---NTWTHKALAAVDA-----YRLTPTESTNLKNKNVTTFERVGG-VN-RTFGGAMAGGE-----WIDVMIF 329 (428) Q Consensus 265 ~~~~~~g~---~t~~fk~~~Gv~~-----~~~t~t~~~~l~~~~~n~y~~~~~-~~-~~~~G~~~~G~-----~iD~~~~ 329 (428) +...+.-+ .....|.+.||.- ..+++.|.+.|..+|+|.+..+-+ .+ .++..+|++++ ||-+.+- T Consensus 479 ~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s~~~~i~vrR~ 558 (663) T protein:vir:10 479 CAYTDQVGHPWMSPAGYRRGQLRNTIKLAIEPKQSLRDTMYQVSINPVTGFAGGDGFVLFGDKMATQVPSPFDRINVRRL 558 (663) T ss_pred HHHhhccCCcEEccCCeeecceeccccceeecCchhHHHHHhCCCcEEEEeeCCCcEEEEcccccCCCCcccceEehhhH Confidence 65444222 2234455444432 357899999999999999998765 45 47888887763 5677778 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEE Q lcl|NC_019918. 330 VDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEF 409 (428) Q Consensus 330 ~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~ 409 (428) .+|+...|+..+...+-. |.++.-...|+..|+.-|++.+++|.|. ||.|... .++.+++|+.+.+.. +.+ T Consensus 559 ~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~---gf~V~~d-~~~nt~~~i~~G~~~-~~i 629 (663) T protein:vir:10 559 FNMLKKNIGDTSKYELFE----NNDAFTRQSFRMEVSQYLDNIRSLGGVY---DFRVVCD-TTNNTPQVIDSNEFV-ATI 629 (663) T ss_pred HHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee---eeEEEEc-CCCCCHHHhhCCeEE-EEE Confidence 888888888877764433 7788999999999999999999999997 5899987 667899999999886 999 Q ss_pred EEEECceEEEEEEEEEEec Q lcl|NC_019918. 410 EARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 410 ~~~~agaih~v~i~~~v~~ 428 (428) .+++...+++|+++..-+= T Consensus 630 ~~~p~~pae~I~~~~~~~~ 648 (663) T protein:vir:10 630 YIKAPRSINYITLNFVATS 648 (663) T ss_pred EEEecCCcceEEEEEEEEe Confidence 9999999999999876654 No 62 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=96.67 E-value=0.00043 Score=39.06 Aligned_cols=385 Identities=11% Similarity=0.113 Sum_probs=163.5 Q ss_pred CC-----CCCceE---EEeeeeeccc-cc---------ccccceEEEEcccC-CCccceE-EeeCHHHHHhhcCCChHHH Q lcl|NC_019918. 1 MT-----VLTDVI---DIQISRETAA-VA---------QTNFNVPLFIASHT-NFSERAR-VYNSLKGVAEDFGESDPTY 60 (428) Q Consensus 1 M~-----~is~iV---~V~i~~~~~~-~~---------~~~f~~~li~~~~~-~~~~~~~-~y~s~~~V~~~fg~~s~eY 60 (428) +. .+..++ .+.+.....+ .. ...++.+-+..... .....+. ...+..+.. T Consensus 176 ~~~~~a~~v~~v~~d~~~~v~~~~~a~~~~t~~~~~~~~~~~~~~~i~A~~~G~~Gn~i~V~i~~~~~~~---------- 245 (663) T protein:vir:10 176 GGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLISAVYPGEIGSTVEVEIVSKTAFN---------- 245 (663) T ss_pred CccccccccceeccccceEEeeccccccccccccccccccccccceEEeccCCcccceeeeeeccccccc---------- Confidence 11 111111 0111111100 00 00011111111110 0000000 011110000 Q ss_pred HHHHHHHhcCCcccEEEE--Eeeecccccccchheeeccc--ccccccce-----eeeeeeccc---chhhhhhhhheee Q lcl|NC_019918. 61 LAAVRYFGQALKPRSLVI--GRRQVPSATVSVSVVQEGQS--YVLTVNGL-----PVSYVSHQD---DTATLIATGLKAA 128 (428) Q Consensus 61 ~aA~~~F~q~p~P~~l~i--gr~~~~~~~~~~~~~~~~~~--~~~~v~g~-----~~s~~~~~~---~~a~~i~a~l~~a 128 (428) .... ..+++ +-.................. .+....+. .++...... .....+...+.. T Consensus 246 --------~~~~-~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~- 315 (663) T protein:vir:10 246 --------SGAQ-QTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFRN- 315 (663) T ss_pred --------cccc-cceecccccccccccceeecccccccceeeEeecCCcceeeecccccccccccccchhhhhhhhcC- Confidence 0000 00000 00000000000000000000 00000000 000000000 000000000000 Q ss_pred ecccceEEEEeeccccceeeeeccccccccccceEEEEeec------cccCHHHHHHHHHhcccCce-EEEEec---CCH Q lcl|NC_019918. 129 YDVTPVVGVTVTDNEDGTLTVASNGDWSLKVSSNLTMAAAP------STEGWPATITAVQGENDEWY-ALSIDS---HAD 198 (428) Q Consensus 129 ~~~~~~~~~~~tt~~~~~~t~as~~~~~~~~s~~~~~~~~~------aa~~~~~al~~~~~~~~~w~-~~~~~~---~~~ 198 (428) ...... ..............+.+..+. ...+...++..+.+...-.- .+.+.. ... T Consensus 316 -~~~~~~-------------~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~ 381 (663) T protein:vir:10 316 -GGSNFI-------------FASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGA 381 (663) T ss_pred -CcceEE-------------EEeecccCccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCch Confidence 000000 000000000000001111111 11223334444443322121 222211 111 Q ss_pred HH----HHHHHHHHhhhCCEEEEEecCcccc-----cchhHHHHHHHHh-------------cccCceEEEec------C Q lcl|NC_019918. 199 DD----IMAVATHIEGTKKVFIGATAQANTK-----TSAENDIASRLVA-------------AGFQRTALIYH------P 250 (428) Q Consensus 199 ~~----~~ala~~~~a~~~~~~~~~~~~~~~-----~~~~~~~~~~l~~-------------~~~~~t~~~y~------~ 250 (428) ++ ..++...++....+|...-...... .....++...... ....+..+.|. + T Consensus 382 ~~~~~v~~~l~~~a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~ 461 (663) T protein:vir:10 382 EIASTVQKYVVSLADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDK 461 (663) T ss_pred hhHHHHHHHHHHHHHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCccceEEEcCceEEecc Confidence 22 2344455555555554432221111 1112222222111 11223333222 1 Q ss_pred Ccc---chhHHHHHHHHHhccCCCc---eeeeeeeecCcc-----ccCCCHHHHHHHHhCCceEEEEEcC-ce-eeecCE Q lcl|NC_019918. 251 NAD---AQFPECAWVGYQLQEQPGS---NTWTHKALAAVD-----AYRLTPTESTNLKNKNVTTFERVGG-VN-RTFGGA 317 (428) Q Consensus 251 ~~~---~~~~~a~~~~~~~~~~~g~---~t~~fk~~~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~-~~-~~~~G~ 317 (428) ... ...+.+.++|.+...+.-+ ....+|.+.++. ...+++.|++.|..+|+|++..+-+ .+ .++..+ T Consensus 462 ~~~~~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~r 541 (663) T protein:vir:10 462 YNDINRWVPLAADIAGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDK 541 (663) T ss_pred cCCceEEechhHHHHHHHHHhhccCCceEccCCceeccccccccceeecChhHHHHHhhCCceEEEEEeCCCcEEEEccc Confidence 111 1245677777766555322 223444433332 2457999999999999999988765 44 468888 Q ss_pred ecCC-----chhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchH Q lcl|NC_019918. 318 MAGG-----EWIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVL 392 (428) Q Consensus 318 ~~~G-----~~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~ 392 (428) ++++ .||-+.+-.+|+...|++.+...+-. |.++.=...|+..|+.-|++.++.|.|. ||.|.+. .+ T Consensus 542 T~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~---g~~v~~d-~~ 613 (663) T protein:vir:10 542 MATQVPSPFDRINVRRLFNMLKKNIGDTSKYELFE----NNDAFTRQSFRMETSQYLDGIRSLGGCY---DFRVVCD-TT 613 (663) T ss_pred ccCCCCcccceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee---eeEEEEc-CC Confidence 8765 25778888899999988888764433 7788889999999999999999999997 5999987 77 Q ss_pred hCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 393 SMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 393 ~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) +.|++|+.++++. +.+.+.+...+++|.++..-+- T Consensus 614 ~nt~~~i~~G~~~-~~i~~~p~~pae~i~~~~~~~~ 648 (663) T protein:vir:10 614 NNTPNVIDRNEFV-GTIYVKPPRSINYITLNMVATS 648 (663) T ss_pred CCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEee Confidence 8899999999986 9999999999999999876554 No 63 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=96.27 E-value=0.0008 Score=37.57 Aligned_cols=417 Identities=13% Similarity=0.108 Sum_probs=205.5 Q ss_pred CCCCCceEEEeeeeecccccccccceEEEEcccC-CCccceEEeeCHHHHHhhcCC---ChHHHHHHHHHHhcCCcccEE Q lcl|NC_019918. 1 MTVLTDVIDIQISRETAAVAQTNFNVPLFIASHT-NFSERARVYNSLKGVAEDFGE---SDPTYLAAVRYFGQALKPRSL 76 (428) Q Consensus 1 M~~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~-~~~~~~~~y~s~~~V~~~fg~---~s~eY~aA~~~F~q~p~P~~l 76 (428) |.+++.=|-|.--=.+.++....-+...|+|... .|.+.-...+|..|....||. .+.++.+...+|-+.- .++ T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ngg--~~~ 78 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAAIVGKFAWGPAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYG--NDL 78 (663) T ss_pred CceecCceEEEEecCcccccccCccceeEEeeeccCCCCccEEecCHHHHHHHhCCcCccchhHHHHHHHHHhCC--CeE Confidence 9999887777422245566666777777888764 455666778899999999997 6666677777776543 366 Q ss_pred EEEeeecc---cccccc------hhe------eeccccccc--------------ccc--eeeee--eec-ccchhhhhh Q lcl|NC_019918. 77 VIGRRQVP---SATVSV------SVV------QEGQSYVLT--------------VNG--LPVSY--VSH-QDDTATLIA 122 (428) Q Consensus 77 ~igr~~~~---~~~~~~------~~~------~~~~~~~~~--------------v~g--~~~s~--~~~-~~~~a~~i~ 122 (428) +|-|-... .....+ ... ..+...... +.+ ..... ... ....+..+. T Consensus 79 ~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~n~~~~~v~~~~a~~~~~~~~v~ 158 (663) T protein:vir:10 79 RLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKTRQLG 158 (663) T ss_pred EEEEccCCcccccccccCCcceeeeeccccCccccccccccccccccccccceeeeccCCceEEEEeccccccccccccc Confidence 66664211 000000 000 000000000 000 00000 000 000000000 Q ss_pred h--h--------hee--------------eecccceEEEEeeccccc---------------eeeeeccccccccc---- Q lcl|NC_019918. 123 T--G--------LKA--------------AYDVTPVVGVTVTDNEDG---------------TLTVASNGDWSLKV---- 159 (428) Q Consensus 123 a--~--------l~~--------------a~~~~~~~~~~~tt~~~~---------------~~t~as~~~~~~~~---- 159 (428) . . +.. ..+............... ..+....+.++... T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~vv~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~G~~Gn~i~v~i 238 (663) T protein:vir:10 159 TYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLVSAVYPGEIGSTVEVEI 238 (663) T ss_pred eeeeccccceeEeeeccccccccccccceecccceeeEeeccccccccccchhhhcccccceeeeeecccccccceeEEe Confidence 0 0 000 000000000000000000 00000000000000 Q ss_pred ----------------------c--------------------------------------------------------c Q lcl|NC_019918. 160 ----------------------S--------------------------------------------------------S 161 (428) Q Consensus 160 ----------------------s--------------------------------------------------------~ 161 (428) . . T Consensus 239 ~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~ 318 (663) T protein:vir:10 239 VSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFRNGGS 318 (663) T ss_pred cccccccccccccccccccccccccceeeeeccccccceeEEEecCCcceeeeeeeecccccccchhhhhhhhhhccCcc Confidence 0 0 Q ss_pred eE----------------EEEeecc------ccCHHHHHHHHHhccc-CceEEEEec---CCHHHH----HHHHHHHhhh Q lcl|NC_019918. 162 NL----------------TMAAAPS------TEGWPATITAVQGEND-EWYALSIDS---HADDDI----MAVATHIEGT 211 (428) Q Consensus 162 ~~----------------~~~~~~a------a~~~~~al~~~~~~~~-~w~~~~~~~---~~~~~~----~ala~~~~a~ 211 (428) .+ .+..+.. ..+...++..+.+... +...+.+.. ...++. .+|-..++.. T Consensus 319 ~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~al~~~a~~~ 398 (663) T protein:vir:10 319 NFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSLADDR 398 (663) T ss_pred eEEEEeecccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHHHHhh Confidence 00 0000000 0001111122211110 111111111 111111 2233333333 Q ss_pred CCEEEEEecCcccc-----cchhHHHHHHHHh-----------cccCce-EEEecCC-------c---cchhHHHHHHHH Q lcl|NC_019918. 212 KKVFIGATAQANTK-----TSAENDIASRLVA-----------AGFQRT-ALIYHPN-------A---DAQFPECAWVGY 264 (428) Q Consensus 212 ~~~~~~~~~~~~~~-----~~~~~~~~~~l~~-----------~~~~~t-~~~y~~~-------~---~~~~~~a~~~~~ 264 (428) ..++.+.-...... .....++...-+. .+++.. ..+|++- . -...+.+.++|. T Consensus 399 ~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d~~~~~~~~~p~s~~vAGl 478 (663) T protein:vir:10 399 QDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLAADIAGL 478 (663) T ss_pred CCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEecccCCceEEechhHHHHHH Confidence 33443332211110 0111222211111 111111 2233321 1 113466777777 Q ss_pred HhccCCCc---eeeeeeeecCcc-----ccCCCHHHHHHHHhCCceEEEEEcC-ce-eeecCEecCC-----chhHHHHH Q lcl|NC_019918. 265 QLQEQPGS---NTWTHKALAAVD-----AYRLTPTESTNLKNKNVTTFERVGG-VN-RTFGGAMAGG-----EWIDVMIF 329 (428) Q Consensus 265 ~~~~~~g~---~t~~fk~~~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~-~~-~~~~G~~~~G-----~~iD~~~~ 329 (428) +...+.-+ .....|.+.++. ...+++.|++.|..+|+|++..+-+ .+ .++..+|+++ .||-+.+- T Consensus 479 ~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s~~~~i~vrR~ 558 (663) T protein:vir:10 479 CAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPSPFDRINVRRL 558 (663) T ss_pred HHHhhccCCceEccCCceeccccccccceeccChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCcccceEehhhH Confidence 66555322 122344433332 2458999999999999999998765 44 4788888765 25677788 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEE Q lcl|NC_019918. 330 VDWLEARMTERLWFRMANSKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEF 409 (428) Q Consensus 330 ~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~ 409 (428) .+|+.+.|+..+...+-. |.|+.-...|+..|+.-|++.+++|.|. ||.|.+. .++.+++|+.++++. +.+ T Consensus 559 ~~~i~~si~~~~~~~v~e----~n~~~l~~~i~~~i~~~L~~l~~~gal~---g~~v~~d-~~~nt~~~i~~G~~~-~~i 629 (663) T protein:vir:10 559 FNMLKKNIGDTSKYELFE----NNDAFTRQSFRMETSQYLDGIRSLGGCY---DFRVVCD-TTNNTPNVIDRNEFV-GTI 629 (663) T ss_pred HHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee---eeEEEEc-CCCCCHHHhhCCeEE-EEE Confidence 889888888887764433 7788899999999999999999999997 5999987 667899999999988 999 Q ss_pred EEEECceEEEEEEEEEEec Q lcl|NC_019918. 410 EARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 410 ~~~~agaih~v~i~~~v~~ 428 (428) .+++...+++|.++..-.- T Consensus 630 ~~~p~~pae~i~~~~~~~~ 648 (663) T protein:vir:10 630 YVKPPRSINYITLNMVATS 648 (663) T ss_pred EEEecCCcceEEEEEEEee Confidence 9999999999999866554 No 64 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=95.52 E-value=0.0019 Score=35.48 Aligned_cols=337 Identities=11% Similarity=0.006 Sum_probs=158.8 Q ss_pred CCCCCceEEEeeeeecccccccccceEEEEcccCCCccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCcccEEEEEe Q lcl|NC_019918. 1 MTVLTDVIDIQISRETAAVAQTNFNVPLFIASHTNFSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALKPRSLVIGR 80 (428) Q Consensus 1 M~~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~P~~l~igr 80 (428) |=|-=.|-+.+..-.+...-. .-.||+|....-.+.+......+|+.+-+|..+-+.|.=.+.+.-.. ++-+++ T Consensus 1 ~~~~v~vn~~n~~~g~~~~~e---r~~lfig~~~~~~g~~~~~~~~sdld~~l~~~ds~lk~~v~aa~~na--G~~~~~- 74 (370) T protein:vir:78 1 MWPYVQIYNLNQMQGPVTEVE---RHLLFIGSAASNTGKLLSLNAQSDFDQLLGAADSELKANLLAARDNA--GQNWSA- 74 (370) T ss_pred CCceEEEeeccccCCCcCccc---eeEEEEecccccccceEeecCccCHHHhcCCcChhHHHHHHHHHhCC--CCceEE- Confidence 877222222222222222222 34677777665567777788888888888888777765444433221 000110 Q ss_pred eecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeecccccccccc Q lcl|NC_019918. 81 RQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVASNGDWSLKVS 160 (428) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as~~~~~~~~s 160 (428) .+.... ......+++..+ ... .+ T Consensus 75 -----------------------------~~~p~~-~~~d~~~Av~~a-~~~--------------------------~s 97 (370) T protein:vir:78 75 -----------------------------AAYVLP-TDKPWLDAARDA-QQT--------------------------QS 97 (370) T ss_pred -----------------------------EEEEec-CchhHHHHHHHH-Hhh--------------------------CC Confidence 000000 000011111110 000 00 Q ss_pred ceEEEEeecc--cc---CHHHHHHHHHhcccCceEEEEecCCHHHHHHHHHHHhhhCCEEEEEecCcccccchhHHHHHH Q lcl|NC_019918. 161 SNLTMAAAPS--TE---GWPATITAVQGENDEWYALSIDSHADDDIMAVATHIEGTKKVFIGATAQANTKTSAENDIASR 235 (428) Q Consensus 161 ~~~~~~~~~a--a~---~~~~al~~~~~~~~~w~~~~~~~~~~~~~~ala~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (428) ...+...+++ .. ...+....+.+...-|.|+.++.+..++-...++|..+- ... T Consensus 98 ~E~V~v~~~~s~~a~~~a~~~~a~el~n~~~Rpv~file~~~~~~~e~w~~y~~~l---------------------~al 156 (370) T protein:vir:78 98 FEGVVVLGQEWHQAAINAAHALNQELIAKWGRWQFMLLAVPAIADEQDWATYEAEL---------------------ATL 156 (370) T ss_pred ccEEEEecCcchHHHHHHHHHHHHHHHHhcCCeEEEEEeecCCCCcCCHHHHHHHH---------------------HHh Confidence 0000011100 00 011111222222223444444433322222333333211 000 Q ss_pred HHhcccC--ceEEEecCCccchhHHHHHHHHHhcc------CCCcee-eeee---eec-CccccCCCHHHHHHHHhCCce Q lcl|NC_019918. 236 LVAAGFQ--RTALIYHPNADAQFPECAWVGYQLQE------QPGSNT-WTHK---ALA-AVDAYRLTPTESTNLKNKNVT 302 (428) Q Consensus 236 l~~~~~~--~t~~~y~~~~~~~~~~a~~~~~~~~~------~~g~~t-~~fk---~~~-Gv~~~~~t~t~~~~l~~~~~n 302 (428) -+..... ..++.+|... .+.++|+...+ .|+++. -.-+ .+| .-....++...+++|+++|+. T Consensus 157 ~~gia~~~V~vvp~~~g~~-----~G~~aGRL~naavsVadsP~Rv~tG~l~gl~~~p~d~~~~~l~~a~l~aLd~agy~ 231 (370) T protein:vir:78 157 QDGIAASSVSLIPQLWPTL-----AGAYAGRLCNRAVSIADSPCRVKTGALVGLGNKPVGKDGIPLPLATLQTLEANRYS 231 (370) T ss_pred hhccccccceEEeeecccc-----HHHHHHHHhcCeeeecccceeeeccccccccccccccCCcccCHHHHHHHHhCCCe Confidence 0111112 2344555432 46666764322 233221 1111 111 012245888999999999999 Q ss_pred EEEEEcCc-ee-eecCEecC---CchhHHHHHHHHHHHHHHHHHHHHHHhcC-CCCcCHhHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019918. 303 TFERVGGV-NR-TFGGAMAG---GEWIDVMIFVDWLEARMTERLWFRMANSK-KIPYDAVGATILESEIRAQLNEGIRVG 376 (428) Q Consensus 303 ~y~~~~~~-~~-~~~G~~~~---G~~iD~~~~~dwl~~~lq~~l~~ll~~~~-kip~~~~G~~~i~~~i~~~~~~~~~~G 376 (428) +...|.|- ++ ..+|.|+. |+|=-.-+.+.+-|..-+..+.-+..-.+ .+==++..++..+.....+|+++.+.+ T Consensus 232 vp~~Y~gy~G~Y~~d~~tl~~~gsDYq~ie~~RVvdKa~R~vR~~ai~~i~D~~lnst~gsia~~~~~~~~~L~ema~s~ 311 (370) T protein:vir:78 232 VPMWYPDYDGIYWADGRTLDAEGGDYQVIENLRIAYKVARRMRLRAIARIGDRSFNSTPGSTAAAITYFGKDLREMAKST 311 (370) T ss_pred EEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHHhCCcccCCCCcchhHHHHHHHhhHHHHHhhh Confidence 99999884 34 45788874 45544445555555554444443222222 222244677888888999999988888 Q ss_pred ceecC--CceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 377 GLAEA--PAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 377 ~I~~g--~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) -|..- +| +|..|.=.++..+-...++. .|.+.+++=|.-..|++++-+.+ T Consensus 312 ~i~~~~fpg-eI~~p~d~Di~i~w~s~~~v-~I~~~v~P~~~pk~Itv~I~LDl 363 (370) T protein:vir:78 312 TINGQPFPG-DIASPQDGDIRIQWVAKNLV-SVFVVVRTVDCPKGITVNIMLDL 363 (370) T ss_pred hhcccccce-eEeccCCCcceEEeeccceE-EEEEEEEeccCCceEEEEEEEee Confidence 88642 33 55555433333333344444 48888888888888888777777 No 65 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=92.83 E-value=0.0098 Score=31.59 Aligned_cols=331 Identities=11% Similarity=0.041 Sum_probs=154.8 Q ss_pred CCCCCceEEEeeeeecccccccccceEEEEcccCCCccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCcccEEEEEe Q lcl|NC_019918. 1 MTVLTDVIDIQISRETAAVAQTNFNVPLFIASHTNFSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALKPRSLVIGR 80 (428) Q Consensus 1 M~~is~iV~V~i~~~~~~~~~~~f~~~li~~~~~~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~P~~l~igr 80 (428) |=|-=.|-+.+..-.+...-. .-.||+|......+.+......+|+-.-+|..+-+.|.=.+.+.-.. T Consensus 1 ~~~~v~vn~~n~~~g~~~~~e---r~~Lfig~~~~~~~~~~~~~~~sdld~~lg~~~~~lk~~v~aa~~na--------- 68 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIE---RHALFVGVGTTNQGKLLALTPDSDFDKVFGETDTDLKKQVRAAMLNA--------- 68 (376) T ss_pred CCCeEEEecccccCCCccccc---ceEEeeccccccccceeeecCccchHhhhCCCchHHHHHHHHHHhCC--------- Confidence 776211222221112222222 34577777665566777777777776667777666664332222110 Q ss_pred eecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeecccccccccc Q lcl|NC_019918. 81 RQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVASNGDWSLKVS 160 (428) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as~~~~~~~~s 160 (428) ++.+...+. T Consensus 69 ---------------G~~~~~~~~-------------------------------------------------------- 77 (376) T protein:vir:37 69 ---------------GQNWFAHVY-------------------------------------------------------- 77 (376) T ss_pred ---------------CCcEEEEEE-------------------------------------------------------- Confidence 000000000 Q ss_pred ceEEEEeeccccCHHHHHHHHHhcccCceEEEEecC---CHHHHHHH---HHHHhhhCCEEEE--EecC-cc---cccch Q lcl|NC_019918. 161 SNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSH---ADDDIMAV---ATHIEGTKKVFIG--ATAQ-AN---TKTSA 228 (428) Q Consensus 161 ~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~---~~~~~~al---a~~~~a~~~~~~~--~~~~-~~---~~~~~ 228 (428) ......++..+++.... ...++.+..+... +.+++.++ +.....+-.++.+ .... -+ ..... T Consensus 78 -----~~~~~~~~~~~Av~~a~-~~~s~E~V~v~~pv~t~~a~i~aa~~~a~el~~~~~Rpv~file~r~~~~~~~~~e~ 151 (376) T protein:vir:37 78 -----IAQEDGYDFVECVKKAN-QTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQAVQGINHDQSDGET 151 (376) T ss_pred -----eecCCchHHHHHHHHhh-hhcCceEEEEeccccccHHHHHHHHHHHHHHHHhcCCeEEEEEeccCcCcccccccC Confidence 00011112222222221 1122222222211 12222222 2222222122211 1110 00 00111 Q ss_pred hHHHHHHHHh----cccCce--EEEecCCccchhHHHHHHHHHh--c----cCCCcee-eeee-----e-ecCccccCCC Q lcl|NC_019918. 229 ENDIASRLVA----AGFQRT--ALIYHPNADAQFPECAWVGYQL--Q----EQPGSNT-WTHK-----A-LAAVDAYRLT 289 (428) Q Consensus 229 ~~~~~~~l~~----~~~~~t--~~~y~~~~~~~~~~a~~~~~~~--~----~~~g~~t-~~fk-----~-~~Gv~~~~~t 289 (428) ..+....+.+ -...+. ++..|.+ ..+.++|+.. . ..||++. -.-. . ........++ T Consensus 152 w~~y~~~~~al~~gia~~~V~~V~~~~gn-----~~G~~aGRl~~aaVsVadspgRV~tG~l~gl~~~~lp~d~~~~~l~ 226 (376) T protein:vir:37 152 WDQYVQKLTTLQQTIVADHVCLVPLLFGN-----ETGVLAGRLANRAVTVADSPARVQTGALVSLGSANKPLDKDRNELT 226 (376) T ss_pred HHHHHHHHHHhhcccccccceeeeeehhh-----hHHHHHHHHhhcccchhhCccceeccccccccccccccCcCcccCC Confidence 1222222221 112222 3332321 2566667642 2 3566542 1111 1 1233445789 Q ss_pred HHHHHHHHhCCceEEEEEcCc-e-eeecCEecC---CchhHHHHHHHHHHHHHHHHH--HHHHHhcCCCCcCHhHHHHHH Q lcl|NC_019918. 290 PTESTNLKNKNVTTFERVGGV-N-RTFGGAMAG---GEWIDVMIFVDWLEARMTERL--WFRMANSKKIPYDAVGATILE 362 (428) Q Consensus 290 ~t~~~~l~~~~~n~y~~~~~~-~-~~~~G~~~~---G~~iD~~~~~dwl~~~lq~~l--~~ll~~~~kip~~~~G~~~i~ 362 (428) ...+.+|+++|+.+...|.|- + +..+|.|+. |+|=-.-+.+.+-|..=+.++ ...+.. ..+=-+..+++..+ T Consensus 227 ~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~gsDY~~ie~~RVvdKa~R~vR~~ai~~i~D-~~lnst~~sia~~~ 305 (376) T protein:vir:37 227 LAHLKSLETARYSVPMWYPDYDGYYWADGRTLDVEGGDYQVIENLRVVDKVARKVRLLAIGKIAD-RSFNSTTSSTEYHK 305 (376) T ss_pred HHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHHhCC-cccCcchhhHHHHH Confidence 999999999999999999884 3 445788875 455444445555555433333 232222 22223556788888 Q ss_pred HHHHHHHHHHHhcCceecC--CceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 363 SEIRAQLNEGIRVGGLAEA--PAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 363 ~~i~~~~~~~~~~G~I~~g--~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) +.+..+|+++.+.+.|..- +| +|..|.=.++..+-....+.. |.+.+++=|.-..|++++-+.+ T Consensus 306 ~yi~~pLr~M~~s~~i~g~~fpG-eI~~p~d~Di~i~w~s~~~V~-I~~~v~P~~~pk~Itv~I~Ldl 371 (376) T protein:vir:37 306 NYFAKPLRDMSKSATINGKDFPG-ECMPPKDDAITIVWQSKTKVT-IYIKVRPYDCPKEITANIFLDL 371 (376) T ss_pred HHHHHHHHHHHhcchhccccccc-eeecCCCCCceEEeeccceEE-EEEEEEeccCCceEEEEEEeec Confidence 8899999999887776532 22 466655334443333333333 7777778888888888877777 No 66 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=83.00 E-value=0.072 Score=26.84 Aligned_cols=323 Identities=11% Similarity=0.045 Sum_probs=141.7 Q ss_pred CcccEEEEEeeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeee Q lcl|NC_019918. 71 LKPRSLVIGRRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVA 150 (428) Q Consensus 71 p~P~~l~igr~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~a 150 (428) -.=.+|-|..-+ .++-.+..|....+ +. ..++++ .+.........++.-+.....+ T Consensus 1 m~~~~V~in~~n------------~~qg~~~~ver~~l-fi--g~g~~~---------~~~g~~~~~~~~sdld~~lg~~ 56 (369) T protein:vir:27 1 MAWPTVIIKILN------------LMNGPIADIECHFL-FV--IRGTVS---------GEVRNLIMVDSTSDLDDVLAEA 56 (369) T ss_pred CCCCceEEeccc------------ccCCCcccccceEE-EE--Eecccc---------ccccceEEecCccchHhhcCCc Confidence 000011110000 00000000100000 00 000000 0000000000000000000000 Q ss_pred --------ccccccccccceEEEEeeccccCHHHHHHHHHhcccCceEEEEecC--CHHHHHHHHHHHh---hhCCEE-E Q lcl|NC_019918. 151 --------SNGDWSLKVSSNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSH--ADDDIMAVATHIE---GTKKVF-I 216 (428) Q Consensus 151 --------s~~~~~~~~s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~--~~~~~~ala~~~~---a~~~~~-~ 216 (428) .+.-......-...+......++..+++.... ....+.+..+... +.+++.++....+ .+-.++ + T Consensus 57 ds~lk~~v~aa~~naG~~w~a~~~p~~~~~~~~~Av~~a~-~~~s~E~V~v~~p~t~~a~i~aaq~~a~el~~~~~R~vf 135 (369) T protein:vir:27 57 SAEGLAIVKAAQLNGKQAWTAGVMILSEEDNWQDAVKKAN-EVSSFEFVVLGFDAETKAMIEDAITLRTELKNSLGREVG 135 (369) T ss_pred ChhHHHHHHHHHhCCCCceEEEEEEeCCchhHHHHHHhhh-hhCCccEEEEecCcccHHHHHHHHHHHHHHHHhcCCeEE Confidence 00000001111223344455566777776553 3345556655543 3466665544433 222222 3 Q ss_pred EEec----Ccc-cccchhHHHHHHH----HhcccCce--EEEecCCccchhHHHHHHHHHhcc------CCCcee-e--- Q lcl|NC_019918. 217 GATA----QAN-TKTSAENDIASRL----VAAGFQRT--ALIYHPNADAQFPECAWVGYQLQE------QPGSNT-W--- 275 (428) Q Consensus 217 ~~~~----~~~-~~~~~~~~~~~~l----~~~~~~~t--~~~y~~~~~~~~~~a~~~~~~~~~------~~g~~t-~--- 275 (428) +... +.. ....+.++....+ +.....+. ++.++...+ ..+.++|+.... .||++- - T Consensus 136 fi~e~~~~~~~~~~~e~w~dy~a~l~al~~g~a~~~V~vv~~~~~~gn---~~G~~aGRl~n~aVsIadsp~RVktG~l~ 212 (369) T protein:vir:27 136 VLCQLPAINNDPTNGQTWSEWLADTVDIPKDVASEYISVVPNVHAAGD---TLGKYAGRLANKEVSIADSPARVQTGSVL 212 (369) T ss_pred EEEeccccCCCccccCCHHHHHHHHHHHhhccCcccceeeeeeccccc---hHHHHHHHHHhcccchhcCcceeeecccc Confidence 3221 111 1112223332222 22234444 344553222 246666774322 345431 1 Q ss_pred eeeeecCcccc--CCCHHHHHHHHhCCceEEEEEcCc-ee-eecCEecC---CchhHHHHHHHHHHHHHHHHHHHHH-Hh Q lcl|NC_019918. 276 THKALAAVDAY--RLTPTESTNLKNKNVTTFERVGGV-NR-TFGGAMAG---GEWIDVMIFVDWLEARMTERLWFRM-AN 347 (428) Q Consensus 276 ~fk~~~Gv~~~--~~t~t~~~~l~~~~~n~y~~~~~~-~~-~~~G~~~~---G~~iD~~~~~dwl~~~lq~~l~~ll-~~ 347 (428) -.+.+| +.++ .++.+.+.+|+++|+.+...|.|- ++ ..+|.|++ |+|=-.-+.+.+-|..=+.++..+- +. T Consensus 213 g~~~~p-~d~~g~~l~~a~l~aLd~agysvp~~Y~gy~G~Yw~d~~tl~~~gsDYq~iE~~RVvdKa~R~vR~~Ai~~i~ 291 (369) T protein:vir:27 213 GNTELM-KDKAGKALDLATLKALESNRIAVPMWYPDYPGQYWTTGRTLDVPGGDYQDIRHIRVAMKAARKVRIRAIARIA 291 (369) T ss_pred cccccc-cCCCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCCeehhhhhhHHHHHHHHHHHHHHHHhc Confidence 111222 1122 378899999999999999999984 34 45788875 4554445555555555554444432 33 Q ss_pred cCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEe Q lcl|NC_019918. 348 SKKIPYDAVGATILESEIRAQLNEGIRVGGLAEAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVT 427 (428) Q Consensus 348 ~~kip~~~~G~~~i~~~i~~~~~~~~~~G~I~~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~ 427 (428) ...+.-+..+++..+..+..+|+++.+-+ -|| +|..|.-.++.-+ ...+.--.|.+..++=+.=..|++++-+. T Consensus 292 Dr~lnstp~sia~~~~~~~~pLr~M~ks~--fpg---ei~~P~d~dI~i~-w~~k~~V~I~~~vrP~~~pk~it~~I~ld 365 (369) T protein:vir:27 292 DRTLNSTPQSIAAAKLYFTQDLRTMALTG--VPG---EIYPPEDEDIQIK-WVNSTDVEIYMSVQPYECPVKITIAISVK 365 (369) T ss_pred CcccccChhHHHHHHHHHhhHHHHHHhhc--CCe---EEecCCCCceEEE-eeccceEEEEEEEeeccCCceEEEEEEEe Confidence 45578889999999999999999987553 233 3565543333221 11222333555566666666777777777 Q ss_pred c Q lcl|NC_019918. 428 V 428 (428) Q Consensus 428 ~ 428 (428) + T Consensus 366 l 366 (369) T protein:vir:27 366 Q 366 (369) T ss_pred c Confidence 7 No 67 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=81.90 E-value=0.082 Score=26.55 Aligned_cols=328 Identities=11% Similarity=0.063 Sum_probs=157.7 Q ss_pred CCCCCceEEEe-eeeecccccccccceEEEEcccCCCccceEEeeCHHHHHhhcCCChHHHHHHHHHHhcCCcccEEEEE Q lcl|NC_019918. 1 MTVLTDVIDIQ-ISRETAAVAQTNFNVPLFIASHTNFSERARVYNSLKGVAEDFGESDPTYLAAVRYFGQALKPRSLVIG 79 (428) Q Consensus 1 M~~is~iV~V~-i~~~~~~~~~~~f~~~li~~~~~~~~~~~~~y~s~~~V~~~fg~~s~eY~aA~~~F~q~p~P~~l~ig 79 (428) |=|- |+|+ +++.-.+...- =.-.||+|......+.+......+|+-.-||..+-+.|.=.+.+.-. T Consensus 1 ~~~~---v~vn~ln~~qg~~~~v-er~~lfig~~~~~~~~~~~~~~~sdld~~lg~~ds~lk~~v~aa~~n--------- 67 (376) T protein:vir:37 1 MFPS---VQINALNQLSGETKEI-ERHALFVGVGTTNQGKLLALTPDSDFDKVFGETDTDLKKQVRAAMLN--------- 67 (376) T ss_pred CCCe---EEEeeeeccCCCcccc-cceEEEeeccccccCceEEecCCCChHHhhCCCchhHHHHHHHHHhC--------- Confidence 7662 3332 12211111111 13467787776666777777777777777787766665322221100 Q ss_pred eeecccccccchheeecccccccccceeeeeeecccchhhhhhhhheeeecccceEEEEeeccccceeeeeccccccccc Q lcl|NC_019918. 80 RRQVPSATVSVSVVQEGQSYVLTVNGLPVSYVSHQDDTATLIATGLKAAYDVTPVVGVTVTDNEDGTLTVASNGDWSLKV 159 (428) Q Consensus 80 r~~~~~~~~~~~~~~~~~~~~~~v~g~~~s~~~~~~~~a~~i~a~l~~a~~~~~~~~~~~tt~~~~~~t~as~~~~~~~~ 159 (428) +++.+... T Consensus 68 ---------------aG~~w~a~--------------------------------------------------------- 75 (376) T protein:vir:37 68 ---------------AGQNWFAH--------------------------------------------------------- 75 (376) T ss_pred ---------------CCCceEEE--------------------------------------------------------- Confidence 00000000 Q ss_pred cceEEEEeeccccCHHHHHHHHHhcccCceEEEEecC---CHHHHHHHHHHH---hhhCCEE--EEEecC-cc---cccc Q lcl|NC_019918. 160 SSNLTMAAAPSTEGWPATITAVQGENDEWYALSIDSH---ADDDIMAVATHI---EGTKKVF--IGATAQ-AN---TKTS 227 (428) Q Consensus 160 s~~~~~~~~~aa~~~~~al~~~~~~~~~w~~~~~~~~---~~~~~~ala~~~---~a~~~~~--~~~~~~-~~---~~~~ 227 (428) ......+.++..+++.... ....+.|..+... +.+++.++.... ..+-.++ |..... -+ .... T Consensus 76 ----~~~p~~~~~~~~~Av~~a~-~~~s~E~V~v~~p~~t~~a~i~a~qa~a~el~~~~~R~vffile~~g~d~~~~~ge 150 (376) T protein:vir:37 76 ----VYIAQEDGYDFVECVKKAN-QTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQAVQGINHDQSDGE 150 (376) T ss_pred ----EEecCCChhhHHHHHHHHH-hhCCeeEEEEecCcchhHHHHHHHHHHHHHHHHhcCCeEEEEEeccCCCCcccccC Confidence 0001112223344443332 2233444433332 233333332222 1211111 111111 00 0111 Q ss_pred hhHHHHHHHH----hcccCce--EEEecCCccchhHHHHHHHHHhcc------CCCce-eeeeeeecCcc-c-----cCC Q lcl|NC_019918. 228 AENDIASRLV----AAGFQRT--ALIYHPNADAQFPECAWVGYQLQE------QPGSN-TWTHKALAAVD-A-----YRL 288 (428) Q Consensus 228 ~~~~~~~~l~----~~~~~~t--~~~y~~~~~~~~~~a~~~~~~~~~------~~g~~-t~~fk~~~Gv~-~-----~~~ 288 (428) +..+....++ .....+. ++.+|.+ ..+.++|+.... .||++ |-.-+.+.-+. | ..+ T Consensus 151 ~w~~y~~~l~a~~~gia~~~V~vV~~~~gn-----~~G~~aGRl~naaVsVadspgRV~tGai~gl~~~~~p~d~~g~el 225 (376) T protein:vir:37 151 TWDQYVQKLTTLQQTIVADHVCLVPLLFGN-----ETGVLAGRLANRAVTVADSPARVQTGALVSLGSANKPLDKDGNEL 225 (376) T ss_pred CHHHHHHHHHHHhccccccceeeeeeeccc-----hHHHHHHHHHhCCcchhcCccceeecccccccccccccccCCccc Confidence 1222222222 1122333 3444442 356777775422 45554 22222211111 1 237 Q ss_pred CHHHHHHHHhCCceEEEEEcCc-e-eeecCEecC---Cc--hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHH Q lcl|NC_019918. 289 TPTESTNLKNKNVTTFERVGGV-N-RTFGGAMAG---GE--WIDVMIFVDWLEARMTERLWFRMANSKKIPYDAVGATIL 361 (428) Q Consensus 289 t~t~~~~l~~~~~n~y~~~~~~-~-~~~~G~~~~---G~--~iD~~~~~dwl~~~lq~~l~~ll~~~~kip~~~~G~~~i 361 (428) +...+.+|+++|+.+.-.|.|- + +...|+|++ |+ +|...+=.|=...+++.....-+. ...+.-+..+++.. T Consensus 226 ~~a~l~aLd~arysvpr~Y~gydG~Yw~dg~tl~~~gsDYq~ie~~RVvdKa~R~vR~~Ai~~i~-Dr~lnstp~sia~~ 304 (376) T protein:vir:37 226 TLAHLKSLETARYSVPMWYPDYDGYYRADGRTLDVEGGDYQVIENLRVVDKVARKVRLLAIGKIA-DRSFNSTTSSTEYH 304 (376) T ss_pred chHHHHHHHhCCCeEEEeeCCCCceEEeCCeEeccCCCCeeeehhchHHHHHHHHHHHHHHHHhc-CccccCChhHHHHH Confidence 8899999999999999999884 3 345788874 34 566555555555555544443333 34467788899999 Q ss_pred HHHHHHHHHHHHhcCcee----cCCceEEEeCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEEec Q lcl|NC_019918. 362 ESEIRAQLNEGIRVGGLA----EAPAPKVFVPDVLSMSPNMRAQRIFEGIEFEARLAGAIHFVHIRGTVTV 428 (428) Q Consensus 362 ~~~i~~~~~~~~~~G~I~----~g~~~~v~~~~~~~~~~~dra~R~~~~i~~~~~~agaih~v~i~~~v~~ 428 (428) +..+..+|+++.+-+=|. || .|..|+=.++.-. =..|.--.|.+..++=+.=..|++++-+.+ T Consensus 305 ~~~~~~pLr~M~ks~ei~g~~fpg---ei~~P~d~dI~i~-w~sk~~V~I~~~vrPy~cpk~i~~~I~LDl 371 (376) T protein:vir:37 305 KNYFAKPLRDMSKSATINGKDFPG---ECMPPKDDAITIV-WQSKTKVTIYIKVRPYDCPKEITANIFLDL 371 (376) T ss_pred HHHHhHHHHHHHhhhhhccccccc---eeecCCCCceEEE-eccCceEEEEEEEeeecCcceeEEEEEEec Confidence 999999999987765553 22 2444332222110 012333446666666666677777777777 Done!