Query lcl|NC_019421.1_cdsid_YP_006990491.1 [gene=D863_gp13] [protein=tail sheath] [protein_id=YP_006990491.1] [location=9671..11092] Match_columns 473 No_of_seqs 148 out of 338 Neff 9.1 Searched_HMMs 1612 Date Thu Nov 7 16:50:06 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_13 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_13_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:102957 Length: 437 100.0 1E-114 8E-118 645.0 47.5 431 1-473 1-437 (437) 2 protein:vir:78986 Length: 436 100.0 3E-111 2E-114 626.4 46.6 427 1-473 3-436 (436) 3 protein:vir:105470 Length: 451 100.0 3E-110 2E-113 621.3 47.7 441 1-473 1-451 (451) 4 protein:vir:99306 Length: 587 100.0 3.4E-93 2.1E-96 527.5 47.0 446 1-473 1-581 (587) 5 protein:vir:95741 Length: 587 100.0 4.6E-93 2.9E-96 526.8 46.6 446 1-473 1-581 (587) 6 protein:vir:63742 Length: 562 100.0 6.7E-91 4.1E-94 515.0 47.3 447 1-473 1-556 (562) 7 protein:vir:80488 Length: 562 100.0 6.4E-90 4E-93 509.6 47.4 447 1-473 1-556 (562) 8 protein:vir:80779 Length: 569 100.0 6.6E-89 4.1E-92 504.0 45.6 447 1-473 1-563 (569) 9 protein:vir:96586 Length: 587 100.0 3.1E-87 2E-90 494.8 43.8 449 1-473 1-581 (587) 10 protein:vir:100829 Length: 607 100.0 5.4E-82 3.3E-85 466.1 43.1 445 1-473 1-595 (607) 11 protein:vir:102819 Length: 648 100.0 4.7E-76 2.9E-79 433.6 39.7 451 1-473 1-644 (648) 12 protein:vir:101187 Length: 663 100.0 2.3E-70 1.4E-73 402.4 40.5 450 10-473 1-647 (663) 13 protein:vir:101804 Length: 663 100.0 2.4E-70 1.5E-73 402.3 40.4 450 10-473 1-647 (663) 14 protein:vir:106427 Length: 679 100.0 1.3E-69 7.9E-73 398.3 38.2 450 10-473 1-664 (679) 15 protein:vir:103456 Length: 659 100.0 6.6E-69 4.1E-72 394.4 39.8 450 10-473 1-645 (659) 16 protein:vir:7206 Length: 659 # 100.0 8.2E-69 5.1E-72 393.9 40.1 450 10-473 1-645 (659) 17 protein:vir:6894 Length: 660 # 100.0 5.8E-69 3.6E-72 394.7 37.5 454 10-473 1-645 (660) 18 protein:vir:108052 Length: 660 100.0 4.3E-69 2.7E-72 395.4 36.6 442 10-473 1-646 (660) 19 protein:vir:98824 Length: 774 100.0 1.2E-69 7.2E-73 398.5 33.2 438 1-473 270-766 (774) 20 protein:vir:106984 Length: 743 100.0 2E-68 1.2E-71 391.8 38.9 454 1-473 1-731 (743) 21 protein:vir:80984 Length: 666 100.0 1.6E-68 1E-71 392.2 37.6 454 10-473 1-650 (666) 22 protein:vir:6594 Length: 666 # 100.0 2.3E-68 1.4E-71 391.4 38.3 442 10-473 1-650 (666) 23 protein:vir:100539 Length: 663 100.0 2E-68 1.3E-71 391.7 37.5 450 10-473 1-647 (663) 24 protein:vir:98263 Length: 664 100.0 4.3E-68 2.7E-71 389.9 38.2 450 1-473 1-649 (664) 25 protein:vir:104858 Length: 729 100.0 1.7E-67 1.1E-70 386.6 38.9 455 9-473 1-716 (729) 26 protein:vir:102359 Length: 356 100.0 7.6E-68 4.7E-71 388.6 31.5 333 93-472 1-356 (356) 27 protein:vir:104477 Length: 749 100.0 7.1E-66 4.4E-69 377.8 41.1 455 2-473 1-738 (749) 28 protein:vir:5663 Length: 671 # 100.0 2.8E-65 1.7E-68 374.5 36.7 455 10-473 1-660 (671) 29 protein:vir:79092 Length: 477 100.0 2.5E-61 1.6E-64 352.8 36.0 439 1-473 1-466 (477) 30 protein:vir:107310 Length: 581 100.0 6.3E-61 3.9E-64 350.6 33.1 436 1-473 106-565 (581) 31 protein:vir:107865 Length: 477 100.0 9.1E-61 5.6E-64 349.8 32.8 433 1-473 1-466 (477) 32 protein:vir:7653 Length: 581 # 100.0 3.2E-60 2E-63 346.8 34.5 429 1-473 106-565 (581) 33 protein:vir:6079 Length: 396 # 100.0 6E-53 3.7E-56 306.9 30.2 358 9-473 1-382 (396) 34 protein:vir:1845 Length: 392 # 100.0 2E-52 1.3E-55 304.0 31.1 355 9-473 1-379 (392) 35 protein:vir:5711 Length: 396 # 100.0 2.2E-52 1.4E-55 303.8 29.6 358 9-473 1-382 (396) 36 protein:vir:79181 Length: 390 100.0 6.4E-52 4E-55 301.2 28.9 356 1-473 1-377 (390) 37 protein:vir:98553 Length: 395 100.0 1.8E-51 1.1E-54 298.7 31.2 358 9-473 1-382 (395) 38 protein:vir:1172 Length: 391 # 100.0 5.5E-52 3.4E-55 301.6 28.1 357 1-473 1-378 (391) 39 protein:vir:2035 Length: 396 # 100.0 9.3E-52 5.8E-55 300.4 29.2 358 9-473 1-382 (396) 40 protein:vir:78206 Length: 390 100.0 1.1E-51 7.1E-55 299.9 28.7 356 1-473 1-377 (390) 41 protein:vir:103993 Length: 390 100.0 1.1E-51 7.1E-55 299.9 28.7 356 1-473 1-377 (390) 42 protein:vir:79141 Length: 391 100.0 1.4E-51 8.6E-55 299.4 28.2 353 1-473 1-377 (391) 43 protein:vir:96740 Length: 388 100.0 1.1E-49 7E-53 288.9 31.0 353 1-473 1-376 (388) 44 protein:vir:4517 Length: 498 # 100.0 3.8E-48 2.3E-51 280.6 32.2 451 2-473 1-487 (498) 45 protein:vir:100323 Length: 393 100.0 3.4E-48 2.1E-51 280.8 31.4 354 1-473 1-379 (393) 46 protein:vir:489 Length: 498 # 100.0 6.5E-48 4E-51 279.3 32.1 451 2-473 1-487 (498) 47 protein:vir:4463 Length: 498 # 100.0 7.9E-48 4.9E-51 278.8 32.5 451 2-473 1-487 (498) 48 protein:vir:1996 Length: 495 # 100.0 3.9E-46 2.4E-49 269.6 34.9 449 1-473 1-491 (495) 49 protein:vir:10336 Length: 386 100.0 2.1E-46 1.3E-49 271.1 27.6 357 1-473 1-378 (386) 50 protein:vir:79798 Length: 717 100.0 5.6E-43 3.5E-46 252.2 36.5 459 1-473 1-716 (717) 51 protein:vir:103168 Length: 641 100.0 7.6E-43 4.7E-46 251.5 26.1 357 7-369 1-641 (641) 52 protein:vir:5833 Length: 742 # 100.0 6.5E-41 4E-44 240.9 29.6 435 1-473 264-735 (742) 53 protein:vir:101326 Length: 529 99.9 9.3E-25 5.8E-28 152.3 31.7 452 1-473 1-528 (529) 54 protein:vir:95263 Length: 450 99.6 8E-15 5E-18 97.9 32.9 414 5-473 1-448 (450) 55 protein:vir:5260 Length: 502 # 99.5 5.8E-13 3.6E-16 87.7 32.0 426 9-473 1-501 (502) 56 protein:vir:96104 Length: 504 99.3 2E-10 1.2E-13 73.8 30.9 425 12-473 1-504 (504) 57 protein:vir:101576 Length: 501 99.2 3.3E-10 2E-13 72.6 33.2 409 9-473 1-500 (501) 58 protein:vir:3751 Length: 376 # 99.2 7.1E-10 4.4E-13 70.7 30.3 330 13-473 1-370 (376) 59 protein:vir:3636 Length: 501 # 99.2 9.9E-10 6.1E-13 69.9 36.3 418 9-473 1-500 (501) 60 protein:vir:3788 Length: 376 # 99.1 1.2E-09 7.5E-13 69.5 31.9 330 13-473 1-370 (376) 61 protein:vir:94073 Length: 494 99.1 9.9E-10 6.1E-13 70.0 28.7 415 1-473 1-493 (494) 62 protein:vir:78782 Length: 370 99.1 4.4E-10 2.8E-13 71.9 26.5 328 13-473 1-362 (370) 63 protein:vir:106730 Length: 501 99.0 6.9E-09 4.3E-12 65.3 33.7 418 9-473 1-500 (501) 64 protein:vir:78611 Length: 501 99.0 7.7E-09 4.8E-12 65.1 35.6 409 9-473 1-500 (501) 65 protein:vir:99586 Length: 507 98.8 3.4E-08 2.1E-11 61.5 35.0 416 12-473 1-507 (507) 66 protein:vir:276 Length: 369 # 98.8 3.5E-08 2.2E-11 61.4 31.4 326 12-473 1-365 (369) 67 protein:vir:80052 Length: 331 98.6 2.2E-07 1.4E-10 57.0 26.9 310 89-473 1-330 (331) 68 protein:vir:107720 Length: 515 98.5 4E-07 2.5E-10 55.6 30.9 411 11-473 1-515 (515) 69 protein:vir:3165 Length: 426 # 98.0 7.9E-06 4.9E-09 48.6 25.3 392 14-473 1-425 (426) 70 protein:vir:96586 Length: 587 96.1 0.00098 6.1E-07 37.1 14.4 415 1-465 102-587 (587) 71 protein:vir:80488 Length: 562 96.1 0.00035 2.2E-07 39.5 11.1 424 1-465 63-562 (562) 72 protein:vir:63742 Length: 562 95.9 0.00016 9.8E-08 41.4 8.7 416 1-465 102-562 (562) 73 protein:vir:95741 Length: 587 95.6 0.0017 1.1E-06 35.7 14.6 410 1-465 102-587 (587) 74 protein:vir:99306 Length: 587 95.1 0.0029 1.8E-06 34.5 16.0 411 1-464 102-587 (587) No 1 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=100.00 E-value=1.3e-114 Score=645.00 Aligned_cols=431 Identities=25% Similarity=0.406 Sum_probs=380.2 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLA 80 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~ 80 (473) |+||||+ +|||+||||||||++++++++++++||+++|+|.++|||+|+|++|+|+ +++.+.||....+..+.+++++ T Consensus 1 m~gg~~~-~~~k~~PGvYi~~~~~~~~~i~~~~~~~~a~~~~~~~Gp~~~~~~i~s~-~d~~~~fG~~~~~~~~~~~~~~ 78 (437) T protein:vir:10 1 MAGGIWK-RQNKVRPGAYINVKSKDIAMTRLGGDGVVTVPLALSFGQSKKLMKIRRG-EDLFKKLGYEQESPQLLLLNEA 78 (437) T ss_pred CCcceec-ccceecCceeEEEecCCcceeeccCCcEEEEEEEecCCCCceeEEEecH-HHHHHHcCCccchhHHHHHHHH Confidence 9999997 7999999999999999999999999999999999999999999999995 6799999988777777777776 Q ss_pred HhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEE--eeccCCccceeeeeecCCceeeEEE Q lcl|NC_019421. 81 LLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTI--KSNLVDSDKKDFIFFENTKQLFSSS 158 (473) Q Consensus 81 f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v--~~~~~~~~~~~v~v~~~~~~~~~~~ 158 (473) | +|+++||+||+++|.. |+.++. +.++++|+|||.|||.++| +.+.++++++++.++.++..++.+. T Consensus 79 ~-~g~~~~~~~R~~~g~~--a~~tl~--------~~~~~~A~~~G~~gn~i~v~v~~~~~d~~~~~v~~~~~~~~~d~~~ 147 (437) T protein:vir:10 79 F-KRVSEVLLYRLNTGEK--ANVSLS--------DNVTAQAKYSGVRGNDITVTVKTNVDDPSSFDVVTFLDTVVMDLQT 147 (437) T ss_pred h-cCCCEEEEEECCCCce--eeEeec--------cceEEEeccCCcccceeEEEEeeccCCccceEEEEecCcceeeeee Confidence 5 7999999999998754 454443 4689999999999987655 5667789999999999998888876 Q ss_pred ecccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhcccceEEEEEcCCCcHH Q lcl|NC_019421. 159 IKGTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFERYSFDSFVLDGVADEA 238 (473) Q Consensus 159 ~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~~~l~~p~~~~~~ 238 (473) +....+ ...+.++.+. ..++++.++..+|+||+||. ++.+||.++|++||.++||++|+|. .+++ T Consensus 148 v~~~~~--------~~~n~~v~~~----~~~~l~~~a~~~LtGG~dg~--~t~~dy~~al~~le~~~~n~l~~~~-~d~~ 212 (437) T protein:vir:10 148 VKVLAD--------LKNNALVEFS----GTGELQPVAGAKLTGGTDGA--ISTQDYLEYFKALETVEFNYMALPV-EDAS 212 (437) T ss_pred hhhhhh--------hhhhcccccc----cccccccccceeeeccccCC--CChhHHHHHHHHhccCcceEEEecC-CChh Confidence 543222 2345666553 34456777888999999986 4678999999999999999999996 5789 Q ss_pred HHHHHHHHHHHHhhC-CCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCcee-cCcccchHHHHHHHHHhhhcCcc Q lcl|NC_019421. 239 LQETTKAWVAKNKEL-GKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYY-ENIKYTPSEVAVYIAALSVSKGI 316 (473) Q Consensus 239 ~~~~l~~~v~~~~~~-~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~-~~~~~~~~~~a~~vAG~~a~~~~ 316 (473) +|+++.+||+++|++ ++++.+|+++. +.|+|+|+++.+.... ++..++++++|+|+||++|++++ T Consensus 213 ~~t~~~~~ik~~r~~~g~~~~~V~~~~-------------~~d~e~Iin~~n~~~~~~~~~~~~~~~~a~vAG~~Ag~~~ 279 (437) T protein:vir:10 213 IKKAAINFIKRMREDEGLGAQLVVADS-------------DADSEAVINVKNGVILSDKTVIDKTKATVWVAAASANAGV 279 (437) T ss_pred HHHHHHHHHHHHHhccCceEEEEeCCC-------------CCCCceEEEeecceeecCcceechhhHHHHHHHHhccCcc Confidence 999999999999986 77778888764 3489999999998766 45679999999999999999999 Q ss_pred ccccceeccCcc-cccccCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhhhhhHHHHHHHHHHHH Q lcl|NC_019421. 317 TGSICNAKTIFE-EVEPRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYISNIMFINTINKDTSL 395 (473) Q Consensus 317 ~~s~t~~~~~~~-~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~ 395 (473) ++|+||++++++ ++..+|+++|+++|+++|+++|+++++.++|+||||||++++++++++|+||+++|++|+|.++|+. T Consensus 280 ~~S~t~~~~~~~~~v~~~~t~~e~~~~i~~G~~vl~~~~~~v~i~~gInTltt~~~~~~~~~~ki~vir~~D~i~~di~~ 359 (437) T protein:vir:10 280 EKSLTYEKYEDSVDVVGRLSHTETEDALLKGQFVFTARRGRAVVEQDINSHVSFTIEKNQDFRKNRILRTLDDIVNDTRY 359 (437) T ss_pred ccCccccccCCcccccccCCHHHHHHHHhCCcEEEEEeCCeEEEEEccccccccCCCCCchhhhhhHHHHHHHHHHHHHH Confidence 999999999865 7888999999999999999999999999999999999999999999999999999999999999986 Q ss_pred H-HhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 396 K-RKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 396 ~-~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) . .+.|+||+|||+++|++|+++|++||++|+++|+|++|... |.++.+.+++|.+++++.|+|+|+||+||++++|- T Consensus 360 ~~~~~yiGk~~N~~~~r~~~~~~i~~yl~~l~~~g~I~~~~~~-d~~v~~~~~~~~v~v~~~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 360 AFSEYFLGKVSNNEDGRQAFKANRIRYFKDLEARGAIEDFKVE-DIEVLRGELKESVVVNVKVKPVDSMEKLYMTVTVE 437 (437) T ss_pred HHHhccccccCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCce-eEEeecCCCCCEEEEEEEEEEeeeeeeEEEEEEec Confidence 4 55799999999999999999999999999999999999886 56666678899999999999999999999999999 No 2 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=100.00 E-value=3.1e-111 Score=626.40 Aligned_cols=427 Identities=21% Similarity=0.338 Sum_probs=385.7 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccH--HHHHHHcCCCcCcHHHHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDL--RQLKNLFGDDMNYSAFKLGK 78 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~--~~~~~~fG~~~~~~~~~~v~ 78 (473) |+||||. +|||+|||+||||++.+...++++.||++++|..++|||+|+++.|++.. ......||++..++.+..++ T Consensus 3 magg~~~-~~~K~~PG~Y~n~~~~~~~~~~~~~rGi~a~p~~~~wGp~~~v~~i~~~~~~~~~~~~~G~~~~~~~~~~l~ 81 (436) T protein:vir:78 3 LGGGTFV-TQNKVLPGSYINFVSATRATSSLSDRGIVAMPLELDWGIDEEVFQVTSDDFEKYSTKYFGYDYTHEKLKGLR 81 (436) T ss_pred ccceeec-cceeecCceEEEEEecCcceeeccCCeEEEEEEEecCCCCceeEEeecccchHHHHHHhcCccchHHHHHHH Confidence 9999996 79999999999999999999999999999999999999999999999853 24667899998888889999 Q ss_pred HHHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEE--EeeccCCccceeeeeecCCceeeE Q lcl|NC_019421. 79 LALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVT--IKSNLVDSDKKDFIFFENTKQLFS 156 (473) Q Consensus 79 ~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~--v~~~~~~~~~~~v~v~~~~~~~~~ 156 (473) ++| .|++++|+|||++|..++++ .++|+|||.+||.++ |+.+++|+++|++.++.++.++++ T Consensus 82 ~~~-~~~~tv~~yrl~~G~~a~~~---------------v~~Aky~g~~gn~i~v~v~~~~~d~~~~dv~~~~g~~~~d~ 145 (436) T protein:vir:78 82 DLF-KNIRLGYFYKLNKGVKASCS---------------IATARCSGIRGNDLKVIVTTNIDDNAKFDVVTLLDNKKVDT 145 (436) T ss_pred HHh-cCCCEEEEEECCCcceeeee---------------eeeeecCCCCCcEEEEEecccccccCceEEEEEecchhhhh Confidence 977 68899999999998766553 268999999997665 557788999999999999998887 Q ss_pred EEecccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhcccceEEEEEcCCCc Q lcl|NC_019421. 157 SSIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFERYSFDSFVLDGVAD 236 (473) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~~~l~~p~~~~ 236 (473) +.+. ....+..|+||+++. .+.++.++..+|+||+||++ +++++|.++|++||.++||+||+|+ .+ T Consensus 146 ~~~~--------~~~~l~~n~~V~~~~----~g~la~~a~~~LtGG~dG~~-~T~~dy~~al~~le~~~fn~l~~~~-~d 211 (436) T protein:vir:78 146 QIAK--------VITELQDNDYVTWKK----EATLEATAGLTFTNGTNGEA-VTGTEYQAFLDKIESYSFNALGCLA-TT 211 (436) T ss_pred hhHH--------HHhhccCCceEEEEe----cccccccceeeeeccccccc-cchHHHHHHHHHHcccceeEEEecC-CC Confidence 5532 233456799999874 45688899999999999974 6899999999999999999999997 58 Q ss_pred HHHHHHHHHHHHHHhh-CCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCcccchHHHHHHHHHhhhcCc Q lcl|NC_019421. 237 EALQETTKAWVAKNKE-LGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENIKYTPSEVAVYIAALSVSKG 315 (473) Q Consensus 237 ~~~~~~l~~~v~~~~~-~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~~~~~~~~a~~vAG~~a~~~ 315 (473) +++|+++.+|++++|+ .++++++|+++. ...|+|+|+++.+++ .+..+.++++|+|+||++|+++ T Consensus 212 ~~~~~~~~a~ikr~re~~g~~~~aV~~~~------------~~~d~EgIInv~n~v--~g~~~~~~~~~a~vAG~~Ag~~ 277 (436) T protein:vir:78 212 AEIKSLFVEFTKRMRDKVGAKFQTVLYKK------------NDADYEGVVSVENKI--KDTGLLESSLIYWTTGAIAGCD 277 (436) T ss_pred hHHHHHHHHHHHHHHhhcCCeEEEEecCC------------CCCCCceEEEeeccc--CCceechhHHHHHHHHHHhcCc Confidence 9999999999999996 489999998763 347999999999874 7778999999999999999999 Q ss_pred cccccceeccCcc-cccccCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhhhhhHHHHHHHHHHH Q lcl|NC_019421. 316 ITGSICNAKTIFE-EVEPRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYISNIMFINTINKDTS 394 (473) Q Consensus 316 ~~~s~t~~~~~~~-~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~ 394 (473) +++|+||++++++ ++..+++++|+++++++|+++|++++++++|++|||||++++.+++++|++|+++|+||+|.++|+ T Consensus 278 ~~~S~T~~~~~~~~~v~~~~t~~e~~~ai~~G~lvl~~d~~~v~I~~~VNTltt~~~~k~~~~~kI~vir~~D~i~~di~ 357 (436) T protein:vir:78 278 INKSNTNKRYDGEFDVDVNYTQIHLEEALKTGKFIFHKVGDEVHVLEDINTFVSFTDEKNDDFSSNQSVRVLDQIANDIA 357 (436) T ss_pred cccCccceecCccccccccCCHHHHHHHHhCCeEEEEEeCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHH Confidence 9999999999976 788999999999999999999999999999999999999999999999999999999999999998 Q ss_pred H-HHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 395 L-KRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 395 ~-~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) . +.++||||+||++++|.+|+++|++||++|+++|+|++|+.. |.++.+.++++.+++++.++|+|+||+||+|++|+ T Consensus 358 ~~~~~~yiGKv~N~~dgr~~l~~~i~~yl~~L~~~g~I~~f~~~-Dv~v~~~~~~~~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 358 TLFNTKYLGEVPNDKSGRISFWNDVVKHHEQLQNMRAIEDFKAD-DVSVEPGSDKKTVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred HHhhhccccccCCCHHHHHHHHHHHHHHHHHHHhCCcccCCCCc-ceEEeecCCCCEEEEEEEEEEEEeeeeEEEEEEEC Confidence 5 557899999999999999999999999999999999999864 88888889999999999999999999999999999 No 3 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=100.00 E-value=2.7e-110 Score=621.30 Aligned_cols=441 Identities=19% Similarity=0.286 Sum_probs=381.8 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEE-eeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPI-RANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKL 79 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g-~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~ 79 (473) |+||||. +|||+||||||||++++.+++.+++++.++|++ .++||| ++|++|.|+ +++++.||...+++.++++++ T Consensus 1 magg~~~-~~~K~~PGvYi~~~~~~~~~~~~~~~~~~~~i~~~~~~g~-~~~v~i~~~-~d~~~~fG~~~~~~~~~~~~~ 77 (451) T protein:vir:10 1 MAGGTWK-AQDKRRPGTYINVVGNGQREAASSLGRVLLIRDKGLGWGK-NGVIEVEAN-SDFTKKLGTTLDDPSLTALKE 77 (451) T ss_pred CCceeec-cceeecCceEEEEeccCcceeeccCCcEEEEEeeecCCCC-cccEEeecH-HHHHHHcCCcccchhHHHHHH Confidence 9999997 799999999999999999999998877777777 678998 568999996 568889999888888888888 Q ss_pred HHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEE--eeccCCccceeeeeecCCceeeEE Q lcl|NC_019421. 80 ALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTI--KSNLVDSDKKDFIFFENTKQLFSS 157 (473) Q Consensus 80 ~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v--~~~~~~~~~~~v~v~~~~~~~~~~ 157 (473) || +|+++||+||+++|..++++.. .+.++++|+|||.|||.++| +.+++|+.+|++.++.++..++.+ T Consensus 78 ~~-~g~~~v~~yrl~~g~~a~~t~~---------~~~~~~~Aky~G~~Gn~i~v~v~~~~~d~~~~~v~t~~g~~~vd~q 147 (451) T protein:vir:10 78 TL-KGASKVLVLNPNEGTAATLTKE---------GLPWTVTANYPGEKGNQITVSVEVSPADQNAATVSTIFGTKLVDEQ 147 (451) T ss_pred Hh-cCCcEEEEEEcCCCceEEEEee---------cCceEEEEeeCCcCCceEEEEEecccCCcCceEEEEEECCeEEEEE Confidence 77 5899999999999877666543 24578999999999987655 567889999999999999999998 Q ss_pred EecccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCc-ccccchhhHHHHHHHHhhcccceEEEEEcCCC- Q lcl|NC_019421. 158 SIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGN-DGCTSITNESYLKALEEFERYSFDSFVLDGVA- 235 (473) Q Consensus 158 ~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~-dg~~~~t~~d~~~~l~~le~~~~~~l~~p~~~- 235 (473) ++.... ...+..|++++++...... +.......+++|. .|....++.+|.++|.++|.++||++++|+.+ T Consensus 148 tv~~~~------~~el~~nd~V~a~~~~~g~--~~~~~~~~l~~~~~gg~~~~~~~~~~~~l~~~e~~~~n~l~~~~~~~ 219 (451) T protein:vir:10 148 SIKFNE------LDKFKGNDYITAKVVEEGS--SKPVAFTNVSGTLTGGTTTESNKVESLLNDALENEEYAVVTTAGFEP 219 (451) T ss_pred Eeeccc------hhhccCCceEEEEeccccc--ccceeeeecccccccccccCCccchHHHHHHhccceeeEEEEccCCC Confidence 764321 2345679999988765443 3334444555553 34555678899999999999999999999764 Q ss_pred cHHHHHHHHHHHHHHhh-CCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCcee-cCcccchHHHHHHHHHhhhc Q lcl|NC_019421. 236 DEALQETTKAWVAKNKE-LGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYY-ENIKYTPSEVAVYIAALSVS 313 (473) Q Consensus 236 ~~~~~~~l~~~v~~~~~-~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~-~~~~~~~~~~a~~vAG~~a~ 313 (473) ++++|+++.+||+++|+ +|+++++|++++.. ..+|+++|+++.+++.. ++..++++++|+||||++|+ T Consensus 220 ~~~i~~~~~a~ik~~r~~~g~~~~aVl~~~~~----------~~~d~egiinv~n~~~~~dg~~~~~~~~~~~vAG~~Ag 289 (451) T protein:vir:10 220 SSNMNKLVVEAVKRLRENEGRKVRGVIPTDAD----------TTYNYEGISTVVNGYTLSDGTNVDVKDATGYFAGISAS 289 (451) T ss_pred chHHHHHHHHHHHHHHHhcCCeEEEEecCccC----------CCCCCcceEEeecceEecCceeechhhhHHHHHHHHcc Confidence 56789999999999996 59999999987643 34799999999998866 45679999999999999999 Q ss_pred CccccccceeccCc-ccccccCCHHHHHHHHhCCcEEEE-EcCCEEEEEecccccccCCCCCcchhhhhhhhHHHHHHHH Q lcl|NC_019421. 314 KGITGSICNAKTIF-EEVEPRLSQSEVKECLKSGTLVLD-FDDGDVIIVDDVNTFKKYVDDKNEAMGYISNIMFINTINK 391 (473) Q Consensus 314 ~~~~~s~t~~~~~~-~~~~~~~t~~e~~~l~~~G~~~l~-~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~ 391 (473) +++++|+||+++++ .++..+|+++|+++++++|+++|+ ++++.++|++|||||++++.+++++|++|+++|+||+|.+ T Consensus 290 ~~~~~S~T~~~~~~~~~v~~~~t~~e~~~~i~~G~lvl~~~~g~~v~i~~~INTltt~~~~k~~~~~ki~vir~~D~i~~ 369 (451) T protein:vir:10 290 ADVATSLTYFEVEDAVSAYPKFDNEKTIKALDAGQIVFTTRPGQRVVIEQDINSLHKFTAEKPQAFSKNRVIRTLDEIAT 369 (451) T ss_pred cccccCccceecCCceeeeeeCCHHHHHHHHhCCeEEEEEEcCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHH Confidence 99999999999985 578899999999999999999996 6778899999999999999999999999999999999999 Q ss_pred HHHHH-HhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCCCEEEEEEEEEEeeeeeeEEEEE Q lcl|NC_019421. 392 DTSLK-RKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKADEFYWKWDAVKVDVMKKIYGTG 470 (473) Q Consensus 392 ~i~~~-~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~d~~~v~i~v~p~~~~e~i~~t~ 470 (473) +|+.. .++||||+|||+++|++|+++|++||++|+++|+|++|.. .|+++++.++++.+++++.|+|+|+||+||+++ T Consensus 370 di~~~~~~~yiGk~~N~~~gr~~~~~~i~~yl~~l~~~g~i~~~~~-~d~~v~~~~~~~~v~v~~~v~pvdame~iy~t~ 448 (451) T protein:vir:10 370 NTENTFERTYLGNVGNNAAGRDLFKADRIAYLTSLQNRNMIQSFAN-TDITVEAGNDMDSIVVNLAVTPVDAMEKLYMTM 448 (451) T ss_pred HHHHHhhhccceecCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCc-cceEEeecCCCCEEEEEEEEEEEeeeeeEEEEE Confidence 99855 4679999999999999999999999999999999999984 588888899999999999999999999999999 Q ss_pred EeC Q lcl|NC_019421. 471 YLG 473 (473) Q Consensus 471 ~v~ 473 (473) +|- T Consensus 449 ~v~ 451 (451) T protein:vir:10 449 VVR 451 (451) T ss_pred EEc Confidence 999 No 4 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=100.00 E-value=3.4e-93 Score=527.54 Aligned_cols=446 Identities=17% Similarity=0.218 Sum_probs=353.1 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLA 80 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~ 80 (473) ||.+.|. +++.+|||||+|+.+++..+++++++++++|+|.+++||+++|++|+++ +++++.||.+ +++.+++++ T Consensus 1 ~a~~~~~-~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~-~~~~~~~~~g---~l~~~~~~a 75 (587) T protein:vir:99 1 MAVEPFP-RRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNY-SQAKRLFRSG---ELLDAIELA 75 (587) T ss_pred CcccccC-CcccccCceEEEEecCCccccCCCCCceEEEEEEecCCccceeEEeccH-HHHHHHhcCc---chHHHHHHH Confidence 9999997 7999999999999999999999999999999999999999999999995 6699999875 477888888 Q ss_pred H----hcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeec---cCCccceeeeeecCC-- Q lcl|NC_019421. 81 L----LGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSN---LVDSDKKDFIFFENT-- 151 (473) Q Consensus 81 f----~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~---~~~~~~~~v~v~~~~-- 151 (473) | .+|+++||++|+.++. +|+.++ +.|+++|++||.|||.|+|+.. ..++.++.+....+. T Consensus 76 ~~~~~~~g~~~~~~~rv~~~~--~a~~~~---------~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~ 144 (587) T protein:vir:99 76 WGSNPNYTAGRILAMRIEDAK--PASAEI---------GGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFN 144 (587) T ss_pred hccccCCCceEEEEEEcCCCc--eeEEEe---------cCeEEEEeeccccccceEEEEccCCCCcceeEEEEEecccce Confidence 8 5899999999996554 455554 3499999999999999988632 222333333111110 Q ss_pred -------------------------------------------ceeeEEEecccc----hhhhhhhhhcc---------- Q lcl|NC_019421. 152 -------------------------------------------KQLFSSSIKGTI----DEIVLEINSNL---------- 174 (473) Q Consensus 152 -------------------------------------------~~~~~~~~~~~~----~~~~~~~~~~~---------- 174 (473) +++....+.... ..+...+.... T Consensus 145 ~~~~~~g~v~~i~y~g~~~~a~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~i~~~~~~tAky~~~~ 224 (587) T protein:vir:99 145 EVYDNIGNIFTIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFG 224 (587) T ss_pred eeeeeccceeeEEeecccccceeeEeecCcceeeeeeeeecCCceeEEEEecCCchHHHHHHHhhhccccceeEEeeccC Confidence 000000000000 00000000000 Q ss_pred c------------------------------------ccceeEeec-----------------------ccCCccccccc Q lcl|NC_019421. 175 D------------------------------------NEYVIATKV-----------------------ADSDTILANVV 195 (473) Q Consensus 175 ~------------------------------------s~~v~~~~~-----------------------~~~~~~~~~~~ 195 (473) . ++++.++.. ....+.++... T Consensus 225 ~~~i~~~~~~~~~~~~v~~~~~~v~a~~~D~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~ 304 (587) T protein:vir:99 225 DKNLESSKLDKIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFE 304 (587) T ss_pred CceeEeecccccccceeeeeeeeeehhccceeeecccceeeeeeecccccchhhhhhhhhccccceeeeeccccceeccc Confidence 0 000000000 11112233344 Q ss_pred eeeeccCcccccchhhHHHHHHHHhhcccceEEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHh Q lcl|NC_019421. 196 NQALEGGNDGCTSITNESYLKALEEFERYSFDSFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDK 275 (473) Q Consensus 196 ~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~ 275 (473) ...|+||+||..+ .+|.++|++|+.+++++|++ .++++++|+++++||++||++++++++|++++.+++++++.++ T Consensus 305 ~t~LtGG~dG~~~---~sy~~al~ale~~~~~~i~~-~t~d~~i~a~l~a~vk~~r~~g~~~~aVlg~~~~~~~~~~~~~ 380 (587) T protein:vir:99 305 LTKLKGGTNGEPP---ATWADKLDKFAHEGGYYIVP-LSSKQSVHAEVASFVKERSDAGEPMRAIVGGGFNESKEQLFGR 380 (587) T ss_pred ceeeecCCCCCcc---ccHHHHHHHHhhCCcEEEEe-cCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHH Confidence 5669999998653 46889999999999999965 5778999999999999999999999999999999999999999 Q ss_pred hhccCCceEEEecCCceec---C--cccchHHHHHHHHHhhhcCccccccceeccCcccccccCCHHHHHHHHhCCcEEE Q lcl|NC_019421. 276 SKSFNDENIVNVGSSAYYE---N--IKYTPSEVAVYIAALSVSKGITGSICNAKTIFEEVEPRLSQSEVKECLKSGTLVL 350 (473) Q Consensus 276 ~~~~n~~~i~~~~~~~~~~---~--~~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~~~~~~~~~t~~e~~~l~~~G~~~l 350 (473) +..+|++|+++|.++.... + ..++++++|+|+||++|++++++|+||++++++++.++|+++|+++|+++|+++| T Consensus 381 a~~~n~e~vi~v~~~~~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~~~~SlT~~~i~~~~v~~~~t~~e~e~li~~Gvl~l 460 (587) T protein:vir:99 381 QASLSNPRVSLVANSGTFVMDDGRKNHVPAYMVAVALGGLASGLEIGESITFKPLRVSSLDQIYESIDLDELNENGIISI 460 (587) T ss_pred hhhcCCCcEEEEeccceEecCCCceeeechHHHHHHHHHHHhcCchhcCccceeeecccccccCCHHHHHHHHhCCeEEE Confidence 9999999999999886542 2 3488999999999999999999999999999999999999999999999999999 Q ss_pred EEcCC----EEEEEecccccccCCCCCcchhhhhhhhHHHHHHHHHHHHHH-hhcCCcccCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019421. 351 DFDDG----DVIIVDDVNTFKKYVDDKNEAMGYISNIMFINTINKDTSLKR-KEFVGKIFNDATGQTTVICALKKYFEEL 425 (473) Q Consensus 351 ~~~~~----~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~-~~~ig~~~N~~~~r~~i~~~i~~~l~~l 425 (473) ++..+ .++|+++|||++ .++++.|++|+++|++|+|.++||..+ +.|+|+ ||++.+|..|+++|.+||++| T Consensus 461 ~~~~~~~~~~vriv~~ItT~t---~~~~~~~~~i~viRv~D~i~~di~~~~~~~yiGk-~Nn~~~r~~i~~~i~~~L~~l 536 (587) T protein:vir:99 461 EFVRNRTNTFFRIVDDVTTFN---DKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGT-RTINTSASIIKDFIQSYLGRK 536 (587) T ss_pred EEecCCcceEEEEeeceeecc---CCCCchhhhhhhhhhHHHHHHHHHHHHHhhCCcc-ccchHHHHHHHHHHHHHHHHH Confidence 87644 368899999875 467889999999999999999998654 679998 788899999999999999999 Q ss_pred HhcCCccCccceeccccccCCCCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 426 MSQGIISEFNVDIDTELQATAKADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 426 ~~~g~i~~~~~~~D~~~~~~~~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) +++|+|++|.+. |..+. .+.|.++|++.++|+++|||||+|+++. T Consensus 537 ~~~gaI~~~~~~-dv~v~--~~~d~~~v~~~v~Pv~~mekIy~tv~~~ 581 (587) T protein:vir:99 537 KRDNEIQDFPAE-DVQVI--VEGNEARISMTVYPIRSFKKISVSLVYK 581 (587) T ss_pred HhCCcccCCCcc-ceEEE--ecCCEEEEEEEEEEcccceEEEEEEEEE Confidence 999999999763 55444 4667899999999999999999999998 No 5 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=100.00 E-value=4.6e-93 Score=526.80 Aligned_cols=446 Identities=17% Similarity=0.216 Sum_probs=352.3 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLA 80 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~ 80 (473) ||.+.|. +++..|||||||+.+++.++++++++++++|+|.+++||+++|++++++ +++++.||.+ +++.+++++ T Consensus 1 ~a~~~~~-~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~-~~~~~~~~~g---~l~~~~~~a 75 (587) T protein:vir:95 1 MAVEPFP-RRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNY-SQAKRLFRSG---ELLDAIELA 75 (587) T ss_pred CcccccC-CcccccCceEEEEecCCccccCCCCCceEEEEEEeCCCCCceeEEeccH-HHHHHHhcCc---chHHHHHHH Confidence 9999997 7999999999999999999999999999999999999999999999995 6699999875 477778777 Q ss_pred H----hcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeec---cCCccceeeeeecCCc- Q lcl|NC_019421. 81 L----LGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSN---LVDSDKKDFIFFENTK- 152 (473) Q Consensus 81 f----~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~---~~~~~~~~v~v~~~~~- 152 (473) | .+|+++||++|+.++. +|+.++ +.|+++|+.||.|||.|+|+.. ..++.++.+....+.. T Consensus 76 ~~~~~~~g~~~~~~~rv~~~~--~a~~~~---------~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~ 144 (587) T protein:vir:95 76 WGSNPNYTAGRILAMRIEDAK--PASAEI---------GGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFN 144 (587) T ss_pred hccccCCCceEEEEEEcCCCc--eeEEEe---------cCeEEEEecccccccceEEEEecCCCCCceeEEEEEecccce Confidence 7 5899999999996554 455554 3489999999999999998632 2223333332221110 Q ss_pred ee--------------------------------------------eEEEecccc----hhhhhhhhhcc---------- Q lcl|NC_019421. 153 QL--------------------------------------------FSSSIKGTI----DEIVLEINSNL---------- 174 (473) Q Consensus 153 ~~--------------------------------------------~~~~~~~~~----~~~~~~~~~~~---------- 174 (473) ++ ....+.... ..+...+.... T Consensus 145 ~~~~~~g~v~si~y~g~~~~~~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~in~~~~~tAky~g~~ 224 (587) T protein:vir:95 145 EVYDNIGNIFTIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFG 224 (587) T ss_pred eeeeeccceeeeeeeccccccceeeeecccceeeeeeeeecCCceEEEEEecCCchHHHHHHHHhhccccceEEEEeccc Confidence 00 000000000 00000000000 Q ss_pred c------------------------------------ccceeEeec-----------------------ccCCccccccc Q lcl|NC_019421. 175 D------------------------------------NEYVIATKV-----------------------ADSDTILANVV 195 (473) Q Consensus 175 ~------------------------------------s~~v~~~~~-----------------------~~~~~~~~~~~ 195 (473) . .+++.+... ....+.++... T Consensus 225 ~~~i~~~~~~~~~~~~v~~~~~~v~a~~~d~~~~~~~~~~v~~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~ 304 (587) T protein:vir:95 225 DKNLESSKLDKIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFE 304 (587) T ss_pred CceeEEeecCcccccceehhhhhhhhhhcceeeeeeceeeeeeecccccceeccchhhhhcccchheeccccccceeccc Confidence 0 000110000 11112223344 Q ss_pred eeeeccCcccccchhhHHHHHHHHhhcccceEEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHh Q lcl|NC_019421. 196 NQALEGGNDGCTSITNESYLKALEEFERYSFDSFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDK 275 (473) Q Consensus 196 ~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~ 275 (473) ...|+||+||..+ .+|.++|++|+.+++++|++ .++++++|+++.+||++||++++++++|++++.+++++++.++ T Consensus 305 ~t~LtGG~dG~~~---~~y~~~l~ale~~~~~~i~~-~t~d~~v~a~l~a~vk~~~~~g~~~~aVvg~~~~~~~~~~~~~ 380 (587) T protein:vir:95 305 LTKLKGGTNGEPP---ATWADKLDKFAHEGGYYIVP-LSSKQSVHAEVASFVKERSDAGEPMRAIVGGGFNESKEQLFGR 380 (587) T ss_pred eeeeecCCCCCCc---ccHHHHHHHHHhCCcEEEEe-cCCCHHHHHHHHHHHHHHHhCCCcEEEEEcCCCCCCHHHHHHH Confidence 5679999998653 46889999999999999975 5778999999999999999999999999999999999999999 Q ss_pred hhccCCceEEEecCCceec---C--cccchHHHHHHHHHhhhcCccccccceeccCcccccccCCHHHHHHHHhCCcEEE Q lcl|NC_019421. 276 SKSFNDENIVNVGSSAYYE---N--IKYTPSEVAVYIAALSVSKGITGSICNAKTIFEEVEPRLSQSEVKECLKSGTLVL 350 (473) Q Consensus 276 ~~~~n~~~i~~~~~~~~~~---~--~~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~~~~~~~~~t~~e~~~l~~~G~~~l 350 (473) +..+|++|++++.++.... + ..++++++|+|+||++|++++++|+||++++++++.++|+++|+++|+++|+++| T Consensus 381 a~~~n~ervi~v~~~~~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~~~~SlT~~~i~~~~v~~~~t~~e~e~ai~~Gvl~l 460 (587) T protein:vir:95 381 QESLSNPRVSLVANSGTFVMDDGRKNHVPAYMVAVALGGLASGLEIGESITFKPLRVSSLDQIYESIDLDELNENGIISI 460 (587) T ss_pred HhhcCCCcEEEecccceEecCCCceeeechHHHHHHHHHHHhcCchhcCccceeeecccccccCCHHHHHHHHhCCeEEE Confidence 9999999999999876542 2 3478999999999999999999999999999999999999999999999999999 Q ss_pred EEcCC----EEEEEecccccccCCCCCcchhhhhhhhHHHHHHHHHHHHHH-hhcCCcccCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019421. 351 DFDDG----DVIIVDDVNTFKKYVDDKNEAMGYISNIMFINTINKDTSLKR-KEFVGKIFNDATGQTTVICALKKYFEEL 425 (473) Q Consensus 351 ~~~~~----~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~-~~~ig~~~N~~~~r~~i~~~i~~~l~~l 425 (473) ++.++ .++|+++|||++ .++++.|++|+++|++|+|.++||..+ ++|+|+ ||++.+|..|+++|.+||++| T Consensus 461 ~~~~~~~~~~vriv~~itT~t---~~~d~~~~~i~viRv~D~i~~dir~~~~~~~iGk-~nn~~~r~~v~~~i~~~L~~l 536 (587) T protein:vir:95 461 EFVRNRTNTFFRIVDDVTTFN---DKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGT-RTINTSASIIKDFIQSYLGRK 536 (587) T ss_pred EEecCCcceEEEEeecceecc---CCCCcchhhhhhhhhHHHHHHHHHHHHHhhCCcc-ccchHHHHHHHHHHHHHHHHH Confidence 87544 368899999876 467889999999999999999998655 679998 788999999999999999999 Q ss_pred HhcCCccCccceeccccccCCCCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 426 MSQGIISEFNVDIDTELQATAKADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 426 ~~~g~i~~~~~~~D~~~~~~~~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) +++|+|++|.++ |..+. .+.|.++|++.++|+++|||||+++++. T Consensus 537 ~~~gaI~~~~~~-dv~v~--~~~d~~~v~~~v~Pv~~mekI~vt~~~~ 581 (587) T protein:vir:95 537 KRDNEIQDFPAE-DVQVI--VEGNEARISMTVYPIRSFKKISVSLVYK 581 (587) T ss_pred HhCCcccCCCcc-ceEEE--ecCCEEEEEEEEEEcccceEEEEEEEEe Confidence 999999999773 54443 5667899999999999999999999998 No 6 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=100.00 E-value=6.7e-91 Score=514.96 Aligned_cols=447 Identities=16% Similarity=0.186 Sum_probs=359.3 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLA 80 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~ 80 (473) |+=--|. +++-.+||||+||.+++.++++++++++++|+|.|++||+|+|++|++ ++|+++.||.+ +++.++.+| T Consensus 1 ~~~~~~~-~~~~~~Pgv~~~~~~s~~~~~~~~~~~~~~~iG~a~~G~~~~~~~~~~-~~~~~~~fg~g---~l~~~i~~a 75 (562) T protein:vir:63 1 MAIEIYP-RKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKPNAVYKVRN-YSQAKSVFRSG---ELLDAIERA 75 (562) T ss_pred CeeeeeC-CCcccCCceEEEEecCCCcccCCCCCceEEEEEEeCCCCCceeEEEcc-HHHHHHHhcCC---chHHHHHHh Confidence 8888894 688888999999999999999999999999999999999999999999 67799999876 466778777 Q ss_pred H----hcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeecc---CCccceeeee------ Q lcl|NC_019421. 81 L----LGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNL---VDSDKKDFIF------ 147 (473) Q Consensus 81 f----~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~---~~~~~~~v~v------ 147 (473) | .|||.+||++|+.+ +++|+.+. +.++++|+.+|.|+|++.|+... ....+|.+.. T Consensus 76 ~~~~~~~g~~~~~~~rv~~--a~~a~~~~---------~~~~~~a~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ 144 (562) T protein:vir:63 76 WNPGEGTGAGDILAMRVEE--AKEATFEA---------EGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVN 144 (562) T ss_pred ccccccCCceEEEEEEcCC--CccceeEe---------cceeEEEeecccCCCeEEEEEecCCCCCCcceEEEecCCCcc Confidence 7 68999999999944 45555543 45899999999999999887521 1111222110 Q ss_pred ---------------------------------------ecCCceeeEEEeccc----chhhhhhh-------------- Q lcl|NC_019421. 148 ---------------------------------------FENTKQLFSSSIKGT----IDEIVLEI-------------- 170 (473) Q Consensus 148 ---------------------------------------~~~~~~~~~~~~~~~----~~~~~~~~-------------- 170 (473) ..+...+....+... ...+...+ T Consensus 145 ev~~~~g~V~~i~y~g~~~~~~~~v~~~~~~~~a~~l~~~~g~~~v~~~~L~~g~~~~~~~l~~~in~~~~~~aky~~~~ 224 (562) T protein:vir:63 145 QVYDNLGSIFSIKYKGTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIG 224 (562) T ss_pred hhhhhccceeeeeeecccccceEEEEecCcceeEEEEEeecCCcceeEEEecCCccchhHHHHHhhccccceEEEeeccC Confidence 011111111000000 00000000 Q ss_pred --------------------------------hhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHH Q lcl|NC_019421. 171 --------------------------------NSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKAL 218 (473) Q Consensus 171 --------------------------------~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l 218 (473) ..+..+.++.+... ..+.+++.+...|+||+||+.+ .+|.++| T Consensus 225 gn~i~~~~~d~~~~~~vkt~~~~v~t~~~d~~~~~~~~~~v~~~~~--~~~~la~~~~~~LtGG~dGt~~---~~~~~al 299 (562) T protein:vir:63 225 DKNLTTDNFDAQIDVDIKTKEAYVKAVGGDIEKQTAYNGYVDFEFD--RSKEIANFPLTKLTGGDNGTIP---ESWADKF 299 (562) T ss_pred Cceeeeeccccccccchhhhhhhhhhhhhhhhhcccccceeeeeec--cccceecccceeeecCCCCCch---hhHHHHH Confidence 00112334443322 3356777788999999999754 4578888 Q ss_pred HhhcccceEEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecC--- Q lcl|NC_019421. 219 EEFERYSFDSFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYEN--- 295 (473) Q Consensus 219 ~~le~~~~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~--- 295 (473) ++|+.++++++++ .++++++|+++.+||++||++++++++|++++.+++++++.+++..+|++|++++.++....+ T Consensus 300 ~ale~~~~~~i~~-~t~d~av~~~l~a~vkr~~~~g~~~~aVlg~~~~~~~~~~~~~a~~~n~ervv~v~~~~~~~~~~~ 378 (562) T protein:vir:63 300 SYFANEGGYYLVP-LTSKQAVHAEALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTRAIGLQNERAGLIGFSGTVKMDDG 378 (562) T ss_pred HHHHhCCcEEEEe-cCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHhhhcCCCcEEEEecCeeEECCCC Confidence 8999999988864 578899999999999999999999999999999999999999999999999999999876533 Q ss_pred --cccchHHHHHHHHHhhhcCccccccceeccCcccccccCCHHHHHHHHhCCcEEEEEcCC-EEEEEecccccccCCCC Q lcl|NC_019421. 296 --IKYTPSEVAVYIAALSVSKGITGSICNAKTIFEEVEPRLSQSEVKECLKSGTLVLDFDDG-DVIIVDDVNTFKKYVDD 372 (473) Q Consensus 296 --~~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~~~~~~~~~t~~e~~~l~~~G~~~l~~~~~-~~~i~~gi~T~~~~~~~ 372 (473) ..++++++|+|+||++|++++++|+||++++++++..+|+++|+++|+++|+++|+++++ .+++.+++|++++++.+ T Consensus 379 ~~~~~~~~~~aa~vAGl~A~~~~~~SlT~~~i~~~~v~~~~t~~e~~~li~~Gv~~l~~~~~~~v~~~~iv~~itT~t~~ 458 (562) T protein:vir:63 379 RSLKMPGYMFAAQVAGLTCGLEIGEAITFKNIAIETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDK 458 (562) T ss_pred ceeeechhHHHHHHHHHhhcCchhcCccceeeccccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCC Confidence 358899999999999999999999999999999999999999999999999999998654 56666777888777888 Q ss_pred CcchhhhhhhhHHHHHHHHHHHHHH-hhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCCCEE Q lcl|NC_019421. 373 KNEAMGYISNIMFINTINKDTSLKR-KEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKADEF 451 (473) Q Consensus 373 ~~~~~~~i~v~R~~d~i~~~i~~~~-~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~d~~ 451 (473) +++.|++|+++|++|+|.++||..+ ++|+|+ ||+..+|.+|+++|.+||++|+++|+|++|++. |..+ ..+.|.+ T Consensus 459 ~~~~~~ki~viRv~D~i~~dir~~~~~~yiGk-~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-dv~v--~~~~d~~ 534 (562) T protein:vir:63 459 TDPVKSEIGVGEANDFLVSELKISLDNEYIGT-KIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPE-EVQV--VIEGDVA 534 (562) T ss_pred CCchhhhhhhhHHHHHHHHHHHHHHHhcCCcc-ccChHHHHHHHHHHHHHHHHHHhCCcccCCCcc-ceEE--EecCCEE Confidence 9999999999999999999998654 579999 788999999999999999999999999999763 4443 3566789 Q ss_pred EEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 452 YWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 452 ~v~i~v~p~~~~e~i~~t~~v~ 473 (473) +|++.++|+++|||||+|+++. T Consensus 535 ~v~~~v~pv~~mekIy~ti~~~ 556 (562) T protein:vir:63 535 RISLTVFPIRSMKKIEVSLVYR 556 (562) T ss_pred EEEEEEEEcccceEEEEEEEEe Confidence 9999999999999999999999 No 7 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=100.00 E-value=6.4e-90 Score=509.58 Aligned_cols=447 Identities=16% Similarity=0.180 Sum_probs=357.5 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLA 80 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~ 80 (473) ||=--|. +++-.+||||+|+..++.++++++++++++|+|.|++||+|+|++|++ ++|+++.||.+ +++.++.+| T Consensus 1 ~~~~~~~-~~~~~~pgv~~~~~~s~~~~~~~~~~~~~~~ig~a~~G~~~~~~~~~~-~~~~~~~f~~g---~l~~~i~~a 75 (562) T protein:vir:80 1 MAIEIYP-RKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKPNAVYKVRN-YSQAKSVFRSG---ELLDAIERA 75 (562) T ss_pred CeeeeeC-CCcccCCceEEEEecCCcccCCCCCCceEEEEEEeCCCCcceeEEEcc-HHHHHHHhcCC---ChHHHHHHh Confidence 8888884 688889999999999999999999999999999999999999999999 67899999876 466677777 Q ss_pred H----hcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccC---Cccceeee------- Q lcl|NC_019421. 81 L----LGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLV---DSDKKDFI------- 146 (473) Q Consensus 81 f----~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~---~~~~~~v~------- 146 (473) | .|||.+||++|+.+ +++|+.+. +.+++++..+|.|+|++.|+.-.. ....|.+. T Consensus 76 ~~~~~~~g~~~~~~~rv~~--a~~a~~~~---------~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ 144 (562) T protein:vir:80 76 WNPGEGTGAGDILAMRVEE--AKEATFEA---------EGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVN 144 (562) T ss_pred cccccccCceEEEEEEcCC--CCcceEEe---------cceEEEEeecccCCCceEEEEecCCCCCCcceEEEecCCcce Confidence 7 58999999999954 44455443 468999999999999988874211 11112110 Q ss_pred --------------------------------------eecCCceeeEEEeccc----chhhhhhh-------------- Q lcl|NC_019421. 147 --------------------------------------FFENTKQLFSSSIKGT----IDEIVLEI-------------- 170 (473) Q Consensus 147 --------------------------------------v~~~~~~~~~~~~~~~----~~~~~~~~-------------- 170 (473) +..+...+....+... ...+...+ T Consensus 145 ev~~~~g~v~~i~y~g~~~~a~~~i~~~~~~~~a~~l~~~~g~~~v~~~~l~~g~~~~~~~l~~~i~~~~~~tAky~g~~ 224 (562) T protein:vir:80 145 QVYDNLGSIFSIKYKGTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIG 224 (562) T ss_pred EEeeccCceeeeeeccccccceeEEEecCccceEEEEEEecCCcceeEEEeCCCccchhhhhhhhhccccceEEEecccC Confidence 0001111111111000 00000000 Q ss_pred --------------------------------hhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHH Q lcl|NC_019421. 171 --------------------------------NSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKAL 218 (473) Q Consensus 171 --------------------------------~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l 218 (473) ..+..+.++.+... ..+.++..+...|+||+||+.+ .+|.++| T Consensus 225 ~n~i~~~~~d~~~~~~~kt~~~~v~~~~~d~~~~~~~n~~v~~~~~--~~~~la~~~~~~LtGG~dG~~~---~~~~dal 299 (562) T protein:vir:80 225 DKNLTTDNFDAQIDVDIKTKEAYVKAVGGDIEKQTAYNGYVEFEFD--RSKEIANFPLTKLTGGDNGTIP---ESWADKF 299 (562) T ss_pred CceeeecccccchhhhcccceeeeeehhhhhhhcccccceEEEEec--cCccccccceeeeeCCCCCCcc---ccHHHHH Confidence 00112333433322 3456777788999999999754 4577888 Q ss_pred HhhcccceEEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecC--- Q lcl|NC_019421. 219 EEFERYSFDSFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYEN--- 295 (473) Q Consensus 219 ~~le~~~~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~--- 295 (473) ++|+.++++++++ .++++++|+++.+||++||++++++++|++++.+++++++.+++..+|++|++++.++....+ T Consensus 300 ~~Le~~~~~~i~~-~t~d~ai~~~~~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~~~ 378 (562) T protein:vir:80 300 SYFANEGGYYLVP-LTSKQAVHAEALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTRAIGLQNERAGLIGFSGTVKMDDG 378 (562) T ss_pred HHHHhCCcEEEEe-cCCChHHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHHhhhcCCCeEEEEecCeeEECCCC Confidence 8999999998875 467899999999999999999999999999999999999999999999999999998765422 Q ss_pred --cccchHHHHHHHHHhhhcCccccccceeccCcccccccCCHHHHHHHHhCCcEEEEEcCC-EEEEEecccccccCCCC Q lcl|NC_019421. 296 --IKYTPSEVAVYIAALSVSKGITGSICNAKTIFEEVEPRLSQSEVKECLKSGTLVLDFDDG-DVIIVDDVNTFKKYVDD 372 (473) Q Consensus 296 --~~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~~~~~~~~~t~~e~~~l~~~G~~~l~~~~~-~~~i~~gi~T~~~~~~~ 372 (473) ..++++++|+|+||++|++++++|+||++++++++..+|+++|+++|+++|+++|+++++ .+++.+++|++++++.+ T Consensus 379 ~~~~~~~~~~aa~vAGl~Ag~~~~~S~T~~~i~~~~v~~~lt~~e~~~li~~G~l~l~~~~~~~v~~~riv~~itT~t~~ 458 (562) T protein:vir:80 379 RSLKMPGYMFAAQVAGLTCGLEIGEAITFKNIAIETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDK 458 (562) T ss_pred ceeeechhHHHHHHHHHHhcCccccCccceeeccccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceeccCC Confidence 358899999999999999999999999999999999999999999999999999998654 46666777777777788 Q ss_pred CcchhhhhhhhHHHHHHHHHHHHHH-hhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCCCEE Q lcl|NC_019421. 373 KNEAMGYISNIMFINTINKDTSLKR-KEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKADEF 451 (473) Q Consensus 373 ~~~~~~~i~v~R~~d~i~~~i~~~~-~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~d~~ 451 (473) +++.|++|+++|++|+|.++||..+ +.|+|| ||++.+|.+|+++|.+||++|+++|+|++|.+. |..+. .+.|.+ T Consensus 459 ~~~~~~ki~viRv~D~i~~dir~~~~~~yIGk-~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-dv~v~--~~~d~~ 534 (562) T protein:vir:80 459 TDPVKSEIGVGEANDFLVSELKISLDNEYIGT-KIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPE-EVQVV--IEGDIA 534 (562) T ss_pred CCchhhhhhhhHHHHHHHHHHHHHHHhcCCcc-ccChHHHHHHHHHHHHHHHHHHhCCcccCCCcc-ceEEE--ecCCEE Confidence 8999999999999999999998665 579999 788899999999999999999999999999753 44433 566789 Q ss_pred EEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 452 YWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 452 ~v~i~v~p~~~~e~i~~t~~v~ 473 (473) +|++.++|+++|||||+|+++. T Consensus 535 ~v~~~v~Pv~~mekIy~ti~~~ 556 (562) T protein:vir:80 535 RISLTVFPIRSMKKIEVSLVYR 556 (562) T ss_pred EEEEEEEEcccceEEEEEEEEE Confidence 9999999999999999999999 No 8 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=100.00 E-value=6.6e-89 Score=504.01 Aligned_cols=447 Identities=15% Similarity=0.185 Sum_probs=359.1 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLA 80 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~ 80 (473) ||.--| ++++..|||||+|+..++.++++++++++++|+|.|++||+|+|++|++ |+|+++.||.+ ++..++.++ T Consensus 1 ~~~~~~-~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~~G~~~~~~~~~~-~~~~~~~f~~g---~l~~a~~~a 75 (569) T protein:vir:80 1 MAVEQF-PRKKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAKGGKPDTVYRFRN-YQQAKQVLRSG---DLLDAIELA 75 (569) T ss_pred Ceeeee-cCCccccCceEEEEecCCCcCCCCCCceeEEEEEEeCCCCCceeEEecC-HHHHHHHhcCC---chhHHHHhh Confidence 777777 5899999999999999999999999999999999999999999999998 67899999875 467777787 Q ss_pred H------hcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeecc---CCccceeeeeecCC Q lcl|NC_019421. 81 L------LGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNL---VDSDKKDFIFFENT 151 (473) Q Consensus 81 f------~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~---~~~~~~~v~v~~~~ 151 (473) | .+|+.+||++|+.+. ++|+.+ .+.+++++...|.|+|++.++... .+..++.+....+. T Consensus 76 ~~~~~~~~~~~~~~~~~rv~~a--~~a~~~---------~~~~~~~a~~~g~~~n~i~v~l~~~~~~~~~~~~v~~~~~~ 144 (569) T protein:vir:80 76 WNASDVNTASAGDILAVRVEDA--KNATLT---------KGGLTFASTIYGVDANEIQVALEDNNLTHTKRLTVAFSKDG 144 (569) T ss_pred ccCccccccCceEEEEEEcCCC--eeeeee---------ccceeeeeeeccCCCceEEEEEecCcCCcceeeEEeeecCC Confidence 6 478889999999554 334433 246899999999999998876421 11111211100000 Q ss_pred c----------------------------------------------e----ee-------------------------E Q lcl|NC_019421. 152 K----------------------------------------------Q----LF-------------------------S 156 (473) Q Consensus 152 ~----------------------------------------------~----~~-------------------------~ 156 (473) . + +. + T Consensus 145 ~~~~~~~ig~v~si~ytg~~~~a~~~~~~~~~~~~a~~l~~~~g~~~~~~~~v~~~~~~~~~~~~~~~lv~~~~~~~~f~ 224 (569) T protein:vir:80 145 YKKVFDNLGKIFSIQYKGSEAQANFTIAQDSISKKATTLTLNVGSEPESTTEVMKYELGQGVYSETNVLVSAINSLPDWE 224 (569) T ss_pred CccccccccceeeEEEeeccccceEEeecCcCcceeEEEEEEecCCcceeEEEEeeccCCccchhhhhhhhhcCCccCce Confidence 0 0 00 0 Q ss_pred EEeccc---------chhh---------------hhh-hhhcccccceeEeecccCCccccccceeeeccCcccccchhh Q lcl|NC_019421. 157 SSIKGT---------IDEI---------------VLE-INSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITN 211 (473) Q Consensus 157 ~~~~~~---------~~~~---------------~~~-~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~ 211 (473) ..+... .++. ... ......+++++++.. .+++++..+...|+||+||+. . T Consensus 225 a~~~~~~~~~~~~~~~d~~~~~~~~t~~~~~~~~~~di~~~~~~~~~v~~~~~--~~~~l~~~~~~~LtGG~dG~~---~ 299 (569) T protein:vir:80 225 AKFFPIGDKNLPTDALEAVTKVDVKTEAVFVGALAGDIAKQLEYNDYVTVAVD--ATKPVEDFELTNLTGGSDGTA---P 299 (569) T ss_pred EEEEecCCCcceehhccchhheeccccceeeehhHHHHHHhhcCCceEEEEec--CCcceeeecceeecCCCCCCc---c Confidence 000000 0000 000 001123445555432 345678888899999999864 3 Q ss_pred HHHHHHHHhhcccceEEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCc Q lcl|NC_019421. 212 ESYLKALEEFERYSFDSFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSA 291 (473) Q Consensus 212 ~d~~~~l~~le~~~~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~ 291 (473) .+|.++|++||.+++++++++ ++++++|+++++||++||++++++++|++++.+.+++++.+++..+|++|++++.++. T Consensus 300 ~~~~~~l~~le~~~~~~i~~~-t~d~av~~~l~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~e~vv~v~~~~ 378 (569) T protein:vir:80 300 ESWANKFPLLANEGGYYLVPL-TDKQAVHSEALAFVKDRTDNGDPMRIIVGGGTNETVEESITRATNLRDPRASLVGFSG 378 (569) T ss_pred chHHHHHHHHhhCCcEEEEec-CCChHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHHhhcCCCeEEEEecCc Confidence 468899999999999999764 6789999999999999999999999999999999999999999999999999999886 Q ss_pred eecC-----cccchHHHHHHHHHhhhcCccccccceeccCcccccccCCHHHHHHHHhCCcEEEEEcCC-EEEEEecccc Q lcl|NC_019421. 292 YYEN-----IKYTPSEVAVYIAALSVSKGITGSICNAKTIFEEVEPRLSQSEVKECLKSGTLVLDFDDG-DVIIVDDVNT 365 (473) Q Consensus 292 ~~~~-----~~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~~~~~~~~~t~~e~~~l~~~G~~~l~~~~~-~~~i~~gi~T 365 (473) ...+ ..++++++|+|+||++|++++++|+||++++++++..+|+++|+++|+++|+++|++.++ .+++.++||+ T Consensus 379 ~~~~~~g~~~~~~~~~~aa~vAG~~A~~~~~~S~T~k~i~~~~i~~~lt~~e~~~li~~G~~~l~~~~~~~~~v~~~vn~ 458 (569) T protein:vir:80 379 TRKMDDGRLLKLPGYMMASQIAGIASGLEVGEAITFKHFNVTSVDRVFESSQLDMLNESGVISIEFVRNRTLTAFRVVQD 458 (569) T ss_pred eeecCCCcceeechhhHHHHHHHHHhcCccccCccceeeccccccccCCHHHHHHHHhCCeEEEEEecCceEEEEEEecc Confidence 4422 468999999999999999999999999999999999999999999999999999988654 5777788888 Q ss_pred cccCCCCCcchhhhhhhhHHHHHHHHHHHHHH-hhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceecccccc Q lcl|NC_019421. 366 FKKYVDDKNEAMGYISNIMFINTINKDTSLKR-KEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQA 444 (473) Q Consensus 366 ~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~-~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~ 444 (473) +++++.++++.|++|+++|++|+|.++||..+ +.|+|+ ||+..+|..|++.|++||++|+++|+|++|... |..+ T Consensus 459 itT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yiGk-~nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-dv~v-- 534 (569) T protein:vir:80 459 VTTYNDKSDPVKNEMSVGEANDFLVSELKIELDNNFIGT-KVIDTSASLIKNFIQSFLDNKKRAREIQDYTPE-EVQV-- 534 (569) T ss_pred ceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhhcCcc-cCChhHHHHHHHHHHHHHHHHHhCCcccCCCcc-ceEE-- Confidence 88888899999999999999999999998654 679998 788899999999999999999999999999753 4433 Q ss_pred CCCCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 445 TAKADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 445 ~~~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) ..+.|.++|++.++|+++|||||+++++. T Consensus 535 ~~~~d~~~v~~~v~Pv~~~ekI~~ti~~~ 563 (569) T protein:vir:80 535 VLEGDVASISMTVMPIRSLNKITVQLVYK 563 (569) T ss_pred EecCCEEEEEEEEEEcccccEEEEEEEEe Confidence 35677899999999999999999999999 No 9 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=100.00 E-value=3.1e-87 Score=494.82 Aligned_cols=449 Identities=15% Similarity=0.167 Sum_probs=345.7 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLA 80 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~ 80 (473) ||=--|. +++-.+||||+++..++..++.++++++++|+|.+++||+|+|++|++ ++++++.||.+ ++..++.++ T Consensus 1 ~~~~~~~-~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~g~~~~~~~~~~-~~~~~~~~g~G---~l~~ai~~a 75 (587) T protein:vir:96 1 MAKDIFP-RRPIQRPHASIEVDSSGIGGSASNSEKILCLIGKAEGGEPNTVYQVRN-YAQAKSVFRSG---ELLDAIELA 75 (587) T ss_pred CeeeeeC-CCcccCCceEEEEecCCccCCCCCCCceEEEEEEecCCCCceeEEEcC-hHHHHHhhcCC---cHHHHHHHH Confidence 7777775 688889999999999999999999999999999999999999999998 56799999886 477888888 Q ss_pred H----hcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEee--------------------- Q lcl|NC_019421. 81 L----LGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKS--------------------- 135 (473) Q Consensus 81 f----~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~--------------------- 135 (473) | .+|+.+||++|+.+... |+++. ..+++++..+|.|+|.|.|+. T Consensus 76 ~~~~~~~g~~~~~a~rv~~~~~--a~~~~---------~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~~~~~~~~~~ 144 (587) T protein:vir:96 76 WGSNPQYTAGKILAMRVEDAKA--SQLEK---------GGLRVTSKIFGSVSNDIQVALEKNTITDSLRLRVVFQKDNYQ 144 (587) T ss_pred hccCcCCCceEEEEEecCCCcc--ceeec---------ccccccccccCCCCceEEEEEEeccCCCccceEEEEecCCce Confidence 8 58999999999965333 22221 123333333444444333321 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_019421. 136 -------------------------------------------------------------------------------- 135 (473) Q Consensus 136 -------------------------------------------------------------------------------- 135 (473) T Consensus 145 ~~~~n~G~v~~i~y~g~~~~a~~~~~~~~~~~~A~~l~l~gg~~~v~~yrl~~g~~~~~~~~~~~~~~~~~~tAky~g~~ 224 (587) T protein:vir:96 145 EVFDNLGNIFSINYKGEGEKATFSVEKDKETQEAKRLVLKVDEKEVKAYELNGGAYSFTNEIITDINELPDFEAKLSPFG 224 (587) T ss_pred eeccccCceEEEEecccccceeEeeccCcccceeeeeEEEecCceEEEEEeCCCchhhhhhhhhhhccccceEEEeeccc Confidence Q ss_pred ----------ccCCccceeeeeecCCceeeEEEecccc---h-hhhhhhhhcccccceeE------eecccCCccccccc Q lcl|NC_019421. 136 ----------NLVDSDKKDFIFFENTKQLFSSSIKGTI---D-EIVLEINSNLDNEYVIA------TKVADSDTILANVV 195 (473) Q Consensus 136 ----------~~~~~~~~~v~v~~~~~~~~~~~~~~~~---~-~~~~~~~~~~~s~~v~~------~~~~~~~~~~~~~~ 195 (473) .+.+.+..++.++.++...+........ + ..............++. .......+.++... T Consensus 225 ~n~~~v~v~d~~~~~~~k~~~~y~~t~~~di~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~ 304 (587) T protein:vir:96 225 DKNLESRKLDEATDVDIKGKAVYVKAVFGDIENQTQYNQYVKFEQLPEQASEPSDVEVHAETESATVTATSKPKAIEPFE 304 (587) T ss_pred CceeEEEeeccccccccceEEEeehhhhhhhhhhhccccceeeccccchhhhhhcccccccccceeeeeccccccccccc Confidence 1222222222222222111000000000 0 00000000000000110 11122233455666 Q ss_pred eeeeccCcccccchhhHHHHHHHHhhcccceEEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHh Q lcl|NC_019421. 196 NQALEGGNDGCTSITNESYLKALEEFERYSFDSFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDK 275 (473) Q Consensus 196 ~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~ 275 (473) ...|+||+||..+ .+|.++|++|+.+++++|+++ ++++++|+++++||++||++++++++|++++.+++++++.++ T Consensus 305 ~~aLtGG~dG~~~---~~y~~~l~ale~~~~~~i~~~-t~d~ai~~~l~a~vk~~r~~gk~~~aVlg~~~~~~~~~~~~~ 380 (587) T protein:vir:96 305 LTKLSGGTNGEPP---TSWSAKLEKFKNEGGYYIVPL-TDRQSVHSEVATFVKNRSDAGEPMRAIVGGGTSETKEKLFGR 380 (587) T ss_pred ceeeecCCCCCCc---ccHHHHHHHHhhCCcEEEEec-CCCHHHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHH Confidence 6789999999753 468888999999999999765 678899999999999999999999999999999999999999 Q ss_pred hhccCCceEEEecCCceec-C----cccchHHHHHHHHHhhhcCccccccceeccCcccccccCCHHHHHHHHhCCcEEE Q lcl|NC_019421. 276 SKSFNDENIVNVGSSAYYE-N----IKYTPSEVAVYIAALSVSKGITGSICNAKTIFEEVEPRLSQSEVKECLKSGTLVL 350 (473) Q Consensus 276 ~~~~n~~~i~~~~~~~~~~-~----~~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~~~~~~~~~t~~e~~~l~~~G~~~l 350 (473) +..+|++|++++.++.... + ..++++++|+|+||++|++++++|+||++++++++.++|+++|+++|+++|+++| T Consensus 381 a~~~n~e~vi~v~~~~~~~~~~~~~~~~~~~~~aa~vAG~~Ag~~~~~S~T~~~~~~~~v~~~~t~~e~~~~i~~G~~~l 460 (587) T protein:vir:96 381 QAILNNPRVALVANSGKFVMGNGRILQAPAYMVASAVAGLVSGLDIGESITFKPLFVNSLDKVYESEELDELNENGIITI 460 (587) T ss_pred HhhcCCCcEEEEecceEEecCCCceeeechhhHHHHHHHHHhcCccccCccceeeecccccccCCHHHHHHHHhCCeEEE Confidence 9999999999998876542 2 3688999999999999999999999999999999999999999999999999999 Q ss_pred EEc-CCEEEEEecccccccCCCCCcchhhhhhhhHHHHHHHHHHHHHH-hhcCCcccCCHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019421. 351 DFD-DGDVIIVDDVNTFKKYVDDKNEAMGYISNIMFINTINKDTSLKR-KEFVGKIFNDATGQTTVICALKKYFEELMSQ 428 (473) Q Consensus 351 ~~~-~~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~-~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~ 428 (473) ++. ++.+++.++||++++++.++++.|++|+++|++|+|.++||..+ +.|+|| ||++.+|.+|+++|.+||++|+++ T Consensus 461 ~~~~~~~~~v~~~vnsitT~t~~~~~~~~~i~virv~D~i~~di~~~~~~~yiGk-~nn~~~r~~v~~~i~~~L~~l~~~ 539 (587) T protein:vir:96 461 EFVRNRMTTMFRIVDDVTTFPDKNDPVKSEMALGEANDFLVSELKILLEEQYIGT-RTINTSASQIKDFVQSYLGRKKRD 539 (587) T ss_pred EEecCCcEEEEEeeccceecCCCCCchhhhhhhHHHHHHHHHHHHHHHHhcCCcc-ccCHHHHHHHHHHHHHHHHHHHhC Confidence 875 45677788999999999999999999999999999999998655 679999 789999999999999999999999 Q ss_pred CCccCccceeccccccCCCCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 429 GIISEFNVDIDTELQATAKADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 429 g~i~~~~~~~D~~~~~~~~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) |+|++|.+ .|..+. .+.|.++|++.++|+++|||||+|+++. T Consensus 540 g~I~~~~~-~dv~v~--~~~D~~~v~~~v~Pv~~mekIy~tv~~~ 581 (587) T protein:vir:96 540 NEIQDFPP-EDVQVI--IEGNEARISLTIFPIRALKKISVSLVYR 581 (587) T ss_pred CcccCCCc-cceEEE--ecCCEEEEEEEEEEcccceEEEEEEEEE Confidence 99999976 355444 4667899999999999999999999998 No 10 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=100.00 E-value=5.4e-82 Score=466.14 Aligned_cols=445 Identities=18% Similarity=0.214 Sum_probs=336.2 Q ss_pred CC-----ccccC---CCCceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcH Q lcl|NC_019421. 1 MA-----TGTWN---EKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYS 72 (473) Q Consensus 1 m~-----~g~~~---~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~ 72 (473) |. +..|+ ++++-.|||||+++..++..++.++++++++|+|.|+.||+|+|++|++ +++++..||.+ + T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~G~~~~~~~~~~-~~~a~~~f~~g---~ 76 (607) T protein:vir:10 1 MTTTITSAESYKRIYPLFYDSRPHVETNFDDSRLSNTASDSAKNIFMLGSATNGDPTKVYEIRT-SQQATKIFGSG---D 76 (607) T ss_pred CcceecchhhHHHHhCCCCccCCceEEEEecCcCcCCCCCCcceEEEEEEeCCCCCceEEEEcc-hhHHHHhhcCc---c Confidence 22 22232 2466678999999999999999999999999999999999999999998 57799999876 4 Q ss_pred HHHHHHHHH------hcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEee----------- Q lcl|NC_019421. 73 AFKLGKLAL------LGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKS----------- 135 (473) Q Consensus 73 ~~~~v~~~f------~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~----------- 135 (473) +..++.++| .+|+..||++|+++...+.++. +.+++++..+|.|+|.++|+. T Consensus 77 l~~a~~~a~~~~~~~~~g~~~~~~~rv~~~~~a~~~~-----------~~~~~~~~~~~~~~~~i~~~l~~~~~~~~~~~ 145 (607) T protein:vir:10 77 LVDGIKLAFDPTGNSVTNGGTVYALRVDNAKQASLVK-----------DGLTFTSSIFGTNANQVSVALDNDVFGVPRIT 145 (607) T ss_pred hHHHHHHhhccccCCccCCceEEEEeCCCccccceec-----------ccccccccccccCCCceEEEEEecCCCcccee Confidence 777788888 6899999999996543322221 112233333333333332221 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_019421. 136 -------------------------------------------------------------------------------- 135 (473) Q Consensus 136 -------------------------------------------------------------------------------- 135 (473) T Consensus 146 ~~~~~d~~~~~~~n~g~~~~i~y~g~~~~a~~~v~~~~~g~~~~lt~~~~~~~~~~~~V~~~~l~~~~~~t~~~l~~din 225 (607) T protein:vir:10 146 VNYSPDNYERTYTNIGQMFSITYSGKSASAGYTVSHDTDGKAILLTLGSGDSIDKLTNVATFDLTMSKYDTIAKLMQAIS 225 (607) T ss_pred EEeecccceeeeeeccceeecccCcccccccceeeecCCCceeEEEecCCCccceeeeeecccccccccchHHHHHHHhh Confidence Q ss_pred ---------------------------------ccCCccceeeeeecCCceeeEEEecccchhhhhhhhhcccccceeEe Q lcl|NC_019421. 136 ---------------------------------NLVDSDKKDFIFFENTKQLFSSSIKGTIDEIVLEINSNLDNEYVIAT 182 (473) Q Consensus 136 ---------------------------------~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~ 182 (473) ...++...++..+.+......++.......... .....+.++.+. T Consensus 226 ~~~~~~A~~~g~~~i~tky~d~~~~~i~V~~~~~iv~a~~~D~~~~~~~~~~~~~t~~~~~~~~~~--~~~~~~~~~~~~ 303 (607) T protein:vir:10 226 ATPNFSASVVGSPSVNTSYLDEVTSPVDVKTAPAVVTAKIGDAISKLGYDPYVVVTQTSNNKPIVN--GVSAGTGSATAS 303 (607) T ss_pred cCCceEEEEecccceeeeccccccceeEEEEeeeeechhhhhhhhcccccceEEeeecccchhhhh--hhhccccceeee Confidence 000000001111111110000000000000000 001123334445 Q ss_pred ecccCCccccccceeeeccCcccccchhhHHHHHHHHhhcccceEEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEc Q lcl|NC_019421. 183 KVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFERYSFDSFVLDGVADEALQETTKAWVAKNKELGKDILLFLG 262 (473) Q Consensus 183 ~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~ 262 (473) ......+.++.++...|+||+||+. +.+|.++|++|+.+++++++++ ++++++|+++++||++||++++++++|++ T Consensus 304 ~~~~~~~~~a~~a~~~LtGGtdG~~---~~ty~dal~aLe~~e~~~i~~~-t~d~ai~~~l~a~vkr~~~~g~~~~aVlg 379 (607) T protein:vir:10 304 VTTAPESFPANFDTAFLTGGSTGDV---PVSWADKFNGAIGNNVYYIIPL-TSEENIHAELQAFIDEQHVLGYNYHAFVG 379 (607) T ss_pred eeccccccccccceeeeeCCCCCCc---hhhHHHHHHHHhhcCceEEEec-CCCHHHHHHHHHHHHHHHhCCCcEEEEec Confidence 5555667788888899999999965 3468888889999999999765 67899999999999999999999999999 Q ss_pred CCCCccHHHHHHhhhccCCceEEEecCCcee-cC---cccchHHHHHHHHHhhhcCccccccceeccCcccccccCCHHH Q lcl|NC_019421. 263 GKTEDNIKQINDKSKSFNDENIVNVGSSAYY-EN---IKYTPSEVAVYIAALSVSKGITGSICNAKTIFEEVEPRLSQSE 338 (473) Q Consensus 263 ~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~-~~---~~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~~~~~~~~~t~~e 338 (473) ++.+++++++.+++..+|++||+++.++... ++ ..++++++|+|+||++|++++++|+||++++++++.++|+++| T Consensus 380 ~~~~~t~~~~~t~a~~~N~ervv~V~~~~~~~~~G~~~~~~~~~~Aa~vAGl~Ag~~~~~SlT~k~i~~~~v~~~lt~~e 459 (607) T protein:vir:10 380 GGFAEPLEQILSRQVNINDSRFGLVGQSGHVQEGGESVHVPAYLMAAYVGGLSSSLGVAVPITNKKLALVDLDQNFSGDD 459 (607) T ss_pred CCCCCCHHHHHHHHHhhCCCcEEEEecCeeEeeCCcceeccHHHHHHHHHHHHhcCccccCcccceeccccccccCCHHH Confidence 9999999999999999999999999987644 22 3588999999999999999999999999999999999999999 Q ss_pred HHHHHhCCcEEEEEcCC-----EEEEEecccccccCCCCCcchhhhhhhhHHHHHHHHHHHHHH-hhcCCcccCCHHHHH Q lcl|NC_019421. 339 VKECLKSGTLVLDFDDG-----DVIIVDDVNTFKKYVDDKNEAMGYISNIMFINTINKDTSLKR-KEFVGKIFNDATGQT 412 (473) Q Consensus 339 ~~~l~~~G~~~l~~~~~-----~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~-~~~ig~~~N~~~~r~ 412 (473) +++|+++|+++|+++++ .++|++||||++. .+++.|++|+++|++|+|.++||..+ ++||||.+| ...|. T Consensus 460 ~e~ai~~Gv~~l~~~~~~~~~~~vrIv~~ItT~t~---~~~~~~~~i~viRv~D~i~~dir~~~~~~yIGk~nn-d~~~~ 535 (607) T protein:vir:10 460 LNTLNQNGVIGIEHLVNRNATGGYYIVQDVSTNTV---SSSHVDGSLYLGELTDFLFDNLRFVLRDTYIGSNIR-STSAD 535 (607) T ss_pred HHHHHhCCeEEEEEccCccccceEEEeeeeeeccC---CCCcchheeehhhhHHHHHHHHHHHHhhcCCcccCC-cchHH Confidence 99999999999987654 4899999999764 56889999999999999999998654 679999755 46789 Q ss_pred HHHHHHHHHHHH--HHhcCCccCccceeccccccCCCCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 413 TVICALKKYFEE--LMSQGIISEFNVDIDTELQATAKADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 413 ~i~~~i~~~l~~--l~~~g~i~~~~~~~D~~~~~~~~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) +++..+.+||.. |+.+|+|++|.. .|..+. .+.|.++|++.++|+++|||||+|+++. T Consensus 536 ~vk~~i~~~L~~~~l~~~gaI~df~~-edv~v~--~~~D~v~v~~~v~Pv~~iekIyvtv~v~ 595 (607) T protein:vir:10 536 DIKSTVASYLYSEMNNDDGLIVDFSE-SDIVVT--ISGTVVYIQFAVAPTQEIKNIVVSGTYS 595 (607) T ss_pred HHHHHHHHHHHHHHHHhcCceeCCCc-cccEEe--eCCCEEEEEEEEEEcccceEEEEEEEEE Confidence 999999999865 455799999975 355444 4567899999999999999999999999 No 11 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=100.00 E-value=4.7e-76 Score=433.55 Aligned_cols=451 Identities=13% Similarity=0.068 Sum_probs=325.5 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLA 80 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~ 80 (473) ||=--|=.+++..+|||||||++++.++|+|++|++++|+|.+++||+|+|++|+| |.++++.||.. ++.+++++| T Consensus 1 ma~~~yf~~~~~~~PGVyvee~~sg~~~i~gv~tsva~fvG~a~~Gp~~~p~~v~s-~~~~~~~fggg---~l~~av~~~ 76 (648) T protein:vir:10 1 MAISVYFDGKLIKQLGAYVKTDLSAVKQINGVGTGIVALLGLAEGGETYKPYRLTS-FAEAVSIFKGG---PLLEHIKAA 76 (648) T ss_pred CeeeeeeCCCCccCCceEEEEeccccccccCCCCceEEEEEeeCCCCCceeEEecC-HHHHHHHhcCc---cHHHHHHHH Confidence 88666424788899999999999999999999999999999999999999999999 56688888854 588999999 Q ss_pred HhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCc---cceeeeeecC------- Q lcl|NC_019421. 81 LLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDS---DKKDFIFFEN------- 150 (473) Q Consensus 81 f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~---~~~~v~v~~~------- 150 (473) |.|||++||++|+.++..+++ ..+.++++|+.+|.|+|.++++....+. ..+++.+... T Consensus 77 F~nGg~~~~~vRv~~~~~a~~-----------~~~~~~~~a~~~g~~gn~i~~~v~~~~~~~~~~~~l~v~~~~~~~~~d 145 (648) T protein:vir:10 77 FIGGAGEVVAVRIGNPTTASV-----------SIPVAQNTSDTSPANLNFVSYEASTRSNQIYVSFDLDENFTSANEADD 145 (648) T ss_pred HhCCCcEEEEEEcCCCcccce-----------ecceeEEeecccCCCCCceEEEEEEcCCCcCceeEEEEEecCCCcccc Confidence 999999999999987644332 2357899999999999988755433221 1222222110 Q ss_pred ---------------CceeeEEEeccc-------ch---------hh----------hhhh----hhc------------ Q lcl|NC_019421. 151 ---------------TKQLFSSSIKGT-------ID---------EI----------VLEI----NSN------------ 173 (473) Q Consensus 151 ---------------~~~~~~~~~~~~-------~~---------~~----------~~~~----~~~------------ 173 (473) +....++..... .. .+ .... ... T Consensus 146 ~~v~~i~~~~~~y~gt~~~~t~~v~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~ 225 (648) T protein:vir:10 146 TIIFTIYQKHPDFSVTRETFTFPRKFTTPTVLVKRGSTLFFVDRSIVNAALAAGPAFQTALINLLKEQLQPTDVVQIFDA 225 (648) T ss_pred eeEEEeccCCCcccccceeccccccccccccccccccceeecCccchhhhhccCccchhhhhhchhhhhhhhhhheeccc Confidence 000001110000 00 00 0000 000 Q ss_pred ---------cc---------------------------------ccceeEeec--------------------------- Q lcl|NC_019421. 174 ---------LD---------------------------------NEYVIATKV--------------------------- 184 (473) Q Consensus 174 ---------~~---------------------------------s~~v~~~~~--------------------------- 184 (473) .. +.++..... T Consensus 226 s~~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~tp~~~~~~~~~~~~~~~~~~~~~v~~~~ 305 (648) T protein:vir:10 226 SDTNPVDIPLGLFVYEVLYGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNLSATPFFDGSDYQDYTSLSDPANWFAKDAYT 305 (648) T ss_pred ccccccccccccccccccchhhhcCCcchhhhhhhccccccccccceecccccccccccceeeeeccccccceeeeeccc Confidence 00 000000000 Q ss_pred ----ccCCccc--cccceeeeccCcccccch---------hhHHHHHHHHhhcccceEEEEE-c-----------CCCcH Q lcl|NC_019421. 185 ----ADSDTIL--ANVVNQALEGGNDGCTSI---------TNESYLKALEEFERYSFDSFVL-D-----------GVADE 237 (473) Q Consensus 185 ----~~~~~~~--~~~~~~~l~gG~dg~~~~---------t~~d~~~~l~~le~~~~~~l~~-p-----------~~~~~ 237 (473) ......| .....+.|+||+||..+. +..||.++|+.|++.+..+++. + .++.. T Consensus 306 ~~~l~~~~~~p~~~~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~~ivp~~~~~~~~~~~~~lt~~q 385 (648) T protein:vir:10 306 INHLVDTTINPHILATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVNFVIPAYKFTNVTQLNDRLTIFK 385 (648) T ss_pred hhhcccccccCcccccccceecccccCCCcccccccccccchhhHHHHhhhccCCCceEEEeecccccccccccccCCcc Confidence 0000000 011124688999998753 5678999999999987666542 1 46678 Q ss_pred HHHHHHHHHHHHHhhC-----CCeEEEEEcCCCCccHH--HHHHhhhccCCceEEEecC-----------C-ceec-C-- Q lcl|NC_019421. 238 ALQETTKAWVAKNKEL-----GKDILLFLGGKTEDNIK--QINDKSKSFNDENIVNVGS-----------S-AYYE-N-- 295 (473) Q Consensus 238 ~~~~~l~~~v~~~~~~-----~~~~~av~~~~~~~t~~--~~~~~~~~~n~~~i~~~~~-----------~-~~~~-~-- 295 (473) .+|+++.+|++.|+.. +..+.++++++++++.. +..-....+|+++.+.+.. + .... | T Consensus 386 ~i~a~a~shv~~~s~~~~~~~r~~~~~~vg~~~~es~~~se~~~~~~~~~~~~a~~~~~d~~~~~~~~~~~~~~~~~G~~ 465 (648) T protein:vir:10 386 GIASTFLSHVQTMSQVNRRKARVGVFGLPAPSPNESVTASEYLYNRNILNTISAMFGGTDRAQAVVFPFYSNVFNDEGKV 465 (648) T ss_pred chHHHHHHHHHHhhhccccccccCeEEEeCCCCchhHHHHHHHhhhhcccccceeeeecCCceEEeecccceeECCCCcE Confidence 9999999999999743 44567888888887753 3344444455554433221 1 1222 2 Q ss_pred cccchHHHHHHHHHhhhcCccccccceeccCccccc--ccCCHHHHHHHHhCCcEEEEEcCC-----EEEEEeccccccc Q lcl|NC_019421. 296 IKYTPSEVAVYIAALSVSKGITGSICNAKTIFEEVE--PRLSQSEVKECLKSGTLVLDFDDG-----DVIIVDDVNTFKK 368 (473) Q Consensus 296 ~~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~~~~~~--~~~t~~e~~~l~~~G~~~l~~~~~-----~~~i~~gi~T~~~ 368 (473) ..++++++|++|||+++++++..|+|++++.+.++. .+++++|+|+|+++|++|+++..+ .++|++||+|++. T Consensus 466 ~~~p~~~~Aa~VAGl~a~l~~~~s~T~k~i~~~~id~~~~~t~~qld~L~~~Gv~~ie~~~~~~~~~~~rvv~gITT~~~ 545 (648) T protein:vir:10 466 ELLGGEFFASYVAGMHANREPQDSITFLPISGIGAEPLYNWTYTQKDDLISNRVLFVEKVKTSFGGIVYRIHHNPTTWLG 545 (648) T ss_pred EecchhhHHHHHHhhhhccccccCcccceeeccccccccCCCHHHHHHHhcCCcEEEEEecCCcceeeEEEeccceeecC Confidence 126889999999999999999999999999887665 489999999999999999998644 4789999999875 Q ss_pred CCCCCcchhhhhhhhHHHHHHHHHHHH-HHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCC Q lcl|NC_019421. 369 YVDDKNEAMGYISNIMFINTINKDTSL-KRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAK 447 (473) Q Consensus 369 ~~~~~~~~~~~i~v~R~~d~i~~~i~~-~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~ 447 (473) ..++.|++|+++|+.||+...||. ..+.|||+ +|+...|+++|+.|.+||.++++++.|++|.. .++....+ T Consensus 546 ---~~~~~~~eisv~ri~D~l~~~vr~~l~~~fIG~-~n~~~~~~~ik~~i~~~L~~~~~~~~I~~y~~---~~v~~~~~ 618 (648) T protein:vir:10 546 ---PVTQGFQEFVLRRIDDFLQSYVYKNLQEQFIGR-KSYGRKTENDIKVYTEALLSNLVGKQIVAYKD---VKVTSNED 618 (648) T ss_pred ---CCCcceeeeeeeehhhHHHHHHHHHHhhhcCcc-cccHHHHHHHHHHHHHHHhhHhhcCcccCccc---ceEEEEec Confidence 357889999999999999999986 55689998 68888999999999999999999999999863 23333445 Q ss_pred CCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 448 ADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 448 ~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) .+.++|++.++|+++|+|||+++.|- T Consensus 619 ~~vv~V~~~v~Pv~~i~~I~vti~it 644 (648) T protein:vir:10 619 KTVYYVEFFYQPVTEIKFILVTMKVT 644 (648) T ss_pred CCEEEEEEEEEecceeeEEEEEEEEE Confidence 68999999999999999999999999 No 12 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=100.00 E-value=2.3e-70 Score=402.41 Aligned_cols=450 Identities=15% Similarity=0.119 Sum_probs=302.3 Q ss_pred CceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCC-CcCcHHHHHHHHHHhcCCCEE Q lcl|NC_019421. 10 ERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGD-DMNYSAFKLGKLALLGNVKEL 88 (473) Q Consensus 10 ~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~-~~~~~~~~~v~~~f~~g~~~v 88 (473) -+-..|||||||+ ++.++|.+++|+++||+|.++|||+|+|++|+|+ .+|++.||. .....+.++++.||+|||++| T Consensus 1 ~~~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~-~d~~~~fG~~~~~~~~~~~v~~~f~ngg~~~ 78 (663) T protein:vir:10 1 MALLSPGIEMKET-SINSTVVRSATGRAAIVGKFAWGPAYEVRQVTNE-VELVDMFGSPDNVTAPYFMSAMNFLQYGNDL 78 (663) T ss_pred CceecCceEEEEe-cCcccccccCccceeEEeeeccCCCCccEEecCH-HHHHHHhCCcCccchhHHHHHHHHHhCCCeE Confidence 4567799999999 5999999999999999999999999999999995 557777775 444556689999999999999 Q ss_pred EEEecCCCcccceeeeeccccc-------cc------------------------------------------------- Q lcl|NC_019421. 89 LLYRLVDGNQKKGTLTLKDTTE-------NS------------------------------------------------- 112 (473) Q Consensus 89 ~v~rv~~g~~~aat~~l~~~~~-------~~------------------------------------------------- 112 (473) ||+|+.++...++...+.+... .. T Consensus 79 ~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~n~~~~~v~~~~a~~~~~~~~v~ 158 (663) T protein:vir:10 79 RLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKTRQLG 158 (663) T ss_pred EEEEccCCcccccccccCCcceeeeeccccCccccccccccccccccccccceeeeccCCceEEEEeccccccccccccc Confidence 9999976543322111110000 00 Q ss_pred ---------------------------------------------------------ccceEEEEecCccccceeEEEee Q lcl|NC_019421. 113 ---------------------------------------------------------AKDVIKLETKYPTARNFNVTIKS 135 (473) Q Consensus 113 ---------------------------------------------------------~~~~l~i~A~~~G~~~n~i~v~~ 135 (473) ....+.+.+.++|.|+|.+++.. T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~vv~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~G~~Gn~i~v~i 238 (663) T protein:vir:10 159 TYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLVSAVYPGEIGSTVEVEI 238 (663) T ss_pred eeeeccccceeEeeeccccccccccccceecccceeeEeeccccccccccchhhhcccccceeeeeecccccccceeEEe Confidence 00001233444445554444431 Q ss_pred ccCCc-------------------------------cceeeeeecCCceeeEEEecccchhh--------hhhhhhcccc Q lcl|NC_019421. 136 NLVDS-------------------------------DKKDFIFFENTKQLFSSSIKGTIDEI--------VLEINSNLDN 176 (473) Q Consensus 136 ~~~~~-------------------------------~~~~v~v~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~s 176 (473) ..... ..+.+.+..++...+........... .........+ T Consensus 239 ~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~ 318 (663) T protein:vir:10 239 VSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFRNGGS 318 (663) T ss_pred cccccccccccccccccccccccccceeeeeccccccceeEEEecCCcceeeeeeeecccccccchhhhhhhhhhccCcc Confidence 11000 00011111111100000000000000 0000011122 Q ss_pred cceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhcc---cceEEEEEcCCC--c----HHHHHHHHHHH Q lcl|NC_019421. 177 EYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFER---YSFDSFVLDGVA--D----EALQETTKAWV 247 (473) Q Consensus 177 ~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~---~~~~~l~~p~~~--~----~~~~~~l~~~v 247 (473) .++.+... ...........+.+|.|+..+++..++..+++.|+. .+++++++|... + ..+++.+.+|| T Consensus 319 ~~~~~~~~---~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~al~~~a 395 (663) T protein:vir:10 319 NFIFASSE---GWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSLA 395 (663) T ss_pred eEEEEeec---ccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHHH Confidence 23222211 111112224578999999888888899988877754 456677665421 1 44666777777 Q ss_pred HHHhhCCCeEEEEEcCCCCc--------cHHHHHHhh-------------hccCCceEEEecCCceecCc---ccchHHH Q lcl|NC_019421. 248 AKNKELGKDILLFLGGKTED--------NIKQINDKS-------------KSFNDENIVNVGSSAYYENI---KYTPSEV 303 (473) Q Consensus 248 ~~~~~~~~~~~av~~~~~~~--------t~~~~~~~~-------------~~~n~~~i~~~~~~~~~~~~---~~~~~~~ 303 (473) +++ +.++++++.+.+. +.+.+.++. ..+++.+.++++||....+. ....... T Consensus 396 ~~~----~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d~~~~~~~~~p~ 471 (663) T protein:vir:10 396 DDR----QDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPL 471 (663) T ss_pred Hhh----CCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEecccCCceEEech Confidence 655 3578888877532 334444433 34789999999999865332 2222234 Q ss_pred HHHHHHhhhcCccccccceeccC--------cccccccCCHHHHHHHHhCCcEEEEEc-CCEEEEEecccccccCCCCCc Q lcl|NC_019421. 304 AVYIAALSVSKGITGSICNAKTI--------FEEVEPRLSQSEVKECLKSGTLVLDFD-DGDVIIVDDVNTFKKYVDDKN 374 (473) Q Consensus 304 a~~vAG~~a~~~~~~s~t~~~~~--------~~~~~~~~t~~e~~~l~~~G~~~l~~~-~~~~~i~~gi~T~~~~~~~~~ 374 (473) ++++||++|+.|..+++|+.|.+ ..++...+++.|++.|+++|++|+++. +.+..++||.+|+.+ .+ T Consensus 472 s~~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~----~~ 547 (663) T protein:vir:10 472 AADIAGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQ----VP 547 (663) T ss_pred hHHHHHHHHHhhccCCceEccCCceeccccccccceeccChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCC----CC Confidence 79999999999988877666543 234567899999999999999999875 446777799999742 23 Q ss_pred chhhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCCC--EEE Q lcl|NC_019421. 375 EAMGYISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKAD--EFY 452 (473) Q Consensus 375 ~~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~d--~~~ 452 (473) .+|++|++||+++||+++|+...+++++ +||++.+|..|+..|+.||++|+++|+|.+|++.||++.|++++.+ .++ T Consensus 548 s~~~~i~vrR~~~~i~~si~~~~~~~v~-e~n~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~ 626 (663) T protein:vir:10 548 SPFDRINVRRLFNMLKKNIGDTSKYELF-ENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVIDRNEFV 626 (663) T ss_pred cccceEehhhHHHHHHHHHHHHHHHhcc-CCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEE Confidence 5899999999999999999877766666 5899999999999999999999999999999999999988887544 788 Q ss_pred EEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 453 WKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 453 v~i~v~p~~~~e~i~~t~~v~ 473 (473) ++|.++|+.|+|||.++|..- T Consensus 627 ~~i~~~p~~pae~i~~~~~~~ 647 (663) T protein:vir:10 627 GTIYVKPPRSINYITLNMVAT 647 (663) T ss_pred EEEEEEecCCcceEEEEEEEe Confidence 899999999999999998865 No 13 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=100.00 E-value=2.4e-70 Score=402.25 Aligned_cols=450 Identities=14% Similarity=0.117 Sum_probs=299.5 Q ss_pred CceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCC-cCcHHHHHHHHHHhcCCCEE Q lcl|NC_019421. 10 ERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDD-MNYSAFKLGKLALLGNVKEL 88 (473) Q Consensus 10 ~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~-~~~~~~~~v~~~f~~g~~~v 88 (473) -+-..|||||||+ ++.++|++++|+++||+|.++|||+|+|++|+|+ .+|++.||.. ....+.++++.||+|||++| T Consensus 1 ~~~~~Pgvyv~e~-~~~~~i~~~~t~~~~~vG~~~~Gp~~~p~~v~~~-~~~~~~fg~~~~~~~~~~~~~~~f~ngg~~~ 78 (663) T protein:vir:10 1 MALLSPGIEMKET-SINSTVVRSATGRAAIVGKFAWGPAYEVRQVTNE-VELVDMFGSPDNVTAPYFMSAMNFLQYGNDL 78 (663) T ss_pred CceecCceEEEEe-cCCccccccCcccceeEeecccCCCCccEEecCH-HHHHHhcCCcCCcchhHHHHHHHHHhCCCeE Confidence 4557799999999 5899999999999999999999999999999995 5577777754 34456689999999999999 Q ss_pred EEEecCCCcccceeeeeccccc-------c--cc---------------------------------------------- Q lcl|NC_019421. 89 LLYRLVDGNQKKGTLTLKDTTE-------N--SA---------------------------------------------- 113 (473) Q Consensus 89 ~v~rv~~g~~~aat~~l~~~~~-------~--~~---------------------------------------------- 113 (473) ||+|+.++...++...+.+... . .. T Consensus 79 ~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~i~v~~~~~~~~~~~~~~~~~~~~~~~~v~~~ta~~~~~~~~v~ 158 (663) T protein:vir:10 79 RLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKTRQLG 158 (663) T ss_pred EEEEccCCccccccccccccceeEEeecccccccccccccccccccccccccceeeecccceEEEeeccccccccccccc Confidence 9999975443322111110000 0 00 Q ss_pred ----------------------------------------------------------cceEEEEecCccccceeEEEee Q lcl|NC_019421. 114 ----------------------------------------------------------KDVIKLETKYPTARNFNVTIKS 135 (473) Q Consensus 114 ----------------------------------------------------------~~~l~i~A~~~G~~~n~i~v~~ 135 (473) .....+.+.++|.|+|.++|.. T Consensus 159 ~~~~~~~~~~~~~s~~s~~~~~a~~v~~v~~d~~~~v~~~~~a~~~~t~~~~~~~~~~~~~~~i~A~~~G~~Gn~i~V~i 238 (663) T protein:vir:10 159 TYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLISAVYPGEIGSTVEVEI 238 (663) T ss_pred cceeeccceeeEeeeccCccccccccceeccccceEEeeccccccccccccccccccccccceEEeccCCcccceeeeee Confidence 0001223334444444444332 Q ss_pred ccCCc-------------------------------cceeeeeecCCceeeEEEecccchhh--------hhhhhhcccc Q lcl|NC_019421. 136 NLVDS-------------------------------DKKDFIFFENTKQLFSSSIKGTIDEI--------VLEINSNLDN 176 (473) Q Consensus 136 ~~~~~-------------------------------~~~~v~v~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~s 176 (473) ..... ..+.+.+..++...+........+.. ......+..+ T Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~ 318 (663) T protein:vir:10 239 VSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFRNGGS 318 (663) T ss_pred ccccccccccccceecccccccccccceeecccccccceeeEeecCCcceeeecccccccccccccchhhhhhhhcCCcc Confidence 11000 00000000010000000000000000 0000001112 Q ss_pred cceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhccc---ceEEEEEcCCCc------HHHHHHHHHHH Q lcl|NC_019421. 177 EYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFERY---SFDSFVLDGVAD------EALQETTKAWV 247 (473) Q Consensus 177 ~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~~---~~~~l~~p~~~~------~~~~~~l~~~v 247 (473) .++.... ............+.+|.|+..+.+..++..+++.|+.. .++++++|.... ..+++.+.+|| T Consensus 319 ~~~~~~~---~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~a 395 (663) T protein:vir:10 319 NFIFASS---EGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSLA 395 (663) T ss_pred eEEEEee---cccCccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHHH Confidence 2222111 11111222335788999998888888999888877654 566776654221 34666677777 Q ss_pred HHHhhCCCeEEEEEcCCCCc--------cHHHHHHhh-------------hccCCceEEEecCCceecCc---ccchHHH Q lcl|NC_019421. 248 AKNKELGKDILLFLGGKTED--------NIKQINDKS-------------KSFNDENIVNVGSSAYYENI---KYTPSEV 303 (473) Q Consensus 248 ~~~~~~~~~~~av~~~~~~~--------t~~~~~~~~-------------~~~n~~~i~~~~~~~~~~~~---~~~~~~~ 303 (473) +++ +.++++++++.+. +.+.+..+. ..+++++.++++||.+..+. ...-... T Consensus 396 ~~~----~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~ 471 (663) T protein:vir:10 396 DDR----QDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPL 471 (663) T ss_pred Hhh----CCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCccceEEEcCceEEecccCCceEEech Confidence 655 3578888877432 333343333 34689999999999865432 2222334 Q ss_pred HHHHHHhhhcCccccccceeccC--------cccccccCCHHHHHHHHhCCcEEEEEc-CCEEEEEecccccccCCCCCc Q lcl|NC_019421. 304 AVYIAALSVSKGITGSICNAKTI--------FEEVEPRLSQSEVKECLKSGTLVLDFD-DGDVIIVDDVNTFKKYVDDKN 374 (473) Q Consensus 304 a~~vAG~~a~~~~~~s~t~~~~~--------~~~~~~~~t~~e~~~l~~~G~~~l~~~-~~~~~i~~gi~T~~~~~~~~~ 374 (473) ++++||++|++|..+++|+.|.+ .+++...+++.|++.|+++|++|+++. +++..+.||-+|+.+ .+ T Consensus 472 sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~----~~ 547 (663) T protein:vir:10 472 AADIAGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQ----VP 547 (663) T ss_pred hHHHHHHHHHhhccCCceEccCCceeccccccccceeecChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCC----CC Confidence 79999999999988876665543 234566789999999999999999875 447888899999743 23 Q ss_pred chhhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCCC--EEE Q lcl|NC_019421. 375 EAMGYISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKAD--EFY 452 (473) Q Consensus 375 ~~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~d--~~~ 452 (473) .+|++|++||+++||+++|+...+++++ +||++.+|..|+..|+.||++|+++|+|.+|++.||++.|++++.+ .++ T Consensus 548 s~~~~i~vrR~~~~i~~si~~~~~~~v~-epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~ 626 (663) T protein:vir:10 548 SPFDRINVRRLFNMLKKNIGDTSKYELF-ENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVIDRNEFV 626 (663) T ss_pred cccceEehhhHHHHHHHHHHHHHHHhcc-CCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEE Confidence 5899999999999999999887766666 5899999999999999999999999999999999999988887544 788 Q ss_pred EEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 453 WKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 453 v~i~v~p~~~~e~i~~t~~v~ 473 (473) +.+.++|+.|+|||.++|..- T Consensus 627 ~~i~~~p~~pae~i~~~~~~~ 647 (663) T protein:vir:10 627 GTIYVKPPRSINYITLNMVAT 647 (663) T ss_pred EEEEEEecCCcceEEEEEEEe Confidence 889999999999999998865 No 14 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=100.00 E-value=1.3e-69 Score=398.30 Aligned_cols=450 Identities=17% Similarity=0.178 Sum_probs=294.8 Q ss_pred CceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCC-CcCcHHHHHHHHHHhcCCCEE Q lcl|NC_019421. 10 ERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGD-DMNYSAFKLGKLALLGNVKEL 88 (473) Q Consensus 10 ~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~-~~~~~~~~~v~~~f~~g~~~v 88 (473) -+-..|||||||+ ++.++|++++|+++||+|.++|||+|+|++|+|+ .+|.+.||. .....++++++.||+|||++| T Consensus 1 ~~~~~Pgvyv~e~-~~~~~i~~~~t~~~~~vg~~~~gp~~~p~~i~s~-~~~~~~fg~~~~~~~~~~~~~~~f~~gg~~~ 78 (679) T protein:vir:10 1 MTLLSPGVETKEI-NLQTTIARSSTGRAALVGKFNWGPAYQISQVVSE-VDLVDKFGRPDDQTADSFFSGVNFLNYGNDL 78 (679) T ss_pred CceecCceEEEee-cCCcccccCccccceeeecccCCCCccCEEecCH-HHHHHHcCCcccccchHHHHHHHHHhCCCeE Confidence 5667799999999 4889999999999999999999999999999995 557777775 444456789999999999999 Q ss_pred EEEecCCCcccceeeeeccccc----ccc-----cc--------------------------eEEE-------------- Q lcl|NC_019421. 89 LLYRLVDGNQKKGTLTLKDTTE----NSA-----KD--------------------------VIKL-------------- 119 (473) Q Consensus 89 ~v~rv~~g~~~aat~~l~~~~~----~~~-----~~--------------------------~l~i-------------- 119 (473) ||+|+.++...++...+++... ..+ .. .+.+ T Consensus 79 ~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~s~~~~~~~~~~~~~~~~~~~~v~~~~~~~~a~~~~~ 158 (679) T protein:vir:10 79 RLVRVLNETKSRNSSALYQSLSYTITSPGVDYKVGDVVNVLQGGNVIATGKVTVVNASGGIVAFYVPTAAIIDKAKSLND 158 (679) T ss_pred EEEEccCcccccccccccccccccccccccccccccceeeeeCCCcccceeEEEeeccCceeeeeecccccccccccccc Confidence 9999976664433222211000 000 00 0000 Q ss_pred ----------------------------------------------------------------------EecCccccce Q lcl|NC_019421. 120 ----------------------------------------------------------------------ETKYPTARNF 129 (473) Q Consensus 120 ----------------------------------------------------------------------~A~~~G~~~n 129 (473) .+..+|.+++ T Consensus 159 ~~~l~~a~~~~~~~~t~~~g~~~~~~v~~v~~~~~~~~~~~~~a~~~i~~~~~~~~t~~~~~~~~~~~~~~A~~~g~~gn 238 (679) T protein:vir:10 159 YPALDNAWQIQFAAGGPGAGQAATATVVGINLDSTIFVPNDEYAMSAISERSETKRTFIDICEEMKVPAIVARYAGTYGD 238 (679) T ss_pred cceecccceeeeeeccccccceeeeeeeeeccCCceeeccccccccccccccccchhhhhhhhccccceeeeecccccCC Confidence 0000011111 Q ss_pred eEEEeeccCCc------c-------------------------------------ceeeeeecCCceeeEEEecccchhh Q lcl|NC_019421. 130 NVTIKSNLVDS------D-------------------------------------KKDFIFFENTKQLFSSSIKGTIDEI 166 (473) Q Consensus 130 ~i~v~~~~~~~------~-------------------------------------~~~v~v~~~~~~~~~~~~~~~~~~~ 166 (473) .+.+....... . .+.+.+..++...+.+......... T Consensus 239 ~i~v~~va~~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vvv~~~g~~~~~~~~~~~~~~~ 318 (679) T protein:vir:10 239 NIKVLMIAYKDYYKFNEAGKIVSVNTINPKVFPTGLDYGNVTPSSYLEFGPQNESQFAFIVFNNGVAVESKILSTKPGDR 318 (679) T ss_pred cceEEEEeecccccccccccccccccccccccccccccccceeeeecccccccccceeeEEecccccccceeeecccccc Confidence 10000000000 0 0000000000000000000000000 Q ss_pred h-----h---hhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhh---cccceEEEEEcCCC Q lcl|NC_019421. 167 V-----L---EINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEF---ERYSFDSFVLDGVA 235 (473) Q Consensus 167 ~-----~---~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~l---e~~~~~~l~~p~~~ 235 (473) . . ....+..+.++... ....+........+.||.++....+..++.+++..+ +...++++++|+.. T Consensus 319 ~~~~~~~~~~~~~~~~~~~~v~~~---~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~ 395 (679) T protein:vir:10 319 DIYGTSIYINEYFGNGYSSFVQGV---AESWPVGYTGVLAFGGGQSSNTDISAAEFMKGWDMFADREHTDVNLFIAGAVA 395 (679) T ss_pred cccchhhhhhhhhcCcccceeeec---cccccccccceeeccCCccCCCccchhhhhhhhhhhhcccccccceEEecCCC Confidence 0 0 00000000111100 000111112245677888887777777777766544 44578999999864 Q ss_pred c------HHHHHHHHHHHHHHhhCCCeEEEEEcCCCC--------ccHHHHHHhhh-------------ccCCceEEEec Q lcl|NC_019421. 236 D------EALQETTKAWVAKNKELGKDILLFLGGKTE--------DNIKQINDKSK-------------SFNDENIVNVG 288 (473) Q Consensus 236 ~------~~~~~~l~~~v~~~~~~~~~~~av~~~~~~--------~t~~~~~~~~~-------------~~n~~~i~~~~ 288 (473) . ..+++++.+||++++ .|++++.++.. .+.+++..+.. .+++.+.++++ T Consensus 396 ~~~~~~~~~v~~~l~~~~~~~~----~~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~~~~~ 471 (679) T protein:vir:10 396 GEGAQIASTVQKAVVAIADERR----DCLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYASVDG 471 (679) T ss_pred CCchhhhHHHHHHHHHHHHhhC----CeEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceEEEEc Confidence 3 356777888887764 47788876533 33455554443 46788999999 Q ss_pred CCceecCc---ccchHHHHHHHHHhhhcCccccccceeccC--------cccccccCCHHHHHHHHhCCcEEEEEcCCEE Q lcl|NC_019421. 289 SSAYYENI---KYTPSEVAVYIAALSVSKGITGSICNAKTI--------FEEVEPRLSQSEVKECLKSGTLVLDFDDGDV 357 (473) Q Consensus 289 ~~~~~~~~---~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~--------~~~~~~~~t~~e~~~l~~~G~~~l~~~~~~~ 357 (473) ||....+. .......++++||++|++|..+++|+.|.+ ..++...+++.|++.|+++|++++++..++. T Consensus 472 p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~g~G 551 (679) T protein:vir:10 472 NYKYQYDKYNDVNRWIPLAADIAGLCARTDTVGQPWQSPAGFNRGQIVNVIKLAVDTRQAHRDEMYTNGINPIVGFAGQG 551 (679) T ss_pred cceeeecccCCceEEechHHHHHHHHHHhhccCCcEECcCCeeeccccccccceeecChhhHHhhhhCCceEEEEecCCe Confidence 98876432 222233479999999999987776666543 2345667899999999999999999987777 Q ss_pred EEEecccccccCCCCCcchhhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccce Q lcl|NC_019421. 358 IIVDDVNTFKKYVDDKNEAMGYISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVD 437 (473) Q Consensus 358 ~i~~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~ 437 (473) .++||-+|+.+ .+.+|++|++||++++|+++|+....++++ +|||+.+|.+|+..|+.||++||++|+|.+|++. T Consensus 552 ~~~wG~rT~~~----~~s~~~~i~vrR~~~~i~~si~~~~~~~v~-epn~~~~~~~i~~~i~~fL~~l~~~gal~gf~v~ 626 (679) T protein:vir:10 552 YILYGDKTASQ----APTPFDRINVRRLFNLLKKSISESAKYKLF-ELNDAFTRSSFRSEVGSYLDTIRSLGGIYDFRVV 626 (679) T ss_pred EEEEcccccCC----CCcccceEehhhHHHHHHHHHHHHHHHhcc-CCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEE Confidence 78899999743 235899999999999999999987766666 6899999999999999999999999999999999 Q ss_pred eccccccCCCCC--EEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 438 IDTELQATAKAD--EFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 438 ~D~~~~~~~~~d--~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) ||.+.+++++.+ .+++++.++|++|||||.++|.-- T Consensus 627 ~d~~~nt~~~i~~G~~~~~i~~~p~~pae~i~~~~~~~ 664 (679) T protein:vir:10 627 CDESNNTPAVIDRNEFVATILIKPARSINYITLSFVAT 664 (679) T ss_pred EcCCCCCHHHhhCCeEEEEEEEEecCCccEEEEEEEEe Confidence 999988877554 688889999999999999998765 No 15 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=100.00 E-value=6.6e-69 Score=394.39 Aligned_cols=450 Identities=15% Similarity=0.126 Sum_probs=300.2 Q ss_pred CceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCC-CcCcHHHHHHHHHHhcCCCEE Q lcl|NC_019421. 10 ERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGD-DMNYSAFKLGKLALLGNVKEL 88 (473) Q Consensus 10 ~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~-~~~~~~~~~v~~~f~~g~~~v 88 (473) =+-..|||||||++++++++.+ +|+++||+|.++|||+|+|++|+|+ .+|++.||. .....+.++++.||+|||++| T Consensus 1 ~~~~~PgVyv~e~~~~~~~~~~-~ts~~~fvG~~~~Gp~~~p~~i~s~-~d~~~~fG~~~~~~~~~~~v~~~f~ngg~~~ 78 (659) T protein:vir:10 1 MTLLSPGIELKETTVQSTVVNN-STGTAALAGKFQWGPAFQIKQVTNE-VDLVNTFGQPTAETADYFMSAMNFLQYGNDL 78 (659) T ss_pred CceecCceEEEEecCCceeccc-CccceEEEecccCCCCCccEEecCH-HHHHHHcCCcCCCcchhHHHHHHHhhCCCeE Confidence 3446799999999999987765 7999999999999999999999995 557777775 444556789999999999999 Q ss_pred EEEecCCCcccceeeeecccc-----------cccccceEEEEecCccccceeEEEeecc-------------------C Q lcl|NC_019421. 89 LLYRLVDGNQKKGTLTLKDTT-----------ENSAKDVIKLETKYPTARNFNVTIKSNL-------------------V 138 (473) Q Consensus 89 ~v~rv~~g~~~aat~~l~~~~-----------~~~~~~~l~i~A~~~G~~~n~i~v~~~~-------------------~ 138 (473) ||+|+.++.+..+...+.+.. ........+..+..+|.+++...+.... . T Consensus 79 ~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~vd~~~~~~~~~i~~~~~~~~~~~~g 158 (659) T protein:vir:10 79 RVVRAVDRDTAKNSSPIAGNIEYTISTPGSNYAVGDKITVKYVSDAIETEGKITEVDTDGKIKKINIPTAKIIAKAKEVG 158 (659) T ss_pred EEEEccCcccccccccccccceeeEeecccccccccceeeeecCCCccccceeeEEecccccceeeeccccccccccccc Confidence 999997665443332222110 0000001111122333333322111000 0 Q ss_pred Cccce------e-----------ee--------------eec-----------------CCcee---------eEEEec- Q lcl|NC_019421. 139 DSDKK------D-----------FI--------------FFE-----------------NTKQL---------FSSSIK- 160 (473) Q Consensus 139 ~~~~~------~-----------v~--------------v~~-----------------~~~~~---------~~~~~~- 160 (473) +...+ + +. +.. +...+ ....+. T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~a~t~~~~~~~~~~~~~~~v~a~~~G~~g~~~tv~~ 238 (659) T protein:vir:10 159 EYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKIEIEI 238 (659) T ss_pred ccceeeeeeeeeeeeeccccceeeEEeeeecCCceeEEeeccccccccccccccceeecccccccccccceecccceEEE Confidence 00000 0 00 000 00000 000000 Q ss_pred ------------------------cc----------chhhhhhh--------------------------------hhcc Q lcl|NC_019421. 161 ------------------------GT----------IDEIVLEI--------------------------------NSNL 174 (473) Q Consensus 161 ------------------------~~----------~~~~~~~~--------------------------------~~~~ 174 (473) .. .++..... ..+. T Consensus 239 ~~~a~~~~~~~v~v~~~~~~~~~a~~~t~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (659) T protein:vir:10 239 VSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDFFAKG 318 (659) T ss_pred echhhccccceeeeeeeeecccccccceeeeeeccccccchhhccccccceeeeeeeeccccccccccchhhhhhhhccC Confidence 00 00000000 0000 Q ss_pred cccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhh---cccceEEEEEcCCCc------HHHHHHHHH Q lcl|NC_019421. 175 DNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEF---ERYSFDSFVLDGVAD------EALQETTKA 245 (473) Q Consensus 175 ~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~l---e~~~~~~l~~p~~~~------~~~~~~l~~ 245 (473) .+.++.... ............+.+|.++....+..++.+++.+| +..+++++++|+... ..++.++.+ T Consensus 319 ~~~~v~~~~---~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~il~~p~~~~~~~~~~~~v~~al~~ 395 (659) T protein:vir:10 319 GSEYIFATA---QNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVVS 395 (659) T ss_pred cccEEEEee---cccCCCccceeeecccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHHH Confidence 011111110 00011112234678888887777888888777766 455799999998643 456777788 Q ss_pred HHHHHhhCCCeEEEEEcCC--------CCccHHHHHHhhhc----------cCCceEEEecCCceecCc---ccchHHHH Q lcl|NC_019421. 246 WVAKNKELGKDILLFLGGK--------TEDNIKQINDKSKS----------FNDENIVNVGSSAYYENI---KYTPSEVA 304 (473) Q Consensus 246 ~v~~~~~~~~~~~av~~~~--------~~~t~~~~~~~~~~----------~n~~~i~~~~~~~~~~~~---~~~~~~~a 304 (473) ||++++ .+++++..+ ...+.+.+..+... ++|.+.++++||....+. .+.....+ T Consensus 396 ~~~~~~----~~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~s 471 (659) T protein:vir:10 396 IGDARQ----DCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLA 471 (659) T ss_pred HHHhhC----CeEEEEcCccccccCCCcccCHHHHHHHHHhcccccccccccCcceEEEEeCcEEEecccCCceEEechH Confidence 887664 466776654 34556666666543 789999999999875432 22233458 Q ss_pred HHHHHhhhcCccccccceeccC--------cccccccCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcch Q lcl|NC_019421. 305 VYIAALSVSKGITGSICNAKTI--------FEEVEPRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEA 376 (473) Q Consensus 305 ~~vAG~~a~~~~~~s~t~~~~~--------~~~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~ 376 (473) +++||++|++|.++++|+.|.+ ..++...+++.|++.|+++|++++++.+++..+.||-+|+.+ .+.+ T Consensus 472 g~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~----~~s~ 547 (659) T protein:vir:10 472 ADIAGLCARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATS----VPSP 547 (659) T ss_pred HHHHHHHHHHhccCCceEccCCceeeeeeccccceecCCHhHHHHHhhCCeeEEEEeCCCeEEEEcccccCC----CCcc Confidence 9999999999998887776643 224455789999999999999999998887888899998742 2358 Q ss_pred hhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCCC--EEEEE Q lcl|NC_019421. 377 MGYISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKAD--EFYWK 454 (473) Q Consensus 377 ~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~d--~~~v~ 454 (473) |++|++||+++||+++|++...++++ +||++.+|..|+..|+.||++|+++|+|.+|++.||.+.|++++.+ .++++ T Consensus 548 ~~~i~vrR~~~~i~~si~~~~~~~v~-e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~ 626 (659) T protein:vir:10 548 FDRINVRRLFNMLKTNIGRSSKYRLF-ELNNAFTRSSFRTETAQYLQGIKALGGIYEYRVVCDTTNNTPSVIDRNEFVAT 626 (659) T ss_pred cceEehhhHHHHHHHHHHHHHHHhcc-CCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCCeEEEE Confidence 99999999999999999987766666 6899999999999999999999999999999999999988877544 78889 Q ss_pred EEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 455 WDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 455 i~v~p~~~~e~i~~t~~v~ 473 (473) +.++|+.|+|||.++|.-- T Consensus 627 i~~~p~~pae~i~~~~~~~ 645 (659) T protein:vir:10 627 FYIQPARSINYITLNFVAT 645 (659) T ss_pred EEEEecCCcceEEEEEEEE Confidence 9999999999999998766 No 16 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=100.00 E-value=8.2e-69 Score=393.86 Aligned_cols=450 Identities=14% Similarity=0.119 Sum_probs=300.0 Q ss_pred CceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCC-CcCcHHHHHHHHHHhcCCCEE Q lcl|NC_019421. 10 ERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGD-DMNYSAFKLGKLALLGNVKEL 88 (473) Q Consensus 10 ~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~-~~~~~~~~~v~~~f~~g~~~v 88 (473) =+-..|||||||++++++++ +++|+++||+|.++|||+|+|++|+|+ .+|++.||. .....+.++++.||+|||++| T Consensus 1 ~~~~~PgVyvee~~~~~~~~-~~~ts~~~fvG~~~~Gp~~~p~~i~s~-~~~~~~fG~~~~~~~~~~~~~~~f~ngg~~~ 78 (659) T protein:vir:72 1 MTLLSPGIELKETTVQSTVV-NNSTGTAALAGKFQWGPAFQIKQVTNE-VDLVNTFGQPTAETADYFMSAMNFLQYGNDL 78 (659) T ss_pred CceecCceEEEEecCCcccc-cCCCcceEEEeecCCCCCcccEEecCH-HHHHHHcCCcCCCCchhHHHHHHHHhCCceE Confidence 34567999999999999766 558999999999999999999999995 557778875 444556789999999999999 Q ss_pred EEEecCCCccc-ceeeeecccc---cc-------cccceEEEEecCccccceeEEEeecc-------------------- Q lcl|NC_019421. 89 LLYRLVDGNQK-KGTLTLKDTT---EN-------SAKDVIKLETKYPTARNFNVTIKSNL-------------------- 137 (473) Q Consensus 89 ~v~rv~~g~~~-aat~~l~~~~---~~-------~~~~~l~i~A~~~G~~~n~i~v~~~~-------------------- 137 (473) ||+|+.++... +++....... .. .........+..+|.|++...+.... T Consensus 79 ~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~g~v~~~~~~~~~~~~~v~t~~~~a~~~~~~ 158 (659) T protein:vir:72 79 RVVRAVDRDTAKNSSPIAGNIDYTISTPGSNYAVGDKITVKYVSDDIETEGKITEVDADGKIKKINIPTGKNYAKAKEVG 158 (659) T ss_pred EEEEccCCcccccccccccccceeecccccccccceeeeeeeccccccccceEEEeeccccceeeeeccccccccccccc Confidence 99999764433 2222211100 00 00111222233445554322211000 Q ss_pred ----------------CC--ccceeee-eecCCcee-------------------------------------eEEEec- Q lcl|NC_019421. 138 ----------------VD--SDKKDFI-FFENTKQL-------------------------------------FSSSIK- 160 (473) Q Consensus 138 ----------------~~--~~~~~v~-v~~~~~~~-------------------------------------~~~~~~- 160 (473) .. ...+.+. +..+.... ....+. T Consensus 159 ~~~~~~~~~~~~~~~~~~~~a~~~~~v~v~~~~~~~~~~v~~~~~a~~~~~~~~~v~~~~~~~~~a~~~gt~g~~~tv~i 238 (659) T protein:vir:72 159 EYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKIEIEI 238 (659) T ss_pred cccccccceeeEEeeccccccceEEEEEeecCcceeeeeccccchhhhcccccccccccccceeeeccccccccceeEEE Confidence 00 0000000 00000000 000000 Q ss_pred ----------------------------------ccchhhhh--------------------------------hhhhcc Q lcl|NC_019421. 161 ----------------------------------GTIDEIVL--------------------------------EINSNL 174 (473) Q Consensus 161 ----------------------------------~~~~~~~~--------------------------------~~~~~~ 174 (473) ...++... ...... T Consensus 239 ~~~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (659) T protein:vir:72 239 VSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDFFAKG 318 (659) T ss_pred ccccccccceeeeeecccccccccccceeeeeeecccccccceeeecccceeeeeeeeeccccccccchhhhhhhhhhcC Confidence 00000000 000000 Q ss_pred cccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhc---ccceEEEEEcCCCc------HHHHHHHHH Q lcl|NC_019421. 175 DNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFE---RYSFDSFVLDGVAD------EALQETTKA 245 (473) Q Consensus 175 ~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le---~~~~~~l~~p~~~~------~~~~~~l~~ 245 (473) .+.++..... ...........+.+|.++....+..++.+++.+|+ ..+++++++|+... ..+++++.+ T Consensus 319 ~~~~v~~~~~---~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~~l~~ 395 (659) T protein:vir:72 319 GSEYIFATAQ---NWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVVS 395 (659) T ss_pred CceEEEEEec---ccCCcccccccccccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHHH Confidence 1111111110 00011122345778888877777888888877764 45799999998643 346777778 Q ss_pred HHHHHhhCCCeEEEEEcCC--------CCccHHHHHHhhh----------ccCCceEEEecCCceecCc---ccchHHHH Q lcl|NC_019421. 246 WVAKNKELGKDILLFLGGK--------TEDNIKQINDKSK----------SFNDENIVNVGSSAYYENI---KYTPSEVA 304 (473) Q Consensus 246 ~v~~~~~~~~~~~av~~~~--------~~~t~~~~~~~~~----------~~n~~~i~~~~~~~~~~~~---~~~~~~~a 304 (473) ||++++ .+++++..+ ...+.+.+.++.. .+++.++++++||....+. .......+ T Consensus 396 ~~~~~~----~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s 471 (659) T protein:vir:72 396 IGDARQ----DCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNHKYQYDKYNDVNRWVPLA 471 (659) T ss_pred HHhhhC----CEEEEEcCccccccCCCcccCHHHHHHHHhhccccccccccccceeEEEEcCceeeccccCCceEEechH Confidence 877665 467777654 3455666666554 3689999999999865332 22223347 Q ss_pred HHHHHhhhcCccccccceeccCc--------ccccccCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcch Q lcl|NC_019421. 305 VYIAALSVSKGITGSICNAKTIF--------EEVEPRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEA 376 (473) Q Consensus 305 ~~vAG~~a~~~~~~s~t~~~~~~--------~~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~ 376 (473) +++||++|++|.++++|+.|.+. .++...+++.|++.|+++|++++++.+++..++||-+|+.+ .+.+ T Consensus 472 g~vAGl~Ar~D~~~G~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~----~~s~ 547 (659) T protein:vir:72 472 ADIAGLCARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATS----VPSP 547 (659) T ss_pred HHHHHHHHHhhccCCcEEccCCeeeceeeccccccccCChhHHHHHhhCCceEEEEecCCeEEEEcccccCC----CCcc Confidence 99999999999888777766431 24456789999999999999999998887888899998743 2358 Q ss_pred hhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCCC--EEEEE Q lcl|NC_019421. 377 MGYISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKAD--EFYWK 454 (473) Q Consensus 377 ~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~d--~~~v~ 454 (473) |++|++||++|+|+++|+....++++ +||++.+|..|+..|+.||++||++|+|++|++.||.+.+++++.+ .++++ T Consensus 548 ~~~i~vrR~~~~i~~si~~~~~~~v~-e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~ 626 (659) T protein:vir:72 548 FDRINVRRLFNMLKTNIGRSSKYRLF-ELNNAFTRSSFRTETAQYLQGNKALGGIYEYRVVCDTTNNTPSVIDRNEFVAT 626 (659) T ss_pred cceEeehhHHHHHHHHHHHHHHHhhc-CCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCCeEEEE Confidence 99999999999999999877766665 6899999999999999999999999999999999999988877544 78889 Q ss_pred EEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 455 WDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 455 i~v~p~~~~e~i~~t~~v~ 473 (473) +.++|+.|+|||.++|.-- T Consensus 627 i~~~p~~pae~I~~~~~~~ 645 (659) T protein:vir:72 627 FYIQPARSINYITLNFVAT 645 (659) T ss_pred EEEEecCCccEEEEEEEEe Confidence 9999999999999998764 No 17 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=100.00 E-value=5.8e-69 Score=394.70 Aligned_cols=454 Identities=14% Similarity=0.130 Sum_probs=293.3 Q ss_pred CceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCC-CcCcHHHHHHHHHHhcCCCEE Q lcl|NC_019421. 10 ERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGD-DMNYSAFKLGKLALLGNVKEL 88 (473) Q Consensus 10 ~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~-~~~~~~~~~v~~~f~~g~~~v 88 (473) -+-..|||||||+ ++.++|++++|+++||+|.++|||+|+|++|+|+ .+|++.||. .....+.++++.||+|||++| T Consensus 1 ~~~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~-~~~~~~fG~~~~~~~~~~~~~~~f~~~g~~~ 78 (660) T protein:vir:68 1 MALLSPGVELKET-TVQSTVVNNSTGTAALAGKFQWGPAFQIKQITDE-VALVDMFGTPNTDTADYFMSAMNFLQYGNDL 78 (660) T ss_pred CccccCceEEEEe-cCCcccccCCCcceeEEecccCCCCccCEEecCH-HHHHHhcCCccCccchhHHHHHHHHhCCCeE Confidence 4556799999999 5899999999999999999999999999999995 567778875 444456679999999999999 Q ss_pred EEEecCCCcccceeeeeccc----ccccc---------------------cc---------------------------- Q lcl|NC_019421. 89 LLYRLVDGNQKKGTLTLKDT----TENSA---------------------KD---------------------------- 115 (473) Q Consensus 89 ~v~rv~~g~~~aat~~l~~~----~~~~~---------------------~~---------------------------- 115 (473) ||+|+.+++..++.....+. ....+ .. T Consensus 79 ~vvRv~~~~~~~~~~~~~~~~~~t~~~~g~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ta~~~~~a~~~~ 158 (660) T protein:vir:68 79 RVVRAVDRDTAKNSSPVAGNINFTISSAGTNYRVGDKVVVKYSTDIIEPDGEVTSVDSDGKILNIFIPSGKIIAKAKEIG 158 (660) T ss_pred EEEEecccccccccccccccceeeeeccCcceeeeeeeeeecccccccccccceeeeecCceeeeeeccccccccceeec Confidence 99999754433211110000 00000 00 Q ss_pred ---------eEEE---------------------------------------------------EecCccccceeEEEee Q lcl|NC_019421. 116 ---------VIKL---------------------------------------------------ETKYPTARNFNVTIKS 135 (473) Q Consensus 116 ---------~l~i---------------------------------------------------~A~~~G~~~n~i~v~~ 135 (473) ...+ .|..+|.|++.+++.. T Consensus 159 ~~~~~~~~~~~~v~~~~~~~~~~~~v~~~~~d~~~~~~~~~ta~~~~~~~~~~~~~~~~~~~~~~A~~~g~~G~~i~v~~ 238 (660) T protein:vir:68 159 EYPELGSNWTAEMSGSSSGLSAVITIDSVVMDSGILLTEVETSEEAITSLTFQESIKKYGVPGVVALYPGELGDQLEIEI 238 (660) T ss_pred cccccccceeEEeecccccceeeeeeccccccccceeeeeccccccccccceeeeecccCccccccccccccccceEEEE Confidence 0000 0111112222211110 Q ss_pred ccCC---------------------------------ccceeeeeecCCceeeEEEecccchhhh--------hhhhhcc Q lcl|NC_019421. 136 NLVD---------------------------------SDKKDFIFFENTKQLFSSSIKGTIDEIV--------LEINSNL 174 (473) Q Consensus 136 ~~~~---------------------------------~~~~~v~v~~~~~~~~~~~~~~~~~~~~--------~~~~~~~ 174 (473) .... ...+.+.+..++...+.+.......... .....+. T Consensus 239 ~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (660) T protein:vir:68 239 VSKADYDKGASAQLKIYPDGGTRYSTAKAIFGYGPQTDDQYAIIVRRNDSVVQSVVLSTKRGERDIYGSNIFIDDFFAKG 318 (660) T ss_pred eccccccccccccceeeecccccccceeeEeecccccccceeeeeecCCcceeeeeeecccccccccccceeeehhhccC Confidence 0000 0000000000000000000000000000 0000011 Q ss_pred cccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhh---cccceEEEEEcCCCc------HHHHHHHHH Q lcl|NC_019421. 175 DNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEF---ERYSFDSFVLDGVAD------EALQETTKA 245 (473) Q Consensus 175 ~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~l---e~~~~~~l~~p~~~~------~~~~~~l~~ 245 (473) .+.++.+.. ............+.+|.++....+..++..++..+ +..+.+++.+++... ..+++.+.+ T Consensus 319 ~~~~v~~~~---~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~v~~~l~~ 395 (660) T protein:vir:68 319 ASNYIFATA---QGWPKGFSGVIKLNGGLSSNETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESLEVASTVQKHVVA 395 (660) T ss_pred cccEEEEee---cCCCccccceeeeccccccccccccchhhhHHHHhhhhhccccceeeccccCCCchHHHHHHHHHHHH Confidence 112222111 11111122234678888887766776776665544 455566665554322 256778888 Q ss_pred HHHHHhhC----CCeEEEEEcCCCCccHHHHHHhhh----------ccCCceEEEecCCceecCc---ccchHHHHHHHH Q lcl|NC_019421. 246 WVAKNKEL----GKDILLFLGGKTEDNIKQINDKSK----------SFNDENIVNVGSSAYYENI---KYTPSEVAVYIA 308 (473) Q Consensus 246 ~v~~~~~~----~~~~~av~~~~~~~t~~~~~~~~~----------~~n~~~i~~~~~~~~~~~~---~~~~~~~a~~vA 308 (473) ||+++++. ...+.+++..+.+.+.+.+..+.. .+++.++++++||....+. .......++++| T Consensus 396 ~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~A 475 (660) T protein:vir:68 396 IGDSRQDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADIA 475 (660) T ss_pred HHHhhCCeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceEEechhHHHH Confidence 88776542 112344445556677777777665 3789999999998865332 222233479999 Q ss_pred HhhhcCccccccceeccC--------cccccccCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhh Q lcl|NC_019421. 309 ALSVSKGITGSICNAKTI--------FEEVEPRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYI 380 (473) Q Consensus 309 G~~a~~~~~~s~t~~~~~--------~~~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i 380 (473) |++|++|.++++|+.|.+ ..++...+++.|++.|+++|++++++.+++..++||-+|+.+ .+++|++| T Consensus 476 Gl~Ar~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~----~~s~~~~i 551 (660) T protein:vir:68 476 GLCARTDNISQPWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATS----VPSPFDRI 551 (660) T ss_pred HHHHHHhccCCcEEccCCeeeceeeccceeeecCChhHHHHHhhCCceEEEEecCCeEEEEcceecCC----CCcccceE Confidence 999999977766665543 124555689999999999999999998888888899998743 24589999 Q ss_pred hhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCCC--EEEEEEEEE Q lcl|NC_019421. 381 SNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKAD--EFYWKWDAV 458 (473) Q Consensus 381 ~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~d--~~~v~i~v~ 458 (473) ++||++++|+++|+....++++ +|||+.+|.+|+..|+.||++||++|+|.+|++.||.+.+++++.+ .+++.+.++ T Consensus 552 ~vrR~~~~i~~si~~~~~~~v~-epn~~~~~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~G~~~~~i~~~ 630 (660) T protein:vir:68 552 NVRRLFNMVKTNIGSASKYRLF-ELNNAFTRSSFRTETSQYLQGIKALGGVYNFKVVCDTTNNTPAVIDRNEFVATFYLQ 630 (660) T ss_pred ehhhHHHHHHHHHHHHHHHhcc-CCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEecCCCCHHHhhCCeEEEEEEEE Confidence 9999999999999977765555 6999999999999999999999999999999999999998887654 788889999 Q ss_pred EeeeeeeEEEEEEeC Q lcl|NC_019421. 459 KVDVMKKIYGTGYLG 473 (473) Q Consensus 459 p~~~~e~i~~t~~v~ 473 (473) |+.|||||.++|.-- T Consensus 631 p~~pae~i~l~~~~~ 645 (660) T protein:vir:68 631 PARSINYITLNFVAT 645 (660) T ss_pred ecCCcceEEEEEEEe Confidence 999999999998655 No 18 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=100.00 E-value=4.3e-69 Score=395.37 Aligned_cols=442 Identities=16% Similarity=0.139 Sum_probs=298.7 Q ss_pred CceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCC-CcCcHHHHHHHHHHhcCCCEE Q lcl|NC_019421. 10 ERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGD-DMNYSAFKLGKLALLGNVKEL 88 (473) Q Consensus 10 ~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~-~~~~~~~~~v~~~f~~g~~~v 88 (473) -+-..|||||||++ +.++|++++|+++||+|.++|||+|+|++|+|+ .++++.||. .....+.++++.||+|||++| T Consensus 1 ~~~~~Pgvyv~e~~-~~~~i~~~~t~~~~~vg~~~~gp~~~p~~v~s~-~~~~~~fg~~~~~~~~~~~~~~~f~~~g~~~ 78 (660) T protein:vir:10 1 MALLSPGIELKETS-VQSTVVRNATGRAALVGKFQWGPAFQVTQITNE-VELVDLFGGPNNEVADYFMSGMNFLQYGNDL 78 (660) T ss_pred CceecCceEEEeec-CCccccCCCcccceEEeecCCCCCccCeEcCCH-HHHHHHcCCcCCCchhHHHHHHHHHhCCceE Confidence 56677999999995 789999999999999999999999999999995 557778875 434456688999999999999 Q ss_pred EEEecCCCcccceeeeecccccccccceEEEEecCcc---ccceeEEEeeccCC-----------c-------------- Q lcl|NC_019421. 89 LLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPT---ARNFNVTIKSNLVD-----------S-------------- 140 (473) Q Consensus 89 ~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G---~~~n~i~v~~~~~~-----------~-------------- 140 (473) |++|+.++...++... ....+.+++.++| .|++.+++...... + T Consensus 79 ~vvrv~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~v~~~~a~g~~~~~~~~ta~~ 150 (660) T protein:vir:10 79 RTVRVVSREFAKNASP--------IAGNIETTITTAGSNYAVGDKINIKYNQTVVESEGRVTSVDTDGKILSVFIPSAKI 150 (660) T ss_pred EEEEeccccccccccc--------ccccceeEEeeccccccccceeeEeeccccccccccceeeccccceeeeccccccc Confidence 9999987653322111 1122333333333 34444433210000 0 Q ss_pred --------------c--ceee-----------ee---ecCCce------------------------------------- Q lcl|NC_019421. 141 --------------D--KKDF-----------IF---FENTKQ------------------------------------- 153 (473) Q Consensus 141 --------------~--~~~v-----------~v---~~~~~~------------------------------------- 153 (473) . ...+ .+ ..++.. T Consensus 151 ~~~a~~v~~~~~~~~~~~~~~~~~~~~~~~a~sv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~ 230 (660) T protein:vir:10 151 IAYARSLNQYPTLGPAWTAEVTSASSGVSGTITVGKIVTDSGILLTEAENSEEAITSLEFQAALKKFAMPGVVALYPGEI 230 (660) T ss_pred cccccccccccccccceeEEEecccCccccceeeeeeeccCcceEEeeeccccccccccceeeccccccceeeeeccccc Confidence 0 0000 00 000000 Q ss_pred ---eeEE---------------Eec-----------c------cchhhhhhh---hh----------------------- Q lcl|NC_019421. 154 ---LFSS---------------SIK-----------G------TIDEIVLEI---NS----------------------- 172 (473) Q Consensus 154 ---~~~~---------------~~~-----------~------~~~~~~~~~---~~----------------------- 172 (473) .... ... . ..++..... .. T Consensus 231 G~~i~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~~~~~~ 310 (660) T protein:vir:10 231 GSTLEVEIVSKAAYEAGSSKMLDVYPGGGTRASIAKAVFNYGPQTDDQYAIIVRRDGAIVESVVLSTKEGEKDVYGNNIY 310 (660) T ss_pred CcceeEEEeeccccCCcceeEEeeeeccceeeEEeeeecccccccccccccccccCCcccceeeeeccccccccccceee Confidence 0000 000 0 000000000 00 Q ss_pred ------cccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhcc---cceEEEEEcCCCc------H Q lcl|NC_019421. 173 ------NLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFER---YSFDSFVLDGVAD------E 237 (473) Q Consensus 173 ------~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~---~~~~~l~~p~~~~------~ 237 (473) ...+.++.+... ...........+.+|.++....+..++..++..|+. ..++++++|+... . T Consensus 311 ~~~~~~~~~~~~v~~~~~---~~~~~~~~~~~l~gg~~~~~~~t~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~ 387 (660) T protein:vir:10 311 LDDYFAKGTSNYIYATSL---NWPKGFSGIINLSGGISANDKVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDEVAS 387 (660) T ss_pred eehhhcCCCccEEEEEec---cCCCCcccceeeeccccCccccccchhhhhhhhhhhhhhcccceEEEcCcCCCchhhhH Confidence 000000000000 000011223567888888777788888877777654 4689999997542 3 Q ss_pred HHHHHHHHHHHHHhhCCCeEEEEEcCCCC--------ccHHHHHHhhh----------ccCCceEEEecCCceecCc--- Q lcl|NC_019421. 238 ALQETTKAWVAKNKELGKDILLFLGGKTE--------DNIKQINDKSK----------SFNDENIVNVGSSAYYENI--- 296 (473) Q Consensus 238 ~~~~~l~~~v~~~~~~~~~~~av~~~~~~--------~t~~~~~~~~~----------~~n~~~i~~~~~~~~~~~~--- 296 (473) .+++++.+||++++ .|+++++.+.+ .+.+.+..+.. .+++.+.++++||....+. T Consensus 388 ~v~~al~~~~~~~~----~~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~ 463 (660) T protein:vir:10 388 TVQKHVVSIADERQ----DCLAFISPPKGLLVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTYAAIDGNYKYQYDKYND 463 (660) T ss_pred HHHHHHHHHHHhhC----CEEEEEecCcccccccccccCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCC Confidence 46777777777653 58888887643 45666666654 4789999999998865332 Q ss_pred ccchHHHHHHHHHhhhcCccccccce----eccC----cccccccCCHHHHHHHHhCCcEEEEEc-CCEEEEEecccccc Q lcl|NC_019421. 297 KYTPSEVAVYIAALSVSKGITGSICN----AKTI----FEEVEPRLSQSEVKECLKSGTLVLDFD-DGDVIIVDDVNTFK 367 (473) Q Consensus 297 ~~~~~~~a~~vAG~~a~~~~~~s~t~----~~~~----~~~~~~~~t~~e~~~l~~~G~~~l~~~-~~~~~i~~gi~T~~ 367 (473) .......++++||++|++|.++++|+ +.+. ..++...+++.|++.|+++|++++++. +++..++||.+|+. T Consensus 464 ~~~~~p~sg~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~ 543 (660) T protein:vir:10 464 VNRWVPLAADLAGLCARTDDVSQPWMSPAGYNRGQILNVLKLAIEPRQAQRDRMYQEAINPVVGFAGGDGFVLFGDKTAT 543 (660) T ss_pred ceeEechhHHHHHHHHHhhccCCcEEccCCeeeceeeccceeeecCChhhHHhHhhCCceEEEEeeCCCcEEEEcccccC Confidence 22223347999999999997775554 4332 234556789999999999999999875 44677789999964 Q ss_pred cCCCCCcchhhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCC Q lcl|NC_019421. 368 KYVDDKNEAMGYISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAK 447 (473) Q Consensus 368 ~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~ 447 (473) + .+++|++|++||++++|+++|+...++++++ ||++.+|.+|+..|+.||++||++|+|.+|++.||.+.+++++ T Consensus 544 ~----~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e-pn~~~l~~~i~~~i~~fL~~l~~~gal~g~~V~~d~~~nt~~d 618 (660) T protein:vir:10 544 K----VPSPMDHINVRRLFNMLKKNIGDASKYKLFE-LNDNFTRSSFRMEVSQYLDGIKALGGIYEGRVVCDTTVNTPAV 618 (660) T ss_pred C----CCcccceEehhhHHHHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHH Confidence 2 2458999999999999999999877666664 8999999999999999999999999999999999999888876 Q ss_pred CC--EEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 448 AD--EFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 448 ~d--~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) .+ .+++++.++|+.|||||.++|.-- T Consensus 619 i~~G~~~~~i~~~P~~pae~I~~~~~~~ 646 (660) T protein:vir:10 619 IDRNEFIANIYVKPARSINYITLNFVAT 646 (660) T ss_pred hhCCeEEEEEEEEecCCccEEEEEEEEe Confidence 54 788999999999999999997755 No 19 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=100.00 E-value=1.2e-69 Score=398.52 Aligned_cols=438 Identities=15% Similarity=0.065 Sum_probs=314.4 Q ss_pred CCc-cccCCC-CceecCceeEEEecCCcceecc-cCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHH Q lcl|NC_019421. 1 MAT-GTWNEK-ERKEIPGFYNRFKTQAEKSTNT-GLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLG 77 (473) Q Consensus 1 m~~-g~~~~~-~~~~~PGvYie~~~~~~~~i~~-~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v 77 (473) .+| .-|+.. -|=..|||||||+++++++|++ ++|+++||+|.++|||+|+|++|+|+.| +...||...+. T Consensus 270 ~~~~~~~~~~~~~v~~~GVYVEEVpSGvrtIeGGV~TSVAAFVG~A~rGPvn~PvlITS~aD-~~~~Fg~~~GG------ 342 (774) T protein:vir:98 270 CAGVEPFGEITRNVEDNGVVIQLEPALTGSISNRFSFYVTANDNTANRGFTTSPALVTTIPD-PAIHFTSFQGG------ 342 (774) T ss_pred hcccccccceEEEEecCceEEEEeCCCCccccccccceeeeecccccCCCCCcCEEEeehhH-hhhhhccccCC------ Confidence 233 112211 3445699999999999999998 8999999999999999999999999654 55555533221 Q ss_pred HHHHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCc----- Q lcl|NC_019421. 78 KLALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTK----- 152 (473) Q Consensus 78 ~~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~----- 152 (473) .+|+.+++.... ...+...++++|++||.|||.++|.......+.+.+.+..... T Consensus 343 ----l~GassA~r~~~----------------~~sG~~~L~i~A~~pGawGN~ItV~I~~~t~~~~~l~v~~~~~s~f~~ 402 (774) T protein:vir:98 343 ----LDGPRSAFRDFY----------------TFNGTPLLRLQAVSEGNWGNQVTVSIYPVNNSEFRLNVQDLNGSAFNP 402 (774) T ss_pred ----ccccceeeeeee----------------eecccceEEEEEeecCcCCCceEEEEEecCCceeEEEEEecCCccccc Confidence 145555542211 1233457899999999999999888654433334333221100 Q ss_pred --eeeEEEec-----------ccchhhhh-------h-hhhcccccceeEee-----cc-cCCc----------cccccc Q lcl|NC_019421. 153 --QLFSSSIK-----------GTIDEIVL-------E-INSNLDNEYVIATK-----VA-DSDT----------ILANVV 195 (473) Q Consensus 153 --~~~~~~~~-----------~~~~~~~~-------~-~~~~~~s~~v~~~~-----~~-~~~~----------~~~~~~ 195 (473) ..+...+. ...++... . ...+..+.++.... .. .... ...... T Consensus 403 ~~a~e~~tv~~~~~~~~~~v~e~~dn~~i~~~~~~~~~~~in~vs~lv~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~v 482 (774) T protein:vir:98 403 PLADEVYTVKLGDTNESGELNALLDSKFIRGFFLPKSIDSINYDAALVRQSPLRLAPPDESETDVENPAHVDFYGPNVLV 482 (774) T ss_pred cccceeEEEecccccccceeeeeeceeeEeecccccccccccccccccccchhcccccccccccccccccccccCCcceE Confidence 00001100 00000000 0 00001111111110 00 0000 011222 Q ss_pred eeeeccCcccccchhhHHHHHHHHhhcccceEEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHh Q lcl|NC_019421. 196 NQALEGGNDGCTSITNESYLKALEEFERYSFDSFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDK 275 (473) Q Consensus 196 ~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~ 275 (473) ...+.+|.||.. .+..+|..++..++..++++|+.+ ..+..++..+.+||++++..++.|+++++.+.+.+.+++.++ T Consensus 483 ~v~lagG~Dg~~-tt~~~igg~~~~~~~tgi~aLl~a-~~~~~V~~aii~~~e~~~~~~~~r~avid~p~g~t~~~Ai~~ 560 (774) T protein:vir:98 483 DVTLENGYDGPP-VTNDDYVSIIRTLENQPVHILLVG-TTNVGVQQALITEAERASDSDGLRIAVLAAPPRTTPTLAASV 560 (774) T ss_pred EEeecCCCCccc-ccchheecccccccccceeEEEcC-ccchhhHHHHHHHHHHhhhcccceEEEEECCCCCCHHHHHHH Confidence 345778888754 455678888888888899988765 567889999999999999888889999999999999999999 Q ss_pred hhccCCceEEEecCCceecCc---ccchHHHHHHHHHhhhcCccccccceeccCcc-------cccccCCHHHHHHHHhC Q lcl|NC_019421. 276 SKSFNDENIVNVGSSAYYENI---KYTPSEVAVYIAALSVSKGITGSICNAKTIFE-------EVEPRLSQSEVKECLKS 345 (473) Q Consensus 276 ~~~~n~~~i~~~~~~~~~~~~---~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~~~-------~~~~~~t~~e~~~l~~~ 345 (473) +..++++++++++||....+. .......|+++||++|++|+.+|+.|+++.+. ......++.|++.|..+ T Consensus 561 r~~f~S~~aal~~Pwvkv~D~~~g~~~~vPpSg~vAGl~ArtDv~kSPANk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~ 640 (774) T protein:vir:98 561 TRGFNSTRAVMVAGWFTYAGQPNSSRYGVPGAAVYAGKLAAIDFFVSPAARSLVGPLFNIIESDTDNYTSRSNQDIYSAA 640 (774) T ss_pred HhccCCceEEEEeCcEEEeccCCCceeecChhHHHHHHHHhcCcccccCCceeecceeccccccccccccchhhhhhccc Confidence 999999999999999876432 22223348999999999999999999987543 23345678899999999 Q ss_pred CcEEEE-EcCCEEEEEecccccccCCCCCcchhhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHH Q lcl|NC_019421. 346 GTLVLD-FDDGDVIIVDDVNTFKKYVDDKNEAMGYISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEE 424 (473) Q Consensus 346 G~~~l~-~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~ 424 (473) |++++. ...++....||-+|+.+ |+.|++|++||++|||+++|+....+++++ ||++.+|.+|+..|+.||++ T Consensus 641 gIN~i~itt~g~G~rvWG~RTlss-----Dp~wr~InVRRlfd~Ie~SI~~~~~~~VfE-PNd~~l~~~I~~sI~~fL~~ 714 (774) T protein:vir:98 641 RLEVLSLDTVDRTYRFASGVTLST-----DPAWERIYLRRVHDVVRQGAHAILRNYVAM-PNSRLVRNQIAAALNAFMGE 714 (774) T ss_pred ccceeEEEEcCCcEEEEcccccCC-----CcccceEeehhhHHHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHH Confidence 999986 34445667788888643 789999999999999999999888777775 99999999999999999999 Q ss_pred HHhcCCccCcc-ceeccccccCCCCC--EEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 425 LMSQGIISEFN-VDIDTELQATAKAD--EFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 425 l~~~g~i~~~~-~~~D~~~~~~~~~d--~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) |+++|+|.+|. +.||.+.+++++.+ .++++++++|++|+|||++++.-- T Consensus 715 L~~~GaL~G~~~V~~D~etNt~~dI~~G~l~i~I~vaP~~PAEfIilri~q~ 766 (774) T protein:vir:98 715 LKRNGNIVSFRPAIIDGSNNSTAAYFSRELYVSLQFQPLYSADYIYVTISRD 766 (774) T ss_pred HHhCCceecceEEEEcCCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEe Confidence 99999999987 78999988887544 789999999999999999998777 No 20 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=100.00 E-value=2e-68 Score=391.78 Aligned_cols=454 Identities=16% Similarity=0.088 Sum_probs=306.9 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCC-cCcHHHHHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDD-MNYSAFKLGKL 79 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~-~~~~~~~~v~~ 79 (473) |+ ++ .-|||||||++++.++|+++.|+++||+|.++|||+|+|++|+|+ .+|++.||.. ....+.++++. T Consensus 1 m~--~~------~~Pgvyv~e~~~~~~~i~~~~t~~~~~vg~~~~Gp~~~p~~i~s~-~~~~~~fG~~~~~~~~~~~v~~ 71 (743) T protein:vir:10 1 MA--SQ------VSPGILIKERDLTNAVVTGALQIRAAHASTFAKGPIGDIVNINTQ-KELVSVFGEPKEDNAEDWMVAS 71 (743) T ss_pred Cc--cc------cCCceEEEEecCCCceeccCCcceeEEEEeccCCCCCcCEEecCH-HHHHHHcCCccCCcchHHHHHH Confidence 43 12 239999999999999999999999999999999999999999995 5577788864 34456689999 Q ss_pred HHhcCCCEEEEEecCCCcccceeeeecc-------cccccccceEEEEecCccccceeEEEeeccCCcc----------- Q lcl|NC_019421. 80 ALLGNVKELLLYRLVDGNQKKGTLTLKD-------TTENSAKDVIKLETKYPTARNFNVTIKSNLVDSD----------- 141 (473) Q Consensus 80 ~f~~g~~~v~v~rv~~g~~~aat~~l~~-------~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~----------- 141 (473) ||+|||++|||+|+.++..+.++..... .........++++|++||+|+|.++|+.....+. T Consensus 72 ~f~ngg~~~~vvrv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gN~i~V~v~~~~~d~~~~~~~~~~~ 151 (743) T protein:vir:10 72 EFLNYGGRLAVVRAETTGVLNATTGSAGVLVKNRESWDAGSGNGEVFVARTAGSWGNSLMGVLVDRGADYIVTFAATPTD 151 (743) T ss_pred HHHhCCceEEEEEccCccccccccccccccccccccccccccceeEEEEeeccccccceEEEEecCCCcceeeeeccccc Confidence 9999999999999998766655533211 1123355689999999999999888763211000 Q ss_pred ---ceeee-------------eec---C--------C-cee----------------------------eEEE----ecc Q lcl|NC_019421. 142 ---KKDFI-------------FFE---N--------T-KQL----------------------------FSSS----IKG 161 (473) Q Consensus 142 ---~~~v~-------------v~~---~--------~-~~~----------------------------~~~~----~~~ 161 (473) ...+. ... + . ... .... ... T Consensus 152 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (743) T protein:vir:10 152 TAVGTQLLFSYSGTLVTGEILSYDSATNTATITASGTLTSQYLLDTPEQGLIGSFTDNSTTEVGRTPGTYSNVPASGGTG 231 (743) T ss_pred cccceeeeecccccccccceeeeeecCcceeeeeccccceeeecccccccccccccccccccccccccceeeEEeccccc Confidence 00000 000 0 0 000 0000 000 Q ss_pred cchhh-------------------------------------------hh--h----------h-----------hhccc Q lcl|NC_019421. 162 TIDEI-------------------------------------------VL--E----------I-----------NSNLD 175 (473) Q Consensus 162 ~~~~~-------------------------------------------~~--~----------~-----------~~~~~ 175 (473) ..... .. . . ..... T Consensus 232 ~~~~~tv~v~~~~~~vg~~v~~~~~~~~~~~~~~~~~~v~~~~~~~~t~~~~~~~~~~~g~~~~~a~~~~~~~~~~~~~~ 311 (743) T protein:vir:10 232 TGATFNVVVADAGGGVGGSVVVTLANPGTGYNQGETLTIASAATGDGTDILVTVATLSDGTIAITELKDWYLNTEIGSTG 311 (743) T ss_pred ccccccccccccccccccccccccccccceeeeccccccccccccccccchhheecccccceeeeecccccccchhhccc Confidence 00000 00 0 0 00000 Q ss_pred c-------ccee-----------------------------------Eeecc--cCCc----c----------------- Q lcl|NC_019421. 176 N-------EYVI-----------------------------------ATKVA--DSDT----I----------------- 190 (473) Q Consensus 176 s-------~~v~-----------------------------------~~~~~--~~~~----~----------------- 190 (473) + +..+ +.... .... . T Consensus 312 ~~~~~~~~~~~t~~~~~~~~~~~d~~~v~v~~~~~~~~~~~~~v~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~~~~ 391 (743) T protein:vir:10 312 IKLGDIGPRPGTSQFATDNGITDDQVHFAVIDTTGELTGTANTIVERLTYLSKLSDARSEENANIYYKNVINEQSAYLYH 391 (743) T ss_pred cccccccccceeeeccccccccccceEEEEecCcceeeeccCceeEEEeeeecccccccccCcceeecceeccccceeec Confidence 0 0000 00000 0000 0 Q ss_pred -------------------------------ccccceeeeccCcccccchhhHHHHHHHHhh---cccceEEEEEcCCC- Q lcl|NC_019421. 191 -------------------------------LANVVNQALEGGNDGCTSITNESYLKALEEF---ERYSFDSFVLDGVA- 235 (473) Q Consensus 191 -------------------------------~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~l---e~~~~~~l~~p~~~- 235 (473) ........+.||.|+.. .+..++..++..| +..+++++++|+.. T Consensus 392 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gG~d~~~-~~~~~~~~~~~~~~~~~~~~~~ll~~p~~~~ 470 (743) T protein:vir:10 392 GNDAAVQIAASGEAWGQSSDQVLADAGTAFSRTTGYWVNLAGGNDDFA-YDAGEFGAAMDLFLDTEETEIDFVLMGGSMA 470 (743) T ss_pred cCcccceeeeccccCccccceeeeecccccccccceEEEeecCccccc-cchhHHHHHHHHhhhccccCcceEEecCccc Confidence 00000123455665532 2444566666655 44568999999743 Q ss_pred ----cHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCc---------------cHHHHHHhhhccCCceEEEecCCceecCc Q lcl|NC_019421. 236 ----DEALQETTKAWVAKNKELGKDILLFLGGKTED---------------NIKQINDKSKSFNDENIVNVGSSAYYENI 296 (473) Q Consensus 236 ----~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~---------------t~~~~~~~~~~~n~~~i~~~~~~~~~~~~ 296 (473) ...+++++.++|+++ +.|+++++.+.+. ..+....+...+++++.++++||....+. T Consensus 471 ~~~~~~~v~~a~~~~~~~~----~~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~ 546 (743) T protein:vir:10 471 DEADTKSKATKVIAIAASR----KDALAFVSPHKGNQIASTGNVALSSAQQKENTIAFFSDLTSTSYAVFDSGYKYVYDR 546 (743) T ss_pred CccchHHHHHHHHHHHHhh----CCeEEEEecCCCccccccccccccccccchHHHHHHHhccCCeeEEEEccceeeecc Confidence 245667777777654 4588998876532 23445556667889999999999865332 Q ss_pred ---ccchHHHHHHHHHhhhcCccccccceeccC----c----ccccccCCHHHHHHHHhCCcEEEEEcCCEEEEEecccc Q lcl|NC_019421. 297 ---KYTPSEVAVYIAALSVSKGITGSICNAKTI----F----EEVEPRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNT 365 (473) Q Consensus 297 ---~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~----~----~~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T 365 (473) .......++++||++|++|.++++|+.|.+ + .++...+++.|++.|+++|++++++.+++..++||-+| T Consensus 547 ~~~~~~~~p~s~~~AGl~a~~D~~~g~~~span~~~~gi~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT 626 (743) T protein:vir:10 547 FTDKYRYIPCNGDVAGLCVQTSNQLDDWYSPAGLNRGGILNAVKLAYNPNKADRDELYQNRINPVVSLRGQGITLFGDKT 626 (743) T ss_pred ccCceeEechhHHHHHHHHHhhccCCcEEccCCeeeeeeeccccceecCChhHHHhHhhCCceEEEEecCCeEEEEcccc Confidence 222233479999999999877765554432 2 23445688999999999999999988777778899998 Q ss_pred cccCCCCCcchhhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccC Q lcl|NC_019421. 366 FKKYVDDKNEAMGYISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQAT 445 (473) Q Consensus 366 ~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~ 445 (473) +. ..|+.|++|++||++|+|+++|++...++++ +||++.+|.+|+..|+.||++||++|+|++|++.||.+.+++ T Consensus 627 ~~----s~d~~~~~i~vrR~~~~i~~si~~~~~~~v~-e~n~~~~~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~ 701 (743) T protein:vir:10 627 AL----AAPSAFDRINVRRLFLNLEKRARRLAEGVLF-EQNDATTRAGFSSALNSYLSEVQARRGVTDYLVICDESNNTP 701 (743) T ss_pred cC----CCCcccceEeehhhHHHHHHHHHHHHHHhcc-CCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCH Confidence 73 2468999999999999999999987766665 689999999999999999999999999999999999998887 Q ss_pred CCC--CEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 446 AKA--DEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 446 ~~~--d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) ++. ..+++++.++|+.|||||.++|.-- T Consensus 702 ~~i~~G~~~~~i~~~p~~pae~I~~~~~~~ 731 (743) T protein:vir:10 702 DIIDRNEFVAEVYVKPTRSINFITITFTAT 731 (743) T ss_pred HHhhCCeEEEEEEEEecCCcceEEEEEEEe Confidence 654 4788999999999999999998722 No 21 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=100.00 E-value=1.6e-68 Score=392.21 Aligned_cols=454 Identities=15% Similarity=0.118 Sum_probs=291.8 Q ss_pred CceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCC-CcCcHHHHHHHHHHhcCCCEE Q lcl|NC_019421. 10 ERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGD-DMNYSAFKLGKLALLGNVKEL 88 (473) Q Consensus 10 ~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~-~~~~~~~~~v~~~f~~g~~~v 88 (473) -+-..|||||||+ ++.++|.+++|+++||+|.++|||+|+|++|+|+ .+|++.||. .....+.++++.+|+|||++| T Consensus 1 ~~~~~Pgvyv~e~-~~~~~~~~~~t~~~~~vg~~~~gp~~~p~~i~~~-~~~~~~fg~~~~~~~~~~~~~~~f~~~g~~~ 78 (666) T protein:vir:80 1 MTLLSPGFETKET-TLSTTIVQSATGRAALVGKFQWGPAFQIIQVTNE-VELVNKFGQPDNNTADYFMSGANFLQYGNDL 78 (666) T ss_pred CceecCceEEEEe-cCCccccccCcccceEEeccccCCCccceEecCH-HHHHHhcCCccCccchHHHHHHHHhcCCCeE Confidence 4567799999999 5888999999999999999999999999999995 557777875 434456689999999999999 Q ss_pred EEEecCCCcccceeeeeccc------ccc-------------------cccceEEE------------------------ Q lcl|NC_019421. 89 LLYRLVDGNQKKGTLTLKDT------TEN-------------------SAKDVIKL------------------------ 119 (473) Q Consensus 89 ~v~rv~~g~~~aat~~l~~~------~~~-------------------~~~~~l~i------------------------ 119 (473) ||+|+.++.+.++.....+. ... .......+ T Consensus 79 ~v~R~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~i~~~~~~~~~~~~ta~~~~~a~~~~ 158 (666) T protein:vir:80 79 RVVRVLNKEKAKNATALAGNIEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIG 158 (666) T ss_pred EEEEecCccccccccccccceeEEEeeccccccccccccccccCcccccCcceEEEeecceeeeeecchhhhcccccccc Confidence 99999754332211110000 000 00000000 Q ss_pred ----------------------------------------------------------------EecCccccceeEEEee Q lcl|NC_019421. 120 ----------------------------------------------------------------ETKYPTARNFNVTIKS 135 (473) Q Consensus 120 ----------------------------------------------------------------~A~~~G~~~n~i~v~~ 135 (473) .+.++|.|++.+++.. T Consensus 159 ~~~~v~~~~~~~~~~~~~~~~~a~~V~~~~~~~~~~~~~~~~a~~~~t~~~~~~~~~~~~~~a~~a~~~g~~g~~l~v~i 238 (666) T protein:vir:80 159 VYPELDGDWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLQKYDMPAVSAIYAGEIGNSLEVEI 238 (666) T ss_pred ccceeeccceeeeccccccceeeeeeeeeecCCccceeeeccccccccccccccccccccchhhhhhcccccccceeeee Confidence 0111122222111110 Q ss_pred ccC-------------------------------CccceeeeeecCCceeeEEEecccchhhh--------hhhhhcccc Q lcl|NC_019421. 136 NLV-------------------------------DSDKKDFIFFENTKQLFSSSIKGTIDEIV--------LEINSNLDN 176 (473) Q Consensus 136 ~~~-------------------------------~~~~~~v~v~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~s 176 (473) ... +...+.+.+...+..++++.+....+... ........+ T Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (666) T protein:vir:80 239 LARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFGRGSS 318 (666) T ss_pred ccccccccccccceeeeccccccccceeeeeccccccceeeEeccCCccceeeecccccccccccchhhhhhhhhccccc Confidence 000 00000011111111111111110000000 000000011 Q ss_pred cceeEeecccCCccccccceeeeccCcccccchh----hH-------HHHHHHHhhcccceEEEEEcCCC-----cHHHH Q lcl|NC_019421. 177 EYVIATKVADSDTILANVVNQALEGGNDGCTSIT----NE-------SYLKALEEFERYSFDSFVLDGVA-----DEALQ 240 (473) Q Consensus 177 ~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t----~~-------d~~~~l~~le~~~~~~l~~p~~~-----~~~~~ 240 (473) .++... .....+.......+.+|.++....+ .. .....+...+.++++++++|+.. ...++ T Consensus 319 ~~~~~~---~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v~ 395 (666) T protein:vir:80 319 QYIYAT---AQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWGLFAERESIHVNLLIAGACAGEGDAFSTVQ 395 (666) T ss_pred eeeeec---ccccccccceEEEecCCCCcccccccccccccccccchhhhhhhhhhcccccceEeecCcCCcccchHHHH Confidence 111111 1111112222334556655322111 11 12233444466789999999754 35678 Q ss_pred HHHHHHHHHHhhC----CCeEEEEEcCCCCccHHHHHHhhh----------ccCCceEEEecCCceecCc---ccchHHH Q lcl|NC_019421. 241 ETTKAWVAKNKEL----GKDILLFLGGKTEDNIKQINDKSK----------SFNDENIVNVGSSAYYENI---KYTPSEV 303 (473) Q Consensus 241 ~~l~~~v~~~~~~----~~~~~av~~~~~~~t~~~~~~~~~----------~~n~~~i~~~~~~~~~~~~---~~~~~~~ 303 (473) ..+.+||+++++. ...+.+++..+...+.+.+..+.. .+++.+.++++||....+. ....... T Consensus 396 ~~~~~~~~~~~~~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~ 475 (666) T protein:vir:80 396 KHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVPL 475 (666) T ss_pred HHHHHHHHhhcceEEEeecceeEEeecCCCCCHHHHHHHHHhcccchhhhcccCcceEEEEcCceEEecccCCceeEech Confidence 8888888877642 223455666666778888877664 3789999999999866432 2222334 Q ss_pred HHHHHHhhhcCccccccceeccC--------cccccccCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcc Q lcl|NC_019421. 304 AVYIAALSVSKGITGSICNAKTI--------FEEVEPRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNE 375 (473) Q Consensus 304 a~~vAG~~a~~~~~~s~t~~~~~--------~~~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~ 375 (473) ++++||++|++|..+++|+.|.+ ..++...+++.|++.|+++|++++++.+++..++||-+|+.+ .++ T Consensus 476 sg~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~----~~s 551 (666) T protein:vir:80 476 AADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATT----VPS 551 (666) T ss_pred HHHHHHHHHHHhhcCCceEccCCeecceeeccccceeecChhHHHhhhhCCeeEEEEeCCCeEEEEccccCCC----CCc Confidence 89999999999988766655432 224556789999999999999999988887888899988642 245 Q ss_pred hhhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCCC--EEEE Q lcl|NC_019421. 376 AMGYISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKAD--EFYW 453 (473) Q Consensus 376 ~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~d--~~~v 453 (473) +|++|++||++++|+++|++..+++++ +|||+.+|.+|+..|+.||++||++|+|.+|++.||.+.|++++.+ .+++ T Consensus 552 ~~~~i~vRRl~~~i~~si~~~~~~~v~-epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~di~~G~~~~ 630 (666) T protein:vir:80 552 PFDRINVRRLFNMLKKNIGDSSKYKLF-ENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVA 630 (666) T ss_pred ccceeehhhHHHHHHHHHHHHHHHhcc-CCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEE Confidence 899999999999999999877755555 6999999999999999999999999999999999999988877654 7888 Q ss_pred EEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 454 KWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 454 ~i~v~p~~~~e~i~~t~~v~ 473 (473) ++.++|+.|||||.++|.-- T Consensus 631 ~i~~~P~~Pae~I~~~~~~~ 650 (666) T protein:vir:80 631 SMFIKPAKSINYIMLNFTAV 650 (666) T ss_pred EEEEEecCCcceEEEEEEEe Confidence 89999999999999998743 No 22 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=100.00 E-value=2.3e-68 Score=391.42 Aligned_cols=442 Identities=16% Similarity=0.128 Sum_probs=291.6 Q ss_pred CceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCC-CcCcHHHHHHHHHHhcCCCEE Q lcl|NC_019421. 10 ERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGD-DMNYSAFKLGKLALLGNVKEL 88 (473) Q Consensus 10 ~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~-~~~~~~~~~v~~~f~~g~~~v 88 (473) -+-..|||||||+ ++.++|++++|+++||+|.++|||+|+|++|+|+ .+|++.||. .....+.++++.||+|||++| T Consensus 1 ~~~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~v~s~-~~~~~~fG~~~~~~~~~~~~~~~f~ngg~~~ 78 (666) T protein:vir:65 1 MTLLSPGFETKET-TLSTTIVQSETGRAALVGKFQWGPAFQIIQVTNE-VELVNKFGQPDNNTADYFMSGANFLQYGNDL 78 (666) T ss_pred CceecCceEEEEe-cCcccccccCcccceEEecccCCCCccCEEecCH-HHHHHHcCCccccchhHHHHHHHHHhcCceE Confidence 5567799999999 5888999999999999999999999999999995 557777775 444556789999999999999 Q ss_pred EEEecCCCcccceeeeecccccccccceEEEEecCc---cccceeEEEeeccC--------------------------- Q lcl|NC_019421. 89 LLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYP---TARNFNVTIKSNLV--------------------------- 138 (473) Q Consensus 89 ~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~---G~~~n~i~v~~~~~--------------------------- 138 (473) ||+|+.++...++...+.+ .+.+++..+ +.|++.+.|+.+.. T Consensus 79 ~vvrv~~~~~~~~~~~~~~--------~~~~~~~~~g~~~~~g~~~~V~~~~~~~~~~~~~~~~~~~~~~~g~~~~t~~~ 150 (666) T protein:vir:65 79 RVVRVLNKEKAKNATALAG--------NVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKI 150 (666) T ss_pred EEEEccCcccccccccccC--------ceeeeEeeccccccccceEEEEeccccccccccccccccccccccccccccee Confidence 9999976544433222211 111111111 11222222210000 Q ss_pred ---------------------------------------Cc--------------------------------------- Q lcl|NC_019421. 139 ---------------------------------------DS--------------------------------------- 140 (473) Q Consensus 139 ---------------------------------------~~--------------------------------------- 140 (473) +. T Consensus 151 ~~~~~~~g~~~~l~~~~~~~~~~~~~~~~~a~sv~~~~~~g~~~~~~~~~a~~~~~~~~~~~~~~~~~~~a~~A~~~g~~ 230 (666) T protein:vir:65 151 IAHAKAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLKKYDMPAVSAIYAGEI 230 (666) T ss_pred eccccccCcceeEeeccceeecccCcccccceeeeecccccceeeeeecccccccccccccccccccccceeeeeecccc Confidence 00 Q ss_pred --------------------------------------------cceeeeeecCCceeeEEEecccchhh-----h---h Q lcl|NC_019421. 141 --------------------------------------------DKKDFIFFENTKQLFSSSIKGTIDEI-----V---L 168 (473) Q Consensus 141 --------------------------------------------~~~~v~v~~~~~~~~~~~~~~~~~~~-----~---~ 168 (473) ..|.+.+...+...+++......+.. . . T Consensus 231 g~~i~v~i~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~ 310 (666) T protein:vir:65 231 GNSLEVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMD 310 (666) T ss_pred ccceeEEeecccccccccccccccccccccccceeeecccccccccceeeeecCCcccceeecccCcccccccchhhhhh Confidence 00000000000000000000000000 0 0 Q ss_pred hhhhcccccceeEeecccCCccccccceeeeccCcccccchh--------hHHHHHHHHhhc---ccceEEEEEcCCC-- Q lcl|NC_019421. 169 EINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSIT--------NESYLKALEEFE---RYSFDSFVLDGVA-- 235 (473) Q Consensus 169 ~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t--------~~d~~~~l~~le---~~~~~~l~~p~~~-- 235 (473) .......+.++.+...... ........+.+|.++..+.+ ..++..++.+++ ...++++++|+.. T Consensus 311 ~~~~~~~~~~v~~~~~~~~---~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~ 387 (666) T protein:vir:65 311 DFFARGSSQYIYATAQGWV---DGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWDLFAERESIHVNLLIAGACAGE 387 (666) T ss_pred hhhcccccceeeeeccccc---ccccceEEccCCCCcCcccccccccccccccHHHHHHHHhhhhhccCCceeecCcCCc Confidence 0000001111111111000 01112234566666543221 123445666554 3468999999754 Q ss_pred ---cHHHHHHHHHHHHHHhhCCCeEEEEEc--------CCCCccHHHHHHhhhc----------cCCceEEEecCCceec Q lcl|NC_019421. 236 ---DEALQETTKAWVAKNKELGKDILLFLG--------GKTEDNIKQINDKSKS----------FNDENIVNVGSSAYYE 294 (473) Q Consensus 236 ---~~~~~~~l~~~v~~~~~~~~~~~av~~--------~~~~~t~~~~~~~~~~----------~n~~~i~~~~~~~~~~ 294 (473) +..+++++.+||+++++ |+++++ .+...+.+.+.++... ++|.+.++++||.+.. T Consensus 388 ~~~~~~v~~~l~~~~~~~~~----~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~ 463 (666) T protein:vir:65 388 GDAFSTVQKHAVSIGDERQD----CLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQY 463 (666) T ss_pred cchhHHHHHHHHHHHhhccc----eEEEeccccceeeecCCCCCHHHHHHHHHhcccccccccccCcceEEEEcCceEEe Confidence 35677888888877654 445444 4456777777776543 7899999999998653 Q ss_pred Cc---ccchHHHHHHHHHhhhcCccccccceeccC--------cccccccCCHHHHHHHHhCCcEEEEEcCCEEEEEecc Q lcl|NC_019421. 295 NI---KYTPSEVAVYIAALSVSKGITGSICNAKTI--------FEEVEPRLSQSEVKECLKSGTLVLDFDDGDVIIVDDV 363 (473) Q Consensus 295 ~~---~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~--------~~~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi 363 (473) +. .......++++||++|++|.++++|+.|.+ ..++...+++.|++.|+++|++++++.+++..++||- T Consensus 464 d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~ 543 (666) T protein:vir:65 464 DKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGD 543 (666) T ss_pred cccCCceeEechHHHHHHHHHHHhccCCcEEccCCeecceeeccccceeecChhHHHhhhhCCceEEEEeCCCeEEEEec Confidence 32 222233579999999999988776665543 2245567899999999999999999988888888999 Q ss_pred cccccCCCCCcchhhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccc Q lcl|NC_019421. 364 NTFKKYVDDKNEAMGYISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQ 443 (473) Q Consensus 364 ~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~ 443 (473) +|+.+ .+++|++|++||++++|+++|+....++++ +||++.+|.+|+..|+.||++|+++|+|.+|++.||.+.+ T Consensus 544 rT~~~----~~s~~~~i~vrR~~~~i~~si~~~~~~~v~-epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~n 618 (666) T protein:vir:65 544 KTATT----VPSPFDRINVRRLFNMLKKNIGDSSKYKLF-ENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNN 618 (666) T ss_pred ccCCC----CCcccceEehhhHHHHHHHHHHHHHHHhcc-CCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCC Confidence 98742 345899999999999999999987766666 5899999999999999999999999999999999999988 Q ss_pred cCCCC--CEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 444 ATAKA--DEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 444 ~~~~~--d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) ++++. ..+++++.++|+.|||||.++|.-- T Consensus 619 t~~~i~~G~~~~~i~~~p~~pae~i~~~~~~~ 650 (666) T protein:vir:65 619 TPDVIDRNEFVASMFIKPAKSINYIMLNFTAV 650 (666) T ss_pred CHHHhhCCeEEEEEEEEecCCcceEEEEEEEe Confidence 87754 4788889999999999999998765 No 23 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=100.00 E-value=2e-68 Score=391.68 Aligned_cols=450 Identities=16% Similarity=0.134 Sum_probs=291.9 Q ss_pred CceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCC-CcCcHHHHHHHHHHhcCCCEE Q lcl|NC_019421. 10 ERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGD-DMNYSAFKLGKLALLGNVKEL 88 (473) Q Consensus 10 ~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~-~~~~~~~~~v~~~f~~g~~~v 88 (473) -+-..|||||||++ +.++|.+++|+++||+|.++|||+|+|++|+|+ .++.+.||. .....+.++++.||+|||++| T Consensus 1 ~~~~~Pgvyv~e~~-~~~~~~~v~t~~~~fvG~~~~gp~~~p~~i~s~-~~~~~~fG~~~~~~~~~~~v~~~f~ngg~~~ 78 (663) T protein:vir:10 1 MALLSPGIEMKETS-INSTVVRSATGRAALVGKFAWGPAYEIRQVTNE-VELVDMFGSPDNVTAPYFMSAMNFLQYGNDL 78 (663) T ss_pred CccccCceEEEEec-CcccccccccccceeeeccccCCCCcCEEecCH-HHHHHHcCCcccccchHHHHHHHHHhCCCeE Confidence 45677999999995 788999999999999999999999999999995 557777875 434455689999999999999 Q ss_pred EEEecCCCcccceeeeeccccc-------ccc--cce----------------EE------------------------- Q lcl|NC_019421. 89 LLYRLVDGNQKKGTLTLKDTTE-------NSA--KDV----------------IK------------------------- 118 (473) Q Consensus 89 ~v~rv~~g~~~aat~~l~~~~~-------~~~--~~~----------------l~------------------------- 118 (473) ||+|+.++.+.++...+++... ... ... +. T Consensus 79 ~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~~~~~a~~~~ 158 (663) T protein:vir:10 79 RLVRVIDMEQAKNASPLFNQIEVTITTEGQGYTVGDTVSIKHNTTTVTEEGKVTKVDADGKIKALFVPSSAVIAKAKQLG 158 (663) T ss_pred EEEecCCcccccccccccccceeeEeecccCccccceeeecccccccccCcceeeeccCCceeEEEeccccccccccccc Confidence 9999987544332222211000 000 000 00 Q ss_pred ---------------------------------------------------------------EEecCccccceeEEEee Q lcl|NC_019421. 119 ---------------------------------------------------------------LETKYPTARNFNVTIKS 135 (473) Q Consensus 119 ---------------------------------------------------------------i~A~~~G~~~n~i~v~~ 135 (473) +.+..+|.|++.+.+.. T Consensus 159 ~~~~~~~a~~~~v~~~~~~~~~a~av~~i~~dg~vt~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i~v~~ 238 (663) T protein:vir:10 159 TYPVLGDNWRAEVSGASGGSAATLTLGGIVVDSGVTFGNSEEAPDVMTSTKVLANFAKYGMPLISAVYPGEIGSTVEVEV 238 (663) T ss_pred cccccccceeeEEeeccccccccceeEeeecCCceeEEeeeccccccccceeeeeccccccceeeeecccccCcceeEee Confidence 00011111111111110 Q ss_pred ccCC-------------------------------ccceeeeeecCCceeeEEEecccchhh--------hhhhhhcccc Q lcl|NC_019421. 136 NLVD-------------------------------SDKKDFIFFENTKQLFSSSIKGTIDEI--------VLEINSNLDN 176 (473) Q Consensus 136 ~~~~-------------------------------~~~~~v~v~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~s 176 (473) ...+ ...+.+++..++...+.+......+.. ......+..+ T Consensus 239 ~~~~~~~~~~~~~v~~~~g~~~~~~~~~~~~g~~~~~~~~~~~~~~g~~~e~~~ls~~~~~~~~~~~~~~~~~~~~~~~s 318 (663) T protein:vir:10 239 ISKTAFQSGAAQPIYPFGGTRASNARSVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRRGDRDVYGNNIFMDDYFRNGSS 318 (663) T ss_pred cccccccccceeeecccCcccccccccccccccccchhhcccccCCCcccceeeeeccccccccchhhhhhhhhhcCccc Confidence 0000 000000000000000000000000000 0000001122 Q ss_pred cceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhcc---cce-EEEEEcCCCc-----HHHHHHHHHHH Q lcl|NC_019421. 177 EYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFER---YSF-DSFVLDGVAD-----EALQETTKAWV 247 (473) Q Consensus 177 ~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~---~~~-~~l~~p~~~~-----~~~~~~l~~~v 247 (473) .++.+... ...........+.+|.++...++..++..+++.|.. .+. .+++.|...+ ..+++++.+|| T Consensus 319 ~~v~~~~~---~~~~~~~~~~~l~gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~~ 395 (663) T protein:vir:10 319 NFIYASSV---NWPAGFTGIIQLGGGASANNAVGSDELIAGWDLFADREALHVNLMIAGACKSDGVAVASTVQKHVVALA 395 (663) T ss_pred ceeEeecc---ccCcccceeEEecccccCcccchhhhhhhHHhhhccccccCceEEEeecCCCCchhhHHHHHHHHHHHH Confidence 22222211 111122233578899998887888888877766643 444 3444444332 23455555555 Q ss_pred HHHhhCCCeEEEEEcCCCCccH--------HHHHH-------------hhhccCCceEEEecCCceecCc---ccchHHH Q lcl|NC_019421. 248 AKNKELGKDILLFLGGKTEDNI--------KQIND-------------KSKSFNDENIVNVGSSAYYENI---KYTPSEV 303 (473) Q Consensus 248 ~~~~~~~~~~~av~~~~~~~t~--------~~~~~-------------~~~~~n~~~i~~~~~~~~~~~~---~~~~~~~ 303 (473) ++ .+.|+++++++...+. +.+.. ....+++.+.++++||....+. ....... T Consensus 396 ~~----~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~ 471 (663) T protein:vir:10 396 DD----RQDCVAFVNPPSELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYAFISGNYKYQYDKYNDINRWVPL 471 (663) T ss_pred Hh----hCCEEEEEecCcccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceEEEEecceeEecccCCceEEech Confidence 54 4568999988755332 22332 2345789999999998765432 1222234 Q ss_pred HHHHHHhhhcCccccccceeccC--------cccccccCCHHHHHHHHhCCcEEEEEc-CCEEEEEecccccccCCCCCc Q lcl|NC_019421. 304 AVYIAALSVSKGITGSICNAKTI--------FEEVEPRLSQSEVKECLKSGTLVLDFD-DGDVIIVDDVNTFKKYVDDKN 374 (473) Q Consensus 304 a~~vAG~~a~~~~~~s~t~~~~~--------~~~~~~~~t~~e~~~l~~~G~~~l~~~-~~~~~i~~gi~T~~~~~~~~~ 374 (473) ++++||++|++|.++++|+.|.+ ..++...+++.|++.|+++|+++++.. +++..++||.+|+.+ .+ T Consensus 472 s~~vAGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~----~~ 547 (663) T protein:vir:10 472 SADIAGLCAYTDQVGHPWMSPAGYRRGQLRNTIKLAIEPKQSLRDTMYQVSINPVTGFAGGDGFVLFGDKMATQ----VP 547 (663) T ss_pred HHHHHHHHHHhhccCCcEEccCCeeecceeccccceeecCchhHHHHHhCCCcEEEEeeCCCcEEEEcccccCC----CC Confidence 79999999999987777666543 234556789999999999999999875 457888899999642 34 Q ss_pred chhhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCC--CEEE Q lcl|NC_019421. 375 EAMGYISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKA--DEFY 452 (473) Q Consensus 375 ~~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~--d~~~ 452 (473) ..|++|++||++++|+++|+...+++++ +||++.+|..|+..|+.||++||++|+|.+|++.||++.|++++. ..++ T Consensus 548 s~~~~i~vrR~~~~i~~si~~~~~~~v~-epn~~~l~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~G~~~ 626 (663) T protein:vir:10 548 SPFDRINVRRLFNMLKKNIGDTSKYELF-ENNDAFTRQSFRMEVSQYLDNIRSLGGVYDFRVVCDTTNNTPQVIDSNEFV 626 (663) T ss_pred cccceEehhhHHHHHHHHHHHHHHHhcc-CCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEE Confidence 5899999999999999999987777776 489999999999999999999999999999999999998888764 4788 Q ss_pred EEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 453 WKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 453 v~i~v~p~~~~e~i~~t~~v~ 473 (473) +++.++|+.|+|||.++|..- T Consensus 627 ~~i~~~p~~pae~I~~~~~~~ 647 (663) T protein:vir:10 627 ATIYIKAPRSINYITLNFVAT 647 (663) T ss_pred EEEEEEecCCcceEEEEEEEE Confidence 999999999999999998866 No 24 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=100.00 E-value=4.3e-68 Score=389.90 Aligned_cols=450 Identities=14% Similarity=0.106 Sum_probs=295.9 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCC-CcCcHHHHHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGD-DMNYSAFKLGKL 79 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~-~~~~~~~~~v~~ 79 (473) |+ +. .|||||||++ ++++|++++|+++||+|.++|||+|+|++|+|+ .+|++.||. +....+.++++. T Consensus 1 ma---~~------~PgVyv~E~~-~~~~i~~~~ts~~~~vG~~~~Gp~~~p~~i~s~-~d~~~~fG~~~~~~~~~~~v~~ 69 (664) T protein:vir:98 1 MA---LQ------SPGIETKETS-VQSTVVRNSTGRAAIVGKFSWGPAYQIRQISNE-VELVNYFGAPDNLTADYFMSAV 69 (664) T ss_pred Cc---ee------cCceEEEecC-CCcccccccccceEEEeeccCCCCCccEEecCH-HHHHHhcCCccccchhHHHHHH Confidence 44 32 5999999995 899999999999999999999999999999995 557777775 444456689999 Q ss_pred HHhcCCCEEEEEecCCCcccceeeeecc--------ccc------------ccccceEEEEecCccccceeEEEeeccCC Q lcl|NC_019421. 80 ALLGNVKELLLYRLVDGNQKKGTLTLKD--------TTE------------NSAKDVIKLETKYPTARNFNVTIKSNLVD 139 (473) Q Consensus 80 ~f~~g~~~v~v~rv~~g~~~aat~~l~~--------~~~------------~~~~~~l~i~A~~~G~~~n~i~v~~~~~~ 139 (473) ||+|||++||++|+.++.+.++...+.. ... ......-...+..+|.|+|.+++...... T Consensus 70 ~f~ngg~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~gn~~~v~i~~~~ 149 (664) T protein:vir:98 70 NFLQYGNDLRLVRVVDKDAAKNASAIFNQIKTTIASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNGKILAVTIPKRK 149 (664) T ss_pred HHHhcCCeEEEEEecCccccccccccccccceeeccCCcccccccceeEeecCcccccceeeccCCCCCceeeEeeccCc Confidence 9999999999999976543322211111 000 00001112246678888887765321000 Q ss_pred cc-----------------ceee--------------------eeecC-----------Cc-----------eeeE-E-- Q lcl|NC_019421. 140 SD-----------------KKDF--------------------IFFEN-----------TK-----------QLFS-S-- 157 (473) Q Consensus 140 ~~-----------------~~~v--------------------~v~~~-----------~~-----------~~~~-~-- 157 (473) .. .-.+ .+... .. .... . T Consensus 150 ~~~~~~~~~~~~~~~~~~~~~s~~~~s~g~a~a~~v~~v~~d~~~~~~~~~~a~~~i~~~~~~~~~~~~~~~~~~a~~~G 229 (664) T protein:vir:98 150 KSLLVLNRSVLTQIFLLVGTTEIVSQSSGVSASITIDGIESDSGITLLNLDIAKETIQGTSFQTLTQKYQIPSVVALYPG 229 (664) T ss_pred cceeecccccccccceecccceeeeeecccceeeecccccccceeeccccceeeeccccccceeeeeccccceeeeeecc Confidence 00 0000 00000 00 0000 0 Q ss_pred --------Ee-c-ccc-----------------------------hhhhh---hhhh----------------------- Q lcl|NC_019421. 158 --------SI-K-GTI-----------------------------DEIVL---EINS----------------------- 172 (473) Q Consensus 158 --------~~-~-~~~-----------------------------~~~~~---~~~~----------------------- 172 (473) .. . ... ++... .... T Consensus 230 ~~Gn~isv~i~s~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~ 309 (664) T protein:vir:98 230 ELGSTVQVEIISKAAYDTGAMISGYPSGISVKNSGRSVMTYGPQTDNQYAFVVRRGGIVQESFIVSTDKTDKDIYGVNIY 309 (664) T ss_pred cccceeeeeecccccccCcceEeeccCceecccceeeeeeccccCccceeEEEecCCceeeeEEeecccCcccceeeeee Confidence 00 0 000 00000 0000 Q ss_pred ------cccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhcc---cceEEEEEcCCCcH------ Q lcl|NC_019421. 173 ------NLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFER---YSFDSFVLDGVADE------ 237 (473) Q Consensus 173 ------~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~---~~~~~l~~p~~~~~------ 237 (473) ...+.++... .............+.+|.+....++..+..++|.+|+. .+.++|++|+.... T Consensus 310 ~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~~~~~~~~~ll~~p~~~~~~~~~~~ 386 (664) T protein:vir:98 310 MDDFFANGGSQYVFGT---SMNWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFADREALHVPLLIAGGCAGESVEIAS 386 (664) T ss_pred chhheecccceeeeee---cccCCcccceeEeccCccccccccCchhHHHHHHhhhcccccccceEEecCCCCCcHHHHH Confidence 0000000000 00000111223456677766555555667777777754 45799999985432 Q ss_pred HHHHHHHHHHHHHhhCCCeEEEEEcCC--------CCccHHHHHHhhh--------------ccCCceEEEecCCceecC Q lcl|NC_019421. 238 ALQETTKAWVAKNKELGKDILLFLGGK--------TEDNIKQINDKSK--------------SFNDENIVNVGSSAYYEN 295 (473) Q Consensus 238 ~~~~~l~~~v~~~~~~~~~~~av~~~~--------~~~t~~~~~~~~~--------------~~n~~~i~~~~~~~~~~~ 295 (473) .++.++.+||++++ .|++++..+ ...+.+.+.++.. .+++.+.++++||....+ T Consensus 387 ~v~~al~~~a~~~~----~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d 462 (664) T protein:vir:98 387 TVQKHVISIGDERQ----DCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNYKYQYD 462 (664) T ss_pred HHHHHHHHHHHhcC----CeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCeEEEec Confidence 35666667766554 456665543 4555665555433 478999999999986643 Q ss_pred c---ccchHHHHHHHHHhhhcCccccccceeccC--------cccccccCCHHHHHHHHhCCcEEEEEcCC-EEEEEecc Q lcl|NC_019421. 296 I---KYTPSEVAVYIAALSVSKGITGSICNAKTI--------FEEVEPRLSQSEVKECLKSGTLVLDFDDG-DVIIVDDV 363 (473) Q Consensus 296 ~---~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~--------~~~~~~~~t~~e~~~l~~~G~~~l~~~~~-~~~i~~gi 363 (473) . .......++++||++|++|..+++|+.|.+ ..++...+++.|++.|+++|+++++...+ +..+.||- T Consensus 463 ~~~~~~~~~p~sg~~AGl~A~~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~~G~~~wG~ 542 (664) T protein:vir:98 463 KYNDVNRWVPLAGDIAGLCVYTDSVANPWMSPAGYNRGQIRNCIKLAIEPRTAHRDAMYQVQINPVTGFAGGSGFVLYGD 542 (664) T ss_pred ccCCceEEechHHHHHHHHHHhhhcCCcEECcCCceeeeeeccccceeecChhhHHHHHhCCCeEEEEeeCCCcEEEEcc Confidence 2 222223479999999999977766555433 23455678899999999999999987544 68888999 Q ss_pred cccccCCCCCcchhhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccc Q lcl|NC_019421. 364 NTFKKYVDDKNEAMGYISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQ 443 (473) Q Consensus 364 ~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~ 443 (473) +|+.+ .+..|++|++||++++|+++|+....++++ +||++.+|.+|+..|+.||++||++|+|.+|++.||.+.+ T Consensus 543 rT~~~----~~s~~~~i~vrR~~~~i~~si~~~~~~~v~-epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~n 617 (664) T protein:vir:98 543 KTLTS----VPSPFDRINVRRLFNMIKKDIGDNAKYKLF-ENNDDFTRASFRMDTGQYMTNIRALGGCYDYRVICDTTNN 617 (664) T ss_pred cccCC----CCcccceEeehhHHHHHHHHHHHHHHHhhc-CCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCC Confidence 98742 345899999999999999999987766666 5899999999999999999999999999999999999988 Q ss_pred cCCCC--CEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 444 ATAKA--DEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 444 ~~~~~--d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) ++++. ..+++++.++|+.|+|||.++|.-- T Consensus 618 t~~~i~~G~~~~~i~~~p~~pae~I~~~~~q~ 649 (664) T protein:vir:98 618 TPDVIDRNEFVATVYVKPPRSINYITLNFVAT 649 (664) T ss_pred CHHHhhCCeEEEEEEEEecCCcceEEEEEEEe Confidence 88764 4788999999999999999997755 No 25 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=100.00 E-value=1.7e-67 Score=386.64 Aligned_cols=455 Identities=15% Similarity=0.079 Sum_probs=297.5 Q ss_pred CC-ceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCC---cCcHHHHHHHHHHhcC Q lcl|NC_019421. 9 KE-RKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDD---MNYSAFKLGKLALLGN 84 (473) Q Consensus 9 ~~-~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~---~~~~~~~~v~~~f~~g 84 (473) +. +...|||||||+++++++|++++|+++||+|.++|||+|+|++|+|+ .+|++.||.. ....+.++++.||+|| T Consensus 1 m~~~~~~PgVyv~e~~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~-~~~~~~fG~~~~~~~~~~~~~~~~~f~ng 79 (729) T protein:vir:10 1 MPLNLASPGIVVREVDLTIGRVDPTSGSIGALVAPFAKGPVNDPQLIESE-EDLLQTFGQPYSTDKHYEYWMVASSYLAY 79 (729) T ss_pred CCccccCCceEEEEecCCCcccccccccceeEEeccccCCCccCeEcCCH-HHHHHHcCccccCCcchhHHHHHHHHHhC Confidence 33 55679999999999999999999999999999999999999999995 5588888863 2233457899999999 Q ss_pred CCEEEEEecCCCcccceeeeecccc------------------------cccccceEEEEecCccccceeEEEeeccCCc Q lcl|NC_019421. 85 VKELLLYRLVDGNQKKGTLTLKDTT------------------------ENSAKDVIKLETKYPTARNFNVTIKSNLVDS 140 (473) Q Consensus 85 ~~~v~v~rv~~g~~~aat~~l~~~~------------------------~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~ 140 (473) |++|||+|+.++.+++++..+.... .......+++.+.+||.|+|.+.+....... T Consensus 80 g~~~~vvRv~~~~~~~a~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gn~~~v~v~~~~~ 159 (729) T protein:vir:10 80 GGTMQVVRADDYNTQTGVGLKNAFVTGGVGVGATMLRITSNTHYNQLGYDENTISGVSVAAKNPGTWANGIKVAIIDGKA 159 (729) T ss_pred CceEEEEecCcccccccccccccccccccccccccccccccccccccccccCCCcceEEEEeccCccccceeeEEecccC Confidence 9999999999877666654332110 0112335788999999999987765321100 Q ss_pred cc---------------eeeee----------------------ecCCceee----------------------EEEec- Q lcl|NC_019421. 141 DK---------------KDFIF----------------------FENTKQLF----------------------SSSIK- 160 (473) Q Consensus 141 ~~---------------~~v~v----------------------~~~~~~~~----------------------~~~~~- 160 (473) .. +.... ........ ..... T Consensus 160 ~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~s~~~~~~~~~~~~~~~~~~~~~~~ 239 (729) T protein:vir:10 160 DQILTVASGNTTAVGSAVTQSISKTIGTATGTTTIDGVLKGIVTGSTDTTLEVKVISHISAAGVETAVEYQQNGTYTFDN 239 (729) T ss_pred cceeeeeccccccceeeeeeeccccccccccceeeeeeecccccccccccccceecccccccccceeccccccceeeecc Confidence 00 00000 00000000 00000 Q ss_pred -ccc---------hhhhh---------------------hhhhcc----cccc--------------------------- Q lcl|NC_019421. 161 -GTI---------DEIVL---------------------EINSNL----DNEY--------------------------- 178 (473) Q Consensus 161 -~~~---------~~~~~---------------------~~~~~~----~s~~--------------------------- 178 (473) ... +.... ...... .+.+ T Consensus 240 ~~s~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~d~~~~~~~d~~~~~~~~~g 319 (729) T protein:vir:10 240 SGSVNVIAAGSSGSGSAKSYTAQTDWFESQNIVLSNSTLEWDSIADAPGTSTYVSTRGGKNDEIHVLVIDDKGTITGNSG 319 (729) T ss_pred cCccceeeeccccccccccceeeeccccccccccccccccccccccccccccccccccccccccceeeeccccccccCcc Confidence 000 00000 000000 0000 Q ss_pred ------eeEeecccC---Ccc-----------------------------------------------------ccccce Q lcl|NC_019421. 179 ------VIATKVADS---DTI-----------------------------------------------------LANVVN 196 (473) Q Consensus 179 ------v~~~~~~~~---~~~-----------------------------------------------------~~~~~~ 196 (473) ......... .+. ...... T Consensus 320 ~vve~~~~~s~~~~~~~~~~~~~~~~~vi~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 399 (729) T protein:vir:10 320 TILEKHLSLSKAKDAEYSVGSSSYWRDFLATNSKYIFGGGATSGITTTGYSVSSTNTLDTDSGWDQNAEGVNFGASGVAT 399 (729) T ss_pred cceeeeeeeeeccccccccccccccceeeccccceeeecccccccccccccccccceeccccccccccccccccccceeE Confidence 000000000 000 000011 Q ss_pred eeeccCcccccc----------hhhHHHHHHHHhhccc---ceEEEEEcC-----CCcHHHHHHHHHHHHHHhhCCCeEE Q lcl|NC_019421. 197 QALEGGNDGCTS----------ITNESYLKALEEFERY---SFDSFVLDG-----VADEALQETTKAWVAKNKELGKDIL 258 (473) Q Consensus 197 ~~l~gG~dg~~~----------~t~~d~~~~l~~le~~---~~~~l~~p~-----~~~~~~~~~l~~~v~~~~~~~~~~~ 258 (473) ..+.+|.++... ....++..++.+|+.. .++.++++. .....++.++.+||++++ .++ T Consensus 400 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~~~~~~v~~a~~~~~~~~~----~~~ 475 (729) T protein:vir:10 400 LTLAGGTNYGDKTDLTTSGALSSGVDDIISGYTLFENTEEIEVDFILMGAAHHPKEQSQAVAEKVTAVAEARK----DAV 475 (729) T ss_pred EEeecccccccccccccccccccchhHHHHHHHHhhcccccccceeeecCCCCCccchHHHHHHHHHHHHhcC----CeE Confidence 223344432211 1223455678777654 344444432 344567788888887664 356 Q ss_pred EEEcCCCC-----------------ccHHHHHHhhhcc-CCceEEEecCCceecC---cccchHHHHHHHHHhhhcCccc Q lcl|NC_019421. 259 LFLGGKTE-----------------DNIKQINDKSKSF-NDENIVNVGSSAYYEN---IKYTPSEVAVYIAALSVSKGIT 317 (473) Q Consensus 259 av~~~~~~-----------------~t~~~~~~~~~~~-n~~~i~~~~~~~~~~~---~~~~~~~~a~~vAG~~a~~~~~ 317 (473) +++..+.. .+.+.+..+...+ ++.++.+++||....+ ........++++||++|++|.+ T Consensus 476 a~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~p~s~~~aGl~a~~d~~ 555 (729) T protein:vir:10 476 AFISPYRQAFLNDTVSGTVTVSNIDQTTENVVGFYAPLSSSTYSVFDSGYKYMFDRFNNTFRYVPLNGDIAGTCARTDIE 555 (729) T ss_pred EEecccccccccccccccccccccchhhHHHHHHHhhccCCceEEEEcCeeEEecccCCceEEechhHHHHHHHHHhhcc Confidence 66654421 2223334443333 5678888888876533 2222233489999999999987 Q ss_pred cccceeccC--------cccccccCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhhhhhHHHHHH Q lcl|NC_019421. 318 GSICNAKTI--------FEEVEPRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYISNIMFINTI 389 (473) Q Consensus 318 ~s~t~~~~~--------~~~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~d~i 389 (473) +++|+.|.+ ..++...+++.|++.|+++|++++++..++..++||-+|+. ..|+.|++|++||++|+| T Consensus 556 ~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~----~~d~~~~~i~vrR~~~~i 631 (729) T protein:vir:10 556 QFPWFSPAGTARGPILNSVKLVYNPGKKQRDILYSNRINPVILSPGAGIILFGDKTGF----GKSSAFDRINVRRLFIYL 631 (729) T ss_pred CCcEEccCCccccceecccceeeecChhhHhhhhhCCceEEEEecCCeEEEEcceecC----CCCcccceeehhhhHHHH Confidence 766555433 22345578899999999999999999888888889999873 246899999999999999 Q ss_pred HHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCCC--EEEEEEEEEEeeeeeeEE Q lcl|NC_019421. 390 NKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKAD--EFYWKWDAVKVDVMKKIY 467 (473) Q Consensus 390 ~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~d--~~~v~i~v~p~~~~e~i~ 467 (473) +++|++...+|++ +||++.+|.+|+..|+.||++||++|+|.+|++.||.+.+++++.+ .+++.++++|+.|+|||. T Consensus 632 ~~si~~~~~~~v~-epn~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~p~e~i~ 710 (729) T protein:vir:10 632 EDAISAAAKDQLF-EFNDELTRTNFVNIVEPFLRDVQAKRGIFDFVVICDETNNTAAVIDSNEFVADIFIKPARSINFIG 710 (729) T ss_pred HHHHHHHHHHhhc-CCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCccEEE Confidence 9999988877776 5899999999999999999999999999999999999988887654 688889999999999999 Q ss_pred EEEEeC Q lcl|NC_019421. 468 GTGYLG 473 (473) Q Consensus 468 ~t~~v~ 473 (473) ++|.-- T Consensus 711 ~~~~~~ 716 (729) T protein:vir:10 711 LTFVAT 716 (729) T ss_pred EEEEEe Confidence 998665 No 26 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=100.00 E-value=7.6e-68 Score=388.56 Aligned_cols=333 Identities=18% Similarity=0.175 Sum_probs=256.8 Q ss_pred cCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceeeEEEecccchhhhhhhhh Q lcl|NC_019421. 93 LVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLFSSSIKGTIDEIVLEINS 172 (473) Q Consensus 93 v~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~ 172 (473) +++ - ..... ....++.+|+++|.+|+-+.+-. ++...+. .+.....+ .+... T Consensus 1 ~~g-l-p~i~i---------~f~~~a~ta~~~g~rGiv~~il~--d~~~~~~--~~~~~~~v--------~~~~~----- 52 (356) T protein:vir:10 1 MAG-L-VNINI---------EFKELATSFIQRSKAGIVAIILK--DTTKMYK--ELTSEDDI--------PISLS----- 52 (356) T ss_pred CCC-C-CceeE---------EEeecceeeccCCccceEEEEEe--cCCccee--EEeccccc--------hhHHH----- Confidence 100 0 01111 23457889999999996543322 1222111 11111110 11111 Q ss_pred cccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhcccceEEEEEcCCCcHHHHHHHHHHHHHHhh Q lcl|NC_019421. 173 NLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFERYSFDSFVLDGVADEALQETTKAWVAKNKE 252 (473) Q Consensus 173 ~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~~~l~~p~~~~~~~~~~l~~~v~~~~~ 252 (473) ...++++.....++..+..+.. ++.++..+. .++++|.++|++||.++||+||+|+ .++++|+++.+|++++|+ T Consensus 53 ~~n~~~i~~~~~g~~~~~~~~~---p~~~~~~~~--~t~~~y~~aL~~le~~~fn~l~~~~-~d~~~~~~~~a~ikr~r~ 126 (356) T protein:vir:10 53 ADNKKYIKYGFVGATDNEKVLR---PSKVIISTF--TEDGKVEDILEELESVEFNYLCMPE-AIEAEKTKIVTWIKKIRE 126 (356) T ss_pred HHHHHHHHHHhhcccccccccc---ceeeeeecc--cCchhHHHHHHHhcCccceEEEecC-CChHHHHHHHHHHHHHHh Confidence 1123343333222222221211 222222211 2567999999999999999999997 578999999999999998 Q ss_pred C-CCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCcccchHHHHHHHHHhhhcCccccccceeccCccccc Q lcl|NC_019421. 253 L-GKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENIKYTPSEVAVYIAALSVSKGITGSICNAKTIFEEVE 331 (473) Q Consensus 253 ~-~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~~~~~~ 331 (473) + ++++.+|+++. ..|+|+|+++.+++...+..+.++++|+|+||++|++++++|+||++++.++.. T Consensus 127 ~~~~~~~~V~~~~-------------~aD~EgIInv~n~~~~~g~~~t~~~~~~~vAG~~Ag~~~n~S~T~~~~~~~~~~ 193 (356) T protein:vir:10 127 EESTEAKAVLANI-------------KADNEAIINFTENVVVDGEEITAEKYTTRVASLIASTPNTQSITYAPLDEVESI 193 (356) T ss_pred cCCcEEEEEecCC-------------CCCCceeEEeecCeEecceeechhHHHHHHHHHHhccchhccccceecCCcccc Confidence 7 55666666543 359999999999988889999999999999999999999999999999999889 Q ss_pred ccCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhhhhhHHHHHHHHHHHH-HHhhcCCcccCCHHH Q lcl|NC_019421. 332 PRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYISNIMFINTINKDTSL-KRKEFVGKIFNDATG 410 (473) Q Consensus 332 ~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~-~~~~~ig~~~N~~~~ 410 (473) ++|+++|+++++++|.++|+++++.++|++|||||++++.+++++|+||+++|+||.|.++++. +.+.|+||+||++++ T Consensus 194 ~~~t~~e~~~ai~~G~lvl~~d~~~V~I~~~VNSltt~t~~k~~~f~Kirvvr~~D~i~~Di~~~f~~~yiGKv~N~~dg 273 (356) T protein:vir:10 194 VKIDKASADAKVQAGELILRRLSGKIRIARGINSLTTLTAEKGEIFQKIKLVDTKDLISKDIKNIYVEKYLRKCPNTYDN 273 (356) T ss_pred ccCCHHHHHHHHhCCeEEEEEEcCeEEEEecCccceecCCCCCcchhhhHHHHHHHHHHHHHHHHHhhccccccCCCHHH Confidence 9999999999999999999999999999999999999999999999999999999999999985 556899999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCccC-ccceeccccc--------------------cCCCCCEEEEEEEEEEeeeeeeEEEE Q lcl|NC_019421. 411 QTTVICALKKYFEELMSQGIISE-FNVDIDTELQ--------------------ATAKADEFYWKWDAVKVDVMKKIYGT 469 (473) Q Consensus 411 r~~i~~~i~~~l~~l~~~g~i~~-~~~~~D~~~~--------------------~~~~~d~~~v~i~v~p~~~~e~i~~t 469 (473) |.+|+++|++||++|+++|+|++ |.+++|++.| +.++++.+++++.++|+|+||+||++ T Consensus 274 r~~l~~ai~~y~~~L~~~~~I~~~~~~eid~e~q~~~~~~~g~d~~~~~d~~v~~~~~~~~v~~~~~v~~vdamE~iy~t 353 (356) T protein:vir:10 274 KCLFIVAVQSYLTELAKQELIDSNFTVEIDLEKQKEYLEGKKIAVSKMKENEIKEANTGSNGFYLINLKLVDAMEDINIR 353 (356) T ss_pred HHHHHHHHHHHHHHHHhCCccccCceeEecccchHHHhhhccccccccccceeecccCCcEEEEEEEEEEEeeeeeEEeE Confidence 99999999999999999999974 6677777655 34678899999999999999999999 Q ss_pred EEe Q lcl|NC_019421. 470 GYL 472 (473) Q Consensus 470 ~~v 472 (473) ++| T Consensus 354 i~v 356 (356) T protein:vir:10 354 VQM 356 (356) T ss_pred EeC Confidence 999 No 27 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=100.00 E-value=7.1e-66 Score=377.76 Aligned_cols=455 Identities=15% Similarity=0.106 Sum_probs=296.9 Q ss_pred CccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCc-CcHHHHHHHHH Q lcl|NC_019421. 2 ATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDM-NYSAFKLGKLA 80 (473) Q Consensus 2 ~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~-~~~~~~~v~~~ 80 (473) |.- +...|||||||++++ ++|.++.|+++||+|.++|||+|+|++|+|+ .+|.+.||... ...+.++++.| T Consensus 1 M~~------~~~~PgVyv~e~~~~-~~~~~~~t~~~~fvG~~~~Gp~~~p~~v~s~-~~~~~~fG~~~~~~~~~~~v~~~ 72 (749) T protein:vir:10 1 MAT------NQSSPGVVIQERDLT-TVSTIPTANVGVIAAPFTKGPVEEVIEITSE-RQLAEKFGEPNESNYEYWFSAAQ 72 (749) T ss_pred CCc------cccCCeeEEEEecCC-cccccccCceeEEEeccCCCCCccCEEcCCH-HHHHHHcCCccCCcccHHHHHHH Confidence 332 235699999999987 5689999999999999999999999999995 55777888644 34566899999 Q ss_pred HhcCCCEEEEEecCCCcccceeeeeccc----------ccccccceEEEEecCccccceeEEEeeccCCcccee------ Q lcl|NC_019421. 81 LLGNVKELLLYRLVDGNQKKGTLTLKDT----------TENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKD------ 144 (473) Q Consensus 81 f~~g~~~v~v~rv~~g~~~aat~~l~~~----------~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~------ 144 (473) |+|||++|||+|+.++..+.++...... ....+...+++.|++||.|||.++|.+......... T Consensus 73 F~ngg~~~~vvRv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~pG~~gn~l~v~v~~~~~~~~~~~~~~~ 152 (749) T protein:vir:10 73 FLSYGGLLKTIRVNSSSLKNAVDTGTAPLVKNLQDYETSIEDASNNFSWVARTPGDTGNSIGIFVTDAGADQVVVVPAPG 152 (749) T ss_pred HhhcCCeEEEEEccCccccccccccccccccccccccccccccccceEEEeccCCCcCCceEEEEEcCCCceeeeeecCC Confidence 9999999999999876665554322111 112344568899999999999887754211000000 Q ss_pred -----------------------------eee----------------------------ecC-CceeeEEEeccc---- Q lcl|NC_019421. 145 -----------------------------FIF----------------------------FEN-TKQLFSSSIKGT---- 162 (473) Q Consensus 145 -----------------------------v~v----------------------------~~~-~~~~~~~~~~~~---- 162 (473) +.+ ... ...++....... T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~a~~~~~~~~~~~~~~~~~~s~~~~~~ 232 (749) T protein:vir:10 153 SGNEHEFVADAAVSAASGAAGKVFKYSIILTIDDVVGTFAPGSATTITIGGSAESVNVLAYDATNKKLEIGLPSGGVTGI 232 (749) T ss_pred ccceeeEEeeecccccccccccccccceeeeeccccceeecccceeeeccCcccccccccccCCcceEEEeeecccccce Confidence 000 000 000000000000 Q ss_pred ------------c-------hhhh-hh---------hh--------------------------------h--------- Q lcl|NC_019421. 163 ------------I-------DEIV-LE---------IN--------------------------------S--------- 172 (473) Q Consensus 163 ------------~-------~~~~-~~---------~~--------------------------------~--------- 172 (473) . .... .. .. . T Consensus 233 ~a~~~~v~~~~~~~~~~~~i~~~~~~~~~~~~~~~a~~~~~~~~~g~~~~it~v~~~~~~~~~~t~~~~~~~a~~~gt~~ 312 (749) T protein:vir:10 233 LADNQVITQGTNTAKINVTIERKLLVALNKSSIEFAASDVVQDTNSTNITITSVRDEYTEREYLPGVKWINVAPRPGTSL 312 (749) T ss_pred eeeeecccccccccccccccccchhhhhccccceeeccccccCCccceeEEEeeeccccccccccceeecccccccccee Confidence 0 0000 00 00 0 Q ss_pred ---------------------------------------------------------cccccceeEeeccc--------- Q lcl|NC_019421. 173 ---------------------------------------------------------NLDNEYVIATKVAD--------- 186 (473) Q Consensus 173 ---------------------------------------------------------~~~s~~v~~~~~~~--------- 186 (473) ...|.++....... T Consensus 313 ~~~~~~g~~D~~~v~v~~~~g~~~~~~g~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~ 392 (749) T protein:vir:10 313 YANGVGGHRDEMHVILVDIDGGVTGTVGALLERYIDVSKASDAKTSVGETNYYAEVIKQKSEFIYWAEHESTLYAATSSA 392 (749) T ss_pred eeecccCCCCceEEEEecCCCeeeecccceeeeeeeccccccccccccccchhhhhhccCCCEEEEEecccccccccccc Confidence 00000000000000 Q ss_pred CCc-------------------c-------------ccccceeeeccCccccc-----chhhHHHHHHHHhhc---ccce Q lcl|NC_019421. 187 SDT-------------------I-------------LANVVNQALEGGNDGCT-----SITNESYLKALEEFE---RYSF 226 (473) Q Consensus 187 ~~~-------------------~-------------~~~~~~~~l~gG~dg~~-----~~t~~d~~~~l~~le---~~~~ 226 (473) ..+ . ........+.+|.|+.. ..+..++..++..|. ...+ T Consensus 393 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 472 (749) T protein:vir:10 393 SDGLFGQTAANRQFNLFRSAAGSVDYPAGVTTLGSKNNATYYYRLSGGVNYTVSAGQYTITNTDIGSAYELIGDPESQIV 472 (749) T ss_pred cccccccccccceeeccccccccceeccccccccccCCcEEEEEccCCcccccccccccccchhHHHHHHHhhhhhhccc Confidence 000 0 00000012233333321 223445666665554 3456 Q ss_pred EEEEEc--CCCcH---HHHHHHHHHHHHHhhCCCeEEEEEcCCCCccH----------HHHHHhhhccCCceEEEecCCc Q lcl|NC_019421. 227 DSFVLD--GVADE---ALQETTKAWVAKNKELGKDILLFLGGKTEDNI----------KQINDKSKSFNDENIVNVGSSA 291 (473) Q Consensus 227 ~~l~~p--~~~~~---~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~----------~~~~~~~~~~n~~~i~~~~~~~ 291 (473) ++++++ +.++. .++.++.++|++++ .++++++.+..... .....+....++.+.++++||. T Consensus 473 ~~li~~~~~~~~~~~~~v~~al~~~~~~~~----~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~ 548 (749) T protein:vir:10 473 DFIISGPSGTSDANALAKITSLVNIAEERR----DCMVFVSPRRGNVIGISNTTTITTNIVDFFKKLPSSSYMVFDSGYK 548 (749) T ss_pred ceEEEecCCCCcchhHHHHHHHHHHHhhcC----CEEEEEcCCCCcccccccchhhhhHHHHHHhhccCceeEEEEccce Confidence 776654 33333 35566666766553 47888876644321 2223344456788999999988 Q ss_pred eecCc---ccchHHHHHHHHHhhhcCccccccceeccC--------cccccccCCHHHHHHHHhCCcEEEEEcCCEEEEE Q lcl|NC_019421. 292 YYENI---KYTPSEVAVYIAALSVSKGITGSICNAKTI--------FEEVEPRLSQSEVKECLKSGTLVLDFDDGDVIIV 360 (473) Q Consensus 292 ~~~~~---~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~--------~~~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~ 360 (473) +..+. .......|+++||++|++|..+++|+.|.+ ..++...+++.|++.|+++|++++++..+...++ T Consensus 549 ~~~d~~~~~~~~~p~s~~vAGl~Ar~D~~~g~~~SPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~g~G~~~ 628 (749) T protein:vir:10 549 YIYDKYNDVYRYIPCNGDTAGLCLQTNEISEPWFSPAGFQRGVLRNAIKLAYTPNKAQRDQLYANRVNPIVSFPGQGVVL 628 (749) T ss_pred eeeccccCceEEechHHHHHHHHHHhhccCCcEECcCCceeeeeeccccceeecChhHHHhhhhCCceEEEEecCCeEEE Confidence 65322 222233489999999999977765554433 2345667899999999999999999988878888 Q ss_pred ecccccccCCCCCcchhhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceecc Q lcl|NC_019421. 361 DDVNTFKKYVDDKNEAMGYISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDT 440 (473) Q Consensus 361 ~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~ 440 (473) ||-+|+.+ .|+.|++|++||++|+|+++|+....+|++ +||++.+|.+|+..|+.||++||++|+|++|++.||. T Consensus 629 wG~rT~~s----~d~~~~~i~vRRl~~~ie~si~~~~~~~v~-epn~~~l~~~i~~~i~~fL~~l~~~G~i~~f~V~~d~ 703 (749) T protein:vir:10 629 YGDKTALG----FASAFDRINIRRLFLTVERVISTAAKAQLF-EQNDEAQRSLFINIVEPYLRDVQGRRGVVDFLVKCDS 703 (749) T ss_pred EcceecCC----CCcccceeehhhhHHHHHHHHHHHHHHhhc-CCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcC Confidence 99998742 357899999999999999999987776666 6899999999999999999999999999999999999 Q ss_pred ccccCCCCC--EEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 441 ELQATAKAD--EFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 441 ~~~~~~~~d--~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) +.|++++.+ .+++++.++|+.|+|||.++|.-- T Consensus 704 ~~Nt~~~i~~G~~~~~i~~~P~~pae~I~~~~~~~ 738 (749) T protein:vir:10 704 TNNTPEAVDRGEFYAEVFLKPTRTINYVQLTFVAT 738 (749) T ss_pred CCCCHHHhhCCEEEEEEEEEecCCccEEEEEEEEe Confidence 998877544 788999999999999999998744 No 28 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=100.00 E-value=2.8e-65 Score=374.50 Aligned_cols=455 Identities=15% Similarity=0.142 Sum_probs=286.3 Q ss_pred CceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCC-CcCcHHHHHHHHHHhcCCCEE Q lcl|NC_019421. 10 ERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGD-DMNYSAFKLGKLALLGNVKEL 88 (473) Q Consensus 10 ~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~-~~~~~~~~~v~~~f~~g~~~v 88 (473) -+-..|||||||++ +.++|++++|+++||+|.++|||+|+|++|+|+ .+|.+.||. .....+.++++.||+|||++| T Consensus 1 ~~~~~Pgvyv~e~~-~~~~i~~v~t~~~~fvG~~~~Gp~~~p~~v~s~-~~~~~~fG~~~~~~~~~~~v~~~f~ngg~~~ 78 (671) T protein:vir:56 1 MTLLSPGIENKEIN-LASAIGRAATGRAAMVGKFEWGPAYSITQVTSE-SDLVTIFGRPNDYTAASFMTANNFLKYGNDL 78 (671) T ss_pred CceecCceEEEeec-CcccccccCcccceEEecccCCCCccCEEcCCH-HHHHHHcCCcCCCcchhHHHHHHHHhcCCeE Confidence 45677999999995 899999999999999999999999999999995 557777775 444556789999999999999 Q ss_pred EEEecCCCcccceeeeeccccc---cc-----ccceEEEEecCccccceeE----------------------------- Q lcl|NC_019421. 89 LLYRLVDGNQKKGTLTLKDTTE---NS-----AKDVIKLETKYPTARNFNV----------------------------- 131 (473) Q Consensus 89 ~v~rv~~g~~~aat~~l~~~~~---~~-----~~~~l~i~A~~~G~~~n~i----------------------------- 131 (473) ||+|+.+++..++...+++... .. ....+.+.+..++.+.... T Consensus 79 ~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~v~~~~~ 158 (671) T protein:vir:56 79 RLVRICDATTAQNATPLYNAVEYTIGASNGCVVGDDITITYSGVGALTAKGKVLEVDAGNNNAASKIFLPSAEIVAAAKS 158 (671) T ss_pred EEEEecCccccccchhhccccccccccCcceeeceeeeeecCcccccccCcceeEEeeeccceeeeeeccceeEEEeeec Confidence 9999987665544433332100 00 0001112111111111000 Q ss_pred -------------------EEee-ccCCccc-e---------------------------------eeee-ecC---Cc- Q lcl|NC_019421. 132 -------------------TIKS-NLVDSDK-K---------------------------------DFIF-FEN---TK- 152 (473) Q Consensus 132 -------------------~v~~-~~~~~~~-~---------------------------------~v~v-~~~---~~- 152 (473) .+.. ...+... . .+.. +.+ .. T Consensus 159 ~~~~~~~~~~t~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~~ 238 (671) T protein:vir:56 159 DGNYPSVGTITLQPTQGDIALTNIEIIDTGSVYFPNIELAFDALTAIETEGGALKYADLIEKQGFPRLSARYVGDFGDAI 238 (671) T ss_pred cccccccccccccccccceeeeeecccccceEEEeccccccccccccccccccccchhhhhcccccccccccccccCcce Confidence 0000 0000000 0 0000 000 00 Q ss_pred eeeE------EE---ec---------------ccchhhhhh----hhhcccccc---eeE----------ee-c------ Q lcl|NC_019421. 153 QLFS------SS---IK---------------GTIDEIVLE----INSNLDNEY---VIA----------TK-V------ 184 (473) Q Consensus 153 ~~~~------~~---~~---------------~~~~~~~~~----~~~~~~s~~---v~~----------~~-~------ 184 (473) .+.. .. .. ...+..... ......+.+ +.. .. . T Consensus 239 ~v~v~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~ 318 (671) T protein:vir:56 239 SVEIINYADYQTAFAFAAGHTLGDIELPIYPDGGTRSINLSSYFTFGPSNSNQYAVIVRVSGEVEEAFIVSTNPGDKDVN 318 (671) T ss_pred EEEEecccccccccccccceeeeeccccccccccccccccceeecccccccccceeEEeecCccceeEEEeecccccccc Confidence 0000 00 00 000000000 000000000 000 00 0 Q ss_pred -------------------ccCCccccccceeeeccCcccccchhhHHHHHHHHhhccc---ceEEEEEcCCCcHH---H Q lcl|NC_019421. 185 -------------------ADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFERY---SFDSFVLDGVADEA---L 239 (473) Q Consensus 185 -------------------~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~~---~~~~l~~p~~~~~~---~ 239 (473) ......+.......+.||.++.. ...++.+++++++.. ..+++..|+..... . T Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~--~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 396 (671) T protein:vir:56 319 GQSIFIDEYFENSGSAYITAIAEGWKTESGAYNFGGGSDANA--GADDWMFGLDMLSDPEVLYTNLVIAGNAAAEEVSIA 396 (671) T ss_pred hhhhhhhhhhcccCceEEEecCcccCCccccccccCcccccc--chhHHHHHHHhhhhccccceeEEEcCCCCCccchhH Confidence 00000111222234566776643 344677788777643 46677666532211 1 Q ss_pred HHHHHHHHHHHhhCCCeEEEEEcCCC--------CccHHHHHHhh--------------hccCCceEEEecCCceecCc- Q lcl|NC_019421. 240 QETTKAWVAKNKELGKDILLFLGGKT--------EDNIKQINDKS--------------KSFNDENIVNVGSSAYYENI- 296 (473) Q Consensus 240 ~~~l~~~v~~~~~~~~~~~av~~~~~--------~~t~~~~~~~~--------------~~~n~~~i~~~~~~~~~~~~- 296 (473) .....+.+..+.+..+.+++++..+. ..+.+.+.++. ..+++.+.++++||.+..+. T Consensus 397 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~ 476 (671) T protein:vir:56 397 STVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAGTAVANIQGWRTGIDPTNGQAVVDNLNVSTTYAVIDGNYKYQYDKY 476 (671) T ss_pred HHHHHHHHHHHHhhcCCEEEEEeccccccccccccccHHHHHHHhhhccccchhhhhhhccCCcceEEEecCceEEeccc Confidence 22233444445555566788877542 34555555444 34678899999999875332 Q ss_pred --ccchHHHHHHHHHhhhcCccccccceeccC--------cccccccCCHHHHHHHHhCCcEEEEEcCCEEEEEeccccc Q lcl|NC_019421. 297 --KYTPSEVAVYIAALSVSKGITGSICNAKTI--------FEEVEPRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTF 366 (473) Q Consensus 297 --~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~--------~~~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~ 366 (473) .......++++||++|++|.++++|+.|.+ ...+...+++.|++.|+++|++++++..++..+.||-+|+ T Consensus 477 ~~~~~~~p~s~~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~ 556 (671) T protein:vir:56 477 NDRNRWVPLAGDIAGLCAYTDQVSQPWMSPAGFNRGQIKGVNRLAVDLRRAHRDALYQIGINPVVGFAGQGFVLYGDKTA 556 (671) T ss_pred CCceeEechHHHHHHHHHHhhccCCcEECcCCceeccccccccceeecChhHHHHHhhCCceEEEEecCCeEEEEcceec Confidence 122222479999999999977765554433 2245667899999999999999999987777888999987 Q ss_pred ccCCCCCcchhhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCC Q lcl|NC_019421. 367 KKYVDDKNEAMGYISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATA 446 (473) Q Consensus 367 ~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~ 446 (473) .+ .++.|++|++||+++||+++|+....++++ +||++.+|..|+..|+.||+.||++|+|.+|++.||.+.|+++ T Consensus 557 ~~----~~~~~~~i~vrR~~~~i~~si~~~~~~~v~-epn~~~~~~~i~~~i~~fL~~l~~~gal~g~~v~~d~~~nt~~ 631 (671) T protein:vir:56 557 TQ----QASAFDRINVRRLFNLLKKAISDAAKYRLF-ELNDEFTRSSFKSEIDAYLTNIQDLGGVYDFRVVCDETNNPGS 631 (671) T ss_pred CC----CCcccceEehhhHHHHHHHHHHHHHHHhcC-CCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHH Confidence 42 346899999999999999999887766666 5899999999999999999999999999999999999988877 Q ss_pred CCC--EEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 447 KAD--EFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 447 ~~d--~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) +.+ .+++++.++|+.|+|||.++|.-- T Consensus 632 ~i~~G~~~~~i~~~p~~Pae~I~~~~~~~ 660 (671) T protein:vir:56 632 VIDRNEFVASIYVKPAKSINFITLNFVAT 660 (671) T ss_pred HhhCCeEEEEEEEEecCCcceEEEEEEEe Confidence 544 788889999999999999998644 No 29 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=100.00 E-value=2.5e-61 Score=352.81 Aligned_cols=439 Identities=12% Similarity=0.068 Sum_probs=286.0 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLA 80 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~ 80 (473) |+. +| +||||+||++++.++|.+++|++.+|+|.+++||+|+|++|+|+.+ +.++||.....++.++++.+ T Consensus 1 M~~-~~-------~pGVyv~E~~~g~~~I~~v~Tsv~~~VG~a~~~p~n~pv~its~~d-~~~~g~~~~~~tL~~Av~~~ 71 (477) T protein:vir:79 1 MAA-NY-------LHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPVQSLSDVD-AAQFGPQLAGFTIPQALDAV 71 (477) T ss_pred CcC-CC-------CCCeEEEEecCCcccccccCCceEEEEeecccCCCcccEEEccHHH-HHHhcCCCCCCcHHHHHHHH Confidence 552 22 4999999999999999999999999999999999999999999655 67788888888899999999 Q ss_pred HhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceeeEEEec Q lcl|NC_019421. 81 LLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLFSSSIK 160 (473) Q Consensus 81 f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~~~~~~ 160 (473) |.|||.+||++|+.++...+++........ ..........+.....+.+....... ... .+...... ... T Consensus 72 f~ngg~~~~vvrV~~~~~~~~~~a~~~~~~----~~~~~~~~~~~~~~~~~~v~~~~~~~---~~~--~~~~~~~~-~~~ 141 (477) T protein:vir:79 72 YDYGSGTVIVINVLDPAVHKSNAASESVTF----DAATGRAKLAHPAAANLVLKNDSGGT---TYT--EGTDYAVD-LIN 141 (477) T ss_pred hhcCCceEEEEeccCCcccccccccccccc----ccccccccccccccceeEEeeccccc---ccc--cCcccccc-ccc Confidence 999999999999988765544433222110 01111112222222222222111100 000 00000000 000 Q ss_pred ccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhcc---cceEEEEEcCCC-c Q lcl|NC_019421. 161 GTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFER---YSFDSFVLDGVA-D 236 (473) Q Consensus 161 ~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~---~~~~~l~~p~~~-~ 236 (473) ........ .............. ....+.........+..+.... .....+|...+. ....+++.|+.. + T Consensus 142 ~~~~~~~~-~~~~~~~~~~~~~~---~~~~~~~~~~~~~~g~~~a~~~---~tg~~al~~~~~~~~~~~~iv~apg~~~~ 214 (477) T protein:vir:79 142 GVITRIKT-GTIPAAATAAKATY---DYADPTKVTAADIIGAVNAAGM---RTGMKALKDTYNLYGYFSKILIAPAYCTQ 214 (477) T ss_pred hhhhhhhc-cccccccceeecee---ccCCcccceeeeeccccccccc---chhhhhhhhhhhhcccccceeeccccccc Confidence 00000000 00000000000000 0001111111122222222111 111122222222 235678888753 4 Q ss_pred HHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhh-------hccCCceEEEecCCceecCcc---cchHHHHHH Q lcl|NC_019421. 237 EALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKS-------KSFNDENIVNVGSSAYYENIK---YTPSEVAVY 306 (473) Q Consensus 237 ~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~-------~~~n~~~i~~~~~~~~~~~~~---~~~~~~a~~ 306 (473) ..+++.+.++|+++ ++++++..+...+.+.+.+.. ..+++.++++++||....+.. ......+++ T Consensus 215 ~~v~~~l~~~~~~~-----~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~ 289 (477) T protein:vir:79 215 NSVSVELEAMAVQL-----GAIAYIDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDIATNAERLEPLSSR 289 (477) T ss_pred hhHHHHHHHHHhhc-----CeEEEEecCCCCChHHHhhhhhhccccccccccceEEEEcCeeEEecccCCceeeechHHH Confidence 56888888888654 368899888777776655543 347899999999987643221 111234789 Q ss_pred HHHhhhcCccccccc----eeccCcc-ccccc------CCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcc Q lcl|NC_019421. 307 IAALSVSKGITGSIC----NAKTIFE-EVEPR------LSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNE 375 (473) Q Consensus 307 vAG~~a~~~~~~s~t----~~~~~~~-~~~~~------~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~ 375 (473) +||++|+++..+++| |+++.++ .+... .++.|.+.|+++|++++++..++..++||-+|+.. +..++ T Consensus 290 ~ag~~a~~d~~~g~~~span~~~~gv~~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~~G~~~wG~rT~~~--~~~~~ 367 (477) T protein:vir:79 290 AAGLRARVDLDKGYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAW--PTVTH 367 (477) T ss_pred HHHHHHHhhccCCceEccCCceeecceecccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCC--CCCCc Confidence 999999998766544 4444332 33222 35678999999999999988777788899988742 23467 Q ss_pred hhhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCCC--EEEE Q lcl|NC_019421. 376 AMGYISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKAD--EFYW 453 (473) Q Consensus 376 ~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~d--~~~v 453 (473) .|++|++||++|+|+++|+...++|+++ ||++.+|..|+..|+.||++|+++|+|.+|++.||.+.+++++.+ .+++ T Consensus 368 ~~~~i~vrR~~~~i~~~~~~~~~~~v~e-~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~ 446 (477) T protein:vir:79 368 MRNFENVRRTGDVINESLRYFSQQFVDA-PIDQGLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNPKEELAAGHLLI 446 (477) T ss_pred cceeeehhhHHHHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCeEEE Confidence 8999999999999999999888888775 899999999999999999999999999999999999888776544 6889 Q ss_pred EEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 454 KWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 454 ~i~v~p~~~~e~i~~t~~v~ 473 (473) ++.++|+.|+|||.+++..- T Consensus 447 ~i~~~p~~p~e~i~~~~~~~ 466 (477) T protein:vir:79 447 NYKYTVPPPLERLTYETEIT 466 (477) T ss_pred EEEEEecCCceeEEEEEEEe Confidence 99999999999999999888 No 30 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=100.00 E-value=6.3e-61 Score=350.62 Aligned_cols=436 Identities=13% Similarity=0.055 Sum_probs=286.9 Q ss_pred CCccccCCCCceecCceeEEEecC-Ccce-----ecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQ-AEKS-----TNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAF 74 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~-~~~~-----i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~ 74 (473) --.+.+...-.+ -.+.-|+|... +..+ .+++. +...-+.....|+....... + .+..-+.. ..+. T Consensus 106 i~~~~v~v~g~~-g~~~~VtF~g~~~~l~~~~~~lt~g~-~~~vtV~~~~~g~~~~~~~~-s----~~gi~~~~--~~l~ 176 (581) T protein:vir:10 106 VEDDEVTVLGDP-GGPWTVTFTKAVAALTKDVTGLTGGD-DPDLNIASEQTGVPAMNRAL-A----KKGIKTDT--IRVV 176 (581) T ss_pred CCcceEEEECCC-CceEEEEEcCCccceeeeeceecCCC-ceeEEEeccccCcccccccc-c----cccccccc--cccc Confidence 112222211110 11223344321 1111 11111 11111111122211100000 0 00000000 0000 Q ss_pred HHHHHHHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCcccc---ceeEEEeeccCCccceeeeeecCC Q lcl|NC_019421. 75 KLGKLALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTAR---NFNVTIKSNLVDSDKKDFIFFENT 151 (473) Q Consensus 75 ~~v~~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~---~n~i~v~~~~~~~~~~~v~v~~~~ 151 (473) ......+.--++.+-+.+...+....+. ...+..++.....|.- ++.+.++-+..|+.+++++.+.++ T Consensus 177 ~~~~~~~~~~gsd~~~~~~~~~~~~~~~---------~~~D~~t~~~~~~g~~~~~~~v~~~~~~~~d~~~~~~v~~~~~ 247 (581) T protein:vir:10 177 NPNSGQVYVLGTDYVVTRVNAGEDGEAN---------TRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDP 247 (581) T ss_pred ccccCcceeccccceeeecccCcccccc---------ccccceeeeeeecccccccceEEEEEEEeecCCcceeEEeecC Confidence 0000011112233444444433222221 1112233333333332 333455555677777777766666 Q ss_pred ceeeEEEecccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCccccc-chhhHHHHHHHHhhcccceEEEE Q lcl|NC_019421. 152 KQLFSSSIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCT-SITNESYLKALEEFERYSFDSFV 230 (473) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~-~~t~~d~~~~l~~le~~~~~~l~ 230 (473) .....+.... .....+..+.+... ....+.+.....|++|.++.. .++++||.++|++||.++++.++ T Consensus 248 ~~~~~~~~~~------~~~~g~~~~~~t~~-----~~~~~tn~~~~~l~~gvd~~g~tvt~~dy~~Al~ale~~~~~~iv 316 (581) T protein:vir:10 248 DDIQDFYGPA------FDEAGNVQSEITLC-----AQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAII 316 (581) T ss_pred cchhhhhhhh------hhccCccccchhhh-----heeeeecccceeEEeeccCCCCccchHHHHHHHHHHhcCCceEEE Confidence 5443332110 01111222222111 122345556678888888743 47899999999999999999999 Q ss_pred EcCCCcHHHHHHHHHHHHHHhhCCCe---EEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCc-------ccch Q lcl|NC_019421. 231 LDGVADEALQETTKAWVAKNKELGKD---ILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENI-------KYTP 300 (473) Q Consensus 231 ~p~~~~~~~~~~l~~~v~~~~~~~~~---~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~-------~~~~ 300 (473) +|+++++++|+++++||++|+++++. +++|.++....+.+.+.+++..+|++|+++++|+....+. .+++ T Consensus 317 v~~t~~~~v~a~l~ahv~~~s~~~~~~ravigV~g~~~~~~~~~~~~~a~~~n~~Rvvlv~p~~~~~~g~~~~~~v~lp~ 396 (581) T protein:vir:10 317 VAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGG 396 (581) T ss_pred EeCCCCHHHHHHHHHHHHHHHhccCCcEEEEEecCCCCCccHHHHHHhhccCCCceEEEEecCceeecCcccCceeccch Confidence 99999999999999999999976544 4666667777788888999999999999999987755332 3788 Q ss_pred HHHHHHHHHhhhcCccccccceeccC-cccccccCCHHHHHHHHhCCcEEEEE-cCCEEEEEecccccccCCCCCcchhh Q lcl|NC_019421. 301 SEVAVYIAALSVSKGITGSICNAKTI-FEEVEPRLSQSEVKECLKSGTLVLDF-DDGDVIIVDDVNTFKKYVDDKNEAMG 378 (473) Q Consensus 301 ~~~a~~vAG~~a~~~~~~s~t~~~~~-~~~~~~~~t~~e~~~l~~~G~~~l~~-~~~~~~i~~gi~T~~~~~~~~~~~~~ 378 (473) +++||++||++++.++++|++++++. ...+..++++.|+++|+++|+++|++ ++++++|+|||||+++ +++|+ T Consensus 397 y~~AA~vAGl~a~~~~~~slT~~~i~gi~~l~~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s-----~~~~~ 471 (581) T protein:vir:10 397 QFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPT-----SLHTR 471 (581) T ss_pred hhHHHHHHHHhhccccccCcccccccccccccccCCHHHHHHHHhCCeEEEEEecCCeEEEEeeeecCCC-----CCcce Confidence 99999999999999999999999997 45688899999999999999999986 6778999999999876 67899 Q ss_pred hhhhhHHHHHHHHHHHHHH--hhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCCCEEEEEEE Q lcl|NC_019421. 379 YISNIMFINTINKDTSLKR--KEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKADEFYWKWD 456 (473) Q Consensus 379 ~i~v~R~~d~i~~~i~~~~--~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~d~~~v~i~ 456 (473) +|+++|++||+.+++|..+ ++|||+ ||++.+|++||++|.+||.+|+++|+|++|....+ .+...++|.++|++. T Consensus 472 ~i~~iR~~D~v~~~ir~~~~~~~fIG~-~n~~~~r~~ik~~i~~~L~~l~~~g~I~~~~~~~~--~~~~~~~d~v~V~i~ 548 (581) T protein:vir:10 472 EWNIIGQQDVMVYRIRDYLDADGLIGM-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKA--RQIERQPDVIEVRYE 548 (581) T ss_pred eeeeehhhhHHHHHHHHHhhhhcCCCc-ccCHHHHHHHHHHHHHHHHHHHhcCcccCCcccee--eeeecCCCEEEEEEE Confidence 9999999999999999765 579997 89999999999999999999999999999985443 345578889999999 Q ss_pred EEEeeeeeeEEEEEEeC Q lcl|NC_019421. 457 AVKVDVMKKIYGTGYLG 473 (473) Q Consensus 457 v~p~~~~e~i~~t~~v~ 473 (473) ++|+++|||||+++.+. T Consensus 549 v~Pv~~i~~I~vti~~~ 565 (581) T protein:vir:10 549 WRPAYPLNYIVVRYSIA 565 (581) T ss_pred EEecccceEEEEEEEEe Confidence 99999999999999999 No 31 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=100.00 E-value=9.1e-61 Score=349.75 Aligned_cols=433 Identities=10% Similarity=0.046 Sum_probs=285.7 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLA 80 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~ 80 (473) |+ ++ -+|||||+|++++.++|.+++|++.+|+|.+++||+|+|++|+|+.+ ++.++|.....++.++++.+ T Consensus 1 M~-------~~-~~pGVyv~E~~~~~~~i~~v~T~v~~~VG~a~~gp~n~pv~its~~d-~~~~g~~~~~~tL~~Av~~~ 71 (477) T protein:vir:10 1 MA-------AN-YLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPVQSLSDVD-AAQFGPQLAGFTIPQALDAV 71 (477) T ss_pred Cc-------cc-CCCCeEEEEccCCcccccccCCceeEEEecccCCCCCcCEEEccHHH-HHHhccCCCCCcHHHHHHHH Confidence 43 11 25999999999999999999999999999999999999999999655 67777777778899999999 Q ss_pred HhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccce---eeeeecCCceeeEE Q lcl|NC_019421. 81 LLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKK---DFIFFENTKQLFSS 157 (473) Q Consensus 81 f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~---~v~v~~~~~~~~~~ 157 (473) |.||+.+|+++|+.++...+++......... .....+...+.+.+...+.......... +........ T Consensus 72 f~nGg~~~~vVrV~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~----- 142 (477) T protein:vir:10 72 YDYGSGTVIVINVLDPAVHKSNAANEPVTFD----AATGRAKLAHPAAANLVLKNDSGGTTYAEGTDYAVDLING----- 142 (477) T ss_pred HhccceEEEEEecCccccccccccccccccc----cccceecccccccccccccccccccccccchhhhhhhccc----- Confidence 9999999999999887665554432211111 1111122223333322222111110000 000000000 Q ss_pred EecccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhcc------cceEEEEE Q lcl|NC_019421. 158 SIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFER------YSFDSFVL 231 (473) Q Consensus 158 ~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~------~~~~~l~~ 231 (473) ....... .............. ....+.........+..+... ...++++|+. ....+++. T Consensus 143 ----~~~~~~~---~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~~~~~------~~tGl~al~~~~~~~~~~~~~l~a 208 (477) T protein:vir:10 143 ----VITRIKT---GTIPPGATAAKATY-DYADPTKVTAADIIGAVNAAG------MRTGMKALKDTYNLYGYFSKILIA 208 (477) T ss_pred ----cceeccc---ccccccceeeeecc-ccccccccccccccccccccc------hhhhhhhhhhhhhhcchhcccccc Confidence 0000000 00000000000000 001111111111222222111 1122333322 12467778 Q ss_pred cCCC-cHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhh-------ccCCceEEEecCCceecCcc---cch Q lcl|NC_019421. 232 DGVA-DEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSK-------SFNDENIVNVGSSAYYENIK---YTP 300 (473) Q Consensus 232 p~~~-~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~-------~~n~~~i~~~~~~~~~~~~~---~~~ 300 (473) |+.. +..+++++.++|+++ ++++++..+...+.+++..... .+++++++.++||....+.. ... T Consensus 209 pg~~~~~~v~~~l~~~~~~~-----~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~ 283 (477) T protein:vir:10 209 PAYCTQNSVSVELEAMAVQL-----GAIAYIDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDTATNAERL 283 (477) T ss_pred cccccchhhHHHHHHHHhhC-----CEEEEEecCCCCCHHHHHhhhhhccccccccccceEEEEcCeEEEecccCCceeE Confidence 8754 456888888887654 3688888887777766655443 56899999999988653321 111 Q ss_pred HHHHHHHHHhhhcCccccc----cceeccCc-cccccc------CCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccC Q lcl|NC_019421. 301 SEVAVYIAALSVSKGITGS----ICNAKTIF-EEVEPR------LSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKY 369 (473) Q Consensus 301 ~~~a~~vAG~~a~~~~~~s----~t~~~~~~-~~~~~~------~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~ 369 (473) ...++++||++|++|..++ +.|+++.+ ..+... .++.|.+.|+++|++++++..++..++||-+|+.. T Consensus 284 ~p~s~~~ag~~a~~d~~~g~~~span~~~~gi~~~~~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~~G~~~wG~rT~~~- 362 (477) T protein:vir:10 284 EPLSSRAAGLRARVDLDKGYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAW- 362 (477) T ss_pred EchHHHHHHHHHHhhhcCCceeccCCceeccccccccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCC- Confidence 2247899999999986655 45555542 233222 35678999999999999988777778899988743 Q ss_pred CCCCcchhhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCCC Q lcl|NC_019421. 370 VDDKNEAMGYISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKAD 449 (473) Q Consensus 370 ~~~~~~~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~d 449 (473) +..++.|++|++||++|+|++++++.+.+|+++ ||++.+|..++..|+.||+.|+++|+|.+|++.||.+.+++++.. T Consensus 363 -~~~~~~~~~~~vrR~~~~i~~~~~~~~~~~v~~-~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~ 440 (477) T protein:vir:10 363 -PTVTHMRNFENVRRTGDVINESLRYFSQQFVDA-PIDQGLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNPKEELA 440 (477) T ss_pred -CCCCcccceeehhhHHHHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhh Confidence 234678999999999999999999888888875 899999999999999999999999999999999999988776544 Q ss_pred --EEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 450 --EFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 450 --~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) .+++++.++|+.|+|+|.++++.- T Consensus 441 ~G~~~~~i~~~p~~p~e~i~~~~~~~ 466 (477) T protein:vir:10 441 AGHLLINYKYTVPPPLERLTYETEIT 466 (477) T ss_pred CCeEEEEEEEEecCCcceEEEEEEEc Confidence 788999999999999999999888 No 32 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=100.00 E-value=3.2e-60 Score=346.76 Aligned_cols=429 Identities=13% Similarity=0.061 Sum_probs=278.2 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCc-CcHHH----- Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDM-NYSAF----- 74 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~-~~~~~----- 74 (473) -..+.++.+-++ ..+--|+|... ..++.. -+.+. .|..+..+.|..... |... +..+. T Consensus 106 i~~~~v~vtg~~-~~~~~V~F~g~-~~~~~~------~~~~l--tg~~~~~~~V~~~~~------G~~~~~~~l~~~g~~ 169 (581) T protein:vir:76 106 VEDDEVTVLGDP-GGPWTVTFTKA-VAALTK------DVTGL--TGGDNPDLNIASEQT------GVPAMNRALAKKGIK 169 (581) T ss_pred CCCceEEEEcCC-CceEEEEEcCC-ccceeE------eeeee--ecCCcceeEEEEEec------CcCCcCceeeecccc Confidence 112222221111 11222333311 011100 00000 111111122211000 0000 00000 Q ss_pred -HHHHHHHhcCC------CEEEEEecCCCcccceeeeecccccccccceEEEEecCcccc---ceeEEEeeccCCcccee Q lcl|NC_019421. 75 -KLGKLALLGNV------KELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTAR---NFNVTIKSNLVDSDKKD 144 (473) Q Consensus 75 -~~v~~~f~~g~------~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~---~n~i~v~~~~~~~~~~~ 144 (473) ......-.+.+ ...-+-|+..+....+. + ..+..++.....|.- +..+.+.-...|+.+++ T Consensus 170 ~~~~~~~~s~~~~~~~l~~~~~~~~~~~~~~~~~~--~-------~~~~~t~~~~~~g~~~~~~~i~~~~~~~~D~~~~~ 240 (581) T protein:vir:76 170 TDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEAN--T-------RDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHE 240 (581) T ss_pred ccccceeecCCcceeeecccccceeeccCccccee--e-------eeeeeeeEeecccccccceeEEEEEEEeecCCccc Confidence 00000001111 11112222332221111 1 112233333333322 12223334456666666 Q ss_pred eeeecCCceeeEEEecccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccc-cchhhHHHHHHHHhhcc Q lcl|NC_019421. 145 FIFFENTKQLFSSSIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGC-TSITNESYLKALEEFER 223 (473) Q Consensus 145 v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~-~~~t~~d~~~~l~~le~ 223 (473) +..+.+......+.... .+ ...+..+.+... ......+.+...|++|.|+. ..++++||.++|++||. T Consensus 241 ~v~~~~~~~~~~~~~~~-~~-----~~g~~~~e~~~~-----~~~~~t~~~~~~l~~gvd~~g~tvt~~dy~~aL~ale~ 309 (581) T protein:vir:76 241 VIRFTDPDDIQDFYGPA-FD-----EAGNVQSEITLC-----AQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRD 309 (581) T ss_pred eEEEecccccccceeee-hh-----hcCccccchhhh-----hheeeccccceEEEeeecCCCCccchHHHHHHHHHHhc Confidence 66665554433332211 01 111222222111 11234555567888888874 35789999999999999 Q ss_pred cceEEEEEcCCCcHHHHHHHHHHHHHHhhCCCe---EEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecC----- Q lcl|NC_019421. 224 YSFDSFVLDGVADEALQETTKAWVAKNKELGKD---ILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYEN----- 295 (473) Q Consensus 224 ~~~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~---~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~----- 295 (473) ++++.+++|++.++++|+++++||+++++.+++ ++++++++...+.+.+.+++..+|++|+++++++..... T Consensus 310 ~~~~~ivvp~t~~~~i~a~l~ahv~~~s~~~~~~ra~igv~g~~~~~~~~~~~~~a~~~ns~Rvvlv~p~~~~~~g~~~~ 389 (581) T protein:vir:76 310 EDEIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELN 389 (581) T ss_pred CCeEEEEEecCCChHHHHHHHHHHHHHHhccCCceEEEEeeCCCCCchHHHHHHhhcccCCCcEEEEEcCceEeccccCC Confidence 999999999999999999999999999976544 466677777778888999999999999999998765432 Q ss_pred --cccchHHHHHHHHHhhhcCccccccceeccCc-ccccccCCHHHHHHHHhCCcEEEEE-cCCEEEEEecccccccCCC Q lcl|NC_019421. 296 --IKYTPSEVAVYIAALSVSKGITGSICNAKTIF-EEVEPRLSQSEVKECLKSGTLVLDF-DDGDVIIVDDVNTFKKYVD 371 (473) Q Consensus 296 --~~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~~-~~~~~~~t~~e~~~l~~~G~~~l~~-~~~~~~i~~gi~T~~~~~~ 371 (473) ..++++++++++||++++.++++|++++++.+ ..+..++++.|+++|+++|+++|++ ++++++|+|||||+++ T Consensus 390 ~~~~lp~~~~AA~vAG~~a~~~~~~slT~~~i~g~~~~~~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s--- 466 (581) T protein:vir:76 390 REVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPT--- 466 (581) T ss_pred cceecchhhhhhhHHhhhhccccccCcccccccccccccccCCHHHHHHHHhCCeEEEEEecCCeEEEEEeeecCCC--- Confidence 23678888999999999999999999999985 4688899999999999999999986 6778999999999976 Q ss_pred CCcchhhhhhhhHHHHHHHHHHHHHH--hhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCCC Q lcl|NC_019421. 372 DKNEAMGYISNIMFINTINKDTSLKR--KEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKAD 449 (473) Q Consensus 372 ~~~~~~~~i~v~R~~d~i~~~i~~~~--~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~d 449 (473) ++.|++|+++|++||+.+++|..+ ++|+|+ ||++.+|++||++|.+||.+|+++|+|++|+..++. +...++| T Consensus 467 --~~~~k~i~viR~~D~v~~~vr~~~~~~~fiG~-~n~~~~r~~ik~~i~~~L~~l~~~g~I~g~~~~~~~--~~~~~~d 541 (581) T protein:vir:76 467 --SLHTREWNIIGQQDVMVYRIRDYLDADGLIGM-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKAR--QIERQPD 541 (581) T ss_pred --CCccceeeehhhhHHHHHHHHHHHhhhcCCCc-ccChHHHHHHHHHHHHHHHHHHhcCcccCcccceee--EEecCCC Confidence 678999999999999999999765 469997 899999999999999999999999999999865554 3456788 Q ss_pred EEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 450 EFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 450 ~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) .++|++.++|+++|||||+++++. T Consensus 542 ~v~V~i~v~Pv~~ie~I~vt~~~~ 565 (581) T protein:vir:76 542 VIEVRYEWRPAYPLNYIVVRYSIA 565 (581) T ss_pred EEEEEEEEEecccceEEEEEEEEe Confidence 999999999999999999999999 No 33 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=100.00 E-value=6e-53 Score=306.87 Aligned_cols=358 Identities=12% Similarity=0.047 Sum_probs=268.3 Q ss_pred CCceecCceeEEEecCCcceecccCceEEEEEEeeCCC-----CCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHHHhc Q lcl|NC_019421. 9 KERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWG-----DVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLALLG 83 (473) Q Consensus 9 ~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~G-----p~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~f~~ 83 (473) +.++ +|||||+|++++.+++.++++++.+|+|.++.+ |.++|++|+|. .++...||... .+.++++.+|.+ T Consensus 1 m~~~-~~Gv~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~-~~~~~~~g~~~--tl~~a~~~~~~~ 76 (396) T protein:vir:60 1 MSDY-HHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPLNKPVLITNV-QSAIAKAGKKG--TLAASLQAIADQ 76 (396) T ss_pred CCCC-CCCeEEEEcCCCcccccccCceeEEEEecccccccccccCccCeEeech-HHHHHhhcCcc--hhHHHHHHHhhc Confidence 4555 499999999999999999999999999988553 88999999995 45777777543 567789999999 Q ss_pred CCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceeeEEEecccc Q lcl|NC_019421. 84 NVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLFSSSIKGTI 163 (473) Q Consensus 84 g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~ 163 (473) ++..++++++..+........ . . +... . T Consensus 77 gg~~~~vv~~~~~~~~~~~~~------------~----------~----~~~~-------~------------------- 104 (396) T protein:vir:60 77 SKPVTVVVRVEDGTGEDEETK------------L----------A----QTVS-------N------------------- 104 (396) T ss_pred cCceEEEEecccccccccccc------------c----------c----cccc-------c------------------- Confidence 999999999876532211000 0 0 0000 0 Q ss_pred hhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHh---hcccceEEEEEcCCCcHHHH Q lcl|NC_019421. 164 DEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEE---FERYSFDSFVLDGVADEALQ 240 (473) Q Consensus 164 ~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~---le~~~~~~l~~p~~~~~~~~ 240 (473) ..++.+.....+. ..+|.. +......++++|+..+..++ T Consensus 105 -----------------------------------~~~~~d~~~~~tg---~~al~~~~~~~~~~~~il~ap~~~~~~v~ 146 (396) T protein:vir:60 105 -----------------------------------IIGTTDENGQYTG---LKALLAAESVTGVKPRILGVPGLDTKEVA 146 (396) T ss_pred -----------------------------------ccccccccccccc---hhhhhhcccceeeeeeeccccccccHHHH Confidence 0000000000000 011111 11234567778888888888 Q ss_pred HHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCc---ccchHHHHHHHHHhhhcCccc Q lcl|NC_019421. 241 ETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENI---KYTPSEVAVYIAALSVSKGIT 317 (473) Q Consensus 241 ~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~---~~~~~~~a~~vAG~~a~~~~~ 317 (473) +++.++|+++ .+++++..+...+.+.+.+....+++.+++++.||....+. .......++++||++|++|.. T Consensus 147 ~al~~~~~~~-----~~~~i~d~p~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~ 221 (396) T protein:vir:60 147 VALASVCQKL-----RAFGYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVASTTATAYATARALGLRAKIDQE 221 (396) T ss_pred HHHHHHhccC-----CeEEEEeCCCCCCHHHHHHHHhhcCCceEEEEeCceeeecccCCceeEEchhHHHHHHHHHhhhc Confidence 8888887654 46888998888999999999999999999999998865332 112223478999999999977 Q ss_pred cc----cceeccCcc-ccc------ccCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhhhhhHHH Q lcl|NC_019421. 318 GS----ICNAKTIFE-EVE------PRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYISNIMFI 386 (473) Q Consensus 318 ~s----~t~~~~~~~-~~~------~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~ 386 (473) ++ +.|+++.++ ... ...+..|.+.|+++|++++.+ +++ .++||-+|+.+ |+.|++|++||++ T Consensus 222 ~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~-~~G-~~~wG~rT~~~-----d~~~~~i~~rR~~ 294 (396) T protein:vir:60 222 QGWHKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGVTTLIR-RDG-FRFWGNRTCSD-----DPLFLFENYTRTA 294 (396) T ss_pred cCcEeCcCCceecceeeceeecccccCCCcchhhhhhhcCcEEEEc-CCC-EEEEcccccCC-----CcccceeehhhHH Confidence 74 445544332 222 234667899999999999965 333 55699998753 7789999999999 Q ss_pred HHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCC--CEEEEEEEEEEeeeee Q lcl|NC_019421. 387 NTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKA--DEFYWKWDAVKVDVMK 464 (473) Q Consensus 387 d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~--d~~~v~i~v~p~~~~e 464 (473) |+|+++|+....+|+++ ||++.+|.+++..|+.||++|+++|+|.+|++.||++.+++++. ..+++++.++|+.|+| T Consensus 295 ~~i~~~i~~~~~~~v~e-~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~~~~d~~~nt~~~i~~G~~~~~i~~~p~~pae 373 (396) T protein:vir:60 295 QVLADTMAEAHMWAVDK-PITATLIRDIVDGINAKFRELKTNGYIVDATCWFSEESNDAETLKAGKLYIDYDYTPVPPLE 373 (396) T ss_pred HHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcc Confidence 99999999888888885 89999999999999999999999999999999999998887654 4788889999999999 Q ss_pred eEEEEEEeC Q lcl|NC_019421. 465 KIYGTGYLG 473 (473) Q Consensus 465 ~i~~t~~v~ 473 (473) ||.++++.- T Consensus 374 ~I~~~~~~~ 382 (396) T protein:vir:60 374 NLTLRQRIT 382 (396) T ss_pred eEEEEEEEc Confidence 999999998 No 34 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=100.00 E-value=2e-52 Score=303.99 Aligned_cols=355 Identities=12% Similarity=0.042 Sum_probs=269.4 Q ss_pred CCceecCceeEEEecCCcceecccCceEEEEEEeeCCC-----CCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHHHhc Q lcl|NC_019421. 9 KERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWG-----DVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLALLG 83 (473) Q Consensus 9 ~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~G-----p~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~f~~ 83 (473) ..+. +||||++|..++.+++.+.++++.+|+|++..+ |.++|++|+++.+ +...||.+. .+..++..+|.+ T Consensus 1 m~~~-~~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~~~p~~its~~~-~~~~~g~~g--tl~~al~~~~~n 76 (392) T protein:vir:18 1 MSDF-HHGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPLNEPVLITNVQS-AIAKAGKKG--TLSASLQAIADQ 76 (392) T ss_pred CCCC-CCCeEEEEcCCCceeeeccCcceeEEEEeccCCCCcccccccceEeechHH-HHhhcCCCc--chHHHHHHhhcc Confidence 4555 699999999999999999999999999999765 8899999999655 666677542 466788999999 Q ss_pred CCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceeeEEEecccc Q lcl|NC_019421. 84 NVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLFSSSIKGTI 163 (473) Q Consensus 84 g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~ 163 (473) ++..++++++..+....... ++. . + T Consensus 77 gg~~~~vv~v~~~~~~~~~~-------------~t~--------~---------------d------------------- 101 (392) T protein:vir:18 77 SKPVTVVVRVAEGTGDDAEA-------------QTT--------S---------------N------------------- 101 (392) T ss_pred cCceEEEecccccccccccc-------------cch--------h---------------h------------------- Confidence 99999998876543211100 000 0 0 Q ss_pred hhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhc---ccceEEEEEcCCCcHHHH Q lcl|NC_019421. 164 DEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFE---RYSFDSFVLDGVADEALQ 240 (473) Q Consensus 164 ~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le---~~~~~~l~~p~~~~~~~~ 240 (473) +.|+.+.....+. ..+|...+ ...++++++|+.++..++ T Consensus 102 -----------------------------------liG~~~~~~~~tg---~~al~~~~~~~~~~p~il~ap~~~~~~v~ 143 (392) T protein:vir:18 102 -----------------------------------IIGGTDENGKYTG---IKALLTAEAVTGVKPRILGVPGLDTQEVA 143 (392) T ss_pred -----------------------------------heecccccchhhh---HHHHHhhhhhhceeehhcccCccchHHHH Confidence 0000000000000 01122211 234678888998888888 Q ss_pred HHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCcc---cchHHHHHHHHHhhhcCccc Q lcl|NC_019421. 241 ETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENIK---YTPSEVAVYIAALSVSKGIT 317 (473) Q Consensus 241 ~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~~---~~~~~~a~~vAG~~a~~~~~ 317 (473) +.+.++|++++ +++++.++.+.+.+++.++...+++.+.++++||....+.. ......++++||++++++.+ T Consensus 144 ~~l~~~~~~~~-----~~~~~d~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~ 218 (392) T protein:vir:18 144 TALASVCISLR-----AFGYVSAWGCKTISEAMAYRENFSQRELMVIWPDFLAWDTTANATATAYATARALGLRAYIDQT 218 (392) T ss_pred HHHHHHHhhcC-----cEEEEecCCCCCHHHHHHHHhhccCceEEEEeCceeeecccCCceEEechHHHHHHHHHhhhcc Confidence 99998887653 57888888889999999999999999999999998754322 11223478999999999866 Q ss_pred c----ccceeccCc-ccccc------cCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhhhhhHHH Q lcl|NC_019421. 318 G----SICNAKTIF-EEVEP------RLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYISNIMFI 386 (473) Q Consensus 318 ~----s~t~~~~~~-~~~~~------~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~ 386 (473) + |+.|+++.+ .++.. ..+..|.+.|+++|++++.+ +++ .++||-+|+.+ |+.|++|++||++ T Consensus 219 ~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~~-~~G-~~~wG~rT~~~-----d~~~~~i~~rR~~ 291 (392) T protein:vir:18 219 IGWHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLVR-KDG-FRFWGNRTCSD-----DPLFLFENYTRTA 291 (392) T ss_pred CCceEccCCceeeceeecceecccccCCCcchhhhhhhcCceEEEc-CCC-EEEEcccccCC-----CcccceeehhhHH Confidence 5 556666543 33322 34567899999999999964 333 56799888643 7789999999999 Q ss_pred HHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCC--CEEEEEEEEEEeeeee Q lcl|NC_019421. 387 NTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKA--DEFYWKWDAVKVDVMK 464 (473) Q Consensus 387 d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~--d~~~v~i~v~p~~~~e 464 (473) |+|+++|+....+|+++ ||++.+|..++..|+.||++||++|+|.+|++.||.+.+++++. ..+++++.++|+.|+| T Consensus 292 ~~i~~~i~~~~~~~v~e-~n~~~~~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~p~e 370 (392) T protein:vir:18 292 QVLADTMAEAHMWAVDK-PITASLIRDIVDGINAKFRELKSNGYIVDGECWFDEESNDKETLKAGKLYIDYDYTPVPPLE 370 (392) T ss_pred HHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHHHHhcCcccceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcc Confidence 99999999888888885 89999999999999999999999999999999999998887765 4688889999999999 Q ss_pred eEEEEEEeC Q lcl|NC_019421. 465 KIYGTGYLG 473 (473) Q Consensus 465 ~i~~t~~v~ 473 (473) ||.++++.- T Consensus 371 ~I~~~~~~~ 379 (392) T protein:vir:18 371 SLTLRQRIT 379 (392) T ss_pred eEEEEEEEc Confidence 999999998 No 35 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=100.00 E-value=2.2e-52 Score=303.81 Aligned_cols=358 Identities=12% Similarity=0.047 Sum_probs=267.2 Q ss_pred CCceecCceeEEEecCCcceecccCceEEEEEEeeCCC-----CCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHHHhc Q lcl|NC_019421. 9 KERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWG-----DVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLALLG 83 (473) Q Consensus 9 ~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~G-----p~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~f~~ 83 (473) ..++. |||||+|++++.+++.++++++++|+|.+..+ |.++|++|+|..+ +...||.. ..+..+++.+|.+ T Consensus 1 m~~~~-~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~~~pv~i~s~~~-~~~~~g~~--~tl~~al~~~~~~ 76 (396) T protein:vir:57 1 MSDYH-HGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQS-AIAKAGKK--GTLAASLQAIADQ 76 (396) T ss_pred CCCCC-CceEEEEcCCCcccccccCCceEEEEEeccCCCcccccCccCeEeecchh-hhhhcccc--cchHHHHHHhhhc Confidence 44544 99999999999999999999999999998765 8899999999655 55566654 2566788999999 Q ss_pred CCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceeeEEEecccc Q lcl|NC_019421. 84 NVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLFSSSIKGTI 163 (473) Q Consensus 84 g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~ 163 (473) ++..++++++..+.......... . +. . ++ T Consensus 77 ~~~~~~vv~~~~~~~~~~~~~~a----------~-------t~-~---------------~i------------------ 105 (396) T protein:vir:57 77 SKPVTVVVRVEDGTGDDEETKLA----------Q-------TV-S---------------NI------------------ 105 (396) T ss_pred CCceeEeeecccccccccccccc----------c-------cc-e---------------ee------------------ Confidence 99999999987654332110000 0 00 0 00 Q ss_pred hhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhcc---cceEEEEEcCCCcHHHH Q lcl|NC_019421. 164 DEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFER---YSFDSFVLDGVADEALQ 240 (473) Q Consensus 164 ~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~---~~~~~l~~p~~~~~~~~ 240 (473) .|+.+.....+. ..+|...+. ....++++|+.....++ T Consensus 106 ------------------------------------iG~~~~~~~~tg---l~al~~~~~~~~~~p~i~~ap~~~~~~v~ 146 (396) T protein:vir:57 106 ------------------------------------IGTTDENGQYTG---LKALMGAESVTGVKPRILGVPGLDTKEVA 146 (396) T ss_pred ------------------------------------eeeccccccchh---hhhhhhcccceeEEeccccCcccchhHHH Confidence 000000000000 011111111 22456667777777888 Q ss_pred HHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCc---ccchHHHHHHHHHhhhcCccc Q lcl|NC_019421. 241 ETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENI---KYTPSEVAVYIAALSVSKGIT 317 (473) Q Consensus 241 ~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~---~~~~~~~a~~vAG~~a~~~~~ 317 (473) +++.++|+++ .+++++..+...+.+.+.++...+++.+.++++||....+. .......++++||++|++|.. T Consensus 147 ~al~~~~~~~-----~~~~~~d~p~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~ 221 (396) T protein:vir:57 147 VALASVCQEL-----NAFGYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQE 221 (396) T ss_pred HHHHHHhhhC-----ceEEEEcCCCCCCHHHHHHHHhccCCceEEEEcceeeeecccCCceeEEehhHHHHHHHHHhhhc Confidence 8888888765 36888999888899999999999999999999999875332 111123479999999999966 Q ss_pred c----ccceeccCcc-cccc------cCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhhhhhHHH Q lcl|NC_019421. 318 G----SICNAKTIFE-EVEP------RLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYISNIMFI 386 (473) Q Consensus 318 ~----s~t~~~~~~~-~~~~------~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~ 386 (473) + |+.|+++.++ .... ..+..|.+.|+++|++++.+. + ..+.||-+|+.+ ++.|++|++||++ T Consensus 222 ~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~t~~~~-~-G~~~wG~rT~~~-----d~~~~~i~vrR~~ 294 (396) T protein:vir:57 222 QGWHKTLSNVGVNGVTGISASVFWDLQKPGTDADLLNEAGVTTLVRR-D-GFRFWGNRTCSD-----DPLFLFESYTRTA 294 (396) T ss_pred cCcEeccCCceeccccccceecccccCCcchhhhhhhhcCcEEEEcC-C-CEEEEcccccCC-----CcccceeehhhHH Confidence 6 5566665432 2322 235678999999999999653 3 356799888643 7789999999999 Q ss_pred HHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCC--CEEEEEEEEEEeeeee Q lcl|NC_019421. 387 NTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKA--DEFYWKWDAVKVDVMK 464 (473) Q Consensus 387 d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~--d~~~v~i~v~p~~~~e 464 (473) |+|+++|+....+|+++ ||++.+|..|+..|+.||++|+++|+|.+|++.||++.+++++. ..+++++.++|+.|+| T Consensus 295 ~~i~~~i~~~~~~~v~e-~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e 373 (396) T protein:vir:57 295 QVLADTMAEAHMWAIDK-PITATLIRDIIDGINAKFRELKNNGYIVDGTCWFSEESNDAETLKAGKLYIDYDYTPVPPLE 373 (396) T ss_pred HHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcc Confidence 99999999888888885 89999999999999999999999999999999999998887764 4788999999999999 Q ss_pred eEEEEEEeC Q lcl|NC_019421. 465 KIYGTGYLG 473 (473) Q Consensus 465 ~i~~t~~v~ 473 (473) +|.++++.- T Consensus 374 ~I~~~~~~~ 382 (396) T protein:vir:57 374 NLTLRQRIT 382 (396) T ss_pred eEEEEEEEc Confidence 999999988 No 36 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=100.00 E-value=6.4e-52 Score=301.24 Aligned_cols=356 Identities=10% Similarity=0.040 Sum_probs=265.8 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCC-----CCCceEEeeccHHHHHHHcCCCcCcHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWG-----DVGKVVTIKNDLRQLKNLFGDDMNYSAFK 75 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~G-----p~~~~v~i~s~~~~~~~~fG~~~~~~~~~ 75 (473) ||. .-+|||||+|++++.+++..+++++.+|+|.+..+ |+|+|++|+|.. ++...||.+. .+.. T Consensus 1 M~~--------~~~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~its~~-~~~~~~g~~~--tL~~ 69 (390) T protein:vir:79 1 MPQ--------DYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVV-AALGKAGKKG--TLRR 69 (390) T ss_pred Ccc--------ccCCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccceEeecHH-HHHHhcCCCc--cchh Confidence 553 34699999999999999999999999999999876 899999999854 4677787643 4567 Q ss_pred HHHHHHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceee Q lcl|NC_019421. 76 LGKLALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLF 155 (473) Q Consensus 76 ~v~~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~ 155 (473) +++.+|.+++..|+++++..+....++.. .. + |. .++ T Consensus 70 al~~~~~~~~~~~~vv~v~~~~~~~~~~~----------~~--i-----g~-----------~~~--------------- 106 (390) T protein:vir:79 70 TLDAIGKQTKPLTVVVRVAEGKDADETTS----------NV--I-----GT-----------VTP--------------- 106 (390) T ss_pred hhhhhcccccceEEEEeeccccccccccc----------ee--e-----ec-----------ccc--------------- Confidence 88999999999999999876543322110 00 0 00 000 Q ss_pred EEEecccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhcccceEEEEEcCCC Q lcl|NC_019421. 156 SSSIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFERYSFDSFVLDGVA 235 (473) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~~~l~~p~~~ 235 (473) .++..|.. -+ .....+.....+++++|+.+ T Consensus 107 --------------------------------------------~~~~tgl~-----al-~~~~~~~~~~p~il~ap~~~ 136 (390) T protein:vir:79 107 --------------------------------------------DGKYTGIK-----AL-LAAQGALGVKPRILAAPGLD 136 (390) T ss_pred --------------------------------------------cccchhhh-----hh-hhhhhhhccccccccCCccc Confidence 00000000 00 00001112345677788877 Q ss_pred cHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCc---ccchHHHHHHHHHhhh Q lcl|NC_019421. 236 DEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENI---KYTPSEVAVYIAALSV 312 (473) Q Consensus 236 ~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~---~~~~~~~a~~vAG~~a 312 (473) ...+++.+..+++++ ++++++.++...+...+.++...+++.+.+.+.||....+. .......++++||++| T Consensus 137 ~~~v~~~l~~~a~~~-----~~~ai~D~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a 211 (390) T protein:vir:79 137 TQPVAAALAATAQSL-----RAMAYVSASGCKTKEEAAAYRRQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRA 211 (390) T ss_pred chHHHHHHHHhhhhc-----ceEEEEEccCCCCHHHHHHHhcCCCCceEEEEcCceeecccccCceeEeehHHHHHHHHH Confidence 777888888777654 46889998888889999999999999999999998865332 1122234789999999 Q ss_pred cCcccc----ccceeccC-cccccccC------CHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhhh Q lcl|NC_019421. 313 SKGITG----SICNAKTI-FEEVEPRL------SQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYIS 381 (473) Q Consensus 313 ~~~~~~----s~t~~~~~-~~~~~~~~------t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~ 381 (473) ++|..+ |+.|+++. ...+...+ +..|.+.|+++|++++.+. + ..++||-+|+.+ |+.|++|+ T Consensus 212 ~~D~~~g~~~spsN~~i~gi~~~~~~~~~~~~~~~~~a~~Ln~~gi~t~~~~-~-G~~~wG~rT~~~-----d~~~~~i~ 284 (390) T protein:vir:79 212 KIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVNR-N-GFRFWGERTCSD-----DPKFAFEN 284 (390) T ss_pred hhhccCCcEEccCCceeeccceeeeeccccccccchhhhhhhhcCcEEEEcC-C-CEEEEeccccCC-----Ccccceee Confidence 999655 55556553 33333332 3345678999999998653 3 355699888643 77899999 Q ss_pred hhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCC--CEEEEEEEEEE Q lcl|NC_019421. 382 NIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKA--DEFYWKWDAVK 459 (473) Q Consensus 382 v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~--d~~~v~i~v~p 459 (473) +||++|+|+++|+....+++++ ||++.+|.+|+..|+.||+.|+++|+|.+|++.||++.+++++. ..+++++.++| T Consensus 285 vrR~~~~i~~~i~~~~~~~v~e-~~~~~~~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p 363 (390) T protein:vir:79 285 YTRTAQVAADSIAEAQMPVVDG-PLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTP 363 (390) T ss_pred ehhhHHHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEe Confidence 9999999999999888888885 89999999999999999999999999999999999998887754 47889999999 Q ss_pred eeeeeeEEEEEEeC Q lcl|NC_019421. 460 VDVMKKIYGTGYLG 473 (473) Q Consensus 460 ~~~~e~i~~t~~v~ 473 (473) +.|+|+|.++++.- T Consensus 364 ~~p~e~i~~~~~~~ 377 (390) T protein:vir:79 364 VPPLENLVLRQRIT 377 (390) T ss_pred cCCcceEEEEEEEc Confidence 99999999999998 No 37 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=100.00 E-value=1.8e-51 Score=298.73 Aligned_cols=358 Identities=13% Similarity=0.048 Sum_probs=269.3 Q ss_pred CCceecCceeEEEecCCcceecccCceEEEEEEeeCCC-----CCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHHHhc Q lcl|NC_019421. 9 KERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWG-----DVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLALLG 83 (473) Q Consensus 9 ~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~G-----p~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~f~~ 83 (473) ..+. +|||||++..++.+++.++++++.+|+|.++.+ |.++|++|+|. .++...||... .+..+++.+|.+ T Consensus 1 m~~~-~~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~~~pv~v~s~-~~~~~~~g~~~--tl~~al~~~~~~ 76 (395) T protein:vir:98 1 MSDF-HHGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPLNEPVLITNV-QSAIAKAGKKG--TLAASLQAIADQ 76 (395) T ss_pred CCCC-CCCeEEEEcCCCcccccccCcceEEEEeeccCCCccccccccceEeech-HHhHhhccccc--chhhHHHHHhhc Confidence 5666 699999999999999999999999999998754 78999999985 45777777543 466789999999 Q ss_pred CCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceeeEEEecccc Q lcl|NC_019421. 84 NVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLFSSSIKGTI 163 (473) Q Consensus 84 g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~ 163 (473) ++..|+++++..+........ + .+ .. T Consensus 77 ~~~~~~vv~~~~~~~~~~~~~------------~--------------a~----------------------------~~ 102 (395) T protein:vir:98 77 SKPVTVVVRVEDGTGDDEEAA------------L--------------AQ----------------------------TV 102 (395) T ss_pred cCceEEEeecccccccccccc------------c--------------cc----------------------------cc Confidence 999999998866433211000 0 00 00 Q ss_pred hhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHh---hcccceEEEEEcCCCcHHHH Q lcl|NC_019421. 164 DEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEE---FERYSFDSFVLDGVADEALQ 240 (473) Q Consensus 164 ~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~---le~~~~~~l~~p~~~~~~~~ 240 (473) + .+.++.+.....+. . .+|.. .......++++|+..+..++ T Consensus 103 ~---------------------------------~i~g~~~~~~~~Tg--l-~al~~~~~~~~~~p~il~ap~~~~~~v~ 146 (395) T protein:vir:98 103 S---------------------------------NIIGGTDENGKYTG--I-KALLTAQAVTGVKPRILGVPGLDTKEVA 146 (395) T ss_pred c---------------------------------ccccccccccchhH--H-HHHhhhhhhhccchhhcccccccccHHH Confidence 0 00000000000000 0 11111 11234577888988888888 Q ss_pred HHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCcc---cchHHHHHHHHHhhhcCccc Q lcl|NC_019421. 241 ETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENIK---YTPSEVAVYIAALSVSKGIT 317 (473) Q Consensus 241 ~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~~---~~~~~~a~~vAG~~a~~~~~ 317 (473) +++.+++++++ +++++..+.+.+.+.+.++...+++.+.++++||....+.. ......++++||++|+.+.. T Consensus 147 ~al~~~~~~~~-----~~~~~d~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~ 221 (395) T protein:vir:98 147 VALASAAIKLR-----AFAYVSAWGCKTISEAMEYRKNFSQRELMVIWPDFLAWDTVKNTTATAYATARALGLRAYIDQT 221 (395) T ss_pred HHHHHHhhhcC-----cEEEEEcCCCCCHHHHHHHHhccCCceEEEEecceeEecccCCceeeechHHHHHHHHHHhhcc Confidence 88888887653 57888888888999999999999999999999998754321 11222478999999999866 Q ss_pred ccc----ceeccCc-ccccc------cCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhhhhhHHH Q lcl|NC_019421. 318 GSI----CNAKTIF-EEVEP------RLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYISNIMFI 386 (473) Q Consensus 318 ~s~----t~~~~~~-~~~~~------~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~ 386 (473) +++ .|+++.+ ..+.. ..+..|.+.|+++|++++.+. + ..+.||-+|+.+ |+.|++|++||++ T Consensus 222 ~g~~~spaN~~i~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~~-~-G~~~wG~rT~s~-----d~~~~~i~~rR~~ 294 (395) T protein:vir:98 222 VGWHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLVRK-D-GFRFWGNRTCSD-----DPLFLFENYTRTA 294 (395) T ss_pred cCcEeccCCceeecccccceecccccCCCcchHHhhhhcCcEEEEcC-C-CEEEEcccccCC-----CcccceeehhhHH Confidence 654 4454432 22222 345789999999999999653 3 466799888743 7789999999999 Q ss_pred HHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCC--CEEEEEEEEEEeeeee Q lcl|NC_019421. 387 NTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKA--DEFYWKWDAVKVDVMK 464 (473) Q Consensus 387 d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~--d~~~v~i~v~p~~~~e 464 (473) |+|+++|+....+|+++ ||++.+|..|+..|+.||++|+++|+|.+|++.||++.|++++. ..+++.+.++|+.|+| T Consensus 295 ~~i~~~i~~~~~~~v~e-~~~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e 373 (395) T protein:vir:98 295 QVLADTMAEAHMWAVDK-PITATLIRDIVDGINAKFRELKSNGYIVEGKCWFDEESNDKETLKAGKLYIDYDYTPVPPLE 373 (395) T ss_pred HHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcc Confidence 99999999888888885 89999999999999999999999999999999999998887754 4788999999999999 Q ss_pred eEEEEEEeC Q lcl|NC_019421. 465 KIYGTGYLG 473 (473) Q Consensus 465 ~i~~t~~v~ 473 (473) +|.++++.- T Consensus 374 ~I~~~~~~~ 382 (395) T protein:vir:98 374 SLTLRQRIT 382 (395) T ss_pred eEEEEEEEc Confidence 999999988 No 38 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=100.00 E-value=5.5e-52 Score=301.62 Aligned_cols=357 Identities=9% Similarity=0.012 Sum_probs=267.2 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeC-----CCCCCceEEeeccHHHHHHHcCCCcCcHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRAN-----WGDVGKVVTIKNDLRQLKNLFGDDMNYSAFK 75 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~-----~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~ 75 (473) ||.=++ .|||||+|+.++.+++..+.+++.+|+|.+. .+|+++|++|+|..+ +...||.+ ..+.. T Consensus 1 M~~~~~-------~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v~s~~~-~~~~~g~~--~tl~~ 70 (391) T protein:vir:11 1 MAADQY-------HHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLITNVQA-AIGKAGTS--GTLPA 70 (391) T ss_pred CCCCcC-------CCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEEecchh-hheecCCC--ccchh Confidence 776655 4999999999999999999999999999987 569999999999655 66667754 35667 Q ss_pred HHHHHHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceee Q lcl|NC_019421. 76 LGKLALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLF 155 (473) Q Consensus 76 ~v~~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~ 155 (473) +++.+|.+++..|+++++.++...+.+..-. . |.. +.. T Consensus 71 al~~~~~~~g~~~~vv~~~~~~~~~~t~~d~-------~----------g~~----------~a~--------------- 108 (391) T protein:vir:11 71 SLQAIADQANAATVVVRVKPGEDEAATNSAV-------I----------GGV----------SAD--------------- 108 (391) T ss_pred hhhhhhccccceeEEeeecccccccccchhh-------h----------ccc----------ccc--------------- Confidence 8999999999999999987765433221100 0 000 000 Q ss_pred EEEecccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhcccceEEEEEcCCC Q lcl|NC_019421. 156 SSSIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFERYSFDSFVLDGVA 235 (473) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~~~l~~p~~~ 235 (473) ........ ..++. ..-.....++..|+.+ T Consensus 109 --~~~~g~~a------------------------------------------------~~~~~-~~~~~~p~~~~ap~~~ 137 (391) T protein:vir:11 109 --GKYTGMKA------------------------------------------------LLAAK-ARLGVVPRILGVPGLD 137 (391) T ss_pred --cchhhhhh------------------------------------------------hhhhh-hhheeccccccccccc Confidence 00000000 00000 0001112344556666 Q ss_pred cHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCc---ccchHHHHHHHHHhhh Q lcl|NC_019421. 236 DEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENI---KYTPSEVAVYIAALSV 312 (473) Q Consensus 236 ~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~---~~~~~~~a~~vAG~~a 312 (473) +..+++++.++++++ ++++++..+...+.+.+.+....+++.+.+.++||....+. .......++++||++| T Consensus 138 ~~~v~~al~~~~~~~-----~~~~i~D~p~~~t~~~a~~~r~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a 212 (391) T protein:vir:11 138 TQPVATALIAIAQQL-----RAFAYVSASGCKTKEEATAYRENFAAREAMVIWPDFLTWSTVVNQTVPAPAVAQALGLRA 212 (391) T ss_pred cHHHHHHHHHhhccc-----ceEEEEEcCCCCCHHHHHHHhhhcCCceEEEEcCcceecccccCceEEechHHHHHHHHH Confidence 677888888877543 57888888888899999999999999999999999875432 2222335889999999 Q ss_pred cCcccc----ccceeccCc-cccccc------CCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhhh Q lcl|NC_019421. 313 SKGITG----SICNAKTIF-EEVEPR------LSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYIS 381 (473) Q Consensus 313 ~~~~~~----s~t~~~~~~-~~~~~~------~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~ 381 (473) ++|..+ |+.|+++.+ ..+... .++.|.+.|+++|++++.. ++ ..+.||-+|+.+ |+.|++|+ T Consensus 213 ~~d~~~g~~~span~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~-~~-G~~~wG~rT~~~-----d~~~~~i~ 285 (391) T protein:vir:11 213 RIDQEVGWHKTLSNVAVNGVTGISADVFWDLQSPSTDANYLNENEVTTLVQ-EG-GFRFWGSRTCSD-----DPLFAFEN 285 (391) T ss_pred HhhccCCcEEccCCceeeceeecccccccccCCCcchhhhhhhcCcEEEEc-CC-CEEEEcccccCC-----Ccccceee Confidence 999555 555555543 233322 3467889999999999854 33 366899988743 67899999 Q ss_pred hhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCC--CEEEEEEEEEE Q lcl|NC_019421. 382 NIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKA--DEFYWKWDAVK 459 (473) Q Consensus 382 v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~--d~~~v~i~v~p 459 (473) +||++|+|+++|+....+++++ ||++.+|..|+..|+.||++|+++|+|.+|++.||.+.+++++. ..+++++.++| T Consensus 286 vrR~~~~i~~~~~~~~~~~v~e-~n~~~~~~~i~~~i~~~l~~l~~~g~l~g~~~~~~~~~n~~~~i~~G~~~~~i~~~p 364 (391) T protein:vir:11 286 YTRTAQVLADTIAEAHMWAVDK-PMHPSLVRDILEGVNAKFRELKGLGLIIDAQAWYDPNVNDKDTLKAGKLRITYDYTP 364 (391) T ss_pred hhhHHHHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHHHHhccceeceEEEEecCCCCHHHhhCCeEEEEEEEEe Confidence 9999999999999888888885 89999999999999999999999999999999999998887764 47889999999 Q ss_pred eeeeeeEEEEEEeC Q lcl|NC_019421. 460 VDVMKKIYGTGYLG 473 (473) Q Consensus 460 ~~~~e~i~~t~~v~ 473 (473) +.|+|+|.++++.- T Consensus 365 ~~p~e~i~~~~~~~ 378 (391) T protein:vir:11 365 VPPLEDLTFFQKIT 378 (391) T ss_pred cCCcceEEEEEEEc Confidence 99999999999988 No 39 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=100.00 E-value=9.3e-52 Score=300.36 Aligned_cols=358 Identities=13% Similarity=0.064 Sum_probs=264.0 Q ss_pred CCceecCceeEEEecCCcceecccCceEEEEEEeeCCC-----CCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHHHhc Q lcl|NC_019421. 9 KERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWG-----DVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLALLG 83 (473) Q Consensus 9 ~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~G-----p~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~f~~ 83 (473) .-++ +|||||+|.+++.+++..+.+++.+|+|.++.+ |.++|++|+|..+ +...||... .+..+++.+|.+ T Consensus 1 m~~~-~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l~~pvlvts~~~-~~~~~g~~~--tL~~al~~~~~n 76 (396) T protein:vir:20 1 MSDY-HHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQS-AISKAGKKG--TLAASLQAIADQ 76 (396) T ss_pred CCCC-CCCeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccCccCEEeechHH-HHhhccccc--chhhhhhhhhcc Confidence 4455 499999999999999999999999999998654 7899999999654 666777543 466788899999 Q ss_pred CCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceeeEEEecccc Q lcl|NC_019421. 84 NVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLFSSSIKGTI 163 (473) Q Consensus 84 g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~ 163 (473) ++..++++++..+......... ..... .+. ... T Consensus 77 gg~~~~v~~~~~~~~~~~~~~~--------------------------a~t~~-------~~~--------------~~~ 109 (396) T protein:vir:20 77 SKPVTVVMRVEDGTGDDEETKL--------------------------AQTVS-------NII--------------GTT 109 (396) T ss_pred CceeEEEEeccccccccccccc--------------------------ccccc-------ccc--------------ccc Confidence 9999999998765432111000 00000 000 000 Q ss_pred hhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhh---cccceEEEEEcCCCcHHHH Q lcl|NC_019421. 164 DEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEF---ERYSFDSFVLDGVADEALQ 240 (473) Q Consensus 164 ~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~l---e~~~~~~l~~p~~~~~~~~ 240 (473) + .. +...| ..+|... ......++..|+..+..++ T Consensus 110 ~----------------------~~------------~~~tg---------~~al~~~~~~~~~~p~i~~ap~~~~~~v~ 146 (396) T protein:vir:20 110 D----------------------EN------------GQYTG---------LKAMLAAESVTGVKPRILGVPGLDTKEVA 146 (396) T ss_pred c----------------------cc------------cccch---------hhhhhhhccccccchhhhhhhhhccHHHH Confidence 0 00 00000 0000000 0112234445666677788 Q ss_pred HHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCc---ccchHHHHHHHHHhhhcCccc Q lcl|NC_019421. 241 ETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENI---KYTPSEVAVYIAALSVSKGIT 317 (473) Q Consensus 241 ~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~---~~~~~~~a~~vAG~~a~~~~~ 317 (473) +++.++|+++ ++++++..+...+.+++.++...+++.+.+++.||....+. .......++++||++|++|.. T Consensus 147 ~al~~~~~~~-----~~~~~iD~p~~~~~~~a~~~r~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~ 221 (396) T protein:vir:20 147 VALASVCQKL-----RAFGYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQE 221 (396) T ss_pred HHHHHHHhcC-----CcEEEEecCCCCCHHHHHHHhhCCCCceEEEEcCccccccCcCCcceeechhHHHHHHHHHhhhh Confidence 8888887654 45788898888899999999999999999999998865332 222233478999999999966 Q ss_pred c----ccceeccCcc-cccc------cCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhhhhhHHH Q lcl|NC_019421. 318 G----SICNAKTIFE-EVEP------RLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYISNIMFI 386 (473) Q Consensus 318 ~----s~t~~~~~~~-~~~~------~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~ 386 (473) + |+.|+++.++ .... .++..|.+.|+++|++++.+. + ....||-+|+.+ |+.|++|++||++ T Consensus 222 ~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~~-~-G~~~wG~rT~s~-----d~~~~~i~~rR~~ 294 (396) T protein:vir:20 222 QGWHKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGVTTLIRR-D-GFRFWGNRTCSD-----DPLFLFENYTRTA 294 (396) T ss_pred cCcEeccCCceeccceecceecccccCCCcchhhhhhhcCcEEEEcC-C-CEEEEcccccCC-----CcccceeehhhHH Confidence 5 5555555433 2222 245678999999999999653 3 367799988743 7789999999999 Q ss_pred HHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCC--CEEEEEEEEEEeeeee Q lcl|NC_019421. 387 NTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKA--DEFYWKWDAVKVDVMK 464 (473) Q Consensus 387 d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~--d~~~v~i~v~p~~~~e 464 (473) |+|+++|+....+++++ ||++.+|..++..|+.||++|+++|+|.+|++.||.+.+++++. ..+++++.++|+.|+| T Consensus 295 ~~i~~~~~~~~~~~v~e-~~~~~~~~~i~~~i~~~L~~l~~~G~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e 373 (396) T protein:vir:20 295 QVVADTMAEAHMWAVDK-PITATLIRDIVDGINAKFRELKTNGYIVDATCWFSEESNDAETLKAGKLYIDYDYTPVPPLE 373 (396) T ss_pred HHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHHHHhCcceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcc Confidence 99999999888888875 89999999999999999999999999999999999998887764 4788889999999999 Q ss_pred eEEEEEEeC Q lcl|NC_019421. 465 KIYGTGYLG 473 (473) Q Consensus 465 ~i~~t~~v~ 473 (473) +|.++++.- T Consensus 374 ~i~~~~~~~ 382 (396) T protein:vir:20 374 NLTLRQRIT 382 (396) T ss_pred eEEEEEEEc Confidence 999999988 No 40 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=100.00 E-value=1.1e-51 Score=299.86 Aligned_cols=356 Identities=10% Similarity=0.020 Sum_probs=265.8 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCC-----CCCceEEeeccHHHHHHHcCCCcCcHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWG-----DVGKVVTIKNDLRQLKNLFGDDMNYSAFK 75 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~G-----p~~~~v~i~s~~~~~~~~fG~~~~~~~~~ 75 (473) ||. ...|||||+|++.+.+++..+.+++.+|+|.++.+ |.|+|++|+|. .++...||.+ ..+.. T Consensus 1 M~~--------~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~-~~~~~~~g~~--gtL~~ 69 (390) T protein:vir:78 1 MPQ--------DYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNV-VAALGKAGKK--GTLRR 69 (390) T ss_pred Ccc--------cccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccH-HHHHhhcCCC--ceehh Confidence 553 24699999999999999999999999999998765 99999999985 4566678864 35667 Q ss_pred HHHHHHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceee Q lcl|NC_019421. 76 LGKLALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLF 155 (473) Q Consensus 76 ~v~~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~ 155 (473) ++..+|.+++..|+++++.++...+++... + -|. ...... T Consensus 70 al~~~~~~gg~~~~vv~v~~~~~~~~~~~~-------------~----ig~----------~~~~~~------------- 109 (390) T protein:vir:78 70 TLDAIGKQTKPLTVVVRVAEGKDADETTSN-------------V----IGT----------VTPDGK------------- 109 (390) T ss_pred hhhhhccccCceEEEEEecccccccccccc-------------c----ccc----------cccccc------------- Confidence 899999999999999999776544322110 0 000 000000 Q ss_pred EEEecccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhcccceEEEEEcCCC Q lcl|NC_019421. 156 SSSIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFERYSFDSFVLDGVA 235 (473) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~~~l~~p~~~ 235 (473) .+.... +... ...-.....++++|+.+ T Consensus 110 -~tg~~a---------------------------------------------------l~~~-~~~~~~~p~il~ap~~~ 136 (390) T protein:vir:78 110 -YTGIKA---------------------------------------------------LLAA-QGALGVKPRILAAPGLD 136 (390) T ss_pred -cchhhh---------------------------------------------------hhhh-hhhhcceehhhcccccc Confidence 000000 0000 00012233456677777 Q ss_pred cHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCcc---cchHHHHHHHHHhhh Q lcl|NC_019421. 236 DEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENIK---YTPSEVAVYIAALSV 312 (473) Q Consensus 236 ~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~~---~~~~~~a~~vAG~~a 312 (473) ...+++.+..+++++ ++++++..+...+.+.+.++...+++.+.+++.||....+.. ..-...++++||++| T Consensus 137 ~~~v~~~l~~~a~~~-----~~~aivD~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a 211 (390) T protein:vir:78 137 TQPVAAALAATAQSL-----RAMAYVSASGCKTKEEAAAYRKQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRA 211 (390) T ss_pred hHHHHHHHHHhhccc-----ceEEEEecCCCCCHHHHHHHhhccCCceEEEEcCceEeecccCCcccccchHHHHHHHHH Confidence 777888888887654 357889998889999999999999999999999988653321 112234789999999 Q ss_pred cCcccc----ccceeccC-cccccccCC------HHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhhh Q lcl|NC_019421. 313 SKGITG----SICNAKTI-FEEVEPRLS------QSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYIS 381 (473) Q Consensus 313 ~~~~~~----s~t~~~~~-~~~~~~~~t------~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~ 381 (473) ++|..+ |+.|+++. ....+..++ ..|.+.|+.+|++++.+.+ + ..+||-+|+.+ |+.|++|+ T Consensus 212 ~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~~~-G-~~~wG~rT~s~-----d~~~~~i~ 284 (390) T protein:vir:78 212 KIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVNRN-G-FRFWGERTCSD-----DPKFAFEN 284 (390) T ss_pred HhhcCCCcEECcCCceeeceeecceecccccccccchhhhhhhcCcEEEEcCC-C-EEEEcccccCC-----Ccccceee Confidence 999665 55555554 333443333 3455789999999986543 3 45699998643 67899999 Q ss_pred hhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCC--CEEEEEEEEEE Q lcl|NC_019421. 382 NIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKA--DEFYWKWDAVK 459 (473) Q Consensus 382 v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~--d~~~v~i~v~p 459 (473) +||++|+|+++|+....+++++ ||++.+|..++..|+.||++|+++|+|.+|++.||++.+++++. ..+++.+.++| T Consensus 285 ~rR~~~~i~~~i~~~~~~~v~e-~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p 363 (390) T protein:vir:78 285 YTRTAQVAGDSIAEAQMPVVDG-PLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTP 363 (390) T ss_pred hhhHHHHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEe Confidence 9999999999999888888885 89999999999999999999999999999999999988887765 47888999999 Q ss_pred eeeeeeEEEEEEeC Q lcl|NC_019421. 460 VDVMKKIYGTGYLG 473 (473) Q Consensus 460 ~~~~e~i~~t~~v~ 473 (473) +.|+|+|.++++.- T Consensus 364 ~~pae~I~~~~~~~ 377 (390) T protein:vir:78 364 VPPLENLVLRQRIT 377 (390) T ss_pred cCCcceEEEEEEEc Confidence 99999999999998 No 41 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=100.00 E-value=1.1e-51 Score=299.86 Aligned_cols=356 Identities=10% Similarity=0.020 Sum_probs=265.8 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCC-----CCCceEEeeccHHHHHHHcCCCcCcHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWG-----DVGKVVTIKNDLRQLKNLFGDDMNYSAFK 75 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~G-----p~~~~v~i~s~~~~~~~~fG~~~~~~~~~ 75 (473) ||. ...|||||+|++.+.+++..+.+++.+|+|.++.+ |.|+|++|+|. .++...||.+ ..+.. T Consensus 1 M~~--------~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~-~~~~~~~g~~--gtL~~ 69 (390) T protein:vir:10 1 MPQ--------DYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNV-VAALGKAGKK--GTLRR 69 (390) T ss_pred Ccc--------cccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccH-HHHHhhcCCC--ceehh Confidence 553 24699999999999999999999999999998765 99999999985 4566678864 35667 Q ss_pred HHHHHHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceee Q lcl|NC_019421. 76 LGKLALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLF 155 (473) Q Consensus 76 ~v~~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~ 155 (473) ++..+|.+++..|+++++.++...+++... + -|. ...... T Consensus 70 al~~~~~~gg~~~~vv~v~~~~~~~~~~~~-------------~----ig~----------~~~~~~------------- 109 (390) T protein:vir:10 70 TLDAIGKQTKPLTVVVRVAEGKDADETTSN-------------V----IGT----------VTPDGK------------- 109 (390) T ss_pred hhhhhccccCceEEEEEecccccccccccc-------------c----ccc----------cccccc------------- Confidence 899999999999999999776544322110 0 000 000000 Q ss_pred EEEecccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhcccceEEEEEcCCC Q lcl|NC_019421. 156 SSSIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFERYSFDSFVLDGVA 235 (473) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~~~l~~p~~~ 235 (473) .+.... +... ...-.....++++|+.+ T Consensus 110 -~tg~~a---------------------------------------------------l~~~-~~~~~~~p~il~ap~~~ 136 (390) T protein:vir:10 110 -YTGIKA---------------------------------------------------LLAA-QGALGVKPRILAAPGLD 136 (390) T ss_pred -cchhhh---------------------------------------------------hhhh-hhhhcceehhhcccccc Confidence 000000 0000 00012233456677777 Q ss_pred cHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCcc---cchHHHHHHHHHhhh Q lcl|NC_019421. 236 DEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENIK---YTPSEVAVYIAALSV 312 (473) Q Consensus 236 ~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~~---~~~~~~a~~vAG~~a 312 (473) ...+++.+..+++++ ++++++..+...+.+.+.++...+++.+.+++.||....+.. ..-...++++||++| T Consensus 137 ~~~v~~~l~~~a~~~-----~~~aivD~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a 211 (390) T protein:vir:10 137 TQPVAAALAATAQSL-----RAMAYVSASGCKTKEEAAAYRKQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRA 211 (390) T ss_pred hHHHHHHHHHhhccc-----ceEEEEecCCCCCHHHHHHHhhccCCceEEEEcCceEeecccCCcccccchHHHHHHHHH Confidence 777888888887654 357889998889999999999999999999999988653321 112234789999999 Q ss_pred cCcccc----ccceeccC-cccccccCC------HHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhhh Q lcl|NC_019421. 313 SKGITG----SICNAKTI-FEEVEPRLS------QSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYIS 381 (473) Q Consensus 313 ~~~~~~----s~t~~~~~-~~~~~~~~t------~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~ 381 (473) ++|..+ |+.|+++. ....+..++ ..|.+.|+.+|++++.+.+ + ..+||-+|+.+ |+.|++|+ T Consensus 212 ~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~~~-G-~~~wG~rT~s~-----d~~~~~i~ 284 (390) T protein:vir:10 212 KIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVNRN-G-FRFWGERTCSD-----DPKFAFEN 284 (390) T ss_pred HhhcCCCcEECcCCceeeceeecceecccccccccchhhhhhhcCcEEEEcCC-C-EEEEcccccCC-----Ccccceee Confidence 999665 55555554 333443333 3455789999999986543 3 45699998643 67899999 Q ss_pred hhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCC--CEEEEEEEEEE Q lcl|NC_019421. 382 NIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKA--DEFYWKWDAVK 459 (473) Q Consensus 382 v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~--d~~~v~i~v~p 459 (473) +||++|+|+++|+....+++++ ||++.+|..++..|+.||++|+++|+|.+|++.||++.+++++. ..+++.+.++| T Consensus 285 ~rR~~~~i~~~i~~~~~~~v~e-~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p 363 (390) T protein:vir:10 285 YTRTAQVAGDSIAEAQMPVVDG-PLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTP 363 (390) T ss_pred hhhHHHHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEe Confidence 9999999999999888888885 89999999999999999999999999999999999988887765 47888999999 Q ss_pred eeeeeeEEEEEEeC Q lcl|NC_019421. 460 VDVMKKIYGTGYLG 473 (473) Q Consensus 460 ~~~~e~i~~t~~v~ 473 (473) +.|+|+|.++++.- T Consensus 364 ~~pae~I~~~~~~~ 377 (390) T protein:vir:10 364 VPPLENLVLRQRIT 377 (390) T ss_pred cCCcceEEEEEEEc Confidence 99999999999998 No 42 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=100.00 E-value=1.4e-51 Score=299.39 Aligned_cols=353 Identities=9% Similarity=-0.023 Sum_probs=262.1 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeC-----CCCCCceEEeeccHHHHHHHcCCCcCcHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRAN-----WGDVGKVVTIKNDLRQLKNLFGDDMNYSAFK 75 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~-----~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~ 75 (473) ||. ++ .|||||+|..++.+++.++++.+.+|+|.++ .+|+|+|++|+|+. ++...||.+. .+.+ T Consensus 1 M~~-------~~-~pGv~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~n~pv~iss~~-~~~~~~g~~g--tl~~ 69 (391) T protein:vir:79 1 MPT-------DY-HHGVRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPLDTPVLLTNPQ-AYIGKAGDKG--TLAH 69 (391) T ss_pred CCC-------CC-CCCeEEEECCCCcccccccCCceEEEEeecccccccccccccCEEeccHH-HHHHhcCCcc--ccch Confidence 662 33 6999999999999999999999999999875 68999999999954 5676777542 4567 Q ss_pred HHHHHHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceee Q lcl|NC_019421. 76 LGKLALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLF 155 (473) Q Consensus 76 ~v~~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~ 155 (473) ++..+|.+++..++++++........+.. .. T Consensus 70 al~~~~~~gg~~~~vv~~~~~~~~~~~~~----------~~--------------------------------------- 100 (391) T protein:vir:79 70 TLDAITDQTNPLTVVVRVAGGASEAETTS----------NL--------------------------------------- 100 (391) T ss_pred hhhhhhcccccceeeeccccccccccccc----------cc--------------------------------------- Confidence 88999999999999998765432211100 00 Q ss_pred EEEecccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHh---hcccceEEEEEc Q lcl|NC_019421. 156 SSSIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEE---FERYSFDSFVLD 232 (473) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~---le~~~~~~l~~p 232 (473) .++.+.....+. + .+|.. .......++++| T Consensus 101 --------------------------------------------~g~~~~~~~~tG--l-~~l~~~~~~~~~~p~~l~~p 133 (391) T protein:vir:79 101 --------------------------------------------IGTTNAAGRYTG--M-KALLTARNRFGVAPRILAVP 133 (391) T ss_pred --------------------------------------------cccccchhhhHH--H-hhhhhhhhhhcccchhhcCC Confidence 000000000000 0 00000 011223445567 Q ss_pred CCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCc---ccchHHHHHHHHH Q lcl|NC_019421. 233 GVADEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENI---KYTPSEVAVYIAA 309 (473) Q Consensus 233 ~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~---~~~~~~~a~~vAG 309 (473) +.+...+++++.++|++++ +++++..+...+.+.+......+++.+++.+.||....+. .......++++|| T Consensus 134 ~~~~~~v~~al~~~~~~~~-----~~ai~d~p~~~t~~~a~~~~~~~~s~~~a~~~P~~~~~d~~~~~~~~~p~s~~~AG 208 (391) T protein:vir:79 134 GLDSLPVGTELVTIAQKLR-----AFAYLSAYGCQTKEEAVAYRSNFGQREAMVMWPDFVGWDTAANAETTLWATARAVG 208 (391) T ss_pred ccchhHHHHHHHHHHhhcC-----cEEEEECCCCCCHHHHHHHHhccCCceeEEecceeeeecCcCCceeeechHHHHHH Confidence 7777778888888876543 5688888888899999999999999999999998865332 1222234789999 Q ss_pred hhhcCccccccce----eccC-cccccccCC------HHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhh Q lcl|NC_019421. 310 LSVSKGITGSICN----AKTI-FEEVEPRLS------QSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMG 378 (473) Q Consensus 310 ~~a~~~~~~s~t~----~~~~-~~~~~~~~t------~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~ 378 (473) ++|++|..+++|. +.+. +.++...++ ..|.+.|+.+|++++.+. +..++||-+|+++ |+.|+ T Consensus 209 ~~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~~I~t~~~~--~G~~~wG~rT~~~-----d~~~~ 281 (391) T protein:vir:79 209 LRAKIDNDTGWHKTLSNVAVGGVTGLSRDVFWDLQDPATDAGYLNANEVTTLVHR--DGYRFWGSRTCSA-----DPLFA 281 (391) T ss_pred HHHHhhhcccceeccCCceehhhhccccccccccccccchhhhhhhcCceEEECC--CcEEEEcccccCC-----Ccccc Confidence 9999996655544 4443 333443333 345678999999998543 3467799888643 78899 Q ss_pred hhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCC--CEEEEEEE Q lcl|NC_019421. 379 YISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKA--DEFYWKWD 456 (473) Q Consensus 379 ~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~--d~~~v~i~ 456 (473) +|++||++|+|+++|+....+++++ ||++.+|.+++..|+.||++|+++|+|.+|++.||.+.+++++. ..++++++ T Consensus 282 ~i~~rR~~~~i~~~i~~~~~~~v~e-pn~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~ 360 (391) T protein:vir:79 282 FENYTRTAQVLADTMAEAHMWANDL-PMTPTLVRDLLEGINAKLRMLTRNGYLLGGAAWFDADANSKDTLKAGQLAIDYD 360 (391) T ss_pred eeehhhHHHHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCEEEEEEE Confidence 9999999999999999888888885 89999999999999999999999999999999999998887764 47889999 Q ss_pred EEEeeeeeeEEEEEEeC Q lcl|NC_019421. 457 AVKVDVMKKIYGTGYLG 473 (473) Q Consensus 457 v~p~~~~e~i~~t~~v~ 473 (473) ++|+.|+|||.++++.- T Consensus 361 ~~p~~p~e~i~~~~~~~ 377 (391) T protein:vir:79 361 YTPVPPLENLTFRQRIT 377 (391) T ss_pred EEecCCcceEEEEEEEc Confidence 99999999999999988 No 43 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=100.00 E-value=1.1e-49 Score=288.94 Aligned_cols=353 Identities=10% Similarity=0.021 Sum_probs=267.4 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCC----CCCceEEeeccHHHHHHHcCC-CcCcHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWG----DVGKVVTIKNDLRQLKNLFGD-DMNYSAFK 75 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~G----p~~~~v~i~s~~~~~~~~fG~-~~~~~~~~ 75 (473) |..= ..=+||||++|+++++++|.++++++++|+|.++-+ |.++|+.|.+. .++..++|. .....+.. T Consensus 1 m~~~------~~~~hGv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~~p~~~~~~i~~~-~d~~~~~~~~~~~gtl~~ 73 (388) T protein:vir:96 1 MPVI------DQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANT-ADAQYLDSTGNELGTGWH 73 (388) T ss_pred CCCC------CCCCCceEEEEcCCCcccccccCcceeEEEEecCCCccccccccceeeecc-hhhhhhhccccccccchh Confidence 4321 112489999999999999999999999999998664 88999999885 446666664 33455667 Q ss_pred HHHHHHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceee Q lcl|NC_019421. 76 LGKLALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLF 155 (473) Q Consensus 76 ~v~~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~ 155 (473) ++..+|.+++..++++|+..|...+++.+ + T Consensus 74 al~~~~~~~~~~~~vv~v~~g~~~~at~a---------------------------------------~----------- 103 (388) T protein:vir:96 74 AASETLKKTSVPQYFIVVPEGADDAATMA---------------------------------------N----------- 103 (388) T ss_pred hhHhhhccCCceEEEEEeccccccccccc---------------------------------------e----------- Confidence 89999999998999999876644322110 0 Q ss_pred EEEecccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhccc--ceEEEEEcC Q lcl|NC_019421. 156 SSSIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFERY--SFDSFVLDG 233 (473) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~~--~~~~l~~p~ 233 (473) +.||.+..+ . ..+++.+++.. ..++|++|+ T Consensus 104 -------------------------------------------iig~~~~~t---g--~~~gl~al~~~~~~p~il~aPg 135 (388) T protein:vir:96 104 -------------------------------------------IIGGIDPTT---G--RRTGIAALTECTERPTLIGAPG 135 (388) T ss_pred -------------------------------------------eeeeccccc---c--hhhHHHHhhhcccceeEEEeec Confidence 000110000 0 11234444433 358999998 Q ss_pred CC-cHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHH----hhhccCCceEEEecCCceecCc---ccchHHHHH Q lcl|NC_019421. 234 VA-DEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQIND----KSKSFNDENIVNVGSSAYYENI---KYTPSEVAV 305 (473) Q Consensus 234 ~~-~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~----~~~~~n~~~i~~~~~~~~~~~~---~~~~~~~a~ 305 (473) .+ ...+++++.++|++++ +++++.++.+.+.+.... ....++|.+.+.+.||....+. .......++ T Consensus 136 ~s~~~~v~~al~~~~~~~~-----~~~i~D~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~ 210 (388) T protein:vir:96 136 FSQNKAVIDALASMAKRLK-----CRAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPST 210 (388) T ss_pred cccchHHHHHHHHHHhhcC-----cEEEEeccCCchhHHHHHHhhhhccCcCcceEEEEeCceeeecccCCceeeechHH Confidence 54 4578889999987653 578888776544332221 2335789999999998765332 222234579 Q ss_pred HHHHhhhcCccccccceeccCccccc------ccCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhh Q lcl|NC_019421. 306 YIAALSVSKGITGSICNAKTIFEEVE------PRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGY 379 (473) Q Consensus 306 ~vAG~~a~~~~~~s~t~~~~~~~~~~------~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~ 379 (473) ++||++|++|+.+|+.|+++....+. ...++.|.+.|+++|++++.+..++..+.||-+|+ .|++ T Consensus 211 ~~AG~~a~~D~~~spaN~~i~i~g~~~~~~~~~~~~~~~~~~Ln~~gI~~i~~~~~~G~~~wG~rT~---------~~~~ 281 (388) T protein:vir:96 211 IAMGAVAAVKPWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTV---------TGKF 281 (388) T ss_pred HHHHHHHhhcCcccccCeeEEeeeecccccccccCChhhHHhhhhcCceEEEEecCCcEEEEccccc---------CCcc Confidence 99999999999999999987544332 23467889999999999999887777777998875 3999 Q ss_pred hhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCCC--EEEEEEEE Q lcl|NC_019421. 380 ISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKAD--EFYWKWDA 457 (473) Q Consensus 380 i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~d--~~~v~i~v 457 (473) |++||++|+|+++|+....+++++ ||++.+|.+++..|+.||++|+++|+|.+|++.||.+.|++++.+ .+++++++ T Consensus 282 i~vrR~~~~i~~si~~~~~~~v~e-pn~~~~~~~i~~~i~~fL~~l~~~Gal~g~~~~~d~~~nt~~~i~~G~~~~~i~~ 360 (388) T protein:vir:96 282 ISFVGLEDAIARKLEAASQRAMSK-QLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDY 360 (388) T ss_pred eeehhhHHHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEE Confidence 999999999999999888777775 999999999999999999999999999999999999988877654 78899999 Q ss_pred EEeeeeeeEEEEEEeC Q lcl|NC_019421. 458 VKVDVMKKIYGTGYLG 473 (473) Q Consensus 458 ~p~~~~e~i~~t~~v~ 473 (473) +|+.|+|||.+++++- T Consensus 361 ~p~~pae~I~~~~~~~ 376 (388) T protein:vir:96 361 GRYSPNEHMIFHLNAV 376 (388) T ss_pred EecCCcceEEEEEEEc Confidence 9999999999999998 No 44 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=100.00 E-value=3.8e-48 Score=280.59 Aligned_cols=451 Identities=13% Similarity=0.093 Sum_probs=305.8 Q ss_pred CccccCCCCceec-CceeEEEecCCcceecccCceEEEEEEeeC---CCCCCceEEeeccHHHHHHHcCCCcCcHHHHHH Q lcl|NC_019421. 2 ATGTWNEKERKEI-PGFYNRFKTQAEKSTNTGLKGRLAMPIRAN---WGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLG 77 (473) Q Consensus 2 ~~g~~~~~~~~~~-PGvYie~~~~~~~~i~~~~~~~~~~~g~a~---~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v 77 (473) |.-.|+..++.+| ||+|+||.++.. ..+.....+.++|... ..+.++|++|+| .++.+.+||.++ -+..++ T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A--~~~~~~q~vLiiGq~la~gs~~~~~~v~v~s-~~~a~~lfG~GS--ml~~M~ 75 (498) T protein:vir:45 1 MTISFNTIPSNTLVPLFYAEMDNQAA--NTAQDSGASLLIGHANNGAEIVANSLVLMPS-ADYARQICGAGS--QLARMV 75 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCC--CCCCCCcceEEEEecCCccccccceeEEecC-HHHHHHhcCcCc--HHHHHH Confidence 9999999999999 999999998876 3455667888888753 348899999999 567999999883 233445 Q ss_pred HHHHhcCCC-EEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCcc-----ceeeeeecCC Q lcl|NC_019421. 78 KLALLGNVK-ELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSD-----KKDFIFFENT 151 (473) Q Consensus 78 ~~~f~~g~~-~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~-----~~~v~v~~~~ 151 (473) +.+.+++.. ++|++-+.+.+..+|+.++.-+......+.+.+ +-|.. .|.|.+...|.. .....+.-+. T Consensus 76 ~a~~~~n~~~~l~~i~~~d~aG~aA~g~it~tg~at~~G~l~l---~Igg~--~v~v~V~~gdTaa~vA~al~aaina~~ 150 (498) T protein:vir:45 76 EAYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNV---YVGRT--RVQAPVTNGDNVTTIASSIQDAINAVP 150 (498) T ss_pred HHHHHhCCcceEEEEeeCCcccceeEEEEEeecccCCCcEEEE---EECCE--EEEEEecCCCCHHHHHHHHHHHHhCCC Confidence 555554443 688888876444444444433333333444433 11332 344443333211 0000111112 Q ss_pred ceeeEEEecccchhhhhhhhhcccccc-eeEeecc--cCCcccccc--ceeeeccCcccccchhhHHHHHHHHhhcccce Q lcl|NC_019421. 152 KQLFSSSIKGTIDEIVLEINSNLDNEY-VIATKVA--DSDTILANV--VNQALEGGNDGCTSITNESYLKALEEFERYSF 226 (473) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~s~~-v~~~~~~--~~~~~~~~~--~~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~ 226 (473) ....+.......-.+.++.+....|++ ++..... ..+.+|+.. ....++||+. +.|+.++|+++...+| T Consensus 151 ~lPVTA~~~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamagGag------~PD~a~alaal~~~~~ 224 (498) T protein:vir:45 151 TLPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTG------APVLTGAVAAMADEPF 224 (498) T ss_pred CCceEEEecCceEEEEeeccCccccceeEEEeeccccccccccceeeEEEEccCCCcc------CchhHHHHHHhccCCc Confidence 222233333333333444444445553 2222211 122333333 2345556552 4488999999999999 Q ss_pred EEEEEcCCCcHHHHHHHHHHHHHHhh---CCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCcccchHHH Q lcl|NC_019421. 227 DSFVLDGVADEALQETTKAWVAKNKE---LGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENIKYTPSEV 303 (473) Q Consensus 227 ~~l~~p~~~~~~~~~~l~~~v~~~~~---~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~~~~~~~~ 303 (473) ++|++|.++..++ .++.+|+...+. .-+++-++.......|+++..++....|++++.+++.. .+...+++.. T Consensus 225 ~~I~~p~~D~asL-~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~~~---~~~~sp~~~~ 300 (498) T protein:vir:45 225 DYIGLPFNDTASV-NTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYE---KETQTPADEL 300 (498) T ss_pred cEEEEeeCCHHHH-HHHHHHHhhhhhhhhHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEecC---CCCCChHHHH Confidence 9999998665555 688899986543 23344455555677799999999999999999876421 1223455677 Q ss_pred HHHHHHhhh---cCccccccceeccCcc---cccccCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCC-CCCcch Q lcl|NC_019421. 304 AVYIAALSV---SKGITGSICNAKTIFE---EVEPRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYV-DDKNEA 376 (473) Q Consensus 304 a~~vAG~~a---~~~~~~s~t~~~~~~~---~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~-~~~~~~ 376 (473) ++.+||.+| ..||.+++....++++ ....+|+..|++.|+.+|+.+++..+|.++|+|.|+||++.. ...|+. T Consensus 301 AAa~aa~~A~~l~~DPArPL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~~D~s 380 (498) T protein:vir:45 301 AASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADNS 380 (498) T ss_pred HHHHHHHHHHHhhcccccccCceeecceecCCchhcCChHHHHHHHhCCcceEEEcCCeEEEEeeeeeeeecCCCCcchh Confidence 777777777 7999999998888765 356789999999999999999998888999999999998554 577899 Q ss_pred hhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHH-----------HHHHHHHHHHHHHHHHHhcCCccCccceeccccccC Q lcl|NC_019421. 377 MGYISNIMFINTINKDTSLKRKEFVGKIFNDAT-----------GQTTVICALKKYFEELMSQGIISEFNVDIDTELQAT 445 (473) Q Consensus 377 ~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~-----------~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~ 445 (473) |.+|++.|+++|+.+.+|..+....++.+...+ ....+|+++.+.+++|+.+|++|+++..+...+... T Consensus 381 yLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~ell~~y~~le~~givEn~~~~~~~LiVer 460 (498) T protein:vir:45 381 YLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVER 460 (498) T ss_pred hhhhhhHHHHHHHHHHHHHHhhhhcCCeeecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEEEE Confidence 999999999999999999877766676665554 457799999999999999999999987665544445 Q ss_pred CCCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 446 AKADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 446 ~~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) +..|.-++++.+ |.+-++.+++--..- T Consensus 461 d~~dpnRln~~~-p~d~vn~L~V~A~~~ 487 (498) T protein:vir:45 461 DASVPNRLNTLF-PPDYVNQLRVFAVVN 487 (498) T ss_pred CCCCCcEEEEEe-cccccCchhhhhhhh Confidence 555566777744 888888776543222 No 45 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=100.00 E-value=3.4e-48 Score=280.85 Aligned_cols=354 Identities=12% Similarity=0.020 Sum_probs=259.7 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCC-----CCCceEEeeccHHHHHHHcCCCcCcHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWG-----DVGKVVTIKNDLRQLKNLFGDDMNYSAFK 75 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~G-----p~~~~v~i~s~~~~~~~~fG~~~~~~~~~ 75 (473) |+ .....+||||++|.+++.+++.++++++.+|+|.+..+ |+|+|++|+|+.+ +...||.. ..+.+ T Consensus 1 m~------m~~~~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~~~-~~~~~g~~--g~L~~ 71 (393) T protein:vir:10 1 MS------ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLN-YLEKAGST--GTLRR 71 (393) T ss_pred CC------CCCccCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEecchHH-HHHhhCCc--cchhh Confidence 32 22234599999999999999999999999999999877 9999999998554 66677753 25667 Q ss_pred HHHHHHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceee Q lcl|NC_019421. 76 LGKLALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLF 155 (473) Q Consensus 76 ~v~~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~ 155 (473) ++..+|.+++..|+++++..+...+.+... . + | T Consensus 72 al~~~~~~~~~~~~vv~v~~~~~~~~t~~~----------i--i-----g------------------------------ 104 (393) T protein:vir:10 72 TLNSIGSIVKTPTVIVRVAESDDSDTLTAN----------I--V-----G------------------------------ 104 (393) T ss_pred hhhhhhcccCceEEEeecccCccccccccc----------c--c-----c------------------------------ Confidence 899999999999999998765433211100 0 0 0 Q ss_pred EEEecccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHh---hcccceEEEEEc Q lcl|NC_019421. 156 SSSIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEE---FERYSFDSFVLD 232 (473) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~---le~~~~~~l~~p 232 (473) +.+. ...+ ...+|.. ......++++.| T Consensus 105 ----------------------------------------------~~~~-~~~t---gl~al~~~~~~~~~~p~li~ap 134 (393) T protein:vir:10 105 ----------------------------------------------TQEN-GKFT---GIKALLTAQSTVFVKPKLLCVP 134 (393) T ss_pred ----------------------------------------------cccc-chhh---HHHHHHhhhhhcceeeeeeeec Confidence 0000 0000 0011111 112335778889 Q ss_pred CCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecC---cccchHHHHHHHHH Q lcl|NC_019421. 233 GVADEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYEN---IKYTPSEVAVYIAA 309 (473) Q Consensus 233 ~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~---~~~~~~~~a~~vAG 309 (473) +.++..+++++.++|++++. ..++..++..+.+.+......+++.+.+.+.||....+ ....-...++++|| T Consensus 135 g~~~~~~~~al~~~~~~~~~-----~~~v~d~~~~t~~~ai~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag 209 (393) T protein:vir:10 135 QHDNQAVATELLSVAKKLNA-----FAFISDNGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACA 209 (393) T ss_pred cccchHHHHHHHHHhhccCc-----EEEEEcCCCCCHHHHHHHhhhcCCceEEEEecccccccccCCceeEeehhHHHHH Confidence 98888888888888876542 33344455667888888888899999999999876432 12222334789999 Q ss_pred hhhcCccccccc----eeccC-cccccc------cCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhh Q lcl|NC_019421. 310 LSVSKGITGSIC----NAKTI-FEEVEP------RLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMG 378 (473) Q Consensus 310 ~~a~~~~~~s~t----~~~~~-~~~~~~------~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~ 378 (473) ++|++|..++++ |+++. +..+.. ..++.|.+.|+++|++++.+ ++ ....||-+|+.+ |+.|+ T Consensus 210 ~~a~~d~~~G~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~~-~~-G~~~wG~rT~s~-----d~~~~ 282 (393) T protein:vir:10 210 LQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICLN-HN-GFRYWGSRTLAT-----DTRWA 282 (393) T ss_pred HHHHhhcCCCcEEccCCceeeceeecceecccccCCCcchhHhHhhcCceEEEc-CC-CEEEEcccccCC-----Ccccc Confidence 999998766554 44443 223322 24577899999999999854 33 356799988753 67899 Q ss_pred hhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcC--CccCccceeccccccCC-CCCEEEEEE Q lcl|NC_019421. 379 YISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQG--IISEFNVDIDTELQATA-KADEFYWKW 455 (473) Q Consensus 379 ~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g--~i~~~~~~~D~~~~~~~-~~d~~~v~i 455 (473) +|++||++|+|+++|+....+++++ ||++.+|..++..|+.||++|++.| +|.+|++.||+++++.+ ....+++++ T Consensus 283 ~i~vrR~~~~i~~~i~~~~~~~v~e-~~~~~~~~~i~~~i~~~L~~l~~~g~~al~g~~v~~~~~nt~~~i~~G~~~~~i 361 (393) T protein:vir:10 283 FQQSVRTAQIIKETIGAGLAWAVDM-PLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVAEEITADIIKSGKFVIKY 361 (393) T ss_pred eeehhhHHHHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHHHHhccccccccceEEecCCCCHHHhhCCEEEEEE Confidence 9999999999999999888888885 8999999999999999999999855 89999999998855432 344789999 Q ss_pred EEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 456 DAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 456 ~v~p~~~~e~i~~t~~v~ 473 (473) .++|+.|+|+|.++++.- T Consensus 362 ~~~p~~p~e~I~~~~~~~ 379 (393) T protein:vir:10 362 DYHWIPSLESLGLEQRVN 379 (393) T ss_pred EEEecCCcceEEEEEEEc Confidence 999999999999999998 No 46 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=100.00 E-value=6.5e-48 Score=279.28 Aligned_cols=451 Identities=12% Similarity=0.088 Sum_probs=303.9 Q ss_pred CccccCCCCceec-CceeEEEecCCcceecccCceEEEEEEeeC---CCCCCceEEeeccHHHHHHHcCCCcCcHHHHHH Q lcl|NC_019421. 2 ATGTWNEKERKEI-PGFYNRFKTQAEKSTNTGLKGRLAMPIRAN---WGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLG 77 (473) Q Consensus 2 ~~g~~~~~~~~~~-PGvYie~~~~~~~~i~~~~~~~~~~~g~a~---~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v 77 (473) |.-.|+..++.+| ||+|+||.++...+..+. ..+.++|... ..|.++|++|+| .++...+||.++ -+..++ T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~~~--qrvLiiGq~la~gt~~~~~~v~v~s-~~~a~~~fG~GS--~l~~M~ 75 (498) T protein:vir:48 1 MTISFSAVPSDTLVPLFYAEMDNSAANTAVTS--APALLIGHASNDAAIEVNSLVLMPS-ADYARQICGAGS--QLARMV 75 (498) T ss_pred CCccccccCcccccceEEEEEecCCCccccCC--cceEEEeecCccccccccceEEecC-HHHHHHhcCccc--HHHHHH Confidence 9999999999999 999999999887765554 4688888643 348899999999 567999999873 233445 Q ss_pred HHHHhcCCC-EEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCcc-----ceeeeeecCC Q lcl|NC_019421. 78 KLALLGNVK-ELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSD-----KKDFIFFENT 151 (473) Q Consensus 78 ~~~f~~g~~-~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~-----~~~v~v~~~~ 151 (473) +.+.+++.. ++|++-+.+.+..+|+.++.-+......+.+.+ +-|.. .|.|.+...|.. .....+..+. T Consensus 76 ~a~~~~n~~~~l~~i~~~D~ag~aA~g~it~tg~at~~G~l~l---~Igg~--~v~v~V~~gdTaa~vA~al~aai~a~~ 150 (498) T protein:vir:48 76 DVYRQTDPFGELYVIAVPEARGAAATVRVTVTGEAEESGTLSL---YVGRS--SVQVPVVNGDDATAVATAIKEAVNGVI 150 (498) T ss_pred HHHHHhCCCceeEEEeeCCcccceeEEEEEecccccCCceEEE---EECCE--EEEEeecCCCCHHHHHHHHHHHHhCCC Confidence 555554443 688888875444444444333322233333333 11332 344433332211 0000111112 Q ss_pred ceeeEEEecccchhhhhhhhhcccccc-eeEeecc--cCCcccccc--ceeeeccCcccccchhhHHHHHHHHhhcccce Q lcl|NC_019421. 152 KQLFSSSIKGTIDEIVLEINSNLDNEY-VIATKVA--DSDTILANV--VNQALEGGNDGCTSITNESYLKALEEFERYSF 226 (473) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~s~~-v~~~~~~--~~~~~~~~~--~~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~ 226 (473) ....+.......-.+.++.+....|+. ++..... ..+.+|+.. .-..++||+. +.|+.++|+++...+| T Consensus 151 ~lPVTA~~~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamsgGag------~PDia~aLaal~~~~~ 224 (498) T protein:vir:48 151 TLPFAASSDAGVVTLTARHKGLYGNELPVCLNYYGSGGGEILPAGLQVVTEAGTAGSG------APDLTAAVAAMGDEAF 224 (498) T ss_pred CcceEEEecCcEEEEEeeecccccccceeeeeeccCcccccccceeeEEEEcccCCcc------CcchHHHHHhhccCCc Confidence 222333333322233344444444443 2222111 123344433 3345666653 4488999999999999 Q ss_pred EEEEEcCCCcHHHHHHHHHHHHHHhh---CCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCcccchHHH Q lcl|NC_019421. 227 DSFVLDGVADEALQETTKAWVAKNKE---LGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENIKYTPSEV 303 (473) Q Consensus 227 ~~l~~p~~~~~~~~~~l~~~v~~~~~---~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~~~~~~~~ 303 (473) ++|++|.++..++ .++.+|+...+. .-+++-++.......|+++..++....|++++.+++.. .....+++.. T Consensus 225 ~~I~~p~~D~asl-~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~~~---~~~~~p~~~~ 300 (498) T protein:vir:48 225 DFIGLPFNDAASI-NMMMTEMNDSSGRWSYARQLYGHVYTAKLGTLSELVNAGDMHNQQHITLAGYE---KETQSPVDEL 300 (498) T ss_pred cEEEEeecCHHHH-HHHHHHHhhhhhhhhHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEecC---CCCCChHHHH Confidence 9999998765555 678999986543 23344455555677799999999999999999876522 1223455566 Q ss_pred HHHHHHhhh---cCccccccceeccCcc---cccccCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCC-CCCcch Q lcl|NC_019421. 304 AVYIAALSV---SKGITGSICNAKTIFE---EVEPRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYV-DDKNEA 376 (473) Q Consensus 304 a~~vAG~~a---~~~~~~s~t~~~~~~~---~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~-~~~~~~ 376 (473) ++..|++.| ..||.+++....++++ ....+|+..|++.|+.+|+.+++.+++.++|+|.|+||++.. ...|+. T Consensus 301 AAa~a~~aA~~l~~DPArPLqtl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~~D~s 380 (498) T protein:vir:48 301 VASRLAREAVFIRNDPARPTQTGELVGMLPAPKGKRFIMTEQQTLLSHGVATAYVEGGTLRIQRSVTTYKKNAYGVADNS 380 (498) T ss_pred HHHHHHHHHHhhhccccccccceeeeccccCCchhcCChHHHHHHHhcCcceEEEcCCeEEEEeeeeeeeecCCCCcchh Confidence 666777666 7999999988877755 456789999999999999999998999999999999998554 577899 Q ss_pred hhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHH-----------HHHHHHHHHHHHHHHHHhcCCccCccceeccccccC Q lcl|NC_019421. 377 MGYISNIMFINTINKDTSLKRKEFVGKIFNDAT-----------GQTTVICALKKYFEELMSQGIISEFNVDIDTELQAT 445 (473) Q Consensus 377 ~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~-----------~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~ 445 (473) |.+|++.|+++|+.+.+|..+....++.+...+ ....+|+++.+.+++|+.+|++|+++..+...+... T Consensus 381 yLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~eli~~y~~le~~given~~~~~~~LiVer 460 (498) T protein:vir:48 381 YLDSETLHTSAYVLRKLKSVITSKYGRHKLANDGTRFGPGQAIVTPAVIKGELLATYRQMERAGIVENYDLFKQYLIVER 460 (498) T ss_pred hhhhhhHHHHHHHHHHHHHHhhhhcCCceecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEEEE Confidence 999999999999999999877665566655554 457799999999999999999999987665444445 Q ss_pred CCCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 446 AKADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 446 ~~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) +..|..++++.+ |.+-++.+++--..- T Consensus 461 d~~dpnRln~~~-p~d~vn~L~V~A~~~ 487 (498) T protein:vir:48 461 DADNPNRLNTLF-PPDYVNQLRVFAVVN 487 (498) T ss_pred CCCCCcEEEEEe-cccccCchhhhhhhh Confidence 555566777744 888888776542222 No 47 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=100.00 E-value=7.9e-48 Score=278.82 Aligned_cols=451 Identities=12% Similarity=0.102 Sum_probs=304.9 Q ss_pred CccccCCCCceec-CceeEEEecCCcceecccCceEEEEEEeeC---CCCCCceEEeeccHHHHHHHcCCCcCcHHHHHH Q lcl|NC_019421. 2 ATGTWNEKERKEI-PGFYNRFKTQAEKSTNTGLKGRLAMPIRAN---WGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLG 77 (473) Q Consensus 2 ~~g~~~~~~~~~~-PGvYie~~~~~~~~i~~~~~~~~~~~g~a~---~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v 77 (473) |+-.|+..++.+| ||+|+||.++.. ..+.....+.++|... ..|.++|++|+| .++.+.+||.++ -+..++ T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A--~~~~~~q~vLiiGq~la~gs~~~~~~v~v~s-~~~a~~~fG~GS--ml~~M~ 75 (498) T protein:vir:44 1 MAISFNSIPSDTRVPLFYAEMDNSAA--NTARDSGASLLIGHASNDASIAVNSLVLVSS-VDYARQICGAGS--QLARMV 75 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCC--CCCcCCcceEEEEecCcccccccceeEeecC-HHHHHHhcCccc--HHHHHH Confidence 9999999999999 999999988766 3455666788888753 338899999999 567999999883 233445 Q ss_pred HHHHhcCCC-EEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCcc-----ceeeeeecCC Q lcl|NC_019421. 78 KLALLGNVK-ELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSD-----KKDFIFFENT 151 (473) Q Consensus 78 ~~~f~~g~~-~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~-----~~~v~v~~~~ 151 (473) +.+.+++.. ++|++-+.+.+..+|+.++.-+.....++.+.+ +-|.. .|.|.+...|.. .....+.-+. T Consensus 76 ~a~~~~n~~~~l~~i~~~D~aG~aAtg~it~tg~at~~G~l~l---~Igg~--~v~v~V~~gdTaa~vA~al~aaina~~ 150 (498) T protein:vir:44 76 GAYRKTDPFGELYVIAVPESTGAAATVALTVTGEATETGTVNV---YTGRT--RVQAPVTSGDDAAAVAVSIKDAVNANP 150 (498) T ss_pred HHHHHhCCCceeEEEecCCcccceeEEEEEeecccCCCcEEEE---EECCE--EEEEEecCCCCHHHHHHHHHHHHhCCC Confidence 555554443 688888876444555444443333334444433 11332 344443333211 0000111111 Q ss_pred ceeeEEEecccchhhhhhhhhcccccc-eeEeecc--cCCcccccc--ceeeeccCcccccchhhHHHHHHHHhhcccce Q lcl|NC_019421. 152 KQLFSSSIKGTIDEIVLEINSNLDNEY-VIATKVA--DSDTILANV--VNQALEGGNDGCTSITNESYLKALEEFERYSF 226 (473) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~s~~-v~~~~~~--~~~~~~~~~--~~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~ 226 (473) ....+.......-.+.++.+....|+. ++..... ..+.+|+.. .-..++||+ .+.|+.++|+++...+| T Consensus 151 ~lPVTA~~~~~~vtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~titamsgGa------g~PDia~alaal~~~~~ 224 (498) T protein:vir:44 151 DLPFTATSEAGVVTLTARHKGLYGNEIPVTLNYYGFGGGEVLPAGVNITVASGVKGA------GAPALNDAVAAMGDEPF 224 (498) T ss_pred CCceEEeeccceEEEEEeccCcccCcceEEEeeccCccccccccceeEEEEcccCCc------cCchhHHHHHhhccCCc Confidence 222233332222233444444445543 2222111 123333332 234555554 24589999999999999 Q ss_pred EEEEEcCCCcHHHHHHHHHHHHHHhh---CCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCcccchHHH Q lcl|NC_019421. 227 DSFVLDGVADEALQETTKAWVAKNKE---LGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENIKYTPSEV 303 (473) Q Consensus 227 ~~l~~p~~~~~~~~~~l~~~v~~~~~---~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~~~~~~~~ 303 (473) ++|++|.++..++ .++.+|+...+. ..+++.++.......|+++..++....|++++.+.+.. .+...+++.. T Consensus 225 ~~i~~p~~D~asl-~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~a~l~t~g~~~N~~~it~~~~~---~~~~sp~~~~ 300 (498) T protein:vir:44 225 DYIGLPFNDTASV-NSMATEMNDSSGRWSYVRQLYGHVYTAKTGTLSELVAAGDQFNLQHITLAGYE---KDTQTPADEL 300 (498) T ss_pred cEEEEeecCHHHH-HHHHHHHhhhhcchHHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEecC---CCCCCHHHHH Confidence 9999998665555 678999976543 23444555555678899999999999999999876421 1223355677 Q ss_pred HHHHHHhhh---cCccccccceeccCcc---cccccCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCC-CCCcch Q lcl|NC_019421. 304 AVYIAALSV---SKGITGSICNAKTIFE---EVEPRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYV-DDKNEA 376 (473) Q Consensus 304 a~~vAG~~a---~~~~~~s~t~~~~~~~---~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~-~~~~~~ 376 (473) ++.+|++.| ..||.+++....++++ ....+|+..|++.|+.+|+.+++..+|.++|+|.|+||++.. ...|+. T Consensus 301 AAa~a~~aA~~l~~DPArPL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~~D~s 380 (498) T protein:vir:44 301 AASRTARAAVFIRNDPARPTQTGELVDMLPAPKGKRFTTTEQQTLLSHGVATAYVESGVLRIQRDITTYRKNAYGVADNS 380 (498) T ss_pred HHHHHHHHHHHhhcccccccCceeecccccCCchhcCChHHHHHHHhcCcceEEEcCCeEEEEeeeeeeeecCCCCcchh Confidence 777777777 7999999998888755 356789999999999999999998888999999999998554 577899 Q ss_pred hhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHH-----------HHHHHHHHHHHHHHHHHhcCCccCccceeccccccC Q lcl|NC_019421. 377 MGYISNIMFINTINKDTSLKRKEFVGKIFNDAT-----------GQTTVICALKKYFEELMSQGIISEFNVDIDTELQAT 445 (473) Q Consensus 377 ~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~-----------~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~ 445 (473) |.+|++.|+++|+.+.+|..+....++.+...+ ....+|+++.+.+++|+.+|++|+++..+...+... T Consensus 381 yLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~eli~~y~~le~~givEn~~~~~~~LiVer 460 (498) T protein:vir:44 381 YLDSETLHTSAYVLRRLKSVITSKYGRHKLANDGTRFGSGQAIVTPAVIRGELGSTYRQMEREGIVENFDLFQQHLIVER 460 (498) T ss_pred hhhhhhHHHHHHHHHHHHHHhhhhcCCcccccCCcccCCCcccccHHHHHHHHHHHHHhhhhhccccChhhhcceeEEEE Confidence 999999999999999999877665565543332 456799999999999999999999987665444445 Q ss_pred CCCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 446 AKADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 446 ~~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) +..|.-++++.+ |.+-++.+++--..- T Consensus 461 d~~dpnRln~~~-p~d~vn~L~V~A~~~ 487 (498) T protein:vir:44 461 NANDSNRLDVLF-PPDYVNQLRVFAVLN 487 (498) T ss_pred CCCCCcEEEEEe-cccccCchhhhhhhh Confidence 555566777744 888888876543222 No 48 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=100.00 E-value=3.9e-46 Score=269.55 Aligned_cols=449 Identities=13% Similarity=0.082 Sum_probs=303.6 Q ss_pred CCccccCCCCceec-CceeEEEecCCcceecccCceEEEEEEee---CCCCCCceEEeeccHHHHHHHcCCCcCcHHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEI-PGFYNRFKTQAEKSTNTGLKGRLAMPIRA---NWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKL 76 (473) Q Consensus 1 m~~g~~~~~~~~~~-PGvYie~~~~~~~~i~~~~~~~~~~~g~a---~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~ 76 (473) |..-.|+..++.+| ||+|+||.++...+-.......+.++|.. ...|.++|++|+| .++...+||.++ -+..+ T Consensus 1 m~~i~F~~IP~~iRvP~~y~E~dns~A~~g~~~~~q~vLiiGq~la~gs~~~~~pv~v~s-~~~a~~~fG~GS--~la~M 77 (495) T protein:vir:19 1 MSDISFNAIPSDVRVPLTYIEFDNSNAVSGTPAPRQRVLMFGQSGSKASAAPNVPVRIRS-GSQASAAFGQGS--MLALM 77 (495) T ss_pred CCCCchhhCCcccccCeEEEEEccCCCCcCCcCCCceEEEEEecCcccccccceeEEecC-HHHHHHhcCcCc--HHHHH Confidence 99999999999999 99999999998877677777788888874 3348899999999 467999999883 23344 Q ss_pred HHHHHhcCCC-EEEEEecCCCcccce--eeeecccccccccceEEEEecCccccceeEEEeeccCCccc-----eeeeee Q lcl|NC_019421. 77 GKLALLGNVK-ELLLYRLVDGNQKKG--TLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDK-----KDFIFF 148 (473) Q Consensus 77 v~~~f~~g~~-~v~v~rv~~g~~~aa--t~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~-----~~v~v~ 148 (473) ++.+.+++.. ++|++-+.+.+..+| +.++.++.. .++.+.+ +-|.. +|.|.+...|... ....+. T Consensus 78 ~~a~~~~n~~~~l~~i~~~D~aG~aA~g~it~tg~at--~~G~l~l---~I~g~--~v~v~V~~gdTaa~vA~al~aain 150 (495) T protein:vir:19 78 ADAFLNANRVAELWCIPQGNGTGNAAVGEISLSGTAG--ENGSLVT---YIAGQ--RLAVSVAAGATGAALADLLVARIK 150 (495) T ss_pred HHHHHHhCCcceEEEEeeCChhhceeEEEEEEeecCC--CCcEEEE---EECCE--EEEEEecCCCCHHHHHHHHHHHhc Confidence 5555544433 689888876444444 444444433 3444433 11333 3444433322110 000000 Q ss_pred cCCceeeEEEecc--------cchhhhhhhhhccccc-ceeEeecccCCcccccc--ceeeeccCcccccchhhHHHHHH Q lcl|NC_019421. 149 ENTKQLFSSSIKG--------TIDEIVLEINSNLDNE-YVIATKVADSDTILANV--VNQALEGGNDGCTSITNESYLKA 217 (473) Q Consensus 149 ~~~~~~~~~~~~~--------~~~~~~~~~~~~~~s~-~v~~~~~~~~~~~~~~~--~~~~l~gG~dg~~~~t~~d~~~~ 217 (473) -......+..... ..-.+.++.+.. .|. -++..... .+.+|... +-..++||+. +.|+.++ T Consensus 151 a~~~lPvTA~~~~~~~~~~a~~~VtlTAr~kG~-~n~idi~~~~~~-ge~~p~Glt~titamsgGag------~PDia~a 222 (495) T protein:vir:19 151 GQPDLPVTAEVRADSGDDDTHADVVLSAKFTGA-LSAVDVRWNYYA-GETTPYGIITAFKAASGKNG------NPDISAS 222 (495) T ss_pred CCccCceEEEeeccCCCCcCceeEEEEEeeccc-cccceeEEEeec-ccccccceeEEEEecCCCCC------CcchHHH Confidence 1111111111100 001112222221 122 12222222 23333332 3356667663 4479999 Q ss_pred HHhhcccceEEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCcc Q lcl|NC_019421. 218 LEEFERYSFDSFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENIK 297 (473) Q Consensus 218 l~~le~~~~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~~ 297 (473) |+++...+|++|++|.++..++ .++.+|++.+....+++-++.......|.++..++....|++++.+++- .+.. T Consensus 223 laal~~~~~~~I~~P~tD~asL-~al~~~l~~rw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~~----~gsp 297 (495) T protein:vir:19 223 IAGMGDLQYKYIVMPYTDEPNL-NLLRTELQERWGPVNQADGFAVTVLSGTYGDISTFGVSRNDHLISCMGI----AGAP 297 (495) T ss_pred HHHhccCCCcEEEEecCcHHHH-HHHHHHHHHhhhHHHhcCeEEEEeecCCHHHHHHhhhccCCceEEEEec----CCCC Confidence 9999999999999998766565 7899999988776666666655667789999999999999999988743 3444 Q ss_pred cchHHHHHHHHHhhh---cCccccccceeccCcc---cccccCCHHHHHHHHhCCcEEEEE-cCCEEEEEecccccccCC Q lcl|NC_019421. 298 YTPSEVAVYIAALSV---SKGITGSICNAKTIFE---EVEPRLSQSEVKECLKSGTLVLDF-DDGDVIIVDDVNTFKKYV 370 (473) Q Consensus 298 ~~~~~~a~~vAG~~a---~~~~~~s~t~~~~~~~---~~~~~~t~~e~~~l~~~G~~~l~~-~~~~~~i~~gi~T~~~~~ 370 (473) -++...++.+|+.+| ..||.+++....++++ ....+|+..|++.|+.+|+.+++. .++.++|+|.|+||++.. T Consensus 298 ~~~~~~AAA~aa~~A~~l~~DPArPL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~~G~V~I~R~ITTY~~n~ 377 (495) T protein:vir:19 298 EPSYLYAATLCAVASQALSIDPARPLQTLTLPGRMPPAVGDRFTWSERNALLFDGISTFNVNDGGEMQIERMITMYRTNK 377 (495) T ss_pred CcHHHHHHHHHHHHHHHhhcccccccCceeecceecCCccccCChHHHHHHHhCCcceEEECCCCeEEEEeeeeeeeecC Confidence 556666666666654 7899999998888765 456789999999999999999986 578999999999997654 Q ss_pred -CCCcchhhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHH-----------HHHHHHHHHHHHHHHHHhcCCccCcccee Q lcl|NC_019421. 371 -DDKNEAMGYISNIMFINTINKDTSLKRKEFVGKIFNDAT-----------GQTTVICALKKYFEELMSQGIISEFNVDI 438 (473) Q Consensus 371 -~~~~~~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~-----------~r~~i~~~i~~~l~~l~~~g~i~~~~~~~ 438 (473) ...|+.|.+|++.|+++|+++.+|..+....++.+...+ .-..+|+++.+.+++|+.+|++|+++..+ T Consensus 378 ~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~ell~~~~~le~~given~~~~~ 457 (495) T protein:vir:19 378 YGDSDPSYLNVNTIATLSYLRYSLRTRITQKFPNYKLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGLVEDFDTFK 457 (495) T ss_pred CCCcchhhhhhHHHHHHHHHHHHHHHHHhhhcCCcccccCCCCCCCcccccChHHHHHHHHHHHHhhhhhccccChhhhc Confidence 577999999999999999999999877766666655544 34679999999999999999999998766 Q ss_pred ccccccCCCCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 439 DTELQATAKADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 439 D~~~~~~~~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) ...+...+..|.-++++.+ |.+-++.+++--..- T Consensus 458 ~~LiVerd~~dpnRln~~~-p~d~vn~L~V~A~~i 491 (495) T protein:vir:19 458 EELYVARNKDDKDRLDVLC-GPNLINQFRIFAAQV 491 (495) T ss_pred ceeEEEECCCCCcEEEEEe-cceeeCceeeeeeee Confidence 5444444444556666644 777777665432222 No 49 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=100.00 E-value=2.1e-46 Score=271.05 Aligned_cols=357 Identities=12% Similarity=0.035 Sum_probs=245.3 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCC-----CCCceEEeeccHHHHHHHcCCCcCcHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWG-----DVGKVVTIKNDLRQLKNLFGDDMNYSAFK 75 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~G-----p~~~~v~i~s~~~~~~~~fG~~~~~~~~~ 75 (473) |+ ....|||||+|+.++.+++..+++++.+|+|.+..+ |.++|++++|..+ +...||.+ ..+.. T Consensus 1 M~--------~~~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~~~-~~~~~g~~--~tl~~ 69 (386) T protein:vir:10 1 MA--------EQYLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVLIAGSRR-EAAKLGAG--GTLPQ 69 (386) T ss_pred Cc--------cccCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceEecchHH-HHhhcCCC--cchhH Confidence 44 124599999999999999999999999999998754 8999999999654 66666644 25667 Q ss_pred HHHHHHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceee Q lcl|NC_019421. 76 LGKLALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLF 155 (473) Q Consensus 76 ~v~~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~ 155 (473) ++..+|.+++..|+++++.++.....+.. ... |. ..+... T Consensus 70 a~~~~~~~gg~~~~vv~~~~~~~~~~t~~----------~~i-------g~----------~~~~t~------------- 109 (386) T protein:vir:10 70 AIDGIFDQTGAVVVVIRVDEGVDSAATQS----------NVI-------GK----------VDADTE------------- 109 (386) T ss_pred HHHHHhccCceeEEEeeccccccccccch----------hhh-------cc----------cccccc------------- Confidence 89999999999999999876543322100 000 00 000000 Q ss_pred EEEecccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhcccceEEEEEcCCC Q lcl|NC_019421. 156 SSSIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFERYSFDSFVLDGVA 235 (473) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~~~l~~p~~~ 235 (473) .......+. .....+ ....+++..|+.. T Consensus 110 ---~~tgl~~l~-----~~~~~~--------------------------------------------~~~p~i~~ap~~~ 137 (386) T protein:vir:10 110 ---QYTGILALL-----SAENTV--------------------------------------------KVQPRILIAPGFS 137 (386) T ss_pred ---hhhhhHHhh-----hhcccc--------------------------------------------ccccccccccccc Confidence 000000000 000000 0001122223222 Q ss_pred cHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCc---ccchHHHHHHHHHhhh Q lcl|NC_019421. 236 DEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENI---KYTPSEVAVYIAALSV 312 (473) Q Consensus 236 ~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~---~~~~~~~a~~vAG~~a 312 (473) + ...+.+.+..+.+. ...+.+. .+...+.+.+......+++.+.+.++||....+. ...-...++++||++| T Consensus 138 ~---~~~v~~~l~~~~~~-~~~~~~~-~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~v~~~~~~~~~~~p~s~~~ag~~a 212 (386) T protein:vir:10 138 N---QKAVADQLVSVADT-AAWLCHS-GWSNTTDAAAITYRELFGSRRCEVVDPWYKVWDVETSAHIIQPPSARHAGVMA 212 (386) T ss_pred c---hhHHHHHHHHhhcc-eEEEEEe-CCCCCchHHHHHhhhcccccceEEecCceeeeccccccceeechHHHHHHHHH Confidence 1 11223333333221 1223333 3446677778888888999999999998765322 1111234789999999 Q ss_pred cCcccc----ccceeccC-cccccc------cCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhhh Q lcl|NC_019421. 313 SKGITG----SICNAKTI-FEEVEP------RLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYIS 381 (473) Q Consensus 313 ~~~~~~----s~t~~~~~-~~~~~~------~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~ 381 (473) ++|..+ |+.|+++. +..+.. ..+..|.+.|+++|++++.. + +..++||-+|+.+ |+.|++|+ T Consensus 213 ~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gi~~~~~-~-~G~~~wG~rT~~~-----d~~~~~i~ 285 (386) T protein:vir:10 213 KVHNTLGFWWSNSNQEILGIDGLCRPVDFKLDDPTCRANLLNAKEVTTTIQ-Q-NGFRVWGDRTCSA-----DSKWAFKN 285 (386) T ss_pred HhhhcCCcEEccCCceeecccccceecccccccCcchhhhhhhcCcEEEEc-C-CCEEEEcccccCC-----Ccccceee Confidence 998665 55555543 223322 24577899999999998854 3 3467799998743 67899999 Q ss_pred hhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccCCCC--CEEEEEEEEEE Q lcl|NC_019421. 382 NIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQATAKA--DEFYWKWDAVK 459 (473) Q Consensus 382 v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~~~~--d~~~v~i~v~p 459 (473) +||++++|+++|+..+.+|+++ ||++.+|.+|++.|+.||++||++|+|.+|++.||.+.+++++. ..+++.+.++| T Consensus 286 vrR~~~~i~~~~~~~~~~~v~e-~~~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~~~~G~~~~~i~~~p 364 (386) T protein:vir:10 286 VVITNDMIADSLVRNHLWAVDR-NITKTYVEDVTEGVNNYLRHLKNIGAIAGGECWVDPELNSPDQIQQGKVYFDYDFSA 364 (386) T ss_pred hhhHHHHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcccCCCHHHhhCCeEEEEEEEEe Confidence 9999999999999888888885 89999999999999999999999999999999999998887764 47888999999 Q ss_pred eeeeeeEEEEEEeC Q lcl|NC_019421. 460 VDVMKKIYGTGYLG 473 (473) Q Consensus 460 ~~~~e~i~~t~~v~ 473 (473) +.|+|||.++++.- T Consensus 365 ~~p~e~i~~~~~~~ 378 (386) T protein:vir:10 365 YAPAEHITFRSHMV 378 (386) T ss_pred cCCceeEEEEEEEe Confidence 99999999999888 No 50 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=100.00 E-value=5.6e-43 Score=252.21 Aligned_cols=459 Identities=15% Similarity=0.159 Sum_probs=268.4 Q ss_pred CCccccCCCCceecCceeEEEecCC---cceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCC-------CcC Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQA---EKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGD-------DMN 70 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~---~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~-------~~~ 70 (473) |++ |+ |---+||+-..+.... ........|.-+.++|++--||+++||+|+.+ .+..+||. .-+ T Consensus 1 ~~~--~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 74 (717) T protein:vir:79 1 MAG--FD--QYQAIPGHNARFKDGNLNLKSDPNPRETESVVLLGTATDGPVMQPVRVTPE--TAYNIFGKVAHENGVYNG 74 (717) T ss_pred CCc--hh--hhhcCCCceeeeecCceecCCCCCccccceEEEEeeccCCcccCceeeChh--HHHhhhhhhhhhcccccc Confidence 543 43 5556899999987543 23344566677888999999999999999863 36778884 223 Q ss_pred cHHHHHHHHHHhcCCCEEEEEecCCCc-----------------------------ccceeeeeccccccccc------- Q lcl|NC_019421. 71 YSAFKLGKLALLGNVKELLLYRLVDGN-----------------------------QKKGTLTLKDTTENSAK------- 114 (473) Q Consensus 71 ~~~~~~v~~~f~~g~~~v~v~rv~~g~-----------------------------~~aat~~l~~~~~~~~~------- 114 (473) ..++.++.++...|..+..+.|..+-. ..++|..|.......+. T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (717) T protein:vir:79 75 ATLLPKFEELWAAGNRDIRLMRTTGVNAVSSLLGTSYSKNSKEVAEDKLGGAQARGNVAATFTLPNGGIVEATFLLKARG 154 (717) T ss_pred hhhhHHHHHHHhcCCcceEEEEecchhHHHHHhhcccccchhhHHHHhhcccccccceEEEEEcCCCceeeeeeeeeecc Confidence 466777888887787788887764211 11222222210000000 Q ss_pred ---------------------------------------ceEE----EEecCc-cc-------------cce-------e Q lcl|NC_019421. 115 ---------------------------------------DVIK----LETKYP-TA-------------RNF-------N 130 (473) Q Consensus 115 ---------------------------------------~~l~----i~A~~~-G~-------------~~n-------~ 130 (473) ..++ ++.+.. |+ .|. . T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (717) T protein:vir:79 155 VIIPPNNYTLDVGTEEDMKAGTQPTFAQVLLNENVADMESEITVSYEFTYKDAQGETKTSEVLDNNTDKDGKPMIAKGAD 234 (717) T ss_pred eEeCCCcceEeccChhhhhcCCCchhhhhhhccchhhccceeEEEEEEEeecccCcchhhhhhcCCCCCCCceeEEeccc Confidence 0011 111100 00 000 0 Q ss_pred EEEee-------------------cc------------CCcc---------------------ceeeeeecCCc------ Q lcl|NC_019421. 131 VTIKS-------------------NL------------VDSD---------------------KKDFIFFENTK------ 152 (473) Q Consensus 131 i~v~~-------------------~~------------~~~~---------------------~~~v~v~~~~~------ 152 (473) ++|+- .. ..+. -|.+.-..++. T Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~n~~~ 314 (717) T protein:vir:79 235 VTIKLEHVALAGLKLYADGIEVVDAKAFTVAGDQLTIHSNSKMKLGASLEAQYAYNLVEVIQPVIELESIFGGGVYNDIM 314 (717) T ss_pred ceeehhhhhhhhhHHhhcchhhhhhhheeeecceEEEEecCCcccchhhHHHHHhhHHHhhccceEEeecccCceeeeee Confidence 11110 00 0000 01111111111 Q ss_pred ---------eeeEEEecccchhhhhh-------------------hhhcccccceeEeecccCCccc----cccceeeec Q lcl|NC_019421. 153 ---------QLFSSSIKGTIDEIVLE-------------------INSNLDNEYVIATKVADSDTIL----ANVVNQALE 200 (473) Q Consensus 153 ---------~~~~~~~~~~~~~~~~~-------------------~~~~~~s~~v~~~~~~~~~~~~----~~~~~~~l~ 200 (473) ...+++......+.... +.....++.............. .......+. T Consensus 315 ~~v~~~D~~~~~~~t~~~~~~g~~~~~pl~~ts~dy~~~~~~vdgI~~~~~~~V~~~g~~s~a~a~~~~g~~s~d~a~f~ 394 (717) T protein:vir:79 315 RKVESKDGAVTVTITKPESKRGMISEDPLVFKSGDYTNFKMLVDAINNHPFNNVVRARTKPEFEATFTSTLQAAADAKFS 394 (717) T ss_pred eEEecCCceEEEEEecccccCcceeccccccccCceeeeeeeecccccCchhheeeeecccccceeeeecccCchhhccC Confidence 11111100000000000 0000011111111111111111 112234567 Q ss_pred cCcccccchhhHHH---------------HHHHHhhcccceEEEEEcCCCc--------HHHHHHHHHHHHHHhhCCCeE Q lcl|NC_019421. 201 GGNDGCTSITNESY---------------LKALEEFERYSFDSFVLDGVAD--------EALQETTKAWVAKNKELGKDI 257 (473) Q Consensus 201 gG~dg~~~~t~~d~---------------~~~l~~le~~~~~~l~~p~~~~--------~~~~~~l~~~v~~~~~~~~~~ 257 (473) ||.|+......+-| ..++..|+.++++++++|+... ..++..+++||..++...+.+ T Consensus 395 Gg~dgl~~~~ee~Y~~lGgk~~d~g~lt~~aays~LE~~dVDlVil~ga~adtt~ga~~d~va~alad~caalSal~r~a 474 (717) T protein:vir:79 395 GGKDELSLDKEEMYKRLGGEKNEEGFVTKQGAYQYLENYEVDYVIPLGVHADTKLIGKYDDFAYQLALACAVMSHYNSVT 474 (717) T ss_pred CCccccccchhhhhccccccccccccccchhhhhhcCcceeEEEEecCccccccccchhhhHHHHHHHHHHHhhhccccc Confidence 77777544333323 1467888999999999987532 245778899998887655566 Q ss_pred EEEEcC--CCCccHHHHHHhhhcc------------------CC--------ceE-EEecCCceecCc--ccchHHHHHH Q lcl|NC_019421. 258 LLFLGG--KTEDNIKQINDKSKSF------------------ND--------ENI-VNVGSSAYYENI--KYTPSEVAVY 306 (473) Q Consensus 258 ~av~~~--~~~~t~~~~~~~~~~~------------------n~--------~~i-~~~~~~~~~~~~--~~~~~~~a~~ 306 (473) +.+++. +.....+....+...+ +. .+. ++.+++....+. .......||+ T Consensus 475 i~VI~l~sp~D~~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a~~~~~~~idis~y~~vv~~~~~iv~~~~~~~~~~p~AG~ 554 (717) T protein:vir:79 475 IGIIPTTTPSDISLAGVEEHVKKLENYANEFYMRDRFGNIIFDADRNKIDLGQFIEVVAGPDFIVRNTRLGQMASTPDAS 554 (717) T ss_pred eeeeccccccccchhhHHHHHHHHHhhhhhhhhhcchhccccccccccccccceeeeeecceeEEEcCCCceeecCHHHH Confidence 666553 2222222221111111 00 011 122233222211 1122344799 Q ss_pred HHHhhhcCccccccceeccC-cccccccCCHHHHHHHHhCCcEEEEEcC-CEEEEEecccccccCCCCCcchhhhhhhhH Q lcl|NC_019421. 307 IAALSVSKGITGSICNAKTI-FEEVEPRLSQSEVKECLKSGTLVLDFDD-GDVIIVDDVNTFKKYVDDKNEAMGYISNIM 384 (473) Q Consensus 307 vAG~~a~~~~~~s~t~~~~~-~~~~~~~~t~~e~~~l~~~G~~~l~~~~-~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R 384 (473) +||+.++.++++|++|+++. ..++...+++.|++.|+++|++|+++.. .++++..++|+. ..+++|++|++|| T Consensus 555 vAGldA~rGVwkSPANk~I~GVvgLa~~lT~sE~d~Ln~aGIntIr~~~GrGirVWGaRTta-----sd~sdWryInVRR 629 (717) T protein:vir:79 555 YIGMVSQLKTQSAPTNKPLPSVTALRYTYSANQLNRLTKARFATFKYKQDGSIGVVDAPTSA-----HAGSDYTRLSTAR 629 (717) T ss_pred HHHHHhcCCcccccccceecccccCcccCCHHHHHHHhhCCeEEEEEeCCceEEEEeeeecC-----CCCcccceeehhh Confidence 99999999999999999985 5678889999999999999999998764 456665555442 2245799999999 Q ss_pred HHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceeccccccC-CCCCEEEEEEEEEEeeee Q lcl|NC_019421. 385 FINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQAT-AKADEFYWKWDAVKVDVM 463 (473) Q Consensus 385 ~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~~-~~~d~~~v~i~v~p~~~~ 463 (473) ++|+|+++|+...++||++ ||++.+|..|++.|++||++||++|+|.+|++++ ..++. .....++|++.++|++|| T Consensus 630 l~D~Ie~sIr~al~~yVgE-PNd~~tr~~Ik~sI~afL~~L~r~GAI~Gykvdv--tnT~~di~~G~l~V~I~vaPv~Pa 706 (717) T protein:vir:79 630 IVKEAVNAVREVADPFIGE-PNDTGNRNALTAAVDKRLSKMIENKALLGFDFRL--VVTPQQELLGEGSIELSLEAPNEL 706 (717) T ss_pred hHHHHHHHHHHHHHHhccc-cCCHHHHHHHHHHHHHHHHHHHhcCceecceeeE--ecChhHhhCCEEEEEEEEEecCcc Confidence 9999999999888888886 8999999999999999999999999999998654 33222 334479999999999999 Q ss_pred eeEEEEEEeC Q lcl|NC_019421. 464 KKIYGTGYLG 473 (473) Q Consensus 464 e~i~~t~~v~ 473 (473) |||+++++|. T Consensus 707 EfI~ititIT 716 (717) T protein:vir:79 707 RRLTTIVSLS 716 (717) T ss_pred cEEEEEEEEe Confidence 9999999999 No 51 >protein:vir:103168 Length: 641 # NCBI annotation: gp122 # Family: family:all:661 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717789;genbank:gi:113200626;genbank:GeneID:4239177 Probab=100.00 E-value=7.6e-43 Score=251.50 Aligned_cols=357 Identities=15% Similarity=0.092 Sum_probs=216.2 Q ss_pred CCCCceecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCC-CcCcHHHHHHHHHHhcCC Q lcl|NC_019421. 7 NEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGD-DMNYSAFKLGKLALLGNV 85 (473) Q Consensus 7 ~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~-~~~~~~~~~v~~~f~~g~ 85 (473) -+..+-.-|||||||++++ ++|++++|++++|+|.++|||+|+|++|+|+ .+|++.||. .....+.++++.||+||| T Consensus 1 ~~m~~~~sPGVyv~E~~~~-~~i~~v~tsvaafvG~~~~GP~~~p~~v~s~-~d~~~~FG~~~~~~~l~~av~~fF~ngG 78 (641) T protein:vir:10 1 MSVSNQLSPGVVIQERDLT-AVTTPIGLNVGVLAAPFTKGPVEEIFEVSTE-RDLASVFGEPNDYNYEYWFTASQFLSYG 78 (641) T ss_pred CCCccccCCceEEEEecCC-CcccccCCccceEEecccCCCCCccEEecCH-HHHHHHcCCcCCCcchHHHHHHHHHhcC Confidence 1223334599999999987 5899999999999999999999999999995 557777775 455567789999999999 Q ss_pred CEEEEEecCCCcccceeeeecc-----------cccccccceEEEEecCccccceeEEEeeccCCccc------------ Q lcl|NC_019421. 86 KELLLYRLVDGNQKKGTLTLKD-----------TTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDK------------ 142 (473) Q Consensus 86 ~~v~v~rv~~g~~~aat~~l~~-----------~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~------------ 142 (473) ++|||+|+.++....++..... .........+++.|++||.|||.++|.+....+.. T Consensus 79 ~~~~vvRv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~pG~~gn~l~v~i~~~~~~~~~~~~~~~~~~~ 158 (641) T protein:vir:10 79 GVLKAIRLNAASLKNSVDSGTAPLIKNLQEYETTYESSNSNTFKFASRDAGALGNSVGIFITDAGPDQIAVLPAPGTGNE 158 (641) T ss_pred CEEEEEEecCccccccccccchhhccccccccccccCcCccccEEEeccCCCcCCceEEEEEcCCCcceeeeeccccccc Confidence 9999999987665555433221 12234556789999999999998877542110000 Q ss_pred ---------------------------------------------------------------eeeeeecCCc---eeeE Q lcl|NC_019421. 143 ---------------------------------------------------------------KDFIFFENTK---QLFS 156 (473) Q Consensus 143 ---------------------------------------------------------------~~v~v~~~~~---~~~~ 156 (473) ..+....++. .... T Consensus 159 ~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~~~~~~~~~~a~~~~~~i~~~~~g~~g~~~~~ 238 (641) T protein:vir:10 159 WEFVADEAVTAASGASAKVYRYSIRLTLTNVVGTFVPGGATTISISGSDESVDVLAWDAGNKYLEIALPAGGVTGIFADA 238 (641) T ss_pred ceeccceeeeeccCcccccccccccccccceeeecccCCcceeEecccccccccccccCCcceeeeeecCCcceeeeeee Confidence 0000000000 0000 Q ss_pred EEecc----------cch-hh-----------------------------------------------hh------hhh- Q lcl|NC_019421. 157 SSIKG----------TID-EI-----------------------------------------------VL------EIN- 171 (473) Q Consensus 157 ~~~~~----------~~~-~~-----------------------------------------------~~------~~~- 171 (473) ..... ..+ .. .. ... T Consensus 239 ~~~t~gt~~~t~a~~g~~~~~~~~~~~~~ia~aat~ag~~g~~~~v~~~~v~d~~~a~~~~~g~~~~~va~~~gts~~a~ 318 (641) T protein:vir:10 239 QVVTQGTNTAAIASSGIERRLYIGKDSGSINFAATDAVVDTNATSATISSVRNEYAEREYLPGSKWVNVAARPGTSLYAN 318 (641) T ss_pred eeccCCccceeeecccchhhhhhccccccceeeeecccccccceeeEeeeeeeeecccccccccccccccccchhhhhhh Confidence 00000 000 00 00 000 Q ss_pred ---h--------------------------------------------------cccccceeEeeccc----------CC Q lcl|NC_019421. 172 ---S--------------------------------------------------NLDNEYVIATKVAD----------SD 188 (473) Q Consensus 172 ---~--------------------------------------------------~~~s~~v~~~~~~~----------~~ 188 (473) . +..|++|....... .. T Consensus 319 ~~g~~~D~~~~lv~d~~~~~~g~~g~v~e~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~ 398 (641) T protein:vir:10 319 SVGGVNDELHVLVIDVDGKITGNPGSVLERFIGVSKASDAKTSIGEVNYYKEVIKQQSAYVYWGSHETAPFLGTAANAAA 398 (641) T ss_pred hcCCcccceEEEEEeecceeeccccceeeeeecccccCCcccccccceeeeeeeccccceEEEecccccccccccccccc Confidence 0 00001110000000 00 Q ss_pred -----cc---------------------------ccccceeeeccCcccccch-----hhHHHHHHHHhhcc---cceEE Q lcl|NC_019421. 189 -----TI---------------------------LANVVNQALEGGNDGCTSI-----TNESYLKALEEFER---YSFDS 228 (473) Q Consensus 189 -----~~---------------------------~~~~~~~~l~gG~dg~~~~-----t~~d~~~~l~~le~---~~~~~ 228 (473) .. ........|.+|.|+.... ...+...++++|+. .++++ T Consensus 399 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gG~d~~~~~~~~~~~~~~~~tg~~~~~~~e~~~i~~ 478 (641) T protein:vir:10 399 GDWGASALNRRYNLLRSTAGTTSFPAGAVTVGSKNNATHYYRLANGADYSASGALYNLSNVDIATAYELIEDPESQVIDY 478 (641) T ss_pred cccccccccccccccccccccccccccccccCCCCcceeEEEeecCcccccccccccccchhHHHHHHHhhhhhhhccce Confidence 00 0000124577777764322 22344566666644 45788 Q ss_pred EEEcCC-----CcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCcc---------HHHHHHhhh-ccCCceEEEecCCcee Q lcl|NC_019421. 229 FVLDGV-----ADEALQETTKAWVAKNKELGKDILLFLGGKTEDN---------IKQINDKSK-SFNDENIVNVGSSAYY 293 (473) Q Consensus 229 l~~p~~-----~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t---------~~~~~~~~~-~~n~~~i~~~~~~~~~ 293 (473) +++|.. +..+++..+.+||+++ +.|+++++.+.+.+ .+.+..+.. ..++.+..+++||.+. T Consensus 479 l~~~~~~~~~~~~~~v~~~~i~~ce~~----~d~~ailD~p~~~~~~~~~~~~~~~~~~~~~~~~~~s~yaa~y~P~~~v 554 (641) T protein:vir:10 479 VLSGPAGADEAAAIAKATTITTIVESR----KDCMAFLSPLRSDVIGVSNTTTVTENLVNYFNQLPSSNYVVFDSGYKYI 554 (641) T ss_pred eeecCCCCCcchhHHHHHHHHHHHHhc----CCEEEEEcCCcccccCCCchhhHHHHHHHHHhhcCCCceEEEEeceeEe Confidence 887653 2245667777777655 45899999875432 234444433 3577888888888765 Q ss_pred cC---cccchHHHHHHHHHhhhcCccccccceeccCc--------ccccccCCHHHHHHHHhCCcEEEEEcCCEEEEEec Q lcl|NC_019421. 294 EN---IKYTPSEVAVYIAALSVSKGITGSICNAKTIF--------EEVEPRLSQSEVKECLKSGTLVLDFDDGDVIIVDD 362 (473) Q Consensus 294 ~~---~~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~~--------~~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~g 362 (473) .+ ....-...|+++||++|++|..|++|++|.+. +++...+++.|++.|+++||+||+..+++..|-.- T Consensus 555 ~dp~~~~~~~vPpsG~iAGv~ArtD~~rGvwkAPAn~~~~~i~gv~~l~~~~~~~e~~~Lnp~gIN~ir~fpg~G~v~~~ 634 (641) T protein:vir:10 555 YDKYNDVYRYVPCNGDIAGLILETGLEEEPWFSPAGFQRGVLRNAVKLAYSPNKTQRDRLYANRINPVVSFPGHAMINNN 634 (641) T ss_pred ecccCCceeEecCCHHHHHHHHhhhccCCceECcCCcccceeeeeeeeeEecChhHHhhhhhcccceEEecCCceeecce Confidence 32 12222233899999999999999999998763 35677899999999999999999998887776444 Q ss_pred ccccccC Q lcl|NC_019421. 363 VNTFKKY 369 (473) Q Consensus 363 i~T~~~~ 369 (473) |---++. T Consensus 635 ~~~~~~~ 641 (641) T protein:vir:10 635 IAFHTKL 641 (641) T ss_pred eeeeecC Confidence 4322221 No 52 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=100.00 E-value=6.5e-41 Score=240.91 Aligned_cols=435 Identities=10% Similarity=0.071 Sum_probs=230.3 Q ss_pred CCccc-cCCCCceecCceeEEEecCCcceecccCceEEEEEEeeCC---CC--CCc-eEEeeccHHHH-HHHcCCCcCcH Q lcl|NC_019421. 1 MATGT-WNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANW---GD--VGK-VVTIKNDLRQL-KNLFGDDMNYS 72 (473) Q Consensus 1 m~~g~-~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~---Gp--~~~-~v~i~s~~~~~-~~~fG~~~~~~ 72 (473) ...|| +...-| +|..+...+-++.+. +..-|-...+| |- +|+ |.+..+-.... +...|...-.+ T Consensus 264 ~~~~~~~~~~~~-------~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~~~n~~~~~~~~~~~~~~~~~~~~s~~~~ 335 (742) T protein:vir:58 264 VSANTEYIRFRQ-------VNLNPESPNYIERVI-GNMTFEFDGERIVTGGEYPNQVPFLRVVVSQDIKQNVAGVEKWVP 335 (742) T ss_pred CCCCccceeeee-------eecCCCCcceeeecc-cceeeeeccceeeecccccccccceeeEeccccCcCccceeEEEe Confidence 22222 110000 011122222222222 12223333332 21 133 33333311100 11111110000 Q ss_pred HHHHHHHHHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeee----- Q lcl|NC_019421. 73 AFKLGKLALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIF----- 147 (473) Q Consensus 73 ~~~~v~~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v----- 147 (473) .-.+.++.-+.-+.++...... +.-+.+.+.........+++. +...|....+..+. ...|.+.. T Consensus 336 ---~~~~~~~~v~d~~~~~~~~~~v----~~~~t~~~~~pp~~~~~~e~v-~~ngG~~f~v~s~~--~~g~~i~~~~as~ 405 (742) T protein:vir:58 336 ---VGFEGIYSVGDFTVIVNELTNV----SIPVTDSAIIPPMRFTRIEQI-TLSGGASFSVISNQ--PYGFNIQDSRHSY 405 (742) T ss_pred ---ccccccccccceeeeccccccc----eeeccccccCCccccccccee-ecccCcceEEEEec--ccCcceeccCcce Confidence 0011111222112221111000 000000000000011111111 11111112221110 01111111 Q ss_pred ---ecCCceeeEEEecccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCccccc----------------c Q lcl|NC_019421. 148 ---FENTKQLFSSSIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCT----------------S 208 (473) Q Consensus 148 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~----------------~ 208 (473) ..+.......... ..+.. .....+ .........+.......+.||.++.. + T Consensus 406 ~~s~ln~~~~V~Gt~a-a~~~~------d~~t~~---~v~s~~~alp~~a~sv~laGG~dg~v~v~~~~~D~iG~~~~~d 475 (742) T protein:vir:58 406 WLSPFKDDELIIGTEL-VLPAL------DVSTEF---GVSSWEEALPEFSFLMPFQGGSDGYIRVDENEPDTIGRVKITP 475 (742) T ss_pred EEeccCCceEEEeehh-hcccc------ccchhe---eccccccccceeeEEEeecCCccccccccCCCccccccccccc Confidence 1111111000000 00000 000000 00011111112222333445554421 1 Q ss_pred hhhHHHHHHHHhh-cccceEEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCC-ccHHHHHHhhhccCCceEEE Q lcl|NC_019421. 209 ITNESYLKALEEF-ERYSFDSFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTE-DNIKQINDKSKSFNDENIVN 286 (473) Q Consensus 209 ~t~~d~~~~l~~l-e~~~~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~-~t~~~~~~~~~~~n~~~i~~ 286 (473) ...++++ +|.+| +..++++|++|+.++...++.+.++|+.++++ ..+++..+.. .+.+.+.++...+++.+.++ T Consensus 476 ~~~adrT-GL~ALlev~eVtILiAPG~t~~~v~aav~A~la~a~~R---l~vL~D~P~~~tt~~~A~a~r~~~nSsraal 551 (742) T protein:vir:58 476 ALLANYE-RLLPLLTEDQFDLVLTPYLTFADHAGTVNAFINRAENR---FLYLFDIAGDDDTENLAISLAGYINSSFATT 551 (742) T ss_pred ccccchh-HHHHhhhcCCCcEEEEcCCCchHHHHHHHHHHHhhcCC---eEEEEecCCCCchHHHHHHHHhccCCceEEE Confidence 1122333 45555 44568999999999888899999999877542 2334444433 44577788888999999999 Q ss_pred ecCCceecC-cccchHHHHHHHHHhhhcCccccccceeccCc-ccccccCCHHHHHHHHhCCcEEEEEcCCEEEEEeccc Q lcl|NC_019421. 287 VGSSAYYEN-IKYTPSEVAVYIAALSVSKGITGSICNAKTIF-EEVEPRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVN 364 (473) Q Consensus 287 ~~~~~~~~~-~~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~~-~~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~ 364 (473) +.||..... ........++++||++|++|..+++|..+.+- .......++.|++.|+++|++++++.++ ..++||-+ T Consensus 552 y~PwVkv~d~~~~r~vPpSgaIAGL~ARtD~erGvw~SPANrgii~~~~~s~se~d~LN~~GINtIrsfG~-G~rlWGnR 630 (742) T protein:vir:58 552 FFPWVRRLTNKGMRTVPASLAAYRSIRTTDPETGLAPVGARRGVVTGEPVRQVDWEDLYNNRINPIVRVGN-DVLLFGQK 630 (742) T ss_pred EeceeeeccCCcceeechHHHHHHHHHHhccCCceEecCCcceeeeccccchhhHHHHhhCCceEEEECCC-cEEEEcce Confidence 999876533 22222234799999999999988888777652 2223356789999999999999988755 46679999 Q ss_pred ccccCCCCCcchhhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceecccccc Q lcl|NC_019421. 365 TFKKYVDDKNEAMGYISNIMFINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDTELQA 444 (473) Q Consensus 365 T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~~~~~ 444 (473) |+. ..|+.|++|++||++|+|+++|+....+++++ |||+.+|.+|+..|++||+.||++|+|.+|++.||+++++ T Consensus 631 Tla----ssDs~wryInVRRlfd~Ie~SI~~a~q~~VfE-PNd~~L~~sIk~sInafL~~L~aqGALlGfrV~lDetNTp 705 (742) T protein:vir:58 631 TML----NVNSALNRINVRRLLIVMRNRISQILSSYLFE-NNTSENRLRAEALVRQYLESLRLRGAVTDYEVAIDSVTTP 705 (742) T ss_pred ecC----CCCcccceEeehhhHHHHHHHHHHHHHHhccC-CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCH Confidence 874 24678999999999999999999877666665 8999999999999999999999999999999999976443 Q ss_pred CC-CCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 445 TA-KADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 445 ~~-~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) .+ +...+++++.++|+.|||||.+++.+. T Consensus 706 eDI~~Gklvv~I~vAP~~PAEfI~lrf~it 735 (742) T protein:vir:58 706 TDIDNNTLRARVTVQPARSIEYIDITFVIT 735 (742) T ss_pred HHhhCCEEEEEEEEEccCCcceEEEEEEEE Confidence 22 344789999999999999999999887 No 53 >protein:vir:101326 Length: 529 # NCBI annotation: gp22 # Family: family:all:9453 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006525;genbank:gi:46401679;genbank:GeneID:2777386 Probab=99.92 E-value=9.3e-25 Score=152.33 Aligned_cols=452 Identities=13% Similarity=0.114 Sum_probs=269.2 Q ss_pred CCccccCCCCcee--cCceeEEEecCCccee--cccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCc------C Q lcl|NC_019421. 1 MATGTWNEKERKE--IPGFYNRFKTQAEKST--NTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDM------N 70 (473) Q Consensus 1 m~~g~~~~~~~~~--~PGvYie~~~~~~~~i--~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~------~ 70 (473) |- .|. .++-+ -.||=++.++.....- .+...++.+++|.+.||++++|++|++ +..++++|... . T Consensus 1 ~~--~ys-i~q~ig~aSGvav~pi~~d~t~~~~~g~g~~v~a~Vgif~RG~i~k~~~Vt~--~n~~~~LGep~~~~~ga~ 75 (529) T protein:vir:10 1 MS--QYS-IQQSLGNASGVAVSPINADATLSTGVALNSSLWAGIGVFARGKPFTVLAVTE--SNYEDVLGEPLKPSSGSQ 75 (529) T ss_pred CC--cee-hhhhhhhhcccccCCcCcccccchheecCceEEEEEEEeecCCCcceEEEch--hHHHHHhccccCCCcchh Confidence 32 232 12211 1788887665432221 234778999999999999999999986 23667777422 2 Q ss_pred cHHHHHHHHHHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCc---cceeeee Q lcl|NC_019421. 71 YSAFKLGKLALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDS---DKKDFIF 147 (473) Q Consensus 71 ~~~~~~v~~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~---~~~~v~v 147 (473) ...+.++.+++.++ .|||+|+....++.-...++ .....+...+-+.+...+.||+.+++-....|+ ..+.+.+ T Consensus 76 ~E~~~h~~eA~~~~--s~yVVRvv~~dak~p~i~~~-~~~~~~~s~~~~s~~~~l~~G~~~~iy~~Dgd~~~s~~~~l~i 152 (529) T protein:vir:10 76 FEPIRHVYEAIQQT--SGYVVRAVPDDAKFPIIMFD-ESGEPAYSALPYGSEIELDSGEAFAIYVDDGDPCISPTRELTI 152 (529) T ss_pred hhhHhhhhhhhcCC--ceEEEEEcccccCCceEEec-CCccchhhcccccccccccccceEEEEEecCcCccCCceEEEE Confidence 23334566666444 59999998766665555554 333445555656666556677666665444332 2333333 Q ss_pred ec----------------------CCceeeEEEecccchhhhh-----hhhh--cccccceeEeecccCC--ccccccce Q lcl|NC_019421. 148 FE----------------------NTKQLFSSSIKGTIDEIVL-----EINS--NLDNEYVIATKVADSD--TILANVVN 196 (473) Q Consensus 148 ~~----------------------~~~~~~~~~~~~~~~~~~~-----~~~~--~~~s~~v~~~~~~~~~--~~~~~~~~ 196 (473) .. ....++++++....+...- .+.. ...|.+.++....+-. ..+..... T Consensus 153 ~~~~ads~g~e~~~l~~~~~~~~g~~~~let~~~sl~~~a~dd~G~~~yl~svle~~s~~l~ai~~~e~~~t~~~~t~~d 232 (529) T protein:vir:10 153 ETATADSAGNERFLLKLTQTTSLGVVTTLETHTVSLAEEAKDDMGRLCYLPTALEARSKYLRAVVNEELISTAKVTNKKS 232 (529) T ss_pred EeeccccCCCccceeeEEEEeecCCceEEEEEEeeeeechhhhcCCccchhHHHhhccCceeeeeeeccccccchhhhhh Confidence 21 1223344444332221110 0000 1123333332211111 12222233 Q ss_pred eeeccCccccc-chhhHHHHHHHHhhccc--ceEEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHH Q lcl|NC_019421. 197 QALEGGNDGCT-SITNESYLKALEEFERY--SFDSFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQIN 273 (473) Q Consensus 197 ~~l~gG~dg~~-~~t~~d~~~~l~~le~~--~~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~ 273 (473) ..+++|+||.. .+...+|..|+.+|++. +|.++.-.+.-+.++.++|...|.+. .+..+...+++.|+.++. T Consensus 233 ~~f~~GtdG~~~~i~s~~y~~A~~~L~n~p~d~~~il~~g~y~~a~I~~L~~ic~~~-----~~d~f~DV~~~LT~~aA~ 307 (529) T protein:vir:10 233 LAFTGGTNGDQSKISTAAYLRAVKVLNNAPYMYTAVLGLGCYDNAAITALGKICADR-----LIDGFFDVKPTLTYAEAL 307 (529) T ss_pred hhccCCccccccccchHHHHHHHHHhcCCcceeeeeeccCCccHHHHHHHHHHHhhh-----hhcEEEcCCCCcCHHHHH Confidence 57899999864 34677899999999664 55555544555666666666666433 345666889999999999 Q ss_pred HhhhccCC---ceE---EEecCCceecC---cc--cchHHHHHHHHHhh--hcCccccccceeccCc-------cccccc Q lcl|NC_019421. 274 DKSKSFND---ENI---VNVGSSAYYEN---IK--YTPSEVAVYIAALS--VSKGITGSICNAKTIF-------EEVEPR 333 (473) Q Consensus 274 ~~~~~~n~---~~i---~~~~~~~~~~~---~~--~~~~~~a~~vAG~~--a~~~~~~s~t~~~~~~-------~~~~~~ 333 (473) .+...++- +.+ ++..|+...+. .. +.... .|++|+.- +....-.+.++.+.+- ..+.+- T Consensus 308 ~~~e~~gl~~~~~~~~s~y~~P~~~~D~~tg~k~~~GlsG-~A~~akargv~~na~v~g~hY~pAGe~r~~inr~~I~~l 386 (529) T protein:vir:10 308 PAVEDTGLLGTDYVSCSVYHYPFSCKDKWTQSRVVFGLSG-VAYAAKARGVKKNSDVGGWHYSPAGEERAVIARASIQPL 386 (529) T ss_pred HHHHhcCccccCceeeEEEEcceeeccccccCceeeCCCc-ceeeccccceeecccccccccccCCCccceeecccceec Confidence 88776654 333 23344432221 11 11110 12333221 1112222334433331 134444 Q ss_pred CCH--HHHHHHHhCCcEEEE-EcCCEEEEEecccccccCCCCCcchhhhhhhhHHHHHHHHHHHHHHhhcCCcccCCHHH Q lcl|NC_019421. 334 LSQ--SEVKECLKSGTLVLD-FDDGDVIIVDDVNTFKKYVDDKNEAMGYISNIMFINTINKDTSLKRKEFVGKIFNDATG 410 (473) Q Consensus 334 ~t~--~e~~~l~~~G~~~l~-~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N~~~~ 410 (473) ++. -|.+.|.++.++++- ..++...|..-++++++ |..|+.++++++|.+|.+.+-. ..+++-++|+.... T Consensus 387 y~~d~~e~~~lv~~riNPV~~~~~g~~~idDsLt~~~k-----nny~R~~hv~~lmn~I~~~~~k-~a~~~~~~Pd~it~ 460 (529) T protein:vir:10 387 YPEDTPDEEAMVKGRLNKVSVGTSGQMIIDDALTCCTQ-----DNYLHFQHVPSLMNAISRFFVQ-LARQMKHSPDGITA 460 (529) T ss_pred cCCCccCHHHHHhhccCeeeeeccCcceeeeeeceeee-----CCchhhhhHHHHHHHHHHHHHH-HHHHHhhCCChHHH Confidence 444 456679999999885 45666677777777664 8899999999999999988854 44677788999998 Q ss_pred HHHHHHHHHHHHHHHHhcCCccCcc-ceeccc-----cccCCCCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 411 QTTVICALKKYFEELMSQGIISEFN-VDIDTE-----LQATAKADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 411 r~~i~~~i~~~l~~l~~~g~i~~~~-~~~D~~-----~~~~~~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) |. ++.-+..+|+.+++.|+|.... +..|-+ ...+.+.|...+++.+.|.-....|++.=.|= T Consensus 461 ~g-l~~~l~~~L~r~~asgalv~prdp~~~G~epy~~~V~q~d~D~~~v~~~~~ptGv~Rri~~~p~l~ 528 (529) T protein:vir:10 461 AG-LTKGMTKLLDRFVASGALVAPRDPDADGTEPYVLKVTQAEFDKWEVVWACCPTGVARRIQGVPLLI 528 (529) T ss_pred HH-HHHhHHHHHHHHHhcCceecccCccCCCCCceEEEEeecccCeEEEEEEeecCCceeeEEeeeeec Confidence 88 8999999999999999995322 111110 01356778899999999999999998887777 No 54 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=99.64 E-value=8e-15 Score=97.86 Aligned_cols=414 Identities=11% Similarity=0.048 Sum_probs=212.0 Q ss_pred ccCCCCceecCceeEEEecC-CcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHHHhc Q lcl|NC_019421. 5 TWNEKERKEIPGFYNRFKTQ-AEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLALLG 83 (473) Q Consensus 5 ~~~~~~~~~~PGvYie~~~~-~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~f~~ 83 (473) -|+.. |+..++ ...++....=+..+|+|.. .-|.+ .++..+..++....||.+ ++.+++++.+|.+ T Consensus 1 ~~s~i---------VnV~i~~~~~a~~~~~f~~~l~~~~~-~~~~~-r~~~yss~~~V~~~FG~~--S~ey~aA~~yF~q 67 (450) T protein:vir:95 1 MWNPI---------VNVDITLNTAGTTREGFGLPLFLAST-DNFEE-RVRGYTSLTEVAEDFDEN--TAAYKAAKQLWSQ 67 (450) T ss_pred CCCce---------EEEeecccccccccccceeEEEEcCC-CCCcc-ceeeecCHHHHHHhcCCC--cHHHHHHHHHHhC Confidence 46542 333322 2223333333445555543 34554 456666677788899966 4788899999976 Q ss_pred CC--CEEEEEecCCCcccceeeeecccccccccceEEEEecCc--cccceeEEEeeccCCc-cceeeeeecCCceeeEEE Q lcl|NC_019421. 84 NV--KELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYP--TARNFNVTIKSNLVDS-DKKDFIFFENTKQLFSSS 158 (473) Q Consensus 84 g~--~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~--G~~~n~i~v~~~~~~~-~~~~v~v~~~~~~~~~~~ 158 (473) .. +++|+.|-........... ......+.++++-.+. ...+..++...+..+. ..+.-.+.........+. T Consensus 68 ~p~p~~l~igr~~~~~t~~~~~~----~~~~~~g~lt~tv~G~~~~~~~i~~s~a~s~~~va~~~~tai~~~~~~~~~~~ 143 (450) T protein:vir:95 68 TPKVTQLYIGRRAMQYTVSIPDA----VTESTDYSITVAAGGGISQPYQYTAQSSDTAENVLQQFKTQIEADPTIKDKVS 143 (450) T ss_pred CCcccEEEEEeeccchhhhhhhh----hccccceeEEEEecceeeeeeEEEEEecCChhhHHHHhhhhhcccceeeeeee Confidence 43 3688888654322211111 1112233343322111 1111111111000000 000000000000000000 Q ss_pred ecccchhhhhhhhhcccccceeEeecccCCc---cccccceeeeccCcccccchhhHHHHHHHHhhcc--cceEEEEEcC Q lcl|NC_019421. 159 IKGTIDEIVLEINSNLDNEYVIATKVADSDT---ILANVVNQALEGGNDGCTSITNESYLKALEEFER--YSFDSFVLDG 233 (473) Q Consensus 159 ~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~---~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~--~~~~~l~~p~ 233 (473) ...... ...++......... .+..........|.+. +...++++++.. .+|..++++. T Consensus 144 ~~s~g~-----------~~~~t~~~~~~~~~~~~~l~~~~~~~~~~g~~a------et~~~a~~a~~~~~~~w~~~~~~~ 206 (450) T protein:vir:95 144 VNVTGS-----------NGSATMIIAKAGDNDFVKVTTTAQTVYIASTTA------DTASTALAAIEAYSTDWYFIAAED 206 (450) T ss_pred eeeecc-----------cceeeeeeeccccchhhccccccceeEeccccc------ccHHHHHHHHHHhhCCeEEEEecC Confidence 000000 00000000000000 0111122222233322 223455555543 3455667776 Q ss_pred CCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccH----HH---HHHhhhccCCceEEEecCCceecCcccchHHHHHH Q lcl|NC_019421. 234 VADEALQETTKAWVAKNKELGKDILLFLGGKTEDNI----KQ---INDKSKSFNDENIVNVGSSAYYENIKYTPSEVAVY 306 (473) Q Consensus 234 ~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~----~~---~~~~~~~~n~~~i~~~~~~~~~~~~~~~~~~~a~~ 306 (473) .++ +.+.++..|++.. . ++.+.......... .. +.+.-+..++.|.+.+.... ......+++ T Consensus 207 ~~~-~~i~a~a~w~~a~---~-~~f~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~------~~~~~~~aa 275 (450) T protein:vir:95 207 RTQ-QFVLAMASEIQAR---K-KIFFTANSDVTALQGTELASANDVPAQLAKNMYTRTVCLWHHA------AAEDYPEMA 275 (450) T ss_pred CCH-HHHHHHHHHHhhc---C-cEEEEEcCCchhhhhhhhhcccchHHHHHhccCCeeEEEeeCC------CchhHHHHH Confidence 544 4446788998753 2 33333222211100 01 11122233445554433211 111223455 Q ss_pred HHHhhhcCcccc-ccceeccCccccc------ccCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhh Q lcl|NC_019421. 307 IAALSVSKGITG-SICNAKTIFEEVE------PRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGY 379 (473) Q Consensus 307 vAG~~a~~~~~~-s~t~~~~~~~~~~------~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~ 379 (473) ++|......+.+ .+-++.++++... +.++..|.+.|.++|.+.+.+.++...+-+|+++ +-+| T Consensus 276 ~~g~~~~~~~g~~T~~fk~l~Gv~~~v~~~~~~~lt~~~~~al~~~~~n~y~~~~~~~~~~~G~~~--------~G~~-- 345 (450) T protein:vir:95 276 YIAYGAPYDAGSIAWGNAQLTGVAASLQPSNQRPLTSIQKSALDVRHCNFIDLDGGVPVVRRGITS--------GGEW-- 345 (450) T ss_pred HHHHhhhcccceeeeccccccceeeeccCccccccchHHHHHHHhCCcEEEEEecCceeeeCCeee--------Ccch-- Confidence 666655544432 3445667766432 3588999999999999988877777777778755 2234 Q ss_pred hhhhHHHHHHHHHHHHHHhhc-----CCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceecc-ccccCCCCC---E Q lcl|NC_019421. 380 ISNIMFINTINKDTSLKRKEF-----VGKIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDT-ELQATAKAD---E 450 (473) Q Consensus 380 i~v~R~~d~i~~~i~~~~~~~-----ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~-~~~~~~~~d---~ 450 (473) |-+++..|++...++..+..+ .+|+|-+..+..+|++.|+..|++..+.|+|-.|.+...+ +.++++++. . T Consensus 346 iD~~~~~~wl~~~iq~~l~~ll~~~~~~KiPy~~~G~~~i~a~i~~~l~~a~~~G~Ia~~~V~~~~~~~~~~~dr~~R~~ 425 (450) T protein:vir:95 346 IDIIRGVDWLESDLKTSLRDLLINQKGGKITYDDTGITRIRQVIETSLQRAVNRNFLSSYTVNVPKASQVALADKKARIL 425 (450) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhcCCCCCccChhhHHHHHHHHHHHHHHHHhcCcccceeEecCChHhcCHHHHhccCC Confidence 567889999998887544332 3689999999999999999999999999999988876533 223333333 2 Q ss_pred EEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 451 FYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 451 ~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) --+.+.++...++..+.|+.+|. T Consensus 426 ~~i~~~~~laGAIh~~~i~~~v~ 448 (450) T protein:vir:95 426 KDVTFAGILAGAILDVDLKGTVA 448 (450) T ss_pred CCeeEEEEEccceEEEEEEEEEE Confidence 34788999999999999999999 No 55 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=99.52 E-value=5.8e-13 Score=87.67 Aligned_cols=426 Identities=12% Similarity=0.080 Sum_probs=207.5 Q ss_pred CCceecCceeEEEecC-CcceecccCceEEEEEEee-CCCC--CCceEEeeccHHHHHHHcCCCcCcHHHHHHHHHHhc- Q lcl|NC_019421. 9 KERKEIPGFYNRFKTQ-AEKSTNTGLKGRLAMPIRA-NWGD--VGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLALLG- 83 (473) Q Consensus 9 ~~~~~~PGvYie~~~~-~~~~i~~~~~~~~~~~g~a-~~Gp--~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~f~~- 83 (473) +.-|+- -||+...+ ....+.+.+=+..+|+|.. ..-| ..+.++..+..++....||..+ +.+++++.+|.+ T Consensus 1 msip~s--~ivnV~i~~~~~a~~~~~f~~~l~l~~~~~~~~~~~~~r~~~~~s~~~V~~~FG~~s--~ey~aA~~yF~q~ 76 (502) T protein:vir:52 1 MALSIS--HIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNS--ETAKAAQPFFAQS 76 (502) T ss_pred CCCCcc--ceeEEeeccccccccccccCceEEEeeccCccccCCccceEEecCHHHHHHhcCCCh--HHHHHHHHHhcCC Confidence 222222 24454322 2333444444567777753 3322 3345566665677888999664 788889998864 Q ss_pred -CCCEEEEEecCCCccc-cee-eeecccccc--------cccceEEEEecCcc--ccceeEEEeeccCCcc-----ce-- Q lcl|NC_019421. 84 -NVKELLLYRLVDGNQK-KGT-LTLKDTTEN--------SAKDVIKLETKYPT--ARNFNVTIKSNLVDSD-----KK-- 143 (473) Q Consensus 84 -g~~~v~v~rv~~g~~~-aat-~~l~~~~~~--------~~~~~l~i~A~~~G--~~~n~i~v~~~~~~~~-----~~-- 143 (473) ..+++++.|-...... ..+ ..+++.... -..+.++++--+-- ..+..++-..+..+.. .+ T Consensus 77 p~P~~l~igR~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~i~i~g~~~t~~~i~lS~~ts~~~vA~~i~~~l~~ 156 (502) T protein:vir:52 77 PRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTT 156 (502) T ss_pred CccceEEEEeccccccceeechhhhhhhhhHHhHHHhhhhcCceeEEEecceeeeeeccccccccchhHHHHHHHhhhcc Confidence 3447899886532111 110 011110000 01122222110000 0000000000000000 00 Q ss_pred ---eeeeecCCceeeEEEecccchhhhhhhhhcccccceeEeeccc--CCccccccceeeeccC-------cccccchhh Q lcl|NC_019421. 144 ---DFIFFENTKQLFSSSIKGTIDEIVLEINSNLDNEYVIATKVAD--SDTILANVVNQALEGG-------NDGCTSITN 211 (473) Q Consensus 144 ---~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~--~~~~~~~~~~~~l~gG-------~dg~~~~t~ 211 (473) .+.+..+... ..+.+...... ....+.+..... .+++.. .....++.+ .+. ...+. T Consensus 157 ~~~~~tv~~d~~~-~~F~i~s~ttg---------~~~~~~~~~a~~~~~~gt~~-a~~l~l~~~~~av~v~~~~-~g~~a 224 (502) T protein:vir:52 157 LSVAVSIAYDETG-NRFIVSANVAG---------EDKKTEIDYAIDEGGEGEYI-GALLKLENGQASRKVGKNS-VSLKK 224 (502) T ss_pred cccceEEEEecCC-ceEEEEeccCC---------CcceeEEEEeecCCcchhHH-HHHhccccccceeeeeeec-ccccc Confidence 0000000000 00000000000 000000000000 000000 000001110 000 11223 Q ss_pred HHHHHHHHhhccc--ceEEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCC-c--cHHHHHHhhhccCCceEEE Q lcl|NC_019421. 212 ESYLKALEEFERY--SFDSFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTE-D--NIKQINDKSKSFNDENIVN 286 (473) Q Consensus 212 ~d~~~~l~~le~~--~~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~-~--t~~~~~~~~~~~n~~~i~~ 286 (473) +...++|+++... +|..+.++...+++.+.++.+|++.. + ++.++...... . ....+.+..++.++.|.+. T Consensus 225 et~~~al~a~~~~~~~w~~~~~a~~~~~~~~la~a~~iea~---~-~~f~~~~~d~~~~~~~~~~i~~~l~a~~~~~t~~ 300 (502) T protein:vir:52 225 ETLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQAN---T-KLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLA 300 (502) T ss_pred cCHHHHHHHHHhccCceEEEEEeecCChhHHHHHHHHHhhc---C-cEEEEEecCcceeccccchHHHHHHhccCceeEE Confidence 4466777777543 45555555443455567889998752 2 23322221110 0 0111222234456666665 Q ss_pred ecCCceecCcccchHHHHHHHHHhhhcCcccc-----ccceeccCcccccccCCHHHHHHHHhCCcEEEEEcCCEEEEEe Q lcl|NC_019421. 287 VGSSAYYENIKYTPSEVAVYIAALSVSKGITG-----SICNAKTIFEEVEPRLSQSEVKECLKSGTLVLDFDDGDVIIVD 361 (473) Q Consensus 287 ~~~~~~~~~~~~~~~~~a~~vAG~~a~~~~~~-----s~t~~~~~~~~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~ 361 (473) +... ...++. +.+.|.+++.++.+ .+-++.++++... .++..|++.|.++|.+.+.+.++...+-+ T Consensus 301 ~y~~----~~~~~~----aa~~g~~as~~f~~~~g~iT~~fk~l~GV~~~-~lt~t~~~al~~~~~N~y~~~~~~~~~~~ 371 (502) T protein:vir:52 301 MFDK----NDMYPV----SSALARLLSTNFAANNSTLTLKFKQQPTITAD-EITATEFAKAKRLGINVYTYFDDVAMIAE 371 (502) T ss_pred EecC----CcchhH----HHHHHHHHhcCCCcCcceeeecccccCCcccC-cCCHHHHHHHHhcCceEEEEecCeeEEec Confidence 5431 122222 23445556655432 2344566666533 58999999999999999987777777777 Q ss_pred cccccccCCCCCcchhhhhhhhHHHHHHHHHHHHHH-hh-c--CCcccCCHHHHHHHHHHHHHHHHHHHhcCCccC---- Q lcl|NC_019421. 362 DVNTFKKYVDDKNEAMGYISNIMFINTINKDTSLKR-KE-F--VGKIFNDATGQTTVICALKKYFEELMSQGIISE---- 433 (473) Q Consensus 362 gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~-~~-~--ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~---- 433 (473) |..+ +-+| |-+++-.|++...++..+ +. + -+|+|-+..+..+|++.|+..|++..+.|+|.. T Consensus 372 G~~~--------~G~~--iD~~~~~~Wl~~~lq~~l~~~L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~G~~~ 441 (502) T protein:vir:52 372 GTVI--------GGKF--ADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWT 441 (502) T ss_pred Ceee--------CCch--hhHHHHHHHHHHHHHHHHHHHHHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCcccccccc Confidence 7655 2234 556788889988886544 22 2 268999999999999999999999999999943 Q ss_pred ----------------ccceecc-ccccCCC---CCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 434 ----------------FNVDIDT-ELQATAK---ADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 434 ----------------~~~~~D~-~~~~~~~---~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) |.+...+ +.+.+++ +..--|.+.+++..++++|.|.+.|- T Consensus 442 ~~~~g~~~~~d~~~~gy~v~~~~~~~~s~~dr~~R~~~~~~~~~~~aGaIh~v~i~~nv~ 501 (502) T protein:vir:52 442 GAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYN 501 (502) T ss_pred CcccceeeecccccCceEEEeCchhhCCHHHHHcccCCCeEEEEEECceEEEEEEEEEEe Confidence 4444322 2223332 33345888999999999999999999 No 56 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=99.28 E-value=2e-10 Score=73.76 Aligned_cols=425 Identities=11% Similarity=0.068 Sum_probs=191.3 Q ss_pred eecCceeEEEecCC-cceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHHHhc------C Q lcl|NC_019421. 12 KEIPGFYNRFKTQA-EKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLALLG------N 84 (473) Q Consensus 12 ~~~PGvYie~~~~~-~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~f~~------g 84 (473) -+-=-.||+..++- ........-+...|++.-..=|+++. +..+..++....||..+ +.+++++.+|.+ - T Consensus 1 mip~s~iV~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~-~~y~s~~~V~~~FG~~S--~ey~aA~~yF~~~~~~~~~ 77 (504) T protein:vir:96 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIV-IEFDNANAVLSYFGAQS--EEYQRAAAYFKFISKSVNS 77 (504) T ss_pred CCCccceeEeeecccccccccccccceeEeecccCCCccce-EEecCHHHHHHhcCCCh--HHHHHHHHHhhcCCCCCcc Confidence 11123456665442 22222222345566655555577765 55555777888999774 778888998865 4 Q ss_pred CCEEEEEecCCCcccceeeeecccccccccceEEEEecCccc-----cceeEEEe-eccCCcccee-eeeecCC---cee Q lcl|NC_019421. 85 VKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTA-----RNFNVTIK-SNLVDSDKKD-FIFFENT---KQL 154 (473) Q Consensus 85 ~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~-----~~n~i~v~-~~~~~~~~~~-v~v~~~~---~~~ 154 (473) .+++++.|-..... ++. +++.... .....+.+...|. .|...++. .+....+.|. +...... ... T Consensus 78 P~~l~igR~~~~a~-~~~--l~g~~~~--~~~~~~~~i~~G~lsitv~G~~~~~~~i~~S~~ts~~~vA~~i~~al~~~~ 152 (504) T protein:vir:96 78 PSSISFARWVNTAI-APM--VVGDNLP--KTIADFAGFSAGVLTIMVGAAEKNITAIDTSAATSMDNVASIIQTEIRKNT 152 (504) T ss_pred ccEEEEEeecCcCc-cce--EEechhH--HHHHHHhhhhceEEEEEEcceeeeecccccccccchHHHHHHHHhhhhccc Confidence 67899999654321 121 1110000 0001111111111 00000000 0000000000 0000000 000 Q ss_pred eEE--EecccchhhhhhhhhcccccceeEeecccCC----------cccccc-----ceeeeccCcccccchhhHHHHHH Q lcl|NC_019421. 155 FSS--SIKGTIDEIVLEINSNLDNEYVIATKVADSD----------TILANV-----VNQALEGGNDGCTSITNESYLKA 217 (473) Q Consensus 155 ~~~--~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~----------~~~~~~-----~~~~l~gG~dg~~~~t~~d~~~~ 217 (473) ..+ ...... ....+.++......+.. ..+... .......|.+. +...++ T Consensus 153 ~~~~~~~tv~~--------d~~~~~f~its~~tg~~~~~~~~~a~~~~~~~~lgl~~~~~~~v~g~~a------et~~~a 218 (504) T protein:vir:96 153 DPQLAQATVTW--------NPNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAA------DLPDAA 218 (504) T ss_pred ccccccceEEE--------eccCCeEEEEeeccccceeEEEeeccccchhhhhhcccccceEEeeccc------ccHHHH Confidence 000 000000 00011111100000000 000000 00011112221 123455 Q ss_pred HHhhccc--ceEEEEEcCC-CcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceec Q lcl|NC_019421. 218 LEEFERY--SFDSFVLDGV-ADEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYE 294 (473) Q Consensus 218 l~~le~~--~~~~l~~p~~-~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~ 294 (473) +.++... +|..+++... .+++.+.++..|++.. .++++.++......+ ..........+......+... . T Consensus 219 l~al~~~~~~Wy~f~~a~~~~~dd~ilalA~w~ea~---~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~---~ 291 (504) T protein:vir:96 219 VAKSTNVSNNFGSFLFAGATLDNDQIKAVSAWNAAQ---NNQFIYTVATSLANL-GALFDLVKGNSGTALNVLSAT---A 291 (504) T ss_pred HHHHHhhcCCeEEEEEEeccCCHHHHHHHHHHHhhc---CceEEEEEeecccch-hhHHHhhhhcceeEEEEeecC---c Confidence 6666443 3444444332 2334445778888743 233333332222111 122222223333333222211 1 Q ss_pred CcccchHHHHHHHHHhh-hcCccccccceeccCcccccccCCHHHHHHHHhCCcEEEEEc---CCEEEE-EecccccccC Q lcl|NC_019421. 295 NIKYTPSEVAVYIAALS-VSKGITGSICNAKTIFEEVEPRLSQSEVKECLKSGTLVLDFD---DGDVII-VDDVNTFKKY 369 (473) Q Consensus 295 ~~~~~~~~~a~~vAG~~-a~~~~~~s~t~~~~~~~~~~~~~t~~e~~~l~~~G~~~l~~~---~~~~~i-~~gi~T~~~~ 369 (473) ...+..+..++++|+.. .+..-..++-++.++++.. ..++..|.+.|..+|.+.+... +....+ -+|+.+ T Consensus 292 ~~~~~~~~~~~~~as~~f~~~ng~~T~~fk~l~GVta-~~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~~~---- 366 (504) T protein:vir:96 292 SNDFVEQCPSEILAATNYDEPGASQNYMYYQFPGRNI-TVSDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGILC---- 366 (504) T ss_pred cchhHHHHHHHHHHhcCcCcccccccccccccCCcCc-ccCCHHHHHHHHhcCCeEEEEeecccceeeEEecCeee---- Confidence 12233333334444332 1122223455567777753 3689999999999999988533 233443 456554 Q ss_pred CCCCcc-hhhhhhhhHHHHHHHHHHHHHH-hh--cCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccC------------ Q lcl|NC_019421. 370 VDDKNE-AMGYISNIMFINTINKDTSLKR-KE--FVGKIFNDATGQTTVICALKKYFEELMSQGIISE------------ 433 (473) Q Consensus 370 ~~~~~~-~~~~i~v~R~~d~i~~~i~~~~-~~--~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~------------ 433 (473) . ++ +|..|.+.+-.++|...++..+ +- =.+|+|-+..+..+|++.|+..|++-.+.|+|.. T Consensus 367 --g-G~~~~~wiDv~~~~~WL~~~lq~~l~~l~~~~~kIPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~I 443 (504) T protein:vir:96 367 --G-GPTDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQYI 443 (504) T ss_pred --C-CccccchhhhhhhHHHHHHHHHHHHHHHHhcCCCcccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchhee Confidence 1 33 5677888888788877775332 11 2479999999999999999999999999999843 Q ss_pred -----------------ccceecc-ccccCC---CCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 434 -----------------FNVDIDT-ELQATA---KADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 434 -----------------~~~~~D~-~~~~~~---~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) |.+.+++ +.+.++ .+-..-+.+.++--.++.+|.++=++= T Consensus 444 ~~~~~~d~~~~~~~~~GYyv~~~~~s~~s~~~r~~R~~~~~~~~y~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 444 TQITGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) T ss_pred cccccccccccceeccceEEEecChhccChhHhhhccccceEEEEEECCeEEEEEeccccC Confidence 2233221 111212 222345666666667777765554433 No 57 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=99.25 E-value=3.3e-10 Score=72.56 Aligned_cols=409 Identities=10% Similarity=0.042 Sum_probs=188.7 Q ss_pred CCceecC-ceeEEEecCCcceecccCceEEE-EEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHHHh---- Q lcl|NC_019421. 9 KERKEIP-GFYNRFKTQAEKSTNTGLKGRLA-MPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLALL---- 82 (473) Q Consensus 9 ~~~~~~P-GvYie~~~~~~~~i~~~~~~~~~-~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~f~---- 82 (473) +++.-+| -.|++..++-. +-.+..++..+ |++....=|+++.....| .++....||..+ +.+++++.+|. T Consensus 1 m~~~~ip~s~iV~V~~~v~-~~~~~~~~~~~l~l~~~~~~~~~~~~~~~s-~~~V~~~FG~~S--~ey~aA~~yFsg~~~ 76 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVI-GAGGAPGRLTGLVLTQDTSVQPGQLADFFQ-ETDVENWFGALS--NEAKIADAYFPGIVN 76 (501) T ss_pred CCCCCcccceEEEEeeecc-cCCCccccceeEEEeccCCCCccceEEecC-HHHHHHhcCCCh--HHHHHHHHHhhhhcC Confidence 3432333 45666654422 22233333333 444445558898888665 677889999764 67788888885 Q ss_pred c--CCCEEEEEecCCCcccceeeeeccccc--------ccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCc Q lcl|NC_019421. 83 G--NVKELLLYRLVDGNQKKGTLTLKDTTE--------NSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTK 152 (473) Q Consensus 83 ~--g~~~v~v~rv~~g~~~aat~~l~~~~~--------~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~ 152 (473) . -.+++++.|-...... +. ++...- ..-.+.++++-....+ ....+.... T Consensus 77 q~p~P~~l~igR~~~~~~~-~~--l~g~~l~~~~la~~~~~sg~l~vti~g~~~-----~~~i~ls~a------------ 136 (501) T protein:vir:10 77 GGQLPYDLKFARYVAADAP-AS--VYGIPLTGVTLAQLQGYSGTLTVTTAAQHV-----SANISLAAA------------ 136 (501) T ss_pred CCccccEEEEEeecCCCcc-ce--EeccchhhhhhhhcceeeeEEEEeecccee-----ecccccccc------------ Confidence 2 3558999996542211 11 111000 0001222221110000 000000000 Q ss_pred eeeEEEecccchhhhhhhhhc------------ccccceeEeecccCCcccccc-------ceeeeccCcccc---cchh Q lcl|NC_019421. 153 QLFSSSIKGTIDEIVLEINSN------------LDNEYVIATKVADSDTILANV-------VNQALEGGNDGC---TSIT 210 (473) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~------------~~s~~v~~~~~~~~~~~~~~~-------~~~~l~gG~dg~---~~~t 210 (473) .+..+....+... ..+.++..+...+...+.... ....|+.++... .+.. T Consensus 137 --------ts~~~vAs~i~~al~~~~~tv~~d~~~~~f~its~ttG~~~~i~~~~~~~~la~~l~Lt~~~~a~v~~~g~~ 208 (501) T protein:vir:10 137 --------TSFANAATLIEAAFTSPDFVVAYDALRNRFTVVTNATGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVA 208 (501) T ss_pred --------cCHHHHHHHHhhhccCCceEEEEcccCceEEEEeeccCCceeEEEeeCchhhhhhcCccccccceEEecCcc Confidence 0000000000000 001111111111111110000 001111111000 0111 Q ss_pred hHHHHHHHHhhcccc--eEEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccH-----HHHHHhhhccCCce Q lcl|NC_019421. 211 NESYLKALEEFERYS--FDSFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTEDNI-----KQINDKSKSFNDEN 283 (473) Q Consensus 211 ~~d~~~~l~~le~~~--~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~-----~~~~~~~~~~n~~~ 283 (473) .+...+++.++.... |-.+......+++.+.++.+|++.. .++++.+......... ..+.+.-...++.| T Consensus 209 aet~~~a~~a~~~~~~~Wy~f~~a~~~~~~~~la~A~wiea~---~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~ 285 (501) T protein:vir:10 209 ADTPASAMNRAVGLSRNWATFTTAWTAVIADRLAFAAWNSGQ---AYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQG 285 (501) T ss_pred cccHHHHHHHHHhccCceEEEEEecCCChHHHHHHHHHHHhc---CceEEEEEecCchhhhhhhhhhhHHHHHHhcCCCc Confidence 222345666665443 3333332333455556788888743 2333332221111111 11222233346666 Q ss_pred EEEecCCceecCcccchHHHHHHHHHhhhcCcccc---ccc--eeccC-cccccccCCHHHHHHHHhCCcEEEEEc---C Q lcl|NC_019421. 284 IVNVGSSAYYENIKYTPSEVAVYIAALSVSKGITG---SIC--NAKTI-FEEVEPRLSQSEVKECLKSGTLVLDFD---D 354 (473) Q Consensus 284 i~~~~~~~~~~~~~~~~~~~a~~vAG~~a~~~~~~---s~t--~~~~~-~~~~~~~~t~~e~~~l~~~G~~~l~~~---~ 354 (473) .+.++. ..+.. +.+.|..++.+.++ +.| ++.++ ++. ...++..|.+.|..+|.+++... + T Consensus 286 t~~~y~------~~~~~----aa~~g~~as~nf~~~~g~~T~~fkq~~~Gi~-a~~lt~t~a~al~~~~~N~y~~~~~~~ 354 (501) T protein:vir:10 286 TLPLYG------DQATA----GAVMGYAASINFQLRNGRTVLAFRQFNAGVP-ATAHDLPTANALRSNNYTYIGAYANAA 354 (501) T ss_pred eEEECC------CCcHH----HHHHHHHHhhCcccCccceeeeccccCCCcC-cccCCHHHHHHHHhcCCeEEEEecccc Confidence 665542 12222 33445555555433 233 34443 332 34588999999999999998654 3 Q ss_pred CEEEEE-ecccccccCCCCCcchhhhhhhhHHHHHHHHHHHH----HHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019421. 355 GDVIIV-DDVNTFKKYVDDKNEAMGYISNIMFINTINKDTSL----KRKEFVGKIFNDATGQTTVICALKKYFEELMSQG 429 (473) Q Consensus 355 ~~~~i~-~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~----~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g 429 (473) +...+. +|+-+ -+|..|.+.+-.|++...++. .+.. .+|+|-+..+..+|++.|+..|++-.+.| T Consensus 355 ~~~~~~~~G~~s---------G~~~wiD~~~~~~Wl~~~iq~~l~~ll~~-~~kIPyt~~G~~~l~a~v~~~l~~av~nG 424 (501) T protein:vir:10 355 NNYTIAYDGKLS---------GKFLWVDTYLDQIYLNAELQRAEFEAMLA-YNSLPYNEDGYTALYRAGVDVIDAAVTSG 424 (501) T ss_pred ceeeEEecCeee---------ccceeehhhhhHHHHHHHHHHHHHHHHHh-cCCcccCHHHHHHHHHHHHHHHHHHHhCc Confidence 445443 45322 134556666655666555532 2222 47999999999999999999999999999 Q ss_pred CccC-----------------------------ccceecccccc---CCCCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 430 IISE-----------------------------FNVDIDTELQA---TAKADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 430 ~i~~-----------------------------~~~~~D~~~~~---~~~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) .|.. |.+.+++..++ ...+...-+.+.++--.++.+|.|.-+.= T Consensus 425 ~I~~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v 500 (501) T protein:vir:10 425 IIRAGVTLTNSQLQQIDAAAGVAGAGQLVQTRGWYFLIGDPANPGQARQNRTTPACTLWYSDGGSIQQLTIGSNAV 500 (501) T ss_pred eeecCCCCCcccceeeccccCccccccceeccceeEeeccccCChhhhhhccccceEEEEEeCCceeEEEeeeeec Confidence 9943 22222211111 12233455666777777777775432222 No 58 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=99.19 E-value=7.1e-10 Score=70.73 Aligned_cols=330 Identities=11% Similarity=0.074 Sum_probs=190.1 Q ss_pred ecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHH-HHHHHhcCCCEEEEE Q lcl|NC_019421. 13 EIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKL-GKLALLGNVKELLLY 91 (473) Q Consensus 13 ~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~-v~~~f~~g~~~v~v~ 91 (473) ..|=|=|+-++..+-++..+.+ ...|+|.+... .++...|.+. +++-.+||.. ++.++. +..+..|+|.. |-. T Consensus 1 ~~~~v~vn~ln~~qg~~~~ver-~~lfig~~~~~-~~~~~~~~~~-sdld~~lg~~--ds~lk~~v~aa~~naG~~-w~a 74 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIER-HALFVGVGTTN-QGKLLALTPD-SDFDKVFGET--DTDLKKQVRAAMLNAGQN-WFA 74 (376) T ss_pred CCCeEEEeeeeccCCCcccccc-eEEEeeccccc-cCceEEecCC-CChHHhhCCC--chhHHHHHHHHHhCCCCc-eEE Confidence 7788999999998888888888 78999987654 3566677664 3477788863 344544 55544454321 111 Q ss_pred ecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceeeEEEecccchhhhhhhh Q lcl|NC_019421. 92 RLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLFSSSIKGTIDEIVLEIN 171 (473) Q Consensus 92 rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~ 171 (473) .+ ++ |+ T Consensus 75 ~~--------------------------~~--p~---------------------------------------------- 80 (376) T protein:vir:37 75 HV--------------------------YI--AQ---------------------------------------------- 80 (376) T ss_pred EE--------------------------Ee--cC---------------------------------------------- Confidence 00 00 00 Q ss_pred hcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhh-cccc--eEEEEEcCCCcHHHHHHHHHHHH Q lcl|NC_019421. 172 SNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEF-ERYS--FDSFVLDGVADEALQETTKAWVA 248 (473) Q Consensus 172 ~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~l-e~~~--~~~l~~p~~~~~~~~~~l~~~v~ 248 (473) .+..++.+|+... +.+. +-.++-|..++.+...++.+... T Consensus 81 -------------------------------------~~~~~~~~Av~~a~~~~s~E~V~v~~p~~t~~a~i~a~qa~a~ 123 (376) T protein:vir:37 81 -------------------------------------EDGYDFVECVKKANQTASFEYCVNTRYLGVDKASIGKLQECYA 123 (376) T ss_pred -------------------------------------CChhhHHHHHHHHHhhCCeeEEEEecCcchhHHHHHHHHHHHH Confidence 0011222222222 1222 22222232223333344444443 Q ss_pred HHhhC-CCeEEEEEcCC-------CCccH----HHHHHhhhccCCceEEEecCCceecCcccchHHHHHHHHHhhh--cC Q lcl|NC_019421. 249 KNKEL-GKDILLFLGGK-------TEDNI----KQINDKSKSFNDENIVNVGSSAYYENIKYTPSEVAVYIAALSV--SK 314 (473) Q Consensus 249 ~~~~~-~~~~~av~~~~-------~~~t~----~~~~~~~~~~n~~~i~~~~~~~~~~~~~~~~~~~a~~vAG~~a--~~ 314 (473) ..... +|.+..++... .++++ ....+....+.++++..|..-. + ...+.+||.++ +. T Consensus 124 el~~~~~R~vffile~~g~d~~~~~ge~w~~y~~~l~a~~~gia~~~V~vV~~~~---g------n~~G~~aGRl~naaV 194 (376) T protein:vir:37 124 ELLAKFGRRTFFIQAVQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLVPLLF---G------NETGVLAGRLANRAV 194 (376) T ss_pred HHHHhcCCeEEEEEeccCCCCcccccCCHHHHHHHHHHHhccccccceeeeeeec---c------chHHHHHHHHHhCCc Confidence 34333 45555444432 12233 2334445667778887765311 1 12456677764 44 Q ss_pred cccccccee----ccCccc-------ccccCCHHHHHHHHhCCcEEEEE-cC-CEEEEEecccccccCCCCCcchhhhhh Q lcl|NC_019421. 315 GITGSICNA----KTIFEE-------VEPRLSQSEVKECLKSGTLVLDF-DD-GDVIIVDDVNTFKKYVDDKNEAMGYIS 381 (473) Q Consensus 315 ~~~~s~t~~----~~~~~~-------~~~~~t~~e~~~l~~~G~~~l~~-~~-~~~~i~~gi~T~~~~~~~~~~~~~~i~ 381 (473) ++.+|+-.- .++... ....++...+..|-++|..+++. ++ .++.+-+| +|+ +..+.+|++|. T Consensus 195 sVadspgRV~tGai~gl~~~~~p~d~~g~el~~a~l~aLd~arysvpr~Y~gydG~Yw~dg-~tl----~~~gsDYq~ie 269 (376) T protein:vir:37 195 TVADSPARVQTGALVSLGSANKPLDKDGNELTLAHLKSLETARYSVPMWYPDYDGYYRADG-RTL----DVEGGDYQVIE 269 (376) T ss_pred chhcCccceeecccccccccccccccCCcccchHHHHHHHhCCCeEEEeeCCCCceEEeCC-eEe----ccCCCCeeeeh Confidence 556665332 222111 11235677888999999998864 43 34555444 443 34577899999 Q ss_pred hhHHHHHHHHHHHHHHhhcCC--cccCCHHHHHHHHHHHHHHHHHHHhcCCccCcc--ceecc----ccc-cCCCCCEEE Q lcl|NC_019421. 382 NIMFINTINKDTSLKRKEFVG--KIFNDATGQTTVICALKKYFEELMSQGIISEFN--VDIDT----ELQ-ATAKADEFY 452 (473) Q Consensus 382 v~R~~d~i~~~i~~~~~~~ig--~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~--~~~D~----~~~-~~~~~d~~~ 452 (473) .+|++|-+.+.+|...=++|+ .++.++...+..+.-+..-|++|.+.+-|-.+. -+|.+ +++ .-.++..+. T Consensus 270 ~~RVvdKa~R~vR~~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~ei~g~~fpgei~~P~d~dI~i~w~sk~~V~ 349 (376) T protein:vir:37 270 NLRVVDKVARKVRLLAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKDDAITIVWQSKTKVT 349 (376) T ss_pred hchHHHHHHHHHHHHHHHHhcCccccCChhHHHHHHHHHhHHHHHHHhhhhhccccccceeecCCCCceEEEeccCceEE Confidence 999999999999854445665 234445556677888888999999988876532 12221 111 112345688 Q ss_pred EEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 453 WKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 453 v~i~v~p~~~~e~i~~t~~v~ 473 (473) |.+-++|.++-.+|...|.+- T Consensus 350 I~~~vrPy~cpk~i~~~I~LD 370 (376) T protein:vir:37 350 IYIKVRPYDCPKEITANIFLD 370 (376) T ss_pred EEEEEeeecCcceeEEEEEEe Confidence 888999999999999999888 No 59 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=99.16 E-value=9.9e-10 Score=69.95 Aligned_cols=418 Identities=9% Similarity=0.028 Sum_probs=191.4 Q ss_pred CCceecC-ceeEEEecCCcceecccCceEEEEEEeeCCC-CCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHHHh---- Q lcl|NC_019421. 9 KERKEIP-GFYNRFKTQAEKSTNTGLKGRLAMPIRANWG-DVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLALL---- 82 (473) Q Consensus 9 ~~~~~~P-GvYie~~~~~~~~i~~~~~~~~~~~g~a~~G-p~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~f~---- 82 (473) .++..+| -.|++..++ +.+-.+..+...+++...+-. |+++ ++..+..++....||..+ +.+++++.+|. T Consensus 1 m~~~~ip~s~iV~V~~~-v~~~~~~~~~~~~lllt~~~~~~~~r-~~~y~s~~~V~~~FG~~S--~ey~aA~~yFs~~~~ 76 (501) T protein:vir:36 1 MPTTTIPIDQIVQMLPG-VIGAGGAPGRLTGLVLTQDTSVQPGQ-LADFFQETDVENWFGALS--NEAKIADAYFPGIVN 76 (501) T ss_pred CCcCCcccceEEEEeee-eccCCCcceeeeeEEEeccCCCCCcc-eeeecCHHHHHHhcCCCh--HHHHHHHHHhhcccC Confidence 3433343 356666553 333334444444444444444 6764 566666777899999764 67888888884 Q ss_pred c--CCCEEEEEecCCCcccceeee--eccc--cc-ccccceEEEEecCccccceeEEEeeccCCccc------------- Q lcl|NC_019421. 83 G--NVKELLLYRLVDGNQKKGTLT--LKDT--TE-NSAKDVIKLETKYPTARNFNVTIKSNLVDSDK------------- 142 (473) Q Consensus 83 ~--g~~~v~v~rv~~g~~~aat~~--l~~~--~~-~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~------------- 142 (473) . -.+++++.|-......+.... +... +. ....+.|+++.... ......+...... T Consensus 77 q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~~a~~~~~sg~l~vti~g~-----~~~~~i~lS~~ts~~~vA~~i~~al~ 151 (501) T protein:vir:36 77 GGQLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQ-----HVSANISLAAATSFANAATLIEAAFT 151 (501) T ss_pred CCccccEEEEEeecCcCcceeEeccchhhhhhhhccceeEEEEEEecce-----eeeeecccccccCHHHHHHHHhhhhc Confidence 2 355799999764322211100 0000 00 00112333322110 0000000000000 Q ss_pred -eeeeeecCCceeeEEEecccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccc---cchhhHHHHHHH Q lcl|NC_019421. 143 -KDFIFFENTKQLFSSSIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGC---TSITNESYLKAL 218 (473) Q Consensus 143 -~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~---~~~t~~d~~~~l 218 (473) ..+.+..+... ..+.+...... ....+..... ...++. ...|+.++... .....+...+++ T Consensus 152 ~~~~tv~~d~~~-~~f~i~s~t~G---------~~~~i~~~t~---~~~ia~--~l~Lt~~~~a~v~~~g~~~et~~~al 216 (501) T protein:vir:36 152 SPDFVVAYDALR-NRFTVVTNATG---------TAAAISAVTG---TNNFAD--EIGLSAAAGATLQAAGVAADTPASAM 216 (501) T ss_pred CcceEEEEcCcc-eeEEEEeccCC---------cceeeEeeec---ccchhh--hhcccccCcceEEecccccccHHHHH Confidence 00111100000 00000000000 0000111000 000000 01111111000 000111234556 Q ss_pred Hhhcccc--eEEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCcc-----HHHHHHhhhccCCceEEEecCCc Q lcl|NC_019421. 219 EEFERYS--FDSFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTEDN-----IKQINDKSKSFNDENIVNVGSSA 291 (473) Q Consensus 219 ~~le~~~--~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t-----~~~~~~~~~~~n~~~i~~~~~~~ 291 (473) .++.... |-.+.+....+++.+.++.+|++.. .++++.+........ -..+.+.-++.++.|.+.++.. T Consensus 217 ~a~~~~s~~Wy~f~~a~~~~~~~~la~A~wiea~---~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~- 292 (501) T protein:vir:36 217 NRAVGLSRNWATFTTAWTAVIADRLAFASWNSGQ---AYKYMYVAPDLEAASIVSNNAASFGAQVFAAPYQGTLPLYGD- 292 (501) T ss_pred HHHHhccCceEEEEEecCCChHHHHHHHHHHhhc---CceEEEEEecCchhhhhccchhhHHHHHHhcCCCcEEEEcCC- Confidence 6665443 3223332333445556888898743 233332221111000 1112233344567777665421 Q ss_pred eecCcccchHHHHHHHHHhhhcCcccc---ccc--eecc-CcccccccCCHHHHHHHHhCCcEEEEE---cCCEEEEE-e Q lcl|NC_019421. 292 YYENIKYTPSEVAVYIAALSVSKGITG---SIC--NAKT-IFEEVEPRLSQSEVKECLKSGTLVLDF---DDGDVIIV-D 361 (473) Q Consensus 292 ~~~~~~~~~~~~a~~vAG~~a~~~~~~---s~t--~~~~-~~~~~~~~~t~~e~~~l~~~G~~~l~~---~~~~~~i~-~ 361 (473) ..+. +.+-|..++.+.++ +.| ++.+ +++. ...++..|.+.|..+|.+.+.. .++.+.+. + T Consensus 293 -----~~~~----aa~~g~~as~nf~~~~g~~T~~fkq~~~Gi~-a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~ 362 (501) T protein:vir:36 293 -----QATA----GAVMGYAASINFQLRNGRTVLAFRQFNAGVP-ATVHDLPTANALRSNNYTYIGAYANAANNYTIAYD 362 (501) T ss_pred -----CCHH----HHHHHHHHhcCcccCcceeeeeccccCCCcC-cCcCCHHHHHHHHhcCCcEEEEEecccceeeEEEc Confidence 1122 23455555555443 333 3443 3333 2358889999999999997642 34555553 4 Q ss_pred cccccccCCCCCcchhhhhhhhHHHHHHHHHHHHHHhh---cCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCc---- Q lcl|NC_019421. 362 DVNTFKKYVDDKNEAMGYISNIMFINTINKDTSLKRKE---FVGKIFNDATGQTTVICALKKYFEELMSQGIISEF---- 434 (473) Q Consensus 362 gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~~~---~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~---- 434 (473) |.-+ -+|..|.+.+-.|++...++..+-. =.+|+|-+..+..+|++.|+..|++-.+.|+|..- T Consensus 363 G~~s---------G~~~wiD~~~g~dWL~~~iq~~l~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~ 433 (501) T protein:vir:36 363 GKLS---------GKFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTGLYRAGVDVIDAAVTSGIIRAGVTLT 433 (501) T ss_pred Ceee---------ccchhhhHHHhHHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhCceeecCCCCC Confidence 5321 1355677788888887777543322 24799999999999999999999999999999431 Q ss_pred -------------------------cceeccccccC---CCCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 435 -------------------------NVDIDTELQAT---AKADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 435 -------------------------~~~~D~~~~~~---~~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) .+.+++..++. ..+...-+.+.++--.++.+|.|.-+.= T Consensus 434 ~~q~~~I~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v 500 (501) T protein:vir:36 434 NSQLQQIDKAARVAGAGQLVQTRGWYFLIGDPANPGQARQNRTTPACTLWYSDGGSIQSLTIGSNAV 500 (501) T ss_pred cccceeecccccccccccceeccceEEeeCcccCChhhhhhcccCcEEEEEEeCCceeEEEeeeeee Confidence 11222111111 2233355666677777777775422222 No 60 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=99.15 E-value=1.2e-09 Score=69.47 Aligned_cols=330 Identities=12% Similarity=0.090 Sum_probs=184.6 Q ss_pred ecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHH-HHHHHhcCCCEEEEE Q lcl|NC_019421. 13 EIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKL-GKLALLGNVKELLLY 91 (473) Q Consensus 13 ~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~-v~~~f~~g~~~v~v~ 91 (473) ..|=|=|+-++.++.++..+.| .+.|+|.+... .++...+.... ++-.++|... +.++. +..|..|+|.. |-. T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er-~~Lfig~~~~~-~~~~~~~~~~s-dld~~lg~~~--~~lk~~v~aa~~naG~~-~~~ 74 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIER-HALFVGVGTTN-QGKLLALTPDS-DFDKVFGETD--TDLKKQVRAAMLNAGQN-WFA 74 (376) T ss_pred CCCeEEEecccccCCCcccccc-eEEeecccccc-ccceeeecCcc-chHhhhCCCc--hHHHHHHHHHHhCCCCc-EEE Confidence 7788999999999999988888 78999987765 35666676643 4677888743 44454 44444455532 211 Q ss_pred ecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceeeEEEecccchhhhhhhh Q lcl|NC_019421. 92 RLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLFSSSIKGTIDEIVLEIN 171 (473) Q Consensus 92 rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~ 171 (473) .+.... T Consensus 75 ~~~~~~-------------------------------------------------------------------------- 80 (376) T protein:vir:37 75 HVYIAQ-------------------------------------------------------------------------- 80 (376) T ss_pred EEEeec-------------------------------------------------------------------------- Confidence 110000 Q ss_pred hcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHh-hcccceE--EEEEcCCCcHHHHHHHHHHHH Q lcl|NC_019421. 172 SNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEE-FERYSFD--SFVLDGVADEALQETTKAWVA 248 (473) Q Consensus 172 ~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~-le~~~~~--~l~~p~~~~~~~~~~l~~~v~ 248 (473) .++.++.+++.. .+...|. .++-|..++.+...++.+..+ T Consensus 81 -------------------------------------~~~~~~~~Av~~a~~~~s~E~V~v~~pv~t~~a~i~aa~~~a~ 123 (376) T protein:vir:37 81 -------------------------------------EDGYDFVECVKKANQTASFEYCVNTRYLGVDKASIGKLQECYA 123 (376) T ss_pred -------------------------------------CCchHHHHHHHHhhhhcCceEEEEeccccccHHHHHHHHHHHH Confidence 000111222111 1122222 222221122333333344444 Q ss_pred HHhhC-CCeEEEEEcCC-------CCccHH----HHHHhhhccCCceEEEecCCceecCcccchHHHHHHHHHhh--hcC Q lcl|NC_019421. 249 KNKEL-GKDILLFLGGK-------TEDNIK----QINDKSKSFNDENIVNVGSSAYYENIKYTPSEVAVYIAALS--VSK 314 (473) Q Consensus 249 ~~~~~-~~~~~av~~~~-------~~~t~~----~~~~~~~~~n~~~i~~~~~~~~~~~~~~~~~~~a~~vAG~~--a~~ 314 (473) .+... +|.+..++... .++++. ...+....+.++++..|.... | ...|.+||.+ ++. T Consensus 124 el~~~~~Rpv~file~r~~~~~~~~~e~w~~y~~~~~al~~gia~~~V~~V~~~~---g------n~~G~~aGRl~~aaV 194 (376) T protein:vir:37 124 ELLAKFGRRTFFIQAVQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLVPLLF---G------NETGVLAGRLANRAV 194 (376) T ss_pred HHHHhcCCeEEEEEeccCcCcccccccCHHHHHHHHHHhhcccccccceeeeeeh---h------hhHHHHHHHHhhccc Confidence 44333 45554444432 112322 333445556777776554210 0 2256778876 455 Q ss_pred ccccccce----eccCc-------ccccccCCHHHHHHHHhCCcEEEEE-cC-CEEEEEecccccccCCCCCcchhhhhh Q lcl|NC_019421. 315 GITGSICN----AKTIF-------EEVEPRLSQSEVKECLKSGTLVLDF-DD-GDVIIVDDVNTFKKYVDDKNEAMGYIS 381 (473) Q Consensus 315 ~~~~s~t~----~~~~~-------~~~~~~~t~~e~~~l~~~G~~~l~~-~~-~~~~i~~gi~T~~~~~~~~~~~~~~i~ 381 (473) ++++|+-. ...+. ......++...++.|-++|..+++. .+ .++.+-.| +|+ +..+.+|++|. T Consensus 195 sVadspgRV~tG~l~gl~~~~lp~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~-~tl----~~~gsDY~~ie 269 (376) T protein:vir:37 195 TVADSPARVQTGALVSLGSANKPLDKDRNELTLAHLKSLETARYSVPMWYPDYDGYYWADG-RTL----DVEGGDYQVIE 269 (376) T ss_pred chhhCccceeccccccccccccccCcCcccCCHHHHHHHHhCCCeEEEeeCCCCceEEeCc-eEe----ccCCCChhhhh Confidence 55666532 11111 1122357888999999999998864 43 34554444 443 34567899999 Q ss_pred hhHHHHHHHHHHHHHHhhcCCc-ccCCH-HHHHHHHHHHHHHHHHHHhcCCccCcc--ceeccccc-----cCCCCCEEE Q lcl|NC_019421. 382 NIMFINTINKDTSLKRKEFVGK-IFNDA-TGQTTVICALKKYFEELMSQGIISEFN--VDIDTELQ-----ATAKADEFY 452 (473) Q Consensus 382 v~R~~d~i~~~i~~~~~~~ig~-~~N~~-~~r~~i~~~i~~~l~~l~~~g~i~~~~--~~~D~~~~-----~~~~~d~~~ 452 (473) .+|++|-..+.+|...-+++.. .-|+. ..-+..+.-+..-|++|.+...|.... -++.+... .-.+...+. T Consensus 270 ~~RVvdKa~R~vR~~ai~~i~D~~lnst~~sia~~~~yi~~pLr~M~~s~~i~g~~fpGeI~~p~d~Di~i~w~s~~~V~ 349 (376) T protein:vir:37 270 NLRVVDKVARKVRLLAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKDDAITIVWQSKTKVT 349 (376) T ss_pred hhhHHHHHHHHHHHHHHHHhCCcccCcchhhHHHHHHHHHHHHHHHHhcchhccccccceeecCCCCCceEEeeccceEE Confidence 9999999999998655455553 22333 333445555777788888877764421 12211101 112344578 Q ss_pred EEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 453 WKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 453 v~i~v~p~~~~e~i~~t~~v~ 473 (473) |.+.++|..+..+|...|-+. T Consensus 350 I~~~v~P~~~pk~Itv~I~Ld 370 (376) T protein:vir:37 350 IYIKVRPYDCPKEITANIFLD 370 (376) T ss_pred EEEEEEeccCCceEEEEEEee Confidence 888999999999999888888 No 61 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=99.15 E-value=9.9e-10 Score=69.96 Aligned_cols=415 Identities=10% Similarity=0.080 Sum_probs=180.1 Q ss_pred CCccccCCCCceecCceeEEEecCCcceecccCceEEEEEEe-eCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEKSTNTGLKGRLAMPIR-ANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKL 79 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~-a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~ 79 (473) |.+---+ -||+..++- .+-.+..++--+.+.. ...=|+++ ++..+..++....||.++ +.+++++. T Consensus 1 m~~ip~s---------~iV~V~~~v-~~~~~~~~~f~~~l~~~~~~~~~~r-~~~y~s~~~V~~~FG~~S--~ey~aA~~ 67 (494) T protein:vir:94 1 MPNIPIS---------QIVSINPQV-VSAGGTQGTLDGLLLTQATGFPVTQ-PQVYFSAADVGTAFGLTS--DEYNAALV 67 (494) T ss_pred CCCCCcc---------cEEEeeeec-cccCCcccccceeEeecCccCCccc-eeeecCHHHHHHhcCCCh--HHHHHHHH Confidence 3332222 255654432 2222333333333333 33446654 555555777888999764 67788888 Q ss_pred HHh----c--CCCEEEEEecCCCcccce------eeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeee Q lcl|NC_019421. 80 ALL----G--NVKELLLYRLVDGNQKKG------TLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIF 147 (473) Q Consensus 80 ~f~----~--g~~~v~v~rv~~g~~~aa------t~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v 147 (473) +|. . -.+++++.|-......+. +.++..... -.+.++++-. |. +.....+....+.|. T Consensus 68 yFs~~~~q~p~P~~l~igR~~~~a~~~~l~g~~~~~tl~~~~~--~~g~l~iti~--g~---~~~~~i~lS~~ts~~--- 137 (494) T protein:vir:94 68 YFAGILGGGQQPASLTIGRYASAATSAAVFGAPLTLSLAQLQT--LSGTLIVTTD--TQ---RTSAAINLSGATSFA--- 137 (494) T ss_pred HhhhccCCCccccEEEEEeecCccccceeeccchhhhHHhhhh--cceEEEEEEc--ce---EEEeeecccccCChh--- Confidence 885 2 355799999654321111 011111100 1122222111 00 000000000000000 Q ss_pred ecCCceeeEEEecccchhhhhhhh--------hcccccceeEeecccCCcccccc-----ceeeeccCccc---ccchhh Q lcl|NC_019421. 148 FENTKQLFSSSIKGTIDEIVLEIN--------SNLDNEYVIATKVADSDTILANV-----VNQALEGGNDG---CTSITN 211 (473) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~s~~v~~~~~~~~~~~~~~~-----~~~~l~gG~dg---~~~~t~ 211 (473) .....+...+. ....+.++......+........ ....|+.++.. ..+... T Consensus 138 -------------~vA~~i~~ai~~a~~~v~~d~~~~~f~v~s~ttG~~s~is~~t~~~a~~l~lt~~~~a~v~~~g~~a 204 (494) T protein:vir:94 138 -------------NAASLMTSGFTTPNFAITYDAQRRRFVLSTTATGTTASVSAVTGTLADGVGLSTASGAYVEGSGLAA 204 (494) T ss_pred -------------hHHHHHhhhhccccceEEEcccCcEEEEEEccCCceeEEEEeccchhhhhhhhccccceEeecCccc Confidence 00000000000 00001111100000000000000 00111111100 001112 Q ss_pred HHHHHHHHhhccc--ceEEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCC----ccHHHHHHhhhccCCceEE Q lcl|NC_019421. 212 ESYLKALEEFERY--SFDSFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTE----DNIKQINDKSKSFNDENIV 285 (473) Q Consensus 212 ~d~~~~l~~le~~--~~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~----~t~~~~~~~~~~~n~~~i~ 285 (473) +...+++.++... +|-.+.+....+.+.+.++.+|++.... ++..+.-..... ..-..+.+.-+..++.|.+ T Consensus 205 et~~~a~~a~~~~~~~Wy~f~~~~~~~~~~ilalA~wiea~~~--~~~~~~~~~d~~~~~~~~~~~i~~~l~~~~y~~t~ 282 (494) T protein:vir:94 205 DTAASALDRLAASSSTWAIFTTAWAASLSDRTALAQWTSDQVF--RRIYAAWDQDAAGLSVNNVSSFGNIVKTTPFSNTI 282 (494) T ss_pred ccHHHHHHHHHhccCceEEEEEecCCCHHHHHHHHHHHhhcCc--cEEEEEecCCcceeecccchhHHHHHHhhcCCceE Confidence 2344566666443 3433333333344555688999875322 222222111100 0111222333445677776 Q ss_pred EecCCceecCcccchHHHHHHHHHhhhcCccc-----ccccee-ccCcccccccCCHHHHHHHHhCCcEEEEEc---CCE Q lcl|NC_019421. 286 NVGSSAYYENIKYTPSEVAVYIAALSVSKGIT-----GSICNA-KTIFEEVEPRLSQSEVKECLKSGTLVLDFD---DGD 356 (473) Q Consensus 286 ~~~~~~~~~~~~~~~~~~a~~vAG~~a~~~~~-----~s~t~~-~~~~~~~~~~~t~~e~~~l~~~G~~~l~~~---~~~ 356 (473) .++.. ..+.+ .+.|..++.+.+ ..++++ .++++.. ..++..|.+.|..+|.+++... ++. T Consensus 283 ~~y~~------~~~~a----a~~g~~aa~~~~~~~g~~T~~~k~q~~gi~~-~~l~~t~a~al~~~~~N~y~~~~~~~~~ 351 (494) T protein:vir:94 283 PVYGL------LANAM----IVLAWGASTNLQIAEGRTTLALRSPVSSAGV-RVDNLANANALLSNGYTYLGKYASATNT 351 (494) T ss_pred EEcCC------CChHH----HHHHHHHhccccccCcceeEEeeccCCCCCC-ccCCHHHHHHHHhcCCeEEEEecccCce Confidence 65532 11222 334445555542 234444 3444332 3477889999999999999654 345 Q ss_pred EEEEecccccccCCCCCcchhhhhhhhH----HHHHHHHHHHHHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCcc Q lcl|NC_019421. 357 VIIVDDVNTFKKYVDDKNEAMGYISNIM----FINTINKDTSLKRKEFVGKIFNDATGQTTVICALKKYFEELMSQGIIS 432 (473) Q Consensus 357 ~~i~~gi~T~~~~~~~~~~~~~~i~v~R----~~d~i~~~i~~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~ 432 (473) ..+..|... .. +|..|-..+ +-++++.++...+.. .+|+|-+..+..+|++.|+..|++-.+.|+|. T Consensus 352 ~~~~~gg~~-sG-------~~~~id~~~~~~WL~~~iq~~l~~ll~~-~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~ 422 (494) T protein:vir:94 352 YTVTYNGAI-GG-------QFLWADTALGWIALRRNLQQALFETLLA-YRSLPYNADGYNALYQGAQDVVSQFVAAGVIR 422 (494) T ss_pred EEEecCcee-cc-------ccceeeeeccHHHHHHHHHHHHHHHHHh-CCCcccChhhHHHHHHHHHHHHHHHHhCceee Confidence 555555432 11 122222222 223444444432322 37999999999999999999999999999995 Q ss_pred Cc----------------------------ccee-c-cccccCCCCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 433 EF----------------------------NVDI-D-TELQATAKADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 433 ~~----------------------------~~~~-D-~~~~~~~~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) .. .+.. | .+.|...++..--+.+-..--.++.+|.|+.++= T Consensus 423 ~Gv~~~~~q~~~i~~~~G~~~~~~~~~kGyy~~~~~~~s~~~ra~R~~~~~~~~y~~~GAIh~v~i~~~~v 493 (494) T protein:vir:94 423 AGVALSASQRAQIDQAAGVPISGDVVDKGWYLQVIDPITTTVRTDRGSPTVNFWYCDGGSIQRVVVSATTV 493 (494) T ss_pred cccccCcchhhhhhhhhcCccccceeccceeeeccCCCChhhhhccccCCceEEEEecCcEEEEEEeeEEe Confidence 31 1111 2 1222223333322334444567777776665544 No 62 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=99.14 E-value=4.4e-10 Score=71.86 Aligned_cols=328 Identities=15% Similarity=0.069 Sum_probs=179.7 Q ss_pred ecCceeEEEecCCcceecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHHHh-cCCCEEEEE Q lcl|NC_019421. 13 EIPGFYNRFKTQAEKSTNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLALL-GNVKELLLY 91 (473) Q Consensus 13 ~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~f~-~g~~~v~v~ 91 (473) ..|=|=|+-++.++.++..+.| ...|+|.+.. -.++...+..- +++-.++|.. ++.++.-..+++ |+|..-+.+ T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er-~~lfig~~~~-~~g~~~~~~~~-sdld~~l~~~--ds~lk~~v~aa~~naG~~~~~~ 75 (370) T protein:vir:78 1 MWPYVQIYNLNQMQGPVTEVER-HLLFIGSAAS-NTGKLLSLNAQ-SDFDQLLGAA--DSELKANLLAARDNAGQNWSAA 75 (370) T ss_pred CCceEEEeeccccCCCcCccce-eEEEEecccc-cccceEeecCc-cCHHHhcCCc--ChhHHHHHHHHHhCCCCceEEE Confidence 6688999999999999988888 7899998774 44666667653 4477788754 244454444444 443321110 Q ss_pred ecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceeeEEEecccchhhhhhhh Q lcl|NC_019421. 92 RLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLFSSSIKGTIDEIVLEIN 171 (473) Q Consensus 92 rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~ 171 (473) - .+.. T Consensus 76 ~-----------------------------------------~p~~---------------------------------- 80 (370) T protein:vir:78 76 A-----------------------------------------YVLP---------------------------------- 80 (370) T ss_pred E-----------------------------------------EEec---------------------------------- Confidence 0 0000 Q ss_pred hcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhh-ccc--ceEEEEEcCCCcHHHHHHHHHHHH Q lcl|NC_019421. 172 SNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEF-ERY--SFDSFVLDGVADEALQETTKAWVA 248 (473) Q Consensus 172 ~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~l-e~~--~~~~l~~p~~~~~~~~~~l~~~v~ 248 (473) +..++.+|+... +.. ++-.++-| +++.+..+++.+... T Consensus 81 --------------------------------------~~~d~~~Av~~a~~~~s~E~V~v~~~-~s~~a~~~a~~~~a~ 121 (370) T protein:vir:78 81 --------------------------------------TDKPWLDAARDAQQTQSFEGVVVLGQ-EWHQAAINAAHALNQ 121 (370) T ss_pred --------------------------------------CchhHHHHHHHHHhhCCccEEEEecC-cchHHHHHHHHHHHH Confidence 000122222211 122 22223322 234444455555555 Q ss_pred HHhhC-CCeEEEEEcCC---CCccH----HHHHHhhhccCCceEEEecCCceecCcccchHHHHHHHHHhhh--cCcccc Q lcl|NC_019421. 249 KNKEL-GKDILLFLGGK---TEDNI----KQINDKSKSFNDENIVNVGSSAYYENIKYTPSEVAVYIAALSV--SKGITG 318 (473) Q Consensus 249 ~~~~~-~~~~~av~~~~---~~~t~----~~~~~~~~~~n~~~i~~~~~~~~~~~~~~~~~~~a~~vAG~~a--~~~~~~ 318 (473) .+.+. +|.+..++... .++++ ....+....+.++++..|..+.... .+.+||.++ +..+.. T Consensus 122 el~n~~~Rpv~file~~~~~~~e~w~~y~~~l~al~~gia~~~V~vvp~~~g~~---------~G~~aGRL~naavsVad 192 (370) T protein:vir:78 122 ELIAKWGRWQFMLLAVPAIADEQDWATYEAELATLQDGIAASSVSLIPQLWPTL---------AGAYAGRLCNRAVSIAD 192 (370) T ss_pred HHHHhcCCeEEEEEeecCCCCcCCHHHHHHHHHHhhhccccccceEEeeecccc---------HHHHHHHHhcCeeeecc Confidence 44443 45554444322 22332 2344455667778887775542111 234455431 222222 Q ss_pred cc----ceec-----cCcccccccCCHHHHHHHHhCCcEEEEE-cC-CEEEEEecccccccCCCCCcchhhhhhhhHHHH Q lcl|NC_019421. 319 SI----CNAK-----TIFEEVEPRLSQSEVKECLKSGTLVLDF-DD-GDVIIVDDVNTFKKYVDDKNEAMGYISNIMFIN 387 (473) Q Consensus 319 s~----t~~~-----~~~~~~~~~~t~~e~~~l~~~G~~~l~~-~~-~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~d 387 (473) ++ +... ++..+....++...++.|-.+|..+++. .+ .++.+-.| +|+ +..+.+|++|..+|++| T Consensus 193 sP~Rv~tG~l~gl~~~p~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~-~tl----~~~gsDYq~ie~~RVvd 267 (370) T protein:vir:78 193 SPCRVKTGALVGLGNKPVGKDGIPLPLATLQTLEANRYSVPMWYPDYDGIYWADG-RTL----DAEGGDYQVIENLRIAY 267 (370) T ss_pred cceeeeccccccccccccccCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCc-eEe----ccCCCChhhhhhhhHHH Confidence 22 1111 2222233457888999999999998864 43 34555444 443 34567899999999999 Q ss_pred HHHHHHHH-HHhhcCCcccCCHHH-HHHHHHHHHHHHHHHHhcCCccC--ccceeccccc-----cCCCCCEEEEEEEEE Q lcl|NC_019421. 388 TINKDTSL-KRKEFVGKIFNDATG-QTTVICALKKYFEELMSQGIISE--FNVDIDTELQ-----ATAKADEFYWKWDAV 458 (473) Q Consensus 388 ~i~~~i~~-~~~~~ig~~~N~~~~-r~~i~~~i~~~l~~l~~~g~i~~--~~~~~D~~~~-----~~~~~d~~~v~i~v~ 458 (473) -..+.+|. .+.+...+.-|+.++ .+..+.....-|++|.+.+.|-. |...+.+... .-..+..+.|.+.++ T Consensus 268 Ka~R~vR~~ai~~i~D~~lnst~gsia~~~~~~~~~L~ema~s~~i~~~~fpgeI~~p~d~Di~i~w~s~~~v~I~~~v~ 347 (370) T protein:vir:78 268 KVARRMRLRAIARIGDRSFNSTPGSTAAAITYFGKDLREMAKSTTINGQPFPGDIASPQDGDIRIQWVAKNLVSVFVVVR 347 (370) T ss_pred HHHHHHHHHHHHHhCCcccCCCCcchhHHHHHHHhhHHHHHhhhhhcccccceeEeccCCCcceEEeeccceEEEEEEEE Confidence 99999984 444333333344443 23333444445666666777643 4333322111 112344578888999 Q ss_pred EeeeeeeEEEEEEeC Q lcl|NC_019421. 459 KVDVMKKIYGTGYLG 473 (473) Q Consensus 459 p~~~~e~i~~t~~v~ 473 (473) |..+..+|...|.+- T Consensus 348 P~~~pk~Itv~I~LD 362 (370) T protein:vir:78 348 TVDCPKGITVNIMLD 362 (370) T ss_pred eccCCceEEEEEEEe Confidence 999999998888887 No 63 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=98.99 E-value=6.9e-09 Score=65.33 Aligned_cols=418 Identities=11% Similarity=0.024 Sum_probs=189.0 Q ss_pred CCceecC-ceeEEEecCCcceecccCceEEE-EEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHHHh---- Q lcl|NC_019421. 9 KERKEIP-GFYNRFKTQAEKSTNTGLKGRLA-MPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLALL---- 82 (473) Q Consensus 9 ~~~~~~P-GvYie~~~~~~~~i~~~~~~~~~-~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~f~---- 82 (473) .++..+| -.||+..++- .+-.+..++--+ |++....=|++++ +..+..++....||..+ +.+++++.+|. T Consensus 1 m~~~~ip~s~iV~V~~~v-~~~~~~~~~f~~lll~~~~~~~~~r~-~~y~s~~~V~~~FG~~S--~ey~aA~~yFsg~~~ 76 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGV-IGAGGAPGRLTGLVLTQDTSVQPGQL-ADFFQKTDVENWFGALS--NEAKIADAYFPGIVN 76 (501) T ss_pred CCcCccccceEEEEeeec-ccCCCcccccceEEEecccCCCccce-eeecCHHHHHHhcCCCh--HHHHHHHHHhhhhcC Confidence 3433343 3566665442 222233333333 3333444588765 44455677899999764 67788888884 Q ss_pred c--CCCEEEEEecCCCcccceee--eeccc---ccccccceEEEEecCccccceeEEEeeccCCccce------------ Q lcl|NC_019421. 83 G--NVKELLLYRLVDGNQKKGTL--TLKDT---TENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKK------------ 143 (473) Q Consensus 83 ~--g~~~v~v~rv~~g~~~aat~--~l~~~---~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~------------ 143 (473) . -.+++|+.|-......+... .+... .-....+.|+++-... +.....+....+.| T Consensus 77 q~p~P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~~g~l~i~i~g~-----~~~~~i~~s~ats~~~vA~~i~~al~ 151 (501) T protein:vir:10 77 GGQLPYDLKFARYVAADAPASVYGIPLTGITLAQLQGYSGTLTVTTAAQ-----HVSANISLAAATSFANAATLIEAAFT 151 (501) T ss_pred CCccccEEEEEeecccCccceeeeceehhhhhhhhhheeeEEEEeeccc-----eeeeccccccccCHHHHHHHHHHhhc Confidence 1 35689999965422111100 00000 0000112333321110 00000000000000 Q ss_pred --eeeeecCCceeeEEEecccchhhhhhhhhcccccceeEeecccCCccccccceeeeccCcccc---cchhhHHHHHHH Q lcl|NC_019421. 144 --DFIFFENTKQLFSSSIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGC---TSITNESYLKAL 218 (473) Q Consensus 144 --~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~---~~~t~~d~~~~l 218 (473) .+.+..+... ..+.+...... ..+ .+... .++ +.++ ....|+.++-.. .+...+...+++ T Consensus 152 ~~~~tv~~d~~~-~~f~i~~~t~G--------~~~-~i~~~--t~~-~d~a--~~l~Lt~~~~a~v~~~g~~aet~~~Al 216 (501) T protein:vir:10 152 SPDFVVAYDALR-NRFTVVTNTTG--------TAA-AISAV--TGT-NNLA--DELGLSAAAGATLQAAGVAADTPASAM 216 (501) T ss_pred CCceEEEEeccc-ceEEEEecccC--------cce-eEEEe--ecc-ccch--hhhcccccCceeEEecCcccccHHHHH Confidence 0000000000 00000000000 000 00000 000 0000 001122111000 001112234566 Q ss_pred Hhhcccc--eEEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCc-----cHHHHHHhhhccCCceEEEecCCc Q lcl|NC_019421. 219 EEFERYS--FDSFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTED-----NIKQINDKSKSFNDENIVNVGSSA 291 (473) Q Consensus 219 ~~le~~~--~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~-----t~~~~~~~~~~~n~~~i~~~~~~~ 291 (473) .++.... |-.+......+++.+.++.+|++... ++++.+....... .-..+.+.-...++.|.+.++.. T Consensus 217 ~a~~~~~~~Wy~f~~a~~~~~~~~la~A~wi~a~~---~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~- 292 (501) T protein:vir:10 217 NRAVGLSRNWATFTTAWTAVIADRLAFAAWNSGQA---YKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGD- 292 (501) T ss_pred HHHHhcccceEEEEEEecCChHHHHHHHHHHHhcC---ceEEEEEecCcceeeecccchhHHHHHHhcCCCceEEECCC- Confidence 6665543 33233323344555568888887432 3332222111100 00112222333466776655421 Q ss_pred eecCcccchHHHHHHHHHhhhcCcccc---ccc--eecc-CcccccccCCHHHHHHHHhCCcEEEEEc---CCEEEEE-e Q lcl|NC_019421. 292 YYENIKYTPSEVAVYIAALSVSKGITG---SIC--NAKT-IFEEVEPRLSQSEVKECLKSGTLVLDFD---DGDVIIV-D 361 (473) Q Consensus 292 ~~~~~~~~~~~~a~~vAG~~a~~~~~~---s~t--~~~~-~~~~~~~~~t~~e~~~l~~~G~~~l~~~---~~~~~i~-~ 361 (473) ..+ ++.+-|..++.++++ +.| ++.+ +++. ...++..|.+.|..+|.+++... ++...+. + T Consensus 293 -----~~~----~aa~~g~~as~nf~~~~g~~T~~fkql~~Gv~-a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~ 362 (501) T protein:vir:10 293 -----QAT----AGAVMGYAASINFQLRNGRTVLAFRQFNAGVP-ATAHDLPTANALRSNNYTYIGAYANAANNYTIAYD 362 (501) T ss_pred -----CCH----HHHHHHHHHhcCcccCcceeeeeecccCCCcC-cccCCHHHHHHHHhcCCeEEEEEecccceeeEEEc Confidence 112 234455555655543 233 3443 3333 23588899999999999977432 3445543 4 Q ss_pred cccccccCCCCCcchhhhhhhhHHHHHHHHHHHHHH-h--hcCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccCc---- Q lcl|NC_019421. 362 DVNTFKKYVDDKNEAMGYISNIMFINTINKDTSLKR-K--EFVGKIFNDATGQTTVICALKKYFEELMSQGIISEF---- 434 (473) Q Consensus 362 gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~-~--~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~---- 434 (473) |+-+ -+|..|.+.+-.|.+...++..+ + .=.+|+|-+..+..+|++.|+..|++-.+.|.|..- T Consensus 363 G~~s---------G~~~wiD~~~g~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~ 433 (501) T protein:vir:10 363 GKLS---------GKFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTANYRAGVDVIDAAVTSGIIRAGVTLT 433 (501) T ss_pred ceee---------ccceehhhHhhHHHHHHHHHHHHHHHHhcCCCcccCHHHHHHHHHHHHHHHHHHHhCcceecCcccC Confidence 5322 13555777777777777765322 2 224799999999999999999999999999999532 Q ss_pred -------------------------cceecccc---ccCCCCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 435 -------------------------NVDIDTEL---QATAKADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 435 -------------------------~~~~D~~~---~~~~~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) .+.+++.. +....+...-+.+.++--.++.++.|.-+.= T Consensus 434 ~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v 500 (501) T protein:vir:10 434 NSQLQQIDAVAGVAGAGALVQTRGWYFLIGNPANPGQARQNRTSPACTLWYSDGGSIQELTIGSNAV 500 (501) T ss_pred cccceeecccccccccccceeccceEEeeCcccCChhhhhhcccCceEEEEEeCCceeEEEeeeeec Confidence 12222111 1112233455666777777777775432222 No 64 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=98.98 E-value=7.7e-09 Score=65.07 Aligned_cols=409 Identities=10% Similarity=0.044 Sum_probs=185.0 Q ss_pred CCceecC-ceeEEEecCCcceecccCceEEEEEEeeC-CCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHHHh---- Q lcl|NC_019421. 9 KERKEIP-GFYNRFKTQAEKSTNTGLKGRLAMPIRAN-WGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLALL---- 82 (473) Q Consensus 9 ~~~~~~P-GvYie~~~~~~~~i~~~~~~~~~~~g~a~-~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~f~---- 82 (473) .++..+| -.||+..++- .+-.+..+...+++...+ .=|+++ ++..+..++....||.++ +.+++++.+|. T Consensus 1 m~~~~ip~s~iV~V~~~v-~~~~~~~~~~~~lll~~~~~~~~~r-~~~y~s~~~V~~~FG~~S--~ey~aA~~yFs~~~~ 76 (501) T protein:vir:78 1 MPTTTIPIDQIVQMLPGV-IGAGGAPGRLTGLVLTQDTSIQPGQ-LADFFQKTDVENWFGGLS--NEAVIADAYFPGIVN 76 (501) T ss_pred CCcCccccceEEEEeeec-ccCCCcceeeeeEEEecCCCCCccc-eeeecCHHHHHHhcCCCh--HHHHHHHHHhhcCCC Confidence 3433343 3566665442 222333333334444433 347765 455555777889999764 67888888885 Q ss_pred c--CCCEEEEEecCCCcccceeee-------ecccccccccceEEEEecCccccceeEEEeeccCCccc----------- Q lcl|NC_019421. 83 G--NVKELLLYRLVDGNQKKGTLT-------LKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDK----------- 142 (473) Q Consensus 83 ~--g~~~v~v~rv~~g~~~aat~~-------l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~----------- 142 (473) . -.+++++.|-......+.... +... ....+.|+++-.. + ......+...... T Consensus 77 q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~la~~--~~~~G~l~iti~g--~---~~~~~i~~S~~ts~~~vA~~i~~a 149 (501) T protein:vir:78 77 GGQLPYDLKFARYVAADAPASVYGIPLTGVTLTQL--QGYSGTLTVTTAA--Q---HVSSNISLAAATSFANAATLIEAA 149 (501) T ss_pred CCcccceEEEEeecccCcceeEeccceeccchhhh--ceeeeEEEEEecc--c---eeeeccccccccCHHHHHHHHHhh Confidence 2 245789999654322111100 0000 0011233332111 0 0000000000000 Q ss_pred ---eeeeeecCCceeeEEEecccchhhhhhhhhcccccceeEeecccCCcccccc-------ceeeeccCccc---ccch Q lcl|NC_019421. 143 ---KDFIFFENTKQLFSSSIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANV-------VNQALEGGNDG---CTSI 209 (473) Q Consensus 143 ---~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~-------~~~~l~gG~dg---~~~~ 209 (473) ..+.+..+. ..+.++......+....+... ....|+.++.. .... T Consensus 150 l~a~~~tv~~ds----------------------~~~~f~its~t~G~~~~i~~~t~~~~~a~~l~Lt~~~~a~v~~~g~ 207 (501) T protein:vir:78 150 FTSPDFVVSYDA----------------------LRNRFVVNTNATGTAAAISAVTGTNNLADELGLSAAAGASLQAAGV 207 (501) T ss_pred hcCcceEEEEcc----------------------ccceEEEEeeecCCceeEEEEecccchhhhhcccccCceeeEeccc Confidence 000000000 001111100001100000000 00111111100 0001 Q ss_pred hhHHHHHHHHhhcccc--eEEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEc-CCCCcc----HHHHHHhhhccCCc Q lcl|NC_019421. 210 TNESYLKALEEFERYS--FDSFVLDGVADEALQETTKAWVAKNKELGKDILLFLG-GKTEDN----IKQINDKSKSFNDE 282 (473) Q Consensus 210 t~~d~~~~l~~le~~~--~~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~-~~~~~t----~~~~~~~~~~~n~~ 282 (473) ..+...+++.++.... |-.+......+++.+.++.+|++.. .++++.+.. ...... -..+.+.-.+.++. T Consensus 208 ~aet~~~a~~a~~~~~~~Wy~f~~a~~~~~~~~lalA~wiea~---~~~f~~~~~~~~~~~~~~~~~~~i~~~l~a~~y~ 284 (501) T protein:vir:78 208 AADTPASAMNRAVGLSRNWATFTTAWTAVIADRLALASWNSGQ---AYKYMYVAPDLEPASIVTNNSASFGAQVFAAPYQ 284 (501) T ss_pred cccCHHHHHHHHHhccCceEEEEEecCCCHHHHHHHHHHHHhc---CceEEEEEecCCcceeecccchhHHHHHhhcCCC Confidence 1122345666665443 3323322233455556888898743 223322221 110000 01122222334666 Q ss_pred eEEEecCCceecCcccchHHHHHHHHHhhhcCcccc---ccc--eecc-CcccccccCCHHHHHHHHhCCcEEEEEc--- Q lcl|NC_019421. 283 NIVNVGSSAYYENIKYTPSEVAVYIAALSVSKGITG---SIC--NAKT-IFEEVEPRLSQSEVKECLKSGTLVLDFD--- 353 (473) Q Consensus 283 ~i~~~~~~~~~~~~~~~~~~~a~~vAG~~a~~~~~~---s~t--~~~~-~~~~~~~~~t~~e~~~l~~~G~~~l~~~--- 353 (473) |.+.++. ..+.+ +.+.|..++.+.++ +.| ++.+ +++.. ..++..|.+.|..+|.+.+... T Consensus 285 ~t~~~y~------~~~~~----aa~~g~~as~nf~~~~g~~T~~fkq~~~Gv~a-~~l~~t~a~al~~~~~N~y~~~~~~ 353 (501) T protein:vir:78 285 GTLPLYG------DQATA----GAVMGYAASINFQLRNGRTVLAFRQFNAGVPA-TAHDLGTANALRSNNYTYIGAYANA 353 (501) T ss_pred ceEEEcC------CcchH----HHHHHHHHhcCcccCcceeeeeccccCCCcCc-ccCCHHHHHHHHhcCCeEEEEEecc Confidence 6665542 12222 33445555555443 223 3343 33332 3588899999999999987532 Q ss_pred CCEEEEE-ecccccccCCCCCcchhhhhhhhHHHHHHHHHHHHHH-h--hcCCcccCCHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019421. 354 DGDVIIV-DDVNTFKKYVDDKNEAMGYISNIMFINTINKDTSLKR-K--EFVGKIFNDATGQTTVICALKKYFEELMSQG 429 (473) Q Consensus 354 ~~~~~i~-~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~-~--~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g 429 (473) ++.+.+. +|.-+ -+|..|.+.+-.|++...++..+ + .=.+|+|-+..+..+|++.|+..|++-.+.| T Consensus 354 ~~~~~~~~~G~~s---------G~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG 424 (501) T protein:vir:78 354 ANNYTIAYDGKLS---------GKFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSG 424 (501) T ss_pred cceeeEEEcCeee---------ccceeehhhhhHHHHHHHHHHHHHHHHHhCCCcccCHHHHHHHHHHHHHHHHHHHhCc Confidence 3445543 45322 13555666666666665554222 1 2247999999999999999999999999999 Q ss_pred CccCc-----------------------------cceecccc---ccCCCCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 430 IISEF-----------------------------NVDIDTEL---QATAKADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 430 ~i~~~-----------------------------~~~~D~~~---~~~~~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) .|..- .+.+++.. +....+...-+.+.++--.++.+|.|.-+.= T Consensus 425 ~I~~Gv~l~~~q~~~I~~~~g~~~~~~~~~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v 500 (501) T protein:vir:78 425 IIRAGVTLTNSQLQQIDAAAGVAGAGQLVQMRGWYFLIGDPANPGQARQNRTTPTCTLWYSDGGSIQELTIGSNAV 500 (501) T ss_pred eeecCCCCCCccceeeccccCccccccceeccceEEeeccccCChhhhhhcccCcEEEEEEeCCceeEEEeeeeec Confidence 99531 11221111 1112233355666677777777775432222 No 65 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=98.83 E-value=3.4e-08 Score=61.51 Aligned_cols=416 Identities=11% Similarity=0.022 Sum_probs=190.5 Q ss_pred eecCceeEEEecCCcce-ecccCceEEEEEEeeCCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHHHhcC------ Q lcl|NC_019421. 12 KEIPGFYNRFKTQAEKS-TNTGLKGRLAMPIRANWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLALLGN------ 84 (473) Q Consensus 12 ~~~PGvYie~~~~~~~~-i~~~~~~~~~~~g~a~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~f~~g------ 84 (473) -+-=--||+..++-... .....-+...|++.-..=|+++. +..+..++....||..+ +.+++++.+|.+- T Consensus 1 mip~s~iVnV~~~v~~~a~~~~~~~~~lilt~~~~~~~~r~-~~y~s~~~V~~~FG~~S--~ey~aA~~yFsq~p~~~~~ 77 (507) T protein:vir:99 1 MISQSRYVRIVSGVGAGAPVAQRRLIMRVMTTNAVLPPGVV-FESSSADAVGAYFGMAS--EEYKRAKAYMSFISKSINS 77 (507) T ss_pred CCCccceeEEeeeccccCcccccccceeeeccccCCCccce-EeecCHHHHHHhcCCCh--HHHHHHHHHhccCCCCCcc Confidence 11123466665543222 12223356677766555577765 44455777888999764 6788888888642 Q ss_pred CCEEEEEecCCCcccceeeeeccccc--------ccccceEEEEecCccccceeEEEe-eccCCc--------------- Q lcl|NC_019421. 85 VKELLLYRLVDGNQKKGTLTLKDTTE--------NSAKDVIKLETKYPTARNFNVTIK-SNLVDS--------------- 140 (473) Q Consensus 85 ~~~v~v~rv~~g~~~aat~~l~~~~~--------~~~~~~l~i~A~~~G~~~n~i~v~-~~~~~~--------------- 140 (473) .+++++.|-..... ++. +.+... .-..+.|+++-. |...++. .+.... T Consensus 78 P~~L~igR~~~~~~-~a~--l~g~~~~~~l~~~~~~~~G~lti~v~-----G~~~t~~~i~lS~~ts~~~vAs~i~~~l~ 149 (507) T protein:vir:99 78 PSYISFARWVNAAI-ASM--IVGDSLVKNLPALKAVATPTLSLSIG-----GTVVPIAGIDLTAALTLTDVAATLQTKIR 149 (507) T ss_pred cceEEEEeecCccc-cce--eecchhhhhHHHHhhhcceeEEEEEc-----CceeEeccccccccCCHHHHHHHHHHhhh Confidence 55899999754221 111 111000 001122322211 0011110 000000 Q ss_pred -----cceeeeeecCCceeeEEEecccchhhhhhhhhcccccceeEeecccCCcccc-----ccceeeeccCcccccchh Q lcl|NC_019421. 141 -----DKKDFIFFENTKQLFSSSIKGTIDEIVLEINSNLDNEYVIATKVADSDTILA-----NVVNQALEGGNDGCTSIT 210 (473) Q Consensus 141 -----~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~-----~~~~~~l~gG~dg~~~~t 210 (473) ..+.+.+..+... -.+.+...... ...-+............. .........|.+. T Consensus 150 a~~~~~~~~~tv~~d~~~-~~F~v~s~~tG---------~~s~i~~at~~~~gt~~s~l~~~~~~~a~~~~g~~a----- 214 (507) T protein:vir:99 150 ASANAELATATVTFNTTT-NQFVLNGTTTG---------ALAPTITAVRTDPATDISSLLGWTNTGTVFVKGQAA----- 214 (507) T ss_pred ccccccccceEEEEecCC-ceEEEEeeecc---------ccceeEEEEcCCchhhHHHHhccccccceEeecccc----- Confidence 0001111111000 00000000000 000000000000000000 0001111222221 Q ss_pred hHHHHHHHHhhccc--ceEEEEEcCC--CcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEE Q lcl|NC_019421. 211 NESYLKALEEFERY--SFDSFVLDGV--ADEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVN 286 (473) Q Consensus 211 ~~d~~~~l~~le~~--~~~~l~~p~~--~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~ 286 (473) +...++++++... +|-.+++... -.++.+.++.+|++.. .++++-+...... ......... ...-.... T Consensus 215 -et~~~a~~a~~~~~~nW~~~~~a~~~~~td~~~lalA~wiea~---~~~f~~~~~~~~a-~~~~~~~~~--~~~~~~~~ 287 (507) T protein:vir:99 215 -ETPDTSISKSAAISTNFGSFIYTSTPALTNDQITAVASWNASQ---NNMYMYSVPTTIA-NIGTLYAAV--KGFSGCAL 287 (507) T ss_pred -cCHHHHHHHHHhhcCCeEEEEEEeccccChHHHHHHHHHHhhc---CcEEEEEEecCch-hhhhhhhhh--hhcceeEE Confidence 2234566666443 3333332211 1234456788888743 2333322222111 111111111 11111111 Q ss_pred ecCCceecCcccchHHHHHHHHHhhhcCcccc-----ccceeccCcccccccCCHHHHHHHHhCCcEEEEEc---CCEEE Q lcl|NC_019421. 287 VGSSAYYENIKYTPSEVAVYIAALSVSKGITG-----SICNAKTIFEEVEPRLSQSEVKECLKSGTLVLDFD---DGDVI 358 (473) Q Consensus 287 ~~~~~~~~~~~~~~~~~a~~vAG~~a~~~~~~-----s~t~~~~~~~~~~~~~t~~e~~~l~~~G~~~l~~~---~~~~~ 358 (473) .. .....+....++.+.|.+++.+.++ ++-++.++++... .++..|.+.|.++|.+++..- ++... T Consensus 288 ~~-----~~~~~~~~~~~aa~~g~~as~nf~~~ng~~T~~fk~l~GV~a~-~lt~t~a~al~~~n~N~y~~~a~~~~~~~ 361 (507) T protein:vir:99 288 NI-----TSDSLPVDYIEQSPCEILAATDYTRVNATQNYMYYQFPSRNIT-VSDDTTANLVDANRGNYIGQTQSAGQSLA 361 (507) T ss_pred Ee-----ecccccchhHHHHHHHHHHhhccCcCccceeecccccCCcccc-cCCHHHHHHHHhcCCeEEEEeccccceee Confidence 11 0111222223455566666665433 3344566666544 589999999999999998644 23444 Q ss_pred E-EecccccccCCCCCcc-hhhhhhhhHHHHHHHHHHHHHH-hh--cCCcccCCHHHHHHHHHHHHHHHHHHHhcCCccC Q lcl|NC_019421. 359 I-VDDVNTFKKYVDDKNE-AMGYISNIMFINTINKDTSLKR-KE--FVGKIFNDATGQTTVICALKKYFEELMSQGIISE 433 (473) Q Consensus 359 i-~~gi~T~~~~~~~~~~-~~~~i~v~R~~d~i~~~i~~~~-~~--~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~ 433 (473) + -+|+.+- ++ +|..+.+.+-.++|...++..+ +- =.+|+|-+..+..+|++.|+..|++-.+.|.|.. T Consensus 362 ~~~~G~~~g-------G~~~fid~d~~~~~~WL~~~iq~~l~~l~~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~I~~ 434 (507) T protein:vir:99 362 FYQRGILCG-------GPNDAVDMNIYANEIWLKSAISAQILSLFLNVPRVPANETGESMLLSVIQSVVNTAKNNGTISA 434 (507) T ss_pred EEecCeeeC-------CcccceeeeeecchHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhcccccc Confidence 4 4565441 22 4666655555555555554222 11 2479999999999999999999999999999953 Q ss_pred c-----------------------------cceecc-ccccCC---CCCEEEEEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 434 F-----------------------------NVDIDT-ELQATA---KADEFYWKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 434 ~-----------------------------~~~~D~-~~~~~~---~~d~~~v~i~v~p~~~~e~i~~t~~v~ 473 (473) - .+.+++ +.+.+. .+....+.+-++--.++.+|-++-++= T Consensus 435 Gvtl~~~q~~~in~~~~~~~~~~~~~~~Gyy~~~~~~s~~~~~~r~~r~~~~~~~~y~~~gaI~~v~~~~~~v 507 (507) T protein:vir:99 435 GKNLNVIQQQYITQISGDANAWRQVANIGYWLNITFSSYTNPNTQLTEWKASYQLIYSKDDAIRFVEGTDTLI 507 (507) T ss_pred CCcccccchheecccccccccccceeccceEEEeCChHhcChhhhhccccceEEEEEEeCCeEEEEEeeeecC Confidence 2 222221 122222 333567777788888888887766555 No 66 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=98.82 E-value=3.5e-08 Score=61.43 Aligned_cols=326 Identities=12% Similarity=0.085 Sum_probs=178.7 Q ss_pred eecCceeEEEecCCcceecccCceEEEEEEeeC-CCCCCceEEeeccHHHHHHHcCCCcCcHHHHH-HHHHHhcCCCEEE Q lcl|NC_019421. 12 KEIPGFYNRFKTQAEKSTNTGLKGRLAMPIRAN-WGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKL-GKLALLGNVKELL 89 (473) Q Consensus 12 ~~~PGvYie~~~~~~~~i~~~~~~~~~~~g~a~-~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~-v~~~f~~g~~~v~ 89 (473) -..|=|=|+-.+.++-++..+.+ .+.|+|... ..-.|+...+..- +++..++|.. ++.++. +..|..|+|..-. T Consensus 1 m~~~~V~in~~n~~qg~~~~ver-~~lfig~g~~~~~~g~~~~~~~~-sdld~~lg~~--ds~lk~~v~aa~~naG~~w~ 76 (369) T protein:vir:27 1 MAWPTVIIKILNLMNGPIADIEC-HFLFVIRGTVSGEVRNLIMVDST-SDLDDVLAEA--SAEGLAIVKAAQLNGKQAWT 76 (369) T ss_pred CCCCceEEecccccCCCcccccc-eEEEEEeccccccccceEEecCc-cchHhhcCCc--ChhHHHHHHHHHhCCCCceE Confidence 34566778888888888877787 889996542 2345566777653 4578888864 233454 4444444443211 Q ss_pred E--EecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceeeEEEecccchhhh Q lcl|NC_019421. 90 L--YRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLFSSSIKGTIDEIV 167 (473) Q Consensus 90 v--~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~ 167 (473) . ..+.. .+ T Consensus 77 a~~~p~~~----------------------------~~------------------------------------------ 86 (369) T protein:vir:27 77 AGVMILSE----------------------------ED------------------------------------------ 86 (369) T ss_pred EEEEEeCC----------------------------ch------------------------------------------ Confidence 0 00000 00 Q ss_pred hhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHH---hhcccceEEEEEcCCCcHHHHHHHH Q lcl|NC_019421. 168 LEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALE---EFERYSFDSFVLDGVADEALQETTK 244 (473) Q Consensus 168 ~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~---~le~~~~~~l~~p~~~~~~~~~~l~ 244 (473) ++.+|+. ..-...+-.++-| +++.+...+++ T Consensus 87 ---------------------------------------------~~~~Av~~a~~~~s~E~V~v~~p-~t~~a~i~aaq 120 (369) T protein:vir:27 87 ---------------------------------------------NWQDAVKKANEVSSFEFVVLGFD-AETKAMIEDAI 120 (369) T ss_pred ---------------------------------------------hHHHHHHhhhhhCCccEEEEecC-cccHHHHHHHH Confidence 0111111 1112223333333 23323333333 Q ss_pred HHHHHHhh-CCCeEEEEEcC-------CCCccH----HHHHHhhhccCCceEEEecCCceecCcccchHHHHHHHHHhhh Q lcl|NC_019421. 245 AWVAKNKE-LGKDILLFLGG-------KTEDNI----KQINDKSKSFNDENIVNVGSSAYYENIKYTPSEVAVYIAALSV 312 (473) Q Consensus 245 ~~v~~~~~-~~~~~~av~~~-------~~~~t~----~~~~~~~~~~n~~~i~~~~~~~~~~~~~~~~~~~a~~vAG~~a 312 (473) +....+.. -+|.+..++.. ..++++ ....+....+.++++..|..... .+ ...|-+||.++ T Consensus 121 ~~a~el~~~~~R~vffi~e~~~~~~~~~~~e~w~dy~a~l~al~~g~a~~~V~vv~~~~~-~g------n~~G~~aGRl~ 193 (369) T protein:vir:27 121 TLRTELKNSLGREVGVLCQLPAINNDPTNGQTWSEWLADTVDIPKDVASEYISVVPNVHA-AG------DTLGKYAGRLA 193 (369) T ss_pred HHHHHHHHhcCCeEEEEEeccccCCCccccCCHHHHHHHHHHHhhccCcccceeeeeecc-cc------chHHHHHHHHH Confidence 33333333 34555544431 112233 33444556778888877643221 11 12344555553 Q ss_pred --cCccccccce----eccCc-----ccccccCCHHHHHHHHhCCcEEEEE-cC-CEEEEEecccccccCCCCCcchhhh Q lcl|NC_019421. 313 --SKGITGSICN----AKTIF-----EEVEPRLSQSEVKECLKSGTLVLDF-DD-GDVIIVDDVNTFKKYVDDKNEAMGY 379 (473) Q Consensus 313 --~~~~~~s~t~----~~~~~-----~~~~~~~t~~e~~~l~~~G~~~l~~-~~-~~~~i~~gi~T~~~~~~~~~~~~~~ 379 (473) +.++.+|+-. ..++. .+-...++...+..|-++|..+++. ++ .++.+-.| +|+ +..+.+|++ T Consensus 194 n~aVsIadsp~RVktG~l~g~~~~p~d~~g~~l~~a~l~aLd~agysvp~~Y~gy~G~Yw~d~-~tl----~~~gsDYq~ 268 (369) T protein:vir:27 194 NKEVSIADSPARVQTGSVLGNTELMKDKAGKALDLATLKALESNRIAVPMWYPDYPGQYWTTG-RTL----DVPGGDYQD 268 (369) T ss_pred hcccchhcCcceeeecccccccccccCCCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCc-eEe----ccCCCCeeh Confidence 3444555422 22221 1122347788999999999998864 43 34554444 443 345778999 Q ss_pred hhhhHHHHHHHHHHHHHHhhcCC--cccCCHHHHHHHHHHHHHHHHHHHhcCCccCccceecc----ccc-cCCCCCEEE Q lcl|NC_019421. 380 ISNIMFINTINKDTSLKRKEFVG--KIFNDATGQTTVICALKKYFEELMSQGIISEFNVDIDT----ELQ-ATAKADEFY 452 (473) Q Consensus 380 i~v~R~~d~i~~~i~~~~~~~ig--~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~~~~~~~D~----~~~-~~~~~d~~~ 452 (473) |..+|++|-+.+.+|...=++++ .++.++...+..+.-+..-|++|.+.. |..++.+ +++ .-.++..+. T Consensus 269 iE~~RVvdKa~R~vR~~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~----fpgei~~P~d~dI~i~w~~k~~V~ 344 (369) T protein:vir:27 269 IRHIRVAMKAARKVRIRAIARIADRTLNSTPQSIAAAKLYFTQDLRTMALTG----VPGEIYPPEDEDIQIKWVNSTDVE 344 (369) T ss_pred hhhhhHHHHHHHHHHHHHHHHhcCcccccChhHHHHHHHHHhhHHHHHHhhc----CCeEEecCCCCceEEEeeccceEE Confidence 99999999999999855445555 233444555667777788888998763 3222211 110 112344677 Q ss_pred EEEEEEEeeeeeeEEEEEEeC Q lcl|NC_019421. 453 WKWDAVKVDVMKKIYGTGYLG 473 (473) Q Consensus 453 v~i~v~p~~~~e~i~~t~~v~ 473 (473) |.+-++|..+-.+|..+|.+- T Consensus 345 I~~~vrP~~~pk~it~~I~ld 365 (369) T protein:vir:27 345 IYMSVQPYECPVKITIAISVK 365 (369) T ss_pred EEEEEeeccCCceEEEEEEEe Confidence 888899999999999999888 No 67 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=98.59 E-value=2.2e-07 Score=57.03 Aligned_cols=310 Identities=12% Similarity=0.059 Sum_probs=149.9 Q ss_pred EEEecCCCcccceeeeecccccccccceEEEEecCcc---ccceeEEEeeccCCccceeeeeecCCceeeEEEecccchh Q lcl|NC_019421. 89 LLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPT---ARNFNVTIKSNLVDSDKKDFIFFENTKQLFSSSIKGTIDE 165 (473) Q Consensus 89 ~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G---~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~ 165 (473) .|-|+.+ ..+++....|. .+++...+.... + -+++.+.....++.-. ...+. T Consensus 1 ~~~~iv~-------------------V~v~~~~~~~~~~~~~~~~~~~~~~t--~--~~~~~y~s~~~v~~d~--~~~~~ 55 (331) T protein:vir:80 1 MVETITD-------------------VRVHISVLYPSPRIGLGRPAIFVKGT--A--MGYKEYTTLEELKDTF--ADNTE 55 (331) T ss_pred Cccceec-------------------ceeeecccccccccccCcceeEEecc--c--cceEEEechhhhccCC--CCCcH Confidence 1111110 01111111111 112211111100 0 0111111111111100 00000 Q ss_pred hhhhhhhcccccceeEeecccCCccccccceeeeccCcccccchhhHHHHHHHHhhcccceEEEEEcCCCcHHHHHHHHH Q lcl|NC_019421. 166 IVLEINSNLDNEYVIATKVADSDTILANVVNQALEGGNDGCTSITNESYLKALEEFERYSFDSFVLDGVADEALQETTKA 245 (473) Q Consensus 166 ~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~~~l~~p~~~~~~~~~~l~~ 245 (473) .+.. ....-++ ...+. .+..+... ....+....+.+. .++-.+++...+++. +.++.. T Consensus 56 ~Yka-a~~~f~Q----------~~~~~-----~i~v~~~~----~~~~~~a~~a~~~-~~w~~~~~~~~~~~~-~~a~a~ 113 (331) T protein:vir:80 56 VYAK-AKAVFLQ----------KDRPD-----TVAVITYE----DTKLLEAAEAYFL-KSWHFALLAEFKAAD-ALALSN 113 (331) T ss_pred HHHH-HHHHHhc----------cCccc-----eEEEeccc----hHHHHHHHHHhcc-CceeEEEeecCCHHH-HHHHHH Confidence 0000 0000000 00000 01111110 0111222233333 334455555544444 457788 Q ss_pred HHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecCCceecCcccchHHHHHHHHHhhhcCccccc-ccee- Q lcl|NC_019421. 246 WVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSAYYENIKYTPSEVAVYIAALSVSKGITGS-ICNA- 323 (473) Q Consensus 246 ~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~~~~~~~~~~~~~a~~vAG~~a~~~~~~s-~t~~- 323 (473) |++. .+++ ..++.. ++...+....+ ++..+..+.+. .. + ..++.+.|..+..++.+- +-++ T Consensus 114 ~~~a---~~~~-f~~~~~---~~~~~~~~~~~--~~~t~~~~~~~----~~---~-~~~aa~~g~~~~~~~g~~t~~fk~ 176 (331) T protein:vir:80 114 LIEE---QKFK-FAVFQV---TAVADITPLAK--NTRTIAIVHSK----TG---E-KLDAALIGNVASLPVGSATWKGRH 176 (331) T ss_pred HHhh---CCcE-EEEEec---CchHHHHHhhc--cccEEEEEcCC----cc---c-hhHHHHHHHHHhcCccceeeeeec Confidence 8863 2233 333322 12222222222 33333333221 11 1 223445666677777543 2244 Q ss_pred ccCcccccccCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhhhhhHHHHHHHHHHHHHHhh---c Q lcl|NC_019421. 324 KTIFEEVEPRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYISNIMFINTINKDTSLKRKE---F 400 (473) Q Consensus 324 ~~~~~~~~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~~~---~ 400 (473) .++++.. ..++..|++.|..+|.+++.+.++...+-+|..+ +-+ .|-.++-.|++...++..+.. = T Consensus 177 ~l~GV~~-~~lt~t~~~al~~~~~N~y~~~~~~~~~~~G~~~--------~G~--~iD~~~~~dWl~~~lq~~l~~ll~~ 245 (331) T protein:vir:80 177 GLAGITS-EELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTV--------SGE--FIDSIHGDDWIKATIETRLQKLLTE 245 (331) T ss_pred ccCCCCC-CCCCHHHHHHHHhcCceEEEEecCeeEEecceEe--------Cch--hHHHHHHHHHHHHHHHHHHHHHHHh Confidence 3666654 3689999999999999999887777777778644 123 477788888888877543322 2 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHHHHhcCCcc--------Cccceecc-ccccCCCCC---EEEEEEEEEEeeeeeeEEE Q lcl|NC_019421. 401 VGKIFNDATGQTTVICALKKYFEELMSQGIIS--------EFNVDIDT-ELQATAKAD---EFYWKWDAVKVDVMKKIYG 468 (473) Q Consensus 401 ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~i~--------~~~~~~D~-~~~~~~~~d---~~~v~i~v~p~~~~e~i~~ 468 (473) .+|.|-+..+...|++.++..|++..+.|+|. .|.+...+ +.++++++. .--+.+.+++..++.++.| T Consensus 246 ~~kiPy~~~G~~~l~a~i~~~~~~av~~G~I~~g~~~~~~~~~v~~~~~~~~s~~dr~~R~~~~~~~~~~~~gaI~~v~i 325 (331) T protein:vir:80 246 TDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDV 325 (331) T ss_pred CCCCccChhhHHHHHHHHHHHHHHHHhCCceecCccCCCcceEEEeCchhcCCHHHHhccCCCCeEEEEEEcceEEEEEE Confidence 47899999999999999999999999999995 34444321 223333332 2447788899999999999 Q ss_pred EEEeC Q lcl|NC_019421. 469 TGYLG 473 (473) Q Consensus 469 t~~v~ 473 (473) +.+|- T Consensus 326 ~~~v~ 330 (331) T protein:vir:80 326 YGEVE 330 (331) T ss_pred EEEEe Confidence 88888 No 68 >protein:vir:107720 Length: 515 # NCBI annotation: gp17 # Family: family:all:396 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024865;genbank:gi:48697507;genbank:GeneID:2948336 Probab=98.51 E-value=4e-07 Score=55.64 Aligned_cols=411 Identities=9% Similarity=-0.033 Sum_probs=183.6 Q ss_pred ceecCceeEEEecCCcceecccCce-EEEEEEee-CCCCCCceEEeeccHHHHHHHcCCCcCcHHHHHHHHHHh----c- Q lcl|NC_019421. 11 RKEIPGFYNRFKTQAEKSTNTGLKG-RLAMPIRA-NWGDVGKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLALL----G- 83 (473) Q Consensus 11 ~~~~PGvYie~~~~~~~~i~~~~~~-~~~~~g~a-~~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~f~----~- 83 (473) -||-.=+|+...++-. .-.++++. -.+++... ..=|+++ ++..+..++....||..+ +.+++.+.+|. . T Consensus 1 m~I~~~~~V~i~~~v~-aa~~~~~~~f~~li~t~~~~~p~~r-~~~y~s~~~V~~~FG~~S--~ey~aA~~yFsg~~~q~ 76 (515) T protein:vir:10 1 MPISFDKYVAITSGVA-AQQQIAARSFAIRVYTPNPMVSVDR-LITATSAADVGAYFGTAS--EEYKRAVKNFGFISKKT 76 (515) T ss_pred CCCCceeEEEeecccc-cCCccccccceeeeeecccCCCccc-eeeecCHHHHHHhcCCCh--HHHHHHHHHhhhccCCc Confidence 4566678888775432 22333332 22333333 3347765 455555777899999764 67788888884 1 Q ss_pred -CCCEEEEEecCCCcccceeeeeccccccc---------ccceEEEEecCccccceeE-EEe-eccCCccc--------- Q lcl|NC_019421. 84 -NVKELLLYRLVDGNQKKGTLTLKDTTENS---------AKDVIKLETKYPTARNFNV-TIK-SNLVDSDK--------- 142 (473) Q Consensus 84 -g~~~v~v~rv~~g~~~aat~~l~~~~~~~---------~~~~l~i~A~~~G~~~n~i-~v~-~~~~~~~~--------- 142 (473) -.+++++.|-..... ++ .+....-.. ..+.++++-. |..+ ++. .+....+. T Consensus 77 p~P~~L~igR~~~~a~-~~--~l~g~~~~~~~l~~~~~is~G~ltitid-----G~~~~t~s~i~~S~ats~~~vAs~i~ 148 (515) T protein:vir:10 77 RRPTSIQFARWQREAG-PV--AIYGGAKKAAALATLQAVTAGAISFLFG-----GATTVTVSGISFSAATSLADVASELQ 148 (515) T ss_pred ccccEEEEEeccCccc-ce--EEEeccchhhhHHhhhcccceeEEEEEc-----ceEEEEeeccccccccCHHHHHHHHH Confidence 356899999544211 11 111110000 0112222110 0000 000 00000000 Q ss_pred -----------eeeeeecCCceeeEEEecccchhhhhhhhhcccccceeEeecccCCcccccc------------ceeee Q lcl|NC_019421. 143 -----------KDFIFFENTKQLFSSSIKGTIDEIVLEINSNLDNEYVIATKVADSDTILANV------------VNQAL 199 (473) Q Consensus 143 -----------~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~------------~~~~l 199 (473) ..+.+..+ ...+.++......+........ ....| T Consensus 149 tal~~~~~~~~~~~tv~~d----------------------~~~~~F~v~s~~tG~~~~is~~~~t~~~~~t~~a~~lgl 206 (515) T protein:vir:10 149 TALRANADANLATCTVSYD----------------------PVGARFNFAGSPSDDTVQESISIVPQSNPAIDVAQLLGW 206 (515) T ss_pred hhhccccccccceeEEEEe----------------------cCCCeEEEEEeecCCceeEEEEEecCCCchhhHHHHhcc Confidence 00111000 0111111111111111000000 00111 Q ss_pred ccCccc--ccchhhHHHHHHHHhhccc--ceEEEEEcCCC----cHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHH Q lcl|NC_019421. 200 EGGNDG--CTSITNESYLKALEEFERY--SFDSFVLDGVA----DEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQ 271 (473) Q Consensus 200 ~gG~dg--~~~~t~~d~~~~l~~le~~--~~~~l~~p~~~----~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~ 271 (473) +.+... ......+...+++.++... +|-.+++.... ..+....+.+|+++.. ..+..++ .......... T Consensus 207 t~~~~av~~~g~aaet~~~a~~a~~~~s~nWy~f~~a~~~~~~~~~a~~~a~a~~~e~~~--~~~~~~~-~~~~~~~~~~ 283 (515) T protein:vir:10 207 NSAQGASYIAASPVVSPVDTLIASVAGNNNFGSILFTKNGGTGITLSDAEAIALQNQSYN--VAYKFQV-GVDDTTYSSW 283 (515) T ss_pred ccccceEEecccccccHHHHHHHHHhccCCeEEEEEeecCccccchhHHHHHHHHHhhcC--ceEEEEe-ccCccceech Confidence 111100 0001112244566666543 44444443211 1334456666765321 1222222 1111111111 Q ss_pred HHHhhhccCCceEEEecCCceecCcccchHHHHHHHHHhhhcCcccc-----ccceeccCcccccccCCHHHHHHHHhCC Q lcl|NC_019421. 272 INDKSKSFNDENIVNVGSSAYYENIKYTPSEVAVYIAALSVSKGITG-----SICNAKTIFEEVEPRLSQSEVKECLKSG 346 (473) Q Consensus 272 ~~~~~~~~n~~~i~~~~~~~~~~~~~~~~~~~a~~vAG~~a~~~~~~-----s~t~~~~~~~~~~~~~t~~e~~~l~~~G 346 (473) -........+.+...+.+ ....+.. +...|..++++.++ .+-++.++++..+ .++..|.+.|.++| T Consensus 284 ~a~~~~~~~~~~~~~~~~----~~~~~~~----a~~~g~~asvnf~~~ng~iT~kfKq~~Gita~-~lt~t~a~al~~~~ 354 (515) T protein:vir:10 284 QAALAAIGGVNMIYSPVA----LAAEYHD----MQDGIIEAATDFTQQGGATGYMYVQFNNQTPA-VNDDTLSGILDDLN 354 (515) T ss_pred hhhhhhhhhcCceEEEEe----ccCcchH----HHHHHHHHhcCCCccchhheeccccCCCCccc-cCCHHHHHHHHhcC Confidence 111111111222222211 1122222 23455566665433 2344566655443 48899999999999 Q ss_pred cEEEEEc---CCEEEEE-ecccccccCCCCCcchhhhhhhhHHHHHHHHHHHHHH-hhc--CCcccCCHHHHHHHHHHH- Q lcl|NC_019421. 347 TLVLDFD---DGDVIIV-DDVNTFKKYVDDKNEAMGYISNIMFINTINKDTSLKR-KEF--VGKIFNDATGQTTVICAL- 418 (473) Q Consensus 347 ~~~l~~~---~~~~~i~-~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~-~~~--ig~~~N~~~~r~~i~~~i- 418 (473) .+.+..- ++.+.+. +|+-+ ...-+|+.|-+++-.|.++..++..+ +-+ .+|+|-+..+..+|++.| T Consensus 355 ~N~Y~~~~~~~~~~~~~~~G~~~------gG~~~~~WiD~~~g~~WL~~~iq~~l~~L~~s~~KIPytd~G~a~i~a~v~ 428 (515) T protein:vir:10 355 INYYGQTQVNGTNLSFYQDGVMM------GGPTDPRDSNVYANEQWLKSYAGASFMSLQLAQGKIPANIEGRGLLLGKMT 428 (515) T ss_pred CeEEEEEeccCceEEEEeCCeee------CCccchhHHHHHhhHHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHH Confidence 9998533 4456554 46544 11125667888888888888875432 222 569999999999999987 Q ss_pred HHHHHHHHhcCCccCcc-------------ceecc-----------------ccccCCCCC--EEEEEEEEEEeeeeeeE Q lcl|NC_019421. 419 KKYFEELMSQGIISEFN-------------VDIDT-----------------ELQATAKAD--EFYWKWDAVKVDVMKKI 466 (473) Q Consensus 419 ~~~l~~l~~~g~i~~~~-------------~~~D~-----------------~~~~~~~~d--~~~v~i~v~p~~~~e~i 466 (473) ++.|++-.+.|.|.+.. +..|. +.+...++. .+.+.+-...-+++.+| T Consensus 429 q~vl~~av~nG~I~~Gv~ls~~Q~~~i~~~~g~d~~~~~~~~~Gyy~~~~~~~~~~~~~r~~~~~~~~~~y~~g~~i~~i 508 (515) T protein:vir:10 429 KDIIPAAKLNGTFSIGKTLTVDQQLFVTELTGDDTAWQKVQNLGYWYDVQISSFVDTGGTTKYQAVYSLVYSKDDLIRKV 508 (515) T ss_pred HHHHHHHHhCCeeecCcccchhHHHHHHhhhcCcccccchhhcceeEecCcCCCCCcccccccCceeEEEEEcCceEEEE Confidence 47999999999996421 00110 011111222 22223333445677777 Q ss_pred EEEEEeC Q lcl|NC_019421. 467 YGTGYLG 473 (473) Q Consensus 467 ~~t~~v~ 473 (473) .++-++= T Consensus 509 ~~~~~~v 515 (515) T protein:vir:10 509 VGTHTLI 515 (515) T ss_pred EeeeecC Confidence 7766655 No 69 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=97.99 E-value=7.9e-06 Score=48.57 Aligned_cols=392 Identities=12% Similarity=0.016 Sum_probs=170.0 Q ss_pred cCceeEEE-ecCCcceecccCceEEEEEEeeCCCCC----CceEEeeccHHHHHHHcCCCcCcHHHHHHHHHHhcCCCEE Q lcl|NC_019421. 14 IPGFYNRF-KTQAEKSTNTGLKGRLAMPIRANWGDV----GKVVTIKNDLRQLKNLFGDDMNYSAFKLGKLALLGNVKEL 88 (473) Q Consensus 14 ~PGvYie~-~~~~~~~i~~~~~~~~~~~g~a~~Gp~----~~~v~i~s~~~~~~~~fG~~~~~~~~~~v~~~f~~g~~~v 88 (473) -|---|++ ++-....+..-.=|..+|+|...-=|+ ++..+.+| .+....=|| .+++.+++++.+|.++.+ T Consensus 1 m~~~iVnV~Is~~t~A~~~~~Fg~~liigs~~~~~p~~~f~~~~~Yss-~~~V~~Dfg--~~s~~Y~AA~~~f~Q~~~-- 75 (426) T protein:vir:31 1 MPKQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYST-STSVGDDYG--EDSDVYTASEAIEEMGAE-- 75 (426) T ss_pred CCcceEEEEeecccccccccccceeeeeeeccccccccccchhhhhhh-HHHHHhcCC--CChHHHHHHHHHHhCCce-- Confidence 12122333 222334444455567889988765443 33444333 444555455 556889999999987744 Q ss_pred EEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeeeeecCCceeeEEEecccchhhhh Q lcl|NC_019421. 89 LLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFIFFENTKQLFSSSIKGTIDEIVL 168 (473) Q Consensus 89 ~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~ 168 (473) +.|.--.. ++.... .......++ +++-+.... ........... T Consensus 76 -~~r~~v~~---at~~~~----~~~t~~~tv---------~g~~~s~~a--------------------~~~~~a~~i~~ 118 (426) T protein:vir:31 76 -QWRVMVLE---ATEVTE----EELSDGDTI---------DKVPILGNH--------------------EVESPDGDIEF 118 (426) T ss_pred -eEEeeccc---cceeee----ccCCcceee---------cceeeeecc--------------------cCcchHHHHHH Confidence 43431111 111000 001111111 011100000 00000000000 Q ss_pred hhhhc---ccccceeEeecccCCccccccceeeeccCccc-ccchhhHHHHHHHHhhcccceEEEEEcCCCc-HHHHHHH Q lcl|NC_019421. 169 EINSN---LDNEYVIATKVADSDTILANVVNQALEGGNDG-CTSITNESYLKALEEFERYSFDSFVLDGVAD-EALQETT 243 (473) Q Consensus 169 ~~~~~---~~s~~v~~~~~~~~~~~~~~~~~~~l~gG~dg-~~~~t~~d~~~~l~~le~~~~~~l~~p~~~~-~~~~~~l 243 (473) ..... .......+.. ..++. ++.+... .......|+. +|..+.. +.+...++.... -..+..+ T Consensus 119 ~~~~~~~~~~~~~~~~~~--t~~g~--------~t~~~~~~~~~~s~~dw~-~~~~~~s-~~~~~~ia~~~~~~~~~~~~ 186 (426) T protein:vir:31 119 TTDDDPDVEDFDAEIVIN--SATGD--------VATSEDSIELTYFHADWS-QLDEFPS-DVNNFAVADRRFDLKGVGVL 186 (426) T ss_pred hhccccccccceeeeEec--cccce--------eeccccceeeeeccCcch-hhhcccc-cchhhhhhccccchhhhhhh Confidence 00000 0001111110 11111 1111000 0011112221 2222221 222222332111 1112222 Q ss_pred HHHHHHHhhCCCeEEEEEcCCCCccH---HHHHHhhhccCCceEEEecCCceecCccc-chHHHHHHHHHhhhcCccccc Q lcl|NC_019421. 244 KAWVAKNKELGKDILLFLGGKTEDNI---KQINDKSKSFNDENIVNVGSSAYYENIKY-TPSEVAVYIAALSVSKGITGS 319 (473) Q Consensus 244 ~~~v~~~~~~~~~~~av~~~~~~~t~---~~~~~~~~~~n~~~i~~~~~~~~~~~~~~-~~~~~a~~vAG~~a~~~~~~s 319 (473) ..|..-..+.. .+.|...-...+. +.....+ ..... ++ |...+.+... ......+|+++.++..++.+. T Consensus 187 ~~~~~wa~~~~--i~~va~~~e~~~~~~~~~~~a~~--~~~~~--y~-p~~~~~~~~~~~~~~~~~~~~~~~aa~~~~~~ 259 (426) T protein:vir:31 187 DETHSWASDED--MGMIANGVNVDDYDSVDEAMDVA--HEVAG--YV-PSGDLMMIVDASDDDLAAYQLGKFAVSEPWYN 259 (426) T ss_pred Hhhhhhhhhcc--eeeeeeccchhhhcchhhhhhhh--hcccc--cc-cchhheeehhccccchhhHHhhhhhhhccccc Confidence 22222222222 2222211111111 1111100 00000 11 1111111100 111236789999999888776 Q ss_pred cceeccCcccc----------cccCCHHHHHHHHhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhhhhhHHHHHH Q lcl|NC_019421. 320 ICNAKTIFEEV----------EPRLSQSEVKECLKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYISNIMFINTI 389 (473) Q Consensus 320 ~t~~~~~~~~~----------~~~~t~~e~~~l~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~d~i 389 (473) ++...++.... ..-+..+++ ...++-.+.|+...+...|.+++++-. +..+-.+|=++|..|++ T Consensus 260 ~~~~~~~~~~~~~~~~~~~gv~~t~~~~~~-A~~~~~~n~~~~~~~~~~i~~~~~~~G-----~~~~G~~iD~~~g~dwl 333 (426) T protein:vir:31 260 PLWNELPAGETVSKNVGDPEEQGTFEGGDE-AEGEGPVNVLIDVSDANRVSNAVTTAG-----ADSDTSFFDIRRTKVYT 333 (426) T ss_pred hhhhhccccccceeeccccccccccchhhh-hhhcCCceEEEEecCceeeecceeecc-----cccchhhhhhHHHHHHH Confidence 65443332211 111222222 334555678888778788888876532 22233467889999999 Q ss_pred HHHHHHHHhh-c--CCcccCCHHHHHHHHHHHHHHHHHHHhcCC--ccCccceecccc-ccCCCCC-E--EEEEEEEEEe Q lcl|NC_019421. 390 NKDTSLKRKE-F--VGKIFNDATGQTTVICALKKYFEELMSQGI--ISEFNVDIDTEL-QATAKAD-E--FYWKWDAVKV 460 (473) Q Consensus 390 ~~~i~~~~~~-~--ig~~~N~~~~r~~i~~~i~~~l~~l~~~g~--i~~~~~~~D~~~-~~~~~~d-~--~~v~i~v~p~ 460 (473) ...++..+.. . -.|+|-+..+...|++.|..-|++..+.|. +..|.+.. |.. +...++. . --+.+.++.. T Consensus 334 ~~~iq~~l~~ll~~~~KIpyt~~Gi~~I~~~i~~~L~~~v~~~g~~~~~y~v~~-P~~~~~~~dra~R~~~~i~~~~~la 412 (426) T protein:vir:31 334 AEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDV-PEWDDDDVDRVNRNWGGIDLDARLA 412 (426) T ss_pred HHHHHHHHHHHhhcCCCCccchhHHHHHHHHHHHHHHHHhcCCCccccceeecC-CCccccchhhhhhccCCceEEEEEe Confidence 9998755433 2 248899999999999999999998888532 44565542 111 1111222 1 2277888999 Q ss_pred eeeeeEEEEEEeC Q lcl|NC_019421. 461 DVMKKIYGTGYLG 473 (473) Q Consensus 461 ~~~e~i~~t~~v~ 473 (473) -++..+.|..+|- T Consensus 413 GAIh~v~I~g~v~ 425 (426) T protein:vir:31 413 QRAHTFSLGLNVS 425 (426) T ss_pred CcEEEEEEEEEEe Confidence 9999988888877 No 70 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=96.12 E-value=0.00098 Score=37.07 Aligned_cols=415 Identities=12% Similarity=0.026 Sum_probs=173.6 Q ss_pred CCccccCCCCceecCceeEEEecCCc--ceecccCceEEEEE--E----eeCCCCCCceEEeeccHHH---HHHHcCCCc Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAE--KSTNTGLKGRLAMP--I----RANWGDVGKVVTIKNDLRQ---LKNLFGDDM 69 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~--~~i~~~~~~~~~~~--g----~a~~Gp~~~~v~i~s~~~~---~~~~fG~~~ 69 (473) -+|-++.. ..||+-=+.+.... .++.+..+-.+.+- + --+|||+ +.|.+..+. ....||... T Consensus 102 ~~~~~~~~----~~~g~~~n~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~n~G~v---~~i~y~g~~~~a~~~~~~~~~ 174 (587) T protein:vir:96 102 KGGLRVTS----KIFGSVSNDIQVALEKNTITDSLRLRVVFQKDNYQEVFDNLGNI---FSINYKGEGEKATFSVEKDKE 174 (587) T ss_pred cccccccc----cccCCCCceEEEEEEeccCCCccceEEEEecCCceeeccccCce---EEEEecccccceeEeeccCcc Confidence 11111100 00111111110000 01111121111111 1 1246765 344332221 223455443 Q ss_pred CcHHHHHHHHHHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeE--E----Ee-eccCCccc Q lcl|NC_019421. 70 NYSAFKLGKLALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNV--T----IK-SNLVDSDK 142 (473) Q Consensus 70 ~~~~~~~v~~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i--~----v~-~~~~~~~~ 142 (473) .. .+.++++..|+++++.|||++|..+++...+.+. .....++|++||.++|.+ + +. ....+... T Consensus 175 ~~---~A~~l~l~gg~~~v~~yrl~~g~~~~~~~~~~~~-----~~~~~~tAky~g~~~n~~~v~v~d~~~~~~~k~~~~ 246 (587) T protein:vir:96 175 TQ---EAKRLVLKVDEKEVKAYELNGGAYSFTNEIITDI-----NELPDFEAKLSPFGDKNLESRKLDEATDVDIKGKAV 246 (587) T ss_pred cc---eeeeeEEEecCceEEEEEeCCCchhhhhhhhhhh-----ccccceEEEeecccCceeEEEeeccccccccceEEE Confidence 33 3467788889999999999998887766554432 223478999999887643 1 21 11111000 Q ss_pred ------eeeeeecCCceeeEEE-ecccc----------hhhhh------hhhhcccccceeEeecccCCcccccc----- Q lcl|NC_019421. 143 ------KDFIFFENTKQLFSSS-IKGTI----------DEIVL------EINSNLDNEYVIATKVADSDTILANV----- 194 (473) Q Consensus 143 ------~~v~v~~~~~~~~~~~-~~~~~----------~~~~~------~~~~~~~s~~v~~~~~~~~~~~~~~~----- 194 (473) .++....+........ ..... +.... .......-. .....++.+++.+.. T Consensus 247 y~~t~~~di~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~--~~aLtGG~dG~~~~~y~~~l 324 (587) T protein:vir:96 247 YVKAVFGDIENQTQYNQYVKFEQLPEQASEPSDVEVHAETESATVTATSKPKAIEPFE--LTKLSGGTNGEPPTSWSAKL 324 (587) T ss_pred eehhhhhhhhhhhccccceeeccccchhhhhhcccccccccceeeeeccccccccccc--ceeeecCCCCCCcccHHHHH Confidence 0100000000000000 00000 00000 000000000 001122222222110 Q ss_pred ------ceeeeccCcccccchhhHHHHHHHHhhcccceE-EEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCc Q lcl|NC_019421. 195 ------VNQALEGGNDGCTSITNESYLKALEEFERYSFD-SFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTED 267 (473) Q Consensus 195 ------~~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~~-~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~ 267 (473) ....+...++.. .....+.+.++.+....-. ..++++...+... ...++.+... ..++..+.. T Consensus 325 ~ale~~~~~~i~~~t~d~--ai~~~l~a~vk~~r~~gk~~~aVlg~~~~~~~~-~~~~~a~~~n---~e~vi~v~~---- 394 (587) T protein:vir:96 325 EKFKNEGGYYIVPLTDRQ--SVHSEVATFVKNRSDAGEPMRAIVGGGTSETKE-KLFGRQAILN---NPRVALVAN---- 394 (587) T ss_pred HHHhhCCcEEEEecCCCH--HHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHH-HHHHHHhhcC---CCcEEEEec---- Confidence 011122222211 1112233344444322211 2334443333332 2333433332 233444432 Q ss_pred cHHHHHHhhhccCCceEEEecCCc-eecCcccchHHHHHHHHHhhhcCccccccceeccCcccccccCCHHHHHHH---- Q lcl|NC_019421. 268 NIKQINDKSKSFNDENIVNVGSSA-YYENIKYTPSEVAVYIAALSVSKGITGSICNAKTIFEEVEPRLSQSEVKEC---- 342 (473) Q Consensus 268 t~~~~~~~~~~~n~~~i~~~~~~~-~~~~~~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~~~~~~~~~t~~e~~~l---- 342 (473) ..++..+.+. ...+....++..||.+||+-...++..-.... .+......+--.+++.+. T Consensus 395 --------------~~~~~~~~~~~~~~~~~~~aa~vAG~~Ag~~~~~S~T~~~~~~-~~v~~~~t~~e~~~~i~~G~~~ 459 (587) T protein:vir:96 395 --------------SGKFVMGNGRILQAPAYMVASAVAGLVSGLDIGESITFKPLFV-NSLDKVYESEELDELNENGIIT 459 (587) T ss_pred --------------ceEEecCCCceeeechhhHHHHHHHHHhcCccccCccceeeec-ccccccCCHHHHHHHHhCCeEE Confidence 2233333322 22334455777889999998877766533321 121111111111122111 Q ss_pred ---HhCCcEEEEEcCCEEEEEecccccccCCCCCcchhhhhhhhHHHHHHHHH-HH-HHHhhcCCcccCCHHHHHHHHHH Q lcl|NC_019421. 343 ---LKSGTLVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYISNIMFINTINKD-TS-LKRKEFVGKIFNDATGQTTVICA 417 (473) Q Consensus 343 ---~~~G~~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~~-i~-~~~~~~ig~~~N~~~~r~~i~~~ 417 (473) .+++.. ...+++++++|++......-..++-|++...+..-.+. ++ .+..+ .-...+-...|..+..- T Consensus 460 l~~~~~~~~------~v~~~vnsitT~t~~~~~~~~~i~virv~D~i~~di~~~~~~~yiGk-~nn~~~r~~v~~~i~~~ 532 (587) T protein:vir:96 460 IEFVRNRMT------TMFRIVDDVTTFPDKNDPVKSEMALGEANDFLVSELKILLEEQYIGT-RTINTSASQIKDFVQSY 532 (587) T ss_pred EEEecCCcE------EEEEeeccceecCCCCCchhhhhhhHHHHHHHHHHHHHHHHhcCCcc-ccCHHHHHHHHHHHHHH Confidence 122222 23578999999998888878889999988877666544 33 23333 23333445667777776 Q ss_pred HHHHHHHH-HhcCCccCc-------cceeccccccCCCCCEEEEEEEEEEeeeeee Q lcl|NC_019421. 418 LKKYFEEL-MSQGIISEF-------NVDIDTELQATAKADEFYWKWDAVKVDVMKK 465 (473) Q Consensus 418 i~~~l~~l-~~~g~i~~~-------~~~~D~~~~~~~~~d~~~v~i~v~p~~~~e~ 465 (473) +...-+.= ....-.+++ .+.++..+++-.-.+.+++.+.++|-- ++- T Consensus 533 L~~l~~~g~I~~~~~~dv~v~~~~D~~~v~~~v~Pv~~mekIy~tv~~~~~~-~~~ 587 (587) T protein:vir:96 533 LGRKKRDNEIQDFPPEDVQVIIEGNEARISLTIFPIRALKKISVSLVYRQQT-LQA 587 (587) T ss_pred HHHHHhCCcccCCCccceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeee-ecC Confidence 66654321 111111222 233455667777778899998886632 222 No 71 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=96.06 E-value=0.00035 Score=39.50 Aligned_cols=424 Identities=12% Similarity=0.040 Sum_probs=163.1 Q ss_pred CCcc--------ccCCCCceecCc--eeEEEecCC---c--------------------------ceecccCceEEEEEE Q lcl|NC_019421. 1 MATG--------TWNEKERKEIPG--FYNRFKTQA---E--------------------------KSTNTGLKGRLAMPI 41 (473) Q Consensus 1 m~~g--------~~~~~~~~~~PG--vYie~~~~~---~--------------------------~~i~~~~~~~~~~~g 41 (473) ..|| -|++ +..-=| +|.=-+..+ . .++.+..+-.+-+-. T Consensus 63 f~~g~l~~~i~~a~~~--~~~~g~~~~~~~rv~~a~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~ 140 (562) T protein:vir:80 63 FRSGELLDAIERAWNP--GEGTGAGDILAMRVEEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAK 140 (562) T ss_pred hcCCChHHHHHHhccc--ccccCceEEEEEEcCCCCcceEEecceEEEEeecccCCCceEEEEecCCCCCCcceEEEecC Confidence 0000 0110 000000 221111100 0 011111111111100 Q ss_pred eeCCC----CCCceEEeeccHH---HHHHHcCCCcCcHHHHHHHHHHhcCCCEEEEEecCCCcccceeeeeccccccccc Q lcl|NC_019421. 42 RANWG----DVGKVVTIKNDLR---QLKNLFGDDMNYSAFKLGKLALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAK 114 (473) Q Consensus 42 ~a~~G----p~~~~v~i~s~~~---~~~~~fG~~~~~~~~~~v~~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~ 114 (473) ..++ .+|....|..... ....+||.... ..+.++.+..|.++++.||++.|...++...... -. T Consensus 141 -~~~~ev~~~~g~v~~i~y~g~~~~a~~~i~~~~~~---~~a~~l~~~~g~~~v~~~~l~~g~~~~~~~l~~~-----i~ 211 (562) T protein:vir:80 141 -ERVNQVYDNLGSIFSIKYKGTEASATFTVAVDPVT---FKATKLTLKAGDKTVKEYDLGSGAYAETNVLISD-----IN 211 (562) T ss_pred -CcceEEeeccCceeeeeeccccccceeEEEecCcc---ceEEEEEEecCCcceeEEEeCCCccchhhhhhhh-----hc Confidence 0111 1123334432111 11223443322 2345566667778999999998765444322211 11 Q ss_pred ceEEEEecCccccceeEEEeeccCCccceeee---eecCCceeeEEEecccchhhhhh---hhhcccccceeEeecccCC Q lcl|NC_019421. 115 DVIKLETKYPTARNFNVTIKSNLVDSDKKDFI---FFENTKQLFSSSIKGTIDEIVLE---INSNLDNEYVIATKVADSD 188 (473) Q Consensus 115 ~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~---v~~~~~~~~~~~~~~~~~~~~~~---~~~~~~s~~v~~~~~~~~~ 188 (473) ....++|+++|.++|.+.+... +....++.. .+......+........+-.... ...........+ .++.+ T Consensus 212 ~~~~~tAky~g~~~n~i~~~~~-d~~~~~~~kt~~~~v~~~~~d~~~~~~~n~~v~~~~~~~~~la~~~~~~L--tGG~d 288 (562) T protein:vir:80 212 NLPDFEAKFFPIGDKNLTTDNF-DAQIDVDIKTKEAYVKAVGGDIEKQTAYNGYVEFEFDRSKEIANFPLTKL--TGGDN 288 (562) T ss_pred cccceEEEecccCCceeeeccc-ccchhhhcccceeeeeehhhhhhhcccccceEEEEeccCccccccceeee--eCCCC Confidence 2345799999999998876421 111111111 11110000000000000000000 000011111111 22222 Q ss_pred cccccc-----------ceeeeccCcccccchhhHHHHHHHHhhcccce-EEEEEcCCCcHHHHHHHHHHHHHHhhCCCe Q lcl|NC_019421. 189 TILANV-----------VNQALEGGNDGCTSITNESYLKALEEFERYSF-DSFVLDGVADEALQETTKAWVAKNKELGKD 256 (473) Q Consensus 189 ~~~~~~-----------~~~~l~gG~dg~~~~t~~d~~~~l~~le~~~~-~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~ 256 (473) ++.... ....+...++. ..-...+.+..+.+....- -..++++....... .+.++.+.. ... T Consensus 289 G~~~~~~~dal~~Le~~~~~~i~~~t~d--~ai~~~~~a~vkr~r~~g~~~~aVvg~~~~~~~~-~~~~~a~~~---n~e 362 (562) T protein:vir:80 289 GTIPESWADKFSYFANEGGYYLVPLTSK--QAVHAEALQFVRDCSYNGNPMRVFVGGGIGESME-QLFTRAIGL---QNE 362 (562) T ss_pred CCccccHHHHHHHHHhCCcEEEEecCCC--hHHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHH-HHHHHhhhc---CCC Confidence 222110 01122222221 1111223334444433221 23334443333332 223333332 233 Q ss_pred EEEEEcCCCCccHHHHHHhhhccCCceEEEecCCc-eecCcccchHHHHHHHHHhhhcCccccccceeccCcccccccCC Q lcl|NC_019421. 257 ILLFLGGKTEDNIKQINDKSKSFNDENIVNVGSSA-YYENIKYTPSEVAVYIAALSVSKGITGSICNAKTIFEEVEPRLS 335 (473) Q Consensus 257 ~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~~~-~~~~~~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~~~~~~~~~t 335 (473) ++..+.++ .+.....+. ........++..||.+||+-...++..-.... .+......+-- T Consensus 363 ~vv~v~~~------------------~~~~~~~~~~~~~~~~~~aa~vAGl~Ag~~~~~S~T~~~i~~-~~v~~~lt~~e 423 (562) T protein:vir:80 363 RAGLIGFS------------------GTVKMDDGRSLKMPGYMFAAQVAGLTCGLEIGEAITFKNIAI-ETLDTIYEGSQ 423 (562) T ss_pred eEEEEecC------------------eeEECCCCceeeechhHHHHHHHHHHhcCccccCccceeecc-ccccccCCHHH Confidence 44444322 222222221 22233445677889999997776655433221 11111111111 Q ss_pred HHHHHHHHhCCc-EEEEEcCCEEEEEecccccccCCCCCcchhhhhhhhHHHHHHHH-HHH-HHHhhcCCcccCCHHHHH Q lcl|NC_019421. 336 QSEVKECLKSGT-LVLDFDDGDVIIVDDVNTFKKYVDDKNEAMGYISNIMFINTINK-DTS-LKRKEFVGKIFNDATGQT 412 (473) Q Consensus 336 ~~e~~~l~~~G~-~~l~~~~~~~~i~~gi~T~~~~~~~~~~~~~~i~v~R~~d~i~~-~i~-~~~~~~ig~~~N~~~~r~ 412 (473) .+++.+.=..=. ..-.......++++++||++...+..-..++-|++...+..=.+ .++ .+..+.-- ...-...|. T Consensus 424 ~~~li~~G~l~l~~~~~~~v~~~riv~~itT~t~~~~~~~~ki~viRv~D~i~~dir~~~~~~yIGk~Nn-~~~r~~v~~ 502 (562) T protein:vir:80 424 LDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVGEANDFLVSELKISLDNEYIGTKII-DTSASLVKN 502 (562) T ss_pred HHHHHhCCeEEEEEecCCcEEEEEeeccceeccCCCCchhhhhhhhHHHHHHHHHHHHHHHhcCCccccC-hHHHHHHHH Confidence 112221100000 00011223468899999999888877778888888776654443 333 22322222 223334566 Q ss_pred HHHHHHHHHHHHHHhcC----Cc----cCccceeccccccCCCCCEEEEEEEEEEeeeeee Q lcl|NC_019421. 413 TVICALKKYFEELMSQG----II----SEFNVDIDTELQATAKADEFYWKWDAVKVDVMKK 465 (473) Q Consensus 413 ~i~~~i~~~l~~l~~~g----~i----~~~~~~~D~~~~~~~~~d~~~v~i~v~p~~~~e~ 465 (473) .+..-+...-+.=.-++ -+ ++-.+.++..+++-.--+.+++.+.++|-- ++- T Consensus 503 ~i~~~L~~l~~~gaI~~~~~~dv~v~~~~d~~~v~~~v~Pv~~mekIy~ti~~~~~~-~~~ 562 (562) T protein:vir:80 503 FVQSFLDRKKLAKEIQDYSPEEVQVVIEGDIARISLTVFPIRSMKKIEVSLVYRQQI-LTA 562 (562) T ss_pred HHHHHHHHHHhCCcccCCCccceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeee-ecC Confidence 55555554332211111 11 122234566667777778899998887632 222 No 72 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=95.93 E-value=0.00016 Score=41.42 Aligned_cols=416 Identities=12% Similarity=0.021 Sum_probs=161.0 Q ss_pred CCccccCCCCceecCceeEEEecCCc--ceecccCceEEEEEEeeCCCCC----CceEEeeccHHH---HHHHcCCCcCc Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAE--KSTNTGLKGRLAMPIRANWGDV----GKVVTIKNDLRQ---LKNLFGDDMNY 71 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~--~~i~~~~~~~~~~~g~a~~Gp~----~~~v~i~s~~~~---~~~~fG~~~~~ 71 (473) -.|.+|.. ..||+-=|.+.... .++.+..+-.+.+-+ ..++.+ |+...|+...+. ....||...+ T Consensus 102 ~~~~~~~a----~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~-~~~~ev~~~~g~V~~i~y~g~~~~~~~~v~~~~~~- 175 (562) T protein:vir:63 102 AEGVKVSS----TIYGADANDIQVALEDNTITGTKRLSIVFAK-ERVNQVYDNLGSIFSIKYKGTEASATFTVAVDPVT- 175 (562) T ss_pred ecceeEEE----eecccCCCeEEEEEecCCCCCCcceEEEecC-CCcchhhhhccceeeeeeecccccceEEEEecCcc- Confidence 11111110 11111111110000 011122221111110 111111 223333211111 1123333222 Q ss_pred HHHHHHHHHHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeeccCCccceeee---ee Q lcl|NC_019421. 72 SAFKLGKLALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSNLVDSDKKDFI---FF 148 (473) Q Consensus 72 ~~~~~v~~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~~~~~~~~~v~---v~ 148 (473) ..+.+..+..|.++++.|||+.|...++.. +.+ .-.....++|+++|.++|.+.+... +....+++. .+ T Consensus 176 --~~a~~l~~~~g~~~v~~~~L~~g~~~~~~~-l~~----~in~~~~~~aky~~~~gn~i~~~~~-d~~~~~~vkt~~~~ 247 (562) T protein:vir:63 176 --FKATKLTLKAGDKTVKEYDLGSGAYAETNV-LIS----DINNLPDFEAKFFPIGDKNLTTDNF-DAQIDVDIKTKEAY 247 (562) T ss_pred --eeEEEEEeecCCcceeEEEecCCccchhHH-HHH----hhccccceEEEeeccCCceeeeecc-ccccccchhhhhhh Confidence 234456677778899999999876544322 221 1122455799999999998876432 112222221 11 Q ss_pred cCCceeeEEEecccchhhhh---hhhhcccccceeEeecccCCcccccc-----------ceeeeccCcccccchhhHHH Q lcl|NC_019421. 149 ENTKQLFSSSIKGTIDEIVL---EINSNLDNEYVIATKVADSDTILANV-----------VNQALEGGNDGCTSITNESY 214 (473) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~s~~v~~~~~~~~~~~~~~~-----------~~~~l~gG~dg~~~~t~~d~ 214 (473) ..+...+........+-... ............++ ++.+++.+.. ....+...++.. .....+ T Consensus 248 v~t~~~d~~~~~~~~~~v~~~~~~~~~la~~~~~~Lt--GG~dGt~~~~~~~al~ale~~~~~~i~~~t~d~--av~~~l 323 (562) T protein:vir:63 248 VKAVGGDIEKQTAYNGYVDFEFDRSKEIANFPLTKLT--GGDNGTIPESWADKFSYFANEGGYYLVPLTSKQ--AVHAEA 323 (562) T ss_pred hhhhhhhhhhcccccceeeeeeccccceecccceeee--cCCCCCchhhHHHHHHHHHhCCcEEEEecCCCH--HHHHHH Confidence 11100000000000000000 00001111111111 2222221110 011122222211 111223 Q ss_pred HHHHHhhcccceE-EEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHhhhccCCceEEEecC-Cce Q lcl|NC_019421. 215 LKALEEFERYSFD-SFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDKSKSFNDENIVNVGS-SAY 292 (473) Q Consensus 215 ~~~l~~le~~~~~-~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~~~~~n~~~i~~~~~-~~~ 292 (473) .+.++.+....-. ..++++...+... .+.++.+.. ...++..+.++ ....... ... T Consensus 324 ~a~vkr~~~~g~~~~aVlg~~~~~~~~-~~~~~a~~~---n~ervv~v~~~------------------~~~~~~~~~~~ 381 (562) T protein:vir:63 324 LQFVRDCSYNGNPMRVFVGGGIGESME-QLFTRAIGL---QNERAGLIGFS------------------GTVKMDDGRSL 381 (562) T ss_pred HHHHHHHHhCCCcEEEEecCCCCCCHH-HHHHHhhhc---CCCcEEEEecC------------------eeEECCCCcee Confidence 3444444332222 3334433333332 233343332 23344444322 1111111 112 Q ss_pred ecCcccchHHHHHHHHHhhhcCccccccceeccCcccccccCCHHHHHHH-------HhCCcEEEEEcCCEEEEEecccc Q lcl|NC_019421. 293 YENIKYTPSEVAVYIAALSVSKGITGSICNAKTIFEEVEPRLSQSEVKEC-------LKSGTLVLDFDDGDVIIVDDVNT 365 (473) Q Consensus 293 ~~~~~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~~~~~~~~~t~~e~~~l-------~~~G~~~l~~~~~~~~i~~gi~T 365 (473) ..+....++..||.+||.-...++..-.... .+...-..+--.+++... ..++. ....+++++||| T Consensus 382 ~~~~~~~aa~vAGl~A~~~~~~SlT~~~i~~-~~v~~~~t~~e~~~li~~Gv~~l~~~~~~~------v~~~~iv~~itT 454 (562) T protein:vir:63 382 KMPGYMFAAQVAGLTCGLEIGEAITFKNIAI-ETLDTIYEGSQLDQLNESGIITAEFVRNRA------VTNFRIVDDVTT 454 (562) T ss_pred eechhHHHHHHHHHhhcCchhcCccceeecc-ccccccCCHHHHHHHHhCCeEEEEEecCCc------EEEEEeecccee Confidence 2234445677889999997777665433221 121111111111122211 12222 224688899999 Q ss_pred cccCCCCCcchhhhhhhhHHHHHHHH-HHH-HHHhhcCCcccCCHHHHHHHHHHHHHHHHHHHhcC----Cc----cCcc Q lcl|NC_019421. 366 FKKYVDDKNEAMGYISNIMFINTINK-DTS-LKRKEFVGKIFNDATGQTTVICALKKYFEELMSQG----II----SEFN 435 (473) Q Consensus 366 ~~~~~~~~~~~~~~i~v~R~~d~i~~-~i~-~~~~~~ig~~~N~~~~r~~i~~~i~~~l~~l~~~g----~i----~~~~ 435 (473) ++...+..-..++-|++...+..=.+ .++ .+..+.-- ...-...|..+..-+...-+.=.-++ -+ ++-. T Consensus 455 ~t~~~~~~~~ki~viRv~D~i~~dir~~~~~~yiGk~Nn-~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~dv~v~~~~d~ 533 (562) T protein:vir:63 455 FNDKTDPVKSEIGVGEANDFLVSELKISLDNEYIGTKII-DTSASLVKNFVQSFLDRKKLAKEIQDYSPEEVQVVIEGDV 533 (562) T ss_pred cCCCCCchhhhhhhhHHHHHHHHHHHHHHHhcCCccccC-hHHHHHHHHHHHHHHHHHHhCCcccCCCccceEEEecCCE Confidence 98777776677888887776554433 232 22222211 12233455555555444322211111 11 2223 Q ss_pred ceeccccccCCCCCEEEEEEEEEEeeeeee Q lcl|NC_019421. 436 VDIDTELQATAKADEFYWKWDAVKVDVMKK 465 (473) Q Consensus 436 ~~~D~~~~~~~~~d~~~v~i~v~p~~~~e~ 465 (473) +.++..+++-.--+.+++.+.++|-- ++- T Consensus 534 ~~v~~~v~pv~~mekIy~ti~~~~~~-~~~ 562 (562) T protein:vir:63 534 ARISLTVFPIRSMKKIEVSLVYRQQI-LTA 562 (562) T ss_pred EEEEEEEEEcccceEEEEEEEEeeee-ecC Confidence 44566667777777899998887632 222 No 73 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=95.62 E-value=0.0017 Score=35.73 Aligned_cols=410 Identities=8% Similarity=0.011 Sum_probs=151.8 Q ss_pred CCccccCCCCceecCceeEEEecCCcc--eecccCceEEEEE--EeeCC-CCCCceEEeeccHHHHHHHcCCCcCcHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEK--STNTGLKGRLAMP--IRANW-GDVGKVVTIKNDLRQLKNLFGDDMNYSAFK 75 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~--~i~~~~~~~~~~~--g~a~~-Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~ 75 (473) ..+.+|.. .-||.+-|.+.-... ++.+..+-++.+- +..+. =.+|....|.+..+.+..++....+..... T Consensus 102 ~~~l~~~a----~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~si~y~g~~~~~~~~v~~~~~t~~ 177 (587) T protein:vir:95 102 IGGLKITS----KIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIFTIKYKGEEANATFSVEHDEETQK 177 (587) T ss_pred ecCeEEEE----ecccccccceEEEEecCCCCCceeEEEEEecccceeeeeeccceeeeeeeccccccceeeeeccccee Confidence 56666654 668888887644322 2322222222221 11111 122444556544444444433222222345 Q ss_pred HHHHHHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEee--ccCC-----------ccc Q lcl|NC_019421. 76 LGKLALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKS--NLVD-----------SDK 142 (473) Q Consensus 76 ~v~~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~--~~~~-----------~~~ 142 (473) +.+..|+.|+++++.|||++|..+.+.....+ ......++|+++|..+|.+.++. ...+ ... T Consensus 178 a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~-----in~~~~~tAky~g~~~~~i~~~~~~~~~~~~v~~~~~~v~a~~ 252 (587) T protein:vir:95 178 ASRLVLKVGDQEVKSYDLTGGAYDYTNAIITD-----INQLPDFEAKLSPFGDKNLESSKLDKIENANIKDKAVYVKAVF 252 (587) T ss_pred eeeeeeecCCceEEEEEecCCchHHHHHHHHh-----hccccceEEEEecccCceeEEeecCcccccceehhhhhhhhhh Confidence 67888999999999999998876654433322 22345679999999988877652 1111 111 Q ss_pred eeeeeecCCceeeEEEeccc-chhhhh----------hhhhcccccc---eeEe-ecccCCcccccc-----------ce Q lcl|NC_019421. 143 KDFIFFENTKQLFSSSIKGT-IDEIVL----------EINSNLDNEY---VIAT-KVADSDTILANV-----------VN 196 (473) Q Consensus 143 ~~v~v~~~~~~~~~~~~~~~-~~~~~~----------~~~~~~~s~~---v~~~-~~~~~~~~~~~~-----------~~ 196 (473) .++..+.............. .+.... .......+.. +..+ ..++.+++.+.. .. T Consensus 253 ~d~~~~~~~~~~v~~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~t~LtGG~dG~~~~~y~~~l~ale~~~~ 332 (587) T protein:vir:95 253 GDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNGEPPATWADKLDKFAHEGG 332 (587) T ss_pred cceeeeeeceeeeeeecccccceeccchhhhhcccchheeccccccceeccceeeeecCCCCCCcccHHHHHHHHHhCCc Confidence 12222222222222111110 000000 0000001100 0001 122233322110 01 Q ss_pred eeeccCcccccchhhHHHHHHHHhhcccce-EEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHh Q lcl|NC_019421. 197 QALEGGNDGCTSITNESYLKALEEFERYSF-DSFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDK 275 (473) Q Consensus 197 ~~l~gG~dg~~~~t~~d~~~~l~~le~~~~-~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~ 275 (473) ..+...++.. .....+.+.++.+....- -..++++...++.. .+.++.+... ..++..+.++.. T Consensus 333 ~~i~~~t~d~--~v~a~l~a~vk~~~~~g~~~~aVvg~~~~~~~~-~~~~~a~~~n---~ervi~v~~~~~--------- 397 (587) T protein:vir:95 333 YYIVPLSSKQ--SVHAEVASFVKERSDAGEPMRAIVGGGFNESKE-QLFGRQESLS---NPRVSLVANSGT--------- 397 (587) T ss_pred EEEEecCCCH--HHHHHHHHHHHHHHhCCCcEEEEEcCCCCCCHH-HHHHHHhhcC---CCcEEEecccce--------- Confidence 1222222211 111223344444433221 22334433333332 2333433332 233443332210 Q ss_pred hhccCCceEEEecCCceecCcccchHHHHHHHHHhhhcCccccccceeccCcc------cccccCCHH---------HHH Q lcl|NC_019421. 276 SKSFNDENIVNVGSSAYYENIKYTPSEVAVYIAALSVSKGITGSICNAKTIFE------EVEPRLSQS---------EVK 340 (473) Q Consensus 276 ~~~~n~~~i~~~~~~~~~~~~~~~~~~~a~~vAG~~a~~~~~~s~t~~~~~~~------~~~~~~t~~---------e~~ 340 (473) ...++.+. ...+....++..|+.+||+....++..-+... .+.. +...-+.+. +.. T Consensus 398 -~~~~dg~~-------~~~~~~~~aa~vAGl~Ag~~~~~SlT~~~i~~-~~v~~~~t~~e~e~ai~~Gvl~l~~~~~~~~ 468 (587) T protein:vir:95 398 -FVMDDGRK-------NHVPAYMVAVALGGLASGLEIGESITFKPLRV-SSLDQIYESIDLDELNENGIISIEFVRNRTN 468 (587) T ss_pred -EecCCCce-------eeechHHHHHHHHHHHhcCchhcCccceeeec-ccccccCCHHHHHHHHhCCeEEEEEecCCcc Confidence 00011111 12223445667778888887766654322111 1110 000000000 000 Q ss_pred H--HHhCCcEEEEEcCCE----EEE-------Eeccc-ccc-cCCCCCcchhhhhhhhHHHHHHHHHHHHHHhhcCCccc Q lcl|NC_019421. 341 E--CLKSGTLVLDFDDGD----VII-------VDDVN-TFK-KYVDDKNEAMGYISNIMFINTINKDTSLKRKEFVGKIF 405 (473) Q Consensus 341 ~--l~~~G~~~l~~~~~~----~~i-------~~gi~-T~~-~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~ 405 (473) . -+.++++.+..+.+. +++ .++|+ ++. .+-. +.+--+....|...|..+++++.. T Consensus 469 ~~vriv~~itT~t~~~d~~~~~i~viRv~D~i~~dir~~~~~~~iG-------k~nn~~~r~~v~~~i~~~L~~l~~--- 538 (587) T protein:vir:95 469 TFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIG-------TRTINTSASIIKDFIQSYLGRKKR--- 538 (587) T ss_pred eEEEEeecceeccCCCCcchhhhhhhhhHHHHHHHHHHHHHhhCCc-------cccchHHHHHHHHHHHHHHHHHHh--- Confidence 0 012344444322221 111 11111 000 0000 001111222222222222211100 Q ss_pred CCHHHHHHHHHHHHHHHHHHHhcC-CccCccceeccccccCCCCCEEEEEEEEEEeeeeee Q lcl|NC_019421. 406 NDATGQTTVICALKKYFEELMSQG-IISEFNVDIDTELQATAKADEFYWKWDAVKVDVMKK 465 (473) Q Consensus 406 N~~~~r~~i~~~i~~~l~~l~~~g-~i~~~~~~~D~~~~~~~~~d~~~v~i~v~p~~~~e~ 465 (473) ...|..|=. +.-. -+++-.+.++..+++..--+.+++.+.++|--. +- T Consensus 539 ---------~gaI~~~~~--~dv~v~~~~d~~~v~~~v~Pv~~mekI~vt~~~~~~~~-~~ 587 (587) T protein:vir:95 539 ---------DNEIQDFPA--EDVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTL-QA 587 (587) T ss_pred ---------CCcccCCCc--cceEEEecCCEEEEEEEEEEcccceEEEEEEEEeeeee-cC Confidence 000111100 0000 012223445556677777788999988866332 11 No 74 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=95.06 E-value=0.0029 Score=34.53 Aligned_cols=411 Identities=9% Similarity=0.016 Sum_probs=152.6 Q ss_pred CCccccCCCCceecCceeEEEecCCcc--eecccCceEEEEEE--eeC-CCCCCceEEeeccHHHHHHHcCCCcCcHHHH Q lcl|NC_019421. 1 MATGTWNEKERKEIPGFYNRFKTQAEK--STNTGLKGRLAMPI--RAN-WGDVGKVVTIKNDLRQLKNLFGDDMNYSAFK 75 (473) Q Consensus 1 m~~g~~~~~~~~~~PGvYie~~~~~~~--~i~~~~~~~~~~~g--~a~-~Gp~~~~v~i~s~~~~~~~~fG~~~~~~~~~ 75 (473) ..+.+|.. ..||.+-|.+.-... ++.+..+-.+-|-- ..+ .=.+|....|.+..+.+...+....+....+ T Consensus 102 ~~~l~~~a----~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~i~y~g~~~~a~~~v~~~~~t~~ 177 (587) T protein:vir:99 102 IGGLKITS----KIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIFTIKYKGEEANATFSVEHDEETQK 177 (587) T ss_pred ecCeEEEE----eeccccccceEEEEccCCCCcceeEEEEEecccceeeeeeccceeeEEeecccccceeeEeecCccee Confidence 66777764 678998886644222 22222221111110 001 1122444567654454554443322223345 Q ss_pred HHHHHHhcCCCEEEEEecCCCcccceeeeecccccccccceEEEEecCccccceeEEEeec-------------cCCccc Q lcl|NC_019421. 76 LGKLALLGNVKELLLYRLVDGNQKKGTLTLKDTTENSAKDVIKLETKYPTARNFNVTIKSN-------------LVDSDK 142 (473) Q Consensus 76 ~v~~~f~~g~~~v~v~rv~~g~~~aat~~l~~~~~~~~~~~l~i~A~~~G~~~n~i~v~~~-------------~~~~~~ 142 (473) +.+..|+.|+++++.|||++|..+.+.....+ ......++|+++|..++.+.++.. ..++.. T Consensus 178 a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~-----i~~~~~~tAky~~~~~~~i~~~~~~~~~~~~v~~~~~~v~a~~ 252 (587) T protein:vir:99 178 ASRLVLKVGDQEVKSYDLTGGAYDYTNAIITD-----INQLPDFEAKLSPFGDKNLESSKLDKIENANIKDKAVYVKAVF 252 (587) T ss_pred eeeeeeecCCceeEEEEecCCchHHHHHHHhh-----hccccceeEEeeccCCceeEeecccccccceeeeeeeeeehhc Confidence 67888999999999999998876654433322 123445789999988887765321 111122 Q ss_pred eeeeeecCCceeeEEEecccchhhhhhhhhc--ccccceeE------------e-ecccCCcccccc-----------ce Q lcl|NC_019421. 143 KDFIFFENTKQLFSSSIKGTIDEIVLEINSN--LDNEYVIA------------T-KVADSDTILANV-----------VN 196 (473) Q Consensus 143 ~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~--~~s~~v~~------------~-~~~~~~~~~~~~-----------~~ 196 (473) .++..+.........+......+........ .....+.. + ..++.+++.+.. .. T Consensus 253 ~D~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~LtGG~dG~~~~sy~~al~ale~~~~ 332 (587) T protein:vir:99 253 GDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNGEPPATWADKLDKFAHEGG 332 (587) T ss_pred cceeeecccceeeeeeecccccchhhhhhhhhccccceeeeeccccceecccceeeecCCCCCccccHHHHHHHHhhCCc Confidence 2222222222222221111111110000000 00000110 1 122223322110 01 Q ss_pred eeeccCcccccchhhHHHHHHHHhhcccce-EEEEEcCCCcHHHHHHHHHHHHHHhhCCCeEEEEEcCCCCccHHHHHHh Q lcl|NC_019421. 197 QALEGGNDGCTSITNESYLKALEEFERYSF-DSFVLDGVADEALQETTKAWVAKNKELGKDILLFLGGKTEDNIKQINDK 275 (473) Q Consensus 197 ~~l~gG~dg~~~~t~~d~~~~l~~le~~~~-~~l~~p~~~~~~~~~~l~~~v~~~~~~~~~~~av~~~~~~~t~~~~~~~ 275 (473) ..+...++. ......+.+.++.+....- -..++++..+.+.. .+.++.+... ..++..+..+. T Consensus 333 ~~i~~~t~d--~~i~a~l~a~vk~~r~~g~~~~aVlg~~~~~~~~-~~~~~a~~~n---~e~vi~v~~~~---------- 396 (587) T protein:vir:99 333 YYIVPLSSK--QSVHAEVASFVKERSDAGEPMRAIVGGGFNESKE-QLFGRQASLS---NPRVSLVANSG---------- 396 (587) T ss_pred EEEEecCCC--HHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHH-HHHHHhhhcC---CCcEEEEeccc---------- Confidence 112222221 1111223344444433221 22334433333332 2333433332 23333332210 Q ss_pred hhccCCceEEEecCCceecCcccchHHHHHHHHHhhhcCcccccccee-----ccCcccccccCCHH---------HHH- Q lcl|NC_019421. 276 SKSFNDENIVNVGSSAYYENIKYTPSEVAVYIAALSVSKGITGSICNA-----KTIFEEVEPRLSQS---------EVK- 340 (473) Q Consensus 276 ~~~~n~~~i~~~~~~~~~~~~~~~~~~~a~~vAG~~a~~~~~~s~t~~-----~~~~~~~~~~~t~~---------e~~- 340 (473) ..+.-.......+....++..|+.+||+....++..-.... .+.-.+....+.+. ... T Consensus 397 -------~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~~~~SlT~~~i~~~~v~~~~t~~e~e~li~~Gvl~l~~~~~~~~~ 469 (587) T protein:vir:99 397 -------TFVMDDGRKNHVPAYMVAVALGGLASGLEIGESITFKPLRVSSLDQIYESIDLDELNENGIISIEFVRNRTNT 469 (587) T ss_pred -------eEecCCCceeeechHHHHHHHHHHHhcCchhcCccceeeecccccccCCHHHHHHHHhCCeEEEEEecCCcce Confidence 00100111122233445667778888887666654322111 00000000000000 000 Q ss_pred -HHHhCCcEEEEEcCC----EEEEE-------ecccc-cc-cCCCCCcchhhhhhhhHHHHHHHHHHHHHHhhcCCcccC Q lcl|NC_019421. 341 -ECLKSGTLVLDFDDG----DVIIV-------DDVNT-FK-KYVDDKNEAMGYISNIMFINTINKDTSLKRKEFVGKIFN 406 (473) Q Consensus 341 -~l~~~G~~~l~~~~~----~~~i~-------~gi~T-~~-~~~~~~~~~~~~i~v~R~~d~i~~~i~~~~~~~ig~~~N 406 (473) .-+-++++.+..+.+ +++++ ++|+. +. .+-. +.+--+....|...|..+++++.. T Consensus 470 ~vriv~~ItT~t~~~~~~~~~i~viRv~D~i~~di~~~~~~~yiG-------k~Nn~~~r~~i~~~i~~~L~~l~~---- 538 (587) T protein:vir:99 470 FFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIG-------TRTINTSASIIKDFIQSYLGRKKR---- 538 (587) T ss_pred EEEEeeceeeccCCCCchhhhhhhhhhHHHHHHHHHHHHHhhCCc-------cccchHHHHHHHHHHHHHHHHHHh---- Confidence 011234444432222 11111 11110 00 0000 001112222222222222211100 Q ss_pred CHHHHHHHHHHHHHHHHHHHhcC-CccCccceeccccccCCCCCEEEEEEEEEEeeeee Q lcl|NC_019421. 407 DATGQTTVICALKKYFEELMSQG-IISEFNVDIDTELQATAKADEFYWKWDAVKVDVMK 464 (473) Q Consensus 407 ~~~~r~~i~~~i~~~l~~l~~~g-~i~~~~~~~D~~~~~~~~~d~~~v~i~v~p~~~~e 464 (473) ...|..| +.+.-. -.++=.+.++..+++-.--+.+++.+-++|--.-- T Consensus 539 --------~gaI~~~--~~~dv~v~~~~d~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:99 539 --------DNEIQDF--PAEDVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred --------CCcccCC--CccceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 0001111 000000 01222234455677777788899998886643211 Done!