Query lcl|NC_015266.1_cdsid_YP_004306433.1 [gene=21] [protein=gp21] [protein_id=YP_004306433.1] [location=complement(14103..15275)] Match_columns 390 No_of_seqs 167 out of 766 Neff 9.3 Searched_HMMs 1612 Date Thu Nov 7 12:58:32 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_21 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_21_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:103993 Length: 390 100.0 3E-116 2E-119 654.0 36.3 390 1-390 1-390 (390) 2 protein:vir:78206 Length: 390 100.0 3E-116 2E-119 654.0 36.3 390 1-390 1-390 (390) 3 protein:vir:79181 Length: 390 100.0 5E-115 3E-118 647.1 36.5 390 1-390 1-390 (390) 4 protein:vir:79141 Length: 391 100.0 2E-114 2E-117 643.4 36.1 390 1-390 1-390 (391) 5 protein:vir:1172 Length: 391 # 100.0 1E-112 6E-116 634.7 35.2 390 1-390 2-391 (391) 6 protein:vir:100323 Length: 393 100.0 2E-112 1E-115 632.9 34.9 388 1-390 3-392 (393) 7 protein:vir:98553 Length: 395 100.0 7E-111 4E-114 624.5 37.2 389 1-390 1-395 (395) 8 protein:vir:1845 Length: 392 # 100.0 1E-110 8E-114 623.1 36.9 389 1-390 1-392 (392) 9 protein:vir:2035 Length: 396 # 100.0 2E-110 1E-113 621.7 35.8 389 1-390 1-395 (396) 10 protein:vir:5711 Length: 396 # 100.0 4E-110 3E-113 620.2 36.2 389 1-390 1-395 (396) 11 protein:vir:6079 Length: 396 # 100.0 8E-110 5E-113 618.6 36.3 389 1-390 1-395 (396) 12 protein:vir:10336 Length: 386 100.0 5E-107 3E-110 603.3 34.7 384 1-385 1-386 (386) 13 protein:vir:107865 Length: 477 100.0 2E-102 1E-105 578.5 34.5 383 1-388 1-477 (477) 14 protein:vir:79092 Length: 477 100.0 1E-101 6E-105 574.2 34.3 383 1-388 1-477 (477) 15 protein:vir:96740 Length: 388 100.0 1.3E-99 8E-103 562.7 33.8 374 1-389 1-388 (388) 16 protein:vir:80984 Length: 666 100.0 1.9E-86 1.2E-89 490.5 35.1 380 2-390 1-665 (666) 17 protein:vir:6594 Length: 666 # 100.0 7.3E-86 4.5E-89 487.3 35.0 380 2-390 1-665 (666) 18 protein:vir:98263 Length: 664 100.0 7.6E-86 4.7E-89 487.2 33.5 375 1-390 1-660 (664) 19 protein:vir:6894 Length: 660 # 100.0 1.6E-85 9.6E-89 485.5 34.4 380 2-390 1-660 (660) 20 protein:vir:106984 Length: 743 100.0 2E-85 1.2E-88 484.9 34.3 379 1-387 1-743 (743) 21 protein:vir:103456 Length: 659 100.0 9E-85 5.6E-88 481.4 33.6 378 2-390 1-656 (659) 22 protein:vir:108052 Length: 660 100.0 1.3E-84 8E-88 480.5 34.4 379 2-389 1-660 (660) 23 protein:vir:106427 Length: 679 100.0 2.1E-84 1.3E-87 479.3 34.8 380 2-390 1-679 (679) 24 protein:vir:7206 Length: 659 # 100.0 3E-84 1.9E-87 478.5 33.6 379 2-389 1-659 (659) 25 protein:vir:101187 Length: 663 100.0 3.3E-84 2E-87 478.3 33.6 377 2-390 1-662 (663) 26 protein:vir:104858 Length: 729 100.0 5.7E-84 3.5E-87 477.0 34.2 379 1-388 1-729 (729) 27 protein:vir:101804 Length: 663 100.0 4.4E-83 2.7E-86 472.1 33.0 380 2-390 1-662 (663) 28 protein:vir:5663 Length: 671 # 100.0 1.3E-82 7.8E-86 469.6 32.4 375 2-390 1-671 (671) 29 protein:vir:100539 Length: 663 100.0 7.4E-82 4.6E-85 465.4 32.4 380 2-390 1-662 (663) 30 protein:vir:104477 Length: 749 100.0 6.5E-81 4.1E-84 460.2 33.1 374 1-386 1-749 (749) 31 protein:vir:98824 Length: 774 100.0 3.4E-80 2.1E-83 456.3 29.0 374 1-387 279-774 (774) 32 protein:vir:5833 Length: 742 # 100.0 9.8E-75 6.1E-78 426.3 26.8 363 1-386 343-742 (742) 33 protein:vir:79798 Length: 717 100.0 3.7E-50 2.3E-53 291.6 27.9 324 1-378 330-717 (717) 34 protein:vir:63742 Length: 562 100.0 7.2E-40 4.5E-43 235.2 27.2 360 1-383 8-562 (562) 35 protein:vir:80779 Length: 569 100.0 9.3E-39 5.8E-42 229.1 25.7 360 1-383 1-569 (569) 36 protein:vir:80488 Length: 562 100.0 2E-38 1.2E-41 227.3 26.8 360 1-383 1-562 (562) 37 protein:vir:103168 Length: 641 100.0 2.5E-36 1.5E-39 215.8 19.8 267 1-280 3-641 (641) 38 protein:vir:95741 Length: 587 100.0 1E-34 6.4E-38 206.9 26.7 360 1-383 1-587 (587) 39 protein:vir:99306 Length: 587 100.0 4E-34 2.5E-37 203.7 25.8 360 1-383 1-587 (587) 40 protein:vir:96586 Length: 587 100.0 8.7E-34 5.4E-37 201.8 26.8 360 1-383 1-587 (587) 41 protein:vir:107310 Length: 581 100.0 8.6E-34 5.3E-37 201.9 23.6 360 1-390 177-580 (581) 42 protein:vir:102819 Length: 648 100.0 1.1E-32 7E-36 195.7 28.6 357 1-381 1-648 (648) 43 protein:vir:7653 Length: 581 # 100.0 1.4E-33 8.8E-37 200.7 22.8 364 1-390 159-580 (581) 44 protein:vir:100829 Length: 607 99.9 9.4E-29 5.8E-32 174.2 25.1 365 1-389 17-607 (607) 45 protein:vir:102957 Length: 437 99.9 3.3E-28 2.1E-31 171.2 26.1 356 1-377 1-437 (437) 46 protein:vir:105470 Length: 451 99.9 1.2E-22 7.4E-26 140.8 23.7 356 1-377 1-451 (451) 47 protein:vir:101326 Length: 529 99.8 3.9E-22 2.4E-25 137.9 20.1 351 1-378 112-529 (529) 48 protein:vir:78986 Length: 436 99.6 1.1E-16 6.6E-20 108.2 22.3 354 1-377 3-436 (436) 49 protein:vir:102359 Length: 356 99.1 7.7E-12 4.8E-15 81.5 15.0 323 1-376 1-356 (356) 50 protein:vir:3751 Length: 376 # 98.6 5.9E-08 3.7E-11 60.2 20.6 336 5-383 1-376 (376) 51 protein:vir:5260 Length: 502 # 98.6 2.1E-07 1.3E-10 57.2 22.4 358 1-378 68-502 (502) 52 protein:vir:80052 Length: 331 98.5 5.2E-07 3.2E-10 55.0 24.5 312 1-378 1-331 (331) 53 protein:vir:95263 Length: 450 98.5 5.3E-07 3.3E-10 55.0 25.1 358 1-379 1-450 (450) 54 protein:vir:3788 Length: 376 # 98.5 2E-07 1.2E-10 57.3 19.8 342 5-383 1-376 (376) 55 protein:vir:3165 Length: 426 # 98.3 1.5E-06 9.3E-10 52.5 20.8 356 1-378 1-426 (426) 56 protein:vir:78782 Length: 370 98.2 1E-06 6.4E-10 53.4 19.0 341 5-385 1-370 (370) 57 protein:vir:4517 Length: 498 # 98.2 3.4E-07 2.1E-10 56.0 15.3 357 1-390 1-498 (498) 58 protein:vir:276 Length: 369 # 98.2 3.4E-06 2.1E-09 50.6 21.0 334 1-381 1-369 (369) 59 protein:vir:4463 Length: 498 # 98.1 3.8E-07 2.4E-10 55.8 14.9 360 1-390 1-498 (498) 60 protein:vir:489 Length: 498 # 98.0 8.7E-07 5.4E-10 53.8 14.9 354 1-390 1-498 (498) 61 protein:vir:1996 Length: 495 # 97.3 0.0001 6.2E-08 42.5 20.8 353 1-378 1-495 (495) 62 protein:vir:106730 Length: 501 96.7 0.00037 2.3E-07 39.4 18.6 351 1-378 66-501 (501) 63 protein:vir:78611 Length: 501 95.6 0.0018 1.1E-06 35.6 19.5 350 1-378 66-501 (501) 64 protein:vir:99586 Length: 507 92.0 0.013 8.4E-06 30.8 19.3 355 1-377 63-507 (507) 65 protein:vir:96104 Length: 504 91.9 0.014 8.6E-06 30.8 20.0 354 1-377 63-504 (504) 66 protein:vir:3636 Length: 501 # 90.7 0.02 1.2E-05 29.9 23.1 364 1-378 1-501 (501) 67 protein:vir:101576 Length: 501 89.4 0.026 1.6E-05 29.2 24.3 364 1-378 1-501 (501) 68 protein:vir:94073 Length: 494 88.1 0.034 2.1E-05 28.6 22.2 359 1-378 1-494 (494) No 1 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=100.00 E-value=2.9e-116 Score=654.02 Aligned_cols=390 Identities=83% Similarity=1.299 Sum_probs=383.8 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhcccCc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGKQTKP 80 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~~~~~ 80 (390) |+++|+||||++|++++++++.++++++++++|+++++++..+|+++|+++++..++...+++.|+|.++++.++++++. T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~~gg~ 80 (390) T protein:vir:10 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhccccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhhcchHHHHHHHHhhhhcceEEeeccc Q lcl|NC_015266. 81 VTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGLDTQPVAAAFATIAQSLRAMVYVAAH 160 (390) Q Consensus 81 ~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~~~d~~ 160 (390) .++++++.+..+...+..+++++.+..+..+|++++...++.++..|.++.+|++++++|+++|..+|+++++++++|.+ T Consensus 81 ~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~~~aivD~p 160 (390) T protein:vir:10 81 LTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSLRAMAYVSAS 160 (390) T ss_pred eEEEEEecccccccccccccccccccccccchhhhhhhhhhhhcceehhhcccccchHHHHHHHHHhhcccceEEEEecC Confidence 99999999999999999999999988889999999999999999999999999999999999999999999999999999 Q ss_pred ccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeecccccccccch Q lcl|NC_015266. 161 GCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSW 240 (390) Q Consensus 161 ~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~ 240 (390) ++.+.+++++|+++++++++++||||++++++.++...++|||+++||+++++|+++|||+||||++|.|+.++++++++ T Consensus 161 ~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~ 240 (390) T protein:vir:10 161 GCKTKEEAAAYRKQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSW 240 (390) T ss_pred CCCCHHHHHHHhhccCCceEEEEcCceEeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCceeeceeecceeccc Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccccccccceeEEEcCCCEEEEccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH Q lcl|NC_015266. 241 DLQDPATDAGYLNENQVTTLVNRNGFRFWGSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENI 320 (390) Q Consensus 241 ~~~~~~~~~~~l~~~gI~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i 320 (390) .+++.+++++.||.+||+++++++||++||+||+++|++|+||++||++++|+++|++.++|++||||++.+|++|++++ T Consensus 241 ~~~~~~~~~~~ln~~gi~t~~~~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i 320 (390) T protein:vir:10 241 DLQDPATDAGYLNEHEVTTLVNRNGFRFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVDGPLNPSLARDIVESI 320 (390) T ss_pred ccccccchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_015266. 321 NAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFASRVSA 390 (390) Q Consensus 321 ~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~~~~a 390 (390) ++||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++|+++|+| T Consensus 321 ~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:10 321 NGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred HHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=100.00 E-value=2.9e-116 Score=654.02 Aligned_cols=390 Identities=83% Similarity=1.299 Sum_probs=383.8 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhcccCc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGKQTKP 80 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~~~~~ 80 (390) |+++|+||||++|++++++++.++++++++++|+++++++..+|+++|+++++..++...+++.|+|.++++.++++++. T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~~gg~ 80 (390) T protein:vir:78 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhccccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhhcchHHHHHHHHhhhhcceEEeeccc Q lcl|NC_015266. 81 VTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGLDTQPVAAAFATIAQSLRAMVYVAAH 160 (390) Q Consensus 81 ~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~~~d~~ 160 (390) .++++++.+..+...+..+++++.+..+..+|++++...++.++..|.++.+|++++++|+++|..+|+++++++++|.+ T Consensus 81 ~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~~~aivD~p 160 (390) T protein:vir:78 81 LTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSLRAMAYVSAS 160 (390) T ss_pred eEEEEEecccccccccccccccccccccccchhhhhhhhhhhhcceehhhcccccchHHHHHHHHHhhcccceEEEEecC Confidence 99999999999999999999999988889999999999999999999999999999999999999999999999999999 Q ss_pred ccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeecccccccccch Q lcl|NC_015266. 161 GCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSW 240 (390) Q Consensus 161 ~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~ 240 (390) ++.+.+++++|+++++++++++||||++++++.++...++|||+++||+++++|+++|||+||||++|.|+.++++++++ T Consensus 161 ~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~ 240 (390) T protein:vir:78 161 GCKTKEEAAAYRKQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSW 240 (390) T ss_pred CCCCHHHHHHHhhccCCceEEEEcCceEeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCceeeceeecceeccc Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccccccccceeEEEcCCCEEEEccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH Q lcl|NC_015266. 241 DLQDPATDAGYLNENQVTTLVNRNGFRFWGSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENI 320 (390) Q Consensus 241 ~~~~~~~~~~~l~~~gI~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i 320 (390) .+++.+++++.||.+||+++++++||++||+||+++|++|+||++||++++|+++|++.++|++||||++.+|++|++++ T Consensus 241 ~~~~~~~~~~~ln~~gi~t~~~~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i 320 (390) T protein:vir:78 241 DLQDPATDAGYLNEHEVTTLVNRNGFRFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVDGPLNPSLARDIVESI 320 (390) T ss_pred ccccccchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_015266. 321 NAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFASRVSA 390 (390) Q Consensus 321 ~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~~~~a 390 (390) ++||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++|+++|+| T Consensus 321 ~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:78 321 NGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred HHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=100.00 E-value=5.4e-115 Score=647.05 Aligned_cols=390 Identities=83% Similarity=1.300 Sum_probs=382.9 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhcccCc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGKQTKP 80 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~~~~~ 80 (390) |+++|+||||++|++++++++..+++++++|+|++++++...+|+++|+++++..++...+++.++|.++++.++.+++. T Consensus 1 M~~~~~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~its~~~~~~~~g~~~tL~~al~~~~~~~~~ 80 (390) T protein:vir:79 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CccccCCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccceEeecHHHHHHhcCCCccchhhhhhhcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhhcchHHHHHHHHhhhhcceEEeeccc Q lcl|NC_015266. 81 VTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGLDTQPVAAAFATIAQSLRAMVYVAAH 160 (390) Q Consensus 81 ~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~~~d~~ 160 (390) .++++++.+..+...+..+.+++.+..+..+|+++++..++..+..|.++.+|++++++|++++..+|+++++++++|.+ T Consensus 81 ~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tgl~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~~~ai~D~p 160 (390) T protein:vir:79 81 LTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSLRAMAYVSAS 160 (390) T ss_pred eEEEEeeccccccccccceeeecccccccchhhhhhhhhhhhhccccccccCCcccchHHHHHHHHhhhhcceEEEEEcc Confidence 99999999999998888888888888889999999999999999999999999999999999999999999999999999 Q ss_pred ccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeecccccccccch Q lcl|NC_015266. 161 GCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSW 240 (390) Q Consensus 161 ~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~ 240 (390) ++.+.+++.+|+++++|.++++||||++++++..+..+++|||+++||++|++|+++|||+||||++|.|+.++++++++ T Consensus 161 ~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~D~~~g~~~spsN~~i~gi~~~~~~~~~ 240 (390) T protein:vir:79 161 GCKTKEEAAAYRRQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSW 240 (390) T ss_pred CCCCHHHHHHHhcCCCCceEEEEcCceeecccccCceeEeehHHHHHHHHHhhhccCCcEEccCCceeeccceeeeeccc Confidence 89899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccccccccceeEEEcCCCEEEEccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH Q lcl|NC_015266. 241 DLQDPATDAGYLNENQVTTLVNRNGFRFWGSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENI 320 (390) Q Consensus 241 ~~~~~~~~~~~l~~~gI~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i 320 (390) .+++.+++++.||.+||+++++++||++||+||+++|++|+||++||+++||+++|++.++|++||||++.+|++|++++ T Consensus 241 ~~~~~~~~a~~Ln~~gi~t~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i 320 (390) T protein:vir:79 241 DLQDPATDAGYLNEHEVTTLVNRNGFRFWGERTCSDDPKFAFENYTRTAQVAADSIAEAQMPVVDGPLNPSLARDIVESI 320 (390) T ss_pred cccccchhhhhhhhcCcEEEEcCCCEEEEeccccCCCcccceeeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_015266. 321 NAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFASRVSA 390 (390) Q Consensus 321 ~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~~~~a 390 (390) +.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+++++++++|+++|+++|+| T Consensus 321 ~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~v~~ 390 (390) T protein:vir:79 321 NGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred HHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=100.00 E-value=2.5e-114 Score=643.40 Aligned_cols=390 Identities=66% Similarity=1.055 Sum_probs=383.6 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhcccCc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGKQTKP 80 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~~~~~ 80 (390) |+++|+||||++|++++++++..+++++++|+|+++.++...+|+++|+++++..++...+++.|++.++++.++++++. T Consensus 1 M~~~~~pGv~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~n~pv~iss~~~~~~~~g~~gtl~~al~~~~~~gg~ 80 (391) T protein:vir:79 1 MPTDYHHGVRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPLDTPVLLTNPQAYIGKAGDKGTLAHTLDAITDQTNP 80 (391) T ss_pred CCCCCCCCeEEEECCCCcccccccCCceEEEEeecccccccccccccCEEeccHHHHHHhcCCccccchhhhhhhccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhhcchHHHHHHHHhhhhcceEEeeccc Q lcl|NC_015266. 81 VTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGLDTQPVAAAFATIAQSLRAMVYVAAH 160 (390) Q Consensus 81 ~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~~~d~~ 160 (390) .++++++....+...+..++.++.+.++..+|++++.+.++..+..|.++.+|++++.++++++..+|+++++++++|.+ T Consensus 81 ~~~vv~~~~~~~~~~~~~~~~g~~~~~~~~tGl~~l~~~~~~~~~~p~~l~~p~~~~~~v~~al~~~~~~~~~~ai~d~p 160 (391) T protein:vir:79 81 LTVVVRVAGGASEAETTSNLIGTTNAAGRYTGMKALLTARNRFGVAPRILAVPGLDSLPVGTELVTIAQKLRAFAYLSAY 160 (391) T ss_pred ceeeeccccccccccccccccccccchhhhHHHhhhhhhhhhhcccchhhcCCccchhHHHHHHHHHHhhcCcEEEEECC Confidence 99999999999999999999999988999999999999999999999999999999999999999999999999999999 Q ss_pred ccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeecccccccccch Q lcl|NC_015266. 161 GCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSW 240 (390) Q Consensus 161 ~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~ 240 (390) ++.+.+++++|+++++++++++||||++++++.++..+++|||+++||+++++|+++|||+||||++|.|+.++++++++ T Consensus 161 ~~~t~~~a~~~~~~~~s~~~a~~~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~ 240 (391) T protein:vir:79 161 GCQTKEEAVAYRSNFGQREAMVMWPDFVGWDTAANAETTLWATARAVGLRAKIDNDTGWHKTLSNVAVGGVTGLSRDVFW 240 (391) T ss_pred CCCCHHHHHHHHhccCCceeEEecceeeeecCcCCceeeechHHHHHHHHHHhhhcccceeccCCceehhhhcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccccccccceeEEEcCCCEEEEccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH Q lcl|NC_015266. 241 DLQDPATDAGYLNENQVTTLVNRNGFRFWGSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENI 320 (390) Q Consensus 241 ~~~~~~~~~~~l~~~gI~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i 320 (390) .+++.+++++.||.+||+++++++||++||+||+++|++|+||++||++++|+++|++.++|++||||++.+|++|++++ T Consensus 241 ~~~~~~~~~~~Ln~~~I~t~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i 320 (391) T protein:vir:79 241 DLQDPATDAGYLNANEVTTLVHRDGYRFWGSRTCSADPLFAFENYTRTAQVLADTMAEAHMWANDLPMTPTLVRDLLEGI 320 (391) T ss_pred ccccccchhhhhhhcCceEEECCCcEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_015266. 321 NAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFASRVSA 390 (390) Q Consensus 321 ~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~~~~a 390 (390) ++||++||++|+|+||+++||+++||+++|++|+|+++|+++|++|+|||+++++++++|+++|+++|+| T Consensus 321 ~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~v~~ 390 (391) T protein:vir:79 321 NAKLRMLTRNGYLLGGAAWFDADANSKDTLKAGQLAIDYDYTPVPPLENLTFRQRITDRYLMQFAEAVKA 390 (391) T ss_pred HHHHHHHHhCCceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhc Confidence 9999999999999999999999999999999999999999999999999999999999999999999999 No 5 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=100.00 E-value=9.7e-113 Score=634.68 Aligned_cols=390 Identities=65% Similarity=1.026 Sum_probs=382.2 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhcccCc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGKQTKP 80 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~~~~~ 80 (390) |+++++||||++|++++++++..+++++++++|+++..+...+++++|+++++..++...+++.+++.++++.++++++. T Consensus 2 ~~~~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v~s~~~~~~~~g~~~tl~~al~~~~~~~g~ 81 (391) T protein:vir:11 2 AADQYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLITNVQAAIGKAGTSGTLPASLQAIADQANA 81 (391) T ss_pred CCCcCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEEecchhhheecCCCccchhhhhhhhccccc Confidence 66788999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhhcchHHHHHHHHhhhhcceEEeeccc Q lcl|NC_015266. 81 VTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGLDTQPVAAAFATIAQSLRAMVYVAAH 160 (390) Q Consensus 81 ~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~~~d~~ 160 (390) .++++++.+..+...+..++.++.+..+..+|++++++.++..+..|.++.+|++++++++++|.++|++++++.++|.+ T Consensus 82 ~~~vv~~~~~~~~~~t~~d~~g~~~a~~~~~g~~a~~~~~~~~~~~p~~~~ap~~~~~~v~~al~~~~~~~~~~~i~D~p 161 (391) T protein:vir:11 82 ATVVVRVKPGEDEAATNSAVIGGVSADGKYTGMKALLAAKARLGVVPRILGVPGLDTQPVATALIAIAQQLRAFAYVSAS 161 (391) T ss_pred eeEEeeecccccccccchhhhcccccccchhhhhhhhhhhhhheeccccccccccccHHHHHHHHHhhcccceEEEEEcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeecccccccccch Q lcl|NC_015266. 161 GCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSW 240 (390) Q Consensus 161 ~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~ 240 (390) ++.+.+++++|+++++|+++++||||++++++.++..+++|||+++||+++++|.++|||+||||++|.|+.+++.++++ T Consensus 162 ~~~t~~~a~~~r~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~l~gi~~~~~~~~~ 241 (391) T protein:vir:11 162 GCKTKEEATAYRENFAAREAMVIWPDFLTWSTVVNQTVPAPAVAQALGLRARIDQEVGWHKTLSNVAVNGVTGISADVFW 241 (391) T ss_pred CCCCHHHHHHHhhhcCCceEEEEcCcceecccccCceEEechHHHHHHHHHHhhccCCcEEccCCceeeceeeccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccccccccceeEEEcCCCEEEEccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH Q lcl|NC_015266. 241 DLQDPATDAGYLNENQVTTLVNRNGFRFWGSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENI 320 (390) Q Consensus 241 ~~~~~~~~~~~l~~~gI~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i 320 (390) .+++..++++.||.+||+++++++||++||+||+++|++|+||++||++++|+++|++.++|++||||++.+|++|++++ T Consensus 242 ~~~~~~~~~~~Ln~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i 321 (391) T protein:vir:11 242 DLQSPSTDANYLNENEVTTLVQEGGFRFWGSRTCSDDPLFAFENYTRTAQVLADTIAEAHMWAVDKPMHPSLVRDILEGV 321 (391) T ss_pred ccCCCcchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_015266. 321 NAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFASRVSA 390 (390) Q Consensus 321 ~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~~~~a 390 (390) +.||++||++|+|+||+++||+++||+++|++|+|+++|+++|++|+|||+++++++++||++|+++|+| T Consensus 322 ~~~l~~l~~~g~l~g~~~~~~~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~a 391 (391) T protein:vir:11 322 NAKFRELKGLGLIIDAQAWYDPNVNDKDTLKAGKLRITYDYTPVPPLEDLTFFQKITDSYLVDFASRVNA 391 (391) T ss_pred HHHHHHHHhccceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999 No 6 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=100.00 E-value=2.1e-112 Score=632.86 Aligned_cols=388 Identities=45% Similarity=0.743 Sum_probs=375.0 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhcccCc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGKQTKP 80 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~~~~~ 80 (390) |+++|+||||++|++++++++.+++|++++|+|+++..+...+|+++|+++++..++...+++.++|.+++++++++++. T Consensus 3 m~~~~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~~~~~~~~g~~g~L~~al~~~~~~~~~ 82 (393) T protein:vir:10 3 ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIVKT 82 (393) T ss_pred CCCccCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEecchHHHHHhhCCccchhhhhhhhhcccCc Confidence 88899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhhcchHHHHHHHHhhhhcceEEeeccc Q lcl|NC_015266. 81 VTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGLDTQPVAAAFATIAQSLRAMVYVAAH 160 (390) Q Consensus 81 ~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~~~d~~ 160 (390) .++++++.+..++..+..+++++.+ ++.++|+++++++++.++..|+++.+||++++++++++..+|++++++++++++ T Consensus 83 ~~~vv~v~~~~~~~~t~~~iig~~~-~~~~tgl~al~~~~~~~~~~p~li~apg~~~~~~~~al~~~~~~~~~~~~v~d~ 161 (393) T protein:vir:10 83 PTVIVRVAESDDSDTLTANIVGTQE-NGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFAFISDN 161 (393) T ss_pred eEEEeecccCccccccccccccccc-cchhhHHHHHHhhhhhcceeeeeeeeccccchHHHHHHHHHhhccCcEEEEEcC Confidence 9999999999998888888877544 567899999999999999999999999999999999999999999999999989 Q ss_pred ccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeecccccccccch Q lcl|NC_015266. 161 GCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSW 240 (390) Q Consensus 161 ~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~ 240 (390) +..+.++++.|+++++|.++++||||++++++.++..+++|||+++||+++++|+++|||+||||++|.|+.++++++++ T Consensus 162 ~~~t~~~ai~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~G~~~spaN~~l~gi~~~~~~~~~ 241 (393) T protein:vir:10 162 GATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEF 241 (393) T ss_pred CCCCHHHHHHHhhhcCCceEEEEecccccccccCCceeEeehhHHHHHHHHHhhcCCCcEEccCCceeeceeecceeccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccccccccceeEEEcCCCEEEEccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH Q lcl|NC_015266. 241 DLQDPATDAGYLNENQVTTLVNRNGFRFWGSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENI 320 (390) Q Consensus 241 ~~~~~~~~~~~l~~~gI~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i 320 (390) .+++..+|++.||.+||+++++++||++||+||+++|++|+||++||++++|+++|++.++|++||||++.+|++|++++ T Consensus 242 ~~~~~~~~~~~Ln~~gI~t~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i 321 (393) T protein:vir:10 242 DINESSTEANYLNEKGITICLNHNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAI 321 (393) T ss_pred ccCCCcchhHhHhhcCceEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhCC--ceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_015266. 321 NAWFRREVSVG--ELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFASRVSA 390 (390) Q Consensus 321 ~~~L~~l~~~g--~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~~~~a 390 (390) +.||++||+.| +|+||+++||++ ||+++|++|+|+++|+++|++|+|||+|+++++++|+++|+++|+| T Consensus 322 ~~~L~~l~~~g~~al~g~~v~~~~~-nt~~~i~~G~~~~~i~~~p~~p~e~I~~~~~~~~~~~~~l~~~v~a 392 (393) T protein:vir:10 322 NNKLRSWASGDDPRILGARVWVAEE-ITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLKA 392 (393) T ss_pred HHHHHHHHhccccccccceEEecCC-CCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHHhc Confidence 99999999855 899999999876 8889999999999999999999999999999999999999999999 No 7 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=100.00 E-value=6.9e-111 Score=624.52 Aligned_cols=389 Identities=59% Similarity=0.950 Sum_probs=373.4 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhcccCc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGKQTKP 80 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~~~~~ 80 (390) |+++ +|||||+|++++++++.++++++++|+|++++.++..+|+++|+++++..++...+++.++|..++++++++++. T Consensus 1 m~~~-~~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~~~pv~v~s~~~~~~~~g~~~tl~~al~~~~~~~~~ 79 (395) T protein:vir:98 1 MSDF-HHGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPLNEPVLITNVQSAIAKAGKKGTLAASLQAIADQSKP 79 (395) T ss_pred CCCC-CCCeEEEEcCCCcccccccCcceEEEEeeccCCCccccccccceEeechHHhHhhcccccchhhHHHHHhhccCc Confidence 8876 689999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEeecccccc------ccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhhcchHHHHHHHHhhhhcceE Q lcl|NC_015266. 81 VTVVVRVAEGKDE------AETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGLDTQPVAAAFATIAQSLRAM 154 (390) Q Consensus 81 ~~~vv~v~~~~~~------~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~ 154 (390) .++++++...... ..+..++.++.+..+.+||++++.+.++..++.|.++.+||+++++++++|..+|++++++ T Consensus 80 ~~~vv~~~~~~~~~~~~~~a~~~~~i~g~~~~~~~~Tgl~al~~~~~~~~~~p~il~ap~~~~~~v~~al~~~~~~~~~~ 159 (395) T protein:vir:98 80 VTVVVRVEDGTGDDEEAALAQTVSNIIGGTDENGKYTGIKALLTAQAVTGVKPRILGVPGLDTKEVAVALASAAIKLRAF 159 (395) T ss_pred eEEEeeccccccccccccccccccccccccccccchhHHHHHhhhhhhhccchhhcccccccccHHHHHHHHHhhhcCcE Confidence 9999987654433 2344556666777888999999999999999999999999999999999999999999999 Q ss_pred EeecccccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeeccccc Q lcl|NC_015266. 155 VYVAAHGCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGI 234 (390) Q Consensus 155 ~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~ 234 (390) +++|.|.+.+.+++++|+++++|+++++||||++++++.++..+++|||+++||+++++|.++|||+||||++|+|+.++ T Consensus 160 ~~~d~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~i~gi~~~ 239 (395) T protein:vir:98 160 AYVSAWGCKTISEAMEYRKNFSQRELMVIWPDFLAWDTVKNTTATAYATARALGLRAYIDQTVGWHKTLSNVGVQGVTGI 239 (395) T ss_pred EEEEcCCCCCHHHHHHHHhccCCceEEEEecceeEecccCCceeeechHHHHHHHHHHhhcccCcEeccCCceeeccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccchhhhccccccccccccceeEEEcCCCEEEEccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_015266. 235 SADVSWDLQDPATDAGYLNENQVTTLVNRNGFRFWGSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRAR 314 (390) Q Consensus 235 ~~~~~~~~~~~~~~~~~l~~~gI~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~ 314 (390) +.++++.+++..+|++.||.+||+++++++|+++||+||+++|++|+||++||++++|+++|++.++|++||||++.+|+ T Consensus 240 ~~~~~~~~~~~~~~~~~Ln~~gI~~~~~~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~~~~~~~~ 319 (395) T protein:vir:98 240 SASVFWDLQASGTDADLLNEAGVTTLVRKDGFRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDKPITATLIR 319 (395) T ss_pred ceecccccCCCcchHHhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_015266. 315 DIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFASRVSA 390 (390) Q Consensus 315 ~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~~~~a 390 (390) +|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+++++++++|+++|+++|+| T Consensus 320 ~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~I~~~~~~~~~~~~~~~~~~~~ 395 (395) T protein:vir:98 320 DIVDGINAKFRELKSNGYIVEGKCWFDEESNDKETLKAGKLYIDYDYTPVPPLESLTLRQRITDKYLVNLAESVNS 395 (395) T ss_pred HHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999 No 8 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=100.00 E-value=1.2e-110 Score=623.12 Aligned_cols=389 Identities=60% Similarity=0.950 Sum_probs=375.8 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhcccCc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGKQTKP 80 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~~~~~ 80 (390) |+++ +||||++|++++++++.++++++++++|+++..+...+++++|+++++..++...+++.+++..+++.++++++. T Consensus 1 m~~~-~~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~~~p~~its~~~~~~~~g~~gtl~~al~~~~~ngg~ 79 (392) T protein:vir:18 1 MSDF-HHGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPLNEPVLITNVQSAIAKAGKKGTLSASLQAIADQSKP 79 (392) T ss_pred CCCC-CCCeEEEEcCCCceeeeccCcceeEEEEeccCCCCcccccccceEeechHHHHhhcCCCcchHHHHHHhhcccCc Confidence 8885 699999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEeeccc---cccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhhcchHHHHHHHHhhhhcceEEee Q lcl|NC_015266. 81 VTVVVRVAEG---KDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGLDTQPVAAAFATIAQSLRAMVYV 157 (390) Q Consensus 81 ~~~vv~v~~~---~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~~~ 157 (390) .++++.+... .+...+..+++++.+.++..+|++++.+.+...+..|.++.+||+++++|+++|.++|+++++++++ T Consensus 80 ~~~vv~v~~~~~~~~~~~t~~dliG~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~~~~~~~~~~~ 159 (392) T protein:vir:18 80 VTVVVRVAEGTGDDAEAQTTSNIIGGTDENGKYTGIKALLTAEAVTGVKPRILGVPGLDTQEVATALASVCISLRAFGYV 159 (392) T ss_pred eEEEecccccccccccccchhhheecccccchhhhHHHHHhhhhhhceeehhcccCccchHHHHHHHHHHHhhcCcEEEE Confidence 9998876543 3456677788888888899999999999999999999999999999999999999999999999999 Q ss_pred cccccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeecccccccc Q lcl|NC_015266. 158 AAHGCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISAD 237 (390) Q Consensus 158 d~~~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~ 237 (390) |++++.+.+++.+|+++++|+++++||||++++++.++..+++|||+++||+++++|.++|||+||||++|.||.+++.+ T Consensus 160 d~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~~~ 239 (392) T protein:vir:18 160 SAWGCKTISEAMAYRENFSQRELMVIWPDFLAWDTTANATATAYATARALGLRAYIDQTIGWHKTLSNVGVQGVTGISAS 239 (392) T ss_pred ecCCCCCHHHHHHHHhhccCceEEEEeCceeeecccCCceEEechHHHHHHHHHhhhccCCceEccCCceeeceeeccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhhhccccccccccccceeEEEcCCCEEEEccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHH Q lcl|NC_015266. 238 VSWDLQDPATDAGYLNENQVTTLVNRNGFRFWGSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDII 317 (390) Q Consensus 238 ~~~~~~~~~~~~~~l~~~gI~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~ 317 (390) +++.+++..++++.||++||+++++++|+++||+||+++|++|+||++||++++|+++|++.++|++||||++.+|++|+ T Consensus 240 ~~~~~~~~~~~~~~Ln~~gI~t~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~ 319 (392) T protein:vir:18 240 VFWDLQASGTDADLLNEAGVTTLVRKDGFRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDKPITASLIRDIV 319 (392) T ss_pred cccccCCCcchhhhhhhcCceEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_015266. 318 ENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFASRVSA 390 (390) Q Consensus 318 ~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~~~~a 390 (390) +++++||++||++|+|+||+++||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++|+++|.| T Consensus 320 ~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~p~e~I~~~~~~~~~~~~~~~~~~~~ 392 (392) T protein:vir:18 320 DGINAKFRELKSNGYIVDGECWFDEESNDKETLKAGKLYIDYDYTPVPPLESLTLRQRITDKYLVNLAESVNS 392 (392) T ss_pred HHHHHHHHHHHhcCcccceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999 No 9 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=100.00 E-value=2.2e-110 Score=621.72 Aligned_cols=389 Identities=58% Similarity=0.962 Sum_probs=372.8 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhcccCc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGKQTKP 80 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~~~~~ 80 (390) |+++ +|||||+|++++++++..+.+++++++|++++++...+++++|+++++..++...+++.++|..++++++++++. T Consensus 1 m~~~-~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l~~pvlvts~~~~~~~~g~~~tL~~al~~~~~ngg~ 79 (396) T protein:vir:20 1 MSDY-HHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAISKAGKKGTLAASLQAIADQSKP 79 (396) T ss_pred CCCC-CCCeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccCccCEEeechHHHHhhcccccchhhhhhhhhccCce Confidence 9885 699999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEeecccccc------ccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhhcchHHHHHHHHhhhhcceE Q lcl|NC_015266. 81 VTVVVRVAEGKDE------AETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGLDTQPVAAAFATIAQSLRAM 154 (390) Q Consensus 81 ~~~vv~v~~~~~~------~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~ 154 (390) .++++++...... ..+...+.++.+..+..+|++++.+.++..+..|.++.+|++++++|+++|.++|++++++ T Consensus 80 ~~~v~~~~~~~~~~~~~~~a~t~~~~~~~~~~~~~~tg~~al~~~~~~~~~~p~i~~ap~~~~~~v~~al~~~~~~~~~~ 159 (396) T protein:vir:20 80 VTVVMRVEDGTGDDEETKLAQTVSNIIGTTDENGQYTGLKAMLAAESVTGVKPRILGVPGLDTKEVAVALASVCQKLRAF 159 (396) T ss_pred eEEEEeccccccccccccccccccccccccccccccchhhhhhhhccccccchhhhhhhhhccHHHHHHHHHHHhcCCcE Confidence 9999887544332 2344556666667788999999999999999999999999999999999999999999999 Q ss_pred EeecccccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeeccccc Q lcl|NC_015266. 155 VYVAAHGCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGI 234 (390) Q Consensus 155 ~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~ 234 (390) +++|.|...+.+++++|+++++|.++++||||++++|+.++..+++|||+++||+++++|.++|+|+||||++|.||.++ T Consensus 160 ~~iD~p~~~~~~~a~~~r~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spaN~~l~gi~~~ 239 (396) T protein:vir:20 160 GYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGI 239 (396) T ss_pred EEEecCCCCCHHHHHHHhhCCCCceEEEEcCccccccCcCCcceeechhHHHHHHHHHhhhhcCcEeccCCceeccceec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccchhhhccccccccccccceeEEEcCCCEEEEccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_015266. 235 SADVSWDLQDPATDAGYLNENQVTTLVNRNGFRFWGSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRAR 314 (390) Q Consensus 235 ~~~~~~~~~~~~~~~~~l~~~gI~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~ 314 (390) ++.+.+.+++..+|++.||++||+++++++||++||+||+++|++|+||++||+++||+++|++.++|++||||++.+|+ T Consensus 240 ~~~~~~~~~~~~~~~~~Ln~~gi~~~~~~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~~~~~~~~~v~e~~~~~~~~ 319 (396) T protein:vir:20 240 SASVFWDLQESGTDADLLNESGVTTLIRRDGFRFWGNRTCSDDPLFLFENYTRTAQVVADTMAEAHMWAVDKPITATLIR 319 (396) T ss_pred ceecccccCCCcchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_015266. 315 DIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFASRVSA 390 (390) Q Consensus 315 ~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~~~~a 390 (390) +|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++++||++|+++|+| T Consensus 320 ~i~~~i~~~L~~l~~~G~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 395 (396) T protein:vir:20 320 DIVDGINAKFRELKTNGYIVDATCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITDKYLANLVTSVNS 395 (396) T ss_pred HHHHHHHHHHHHHHhCcceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhc Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999 No 10 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=100.00 E-value=4.3e-110 Score=620.19 Aligned_cols=389 Identities=59% Similarity=0.964 Sum_probs=373.9 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhcccCc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGKQTKP 80 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~~~~~ 80 (390) |+. |+|||||+|++++++++..+++++++++|+++..+...+++++|+++++..++...+++.+++..++++++++++. T Consensus 1 m~~-~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~al~~~~~~~~~ 79 (396) T protein:vir:57 1 MSD-YHHGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAIAKAGKKGTLAASLQAIADQSKP 79 (396) T ss_pred CCC-CCCceEEEEcCCCcccccccCCceEEEEEeccCCCcccccCccCeEeecchhhhhhcccccchHHHHHHhhhcCCc Confidence 887 6689999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEeeccccc------cccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhhcchHHHHHHHHhhhhcceE Q lcl|NC_015266. 81 VTVVVRVAEGKD------EAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGLDTQPVAAAFATIAQSLRAM 154 (390) Q Consensus 81 ~~~vv~v~~~~~------~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~ 154 (390) .++++++..... ...+..+++++++.++..+|++++.++++..+..|.++.+|++++++++++|..+|++++++ T Consensus 80 ~~~vv~~~~~~~~~~~~~~a~t~~~iiG~~~~~~~~tgl~al~~~~~~~~~~p~i~~ap~~~~~~v~~al~~~~~~~~~~ 159 (396) T protein:vir:57 80 VTVVVRVEDGTGDDEETKLAQTVSNIIGTTDENGQYTGLKALMGAESVTGVKPRILGVPGLDTKEVAVALASVCQELNAF 159 (396) T ss_pred eeEeeeccccccccccccccccceeeeeeccccccchhhhhhhhcccceeEEeccccCcccchhHHHHHHHHHhhhCceE Confidence 999988754433 34455667777777889999999999999999999999999999999999999999999999 Q ss_pred EeecccccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeeccccc Q lcl|NC_015266. 155 VYVAAHGCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGI 234 (390) Q Consensus 155 ~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~ 234 (390) .++|.+++.+.+++++|+++++|.++++||||++++++.++..+++|||+++||++|++|.++|+|+||||++|.|+.++ T Consensus 160 ~~~d~p~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spaN~~l~gi~~~ 239 (396) T protein:vir:57 160 GYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGI 239 (396) T ss_pred EEEcCCCCCCHHHHHHHHhccCCceEEEEcceeeeecccCCceeEEehhHHHHHHHHHhhhccCcEeccCCceecccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccchhhhccccccccccccceeEEEcCCCEEEEccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_015266. 235 SADVSWDLQDPATDAGYLNENQVTTLVNRNGFRFWGSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRAR 314 (390) Q Consensus 235 ~~~~~~~~~~~~~~~~~l~~~gI~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~ 314 (390) ++.+++.+++..++++.||++||+++++++|+++||+||+++|++|+||++||++++|+++|++.++|++||||++.+|+ T Consensus 240 ~~~~~~~~~~~~~~~~~Ln~~gi~t~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e~n~~~~~~ 319 (396) T protein:vir:57 240 SASVFWDLQKPGTDADLLNEAGVTTLVRRDGFRFWGNRTCSDDPLFLFESYTRTAQVLADTMAEAHMWAIDKPITATLIR 319 (396) T ss_pred ceecccccCCcchhhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_015266. 315 DIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFASRVSA 390 (390) Q Consensus 315 ~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~~~~a 390 (390) +|+++++.||++||++|+|+||+++||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++|+++|.| T Consensus 320 ~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~I~~~~~~~~~~~~~~~~~~~~ 395 (396) T protein:vir:57 320 DIIDGINAKFRELKNNGYIVDGTCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITSRYLASLVTSVNS 395 (396) T ss_pred HHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhc Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999 No 11 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=100.00 E-value=8.3e-110 Score=618.60 Aligned_cols=389 Identities=58% Similarity=0.958 Sum_probs=373.9 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhcccCc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGKQTKP 80 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~~~~~ 80 (390) |+.+ +||||++|++++++++..+++++++|+|+++..+...+++++|+++++..++...+++.++|.+++++++++++. T Consensus 1 m~~~-~~Gv~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~~~~~~~~g~~~tl~~a~~~~~~~gg~ 79 (396) T protein:vir:60 1 MSDY-HHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPLNKPVLITNVQSAIAKAGKKGTLAASLQAIADQSKP 79 (396) T ss_pred CCCC-CCCeEEEEcCCCcccccccCceeEEEEecccccccccccCccCeEeechHHHHHhhcCcchhHHHHHHHhhccCc Confidence 8885 599999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEeeccccccc------cccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhhcchHHHHHHHHhhhhcceE Q lcl|NC_015266. 81 VTVVVRVAEGKDEA------ETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGLDTQPVAAAFATIAQSLRAM 154 (390) Q Consensus 81 ~~~vv~v~~~~~~~------~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~ 154 (390) .++++++....+.. .+...+.++.+.++..+|++++.+.++..+..|.++.+||+++..|++++.++|++++++ T Consensus 80 ~~~vv~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~tg~~al~~~~~~~~~~~~il~ap~~~~~~v~~al~~~~~~~~~~ 159 (396) T protein:vir:60 80 VTVVVRVEDGTGEDEETKLAQTVSNIIGTTDENGQYTGLKALLAAESVTGVKPRILGVPGLDTKEVAVALASVCQKLRAF 159 (396) T ss_pred eEEEEecccccccccccccccccccccccccccccccchhhhhhcccceeeeeeeccccccccHHHHHHHHHHhccCCeE Confidence 99999986544332 344566777778888999999999999999999999999999999999999999999999 Q ss_pred EeecccccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeeccccc Q lcl|NC_015266. 155 VYVAAHGCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGI 234 (390) Q Consensus 155 ~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~ 234 (390) +++|.|.+.+.+++.+|+++++|.++++||||++++++.++..+++|||+++||+++++|.++|+|+||||++|.|+.++ T Consensus 160 ~i~d~p~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~ 239 (396) T protein:vir:60 160 GYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVASTTATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGI 239 (396) T ss_pred EEEeCCCCCCHHHHHHHHhhcCCceEEEEeCceeeecccCCceeEEchhHHHHHHHHHhhhccCcEeCcCCceecceeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccchhhhccccccccccccceeEEEcCCCEEEEccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_015266. 235 SADVSWDLQDPATDAGYLNENQVTTLVNRNGFRFWGSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRAR 314 (390) Q Consensus 235 ~~~~~~~~~~~~~~~~~l~~~gI~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~ 314 (390) +.++++.+++..+|++.||++||+++++++|+++||+||+++|++|+||++||++++|+++|++.+++++||||++.+|+ T Consensus 240 ~~~~~~~~~~~~~~~~~Ln~~gI~~~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~ 319 (396) T protein:vir:60 240 SASVFWDLQESGTDADLLNESGVTTLIRRDGFRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDKPITATLIR 319 (396) T ss_pred eeecccccCCCcchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_015266. 315 DIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFASRVSA 390 (390) Q Consensus 315 ~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~~~~a 390 (390) +++++|++||++||++|+|+||+++||+++||+++|++|+|+++|+++|++|+|||+++++++++||++|+++|+| T Consensus 320 ~i~~~i~~~l~~l~~~gal~g~~~~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~~~~~~~ 395 (396) T protein:vir:60 320 DIVDGINAKFRELKTNGYIVDATCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITDKYLANLVTSVNS 395 (396) T ss_pred HHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhc Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999 No 12 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=100.00 E-value=5.1e-107 Score=603.32 Aligned_cols=384 Identities=38% Similarity=0.679 Sum_probs=370.0 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhcccCc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGKQTKP 80 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~~~~~ 80 (390) ||++|+|||||+|+.++++|+.++++++++|+|+++.+++..+++++|+++++..++...+++.+++..++.+++.+++. T Consensus 1 M~~~~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~a~~~~~~~gg~ 80 (386) T protein:vir:10 1 MAEQYLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVLIAGSRREAAKLGAGGTLPQAIDGIFDQTGA 80 (386) T ss_pred CccccCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceEecchHHHHhhcCCCcchhHHHHHHhccCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEeeccccccccccccchhhhcc-chhhhhhhhhhhhhhhhhhhhhhhhhhhhcc-hHHHHHHHHhhhhcceEEeec Q lcl|NC_015266. 81 VTVVVRVAEGKDEAETTANVIGTVTP-DGKYTGMKALLAAQGKLAVKPRILVAPGLDT-QPVAAAFATIAQSLRAMVYVA 158 (390) Q Consensus 81 ~~~vv~v~~~~~~~~~~~~~~~~~~~-~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~-~~v~~al~~~~~~~~~~~~~d 158 (390) .|+++++.+..+...+..+.+++.+. +...+|+.++.+.+...+..|.+..+|++++ .+|.+++..++++++.+.+.+ T Consensus 81 ~~~vv~~~~~~~~~~t~~~~ig~~~~~t~~~tgl~~l~~~~~~~~~~p~i~~ap~~~~~~~v~~~l~~~~~~~~~~~~~~ 160 (386) T protein:vir:10 81 VVVVIRVDEGVDSAATQSNVIGKVDADTEQYTGILALLSAENTVKVQPRILIAPGFSNQKAVADQLVSVADTAAWLCHSG 160 (386) T ss_pred eEEEeeccccccccccchhhhcccccccchhhhhHHhhhhcccccccccccccccccchhHHHHHHHHhhcceEEEEEeC Confidence 99999999999988888888887765 6778999999999999999999999999976 568899999999999888777 Q ss_pred ccccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeeccccccccc Q lcl|NC_015266. 159 AHGCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADV 238 (390) Q Consensus 159 ~~~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~ 238 (390) ++..+.+++.++++.+++.++++||||++++++.++...++|||+++||+++++|.++|||+||||++|.||.++++++ T Consensus 161 -~~~~~~~~a~~~~~~~~s~~~~~~~p~~~v~~~~~~~~~~~p~s~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~ 239 (386) T protein:vir:10 161 -WSNTTDAAAITYRELFGSRRCEVVDPWYKVWDVETSAHIIQPPSARHAGVMAKVHNTLGFWWSNSNQEILGIDGLCRPV 239 (386) T ss_pred -CCCCchHHHHHhhhcccccceEEecCceeeeccccccceeechHHHHHHHHHHhhhcCCcEEccCCceeecccccceec Confidence 4677788999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred chhhhccccccccccccceeEEEcCCCEEEEccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHH Q lcl|NC_015266. 239 SWDLQDPATDAGYLNENQVTTLVNRNGFRFWGSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIE 318 (390) Q Consensus 239 ~~~~~~~~~~~~~l~~~gI~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~ 318 (390) ++.+++.+++++.||.+||+++++++|+++||+||+++|++|+||++||++++|+++|+++++|++||||++.+|++|++ T Consensus 240 ~~~~~~~~~~~~~l~~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~~~~~~~~~i~~ 319 (386) T protein:vir:10 240 DFKLDDPTCRANLLNAKEVTTTIQQNGFRVWGDRTCSADSKWAFKNVVITNDMIADSLVRNHLWAVDRNITKTYVEDVTE 319 (386) T ss_pred ccccccCcchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHH Q lcl|NC_015266. 319 NINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFA 385 (390) Q Consensus 319 ~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~ 385 (390) ++++||++||++|+|+||+|+||+++||++++++|+|+++|+++|++|+|||+|+++++++||++|+ T Consensus 320 ~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~~~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~ 386 (386) T protein:vir:10 320 GVNNYLRHLKNIGAIAGGECWVDPELNSPDQIQQGKVYFDYDFSAYAPAEHITFRSHMVNGYLTEVV 386 (386) T ss_pred HHHHHHHHHHhCCceeeeEEEEcccCCCHHHhhCCeEEEEEEEEecCCceeEEEEEEEehhHHHhhC Confidence 9999999999999999999999999999999999999999999999999999999999999999999 No 13 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=100.00 E-value=1.7e-102 Score=578.50 Aligned_cols=383 Identities=32% Similarity=0.503 Sum_probs=348.3 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHH--hhccccchHHHHHhhhccc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALG--KAGTKGTLRRTLDAIGKQT 78 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~--~~~~~gtl~~al~~~~~~~ 78 (390) ||++|+||||++|++++++++..++|++++|+|+++.+ |+|+|++++++.++.. .....++|.+++.++|.++ T Consensus 1 M~~~~~pGVyv~E~~~~~~~i~~v~T~v~~~VG~a~~g-----p~n~pv~its~~d~~~~g~~~~~~tL~~Av~~~f~nG 75 (477) T protein:vir:10 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIG-----PVNTPVQSLSDVDAAQFGPQLAGFTIPQALDAVYDYG 75 (477) T ss_pred CcccCCCCeEEEEccCCcccccccCCceeEEEecccCC-----CCCcCEEEccHHHHHHhccCCCCCcHHHHHHHHHhcc Confidence 99999999999999999999999999999999999865 7889999999988754 2346789999999999999 Q ss_pred CceEEEEeeccccccccc-------------------------------------------------------------- Q lcl|NC_015266. 79 KPVTVVVRVAEGKDEAET-------------------------------------------------------------- 96 (390) Q Consensus 79 ~~~~~vv~v~~~~~~~~~-------------------------------------------------------------- 96 (390) +..++++++.+......+ T Consensus 76 g~~~~vVrV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (477) T protein:vir:10 76 SGTVIVINVLDPAVHKSNAANEPVTFDAATGRAKLAHPAAANLVLKNDSGGTTYAEGTDYAVDLINGVITRIKTGTIPPG 155 (477) T ss_pred ceEEEEEecCccccccccccccccccccccceecccccccccccccccccccccccchhhhhhhccccceeccccccccc Confidence 999999998654321100 Q ss_pred -----------------cccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhhcch-HHHHHHHHhhhhcceEEeec Q lcl|NC_015266. 97 -----------------TANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGLDTQ-PVAAAFATIAQSLRAMVYVA 158 (390) Q Consensus 97 -----------------~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~~-~v~~al~~~~~~~~~~~~~d 158 (390) ..++.+.++.++..+|+++++..++.++..|.++.+||++++ +|.++|.++|++++++.++| T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~tGl~al~~~~~~~~~~~~~l~apg~~~~~~v~~~l~~~~~~~~~~~~~d 235 (477) T protein:vir:10 156 ATAAKATYDYADPTKVTAADIIGAVNAAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQLGAIAYID 235 (477) T ss_pred ceeeeeccccccccccccccccccccccchhhhhhhhhhhhhhcchhcccccccccccchhhHHHHHHHHhhCCEEEEEe Confidence 001122234455678999999999999999999999999865 59999999999999999999 Q ss_pred ccccCchHHHHHHhh-------hhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeecc Q lcl|NC_015266. 159 AHGCKTKEEAVAYRK-------QFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGV 231 (390) Q Consensus 159 ~~~~~~~~~a~~~~~-------~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv 231 (390) .|...+.+++.+|++ +++|++++++|||++++++.++..+++|||+++||++|++|+++|||+||+|++|.|+ T Consensus 236 ~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~gi 315 (477) T protein:vir:10 236 APIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDTATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQLVGV 315 (477) T ss_pred cCCCCCHHHHHhhhhhccccccccccceEEEEcCeEEEecccCCceeEEchHHHHHHHHHHhhhcCCceeccCCceeccc Confidence 998888899999887 4678899999999999999999999999999999999999999999999999999999 Q ss_pred cccccccchhhhccccccccccccceeEEEcC--CCEEEEccccCC---CCcccceeeehhhHHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 232 TGISADVSWDLQDPATDAGYLNENQVTTLVNR--NGFRFWGSRTCD---ADGKFFFENYTRSAQVIADTIAEEQMGVVDG 306 (390) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~--~G~~~wG~rT~~---~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e 306 (390) .++++++++.+++..+|++.||.+||++++++ +|+++||+||++ .|+.|+|+++||++++|+++|++.+++++|| T Consensus 316 ~~~~~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~~~vrR~~~~i~~~~~~~~~~~v~~ 395 (477) T protein:vir:10 316 TGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDA 395 (477) T ss_pred cccccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCCCcccceeehhhHHHHHHHHHHHHHHHhccC Confidence 99999999999999999999999999999875 799999999994 4678999999999999999999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHH Q lcl|NC_015266. 307 PLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFAS 386 (390) Q Consensus 307 ~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~ 386 (390) ||++.+|++|+++|+.||++||++|+|+||+|+||+++||++||++|+|+++|+++|++|+|||+|+++++++||++|+. T Consensus 396 ~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~ 475 (477) T protein:vir:10 396 PIDQGLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNPKEELAAGHLLINYKYTVPPPLERLTYETEITSEYLLTLKG 475 (477) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEcchHHhhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999998887 Q ss_pred Hh Q lcl|NC_015266. 387 RV 388 (390) Q Consensus 387 ~~ 388 (390) -- T Consensus 476 g~ 477 (477) T protein:vir:10 476 GN 477 (477) T ss_pred CC Confidence 66 No 14 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=100.00 E-value=1e-101 Score=574.21 Aligned_cols=383 Identities=32% Similarity=0.496 Sum_probs=346.2 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHh--hccccchHHHHHhhhccc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGK--AGTKGTLRRTLDAIGKQT 78 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~--~~~~gtl~~al~~~~~~~ 78 (390) ||++|+||||++|+++++++|..++|++++|+|+++.+ |+|+|++++++.++... .+..++|..++.++|.++ T Consensus 1 M~~~~~pGVyv~E~~~g~~~I~~v~Tsv~~~VG~a~~~-----p~n~pv~its~~d~~~~g~~~~~~tL~~Av~~~f~ng 75 (477) T protein:vir:79 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIG-----PVNTPVQSLSDVDAAQFGPQLAGFTIPQALDAVYDYG 75 (477) T ss_pred CcCCCCCCeEEEEecCCcccccccCCceEEEEeecccC-----CCcccEEEccHHHHHHhcCCCCCCcHHHHHHHHhhcC Confidence 99999999999999999999999999999999999866 78999999999887643 346789999999999999 Q ss_pred CceEEEEeecccccccccc------------------------------------------------------------- Q lcl|NC_015266. 79 KPVTVVVRVAEGKDEAETT------------------------------------------------------------- 97 (390) Q Consensus 79 ~~~~~vv~v~~~~~~~~~~------------------------------------------------------------- 97 (390) +..|+++++.++....... T Consensus 76 g~~~~vvrV~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (477) T protein:vir:79 76 SGTVIVINVLDPAVHKSNAASESVTFDAATGRAKLAHPAAANLVLKNDSGGTTYTEGTDYAVDLINGVITRIKTGTIPAA 155 (477) T ss_pred CceEEEEeccCCccccccccccccccccccccccccccccceeEEeecccccccccCccccccccchhhhhhhccccccc Confidence 9999999986544321100 Q ss_pred ------------------ccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhhcch-HHHHHHHHhhhhcceEEeec Q lcl|NC_015266. 98 ------------------ANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGLDTQ-PVAAAFATIAQSLRAMVYVA 158 (390) Q Consensus 98 ------------------~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~~-~v~~al~~~~~~~~~~~~~d 158 (390) .+..+.++..+..+|++++...+...+..|.++.+||+++. +|.++|.++|+++++++++| T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~g~~~a~~~~tg~~al~~~~~~~~~~~~iv~apg~~~~~~v~~~l~~~~~~~~~~a~~d 235 (477) T protein:vir:79 156 ATAAKATYDYADPTKVTAADIIGAVNAAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQLGAIAYID 235 (477) T ss_pred cceeeceeccCCcccceeeeecccccccccchhhhhhhhhhhhcccccceeeccccccchhHHHHHHHHHhhcCeEEEEe Confidence 00112222345578888999999999999999999999754 59999999999999999999 Q ss_pred ccccCchHHHHHHhhh-------hccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeecc Q lcl|NC_015266. 159 AHGCKTKEEAVAYRKQ-------FGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGV 231 (390) Q Consensus 159 ~~~~~~~~~a~~~~~~-------~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv 231 (390) .+.+.+.+++.+|++. ++|.+++++|||++++++.++..+++|||+++||+++++|+++|||+||+|++|.|+ T Consensus 236 ~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~gv 315 (477) T protein:vir:79 236 APIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDIATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQLVGV 315 (477) T ss_pred cCCCCChHHHhhhhhhccccccccccceEEEEcCeeEEecccCCceeeechHHHHHHHHHHhhccCCceEccCCceeecc Confidence 9988888888888864 678999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccchhhhccccccccccccceeEEEcC--CCEEEEccccCC---CCcccceeeehhhHHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 232 TGISADVSWDLQDPATDAGYLNENQVTTLVNR--NGFRFWGSRTCD---ADGKFFFENYTRSAQVIADTIAEEQMGVVDG 306 (390) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~--~G~~~wG~rT~~---~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e 306 (390) .++++++++.+++..+|++.||.+||++++++ +|+++||+||++ .++.|+||++||++++|+++|++.++|++|| T Consensus 316 ~~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e 395 (477) T protein:vir:79 316 TGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDA 395 (477) T ss_pred eecccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCCCccceeeehhhHHHHHHHHHHHHHHHhccC Confidence 99999999999999999999999999999875 799999999994 4678999999999999999999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHH Q lcl|NC_015266. 307 PLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFAS 386 (390) Q Consensus 307 ~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~ 386 (390) ||++.+|++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++++||++|+. T Consensus 396 ~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~ 475 (477) T protein:vir:79 396 PIDQGLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNPKEELAAGHLLINYKYTVPPPLERLTYETEITSEYLLTLKG 475 (477) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCeEEEEEEEEecCCceeEEEEEEEechHHhhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999998776 Q ss_pred Hh Q lcl|NC_015266. 387 RV 388 (390) Q Consensus 387 ~~ 388 (390) -- T Consensus 476 ~~ 477 (477) T protein:vir:79 476 GN 477 (477) T ss_pred CC Confidence 65 No 15 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=100.00 E-value=1.3e-99 Score=562.74 Aligned_cols=374 Identities=24% Similarity=0.361 Sum_probs=342.2 Q ss_pred CCC--ccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHh---hccccchHHHHHhhh Q lcl|NC_015266. 1 MPQ--DYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGK---AGTKGTLRRTLDAIG 75 (390) Q Consensus 1 Ma~--~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~---~~~~gtl~~al~~~~ 75 (390) |+. +|+||||++|++++++++.++++++++++|+++++++. +++++|+++.+..+.... ....+++..++..++ T Consensus 1 m~~~~~~~hGv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~-~p~~~~~~i~~~~d~~~~~~~~~~~gtl~~al~~~~ 79 (388) T protein:vir:96 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHAD-VAFSVPFRVANTADAQYLDSTGNELGTGWHAASETL 79 (388) T ss_pred CCCCCCCCCceEEEEcCCCcccccccCcceeEEEEecCCCccc-cccccceeeecchhhhhhhccccccccchhhhHhhh Confidence 994 79999999999999999999999999999999998875 789999998877665443 345789999999999 Q ss_pred cccCceEEEEeeccccccccccccchhhhcc-chhhhhhhhhhhhhhhhhhhhhhhhhhhhcc-hHHHHHHHHhhhhcce Q lcl|NC_015266. 76 KQTKPVTVVVRVAEGKDEAETTANVIGTVTP-DGKYTGMKALLAAQGKLAVKPRILVAPGLDT-QPVAAAFATIAQSLRA 153 (390) Q Consensus 76 ~~~~~~~~vv~v~~~~~~~~~~~~~~~~~~~-~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~-~~v~~al~~~~~~~~~ 153 (390) ++++..++++++....+...+..+++++.+. ++.++|++++.+.+ ..|+++.+||+++ ++|+++|.++|+++++ T Consensus 80 ~~~~~~~~vv~v~~g~~~~at~a~iig~~~~~tg~~~gl~al~~~~----~~p~il~aPg~s~~~~v~~al~~~~~~~~~ 155 (388) T protein:vir:96 80 KKTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECT----ERPTLIGAPGFSQNKAVIDALASMAKRLKC 155 (388) T ss_pred ccCCceEEEEEeccccccccccceeeeecccccchhhHHHHhhhcc----cceeEEEeeccccchHHHHHHHHHHhhcCc Confidence 9999999999999999999999999888764 56777887776654 4689999999976 5799999999999999 Q ss_pred EEeecccccCchHHHHHHhh-----hhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCcee Q lcl|NC_015266. 154 MVYVAAHGCKTKEEAVAYRK-----QFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVV 228 (390) Q Consensus 154 ~~~~d~~~~~~~~~a~~~~~-----~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l 228 (390) +.++|.|.+ +.+++.+++. +++|.++++||||++++|+.++..+++|||+++||++|++| +|+||||+++ T Consensus 156 ~~i~D~p~~-~~~~~~~~~~~~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D----~~~spaN~~i 230 (388) T protein:vir:96 156 RAVIDGPSG-STQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVK----PWESPGNQGV 230 (388) T ss_pred EEEEeccCC-chhHHHHHHhhhhccCcCcceEEEEeCceeeecccCCceeeechHHHHHHHHHhhc----CcccccCeeE Confidence 999998754 4455655543 57889999999999999999999999999999999999999 5999999998 Q ss_pred ecccccccccchhhhccccccccccccceeEEEcC--CCEEEEccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhcC Q lcl|NC_015266. 229 NGVTGISADVSWDLQDPATDAGYLNENQVTTLVNR--NGFRFWGSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVDG 306 (390) Q Consensus 229 ~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~--~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e 306 (390) ++.|+++++++..++..+|++.||++||++++++ +|+++||+||++ |+||++||+++||+++|++.++|++|| T Consensus 231 -~i~g~~~~~~~~~~~~~~~~~~Ln~~gI~~i~~~~~~G~~~wG~rT~~----~~~i~vrR~~~~i~~si~~~~~~~v~e 305 (388) T protein:vir:96 231 -LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT----GKFISFVGLEDAIARKLEAASQRAMSK 305 (388) T ss_pred -EeeeecccccccccCChhhHHhhhhcCceEEEEecCCcEEEEcccccC----CcceeehhhHHHHHHHHHHHHHHhccC Confidence 5999999999999999999999999999999874 799999999996 999999999999999999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHH Q lcl|NC_015266. 307 PLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFAS 386 (390) Q Consensus 307 ~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~ 386 (390) ||++.+|++|+++|+.||++||++|+|+||+++||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++|++ T Consensus 306 pn~~~~~~~i~~~i~~fL~~l~~~Gal~g~~~~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~~~ 385 (388) T protein:vir:96 306 QLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIE 385 (388) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred Hhc Q lcl|NC_015266. 387 RVS 389 (390) Q Consensus 387 ~~~ 389 (390) +|. T Consensus 386 ~~~ 388 (388) T protein:vir:96 386 EVL 388 (388) T ss_pred HhC Confidence 999 No 16 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=100.00 E-value=1.9e-86 Score=490.48 Aligned_cols=380 Identities=14% Similarity=0.092 Sum_probs=300.2 Q ss_pred CCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhcc---ccchHHHHHhhhccc Q lcl|NC_015266. 2 PQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGT---KGTLRRTLDAIGKQT 78 (390) Q Consensus 2 a~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~---~gtl~~al~~~~~~~ 78 (390) -+++.|||||+|+ ++++++..+.|++.+|+|.++.+ |+++|++++++.++...||. ...+..++..+|.++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~~~~~~t~~~~~vg~~~~g-----p~~~p~~i~~~~~~~~~fg~~~~~~~~~~~~~~~f~~~ 74 (666) T protein:vir:80 1 MTLLSPGFETKET-TLSTTIVQSATGRAALVGKFQWG-----PAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (666) T ss_pred CceecCceEEEEe-cCCccccccCcccceEEeccccC-----CCccceEecCHHHHHHhcCCccCccchHHHHHHHHhcC Confidence 4556799999999 68999999999999999998755 67899999999999887763 456778899999999 Q ss_pred CceEEEEeeccccccccc----------------------------------------c---c----------------- Q lcl|NC_015266. 79 KPVTVVVRVAEGKDEAET----------------------------------------T---A----------------- 98 (390) Q Consensus 79 ~~~~~vv~v~~~~~~~~~----------------------------------------~---~----------------- 98 (390) |..++++++......+.. . . T Consensus 75 g~~~~v~R~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~i~~~~~~~~~~~~ta~~~~~a 154 (666) T protein:vir:80 75 GNDLRVVRVLNKEKAKNATALAGNIEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (666) T ss_pred CCeEEEEEecCccccccccccccceeEEEeeccccccccccccccccCcccccCcceEEEeecceeeeeecchhhhcccc Confidence 999999987432110000 0 0 Q ss_pred ---------------cc-------------hhhhc-cc----------h--------------------------hh--- Q lcl|NC_015266. 99 ---------------NV-------------IGTVT-PD----------G--------------------------KY--- 110 (390) Q Consensus 99 ---------------~~-------------~~~~~-~~----------~--------------------------~~--- 110 (390) .+ .+... .. . .. T Consensus 155 ~~~~~~~~v~~~~~~~~~~~~~~~~~a~~V~~~~~~~~~~~~~~~~a~~~~t~~~~~~~~~~~~~~a~~a~~~g~~g~~l 234 (666) T protein:vir:80 155 KAIGVYPELDGDWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLQKYDMPAVSAIYAGEIGNSL 234 (666) T ss_pred ccccccceeeccceeeeccccccceeeeeeeeeecCCccceeeeccccccccccccccccccccchhhhhhcccccccce Confidence 00 00000 00 0 00 Q ss_pred -------------------------------------------------hhh--h------------------hhh---- Q lcl|NC_015266. 111 -------------------------------------------------TGM--K------------------ALL---- 117 (390) Q Consensus 111 -------------------------------------------------tgl--~------------------~~~---- 117 (390) .|. + ... T Consensus 235 ~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (666) T protein:vir:80 235 EVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFG 314 (666) T ss_pred eeeeccccccccccccceeeeccccccccceeeeeccccccceeeEeccCCccceeeecccccccccccchhhhhhhhhc Confidence 000 0 000 Q ss_pred hhh-------------h--------------------------------hhh-------hhhhhhhhhhhc-----chHH Q lcl|NC_015266. 118 AAQ-------------G--------------------------------KLA-------VKPRILVAPGLD-----TQPV 140 (390) Q Consensus 118 ~~~-------------~--------------------------------~~~-------~~p~~~~apg~~-----~~~v 140 (390) +.. . ..+ ....++.+|+++ .+++ T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v 394 (666) T protein:vir:80 315 RGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWGLFAERESIHVNLLIAGACAGEGDAFSTV 394 (666) T ss_pred cccceeeeecccccccccceEEEecCCCCcccccccccccccccccchhhhhhhhhhcccccceEeecCcCCcccchHHH Confidence 000 0 000 001233344443 3468 Q ss_pred HHHHHHhhhhcceE---------EeecccccCchHHHHHHhhh----------hccceEEEEeeeeEEEeeccCceeEec Q lcl|NC_015266. 141 AAAFATIAQSLRAM---------VYVAAHGCKTKEEAVAYRKQ----------FGQREIMVIWPDWLGWDDITNSTVAIP 201 (390) Q Consensus 141 ~~al~~~~~~~~~~---------~~~d~~~~~~~~~a~~~~~~----------~~~~~~~~~~p~~~~~~~~~~~~~~~p 201 (390) +.++.++|++++++ .++|.++..+.+++.+|++. ++|.|+++||||++++|+.++..+++| T Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p 474 (666) T protein:vir:80 395 QKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVP 474 (666) T ss_pred HHHHHHHHHhhcceEEEeecceeEEeecCCCCCHHHHHHHHHhcccchhhhcccCcceEEEEcCceEEecccCCceeEec Confidence 88999999998743 35566677889999999975 678999999999999999999999999 Q ss_pred HHHHHHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEcC--CCEEEEccccCCCC-c Q lcl|NC_015266. 202 APAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVNR--NGFRFWGSRTCDAD-G 278 (390) Q Consensus 202 ~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~--~G~~~wG~rT~~~d-~ 278 (390) ||+++||+++|+|.++|||+||||+++.++.+... ........|++.||.+|||+++++ +|+++||+||++.+ + T Consensus 475 ~sg~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~---~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~~~s 551 (666) T protein:vir:80 475 LAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVK---LAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPS 551 (666) T ss_pred hHHHHHHHHHHHhhcCCceEccCCeecceeecccc---ceeecChhHHHhhhhCCeeEEEEeCCCeEEEEccccCCCCCc Confidence 99999999999999999999999998665554321 223334567888999999999864 68999999999866 5 Q ss_pred ccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEE Q lcl|NC_015266. 279 KFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWID 358 (390) Q Consensus 279 ~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~ 358 (390) +|+||+|||||+||+++|++.++|+|||||++.+|.+|+++|++||++||++|+|.||+|+||+++||+++|++|+|+++ T Consensus 552 ~~~~i~vRRl~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~di~~G~~~~~ 631 (666) T protein:vir:80 552 PFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVAS 631 (666) T ss_pred ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEE Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEecccceEEEEEEEEcch--HHHHHHHHhcC Q lcl|NC_015266. 359 YDYTPVPPLENLKLRQRITDR--YLADFASRVSA 390 (390) Q Consensus 359 i~~~p~~p~e~i~~~~~~~~~--~~~~l~~~~~a 390 (390) |+++|++|+|||+|++..... .|++++++|++ T Consensus 632 i~~~P~~Pae~I~~~~~~~~~~~~~~e~~~~~~~ 665 (666) T protein:vir:80 632 MFIKPAKSINYIMLNFTAVATGSDFDEIIGPVNQ 665 (666) T ss_pred EEEEecCCcceEEEEEEEeecCccHHHHHHHHhc Confidence 999999999999999987655 79999999999 No 17 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=100.00 E-value=7.3e-86 Score=487.34 Aligned_cols=380 Identities=13% Similarity=0.097 Sum_probs=301.9 Q ss_pred CCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhcc---ccchHHHHHhhhccc Q lcl|NC_015266. 2 PQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGT---KGTLRRTLDAIGKQT 78 (390) Q Consensus 2 a~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~---~gtl~~al~~~~~~~ 78 (390) -+++.|||||+|+ ++++++..+.|++.+|+|.++.+ |+++|++++++.++...||. ...+.+++..+|.++ T Consensus 1 ~~~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~G-----p~~~p~~v~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng 74 (666) T protein:vir:65 1 MTLLSPGFETKET-TLSTTIVQSETGRAALVGKFQWG-----PAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (666) T ss_pred CceecCceEEEEe-cCcccccccCcccceEEecccCC-----CCccCEEecCHHHHHHHcCCccccchhHHHHHHHHHhc Confidence 4556799999999 68899999999999999998765 77899999999999888773 456788999999999 Q ss_pred CceEEEEeecccccccc----------------------------------------cc---c-c-----------c--- Q lcl|NC_015266. 79 KPVTVVVRVAEGKDEAE----------------------------------------TT---A-N-----------V--- 100 (390) Q Consensus 79 ~~~~~vv~v~~~~~~~~----------------------------------------~~---~-~-----------~--- 100 (390) +..|+++++........ .. . . + T Consensus 75 g~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~~~V~~~~~~~~~~~~~~~~~~~~~~~g~~~~t~~~~~~~ 154 (666) T protein:vir:65 75 GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (666) T ss_pred CceEEEEEccCcccccccccccCceeeeEeeccccccccceEEEEecccccccccccccccccccccccccccceeeccc Confidence 99999988732210000 00 0 0 0 Q ss_pred ----------------------------------h-hhh----------ccc------------------------hh-- Q lcl|NC_015266. 101 ----------------------------------I-GTV----------TPD------------------------GK-- 109 (390) Q Consensus 101 ----------------------------------~-~~~----------~~~------------------------~~-- 109 (390) . ++. ... +. T Consensus 155 ~~~g~~~~l~~~~~~~~~~~~~~~~~a~sv~~~~~~g~~~~~~~~~a~~~~~~~~~~~~~~~~~~~a~~A~~~g~~g~~i 234 (666) T protein:vir:65 155 KAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLKKYDMPAVSAIYAGEIGNSL 234 (666) T ss_pred cccCcceeEeeccceeecccCcccccceeeeecccccceeeeeecccccccccccccccccccccceeeeeeccccccce Confidence 0 000 000 00 Q ss_pred ------------------------------------------------hhh--hh----------------------hhh Q lcl|NC_015266. 110 ------------------------------------------------YTG--MK----------------------ALL 117 (390) Q Consensus 110 ------------------------------------------------~tg--l~----------------------~~~ 117 (390) ..| ++ .+. T Consensus 235 ~v~i~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (666) T protein:vir:65 235 EVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFA 314 (666) T ss_pred eEEeecccccccccccccccccccccccceeeecccccccccceeeeecCCcccceeecccCcccccccchhhhhhhhhc Confidence 000 00 000 Q ss_pred h------------------------------------------------h----hhhhhhhhhhhhhhhhcc-----hHH Q lcl|NC_015266. 118 A------------------------------------------------A----QGKLAVKPRILVAPGLDT-----QPV 140 (390) Q Consensus 118 ~------------------------------------------------~----~~~~~~~p~~~~apg~~~-----~~v 140 (390) . . .......++++++|+++. .+| T Consensus 315 ~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v 394 (666) T protein:vir:65 315 RGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTV 394 (666) T ss_pred ccccceeeeecccccccccceEEccCCCCcCcccccccccccccccHHHHHHHHhhhhhccCCceeecCcCCccchhHHH Confidence 0 0 000001234455565543 578 Q ss_pred HHHHHHhhhhcceEEee---------cccccCchHHHHHHhhh----------hccceEEEEeeeeEEEeeccCceeEec Q lcl|NC_015266. 141 AAAFATIAQSLRAMVYV---------AAHGCKTKEEAVAYRKQ----------FGQREIMVIWPDWLGWDDITNSTVAIP 201 (390) Q Consensus 141 ~~al~~~~~~~~~~~~~---------d~~~~~~~~~a~~~~~~----------~~~~~~~~~~p~~~~~~~~~~~~~~~p 201 (390) +.+|..+|+++++++.+ |.++..+.+++++|++. ++|.|+++||||++++|+.++..+++| T Consensus 395 ~~~l~~~~~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p 474 (666) T protein:vir:65 395 QKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVP 474 (666) T ss_pred HHHHHHHHhhccceEEEeccccceeeecCCCCCHHHHHHHHHhcccccccccccCcceEEEEcCceEEecccCCceeEec Confidence 89999999999876433 44557789999999975 668999999999999999999999999 Q ss_pred HHHHHHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEcC--CCEEEEccccCCCC-c Q lcl|NC_015266. 202 APAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVNR--NGFRFWGSRTCDAD-G 278 (390) Q Consensus 202 ~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~--~G~~~wG~rT~~~d-~ 278 (390) ||+++||++||+|.++|||+||+|+++.++.+... + .......|.+.||.+|||+++++ +|+++||+||++++ + T Consensus 475 ~sg~vAGl~Ar~D~~~g~~~span~~~~~i~g~~~-~--~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s 551 (666) T protein:vir:65 475 LAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVK-L--AIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPS 551 (666) T ss_pred hHHHHHHHHHHHhccCCcEEccCCeecceeecccc-c--eeecChhHHHhhhhCCceEEEEeCCCeEEEEecccCCCCCc Confidence 99999999999999999999999998766655422 1 22233457788999999999864 68999999999865 5 Q ss_pred ccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEE Q lcl|NC_015266. 279 KFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWID 358 (390) Q Consensus 279 ~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~ 358 (390) +|+||+|||||+||+++|++.++|++||||++.+|.+|+++|+.||++||++|+|.||+|+||+++||+++|++|+|+++ T Consensus 552 ~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G~~~~~ 631 (666) T protein:vir:65 552 PFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVAS 631 (666) T ss_pred ccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEE Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEecccceEEEEEEEEcch--HHHHHHHHhcC Q lcl|NC_015266. 359 YDYTPVPPLENLKLRQRITDR--YLADFASRVSA 390 (390) Q Consensus 359 i~~~p~~p~e~i~~~~~~~~~--~~~~l~~~~~a 390 (390) |+++|++|+|||+|++..... .|+|+++++++ T Consensus 632 i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~~ 665 (666) T protein:vir:65 632 MFIKPAKSINYIMLNFTAVATGSDFDEIIGPANQ 665 (666) T ss_pred EEEEecCCcceEEEEEEEeecCccHHHHHHHHhc Confidence 999999999999999988655 79999999999 No 18 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=100.00 E-value=7.6e-86 Score=487.24 Aligned_cols=375 Identities=14% Similarity=0.090 Sum_probs=300.6 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhc---cccchHHHHHhhhcc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAG---TKGTLRRTLDAIGKQ 77 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~---~~gtl~~al~~~~~~ 77 (390) |+ +.+|||||+|+ +++++|..+.|++.+|+|.++.+ |.++|++++++.++...|| +..++.+++.++|.+ T Consensus 1 ma-~~~PgVyv~E~-~~~~~i~~~~ts~~~~vG~~~~G-----p~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~n 73 (664) T protein:vir:98 1 MA-LQSPGIETKET-SVQSTVVRNSTGRAAIVGKFSWG-----PAYQIRQISNEVELVNYFGAPDNLTADYFMSAVNFLQ 73 (664) T ss_pred Cc-eecCceEEEec-CCCcccccccccceEEEeeccCC-----CCCccEEecCHHHHHHhcCCccccchhHHHHHHHHHh Confidence 99 66899999999 58999999999999999998755 7789999999999988776 445789999999999 Q ss_pred cCceEEEEeeccccccccc--------------------------------------------------------cccc- Q lcl|NC_015266. 78 TKPVTVVVRVAEGKDEAET--------------------------------------------------------TANV- 100 (390) Q Consensus 78 ~~~~~~vv~v~~~~~~~~~--------------------------------------------------------~~~~- 100 (390) +|..++++++......... .... T Consensus 74 gg~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~gn~~~v~i~~~~~~~~ 153 (664) T protein:vir:98 74 YGNDLRLVRVVDKDAAKNASAIFNQIKTTIASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNGKILAVTIPKRKKSLL 153 (664) T ss_pred cCCeEEEEEecCccccccccccccccceeeccCCcccccccceeEeecCcccccceeeccCCCCCceeeEeeccCcccee Confidence 9999999997421100000 0000 Q ss_pred -------h-------------------------h-----h--------------------------h------------- Q lcl|NC_015266. 101 -------I-------------------------G-----T--------------------------V------------- 104 (390) Q Consensus 101 -------~-------------------------~-----~--------------------------~------------- 104 (390) . . + . T Consensus 154 ~~~~~~~~~~~~~~~~~s~~~~s~g~a~a~~v~~v~~d~~~~~~~~~~a~~~i~~~~~~~~~~~~~~~~~~a~~~G~~Gn 233 (664) T protein:vir:98 154 VLNRSVLTQIFLLVGTTEIVSQSSGVSASITIDGIESDSGITLLNLDIAKETIQGTSFQTLTQKYQIPSVVALYPGELGS 233 (664) T ss_pred ecccccccccceecccceeeeeecccceeeecccccccceeeccccceeeeccccccceeeeeccccceeeeeecccccc Confidence 0 0 0 0 Q ss_pred ------------------------------------------c------------------------------------- Q lcl|NC_015266. 105 ------------------------------------------T------------------------------------- 105 (390) Q Consensus 105 ------------------------------------------~------------------------------------- 105 (390) + T Consensus 234 ~isv~i~s~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 313 (664) T protein:vir:98 234 TVQVEIISKAAYDTGAMISGYPSGISVKNSGRSVMTYGPQTDNQYAFVVRRGGIVQESFIVSTDKTDKDIYGVNIYMDDF 313 (664) T ss_pred eeeeeecccccccCcceEeeccCceecccceeeeeeccccCccceeEEEecCCceeeeEEeecccCcccceeeeeechhh Confidence 0 Q ss_pred -------------------------------------cchhhhhhhhhhhhhhhhhhhhhhhhhhhhcc------hHHHH Q lcl|NC_015266. 106 -------------------------------------PDGKYTGMKALLAAQGKLAVKPRILVAPGLDT------QPVAA 142 (390) Q Consensus 106 -------------------------------------~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~------~~v~~ 142 (390) ..+.++|++++.+ .....|+++.+||+++ ++|+. T Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~~---~~~~~~~ll~~p~~~~~~~~~~~~v~~ 390 (664) T protein:vir:98 314 FANGGSQYVFGTSMNWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFAD---REALHVPLLIAGGCAGESVEIASTVQK 390 (664) T ss_pred eecccceeeeeecccCCcccceeEeccCccccccccCchhHHHHHHhhhc---ccccccceEEecCCCCCcHHHHHHHHH Confidence 0000011111111 1123467777888764 35889 Q ss_pred HHHHhhhhcceE-Eeecc--------cccCchHHHHHHhhh--------------hccceEEEEeeeeEEEeeccCceeE Q lcl|NC_015266. 143 AFATIAQSLRAM-VYVAA--------HGCKTKEEAVAYRKQ--------------FGQREIMVIWPDWLGWDDITNSTVA 199 (390) Q Consensus 143 al~~~~~~~~~~-~~~d~--------~~~~~~~~a~~~~~~--------------~~~~~~~~~~p~~~~~~~~~~~~~~ 199 (390) +|.++|++++.+ .++|. ++..+.+++.+|++. ++|+++++||||++++|+.++..++ T Consensus 391 al~~~a~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~ 470 (664) T protein:vir:98 391 HVISIGDERQDCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNYKYQYDKYNDVNRW 470 (664) T ss_pred HHHHHHHhcCCeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCeEEEecccCCceEE Confidence 999999999854 34443 346678889998863 6789999999999999999999999 Q ss_pred ecHHHHHHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEc---CCCEEEEccccCCC Q lcl|NC_015266. 200 IPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVN---RNGFRFWGSRTCDA 276 (390) Q Consensus 200 ~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~---~~G~~~wG~rT~~~ 276 (390) +|||+++||++||+|.++|||+||+|+++.++.+... + .......+.+.||.+|||+++. ++|+++||+||+++ T Consensus 471 ~p~sg~~AGl~A~~D~~~g~~~span~~~~~i~g~~~-~--~~~~~~~~~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~ 547 (664) T protein:vir:98 471 VPLAGDIAGLCVYTDSVANPWMSPAGYNRGQIRNCIK-L--AIEPRTAHRDAMYQVQINPVTGFAGGSGFVLYGDKTLTS 547 (664) T ss_pred echHHHHHHHHHHhhhcCCcEECcCCceeeeeecccc-c--eeecChhhHHHHHhCCCeEEEEeeCCCcEEEEcccccCC Confidence 9999999999999999999999999998776665432 2 2223345778889999998864 36999999999986 Q ss_pred C-cccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeE Q lcl|NC_015266. 277 D-GKFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGT 355 (390) Q Consensus 277 d-~~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~ 355 (390) + ++|+||++||||+||+++|++.++|+|||||++.+|++|+++|+.||++||++|+|+||+|+||+++||+++|++|+| T Consensus 548 ~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G~~ 627 (664) T protein:vir:98 548 VPSPFDRINVRRLFNMIKKDIGDNAKYKLFENNDDFTRASFRMDTGQYMTNIRALGGCYDYRVICDTTNNTPDVIDRNEF 627 (664) T ss_pred CCcccceEeehhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeE Confidence 5 589999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_015266. 356 WIDYDYTPVPPLENLKLRQRITDRYLADFASRVSA 390 (390) Q Consensus 356 ~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~~~~a 390 (390) +++|+++|++|+|||+|++.......+ |+++.. T Consensus 628 ~~~i~~~p~~pae~I~~~~~q~~~~~~--~~e~~~ 660 (664) T protein:vir:98 628 VATVYVKPPRSINYITLNFVATSTGAD--FDELVG 660 (664) T ss_pred EEEEEEEecCCcceEEEEEEEeecCcc--hhHhcc Confidence 999999999999999999998777644 555554 No 19 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=100.00 E-value=1.6e-85 Score=485.53 Aligned_cols=380 Identities=14% Similarity=0.100 Sum_probs=297.9 Q ss_pred CCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhc---cccchHHHHHhhhccc Q lcl|NC_015266. 2 PQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAG---TKGTLRRTLDAIGKQT 78 (390) Q Consensus 2 a~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~---~~gtl~~al~~~~~~~ 78 (390) -+++.|||||+|+ ++++++..+.|++.+|+|.++.+ |+++|++++++.++...|| +...+.+++..+|.++ T Consensus 1 ~~~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~G-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~~~ 74 (660) T protein:vir:68 1 MALLSPGVELKET-TVQSTVVNNSTGTAALAGKFQWG-----PAFQIKQITDEVALVDMFGTPNTDTADYFMSAMNFLQY 74 (660) T ss_pred CccccCceEEEEe-cCCcccccCCCcceeEEecccCC-----CCccCEEecCHHHHHHhcCCccCccchhHHHHHHHHhC Confidence 4456799999999 69999999999999999998765 7799999999999988877 4456888999999999 Q ss_pred CceEEEEeecccccccc----------------------------------------cc--------------------- Q lcl|NC_015266. 79 KPVTVVVRVAEGKDEAE----------------------------------------TT--------------------- 97 (390) Q Consensus 79 ~~~~~vv~v~~~~~~~~----------------------------------------~~--------------------- 97 (390) |..++++++......+. +. T Consensus 75 g~~~~vvRv~~~~~~~~~~~~~~~~~~t~~~~g~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ta~~~~~a 154 (660) T protein:vir:68 75 GNDLRVVRAVDRDTAKNSSPVAGNINFTISSAGTNYRVGDKVVVKYSTDIIEPDGEVTSVDSDGKILNIFIPSGKIIAKA 154 (660) T ss_pred CCeEEEEEecccccccccccccccceeeeeccCcceeeeeeeeeecccccccccccceeeeecCceeeeeeccccccccc Confidence 99999998743211000 00 Q ss_pred --------------ccch-------------hhhccch-------------------------h-------hhh-----h Q lcl|NC_015266. 98 --------------ANVI-------------GTVTPDG-------------------------K-------YTG-----M 113 (390) Q Consensus 98 --------------~~~~-------------~~~~~~~-------------------------~-------~tg-----l 113 (390) ..+. +.....+ . ..| + T Consensus 155 ~~~~~~~~~~~~~~~~v~~~~~~~~~~~~v~~~~~d~~~~~~~~~ta~~~~~~~~~~~~~~~~~~~~~~A~~~g~~G~~i 234 (660) T protein:vir:68 155 KEIGEYPELGSNWTAEMSGSSSGLSAVITIDSVVMDSGILLTEVETSEEAITSLTFQESIKKYGVPGVVALYPGELGDQL 234 (660) T ss_pred eeeccccccccceeEEeecccccceeeeeeccccccccceeeeeccccccccccceeeeecccCccccccccccccccce Confidence 0000 0000000 0 000 0 Q ss_pred h--------------------------hhh------------hh--------------hh-------------------h Q lcl|NC_015266. 114 K--------------------------ALL------------AA--------------QG-------------------K 122 (390) Q Consensus 114 ~--------------------------~~~------------~~--------------~~-------------------~ 122 (390) . ... .. +. . T Consensus 235 ~v~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (660) T protein:vir:68 235 EIEIVSKADYDKGASAQLKIYPDGGTRYSTAKAIFGYGPQTDDQYAIIVRRNDSVVQSVVLSTKRGERDIYGSNIFIDDF 314 (660) T ss_pred EEEEeccccccccccccceeeecccccccceeeEeecccccccceeeeeecCCcceeeeeeecccccccccccceeeehh Confidence 0 000 00 00 0 Q ss_pred --hh---h----------------------------------------------hhhhhhhhhhc------chHHHHHHH Q lcl|NC_015266. 123 --LA---V----------------------------------------------KPRILVAPGLD------TQPVAAAFA 145 (390) Q Consensus 123 --~~---~----------------------------------------------~p~~~~apg~~------~~~v~~al~ 145 (390) .+ . .+.++..++.. ..+++.+|. T Consensus 315 ~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~v~~~l~ 394 (660) T protein:vir:68 315 FAKGASNYIFATAQGWPKGFSGVIKLNGGLSSNETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESLEVASTVQKHVV 394 (660) T ss_pred hccCcccEEEEeecCCCccccceeeeccccccccccccchhhhHHHHhhhhhccccceeeccccCCCchHHHHHHHHHHH Confidence 00 0 00000000000 135788999 Q ss_pred HhhhhcceEE-e--------ecccccCchHHHHHHhhh----------hccceEEEEeeeeEEEeeccCceeEecHHHHH Q lcl|NC_015266. 146 TIAQSLRAMV-Y--------VAAHGCKTKEEAVAYRKQ----------FGQREIMVIWPDWLGWDDITNSTVAIPAPAIA 206 (390) Q Consensus 146 ~~~~~~~~~~-~--------~d~~~~~~~~~a~~~~~~----------~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~v 206 (390) .+|+++++++ + ++.+++.+.+++.+|++. ++|.|+++||||++++|+.++..+++|||+++ T Consensus 395 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~ 474 (660) T protein:vir:68 395 AIGDSRQDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADI 474 (660) T ss_pred HHHHhhCCeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceEEechhHHH Confidence 9999987543 3 344556788999999974 67899999999999999999999999999999 Q ss_pred HHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEc--CCCEEEEccccCCCCc-cccee Q lcl|NC_015266. 207 AGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVN--RNGFRFWGSRTCDADG-KFFFE 283 (390) Q Consensus 207 Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~--~~G~~~wG~rT~~~d~-~~~~i 283 (390) ||+|||+|.++|||+||+|+++.++.++. .........|.+.||.+|||++++ ++|+++||+||+++|+ +|+|| T Consensus 475 AGl~Ar~d~~~g~~~span~~~~~i~g~~---~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i 551 (660) T protein:vir:68 475 AGLCARTDNISQPWMSPAGYNRGQILNVI---KLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRI 551 (660) T ss_pred HHHHHHHhccCCcEEccCCeeeceeeccc---eeeecCChhHHHHHhhCCceEEEEecCCeEEEEcceecCCCCcccceE Confidence 99999999999999999999877766542 233334566788999999999975 4689999999998775 89999 Q ss_pred eehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEe Q lcl|NC_015266. 284 NYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTP 363 (390) Q Consensus 284 ~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p 363 (390) +|||||+||+++|++.++|++||||++.+|.+|+++|++||++||++|+|+||+|+||+++||+++|++|+|+++|+++| T Consensus 552 ~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~G~~~~~i~~~p 631 (660) T protein:vir:68 552 NVRRLFNMVKTNIGSASKYRLFELNNAFTRSSFRTETSQYLQGIKALGGVYNFKVVCDTTNNTPAVIDRNEFVATFYLQP 631 (660) T ss_pred ehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEecCCCCHHHhhCCeEEEEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceEEEEEEEEc--chHHHHHHHHhcC Q lcl|NC_015266. 364 VPPLENLKLRQRIT--DRYLADFASRVSA 390 (390) Q Consensus 364 ~~p~e~i~~~~~~~--~~~~~~l~~~~~a 390 (390) ++|+|||+|++... ..+|+|++++|.| T Consensus 632 ~~pae~i~l~~~~~~~~~~~~e~~~~v~~ 660 (660) T protein:vir:68 632 ARSINYITLNFVATATGADFDELIGAVGG 660 (660) T ss_pred cCCcceEEEEEEEeecCccHHHHHHhhcC Confidence 99999999998876 4599999999999 No 20 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=100.00 E-value=2e-85 Score=484.93 Aligned_cols=379 Identities=16% Similarity=0.128 Sum_probs=305.2 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhc---cccchHHHHHhhhcc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAG---TKGTLRRTLDAIGKQ 77 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~---~~gtl~~al~~~~~~ 77 (390) ||++++|||||+|++.+++++..+.|++.+|+|.++.+ |.++|++++++.++...|| +..++.+++.++|.+ T Consensus 1 m~~~~~Pgvyv~e~~~~~~~i~~~~t~~~~~vg~~~~G-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~n 75 (743) T protein:vir:10 1 MASQVSPGILIKERDLTNAVVTGALQIRAAHASTFAKG-----PIGDIVNINTQKELVSVFGEPKEDNAEDWMVASEFLN 75 (743) T ss_pred CccccCCceEEEEecCCCceeccCCcceeEEEEeccCC-----CCCcCEEecCHHHHHHHcCCccCCcchHHHHHHHHHh Confidence 99999999999999999999999999999999998765 6789999999999888776 346789999999999 Q ss_pred cCceEEEEeeccccccccc------------------------------------------------------------- Q lcl|NC_015266. 78 TKPVTVVVRVAEGKDEAET------------------------------------------------------------- 96 (390) Q Consensus 78 ~~~~~~vv~v~~~~~~~~~------------------------------------------------------------- 96 (390) ++..|+++++........+ T Consensus 76 gg~~~~vvrv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gN~i~V~v~~~~~d~~~~~~~~~~~~~~~ 155 (743) T protein:vir:10 76 YGGRLAVVRAETTGVLNATTGSAGVLVKNRESWDAGSGNGEVFVARTAGSWGNSLMGVLVDRGADYIVTFAATPTDTAVG 155 (743) T ss_pred CCceEEEEEccCccccccccccccccccccccccccccceeEEEEeeccccccceEEEEecCCCcceeeeeccccccccc Confidence 9999999998532100000 Q ss_pred -----------------------------------ccc--------c------------------------hhhhcc--- Q lcl|NC_015266. 97 -----------------------------------TAN--------V------------------------IGTVTP--- 106 (390) Q Consensus 97 -----------------------------------~~~--------~------------------------~~~~~~--- 106 (390) ... . .++... T Consensus 156 ~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (743) T protein:vir:10 156 TQLLFSYSGTLVTGEILSYDSATNTATITASGTLTSQYLLDTPEQGLIGSFTDNSTTEVGRTPGTYSNVPASGGTGTGAT 235 (743) T ss_pred eeeeecccccccccceeeeeecCcceeeeeccccceeeecccccccccccccccccccccccccceeeEEeccccccccc Confidence 000 0 000000 Q ss_pred ---------------------------------------chh-------------------------------hhhh--- Q lcl|NC_015266. 107 ---------------------------------------DGK-------------------------------YTGM--- 113 (390) Q Consensus 107 ---------------------------------------~~~-------------------------------~tgl--- 113 (390) +.. .++. T Consensus 236 ~tv~v~~~~~~vg~~v~~~~~~~~~~~~~~~~~~v~~~~~~~~t~~~~~~~~~~~g~~~~~a~~~~~~~~~~~~~~~~~~ 315 (743) T protein:vir:10 236 FNVVVADAGGGVGGSVVVTLANPGTGYNQGETLTIASAATGDGTDILVTVATLSDGTIAITELKDWYLNTEIGSTGIKLG 315 (743) T ss_pred ccccccccccccccccccccccccceeeeccccccccccccccccchhheecccccceeeeecccccccchhhccccccc Confidence 000 0000 Q ss_pred -----------------------------------------h---hhh-------------------------------- Q lcl|NC_015266. 114 -----------------------------------------K---ALL-------------------------------- 117 (390) Q Consensus 114 -----------------------------------------~---~~~-------------------------------- 117 (390) + .+. T Consensus 316 ~~~~~~~t~~~~~~~~~~~d~~~v~v~~~~~~~~~~~~~v~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~ 395 (743) T protein:vir:10 316 DIGPRPGTSQFATDNGITDDQVHFAVIDTTGELTGTANTIVERLTYLSKLSDARSEENANIYYKNVINEQSAYLYHGNDA 395 (743) T ss_pred cccccceeeeccccccccccceEEEEecCcceeeeccCceeEEEeeeecccccccccCcceeecceeccccceeeccCcc Confidence 0 000 Q ss_pred ------------------------------------------------------hhhh-hhhhhhhhhhhhhhc-----c Q lcl|NC_015266. 118 ------------------------------------------------------AAQG-KLAVKPRILVAPGLD-----T 137 (390) Q Consensus 118 ------------------------------------------------------~~~~-~~~~~p~~~~apg~~-----~ 137 (390) .... .....+.++.+||+. . T Consensus 396 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gG~d~~~~~~~~~~~~~~~~~~~~~~~~~ll~~p~~~~~~~~~ 475 (743) T protein:vir:10 396 AVQIAASGEAWGQSSDQVLADAGTAFSRTTGYWVNLAGGNDDFAYDAGEFGAAMDLFLDTEETEIDFVLMGGSMADEADT 475 (743) T ss_pred cceeeeccccCccccceeeeecccccccccceEEEeecCccccccchhHHHHHHHHhhhccccCcceEEecCcccCccch Confidence 0000 000012455667653 3 Q ss_pred hHHHHHHHHhhhhcc-eEEeecccccC--------------chHHHHHHhh-hhccceEEEEeeeeEEEeeccCceeEec Q lcl|NC_015266. 138 QPVAAAFATIAQSLR-AMVYVAAHGCK--------------TKEEAVAYRK-QFGQREIMVIWPDWLGWDDITNSTVAIP 201 (390) Q Consensus 138 ~~v~~al~~~~~~~~-~~~~~d~~~~~--------------~~~~a~~~~~-~~~~~~~~~~~p~~~~~~~~~~~~~~~p 201 (390) .+++.++..+|++++ ++.++|.|++. +.+++..+++ .++++++++||||++++|+.++..+++| T Consensus 476 ~~v~~a~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p 555 (743) T protein:vir:10 476 KSKATKVIAIAASRKDALAFVSPHKGNQIASTGNVALSSAQQKENTIAFFSDLTSTSYAVFDSGYKYVYDRFTDKYRYIP 555 (743) T ss_pred HHHHHHHHHHHHhhCCeEEEEecCCCccccccccccccccccchHHHHHHHhccCCeeEEEEccceeeeccccCceeEec Confidence 568999999999987 67788877542 2345555554 5678899999999999999999999999 Q ss_pred HHHHHHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEc--CCCEEEEccccC-CCCc Q lcl|NC_015266. 202 APAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVN--RNGFRFWGSRTC-DADG 278 (390) Q Consensus 202 ~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~--~~G~~~wG~rT~-~~d~ 278 (390) ||+++||++|++|.++|||+||+|+++.|+.++.. ....+...|++.||.+|||++++ ++|+++||+||+ +.|+ T Consensus 556 ~s~~~AGl~a~~D~~~g~~~span~~~~gi~g~~~---~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~s~d~ 632 (743) T protein:vir:10 556 CNGDVAGLCVQTSNQLDDWYSPAGLNRGGILNAVK---LAYNPNKADRDELYQNRINPVVSLRGQGITLFGDKTALAAPS 632 (743) T ss_pred hhHHHHHHHHHhhccCCcEEccCCeeeeeeecccc---ceecCChhHHHhHhhCCceEEEEecCCeEEEEcccccCCCCc Confidence 99999999999999999999999999888877532 23344566889999999999986 468999999998 4689 Q ss_pred ccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEE Q lcl|NC_015266. 279 KFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWID 358 (390) Q Consensus 279 ~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~ 358 (390) +|+||++||||+||+++|++.++|+|||||++.+|++|+++|++||++||++|+|.||+|+||+++||+++|++|+|+++ T Consensus 633 ~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~~~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~ 712 (743) T protein:vir:10 633 AFDRINVRRLFLNLEKRARRLAEGVLFEQNDATTRAGFSSALNSYLSEVQARRGVTDYLVICDESNNTPDIIDRNEFVAE 712 (743) T ss_pred ccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEecccceEEEEEEEE--cchHHHHHHHH Q lcl|NC_015266. 359 YDYTPVPPLENLKLRQRI--TDRYLADFASR 387 (390) Q Consensus 359 i~~~p~~p~e~i~~~~~~--~~~~~~~l~~~ 387 (390) |+++|++|+|||+|++.. +..+|+|++++ T Consensus 713 i~~~p~~pae~I~~~~~~~~~~~~~~e~~~~ 743 (743) T protein:vir:10 713 VYVKPTRSINFITITFTATKTGVTFSEVVGR 743 (743) T ss_pred EEEEecCCcceEEEEEEEeecCcchHhhhcC Confidence 999999999999999884 66689999999 No 21 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=100.00 E-value=9e-85 Score=481.35 Aligned_cols=378 Identities=12% Similarity=0.087 Sum_probs=298.2 Q ss_pred CCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhc---cccchHHHHHhhhccc Q lcl|NC_015266. 2 PQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAG---TKGTLRRTLDAIGKQT 78 (390) Q Consensus 2 a~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~---~~gtl~~al~~~~~~~ 78 (390) -++..|||||+|++.+.+++.. .|++.+|+|.++.+ |+++|++++++.++...|| ....+.+++..+|.|+ T Consensus 1 ~~~~~PgVyv~e~~~~~~~~~~-~ts~~~fvG~~~~G-----p~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ng 74 (659) T protein:vir:10 1 MTLLSPGIELKETTVQSTVVNN-STGTAALAGKFQWG-----PAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQY 74 (659) T ss_pred CceecCceEEEEecCCceeccc-CccceEEEecccCC-----CCCccEEecCHHHHHHHcCCcCCCcchhHHHHHHHhhC Confidence 3455799999999999988875 79999999998755 6789999999999887776 4567899999999999 Q ss_pred CceEEEEeeccccccc----------------------------------------ccc--------------------- Q lcl|NC_015266. 79 KPVTVVVRVAEGKDEA----------------------------------------ETT--------------------- 97 (390) Q Consensus 79 ~~~~~vv~v~~~~~~~----------------------------------------~~~--------------------- 97 (390) +..++++++....... ... T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~vd~~~~~~~~~i~~~~~~~~~ 154 (659) T protein:vir:10 75 GNDLRVVRAVDRDTAKNSSPIAGNIEYTISTPGSNYAVGDKITVKYVSDAIETEGKITEVDTDGKIKKINIPTAKIIAKA 154 (659) T ss_pred CCeEEEEEccCcccccccccccccceeeEeecccccccccceeeeecCCCccccceeeEEecccccceeeeccccccccc Confidence 9999999863211000 000 Q ss_pred ------ccch-----------h---------hh------------------------------cc---ch---------- Q lcl|NC_015266. 98 ------ANVI-----------G---------TV------------------------------TP---DG---------- 108 (390) Q Consensus 98 ------~~~~-----------~---------~~------------------------------~~---~~---------- 108 (390) .... . .. +. .. T Consensus 155 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~a~t~~~~~~~~~~~~~~~v~a~~~G~~g~~~ 234 (659) T protein:vir:10 155 KEVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKI 234 (659) T ss_pred ccccccceeeeeeeeeeeeeccccceeeEEeeeecCCceeEEeeccccccccccccccceeecccccccccccceecccc Confidence 0000 0 00 00 00 Q ss_pred --------h-----------------------------------------hhh-----------------------hhhh Q lcl|NC_015266. 109 --------K-----------------------------------------YTG-----------------------MKAL 116 (390) Q Consensus 109 --------~-----------------------------------------~tg-----------------------l~~~ 116 (390) . ..+ +... T Consensus 235 tv~~~~~a~~~~~~~v~v~~~~~~~~~a~~~t~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (659) T protein:vir:10 235 EIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDF 314 (659) T ss_pred eEEEechhhccccceeeeeeeeecccccccceeeeeeccccccchhhccccccceeeeeeeeccccccccccchhhhhhh Confidence 0 000 0000 Q ss_pred -----------------------------------------hhh----hhhhhhhhhhhhhhhhcc------hHHHHHHH Q lcl|NC_015266. 117 -----------------------------------------LAA----QGKLAVKPRILVAPGLDT------QPVAAAFA 145 (390) Q Consensus 117 -----------------------------------------~~~----~~~~~~~p~~~~apg~~~------~~v~~al~ 145 (390) ... .......++++.+||++. ++|+.+|. T Consensus 315 ~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~il~~p~~~~~~~~~~~~v~~al~ 394 (659) T protein:vir:10 315 FAKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVV 394 (659) T ss_pred hccCcccEEEEeecccCCCccceeeecccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHH Confidence 000 000001345566677643 46889999 Q ss_pred HhhhhcceEE-eeccc--------ccCchHHHHHHhhh----------hccceEEEEeeeeEEEeeccCceeEecHHHHH Q lcl|NC_015266. 146 TIAQSLRAMV-YVAAH--------GCKTKEEAVAYRKQ----------FGQREIMVIWPDWLGWDDITNSTVAIPAPAIA 206 (390) Q Consensus 146 ~~~~~~~~~~-~~d~~--------~~~~~~~a~~~~~~----------~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~v 206 (390) .+|+++++++ ++|.+ ...+.+++.+||+. ++|+++++||||++++|+.++..+++|||+++ T Consensus 395 ~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg~~ 474 (659) T protein:vir:10 395 SIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADI 474 (659) T ss_pred HHHHhhCCeEEEEcCccccccCCCcccCHHHHHHHHHhcccccccccccCcceEEEEeCcEEEecccCCceEEechHHHH Confidence 9999998554 44433 34677899999975 67999999999999999999999999999999 Q ss_pred HHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEcC--CCEEEEccccCCCC-ccccee Q lcl|NC_015266. 207 AGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVNR--NGFRFWGSRTCDAD-GKFFFE 283 (390) Q Consensus 207 Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~--~G~~~wG~rT~~~d-~~~~~i 283 (390) ||++||+|.++|||+||||+++.++.++.. ........|.+.||.+|||+++++ +|+++||+||++.+ ++|+|| T Consensus 475 AGl~Ar~D~~~g~~~span~~~~~i~g~~~---~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i 551 (659) T protein:vir:10 475 AGLCARTDNVSQTWMSPAGYNRGQILNVIK---LAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRI 551 (659) T ss_pred HHHHHHHhccCCceEccCCceeeeeecccc---ceecCCHhHHHHHhhCCeeEEEEeCCCeEEEEcccccCCCCcccceE Confidence 999999999999999999998766665532 122334557888999999999864 68999999999866 489999 Q ss_pred eehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEe Q lcl|NC_015266. 284 NYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTP 363 (390) Q Consensus 284 ~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p 363 (390) +|||+++||+++|++.++|++||||++.+|++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+++|+++| T Consensus 552 ~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~i~~~p 631 (659) T protein:vir:10 552 NVRRLFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGIKALGGIYEYRVVCDTTNNTPSVIDRNEFVATFYIQP 631 (659) T ss_pred ehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCCeEEEEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_015266. 364 VPPLENLKLRQRITDRYLADFASRVSA 390 (390) Q Consensus 364 ~~p~e~i~~~~~~~~~~~~~l~~~~~a 390 (390) ++|+|||+|++.......+ |++|.+ T Consensus 632 ~~pae~i~~~~~~~~~~~~--~~e~~~ 656 (659) T protein:vir:10 632 ARSINYITLNFVATATGAD--FDELTG 656 (659) T ss_pred cCCcceEEEEEEEEecCcc--hHHhhc Confidence 9999999999999877666 667776 No 22 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=100.00 E-value=1.3e-84 Score=480.51 Aligned_cols=379 Identities=13% Similarity=0.097 Sum_probs=299.0 Q ss_pred CCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhc---cccchHHHHHhhhccc Q lcl|NC_015266. 2 PQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAG---TKGTLRRTLDAIGKQT 78 (390) Q Consensus 2 a~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~---~~gtl~~al~~~~~~~ 78 (390) -++++|||||+|+ +++++|..+.|++.+|+|.++.+ |+++|++++++.++...|| +...+..++..+|.++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~i~~~~t~~~~~vg~~~~g-----p~~~p~~v~s~~~~~~~fg~~~~~~~~~~~~~~~f~~~ 74 (660) T protein:vir:10 1 MALLSPGIELKET-SVQSTVVRNATGRAALVGKFQWG-----PAFQVTQITNEVELVDLFGGPNNEVADYFMSGMNFLQY 74 (660) T ss_pred CceecCceEEEee-cCCccccCCCcccceEEeecCCC-----CCccCeEcCCHHHHHHHcCCcCCCchhHHHHHHHHHhC Confidence 4556799999999 58999999999999999998755 7889999999999888776 3467888999999999 Q ss_pred CceEEEEeeccccccccc----------------------------------------------ccc------------- Q lcl|NC_015266. 79 KPVTVVVRVAEGKDEAET----------------------------------------------TAN------------- 99 (390) Q Consensus 79 ~~~~~vv~v~~~~~~~~~----------------------------------------------~~~------------- 99 (390) +..|+++++......... ... T Consensus 75 g~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~v~~~~a~g~~~~~~~~ta~~~~~a 154 (660) T protein:vir:10 75 GNDLRTVRVVSREFAKNASPIAGNIETTITTAGSNYAVGDKINIKYNQTVVESEGRVTSVDTDGKILSVFIPSAKIIAYA 154 (660) T ss_pred CceEEEEEecccccccccccccccceeEEeeccccccccceeeEeeccccccccccceeeccccceeeeccccccccccc Confidence 999999987433210000 000 Q ss_pred ----------------chh---h----------hccch-------------------------------------hhh-- Q lcl|NC_015266. 100 ----------------VIG---T----------VTPDG-------------------------------------KYT-- 111 (390) Q Consensus 100 ----------------~~~---~----------~~~~~-------------------------------------~~t-- 111 (390) +.. + +...+ ..+ T Consensus 155 ~~v~~~~~~~~~~~~~~~~~~~~~~~a~sv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i 234 (660) T protein:vir:10 155 RSLNQYPTLGPAWTAEVTSASSGVSGTITVGKIVTDSGILLTEAENSEEAITSLEFQAALKKFAMPGVVALYPGEIGSTL 234 (660) T ss_pred cccccccccccceeEEEecccCccccceeeeeeeccCcceEEeeeccccccccccceeeccccccceeeeecccccCcce Confidence 000 0 00000 000 Q ss_pred ----------------------------------------------------hh--h----------------------h Q lcl|NC_015266. 112 ----------------------------------------------------GM--K----------------------A 115 (390) Q Consensus 112 ----------------------------------------------------gl--~----------------------~ 115 (390) |. + . T Consensus 235 ~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (660) T protein:vir:10 235 EVEIVSKAAYEAGSSKMLDVYPGGGTRASIAKAVFNYGPQTDDQYAIIVRRDGAIVESVVLSTKEGEKDVYGNNIYLDDY 314 (660) T ss_pred eEEEeeccccCCcceeEEeeeeccceeeEEeeeecccccccccccccccccCCcccceeeeeccccccccccceeeeehh Confidence 00 0 0 Q ss_pred hhh----------------------------------------hhhh----hhhhhhhhhhhhhc------chHHHHHHH Q lcl|NC_015266. 116 LLA----------------------------------------AQGK----LAVKPRILVAPGLD------TQPVAAAFA 145 (390) Q Consensus 116 ~~~----------------------------------------~~~~----~~~~p~~~~apg~~------~~~v~~al~ 145 (390) ... .... ....++++.+|++. .++|+++|. T Consensus 315 ~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~t~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~al~ 394 (660) T protein:vir:10 315 FAKGTSNYIYATSLNWPKGFSGIINLSGGISANDKVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDEVASTVQKHVV 394 (660) T ss_pred hcCCCccEEEEEeccCCCCcccceeeeccccCccccccchhhhhhhhhhhhhhcccceEEEcCcCCCchhhhHHHHHHHH Confidence 000 0000 00113333444443 245889999 Q ss_pred Hhhhhcc-eEEeeccccc--------CchHHHHHHhhh----------hccceEEEEeeeeEEEeeccCceeEecHHHHH Q lcl|NC_015266. 146 TIAQSLR-AMVYVAAHGC--------KTKEEAVAYRKQ----------FGQREIMVIWPDWLGWDDITNSTVAIPAPAIA 206 (390) Q Consensus 146 ~~~~~~~-~~~~~d~~~~--------~~~~~a~~~~~~----------~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~v 206 (390) ++|++++ ++.++|.|.. .+.+++.+||+. ++|.++++||||.+++|+.++..+++|||+++ T Consensus 395 ~~~~~~~~~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~ 474 (660) T protein:vir:10 395 SIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADL 474 (660) T ss_pred HHHHhhCCEEEEEecCcccccccccccCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceeEechhHHH Confidence 9999987 6677787643 477899999874 66899999999999999999999999999999 Q ss_pred HHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEc---CCCEEEEccccCCCCc-ccce Q lcl|NC_015266. 207 AGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVN---RNGFRFWGSRTCDADG-KFFF 282 (390) Q Consensus 207 Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~---~~G~~~wG~rT~~~d~-~~~~ 282 (390) ||++||+|.++|||+||||+++.++.+... ........|.+.||.+|||++++ ++||++||+||++.|+ +|+| T Consensus 475 AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~---~~~~~~~~e~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~s~~~~ 551 (660) T protein:vir:10 475 AGLCARTDDVSQPWMSPAGYNRGQILNVLK---LAIEPRQAQRDRMYQEAINPVVGFAGGDGFVLFGDKTATKVPSPMDH 551 (660) T ss_pred HHHHHHhhccCCcEEccCCeeeceeeccce---eeecCChhhHHhHhhCCceEEEEeeCCCcEEEEcccccCCCCcccce Confidence 999999999999999999998765554432 12234455778899999998865 3699999999998875 8999 Q ss_pred eeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEE Q lcl|NC_015266. 283 ENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYT 362 (390) Q Consensus 283 i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~ 362 (390) |||||||+||+++|++.++|+|||||++.+|.+|+++|+.||++||++|+|.||+|+||+++||+++|++|+|+++|+++ T Consensus 552 i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~gal~g~~V~~d~~~nt~~di~~G~~~~~i~~~ 631 (660) T protein:vir:10 552 INVRRLFNMLKKNIGDASKYKLFELNDNFTRSSFRMEVSQYLDGIKALGGIYEGRVVCDTTVNTPAVIDRNEFIANIYVK 631 (660) T ss_pred EehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecccceEEEEEEEEcch--HHHHHHHHhc Q lcl|NC_015266. 363 PVPPLENLKLRQRITDR--YLADFASRVS 389 (390) Q Consensus 363 p~~p~e~i~~~~~~~~~--~~~~l~~~~~ 389 (390) |++|+|||+|++..... .|+|+++++. T Consensus 632 P~~pae~I~~~~~~~~~~~~~~e~~~~~~ 660 (660) T protein:vir:10 632 PARSINYITLNFVATSTGADFDELIGPLV 660 (660) T ss_pred ecCCccEEEEEEEEeecCccHHHHhhhcC Confidence 99999999999888655 5888888888 No 23 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=100.00 E-value=2.1e-84 Score=479.35 Aligned_cols=380 Identities=15% Similarity=0.118 Sum_probs=299.6 Q ss_pred CCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhc---cccchHHHHHhhhccc Q lcl|NC_015266. 2 PQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAG---TKGTLRRTLDAIGKQT 78 (390) Q Consensus 2 a~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~---~~gtl~~al~~~~~~~ 78 (390) -+++.|||||+|+ +++++|..+.|++.+|+|.++.+ |+++|++++++.++...|| +..++.+++.++|.|+ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~i~~~~t~~~~~vg~~~~g-----p~~~p~~i~s~~~~~~~fg~~~~~~~~~~~~~~~f~~g 74 (679) T protein:vir:10 1 MTLLSPGVETKEI-NLQTTIARSSTGRAALVGKFNWG-----PAYQISQVVSEVDLVDKFGRPDDQTADSFFSGVNFLNY 74 (679) T ss_pred CceecCceEEEee-cCCcccccCccccceeeecccCC-----CCccCEEecCHHHHHHHcCCcccccchHHHHHHHHHhC Confidence 4455799999999 59999999999999999998755 7899999999999877776 4567899999999999 Q ss_pred CceEEEEeecccccccc---------------------------------------ccc--------------------c Q lcl|NC_015266. 79 KPVTVVVRVAEGKDEAE---------------------------------------TTA--------------------N 99 (390) Q Consensus 79 ~~~~~vv~v~~~~~~~~---------------------------------------~~~--------------------~ 99 (390) |..|+++++........ +.. . T Consensus 75 g~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~s~~~~~~~~~~~~~~~~~~~~v~~~~~~~~a~ 154 (679) T protein:vir:10 75 GNDLRLVRVLNETKSRNSSALYQSLSYTITSPGVDYKVGDVVNVLQGGNVIATGKVTVVNASGGIVAFYVPTAAIIDKAK 154 (679) T ss_pred CCeEEEEEccCcccccccccccccccccccccccccccccceeeeeCCCcccceeEEEeeccCceeeeeecccccccccc Confidence 99999998732221000 000 0 Q ss_pred ---c--------------------------hhhhcc-ch-------------------------------------hhhh Q lcl|NC_015266. 100 ---V--------------------------IGTVTP-DG-------------------------------------KYTG 112 (390) Q Consensus 100 ---~--------------------------~~~~~~-~~-------------------------------------~~tg 112 (390) . +..... .+ ...| T Consensus 155 ~~~~~~~l~~a~~~~~~~~t~~~g~~~~~~v~~v~~~~~~~~~~~~~a~~~i~~~~~~~~t~~~~~~~~~~~~~~A~~~g 234 (679) T protein:vir:10 155 SLNDYPALDNAWQIQFAAGGPGAGQAATATVVGINLDSTIFVPNDEYAMSAISERSETKRTFIDICEEMKVPAIVARYAG 234 (679) T ss_pred cccccceecccceeeeeeccccccceeeeeeeeeccCCceeeccccccccccccccccchhhhhhhhccccceeeeeccc Confidence 0 000000 00 0000 Q ss_pred ---------------------------------------h--------------------------------h------- Q lcl|NC_015266. 113 ---------------------------------------M--------------------------------K------- 114 (390) Q Consensus 113 ---------------------------------------l--------------------------------~------- 114 (390) . + T Consensus 235 ~~gn~i~v~~va~~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vvv~~~g~~~~~~~~~~~ 314 (679) T protein:vir:10 235 TYGDNIKVLMIAYKDYYKFNEAGKIVSVNTINPKVFPTGLDYGNVTPSSYLEFGPQNESQFAFIVFNNGVAVESKILSTK 314 (679) T ss_pred ccCCcceEEEEeecccccccccccccccccccccccccccccccceeeeecccccccccceeeEEecccccccceeeecc Confidence 0 0 Q ss_pred ---------------hhhhh----------------------------------------hhhhh----hhhhhhhhhhh Q lcl|NC_015266. 115 ---------------ALLAA----------------------------------------QGKLA----VKPRILVAPGL 135 (390) Q Consensus 115 ---------------~~~~~----------------------------------------~~~~~----~~p~~~~apg~ 135 (390) ..... ...+. ..+.++++|++ T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~ 394 (679) T protein:vir:10 315 PGDRDIYGTSIYINEYFGNGYSSFVQGVAESWPVGYTGVLAFGGGQSSNTDISAAEFMKGWDMFADREHTDVNLFIAGAV 394 (679) T ss_pred cccccccchhhhhhhhhcCcccceeeeccccccccccceeeccCCccCCCccchhhhhhhhhhhhcccccccceEEecCC Confidence 00000 00000 01234455665 Q ss_pred c------chHHHHHHHHhhhhcc-eEEeeccccc--------CchHHHHHHhhh-------------hccceEEEEeeee Q lcl|NC_015266. 136 D------TQPVAAAFATIAQSLR-AMVYVAAHGC--------KTKEEAVAYRKQ-------------FGQREIMVIWPDW 187 (390) Q Consensus 136 ~------~~~v~~al~~~~~~~~-~~~~~d~~~~--------~~~~~a~~~~~~-------------~~~~~~~~~~p~~ 187 (390) + .++|+.+|..+|++++ ++.++|.+.. .+.+++..||+. ++|.|+++||||+ T Consensus 395 ~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~~~~~p~~ 474 (679) T protein:vir:10 395 AGEGAQIASTVQKAVVAIADERRDCLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYASVDGNYK 474 (679) T ss_pred CCCchhhhHHHHHHHHHHHHhhCCeEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceEEEEccce Confidence 4 2468899999999998 5566665532 455778888863 5689999999999 Q ss_pred EEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEcC--CC Q lcl|NC_015266. 188 LGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVNR--NG 265 (390) Q Consensus 188 ~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~--~G 265 (390) +++|+.++..+++|||+++||++||+|.++|||+||+|+++.++.++.. . .......|.+.||.+|||+++++ +| T Consensus 475 ~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~-~--~~~~~~~~~~~Ln~~gin~i~~~~g~G 551 (679) T protein:vir:10 475 YQYDKYNDVNRWIPLAADIAGLCARTDTVGQPWQSPAGFNRGQIVNVIK-L--AVDTRQAHRDEMYTNGINPIVGFAGQG 551 (679) T ss_pred eeecccCCceEEechHHHHHHHHHHhhccCCcEECcCCeeecccccccc-c--eeecChhhHHhhhhCCceEEEEecCCe Confidence 9999999999999999999999999999999999999998776665432 1 22234457889999999999864 78 Q ss_pred EEEEccccCCCC-cccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCC Q lcl|NC_015266. 266 FRFWGSRTCDAD-GKFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEP 344 (390) Q Consensus 266 ~~~wG~rT~~~d-~~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~ 344 (390) +++||+||++.+ ++|+||+|||||+||+++|++.++|+|||||++.+|.+|+++|++||++||++|+|.||+|+||+++ T Consensus 552 ~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~gf~v~~d~~~ 631 (679) T protein:vir:10 552 YILYGDKTASQAPTPFDRINVRRLFNLLKKSISESAKYKLFELNDAFTRSSFRSEVGSYLDTIRSLGGIYDFRVVCDESN 631 (679) T ss_pred EEEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCC Confidence 999999999876 4899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHHhCCeEEEEEEEEecccceEEEEEEEEcc--hHHHHHHHHhcC Q lcl|NC_015266. 345 NTTDELTSGGTWIDYDYTPVPPLENLKLRQRITD--RYLADFASRVSA 390 (390) Q Consensus 345 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~--~~~~~l~~~~~a 390 (390) ||+++|++|+|+++|+++|++|+|||+|++.... .+|+|++++|+- T Consensus 632 nt~~~i~~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~~ 679 (679) T protein:vir:10 632 NTPAVIDRNEFVATILIKPARSINYITLSFVATSTGADFDELVGSFQQ 679 (679) T ss_pred CCHHHhhCCeEEEEEEEEecCCccEEEEEEEEeecCccHHHHHHHhcC Confidence 9999999999999999999999999999988754 479999999988 No 24 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=100.00 E-value=3e-84 Score=478.47 Aligned_cols=379 Identities=12% Similarity=0.098 Sum_probs=296.6 Q ss_pred CCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhc---cccchHHHHHhhhccc Q lcl|NC_015266. 2 PQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAG---TKGTLRRTLDAIGKQT 78 (390) Q Consensus 2 a~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~---~~gtl~~al~~~~~~~ 78 (390) -++.+|||||+|++.+++++.. .|++.+|+|.++.+ |+++|++++++.++...|| ....+.+++..+|.++ T Consensus 1 ~~~~~PgVyvee~~~~~~~~~~-~ts~~~fvG~~~~G-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng 74 (659) T protein:vir:72 1 MTLLSPGIELKETTVQSTVVNN-STGTAALAGKFQWG-----PAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQY 74 (659) T ss_pred CceecCceEEEEecCCcccccC-CCcceEEEeecCCC-----CCcccEEecCHHHHHHHcCCcCCCCchhHHHHHHHHhC Confidence 3455799999999999977655 89999999998755 6789999999999988887 4567889999999999 Q ss_pred CceEEEEeeccccccccc---------------------------ccc----------------------c--------- Q lcl|NC_015266. 79 KPVTVVVRVAEGKDEAET---------------------------TAN----------------------V--------- 100 (390) Q Consensus 79 ~~~~~vv~v~~~~~~~~~---------------------------~~~----------------------~--------- 100 (390) |..|+++++......... ... + T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~g~v~~~~~~~~~~~~~v~t~~~~a~~ 154 (659) T protein:vir:72 75 GNDLRVVRAVDRDTAKNSSPIAGNIDYTISTPGSNYAVGDKITVKYVSDDIETEGKITEVDADGKIKKINIPTGKNYAKA 154 (659) T ss_pred CceEEEEEccCCcccccccccccccceeecccccccccceeeeeeeccccccccceEEEeeccccceeeeeccccccccc Confidence 999999987321100000 000 0 Q ss_pred --------------------hh---h------h---------ccc-------h--------------------------- Q lcl|NC_015266. 101 --------------------IG---T------V---------TPD-------G--------------------------- 108 (390) Q Consensus 101 --------------------~~---~------~---------~~~-------~--------------------------- 108 (390) .. . . +.. . T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~v~~~~~~~~~~v~~~~~a~~~~~~~~~v~~~~~~~~~a~~~gt~g~~~ 234 (659) T protein:vir:72 155 KEVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKI 234 (659) T ss_pred cccccccccccceeeEEeeccccccceEEEEEeecCcceeeeeccccchhhhcccccccccccccceeeeccccccccce Confidence 00 0 0 000 0 Q ss_pred -------------------------hhhhh-----------------------------------------------hh- Q lcl|NC_015266. 109 -------------------------KYTGM-----------------------------------------------KA- 115 (390) Q Consensus 109 -------------------------~~tgl-----------------------------------------------~~- 115 (390) ...+. .. T Consensus 235 tv~i~~~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (659) T protein:vir:72 235 EIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDF 314 (659) T ss_pred eEEEccccccccceeeeeecccccccccccceeeeeeecccccccceeeecccceeeeeeeeeccccccccchhhhhhhh Confidence 00000 00 Q ss_pred hh----------------------------------------hhhh----hhhhhhhhhhhhhhcc------hHHHHHHH Q lcl|NC_015266. 116 LL----------------------------------------AAQG----KLAVKPRILVAPGLDT------QPVAAAFA 145 (390) Q Consensus 116 ~~----------------------------------------~~~~----~~~~~p~~~~apg~~~------~~v~~al~ 145 (390) +. .... .....+.++.+||+.+ .+++.+|. T Consensus 315 ~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~~l~ 394 (659) T protein:vir:72 315 FAKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVV 394 (659) T ss_pred hhcCCceEEEEEecccCCcccccccccccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHH Confidence 00 0000 0001245566677643 45889999 Q ss_pred HhhhhcceEE-eeccc--------ccCchHHHHHHhhh----------hccceEEEEeeeeEEEeeccCceeEecHHHHH Q lcl|NC_015266. 146 TIAQSLRAMV-YVAAH--------GCKTKEEAVAYRKQ----------FGQREIMVIWPDWLGWDDITNSTVAIPAPAIA 206 (390) Q Consensus 146 ~~~~~~~~~~-~~d~~--------~~~~~~~a~~~~~~----------~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~v 206 (390) ++|+++++++ ++|.+ ...+.+++.+||+. ++|+++++||||++++|+.++..+++|||+++ T Consensus 395 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~v 474 (659) T protein:vir:72 395 SIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNHKYQYDKYNDVNRWVPLAADI 474 (659) T ss_pred HHHhhhCCEEEEEcCccccccCCCcccCHHHHHHHHhhccccccccccccceeEEEEcCceeeccccCCceEEechHHHH Confidence 9999998654 44443 34567889999975 57889999999999999999999999999999 Q ss_pred HHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEcC--CCEEEEccccCCCCc-cccee Q lcl|NC_015266. 207 AGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVNR--NGFRFWGSRTCDADG-KFFFE 283 (390) Q Consensus 207 Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~--~G~~~wG~rT~~~d~-~~~~i 283 (390) ||+++|+|.++|||+||||+++.++.++.. ........|.+.||.+|||+++++ +|+++||+||+++|+ +|+|| T Consensus 475 AGl~Ar~D~~~G~~~span~~~~~i~g~~~---~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~~~s~~~~i 551 (659) T protein:vir:72 475 AGLCARTDNVSQTWMSPAGYNRGQILNVIK---LAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRI 551 (659) T ss_pred HHHHHHhhccCCcEEccCCeeeceeecccc---ccccCChhHHHHHhhCCceEEEEecCCeEEEEcccccCCCCcccceE Confidence 999999999999999999998776666432 223344567889999999999865 689999999998775 89999 Q ss_pred eehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEe Q lcl|NC_015266. 284 NYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTP 363 (390) Q Consensus 284 ~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p 363 (390) +|||+|+||+++|++.++|+|||||++.+|++|+++|++||++||++|+|.||+|+||+++||+++|++|+|+++|+++| T Consensus 552 ~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~i~~~p 631 (659) T protein:vir:72 552 NVRRLFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGNKALGGIYEYRVVCDTTNNTPSVIDRNEFVATFYIQP 631 (659) T ss_pred eehhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCCeEEEEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceEEEEEEEEcch--HHHHHHHHhc Q lcl|NC_015266. 364 VPPLENLKLRQRITDR--YLADFASRVS 389 (390) Q Consensus 364 ~~p~e~i~~~~~~~~~--~~~~l~~~~~ 389 (390) ++|+|||+|++..... +|+|+.-..- T Consensus 632 ~~pae~I~~~~~~~~~~~~~~e~~~~~~ 659 (659) T protein:vir:72 632 ARSINYITLNFVATATGADFDELTGLAG 659 (659) T ss_pred cCCccEEEEEEEEeecCcchHHhcccCC Confidence 9999999999888554 4555554433 No 25 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=100.00 E-value=3.3e-84 Score=478.28 Aligned_cols=377 Identities=14% Similarity=0.094 Sum_probs=294.7 Q ss_pred CCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhcc---ccchHHHHHhhhccc Q lcl|NC_015266. 2 PQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGT---KGTLRRTLDAIGKQT 78 (390) Q Consensus 2 a~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~---~gtl~~al~~~~~~~ 78 (390) -+++.|||||+|+ ++++++..+.|++.+|+|.++.+ |+++|++++++.++...||. ...+.+++..+|.|+ T Consensus 1 ~~~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~G-----p~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ng 74 (663) T protein:vir:10 1 MALLSPGIEMKET-SINSTVVRSATGRAAIVGKFAWG-----PAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQY 74 (663) T ss_pred CceecCceEEEEe-cCcccccccCccceeEEeeeccC-----CCCccEEecCHHHHHHHhCCcCccchhHHHHHHHHHhC Confidence 4456799999999 69999999999999999998755 67899999999999888875 567889999999999 Q ss_pred CceEEEEeecccccccccc-----------------------------c------------------c------------ Q lcl|NC_015266. 79 KPVTVVVRVAEGKDEAETT-----------------------------A------------------N------------ 99 (390) Q Consensus 79 ~~~~~vv~v~~~~~~~~~~-----------------------------~------------------~------------ 99 (390) |..++++++....+..... . . T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~n~~~~~v~~~~a~~~~~~ 154 (663) T protein:vir:10 75 GNDLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKT 154 (663) T ss_pred CCeEEEEEccCCcccccccccCCcceeeeeccccCccccccccccccccccccccceeeeccCCceEEEEeccccccccc Confidence 9999999874321100000 0 0 Q ss_pred ----------------c----------------h-h-hh----------ccch-----------------hhhh------ Q lcl|NC_015266. 100 ----------------V----------------I-G-TV----------TPDG-----------------KYTG------ 112 (390) Q Consensus 100 ----------------~----------------~-~-~~----------~~~~-----------------~~tg------ 112 (390) + . . +. +.+. ...| T Consensus 155 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~v~~vv~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~G~~Gn~i 234 (663) T protein:vir:10 155 RQLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLVSAVYPGEIGSTV 234 (663) T ss_pred cccceeeeccccceeEeeeccccccccccccceecccceeeEeeccccccccccchhhhcccccceeeeeecccccccce Confidence 0 0 0 00 0000 0000 Q ss_pred ----------------------------------------------------------h------------h-hhhhhhh Q lcl|NC_015266. 113 ----------------------------------------------------------M------------K-ALLAAQG 121 (390) Q Consensus 113 ----------------------------------------------------------l------------~-~~~~~~~ 121 (390) + . ....... T Consensus 235 ~v~i~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~ 314 (663) T protein:vir:10 235 EVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFR 314 (663) T ss_pred eEEecccccccccccccccccccccccccceeeeeccccccceeEEEecCCcceeeeeeeecccccccchhhhhhhhhhc Confidence 0 0 0000000 Q ss_pred hh------------------------------------------------hhhhhhhhh--hhhc----chHHHHHHHHh Q lcl|NC_015266. 122 KL------------------------------------------------AVKPRILVA--PGLD----TQPVAAAFATI 147 (390) Q Consensus 122 ~~------------------------------------------------~~~p~~~~a--pg~~----~~~v~~al~~~ 147 (390) .. .+.+.++.. |+.. ..+|+.+|..+ T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~al~~~ 394 (663) T protein:vir:10 315 NGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSL 394 (663) T ss_pred cCcceEEEEeecccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHH Confidence 00 000001111 1111 24588899999 Q ss_pred hhhcc-eEEeeccccc--------CchHHHHHHhh-------------hhccceEEEEeeeeEEEeeccCceeEecHHHH Q lcl|NC_015266. 148 AQSLR-AMVYVAAHGC--------KTKEEAVAYRK-------------QFGQREIMVIWPDWLGWDDITNSTVAIPAPAI 205 (390) Q Consensus 148 ~~~~~-~~~~~d~~~~--------~~~~~a~~~~~-------------~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~ 205 (390) |++++ ++.++|.|.. .+.+++.+|++ +++|+|+++||||++++|+.++..+++|||++ T Consensus 395 a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d~~~~~~~~~p~s~~ 474 (663) T protein:vir:10 395 ADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLAAD 474 (663) T ss_pred HHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEecccCCceEEechhHH Confidence 99988 5677787643 35677788775 36789999999999999999999999999999 Q ss_pred HHHHHhhhhhccceeecccCceeecccc---cccccchhhhccccccccccccceeEEEc---CCCEEEEccccCCCCc- Q lcl|NC_015266. 206 AAGLRAKIDNDIGWHKTLSNVVVNGVTG---ISADVSWDLQDPATDAGYLNENQVTTLVN---RNGFRFWGSRTCDADG- 278 (390) Q Consensus 206 vAg~~a~~d~~~g~~~span~~l~gv~~---~~~~~~~~~~~~~~~~~~l~~~gI~~~~~---~~G~~~wG~rT~~~d~- 278 (390) +||++||+|.++|||+||||+++.++.+ ++..+ ...|.+.||.+|||+++. ++|+++||+||++.++ T Consensus 475 vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~------~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s 548 (663) T protein:vir:10 475 IAGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEP------KQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPS 548 (663) T ss_pred HHHHHHHhhccCCceEccCCceeccccccccceecc------ChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCc Confidence 9999999999999999999998654444 44333 334677888899998864 3699999999998764 Q ss_pred ccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEE Q lcl|NC_015266. 279 KFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWID 358 (390) Q Consensus 279 ~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~ 358 (390) +|+||+|||||+||+++|++.++|+|||||++.+|++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+++ T Consensus 549 ~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~ 628 (663) T protein:vir:10 549 PFDRINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVIDRNEFVGT 628 (663) T ss_pred ccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEE Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEecccceEEEEEEEEcch--HHHHHHHHhcC Q lcl|NC_015266. 359 YDYTPVPPLENLKLRQRITDR--YLADFASRVSA 390 (390) Q Consensus 359 i~~~p~~p~e~i~~~~~~~~~--~~~~l~~~~~a 390 (390) |+++|++|+|||+|++..... .|+|++++|++ T Consensus 629 i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~~ 662 (663) T protein:vir:10 629 IYVKPPRSINYITLNMVATSTGANFDELIGPMQL 662 (663) T ss_pred EEEEecCCcceEEEEEEEeecCccHHHHHHHHhc Confidence 999999999999999988654 69999999999 No 26 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=100.00 E-value=5.7e-84 Score=476.97 Aligned_cols=379 Identities=15% Similarity=0.120 Sum_probs=295.7 Q ss_pred CCC-ccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccc-----cchHHHHHhh Q lcl|NC_015266. 1 MPQ-DYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTK-----GTLRRTLDAI 74 (390) Q Consensus 1 Ma~-~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-----gtl~~al~~~ 74 (390) |++ +.+|||||+|++.++++|..|.|++.+|+|.++.+ |+++|++++++.++...||.. ..+.+++..+ T Consensus 1 m~~~~~~PgVyv~e~~~~~~~i~~v~ts~~~fvG~~~~G-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~~~ 75 (729) T protein:vir:10 1 MPLNLASPGIVVREVDLTIGRVDPTSGSIGALVAPFAKG-----PVNDPQLIESEEDLLQTFGQPYSTDKHYEYWMVASS 75 (729) T ss_pred CCccccCCceEEEEecCCCcccccccccceeEEeccccC-----CCccCeEcCCHHHHHHHcCccccCCcchhHHHHHHH Confidence 996 55689999999999999999999999999998755 678999999999998888852 3467899999 Q ss_pred hcccCceEEEEeecccccccc---------------------------------------c------------------- Q lcl|NC_015266. 75 GKQTKPVTVVVRVAEGKDEAE---------------------------------------T------------------- 96 (390) Q Consensus 75 ~~~~~~~~~vv~v~~~~~~~~---------------------------------------~------------------- 96 (390) |.|+|..|+++++........ . T Consensus 76 f~ngg~~~~vvRv~~~~~~~a~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gn~~~v~v~ 155 (729) T protein:vir:10 76 YLAYGGTMQVVRADDYNTQTGVGLKNAFVTGGVGVGATMLRITSNTHYNQLGYDENTISGVSVAAKNPGTWANGIKVAII 155 (729) T ss_pred HHhCCceEEEEecCcccccccccccccccccccccccccccccccccccccccccCCCcceEEEEeccCccccceeeEEe Confidence 999999999998743110000 0 Q ss_pred -----c-------------------------------------ccchhhhc-cc------------------hh------ Q lcl|NC_015266. 97 -----T-------------------------------------ANVIGTVT-PD------------------GK------ 109 (390) Q Consensus 97 -----~-------------------------------------~~~~~~~~-~~------------------~~------ 109 (390) . .......+ .. .. T Consensus 156 ~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~s~~~~~~~~~~~~~~~~~~~ 235 (729) T protein:vir:10 156 DGKADQILTVASGNTTAVGSAVTQSISKTIGTATGTTTIDGVLKGIVTGSTDTTLEVKVISHISAAGVETAVEYQQNGTY 235 (729) T ss_pred cccCcceeeeeccccccceeeeeeeccccccccccceeeeeeecccccccccccccceecccccccccceecccccccee Confidence 0 00000000 00 00 Q ss_pred ---------------------------------------hh--------------------------------------- Q lcl|NC_015266. 110 ---------------------------------------YT--------------------------------------- 111 (390) Q Consensus 110 ---------------------------------------~t--------------------------------------- 111 (390) .+ T Consensus 236 ~~~~~~s~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~d~~~~~~~d~~~~~~ 315 (729) T protein:vir:10 236 TFDNSGSVNVIAAGSSGSGSAKSYTAQTDWFESQNIVLSNSTLEWDSIADAPGTSTYVSTRGGKNDEIHVLVIDDKGTIT 315 (729) T ss_pred eecccCccceeeeccccccccccceeeeccccccccccccccccccccccccccccccccccccccccceeeeccccccc Confidence 00 Q ss_pred h-----h------------------------------------------------------------------------- Q lcl|NC_015266. 112 G-----M------------------------------------------------------------------------- 113 (390) Q Consensus 112 g-----l------------------------------------------------------------------------- 113 (390) + + T Consensus 316 ~~~g~vve~~~~~s~~~~~~~~~~~~~~~~~vi~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~ 395 (729) T protein:vir:10 316 GNSGTILEKHLSLSKAKDAEYSVGSSSYWRDFLATNSKYIFGGGATSGITTTGYSVSSTNTLDTDSGWDQNAEGVNFGAS 395 (729) T ss_pred cCcccceeeeeeeeeccccccccccccccceeeccccceeeecccccccccccccccccceecccccccccccccccccc Confidence 0 0 Q ss_pred ------------------------------------hhhhhhhhhhhhhhhhhhh---hhhcchHHHHHHHHhhhhcceE Q lcl|NC_015266. 114 ------------------------------------KALLAAQGKLAVKPRILVA---PGLDTQPVAAAFATIAQSLRAM 154 (390) Q Consensus 114 ------------------------------------~~~~~~~~~~~~~p~~~~a---pg~~~~~v~~al~~~~~~~~~~ 154 (390) +++.+. +.....+.++.. ++.....++.++..+|++++.+ T Consensus 396 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~v~~a~~~~~~~~~~~ 474 (729) T protein:vir:10 396 GVATLTLAGGTNYGDKTDLTTSGALSSGVDDIISGYTLFENT-EEIEVDFILMGAAHHPKEQSQAVAEKVTAVAEARKDA 474 (729) T ss_pred ceeEEEeecccccccccccccccccccchhHHHHHHHHhhcc-cccccceeeecCCCCCccchHHHHHHHHHHHHhcCCe Confidence 000000 000000000000 1112345778899999988754 Q ss_pred -Eeecccc-----------------cCchHHHHHHhhhhc-cceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhh Q lcl|NC_015266. 155 -VYVAAHG-----------------CKTKEEAVAYRKQFG-QREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDN 215 (390) Q Consensus 155 -~~~d~~~-----------------~~~~~~a~~~~~~~~-~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~ 215 (390) .++|.+. ....+++..+++.+. ++++++||||++++|+.++..+++|||+++||++||+|. T Consensus 475 ~a~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~p~s~~~aGl~a~~d~ 554 (729) T protein:vir:10 475 VAFISPYRQAFLNDTVSGTVTVSNIDQTTENVVGFYAPLSSSTYSVFDSGYKYMFDRFNNTFRYVPLNGDIAGTCARTDI 554 (729) T ss_pred EEEecccccccccccccccccccccchhhHHHHHHHhhccCCceEEEEcCeeEEecccCCceEEechhHHHHHHHHHhhc Confidence 4455331 123466777887764 678899999999999999999999999999999999999 Q ss_pred ccceeecccCceeecccccccccchhhhccccccccccccceeEEEcC--CCEEEEccccC-CCCcccceeeehhhHHHH Q lcl|NC_015266. 216 DIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVNR--NGFRFWGSRTC-DADGKFFFENYTRSAQVI 292 (390) Q Consensus 216 ~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~--~G~~~wG~rT~-~~d~~~~~i~vrR~~~~i 292 (390) ++|||+||+|+++.++.++.. ........+++.||.+|||+++++ +|+++||+||+ +.|++|+||++||+++|| T Consensus 555 ~~g~~~span~~~~~i~g~~~---~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~d~~~~~i~vrR~~~~i 631 (729) T protein:vir:10 555 EQFPWFSPAGTARGPILNSVK---LVYNPGKKQRDILYSNRINPVILSPGAGIILFGDKTGFGKSSAFDRINVRRLFIYL 631 (729) T ss_pred cCCcEEccCCccccceecccc---eeeecChhhHhhhhhCCceEEEEecCCeEEEEcceecCCCCcccceeehhhhHHHH Confidence 999999999999877766543 223345567899999999999875 69999999998 679999999999999999 Q ss_pred HHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEE Q lcl|NC_015266. 293 ADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKL 372 (390) Q Consensus 293 ~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~ 372 (390) +++|++.++|+|||||++.+|++|+++|++||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+| T Consensus 632 ~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~ 711 (729) T protein:vir:10 632 EDAISAAAKDQLFEFNDELTRTNFVNIVEPFLRDVQAKRGIFDFVVICDETNNTAAVIDSNEFVADIFIKPARSINFIGL 711 (729) T ss_pred HHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCccEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcch--HHHHHHHHh Q lcl|NC_015266. 373 RQRITDR--YLADFASRV 388 (390) Q Consensus 373 ~~~~~~~--~~~~l~~~~ 388 (390) +++.+.. +|+|++++| T Consensus 712 ~~~~~~~~~~~~e~~~~~ 729 (729) T protein:vir:10 712 TFVATRTGVAFEEVIGSV 729 (729) T ss_pred EEEEeecCccHHHHHhcC Confidence 9988765 789999999 No 27 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=100.00 E-value=4.4e-83 Score=472.08 Aligned_cols=380 Identities=13% Similarity=0.075 Sum_probs=292.1 Q ss_pred CCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhcc---ccchHHHHHhhhccc Q lcl|NC_015266. 2 PQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGT---KGTLRRTLDAIGKQT 78 (390) Q Consensus 2 a~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~---~gtl~~al~~~~~~~ 78 (390) -+++.|||||+|+ +++++|..+.|++.+|+|.++.+ |+++|++++++.++...|+. ...+.+++..+|.++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~i~~~~t~~~~~vG~~~~G-----p~~~p~~v~~~~~~~~~fg~~~~~~~~~~~~~~~f~ng 74 (663) T protein:vir:10 1 MALLSPGIEMKET-SINSTVVRSATGRAAIVGKFAWG-----PAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQY 74 (663) T ss_pred CceecCceEEEEe-cCCccccccCcccceeEeecccC-----CCCccEEecCHHHHHHhcCCcCCcchhHHHHHHHHHhC Confidence 4556799999999 59999999999999999998755 67899999999998877764 456889999999999 Q ss_pred CceEEEEeeccccccccc-----------------------------cc--------------c---------------- Q lcl|NC_015266. 79 KPVTVVVRVAEGKDEAET-----------------------------TA--------------N---------------- 99 (390) Q Consensus 79 ~~~~~vv~v~~~~~~~~~-----------------------------~~--------------~---------------- 99 (390) |..|+++++......... .. + T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~i~v~~~~~~~~~~~~~~~~~~~~~~~~v~~~ta~~~~~~ 154 (663) T protein:vir:10 75 GNDLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKT 154 (663) T ss_pred CCeEEEEEccCCccccccccccccceeEEeecccccccccccccccccccccccccceeeecccceEEEeeccccccccc Confidence 999999987421110000 00 0 Q ss_pred ----------------c---h----------hhhccch--------hh------------------------hhh----- Q lcl|NC_015266. 100 ----------------V---I----------GTVTPDG--------KY------------------------TGM----- 113 (390) Q Consensus 100 ----------------~---~----------~~~~~~~--------~~------------------------tgl----- 113 (390) + . ......+ .. .|. T Consensus 155 ~~v~~~~~~~~~~~~~~s~~s~~~~~a~~v~~v~~d~~~~v~~~~~a~~~~t~~~~~~~~~~~~~~~i~A~~~G~~Gn~i 234 (663) T protein:vir:10 155 RQLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLISAVYPGEIGSTV 234 (663) T ss_pred cccccceeeccceeeEeeeccCccccccccceeccccceEEeeccccccccccccccccccccccceEEeccCCccccee Confidence 0 0 0000000 00 000 Q ss_pred ----hh--------------------------------------------------------------------hhhhhh Q lcl|NC_015266. 114 ----KA--------------------------------------------------------------------LLAAQG 121 (390) Q Consensus 114 ----~~--------------------------------------------------------------------~~~~~~ 121 (390) .. ...... T Consensus 235 ~V~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~ 314 (663) T protein:vir:10 235 EVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFR 314 (663) T ss_pred eeeeccccccccccccceecccccccccccceeecccccccceeeEeecCCcceeeecccccccccccccchhhhhhhhc Confidence 00 000000 Q ss_pred h------------------------------------------------hhhhhhhhhh--hhh----cchHHHHHHHHh Q lcl|NC_015266. 122 K------------------------------------------------LAVKPRILVA--PGL----DTQPVAAAFATI 147 (390) Q Consensus 122 ~------------------------------------------------~~~~p~~~~a--pg~----~~~~v~~al~~~ 147 (390) . ..+.+.++.+ ++. ..++|+.+|.++ T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~ 394 (663) T protein:vir:10 315 NGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSL 394 (663) T ss_pred CCcceEEEEeecccCccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHH Confidence 0 0000001111 111 124588899999 Q ss_pred hhhcce-EEeeccccc--------CchHHHHHHhh-------------hhccceEEEEeeeeEEEeeccCceeEecHHHH Q lcl|NC_015266. 148 AQSLRA-MVYVAAHGC--------KTKEEAVAYRK-------------QFGQREIMVIWPDWLGWDDITNSTVAIPAPAI 205 (390) Q Consensus 148 ~~~~~~-~~~~d~~~~--------~~~~~a~~~~~-------------~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~ 205 (390) |++++. +.++|.|.. .+.+++.+|++ +++|+++++||||++++|+.++..+++|||++ T Consensus 395 a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~ 474 (663) T protein:vir:10 395 ADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLAAD 474 (663) T ss_pred HHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCccceEEEcCceEEecccCCceEEechhHH Confidence 999884 566776643 34566777765 46789999999999999999999999999999 Q ss_pred HHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEc---CCCEEEEccccCCCC-cccc Q lcl|NC_015266. 206 AAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVN---RNGFRFWGSRTCDAD-GKFF 281 (390) Q Consensus 206 vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~---~~G~~~wG~rT~~~d-~~~~ 281 (390) +||+|||+|.++|||+||||+++.++.++.. ........|.+.||.+|||+++. ++|+++||+||++.+ ++|+ T Consensus 475 vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~---~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~s~~~ 551 (663) T protein:vir:10 475 IAGLCAYTDQVSHPWMSPAGYRRGQIRNCIK---LAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPSPFD 551 (663) T ss_pred HHHHHHHhhccCCceEccCCceecccccccc---ceeecChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCcccc Confidence 9999999999999999999998654544321 12222344677888899998864 369999999999876 4899 Q ss_pred eeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEE Q lcl|NC_015266. 282 FENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDY 361 (390) Q Consensus 282 ~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~ 361 (390) ||||||||+||+++|++.++|+|||||++.+|.+|+++|+.||++||++|+|.||+|+||+++||+++|++|+|+++|++ T Consensus 552 ~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~ 631 (663) T protein:vir:10 552 RINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVIDRNEFVGTIYV 631 (663) T ss_pred eEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EecccceEEEEEEEEcc--hHHHHHHHHhcC Q lcl|NC_015266. 362 TPVPPLENLKLRQRITD--RYLADFASRVSA 390 (390) Q Consensus 362 ~p~~p~e~i~~~~~~~~--~~~~~l~~~~~a 390 (390) +|++|+|||+|++.... ..|+|++.+|++ T Consensus 632 ~p~~pae~i~~~~~~~~~~~~~~e~~~~~~~ 662 (663) T protein:vir:10 632 KPPRSINYITLNMVATSTGANFDELIGPMQL 662 (663) T ss_pred EecCCcceEEEEEEEeecCccHHHHHHHHhc Confidence 99999999999999865 469999999999 No 28 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=100.00 E-value=1.3e-82 Score=469.59 Aligned_cols=375 Identities=12% Similarity=0.082 Sum_probs=283.9 Q ss_pred CCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhcc---ccchHHHHHhhhccc Q lcl|NC_015266. 2 PQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGT---KGTLRRTLDAIGKQT 78 (390) Q Consensus 2 a~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~---~gtl~~al~~~~~~~ 78 (390) -++..|||||+|+ +++++|..+.|++.+|+|.++.+ |+++|++++++.++...||. ...+.+++..+|.|+ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~i~~v~t~~~~fvG~~~~G-----p~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~f~ng 74 (671) T protein:vir:56 1 MTLLSPGIENKEI-NLASAIGRAATGRAAMVGKFEWG-----PAYSITQVTSESDLVTIFGRPNDYTAASFMTANNFLKY 74 (671) T ss_pred CceecCceEEEee-cCcccccccCcccceEEecccCC-----CCccCEEcCCHHHHHHHcCCcCCCcchhHHHHHHHHhc Confidence 4456799999999 59999999999999999998765 67999999999999888774 567889999999999 Q ss_pred CceEEEEeeccccccccc----------------------------------------ccc------------------- Q lcl|NC_015266. 79 KPVTVVVRVAEGKDEAET----------------------------------------TAN------------------- 99 (390) Q Consensus 79 ~~~~~vv~v~~~~~~~~~----------------------------------------~~~------------------- 99 (390) |..++++++......+.. ..+ T Consensus 75 g~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~v~ 154 (671) T protein:vir:56 75 GNDLRLVRICDATTAQNATPLYNAVEYTIGASNGCVVGDDITITYSGVGALTAKGKVLEVDAGNNNAASKIFLPSAEIVA 154 (671) T ss_pred CCeEEEEEecCccccccchhhccccccccccCcceeeceeeeeecCcccccccCcceeEEeeeccceeeeeeccceeEEE Confidence 999999997442210000 000 Q ss_pred ---------chhh--------------hccch-------------------hh--------------------------- Q lcl|NC_015266. 100 ---------VIGT--------------VTPDG-------------------KY--------------------------- 110 (390) Q Consensus 100 ---------~~~~--------------~~~~~-------------------~~--------------------------- 110 (390) .... ..... .. T Consensus 155 ~~~~~~~~~~~~~~t~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~ 234 (671) T protein:vir:56 155 AAKSDGNYPSVGTITLQPTQGDIALTNIEIIDTGSVYFPNIELAFDALTAIETEGGALKYADLIEKQGFPRLSARYVGDF 234 (671) T ss_pred eeeccccccccccccccccccceeeeeecccccceEEEeccccccccccccccccccccchhhhhccccccccccccccc Confidence 0000 00000 00 Q ss_pred --------------hhh--------------------------------------------------------------- Q lcl|NC_015266. 111 --------------TGM--------------------------------------------------------------- 113 (390) Q Consensus 111 --------------tgl--------------------------------------------------------------- 113 (390) ..+ T Consensus 235 g~~~~v~v~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~ 314 (671) T protein:vir:56 235 GDAISVEIINYADYQTAFAFAAGHTLGDIELPIYPDGGTRSINLSSYFTFGPSNSNQYAVIVRVSGEVEEAFIVSTNPGD 314 (671) T ss_pred CcceEEEEecccccccccccccceeeeeccccccccccccccccceeecccccccccceeEEeecCccceeEEEeecccc Confidence 000 Q ss_pred ----------------------------------------------------hhhhhhhhhhhhhhhhhhhhhhcch--- Q lcl|NC_015266. 114 ----------------------------------------------------KALLAAQGKLAVKPRILVAPGLDTQ--- 138 (390) Q Consensus 114 ----------------------------------------------------~~~~~~~~~~~~~p~~~~apg~~~~--- 138 (390) .++........+.|.++.+|+++.. T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~ 394 (671) T protein:vir:56 315 KDVNGQSIFIDEYFENSGSAYITAIAEGWKTESGAYNFGGGSDANAGADDWMFGLDMLSDPEVLYTNLVIAGNAAAEEVS 394 (671) T ss_pred cccchhhhhhhhhhcccCceEEEecCcccCCccccccccCccccccchhHHHHHHHhhhhccccceeEEEcCCCCCccch Confidence 0000000000011222222222221 Q ss_pred ----HHHHHHHHhhhhcc-eEEeecccc--------cCchHHHHHHhh--------------hhccceEEEEeeeeEEEe Q lcl|NC_015266. 139 ----PVAAAFATIAQSLR-AMVYVAAHG--------CKTKEEAVAYRK--------------QFGQREIMVIWPDWLGWD 191 (390) Q Consensus 139 ----~v~~al~~~~~~~~-~~~~~d~~~--------~~~~~~a~~~~~--------------~~~~~~~~~~~p~~~~~~ 191 (390) ...+++..+++.++ .+.++|.+. ..+.+++.+|+. +++|.++++||||.+++| T Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d 474 (671) T protein:vir:56 395 IASTVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAGTAVANIQGWRTGIDPTNGQAVVDNLNVSTTYAVIDGNYKYQYD 474 (671) T ss_pred hHHHHHHHHHHHHHhhcCCEEEEEeccccccccccccccHHHHHHHhhhccccchhhhhhhccCCcceEEEecCceEEec Confidence 12233455555443 455556442 456777888875 356889999999999999 Q ss_pred eccCceeEecHHHHHHHHHhhhhhccceeecccCceeecccc---cccccchhhhccccccccccccceeEEEcC--CCE Q lcl|NC_015266. 192 DITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTG---ISADVSWDLQDPATDAGYLNENQVTTLVNR--NGF 266 (390) Q Consensus 192 ~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~---~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~--~G~ 266 (390) +.++..+++|||+++||+|||+|.++|||+||||+++.++.+ +...++ ..|.+.||.+|||+++++ +|+ T Consensus 475 ~~~~~~~~~p~s~~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~------~~~~~~Ln~~gIn~i~~~~~~G~ 548 (671) T protein:vir:56 475 KYNDRNRWVPLAGDIAGLCAYTDQVSQPWMSPAGFNRGQIKGVNRLAVDLR------RAHRDALYQIGINPVVGFAGQGF 548 (671) T ss_pred ccCCceeEechHHHHHHHHHHhhccCCcEECcCCceeccccccccceeecC------hhHHHHHhhCCceEEEEecCCeE Confidence 999999999999999999999999999999999997655544 443332 346778899999999865 799 Q ss_pred EEEccccCCCC-cccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCC Q lcl|NC_015266. 267 RFWGSRTCDAD-GKFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPN 345 (390) Q Consensus 267 ~~wG~rT~~~d-~~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~n 345 (390) ++||+||++.+ ++|+||+|||||+||+++|++.++|+|||||++.+|.+|+++|++||++||++|+|+||+|+||+++| T Consensus 549 ~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~g~~v~~d~~~n 628 (671) T protein:vir:56 549 VLYGDKTATQQASAFDRINVRRLFNLLKKAISDAAKYRLFELNDEFTRSSFKSEIDAYLTNIQDLGGVYDFRVVCDETNN 628 (671) T ss_pred EEEcceecCCCCcccceEehhhHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCC Confidence 99999999865 69999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_015266. 346 TTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFASRVSA 390 (390) Q Consensus 346 t~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~~~~a 390 (390) |+++|++|+|+++|+++|++|+|||+|++.......+ |++|.- T Consensus 629 t~~~i~~G~~~~~i~~~p~~Pae~I~~~~~~~~~~~~--f~e~~~ 671 (671) T protein:vir:56 629 PGSVIDRNEFVASIYVKPAKSINFITLNFVATSTDAD--FAEIIG 671 (671) T ss_pred CHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCcc--hhhhcC Confidence 9999999999999999999999999999998776644 444444 No 29 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=100.00 E-value=7.4e-82 Score=465.39 Aligned_cols=380 Identities=12% Similarity=0.073 Sum_probs=291.2 Q ss_pred CCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhcc---ccchHHHHHhhhccc Q lcl|NC_015266. 2 PQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGT---KGTLRRTLDAIGKQT 78 (390) Q Consensus 2 a~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~---~gtl~~al~~~~~~~ 78 (390) -+++.|||||+|+ ++++++..+.|++.+|+|.++-+ |+++|++++++.++...||. ...+.+++..+|.++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~~~~v~t~~~~fvG~~~~g-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~ng 74 (663) T protein:vir:10 1 MALLSPGIEMKET-SINSTVVRSATGRAALVGKFAWG-----PAYEIRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQY 74 (663) T ss_pred CccccCceEEEEe-cCcccccccccccceeeeccccC-----CCCcCEEecCHHHHHHHcCCcccccchHHHHHHHHHhC Confidence 4455799999999 58999999999999999998755 77999999999998887764 457899999999999 Q ss_pred CceEEEEeecccccccccc------------------------------------------------------------- Q lcl|NC_015266. 79 KPVTVVVRVAEGKDEAETT------------------------------------------------------------- 97 (390) Q Consensus 79 ~~~~~vv~v~~~~~~~~~~------------------------------------------------------------- 97 (390) |..|+++++....+..... T Consensus 75 g~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~~~~~a 154 (663) T protein:vir:10 75 GNDLRLVRVIDMEQAKNASPLFNQIEVTITTEGQGYTVGDTVSIKHNTTTVTEEGKVTKVDADGKIKALFVPSSAVIAKA 154 (663) T ss_pred CCeEEEEecCCcccccccccccccceeeEeecccCccccceeeecccccccccCcceeeeccCCceeEEEeccccccccc Confidence 9999999985421110000 Q ss_pred --c--------c----c---h---------hh----------------hccchh-------------------------- Q lcl|NC_015266. 98 --A--------N----V---I---------GT----------------VTPDGK-------------------------- 109 (390) Q Consensus 98 --~--------~----~---~---------~~----------------~~~~~~-------------------------- 109 (390) . . + . .+ .+.... T Consensus 155 ~~~~~~~~~~~a~~~~v~~~~~~~~~a~av~~i~~dg~vt~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i 234 (663) T protein:vir:10 155 KQLGTYPVLGDNWRAEVSGASGGSAATLTLGGIVVDSGVTFGNSEEAPDVMTSTKVLANFAKYGMPLISAVYPGEIGSTV 234 (663) T ss_pred cccccccccccceeeEEeeccccccccceeEeeecCCceeEEeeeccccccccceeeeeccccccceeeeecccccCcce Confidence 0 0 0 0 00 000000 Q ss_pred ------hh---------------------------h----------------------h-----------------hhhh Q lcl|NC_015266. 110 ------YT---------------------------G----------------------M-----------------KALL 117 (390) Q Consensus 110 ------~t---------------------------g----------------------l-----------------~~~~ 117 (390) .+ | + ..+. T Consensus 235 ~v~~~~~~~~~~~~~~~v~~~~g~~~~~~~~~~~~g~~~~~~~~~~~~~~g~~~e~~~ls~~~~~~~~~~~~~~~~~~~~ 314 (663) T protein:vir:10 235 EVEVISKTAFQSGAAQPIYPFGGTRASNARSVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRRGDRDVYGNNIFMDDYFR 314 (663) T ss_pred eEeecccccccccceeeecccCcccccccccccccccccchhhcccccCCCcccceeeeeccccccccchhhhhhhhhhc Confidence 00 0 0 0000 Q ss_pred hhhhhh--------------------hh-hh----------------------------hhhhhhhhc-chHHHHHHHHh Q lcl|NC_015266. 118 AAQGKL--------------------AV-KP----------------------------RILVAPGLD-TQPVAAAFATI 147 (390) Q Consensus 118 ~~~~~~--------------------~~-~p----------------------------~~~~apg~~-~~~v~~al~~~ 147 (390) .....+ +. .+ .....++++ ..+|+++|..+ T Consensus 315 ~~~s~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~ 394 (663) T protein:vir:10 315 NGSSNFIYASSVNWPAGFTGIIQLGGGASANNAVGSDELIAGWDLFADREALHVNLMIAGACKSDGVAVASTVQKHVVAL 394 (663) T ss_pred CcccceeEeeccccCcccceeEEecccccCcccchhhhhhhHHhhhccccccCceEEEeecCCCCchhhHHHHHHHHHHH Confidence 000000 00 00 000001111 14588999999 Q ss_pred hhhcc-eEEeecccccC--------chHHHHHHh-------------hhhccceEEEEeeeeEEEeeccCceeEecHHHH Q lcl|NC_015266. 148 AQSLR-AMVYVAAHGCK--------TKEEAVAYR-------------KQFGQREIMVIWPDWLGWDDITNSTVAIPAPAI 205 (390) Q Consensus 148 ~~~~~-~~~~~d~~~~~--------~~~~a~~~~-------------~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~ 205 (390) |++++ ++.++|.|... ..+++..|+ .+++|+++++||||++++|+.++..+++|||++ T Consensus 395 ~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~s~~ 474 (663) T protein:vir:10 395 ADDRQDCVAFVNPPSELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYAFISGNYKYQYDKYNDINRWVPLSAD 474 (663) T ss_pred HHhhCCEEEEEecCcccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceEEEEecceeEecccCCceEEechHHH Confidence 99987 66777876543 234455555 467899999999999999999999999999999 Q ss_pred HHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEc---CCCEEEEccccCCCC-cccc Q lcl|NC_015266. 206 AAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVN---RNGFRFWGSRTCDAD-GKFF 281 (390) Q Consensus 206 vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~---~~G~~~wG~rT~~~d-~~~~ 281 (390) +||++||+|.++|||+||||+++.++.++.. ........|.+.||.+|||+++. ++||++||+||++.+ ++|+ T Consensus 475 vAGl~Ar~D~~~g~~~span~~~~~i~g~~~---~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s~~~ 551 (663) T protein:vir:10 475 IAGLCAYTDQVGHPWMSPAGYRRGQLRNTIK---LAIEPKQSLRDTMYQVSINPVTGFAGGDGFVLFGDKMATQVPSPFD 551 (663) T ss_pred HHHHHHHhhccCCcEEccCCeeecceecccc---ceeecCchhHHHHHhCCCcEEEEeeCCCcEEEEcccccCCCCcccc Confidence 9999999999999999999998766655432 12223344667888899988754 469999999999876 5899 Q ss_pred eeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEE Q lcl|NC_015266. 282 FENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDY 361 (390) Q Consensus 282 ~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~ 361 (390) ||++||+|+||+++|++.++|++||||++.+|++|+++|++||++||++|+|+||+|+||+++||+++|++|+|+++|++ T Consensus 552 ~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~G~~~~~i~~ 631 (663) T protein:vir:10 552 RINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMEVSQYLDNIRSLGGVYDFRVVCDTTNNTPQVIDSNEFVATIYI 631 (663) T ss_pred eEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EecccceEEEEEEEEcch--HHHHHHHHhcC Q lcl|NC_015266. 362 TPVPPLENLKLRQRITDR--YLADFASRVSA 390 (390) Q Consensus 362 ~p~~p~e~i~~~~~~~~~--~~~~l~~~~~a 390 (390) +|++|+|||+|++..... .|+|+++.++- T Consensus 632 ~p~~pae~I~~~~~~~~~~~~f~e~~~~~~~ 662 (663) T protein:vir:10 632 KAPRSINYITLNFVATSTGANFDELIGPAQL 662 (663) T ss_pred EecCCcceEEEEEEEEecCccHHHHHHHHhc Confidence 999999999999998755 47777777776 No 30 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=100.00 E-value=6.5e-81 Score=460.19 Aligned_cols=374 Identities=15% Similarity=0.103 Sum_probs=287.7 Q ss_pred CCCcc-CCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhcc---ccchHHHHHhhhc Q lcl|NC_015266. 1 MPQDY-HHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGT---KGTLRRTLDAIGK 76 (390) Q Consensus 1 Ma~~~-~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~---~gtl~~al~~~~~ 76 (390) ||.+| .|||||+|++.+ +.+..+.|++.+|+|.++.+ |+++|++++++.++...||. ...+.+++..+|. T Consensus 1 M~~~~~~PgVyv~e~~~~-~~~~~~~t~~~~fvG~~~~G-----p~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~F~ 74 (749) T protein:vir:10 1 MATNQSSPGVVIQERDLT-TVSTIPTANVGVIAAPFTKG-----PVEEVIEITSERQLAEKFGEPNESNYEYWFSAAQFL 74 (749) T ss_pred CCccccCCeeEEEEecCC-cccccccCceeEEEeccCCC-----CCccCEEcCCHHHHHHHcCCccCCcccHHHHHHHHh Confidence 99855 599999999776 56888999999999998755 77899999999998887764 4568999999999 Q ss_pred ccCceEEEEeecccccccc-------------------------------------------------c----------- Q lcl|NC_015266. 77 QTKPVTVVVRVAEGKDEAE-------------------------------------------------T----------- 96 (390) Q Consensus 77 ~~~~~~~vv~v~~~~~~~~-------------------------------------------------~----------- 96 (390) |+|..|+++++........ . T Consensus 75 ngg~~~~vvRv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~pG~~gn~l~v~v~~~~~~~~~~~~~~~~~ 154 (749) T protein:vir:10 75 SYGGLLKTIRVNSSSLKNAVDTGTAPLVKNLQDYETSIEDASNNFSWVARTPGDTGNSIGIFVTDAGADQVVVVPAPGSG 154 (749) T ss_pred hcCCeEEEEEccCccccccccccccccccccccccccccccccceEEEeccCCCcCCceEEEEEcCCCceeeeeecCCcc Confidence 9999999998732110000 0 Q ss_pred -------------------c----------c-----------------------cc-----------h----hhh--ccc Q lcl|NC_015266. 97 -------------------T----------A-----------------------NV-----------I----GTV--TPD 107 (390) Q Consensus 97 -------------------~----------~-----------------------~~-----------~----~~~--~~~ 107 (390) . . +. + ++. ... T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~a~~~~~~~~~~~~~~~~~~s~~~~~~~a 234 (749) T protein:vir:10 155 NEHEFVADAAVSAASGAAGKVFKYSIILTIDDVVGTFAPGSATTITIGGSAESVNVLAYDATNKKLEIGLPSGGVTGILA 234 (749) T ss_pred ceeeEEeeecccccccccccccccceeeeeccccceeecccceeeeccCcccccccccccCCcceEEEeeecccccceee Confidence 0 0 00 0 000 000 Q ss_pred h-----------------------------------------------------------hhhhh--------------- Q lcl|NC_015266. 108 G-----------------------------------------------------------KYTGM--------------- 113 (390) Q Consensus 108 ~-----------------------------------------------------------~~tgl--------------- 113 (390) . ..++. T Consensus 235 ~~~~v~~~~~~~~~~~~i~~~~~~~~~~~~~~~a~~~~~~~~~g~~~~it~v~~~~~~~~~~t~~~~~~~a~~~gt~~~~ 314 (749) T protein:vir:10 235 DNQVITQGTNTAKINVTIERKLLVALNKSSIEFAASDVVQDTNSTNITITSVRDEYTEREYLPGVKWINVAPRPGTSLYA 314 (749) T ss_pred eeecccccccccccccccccchhhhhccccceeeccccccCCccceeEEEeeeccccccccccceeeccccccccceeee Confidence 0 00000 Q ss_pred -----------------------------hh---hh---hhh-h---------hh------------------------- Q lcl|NC_015266. 114 -----------------------------KA---LL---AAQ-G---------KL------------------------- 123 (390) Q Consensus 114 -----------------------------~~---~~---~~~-~---------~~------------------------- 123 (390) +. +. ... . .. T Consensus 315 ~~~~g~~D~~~v~v~~~~g~~~~~~g~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~ 394 (749) T protein:vir:10 315 NGVGGHRDEMHVILVDIDGGVTGTVGALLERYIDVSKASDAKTSVGETNYYAEVIKQKSEFIYWAEHESTLYAATSSASD 394 (749) T ss_pred ecccCCCCceEEEEecCCCeeeecccceeeeeeeccccccccccccccchhhhhhccCCCEEEEEecccccccccccccc Confidence 00 00 000 0 00 Q ss_pred ----------------------------------------------------------------------------hhhh Q lcl|NC_015266. 124 ----------------------------------------------------------------------------AVKP 127 (390) Q Consensus 124 ----------------------------------------------------------------------------~~~p 127 (390) .+.+ T Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 474 (749) T protein:vir:10 395 GLFGQTAANRQFNLFRSAAGSVDYPAGVTTLGSKNNATYYYRLSGGVNYTVSAGQYTITNTDIGSAYELIGDPESQIVDF 474 (749) T ss_pred cccccccccceeeccccccccceeccccccccccCCcEEEEEccCCcccccccccccccchhHHHHHHHhhhhhhcccce Confidence 0000 Q ss_pred hhhhhhhhc---chHHHHHHHHhhhhcceEEeecccccC----------chHHHHHHhhh-hccceEEEEeeeeEEEeec Q lcl|NC_015266. 128 RILVAPGLD---TQPVAAAFATIAQSLRAMVYVAAHGCK----------TKEEAVAYRKQ-FGQREIMVIWPDWLGWDDI 193 (390) Q Consensus 128 ~~~~apg~~---~~~v~~al~~~~~~~~~~~~~d~~~~~----------~~~~a~~~~~~-~~~~~~~~~~p~~~~~~~~ 193 (390) .++..|+++ ..+++.+|.++|+++++++.++.++.. ...++..++.+ .++.++++||||++++|+. T Consensus 475 li~~~~~~~~~~~~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~ 554 (749) T protein:vir:10 475 IISGPSGTSDANALAKITSLVNIAEERRDCMVFVSPRRGNVIGISNTTTITTNIVDFFKKLPSSSYMVFDSGYKYIYDKY 554 (749) T ss_pred EEEecCCCCcchhHHHHHHHHHHHhhcCCEEEEEcCCCCcccccccchhhhhHHHHHHhhccCceeEEEEccceeeeccc Confidence 000111211 235788999999999998776655432 23455566654 5678999999999999999 Q ss_pred cCceeEecHHHHHHHHHhhhhhccceeecccCceee---cccccccccchhhhccccccccccccceeEEEcC--CCEEE Q lcl|NC_015266. 194 TNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVN---GVTGISADVSWDLQDPATDAGYLNENQVTTLVNR--NGFRF 268 (390) Q Consensus 194 ~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~---gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~--~G~~~ 268 (390) ++..+++|||+++||+++|+|.++|||+||||+++. |+.+++..+ ...|.+.||.+|||+++++ +|+++ T Consensus 555 ~~~~~~~p~s~~vAGl~Ar~D~~~g~~~SPan~~~~~i~g~~~~~~~~------~~~e~~~Ln~~gIn~i~~~~g~G~~~ 628 (749) T protein:vir:10 555 NDVYRYIPCNGDTAGLCLQTNEISEPWFSPAGFQRGVLRNAIKLAYTP------NKAQRDQLYANRVNPIVSFPGQGVVL 628 (749) T ss_pred cCceEEechHHHHHHHHHHhhccCCcEECcCCceeeeeeccccceeec------ChhHHHhhhhCCceEEEEecCCeEEE Confidence 999999999999999999999999999999999755 454444433 3446788999999999865 69999 Q ss_pred EccccC-CCCcccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCH Q lcl|NC_015266. 269 WGSRTC-DADGKFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTT 347 (390) Q Consensus 269 wG~rT~-~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~ 347 (390) ||+||+ +.|++|+||||||+|+||+++|++.++|++||||++.+|++|+++|+.||++||++|+|.||+|+||+++||+ T Consensus 629 wG~rT~~s~d~~~~~i~vRRl~~~ie~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~G~i~~f~V~~d~~~Nt~ 708 (749) T protein:vir:10 629 YGDKTALGFASAFDRINIRRLFLTVERVISTAAKAQLFEQNDEAQRSLFINIVEPYLRDVQGRRGVVDFLVKCDSTNNTP 708 (749) T ss_pred EcceecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcCCCCCH Confidence 999998 6789999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhCCeEEEEEEEEecccceEEEEEEEEcch--HHHHHHH Q lcl|NC_015266. 348 DELTSGGTWIDYDYTPVPPLENLKLRQRITDR--YLADFAS 386 (390) Q Consensus 348 ~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~--~~~~l~~ 386 (390) ++|++|+|+++|+++|++|+|||+|+++.... +|+|+++ T Consensus 709 ~~i~~G~~~~~i~~~P~~pae~I~~~~~~~~~~~~~~e~~s 749 (749) T protein:vir:10 709 EAVDRGEFYAEVFLKPTRTINYVQLTFVATRTGVSFAEVAS 749 (749) T ss_pred HHhhCCEEEEEEEEEecCCccEEEEEEEEeecCcchHHHhC Confidence 99999999999999999999999999987654 6778777 No 31 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=100.00 E-value=3.4e-80 Score=456.29 Aligned_cols=374 Identities=15% Similarity=0.099 Sum_probs=278.6 Q ss_pred CCCcc-CCCEEEEECCCCCccccc-cccccceeeeccccccccceeccceEEEechhHHHHhhccc-cchH---HHHHhh Q lcl|NC_015266. 1 MPQDY-HHGVRVIEINEGGRPIRT-VSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTK-GTLR---RTLDAI 74 (390) Q Consensus 1 Ma~~~-~hGV~v~ev~~~~~~i~~-v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-gtl~---~al~~~ 74 (390) |.-++ .|||||+|+.++++++.. |.|++.+|+|.++.+ |.++|++++++.++...++.. |.+. .++..+ T Consensus 279 ~~~~v~~~GVYVEEVpSGvrtIeGGV~TSVAAFVG~A~rG-----Pvn~PvlITS~aD~~~~Fg~~~GGl~GassA~r~~ 353 (774) T protein:vir:98 279 ITRNVEDNGVVIQLEPALTGSISNRFSFYVTANDNTANRG-----FTTSPALVTTIPDPAIHFTSFQGGLDGPRSAFRDF 353 (774) T ss_pred eEEEEecCceEEEEeCCCCccccccccceeeeecccccCC-----CCCcCEEEeehhHhhhhhccccCCccccceeeeee Confidence 77777 489999999999999987 999999999998755 688999999999965444210 0000 000000 Q ss_pred h----------------cccCceEEEE------------eecc------------------------------------- Q lcl|NC_015266. 75 G----------------KQTKPVTVVV------------RVAE------------------------------------- 89 (390) Q Consensus 75 ~----------------~~~~~~~~vv------------~v~~------------------------------------- 89 (390) + ..+....+.+ .+.+ T Consensus 354 ~~~sG~~~L~i~A~~pGawGN~ItV~I~~~t~~~~~l~v~~~~~s~f~~~~a~e~~tv~~~~~~~~~~v~e~~dn~~i~~ 433 (774) T protein:vir:98 354 YTFNGTPLLRLQAVSEGNWGNQVTVSIYPVNNSEFRLNVQDLNGSAFNPPLADEVYTVKLGDTNESGELNALLDSKFIRG 433 (774) T ss_pred eeecccceEEEEEeecCcCCCceEEEEEecCCceeEEEEEecCCccccccccceeEEEecccccccceeeeeeceeeEee Confidence 0 0000000000 0000 Q ss_pred -ccccccc------------------ccc-----------------------chhhhccchhhhhhhhhhhhhhhhhhhh Q lcl|NC_015266. 90 -GKDEAET------------------TAN-----------------------VIGTVTPDGKYTGMKALLAAQGKLAVKP 127 (390) Q Consensus 90 -~~~~~~~------------------~~~-----------------------~~~~~~~~~~~tgl~~~~~~~~~~~~~p 127 (390) ..+.... ... +.++.+ +..+................ T Consensus 434 ~~~~~~~~~in~vs~lv~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~v~v~lagG~D--g~~tt~~~igg~~~~~~~tg 511 (774) T protein:vir:98 434 FFLPKSIDSINYDAALVRQSPLRLAPPDESETDVENPAHVDFYGPNVLVDVTLENGYD--GPPVTNDDYVSIIRTLENQP 511 (774) T ss_pred cccccccccccccccccccchhcccccccccccccccccccccCCcceEEEeecCCCC--cccccchheecccccccccc Confidence 0000000 000 000000 00011111111111111111 Q ss_pred hhhhhhhhcchHHHHHHHHhhhhc-----ceEEeecccccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecH Q lcl|NC_015266. 128 RILVAPGLDTQPVAAAFATIAQSL-----RAMVYVAAHGCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPA 202 (390) Q Consensus 128 ~~~~apg~~~~~v~~al~~~~~~~-----~~~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~ 202 (390) -..+..+....+++.++..+|+.+ ..+.++|++++.+.+++++|+++++|+|+++||||++++|+.++...++|| T Consensus 512 i~aLl~a~~~~~V~~aii~~~e~~~~~~~~r~avid~p~g~t~~~Ai~~r~~f~S~~aal~~Pwvkv~D~~~g~~~~vPp 591 (774) T protein:vir:98 512 VHILLVGTTNVGVQQALITEAERASDSDGLRIAVLAAPPRTTPTLAASVTRGFNSTRAVMVAGWFTYAGQPNSSRYGVPG 591 (774) T ss_pred eeEEEcCccchhhHHHHHHHHHHhhhcccceEEEEECCCCCCHHHHHHHHhccCCceEEEEeCcEEEeccCCCceeecCh Confidence 111223444566777777777654 356788888999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEE---cCCCEEEEccccCCCCcc Q lcl|NC_015266. 203 PAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLV---NRNGFRFWGSRTCDADGK 279 (390) Q Consensus 203 s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~---~~~G~~~wG~rT~~~d~~ 279 (390) |+++||++|++| ||+||+|++|.|++++..++....+....+.+.++.++||++. .++|+++||+||+++|++ T Consensus 592 Sg~vAGl~ArtD----v~kSPANk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~gIN~i~itt~g~G~rvWG~RTlssDp~ 667 (774) T protein:vir:98 592 AAVYAGKLAAID----FFVSPAARSLVGPLFNIIESDTDNYTSRSNQDIYSAARLEVLSLDTVDRTYRFASGVTLSTDPA 667 (774) T ss_pred hHHHHHHHHhcC----cccccCCceeecceeccccccccccccchhhhhhcccccceeEEEEcCCcEEEEcccccCCCcc Confidence 999999999999 8999999999999999877766666666677788888888763 368999999999999999 Q ss_pred cceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeE-EEEecCCCCHHHHhCCeEEEE Q lcl|NC_015266. 280 FFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGG-AWYDPEPNTTDELTSGGTWID 358 (390) Q Consensus 280 ~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~-v~~d~~~nt~~~i~~G~~~~~ 358 (390) |+||++||+++||+++|.+.++|++||||++.+|.+|+++++.||++||++|+|+|++ |+||+++||+++|++|+|+++ T Consensus 668 wr~InVRRlfd~Ie~SI~~~~~~~VfEPNd~~l~~~I~~sI~~fL~~L~~~GaL~G~~~V~~D~etNt~~dI~~G~l~i~ 747 (774) T protein:vir:98 668 WERIYLRRVHDVVRQGAHAILRNYVAMPNSRLVRNQIAAALNAFMGELKRNGNIVSFRPAIIDGSNNSTAAYFSRELYVS 747 (774) T ss_pred cceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceecceEEEEcCCCCCHHHhhCCEEEEE Confidence 9999999999999999999999999999999999999999999999999999999997 899999999999999999999 Q ss_pred EEEEecccceEEEEEEEEcchHHHHHHHH Q lcl|NC_015266. 359 YDYTPVPPLENLKLRQRITDRYLADFASR 387 (390) Q Consensus 359 i~~~p~~p~e~i~~~~~~~~~~~~~l~~~ 387 (390) |+++|++|+|||+|+++++.++-. |++ T Consensus 748 I~vaP~~PAEfIilri~q~t~~~~--l~E 774 (774) T protein:vir:98 748 LQFQPLYSADYIYVTISRDTETSP--LGE 774 (774) T ss_pred EEEEecCCcceEEEEEEEeeccee--ccC Confidence 999999999999999999888633 222 No 32 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=100.00 E-value=9.8e-75 Score=426.33 Aligned_cols=363 Identities=13% Similarity=0.071 Sum_probs=248.5 Q ss_pred CCCcc------CCCEEEEECCCCCcc-----------------ccccccccceeeeccccccccceeccceEEEechhHH Q lcl|NC_015266. 1 MPQDY------HHGVRVIEINEGGRP-----------------IRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAA 57 (390) Q Consensus 1 Ma~~~------~hGV~v~ev~~~~~~-----------------i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~ 57 (390) |..++ ...|.+-..+..++| ...+...+.++............+++.++++...... T Consensus 343 ~v~d~~~~~~~~~~v~~~~t~~~~~pp~~~~~~e~v~~ngG~~f~v~s~~~~g~~i~~~~as~~~s~ln~~~~V~Gt~aa 422 (742) T protein:vir:58 343 SVGDFTVIVNELTNVSIPVTDSAIIPPMRFTRIEQITLSGGASFSVISNQPYGFNIQDSRHSYWLSPFKDDELIIGTELV 422 (742) T ss_pred cccceeeeccccccceeeccccccCCcccccccceeecccCcceEEEEecccCcceeccCcceEEeccCCceEEEeehhh Confidence 11111 012333222222222 1122222333322222222334466667666443322 Q ss_pred HHhhc---------cccchHHHHHhhhcccCceEEEEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhh Q lcl|NC_015266. 58 LGKAG---------TKGTLRRTLDAIGKQTKPVTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPR 128 (390) Q Consensus 58 ~~~~~---------~~gtl~~al~~~~~~~~~~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~ 128 (390) ..... ..+.+.........+++..+.+...... ....... .......+.++|++++++.. .+. T Consensus 423 ~~~~d~~t~~~v~s~~~alp~~a~sv~laGG~dg~v~v~~~~-~D~iG~~--~~~d~~~adrTGL~ALlev~-----eVt 494 (742) T protein:vir:58 423 LPALDVSTEFGVSSWEEALPEFSFLMPFQGGSDGYIRVDENE-PDTIGRV--KITPALLANYERLLPLLTED-----QFD 494 (742) T ss_pred ccccccchheeccccccccceeeEEEeecCCccccccccCCC-ccccccc--ccccccccchhHHHHhhhcC-----CCc Confidence 11110 0111111111111122222211111111 1110001 11111235678999988764 367 Q ss_pred hhhhhhhcchHHHHHHHHhhhhc--ceEEeecccccC-chHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHH Q lcl|NC_015266. 129 ILVAPGLDTQPVAAAFATIAQSL--RAMVYVAAHGCK-TKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAI 205 (390) Q Consensus 129 ~~~apg~~~~~v~~al~~~~~~~--~~~~~~d~~~~~-~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~ 205 (390) ++.+||+++..++.++.+.|+.. +..++.|.+... +.+++.++++.+++.++++||||+++.+ ++..+++|||++ T Consensus 495 ILiAPG~t~~~v~aav~A~la~a~~Rl~vL~D~P~~~tt~~~A~a~r~~~nSsraaly~PwVkv~d--~~~~r~vPpSga 572 (742) T protein:vir:58 495 LVLTPYLTFADHAGTVNAFINRAENRFLYLFDIAGDDDTENLAISLAGYINSSFATTFFPWVRRLT--NKGMRTVPASLA 572 (742) T ss_pred EEEEcCCCchHHHHHHHHHHHhhcCCeEEEEecCCCCchHHHHHHHHhccCCceEEEEeceeeecc--CCcceeechHHH Confidence 89999999888887777777654 334455655443 4577889999999999999999998775 467789999999 Q ss_pred HHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEcC-CCEEEEccccC-CCCccccee Q lcl|NC_015266. 206 AAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVNR-NGFRFWGSRTC-DADGKFFFE 283 (390) Q Consensus 206 vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~-~G~~~wG~rT~-~~d~~~~~i 283 (390) +||++|++|.++|+|+||+|+.+.+... . ...+.+.||.+|||+++++ +|+++||+||+ +.|++|+|| T Consensus 573 IAGL~ARtD~erGvw~SPANrgii~~~~----~------s~se~d~LN~~GINtIrsfG~G~rlWGnRTlassDs~wryI 642 (742) T protein:vir:58 573 AYRSIRTTDPETGLAPVGARRGVVTGEP----V------RQVDWEDLYNNRINPIVRVGNDVLLFGQKTMLNVNSALNRI 642 (742) T ss_pred HHHHHHHhccCCceEecCCcceeeeccc----c------chhhHHHHhhCCceEEEECCCcEEEEcceecCCCCcccceE Confidence 9999999999999999999986543321 1 2346778899999999886 69999999998 679999999 Q ss_pred eehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEe Q lcl|NC_015266. 284 NYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTP 363 (390) Q Consensus 284 ~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p 363 (390) +|||+++||+++|+++++|++||||++.+|++|++++++||++||++|+|+||+|+||+ +||+++|++|+|+++|+++| T Consensus 643 nVRRlfd~Ie~SI~~a~q~~VfEPNd~~L~~sIk~sInafL~~L~aqGALlGfrV~lDe-tNTpeDI~~Gklvv~I~vAP 721 (742) T protein:vir:58 643 NVRRLLIVMRNRISQILSSYLFENNTSENRLRAEALVRQYLESLRLRGAVTDYEVAIDS-VTTPTDIDNNTLRARVTVQP 721 (742) T ss_pred eehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcC-CCCHHHhhCCEEEEEEEEEc Confidence 99999999999999999999999999999999999999999999999999999999995 68899999999999999999 Q ss_pred cccceEEEEEEEEcchHHHHHHH Q lcl|NC_015266. 364 VPPLENLKLRQRITDRYLADFAS 386 (390) Q Consensus 364 ~~p~e~i~~~~~~~~~~~~~l~~ 386 (390) ++|||||+|+++.+....+ |+ T Consensus 722 ~~PAEfI~lrf~it~tga~--Fs 742 (742) T protein:vir:58 722 ARSIEYIDITFVITPTGVE--IT 742 (742) T ss_pred cCCcceEEEEEEEEecccc--cC Confidence 9999999999888766544 22 No 33 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=100.00 E-value=3.7e-50 Score=291.56 Aligned_cols=324 Identities=13% Similarity=0.074 Sum_probs=184.5 Q ss_pred CCCccCCCEEEEE------------------CCCCCcc----ccccccccceeeeccccccccceeccceEEEechhHHH Q lcl|NC_015266. 1 MPQDYHHGVRVIE------------------INEGGRP----IRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAAL 58 (390) Q Consensus 1 Ma~~~~hGV~v~e------------------v~~~~~~----i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~ 58 (390) |++. .+|+..+. |++.+.+ ....+++.+.+.-+....+...++-....+.....+.+ T Consensus 330 ~~~~-~~g~~~~~pl~~ts~dy~~~~~~vdgI~~~~~~~V~~~g~~s~a~a~~~~g~~s~d~a~f~Gg~dgl~~~~ee~Y 408 (717) T protein:vir:79 330 KPES-KRGMISEDPLVFKSGDYTNFKMLVDAINNHPFNNVVRARTKPEFEATFTSTLQAAADAKFSGGKDELSLDKEEMY 408 (717) T ss_pred cccc-cCcceeccccccccCceeeeeeeecccccCchhheeeeecccccceeeeecccCchhhccCCCccccccchhhhh Confidence 2221 12332211 1111222 11111122222111111111111111000000001100 Q ss_pred Hhh----ccccchH-HHHHhhhcccCceEEEEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhh Q lcl|NC_015266. 59 GKA----GTKGTLR-RTLDAIGKQTKPVTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAP 133 (390) Q Consensus 59 ~~~----~~~gtl~-~al~~~~~~~~~~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~ap 133 (390) ... .+.|.+. .+......+. ...+++-.....+.. . T Consensus 409 ~~lGgk~~d~g~lt~~aays~LE~~-dVDlVil~ga~adtt-----~--------------------------------- 449 (717) T protein:vir:79 409 KRLGGEKNEEGFVTKQGAYQYLENY-EVDYVIPLGVHADTK-----L--------------------------------- 449 (717) T ss_pred ccccccccccccccchhhhhhcCcc-eeEEEEecCcccccc-----c--------------------------------- Confidence 000 0111110 0111111110 011111100000000 0 Q ss_pred hhcchHHHHHHHHhhhhcc-----eEEeec--ccccCchHHHHHHhh---------------------------hhccce Q lcl|NC_015266. 134 GLDTQPVAAAFATIAQSLR-----AMVYVA--AHGCKTKEEAVAYRK---------------------------QFGQRE 179 (390) Q Consensus 134 g~~~~~v~~al~~~~~~~~-----~~~~~d--~~~~~~~~~a~~~~~---------------------------~~~~~~ 179 (390) +-..+.+..++..+|.... ++.+++ .+.....+...+++. +++ .+ T Consensus 450 ga~~d~va~alad~caalSal~r~ai~VI~l~sp~D~~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a~~~~~~~idis-~y 528 (717) T protein:vir:79 450 IGKYDDFAYQLALACAVMSHYNSVTIGIIPTTTPSDISLAGVEEHVKKLENYANEFYMRDRFGNIIFDADRNKIDLG-QF 528 (717) T ss_pred cchhhhHHHHHHHHHHHhhhccccceeeeccccccccchhhHHHHHHHHHhhhhhhhhhcchhcccccccccccccc-ce Confidence 0001122333333332211 111111 111111111111111 111 23 Q ss_pred EEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeE Q lcl|NC_015266. 180 IMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTT 259 (390) Q Consensus 180 ~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~ 259 (390) ...+.++..+..+..+.....||++++||+ |.++|+|+||+|++|.|+.++.++++..+++ .|+.+||++ T Consensus 529 ~~vv~~~~~iv~~~~~~~~~~p~AG~vAGl----dA~rGVwkSPANk~I~GVvgLa~~lT~sE~d------~Ln~aGInt 598 (717) T protein:vir:79 529 IEVVAGPDFIVRNTRLGQMASTPDASYIGM----VSQLKTQSAPTNKPLPSVTALRYTYSANQLN------RLTKARFAT 598 (717) T ss_pred eeeeecceeEEEcCCCceeecCHHHHHHHH----HhcCCcccccccceecccccCcccCCHHHHH------HHhhCCeEE Confidence 334444444445556667778887777666 4557899999999999999999998876654 456689999 Q ss_pred EEc--CCCEEEEccccCCCCc-ccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeee Q lcl|NC_015266. 260 LVN--RNGFRFWGSRTCDADG-KFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGG 336 (390) Q Consensus 260 ~~~--~~G~~~wG~rT~~~d~-~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~ 336 (390) |+. ++|+++||+||++.++ .|+||++||++++|+++|++.++|++||||++.+|.+++.+|++||++||++|+|.|| T Consensus 599 Ir~~~GrGirVWGaRTtasd~sdWryInVRRl~D~Ie~sIr~al~~yVgEPNd~~tr~~Ik~sI~afL~~L~r~GAI~Gy 678 (717) T protein:vir:79 599 FKYKQDGSIGVVDAPTSAHAGSDYTRLSTARIVKEAVNAVREVADPFIGEPNDTGNRNALTAAVDKRLSKMIENKALLGF 678 (717) T ss_pred EEEeCCceEEEEeeeecCCCCcccceeehhhhHHHHHHHHHHHHHHhccccCCHHHHHHHHHHHHHHHHHHHhcCceecc Confidence 974 5699999999998765 7999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcc Q lcl|NC_015266. 337 GAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITD 378 (390) Q Consensus 337 ~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 378 (390) ++.+ +||++++++|+++++|+++|++|+|||+++++.+. T Consensus 679 kvdv---tnT~~di~~G~l~V~I~vaPv~PaEfI~ititITA 717 (717) T protein:vir:79 679 DFRL---VVTPQQELLGEGSIELSLEAPNELRRLTTIVSLSA 717 (717) T ss_pred eeeE---ecChhHhhCCEEEEEEEEEecCcccEEEEEEEEeC Confidence 9865 89999999999999999999999999999999988 No 34 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=100.00 E-value=7.2e-40 Score=235.19 Aligned_cols=360 Identities=12% Similarity=0.069 Sum_probs=257.5 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhh----c Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIG----K 76 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~----~ 76 (390) |-...+||||+++.+++++++..+++++++|+|.+..+ |.+.|++++++.++...|+. |.|.+++...+ . T Consensus 8 ~~~~~~Pgv~~~~~~s~~~~~~~~~~~~~~~iG~a~~G-----~~~~~~~~~~~~~~~~~fg~-g~l~~~i~~a~~~~~~ 81 (562) T protein:vir:63 8 RKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGG-----KPNAVYKVRNYSQAKSVFRS-GELLDAIERAWNPGEG 81 (562) T ss_pred CCcccCCceEEEEecCCCcccCCCCCceEEEEEEeCCC-----CCceeEEEccHHHHHHHhcC-CchHHHHHHhcccccc Confidence 44456789999999999999999999999999999877 56889999999999888877 67888887666 6 Q ss_pred ccCceEEEEeecccccccccccc-----------------------------------------c---hhhh-------- Q lcl|NC_015266. 77 QTKPVTVVVRVAEGKDEAETTAN-----------------------------------------V---IGTV-------- 104 (390) Q Consensus 77 ~~~~~~~vv~v~~~~~~~~~~~~-----------------------------------------~---~~~~-------- 104 (390) +++..+++++|........+... + .+.+ T Consensus 82 ~g~~~~~~~rv~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~V~~i~y~g~ 161 (562) T protein:vir:63 82 TGAGDILAMRVEEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIFSIKYKGT 161 (562) T ss_pred CCceEEEEEEcCCCccceeEecceeEEEeecccCCCeEEEEEecCCCCCCcceEEEecCCCcchhhhhccceeeeeeecc Confidence 88888898887442221111000 0 0000 Q ss_pred -------------ccc--------hhh-------------hhhhhhhhh-----------------------------h- Q lcl|NC_015266. 105 -------------TPD--------GKY-------------TGMKALLAA-----------------------------Q- 120 (390) Q Consensus 105 -------------~~~--------~~~-------------tgl~~~~~~-----------------------------~- 120 (390) +.. +.. +...+.... . T Consensus 162 ~~~~~~~v~~~~~~~~a~~l~~~~g~~~v~~~~L~~g~~~~~~~l~~~in~~~~~~aky~~~~gn~i~~~~~d~~~~~~v 241 (562) T protein:vir:63 162 EASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTDNFDAQIDVDI 241 (562) T ss_pred cccceEEEEecCcceeEEEEEeecCCcceeEEEecCCccchhHHHHHhhccccceEEEeeccCCceeeeeccccccccch Confidence 000 000 000000000 0 Q ss_pred -hh-----------------h----------hhh---hh---------------------------hhhhhhhcchHHHH Q lcl|NC_015266. 121 -GK-----------------L----------AVK---PR---------------------------ILVAPGLDTQPVAA 142 (390) Q Consensus 121 -~~-----------------~----------~~~---p~---------------------------~~~apg~~~~~v~~ 142 (390) .. . +.. +. ....|..+..++++ T Consensus 242 kt~~~~v~t~~~d~~~~~~~~~~v~~~~~~~~~la~~~~~~LtGG~dGt~~~~~~~al~ale~~~~~~i~~~t~d~av~~ 321 (562) T protein:vir:63 242 KTKEAYVKAVGGDIEKQTAYNGYVDFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANEGGYYLVPLTSKQAVHA 321 (562) T ss_pred hhhhhhhhhhhhhhhhcccccceeeeeeccccceecccceeeecCCCCCchhhHHHHHHHHHhCCcEEEEecCCCHHHHH Confidence 00 0 000 00 00000111245666 Q ss_pred HHHHhhhhcce-----EEeecccccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecH---HHHHHHHHhhhh Q lcl|NC_015266. 143 AFATIAQSLRA-----MVYVAAHGCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPA---PAIAAGLRAKID 214 (390) Q Consensus 143 al~~~~~~~~~-----~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~---s~~vAg~~a~~d 214 (390) ++.+++.+++. +.+++.+++.+.+++......+++.+.+.++|+....+. .+....+|+ ++++||+++..| T Consensus 322 ~l~a~vkr~~~~g~~~~aVlg~~~~~~~~~~~~~a~~~n~ervv~v~~~~~~~~~-~~~~~~~~~~~~aa~vAGl~A~~~ 400 (562) T protein:vir:63 322 EALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTRAIGLQNERAGLIGFSGTVKMD-DGRSLKMPGYMFAAQVAGLTCGLE 400 (562) T ss_pred HHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHhhhcCCCcEEEEecCeeEECC-CCceeeechhHHHHHHHHHhhcCc Confidence 77777766543 666777777888888888889999999999998665443 456666777 889999999988 Q ss_pred hccceeecccCceeecccccccccchhhhccccccccccccceeEEEc--CCCEEEEcc-cc-----CCCCcccceeeeh Q lcl|NC_015266. 215 NDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVN--RNGFRFWGS-RT-----CDADGKFFFENYT 286 (390) Q Consensus 215 ~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~--~~G~~~wG~-rT-----~~~d~~~~~i~vr 286 (390) +++||.|+++. ..++....+..+++ .+..+|+.++.. +++.++|.. ++ ...|+.|++|+++ T Consensus 401 ----~~~SlT~~~i~-~~~v~~~~t~~e~~------~li~~Gv~~l~~~~~~~v~~~~iv~~itT~t~~~~~~~~ki~vi 469 (562) T protein:vir:63 401 ----IGEAITFKNIA-IETLDTIYEGSQLD------QLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVG 469 (562) T ss_pred ----hhcCccceeec-cccccccCCHHHHH------HHHhCCeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhhhh Confidence 78999999987 56777666655554 445578888854 344556644 33 3457889999999 Q ss_pred hhHHHHHHHHHHHHH-HHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecc Q lcl|NC_015266. 287 RSAQVIADTIAEEQM-GVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVP 365 (390) Q Consensus 287 R~~~~i~~~l~~~~~-~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~ 365 (390) |++|+|.+++++.+. +|+++||+...|.+++..+..||.+|++.|+|.||+.. +-+..+.++++++++.++|+. T Consensus 470 Rv~D~i~~dir~~~~~~yiGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~~~d~~~v~~~v~pv~ 544 (562) T protein:vir:63 470 EANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPE-----EVQVVIEGDVARISLTVFPIR 544 (562) T ss_pred HHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEecCCEEEEEEEEEEcc Confidence 999999999988865 89999999999999999999999999999999998532 123345678899999999999 Q ss_pred cceEEEEEEEEcchHHHH Q lcl|NC_015266. 366 PLENLKLRQRITDRYLAD 383 (390) Q Consensus 366 p~e~i~~~~~~~~~~~~~ 383 (390) |+|+|.+++.+.++-++. T Consensus 545 ~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:63 545 SMKKIEVSLVYRQQILTA 562 (562) T ss_pred cceEEEEEEEEeeeeecC Confidence 999999999999998887 No 35 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=100.00 E-value=9.3e-39 Score=229.08 Aligned_cols=360 Identities=15% Similarity=0.110 Sum_probs=255.7 Q ss_pred CCCc-------cCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHh Q lcl|NC_015266. 1 MPQD-------YHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDA 73 (390) Q Consensus 1 Ma~~-------~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~ 73 (390) ||-. -+||||+++.+++++++..+++.+.+|+|.++.+ +.++|++++++.++...|+. |.|.+++.. T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~~G-----~~~~~~~~~~~~~~~~~f~~-g~l~~a~~~ 74 (569) T protein:vir:80 1 MAVEQFPRKKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAKGG-----KPDTVYRFRNYQQAKQVLRS-GDLLDAIEL 74 (569) T ss_pred CeeeeecCCccccCceEEEEecCCCcCCCCCCceeEEEEEEeCCC-----CCceeEEecCHHHHHHHhcC-CchhHHHHh Confidence 7643 3489999999999999999999999999999877 55889999999999888876 668888876 Q ss_pred hh------cccCceEEEEeeccccccccc-----------------------------------------cccc---hhh Q lcl|NC_015266. 74 IG------KQTKPVTVVVRVAEGKDEAET-----------------------------------------TANV---IGT 103 (390) Q Consensus 74 ~~------~~~~~~~~vv~v~~~~~~~~~-----------------------------------------~~~~---~~~ 103 (390) .| .+++..|+++++........+ ...+ ++. T Consensus 75 a~~~~~~~~~~~~~~~~~rv~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~l~~~~~~~~~~~~v~~~~~~~~~~~~~ig~ 154 (569) T protein:vir:80 75 AWNASDVNTASAGDILAVRVEDAKNATLTKGGLTFASTIYGVDANEIQVALEDNNLTHTKRLTVAFSKDGYKKVFDNLGK 154 (569) T ss_pred hccCccccccCceEEEEEEcCCCeeeeeeccceeeeeeeccCCCceEEEEEecCcCCcceeeEEeeecCCCccccccccc Confidence 65 466777887776331110000 0000 000 Q ss_pred --------------hcc-----chh------h----------------------h------------hhhhh-------- Q lcl|NC_015266. 104 --------------VTP-----DGK------Y----------------------T------------GMKAL-------- 116 (390) Q Consensus 104 --------------~~~-----~~~------~----------------------t------------gl~~~-------- 116 (390) +.. +.. + + +..+- T Consensus 155 v~si~ytg~~~~a~~~~~~~~~~~~a~~l~~~~g~~~~~~~~v~~~~~~~~~~~~~~~lv~~~~~~~~f~a~~~~~~~~~ 234 (569) T protein:vir:80 155 IFSIQYKGSEAQANFTIAQDSISKKATTLTLNVGSEPESTTEVMKYELGQGVYSETNVLVSAINSLPDWEAKFFPIGDKN 234 (569) T ss_pred eeeEEEeeccccceEEeecCcCcceeEEEEEEecCCcceeEEEEeeccCCccchhhhhhhhhcCCccCceEEEEecCCCc Confidence 000 000 0 0 00000 Q ss_pred ------hhh-------------------hh--------hh-----h------------------------------hhhh Q lcl|NC_015266. 117 ------LAA-------------------QG--------KL-----A------------------------------VKPR 128 (390) Q Consensus 117 ------~~~-------------------~~--------~~-----~------------------------------~~p~ 128 (390) ... .. .. + .... T Consensus 235 ~~~~~~d~~~~~~~~t~~~~~~~~~~di~~~~~~~~~v~~~~~~~~~l~~~~~~~LtGG~dG~~~~~~~~~l~~le~~~~ 314 (569) T protein:vir:80 235 LPTDALEAVTKVDVKTEAVFVGALAGDIAKQLEYNDYVTVAVDATKPVEDFELTNLTGGSDGTAPESWANKFPLLANEGG 314 (569) T ss_pred ceehhccchhheeccccceeeehhHHHHHHhhcCCceEEEEecCCcceeeecceeecCCCCCCccchHHHHHHHHhhCCc Confidence 000 00 00 0 0000 Q ss_pred hhhhhhhcchHHHHHHHHhhhhcce-----EEeecccccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecH- Q lcl|NC_015266. 129 ILVAPGLDTQPVAAAFATIAQSLRA-----MVYVAAHGCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPA- 202 (390) Q Consensus 129 ~~~apg~~~~~v~~al~~~~~~~~~-----~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~- 202 (390) ..+.+....+++++++.++|++++. +++++.+++.+.+++...+..+++.+.++++|+..+.+. ++....+|+ T Consensus 315 ~~i~~~t~d~av~~~l~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~-~g~~~~~~~~ 393 (569) T protein:vir:80 315 YYLVPLTDKQAVHSEALAFVKDRTDNGDPMRIIVGGGTNETVEESITRATNLRDPRASLVGFSGTRKMD-DGRLLKLPGY 393 (569) T ss_pred EEEEecCCChHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHHhhcCCCeEEEEecCceeecC-CCcceeechh Confidence 0000111234677888888887743 667777788889999999999999999999999877653 445556665 Q ss_pred --HHHHHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEcCC--CEEEEcc-c----- Q lcl|NC_015266. 203 --PAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVNRN--GFRFWGS-R----- 272 (390) Q Consensus 203 --s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~~--G~~~wG~-r----- 272 (390) ++++||+++..+ +++||.|+.+. +.++...++..+++ .+..+|+.++...+ +.++|.. + T Consensus 394 ~~aa~vAG~~A~~~----~~~S~T~k~i~-~~~i~~~lt~~e~~------~li~~G~~~l~~~~~~~~~v~~~vn~itT~ 462 (569) T protein:vir:80 394 MMASQIAGIASGLE----VGEAITFKHFN-VTSVDRVFESSQLD------MLNESGVISIEFVRNRTLTAFRVVQDVTTY 462 (569) T ss_pred hHHHHHHHHHhcCc----cccCccceeec-cccccccCCHHHHH------HHHhCCeEEEEEecCceEEEEEEeccceec Confidence 778888888776 88999999997 56777776655544 44567888886543 3445533 2 Q ss_pred cCCCCcccceeeehhhHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHh Q lcl|NC_015266. 273 TCDADGKFFFENYTRSAQVIADTIAEEQ-MGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELT 351 (390) Q Consensus 273 T~~~d~~~~~i~vrR~~~~i~~~l~~~~-~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~ 351 (390) |...|+.|++++++|++|+|.+.+++.+ .+|+++||+...|..++..+..||.+||++|+|.||... +-+.++. T Consensus 463 t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yiGk~nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~~ 537 (569) T protein:vir:80 463 NDKSDPVKNEMSVGEANDFLVSELKIELDNNFIGTKVIDTSASLIKNFIQSFLDNKKRAREIQDYTPE-----EVQVVLE 537 (569) T ss_pred CCCCCchhhhhhhhHHHHHHHHHHHHHHHhhcCcccCChhHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEec Confidence 2245778999999999999999999876 589999999999999999999999999999999998532 1233456 Q ss_pred CCeEEEEEEEEecccceEEEEEEEEcchHHHH Q lcl|NC_015266. 352 SGGTWIDYDYTPVPPLENLKLRQRITDRYLAD 383 (390) Q Consensus 352 ~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 383 (390) ++++++++.++|+.|+|+|.+++.+.++-++. T Consensus 538 ~d~~~v~~~v~Pv~~~ekI~~ti~~~~~~~~~ 569 (569) T protein:vir:80 538 GDVASISMTVMPIRSLNKITVQLVYKQQILTA 569 (569) T ss_pred CCEEEEEEEEEEcccccEEEEEEEEeeeeecC Confidence 78999999999999999999999999998887 No 36 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=100.00 E-value=2e-38 Score=227.30 Aligned_cols=360 Identities=11% Similarity=0.044 Sum_probs=255.7 Q ss_pred CCCc-------cCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHh Q lcl|NC_015266. 1 MPQD-------YHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDA 73 (390) Q Consensus 1 Ma~~-------~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~ 73 (390) ||-. .+||||+++.+++.+++..+++++++|+|.+..+ |.++|++++++.++...|+. |+|.+++.. T Consensus 1 ~~~~~~~~~~~~~pgv~~~~~~s~~~~~~~~~~~~~~~ig~a~~G-----~~~~~~~~~~~~~~~~~f~~-g~l~~~i~~ 74 (562) T protein:vir:80 1 MAIEIYPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGG-----KPNAVYKVRNYSQAKSVFRS-GELLDAIER 74 (562) T ss_pred CeeeeeCCCcccCCceEEEEecCCcccCCCCCCceEEEEEEeCCC-----CcceeEEEccHHHHHHHhcC-CChHHHHHH Confidence 6643 3589999999999999999999999999999877 55889999999999888877 678877776 Q ss_pred hh----cccCceEEEEeeccccccccccc---------------------------------------------cc---- Q lcl|NC_015266. 74 IG----KQTKPVTVVVRVAEGKDEAETTA---------------------------------------------NV---- 100 (390) Q Consensus 74 ~~----~~~~~~~~vv~v~~~~~~~~~~~---------------------------------------------~~---- 100 (390) .+ .+++..++.++|........+.. ++ T Consensus 75 a~~~~~~~g~~~~~~~rv~~a~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~v~ 154 (562) T protein:vir:80 75 AWNPGEGTGAGDILAMRVEEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIF 154 (562) T ss_pred hcccccccCceEEEEEEcCCCCcceEEecceEEEEeecccCCCceEEEEecCCCCCCcceEEEecCCcceEEeeccCcee Confidence 66 58888888888744322111100 00 Q ss_pred -----------------------------hhhhc--------cchhhhhhhhhhhh------------------------ Q lcl|NC_015266. 101 -----------------------------IGTVT--------PDGKYTGMKALLAA------------------------ 119 (390) Q Consensus 101 -----------------------------~~~~~--------~~~~~tgl~~~~~~------------------------ 119 (390) .++.. ..+..+...+.... T Consensus 155 ~i~y~g~~~~a~~~i~~~~~~~~a~~l~~~~g~~~v~~~~l~~g~~~~~~~l~~~i~~~~~~tAky~g~~~n~i~~~~~d 234 (562) T protein:vir:80 155 SIKYKGTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTDNFD 234 (562) T ss_pred eeeeccccccceeEEEecCccceEEEEEEecCCcceeEEEeCCCccchhhhhhhhhccccceEEEecccCCceeeecccc Confidence 00000 00000000000000 Q ss_pred -----h----------------hhh---h-----hh-hh-------hh---------------------------hhhhh Q lcl|NC_015266. 120 -----Q----------------GKL---A-----VK-PR-------IL---------------------------VAPGL 135 (390) Q Consensus 120 -----~----------------~~~---~-----~~-p~-------~~---------------------------~apg~ 135 (390) . ... . .. .. .. ..+.. T Consensus 235 ~~~~~~~kt~~~~v~~~~~d~~~~~~~n~~v~~~~~~~~~la~~~~~~LtGG~dG~~~~~~~dal~~Le~~~~~~i~~~t 314 (562) T protein:vir:80 235 AQIDVDIKTKEAYVKAVGGDIEKQTAYNGYVEFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANEGGYYLVPLT 314 (562) T ss_pred cchhhhcccceeeeeehhhhhhhcccccceEEEEeccCccccccceeeeeCCCCCCccccHHHHHHHHHhCCcEEEEecC Confidence 0 000 0 00 00 00 00000 Q ss_pred cchHHHHHHHHhhhhcc-----eEEeecccccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecH---HHHHH Q lcl|NC_015266. 136 DTQPVAAAFATIAQSLR-----AMVYVAAHGCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPA---PAIAA 207 (390) Q Consensus 136 ~~~~v~~al~~~~~~~~-----~~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~---s~~vA 207 (390) ..+++++.+.+++.+++ .+.++..+++.+.+++......+++.+.+.+.|+..+.+. .+....+|+ ++++| T Consensus 315 ~d~ai~~~~~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~-~~~~~~~~~~~~aa~vA 393 (562) T protein:vir:80 315 SKQAVHAEALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTRAIGLQNERAGLIGFSGTVKMD-DGRSLKMPGYMFAAQVA 393 (562) T ss_pred CChHHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHHhhhcCCCeEEEEecCeeEECC-CCceeeechhHHHHHHH Confidence 12456667777776653 3566777778888999998999999999999998765543 455566666 88999 Q ss_pred HHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEcC--CCEEEE-cccc---C--CCCcc Q lcl|NC_015266. 208 GLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVNR--NGFRFW-GSRT---C--DADGK 279 (390) Q Consensus 208 g~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~--~G~~~w-G~rT---~--~~d~~ 279 (390) |+++..+ +++||.|+++.+ .++...++..++ +.+..+|+.++... ++.++| +-++ . ..|+. T Consensus 394 Gl~Ag~~----~~~S~T~~~i~~-~~v~~~lt~~e~------~~li~~G~l~l~~~~~~~v~~~riv~~itT~t~~~~~~ 462 (562) T protein:vir:80 394 GLTCGLE----IGEAITFKNIAI-ETLDTIYEGSQL------DQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPV 462 (562) T ss_pred HHHhcCc----cccCccceeecc-ccccccCCHHHH------HHHHhCCeEEEEEecCCcEEEEEeeccceeccCCCCch Confidence 9999988 778999999985 466666555444 45556788888653 334455 2222 2 34789 Q ss_pred cceeeehhhHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEE Q lcl|NC_015266. 280 FFFENYTRSAQVIADTIAEEQ-MGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWID 358 (390) Q Consensus 280 ~~~i~vrR~~~~i~~~l~~~~-~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~ 358 (390) |++|+++|++|+|.+.+++.+ .+|+++||+...|..++..+..||.+|++.|+|.+|..+ +-+.+..+++++++ T Consensus 463 ~~ki~viRv~D~i~~dir~~~~~~yIGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~~~d~~~v~ 537 (562) T protein:vir:80 463 KSEIGVGEANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPE-----EVQVVIEGDIARIS 537 (562) T ss_pred hhhhhhhHHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEecCCEEEEE Confidence 999999999999999999887 589999999999999999999999999999999998642 12234567889999 Q ss_pred EEEEecccceEEEEEEEEcchHHHH Q lcl|NC_015266. 359 YDYTPVPPLENLKLRQRITDRYLAD 383 (390) Q Consensus 359 i~~~p~~p~e~i~~~~~~~~~~~~~ 383 (390) +.++|+.|+|+|.+++.+.++-++. T Consensus 538 ~~v~Pv~~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:80 538 LTVFPIRSMKKIEVSLVYRQQILTA 562 (562) T ss_pred EEEEEcccceEEEEEEEEEeeeecC Confidence 9999999999999999999998887 No 37 >protein:vir:103168 Length: 641 # NCBI annotation: gp122 # Family: family:all:661 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717789;genbank:gi:113200626;genbank:GeneID:4239177 Probab=100.00 E-value=2.5e-36 Score=215.81 Aligned_cols=267 Identities=13% Similarity=0.093 Sum_probs=182.5 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhc---cccchHHHHHhhhcc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAG---TKGTLRRTLDAIGKQ 77 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~---~~gtl~~al~~~~~~ 77 (390) |+++..|||||+|++.+ ++|..+.|++.+|+|.++.+ |+++|++++++.++...|+ +...+.+++..+|.| T Consensus 3 m~~~~sPGVyv~E~~~~-~~i~~v~tsvaafvG~~~~G-----P~~~p~~v~s~~d~~~~FG~~~~~~~l~~av~~fF~n 76 (641) T protein:vir:10 3 VSNQLSPGVVIQERDLT-AVTTPIGLNVGVLAAPFTKG-----PVEEIFEVSTERDLASVFGEPNDYNYEYWFTASQFLS 76 (641) T ss_pred CccccCCceEEEEecCC-CcccccCCccceEEecccCC-----CCCccEEecCHHHHHHHcCCcCCCcchHHHHHHHHHh Confidence 88888899999999875 68999999999999998754 7899999999999888776 456799999999999 Q ss_pred cCceEEEEeecccccccc-----------------------------------------------cccc----------- Q lcl|NC_015266. 78 TKPVTVVVRVAEGKDEAE-----------------------------------------------TTAN----------- 99 (390) Q Consensus 78 ~~~~~~vv~v~~~~~~~~-----------------------------------------------~~~~----------- 99 (390) +|..|+++++........ ...+ T Consensus 77 gG~~~~vvRv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~pG~~gn~l~v~i~~~~~~~~~~~~~~~~~ 156 (641) T protein:vir:10 77 YGGVLKAIRLNAASLKNSVDSGTAPLIKNLQEYETTYESSNSNTFKFASRDAGALGNSVGIFITDAGPDQIAVLPAPGTG 156 (641) T ss_pred cCCEEEEEEecCccccccccccchhhccccccccccccCcCccccEEEeccCCCcCCceEEEEEcCCCcceeeeeccccc Confidence 999999998842110000 0000 Q ss_pred ----------------------------------------------------------------------chhh------ Q lcl|NC_015266. 100 ----------------------------------------------------------------------VIGT------ 103 (390) Q Consensus 100 ----------------------------------------------------------------------~~~~------ 103 (390) ..++ T Consensus 157 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~~~~~~~~~~a~~~~~~i~~~~~g~~g~~~ 236 (641) T protein:vir:10 157 NEWEFVADEAVTAASGASAKVYRYSIRLTLTNVVGTFVPGGATTISISGSDESVDVLAWDAGNKYLEIALPAGGVTGIFA 236 (641) T ss_pred ccceeccceeeeeccCcccccccccccccccceeeecccCCcceeEecccccccccccccCCcceeeeeecCCcceeeee Confidence 0000 Q ss_pred ----hc-cc--------hhh-------------------------------------------hhh-----------hh- Q lcl|NC_015266. 104 ----VT-PD--------GKY-------------------------------------------TGM-----------KA- 115 (390) Q Consensus 104 ----~~-~~--------~~~-------------------------------------------tgl-----------~~- 115 (390) .+ .+ +.. +|+ .. T Consensus 237 ~~~~~t~gt~~~t~a~~g~~~~~~~~~~~~~ia~aat~ag~~g~~~~v~~~~v~d~~~a~~~~~g~~~~~va~~~gts~~ 316 (641) T protein:vir:10 237 DAQVVTQGTNTAAIASSGIERRLYIGKDSGSINFAATDAVVDTNATSATISSVRNEYAEREYLPGSKWVNVAARPGTSLY 316 (641) T ss_pred eeeeccCCccceeeecccchhhhhhccccccceeeeecccccccceeeEeeeeeeeecccccccccccccccccchhhhh Confidence 00 00 000 000 00 Q ss_pred ----------------------------hhhhhhhhh------------------------------------------- Q lcl|NC_015266. 116 ----------------------------LLAAQGKLA------------------------------------------- 124 (390) Q Consensus 116 ----------------------------~~~~~~~~~------------------------------------------- 124 (390) ..+.+.... T Consensus 317 a~~~g~~~D~~~~lv~d~~~~~~g~~g~v~e~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~ 396 (641) T protein:vir:10 317 ANSVGGVNDELHVLVIDVDGKITGNPGSVLERFIGVSKASDAKTSIGEVNYYKEVIKQQSAYVYWGSHETAPFLGTAANA 396 (641) T ss_pred hhhcCCcccceEEEEEeecceeeccccceeeeeecccccCCcccccccceeeeeeeccccceEEEecccccccccccccc Confidence 000000000 Q ss_pred --------------------------------------------h---------------------------------hh Q lcl|NC_015266. 125 --------------------------------------------V---------------------------------KP 127 (390) Q Consensus 125 --------------------------------------------~---------------------------------~p 127 (390) + .. T Consensus 397 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gG~d~~~~~~~~~~~~~~~~tg~~~~~~~e~~~i 476 (641) T protein:vir:10 397 AAGDWGASALNRRYNLLRSTAGTTSFPAGAVTVGSKNNATHYYRLANGADYSASGALYNLSNVDIATAYELIEDPESQVI 476 (641) T ss_pred cccccccccccccccccccccccccccccccccCCCCcceeEEEeecCcccccccccccccchhHHHHHHHhhhhhhhcc Confidence 0 00 Q ss_pred hhhhh-hh----hcchHHHHHHHHhhhhcc-eEEeecccccC---------chHHHHHHhhh-hccceEEEEeeeeEEEe Q lcl|NC_015266. 128 RILVA-PG----LDTQPVAAAFATIAQSLR-AMVYVAAHGCK---------TKEEAVAYRKQ-FGQREIMVIWPDWLGWD 191 (390) Q Consensus 128 ~~~~a-pg----~~~~~v~~al~~~~~~~~-~~~~~d~~~~~---------~~~~a~~~~~~-~~~~~~~~~~p~~~~~~ 191 (390) .++++ |+ ....+++.++.++|++++ ++.++|.|... ..+.+.+|++. .+|+|+++||||++++| T Consensus 477 ~~l~~~~~~~~~~~~~~v~~~~i~~ce~~~d~~ailD~p~~~~~~~~~~~~~~~~~~~~~~~~~~s~yaa~y~P~~~v~d 556 (641) T protein:vir:10 477 DYVLSGPAGADEAAAIAKATTITTIVESRKDCMAFLSPLRSDVIGVSNTTTVTENLVNYFNQLPSSNYVVFDSGYKYIYD 556 (641) T ss_pred ceeeecCCCCCcchhHHHHHHHHHHHHhcCCEEEEEcCCcccccCCCchhhHHHHHHHHHhhcCCCceEEEEeceeEeec Confidence 00000 00 011347788999999998 56667765432 24667788865 47889999999999999 Q ss_pred eccCceeEecHHHHHHHHHhhhhhccceeecccCc---eeecccccccccchhhhccccccccccccceeEEEcCCCEEE Q lcl|NC_015266. 192 DITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNV---VVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVNRNGFRF 268 (390) Q Consensus 192 ~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~---~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~~G~~~ 268 (390) +.+++.+++|||+++||+|||+|.++||||||||. .|.|+++++.+++..+++. ||.+|||+|+.+.|.=+ T Consensus 557 p~~~~~~~vPpsG~iAGv~ArtD~~rGvwkAPAn~~~~~i~gv~~l~~~~~~~e~~~------Lnp~gIN~ir~fpg~G~ 630 (641) T protein:vir:10 557 KYNDVYRYVPCNGDIAGLILETGLEEEPWFSPAGFQRGVLRNAVKLAYSPNKTQRDR------LYANRINPVVSFPGHAM 630 (641) T ss_pred ccCCceeEecCCHHHHHHHHhhhccCCceECcCCcccceeeeeeeeeEecChhHHhh------hhhcccceEEecCCcee Confidence 99999999999999999999999999999999998 5899999999988777655 45689999987755433 Q ss_pred EccccCCCCccc Q lcl|NC_015266. 269 WGSRTCDADGKF 280 (390) Q Consensus 269 wG~rT~~~d~~~ 280 (390) -+|.-. ...+. T Consensus 631 v~~~~~-~~~~~ 641 (641) T protein:vir:10 631 INNNIA-FHTKL 641 (641) T ss_pred ecceee-eeecC Confidence 333221 11122 No 38 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=100.00 E-value=1e-34 Score=206.91 Aligned_cols=360 Identities=13% Similarity=0.089 Sum_probs=254.2 Q ss_pred CCCc-------cCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHh Q lcl|NC_015266. 1 MPQD-------YHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDA 73 (390) Q Consensus 1 Ma~~-------~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~ 73 (390) ||-. .+||||+++.+++.++...+++.+++|+|.+..+ +.++|++++++.++...|++ |.|.++++. T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G-----~~~~~~~~~~~~~~~~~~~~-g~l~~~~~~ 74 (587) T protein:vir:95 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGG-----EPNTVYELRNYSQAKRLFRS-GELLDAIEL 74 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEeCCC-----CCceeEEeccHHHHHHHhcC-cchHHHHHH Confidence 7743 3589999999999999999999999999999877 55788999999999888876 668788776 Q ss_pred hh----cccCceEEEEeeccccccccccccc--------------------------------------------hh--- Q lcl|NC_015266. 74 IG----KQTKPVTVVVRVAEGKDEAETTANV--------------------------------------------IG--- 102 (390) Q Consensus 74 ~~----~~~~~~~~vv~v~~~~~~~~~~~~~--------------------------------------------~~--- 102 (390) .| .+++..++.+++........+..++ ++ T Consensus 75 a~~~~~~~g~~~~~~~rv~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ 154 (587) T protein:vir:95 75 AWGSNPNYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIF 154 (587) T ss_pred HhccccCCCceEEEEEEcCCCceeEEEecCeEEEEecccccccceEEEEecCCCCCceeEEEEEecccceeeeeecccee Confidence 66 5788888888764432111000000 00 Q ss_pred -----hh------c----c-c----------h--------hhhh-----hhhhhhhh----------------------h Q lcl|NC_015266. 103 -----TV------T----P-D----------G--------KYTG-----MKALLAAQ----------------------G 121 (390) Q Consensus 103 -----~~------~----~-~----------~--------~~tg-----l~~~~~~~----------------------~ 121 (390) +. + . + + ...| ..+..+.. + T Consensus 155 si~y~g~~~~~~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~in~~~~~tAky~g~~~~~i~~~~~~ 234 (587) T protein:vir:95 155 TIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLD 234 (587) T ss_pred eeeeeccccccceeeeecccceeeeeeeeecCCceEEEEEecCCchHHHHHHHHhhccccceEEEEecccCceeEEeecC Confidence 00 0 0 0 0 0000 00000000 0 Q ss_pred -------------------------------------------------------------------------hhh-hhh Q lcl|NC_015266. 122 -------------------------------------------------------------------------KLA-VKP 127 (390) Q Consensus 122 -------------------------------------------------------------------------~~~-~~p 127 (390) ..+ ..+ T Consensus 235 ~~~~~~v~~~~~~v~a~~~d~~~~~~~~~~v~~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~t~LtGG~dG 314 (587) T protein:vir:95 235 KIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNG 314 (587) T ss_pred cccccceehhhhhhhhhhcceeeeeeceeeeeeecccccceeccchhhhhcccchheeccccccceeccceeeeecCCCC Confidence 000 000 Q ss_pred --h---------------hhhhhhhcchHHHHHHHHhhhhcc-----eEEeecccccCchHHHHHHhhhhccceEEEEee Q lcl|NC_015266. 128 --R---------------ILVAPGLDTQPVAAAFATIAQSLR-----AMVYVAAHGCKTKEEAVAYRKQFGQREIMVIWP 185 (390) Q Consensus 128 --~---------------~~~apg~~~~~v~~al~~~~~~~~-----~~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p 185 (390) . ..+.+..+.+++++++.+++++++ .++++..+++.+.+++...+..+++.+.+++++ T Consensus 315 ~~~~~y~~~l~ale~~~~~~i~~~t~d~~v~a~l~a~vk~~~~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~ervi~v~~ 394 (587) T protein:vir:95 315 EPPATWADKLDKFAHEGGYYIVPLSSKQSVHAEVASFVKERSDAGEPMRAIVGGGFNESKEQLFGRQESLSNPRVSLVAN 394 (587) T ss_pred CCcccHHHHHHHHHhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEEcCCCCCCHHHHHHHHhhcCCCcEEEecc Confidence 0 000111123467777888877663 356667777788899999999999999999988 Q ss_pred eeEEEeeccCceeEecH---HHHHHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEc Q lcl|NC_015266. 186 DWLGWDDITNSTVAIPA---PAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVN 262 (390) Q Consensus 186 ~~~~~~~~~~~~~~~p~---s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~ 262 (390) +..+. ..++....+|| ++++||+++..| +.+||.|+++. ..++....+..+++ .+..+|+.++.. T Consensus 395 ~~~~~-~~dg~~~~~~~~~~aa~vAGl~Ag~~----~~~SlT~~~i~-~~~v~~~~t~~e~e------~ai~~Gvl~l~~ 462 (587) T protein:vir:95 395 SGTFV-MDDGRKNHVPAYMVAVALGGLASGLE----IGESITFKPLR-VSSLDQIYESIDLD------ELNENGIISIEF 462 (587) T ss_pred cceEe-cCCCceeeechHHHHHHHHHHHhcCc----hhcCccceeee-cccccccCCHHHHH------HHHhCCeEEEEE Confidence 75543 23566677887 789999999988 67899999987 45666666555544 445578877743 Q ss_pred --CCC---EE-EEccccC--CCCcccceeeehhhHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCce Q lcl|NC_015266. 263 --RNG---FR-FWGSRTC--DADGKFFFENYTRSAQVIADTIAEEQ-MGVVDGPLNPSRARDIIENINAWFRREVSVGEL 333 (390) Q Consensus 263 --~~G---~~-~wG~rT~--~~d~~~~~i~vrR~~~~i~~~l~~~~-~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l 333 (390) +++ ++ +.|-.|. ..|+.|++++++|++|+|.+.+++.+ .+|++|||+...|..++..+..||.+|++.|+| T Consensus 463 ~~~~~~~~vriv~~itT~t~~~d~~~~~i~viRv~D~i~~dir~~~~~~~iGk~nn~~~r~~v~~~i~~~L~~l~~~gaI 542 (587) T protein:vir:95 463 VRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINTSASIIKDFIQSYLGRKKRDNEI 542 (587) T ss_pred ecCCcceEEEEeecceeccCCCCcchhhhhhhhhHHHHHHHHHHHHHhhCCccccchHHHHHHHHHHHHHHHHHHhCCcc Confidence 222 33 2454554 45778999999999999999999887 589999999999999999999999999999999 Q ss_pred eeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHH Q lcl|NC_015266. 334 IGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLAD 383 (390) Q Consensus 334 ~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 383 (390) .+|..+ +...++...++++++.+.|+.|+|+|.+++.+.++-++. T Consensus 543 ~~~~~~-----dv~v~~~~d~~~v~~~v~Pv~~mekI~vt~~~~~~~~~~ 587 (587) T protein:vir:95 543 QDFPAE-----DVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred cCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEeeeeecC Confidence 998552 222334567899999999999999999999999988876 No 39 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=100.00 E-value=4e-34 Score=203.67 Aligned_cols=360 Identities=13% Similarity=0.083 Sum_probs=253.4 Q ss_pred CCCc-------cCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHh Q lcl|NC_015266. 1 MPQD-------YHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDA 73 (390) Q Consensus 1 Ma~~-------~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~ 73 (390) ||-. .+||||+++.+++.++...+++.+++|+|.+..+ +.++|.+++++.++...|++ |.|.++++. T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G-----~~~~~~~~~~~~~~~~~~~~-g~l~~~~~~ 74 (587) T protein:vir:99 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGG-----EPNTVYELRNYSQAKRLFRS-GELLDAIEL 74 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEecCC-----ccceeEEeccHHHHHHHhcC-cchHHHHHH Confidence 7743 3589999999999999999999999999999877 45788999999999888876 668888877 Q ss_pred hh----cccCceEEEEeeccccccccccccc--------------------------------------------hh--- Q lcl|NC_015266. 74 IG----KQTKPVTVVVRVAEGKDEAETTANV--------------------------------------------IG--- 102 (390) Q Consensus 74 ~~----~~~~~~~~vv~v~~~~~~~~~~~~~--------------------------------------------~~--- 102 (390) .| .+++..++++++........+..++ ++ T Consensus 75 a~~~~~~~g~~~~~~~rv~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ 154 (587) T protein:vir:99 75 AWGSNPNYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIF 154 (587) T ss_pred HhccccCCCceEEEEEEcCCCceeEEEecCeEEEEeeccccccceEEEEccCCCCcceeEEEEEecccceeeeeecccee Confidence 66 5788888888774432211000000 00 Q ss_pred -----hh------c----c-c----------h--------hhhh-----hhhhhhhhh---------------------- Q lcl|NC_015266. 103 -----TV------T----P-D----------G--------KYTG-----MKALLAAQG---------------------- 121 (390) Q Consensus 103 -----~~------~----~-~----------~--------~~tg-----l~~~~~~~~---------------------- 121 (390) +. + . + + ...| ..+...... T Consensus 155 ~i~y~g~~~~a~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~i~~~~~~tAky~~~~~~~i~~~~~~ 234 (587) T protein:vir:99 155 TIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLD 234 (587) T ss_pred eEEeecccccceeeEeecCcceeeeeeeeecCCceeEEEEecCCchHHHHHHHhhhccccceeEEeeccCCceeEeeccc Confidence 00 0 0 0 0 0000 000000000 Q ss_pred -------------------------------------------------------------------------hhh-hhh Q lcl|NC_015266. 122 -------------------------------------------------------------------------KLA-VKP 127 (390) Q Consensus 122 -------------------------------------------------------------------------~~~-~~p 127 (390) ..+ ..+ T Consensus 235 ~~~~~~v~~~~~~v~a~~~D~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~LtGG~dG 314 (587) T protein:vir:99 235 KIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNG 314 (587) T ss_pred ccccceeeeeeeeeehhccceeeecccceeeeeeecccccchhhhhhhhhccccceeeeeccccceecccceeeecCCCC Confidence 000 000 Q ss_pred --h---------------hhhhhhhcchHHHHHHHHhhhhcc-----eEEeecccccCchHHHHHHhhhhccceEEEEee Q lcl|NC_015266. 128 --R---------------ILVAPGLDTQPVAAAFATIAQSLR-----AMVYVAAHGCKTKEEAVAYRKQFGQREIMVIWP 185 (390) Q Consensus 128 --~---------------~~~apg~~~~~v~~al~~~~~~~~-----~~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p 185 (390) . ..+.+....+.+++++.+++++++ .++++..+++.+.+++...+..+++.+.+.+++ T Consensus 315 ~~~~sy~~al~ale~~~~~~i~~~t~d~~i~a~l~a~vk~~r~~g~~~~aVlg~~~~~~~~~~~~~a~~~n~e~vi~v~~ 394 (587) T protein:vir:99 315 EPPATWADKLDKFAHEGGYYIVPLSSKQSVHAEVASFVKERSDAGEPMRAIVGGGFNESKEQLFGRQASLSNPRVSLVAN 394 (587) T ss_pred CccccHHHHHHHHhhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHhhhcCCCcEEEEec Confidence 0 000111123456777788877653 356666677788899999999999999999988 Q ss_pred eeEEEeeccCceeEecH---HHHHHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEc Q lcl|NC_015266. 186 DWLGWDDITNSTVAIPA---PAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVN 262 (390) Q Consensus 186 ~~~~~~~~~~~~~~~p~---s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~ 262 (390) +..+. ..++....+|+ ++++||+++..| +.+||.|+++. ..++....+..+++ .+..+|+.++.. T Consensus 395 ~~~~~-~~dg~~~~~~~~~~aa~vAGl~Ag~~----~~~SlT~~~i~-~~~v~~~~t~~e~e------~li~~Gvl~l~~ 462 (587) T protein:vir:99 395 SGTFV-MDDGRKNHVPAYMVAVALGGLASGLE----IGESITFKPLR-VSSLDQIYESIDLD------ELNENGIISIEF 462 (587) T ss_pred cceEe-cCCCceeeechHHHHHHHHHHHhcCc----hhcCccceeee-cccccccCCHHHHH------HHHhCCeEEEEE Confidence 75543 23456677787 789999999887 77899999987 55776666555544 445578887743 Q ss_pred --CC---CEEE-EccccC--CCCcccceeeehhhHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCce Q lcl|NC_015266. 263 --RN---GFRF-WGSRTC--DADGKFFFENYTRSAQVIADTIAEEQ-MGVVDGPLNPSRARDIIENINAWFRREVSVGEL 333 (390) Q Consensus 263 --~~---G~~~-wG~rT~--~~d~~~~~i~vrR~~~~i~~~l~~~~-~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l 333 (390) ++ ++++ .|-.|. ..|+.|++++++|++|+|.+.+++.+ .+|+++||+...|..++..+..||.+|++.|+| T Consensus 463 ~~~~~~~~vriv~~ItT~t~~~~~~~~~i~viRv~D~i~~di~~~~~~~yiGk~Nn~~~r~~i~~~i~~~L~~l~~~gaI 542 (587) T protein:vir:99 463 VRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINTSASIIKDFIQSYLGRKKRDNEI 542 (587) T ss_pred ecCCcceEEEEeeceeeccCCCCchhhhhhhhhhHHHHHHHHHHHHHhhCCccccchHHHHHHHHHHHHHHHHHHhCCcc Confidence 22 2432 444454 45778999999999999999999887 589999999999999999999999999999999 Q ss_pred eeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHH Q lcl|NC_015266. 334 IGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLAD 383 (390) Q Consensus 334 ~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 383 (390) .+|..+ ...-+....++++++.+.|+.|+|+|.+++.+.++-++. T Consensus 543 ~~~~~~-----dv~v~~~~d~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:99 543 QDFPAE-----DVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred cCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 998652 112233556899999999999999999999999988876 No 40 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=100.00 E-value=8.7e-34 Score=201.84 Aligned_cols=360 Identities=14% Similarity=0.082 Sum_probs=250.2 Q ss_pred CCCc-------cCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHh Q lcl|NC_015266. 1 MPQD-------YHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDA 73 (390) Q Consensus 1 Ma~~-------~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~ 73 (390) ||-. .+||||+++.+++..+....++.+++++|.+... |.++|++++++.++...|+. |.|.+++.. T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~g-----~~~~~~~~~~~~~~~~~~g~-G~l~~ai~~ 74 (587) T protein:vir:96 1 MAKDIFPRRPIQRPHASIEVDSSGIGGSASNSEKILCLIGKAEGG-----EPNTVYQVRNYAQAKSVFRS-GELLDAIEL 74 (587) T ss_pred CeeeeeCCCcccCCceEEEEecCCccCCCCCCCceEEEEEEecCC-----CCceeEEEcChHHHHHhhcC-CcHHHHHHH Confidence 6643 4689999999999999999999999999999877 45788999999999888876 568888876 Q ss_pred hh----cccCceEEEEeeccccccccccc--------------------------------------c---c---hh--- Q lcl|NC_015266. 74 IG----KQTKPVTVVVRVAEGKDEAETTA--------------------------------------N---V---IG--- 102 (390) Q Consensus 74 ~~----~~~~~~~~vv~v~~~~~~~~~~~--------------------------------------~---~---~~--- 102 (390) .+ .+++..++.+++.+......+.. + + .+ T Consensus 75 a~~~~~~~g~~~~~a~rv~~~~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~n~G~v~ 154 (587) T protein:vir:96 75 AWGSNPQYTAGKILAMRVEDAKASQLEKGGLRVTSKIFGSVSNDIQVALEKNTITDSLRLRVVFQKDNYQEVFDNLGNIF 154 (587) T ss_pred HhccCcCCCceEEEEEecCCCccceeecccccccccccCCCCceEEEEEEeccCCCccceEEEEecCCceeeccccCceE Confidence 66 57778888777643222100000 0 0 00 Q ss_pred -----hhc---------c--c---------hh--------------hhhhhhhh---------hhhhh-hh--------- Q lcl|NC_015266. 103 -----TVT---------P--D---------GK--------------YTGMKALL---------AAQGK-LA--------- 124 (390) Q Consensus 103 -----~~~---------~--~---------~~--------------~tgl~~~~---------~~~~~-~~--------- 124 (390) +.. . + +. .+...... .-++. .+ T Consensus 155 ~i~y~g~~~~a~~~~~~~~~~~~A~~l~l~gg~~~v~~yrl~~g~~~~~~~~~~~~~~~~~~tAky~g~~~n~~~v~v~d 234 (587) T protein:vir:96 155 SINYKGEGEKATFSVEKDKETQEAKRLVLKVDEKEVKAYELNGGAYSFTNEIITDINELPDFEAKLSPFGDKNLESRKLD 234 (587) T ss_pred EEEecccccceeEeeccCcccceeeeeEEEecCceEEEEEeCCCchhhhhhhhhhhccccceEEEeecccCceeEEEeec Confidence 000 0 0 00 00000000 00000 00 Q ss_pred ------------hhh----h--------------------------------------h-------------hh------ Q lcl|NC_015266. 125 ------------VKP----R--------------------------------------I-------------LV------ 131 (390) Q Consensus 125 ------------~~p----~--------------------------------------~-------------~~------ 131 (390) +.. . . .. T Consensus 235 ~~~~~~~k~~~~y~~t~~~di~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~aLtGG~dG 314 (587) T protein:vir:96 235 EATDVDIKGKAVYVKAVFGDIENQTQYNQYVKFEQLPEQASEPSDVEVHAETESATVTATSKPKAIEPFELTKLSGGTNG 314 (587) T ss_pred cccccccceEEEeehhhhhhhhhhhccccceeeccccchhhhhhcccccccccceeeeecccccccccccceeeecCCCC Confidence 000 0 0 00 Q ss_pred ---------------------hhhhcchHHHHHHHHhhhhcc-----eEEeecccccCchHHHHHHhhhhccceEEEEee Q lcl|NC_015266. 132 ---------------------APGLDTQPVAAAFATIAQSLR-----AMVYVAAHGCKTKEEAVAYRKQFGQREIMVIWP 185 (390) Q Consensus 132 ---------------------apg~~~~~v~~al~~~~~~~~-----~~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p 185 (390) .+....+++++.+.+++.+++ .++++..+++.+.+++...+..+++.+.+++++ T Consensus 315 ~~~~~y~~~l~ale~~~~~~i~~~t~d~ai~~~l~a~vk~~r~~gk~~~aVlg~~~~~~~~~~~~~a~~~n~e~vi~v~~ 394 (587) T protein:vir:96 315 EPPTSWSAKLEKFKNEGGYYIVPLTDRQSVHSEVATFVKNRSDAGEPMRAIVGGGTSETKEKLFGRQAILNNPRVALVAN 394 (587) T ss_pred CCcccHHHHHHHHhhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHHHhhcCCCcEEEEec Confidence 001112456677777776653 355666667778888888899999999999988 Q ss_pred eeEEEeeccCceeEecH---HHHHHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEc Q lcl|NC_015266. 186 DWLGWDDITNSTVAIPA---PAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVN 262 (390) Q Consensus 186 ~~~~~~~~~~~~~~~p~---s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~ 262 (390) +..+.+. .+....+|+ ++++||+++..+ +.+||.|+++.+ .++....+..+++ .+..+|+.++.. T Consensus 395 ~~~~~~~-~~~~~~~~~~~~aa~vAG~~Ag~~----~~~S~T~~~~~~-~~v~~~~t~~e~~------~~i~~G~~~l~~ 462 (587) T protein:vir:96 395 SGKFVMG-NGRILQAPAYMVASAVAGLVSGLD----IGESITFKPLFV-NSLDKVYESEELD------ELNENGIITIEF 462 (587) T ss_pred ceEEecC-CCceeeechhhHHHHHHHHHhcCc----cccCccceeeec-ccccccCCHHHHH------HHHhCCeEEEEE Confidence 8776554 344445553 688999999877 778999999985 5676666555544 445678888754 Q ss_pred --CCCEEEEcc-ccC-----CCCcccceeeehhhHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCce Q lcl|NC_015266. 263 --RNGFRFWGS-RTC-----DADGKFFFENYTRSAQVIADTIAEEQ-MGVVDGPLNPSRARDIIENINAWFRREVSVGEL 333 (390) Q Consensus 263 --~~G~~~wG~-rT~-----~~d~~~~~i~vrR~~~~i~~~l~~~~-~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l 333 (390) +++.++|.. +++ ..++.|++|+++|++|+|.+.+++.+ .+|++|||+...|..++..+..||.+|++.|+| T Consensus 463 ~~~~~~~v~~~vnsitT~t~~~~~~~~~i~virv~D~i~~di~~~~~~~yiGk~nn~~~r~~v~~~i~~~L~~l~~~g~I 542 (587) T protein:vir:96 463 VRNRMTTMFRIVDDVTTFPDKNDPVKSEMALGEANDFLVSELKILLEEQYIGTRTINTSASQIKDFVQSYLGRKKRDNEI 542 (587) T ss_pred ecCCcEEEEEeeccceecCCCCCchhhhhhhHHHHHHHHHHHHHHHHhcCCccccCHHHHHHHHHHHHHHHHHHHhCCcc Confidence 344556633 433 33667999999999999999999987 589999999999999999999999999999999 Q ss_pred eeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHH Q lcl|NC_015266. 334 IGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLAD 383 (390) Q Consensus 334 ~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 383 (390) .+|+.+ +..-++...++++++.++|+.|+|+|.+++.+.++-++. T Consensus 543 ~~~~~~-----dv~v~~~~D~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:96 543 QDFPPE-----DVQVIIEGNEARISLTIFPIRALKKISVSLVYRQQTLQA 587 (587) T ss_pred cCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 998542 122234556899999999999999999999999888776 No 41 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=100.00 E-value=8.6e-34 Score=201.87 Aligned_cols=360 Identities=14% Similarity=0.117 Sum_probs=218.2 Q ss_pred CCC---ccC----CCEEEEECCCCCcccccccc-ccceeeeccccccccceecc---------ceEEEechhHH---HHh Q lcl|NC_015266. 1 MPQ---DYH----HGVRVIEINEGGRPIRTVST-AVLGIVCTGADADPATFPLD---------TPVLLTNVIAA---LGK 60 (390) Q Consensus 1 Ma~---~~~----hGV~v~ev~~~~~~i~~v~t-avig~vgta~~~~~~~~~~~---------~~~~i~~~~~~---~~~ 60 (390) |+. .+. -|+...+.+.....-...+. +++..++.........+.+. ..+.....++. ... T Consensus 177 ~~~~~~~~~~gsd~~~~~~~~~~~~~~~~~~D~~t~~~~~~g~~~~~~~v~~~~~~~~d~~~~~~v~~~~~~~~~~~~~~ 256 (581) T protein:vir:10 177 NPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGP 256 (581) T ss_pred ccccCcceeccccceeeecccCccccccccccceeeeeeecccccccceEEEEEEEeecCCcceeEEeecCcchhhhhhh Confidence 221 110 13333333322221111110 11111111100001111111 12222222221 101 Q ss_pred h-----ccccchHHHHHhhhcccCceEEEEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhh Q lcl|NC_015266. 61 A-----GTKGTLRRTLDAIGKQTKPVTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGL 135 (390) Q Consensus 61 ~-----~~~gtl~~al~~~~~~~~~~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~ 135 (390) + +..+.+....+..+.++....+...+.... . .++......+|.++...+... +..|+. T Consensus 257 ~~~~~g~~~~~~t~~~~~~~tn~~~~~l~~gvd~~g-~---------tvt~~dy~~Al~ale~~~~~~------ivv~~t 320 (581) T protein:vir:10 257 AFDEAGNVQSEITLCAQLAITNGASTILACAVDPEG-D---------TVTMGDYQNALNKFRDEDEIA------IIVAGT 320 (581) T ss_pred hhhccCccccchhhhheeeeecccceeEEeeccCCC-C---------ccchHHHHHHHHHHhcCCceE------EEEeCC Confidence 0 122234444444444555444443322110 0 122223445565555443222 235566 Q ss_pred cchHHHHHHHHhhhhc-------ceEEeeccc-ccCchHHHHHHhhhhccceEEEEeeeeEEEeeccC-ceeEecH---H Q lcl|NC_015266. 136 DTQPVAAAFATIAQSL-------RAMVYVAAH-GCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITN-STVAIPA---P 203 (390) Q Consensus 136 ~~~~v~~al~~~~~~~-------~~~~~~d~~-~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~-~~~~~p~---s 203 (390) ...++++++.++++++ +++..+... ...+.+.++.....+++.|..+++|+.++.+...+ ....+|+ . T Consensus 321 ~~~~v~a~l~ahv~~~s~~~~~~ravigV~g~~~~~~~~~~~~~a~~~n~~Rvvlv~p~~~~~~g~~~~~~v~lp~y~~A 400 (581) T protein:vir:10 321 GAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMA 400 (581) T ss_pred CCHHHHHHHHHHHHHHHhccCCcEEEEEecCCCCCccHHHHHHhhccCCCceEEEEecCceeecCcccCceeccchhhHH Confidence 6777888888877664 233333323 33455566777788999999999999888766543 3444555 4 Q ss_pred HHHHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEc--CCCEEE-EccccCCCCccc Q lcl|NC_015266. 204 AIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVN--RNGFRF-WGSRTCDADGKF 280 (390) Q Consensus 204 ~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~--~~G~~~-wG~rT~~~d~~~ 280 (390) +++||+++..| +..||.|+++.|+.++....+..+++ .++.+|+.++.. ++|+++ ||-.|+..|+.| T Consensus 401 A~vAGl~a~~~----~~~slT~~~i~gi~~l~~~~s~~e~e------~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s~~~~ 470 (581) T protein:vir:10 401 AAVAGKSVSAI----AAMPLTRKVIRGFSGPAEVQRDGEKS------RESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHT 470 (581) T ss_pred HHHHHHhhccc----cccCcccccccccccccccCCHHHHH------HHHhCCeEEEEEecCCeEEEEeeeecCCCCCcc Confidence 55556665554 78899999999999888777766555 445679988853 456765 777888899999 Q ss_pred ceeeehhhHHHHHHHHHHHHH--HHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEE Q lcl|NC_015266. 281 FFENYTRSAQVIADTIAEEQM--GVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWID 358 (390) Q Consensus 281 ~~i~vrR~~~~i~~~l~~~~~--~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~ 358 (390) ++|++||++|++.+.+++.++ +|++|||+..+|.+|+..+..||..||+.|+|.||+.. ..++.+.+.+.++++ T Consensus 471 ~~i~~iR~~D~v~~~ir~~~~~~~fIG~~n~~~~r~~ik~~i~~~L~~l~~~g~I~~~~~~----~~~~~~~~~d~v~V~ 546 (581) T protein:vir:10 471 REWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRNL----KARQIERQPDVIEVR 546 (581) T ss_pred eeeeeehhhhHHHHHHHHHhhhhcCCCcccCHHHHHHHHHHHHHHHHHHHhcCcccCCccc----eeeeeecCCCEEEEE Confidence 999999999999999999985 58889999999999999999999999999999998643 234556788999999 Q ss_pred EEEEecccceEEEEEEEEcchH--HHHHHHHhcC Q lcl|NC_015266. 359 YDYTPVPPLENLKLRQRITDRY--LADFASRVSA 390 (390) Q Consensus 359 i~~~p~~p~e~i~~~~~~~~~~--~~~l~~~~~a 390 (390) +.++|++|+|||.+++++.++. +..-++-... T Consensus 547 i~v~Pv~~i~~I~vti~~~p~~~~~~~~~~~~~~ 580 (581) T protein:vir:10 547 YEWRPAYPLNYIVVRYSIAPETGDITSTIEGTTS 580 (581) T ss_pred EEEEecccceEEEEEEEEecCCCceEEEEecccc Confidence 9999999999999999999874 1111111111 No 42 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=100.00 E-value=1.1e-32 Score=195.73 Aligned_cols=357 Identities=13% Similarity=0.134 Sum_probs=227.0 Q ss_pred CCC-c-cC------CCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHH Q lcl|NC_015266. 1 MPQ-D-YH------HGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLD 72 (390) Q Consensus 1 Ma~-~-~~------hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~ 72 (390) ||- . |. ||||++|++++.+++..+.|++.+|+|.+..+ |.++|++++++.++...|++ +.|.++++ T Consensus 1 ma~~~yf~~~~~~~PGVyvee~~sg~~~i~gv~tsva~fvG~a~~G-----p~~~p~~v~s~~~~~~~fgg-g~l~~av~ 74 (648) T protein:vir:10 1 MAISVYFDGKLIKQLGAYVKTDLSAVKQINGVGTGIVALLGLAEGG-----ETYKPYRLTSFAEAVSIFKG-GPLLEHIK 74 (648) T ss_pred CeeeeeeCCCCccCCceEEEEeccccccccCCCCceEEEEEeeCCC-----CCceeEEecCHHHHHHHhcC-ccHHHHHH Confidence 775 3 23 89999999999999999999999999999866 67899999999999988875 68999999 Q ss_pred hhhcccCceEEEEeecccccccccc---------------------------------------------ccchhhh--- Q lcl|NC_015266. 73 AIGKQTKPVTVVVRVAEGKDEAETT---------------------------------------------ANVIGTV--- 104 (390) Q Consensus 73 ~~~~~~~~~~~vv~v~~~~~~~~~~---------------------------------------------~~~~~~~--- 104 (390) .+|.+++..++++++........+. .+++... T Consensus 75 ~~F~nGg~~~~~vRv~~~~~a~~~~~~~~~~a~~~g~~gn~i~~~v~~~~~~~~~~~~l~v~~~~~~~~~d~~v~~i~~~ 154 (648) T protein:vir:10 75 AAFIGGAGEVVAVRIGNPTTASVSIPVAQNTSDTSPANLNFVSYEASTRSNQIYVSFDLDENFTSANEADDTIIFTIYQK 154 (648) T ss_pred HHHhCCCcEEEEEEcCCCcccceecceeEEeecccCCCCCceEEEEEEcCCCcCceeEEEEEecCCCcccceeEEEeccC Confidence 9999999999999975432211000 0000000 Q ss_pred -------------cc--ch-----h---------hhhhh------------------hhhh------h------------ Q lcl|NC_015266. 105 -------------TP--DG-----K---------YTGMK------------------ALLA------A------------ 119 (390) Q Consensus 105 -------------~~--~~-----~---------~tgl~------------------~~~~------~------------ 119 (390) +. +. . ...+. .... . T Consensus 155 ~~~y~gt~~~~t~~v~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~s~~~~~d~~ 234 (648) T protein:vir:10 155 HPDFSVTRETFTFPRKFTTPTVLVKRGSTLFFVDRSIVNAALAAGPAFQTALINLLKEQLQPTDVVQIFDASDTNPVDIP 234 (648) T ss_pred CCcccccceeccccccccccccccccccceeecCccchhhhhccCccchhhhhhchhhhhhhhhhheecccccccccccc Confidence 00 00 0 00000 0000 0 Q ss_pred ----------------------hhhhh----------------------------------------------------h Q lcl|NC_015266. 120 ----------------------QGKLA----------------------------------------------------V 125 (390) Q Consensus 120 ----------------------~~~~~----------------------------------------------------~ 125 (390) ....+ . T Consensus 235 ~~~~~~~a~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~tp~~~~~~~~~~~~~~~~~~~~~v~~~~~~~l~~~~~ 314 (648) T protein:vir:10 235 LGLFVYEVLYGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNLSATPFFDGSDYQDYTSLSDPANWFAKDAYTINHLVDTTI 314 (648) T ss_pred cccccccccchhhhcCCcchhhhhhhccccccccccceecccccccccccceeeeeccccccceeeeeccchhhcccccc Confidence 00000 0 Q ss_pred hhhhhh-------------hh-------------hh--------------------------------cchHHHHHHHHh Q lcl|NC_015266. 126 KPRILV-------------AP-------------GL--------------------------------DTQPVAAAFATI 147 (390) Q Consensus 126 ~p~~~~-------------ap-------------g~--------------------------------~~~~v~~al~~~ 147 (390) .|.+.. .| .| ..+++++.+.++ T Consensus 315 ~p~~~~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~~ivp~~~~~~~~~~~~~lt~~q~i~a~a~sh 394 (648) T protein:vir:10 315 NPHILATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVNFVIPAYKFTNVTQLNDRLTIFKGIASTFLSH 394 (648) T ss_pred cCcccccccceecccccCCCcccccccccccchhhHHHHhhhccCCCceEEEeecccccccccccccCCccchHHHHHHH Confidence 000000 00 00 113455555555 Q ss_pred hhhcc----------eEEeecccccCchHHH--HHHhhhhccceEEE---------EeeeeEEEeeccCceeEecH---H Q lcl|NC_015266. 148 AQSLR----------AMVYVAAHGCKTKEEA--VAYRKQFGQREIMV---------IWPDWLGWDDITNSTVAIPA---P 203 (390) Q Consensus 148 ~~~~~----------~~~~~d~~~~~~~~~a--~~~~~~~~~~~~~~---------~~p~~~~~~~~~~~~~~~p~---s 203 (390) +..+. .+.++..++..+..+. ...+..++..++.. -.|+.-.....++....+|| . T Consensus 395 v~~~s~~~~~~~r~~~~~~vg~~~~es~~~se~~~~~~~~~~~~a~~~~~d~~~~~~~~~~~~~~~~~G~~~~~p~~~~A 474 (648) T protein:vir:10 395 VQTMSQVNRRKARVGVFGLPAPSPNESVTASEYLYNRNILNTISAMFGGTDRAQAVVFPFYSNVFNDEGKVELLGGEFFA 474 (648) T ss_pred HHHhhhccccccccCeEEEeCCCCchhHHHHHHHhhhhcccccceeeeecCCceEEeecccceeECCCCcEEecchhhHH Confidence 54331 2333343444443322 22223333322211 12222222233566677888 7 Q ss_pred HHHHHHHhhhhhccceeecccCceeecc-cccccccchhhhccccccccccccceeEEEcC--C----CEEEEccccC-- Q lcl|NC_015266. 204 AIAAGLRAKIDNDIGWHKTLSNVVVNGV-TGISADVSWDLQDPATDAGYLNENQVTTLVNR--N----GFRFWGSRTC-- 274 (390) Q Consensus 204 ~~vAg~~a~~d~~~g~~~span~~l~gv-~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~--~----G~~~wG~rT~-- 274 (390) +++||++++.. ++.||.|+++.++ +.+..+.+..+++ .|+++||.++... + ++++--+-|. T Consensus 475 a~VAGl~a~l~----~~~s~T~k~i~~~~id~~~~~t~~qld------~L~~~Gv~~ie~~~~~~~~~~~rvv~gITT~~ 544 (648) T protein:vir:10 475 SYVAGMHANRE----PQDSITFLPISGIGAEPLYNWTYTQKD------DLISNRVLFVEKVKTSFGGIVYRIHHNPTTWL 544 (648) T ss_pred HHHHhhhhccc----cccCcccceeeccccccccCCCHHHHH------HHhcCCcEEEEEecCCcceeeEEEeccceeec Confidence 78899988755 8899999999855 2332355555554 4455677776431 1 2333222222 Q ss_pred -CCCcccceeeehhhHHHHHHHHHH-HHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeE---EEEecCCCCHHH Q lcl|NC_015266. 275 -DADGKFFFENYTRSAQVIADTIAE-EQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIGGG---AWYDPEPNTTDE 349 (390) Q Consensus 275 -~~d~~~~~i~vrR~~~~i~~~l~~-~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~---v~~d~~~nt~~~ 349 (390) +.++.|+.|+++|+.|++.+.+++ ...+|+++||+...|.++++.|.+||.++++.++|.+|. +.+++ T Consensus 545 ~~~~~~~~eisv~ri~D~l~~~vr~~l~~~fIG~~n~~~~~~~ik~~i~~~L~~~~~~~~I~~y~~~~v~~~~------- 617 (648) T protein:vir:10 545 GPVTQGFQEFVLRRIDDFLQSYVYKNLQEQFIGRKSYGRKTENDIKVYTEALLSNLVGKQIVAYKDVKVTSNE------- 617 (648) T ss_pred CCCCcceeeeeeeehhhHHHHHHHHHHhhhcCcccccHHHHHHHHHHHHHHHhhHhhcCcccCcccceEEEEe------- Confidence 357889999999999999999987 455899999999999999999999999999999999974 45432 Q ss_pred HhCCeEEEEEEEEecccceEEEEEEEEcchHH Q lcl|NC_015266. 350 LTSGGTWIDYDYTPVPPLENLKLRQRITDRYL 381 (390) Q Consensus 350 i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~ 381 (390) +++++++++.+.|++|++||.++++++.+.- T Consensus 618 -~~~vv~V~~~v~Pv~~i~~I~vti~it~~~~ 648 (648) T protein:vir:10 618 -DKTVYYVEFFYQPVTEIKFILVTMKVTFDLE 648 (648) T ss_pred -cCCEEEEEEEEEecceeeEEEEEEEEEeccC Confidence 4599999999999999999999888877644 No 43 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=100.00 E-value=1.4e-33 Score=200.68 Aligned_cols=364 Identities=13% Similarity=0.090 Sum_probs=216.0 Q ss_pred CCCcc-CCCEE-----EEECCCC-----------------Ccccccccc---ccceeeeccccccccce---------ec Q lcl|NC_015266. 1 MPQDY-HHGVR-----VIEINEG-----------------GRPIRTVST---AVLGIVCTGADADPATF---------PL 45 (390) Q Consensus 1 Ma~~~-~hGV~-----v~ev~~~-----------------~~~i~~v~t---avig~vgta~~~~~~~~---------~~ 45 (390) |.-.+ ..|++ .+..+++ ...-...+. .+...++-........+ .. T Consensus 159 ~~~~l~~~g~~~~~~~~~~s~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~g~~~~~~~i~~~~~~~~D~~~ 238 (581) T protein:vir:76 159 MNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNY 238 (581) T ss_pred cCceeeeccccccccceeecCCcceeeecccccceeeccCcccceeeeeeeeeeEeecccccccceeEEEEEEEeecCCc Confidence 11000 01222 0001111 000000000 01111110000000000 01 Q ss_pred cceEEEechhHHHHhh--------ccccchHHHHHhhhcccCceEEEEeeccccccccccccchhhhccchhhhhhhhhh Q lcl|NC_015266. 46 DTPVLLTNVIAALGKA--------GTKGTLRRTLDAIGKQTKPVTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALL 117 (390) Q Consensus 46 ~~~~~i~~~~~~~~~~--------~~~gtl~~al~~~~~~~~~~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~ 117 (390) ...+.....++....+ +..+.+.......+.++....+...+.... . .++.....++|.++. T Consensus 239 ~~~v~~~~~~~~~~~~~~~~~~~g~~~~e~~~~~~~~~t~~~~~~l~~gvd~~g-~---------tvt~~dy~~aL~ale 308 (581) T protein:vir:76 239 HEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEG-D---------TVTMGDYQNALNKFR 308 (581) T ss_pred cceEEEecccccccceeeehhhcCccccchhhhhheeeccccceEEEeeecCCC-C---------ccchHHHHHHHHHHh Confidence 1222222222211110 111233333333444444444433332110 0 122233445565555 Q ss_pred hhhhhhhhhhhhhhhhhhcchHHHHHHHHhhhhc-------ceEEeecccc-cCchHHHHHHhhhhccceEEEEeeeeEE Q lcl|NC_015266. 118 AAQGKLAVKPRILVAPGLDTQPVAAAFATIAQSL-------RAMVYVAAHG-CKTKEEAVAYRKQFGQREIMVIWPDWLG 189 (390) Q Consensus 118 ~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~-------~~~~~~d~~~-~~~~~~a~~~~~~~~~~~~~~~~p~~~~ 189 (390) ..+... +..|+....++++.+.++++++ +++..+...+ ..+.+.++.....+++.|..+++|+.++ T Consensus 309 ~~~~~~------ivvp~t~~~~i~a~l~ahv~~~s~~~~~~ra~igv~g~~~~~~~~~~~~~a~~~ns~Rvvlv~p~~~~ 382 (581) T protein:vir:76 309 DEDEIA------IIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFV 382 (581) T ss_pred cCCeEE------EEEecCCChHHHHHHHHHHHHHHhccCCceEEEEeeCCCCCchHHHHHHhhcccCCCcEEEEEcCceE Confidence 443222 2345666667777777776554 2333333223 3455666777788999999999999888 Q ss_pred EeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEc--CCCEE Q lcl|NC_015266. 190 WDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVN--RNGFR 267 (390) Q Consensus 190 ~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~--~~G~~ 267 (390) .+...+......|...+|+.+|....+..+.+||.|+++.|+.++....+..+++ .++.+|+.++.. +++++ T Consensus 383 ~~g~~~~~~~~lp~~~~AA~vAG~~a~~~~~~slT~~~i~g~~~~~~~~s~~e~e------~ll~~Gv~~l~~~~~~~v~ 456 (581) T protein:vir:76 383 YYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQRDGEKS------RESSEGLMVIEKTPRNLVH 456 (581) T ss_pred eccccCCcceecchhhhhhhHHhhhhccccccCcccccccccccccccCCHHHHH------HHHhCCeEEEEEecCCeEE Confidence 7765544444444545555556666666789999999999999888777665554 445679988853 45676 Q ss_pred -EEccccCCCCcccceeeehhhHHHHHHHHHHHHH--HHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCC Q lcl|NC_015266. 268 -FWGSRTCDADGKFFFENYTRSAQVIADTIAEEQM--GVVDGPLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEP 344 (390) Q Consensus 268 -~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~--~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~ 344 (390) +||-.|+.++++|+++++||++|++.+.+++.++ +|++|||+..+|.+|+..+..||..||+.|+|.||+.. . T Consensus 457 Iv~gItT~~s~~~~k~i~viR~~D~v~~~vr~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~g~I~g~~~~----~ 532 (581) T protein:vir:76 457 VRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRNL----K 532 (581) T ss_pred EEEeeecCCCCCccceeeehhhhHHHHHHHHHHHhhhcCCCcccChHHHHHHHHHHHHHHHHHHhcCcccCcccc----e Confidence 5898999999999999999999999999999986 57889999999999999999999999999999998632 3 Q ss_pred CCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchH--HHHHHHHhcC Q lcl|NC_015266. 345 NTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRY--LADFASRVSA 390 (390) Q Consensus 345 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~--~~~l~~~~~a 390 (390) .+..+.+.+.+++++.++|++|+|||.+++++.|+. +..-++-... T Consensus 533 ~~~~~~~~d~v~V~i~v~Pv~~ie~I~vt~~~~p~~~~~~~~~~~~~~ 580 (581) T protein:vir:76 533 ARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEGTTS 580 (581) T ss_pred eeEEecCCCEEEEEEEEEecccceEEEEEEEEeeCCCceEEEEecccc Confidence 455667889999999999999999999999998873 1111111111 No 44 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=99.95 E-value=9.4e-29 Score=174.24 Aligned_cols=365 Identities=13% Similarity=0.117 Sum_probs=248.1 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhh----- Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIG----- 75 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~----- 75 (390) |=..-+||||+++.+++..+.....+.+.+++|.+..+ |.+.|.+++++.++...|+. |.|.+++...| T Consensus 17 ~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~G-----~~~~~~~~~~~~~a~~~f~~-g~l~~a~~~a~~~~~~ 90 (607) T protein:vir:10 17 LFYDSRPHVETNFDDSRLSNTASDSAKNIFMLGSATNG-----DPTKVYEIRTSQQATKIFGS-GDLVDGIKLAFDPTGN 90 (607) T ss_pred CCCccCCceEEEEecCcCcCCCCCCcceEEEEEEeCCC-----CCceEEEEcchhHHHHhhcC-cchHHHHHHhhccccC Confidence 33334689999999999999999999999999999877 55788999999999887766 66777777666 Q ss_pred -cccCceEEEEeecccccccc---------------------------------c----ccc------------------ Q lcl|NC_015266. 76 -KQTKPVTVVVRVAEGKDEAE---------------------------------T----TAN------------------ 99 (390) Q Consensus 76 -~~~~~~~~vv~v~~~~~~~~---------------------------------~----~~~------------------ 99 (390) .+++..++.+++........ + +.+ T Consensus 91 ~~~g~~~~~~~rv~~~~~a~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~~d~~~~~~~n~g~~~~i~y~g 170 (607) T protein:vir:10 91 SVTNGGTVYALRVDNAKQASLVKDGLTFTSSIFGTNANQVSVALDNDVFGVPRITVNYSPDNYERTYTNIGQMFSITYSG 170 (607) T ss_pred CccCCceEEEEeCCCccccceecccccccccccccCCCceEEEEEecCCCccceeEEeecccceeeeeeccceeecccCc Confidence 68888888888643111000 0 000 Q ss_pred --------ch----hh---------hccc-------------hhhhhhhhhh---------------------------- Q lcl|NC_015266. 100 --------VI----GT---------VTPD-------------GKYTGMKALL---------------------------- 117 (390) Q Consensus 100 --------~~----~~---------~~~~-------------~~~tgl~~~~---------------------------- 117 (390) +. +. .+.. ...+..++.. T Consensus 171 ~~~~a~~~v~~~~~g~~~~lt~~~~~~~~~~~~V~~~~l~~~~~~t~~~l~~din~~~~~~A~~~g~~~i~tky~d~~~~ 250 (607) T protein:vir:10 171 KSASAGYTVSHDTDGKAILLTLGSGDSIDKLTNVATFDLTMSKYDTIAKLMQAISATPNFSASVVGSPSVNTSYLDEVTS 250 (607) T ss_pred ccccccceeeecCCCceeEEEecCCCccceeeeeecccccccccchHHHHHHHhhcCCceEEEEecccceeeeccccccc Confidence 00 00 0000 0000000000 Q ss_pred ----------------hhhhhhhhh------------------------------------h------------------ Q lcl|NC_015266. 118 ----------------AAQGKLAVK------------------------------------P------------------ 127 (390) Q Consensus 118 ----------------~~~~~~~~~------------------------------------p------------------ 127 (390) +........ + T Consensus 251 ~i~V~~~~~iv~a~~~D~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~LtGGtdG~~~~ 330 (607) T protein:vir:10 251 PVDVKTAPAVVTAKIGDAISKLGYDPYVVVTQTSNNKPIVNGVSAGTGSATASVTTAPESFPANFDTAFLTGGSTGDVPV 330 (607) T ss_pred eeEEEEeeeeechhhhhhhhcccccceEEeeecccchhhhhhhhccccceeeeeeccccccccccceeeeeCCCCCCchh Confidence 000000000 0 Q ss_pred -------------hhhhhhhhcchHHHHHHHHhhhhcc-----eEEeecccccCchHHHHHHhhhhccceEEEEeeeeEE Q lcl|NC_015266. 128 -------------RILVAPGLDTQPVAAAFATIAQSLR-----AMVYVAAHGCKTKEEAVAYRKQFGQREIMVIWPDWLG 189 (390) Q Consensus 128 -------------~~~~apg~~~~~v~~al~~~~~~~~-----~~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p~~~~ 189 (390) .....+....+++++++.+++.+++ .+.++..+++.+.+++......+++.+.+.+.|+..+ T Consensus 331 ty~dal~aLe~~e~~~i~~~t~d~ai~~~l~a~vkr~~~~g~~~~aVlg~~~~~t~~~~~t~a~~~N~ervv~V~~~~~~ 410 (607) T protein:vir:10 331 SWADKFNGAIGNNVYYIIPLTSEENIHAELQAFIDEQHVLGYNYHAFVGGGFAEPLEQILSRQVNINDSRFGLVGQSGHV 410 (607) T ss_pred hHHHHHHHHhhcCceEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHHHhhCCCcEEEEecCeeE Confidence 0000000112456677777776653 3555666777888999999999999999999998766 Q ss_pred EeeccCceeEecH---HHHHHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEc---- Q lcl|NC_015266. 190 WDDITNSTVAIPA---PAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVN---- 262 (390) Q Consensus 190 ~~~~~~~~~~~p~---s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~---- 262 (390) .+ .+....+|+ ++++||++|..+ +.+||.|+.+. ..++....+..+++. +..+|+.++.. T Consensus 411 ~~--~G~~~~~~~~~~Aa~vAGl~Ag~~----~~~SlT~k~i~-~~~v~~~lt~~e~e~------ai~~Gv~~l~~~~~~ 477 (607) T protein:vir:10 411 QE--GGESVHVPAYLMAAYVGGLSSSLG----VAVPITNKKLA-LVDLDQNFSGDDLNT------LNQNGVIGIEHLVNR 477 (607) T ss_pred ee--CCcceeccHHHHHHHHHHHHhcCc----cccCcccceec-cccccccCCHHHHHH------HHhCCeEEEEEccCc Confidence 54 345556664 788999999877 67899999986 557777766655544 44578777743 Q ss_pred --CCCEEEEccccC---CCCcccceeeehhhHHHHHHHHHHHHH-HHhcCCCCHHHHHHHHHHHHHHHHHHHh--CCcee Q lcl|NC_015266. 263 --RNGFRFWGSRTC---DADGKFFFENYTRSAQVIADTIAEEQM-GVVDGPLNPSRARDIIENINAWFRREVS--VGELI 334 (390) Q Consensus 263 --~~G~~~wG~rT~---~~d~~~~~i~vrR~~~~i~~~l~~~~~-~~v~e~n~~~~~~~i~~~i~~~L~~l~~--~g~l~ 334 (390) .++++++.+-|+ ..++.|++++++|++|+|.+.+++.+. +|++++|+...|.+++..+..+|..+|. .|+|. T Consensus 478 ~~~~~vrIv~~ItT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yIGk~nnd~~~~~vk~~i~~~L~~~~l~~~gaI~ 557 (607) T protein:vir:10 478 NATGGYYIVQDVSTNTVSSSHVDGSLYLGELTDFLFDNLRFVLRDTYIGSNIRSTSADDIKSTVASYLYSEMNNDDGLIV 557 (607) T ss_pred cccceEEEeeeeeeccCCCCcchheeehhhhHHHHHHHHHHHHhhcCCcccCCcchHHHHHHHHHHHHHHHHHHhcCcee Confidence 134777666544 346789999999999999999998875 7999999999999999999999976655 68898 Q ss_pred eeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhc Q lcl|NC_015266. 335 GGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFASRVS 389 (390) Q Consensus 335 g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~~~~ 389 (390) +|..+ +-+-...+.++++++.+.|+.++|+|.+++++.++-|+.-=+..- T Consensus 558 df~~e-----dv~v~~~~D~v~v~~~v~Pv~~iekIyvtv~v~~~~~~~~~~~~~ 607 (607) T protein:vir:10 558 DFSES-----DIVVTISGTVVYIQFAVAPTQEIKNIVVSGTYSNYSATSEDNTTK 607 (607) T ss_pred CCCcc-----ccEEeeCCCEEEEEEEEEEcccceEEEEEEEEEEEEEeeccCCCC Confidence 87421 112234557899999999999999999999999886652222111 No 45 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=99.94 E-value=3.3e-28 Score=171.22 Aligned_cols=356 Identities=13% Similarity=0.070 Sum_probs=223.9 Q ss_pred CCC-------ccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccch-HHHHH Q lcl|NC_015266. 1 MPQ-------DYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTL-RRTLD 72 (390) Q Consensus 1 Ma~-------~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl-~~al~ 72 (390) |+- ...||||++++..+.+++..+.+++.+|+|.+.-+ |.++|+.++++.++...||...+- ...+. T Consensus 1 m~gg~~~~~~k~~PGvYi~~~~~~~~~i~~~~~~~~a~~~~~~~G-----p~~~~~~i~s~~d~~~~fG~~~~~~~~~~~ 75 (437) T protein:vir:10 1 MAGGIWKRQNKVRPGAYINVKSKDIAMTRLGGDGVVTVPLALSFG-----QSKKLMKIRRGEDLFKKLGYEQESPQLLLL 75 (437) T ss_pred CCcceecccceecCceeEEEecCCcceeeccCCcEEEEEEEecCC-----CCceeEEEecHHHHHHHcCCccchhHHHHH Confidence 774 23599999999999999999999999999988544 778999999999998888865432 22222 Q ss_pred hhhcccCceEEEEeeccccccccccccchhhhccchhhhhhh-----------------------------------hhh Q lcl|NC_015266. 73 AIGKQTKPVTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMK-----------------------------------ALL 117 (390) Q Consensus 73 ~~~~~~~~~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~-----------------------------------~~~ 117 (390) ..+.+++..++++++.+......+..+. ...+....|.. ... T Consensus 76 ~~~~~g~~~~~~~R~~~g~~a~~tl~~~---~~~~A~~~G~~gn~i~v~v~~~~~d~~~~~v~~~~~~~~~d~~~v~~~~ 152 (437) T protein:vir:10 76 NEAFKRVSEVLLYRLNTGEKANVSLSDN---VTAQAKYSGVRGNDITVTVKTNVDDPSSFDVVTFLDTVVMDLQTVKVLA 152 (437) T ss_pred HHHhcCCCEEEEEECCCCceeeEeeccc---eEEEeccCCcccceeEEEEeeccCCccceEEEEecCcceeeeeehhhhh Confidence 2334677889999987543322211110 00000000000 000 Q ss_pred ----hhhhhh------hhhhhhhhhhhhc----chHHHHHHHHhhhhcceEEeecccccCchHHHHHHhhhh---ccceE Q lcl|NC_015266. 118 ----AAQGKL------AVKPRILVAPGLD----TQPVAAAFATIAQSLRAMVYVAAHGCKTKEEAVAYRKQF---GQREI 180 (390) Q Consensus 118 ----~~~~~~------~~~p~~~~apg~~----~~~v~~al~~~~~~~~~~~~~d~~~~~~~~~a~~~~~~~---~~~~~ 180 (390) ...... ...+......|.+ ......+|..+.......+.++.........+.+|-+.. ...+. T Consensus 153 ~~~~n~~v~~~~~~~l~~~a~~~LtGG~dg~~t~~dy~~al~~le~~~~n~l~~~~~d~~~~t~~~~~ik~~r~~~g~~~ 232 (437) T protein:vir:10 153 DLKNNALVEFSGTGELQPVAGAKLTGGTDGAISTQDYLEYFKALETVEFNYMALPVEDASIKKAAINFIKRMREDEGLGA 232 (437) T ss_pred hhhhhcccccccccccccccceeeeccccCCCChhHHHHHHHHhccCcceEEEecCCChhHHHHHHHHHHHHHhccCceE Confidence 000000 0000001111111 123455555554322222233322222334555564332 12222 Q ss_pred EEEeeee-----EEEeecc----CceeEec---HHHHHHHHHhhhhhccceeecccCceeecccccccccchhhhccccc Q lcl|NC_015266. 181 MVIWPDW-----LGWDDIT----NSTVAIP---APAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATD 248 (390) Q Consensus 181 ~~~~p~~-----~~~~~~~----~~~~~~p---~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~ 248 (390) .++-+.. .+.+-.+ .....++ .++.+||++|..+ +.+|+.|+.+.|+.++....+..+.+. T Consensus 233 ~~V~~~~~~d~e~Iin~~n~~~~~~~~~~~~~~~~a~vAG~~Ag~~----~~~S~t~~~~~~~~~v~~~~t~~e~~~--- 305 (437) T protein:vir:10 233 QLVVADSDADSEAVINVKNGVILSDKTVIDKTKATVWVAAASANAG----VEKSLTYEKYEDSVDVVGRLSHTETED--- 305 (437) T ss_pred EEEeCCCCCCCceEEEeecceeecCcceechhhHHHHHHHHhccCc----cccCccccccCCcccccccCCHHHHHH--- Confidence 2221110 0111001 0111222 4678889998875 778999999999888776666555444 Q ss_pred cccccccceeEEEcCCC--EEEEccccCCC-----CcccceeeehhhHHHHHHHHHHHHHH-HhcC-CCCHHHHHHHHHH Q lcl|NC_015266. 249 AGYLNENQVTTLVNRNG--FRFWGSRTCDA-----DGKFFFENYTRSAQVIADTIAEEQMG-VVDG-PLNPSRARDIIEN 319 (390) Q Consensus 249 ~~~l~~~gI~~~~~~~G--~~~wG~rT~~~-----d~~~~~i~vrR~~~~i~~~l~~~~~~-~v~e-~n~~~~~~~i~~~ 319 (390) +..+|+..+.+.+| +.++|-.|+.+ ++.|++|.++|++|+|.+.+++.+.. |+++ ||+...|..++.. T Consensus 306 ---~i~~G~~vl~~~~~~v~i~~gInTltt~~~~~~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~~N~~~~r~~~~~~ 382 (437) T protein:vir:10 306 ---ALLKGQFVFTARRGRAVVEQDINSHVSFTIEKNQDFRKNRILRTLDDIVNDTRYAFSEYFLGKVSNNEDGRQAFKAN 382 (437) T ss_pred ---HHhCCcEEEEEeCCeEEEEEccccccccCCCCCchhhhhhHHHHHHHHHHHHHHHHHhccccccCCCHHHHHHHHHH Confidence 44567777766544 44588777643 56899999999999999999998874 9998 7999999999999 Q ss_pred HHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEc Q lcl|NC_015266. 320 INAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRIT 377 (390) Q Consensus 320 i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 377 (390) |..||.+|+++|+|.+|.++..+..+.. ....+++++.++|+.++|+|.+++... T Consensus 383 i~~yl~~l~~~g~I~~~~~~d~~v~~~~---~~~~v~v~~~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 383 RIRYFKDLEARGAIEDFKVEDIEVLRGE---LKESVVVNVKVKPVDSMEKLYMTVTVE 437 (437) T ss_pred HHHHHHHHHhCCCccCCCceeEEeecCC---CCCEEEEEEEEEEeeeeeeEEEEEEec Confidence 9999999999999999988766544322 347899999999999999999999988 No 46 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=99.86 E-value=1.2e-22 Score=140.79 Aligned_cols=356 Identities=13% Similarity=0.023 Sum_probs=210.4 Q ss_pred CCCc-------cCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccch--HHHH Q lcl|NC_015266. 1 MPQD-------YHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTL--RRTL 71 (390) Q Consensus 1 Ma~~-------~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl--~~al 71 (390) |+-- ..||||+.++..+.+++..+.+..+++++.... ++.+.|+.+.+..++...||...+- ..++ T Consensus 1 magg~~~~~~K~~PGvYi~~~~~~~~~~~~~~~~~~~~i~~~~~-----~g~~~~v~i~~~~d~~~~fG~~~~~~~~~~~ 75 (451) T protein:vir:10 1 MAGGTWKAQDKRRPGTYINVVGNGQREAASSLGRVLLIRDKGLG-----WGKNGVIEVEANSDFTKKLGTTLDDPSLTAL 75 (451) T ss_pred CCceeeccceeecCceEEEEeccCcceeeccCCcEEEEEeeecC-----CCCcccEEeecHHHHHHHcCCcccchhHHHH Confidence 7752 349999999999999999999999999986432 3346688999999988888754432 2345 Q ss_pred HhhhcccCceEEEEeeccccccccccccchhhhccchhhhhhh------------------------------------- Q lcl|NC_015266. 72 DAIGKQTKPVTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMK------------------------------------- 114 (390) Q Consensus 72 ~~~~~~~~~~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~------------------------------------- 114 (390) +..+ +++..++++++........+... .....+....|.. T Consensus 76 ~~~~-~g~~~v~~yrl~~g~~a~~t~~~--~~~~~~Aky~G~~Gn~i~v~v~~~~~d~~~~~v~t~~g~~~vd~qtv~~~ 152 (451) T protein:vir:10 76 KETL-KGASKVLVLNPNEGTAATLTKEG--LPWTVTANYPGEKGNQITVSVEVSPADQNAATVSTIFGTKLVDEQSIKFN 152 (451) T ss_pred HHHh-cCCcEEEEEEcCCCceEEEEeec--CceEEEEeeCCcCCceEEEEEecccCCcCceEEEEEECCeEEEEEEeecc Confidence 4455 46677888887554322211100 0000000000000 Q ss_pred -hhhhhhhhhh-------hhhh-----hhhhh------hhcchHHHHHHHHhhhhcceEEeecccccC--chHHHHHHhh Q lcl|NC_015266. 115 -ALLAAQGKLA-------VKPR-----ILVAP------GLDTQPVAAAFATIAQSLRAMVYVAAHGCK--TKEEAVAYRK 173 (390) Q Consensus 115 -~~~~~~~~~~-------~~p~-----~~~ap------g~~~~~v~~al~~~~~~~~~~~~~d~~~~~--~~~~a~~~~~ 173 (390) ........+. ..+. -+..+ +-+......+|......-...+.++..... -...+.+|.+ T Consensus 153 ~~~el~~nd~V~a~~~~~g~~~~~~~~~l~~~~~gg~~~~~~~~~~~~l~~~e~~~~n~l~~~~~~~~~~i~~~~~a~ik 232 (451) T protein:vir:10 153 ELDKFKGNDYITAKVVEEGSSKPVAFTNVSGTLTGGTTTESNKVESLLNDALENEEYAVVTTAGFEPSSNMNKLVVEAVK 232 (451) T ss_pred chhhccCCceEEEEecccccccceeeeecccccccccccCCccchHHHHHHhccceeeEEEEccCCCchHHHHHHHHHHH Confidence 0000000000 0000 00000 001112233333322221122222211111 1233455544 Q ss_pred hh----ccceEE-EEee------eeEEEeeccC----ceeEecH---HHHHHHHHhhhhhccceeecccCceeecccccc Q lcl|NC_015266. 174 QF----GQREIM-VIWP------DWLGWDDITN----STVAIPA---PAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGIS 235 (390) Q Consensus 174 ~~----~~~~~~-~~~p------~~~~~~~~~~----~~~~~p~---s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~ 235 (390) .. +-...+ ++.+ +..+.+-.++ ....+++ ++.+||++|..+ +.+|+.|+.+.|+..+. T Consensus 233 ~~r~~~g~~~~aVl~~~~~~~~d~egiinv~n~~~~~dg~~~~~~~~~~~vAG~~Ag~~----~~~S~T~~~~~~~~~v~ 308 (451) T protein:vir:10 233 RLRENEGRKVRGVIPTDADTTYNYEGISTVVNGYTLSDGTNVDVKDATGYFAGISASAD----VATSLTYFEVEDAVSAY 308 (451) T ss_pred HHHHhcCCeEEEEecCccCCCCCCcceEEeecceEecCceeechhhhHHHHHHHHcccc----cccCccceecCCceeee Confidence 32 222222 2211 1111111111 1123443 578889998865 67799999999988887 Q ss_pred cccchhhhccccccccccccceeEEE-c-CCCEE-EEccccCCC-----CcccceeeehhhHHHHHHHHHHHHHH-HhcC Q lcl|NC_015266. 236 ADVSWDLQDPATDAGYLNENQVTTLV-N-RNGFR-FWGSRTCDA-----DGKFFFENYTRSAQVIADTIAEEQMG-VVDG 306 (390) Q Consensus 236 ~~~~~~~~~~~~~~~~l~~~gI~~~~-~-~~G~~-~wG~rT~~~-----d~~~~~i~vrR~~~~i~~~l~~~~~~-~v~e 306 (390) ..++..+.+.. ..+|...+. + +++++ .+|-.|+.+ +..|+.|.++|++|.|.+.+++.+.. |+++ T Consensus 309 ~~~t~~e~~~~------i~~G~lvl~~~~g~~v~i~~~INTltt~~~~k~~~~~ki~vir~~D~i~~di~~~~~~~yiGk 382 (451) T protein:vir:10 309 PKFDNEKTIKA------LDAGQIVFTTRPGQRVVIEQDINSLHKFTAEKPQAFSKNRVIRTLDEIATNTENTFERTYLGN 382 (451) T ss_pred eeCCHHHHHHH------HhCCeEEEEEEcCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhcccee Confidence 76665555444 345765553 3 33454 588878743 56899999999999999999999875 9996 Q ss_pred -CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEc Q lcl|NC_015266. 307 -PLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRIT 377 (390) Q Consensus 307 -~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 377 (390) ||+..-|..++..|..||.+|+++|+|.+|... |.+-. ..-....+++++.++|+..||+|.+.+++. T Consensus 383 ~~N~~~gr~~~~~~i~~yl~~l~~~g~i~~~~~~-d~~v~--~~~~~~~v~v~~~v~pvdame~iy~t~~v~ 451 (451) T protein:vir:10 383 VGNNAAGRDLFKADRIAYLTSLQNRNMIQSFANT-DITVE--AGNDMDSIVVNLAVTPVDAMEKLYMTMVVR 451 (451) T ss_pred cCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCcc-ceEEe--ecCCCCEEEEEEEEEEEeeeeeEEEEEEEc Confidence 799999999999999999999999999987632 21110 111357899999999999999999999988 No 47 >protein:vir:101326 Length: 529 # NCBI annotation: gp22 # Family: family:all:9453 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006525;genbank:gi:46401679;genbank:GeneID:2777386 Probab=99.83 E-value=3.9e-22 Score=137.94 Aligned_cols=351 Identities=15% Similarity=0.084 Sum_probs=197.3 Q ss_pred CCC---ccCCCEEEEECCCCCccccccccccc--------eeeeccccc-cccceec---------c----ceEEEechh Q lcl|NC_015266. 1 MPQ---DYHHGVRVIEINEGGRPIRTVSTAVL--------GIVCTGADA-DPATFPL---------D----TPVLLTNVI 55 (390) Q Consensus 1 Ma~---~~~hGV~v~ev~~~~~~i~~v~tavi--------g~vgta~~~-~~~~~~~---------~----~~~~i~~~~ 55 (390) |.. .|..|.+++ .+.|....--++...+ .+-.+..+. ....+.+ . .....+-.. T Consensus 112 ~~~~~s~~~~s~~~~-l~~G~~~~iy~~Dgd~~~s~~~~l~i~~~~ads~g~e~~~l~~~~~~~~g~~~~let~~~sl~~ 190 (529) T protein:vir:10 112 GEPAYSALPYGSEIE-LDSGEAFAIYVDDGDPCISPTRELTIETATADSAGNERFLLKLTQTTSLGVVTTLETHTVSLAE 190 (529) T ss_pred ccchhhccccccccc-ccccceEEEEEecCcCccCCceEEEEEeeccccCCCccceeeEEEEeecCCceEEEEEEeeeee Confidence 332 222233321 1121111111111110 000000000 0000000 0 001111122 Q ss_pred HHHHhhccccchHHHHHhhhcccCceEEEEeecccccccc-cc--ccchhhhccch-------hhhhhhhhhhhhhhhhh Q lcl|NC_015266. 56 AALGKAGTKGTLRRTLDAIGKQTKPVTVVVRVAEGKDEAE-TT--ANVIGTVTPDG-------KYTGMKALLAAQGKLAV 125 (390) Q Consensus 56 ~~~~~~~~~gtl~~al~~~~~~~~~~~~vv~v~~~~~~~~-~~--~~~~~~~~~~~-------~~tgl~~~~~~~~~~~~ 125 (390) ++....+....+..+++.....- .-++.++....... +. ..+.+|++... ...++.++.... + T Consensus 191 ~a~dd~G~~~yl~svle~~s~~l---~ai~~~e~~~t~~~~t~~d~~f~~GtdG~~~~i~s~~y~~A~~~L~n~p----~ 263 (529) T protein:vir:10 191 EAKDDMGRLCYLPTALEARSKYL---RAVVNEELISTAKVTNKKSLAFTGGTNGDQSKISTAAYLRAVKVLNNAP----Y 263 (529) T ss_pred chhhhcCCccchhHHHhhccCce---eeeeeeccccccchhhhhhhhccCCccccccccchHHHHHHHHHhcCCc----c Confidence 33444555666666655432211 11111111111100 11 12333333211 112222222221 2 Q ss_pred hhhhhhhhhhcchHHHHHHHHhhhhcceEEeecccccCchHHHHHHhhhhcc---c---eEEEEeeeeEEEeeccCceeE Q lcl|NC_015266. 126 KPRILVAPGLDTQPVAAAFATIAQSLRAMVYVAAHGCKTKEEAVAYRKQFGQ---R---EIMVIWPDWLGWDDITNSTVA 199 (390) Q Consensus 126 ~p~~~~apg~~~~~v~~al~~~~~~~~~~~~~d~~~~~~~~~a~~~~~~~~~---~---~~~~~~p~~~~~~~~~~~~~~ 199 (390) .-..++.-|....++..+|..+|++.+..+..|.++..|+++|.+|.++.+- . ....+|||. ..|+.+++... T Consensus 264 d~~~il~~g~y~~a~I~~L~~ic~~~~~d~f~DV~~~LT~~aA~~~~e~~gl~~~~~~~~s~y~~P~~-~~D~~tg~k~~ 342 (529) T protein:vir:10 264 MYTAVLGLGCYDNAAITALGKICADRLIDGFFDVKPTLTYAEALPAVEDTGLLGTDYVSCSVYHYPFS-CKDKWTQSRVV 342 (529) T ss_pred eeeeeeccCCccHHHHHHHHHHHhhhhhcEEEcCCCCcCHHHHHHHHHhcCccccCceeeEEEEccee-eccccccCcee Confidence 2233445555567788999999988888888899999999999999987653 2 245667775 78888899899 Q ss_pred ecHHHH--HHHHHh--hhhhccceeecccCceee-----cccccccccchhhhccccccccccccceeEEEcC--C---- Q lcl|NC_015266. 200 IPAPAI--AAGLRA--KIDNDIGWHKTLSNVVVN-----GVTGISADVSWDLQDPATDAGYLNENQVTTLVNR--N---- 264 (390) Q Consensus 200 ~p~s~~--vAg~~a--~~d~~~g~~~span~~l~-----gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~--~---- 264 (390) +++|+. +|+..+ +.-.-.|.+.+||++.-. ||..+-. .... |.-.|-...||++.-+ + T Consensus 343 ~GlsG~A~~akargv~~na~v~g~hY~pAGe~r~~inr~~I~~ly~---~d~~----e~~~lv~~riNPV~~~~~g~~~i 415 (529) T protein:vir:10 343 FGLSGVAYAAKARGVKKNSDVGGWHYSPAGEERAVIARASIQPLYP---EDTP----DEEAMVKGRLNKVSVGTSGQMII 415 (529) T ss_pred eCCCcceeeccccceeecccccccccccCCCccceeecccceeccC---CCcc----CHHHHHhhccCeeeeeccCccee Confidence 999994 333222 122223459999998522 3322211 1111 2222333455555322 1 Q ss_pred CEEEEccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceee--------- Q lcl|NC_015266. 265 GFRFWGSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVDGPLNPSRARDIIENINAWFRREVSVGELIG--------- 335 (390) Q Consensus 265 G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g--------- 335 (390) +-.+||+|+ |+.|||+|+++|+++|++.+-+..++.+|||++..+|. +++-++.+|..+|+.|+|++ T Consensus 416 dDsLt~~~k---nny~R~~hv~~lmn~I~~~~~k~a~~~~~~Pd~it~~g-l~~~l~~~L~r~~asgalv~prdp~~~G~ 491 (529) T protein:vir:10 416 DDALTCCTQ---DNYLHFQHVPSLMNAISRFFVQLARQMKHSPDGITAAG-LTKGMTKLLDRFVASGALVAPRDPDADGT 491 (529) T ss_pred eeeeceeee---CCchhhhhHHHHHHHHHHHHHHHHHHHhhCCChHHHHH-HHHhHHHHHHHHHhcCceecccCccCCCC Confidence 235677774 78899999999999999999999999999999999988 99999999999999999976 Q ss_pred --eEEEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcc Q lcl|NC_015266. 336 --GGAWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITD 378 (390) Q Consensus 336 --~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 378 (390) |.+.. + +.+.++|.+++.++|.-.+.+|...-..-. T Consensus 492 epy~~~V-----~--q~d~D~~~v~~~~~ptGv~Rri~~~p~l~~ 529 (529) T protein:vir:10 492 EPYVLKV-----T--QAEFDKWEVVWACCPTGVARRIQGVPLLIK 529 (529) T ss_pred CceEEEE-----e--ecccCeEEEEEEeecCCceeeEEeeeeecC Confidence 33333 2 344599999999999999999887655544 No 48 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=99.64 E-value=1.1e-16 Score=108.16 Aligned_cols=354 Identities=11% Similarity=0.041 Sum_probs=200.4 Q ss_pred CCC-c------cCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechh---HHHHhhccccchHH- Q lcl|NC_015266. 1 MPQ-D------YHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVI---AALGKAGTKGTLRR- 69 (390) Q Consensus 1 Ma~-~------~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~---~~~~~~~~~gtl~~- 69 (390) |+- . ..||+|+.-+......+......++++...+. .-|.++++.+++.+ +....+|...+... T Consensus 3 magg~~~~~~K~~PG~Y~n~~~~~~~~~~~~~rGi~a~p~~~~-----wGp~~~v~~i~~~~~~~~~~~~~G~~~~~~~~ 77 (436) T protein:vir:78 3 LGGGTFVTQNKVLPGSYINFVSATRATSSLSDRGIVAMPLELD-----WGIDEEVFQVTSDDFEKYSTKYFGYDYTHEKL 77 (436) T ss_pred ccceeeccceeecCceEEEEEecCcceeeccCCeEEEEEEEec-----CCCCceeEEeecccchHHHHHHhcCccchHHH Confidence 443 1 25999998776666667777777777776653 44677788887643 44455666655332 Q ss_pred -HHHhhhcccCceEEEEeeccccccccccc--c--------ch----hhhccchh-----hhhh--------hhhhhh-h Q lcl|NC_015266. 70 -TLDAIGKQTKPVTVVVRVAEGKDEAETTA--N--------VI----GTVTPDGK-----YTGM--------KALLAA-Q 120 (390) Q Consensus 70 -al~~~~~~~~~~~~vv~v~~~~~~~~~~~--~--------~~----~~~~~~~~-----~tgl--------~~~~~~-~ 120 (390) .++..+ .+....+.+++.++.....+.. . +. ...+.... ..|- ....+. . T Consensus 78 ~~l~~~~-~~~~tv~~yrl~~G~~a~~~v~~Aky~g~~gn~i~v~v~~~~~d~~~~dv~~~~g~~~~d~~~~~~~~~l~~ 156 (436) T protein:vir:78 78 KGLRDLF-KNIRLGYFYKLNKGVKASCSIATARCSGIRGNDLKVIVTTNIDDNAKFDVVTLLDNKKVDTQIAKVITELQD 156 (436) T ss_pred HHHHHHh-cCCCEEEEEECCCcceeeeeeeeeecCCCCCcEEEEEecccccccCceEEEEEecchhhhhhhHHHHhhccC Confidence 233333 3344466666654322211110 0 00 00000000 0000 000000 0 Q ss_pred hhh------h---hhhhhhhhhhhc-----chHHHHHHHHhhhhcceEEeecccccCchHHHHHHhhhh----ccceEEE Q lcl|NC_015266. 121 GKL------A---VKPRILVAPGLD-----TQPVAAAFATIAQSLRAMVYVAAHGCKTKEEAVAYRKQF----GQREIMV 182 (390) Q Consensus 121 ~~~------~---~~p~~~~apg~~-----~~~v~~al~~~~~~~~~~~~~d~~~~~~~~~a~~~~~~~----~~~~~~~ 182 (390) ..+ + .........|.+ ......+|..+...-...+.++.........+.+|-+.. +-+..++ T Consensus 157 n~~V~~~~~g~la~~a~~~LtGG~dG~~~T~~dy~~al~~le~~~fn~l~~~~~d~~~~~~~~a~ikr~re~~g~~~~aV 236 (436) T protein:vir:78 157 NDYVTWKKEATLEATAGLTFTNGTNGEAVTGTEYQAFLDKIESYSFNALGCLATTAEIKSLFVEFTKRMRDKVGAKFQTV 236 (436) T ss_pred CceEEEEecccccccceeeeeccccccccchHHHHHHHHHHcccceeEEEecCCChHHHHHHHHHHHHHHhhcCCeEEEE Confidence 000 0 001111122222 233455665544332222233322222234455554322 2222222 Q ss_pred Eeee--------eEEEeeccCceeEe--cHHHHHHHHHhhhhhccceeecccCceeecccccccccchhhhccccccccc Q lcl|NC_015266. 183 IWPD--------WLGWDDITNSTVAI--PAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYL 252 (390) Q Consensus 183 ~~p~--------~~~~~~~~~~~~~~--p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l 252 (390) ..+. ..+....++. .+- -.++.+||++|..+ +-.|+.|+.+.++.++....+..+.+.. T Consensus 237 ~~~~~~~d~EgIInv~n~v~g~-~~~~~~~~a~vAG~~Ag~~----~~~S~T~~~~~~~~~v~~~~t~~e~~~a------ 305 (436) T protein:vir:78 237 LYKKNDADYEGVVSVENKIKDT-GLLESSLIYWTTGAIAGCD----INKSNTNKRYDGEFDVDVNYTQIHLEEA------ 305 (436) T ss_pred ecCCCCCCCceEEEeecccCCc-eechhHHHHHHHHHHhcCc----cccCccceecCccccccccCCHHHHHHH------ Confidence 2221 0011111111 122 25778889988876 5669999999988877766655544433 Q ss_pred cccceeEEEcC-CCEEE-EccccCC-----CCcccceeeehhhHHHHHHHHHHHHH-HHhcC-CCCHHHHHHHHHHHHHH Q lcl|NC_015266. 253 NENQVTTLVNR-NGFRF-WGSRTCD-----ADGKFFFENYTRSAQVIADTIAEEQM-GVVDG-PLNPSRARDIIENINAW 323 (390) Q Consensus 253 ~~~gI~~~~~~-~G~~~-wG~rT~~-----~d~~~~~i~vrR~~~~i~~~l~~~~~-~~v~e-~n~~~~~~~i~~~i~~~ 323 (390) ..+|.-.+.+. +++++ -|--|+. .+..|+.|.++|++|.|.+.+++.+. .|+++ ||+..-|..++..+..| T Consensus 306 i~~G~lvl~~d~~~v~I~~~VNTltt~~~~k~~~~~kI~vir~~D~i~~di~~~~~~~yiGKv~N~~dgr~~l~~~i~~y 385 (436) T protein:vir:78 306 LKTGKFIFHKVGDEVHVLEDINTFVSFTDEKNDDFSSNQSVRVLDQIANDIATLFNTKYLGEVPNDKSGRISFWNDVVKH 385 (436) T ss_pred HhCCeEEEEEeCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccccccCCCHHHHHHHHHHHHHH Confidence 34566666554 34443 4555542 25689999999999999999999986 59997 79999999999999999 Q ss_pred HHHHHhCCceeeeEE---EEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEc Q lcl|NC_015266. 324 FRREVSVGELIGGGA---WYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRIT 377 (390) Q Consensus 324 L~~l~~~g~l~g~~v---~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 377 (390) |.+|.++|+|..|.. ..++. -....+++++.+.|+..+|+|.+.++.. T Consensus 386 l~~L~~~g~I~~f~~~Dv~v~~~------~~~~~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 386 HEQLQNMRAIEDFKADDVSVEPG------SDKKTVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred HHHHHhCCcccCCCCcceEEeec------CCCCEEEEEEEEEEEEeeeeEEEEEEEC Confidence 999999999998763 33221 1356788999999999999999999988 No 49 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=99.09 E-value=7.7e-12 Score=81.51 Aligned_cols=323 Identities=12% Similarity=0.048 Sum_probs=173.2 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhcccCc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGKQTKP 80 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~~~~~ 80 (390) ||= +|++.++-..-....+....-.++.++-.-. .. ....++...+-..... ......+...+..+.. T Consensus 1 ~~g--lp~i~i~f~~~a~ta~~~g~rGiv~~il~d~---~~-----~~~~~~~~~~v~~~~~--~~n~~~i~~~~~g~~~ 68 (356) T protein:vir:10 1 MAG--LVNINIEFKELATSFIQRSKAGIVAIILKDT---TK-----MYKELTSEDDIPISLS--ADNKKYIKYGFVGATD 68 (356) T ss_pred CCC--CCceeEEEeecceeeccCCccceEEEEEecC---Cc-----ceeEEeccccchhHHH--HHHHHHHHHHhhcccc Confidence 654 4788887766666655544443333333211 00 0111111111100110 1112222222322211 Q ss_pred eEEEEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhhcchHHHHHHHHhhhhcce-----EE Q lcl|NC_015266. 81 VTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGLDTQPVAAAFATIAQSLRA-----MV 155 (390) Q Consensus 81 ~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~-----~~ 155 (390) ........ . ....+..+.......|..+... ....+..|+. ..++.+.+.++..+++. +. T Consensus 69 ~~~~~~p~-----~---~~~~~~~t~~~y~~aL~~le~~------~fn~l~~~~~-d~~~~~~~~a~ikr~r~~~~~~~~ 133 (356) T protein:vir:10 69 NEKVLRPS-----K---VIISTFTEDGKVEDILEELESV------EFNYLCMPEA-IEAEKTKIVTWIKKIREEESTEAK 133 (356) T ss_pred ccccccce-----e---eeeecccCchhHHHHHHHhcCc------cceEEEecCC-ChHHHHHHHHHHHHHHhcCCcEEE Confidence 11110000 0 0000001112223334443322 2223445553 34566666666666542 22 Q ss_pred eecccccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeecccccc Q lcl|NC_015266. 156 YVAAHGCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGIS 235 (390) Q Consensus 156 ~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~ 235 (390) .+..... .++...+-+... .+.+. ......-.++.+||++|.... -+|+.|..+.++.... T Consensus 134 ~V~~~~~------------aD~EgIInv~n~-~~~~g--~~~t~~~~~~~vAG~~Ag~~~----n~S~T~~~~~~~~~~~ 194 (356) T protein:vir:10 134 AVLANIK------------ADNEAIINFTEN-VVVDG--EEITAEKYTTRVASLIASTPN----TQSITYAPLDEVESIV 194 (356) T ss_pred EEecCCC------------CCCceeEEeecC-eEecc--eeechhHHHHHHHHHHhccch----hccccceecCCccccc Confidence 2221111 122222222221 11111 111112357799999998874 5599999888765443 Q ss_pred cccchhhhccccccccccccceeEEEcCCC-E-EEEccccCC-----CCcccceeeehhhHHHHHHHHHHHHH-HHhcC- Q lcl|NC_015266. 236 ADVSWDLQDPATDAGYLNENQVTTLVNRNG-F-RFWGSRTCD-----ADGKFFFENYTRSAQVIADTIAEEQM-GVVDG- 306 (390) Q Consensus 236 ~~~~~~~~~~~~~~~~l~~~gI~~~~~~~G-~-~~wG~rT~~-----~d~~~~~i~vrR~~~~i~~~l~~~~~-~~v~e- 306 (390) ..+..+.+. .-.+|--.+.+.+| . ..-|-.|+. .+..|+.|.+.|++|.|.+.+++.+. .|+++ T Consensus 195 -~~t~~e~~~------ai~~G~lvl~~d~~~V~I~~~VNSltt~t~~k~~~f~Kirvvr~~D~i~~Di~~~f~~~yiGKv 267 (356) T protein:vir:10 195 -KIDKASADA------KVQAGELILRRLSGKIRIARGINSLTTLTAEKGEIFQKIKLVDTKDLISKDIKNIYVEKYLRKC 267 (356) T ss_pred -cCCHHHHHH------HHhCCeEEEEEEcCeEEEEecCccceecCCCCCcchhhhHHHHHHHHHHHHHHHHHhhcccccc Confidence 333333322 23456555555444 3 345655652 24579999999999999999999987 69998 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCcee-eeEEEEecCC--------------CCHHHHhC----CeEEEEEEEEecccc Q lcl|NC_015266. 307 PLNPSRARDIIENINAWFRREVSVGELI-GGGAWYDPEP--------------NTTDELTS----GGTWIDYDYTPVPPL 367 (390) Q Consensus 307 ~n~~~~~~~i~~~i~~~L~~l~~~g~l~-g~~v~~d~~~--------------nt~~~i~~----G~~~~~i~~~p~~p~ 367 (390) ||+..-|..+...++.||.+|.+.|+|. ++.++.|.+. ++...+.+ -.+.+.+.+.|+-.+ T Consensus 268 ~N~~dgr~~l~~ai~~y~~~L~~~~~I~~~~~~eid~e~q~~~~~~~g~d~~~~~d~~v~~~~~~~~v~~~~~v~~vdam 347 (356) T protein:vir:10 268 PNTYDNKCLFIVAVQSYLTELAKQELIDSNFTVEIDLEKQKEYLEGKKIAVSKMKENEIKEANTGSNGFYLINLKLVDAM 347 (356) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCccccCceeEecccchHHHhhhccccccccccceeecccCCcEEEEEEEEEEEeee Confidence 6999999999999999999999999995 6777776643 22223332 347899999999999 Q ss_pred eEEEEEEEE Q lcl|NC_015266. 368 ENLKLRQRI 376 (390) Q Consensus 368 e~i~~~~~~ 376 (390) |.|.+.++. T Consensus 348 E~iy~ti~v 356 (356) T protein:vir:10 348 EDINIRVQM 356 (356) T ss_pred eeEEeEEeC Confidence 999999998 No 50 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=98.61 E-value=5.9e-08 Score=60.20 Aligned_cols=336 Identities=11% Similarity=0.077 Sum_probs=190.6 Q ss_pred cCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhc-cccchHHHHHhhhcccCceEE Q lcl|NC_015266. 5 YHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAG-TKGTLRRTLDAIGKQTKPVTV 83 (390) Q Consensus 5 ~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~-~~gtl~~al~~~~~~~~~~~~ 83 (390) ..|-|.|.+.+-+..++..+.-.. -|+|.+.......++++ ...+.-..++ .+..|..-+.+...|.|.. + T Consensus 1 ~~~~v~vn~ln~~qg~~~~ver~~-lfig~~~~~~~~~~~~~------~~sdld~~lg~~ds~lk~~v~aa~~naG~~-w 72 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIERHA-LFVGVGTTNQGKLLALT------PDSDFDKVFGETDTDLKKQVRAAMLNAGQN-W 72 (376) T ss_pred CCCeEEEeeeeccCCCcccccceE-EEeeccccccCceEEec------CCCChHHhhCCCchhHHHHHHHHHhCCCCc-e Confidence 566788888888888888887554 48888775544333333 2333323333 3456778888887777664 2 Q ss_pred EEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhh--cchHHHHHH----HHhhhhc-ce-EE Q lcl|NC_015266. 84 VVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGL--DTQPVAAAF----ATIAQSL-RA-MV 155 (390) Q Consensus 84 vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~--~~~~v~~al----~~~~~~~-~~-~~ 155 (390) ...+..+..+. ... +.++..+.... .+..+...|- ..++...++ .....++ +- ++ T Consensus 73 ~a~~~~p~~~~------------~~~---~~Av~~a~~~~--s~E~V~v~~p~~t~~a~i~a~qa~a~el~~~~~R~vff 135 (376) T protein:vir:37 73 FAHVYIAQEDG------------YDF---VECVKKANQTA--SFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFF 135 (376) T ss_pred EEEEEecCCCh------------hhH---HHHHHHHHhhC--CeeEEEEecCcchhHHHHHHHHHHHHHHHHhcCCeEEE Confidence 22222111110 111 12222222221 1112222221 122222233 2333333 23 33 Q ss_pred eecccc-------cCch----HHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeeccc Q lcl|NC_015266. 156 YVAAHG-------CKTK----EEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLS 224 (390) Q Consensus 156 ~~d~~~-------~~~~----~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~spa 224 (390) ++.... +.+. ....+-++++.+.+..++-.. +. -..|.+||.+|+. ..-+..||. T Consensus 136 ile~~g~d~~~~~ge~w~~y~~~l~a~~~gia~~~V~vV~~~---~g---------n~~G~~aGRl~na--aVsVadspg 201 (376) T protein:vir:37 136 IQAVQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLVPLL---FG---------NETGVLAGRLANR--AVTVADSPA 201 (376) T ss_pred EEeccCCCCcccccCCHHHHHHHHHHHhccccccceeeeeee---cc---------chHHHHHHHHHhC--CcchhcCcc Confidence 333321 1122 233334556777776665321 11 2467888888762 334688998 Q ss_pred Cce---eecccccccccchh-hhccccccccccccceeEEEcC---CCEEEEccccCCC-CcccceeeehhhHHHHHHHH Q lcl|NC_015266. 225 NVV---VNGVTGISADVSWD-LQDPATDAGYLNENQVTTLVNR---NGFRFWGSRTCDA-DGKFFFENYTRSAQVIADTI 296 (390) Q Consensus 225 n~~---l~gv~~~~~~~~~~-~~~~~~~~~~l~~~gI~~~~~~---~G~~~wG~rT~~~-d~~~~~i~vrR~~~~i~~~l 296 (390) ... |.|+..+..+.... ....+.....|...|..+.+.. .|+-+-.+||+.. .+++++|..+|..|-+.|.+ T Consensus 202 RV~tGai~gl~~~~~p~d~~g~el~~a~l~aLd~arysvpr~Y~gydG~Yw~dg~tl~~~gsDYq~ie~~RVvdKa~R~v 281 (376) T protein:vir:37 202 RVQTGALVSLGSANKPLDKDGNELTLAHLKSLETARYSVPMWYPDYDGYYRADGRTLDVEGGDYQVIENLRVVDKVARKV 281 (376) T ss_pred ceeecccccccccccccccCCcccchHHHHHHHhCCCeEEEeeCCCCceEEeCCeEeccCCCCeeeehhchHHHHHHHHH Confidence 764 33333333222211 1123334556777787777543 4777677888865 46899999999999999988 Q ss_pred HHHHHHHhcC---CCCHHHHHHHHHHHHHHHHHHHhCCceeeeE----EEEecCCCCHHHH-----hCCeEEEEEEEEec Q lcl|NC_015266. 297 AEEQMGVVDG---PLNPSRARDIIENINAWFRREVSVGELIGGG----AWYDPEPNTTDEL-----TSGGTWIDYDYTPV 364 (390) Q Consensus 297 ~~~~~~~v~e---~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~----v~~d~~~nt~~~i-----~~G~~~~~i~~~p~ 364 (390) +...-+.+.. +.++.-....+.-+..=|+++.+.+.|.|.. |.-.++ +|| ...++.+-+.+.|. T Consensus 282 R~~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~ei~g~~fpgei~~P~d----~dI~i~w~sk~~V~I~~~vrPy 357 (376) T protein:vir:37 282 RLLAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKD----DAITIVWQSKTKVTIYIKVRPY 357 (376) T ss_pred HHHHHHHhcCccccCChhHHHHHHHHHhHHHHHHHhhhhhccccccceeecCCC----CceEEEeccCceEEEEEEEeee Confidence 8766655543 4566678888899999999999999998843 432111 122 24567777777888 Q ss_pred ccceEEEEEEEEcchHHHH Q lcl|NC_015266. 365 PPLENLKLRQRITDRYLAD 383 (390) Q Consensus 365 ~p~e~i~~~~~~~~~~~~~ 383 (390) --.+.|+..+..|-+-+.+ T Consensus 358 ~cpk~i~~~I~LDls~~~~ 376 (376) T protein:vir:37 358 DCPKEITANIFLDLDSLGE 376 (376) T ss_pred cCcceeEEEEEEecCCCCC Confidence 7788999988887764444 No 51 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=98.56 E-value=2.1e-07 Score=57.19 Aligned_cols=358 Identities=11% Similarity=0.044 Sum_probs=169.3 Q ss_pred CCCccC-----CC-EEEEECCCCCccccccccccceeeeccccc-------------------cccceeccceEEEechh Q lcl|NC_015266. 1 MPQDYH-----HG-VRVIEINEGGRPIRTVSTAVLGIVCTGADA-------------------DPATFPLDTPVLLTNVI 55 (390) Q Consensus 1 Ma~~~~-----hG-V~v~ev~~~~~~i~~v~tavig~vgta~~~-------------------~~~~~~~~~~~~i~~~~ 55 (390) ||..|+ |+ +++-+-.....+......+..+...+.... .....+++..+.++... T Consensus 68 aA~~yF~q~p~P~~l~igR~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~i~i~g~~~t~~~i~lS~~ts~~~vA 147 (502) T protein:vir:52 68 AAQPFFAQSPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVA 147 (502) T ss_pred HHHHHhcCCCccceEEEEeccccccceeechhhhhhhhhHHhHHHhhhhcCceeEEEecceeeeeeccccccccchhHHH Confidence 444333 21 333333222222111111111110000000 00001111111111111 Q ss_pred HHHHhh-ccccchHHHHHhhhcccCceEEEEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhh Q lcl|NC_015266. 56 AALGKA-GTKGTLRRTLDAIGKQTKPVTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPG 134 (390) Q Consensus 56 ~~~~~~-~~~gtl~~al~~~~~~~~~~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg 134 (390) ...... +..+. .+..-++..+.+..+..-..+.+.. ..+..+.......+.+..++.........+......| T Consensus 148 ~~i~~~l~~~~~---~~tv~~d~~~~~F~i~s~ttg~~~~---~~~~~a~~~~~~gt~~a~~l~l~~~~~av~v~~~~~g 221 (502) T protein:vir:52 148 TKIQEKLTTLSV---AVSIAYDETGNRFIVSANVAGEDKK---TEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVS 221 (502) T ss_pred HHHHhhhccccc---ceEEEEecCCceEEEEeccCCCcce---eEEEEeecCCcchhHHHHHhccccccceeeeeeeccc Confidence 111100 00000 0000111112222221111111111 0111111111122223333322222222222222334 Q ss_pred hcchHHHHHHHHhhhh---cceEEeecccccCchHHHHHHhhhhccceEEEEeeeeE-EEeec----------cC---ce Q lcl|NC_015266. 135 LDTQPVAAAFATIAQS---LRAMVYVAAHGCKTKEEAVAYRKQFGQREIMVIWPDWL-GWDDI----------TN---ST 197 (390) Q Consensus 135 ~~~~~v~~al~~~~~~---~~~~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p~~~-~~~~~----------~~---~~ 197 (390) ........+|.++... +-.+...+........++.+|.+..+. ...+..+-. +.+.. .+ .. T Consensus 222 ~~aet~~~al~a~~~~~~~w~~~~~a~~~~~~~~la~a~~iea~~~--~f~~~~~d~~~~~~~~~~i~~~l~a~~~~~t~ 299 (502) T protein:vir:52 222 LKKETLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQANTK--LFGANVIRAEQIEWSADNIYKKLYDAGLDHTL 299 (502) T ss_pred ccccCHHHHHHHHHhccCceEEEEEeecCChhHHHHHHHHHhhcCc--EEEEEecCcceeccccchHHHHHHhccCceeE Confidence 3334444444444333 222223332222223334444443222 122211100 00000 00 00 Q ss_pred eE-----ecHHHHHHHHHhhhhhccc-eeecccCceeecccccccccchhhhccccccccccccceeEEEcCCCE-EEEc Q lcl|NC_015266. 198 VA-----IPAPAIAAGLRAKIDNDIG-WHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVNRNGF-RFWG 270 (390) Q Consensus 198 ~~-----~p~s~~vAg~~a~~d~~~g-~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~~G~-~~wG 270 (390) .. -.|.+++.|.++..|-..- -...-.+|.+.||.... ...++++.|..+++|.+.+.+|. .+.. T Consensus 300 ~~y~~~~~~~~aa~~g~~as~~f~~~~g~iT~~fk~l~GV~~~~--------lt~t~~~al~~~~~N~y~~~~~~~~~~~ 371 (502) T protein:vir:52 300 AMFDKNDMYPVSSALARLLSTNFAANNSTLTLKFKQQPTITADE--------ITATEFAKAKRLGINVYTYFDDVAMIAE 371 (502) T ss_pred EEecCCcchhHHHHHHHHHhcCCCcCcceeeecccccCCcccCc--------CCHHHHHHHHhcCceEEEEecCeeEEec Confidence 01 1256667788888774331 23344566777775322 23456777888999999776664 4567 Q ss_pred cccCCCCcccceeeehhhHHHHHHHHHHHHHHHhc----C-CCCHHHHHHHHHHHHHHHHHHHhCCcee----------- Q lcl|NC_015266. 271 SRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVD----G-PLNPSRARDIIENINAWFRREVSVGELI----------- 334 (390) Q Consensus 271 ~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~----e-~n~~~~~~~i~~~i~~~L~~l~~~g~l~----------- 334 (390) +++++++ ||-+.+-.+|+...|+..+...++ + |-|..=...|+..|+.-|++-++.|.|. T Consensus 372 G~~~~G~----~iD~~~~~~Wl~~~lq~~l~~~L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~G~~~~~~~g~ 447 (502) T protein:vir:52 372 GTVIGGK----FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGN 447 (502) T ss_pred CeeeCCc----hhhHHHHHHHHHHHHHHHHHHHHHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCccccccccCcccce Confidence 7777763 777888999999999988876654 2 6677778999999999999999999884 Q ss_pred ---------eeEEEEe-cCCCCHHHHhCCeE-EEEEEEEecccceEEEEEEEEcc Q lcl|NC_015266. 335 ---------GGGAWYD-PEPNTTDELTSGGT-WIDYDYTPVPPLENLKLRQRITD 378 (390) Q Consensus 335 ---------g~~v~~d-~~~nt~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~~ 378 (390) ||.+... .++.++.|..+.+. -+.+.+.+..-+++|++.+..++ T Consensus 448 ~~~~d~~~~gy~v~~~~~~~~s~~dr~~R~~~~~~~~~~~aGaIh~v~i~~nv~~ 502 (502) T protein:vir:52 448 LSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) T ss_pred eeecccccCceEEEeCchhhCCHHHHHcccCCCeEEEEEECceEEEEEEEEEEeC Confidence 5777765 56889999999988 89999999999999999999988 No 52 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=98.47 E-value=5.2e-07 Score=55.03 Aligned_cols=312 Identities=11% Similarity=0.047 Sum_probs=173.0 Q ss_pred CCCccCCCEEEEECCCCCcccccccccccee--eeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhccc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGI--VCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGKQT 78 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~--vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~~~ 78 (390) |-+..- +|.+.--..-+ .....-....+ .|++. .....++..+-...++....++.+....|.++ T Consensus 1 ~~~~iv-~V~v~~~~~~~--~~~~~~~~~~~~~~~t~~----------~~~~y~s~~~v~~d~~~~~~~Ykaa~~~f~Q~ 67 (331) T protein:vir:80 1 MVETIT-DVRVHISVLYP--SPRIGLGRPAIFVKGTAM----------GYKEYTTLEELKDTFADNTEVYAKAKAVFLQK 67 (331) T ss_pred Ccccee-cceeeeccccc--ccccccCcceeEEecccc----------ceEEEechhhhccCCCCCcHHHHHHHHHHhcc Confidence 555442 33221110111 11111111211 22221 12344444444445677778888899999988 Q ss_pred CceEEEEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhhcchHHHHHHHHhhhhcceEE-ee Q lcl|NC_015266. 79 KPVTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGLDTQPVAAAFATIAQSLRAMV-YV 157 (390) Q Consensus 79 ~~~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~-~~ 157 (390) ..+.-+....... . +-+.++...... ... .+..... ..+-..++....+..+-++ .+ T Consensus 68 ~~~~~i~v~~~~~-~-----------------~~~~a~~a~~~~-~w~--~~~~~~~-~~~~~~a~a~~~~a~~~~f~~~ 125 (331) T protein:vir:80 68 DRPDTVAVITYED-T-----------------KLLEAAEAYFLK-SWH--FALLAEF-KAADALALSNLIEEQKFKFAVF 125 (331) T ss_pred CccceEEEeccch-H-----------------HHHHHHHHhccC-cee--EEEeecC-CHHHHHHHHHHHhhCCcEEEEE Confidence 7654433221110 0 011111111000 000 0011111 1222223444444433333 33 Q ss_pred cccccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCc-eeeccccccc Q lcl|NC_015266. 158 AAHGCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNV-VVNGVTGISA 236 (390) Q Consensus 158 d~~~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~-~l~gv~~~~~ 236 (390) .. . +........+ .+....+++. ..+. -+.+++.|.++..+..+--| .++ +|.||.... T Consensus 126 ~~--~-~~~~~~~~~~--~~~t~~~~~~-------~~~~----~~~aa~~g~~~~~~~g~~t~---~fk~~l~GV~~~~- 185 (331) T protein:vir:80 126 QV--T-AVADITPLAK--NTRTIAIVHS-------KTGE----KLDAALIGNVASLPVGSATW---KGRHGLAGITSEE- 185 (331) T ss_pred ec--C-chHHHHHhhc--cccEEEEEcC-------Cccc----hhHHHHHHHHHhcCccceee---eeecccCCCCCCC- Confidence 22 1 1122221111 2222223222 1111 24566677777666533222 455 366665321 Q ss_pred ccchhhhccccccccccccceeEEEcCCCE-EEEccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhcC----CCCHH Q lcl|NC_015266. 237 DVSWDLQDPATDAGYLNENQVTTLVNRNGF-RFWGSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVDG----PLNPS 311 (390) Q Consensus 237 ~~~~~~~~~~~~~~~l~~~gI~~~~~~~G~-~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e----~n~~~ 311 (390) ....+.+.|..+++|.+.+..|. .++...|++++ ||-+.+-.+|+...|+..+...+-. |-+.. T Consensus 186 -------lt~t~~~al~~~~~N~y~~~~~~~~~~~G~~~~G~----~iD~~~~~dWl~~~lq~~l~~ll~~~~kiPy~~~ 254 (331) T protein:vir:80 186 -------LKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLLTETDKLTFDAR 254 (331) T ss_pred -------CCHHHHHHHHhcCceEEEEecCeeEEecceEeCch----hHHHHHHHHHHHHHHHHHHHHHHHhCCCCccChh Confidence 23456777888999999877664 56777777763 8999999999999999988876543 56666 Q ss_pred HHHHHHHHHHHHHHHHHhCCcee--------eeEEEEe-cCCCCHHHHhCCeEE-EEEEEEecccceEEEEEEEEcc Q lcl|NC_015266. 312 RARDIIENINAWFRREVSVGELI--------GGGAWYD-PEPNTTDELTSGGTW-IDYDYTPVPPLENLKLRQRITD 378 (390) Q Consensus 312 ~~~~i~~~i~~~L~~l~~~g~l~--------g~~v~~d-~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 378 (390) =...|+..++.-|++-++.|.|. ||.|... .++.+++|+.+++.. +.+.+.+..-+++|++....+. T Consensus 255 G~~~l~a~i~~~~~~av~~G~I~~g~~~~~~~~~v~~~~~~~~s~~dr~~R~~~~~~~~~~~~gaI~~v~i~~~v~~ 331 (331) T protein:vir:80 255 GIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) T ss_pred hHHHHHHHHHHHHHHHHhCCceecCccCCCcceEEEeCchhcCCHHHHhccCCCCeEEEEEEcceEEEEEEEEEEeC Confidence 77889999999999999999995 6788775 467899999998876 8888999999999999999888 No 53 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=98.47 E-value=5.3e-07 Score=54.98 Aligned_cols=358 Identities=11% Similarity=0.024 Sum_probs=187.0 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhcccCc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGKQTKP 80 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~~~~~ 80 (390) |-.+. |.| .+.-.+.++...+-..+.++|.... +.......++..+-...|+.....+.+...+|.+... T Consensus 1 ~~s~i---VnV-~i~~~~~a~~~~~f~~~l~~~~~~~------~~~r~~~yss~~~V~~~FG~~S~ey~aA~~yF~q~p~ 70 (450) T protein:vir:95 1 MWNPI---VNV-DITLNTAGTTREGFGLPLFLASTDN------FEERVRGYTSLTEVAEDFDENTAAYKAAKQLWSQTPK 70 (450) T ss_pred CCCce---EEE-eecccccccccccceeEEEEcCCCC------CccceeeecCHHHHHHhcCCCcHHHHHHHHHHhCCCc Confidence 65533 222 2233344444445455556665421 2222233445555555778888888888888887554 Q ss_pred eEEEEe--eccccccc-----------------------cccccchhhhccchhhhhhhhhhhhhhhh------------ Q lcl|NC_015266. 81 VTVVVR--VAEGKDEA-----------------------ETTANVIGTVTPDGKYTGMKALLAAQGKL------------ 123 (390) Q Consensus 81 ~~~vv~--v~~~~~~~-----------------------~~~~~~~~~~~~~~~~tgl~~~~~~~~~~------------ 123 (390) +..+.- ........ ....++.......+..+.+.......... T Consensus 71 p~~l~igr~~~~~t~~~~~~~~~~~~g~lt~tv~G~~~~~~~i~~s~a~s~~~va~~~~tai~~~~~~~~~~~~~s~g~~ 150 (450) T protein:vir:95 71 VTQLYIGRRAMQYTVSIPDAVTESTDYSITVAAGGGISQPYQYTAQSSDTAENVLQQFKTQIEADPTIKDKVSVNVTGSN 150 (450) T ss_pred ccEEEEEeeccchhhhhhhhhccccceeEEEEecceeeeeeEEEEEecCChhhHHHHhhhhhcccceeeeeeeeeeeccc Confidence 432211 11100000 00000000000000011111111100000 Q ss_pred ------------------hhhhhhhhhhhhcchHHHHHHHHhhhhcceEEeecccccCchHHH---HHHhhhhccceEEE Q lcl|NC_015266. 124 ------------------AVKPRILVAPGLDTQPVAAAFATIAQSLRAMVYVAAHGCKTKEEA---VAYRKQFGQREIMV 182 (390) Q Consensus 124 ------------------~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~~~d~~~~~~~~~a---~~~~~~~~~~~~~~ 182 (390) ..........|...+.+..++..+.........+-. +..+.++. .+|.+..+ +... T Consensus 151 ~~~t~~~~~~~~~~~~~l~~~~~~~~~~g~~aet~~~a~~a~~~~~~~w~~~~~-~~~~~~~i~a~a~w~~a~~--~~f~ 227 (450) T protein:vir:95 151 GSATMIIAKAGDNDFVKVTTTAQTVYIASTTADTASTALAAIEAYSTDWYFIAA-EDRTQQFVLAMASEIQARK--KIFF 227 (450) T ss_pred ceeeeeeeccccchhhccccccceeEecccccccHHHHHHHHHHhhCCeEEEEe-cCCCHHHHHHHHHHHhhcC--cEEE Confidence 000111111122223345555554443332222221 22233333 33444322 2222 Q ss_pred EeeeeE-EEeec--------------c--Ccee-E-------ecHHHHHHHHHhhhhhccceeecccCceeecccccccc Q lcl|NC_015266. 183 IWPDWL-GWDDI--------------T--NSTV-A-------IPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTGISAD 237 (390) Q Consensus 183 ~~p~~~-~~~~~--------------~--~~~~-~-------~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~~~~~ 237 (390) +..+-. +.+.. . .... + -.+.++++|.....++.+ ....+|.+.||..-..+ T Consensus 228 ~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~~~aa~~g~~~~~~~g~---~T~~fk~l~Gv~~~v~~ 304 (450) T protein:vir:95 228 TANSDVTALQGTELASANDVPAQLAKNMYTRTVCLWHHAAAEDYPEMAYIAYGAPYDAGS---IAWGNAQLTGVAASLQP 304 (450) T ss_pred EEcCCchhhhhhhhhcccchHHHHHhccCCeeEEEeeCCCchhHHHHHHHHHhhhcccce---eeeccccccceeeeccC Confidence 222211 10000 0 0011 1 124555555544433322 23346777776532221 Q ss_pred cchhhhccccccccccccceeEEEcCCCE-EEEccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhc-----C-CCCH Q lcl|NC_015266. 238 VSWDLQDPATDAGYLNENQVTTLVNRNGF-RFWGSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVD-----G-PLNP 310 (390) Q Consensus 238 ~~~~~~~~~~~~~~l~~~gI~~~~~~~G~-~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~-----e-~n~~ 310 (390) ...+.....+++.|..+++|.+...+|. .++.++|++. .||-++|-.+|+...|+..+...+- + |-+. T Consensus 305 -~~~~~lt~~~~~al~~~~~n~y~~~~~~~~~~~G~~~~G----~~iD~~~~~~wl~~~iq~~l~~ll~~~~~~KiPy~~ 379 (450) T protein:vir:95 305 -SNQRPLTSIQKSALDVRHCNFIDLDGGVPVVRRGITSGG----EWIDIIRGVDWLESDLKTSLRDLLINQKGGKITYDD 379 (450) T ss_pred -ccccccchHHHHHHHhCCcEEEEEecCceeeeCCeeeCc----chhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCccCh Confidence 1112334567888889999988766663 5688888886 3788999999999999999987662 2 7778 Q ss_pred HHHHHHHHHHHHHHHHHHhCCceeeeEEEEec-CCCCHHHHhCCeEE-EEEEEEecccceEEEEEEEEcch Q lcl|NC_015266. 311 SRARDIIENINAWFRREVSVGELIGGGAWYDP-EPNTTDELTSGGTW-IDYDYTPVPPLENLKLRQRITDR 379 (390) Q Consensus 311 ~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~-~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~~ 379 (390) .-...|+..|+.-|++..++|.|.||+|...+ +..++.|+.+.++. +++.+.....++.+.++....=+ T Consensus 380 ~G~~~i~a~i~~~l~~a~~~G~Ia~~~V~~~~~~~~~~~dr~~R~~~~i~~~~~laGAIh~~~i~~~v~~~ 450 (450) T protein:vir:95 380 TGITRIRQVIETSLQRAVNRNFLSSYTVNVPKASQVALADKKARILKDVTFAGILAGAILDVDLKGTVAYE 450 (450) T ss_pred hhHHHHHHHHHHHHHHHHhcCcccceeEecCChHhcCHHHHhccCCCCeeEEEEEccceEEEEEEEEEEeC Confidence 88888999999999999999999999998764 78889999988865 88888888999988887776555 No 54 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=98.46 E-value=2e-07 Score=57.34 Aligned_cols=342 Identities=11% Similarity=0.070 Sum_probs=182.1 Q ss_pred cCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhc-cccchHHHHHhhhcccCceEE Q lcl|NC_015266. 5 YHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAG-TKGTLRRTLDAIGKQTKPVTV 83 (390) Q Consensus 5 ~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~-~~gtl~~al~~~~~~~~~~~~ 83 (390) ..|-|.|...+-+..++..+.-.. -|+|.+.......+++|+. .+.-..++ .+..|..-+.+.-.|+|..- T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er~~-Lfig~~~~~~~~~~~~~~~------sdld~~lg~~~~~lk~~v~aa~~naG~~~- 72 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIERHA-LFVGVGTTNQGKLLALTPD------SDFDKVFGETDTDLKKQVRAAMLNAGQNW- 72 (376) T ss_pred CCCeEEEecccccCCCcccccceE-EeeccccccccceeeecCc------cchHhhhCCCchHHHHHHHHHHhCCCCcE- Confidence 566788888888888888887544 4888776544433333332 22222222 33678888888888877643 Q ss_pred EEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhhcchHHHHHHH----Hhhhhc-ceEEe-e Q lcl|NC_015266. 84 VVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGLDTQPVAAAFA----TIAQSL-RAMVY-V 157 (390) Q Consensus 84 vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~~~~v~~al~----~~~~~~-~~~~~-~ 157 (390) .+.+..+..+. .+. +.++..+.....+.--.+..|--+.++-..++. ....++ +-.++ + T Consensus 73 ~~~~~~~~~~~---~~~------------~~Av~~a~~~~s~E~V~v~~pv~t~~a~i~aa~~~a~el~~~~~Rpv~fil 137 (376) T protein:vir:37 73 FAHVYIAQEDG---YDF------------VECVKKANQTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQ 137 (376) T ss_pred EEEEEeecCCc---hHH------------HHHHHHhhhhcCceEEEEeccccccHHHHHHHHHHHHHHHHhcCCeEEEEE Confidence 22222111111 111 111112211111111111112001122222222 333332 33333 3 Q ss_pred cccc-------cCchH----HHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCc Q lcl|NC_015266. 158 AAHG-------CKTKE----EAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNV 226 (390) Q Consensus 158 d~~~-------~~~~~----~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~ 226 (390) .... +.+.+ ...+-++++.+.+..++ |. .| + -..|.+||.+|+. ..-+..||... T Consensus 138 e~r~~~~~~~~~e~w~~y~~~~~al~~gia~~~V~~V-~~--~~----g-----n~~G~~aGRl~~a--aVsVadspgRV 203 (376) T protein:vir:37 138 AVQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLV-PL--LF----G-----NETGVLAGRLANR--AVTVADSPARV 203 (376) T ss_pred eccCcCcccccccCHHHHHHHHHHhhcccccccceee-ee--eh----h-----hhHHHHHHHHhhc--ccchhhCccce Confidence 3221 11222 22223344555544332 11 11 1 2367788887654 23357788765 Q ss_pred e---eecccccccccch-hhhccccccccccccceeEEEcC---CCEEEEccccCCC-CcccceeeehhhHHHHHHHHHH Q lcl|NC_015266. 227 V---VNGVTGISADVSW-DLQDPATDAGYLNENQVTTLVNR---NGFRFWGSRTCDA-DGKFFFENYTRSAQVIADTIAE 298 (390) Q Consensus 227 ~---l~gv~~~~~~~~~-~~~~~~~~~~~l~~~gI~~~~~~---~G~~~wG~rT~~~-d~~~~~i~vrR~~~~i~~~l~~ 298 (390) . |.|......+... .....+...+.|..+|..+.+.. .|+-+-.+||+.. .+++++|..+|+.|-+.|.++. T Consensus 204 ~tG~l~gl~~~~lp~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~gsDY~~ie~~RVvdKa~R~vR~ 283 (376) T protein:vir:37 204 QTGALVSLGSANKPLDKDRNELTLAHLKSLETARYSVPMWYPDYDGYYWADGRTLDVEGGDYQVIENLRVVDKVARKVRL 283 (376) T ss_pred eccccccccccccccCcCcccCCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHH Confidence 3 4444333332211 11222334456777788777653 4777667888865 4689999999999999998888 Q ss_pred HHHHHhcCC---CCHHHHHHHHHHHHHHHHHHHhCCceeee----EEEEecCCC-CHHHHhCCeEEEEEEEEecccceEE Q lcl|NC_015266. 299 EQMGVVDGP---LNPSRARDIIENINAWFRREVSVGELIGG----GAWYDPEPN-TTDELTSGGTWIDYDYTPVPPLENL 370 (390) Q Consensus 299 ~~~~~v~e~---n~~~~~~~i~~~i~~~L~~l~~~g~l~g~----~v~~d~~~n-t~~~i~~G~~~~~i~~~p~~p~e~i 370 (390) .+-+++... .++.-.+..+.-+..=|+++.+...+.|. +|.-..+.+ +..-+...++.+.+.+.|.--.+.| T Consensus 284 ~ai~~i~D~~lnst~~sia~~~~yi~~pLr~M~~s~~i~g~~fpGeI~~p~d~Di~i~w~s~~~V~I~~~v~P~~~pk~I 363 (376) T protein:vir:37 284 LAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKDDAITIVWQSKTKVTIYIKVRPYDCPKEI 363 (376) T ss_pred HHHHHhCCcccCcchhhHHHHHHHHHHHHHHHHhcchhccccccceeecCCCCCceEEeeccceEEEEEEEEeccCCceE Confidence 777765532 24444566666677778988888777773 344322211 1222467888899999999999999 Q ss_pred EEEEEEcchHHHH Q lcl|NC_015266. 371 KLRQRITDRYLAD 383 (390) Q Consensus 371 ~~~~~~~~~~~~~ 383 (390) +..+..|-.=..+ T Consensus 364 tv~I~Ldlsn~~~ 376 (376) T protein:vir:37 364 TANIFLDLDSLGE 376 (376) T ss_pred EEEEEeecCCCCC Confidence 8777765442222 No 55 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=98.28 E-value=1.5e-06 Score=52.51 Aligned_cols=356 Identities=13% Similarity=0.008 Sum_probs=178.5 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhcccCc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGKQTKP 80 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~~~~~ 80 (390) |+. -|.=+++.-.+.++..-.-..+.++|+....++.. .++...+.++..+-...|+.....+.+...+|.|+-. T Consensus 1 m~~----~iVnV~Is~~t~A~~~~~Fg~~liigs~~~~~p~~-~f~~~~~Yss~~~V~~Dfg~~s~~Y~AA~~~f~Q~~~ 75 (426) T protein:vir:31 1 MPK----QIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDA-EFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAE 75 (426) T ss_pred CCc----ceEEEEeecccccccccccceeeeeeecccccccc-ccchhhhhhhHHHHHhcCCCChHHHHHHHHHHhCCce Confidence 994 44445667777777777888888999875444322 1344455666666667888888999999999988743 Q ss_pred eEEEEeeccc-------ccccccccc--chh--hhc--cchhhhhhhhhhhhhhhhhh---------------------- Q lcl|NC_015266. 81 VTVVVRVAEG-------KDEAETTAN--VIG--TVT--PDGKYTGMKALLAAQGKLAV---------------------- 125 (390) Q Consensus 81 ~~~vv~v~~~-------~~~~~~~~~--~~~--~~~--~~~~~tgl~~~~~~~~~~~~---------------------- 125 (390) .-... +.+. .+...+... +.+ +.. .....+++.+..+....... T Consensus 76 ~~r~~-v~~at~~~~~~~t~~~tv~g~~~s~~a~~~~~a~~i~~~~~~~~~~~~~~~~~~~~t~~g~~t~~~~~~~~~~s 154 (426) T protein:vir:31 76 QWRVM-VLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYF 154 (426) T ss_pred eEEee-ccccceeeeccCCcceeecceeeeecccCcchHHHHHHhhccccccccceeeeEeccccceeeccccceeeeec Confidence 22221 1111 000000000 000 000 00001111111111111000 Q ss_pred ----------------hhhhhhhhhhcchHHHHHHHHhhhhcceEEeecccccC---chHHHHHHhhhhccceEEEEeee Q lcl|NC_015266. 126 ----------------KPRILVAPGLDTQPVAAAFATIAQSLRAMVYVAAHGCK---TKEEAVAYRKQFGQREIMVIWPD 186 (390) Q Consensus 126 ----------------~p~~~~apg~~~~~v~~al~~~~~~~~~~~~~d~~~~~---~~~~a~~~~~~~~~~~~~~~~p~ 186 (390) ....-.+.++....+.+.+....+.-+-+.+....... ..+.+..++.. ..-|.|. T Consensus 155 ~~dw~~~~~~~s~~~~~~ia~~~~~~~~~~~~~~~~~wa~~~~i~~va~~~e~~~~~~~~~~~a~~~~-----~~~y~p~ 229 (426) T protein:vir:31 155 HADWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHE-----VAGYVPS 229 (426) T ss_pred cCcchhhhcccccchhhhhhccccchhhhhhhHhhhhhhhhcceeeeeeccchhhhcchhhhhhhhhc-----ccccccc Confidence 00000011111111111111111111111111111111 11122222221 1122232 Q ss_pred eEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceeecccc-------cccccchhhhccccccccccccceeE Q lcl|NC_015266. 187 WLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVNGVTG-------ISADVSWDLQDPATDAGYLNENQVTT 259 (390) Q Consensus 187 ~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~gv~~-------~~~~~~~~~~~~~~~~~~l~~~gI~~ 259 (390) .......... .--..+.++|.++..+ ||..|.=..+.+... +..+..+..+ .++ .+ ++..|. T Consensus 230 ~~~~~~~~~~--~~~~~~~~~~~~aa~~----~~~~~~~~~~~~~~~~~~~~~~~gv~~t~~~~---~~A-~~-~~~~n~ 298 (426) T protein:vir:31 230 GDLMMIVDAS--DDDLAAYQLGKFAVSE----PWYNPLWNELPAGETVSKNVGDPEEQGTFEGG---DEA-EG-EGPVNV 298 (426) T ss_pred hhheeehhcc--ccchhhHHhhhhhhhc----cccchhhhhccccccceeeccccccccccchh---hhh-hh-cCCceE Confidence 1111100000 0012567888888877 466653222211111 1111111111 111 12 244567 Q ss_pred EEcCC-CEEEEccccC-CCCcccceeeehhhHHHHHHHHHHHHHHHhc---C-CCCHHHHHHHHHHHHHHHHHHHhCCc- Q lcl|NC_015266. 260 LVNRN-GFRFWGSRTC-DADGKFFFENYTRSAQVIADTIAEEQMGVVD---G-PLNPSRARDIIENINAWFRREVSVGE- 332 (390) Q Consensus 260 ~~~~~-G~~~wG~rT~-~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~---e-~n~~~~~~~i~~~i~~~L~~l~~~g~- 332 (390) +.... +..+|-.-|. .....-.||=++|..+|++..++..++..+= + |.+..-+..|+..|+.-|++.++.|. T Consensus 299 ~~~~~~~~~i~~~~~~~G~~~~G~~iD~~~g~dwl~~~iq~~l~~ll~~~~KIpyt~~Gi~~I~~~i~~~L~~~v~~~g~ 378 (426) T protein:vir:31 299 LIDVSDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQ 378 (426) T ss_pred EEEecCceeeecceeecccccchhhhhhHHHHHHHHHHHHHHHHHHhhcCCCCccchhHHHHHHHHHHHHHHHHhcCCCc Confidence 76543 4555544344 3345667999999999999999999987653 3 78888889999999999999998654 Q ss_pred -eeeeEEEEecCCCCHHHHhCCeEE-EEEEEEecccceEEEEEEEEcc Q lcl|NC_015266. 333 -LIGGGAWYDPEPNTTDELTSGGTW-IDYDYTPVPPLENLKLRQRITD 378 (390) Q Consensus 333 -l~g~~v~~d~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 378 (390) +.+|.|..-....++.|..+-++. +++.....-.+.++.|+..... T Consensus 379 ~~~~y~v~~P~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) T protein:vir:31 379 PLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) T ss_pred cccceeecCCCccccchhhhhhccCCceEEEEEeCcEEEEEEEEEEeC Confidence 457888755444455566666665 7888888899999999888887 No 56 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=98.24 E-value=1e-06 Score=53.42 Aligned_cols=341 Identities=14% Similarity=0.072 Sum_probs=179.5 Q ss_pred cCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhc-cccchHHHHHhhhcccCceEE Q lcl|NC_015266. 5 YHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAG-TKGTLRRTLDAIGKQTKPVTV 83 (390) Q Consensus 5 ~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~-~~gtl~~al~~~~~~~~~~~~ 83 (390) ..|-|.|...+-+..++..+.-. +-|+|++.......+++|+ ..+.-..++ .+..|..-+.+.-.++|..-. T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er~-~lfig~~~~~~g~~~~~~~------~sdld~~l~~~ds~lk~~v~aa~~naG~~~~ 73 (370) T protein:vir:78 1 MWPYVQIYNLNQMQGPVTEVERH-LLFIGSAASNTGKLLSLNA------QSDFDQLLGAADSELKANLLAARDNAGQNWS 73 (370) T ss_pred CCceEEEeeccccCCCcCcccee-EEEEecccccccceEeecC------ccCHHHhcCCcChhHHHHHHHHHhCCCCceE Confidence 55778888888888888888754 4588887754444333332 233223333 335677777777777766433 Q ss_pred EEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhh-cchHHHHHHHH----hhhhc-ceEEee Q lcl|NC_015266. 84 VVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGL-DTQPVAAAFAT----IAQSL-RAMVYV 157 (390) Q Consensus 84 vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~-~~~~v~~al~~----~~~~~-~~~~~~ 157 (390) .....- . +..+ -+.++..+..... +..+..-|- +.++...++.+ ...++ +.++++ T Consensus 74 ~~~~p~--~---~~~d------------~~~Av~~a~~~~s--~E~V~v~~~~s~~a~~~a~~~~a~el~n~~~Rpv~fi 134 (370) T protein:vir:78 74 AAAYVL--P---TDKP------------WLDAARDAQQTQS--FEGVVVLGQEWHQAAINAAHALNQELIAKWGRWQFML 134 (370) T ss_pred EEEEEe--c---Cchh------------HHHHHHHHHhhCC--ccEEEEecCcchHHHHHHHHHHHHHHHHhcCCeEEEE Confidence 222110 0 0011 1223322222221 112222222 22333333333 33333 333333 Q ss_pred ccccc----Cc----hHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccceeecccCceee Q lcl|NC_015266. 158 AAHGC----KT----KEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGWHKTLSNVVVN 229 (390) Q Consensus 158 d~~~~----~~----~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~~~span~~l~ 229 (390) ....+ .+ ..+..+-++++.+.+..++--++. -.-|.+||.+|.. ..-+..+|.-.... T Consensus 135 le~~~~~~~e~w~~y~~~l~al~~gia~~~V~vvp~~~g------------~~~G~~aGRL~na--avsVadsP~Rv~tG 200 (370) T protein:vir:78 135 LAVPAIADEQDWATYEAELATLQDGIAASSVSLIPQLWP------------TLAGAYAGRLCNR--AVSIADSPCRVKTG 200 (370) T ss_pred EeecCCCCcCCHHHHHHHHHHhhhccccccceEEeeecc------------ccHHHHHHHHhcC--eeeecccceeeecc Confidence 22222 22 123344456677777666544321 1136778876552 22367788754322 Q ss_pred cccccc-ccc-chhhhccccccccccccceeEEEcC---CCEEEEccccCCC-CcccceeeehhhHHHHHHHHHHH-HHH Q lcl|NC_015266. 230 GVTGIS-ADV-SWDLQDPATDAGYLNENQVTTLVNR---NGFRFWGSRTCDA-DGKFFFENYTRSAQVIADTIAEE-QMG 302 (390) Q Consensus 230 gv~~~~-~~~-~~~~~~~~~~~~~l~~~gI~~~~~~---~G~~~wG~rT~~~-d~~~~~i~vrR~~~~i~~~l~~~-~~~ 302 (390) -+.++. .++ ...........+.|..+|-.+.+.. .|+-+-.+|||.. .+++++|..+|+.+-+.|.++.. +.. T Consensus 201 ~l~gl~~~p~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~gsDYq~ie~~RVvdKa~R~vR~~ai~~ 280 (370) T protein:vir:78 201 ALVGLGNKPVGKDGIPLPLATLQTLEANRYSVPMWYPDYDGIYWADGRTLDAEGGDYQVIENLRIAYKVARRMRLRAIAR 280 (370) T ss_pred ccccccccccccCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHH Confidence 222221 111 1112223334556778888777653 4766667788865 46899999999999999999944 444 Q ss_pred HhcCCCCHH--HHHHHHHHHHHHHHHHHhCCceee--eEEEEecCCC---CHHHHhCCeEEEEEEEEecccceEEEEEEE Q lcl|NC_015266. 303 VVDGPLNPS--RARDIIENINAWFRREVSVGELIG--GGAWYDPEPN---TTDELTSGGTWIDYDYTPVPPLENLKLRQR 375 (390) Q Consensus 303 ~v~e~n~~~--~~~~i~~~i~~~L~~l~~~g~l~g--~~v~~d~~~n---t~~~i~~G~~~~~i~~~p~~p~e~i~~~~~ 375 (390) ..++-.++. .....+.-...=|+++...+.+.| |.-++....+ +..-+..+++.+.+.+.|.--.+.|+..+. T Consensus 281 i~D~~lnst~gsia~~~~~~~~~L~ema~s~~i~~~~fpgeI~~p~d~Di~i~w~s~~~v~I~~~v~P~~~pk~Itv~I~ 360 (370) T protein:vir:78 281 IGDRSFNSTPGSTAAAITYFGKDLREMAKSTTINGQPFPGDIASPQDGDIRIQWVAKNLVSVFVVVRTVDCPKGITVNIM 360 (370) T ss_pred hCCcccCCCCcchhHHHHHHHhhHHHHHhhhhhcccccceeEeccCCCcceEEeeccceEEEEEEEEeccCCceEEEEEE Confidence 445433322 223333334444555555777666 3334432221 112236678899999999988999998887 Q ss_pred EcchHHHHHH Q lcl|NC_015266. 376 ITDRYLADFA 385 (390) Q Consensus 376 ~~~~~~~~l~ 385 (390) +|-..=++-- T Consensus 361 LDls~e~~~~ 370 (370) T protein:vir:78 361 LDLSLNNGEG 370 (370) T ss_pred EeeccccCCC Confidence 7654322211 No 57 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=98.18 E-value=3.4e-07 Score=56.01 Aligned_cols=357 Identities=17% Similarity=0.109 Sum_probs=170.8 Q ss_pred CCCcc--------CCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHH Q lcl|NC_015266. 1 MPQDY--------HHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLD 72 (390) Q Consensus 1 Ma~~~--------~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~ 72 (390) |.-.| .||+|++-.++.+.... ....+-++|-... ....+.++|+++.+..+....+|....+..-++ T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~--~~q~vLiiGq~la--~gs~~~~~~v~v~s~~~a~~lfG~GSml~~M~~ 76 (498) T protein:vir:45 1 MTISFNTIPSNTLVPLFYAEMDNQAANTAQ--DSGASLLIGHANN--GAEIVANSLVLMPSADYARQICGAGSQLARMVE 76 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCCCCCC--CCcceEEEEecCC--ccccccceeEEecCHHHHHHhcCcCcHHHHHHH Confidence 66544 38999976555553222 2345566775432 344566889999988887777776555444333 Q ss_pred hhhcccCc-eEEEEee-------------------------------------ccccccc-----------------ccc Q lcl|NC_015266. 73 AIGKQTKP-VTVVVRV-------------------------------------AEGKDEA-----------------ETT 97 (390) Q Consensus 73 ~~~~~~~~-~~~vv~v-------------------------------------~~~~~~~-----------------~~~ 97 (390) .+.++... ..+++.+ ...++.. .+. T Consensus 77 a~~~~n~~~~l~~i~~~d~aG~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aaina~~~lPVTA 156 (498) T protein:vir:45 77 AYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTA 156 (498) T ss_pred HHHHhCCcceEEEEeeCCcccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHhCCCCCceEE Confidence 33222111 1111111 1111100 000 Q ss_pred ccchhhhccchhhhhhh-----hhhhhhh-------hhhh------------hhh--------------hhhhhhhcchH Q lcl|NC_015266. 98 ANVIGTVTPDGKYTGMK-----ALLAAQG-------KLAV------------KPR--------------ILVAPGLDTQP 139 (390) Q Consensus 98 ~~~~~~~~~~~~~tgl~-----~~~~~~~-------~~~~------------~p~--------------~~~apg~~~~~ 139 (390) ....+-++.+....|.. .....+. -.++ .|. ++..| |++.. T Consensus 157 ~~~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamagGag~PD~a~alaal~~~~~~~I~~p-~~D~a 235 (498) T protein:vir:45 157 SSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLP-FNDTA 235 (498) T ss_pred EecCceEEEEeeccCccccceeEEEeeccccccccccceeeEEEEccCCCccCchhHHHHHHhccCCccEEEEe-eCCHH Confidence 00011122222111110 0000000 0000 000 01111 12222 Q ss_pred HHHHHHHhhh----------hcceEEeecccccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHH---HHH Q lcl|NC_015266. 140 VAAAFATIAQ----------SLRAMVYVAAHGCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAP---AIA 206 (390) Q Consensus 140 v~~al~~~~~----------~~~~~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s---~~v 206 (390) -..++..+.+ ++.++.+. ....+..+...+....++.+..+.+... ...-||- +++ T Consensus 236 sL~al~~~L~~~sgRw~~~~q~~g~~~~--a~~gT~~~l~t~g~~~N~~~it~~~~~~---------~~~sp~~~~AAa~ 304 (498) T protein:vir:45 236 SVNTLVTEMNDTSGRWSYARQLYGHVYT--AKTGTLSELVNAGDQFNQQHITLAGYEK---------ETQTPADELAASR 304 (498) T ss_pred HHHHHHHHHhhhhhhhhHHhhcCeEEEE--eccCCHHHHHHhhhccCCceEEEEecCC---------CCCChHHHHHHHH Confidence 2223333222 12222222 2333567777777777777765543210 1111332 333 Q ss_pred HHHHhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEcCCCE-EEEccccC-------CCCc Q lcl|NC_015266. 207 AGLRAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVNRNGF-RFWGSRTC-------DADG 278 (390) Q Consensus 207 Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~~G~-~~wG~rT~-------~~d~ 278 (390) ||..+.. .+..|-..--...|.|+..+.. ..+...+|.|.|..+||.++.-+.|. .+-=..|. ..|+ T Consensus 305 aa~~A~~-l~~DPArPL~tl~L~Gi~~p~~----~~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~~D~ 379 (498) T protein:vir:45 305 TARAAVF-IRNDPARPTQTGELVGMLPAPK----GKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADN 379 (498) T ss_pred HHHHHHH-hhcccccccCceeecceecCCc----hhcCChHHHHHHHhCCcceEEEcCCeEEEEeeeeeeeecCCCCcch Confidence 3333311 1222433444456777775543 33345667888889999999666773 22222222 2478 Q ss_pred ccceeeehhhHHHHHHHHHHHHHHH-hcCCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeee---E----EE Q lcl|NC_015266. 279 KFFFENYTRSAQVIADTIAEEQMGV-VDGPLNPS-----------RARDIIENINAWFRREVSVGELIGG---G----AW 339 (390) Q Consensus 279 ~~~~i~vrR~~~~i~~~l~~~~~~~-v~e~n~~~-----------~~~~i~~~i~~~L~~l~~~g~l~g~---~----v~ 339 (390) .|..|+..|+.+|+.+.++..+... --+..... |-..|+..+-.-+++|..+|-+..+ + |+ T Consensus 380 syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~ell~~y~~le~~givEn~~~~~~~LiVe 459 (498) T protein:vir:45 380 SYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVE 459 (498) T ss_pred hhhhhhhHHHHHHHHHHHHHHhhhhcCCeeecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEEE Confidence 8999999999999999999888743 22232222 6778899999999999998887653 2 33 Q ss_pred EecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_015266. 340 YDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFASRVSA 390 (390) Q Consensus 340 ~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~~~~a 390 (390) -|.++ .+++.+.+-...+-+..-+-.++++.-+|- +-+| T Consensus 460 rd~~d-------pnRln~~~p~d~vn~L~V~A~~~~f~lq~~-----~~~~ 498 (498) T protein:vir:45 460 RDASV-------PNRLNTLFPPDYVNQLRVFAVVNQFRLQYS-----EESA 498 (498) T ss_pred ECCCC-------CcEEEEEecccccCchhhhhhhhhhheehh-----hcCC Confidence 33222 245555544444444443222222222222 2222 No 58 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=98.16 E-value=3.4e-06 Score=50.55 Aligned_cols=334 Identities=14% Similarity=0.048 Sum_probs=178.2 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccc--cccccceeccceEEEechhHHHHhhc-cccchHHHHHhhhcc Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGA--DADPATFPLDTPVLLTNVIAALGKAG-TKGTLRRTLDAIGKQ 77 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~--~~~~~~~~~~~~~~i~~~~~~~~~~~-~~gtl~~al~~~~~~ 77 (390) |+- |-|.|.+.+-+..++..+. ..+.|+|+.. ......+++ ....+.-..++ .+..|..-+.+...+ T Consensus 1 m~~---~~V~in~~n~~qg~~~~ve-r~~lfig~g~~~~~~g~~~~~------~~~sdld~~lg~~ds~lk~~v~aa~~n 70 (369) T protein:vir:27 1 MAW---PTVIIKILNLMNGPIADIE-CHFLFVIRGTVSGEVRNLIMV------DSTSDLDDVLAEASAEGLAIVKAAQLN 70 (369) T ss_pred CCC---CceEEecccccCCCccccc-ceEEEEEeccccccccceEEe------cCccchHhhcCCcChhHHHHHHHHHhC Confidence 775 5588888888888777776 4556785443 333333333 33333222333 334578888888888 Q ss_pred cCceEEEEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhh-cchH----HHHHHHHhhhhc- Q lcl|NC_015266. 78 TKPVTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGL-DTQP----VAAAFATIAQSL- 151 (390) Q Consensus 78 ~~~~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~-~~~~----v~~al~~~~~~~- 151 (390) +|..-.. .+. +... ..+. ..++..+..... +..+..-+- +.++ .++.......++ T Consensus 71 aG~~w~a-~~~-p~~~---~~~~------------~~Av~~a~~~~s--~E~V~v~~p~t~~a~i~aaq~~a~el~~~~~ 131 (369) T protein:vir:27 71 GKQAWTA-GVM-ILSE---EDNW------------QDAVKKANEVSS--FEFVVLGFDAETKAMIEDAITLRTELKNSLG 131 (369) T ss_pred CCCceEE-EEE-EeCC---chhH------------HHHHHhhhhhCC--ccEEEEecCcccHHHHHHHHHHHHHHHHhcC Confidence 7764322 211 1111 1111 112222221111 111222222 1222 222233333333 Q ss_pred ceEEee-ccc-----c--cCc----hHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHHHHhhhhhccce Q lcl|NC_015266. 152 RAMVYV-AAH-----G--CKT----KEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAGLRAKIDNDIGW 219 (390) Q Consensus 152 ~~~~~~-d~~-----~--~~~----~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg~~a~~d~~~g~ 219 (390) +.++++ ..+ + +.+ .....+-++++.+.+..++.-++... .-.|.+||.+|.. ..-+ T Consensus 132 R~vffi~e~~~~~~~~~~~e~w~dy~a~l~al~~g~a~~~V~vv~~~~~~g----------n~~G~~aGRl~n~--aVsI 199 (369) T protein:vir:27 132 REVGVLCQLPAINNDPTNGQTWSEWLADTVDIPKDVASEYISVVPNVHAAG----------DTLGKYAGRLANK--EVSI 199 (369) T ss_pred CeEEEEEeccccCCCccccCCHHHHHHHHHHHhhccCcccceeeeeecccc----------chHHHHHHHHHhc--ccch Confidence 333333 211 1 111 13344456677888877763332211 2467788888763 2336 Q ss_pred eecccCceeeccccccc-cc-chhhhccccccccccccceeEEEcC---CCEEEEccccCCC-CcccceeeehhhHHHHH Q lcl|NC_015266. 220 HKTLSNVVVNGVTGISA-DV-SWDLQDPATDAGYLNENQVTTLVNR---NGFRFWGSRTCDA-DGKFFFENYTRSAQVIA 293 (390) Q Consensus 220 ~~span~~l~gv~~~~~-~~-~~~~~~~~~~~~~l~~~gI~~~~~~---~G~~~wG~rT~~~-d~~~~~i~vrR~~~~i~ 293 (390) ..||....-..+.++.. +. .......+.....|...|..+.+.. .|+-+-.+||+.. .+++++|..+|..|-+. T Consensus 200 adsp~RVktG~l~g~~~~p~d~~g~~l~~a~l~aLd~agysvp~~Y~gy~G~Yw~d~~tl~~~gsDYq~iE~~RVvdKa~ 279 (369) T protein:vir:27 200 ADSPARVQTGSVLGNTELMKDKAGKALDLATLKALESNRIAVPMWYPDYPGQYWTTGRTLDVPGGDYQDIRHIRVAMKAA 279 (369) T ss_pred hcCcceeeecccccccccccCCCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCCeehhhhhhHHHHHH Confidence 77887764333333321 11 1111122234445777888777653 4766667788865 46899999999999999 Q ss_pred HHHHHHHHHHhcC---CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHH-----hCCeEEEEEEEEecc Q lcl|NC_015266. 294 DTIAEEQMGVVDG---PLNPSRARDIIENINAWFRREVSVGELIGGGAWYDPEPNTTDEL-----TSGGTWIDYDYTPVP 365 (390) Q Consensus 294 ~~l~~~~~~~v~e---~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i-----~~G~~~~~i~~~p~~ 365 (390) |.++...-+.+.. +.++.-....+.-+..=|+++.+.+ ..+++.-.++ +|| ...++.+-+.+.|.- T Consensus 280 R~vR~~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~--fpgei~~P~d----~dI~i~w~~k~~V~I~~~vrP~~ 353 (369) T protein:vir:27 280 RKVRIRAIARIADRTLNSTPQSIAAAKLYFTQDLRTMALTG--VPGEIYPPED----EDIQIKWVNSTDVEIYMSVQPYE 353 (369) T ss_pred HHHHHHHHHHhcCcccccChhHHHHHHHHHhhHHHHHHhhc--CCeEEecCCC----CceEEEeeccceEEEEEEEeecc Confidence 9888777665553 3445556667777777788886553 2223332111 123 445677777788888 Q ss_pred cceEEEEEEEEcchHH Q lcl|NC_015266. 366 PLENLKLRQRITDRYL 381 (390) Q Consensus 366 p~e~i~~~~~~~~~~~ 381 (390) -.+.|+..+..|-.-+ T Consensus 354 ~pk~it~~I~ldl~~~ 369 (369) T protein:vir:27 354 CPVKITIAISVKQGDY 369 (369) T ss_pred CCceEEEEEEEeccCC Confidence 8889999999865433 No 59 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=98.15 E-value=3.8e-07 Score=55.77 Aligned_cols=360 Identities=16% Similarity=0.101 Sum_probs=170.3 Q ss_pred CCCcc--------CCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHH Q lcl|NC_015266. 1 MPQDY--------HHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLD 72 (390) Q Consensus 1 Ma~~~--------~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~ 72 (390) |.-.| .||+|++-.++.+.. ......+-++|.... ....+.++|+++.+..+....+|....+..-++ T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~--~~~~q~vLiiGq~la--~gs~~~~~~v~v~s~~~a~~~fG~GSml~~M~~ 76 (498) T protein:vir:44 1 MAISFNSIPSDTRVPLFYAEMDNSAANT--ARDSGASLLIGHASN--DASIAVNSLVLVSSVDYARQICGAGSQLARMVG 76 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCCCC--CcCCcceEEEEecCc--ccccccceeEeecCHHHHHHhcCcccHHHHHHH Confidence 66554 389999755545532 223344556775432 245667899999998888777776555544443 Q ss_pred hhhcccC-ceEEEEeec-------------------------------------ccccccc-----------------cc Q lcl|NC_015266. 73 AIGKQTK-PVTVVVRVA-------------------------------------EGKDEAE-----------------TT 97 (390) Q Consensus 73 ~~~~~~~-~~~~vv~v~-------------------------------------~~~~~~~-----------------~~ 97 (390) .+.++.. ...+++.+. ..++... +. T Consensus 77 a~~~~n~~~~l~~i~~~D~aG~aAtg~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aaina~~~lPVTA 156 (498) T protein:vir:44 77 AYRKTDPFGELYVIAVPESTGAAATVALTVTGEATETGTVNVYTGRTRVQAPVTSGDDAAAVAVSIKDAVNANPDLPFTA 156 (498) T ss_pred HHHHhCCCceeEEEecCCcccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHhCCCCCceEE Confidence 3332211 111111111 1111000 00 Q ss_pred ccchhhhccchhhhhh-----hhhhhhhhh--hhhhhhhh------hhhhhcchHHHHHHHHhhhhcceEEeec------ Q lcl|NC_015266. 98 ANVIGTVTPDGKYTGM-----KALLAAQGK--LAVKPRIL------VAPGLDTQPVAAAFATIAQSLRAMVYVA------ 158 (390) Q Consensus 98 ~~~~~~~~~~~~~tgl-----~~~~~~~~~--~~~~p~~~------~apg~~~~~v~~al~~~~~~~~~~~~~d------ 158 (390) .-..+.++.+....|. ......+.. -...|.-+ ++-|--...+..+|..+.+....+.+.. T Consensus 157 ~~~~~~vtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~~~~~~~i~~p~~D~as 236 (498) T protein:vir:44 157 TSEAGVVTLTARHKGLYGNEIPVTLNYYGFGGGEVLPAGVNITVASGVKGAGAPALNDAVAAMGDEPFDYIGLPFNDTAS 236 (498) T ss_pred eeccceEEEEEeccCcccCcceEEEeeccCccccccccceeEEEEcccCCccCchhHHHHHhhccCCccEEEEeecCHHH Confidence 0001112222221221 111110000 00000000 0011111112222222222222221111 Q ss_pred --------------------------ccccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHH---HHHHH Q lcl|NC_015266. 159 --------------------------AHGCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPA---IAAGL 209 (390) Q Consensus 159 --------------------------~~~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~---~vAg~ 209 (390) .....+..++..+....++.+..+.+.. .+ ..-|+-. ++||. T Consensus 237 l~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~a~l~t~g~~~N~~~it~~~~~-------~~--~~sp~~~~AAa~a~~ 307 (498) T protein:vir:44 237 VNSMATEMNDSSGRWSYVRQLYGHVYTAKTGTLSELVAAGDQFNLQHITLAGYE-------KD--TQTPADELAASRTAR 307 (498) T ss_pred HHHHHHHHhhhhcchHHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEecC-------CC--CCCHHHHHHHHHHHH Confidence 1223345666666666666665543221 01 0113322 33333 Q ss_pred HhhhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEcCCCE-EEEccccC-------CCCcccc Q lcl|NC_015266. 210 RAKIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVNRNGF-RFWGSRTC-------DADGKFF 281 (390) Q Consensus 210 ~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~~G~-~~wG~rT~-------~~d~~~~ 281 (390) .+.. .+..|-..--...|.|+..+.. ..+...+|.|.|..+||.++.-+.|. .+-=..|. ..|+.|. T Consensus 308 aA~~-l~~DPArPL~tl~L~Gi~~p~~----~~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~~D~syL 382 (498) T protein:vir:44 308 AAVF-IRNDPARPTQTGELVDMLPAPK----GKRFTTTEQQTLLSHGVATAYVESGVLRIQRDITTYRKNAYGVADNSYL 382 (498) T ss_pred HHHH-hhcccccccCceeecccccCCc----hhcCChHHHHHHHhcCcceEEEcCCeEEEEeeeeeeeecCCCCcchhhh Confidence 3311 1222433444456777775543 33345667888888999999666773 22222222 2478899 Q ss_pred eeeehhhHHHHHHHHHHHHHH-HhcCCCCH-----------HHHHHHHHHHHHHHHHHHhCCceeee---E----EEEec Q lcl|NC_015266. 282 FENYTRSAQVIADTIAEEQMG-VVDGPLNP-----------SRARDIIENINAWFRREVSVGELIGG---G----AWYDP 342 (390) Q Consensus 282 ~i~vrR~~~~i~~~l~~~~~~-~v~e~n~~-----------~~~~~i~~~i~~~L~~l~~~g~l~g~---~----v~~d~ 342 (390) .|+..|+.+|+.+.++..+.. |--+.... .|-..|+..+-.-+++|..+|-+..+ + |+-|. T Consensus 383 Di~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~eli~~y~~le~~givEn~~~~~~~LiVerd~ 462 (498) T protein:vir:44 383 DSETLHTSAYVLRRLKSVITSKYGRHKLANDGTRFGSGQAIVTPAVIRGELGSTYRQMEREGIVENFDLFQQHLIVERNA 462 (498) T ss_pred hhhhHHHHHHHHHHHHHHhhhhcCCcccccCCcccCCCcccccHHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECC Confidence 999999999999999988763 22223222 26788999999999999998887653 2 33332 Q ss_pred CCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_015266. 343 EPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFASRVSA 390 (390) Q Consensus 343 ~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~~~~a 390 (390) + +.+++.+.+-...+-+..-+-.++++.-+|-+ -+| T Consensus 463 ~-------dpnRln~~~p~d~vn~L~V~A~~~~f~lq~~~-----~~~ 498 (498) T protein:vir:44 463 N-------DSNRLDVLFPPDYVNQLRVFAVLNQFRLQYSE-----EAA 498 (498) T ss_pred C-------CCcEEEEEecccccCchhhhhhhhhhhhhhhh-----hcC Confidence 2 12555555555545554443333333333322 222 No 60 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=98.04 E-value=8.7e-07 Score=53.82 Aligned_cols=354 Identities=17% Similarity=0.122 Sum_probs=171.6 Q ss_pred CCCcc--------CCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHH Q lcl|NC_015266. 1 MPQDY--------HHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLD 72 (390) Q Consensus 1 Ma~~~--------~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~ 72 (390) |.-.| .||+|++-.++.+..... ...+-++|.... ....+.++|+++.+..+....+|....+..-++ T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~~--~qrvLiiGq~la--~gt~~~~~~v~v~s~~~a~~~fG~GS~l~~M~~ 76 (498) T protein:vir:48 1 MTISFSAVPSDTLVPLFYAEMDNSAANTAVT--SAPALLIGHASN--DAAIEVNSLVLMPSADYARQICGAGSQLARMVD 76 (498) T ss_pred CCccccccCcccccceEEEEEecCCCccccC--CcceEEEeecCc--cccccccceEEecCHHHHHHhcCcccHHHHHHH Confidence 66544 379999766666654333 245667775432 245567889999988887777776555444333 Q ss_pred hhhcccC-ceEEEEeecc-------------------------------------ccccc-----------------ccc Q lcl|NC_015266. 73 AIGKQTK-PVTVVVRVAE-------------------------------------GKDEA-----------------ETT 97 (390) Q Consensus 73 ~~~~~~~-~~~~vv~v~~-------------------------------------~~~~~-----------------~~~ 97 (390) .+.++.. ...+++.+.+ .++.. .+. T Consensus 77 a~~~~n~~~~l~~i~~~D~ag~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aai~a~~~lPVTA 156 (498) T protein:vir:48 77 VYRQTDPFGELYVIAVPEARGAAATVRVTVTGEAEESGTLSLYVGRSSVQVPVVNGDDATAVATAIKEAVNGVITLPFAA 156 (498) T ss_pred HHHHhCCCceeEEEeeCCcccceeEEEEEecccccCCceEEEEECCEEEEEeecCCCCHHHHHHHHHHHHhCCCCcceEE Confidence 3322211 1111112111 11100 000 Q ss_pred ccchhhhccchhhhh-----hhhhhhhhhh-------hhh------------hhh--------------hhhhhhhcchH Q lcl|NC_015266. 98 ANVIGTVTPDGKYTG-----MKALLAAQGK-------LAV------------KPR--------------ILVAPGLDTQP 139 (390) Q Consensus 98 ~~~~~~~~~~~~~tg-----l~~~~~~~~~-------~~~------------~p~--------------~~~apg~~~~~ 139 (390) .-..+.++.+....| +......+.. .++ .|. ++..| |++.. T Consensus 157 ~~~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamsgGag~PDia~aLaal~~~~~~~I~~p-~~D~a 235 (498) T protein:vir:48 157 SSDAGVVTLTARHKGLYGNELPVCLNYYGSGGGEILPAGLQVVTEAGTAGSGAPDLTAAVAAMGDEAFDFIGLP-FNDAA 235 (498) T ss_pred EecCcEEEEEeeecccccccceeeeeeccCcccccccceeeEEEEcccCCccCcchHHHHHhhccCCccEEEEe-ecCHH Confidence 000111222222111 1111111000 000 000 01111 11222 Q ss_pred HHHHHHHhhh----------hcceEEeecccccCchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHH---HH Q lcl|NC_015266. 140 VAAAFATIAQ----------SLRAMVYVAAHGCKTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPA---IA 206 (390) Q Consensus 140 v~~al~~~~~----------~~~~~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~---~v 206 (390) -..++..+++ ++.++.+. ....+..+...+....++.+..+.+.. + ...-|+.. +. T Consensus 236 sl~al~~~L~~~sgRw~~~~q~~g~~~~--a~~gT~~~l~t~g~~~N~~~it~~~~~--------~-~~~~p~~~~AAa~ 304 (498) T protein:vir:48 236 SINMMMTEMNDSSGRWSYARQLYGHVYT--AKLGTLSELVNAGDMHNQQHITLAGYE--------K-ETQSPVDELVASR 304 (498) T ss_pred HHHHHHHHHhhhhhhhhHHhhcCeEEEE--eccCCHHHHHHhhhccCCceEEEEecC--------C-CCCChHHHHHHHH Confidence 2223333221 11222222 233356677777777777766544311 1 11123322 33 Q ss_pred HHHHh---hhhhccceeecccCceeecccccccccchhhhccccccccccccceeEEEcCCCE-EEEccccC-------C Q lcl|NC_015266. 207 AGLRA---KIDNDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVNRNGF-RFWGSRTC-------D 275 (390) Q Consensus 207 Ag~~a---~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~~G~-~~wG~rT~-------~ 275 (390) |++.+ ..| |-..--...|.|+..+.. ..+...+|.|.|..+||.++.-.+|. .+-=..|. . T Consensus 305 a~~aA~~l~~D----PArPLqtl~L~Gi~~p~~----~~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~ 376 (498) T protein:vir:48 305 LAREAVFIRND----PARPTQTGELVGMLPAPK----GKRFIMTEQQTLLSHGVATAYVEGGTLRIQRSVTTYKKNAYGV 376 (498) T ss_pred HHHHHHhhhcc----ccccccceeeeccccCCc----hhcCChHHHHHHHhcCcceEEEcCCeEEEEeeeeeeeecCCCC Confidence 33332 333 433333456777775543 33445667888889999998555563 22222222 2 Q ss_pred CCcccceeeehhhHHHHHHHHHHHHHH-HhcCCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeee---E--- Q lcl|NC_015266. 276 ADGKFFFENYTRSAQVIADTIAEEQMG-VVDGPLNPS-----------RARDIIENINAWFRREVSVGELIGG---G--- 337 (390) Q Consensus 276 ~d~~~~~i~vrR~~~~i~~~l~~~~~~-~v~e~n~~~-----------~~~~i~~~i~~~L~~l~~~g~l~g~---~--- 337 (390) .|+.|..|+..|+.+|+.+.++..+.. |--+....+ |-..|+..+-.-+++|..+|-+..+ + T Consensus 377 ~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~eli~~y~~le~~given~~~~~~~L 456 (498) T protein:vir:48 377 ADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLANDGTRFGPGQAIVTPAVIKGELLATYRQMERAGIVENYDLFKQYL 456 (498) T ss_pred cchhhhhhhhHHHHHHHHHHHHHHhhhhcCCceecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhccee Confidence 478899999999999999999988764 322333332 6778999999999999998887653 2 Q ss_pred -EEEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_015266. 338 -AWYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITDRYLADFASRVSA 390 (390) Q Consensus 338 -v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~l~~~~~a 390 (390) |+-|.++ .+++.+.+-...+-+..-+-.++++.-+| ++-+| T Consensus 457 iVerd~~d-------pnRln~~~p~d~vn~L~V~A~~~~f~lq~-----~~~~~ 498 (498) T protein:vir:48 457 IVERDADN-------PNRLNTLFPPDYVNQLRVFAVVNQFRLQY-----SEESA 498 (498) T ss_pred EEEECCCC-------CcEEEEEecccccCchhhhhhhhhhhhhh-----hhcCC Confidence 3333222 24555555444444444332233332222 22233 No 61 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=97.31 E-value=0.0001 Score=42.52 Aligned_cols=353 Identities=13% Similarity=0.040 Sum_probs=173.4 Q ss_pred CCC--------cc-CCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHH Q lcl|NC_015266. 1 MPQ--------DY-HHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTL 71 (390) Q Consensus 1 Ma~--------~~-~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al 71 (390) |+. .+ .||+|++--++.+..-.......+-++|.... ....+.++|+++.+..+....+|....+..-+ T Consensus 1 m~~i~F~~IP~~iRvP~~y~E~dns~A~~g~~~~~q~vLiiGq~la--~gs~~~~~pv~v~s~~~a~~~fG~GS~la~M~ 78 (495) T protein:vir:19 1 MSDISFNAIPSDVRVPLTYIEFDNSNAVSGTPAPRQRVLMFGQSGS--KASAAPNVPVRIRSGSQASAAFGQGSMLALMA 78 (495) T ss_pred CCCCchhhCCcccccCeEEEEEccCCCCcCCcCCCceEEEEEecCc--ccccccceeEEecCHHHHHHhcCcCcHHHHHH Confidence 665 22 38999976666554333333455566775322 24556789999999888877777766555544 Q ss_pred Hhhhcc-cCceEEEEeeccccccccccc-cc-------------h----------------------------------- Q lcl|NC_015266. 72 DAIGKQ-TKPVTVVVRVAEGKDEAETTA-NV-------------I----------------------------------- 101 (390) Q Consensus 72 ~~~~~~-~~~~~~vv~v~~~~~~~~~~~-~~-------------~----------------------------------- 101 (390) +.+.+. .-...+++.+.+....+.+.. .+ + T Consensus 79 ~a~~~~n~~~~l~~i~~~D~aG~aA~g~it~tg~at~~G~l~l~I~g~~v~v~V~~gdTaa~vA~al~aaina~~~lPvT 158 (495) T protein:vir:19 79 DAFLNANRVAELWCIPQGNGTGNAAVGEISLSGTAGENGSLVTYIAGQRLAVSVAAGATGAALADLLVARIKGQPDLPVT 158 (495) T ss_pred HHHHHhCCcceEEEEeeCChhhceeEEEEEEeecCCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHhcCCccCceE Confidence 443332 222233333332211110000 00 0 Q ss_pred -------------hhhccchhhhhhhhhhhhhhhh-----hhhhhhh------hhhhhcchHHHHHHHHhhhhcceEEee Q lcl|NC_015266. 102 -------------GTVTPDGKYTGMKALLAAQGKL-----AVKPRIL------VAPGLDTQPVAAAFATIAQSLRAMVYV 157 (390) Q Consensus 102 -------------~~~~~~~~~tgl~~~~~~~~~~-----~~~p~~~------~apg~~~~~v~~al~~~~~~~~~~~~~ 157 (390) +-++.+...+| +. .+..... ...|.-+ +.-|--...+..+|..+.+....+.+. T Consensus 159 A~~~~~~~~~~a~~~VtlTAr~kG-~~-n~idi~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~~~~~~~I~~ 236 (495) T protein:vir:19 159 AEVRADSGDDDTHADVVLSAKFTG-AL-SAVDVRWNYYAGETTPYGIITAFKAASGKNGNPDISASIAGMGDLQYKYIVM 236 (495) T ss_pred EEeeccCCCCcCceeEEEEEeecc-cc-ccceeEEEeecccccccceeEEEEecCCCCCCcchHHHHHHhccCCCcEEEE Confidence 00011111111 00 0000000 0011000 001111223555555555555444333 Q ss_pred ccccc-----------------------------CchHHHHHHhhhhccceEEEEeeeeEEEeeccCceeEecHHHHHHH Q lcl|NC_015266. 158 AAHGC-----------------------------KTKEEAVAYRKQFGQREIMVIWPDWLGWDDITNSTVAIPAPAIAAG 208 (390) Q Consensus 158 d~~~~-----------------------------~~~~~a~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~p~s~~vAg 208 (390) ...+. .+..+...+....++.+..+.+- ++ ..-||....|+ T Consensus 237 P~tD~asL~al~~~l~~rw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~~--------~g--sp~~~~~~AAA 306 (495) T protein:vir:19 237 PYTDEPNLNLLRTELQERWGPVNQADGFAVTVLSGTYGDISTFGVSRNDHLISCMGI--------AG--APEPSYLYAAT 306 (495) T ss_pred ecCcHHHHHHHHHHHHHhhhHHHhcCeEEEEeecCCHHHHHHhhhccCCceEEEEec--------CC--CCCcHHHHHHH Confidence 22111 12333333344444444333211 11 11244333333 Q ss_pred HHhhhh--hccceeecccCceeecccccccccchhhhccccccccccccceeEEE-cCCCEE-EEccccC-------CCC Q lcl|NC_015266. 209 LRAKID--NDIGWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLV-NRNGFR-FWGSRTC-------DAD 277 (390) Q Consensus 209 ~~a~~d--~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~-~~~G~~-~wG~rT~-------~~d 277 (390) +.+..- .+..|-..--...|.|+..+.. ..+...+|.|.|..+||.++. ..+|.+ +-=..|. ..| T Consensus 307 ~aa~~A~~l~~DPArPL~tl~L~Gi~~p~~----~~r~~~~ern~LL~~Gist~~V~~~G~V~I~R~ITTY~~n~~G~~D 382 (495) T protein:vir:19 307 LCAVASQALSIDPARPLQTLTLPGRMPPAV----GDRFTWSERNALLFDGISTFNVNDGGEMQIERMITMYRTNKYGDSD 382 (495) T ss_pred HHHHHHHHhhcccccccCceeecceecCCc----cccCChHHHHHHHhCCcceEEECCCCeEEEEeeeeeeeecCCCCcc Confidence 322221 1223544444557788875543 333456678888899999986 445632 2222222 237 Q ss_pred cccceeeehhhHHHHHHHHHHHHHHHhc-CCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeee---E----E Q lcl|NC_015266. 278 GKFFFENYTRSAQVIADTIAEEQMGVVD-GPLNPS-----------RARDIIENINAWFRREVSVGELIGG---G----A 338 (390) Q Consensus 278 ~~~~~i~vrR~~~~i~~~l~~~~~~~v~-e~n~~~-----------~~~~i~~~i~~~L~~l~~~g~l~g~---~----v 338 (390) +.|..|++-|+.+|+.+.++..+...-. +..... |-..|+..+-.-+++|..+|-+..+ + | T Consensus 383 ~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~ell~~~~~le~~given~~~~~~~LiV 462 (495) T protein:vir:19 383 PSYLNVNTIATLSYLRYSLRTRITQKFPNYKLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGLVEDFDTFKEELYV 462 (495) T ss_pred hhhhhhHHHHHHHHHHHHHHHHHhhhcCCcccccCCCCCCCcccccChHHHHHHHHHHHHhhhhhccccChhhhcceeEE Confidence 7899999999999999999987764332 333332 6678899999999999998887653 2 3 Q ss_pred EEecCCCCHHHHhCCeEEEEEEEEecccceEEEEEEEEcc Q lcl|NC_015266. 339 WYDPEPNTTDELTSGGTWIDYDYTPVPPLENLKLRQRITD 378 (390) Q Consensus 339 ~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 378 (390) +-|.++ .+++.+.+-...+-...-+-.++++-- T Consensus 463 erd~~d-------pnRln~~~p~d~vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 463 ARNKDD-------KDRLDVLCGPNLINQFRIFAAQVQFIL 495 (495) T ss_pred EECCCC-------CcEEEEEecceeeCceeeeeeeeeeeC Confidence 333222 256666655555555554333333322 No 62 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=96.75 E-value=0.00037 Score=39.43 Aligned_cols=351 Identities=12% Similarity=0.073 Sum_probs=143.8 Q ss_pred CCCccCCC----------EEEEECCCCCcccccccccccee-eeccccccccceeccceEEEechhH-HHHhhccccchH Q lcl|NC_015266. 1 MPQDYHHG----------VRVIEINEGGRPIRTVSTAVLGI-VCTGADADPATFPLDTPVLLTNVIA-ALGKAGTKGTLR 68 (390) Q Consensus 1 Ma~~~~hG----------V~v~ev~~~~~~i~~v~tavig~-vgta~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~gtl~ 68 (390) ||..|+.| +++-+-...+.+.......+.+. +......+ ..+.++ +.+... ....+...-.+. T Consensus 66 aA~~yFsg~~~q~p~P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~~-g~l~i~----i~g~~~~~~i~~s~ats~~ 140 (501) T protein:vir:10 66 IADAYFPGIVNGGQLPYDLKFARYVAADAPASVYGIPLTGITLAQLQGYS-GTLTVT----TAAQHVSANISLAAATSFA 140 (501) T ss_pred HHHHHhhhhcCCCccccEEEEEeecccCccceeeeceehhhhhhhhhhee-eEEEEe----eccceeeeccccccccCHH Confidence 55544422 34444332222211111000000 00000000 011110 000000 000111111111 Q ss_pred HHHHhhhcc------------cCceEEEEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhhc Q lcl|NC_015266. 69 RTLDAIGKQ------------TKPVTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGLD 136 (390) Q Consensus 69 ~al~~~~~~------------~~~~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~~ 136 (390) .+-..+... ......+.....+.....+. ...+ +.+...+.... -.+..+...|.. T Consensus 141 ~vA~~i~~al~~~~~tv~~d~~~~~f~i~~~t~G~~~~i~~------~t~~---~d~a~~l~Lt~---~~~a~v~~~g~~ 208 (501) T protein:vir:10 141 NAATLIEAAFTSPDFVVAYDALRNRFTVVTNTTGTAAAISA------VTGT---NNLADELGLSA---AAGATLQAAGVA 208 (501) T ss_pred HHHHHHHHhhcCCceEEEEecccceEEEEecccCcceeEEE------eecc---ccchhhhcccc---cCceeEEecCcc Confidence 111111111 11111111111111111000 0000 11111111111 011112223333 Q ss_pred chHHHHHHHHhh---hhcceEEeecccccCchHHHHHHhhhhccceEEEEeee---eEEEee---------ccCceeE-- Q lcl|NC_015266. 137 TQPVAAAFATIA---QSLRAMVYVAAHGCKTKEEAVAYRKQFGQREIMVIWPD---WLGWDD---------ITNSTVA-- 199 (390) Q Consensus 137 ~~~v~~al~~~~---~~~~~~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p~---~~~~~~---------~~~~~~~-- 199 (390) ......+|..+. ..+-.+...+.+.....-++.+|.+..+..+....+.. ..+... ..+-.+. T Consensus 209 aet~~~Al~a~~~~~~~Wy~f~~a~~~~~~~~la~A~wi~a~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~ 288 (501) T protein:vir:10 209 ADTPASAMNRAVGLSRNWATFTTAWTAVIADRLAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLP 288 (501) T ss_pred cccHHHHHHHHHhcccceEEEEEEecCChHHHHHHHHHHHhcCceEEEEEecCcceeeecccchhHHHHHHhcCCCceEE Confidence 333333444333 33333333443333333445555555554443333221 110000 0010111 Q ss_pred ----ecHHHHHHHHHhhhhhccce-eecccCcee-ecccccccccchhhhccccccccccccceeEEEcC----CCEEEE Q lcl|NC_015266. 200 ----IPAPAIAAGLRAKIDNDIGW-HKTLSNVVV-NGVTGISADVSWDLQDPATDAGYLNENQVTTLVNR----NGFRFW 269 (390) Q Consensus 200 ----~p~s~~vAg~~a~~d~~~g~-~~span~~l-~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~----~G~~~w 269 (390) -+|.+++.|..+..|-++-+ -....+|.+ .|+. . +.....+++.|..+|.|.+..+ ..+.+| T Consensus 289 ~y~~~~~~aa~~g~~as~nf~~~~g~~T~~fkql~~Gv~---a-----~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~ 360 (501) T protein:vir:10 289 LYGDQATAGAVMGYAASINFQLRNGRTVLAFRQFNAGVP---A-----TAHDLPTANALRSNNYTYIGAYANAANNYTIA 360 (501) T ss_pred ECCCCCHHHHHHHHHHhcCcccCcceeeeeecccCCCcC---c-----ccCCHHHHHHHHhcCCeEEEEEecccceeeEE Confidence 24677788888887743311 112223333 2222 1 1234457788888999987543 237788 Q ss_pred ccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhc---C-CCCHHHHHHHHHHHHHHHHHHHhCCceee---------- Q lcl|NC_015266. 270 GSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVD---G-PLNPSRARDIIENINAWFRREVSVGELIG---------- 335 (390) Q Consensus 270 G~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~---e-~n~~~~~~~i~~~i~~~L~~l~~~g~l~g---------- 335 (390) -.-+++++ |.+|-+.+-.+|++..|+..+...+- + |-|..=...|+..|+.-|++-+++|.|.- T Consensus 361 ~~G~~sG~--~~wiD~~~g~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~~~q~~ 438 (501) T protein:vir:10 361 YDGKLSGK--FLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTANYRAGVDVIDAAVTSGIIRAGVTLTNSQLQ 438 (501) T ss_pred Ecceeecc--ceehhhHhhHHHHHHHHHHHHHHHHhcCCCcccCHHHHHHHHHHHHHHHHHHHhCcceecCcccCcccce Confidence 55455554 56677788888888888887776442 2 66677888899999999999999998843 Q ss_pred -------------------eEEEEecCCCC-HHHHhCCeEEEEEEEEecccceEEEEEEEEcc Q lcl|NC_015266. 336 -------------------GGAWYDPEPNT-TDELTSGGTWIDYDYTPVPPLENLKLRQRITD 378 (390) Q Consensus 336 -------------------~~v~~d~~~nt-~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 378 (390) |.+..+....+ ++.-..+...+.+.+.---.+++|++-..--. T Consensus 439 ~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 439 QIDAVAGVAGAGALVQTRGWYFLIGNPANPGQARQNRTSPACTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred eecccccccccccceeccceEEeeCcccCChhhhhhcccCceEEEEEeCCceeEEEeeeeecC Confidence 33334332222 33333344556666666666676665333222 No 63 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=95.57 E-value=0.0018 Score=35.59 Aligned_cols=350 Identities=11% Similarity=0.077 Sum_probs=140.9 Q ss_pred CCCccCCC----------EEEEECCCCCccccccccccceeeeccccccccceeccceEEEechh-HH--HHhhccccch Q lcl|NC_015266. 1 MPQDYHHG----------VRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVI-AA--LGKAGTKGTL 67 (390) Q Consensus 1 Ma~~~~hG----------V~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~-~~--~~~~~~~gtl 67 (390) ||..|+.| +++-+-...+.+......... +.. .. ....++....++-.. .. -..+...-++ T Consensus 66 aA~~yFs~~~~q~~~P~~l~igR~~~~a~~~~l~g~~l~---~~~-la--~~~~~~G~l~iti~g~~~~~~i~~S~~ts~ 139 (501) T protein:vir:78 66 IADAYFPGIVNGGQLPYDLKFARYVAADAPASVYGIPLT---GVT-LT--QLQGYSGTLTVTTAAQHVSSNISLAAATSF 139 (501) T ss_pred HHHHHhhcCCCCCcccceEEEEeecccCcceeEecccee---ccc-hh--hhceeeeEEEEEeccceeeeccccccccCH Confidence 55555422 344443332222211111010 000 00 000001011111000 00 0111111111 Q ss_pred HHHHHhhhcccC------------ceEEEEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhhh Q lcl|NC_015266. 68 RRTLDAIGKQTK------------PVTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPGL 135 (390) Q Consensus 68 ~~al~~~~~~~~------------~~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg~ 135 (390) ..+...+....+ .+..+.....+..+..+ ... ..+.+...+..... .+..+...|. T Consensus 140 ~~vA~~i~~al~a~~~tv~~ds~~~~f~its~t~G~~~~i~------~~t---~~~~~a~~l~Lt~~---~~a~v~~~g~ 207 (501) T protein:vir:78 140 ANAATLIEAAFTSPDFVVSYDALRNRFVVNTNATGTAAAIS------AVT---GTNNLADELGLSAA---AGASLQAAGV 207 (501) T ss_pred HHHHHHHHhhhcCcceEEEEccccceEEEEeeecCCceeEE------EEe---cccchhhhhccccc---CceeeEeccc Confidence 111111111111 11111111111111110 000 01111111111111 1111222333 Q ss_pred cchHHHHHHHH---hhhhcceEEeecccccCchHHHHHHhhhhccceEEEEee---eeEEEee---------ccCceeEe Q lcl|NC_015266. 136 DTQPVAAAFAT---IAQSLRAMVYVAAHGCKTKEEAVAYRKQFGQREIMVIWP---DWLGWDD---------ITNSTVAI 200 (390) Q Consensus 136 ~~~~v~~al~~---~~~~~~~~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~p---~~~~~~~---------~~~~~~~~ 200 (390) ..+....++.. ....+-.+...+.+......++.+|.+..+..+....+. ...+... ..+-.+.+ T Consensus 208 ~aet~~~a~~a~~~~~~~Wy~f~~a~~~~~~~~lalA~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~a~~y~~t~ 287 (501) T protein:vir:78 208 AADTPASAMNRAVGLSRNWATFTTAWTAVIADRLALASWNSGQAYKYMYVAPDLEPASIVTNNSASFGAQVFAAPYQGTL 287 (501) T ss_pred cccCHHHHHHHHHhccCceEEEEEecCCCHHHHHHHHHHHHhcCceEEEEEecCCcceeecccchhHHHHHhhcCCCceE Confidence 33333333333 333333343444333333344455555544444333221 1111000 00111112 Q ss_pred c------HHHHHHHHHhhhhhccce-eecccCcee-ecccccccccchhhhccccccccccccceeEEEcC----CCEEE Q lcl|NC_015266. 201 P------APAIAAGLRAKIDNDIGW-HKTLSNVVV-NGVTGISADVSWDLQDPATDAGYLNENQVTTLVNR----NGFRF 268 (390) Q Consensus 201 p------~s~~vAg~~a~~d~~~g~-~~span~~l-~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~----~G~~~ 268 (390) | +.+++.|..+.+|-++-+ -....+|.+ .|+. .+ .....+++.|..+|.|++..+ ..+.+ T Consensus 288 ~~y~~~~~~aa~~g~~as~nf~~~~g~~T~~fkq~~~Gv~---a~-----~l~~t~a~al~~~~~N~y~~~~~~~~~~~~ 359 (501) T protein:vir:78 288 PLYGDQATAGAVMGYAASINFQLRNGRTVLAFRQFNAGVP---AT-----AHDLGTANALRSNNYTYIGAYANAANNYTI 359 (501) T ss_pred EEcCCcchHHHHHHHHHhcCcccCcceeeeeccccCCCcC---cc-----cCCHHHHHHHHhcCCeEEEEEecccceeeE Confidence 2 456677777777643311 112223333 2222 11 223457788888999987543 23778 Q ss_pred EccccCCCCcccceeeehhhHHHHHHHHHHHHHHHh---cC-CCCHHHHHHHHHHHHHHHHHHHhCCceee--------- Q lcl|NC_015266. 269 WGSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVV---DG-PLNPSRARDIIENINAWFRREVSVGELIG--------- 335 (390) Q Consensus 269 wG~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v---~e-~n~~~~~~~i~~~i~~~L~~l~~~g~l~g--------- 335 (390) |-.-+++++ |.+|-+-+-.+|++..++..+...+ .+ |.+..=...|+..|+.-|++-+++|.|.- T Consensus 360 ~~~G~~sG~--~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~ 437 (501) T protein:vir:78 360 AYDGKLSGK--FLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQL 437 (501) T ss_pred EEcCeeecc--ceeehhhhhHHHHHHHHHHHHHHHHHhCCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCCccc Confidence 855556654 4556666666777666666665433 23 77888888899999999999999998832 Q ss_pred --------------------eEEEEecCCCC-HHHHhCCeEEEEEEEEecccceEEEEEEEEcc Q lcl|NC_015266. 336 --------------------GGAWYDPEPNT-TDELTSGGTWIDYDYTPVPPLENLKLRQRITD 378 (390) Q Consensus 336 --------------------~~v~~d~~~nt-~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 378 (390) |.+..+....+ ++.-..+...+.+.+.---.+++|++-..--. T Consensus 438 ~~I~~~~g~~~~~~~~~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:78 438 QQIDAAAGVAGAGQLVQMRGWYFLIGDPANPGQARQNRTTPTCTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred eeeccccCccccccceeccceEEeeccccCChhhhhhcccCcEEEEEEeCCceeEEEeeeeecC Confidence 33334332222 33333344556666666666666665333222 No 64 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=91.95 E-value=0.013 Score=30.84 Aligned_cols=355 Identities=10% Similarity=0.054 Sum_probs=148.8 Q ss_pred CCCccCCC----------EEEEECCCCCccccccccccceeeeccccccccceecc--------ceEEEechhHHHHhhc Q lcl|NC_015266. 1 MPQDYHHG----------VRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLD--------TPVLLTNVIAALGKAG 62 (390) Q Consensus 1 Ma~~~~hG----------V~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~--------~~~~i~~~~~~~~~~~ 62 (390) ||..|+.| +++-+-.....+............-.........+.++ .++-++....+.... T Consensus 63 aA~~yFsq~p~~~~~P~~L~igR~~~~~~~a~l~g~~~~~~l~~~~~~~~G~lti~v~G~~~t~~~i~lS~~ts~~~vA- 141 (507) T protein:vir:99 63 RAKAYMSFISKSINSPSYISFARWVNAAIASMIVGDSLVKNLPALKAVATPTLSLSIGGTVVPIAGIDLTAALTLTDVA- 141 (507) T ss_pred HHHHHhccCCCCCcccceEEEEeecCccccceeecchhhhhHHHHhhhcceeEEEEEcCceeEeccccccccCCHHHHH- Confidence 44444322 23433322221110000000000000000000001000 000000000000000 Q ss_pred cccchHHHHHh-----------hhcccCceEEEEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhh Q lcl|NC_015266. 63 TKGTLRRTLDA-----------IGKQTKPVTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILV 131 (390) Q Consensus 63 ~~gtl~~al~~-----------~~~~~~~~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~ 131 (390) ..+..++.. .++..+.+..+.....+..+.. ......+..+.+..++.... .+. .. T Consensus 142 --s~i~~~l~a~~~~~~~~~tv~~d~~~~~F~v~s~~tG~~s~i------~~at~~~~gt~~s~l~~~~~-~~a----~~ 208 (507) T protein:vir:99 142 --ATLQTKIRASANAELATATVTFNTTTNQFVLNGTTTGALAPT------ITAVRTDPATDISSLLGWTN-TGT----VF 208 (507) T ss_pred --HHHHHhhhccccccccceEEEEecCCceEEEEeeecccccee------EEEEcCCchhhHHHHhcccc-ccc----eE Confidence 001111111 1111122222222111111111 11111122223332222211 111 11 Q ss_pred hhhhcchHHHHHHHHh---hhhcceEEeeccccc--CchHHHHHHhhhhccceEEEEeeee---------------EEEe Q lcl|NC_015266. 132 APGLDTQPVAAAFATI---AQSLRAMVYVAAHGC--KTKEEAVAYRKQFGQREIMVIWPDW---------------LGWD 191 (390) Q Consensus 132 apg~~~~~v~~al~~~---~~~~~~~~~~d~~~~--~~~~~a~~~~~~~~~~~~~~~~p~~---------------~~~~ 191 (390) ..|........++..+ ...+-.+...+.+.. ....+..+|.+..+..+.+..+.-. .... T Consensus 209 ~~g~~aet~~~a~~a~~~~~~nW~~~~~a~~~~~td~~~lalA~wiea~~~~f~~~~~~~~a~~~~~~~~~~~~~~~~~~ 288 (507) T protein:vir:99 209 VKGQAAETPDTSISKSAAISTNFGSFIYTSTPALTNDQITAVASWNASQNNMYMYSVPTTIANIGTLYAAVKGFSGCALN 288 (507) T ss_pred eecccccCHHHHHHHHHhhcCCeEEEEEEeccccChHHHHHHHHHHhhcCcEEEEEEecCchhhhhhhhhhhhcceeEEE Confidence 2233333333333333 333433333443221 1123444555554444433222100 0000 Q ss_pred eccCceeEecHHHHHHHHHhhhhhccc-eeecccCceeecccccccccchhhhccccccccccccceeEEEcCCC----E Q lcl|NC_015266. 192 DITNSTVAIPAPAIAAGLRAKIDNDIG-WHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVNRNG----F 266 (390) Q Consensus 192 ~~~~~~~~~p~s~~vAg~~a~~d~~~g-~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~~G----~ 266 (390) ...+......+.+++.|.++.+|-++- =-.....|.+.||.. + .....+++.|..+++|.+....| + T Consensus 289 ~~~~~~~~~~~~aa~~g~~as~nf~~~ng~~T~~fk~l~GV~a---~-----~lt~t~a~al~~~n~N~y~~~a~~~~~~ 360 (507) T protein:vir:99 289 ITSDSLPVDYIEQSPCEILAATDYTRVNATQNYMYYQFPSRNI---T-----VSDDTTANLVDANRGNYIGQTQSAGQSL 360 (507) T ss_pred eecccccchhHHHHHHHHHHhhccCcCccceeecccccCCccc---c-----cCCHHHHHHHHhcCCeEEEEecccccee Confidence 011111112356677777777763221 111223444555542 2 13455778888899999865433 6 Q ss_pred EEEcc-ccCCCCcccceeeehhhHHHHHHHHHHHHHHHhc---C-CCCHHHHHHHHHHHHHHHHHHHhCCceee------ Q lcl|NC_015266. 267 RFWGS-RTCDADGKFFFENYTRSAQVIADTIAEEQMGVVD---G-PLNPSRARDIIENINAWFRREVSVGELIG------ 335 (390) Q Consensus 267 ~~wG~-rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~---e-~n~~~~~~~i~~~i~~~L~~l~~~g~l~g------ 335 (390) .+|-. .++++..+|.++-+-+=.+||+..++..+...+- + |-+..=...|+..++.-|++-+++|.|.. T Consensus 361 ~~~~~G~~~gG~~~fid~d~~~~~~WL~~~iq~~l~~l~~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~I~~Gvtl~~ 440 (507) T protein:vir:99 361 AFYQRGILCGGPNDAVDMNIYANEIWLKSAISAQILSLFLNVPRVPANETGESMLLSVIQSVVNTAKNNGTISAGKNLNV 440 (507) T ss_pred eEEecCeeeCCcccceeeeeecchHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhccccccCCcccc Confidence 66644 4444444777777777777888887777775332 2 67778888899999999999999988843 Q ss_pred -----------------------eEEEEe-cCCCCH-HHHhCCeEEEEEEEEecccceEEEEEEEEc Q lcl|NC_015266. 336 -----------------------GGAWYD-PEPNTT-DELTSGGTWIDYDYTPVPPLENLKLRQRIT 377 (390) Q Consensus 336 -----------------------~~v~~d-~~~nt~-~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 377 (390) |.+... .+..++ +....+...+.+.+.---.+++|++....- T Consensus 441 ~q~~~in~~~~~~~~~~~~~~~Gyy~~~~~~s~~~~~~r~~r~~~~~~~~y~~~gaI~~v~~~~~~v 507 (507) T protein:vir:99 441 IQQQYITQISGDANAWRQVANIGYWLNITFSSYTNPNTQLTEWKASYQLIYSKDDAIRFVEGTDTLI 507 (507) T ss_pred cchheecccccccccccceeccceEEEeCChHhcChhhhhccccceEEEEEEeCCeEEEEEeeeecC Confidence 334443 233343 334456677777777777888877765555 No 65 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=91.87 E-value=0.014 Score=30.78 Aligned_cols=354 Identities=10% Similarity=0.023 Sum_probs=144.2 Q ss_pred CCCccCCC----------EEEEECCCCCccccccccccce--------eeeccccc-ccc-----ceeccceEEEechhH Q lcl|NC_015266. 1 MPQDYHHG----------VRVIEINEGGRPIRTVSTAVLG--------IVCTGADA-DPA-----TFPLDTPVLLTNVIA 56 (390) Q Consensus 1 Ma~~~~hG----------V~v~ev~~~~~~i~~v~tavig--------~vgta~~~-~~~-----~~~~~~~~~i~~~~~ 56 (390) ||..|+.| +++-+-...+.+.......... ..|+.... +.. ...++..+.++.-.. T Consensus 63 aA~~yF~~~~~~~~~P~~l~igR~~~~a~~~~l~g~~~~~~~~~~~~i~~G~lsitv~G~~~~~~~i~~S~~ts~~~vA~ 142 (504) T protein:vir:96 63 RAAAYFKFISKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEKNITAIDTSAATSMDNVAS 142 (504) T ss_pred HHHHHhhcCCCCCccccEEEEEeecCcCccceEEechhHHHHHHHhhhhceEEEEEEcceeeeecccccccccchHHHHH Confidence 44444322 3444432222211100000000 00000000 000 000000000000000 Q ss_pred HHHhhccccchHH--HHHhhhcccCceEEEEeeccccccccccccchhhhccchhhhhhhhhhhhhhhhhhhhhhhhhhh Q lcl|NC_015266. 57 ALGKAGTKGTLRR--TLDAIGKQTKPVTVVVRVAEGKDEAETTANVIGTVTPDGKYTGMKALLAAQGKLAVKPRILVAPG 134 (390) Q Consensus 57 ~~~~~~~~gtl~~--al~~~~~~~~~~~~vv~v~~~~~~~~~~~~~~~~~~~~~~~tgl~~~~~~~~~~~~~p~~~~apg 134 (390) ..........-.. .....++..+.+..+..-...... ... ......+.+.++.... .+......| T Consensus 143 ~i~~al~~~~~~~~~~~tv~~d~~~~~f~its~~tg~~~------~~~--~~~a~~~~~~~~lgl~-----~~~~~~v~g 209 (504) T protein:vir:96 143 IIQTEIRKNTDPQLAQATVTWNPNTNQFTLVGATIGTGV------LAV--AKSADPQDMSTALGWS-----TSNVVNVAG 209 (504) T ss_pred HHHhhhhcccccccccceEEEeccCCeEEEEeeccccce------eEE--Eeeccccchhhhhhcc-----cccceEEee Confidence 0000000000000 000111111111111111000000 000 0000000111111110 011111223 Q ss_pred hcchHHHH---HHHHhhhhcceEEeecccccCc-hHHHHHHhhhhccceEEEEeeeeEEEee-------ccCcee---E- Q lcl|NC_015266. 135 LDTQPVAA---AFATIAQSLRAMVYVAAHGCKT-KEEAVAYRKQFGQREIMVIWPDWLGWDD-------ITNSTV---A- 199 (390) Q Consensus 135 ~~~~~v~~---al~~~~~~~~~~~~~d~~~~~~-~~~a~~~~~~~~~~~~~~~~p~~~~~~~-------~~~~~~---~- 199 (390) ........ ++......+-.+...+.+...+ ..+..+|.+..+..+....+- ...+. .....+ . T Consensus 210 ~~aet~~~al~al~~~~~~Wy~f~~a~~~~~dd~ilalA~w~ea~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 287 (504) T protein:vir:96 210 QAADLPDAAVAKSTNVSNNFGSFLFAGATLDNDQIKAVSAWNAAQNNQFIYTVAT--SLANLGALFDLVKGNSGTALNVL 287 (504) T ss_pred cccccHHHHHHHHHhhcCCeEEEEEEeccCCHHHHHHHHHHHhhcCceEEEEEee--cccchhhHHHhhhhcceeEEEEe Confidence 22222223 3333333344444343322211 123344555444443322221 00000 000000 0 Q ss_pred ------ecHHHHHHHHHhhhhhcc-ceeecccCceeecccccccccchhhhccccccccccccceeEEEcCC----CEEE Q lcl|NC_015266. 200 ------IPAPAIAAGLRAKIDNDI-GWHKTLSNVVVNGVTGISADVSWDLQDPATDAGYLNENQVTTLVNRN----GFRF 268 (390) Q Consensus 200 ------~p~s~~vAg~~a~~d~~~-g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~~----G~~~ 268 (390) --+..+.+|.++.+|-.+ .=-..-.+|.+.||... .....+++.|..+++|.+..+. .+.+ T Consensus 288 ~~~~~~~~~~~~~~~~~as~~f~~~ng~~T~~fk~l~GVta~--------~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~ 359 (504) T protein:vir:96 288 SATASNDFVEQCPSEILAATNYDEPGASQNYMYYQFPGRNIT--------VSDDTAANTVDKSRGNYIGVTQANGQQLAF 359 (504) T ss_pred ecCccchhHHHHHHHHHHhcCcCcccccccccccccCCcCcc--------cCCHHHHHHHHhcCCeEEEEeecccceeeE Confidence 013455566666666322 11223345566666422 1245577888889999885432 2555 Q ss_pred E-ccccCCCCcccceeeehhhHHHHHHHHHHHHHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------- Q lcl|NC_015266. 269 W-GSRTCDADGKFFFENYTRSAQVIADTIAEEQMGVVDG----PLNPSRARDIIENINAWFRREVSVGELI--------- 334 (390) Q Consensus 269 w-G~rT~~~d~~~~~i~vrR~~~~i~~~l~~~~~~~v~e----~n~~~~~~~i~~~i~~~L~~l~~~g~l~--------- 334 (390) | .+.++++.-+|.+|.+-+-.+|++..|+..+....-. |.|..=...|+..++.-|++-++.|.|. T Consensus 360 ~~~G~~~gG~~~~~wiDv~~~~~WL~~~lq~~l~~l~~~~~kIPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q 439 (504) T protein:vir:96 360 YQRGILCGGPTDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQ 439 (504) T ss_pred EecCeeeCCccccchhhhhhhHHHHHHHHHHHHHHHHhcCCCcccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccc Confidence 5 5555665557888899999999999999888764332 6677788899999999999999999772 Q ss_pred --------------------eeEEEEec-CCCCHH-HHhCCeEEEEEEEEecccceEEEEEEEEc Q lcl|NC_015266. 335 --------------------GGGAWYDP-EPNTTD-ELTSGGTWIDYDYTPVPPLENLKLRQRIT 377 (390) Q Consensus 335 --------------------g~~v~~d~-~~nt~~-~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 377 (390) ||.+.... ++-+++ .-..+...+.+.+.---.+++|++....- T Consensus 440 ~~~I~~~~~~d~~~~~~~~~GYyv~~~~~s~~s~~~r~~R~~~~~~~~y~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 440 QQYITQITGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) T ss_pred hheecccccccccccceeccceEEEecChhccChhHhhhccccceEEEEEECCeEEEEEeccccC Confidence 35566542 333433 33445566677777777778777754444 No 66 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=90.68 E-value=0.02 Score=29.95 Aligned_cols=364 Identities=11% Similarity=0.035 Sum_probs=170.8 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhc---- Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGK---- 76 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~---- 76 (390) |+-+=.|=-.+++|.-+.-+.....-...+++-+. ....|.......++..+-...|+.....+.+...+|. T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~lllt~----~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~ 76 (501) T protein:vir:36 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQ----DTSVQPGQLADFFQETDVENWFGALSNEAKIADAYFPGIVN 76 (501) T ss_pred CCcCCcccceEEEEeeeeccCCCcceeeeeEEEec----cCCCCCcceeeecCHHHHHHhcCCChHHHHHHHHHhhcccC Confidence 88665554466666555444333332222332221 1123444445555666655667877777777777775 Q ss_pred ccCce--EEEEeecccccccc---------cccc-----------c-----hhhhcc------chhhhhhhhhhhhhhh- Q lcl|NC_015266. 77 QTKPV--TVVVRVAEGKDEAE---------TTAN-----------V-----IGTVTP------DGKYTGMKALLAAQGK- 122 (390) Q Consensus 77 ~~~~~--~~vv~v~~~~~~~~---------~~~~-----------~-----~~~~~~------~~~~tgl~~~~~~~~~- 122 (390) +...+ .++-+......+.. +..+ + ..+++. .+..+.+......... T Consensus 77 q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~~a~~~~~sg~l~vti~g~~~~~~i~lS~~ts~~~vA~~i~~al~~~~~t 156 (501) T protein:vir:36 77 GGQLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDFV 156 (501) T ss_pred CCccccEEEEEeecCcCcceeEeccchhhhhhhhccceeEEEEEEecceeeeeecccccccCHHHHHHHHhhhhcCcceE Confidence 22221 12222211000000 0000 0 000000 0001111111111000 Q ss_pred -----------------------------------hhh---hhhhhhhhhhcchHHHHHHHHhhhh---cceEEeecccc Q lcl|NC_015266. 123 -----------------------------------LAV---KPRILVAPGLDTQPVAAAFATIAQS---LRAMVYVAAHG 161 (390) Q Consensus 123 -----------------------------------~~~---~p~~~~apg~~~~~v~~al~~~~~~---~~~~~~~d~~~ 161 (390) .++ .+..+...|...+....+|..+... +-.+...+.+. T Consensus 157 v~~d~~~~~f~i~s~t~G~~~~i~~~t~~~~ia~~l~Lt~~~~a~v~~~g~~~et~~~al~a~~~~s~~Wy~f~~a~~~~ 236 (501) T protein:vir:36 157 VAYDALRNRFTVVTNATGTAAAISAVTGTNNFADEIGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTAV 236 (501) T ss_pred EEEcCcceeEEEEeccCCcceeeEeeecccchhhhhcccccCcceEEecccccccHHHHHHHHHhccCceEEEEEecCCC Confidence 000 0000111122222233444443333 33333444333 Q ss_pred cCchHHHHHHhhhhccceEEEEeee---eEEEee---------ccCceeE------ecHHHHHHHHHhhhhhccc-eeec Q lcl|NC_015266. 162 CKTKEEAVAYRKQFGQREIMVIWPD---WLGWDD---------ITNSTVA------IPAPAIAAGLRAKIDNDIG-WHKT 222 (390) Q Consensus 162 ~~~~~~a~~~~~~~~~~~~~~~~p~---~~~~~~---------~~~~~~~------~p~s~~vAg~~a~~d~~~g-~~~s 222 (390) ....-++.+|.+..+..+....+.- ..+... ..+-.+. ..+.+++.|..+..|-++- =-.. T Consensus 237 ~~~~la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g~~T 316 (501) T protein:vir:36 237 IADRLAFASWNSGQAYKYMYVAPDLEAASIVSNNAASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNGRTV 316 (501) T ss_pred hHHHHHHHHHHhhcCceEEEEEecCchhhhhccchhhHHHHHHhcCCCcEEEEcCCCCHHHHHHHHHHhcCcccCcceee Confidence 3333445556665555444333211 110000 0010111 1356677777777764330 0112 Q ss_pred ccCcee-ecccccccccchhhhccccccccccccceeEEEcC----CCEEEEccccCCCCcccceeeehhhHHHHHHHHH Q lcl|NC_015266. 223 LSNVVV-NGVTGISADVSWDLQDPATDAGYLNENQVTTLVNR----NGFRFWGSRTCDADGKFFFENYTRSAQVIADTIA 297 (390) Q Consensus 223 pan~~l-~gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~----~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~ 297 (390) ..+|.+ .|+. . +.....+++.|..+|.|.+..+ ..+.+|-.-+++++ +.+|-+.+-.+|++..|+ T Consensus 317 ~~fkq~~~Gi~---a-----~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~--~~wiD~~~g~dWL~~~iq 386 (501) T protein:vir:36 317 LAFRQFNAGVP---A-----TVHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK--FLWVDTYLDQIYLNAELQ 386 (501) T ss_pred eeccccCCCcC---c-----CcCCHHHHHHHHhcCCcEEEEEecccceeeEEEcCeeecc--chhhhHHHhHHHHHHHHH Confidence 223333 2222 1 1233457788888999976432 34777755566654 566788888888888888 Q ss_pred HHHHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhCCceee-----------------------------eEEEEecCC Q lcl|NC_015266. 298 EEQMGVVDG----PLNPSRARDIIENINAWFRREVSVGELIG-----------------------------GGAWYDPEP 344 (390) Q Consensus 298 ~~~~~~v~e----~n~~~~~~~i~~~i~~~L~~l~~~g~l~g-----------------------------~~v~~d~~~ 344 (390) ..+...+-. |.|..=...|+..|+.-|++-+++|.|.- |.+..+... T Consensus 387 ~~l~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~I~~~~g~~~~~~~v~~~Gyy~~~~~~~ 466 (501) T protein:vir:36 387 RAEFEAMLAYNSLPYNEDGYTGLYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDKAARVAGAGQLVQTRGWYFLIGDPA 466 (501) T ss_pred HHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeecccccccccccceeccceEEeeCccc Confidence 888765432 67777788899999999999999998832 334444333 Q ss_pred CCHHH-HhCCeEEEEEEEEecccceEEEEEEEEcc Q lcl|NC_015266. 345 NTTDE-LTSGGTWIDYDYTPVPPLENLKLRQRITD 378 (390) Q Consensus 345 nt~~~-i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 378 (390) .++++ -..+...+.+.+.---.+++|++-..--. T Consensus 467 ~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:36 467 NPGQARQNRTTPACTLWYSDGGSIQSLTIGSNAVI 501 (501) T ss_pred CChhhhhhcccCcEEEEEEeCCceeEEEeeeeeeC Confidence 33333 33344566666666667777765333322 No 67 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=89.43 E-value=0.026 Score=29.24 Aligned_cols=364 Identities=12% Similarity=0.049 Sum_probs=169.8 Q ss_pred CCCccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhc---- Q lcl|NC_015266. 1 MPQDYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGK---- 76 (390) Q Consensus 1 Ma~~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~---- 76 (390) ||-+=.|=-.+++|.-+.-+.........+++-+ ....+|.......++..+-...|+.....+.+...+|. T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~l~l~----~~~~~~~~~~~~~~s~~~V~~~FG~~S~ey~aA~~yFsg~~~ 76 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLT----QDTSVQPGQLADFFQETDVENWFGALSNEAKIADAYFPGIVN 76 (501) T ss_pred CCCCCcccceEEEEeeecccCCCccccceeEEEe----ccCCCCccceEEecCHHHHHHhcCCChHHHHHHHHHhhhhcC Confidence 8865455445566655544333332222222211 12234555556677777766678888777888888886 Q ss_pred ccCce--EEEEeecccccc---------ccccccc---h-------hh------hc------cchhhhhhhhhhhhhhh- Q lcl|NC_015266. 77 QTKPV--TVVVRVAEGKDE---------AETTANV---I-------GT------VT------PDGKYTGMKALLAAQGK- 122 (390) Q Consensus 77 ~~~~~--~~vv~v~~~~~~---------~~~~~~~---~-------~~------~~------~~~~~tgl~~~~~~~~~- 122 (390) +...+ .++-+......+ ..+..++ . ++ .+ -.+..+.+......... T Consensus 77 q~p~P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~sg~l~vti~g~~~~~~i~ls~ats~~~vAs~i~~al~~~~~t 156 (501) T protein:vir:10 77 GGQLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDFV 156 (501) T ss_pred CCccccEEEEEeecCCCccceEeccchhhhhhhhcceeeeEEEEeeccceeecccccccccCHHHHHHHHhhhccCCceE Confidence 33222 222222111100 0000000 0 00 00 00001111111111000 Q ss_pred ----------------hhhhhhh----------------------hhhhhhcchHHHHHHHHhhh---hcceEEeecccc Q lcl|NC_015266. 123 ----------------LAVKPRI----------------------LVAPGLDTQPVAAAFATIAQ---SLRAMVYVAAHG 161 (390) Q Consensus 123 ----------------~~~~p~~----------------------~~apg~~~~~v~~al~~~~~---~~~~~~~~d~~~ 161 (390) .+....+ +...|........++..+.. .+-.+...+.+. T Consensus 157 v~~d~~~~~f~its~ttG~~~~i~~~~~~~~la~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~a~~~~ 236 (501) T protein:vir:10 157 VAYDALRNRFTVVTNATGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTAV 236 (501) T ss_pred EEEcccCceEEEEeeccCCceeEEEeeCchhhhhhcCccccccceEEecCcccccHHHHHHHHHhccCceEEEEEecCCC Confidence 0000000 11112222223344444333 333333444433 Q ss_pred cCchHHHHHHhhhhccceEEEEeee---eEEEee---------ccCceeEe------cHHHHHHHHHhhhhhccce-eec Q lcl|NC_015266. 162 CKTKEEAVAYRKQFGQREIMVIWPD---WLGWDD---------ITNSTVAI------PAPAIAAGLRAKIDNDIGW-HKT 222 (390) Q Consensus 162 ~~~~~~a~~~~~~~~~~~~~~~~p~---~~~~~~---------~~~~~~~~------p~s~~vAg~~a~~d~~~g~-~~s 222 (390) .....++.+|.+..+..+....+.- ..+... ..+-.+.+ .+.+++.|..+.+|-++-+ -.. T Consensus 237 ~~~~la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g~~T 316 (501) T protein:vir:10 237 IADRLAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNGRTV 316 (501) T ss_pred hHHHHHHHHHHHhcCceEEEEEecCchhhhhhhhhhhHHHHHHhcCCCceEEECCCCcHHHHHHHHHHhhCcccCcccee Confidence 3334445556665554443333211 111000 01111222 2566777888877753311 122 Q ss_pred ccCceee-cccccccccchhhhccccccccccccceeEEEcCC----CEEEEccccCCCCcccceeeehhhHHHHHHHHH Q lcl|NC_015266. 223 LSNVVVN-GVTGISADVSWDLQDPATDAGYLNENQVTTLVNRN----GFRFWGSRTCDADGKFFFENYTRSAQVIADTIA 297 (390) Q Consensus 223 pan~~l~-gv~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~~----G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~l~ 297 (390) ..+|.+. |+. . +.....+++.|..++.|.+.... -+.+|-.-+++++ |.+|-+-+-.+|++..++ T Consensus 317 ~~fkq~~~Gi~---a-----~~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~--~~wiD~~~~~~Wl~~~iq 386 (501) T protein:vir:10 317 LAFRQFNAGVP---A-----TAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK--FLWVDTYLDQIYLNAELQ 386 (501) T ss_pred eeccccCCCcC---c-----ccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeecc--ceeehhhhhHHHHHHHHH Confidence 2233332 222 1 12234577888889999986542 3678855556654 455666666666666666 Q ss_pred HHHHHHh---cC-CCCHHHHHHHHHHHHHHHHHHHhCCceee-----------------------------eEEEEecCC Q lcl|NC_015266. 298 EEQMGVV---DG-PLNPSRARDIIENINAWFRREVSVGELIG-----------------------------GGAWYDPEP 344 (390) Q Consensus 298 ~~~~~~v---~e-~n~~~~~~~i~~~i~~~L~~l~~~g~l~g-----------------------------~~v~~d~~~ 344 (390) ..+...+ .+ |.+..=...|+..|+.-|++-+++|.|.- |.+..+... T Consensus 387 ~~l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~ 466 (501) T protein:vir:10 387 RAEFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQTRGWYFLIGDPA 466 (501) T ss_pred HHHHHHHHhcCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccCccccccceeccceeEeecccc Confidence 6665433 23 77888888999999999999999998832 333343322 Q ss_pred CC-HHHHhCCeEEEEEEEEecccceEEEEEEEEcc Q lcl|NC_015266. 345 NT-TDELTSGGTWIDYDYTPVPPLENLKLRQRITD 378 (390) Q Consensus 345 nt-~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 378 (390) .+ ++.-..+...+.+.+.---.+++|++-..--. T Consensus 467 ~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 467 NPGQARQNRTTPACTLWYSDGGSIQQLTIGSNAVI 501 (501) T ss_pred CChhhhhhccccceEEEEEeCCceeEEEeeeeecC Confidence 23 23333344556666666666666665333222 No 68 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=88.13 E-value=0.034 Score=28.62 Aligned_cols=359 Identities=12% Similarity=0.069 Sum_probs=154.9 Q ss_pred CCC-ccCCCEEEEECCCCCccccccccccceeeeccccccccceeccceEEEechhHHHHhhccccchHHHHHhhhc--- Q lcl|NC_015266. 1 MPQ-DYHHGVRVIEINEGGRPIRTVSTAVLGIVCTGADADPATFPLDTPVLLTNVIAALGKAGTKGTLRRTLDAIGK--- 76 (390) Q Consensus 1 Ma~-~~~hGV~v~ev~~~~~~i~~v~tavig~vgta~~~~~~~~~~~~~~~i~~~~~~~~~~~~~gtl~~al~~~~~--- 76 (390) |+. ... ++++|.-+.-+... +...|.+..-.. ....|.......++..+-...|+.....+.+...+|. T Consensus 1 m~~ip~s---~iV~V~~~v~~~~~---~~~~f~~~l~~~-~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~ 73 (494) T protein:vir:94 1 MPNIPIS---QIVSINPQVVSAGG---TQGTLDGLLLTQ-ATGFPVTQPQVYFSAADVGTAFGLTSDEYNAALVYFAGIL 73 (494) T ss_pred CCCCCcc---cEEEeeeeccccCC---cccccceeEeec-CccCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhcc Confidence 652 222 33344333322211 112222221111 1122333334445555555567777777777777776 Q ss_pred -ccCce--EEEEeecccc--------ccccccc----------------------cchhhhccchhhhhhhhhhhhhh-- Q lcl|NC_015266. 77 -QTKPV--TVVVRVAEGK--------DEAETTA----------------------NVIGTVTPDGKYTGMKALLAAQG-- 121 (390) Q Consensus 77 -~~~~~--~~vv~v~~~~--------~~~~~~~----------------------~~~~~~~~~~~~tgl~~~~~~~~-- 121 (390) +...+ .++-+..... ....+.. ++.......+..+.+........ T Consensus 74 ~q~p~P~~l~igR~~~~a~~~~l~g~~~~~tl~~~~~~~g~l~iti~g~~~~~~i~lS~~ts~~~vA~~i~~ai~~a~~~ 153 (494) T protein:vir:94 74 GGGQQPASLTIGRYASAATSAAVFGAPLTLSLAQLQTLSGTLIVTTDTQRTSAAINLSGATSFANAASLMTSGFTTPNFA 153 (494) T ss_pred CCCccccEEEEEeecCccccceeeccchhhhHHhhhhcceEEEEEEcceEEEeeecccccCChhhHHHHHhhhhccccce Confidence 33222 2222221110 0000000 00000000000000110000000 Q ss_pred ---------------hhhhhhhh--------------------hhhhhhcchHHHHHHHHhhhh---cceEEeecccccC Q lcl|NC_015266. 122 ---------------KLAVKPRI--------------------LVAPGLDTQPVAAAFATIAQS---LRAMVYVAAHGCK 163 (390) Q Consensus 122 ---------------~~~~~p~~--------------------~~apg~~~~~v~~al~~~~~~---~~~~~~~d~~~~~ 163 (390) ..+..+.+ +...|...+....++..+... +-.+.+.+.+... T Consensus 154 v~~d~~~~~f~v~s~ttG~~s~is~~t~~~a~~l~lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~~~~~~~~ 233 (494) T protein:vir:94 154 ITYDAQRRRFVLSTTATGTTASVSAVTGTLADGVGLSTASGAYVEGSGLAADTAASALDRLAASSSTWAIFTTAWAASLS 233 (494) T ss_pred EEEcccCcEEEEEEccCCceeEEEEeccchhhhhhhhccccceEeecCcccccHHHHHHHHHhccCceEEEEEecCCCHH Confidence 00000000 011122222233344443332 3333333433333 Q ss_pred chHHHHHHhhhhccceEEEEee---eeEEEeecc------------CceeE---ecHHHHHHHHHhhhhhccceeecccC Q lcl|NC_015266. 164 TKEEAVAYRKQFGQREIMVIWP---DWLGWDDIT------------NSTVA---IPAPAIAAGLRAKIDNDIGWHKTLSN 225 (390) Q Consensus 164 ~~~~a~~~~~~~~~~~~~~~~p---~~~~~~~~~------------~~~~~---~p~s~~vAg~~a~~d~~~g~~~span 225 (390) ...++.+|.+..+..+....+. ........+ +.... ..|.+++.|..+..|- -..+.+ T Consensus 234 ~ilalA~wiea~~~~~~~~~~~~d~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~aa~~~----~~~~g~ 309 (494) T protein:vir:94 234 DRTALAQWTSDQVFRRIYAAWDQDAAGLSVNNVSSFGNIVKTTPFSNTIPVYGLLANAMIVLAWGASTNL----QIAEGR 309 (494) T ss_pred HHHHHHHHHhhcCccEEEEEecCCcceeecccchhHHHHHHhhcCCceEEEcCCCChHHHHHHHHHhccc----cccCcc Confidence 3344555666555444433221 111110000 01111 1255677777776663 334444 Q ss_pred ceeec---ccccccccchhhhccccccccccccceeEEEcCCC----EEEEccccCCCCcccc--eeeehhhHHHHHHHH Q lcl|NC_015266. 226 VVVNG---VTGISADVSWDLQDPATDAGYLNENQVTTLVNRNG----FRFWGSRTCDADGKFF--FENYTRSAQVIADTI 296 (390) Q Consensus 226 ~~l~g---v~~~~~~~~~~~~~~~~~~~~l~~~gI~~~~~~~G----~~~wG~rT~~~d~~~~--~i~vrR~~~~i~~~l 296 (390) ..+.. ..++.. +.....+++.+..+|+|.+....| +.+|.+-+++++-.|- +++.--+-+.+++++ T Consensus 310 ~T~~~k~q~~gi~~-----~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~gg~~sG~~~~id~~~~~~WL~~~iq~~l 384 (494) T protein:vir:94 310 TTLALRSPVSSAGV-----RVDNLANANALLSNGYTYLGKYASATNTYTVTYNGAIGGQFLWADTALGWIALRRNLQQAL 384 (494) T ss_pred eeEEeeccCCCCCC-----ccCCHHHHHHHHhcCCeEEEEecccCceEEEecCceeccccceeeeeccHHHHHHHHHHHH Confidence 44432 112222 223445778888999999876543 6888777887665443 333333444444444 Q ss_pred HHHHHHHhcC-CCCHHHHHHHHHHHHHHHHHHHhCCceee----------------------------eEEEE-e-cCCC Q lcl|NC_015266. 297 AEEQMGVVDG-PLNPSRARDIIENINAWFRREVSVGELIG----------------------------GGAWY-D-PEPN 345 (390) Q Consensus 297 ~~~~~~~v~e-~n~~~~~~~i~~~i~~~L~~l~~~g~l~g----------------------------~~v~~-d-~~~n 345 (390) ...+.. ..+ |.|..=...|+..|+.-|++-+++|.|.- |.+.. + .+.+ T Consensus 385 ~~ll~~-~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~i~~~~G~~~~~~~~~kGyy~~~~~~~s~~ 463 (494) T protein:vir:94 385 FETLLA-YRSLPYNADGYNALYQGAQDVVSQFVAAGVIRAGVALSASQRAQIDQAAGVPISGDVVDKGWYLQVIDPITTT 463 (494) T ss_pred HHHHHh-CCCcccChhhHHHHHHHHHHHHHHHHhCceeecccccCcchhhhhhhhhcCccccceeccceeeeccCCCChh Confidence 443332 233 88888889999999999999999999842 22222 2 2334 Q ss_pred CHHHHhCCeEEEEEEEEecccceEEEEEEEEcc Q lcl|NC_015266. 346 TTDELTSGGTWIDYDYTPVPPLENLKLRQRITD 378 (390) Q Consensus 346 t~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 378 (390) +..+...-++.+.+. ---.+++|++.....- T Consensus 464 ~ra~R~~~~~~~~y~--~~GAIh~v~i~~~~v~ 494 (494) T protein:vir:94 464 VRTDRGSPTVNFWYC--DGGSIQRVVVSATTVI 494 (494) T ss_pred hhhccccCCceEEEE--ecCcEEEEEEeeEEeC Confidence 444444444433333 3666777776555444 Done!