Query lcl|NC_019932.1_cdsid_YP_007238064.1 [gene=G184_gp57] [protein=tail sheath protein] [protein_id=YP_007238064.1] [location=complement(25811..26980)] Match_columns 389 No_of_seqs 162 out of 744 Neff 9.5 Searched_HMMs 1612 Date Thu Nov 7 18:35:14 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_54 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_54_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:103993 Length: 390 100.0 3E-113 2E-116 637.8 36.7 389 1-389 1-390 (390) 2 protein:vir:78206 Length: 390 100.0 3E-113 2E-116 637.8 36.7 389 1-389 1-390 (390) 3 protein:vir:79181 Length: 390 100.0 9E-113 6E-116 634.8 37.2 389 1-389 1-390 (390) 4 protein:vir:98553 Length: 395 100.0 2E-112 1E-115 632.6 38.7 389 1-389 1-395 (395) 5 protein:vir:1845 Length: 392 # 100.0 5E-112 3E-115 630.8 38.2 389 1-389 1-392 (392) 6 protein:vir:6079 Length: 396 # 100.0 1E-111 9E-115 628.3 37.2 389 1-389 1-395 (396) 7 protein:vir:79141 Length: 391 100.0 1E-111 7E-115 628.8 36.7 389 1-389 1-390 (391) 8 protein:vir:5711 Length: 396 # 100.0 3E-111 2E-114 626.7 37.4 389 1-389 1-395 (396) 9 protein:vir:1172 Length: 391 # 100.0 2E-111 1E-114 627.8 36.1 389 1-389 3-391 (391) 10 protein:vir:2035 Length: 396 # 100.0 3E-111 2E-114 626.9 36.7 389 1-389 1-395 (396) 11 protein:vir:100323 Length: 393 100.0 2E-110 1E-113 621.6 36.5 387 1-389 3-392 (393) 12 protein:vir:10336 Length: 386 100.0 3E-105 2E-108 593.6 35.0 383 1-384 1-386 (386) 13 protein:vir:107865 Length: 477 100.0 8E-100 5E-103 563.8 36.1 382 1-387 1-477 (477) 14 protein:vir:96740 Length: 388 100.0 9E-100 6E-103 563.5 35.4 373 1-388 4-388 (388) 15 protein:vir:79092 Length: 477 100.0 2.7E-99 2E-102 561.0 36.3 382 1-387 1-477 (477) 16 protein:vir:80984 Length: 666 100.0 2.4E-87 1.5E-90 495.4 35.6 375 1-389 1-665 (666) 17 protein:vir:98263 Length: 664 100.0 5.7E-87 3.5E-90 493.4 34.5 375 1-389 1-660 (664) 18 protein:vir:6894 Length: 660 # 100.0 2.1E-86 1.3E-89 490.2 35.4 380 1-389 1-660 (660) 19 protein:vir:6594 Length: 666 # 100.0 4.9E-86 3E-89 488.3 36.9 377 1-389 1-665 (666) 20 protein:vir:106427 Length: 679 100.0 3.3E-86 2.1E-89 489.2 35.0 380 1-389 1-679 (679) 21 protein:vir:103456 Length: 659 100.0 1.8E-85 1.1E-88 485.2 35.5 378 1-389 1-656 (659) 22 protein:vir:108052 Length: 660 100.0 2E-85 1.2E-88 485.0 35.1 376 1-388 1-660 (660) 23 protein:vir:7206 Length: 659 # 100.0 2.9E-85 1.8E-88 484.0 35.3 379 1-388 1-659 (659) 24 protein:vir:101187 Length: 663 100.0 1.6E-85 9.6E-89 485.5 33.8 380 1-389 1-662 (663) 25 protein:vir:101804 Length: 663 100.0 8.3E-85 5.1E-88 481.5 33.7 380 1-389 1-662 (663) 26 protein:vir:104858 Length: 729 100.0 1.8E-84 1.1E-87 479.7 35.0 376 1-387 1-729 (729) 27 protein:vir:5663 Length: 671 # 100.0 2.6E-83 1.6E-86 473.4 34.3 378 1-389 1-671 (671) 28 protein:vir:100539 Length: 663 100.0 1.3E-83 8.1E-87 475.0 32.6 380 1-389 1-662 (663) 29 protein:vir:106984 Length: 743 100.0 2E-81 1.2E-84 463.0 29.9 379 1-386 299-743 (743) 30 protein:vir:98824 Length: 774 100.0 1.6E-80 1E-83 458.0 28.3 373 1-386 279-774 (774) 31 protein:vir:104477 Length: 749 100.0 1.6E-76 1E-79 436.1 29.0 378 1-385 320-749 (749) 32 protein:vir:5833 Length: 742 # 100.0 3.5E-75 2.2E-78 428.8 29.1 363 1-385 350-742 (742) 33 protein:vir:79798 Length: 717 100.0 2.8E-50 1.7E-53 292.3 24.8 353 1-377 330-717 (717) 34 protein:vir:63742 Length: 562 100.0 6.5E-39 4E-42 229.9 28.8 359 1-382 8-562 (562) 35 protein:vir:80779 Length: 569 100.0 2E-37 1.2E-40 221.8 27.7 359 1-382 1-569 (569) 36 protein:vir:80488 Length: 562 100.0 3.3E-37 2E-40 220.6 28.2 359 1-382 8-562 (562) 37 protein:vir:103168 Length: 641 100.0 7.6E-37 4.7E-40 218.6 20.0 264 1-279 3-641 (641) 38 protein:vir:107310 Length: 581 100.0 3.3E-35 2.1E-38 209.6 22.9 367 1-389 149-580 (581) 39 protein:vir:7653 Length: 581 # 100.0 3.2E-35 2E-38 209.7 20.6 363 1-389 158-580 (581) 40 protein:vir:95741 Length: 587 100.0 1.9E-33 1.2E-36 200.0 27.2 359 1-382 1-587 (587) 41 protein:vir:96586 Length: 587 100.0 3.1E-33 1.9E-36 198.8 27.5 359 1-382 8-587 (587) 42 protein:vir:99306 Length: 587 100.0 6.1E-33 3.8E-36 197.2 27.2 359 1-382 1-587 (587) 43 protein:vir:102819 Length: 648 100.0 3.1E-31 1.9E-34 187.8 28.4 357 1-380 1-648 (648) 44 protein:vir:102957 Length: 437 99.9 6.5E-29 4E-32 175.1 23.8 357 1-376 1-437 (437) 45 protein:vir:100829 Length: 607 99.9 5.9E-28 3.7E-31 169.9 27.1 364 1-388 17-607 (607) 46 protein:vir:101326 Length: 529 99.9 4.4E-25 2.8E-28 154.1 23.1 355 1-377 112-529 (529) 47 protein:vir:105470 Length: 451 99.9 1.5E-22 9.2E-26 140.3 25.6 355 1-376 1-451 (451) 48 protein:vir:78986 Length: 436 99.6 2.3E-16 1.4E-19 106.3 23.8 352 1-376 3-436 (436) 49 protein:vir:102359 Length: 356 99.2 1.3E-11 8.3E-15 80.2 19.8 320 1-375 1-356 (356) 50 protein:vir:95263 Length: 450 98.8 3.5E-08 2.2E-11 61.5 27.0 357 1-378 1-450 (450) 51 protein:vir:3751 Length: 376 # 98.8 2.7E-08 1.7E-11 62.0 23.2 340 4-382 1-376 (376) 52 protein:vir:5260 Length: 502 # 98.7 8.7E-08 5.4E-11 59.3 28.3 361 1-377 1-502 (502) 53 protein:vir:3788 Length: 376 # 98.7 4.7E-08 2.9E-11 60.8 22.4 342 4-382 1-376 (376) 54 protein:vir:80052 Length: 331 98.7 1.4E-07 8.7E-11 58.1 25.8 314 1-377 1-331 (331) 55 protein:vir:78782 Length: 370 98.6 1E-07 6.3E-11 58.9 22.3 339 4-384 1-370 (370) 56 protein:vir:3165 Length: 426 # 98.3 1.4E-06 8.9E-10 52.6 21.4 363 1-377 1-426 (426) 57 protein:vir:106984 Length: 743 98.3 1.8E-09 1.1E-12 68.6 4.6 352 1-389 1-404 (743) 58 protein:vir:276 Length: 369 # 98.0 7.7E-06 4.8E-09 48.6 26.8 339 1-380 1-369 (369) 59 protein:vir:4463 Length: 498 # 98.0 2.7E-06 1.7E-09 51.1 16.7 353 1-389 74-498 (498) 60 protein:vir:4517 Length: 498 # 98.0 3E-06 1.8E-09 50.9 16.5 354 1-380 74-498 (498) 61 protein:vir:489 Length: 498 # 97.6 4.1E-05 2.5E-08 44.7 18.0 350 1-389 74-498 (498) 62 protein:vir:104477 Length: 749 97.4 3E-06 1.8E-09 50.9 8.6 191 1-389 1-197 (749) 63 protein:vir:1996 Length: 495 # 96.6 0.00048 3E-07 38.8 21.7 346 1-377 77-495 (495) 64 protein:vir:96104 Length: 504 95.3 0.0023 1.4E-06 35.1 28.1 356 1-376 1-504 (504) 65 protein:vir:3636 Length: 501 # 94.7 0.0038 2.3E-06 33.9 25.2 363 1-377 1-501 (501) 66 protein:vir:106730 Length: 501 93.3 0.0082 5.1E-06 32.0 25.6 362 1-377 1-501 (501) 67 protein:vir:101576 Length: 501 92.3 0.012 7.5E-06 31.1 25.2 362 1-377 1-501 (501) 68 protein:vir:99586 Length: 507 89.8 0.024 1.5E-05 29.4 21.6 351 1-376 68-507 (507) 69 protein:vir:78611 Length: 501 89.0 0.029 1.8E-05 29.0 25.3 362 1-377 1-501 (501) 70 protein:vir:94073 Length: 494 65.6 0.28 0.00017 23.6 25.9 352 1-377 1-494 (494) No 1 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=100.00 E-value=2.6e-113 Score=637.83 Aligned_cols=389 Identities=65% Similarity=1.031 Sum_probs=381.1 Q ss_pred CCC-CCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhcccCc Q lcl|NC_019932. 1 MSD-YHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIADQAKP 79 (389) Q Consensus 1 M~~-~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~~~~~ 79 (389) |++ |+|||||+|+.++++++.++++++++|+|++++++...+|+++|+++++..++...++++++|..++..++++++. T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~~gg~ 80 (390) T protein:vir:10 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhccccCc Confidence 777 5799999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhhhhcCceeeeccC Q lcl|NC_019932. 80 VTVVVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALASIAQQLRAFAYVSAW 159 (389) Q Consensus 80 ~~~v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~i~d~~ 159 (389) .++++++.++.+...+..+++++.+..+..+|+++++.+++..+..|.++++|++++.+|+++|..+|+++++++++|+| T Consensus 81 ~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~~~aivD~p 160 (390) T protein:vir:10 81 LTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSLRAMAYVSAS 160 (390) T ss_pred eEEEEEecccccccccccccccccccccccchhhhhhhhhhhhcceehhhcccccchHHHHHHHHHhhcccceEEEEecC Confidence 99999999999998899999999888888999999999999999999999999999999999999999999999999999 Q ss_pred CCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccceeccccccc Q lcl|NC_019932. 160 GCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFW 239 (389) Q Consensus 160 ~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~ 239 (389) .+.+.+++.+++++++|.+.++||||++++++..+..+++|||+++||++|++|.++|||+||||+.|+|+.+++..+++ T Consensus 161 ~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~ 240 (390) T protein:vir:10 161 GCKTKEEAAAYRKQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSW 240 (390) T ss_pred CCCCHHHHHHHhhccCCceEEEEcCceEeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCceeeceeecceeccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCcchhhhhcccceEEEEcCCCEEEEcCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHH Q lcl|NC_019932. 240 DLQQTGTDADLLNEACVTTLIRKDGFRFWGNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGI 319 (389) Q Consensus 240 ~~~~~~~~~~~l~~~~i~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i 319 (389) ..++..+|+++||++||+++++++||++||+||+++||+|+||++||+++||+++|+++++|++||||++.+|++|++++ T Consensus 241 ~~~~~~~~~~~ln~~gi~t~~~~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i 320 (390) T protein:vir:10 241 DLQDPATDAGYLNEHEVTTLVNRNGFRFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVDGPLNPSLARDIVESI 320 (390) T ss_pred ccccccchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 320 NAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 320 ~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) +.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++|+++|+| T Consensus 321 ~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:10 321 NGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred HHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=100.00 E-value=2.6e-113 Score=637.83 Aligned_cols=389 Identities=65% Similarity=1.031 Sum_probs=381.1 Q ss_pred CCC-CCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhcccCc Q lcl|NC_019932. 1 MSD-YHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIADQAKP 79 (389) Q Consensus 1 M~~-~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~~~~~ 79 (389) |++ |+|||||+|+.++++++.++++++++|+|++++++...+|+++|+++++..++...++++++|..++..++++++. T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~~gg~ 80 (390) T protein:vir:78 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhccccCc Confidence 777 5799999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhhhhcCceeeeccC Q lcl|NC_019932. 80 VTVVVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALASIAQQLRAFAYVSAW 159 (389) Q Consensus 80 ~~~v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~i~d~~ 159 (389) .++++++.++.+...+..+++++.+..+..+|+++++.+++..+..|.++++|++++.+|+++|..+|+++++++++|+| T Consensus 81 ~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~~~aivD~p 160 (390) T protein:vir:78 81 LTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSLRAMAYVSAS 160 (390) T ss_pred eEEEEEecccccccccccccccccccccccchhhhhhhhhhhhcceehhhcccccchHHHHHHHHHhhcccceEEEEecC Confidence 99999999999998899999999888888999999999999999999999999999999999999999999999999999 Q ss_pred CCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccceeccccccc Q lcl|NC_019932. 160 GCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFW 239 (389) Q Consensus 160 ~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~ 239 (389) .+.+.+++.+++++++|.+.++||||++++++..+..+++|||+++||++|++|.++|||+||||+.|+|+.+++..+++ T Consensus 161 ~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~ 240 (390) T protein:vir:78 161 GCKTKEEAAAYRKQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSW 240 (390) T ss_pred CCCCHHHHHHHhhccCCceEEEEcCceEeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCceeeceeecceeccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCcchhhhhcccceEEEEcCCCEEEEcCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHH Q lcl|NC_019932. 240 DLQQTGTDADLLNEACVTTLIRKDGFRFWGNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGI 319 (389) Q Consensus 240 ~~~~~~~~~~~l~~~~i~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i 319 (389) ..++..+|+++||++||+++++++||++||+||+++||+|+||++||+++||+++|+++++|++||||++.+|++|++++ T Consensus 241 ~~~~~~~~~~~ln~~gi~t~~~~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i 320 (390) T protein:vir:78 241 DLQDPATDAGYLNEHEVTTLVNRNGFRFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVDGPLNPSLARDIVESI 320 (390) T ss_pred ccccccchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 320 NAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 320 ~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) +.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++|+++|+| T Consensus 321 ~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:78 321 NGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred HHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=100.00 E-value=9.1e-113 Score=634.82 Aligned_cols=389 Identities=65% Similarity=1.032 Sum_probs=380.1 Q ss_pred CCC-CCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhcccCc Q lcl|NC_019932. 1 MSD-YHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIADQAKP 79 (389) Q Consensus 1 M~~-~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~~~~~ 79 (389) |++ |+|||||+|+.++++++.++++++++|+|++++++...+|+++|+++++..++...++++++|..+++.++.+++. T Consensus 1 M~~~~~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~its~~~~~~~~g~~~tL~~al~~~~~~~~~ 80 (390) T protein:vir:79 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CccccCCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccceEeecHHHHHHhcCCCccchhhhhhhcccccc Confidence 666 5799999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhhhhcCceeeeccC Q lcl|NC_019932. 80 VTVVVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALASIAQQLRAFAYVSAW 159 (389) Q Consensus 80 ~~~v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~i~d~~ 159 (389) .++++++..+.+...+..+.+++.+..+..+|++++++.++..+..|.++++|++++.+++++|..+|+++++++++|+| T Consensus 81 ~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tgl~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~~~ai~D~p 160 (390) T protein:vir:79 81 LTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSLRAMAYVSAS 160 (390) T ss_pred eEEEEeeccccccccccceeeecccccccchhhhhhhhhhhhhccccccccCCcccchHHHHHHHHhhhhcceEEEEEcc Confidence 99999999999888888888888888889999999999999999999999999999999999999999999999999999 Q ss_pred CCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccceeccccccc Q lcl|NC_019932. 160 GCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFW 239 (389) Q Consensus 160 ~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~ 239 (389) .+.+.+++.+++++++|.+.++||||+++|++..+..+++|||+++||++||+|.++|||+||||+.|+|+.+++..+.+ T Consensus 161 ~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~D~~~g~~~spsN~~i~gi~~~~~~~~~ 240 (390) T protein:vir:79 161 GCKTKEEAAAYRRQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSW 240 (390) T ss_pred CCCCHHHHHHHhcCCCCceEEEEcCceeecccccCceeEeehHHHHHHHHHhhhccCCcEEccCCceeeccceeeeeccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCcchhhhhcccceEEEEcCCCEEEEcCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHH Q lcl|NC_019932. 240 DLQQTGTDADLLNEACVTTLIRKDGFRFWGNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGI 319 (389) Q Consensus 240 ~~~~~~~~~~~l~~~~i~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i 319 (389) .+++.++|+++||++||+++++++||++||+||+++||+|+||++||+++||+++|+++++|++||||++.+|++|++++ T Consensus 241 ~~~~~~~~a~~Ln~~gi~t~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i 320 (390) T protein:vir:79 241 DLQDPATDAGYLNEHEVTTLVNRNGFRFWGERTCSDDPKFAFENYTRTAQVAADSIAEAQMPVVDGPLNPSLARDIVESI 320 (390) T ss_pred cccccchhhhhhhhcCcEEEEcCCCEEEEeccccCCCcccceeeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 320 NAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 320 ~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) +.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++|+++|+| T Consensus 321 ~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~v~~ 390 (390) T protein:vir:79 321 NGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred HHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=100.00 E-value=2.4e-112 Score=632.56 Aligned_cols=389 Identities=79% Similarity=1.228 Sum_probs=374.4 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhcccCce Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIADQAKPV 80 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~~~~~~ 80 (389) |++|+|||||+|+.++++++.++++++++|+|++++++...+|+++|+++++..++...+|+.+++..++..++++++.. T Consensus 1 m~~~~~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~~~pv~v~s~~~~~~~~g~~~tl~~al~~~~~~~~~~ 80 (395) T protein:vir:98 1 MSDFHHGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPLNEPVLITNVQSAIAKAGKKGTLAASLQAIADQSKPV 80 (395) T ss_pred CCCCCCCeEEEEcCCCcccccccCcceEEEEeeccCCCccccccccceEeechHHhHhhcccccchhhHHHHHhhccCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecccccc------ccccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhhhhcCcee Q lcl|NC_019932. 81 TVVVRVAEGATP------AETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALASIAQQLRAFA 154 (389) Q Consensus 81 ~~v~~~~~~~~~------~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~ 154 (389) +++++...+... ..+...+.+..+..+.+||+++++++++..+..|.++++||+++++++++|.++|+++++++ T Consensus 81 ~~vv~~~~~~~~~~~~~~a~~~~~i~g~~~~~~~~Tgl~al~~~~~~~~~~p~il~ap~~~~~~v~~al~~~~~~~~~~~ 160 (395) T protein:vir:98 81 TVVVRVEDGTGDDEEAALAQTVSNIIGGTDENGKYTGIKALLTAQAVTGVKPRILGVPGLDTKEVAVALASAAIKLRAFA 160 (395) T ss_pred EEEeeccccccccccccccccccccccccccccchhHHHHHhhhhhhhccchhhcccccccccHHHHHHHHHhhhcCcEE Confidence 999887655433 23444566666667889999999999999999999999999999999999999999999999 Q ss_pred eeccCCCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccceecc Q lcl|NC_019932. 155 YVSAWGCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGIS 234 (389) Q Consensus 155 i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~ 234 (389) ++|+|.+.+.+++++++++++|++.+++|||++++++.++..+++|||+++||++||+|.++|||+||+|+.++||.+++ T Consensus 161 ~~d~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~i~gi~~~~ 240 (395) T protein:vir:98 161 YVSAWGCKTISEAMEYRKNFSQRELMVIWPDFLAWDTVKNTTATAYATARALGLRAYIDQTVGWHKTLSNVGVQGVTGIS 240 (395) T ss_pred EEEcCCCCCHHHHHHHHhccCCceEEEEecceeEecccCCceeeechHHHHHHHHHHhhcccCcEeccCCceeecccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccCCcchhhhhcccceEEEEcCCCEEEEcCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHH Q lcl|NC_019932. 235 ASVFWDLQQTGTDADLLNEACVTTLIRKDGFRFWGNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRD 314 (389) Q Consensus 235 ~~~~~~~~~~~~~~~~l~~~~i~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~ 314 (389) .++++.+++..+|+++||++||+++++++||++||+||+++||+|+||++||++++|+++|++.++|++||||++.+|++ T Consensus 241 ~~~~~~~~~~~~~~~~Ln~~gI~~~~~~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~~~~~~~~~ 320 (395) T protein:vir:98 241 ASVFWDLQASGTDADLLNEAGVTTLVRKDGFRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDKPITATLIRD 320 (395) T ss_pred eecccccCCCcchHHhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 315 IIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 315 i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) |+++++.||++||++|+|+||+++||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++||++|+| T Consensus 321 i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~I~~~~~~~~~~~~~~~~~~~~ 395 (395) T protein:vir:98 321 IVDGINAKFRELKSNGYIVEGKCWFDEESNDKETLKAGKLYIDYDYTPVPPLESLTLRQRITDKYLVNLAESVNS 395 (395) T ss_pred HHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999 No 5 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=100.00 E-value=5e-112 Score=630.79 Aligned_cols=389 Identities=80% Similarity=1.234 Sum_probs=376.9 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhcccCce Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIADQAKPV 80 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~~~~~~ 80 (389) |++|||||||+|+.+|++++.++++++++|+|+++..+...+|.++|+++++..++...+++++++..+++.++++++.. T Consensus 1 m~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~~~p~~its~~~~~~~~g~~gtl~~al~~~~~ngg~~ 80 (392) T protein:vir:18 1 MSDFHHGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPLNEPVLITNVQSAIAKAGKKGTLSASLQAIADQSKPV 80 (392) T ss_pred CCCCCCCeEEEEcCCCceeeeccCcceeEEEEeccCCCCcccccccceEeechHHHHhhcCCCcchHHHHHHhhcccCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecccc---ccccccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhhhhcCceeeec Q lcl|NC_019932. 81 TVVVRVAEGA---TPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALASIAQQLRAFAYVS 157 (389) Q Consensus 81 ~~v~~~~~~~---~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~i~d 157 (389) +++++...+. +...+..+++++.+.++..+|++++.+++...+..|.++++||+++++|+++|.++|+++++++++| T Consensus 81 ~~vv~v~~~~~~~~~~~t~~dliG~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~~~~~~~~~~~d 160 (392) T protein:vir:18 81 TVVVRVAEGTGDDAEAQTTSNIIGGTDENGKYTGIKALLTAEAVTGVKPRILGVPGLDTQEVATALASVCISLRAFGYVS 160 (392) T ss_pred EEEecccccccccccccchhhheecccccchhhhHHHHHhhhhhhceeehhcccCccchHHHHHHHHHHHhhcCcEEEEe Confidence 9988765543 3456677888888888899999999999999999999999999999999999999999999999999 Q ss_pred cCCCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccceeccccc Q lcl|NC_019932. 158 AWGCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASV 237 (389) Q Consensus 158 ~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~ 237 (389) +|++.+.+++.+++++++|.+.+++|||++++++..+..+++|||+++||++|++|.++|||+||+|++|+||.++++++ T Consensus 161 ~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~~~~ 240 (392) T protein:vir:18 161 AWGCKTISEAMAYRENFSQRELMVIWPDFLAWDTTANATATAYATARALGLRAYIDQTIGWHKTLSNVGVQGVTGISASV 240 (392) T ss_pred cCCCCCHHHHHHHHhhccCceEEEEeCceeeecccCCceEEechHHHHHHHHHhhhccCCceEccCCceeeceeecceec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCCcchhhhhcccceEEEEcCCCEEEEcCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHH Q lcl|NC_019932. 238 FWDLQQTGTDADLLNEACVTTLIRKDGFRFWGNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIA 317 (389) Q Consensus 238 ~~~~~~~~~~~~~l~~~~i~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~ 317 (389) +++.++..+|+++||++||+++++++||++||+||+++||+|+||++||++++|+++|+++++|++||||++.+|++|++ T Consensus 241 ~~~~~~~~~~~~~Ln~~gI~t~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~ 320 (392) T protein:vir:18 241 FWDLQASGTDADLLNEAGVTTLVRKDGFRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDKPITASLIRDIVD 320 (392) T ss_pred ccccCCCcchhhhhhhcCceEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 318 GINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 318 ~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) +++.||++||++|+|+||+++||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++|+++|+| T Consensus 321 ~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~p~e~I~~~~~~~~~~~~~~~~~~~~ 392 (392) T protein:vir:18 321 GINAKFRELKSNGYIVDGECWFDEESNDKETLKAGKLYIDYDYTPVPPLESLTLRQRITDKYLVNLAESVNS 392 (392) T ss_pred HHHHHHHHHHhcCcccceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999 No 6 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=100.00 E-value=1.4e-111 Score=628.35 Aligned_cols=389 Identities=78% Similarity=1.238 Sum_probs=376.0 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhcccCce Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIADQAKPV 80 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~~~~~~ 80 (389) |++|+|||||+|++++++++..+++++++|+|+++.++...+|.++|+++++..++...+++++++..+++.++++++.. T Consensus 1 m~~~~~Gv~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~~~~~~~~g~~~tl~~a~~~~~~~gg~~ 80 (396) T protein:vir:60 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPLNKPVLITNVQSAIAKAGKKGTLAASLQAIADQSKPV 80 (396) T ss_pred CCCCCCCeEEEEcCCCcccccccCceeEEEEecccccccccccCccCeEeechHHHHHhhcCcchhHHHHHHHhhccCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeccccccc------cccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhhhhcCcee Q lcl|NC_019932. 81 TVVVRVAEGATPA------ETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALASIAQQLRAFA 154 (389) Q Consensus 81 ~~v~~~~~~~~~~------~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~ 154 (389) +++++...+.+.. .+.....++.+.++..+|++++++.++..+..|.++.+||+++.+|++++.++|+++++++ T Consensus 81 ~~vv~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~tg~~al~~~~~~~~~~~~il~ap~~~~~~v~~al~~~~~~~~~~~ 160 (396) T protein:vir:60 81 TVVVRVEDGTGEDEETKLAQTVSNIIGTTDENGQYTGLKALLAAESVTGVKPRILGVPGLDTKEVAVALASVCQKLRAFG 160 (396) T ss_pred EEEEecccccccccccccccccccccccccccccccchhhhhhcccceeeeeeeccccccccHHHHHHHHHHhccCCeEE Confidence 9999886654432 3445667777778889999999999999999999999999999999999999999999999 Q ss_pred eeccCCCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccceecc Q lcl|NC_019932. 155 YVSAWGCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGIS 234 (389) Q Consensus 155 i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~ 234 (389) ++|+|.+.+.+++++++++++|.+.++||||++++|+.++..+++|||+++||++|++|.++|+|+||||+.|+||.+++ T Consensus 161 i~d~p~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~ 240 (396) T protein:vir:60 161 YISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVASTTATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGIS 240 (396) T ss_pred EEeCCCCCCHHHHHHHHhhcCCceEEEEeCceeeecccCCceeEEchhHHHHHHHHHhhhccCcEeCcCCceecceeece Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccCCcchhhhhcccceEEEEcCCCEEEEcCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHH Q lcl|NC_019932. 235 ASVFWDLQQTGTDADLLNEACVTTLIRKDGFRFWGNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRD 314 (389) Q Consensus 235 ~~~~~~~~~~~~~~~~l~~~~i~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~ 314 (389) .++++.+++.++|+++||++||+++|+++|+++||+||+++||+|+||++||+++||+++|++.+++++||||++.+|++ T Consensus 241 ~~~~~~~~~~~~~~~~Ln~~gI~~~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~ 320 (396) T protein:vir:60 241 ASVFWDLQESGTDADLLNESGVTTLIRRDGFRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDKPITATLIRD 320 (396) T ss_pred eecccccCCCcchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 315 IIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 315 i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) |+++++.||++||++|+|+||+++||+++||+++|++|+|+++|+++|++|+|||+++++++++||++||++|+| T Consensus 321 i~~~i~~~l~~l~~~gal~g~~~~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~~~~~~~ 395 (396) T protein:vir:60 321 IVDGINAKFRELKTNGYIVDATCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITDKYLANLVTSVNS 395 (396) T ss_pred HHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhc Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999 No 7 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=100.00 E-value=1.1e-111 Score=628.82 Aligned_cols=389 Identities=70% Similarity=1.137 Sum_probs=380.3 Q ss_pred CCC-CCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhcccCc Q lcl|NC_019932. 1 MSD-YHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIADQAKP 79 (389) Q Consensus 1 M~~-~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~~~~~ 79 (389) |++ |+|||||+|+.++++++.++++++++|+|+++.++...+|+++|+++++..++...+++++++..+++.++++++. T Consensus 1 M~~~~~pGv~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~n~pv~iss~~~~~~~~g~~gtl~~al~~~~~~gg~ 80 (391) T protein:vir:79 1 MPTDYHHGVRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPLDTPVLLTNPQAYIGKAGDKGTLAHTLDAITDQTNP 80 (391) T ss_pred CCCCCCCCeEEEECCCCcccccccCCceEEEEeecccccccccccccCEEeccHHHHHHhcCCccccchhhhhhhccccc Confidence 555 6899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhhhhcCceeeeccC Q lcl|NC_019932. 80 VTVVVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALASIAQQLRAFAYVSAW 159 (389) Q Consensus 80 ~~~v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~i~d~~ 159 (389) .++++++....+...+..+..++.+.++..+|+++++++++..+..|.++++|++++.++++++.++|+++++++++|+| T Consensus 81 ~~~vv~~~~~~~~~~~~~~~~g~~~~~~~~tGl~~l~~~~~~~~~~p~~l~~p~~~~~~v~~al~~~~~~~~~~ai~d~p 160 (391) T protein:vir:79 81 LTVVVRVAGGASEAETTSNLIGTTNAAGRYTGMKALLTARNRFGVAPRILAVPGLDSLPVGTELVTIAQKLRAFAYLSAY 160 (391) T ss_pred ceeeeccccccccccccccccccccchhhhHHHhhhhhhhhhhcccchhhcCCccchhHHHHHHHHHHhhcCcEEEEECC Confidence 99999999999888888888888888899999999999999999999999999999999999999999999999999999 Q ss_pred CCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccceeccccccc Q lcl|NC_019932. 160 GCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFW 239 (389) Q Consensus 160 ~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~ 239 (389) .+.+.+++.++++.++|+++++||||++++++.++..+++|||+++||++||+|.++|||+||+|+.|+||.++++++++ T Consensus 161 ~~~t~~~a~~~~~~~~s~~~a~~~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~ 240 (391) T protein:vir:79 161 GCQTKEEAVAYRSNFGQREAMVMWPDFVGWDTAANAETTLWATARAVGLRAKIDNDTGWHKTLSNVAVGGVTGLSRDVFW 240 (391) T ss_pred CCCCHHHHHHHHhccCCceeEEecceeeeecCcCCceeeechHHHHHHHHHHhhhcccceeccCCceehhhhcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCcchhhhhcccceEEEEcCCCEEEEcCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHH Q lcl|NC_019932. 240 DLQQTGTDADLLNEACVTTLIRKDGFRFWGNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGI 319 (389) Q Consensus 240 ~~~~~~~~~~~l~~~~i~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i 319 (389) ..++..+++++||++||+++++++||++||+||+++||+|+||++||++++|+++|+++++|++||||++.+|++|++++ T Consensus 241 ~~~~~~~~~~~Ln~~~I~t~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i 320 (391) T protein:vir:79 241 DLQDPATDAGYLNANEVTTLVHRDGYRFWGSRTCSADPLFAFENYTRTAQVLADTMAEAHMWANDLPMTPTLVRDLLEGI 320 (391) T ss_pred ccccccchhhhhhhcCceEEECCCcEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 320 NAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 320 ~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) +.||++||++|+|+||+++||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++|+++|+| T Consensus 321 ~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~v~~ 390 (391) T protein:vir:79 321 NAKLRMLTRNGYLLGGAAWFDADANSKDTLKAGQLAIDYDYTPVPPLENLTFRQRITDRYLMQFAEAVKA 390 (391) T ss_pred HHHHHHHHhCCceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhc Confidence 9999999999999999999999999999999999999999999999999999999999999999999999 No 8 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=100.00 E-value=2.8e-111 Score=626.69 Aligned_cols=389 Identities=77% Similarity=1.224 Sum_probs=374.7 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhcccCce Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIADQAKPV 80 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~~~~~~ 80 (389) |++|+|||||+|+.++++++.++++++++|+|+++..+...+|.++|+++++..++...++..+++..+++.++++++.. T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~al~~~~~~~~~~ 80 (396) T protein:vir:57 1 MSDYHHGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAIAKAGKKGTLAASLQAIADQSKPV 80 (396) T ss_pred CCCCCCceEEEEcCCCcccccccCCceEEEEEeccCCCcccccCccCeEeecchhhhhhcccccchHHHHHHhhhcCCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecccccc------ccccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhhhhcCcee Q lcl|NC_019932. 81 TVVVRVAEGATP------AETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALASIAQQLRAFA 154 (389) Q Consensus 81 ~~v~~~~~~~~~------~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~ 154 (389) +++++...+... ..+..++++..+.++..+|++++.++++..+..|.++++|++++++++++|.++|+++++++ T Consensus 81 ~~vv~~~~~~~~~~~~~~a~t~~~iiG~~~~~~~~tgl~al~~~~~~~~~~p~i~~ap~~~~~~v~~al~~~~~~~~~~~ 160 (396) T protein:vir:57 81 TVVVRVEDGTGDDEETKLAQTVSNIIGTTDENGQYTGLKALMGAESVTGVKPRILGVPGLDTKEVAVALASVCQELNAFG 160 (396) T ss_pred eEeeeccccccccccccccccceeeeeeccccccchhhhhhhhcccceeEEeccccCcccchhHHHHHHHHHhhhCceEE Confidence 999887655432 33445666666678889999999999999999999999999999999999999999999999 Q ss_pred eeccCCCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccceecc Q lcl|NC_019932. 155 YVSAWGCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGIS 234 (389) Q Consensus 155 i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~ 234 (389) ++|+|++.+.+++++++++++|.+.++|+||++++++.++..+++|||+++||++||+|.++|+|+||+|++|+||.+++ T Consensus 161 ~~d~p~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spaN~~l~gi~~~~ 240 (396) T protein:vir:57 161 YISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGIS 240 (396) T ss_pred EEcCCCCCCHHHHHHHHhccCCceEEEEcceeeeecccCCceeEEehhHHHHHHHHHhhhccCcEeccCCceeccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccCCcchhhhhcccceEEEEcCCCEEEEcCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHH Q lcl|NC_019932. 235 ASVFWDLQQTGTDADLLNEACVTTLIRKDGFRFWGNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRD 314 (389) Q Consensus 235 ~~~~~~~~~~~~~~~~l~~~~i~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~ 314 (389) +.++++.++.++|+++||++||+++++++||++||+||+++||+|+||++||++++|+++|+++++|++||||++.+|++ T Consensus 241 ~~~~~~~~~~~~~~~~Ln~~gi~t~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e~n~~~~~~~ 320 (396) T protein:vir:57 241 ASVFWDLQKPGTDADLLNEAGVTTLVRRDGFRFWGNRTCSDDPLFLFESYTRTAQVLADTMAEAHMWAIDKPITATLIRD 320 (396) T ss_pred eecccccCCcchhhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 315 IIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 315 i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) |+++++.||++||++|+|+||+++||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++|+++|+| T Consensus 321 i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~I~~~~~~~~~~~~~~~~~~~~ 395 (396) T protein:vir:57 321 IIDGINAKFRELKNNGYIVDGTCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITSRYLASLVTSVNS 395 (396) T ss_pred HHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhc Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999 No 9 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=100.00 E-value=1.7e-111 Score=627.83 Aligned_cols=389 Identities=70% Similarity=1.095 Sum_probs=381.1 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhcccCce Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIADQAKPV 80 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~~~~~~ 80 (389) |+.|+|||||+|+.++++++..+++++++|+|++++++...+|.++|+++++..++...+++++++..+++.++++++.. T Consensus 3 ~~~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v~s~~~~~~~~g~~~tl~~al~~~~~~~g~~ 82 (391) T protein:vir:11 3 ADQYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLITNVQAAIGKAGTSGTLPASLQAIADQANAA 82 (391) T ss_pred CCcCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEEecchhhheecCCCccchhhhhhhhccccce Confidence 44578999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhhhhcCceeeeccCC Q lcl|NC_019932. 81 TVVVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALASIAQQLRAFAYVSAWG 160 (389) Q Consensus 81 ~~v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~i~d~~~ 160 (389) ++++++.++.+...+..+..++.+.....+|+++++++++..+..|.++.+|++++++++++|.++|+++++++++|.|+ T Consensus 83 ~~vv~~~~~~~~~~t~~d~~g~~~a~~~~~g~~a~~~~~~~~~~~p~~~~ap~~~~~~v~~al~~~~~~~~~~~i~D~p~ 162 (391) T protein:vir:11 83 TVVVRVKPGEDEAATNSAVIGGVSADGKYTGMKALLAAKARLGVVPRILGVPGLDTQPVATALIAIAQQLRAFAYVSASG 162 (391) T ss_pred eEEeeecccccccccchhhhcccccccchhhhhhhhhhhhhheeccccccccccccHHHHHHHHHhhcccceEEEEEcCC Confidence 99999999999999999999999998999999999999999999999999999999999999999999999999999999 Q ss_pred CccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccceecccccccc Q lcl|NC_019932. 161 CKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWD 240 (389) Q Consensus 161 ~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~ 240 (389) +.+.+++++++++++|++.++||||++++++..+..+++|||+++||++||+|.++|||+||||+.|+||.+++.++.++ T Consensus 163 ~~t~~~a~~~r~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~l~gi~~~~~~~~~~ 242 (391) T protein:vir:11 163 CKTKEEATAYRENFAAREAMVIWPDFLTWSTVVNQTVPAPAVAQALGLRARIDQEVGWHKTLSNVAVNGVTGISADVFWD 242 (391) T ss_pred CCCHHHHHHHhhhcCCceEEEEcCcceecccccCceEEechHHHHHHHHHHhhccCCcEEccCCceeeceeecccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCcchhhhhcccceEEEEcCCCEEEEcCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHH Q lcl|NC_019932. 241 LQQTGTDADLLNEACVTTLIRKDGFRFWGNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGIN 320 (389) Q Consensus 241 ~~~~~~~~~~l~~~~i~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~ 320 (389) .++.++|+++||++||+++++++||++||+||+++||+|+||++||+|++|+++|++.++|++||||++.+|++|+++++ T Consensus 243 ~~~~~~~~~~Ln~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~ 322 (391) T protein:vir:11 243 LQSPSTDANYLNENEVTTLVQEGGFRFWGSRTCSDDPLFAFENYTRTAQVLADTIAEAHMWAVDKPMHPSLVRDILEGVN 322 (391) T ss_pred cCCCcchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 321 AKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 321 ~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) .||++||++|+|+||+++||+++||+++|++|+|+++|+++|++|+|||+++++++++||++|+++|+| T Consensus 323 ~~l~~l~~~g~l~g~~~~~~~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~a 391 (391) T protein:vir:11 323 AKFRELKGLGLIIDAQAWYDPNVNDKDTLKAGKLRITYDYTPVPPLEDLTFFQKITDSYLVDFASRVNA 391 (391) T ss_pred HHHHHHHhccceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 999999999999999999999999999999999999999999999999999999999999999999999 No 10 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=100.00 E-value=2.5e-111 Score=626.93 Aligned_cols=389 Identities=77% Similarity=1.235 Sum_probs=374.2 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhcccCce Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIADQAKPV 80 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~~~~~~ 80 (389) |++|+|||||+|+.++++++..+++++++|+|++++++...+|+++|+++++..++...++..++|...++.++++++.. T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l~~pvlvts~~~~~~~~g~~~tL~~al~~~~~ngg~~ 80 (396) T protein:vir:20 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAISKAGKKGTLAASLQAIADQSKPV 80 (396) T ss_pred CCCCCCCeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccCccCEEeechHHHHhhcccccchhhhhhhhhccCcee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeccccccc------cccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhhhhcCcee Q lcl|NC_019932. 81 TVVVRVAEGATPA------ETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALASIAQQLRAFA 154 (389) Q Consensus 81 ~~v~~~~~~~~~~------~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~ 154 (389) +++++...+.... .+.....+..+..+..+|++++.++++..+..|.++++|++++++|+++|.++|+++++++ T Consensus 81 ~~v~~~~~~~~~~~~~~~a~t~~~~~~~~~~~~~~tg~~al~~~~~~~~~~p~i~~ap~~~~~~v~~al~~~~~~~~~~~ 160 (396) T protein:vir:20 81 TVVMRVEDGTGDDEETKLAQTVSNIIGTTDENGQYTGLKAMLAAESVTGVKPRILGVPGLDTKEVAVALASVCQKLRAFG 160 (396) T ss_pred EEEEeccccccccccccccccccccccccccccccchhhhhhhhccccccchhhhhhhhhccHHHHHHHHHHHhcCCcEE Confidence 9998876554432 3444566666667888999999999999999999999999999999999999999999999 Q ss_pred eeccCCCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccceecc Q lcl|NC_019932. 155 YVSAWGCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGIS 234 (389) Q Consensus 155 i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~ 234 (389) ++|+|.+.+.+++++++++++|.+.++||||++++|+.++..+++|||+++||++||+|.++|+|+||||++|+||.+++ T Consensus 161 ~iD~p~~~~~~~a~~~r~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spaN~~l~gi~~~~ 240 (396) T protein:vir:20 161 YISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGIS 240 (396) T ss_pred EEecCCCCCHHHHHHHhhCCCCceEEEEcCccccccCcCCcceeechhHHHHHHHHHhhhhcCcEeccCCceeccceecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccCCcchhhhhcccceEEEEcCCCEEEEcCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHH Q lcl|NC_019932. 235 ASVFWDLQQTGTDADLLNEACVTTLIRKDGFRFWGNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRD 314 (389) Q Consensus 235 ~~~~~~~~~~~~~~~~l~~~~i~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~ 314 (389) +++.+.+++.++|+++||++||+++++++||++||+||+++||+|+||++||+++||+++|++.++|++||||++.+|++ T Consensus 241 ~~~~~~~~~~~~~~~~Ln~~gi~~~~~~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~~~~~~~~~v~e~~~~~~~~~ 320 (396) T protein:vir:20 241 ASVFWDLQESGTDADLLNESGVTTLIRRDGFRFWGNRTCSDDPLFLFENYTRTAQVVADTMAEAHMWAVDKPITATLIRD 320 (396) T ss_pred eecccccCCCcchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 315 IIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 315 i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) |+++++.||++||++|+|+||+++||+++||+++|++|+|+++|+++|++|+|||+|+++++++||++|+++|+| T Consensus 321 i~~~i~~~L~~l~~~G~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 395 (396) T protein:vir:20 321 IVDGINAKFRELKTNGYIVDATCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITDKYLANLVTSVNS 395 (396) T ss_pred HHHHHHHHHHHHHhCcceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhc Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999 No 11 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=100.00 E-value=2.4e-110 Score=621.60 Aligned_cols=387 Identities=46% Similarity=0.769 Sum_probs=371.6 Q ss_pred CCC-CCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhcccCc Q lcl|NC_019932. 1 MSD-YHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIADQAKP 79 (389) Q Consensus 1 M~~-~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~~~~~ 79 (389) |++ |+|||||+|+.++++++.++++++++|+|++++++...+|+++|+++++..++...+++.+++..++..++++++. T Consensus 3 m~~~~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~~~~~~~~g~~g~L~~al~~~~~~~~~ 82 (393) T protein:vir:10 3 ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIVKT 82 (393) T ss_pred CCCccCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEecchHHHHHhhCCccchhhhhhhhhcccCc Confidence 555 6899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhhhhcCceeeeccC Q lcl|NC_019932. 80 VTVVVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALASIAQQLRAFAYVSAW 159 (389) Q Consensus 80 ~~~v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~i~d~~ 159 (389) .++++++.+..+...+..++++..+ ++..+|+++++++++..+..|+++++||+++.+++++|.++|++++++++++.+ T Consensus 83 ~~~vv~v~~~~~~~~t~~~iig~~~-~~~~tgl~al~~~~~~~~~~p~li~apg~~~~~~~~al~~~~~~~~~~~~v~d~ 161 (393) T protein:vir:10 83 PTVIVRVAESDDSDTLTANIVGTQE-NGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFAFISDN 161 (393) T ss_pred eEEEeecccCccccccccccccccc-cchhhHHHHHHhhhhhcceeeeeeeeccccchHHHHHHHHHhhccCcEEEEEcC Confidence 9999999999888888888877544 567899999999999999999999999999999999999999999999888888 Q ss_pred CCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccceeccccccc Q lcl|NC_019932. 160 GCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFW 239 (389) Q Consensus 160 ~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~ 239 (389) +..+.++++.+++.++|.+.++||||+++|++.++..+++|||+++||++|++|.++|||+||||+.|.||.++++.+++ T Consensus 162 ~~~t~~~ai~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~G~~~spaN~~l~gi~~~~~~~~~ 241 (393) T protein:vir:10 162 GATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEF 241 (393) T ss_pred CCCCHHHHHHHhhhcCCceEEEEecccccccccCCceeEeehhHHHHHHHHHhhcCCCcEEccCCceeeceeecceeccc Confidence 88899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCcchhhhhcccceEEEEcCCCEEEEcCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHH Q lcl|NC_019932. 240 DLQQTGTDADLLNEACVTTLIRKDGFRFWGNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGI 319 (389) Q Consensus 240 ~~~~~~~~~~~l~~~~i~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i 319 (389) .+++.++|+++||++||+++++++||++||+||+++||+|+||++|||+++|+++|++.++|++||||++.+|++|++++ T Consensus 242 ~~~~~~~~~~~Ln~~gI~t~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i 321 (393) T protein:vir:10 242 DINESSTEANYLNEKGITICLNHNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAI 321 (393) T ss_pred ccCCCcchhHhHhhcCceEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhCC--ceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 320 NAKFRELVSAG--YLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 320 ~~~l~~l~~~g--al~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) +.||++||++| +|.|++++||++ ||+++|++|+|+++|+++|++|+|||+|+++++++|+++|+++|+| T Consensus 322 ~~~L~~l~~~g~~al~g~~v~~~~~-nt~~~i~~G~~~~~i~~~p~~p~e~I~~~~~~~~~~~~~l~~~v~a 392 (393) T protein:vir:10 322 NNKLRSWASGDDPRILGARVWVAEE-ITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLKA 392 (393) T ss_pred HHHHHHHHhccccccccceEEecCC-CCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHHhc Confidence 99999999865 899999999875 8889999999999999999999999999999999999999999999 No 12 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=100.00 E-value=3e-105 Score=593.60 Aligned_cols=383 Identities=38% Similarity=0.649 Sum_probs=367.8 Q ss_pred CCC-CCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhcccCc Q lcl|NC_019932. 1 MSD-YHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIADQAKP 79 (389) Q Consensus 1 M~~-~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~~~~~ 79 (389) |++ |+|||||+|+.++++++.++++++++|+|+++.++...+|.++|+++++..++...+++.+++..++..++.+++. T Consensus 1 M~~~~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~a~~~~~~~gg~ 80 (386) T protein:vir:10 1 MAEQYLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVLIAGSRREAAKLGAGGTLPQAIDGIFDQTGA 80 (386) T ss_pred CccccCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceEecchHHHHhhcCCCcchhHHHHHHhccCce Confidence 997 5799999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEeccccccccccccccccccc-cccchhhHHHHHHhhhhhhhhhhhccccccch-HHHHHHHHhhhhcCceeeec Q lcl|NC_019932. 80 VTVVVRVAEGATPAETTSNIIGTTDE-NGRYTGMKALLSAQTQLGVKPRILGVPGLDAL-EVSTALASIAQQLRAFAYVS 157 (389) Q Consensus 80 ~~~v~~~~~~~~~~~t~~~~~~~~d~-~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~-~v~~al~~~~~~~~~~~i~d 157 (389) .++++++.++.+...+..+.+++.+. +...+|++++.+.+...+..|.++.+|++++. ++.+++.+++.++..+.+.+ T Consensus 81 ~~~vv~~~~~~~~~~t~~~~ig~~~~~t~~~tgl~~l~~~~~~~~~~p~i~~ap~~~~~~~v~~~l~~~~~~~~~~~~~~ 160 (386) T protein:vir:10 81 VVVVIRVDEGVDSAATQSNVIGKVDADTEQYTGILALLSAENTVKVQPRILIAPGFSNQKAVADQLVSVADTAAWLCHSG 160 (386) T ss_pred eEEEeeccccccccccchhhhcccccccchhhhhHHhhhhcccccccccccccccccchhHHHHHHHHhhcceEEEEEeC Confidence 99999999999988888888777774 77889999999999999999999999999875 58999999999999888888 Q ss_pred cCCCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccceeccccc Q lcl|NC_019932. 158 AWGCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASV 237 (389) Q Consensus 158 ~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~ 237 (389) ++ ..+.+++.++++.++|.+.++||||+++|++.++..+++|||+++||++||+|.++|||+||+|++|.||.++++++ T Consensus 161 ~~-~~~~~~a~~~~~~~~s~~~~~~~p~~~v~~~~~~~~~~~p~s~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~ 239 (386) T protein:vir:10 161 WS-NTTDAAAITYRELFGSRRCEVVDPWYKVWDVETSAHIIQPPSARHAGVMAKVHNTLGFWWSNSNQEILGIDGLCRPV 239 (386) T ss_pred CC-CCchHHHHHhhhcccccceEEecCceeeeccccccceeechHHHHHHHHHHhhhcCCcEEccCCceeecccccceec Confidence 75 55778899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCCcchhhhhcccceEEEEcCCCEEEEcCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHH Q lcl|NC_019932. 238 FWDLQQTGTDADLLNEACVTTLIRKDGFRFWGNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIA 317 (389) Q Consensus 238 ~~~~~~~~~~~~~l~~~~i~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~ 317 (389) .++.+++++|+++||++||+++|+++|+++||+||+++||+|+||++||++++|+++|+++++|++||||++.+|++|++ T Consensus 240 ~~~~~~~~~~~~~l~~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~~~~~~~~~i~~ 319 (386) T protein:vir:10 240 DFKLDDPTCRANLLNAKEVTTTIQQNGFRVWGDRTCSADSKWAFKNVVITNDMIADSLVRNHLWAVDRNITKTYVEDVTE 319 (386) T ss_pred ccccccCcchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHH Q lcl|NC_019932. 318 GINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFA 384 (389) Q Consensus 318 ~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~ 384 (389) +++.||++||++|+|+||+|+||+++||++++++|+|+++|+++|++|+|||+|+++++++||++|+ T Consensus 320 ~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~~~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~ 386 (386) T protein:vir:10 320 GVNNYLRHLKNIGAIAGGECWVDPELNSPDQIQQGKVYFDYDFSAYAPAEHITFRSHMVNGYLTEVV 386 (386) T ss_pred HHHHHHHHHHhCCceeeeEEEEcccCCCHHHhhCCeEEEEEEEEecCCceeEEEEEEEehhHHHhhC Confidence 9999999999999999999999999999999999999999999999999999999999999999999 No 13 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=100.00 E-value=8.1e-100 Score=563.85 Aligned_cols=382 Identities=34% Similarity=0.508 Sum_probs=345.5 Q ss_pred CCC-CCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhc--ccchhHHHHHhhhccc Q lcl|NC_019932. 1 MSD-YHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAG--KKGTLAAALQAIADQA 77 (389) Q Consensus 1 M~~-~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~--~~gtl~~~v~~~~~~~ 77 (389) |++ |+||||++|+.++++++..++|++++|+|+++.+ |.|+|+++++..++....+ .+++|..++..+|+++ T Consensus 1 M~~~~~pGVyv~E~~~~~~~i~~v~T~v~~~VG~a~~g-----p~n~pv~its~~d~~~~g~~~~~~tL~~Av~~~f~nG 75 (477) T protein:vir:10 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIG-----PVNTPVQSLSDVDAAQFGPQLAGFTIPQALDAVYDYG 75 (477) T ss_pred CcccCCCCeEEEEccCCcccccccCCceeEEEecccCC-----CCCcCEEEccHHHHHHhccCCCCCcHHHHHHHHHhcc Confidence 998 5799999999999999999999999999999754 7899999999998865333 4689999999999999 Q ss_pred CceEEEEEecccccccc--------------------------------------------------------------- Q lcl|NC_019932. 78 KPVTVVVRVAEGATPAE--------------------------------------------------------------- 94 (389) Q Consensus 78 ~~~~~v~~~~~~~~~~~--------------------------------------------------------------- 94 (389) +..++++++........ T Consensus 76 g~~~~vVrV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (477) T protein:vir:10 76 SGTVIVINVLDPAVHKSNAANEPVTFDAATGRAKLAHPAAANLVLKNDSGGTTYAEGTDYAVDLINGVITRIKTGTIPPG 155 (477) T ss_pred ceEEEEEecCccccccccccccccccccccceecccccccccccccccccccccccchhhhhhhccccceeccccccccc Confidence 99999888753321100 Q ss_pred ----------------ccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchH-HHHHHHHhhhhcCceeeec Q lcl|NC_019932. 95 ----------------TTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALE-VSTALASIAQQLRAFAYVS 157 (389) Q Consensus 95 ----------------t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~-v~~al~~~~~~~~~~~i~d 157 (389) ...++.+..+.++..+|+++++.+++.++..|.++.+||+++.+ |.++|.++|+++++++++| T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~tGl~al~~~~~~~~~~~~~l~apg~~~~~~v~~~l~~~~~~~~~~~~~d 235 (477) T protein:vir:10 156 ATAAKATYDYADPTKVTAADIIGAVNAAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQLGAIAYID 235 (477) T ss_pred ceeeeeccccccccccccccccccccccchhhhhhhhhhhhhhcchhcccccccccccchhhHHHHHHHHhhCCEEEEEe Confidence 00112222234556789999999999999999999999998765 9999999999999999999 Q ss_pred cCCCccHHHHHHhhh-------cccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccc Q lcl|NC_019932. 158 AWGCKTLSEAMAYRE-------NFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGV 230 (389) Q Consensus 158 ~~~~~t~~~a~~~~~-------~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv 230 (389) +|.+.+.+++.++++ +++|.+.+++|||++++|+.++..+++|||+++||++||+|.++|||+||+|++|.|| T Consensus 236 ~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~gi 315 (477) T protein:vir:10 236 APIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDTATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQLVGV 315 (477) T ss_pred cCCCCCHHHHHhhhhhccccccccccceEEEEcCeEEEecccCCceeEEchHHHHHHHHHHhhhcCCceeccCCceeccc Confidence 999988888888876 4678999999999999999999999999999999999999999999999999999999 Q ss_pred eeccccccccccCCcchhhhhcccceEEEEc--CCCEEEEcCccCC---CCcccceeehhhHHHHHHHHHHHHHHHHhhc Q lcl|NC_019932. 231 TGISASVFWDLQQTGTDADLLNEACVTTLIR--KDGFRFWGNRTCS---DDPLFAFENYTRTAQVIADTMAEAHMWANDK 305 (389) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~--~~G~~~wG~rT~~---~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e 305 (389) .++++++.+.++++++|+++||++||+++++ ++|+++||+||++ .|+.|+|+++||++++|+++|++.+++++|| T Consensus 316 ~~~~~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~~~vrR~~~~i~~~~~~~~~~~v~~ 395 (477) T protein:vir:10 316 TGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDA 395 (477) T ss_pred cccccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCCCcccceeehhhHHHHHHHHHHHHHHHhccC Confidence 9999999999999999999999999999965 5899999999994 4678999999999999999999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHHH Q lcl|NC_019932. 306 PLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFAA 385 (389) Q Consensus 306 ~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~ 385 (389) ||++.+|++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|++++++++|++|++ T Consensus 396 ~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~ 475 (477) T protein:vir:10 396 PIDQGLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNPKEELAAGHLLINYKYTVPPPLERLTYETEITSEYLLTLKG 475 (477) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEcchHHhhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred Hh Q lcl|NC_019932. 386 SV 387 (389) Q Consensus 386 ~~ 387 (389) .- T Consensus 476 g~ 477 (477) T protein:vir:10 476 GN 477 (477) T ss_pred CC Confidence 87 No 14 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=100.00 E-value=9.2e-100 Score=563.52 Aligned_cols=373 Identities=25% Similarity=0.346 Sum_probs=343.2 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhc---ccchhHHHHHhhhccc Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAG---KKGTLAAALQAIADQA 77 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~---~~gtl~~~v~~~~~~~ 77 (389) |++|||||||+|+.++++++.++++++++++|++++++.. +|.++++++.+..++...++ ..+++..++..+++++ T Consensus 4 ~~~~~hGv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~-~p~~~~~~i~~~~d~~~~~~~~~~~gtl~~al~~~~~~~ 82 (388) T protein:vir:96 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHAD-VAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKT 82 (388) T ss_pred CCCCCCceEEEEcCCCcccccccCcceeEEEEecCCCccc-cccccceeeecchhhhhhhccccccccchhhhHhhhccC Confidence 8899999999999999999999999999999999988864 78899999988877666543 4689999999999999 Q ss_pred CceEEEEEeccccccccccccccccccc-cccchhhHHHHHHhhhhhhhhhhhccccccch-HHHHHHHHhhhhcCceee Q lcl|NC_019932. 78 KPVTVVVRVAEGATPAETTSNIIGTTDE-NGRYTGMKALLSAQTQLGVKPRILGVPGLDAL-EVSTALASIAQQLRAFAY 155 (389) Q Consensus 78 ~~~~~v~~~~~~~~~~~t~~~~~~~~d~-~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~-~v~~al~~~~~~~~~~~i 155 (389) +..++++++..+.+...+..+++++.+. ++..+|++++++.+. .|+++++||+++. +|+++|.++|++++++++ T Consensus 83 ~~~~~vv~v~~g~~~~at~a~iig~~~~~tg~~~gl~al~~~~~----~p~il~aPg~s~~~~v~~al~~~~~~~~~~~i 158 (388) T protein:vir:96 83 SVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECTE----RPTLIGAPGFSQNKAVIDALASMAKRLKCRAV 158 (388) T ss_pred CceEEEEEeccccccccccceeeeecccccchhhHHHHhhhccc----ceeEEEeeccccchHHHHHHHHHHhhcCcEEE Confidence 9999999999999988888988888775 677788888877544 6899999999875 699999999999999999 Q ss_pred eccCCCccHHHHHHhh-----hcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccc Q lcl|NC_019932. 156 VSAWGCKTLSEAMAYR-----ENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGV 230 (389) Q Consensus 156 ~d~~~~~t~~~a~~~~-----~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv 230 (389) +|+|.+.+ +++.+++ .+++|.+.++||||++++|+.++..+++|||+++||++|++| +|+||||+.+ ++ T Consensus 159 ~D~p~~~~-~~~~~~~~~~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D----~~~spaN~~i-~i 232 (388) T protein:vir:96 159 IDGPSGST-QDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVK----PWESPGNQGV-LI 232 (388) T ss_pred EeccCCch-hHHHHHHhhhhccCcCcceEEEEeCceeeecccCCceeeechHHHHHHHHHhhc----CcccccCeeE-Ee Confidence 99987643 4444433 357899999999999999999999999999999999999999 5999999998 59 Q ss_pred eeccccccccccCCcchhhhhcccceEEEEc--CCCEEEEcCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019932. 231 TGISASVFWDLQQTGTDADLLNEACVTTLIR--KDGFRFWGNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWANDKPLT 308 (389) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~--~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~ 308 (389) .|+++.+.+..++..+|+++||++||+++++ ++|+++||+||++ |+||++||+++||+++|++.++|++||||+ T Consensus 233 ~g~~~~~~~~~~~~~~~~~~Ln~~gI~~i~~~~~~G~~~wG~rT~~----~~~i~vrR~~~~i~~si~~~~~~~v~epn~ 308 (388) T protein:vir:96 233 QDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT----GKFISFVGLEDAIARKLEAASQRAMSKQLT 308 (388) T ss_pred eeecccccccccCChhhHHhhhhcCceEEEEecCCcEEEEcccccC----CcceeehhhHHHHHHHHHHHHHHhccCCCC Confidence 9999999999999999999999999999965 6899999999986 999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhc Q lcl|NC_019932. 309 PVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFAASVN 388 (389) Q Consensus 309 ~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~ 388 (389) +.+|++|+++++.||++||++|+|+||+++||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++||++|. T Consensus 309 ~~~~~~i~~~i~~fL~~l~~~Gal~g~~~~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~~~~~~ 388 (388) T protein:vir:96 309 KSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIEEVL 388 (388) T ss_pred HHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 No 15 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=100.00 E-value=2.7e-99 Score=560.95 Aligned_cols=382 Identities=33% Similarity=0.503 Sum_probs=344.4 Q ss_pred CCC-CCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhc--ccchhHHHHHhhhccc Q lcl|NC_019932. 1 MSD-YHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAG--KKGTLAAALQAIADQA 77 (389) Q Consensus 1 M~~-~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~--~~gtl~~~v~~~~~~~ 77 (389) |++ |+|||||+|+.++++++.+++|++++|+|+++.+ |.|+|++++++.++...++ ..++|..++..+|.++ T Consensus 1 M~~~~~pGVyv~E~~~g~~~I~~v~Tsv~~~VG~a~~~-----p~n~pv~its~~d~~~~g~~~~~~tL~~Av~~~f~ng 75 (477) T protein:vir:79 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIG-----PVNTPVQSLSDVDAAQFGPQLAGFTIPQALDAVYDYG 75 (477) T ss_pred CcCCCCCCeEEEEecCCcccccccCCceEEEEeecccC-----CCcccEEEccHHHHHHhcCCCCCCcHHHHHHHHhhcC Confidence 997 5799999999999999999999999999999754 7899999999999886444 4689999999999999 Q ss_pred CceEEEEEeccccccccc-------------------------------------------------------------- Q lcl|NC_019932. 78 KPVTVVVRVAEGATPAET-------------------------------------------------------------- 95 (389) Q Consensus 78 ~~~~~v~~~~~~~~~~~t-------------------------------------------------------------- 95 (389) +..++++++.+....... T Consensus 76 g~~~~vvrV~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (477) T protein:vir:79 76 SGTVIVINVLDPAVHKSNAASESVTFDAATGRAKLAHPAAANLVLKNDSGGTTYTEGTDYAVDLINGVITRIKTGTIPAA 155 (477) T ss_pred CceEEEEeccCCccccccccccccccccccccccccccccceeEEeecccccccccCccccccccchhhhhhhccccccc Confidence 999999887543311100 Q ss_pred -----------------cccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchH-HHHHHHHhhhhcCceeeec Q lcl|NC_019932. 96 -----------------TSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALE-VSTALASIAQQLRAFAYVS 157 (389) Q Consensus 96 -----------------~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~-v~~al~~~~~~~~~~~i~d 157 (389) ..+..+..+..+..+|+++++.++...++.|.++.+||+++.+ +.++|.++|+++++++++| T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~g~~~a~~~~tg~~al~~~~~~~~~~~~iv~apg~~~~~~v~~~l~~~~~~~~~~a~~d 235 (477) T protein:vir:79 156 ATAAKATYDYADPTKVTAADIIGAVNAAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQLGAIAYID 235 (477) T ss_pred cceeeceeccCCcccceeeeecccccccccchhhhhhhhhhhhcccccceeeccccccchhHHHHHHHHHhhcCeEEEEe Confidence 0011112223445688999999999999999999999997654 9999999999999999999 Q ss_pred cCCCccHHHHHHhhh-------cccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccc Q lcl|NC_019932. 158 AWGCKTLSEAMAYRE-------NFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGV 230 (389) Q Consensus 158 ~~~~~t~~~a~~~~~-------~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv 230 (389) +|.+.+.+++.++++ +++|.+.+++|||++++++.++..+++|||+++||++||+|.++|||+||+|+++.|| T Consensus 236 ~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~gv 315 (477) T protein:vir:79 236 APIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDIATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQLVGV 315 (477) T ss_pred cCCCCChHHHhhhhhhccccccccccceEEEEcCeeEEecccCCceeeechHHHHHHHHHHhhccCCceEccCCceeecc Confidence 999988888888776 3679999999999999999999999999999999999999999999999999999999 Q ss_pred eeccccccccccCCcchhhhhcccceEEEEc--CCCEEEEcCccCC---CCcccceeehhhHHHHHHHHHHHHHHHHhhc Q lcl|NC_019932. 231 TGISASVFWDLQQTGTDADLLNEACVTTLIR--KDGFRFWGNRTCS---DDPLFAFENYTRTAQVIADTMAEAHMWANDK 305 (389) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~--~~G~~~wG~rT~~---~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e 305 (389) .++++++.+..+++++|+++||++||+++++ ++|+++||+||++ .++.|+||++||+|++|+++|++.++|++|| T Consensus 316 ~~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e 395 (477) T protein:vir:79 316 TGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDA 395 (477) T ss_pred eecccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCCCccceeeehhhHHHHHHHHHHHHHHHhccC Confidence 9999999999999999999999999999964 5899999999994 4678999999999999999999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHHH Q lcl|NC_019932. 306 PLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFAA 385 (389) Q Consensus 306 ~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~ 385 (389) ||++.+|++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|++++++++|++|++ T Consensus 396 ~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~ 475 (477) T protein:vir:79 396 PIDQGLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNPKEELAAGHLLINYKYTVPPPLERLTYETEITSEYLLTLKG 475 (477) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCeEEEEEEEEecCCceeEEEEEEEechHHhhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999988 Q ss_pred Hh Q lcl|NC_019932. 386 SV 387 (389) Q Consensus 386 ~~ 387 (389) .- T Consensus 476 ~~ 477 (477) T protein:vir:79 476 GN 477 (477) T ss_pred CC Confidence 87 No 16 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=100.00 E-value=2.4e-87 Score=495.45 Aligned_cols=375 Identities=15% Similarity=0.132 Sum_probs=303.6 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc---cchhHHHHHhhhccc Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK---KGTLAAALQAIADQA 77 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~---~gtl~~~v~~~~~~~ 77 (389) |+-..|||||+|+ ++++++..++|++.+|+|.+. ..|.++|++++++.+|...||. ...+.+.+...|.++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~~~~~~t~~~~~vg~~~-----~gp~~~p~~i~~~~~~~~~fg~~~~~~~~~~~~~~~f~~~ 74 (666) T protein:vir:80 1 MTLLSPGFETKET-TLSTTIVQSATGRAALVGKFQ-----WGPAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (666) T ss_pred CceecCceEEEEe-cCCccccccCcccceEEeccc-----cCCCccceEecCHHHHHHhcCCccCccchHHHHHHHHhcC Confidence 8866799999999 688999999999999999985 4578999999999999999994 344567777788888 Q ss_pred CceEEEEEeccccccccc-------------------------------------------ccc---------------- Q lcl|NC_019932. 78 KPVTVVVRVAEGATPAET-------------------------------------------TSN---------------- 98 (389) Q Consensus 78 ~~~~~v~~~~~~~~~~~t-------------------------------------------~~~---------------- 98 (389) +..++++|.......... ... T Consensus 75 g~~~~v~R~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~i~~~~~~~~~~~~ta~~~~~a 154 (666) T protein:vir:80 75 GNDLRVVRVLNKEKAKNATALAGNIEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (666) T ss_pred CCeEEEEEecCccccccccccccceeEEEeeccccccccccccccccCcccccCcceEEEeecceeeeeecchhhhcccc Confidence 888887776422110000 000 Q ss_pred ----------------c------------c-cccc-cc--------------------------------------c--- Q lcl|NC_019932. 99 ----------------I------------I-GTTD-EN--------------------------------------G--- 107 (389) Q Consensus 99 ----------------~------------~-~~~d-~~--------------------------------------~--- 107 (389) + . +... .. + T Consensus 155 ~~~~~~~~v~~~~~~~~~~~~~~~~~a~~V~~~~~~~~~~~~~~~~a~~~~t~~~~~~~~~~~~~~a~~a~~~g~~g~~l 234 (666) T protein:vir:80 155 KAIGVYPELDGDWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLQKYDMPAVSAIYAGEIGNSL 234 (666) T ss_pred ccccccceeeccceeeeccccccceeeeeeeeeecCCccceeeeccccccccccccccccccccchhhhhhcccccccce Confidence 0 0 0000 00 0 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_019932. 108 -------------------------------------------------------------------------------- 107 (389) Q Consensus 108 -------------------------------------------------------------------------------- 107 (389) T Consensus 235 ~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (666) T protein:vir:80 235 EVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFG 314 (666) T ss_pred eeeeccccccccccccceeeeccccccccceeeeeccccccceeeEeccCCccceeeecccccccccccchhhhhhhhhc Confidence Q ss_pred ------------------------------------------------cchhhHHHHHHhhhhhhhhhhhcccccc---- Q lcl|NC_019932. 108 ------------------------------------------------RYTGMKALLSAQTQLGVKPRILGVPGLD---- 135 (389) Q Consensus 108 ------------------------------------------------~~tGl~a~~~~~~~~~~~p~~~~apg~~---- 135 (389) ..+|++++++ ...+.++++|+++ T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~-----~~~~~~l~~p~~~~~~~ 389 (666) T protein:vir:80 315 RGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWGLFAERE-----SIHVNLLIAGACAGEGD 389 (666) T ss_pred cccceeeeecccccccccceEEEecCCCCcccccccccccccccccchhhhhhhhhhc-----ccccceEeecCcCCccc Confidence 0000000000 0012344555554 Q ss_pred -chHHHHHHHHhhhhcC-cee--------eeccCCCccHHHHHHhhhc----------ccCceeEEeeeeEEEEeecCCC Q lcl|NC_019932. 136 -ALEVSTALASIAQQLR-AFA--------YVSAWGCKTLSEAMAYREN----------FSQRELMVIWPDFISWNTTANQ 195 (389) Q Consensus 136 -~~~v~~al~~~~~~~~-~~~--------i~d~~~~~t~~~a~~~~~~----------~~s~~~~~~~p~~~~~~~~~~~ 195 (389) ..+++.++.++|++++ +++ ++|.++..+.+++++|++. ++|.|.++||||++++|+.+++ T Consensus 390 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~ 469 (666) T protein:vir:80 390 AFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDV 469 (666) T ss_pred chHHHHHHHHHHHHhhcceEEEeecceeEEeecCCCCCHHHHHHHHHhcccchhhhcccCcceEEEEcCceEEecccCCc Confidence 3468889999999875 444 4455667888999999864 7799999999999999999999 Q ss_pred ceEEehhHHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEE--cCCCEEEEcCccC Q lcl|NC_019932. 196 SETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLI--RKDGFRFWGNRTC 273 (389) Q Consensus 196 ~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~--~~~G~~~wG~rT~ 273 (389) .+++|||+++||++||+|.++|||+||+|+++.++.+. +..+..+++.|++.||++|||+++ +++|+++||+||+ T Consensus 470 ~~~~p~sg~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~---~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~ 546 (666) T protein:vir:80 470 NRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNV---VKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTA 546 (666) T ss_pred eeEechHHHHHHHHHHHhhcCCceEccCCeecceeecc---ccceeecChhHHHhhhhCCeeEEEEeCCCeEEEEccccC Confidence 99999999999999999999999999999998777775 334566778999999999999985 5679999999999 Q ss_pred CCCc-ccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCC Q lcl|NC_019932. 274 SDDP-LFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAG 352 (389) Q Consensus 274 ~~d~-~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G 352 (389) ++++ +|+||+|||||+||+++|++.++|+||||||+.+|++|+++++.||++||++|+|.||+|+||+++||+++|++| T Consensus 547 ~~~~s~~~~i~vRRl~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~di~~G 626 (666) T protein:vir:80 547 TTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRN 626 (666) T ss_pred CCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCC Confidence 8765 899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEEecccceEEEEEEEEcch--HHHHHHHHhcC Q lcl|NC_019932. 353 KLFIDYDYTPVPPLEDLTLRQRITDS--YLANFAASVNS 389 (389) Q Consensus 353 ~~~~~i~~~p~~p~e~i~~~~~~~~~--~~~~~~~~~~~ 389 (389) +|+++|+++|++|+|||+|++++.+. .|+++.++|++ T Consensus 627 ~~~~~i~~~P~~Pae~I~~~~~~~~~~~~~~e~~~~~~~ 665 (666) T protein:vir:80 627 EFVASMFIKPAKSINYIMLNFTAVATGSDFDEIIGPVNQ 665 (666) T ss_pred eEEEEEEEEecCCcceEEEEEEEeecCccHHHHHHHHhc Confidence 99999999999999999999998766 69999999999 No 17 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=100.00 E-value=5.7e-87 Score=493.42 Aligned_cols=375 Identities=13% Similarity=0.085 Sum_probs=308.0 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc---cchhHHHHHhhhccc Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK---KGTLAAALQAIADQA 77 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~---~gtl~~~v~~~~~~~ 77 (389) |+.-.|||||+|+ ++++++..+++++.+|+|.+. ..|.++|++++++.++...||. ...+.+.+..+|.++ T Consensus 1 ma~~~PgVyv~E~-~~~~~i~~~~ts~~~~vG~~~-----~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ng 74 (664) T protein:vir:98 1 MALQSPGIETKET-SVQSTVVRNSTGRAAIVGKFS-----WGPAYQIRQISNEVELVNYFGAPDNLTADYFMSAVNFLQY 74 (664) T ss_pred CceecCceEEEec-CCCcccccccccceEEEeecc-----CCCCCccEEecCHHHHHHhcCCccccchhHHHHHHHHHhc Confidence 9933599999999 689999999999999999985 4578999999999999999983 456788888888888 Q ss_pred CceEEEEEeccccccc----------------------------------------c----ccc---------------- Q lcl|NC_019932. 78 KPVTVVVRVAEGATPA----------------------------------------E----TTS---------------- 97 (389) Q Consensus 78 ~~~~~v~~~~~~~~~~----------------------------------------~----t~~---------------- 97 (389) +..++++|........ . ... T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~gn~~~v~i~~~~~~~~~ 154 (664) T protein:vir:98 75 GNDLRLVRVVDKDAAKNASAIFNQIKTTIASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNGKILAVTIPKRKKSLLV 154 (664) T ss_pred CCeEEEEEecCccccccccccccccceeeccCCcccccccceeEeecCcccccceeeccCCCCCceeeEeeccCccceee Confidence 8888888863211000 0 000 Q ss_pred --------------------------------cc----c----------------------------------------- Q lcl|NC_019932. 98 --------------------------------NI----I----------------------------------------- 100 (389) Q Consensus 98 --------------------------------~~----~----------------------------------------- 100 (389) .. . T Consensus 155 ~~~~~~~~~~~~~~~~s~~~~s~g~a~a~~v~~v~~d~~~~~~~~~~a~~~i~~~~~~~~~~~~~~~~~~a~~~G~~Gn~ 234 (664) T protein:vir:98 155 LNRSVLTQIFLLVGTTEIVSQSSGVSASITIDGIESDSGITLLNLDIAKETIQGTSFQTLTQKYQIPSVVALYPGELGST 234 (664) T ss_pred cccccccccceecccceeeeeecccceeeecccccccceeeccccceeeeccccccceeeeeccccceeeeeecccccce Confidence 00 0 Q ss_pred ----------------------c--------------------------------------------------------- Q lcl|NC_019932. 101 ----------------------G--------------------------------------------------------- 101 (389) Q Consensus 101 ----------------------~--------------------------------------------------------- 101 (389) + T Consensus 235 isv~i~s~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 314 (664) T protein:vir:98 235 VQVEIISKAAYDTGAMISGYPSGISVKNSGRSVMTYGPQTDNQYAFVVRRGGIVQESFIVSTDKTDKDIYGVNIYMDDFF 314 (664) T ss_pred eeeeecccccccCcceEeeccCceecccceeeeeeccccCccceeEEEecCCceeeeEEeecccCcccceeeeeechhhe Confidence 0 Q ss_pred ---------------------------ccc------ccccchhhHHHHHHhhhhhhhhhhhccccccch------HHHHH Q lcl|NC_019932. 102 ---------------------------TTD------ENGRYTGMKALLSAQTQLGVKPRILGVPGLDAL------EVSTA 142 (389) Q Consensus 102 ---------------------------~~d------~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~------~v~~a 142 (389) +.+ ....++|++++++ ...+.|+++++|++++. +++.+ T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~~---~~~~~~~ll~~p~~~~~~~~~~~~v~~a 391 (664) T protein:vir:98 315 ANGGSQYVFGTSMNWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFAD---REALHVPLLIAGGCAGESVEIASTVQKH 391 (664) T ss_pred ecccceeeeeecccCCcccceeEeccCccccccccCchhHHHHHHhhhc---ccccccceEEecCCCCCcHHHHHHHHHH Confidence 000 0000122222222 22345788889998754 58899 Q ss_pred HHHhhhhcC-ceeeeccC--------CCccHHHHHHhhh--------------cccCceeEEeeeeEEEEeecCCCceEE Q lcl|NC_019932. 143 LASIAQQLR-AFAYVSAW--------GCKTLSEAMAYRE--------------NFSQRELMVIWPDFISWNTTANQSETA 199 (389) Q Consensus 143 l~~~~~~~~-~~~i~d~~--------~~~t~~~a~~~~~--------------~~~s~~~~~~~p~~~~~~~~~~~~~~~ 199 (389) |.++|++++ +++++|.| +..+.+++++|++ +++|.++++||||++++|+.++..+++ T Consensus 392 l~~~a~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~ 471 (664) T protein:vir:98 392 VISIGDERQDCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNYKYQYDKYNDVNRWV 471 (664) T ss_pred HHHHHHhcCCeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCeEEEecccCCceEEe Confidence 999999985 77888765 3567778888776 478999999999999999999999999 Q ss_pred ehhHHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEE--cC-CCEEEEcCccCCCC Q lcl|NC_019932. 200 YATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLI--RK-DGFRFWGNRTCSDD 276 (389) Q Consensus 200 p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~--~~-~G~~~wG~rT~~~d 276 (389) |||+++||++||+|.++|||+||+|+++.++.+.. .....+++.|++.||++|||++. ++ +||++||+||++++ T Consensus 472 p~sg~~AGl~A~~D~~~g~~~span~~~~~i~g~~---~~~~~~~~~~~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~ 548 (664) T protein:vir:98 472 PLAGDIAGLCVYTDSVANPWMSPAGYNRGQIRNCI---KLAIEPRTAHRDAMYQVQINPVTGFAGGSGFVLYGDKTLTSV 548 (664) T ss_pred chHHHHHHHHHHhhhcCCcEECcCCceeeeeeccc---cceeecChhhHHHHHhCCCeEEEEeeCCCcEEEEcccccCCC Confidence 99999999999999999999999999988887753 34555677899999999999984 44 79999999999876 Q ss_pred c-ccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEE Q lcl|NC_019932. 277 P-LFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLF 355 (389) Q Consensus 277 ~-~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~ 355 (389) + +|+||++||||+||+++|++.++|++||||++.+|++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+ T Consensus 549 ~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G~~~ 628 (664) T protein:vir:98 549 PSPFDRINVRRLFNMIKKDIGDNAKYKLFENNDDFTRASFRMDTGQYMTNIRALGGCYDYRVICDTTNNTPDVIDRNEFV 628 (664) T ss_pred CcccceEeehhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEE Confidence 5 899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 356 IDYDYTPVPPLEDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 356 ~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) ++|+++|++|+|||+|++++.....+ |+++.. T Consensus 629 ~~i~~~p~~pae~I~~~~~q~~~~~~--~~e~~~ 660 (664) T protein:vir:98 629 ATVYVKPPRSINYITLNFVATSTGAD--FDELVG 660 (664) T ss_pred EEEEEEecCCcceEEEEEEEeecCcc--hhHhcc Confidence 99999999999999999999887644 666666 No 18 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=100.00 E-value=2.1e-86 Score=490.24 Aligned_cols=380 Identities=13% Similarity=0.122 Sum_probs=302.7 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc---cchhHHHHHhhhccc Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK---KGTLAAALQAIADQA 77 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~---~gtl~~~v~~~~~~~ 77 (389) |+--.|||||+|+ ++++++..++|++.+|+|.+. ..|+++|++++++.+|...||. ...+.+.+..+|.++ T Consensus 1 ~~~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~-----~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~~~ 74 (660) T protein:vir:68 1 MALLSPGVELKET-TVQSTVVNNSTGTAALAGKFQ-----WGPAFQIKQITDEVALVDMFGTPNTDTADYFMSAMNFLQY 74 (660) T ss_pred CccccCceEEEEe-cCCcccccCCCcceeEEeccc-----CCCCccCEEecCHHHHHHhcCCccCccchhHHHHHHHHhC Confidence 8866799999999 699999999999999999985 4578999999999999999993 455778888889998 Q ss_pred CceEEEEEecccccccccc-----------------------------------cc------------------------ Q lcl|NC_019932. 78 KPVTVVVRVAEGATPAETT-----------------------------------SN------------------------ 98 (389) Q Consensus 78 ~~~~~v~~~~~~~~~~~t~-----------------------------------~~------------------------ 98 (389) +..++++|........... .. T Consensus 75 g~~~~vvRv~~~~~~~~~~~~~~~~~~t~~~~g~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ta~~~~~a 154 (660) T protein:vir:68 75 GNDLRVVRAVDRDTAKNSSPVAGNINFTISSAGTNYRVGDKVVVKYSTDIIEPDGEVTSVDSDGKILNIFIPSGKIIAKA 154 (660) T ss_pred CCeEEEEEecccccccccccccccceeeeeccCcceeeeeeeeeecccccccccccceeeeecCceeeeeeccccccccc Confidence 8888888764221100000 00 Q ss_pred ------------ccccc------------------cc--------cccc----------------hhhHHH--------- Q lcl|NC_019932. 99 ------------IIGTT------------------DE--------NGRY----------------TGMKAL--------- 115 (389) Q Consensus 99 ------------~~~~~------------------d~--------~~~~----------------tGl~a~--------- 115 (389) ..... +. +... .++.+. T Consensus 155 ~~~~~~~~~~~~~~~~v~~~~~~~~~~~~v~~~~~d~~~~~~~~~ta~~~~~~~~~~~~~~~~~~~~~~A~~~g~~G~~i 234 (660) T protein:vir:68 155 KEIGEYPELGSNWTAEMSGSSSGLSAVITIDSVVMDSGILLTEVETSEEAITSLTFQESIKKYGVPGVVALYPGELGDQL 234 (660) T ss_pred eeeccccccccceeEEeecccccceeeeeeccccccccceeeeeccccccccccceeeeecccCccccccccccccccce Confidence 00000 00 0000 000000 Q ss_pred -----------------------------H------------------------------------------------HH Q lcl|NC_019932. 116 -----------------------------L------------------------------------------------SA 118 (389) Q Consensus 116 -----------------------------~------------------------------------------------~~ 118 (389) . .. T Consensus 235 ~v~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (660) T protein:vir:68 235 EIEIVSKADYDKGASAQLKIYPDGGTRYSTAKAIFGYGPQTDDQYAIIVRRNDSVVQSVVLSTKRGERDIYGSNIFIDDF 314 (660) T ss_pred EEEEeccccccccccccceeeecccccccceeeEeecccccccceeeeeecCCcceeeeeeecccccccccccceeeehh Confidence 0 00 Q ss_pred h-h-----------------------------------------------hhhhhhhhhcccccc------chHHHHHHH Q lcl|NC_019932. 119 Q-T-----------------------------------------------QLGVKPRILGVPGLD------ALEVSTALA 144 (389) Q Consensus 119 ~-~-----------------------------------------------~~~~~p~~~~apg~~------~~~v~~al~ 144 (389) . . .....+.+++.++.. ..+++.+|. T Consensus 315 ~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~v~~~l~ 394 (660) T protein:vir:68 315 FAKGASNYIFATAQGWPKGFSGVIKLNGGLSSNETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESLEVASTVQKHVV 394 (660) T ss_pred hccCcccEEEEeecCCCccccceeeeccccccccccccchhhhHHHHhhhhhccccceeeccccCCCchHHHHHHHHHHH Confidence 0 0 000000001111111 135788899 Q ss_pred HhhhhcC-ceeeec--------cCCCccHHHHHHhhhc----------ccCceeEEeeeeEEEEeecCCCceEEehhHHH Q lcl|NC_019932. 145 SIAQQLR-AFAYVS--------AWGCKTLSEAMAYREN----------FSQRELMVIWPDFISWNTTANQSETAYATARA 205 (389) Q Consensus 145 ~~~~~~~-~~~i~d--------~~~~~t~~~a~~~~~~----------~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ 205 (389) ++|++++ +++++| .+.+.+.+++.+++.. ++|.++++||||++++|+.++..+++|||+++ T Consensus 395 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~ 474 (660) T protein:vir:68 395 AIGDSRQDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADI 474 (660) T ss_pred HHHHhhCCeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceEEechhHHH Confidence 9999874 556555 4556788889988873 67999999999999999999999999999999 Q ss_pred HHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEE--EcCCCEEEEcCccCCCCc-cccee Q lcl|NC_019932. 206 LGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTL--IRKDGFRFWGNRTCSDDP-LFAFE 282 (389) Q Consensus 206 Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~--~~~~G~~~wG~rT~~~d~-~~~~i 282 (389) ||++||+|.++|||+||+|+++.+|.+. +.....+++.|++.||++|||++ ++++|+++||+||+++|+ .|+|| T Consensus 475 AGl~Ar~d~~~g~~~span~~~~~i~g~---~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i 551 (660) T protein:vir:68 475 AGLCARTDNISQPWMSPAGYNRGQILNV---IKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRI 551 (660) T ss_pred HHHHHHHhccCCcEEccCCeeeceeecc---ceeeecCChhHHHHHhhCCceEEEEecCCeEEEEcceecCCCCcccceE Confidence 9999999999999999999998888776 34556678899999999999998 557899999999998876 79999 Q ss_pred ehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEe Q lcl|NC_019932. 283 NYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTP 362 (389) Q Consensus 283 ~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p 362 (389) +|||||+||+++|++.++|++||||++.+|++|+++|+.||++||++|+|.||+|+||+++||+++|++|+|+++|+++| T Consensus 552 ~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~G~~~~~i~~~p 631 (660) T protein:vir:68 552 NVRRLFNMVKTNIGSASKYRLFELNNAFTRSSFRTETSQYLQGIKALGGVYNFKVVCDTTNNTPAVIDRNEFVATFYLQP 631 (660) T ss_pred ehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEecCCCCHHHhhCCeEEEEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceEEEEEEEEcch--HHHHHHHHhcC Q lcl|NC_019932. 363 VPPLEDLTLRQRITDS--YLANFAASVNS 389 (389) Q Consensus 363 ~~p~e~i~~~~~~~~~--~~~~~~~~~~~ 389 (389) ++|+|||+|++++... +|++++++|.+ T Consensus 632 ~~pae~i~l~~~~~~~~~~~~e~~~~v~~ 660 (660) T protein:vir:68 632 ARSINYITLNFVATATGADFDELIGAVGG 660 (660) T ss_pred cCCcceEEEEEEEeecCccHHHHHHhhcC Confidence 9999999999988754 89999999999 No 19 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=100.00 E-value=4.9e-86 Score=488.30 Aligned_cols=377 Identities=15% Similarity=0.132 Sum_probs=305.2 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc---cchhHHHHHhhhccc Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK---KGTLAAALQAIADQA 77 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~---~gtl~~~v~~~~~~~ 77 (389) |+--.|||||+|+ ++++++..++|++.+|+|.+.. .|.++|++++++.+|...||. ...+.+.+..+|.++ T Consensus 1 ~~~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~-----Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng 74 (666) T protein:vir:65 1 MTLLSPGFETKET-TLSTTIVQSETGRAALVGKFQW-----GPAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (666) T ss_pred CceecCceEEEEe-cCcccccccCcccceEEecccC-----CCCccCEEecCHHHHHHHcCCccccchhHHHHHHHHHhc Confidence 8866799999999 6888999999999999999854 478999999999999999984 344667777777777 Q ss_pred CceEEEEEeccccccc----------------------------------------ccc---c-cc-------------- Q lcl|NC_019932. 78 KPVTVVVRVAEGATPA----------------------------------------ETT---S-NI-------------- 99 (389) Q Consensus 78 ~~~~~v~~~~~~~~~~----------------------------------------~t~---~-~~-------------- 99 (389) +..++++|........ ... . .. T Consensus 75 g~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~~~V~~~~~~~~~~~~~~~~~~~~~~~g~~~~t~~~~~~~ 154 (666) T protein:vir:65 75 GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (666) T ss_pred CceEEEEEccCcccccccccccCceeeeEeeccccccccceEEEEecccccccccccccccccccccccccccceeeccc Confidence 7777766642211000 000 0 00 Q ss_pred ---------------------------------c--ccc-------------c-----------------------cc-- Q lcl|NC_019932. 100 ---------------------------------I--GTT-------------D-----------------------EN-- 106 (389) Q Consensus 100 ---------------------------------~--~~~-------------d-----------------------~~-- 106 (389) . +.. + .. T Consensus 155 ~~~g~~~~l~~~~~~~~~~~~~~~~~a~sv~~~~~~g~~~~~~~~~a~~~~~~~~~~~~~~~~~~~a~~A~~~g~~g~~i 234 (666) T protein:vir:65 155 KAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLKKYDMPAVSAIYAGEIGNSL 234 (666) T ss_pred cccCcceeEeeccceeecccCcccccceeeeecccccceeeeeecccccccccccccccccccccceeeeeeccccccce Confidence 0 000 0 00 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_019932. 107 -------------------------------------------------------------------------------- 106 (389) Q Consensus 107 -------------------------------------------------------------------------------- 106 (389) T Consensus 235 ~v~i~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (666) T protein:vir:65 235 EVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFA 314 (666) T ss_pred eEEeecccccccccccccccccccccccceeeecccccccccceeeeecCCcccceeecccCcccccccchhhhhhhhhc Confidence Q ss_pred ---------------------------------------------ccchhhHHHHHHhhhhhhhhhhhccccccc----- Q lcl|NC_019932. 107 ---------------------------------------------GRYTGMKALLSAQTQLGVKPRILGVPGLDA----- 136 (389) Q Consensus 107 ---------------------------------------------~~~tGl~a~~~~~~~~~~~p~~~~apg~~~----- 136 (389) ...+|+++++ ......++++++|++++ T Consensus 315 ~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~~---~~~~~~~~~l~~p~~~~~~~~~ 391 (666) T protein:vir:65 315 RGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWDLFA---ERESIHVNLLIAGACAGEGDAF 391 (666) T ss_pred ccccceeeeecccccccccceEEccCCCCcCcccccccccccccccHHHHHHHHh---hhhhccCCceeecCcCCccchh Confidence 0000000000 00112355667777654 Q ss_pred hHHHHHHHHhhhhcC-ceeeecc--------CCCccHHHHHHhhhc----------ccCceeEEeeeeEEEEeecCCCce Q lcl|NC_019932. 137 LEVSTALASIAQQLR-AFAYVSA--------WGCKTLSEAMAYREN----------FSQRELMVIWPDFISWNTTANQSE 197 (389) Q Consensus 137 ~~v~~al~~~~~~~~-~~~i~d~--------~~~~t~~~a~~~~~~----------~~s~~~~~~~p~~~~~~~~~~~~~ 197 (389) .+++.+|.++|++++ +++++|. ++..+.+++.++++. ++|.|.++||||++++|+.++..+ T Consensus 392 ~~v~~~l~~~~~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~ 471 (666) T protein:vir:65 392 STVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNR 471 (666) T ss_pred HHHHHHHHHHHhhccceEEEeccccceeeecCCCCCHHHHHHHHHhcccccccccccCcceEEEEcCceEEecccCCcee Confidence 468899999999985 5555554 456788899998874 678999999999999999999999 Q ss_pred EEehhHHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEEc--CCCEEEEcCccCCC Q lcl|NC_019932. 198 TAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLIR--KDGFRFWGNRTCSD 275 (389) Q Consensus 198 ~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~--~~G~~~wG~rT~~~ 275 (389) ++|||+++||++||+|.++|||+||+|+++.+|.+.. ..+..+++.|++.||++|||++++ ++|+++||+||+++ T Consensus 472 ~~p~sg~vAGl~Ar~D~~~g~~~span~~~~~i~g~~---~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~ 548 (666) T protein:vir:65 472 WVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVV---KLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATT 548 (666) T ss_pred EechHHHHHHHHHHHhccCCcEEccCCeecceeeccc---cceeecChhHHHhhhhCCceEEEEeCCCeEEEEecccCCC Confidence 9999999999999999999999999999988777763 345556788999999999999964 67999999999987 Q ss_pred Cc-ccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEE Q lcl|NC_019932. 276 DP-LFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKL 354 (389) Q Consensus 276 d~-~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~ 354 (389) ++ +|+||+|||||+||+++|++.++|++||||++.+|++|+++|+.||++||++|+|.||+|+||+++||+++|++|+| T Consensus 549 ~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G~~ 628 (666) T protein:vir:65 549 VPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEF 628 (666) T ss_pred CCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeE Confidence 64 89999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEecccceEEEEEEEEcch--HHHHHHHHhcC Q lcl|NC_019932. 355 FIDYDYTPVPPLEDLTLRQRITDS--YLANFAASVNS 389 (389) Q Consensus 355 ~~~i~~~p~~p~e~i~~~~~~~~~--~~~~~~~~~~~ 389 (389) +++|+++|++|+|||+|++++.+. .|++++++++. T Consensus 629 ~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~~ 665 (666) T protein:vir:65 629 VASMFIKPAKSINYIMLNFTAVATGSDFDEIIGPANQ 665 (666) T ss_pred EEEEEEEecCCcceEEEEEEEeecCccHHHHHHHHhc Confidence 999999999999999999988766 69999999999 No 20 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=100.00 E-value=3.3e-86 Score=489.22 Aligned_cols=380 Identities=14% Similarity=0.136 Sum_probs=306.5 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc---cchhHHHHHhhhccc Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK---KGTLAAALQAIADQA 77 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~---~gtl~~~v~~~~~~~ 77 (389) |+-..|||||+|+ ++++++..++|++.+|+|.+. ..|+++|++++++.+|...||. ...+...+..+|.++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~i~~~~t~~~~~vg~~~-----~gp~~~p~~i~s~~~~~~~fg~~~~~~~~~~~~~~~f~~g 74 (679) T protein:vir:10 1 MTLLSPGVETKEI-NLQTTIARSSTGRAALVGKFN-----WGPAYQISQVVSEVDLVDKFGRPDDQTADSFFSGVNFLNY 74 (679) T ss_pred CceecCceEEEee-cCCcccccCccccceeeeccc-----CCCCccCEEecCHHHHHHHcCCcccccchHHHHHHHHHhC Confidence 8866799999999 699999999999999999985 4588999999999999999984 456788888888888 Q ss_pred CceEEEEEecccccccc---------------------------------------cc---------------------- Q lcl|NC_019932. 78 KPVTVVVRVAEGATPAE---------------------------------------TT---------------------- 96 (389) Q Consensus 78 ~~~~~v~~~~~~~~~~~---------------------------------------t~---------------------- 96 (389) +..++++|+........ +. T Consensus 75 g~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~s~~~~~~~~~~~~~~~~~~~~v~~~~~~~~a~ 154 (679) T protein:vir:10 75 GNDLRLVRVLNETKSRNSSALYQSLSYTITSPGVDYKVGDVVNVLQGGNVIATGKVTVVNASGGIVAFYVPTAAIIDKAK 154 (679) T ss_pred CCeEEEEEccCcccccccccccccccccccccccccccccceeeeeCCCcccceeEEEeeccCceeeeeecccccccccc Confidence 88888887532211000 00 Q ss_pred -------------------------------ccccccc-------c-c------------------------------cc Q lcl|NC_019932. 97 -------------------------------SNIIGTT-------D-E------------------------------NG 107 (389) Q Consensus 97 -------------------------------~~~~~~~-------d-~------------------------------~~ 107 (389) ....... + . .+ T Consensus 155 ~~~~~~~l~~a~~~~~~~~t~~~g~~~~~~v~~v~~~~~~~~~~~~~a~~~i~~~~~~~~t~~~~~~~~~~~~~~A~~~g 234 (679) T protein:vir:10 155 SLNDYPALDNAWQIQFAAGGPGAGQAATATVVGINLDSTIFVPNDEYAMSAISERSETKRTFIDICEEMKVPAIVARYAG 234 (679) T ss_pred cccccceecccceeeeeeccccccceeeeeeeeeccCCceeeccccccccccccccccchhhhhhhhccccceeeeeccc Confidence 0000000 0 0 00 Q ss_pred -----------------------c------------------------------------------chh----------- Q lcl|NC_019932. 108 -----------------------R------------------------------------------YTG----------- 111 (389) Q Consensus 108 -----------------------~------------------------------------------~tG----------- 111 (389) . ..| T Consensus 235 ~~gn~i~v~~va~~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vvv~~~g~~~~~~~~~~~ 314 (679) T protein:vir:10 235 TYGDNIKVLMIAYKDYYKFNEAGKIVSVNTINPKVFPTGLDYGNVTPSSYLEFGPQNESQFAFIVFNNGVAVESKILSTK 314 (679) T ss_pred ccCCcceEEEEeecccccccccccccccccccccccccccccccceeeeecccccccccceeeEEecccccccceeeecc Confidence 0 000 Q ss_pred ------------hH-HHHH----------------------------------------Hhhh----hhhhhhhhccccc Q lcl|NC_019932. 112 ------------MK-ALLS----------------------------------------AQTQ----LGVKPRILGVPGL 134 (389) Q Consensus 112 ------------l~-a~~~----------------------------------------~~~~----~~~~p~~~~apg~ 134 (389) +. .+.+ .... ....++++++|++ T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~ 394 (679) T protein:vir:10 315 PGDRDIYGTSIYINEYFGNGYSSFVQGVAESWPVGYTGVLAFGGGQSSNTDISAAEFMKGWDMFADREHTDVNLFIAGAV 394 (679) T ss_pred cccccccchhhhhhhhhcCcccceeeeccccccccccceeeccCCccCCCccchhhhhhhhhhhhcccccccceEEecCC Confidence 00 0000 0000 0012345667776 Q ss_pred cc------hHHHHHHHHhhhhcC-ceeeeccCCC--------ccHHHHHHhhh-------------cccCceeEEeeeeE Q lcl|NC_019932. 135 DA------LEVSTALASIAQQLR-AFAYVSAWGC--------KTLSEAMAYRE-------------NFSQRELMVIWPDF 186 (389) Q Consensus 135 ~~------~~v~~al~~~~~~~~-~~~i~d~~~~--------~t~~~a~~~~~-------------~~~s~~~~~~~p~~ 186 (389) +. .+|+.+|..+|++++ +++++|.|.. .+.+++..++. +++|.|+++||||+ T Consensus 395 ~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~~~~~p~~ 474 (679) T protein:vir:10 395 AGEGAQIASTVQKAVVAIADERRDCLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYASVDGNYK 474 (679) T ss_pred CCCchhhhHHHHHHHHHHHHhhCCeEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceEEEEccce Confidence 53 468899999999985 8899998754 34466777765 46799999999999 Q ss_pred EEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEE--cCCC Q lcl|NC_019932. 187 ISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLI--RKDG 264 (389) Q Consensus 187 ~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~--~~~G 264 (389) +++|+.++..+++|||+++||++||+|.++|||+||+|+.+.+|.+.. .....+++.|++.||++|||+++ +++| T Consensus 475 ~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~---~~~~~~~~~~~~~Ln~~gin~i~~~~g~G 551 (679) T protein:vir:10 475 YQYDKYNDVNRWIPLAADIAGLCARTDTVGQPWQSPAGFNRGQIVNVI---KLAVDTRQAHRDEMYTNGINPIVGFAGQG 551 (679) T ss_pred eeecccCCceEEechHHHHHHHHHHhhccCCcEECcCCeeeccccccc---cceeecChhhHHhhhhCCceEEEEecCCe Confidence 999999999999999999999999999999999999999988887763 23455678899999999999985 5689 Q ss_pred EEEEcCccCCCCc-ccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCC Q lcl|NC_019932. 265 FRFWGNRTCSDDP-LFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTA 343 (389) Q Consensus 265 ~~~wG~rT~~~d~-~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~ 343 (389) +++||+||+++++ +|+||+|||||+||+++|++.++|++|||||+.+|++|+++|+.||++||++|+|.||+|+||+++ T Consensus 552 ~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~gf~v~~d~~~ 631 (679) T protein:vir:10 552 YILYGDKTASQAPTPFDRINVRRLFNLLKKSISESAKYKLFELNDAFTRSSFRSEVGSYLDTIRSLGGIYDFRVVCDESN 631 (679) T ss_pred EEEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCC Confidence 9999999998775 799999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHhhCCEEEEEEEEEecccceEEEEEEEEcch--HHHHHHHHhcC Q lcl|NC_019932. 344 NDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDS--YLANFAASVNS 389 (389) Q Consensus 344 n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~--~~~~~~~~~~~ 389 (389) ||+++|++|+|+++|+++|++|+|||+|++++... +|++++++++- T Consensus 632 nt~~~i~~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~~ 679 (679) T protein:vir:10 632 NTPAVIDRNEFVATILIKPARSINYITLSFVATSTGADFDELVGSFQQ 679 (679) T ss_pred CCHHHhhCCeEEEEEEEEecCCccEEEEEEEEeecCccHHHHHHHhcC Confidence 99999999999999999999999999999988555 79999999999 No 21 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=100.00 E-value=1.8e-85 Score=485.20 Aligned_cols=378 Identities=12% Similarity=0.094 Sum_probs=306.7 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc---cchhHHHHHhhhccc Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK---KGTLAAALQAIADQA 77 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~---~gtl~~~v~~~~~~~ 77 (389) |+--.|||||+|+..+++++.. +|++.+|+|+++ ..|.++|++++++.+|...||. ...+.+.+..+|.++ T Consensus 1 ~~~~~PgVyv~e~~~~~~~~~~-~ts~~~fvG~~~-----~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ng 74 (659) T protein:vir:10 1 MTLLSPGIELKETTVQSTVVNN-STGTAALAGKFQ-----WGPAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQY 74 (659) T ss_pred CceecCceEEEEecCCceeccc-CccceEEEeccc-----CCCCCccEEecCHHHHHHHcCCcCCCcchhHHHHHHHhhC Confidence 9866799999999999987765 799999999985 4578999999999999999984 467888899999999 Q ss_pred CceEEEEEeccccccc----------------------------------------ccc-------c------------- Q lcl|NC_019932. 78 KPVTVVVRVAEGATPA----------------------------------------ETT-------S------------- 97 (389) Q Consensus 78 ~~~~~v~~~~~~~~~~----------------------------------------~t~-------~------------- 97 (389) +..++++|.....+.. ... . T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~vd~~~~~~~~~i~~~~~~~~~ 154 (659) T protein:vir:10 75 GNDLRVVRAVDRDTAKNSSPIAGNIEYTISTPGSNYAVGDKITVKYVSDAIETEGKITEVDTDGKIKKINIPTAKIIAKA 154 (659) T ss_pred CCeEEEEEccCcccccccccccccceeeEeecccccccccceeeeecCCCccccceeeEEecccccceeeeccccccccc Confidence 9988888753211000 000 0 Q ss_pred ----cccc-----------------------cc------------------------------cc---cc---------- Q lcl|NC_019932. 98 ----NIIG-----------------------TT------------------------------DE---NG---------- 107 (389) Q Consensus 98 ----~~~~-----------------------~~------------------------------d~---~~---------- 107 (389) .... .. +. .. T Consensus 155 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~a~t~~~~~~~~~~~~~~~v~a~~~G~~g~~~ 234 (659) T protein:vir:10 155 KEVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKI 234 (659) T ss_pred ccccccceeeeeeeeeeeeeccccceeeEEeeeecCCceeEEeeccccccccccccccceeecccccccccccceecccc Confidence 0000 00 00 00 Q ss_pred -------------------------------------------------cchh-----------------------hHH- Q lcl|NC_019932. 108 -------------------------------------------------RYTG-----------------------MKA- 114 (389) Q Consensus 108 -------------------------------------------------~~tG-----------------------l~a- 114 (389) ...| +.. T Consensus 235 tv~~~~~a~~~~~~~v~v~~~~~~~~~a~~~t~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (659) T protein:vir:10 235 EIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDF 314 (659) T ss_pred eEEEechhhccccceeeeeeeeecccccccceeeeeeccccccchhhccccccceeeeeeeeccccccccccchhhhhhh Confidence 0000 000 Q ss_pred --------------------------------------------HHHHhhhhhhhhhhhccccccc------hHHHHHHH Q lcl|NC_019932. 115 --------------------------------------------LLSAQTQLGVKPRILGVPGLDA------LEVSTALA 144 (389) Q Consensus 115 --------------------------------------------~~~~~~~~~~~p~~~~apg~~~------~~v~~al~ 144 (389) +..........++++++|++++ .+|+.+|. T Consensus 315 ~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~il~~p~~~~~~~~~~~~v~~al~ 394 (659) T protein:vir:10 315 FAKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVV 394 (659) T ss_pred hccCcccEEEEeecccCCCccceeeecccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHH Confidence 0000000112356777888754 45889999 Q ss_pred HhhhhcC-ceeeeccCC--------CccHHHHHHhhhc----------ccCceeEEeeeeEEEEeecCCCceEEehhHHH Q lcl|NC_019932. 145 SIAQQLR-AFAYVSAWG--------CKTLSEAMAYREN----------FSQRELMVIWPDFISWNTTANQSETAYATARA 205 (389) Q Consensus 145 ~~~~~~~-~~~i~d~~~--------~~t~~~a~~~~~~----------~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ 205 (389) ++|++++ +++++|.|. +.+.+++.+|++. ++|+++++||||++++|+.++..+++|||+++ T Consensus 395 ~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg~~ 474 (659) T protein:vir:10 395 SIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADI 474 (659) T ss_pred HHHHhhCCeEEEEcCccccccCCCcccCHHHHHHHHHhcccccccccccCcceEEEEeCcEEEecccCCceEEechHHHH Confidence 9999985 778888653 4677888888874 78999999999999999999999999999999 Q ss_pred HHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEEc--CCCEEEEcCccCCCCc-cccee Q lcl|NC_019932. 206 LGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLIR--KDGFRFWGNRTCSDDP-LFAFE 282 (389) Q Consensus 206 Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~--~~G~~~wG~rT~~~d~-~~~~i 282 (389) ||++||+|.++|||+||+|+++.++.+... .+..+++.|++.||++|||++++ ++|+++||+||+++|+ .|+|| T Consensus 475 AGl~Ar~D~~~g~~~span~~~~~i~g~~~---~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i 551 (659) T protein:vir:10 475 AGLCARTDNVSQTWMSPAGYNRGQILNVIK---LAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRI 551 (659) T ss_pred HHHHHHHhccCCceEccCCceeeeeecccc---ceecCCHhHHHHHhhCCeeEEEEeCCCeEEEEcccccCCCCcccceE Confidence 999999999999999999999887777643 35566788999999999999864 6799999999998775 79999 Q ss_pred ehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEe Q lcl|NC_019932. 283 NYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTP 362 (389) Q Consensus 283 ~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p 362 (389) +|||||+||+++|++.++|++||||++.+|++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+++|+++| T Consensus 552 ~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~i~~~p 631 (659) T protein:vir:10 552 NVRRLFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGIKALGGIYEYRVVCDTTNNTPSVIDRNEFVATFYIQP 631 (659) T ss_pred ehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCCeEEEEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 363 VPPLEDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 363 ~~p~e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) ++|+|||+|++++.....+ |++|.+ T Consensus 632 ~~pae~i~~~~~~~~~~~~--~~e~~~ 656 (659) T protein:vir:10 632 ARSINYITLNFVATATGAD--FDELTG 656 (659) T ss_pred cCCcceEEEEEEEEecCcc--hHHhhc Confidence 9999999999999987655 777777 No 22 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=100.00 E-value=2e-85 Score=484.99 Aligned_cols=376 Identities=14% Similarity=0.127 Sum_probs=304.4 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc---cchhHHHHHhhhccc Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK---KGTLAAALQAIADQA 77 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~---~gtl~~~v~~~~~~~ 77 (389) |+--.|||||+|+ ++++++..+++++.+|+|.++ ..|.++|++++++.++...||. ...+...+..+|.++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~i~~~~t~~~~~vg~~~-----~gp~~~p~~v~s~~~~~~~fg~~~~~~~~~~~~~~~f~~~ 74 (660) T protein:vir:10 1 MALLSPGIELKET-SVQSTVVRNATGRAALVGKFQ-----WGPAFQVTQITNEVELVDLFGGPNNEVADYFMSGMNFLQY 74 (660) T ss_pred CceecCceEEEee-cCCccccCCCcccceEEeecC-----CCCCccCeEcCCHHHHHHHcCCcCCCchhHHHHHHHHHhC Confidence 8866799999999 689999999999999999985 4578999999999999999983 355677777888888 Q ss_pred CceEEEEEecccccccc----------c---------c-----------------------c--ccc------------- Q lcl|NC_019932. 78 KPVTVVVRVAEGATPAE----------T---------T-----------------------S--NII------------- 100 (389) Q Consensus 78 ~~~~~v~~~~~~~~~~~----------t---------~-----------------------~--~~~------------- 100 (389) +..|+++|......... + . . ... T Consensus 75 g~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~v~~~~a~g~~~~~~~~ta~~~~~a 154 (660) T protein:vir:10 75 GNDLRTVRVVSREFAKNASPIAGNIETTITTAGSNYAVGDKINIKYNQTVVESEGRVTSVDTDGKILSVFIPSAKIIAYA 154 (660) T ss_pred CceEEEEEecccccccccccccccceeEEeeccccccccceeeEeeccccccccccceeeccccceeeeccccccccccc Confidence 87777776532221000 0 0 0 000 Q ss_pred ------------------c---cc-----------cc-------------------------------c----------- Q lcl|NC_019932. 101 ------------------G---TT-----------DE-------------------------------N----------- 106 (389) Q Consensus 101 ------------------~---~~-----------d~-------------------------------~----------- 106 (389) . .. +. . T Consensus 155 ~~v~~~~~~~~~~~~~~~~~~~~~~~a~sv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i 234 (660) T protein:vir:10 155 RSLNQYPTLGPAWTAEVTSASSGVSGTITVGKIVTDSGILLTEAENSEEAITSLEFQAALKKFAMPGVVALYPGEIGSTL 234 (660) T ss_pred cccccccccccceeEEEecccCccccceeeeeeeccCcceEEeeeccccccccccceeeccccccceeeeecccccCcce Confidence 0 00 00 0 Q ss_pred -------------------c------------------------------------------------------------ Q lcl|NC_019932. 107 -------------------G------------------------------------------------------------ 107 (389) Q Consensus 107 -------------------~------------------------------------------------------------ 107 (389) . T Consensus 235 ~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (660) T protein:vir:10 235 EVEIVSKAAYEAGSSKMLDVYPGGGTRASIAKAVFNYGPQTDDQYAIIVRRDGAIVESVVLSTKEGEKDVYGNNIYLDDY 314 (660) T ss_pred eEEEeeccccCCcceeEEeeeeccceeeEEeeeecccccccccccccccccCCcccceeeeeccccccccccceeeeehh Confidence 0 Q ss_pred ----------------------------------------cchhhHHHHHHhhhhhhhhhhhccccccc------hHHHH Q lcl|NC_019932. 108 ----------------------------------------RYTGMKALLSAQTQLGVKPRILGVPGLDA------LEVST 141 (389) Q Consensus 108 ----------------------------------------~~tGl~a~~~~~~~~~~~p~~~~apg~~~------~~v~~ 141 (389) ..+|+.+++ ......++++++|++.+ ++|++ T Consensus 315 ~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~t~~~~~~~~~~~~---~~~~~~~~~l~~p~~~~~~~~~~~~v~~ 391 (660) T protein:vir:10 315 FAKGTSNYIYATSLNWPKGFSGIINLSGGISANDKVTAGDLMQGWDLFA---DREALHINLLIAGAVAGEGDEVASTVQK 391 (660) T ss_pred hcCCCccEEEEEeccCCCCcccceeeeccccCccccccchhhhhhhhhh---hhhhcccceEEEcCcCCCchhhhHHHHH Confidence 000000000 00012344556666543 45889 Q ss_pred HHHHhhhhcC-ceeeeccCCC--------ccHHHHHHhhh----------cccCceeEEeeeeEEEEeecCCCceEEehh Q lcl|NC_019932. 142 ALASIAQQLR-AFAYVSAWGC--------KTLSEAMAYRE----------NFSQRELMVIWPDFISWNTTANQSETAYAT 202 (389) Q Consensus 142 al~~~~~~~~-~~~i~d~~~~--------~t~~~a~~~~~----------~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s 202 (389) +|.++|++++ +++++|+|.+ .+.+++.+|++ +++|.+.++||||++++|+.+++.+++||| T Consensus 392 al~~~~~~~~~~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s 471 (660) T protein:vir:10 392 HVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLA 471 (660) T ss_pred HHHHHHHhhCCEEEEEecCcccccccccccCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceeEechh Confidence 9999999885 8999999854 36788888886 367999999999999999999999999999 Q ss_pred HHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEE--cC-CCEEEEcCccCCCCc-c Q lcl|NC_019932. 203 ARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLI--RK-DGFRFWGNRTCSDDP-L 278 (389) Q Consensus 203 ~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~--~~-~G~~~wG~rT~~~d~-~ 278 (389) +++||++||+|.++|||+||||+.+.++.+.. ..+..+++.|++.||++|||+++ ++ +||++||+||+++|+ . T Consensus 472 g~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~---~~~~~~~~~e~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~s~ 548 (660) T protein:vir:10 472 ADLAGLCARTDDVSQPWMSPAGYNRGQILNVL---KLAIEPRQAQRDRMYQEAINPVVGFAGGDGFVLFGDKTATKVPSP 548 (660) T ss_pred HHHHHHHHHhhccCCcEEccCCeeeceeeccc---eeeecCChhhHHhHhhCCceEEEEeeCCCcEEEEcccccCCCCcc Confidence 99999999999999999999999987777753 33556788899999999999984 34 799999999998886 7 Q ss_pred cceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEE Q lcl|NC_019932. 279 FAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDY 358 (389) Q Consensus 279 ~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i 358 (389) |+||||||||+||+++|++.++|++||||++.+|++|+++++.||++||++|+|.||+|+||+++||+++|++|+|+++| T Consensus 549 ~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~gal~g~~V~~d~~~nt~~di~~G~~~~~i 628 (660) T protein:vir:10 549 MDHINVRRLFNMLKKNIGDASKYKLFELNDNFTRSSFRMEVSQYLDGIKALGGIYEGRVVCDTTVNTPAVIDRNEFIANI 628 (660) T ss_pred cceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecccceEEEEEEEEcchH--HHHHHHHhc Q lcl|NC_019932. 359 DYTPVPPLEDLTLRQRITDSY--LANFAASVN 388 (389) Q Consensus 359 ~~~p~~p~e~i~~~~~~~~~~--~~~~~~~~~ 388 (389) +++|++|+|||+|++++.+.+ |+|+++++. T Consensus 629 ~~~P~~pae~I~~~~~~~~~~~~~~e~~~~~~ 660 (660) T protein:vir:10 629 YVKPARSINYITLNFVATSTGADFDELIGPLV 660 (660) T ss_pred EEEecCCccEEEEEEEEeecCccHHHHhhhcC Confidence 999999999999999887664 788888888 No 23 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=100.00 E-value=2.9e-85 Score=484.03 Aligned_cols=379 Identities=13% Similarity=0.109 Sum_probs=304.3 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhc---ccchhHHHHHhhhccc Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAG---KKGTLAAALQAIADQA 77 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~---~~gtl~~~v~~~~~~~ 77 (389) |+--.|||||+|+..+++++.. +|++.+|+|+++ ..|.++|++++++.+|...|| +...+...+..+|.++ T Consensus 1 ~~~~~PgVyvee~~~~~~~~~~-~ts~~~fvG~~~-----~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng 74 (659) T protein:vir:72 1 MTLLSPGIELKETTVQSTVVNN-STGTAALAGKFQ-----WGPAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQY 74 (659) T ss_pred CceecCceEEEEecCCcccccC-CCcceEEEeecC-----CCCCcccEEecCHHHHHHHcCCcCCCCchhHHHHHHHHhC Confidence 8866799999999999976655 899999999985 457899999999999999999 4466788888899999 Q ss_pred CceEEEEEeccccccccc---------------------------ccc----------------------c--------- Q lcl|NC_019932. 78 KPVTVVVRVAEGATPAET---------------------------TSN----------------------I--------- 99 (389) Q Consensus 78 ~~~~~v~~~~~~~~~~~t---------------------------~~~----------------------~--------- 99 (389) +..|+++|.......... ... . T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~g~v~~~~~~~~~~~~~v~t~~~~a~~ 154 (659) T protein:vir:72 75 GNDLRVVRAVDRDTAKNSSPIAGNIDYTISTPGSNYAVGDKITVKYVSDDIETEGKITEVDADGKIKKINIPTGKNYAKA 154 (659) T ss_pred CceEEEEEccCCcccccccccccccceeecccccccccceeeeeeeccccccccceEEEeeccccceeeeeccccccccc Confidence 988888876321100000 000 0 Q ss_pred --------------------cc---c-----------------c---------------------------------cc- Q lcl|NC_019932. 100 --------------------IG---T-----------------T---------------------------------DE- 105 (389) Q Consensus 100 --------------------~~---~-----------------~---------------------------------d~- 105 (389) .. . . +. T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~v~~~~~~~~~~v~~~~~a~~~~~~~~~v~~~~~~~~~a~~~gt~g~~~ 234 (659) T protein:vir:72 155 KEVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKI 234 (659) T ss_pred cccccccccccceeeEEeeccccccceEEEEEeecCcceeeeeccccchhhhcccccccccccccceeeeccccccccce Confidence 00 0 0 00 Q ss_pred ---------------------------ccc-----------------------------------c-------------- Q lcl|NC_019932. 106 ---------------------------NGR-----------------------------------Y-------------- 109 (389) Q Consensus 106 ---------------------------~~~-----------------------------------~-------------- 109 (389) ... . T Consensus 235 tv~i~~~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (659) T protein:vir:72 235 EIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDF 314 (659) T ss_pred eEEEccccccccceeeeeecccccccccccceeeeeeecccccccceeeecccceeeeeeeeeccccccccchhhhhhhh Confidence 000 0 Q ss_pred --hh-------------------------------h------HHHHHHhhhhhhhhhhhccccccc------hHHHHHHH Q lcl|NC_019932. 110 --TG-------------------------------M------KALLSAQTQLGVKPRILGVPGLDA------LEVSTALA 144 (389) Q Consensus 110 --tG-------------------------------l------~a~~~~~~~~~~~p~~~~apg~~~------~~v~~al~ 144 (389) ++ + .++..........+.++++|++.+ .+++.+|. T Consensus 315 ~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~~l~ 394 (659) T protein:vir:72 315 FAKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVV 394 (659) T ss_pred hhcCCceEEEEEecccCCcccccccccccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHH Confidence 00 0 000000000112356777888754 45889999 Q ss_pred HhhhhcC-ceeeeccCC--------CccHHHHHHhhhc----------ccCceeEEeeeeEEEEeecCCCceEEehhHHH Q lcl|NC_019932. 145 SIAQQLR-AFAYVSAWG--------CKTLSEAMAYREN----------FSQRELMVIWPDFISWNTTANQSETAYATARA 205 (389) Q Consensus 145 ~~~~~~~-~~~i~d~~~--------~~t~~~a~~~~~~----------~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ 205 (389) ++|++++ +++++|.|. +.+.+++.+||+. ++|.+.++||||++++|+.++..+++|||+++ T Consensus 395 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~v 474 (659) T protein:vir:72 395 SIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNHKYQYDKYNDVNRWVPLAADI 474 (659) T ss_pred HHHhhhCCEEEEEcCccccccCCCcccCHHHHHHHHhhccccccccccccceeEEEEcCceeeccccCCceEEechHHHH Confidence 9999985 788888763 4567888888874 67999999999999999999999999999999 Q ss_pred HHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEE--cCCCEEEEcCccCCCCc-cccee Q lcl|NC_019932. 206 LGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLI--RKDGFRFWGNRTCSDDP-LFAFE 282 (389) Q Consensus 206 Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~--~~~G~~~wG~rT~~~d~-~~~~i 282 (389) ||++||+|.++|||+||+|+.+.++.++. .....+++.|++.||++|||+++ +++|+++||+||+++|+ .|+|| T Consensus 475 AGl~Ar~D~~~G~~~span~~~~~i~g~~---~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~~~s~~~~i 551 (659) T protein:vir:72 475 AGLCARTDNVSQTWMSPAGYNRGQILNVI---KLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRI 551 (659) T ss_pred HHHHHHhhccCCcEEccCCeeeceeeccc---cccccCChhHHHHHhhCCceEEEEecCCeEEEEcccccCCCCcccceE Confidence 99999999999999999999988888763 34566778899999999999996 56899999999998776 79999 Q ss_pred ehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEe Q lcl|NC_019932. 283 NYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTP 362 (389) Q Consensus 283 ~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p 362 (389) +|||+|+||+++|++.++|+|||||++.+|++|+++|+.||++||++|+|.||+|+||+++||+++|++|+|+++|+++| T Consensus 552 ~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~i~~~p 631 (659) T protein:vir:72 552 NVRRLFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGNKALGGIYEYRVVCDTTNNTPSVIDRNEFVATFYIQP 631 (659) T ss_pred eehhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCCeEEEEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceEEEEEEEEcchH--HHHHHHHhc Q lcl|NC_019932. 363 VPPLEDLTLRQRITDSY--LANFAASVN 388 (389) Q Consensus 363 ~~p~e~i~~~~~~~~~~--~~~~~~~~~ 388 (389) ++|+|||+|+|++...+ |+|+.++-- T Consensus 632 ~~pae~I~~~~~~~~~~~~~~e~~~~~~ 659 (659) T protein:vir:72 632 ARSINYITLNFVATATGADFDELTGLAG 659 (659) T ss_pred cCCccEEEEEEEEeecCcchHHhcccCC Confidence 99999999999886654 555555443 No 24 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=100.00 E-value=1.6e-85 Score=485.54 Aligned_cols=380 Identities=14% Similarity=0.109 Sum_probs=303.8 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc---cchhHHHHHhhhccc Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK---KGTLAAALQAIADQA 77 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~---~gtl~~~v~~~~~~~ 77 (389) |+--.|||||+|+ ++++++..++|++.+|+|.+. ..|.++|++++++.+|...||. ...+.+.+..+|.++ T Consensus 1 ~~~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~-----~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ng 74 (663) T protein:vir:10 1 MALLSPGIEMKET-SINSTVVRSATGRAAIVGKFA-----WGPAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQY 74 (663) T ss_pred CceecCceEEEEe-cCcccccccCccceeEEeeec-----cCCCCccEEecCHHHHHHHhCCcCccchhHHHHHHHHHhC Confidence 8866799999999 699999999999999999985 4578999999999999999995 567788999999999 Q ss_pred CceEEEEEecccccccccc-----------------------------c------------------cc----------- Q lcl|NC_019932. 78 KPVTVVVRVAEGATPAETT-----------------------------S------------------NI----------- 99 (389) Q Consensus 78 ~~~~~v~~~~~~~~~~~t~-----------------------------~------------------~~----------- 99 (389) +..++++|...+....... . .. T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~n~~~~~v~~~~a~~~~~~ 154 (663) T protein:vir:10 75 GNDLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKT 154 (663) T ss_pred CCeEEEEEccCCcccccccccCCcceeeeeccccCccccccccccccccccccccceeeeccCCceEEEEeccccccccc Confidence 9999999875332110000 0 00 Q ss_pred ----------------c----ccc-----------ccc-c------cchhhH---------------------------- Q lcl|NC_019932. 100 ----------------I----GTT-----------DEN-G------RYTGMK---------------------------- 113 (389) Q Consensus 100 ----------------~----~~~-----------d~~-~------~~tGl~---------------------------- 113 (389) . ... +.. . ...++. T Consensus 155 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~v~~vv~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~G~~Gn~i 234 (663) T protein:vir:10 155 RQLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLVSAVYPGEIGSTV 234 (663) T ss_pred cccceeeeccccceeEeeeccccccccccccceecccceeeEeeccccccccccchhhhcccccceeeeeecccccccce Confidence 0 000 000 0 000000 Q ss_pred -------------------------------------------------------------------------HHHHH-- Q lcl|NC_019932. 114 -------------------------------------------------------------------------ALLSA-- 118 (389) Q Consensus 114 -------------------------------------------------------------------------a~~~~-- 118 (389) ..... T Consensus 235 ~v~i~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~ 314 (663) T protein:vir:10 235 EVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFR 314 (663) T ss_pred eEEecccccccccccccccccccccccccceeeeeccccccceeEEEecCCcceeeeeeeecccccccchhhhhhhhhhc Confidence 00000 Q ss_pred ----------------------------------------------hhhhhhhhhhhccc--cc----cchHHHHHHHHh Q lcl|NC_019932. 119 ----------------------------------------------QTQLGVKPRILGVP--GL----DALEVSTALASI 146 (389) Q Consensus 119 ----------------------------------------------~~~~~~~p~~~~ap--g~----~~~~v~~al~~~ 146 (389) .....+.+.++++| +. ...+|+.+|.++ T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~al~~~ 394 (663) T protein:vir:10 315 NGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSL 394 (663) T ss_pred cCcceEEEEeecccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHH Confidence 00000011111111 11 124578899999 Q ss_pred hhhcC-ceeeeccCCC--------ccHHHHHHhhh-------------cccCceeEEeeeeEEEEeecCCCceEEehhHH Q lcl|NC_019932. 147 AQQLR-AFAYVSAWGC--------KTLSEAMAYRE-------------NFSQRELMVIWPDFISWNTTANQSETAYATAR 204 (389) Q Consensus 147 ~~~~~-~~~i~d~~~~--------~t~~~a~~~~~-------------~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~ 204 (389) |++++ +++++|.|.+ .+.+++.+++. +++|++.++||||++++|+.++..+++|||++ T Consensus 395 a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d~~~~~~~~~p~s~~ 474 (663) T protein:vir:10 395 ADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLAAD 474 (663) T ss_pred HHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEecccCCceEEechhHH Confidence 99885 8899999864 34566666664 47899999999999999999999999999999 Q ss_pred HHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEE--cC-CCEEEEcCccCCCCc-ccc Q lcl|NC_019932. 205 ALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLI--RK-DGFRFWGNRTCSDDP-LFA 280 (389) Q Consensus 205 ~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~--~~-~G~~~wG~rT~~~d~-~~~ 280 (389) +||++||+|.++|||+||+|+.+.++.+.. ..+..+++.|++.||++|||+++ ++ +|+++||+||+++++ .|+ T Consensus 475 vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~---~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s~~~ 551 (663) T protein:vir:10 475 IAGLCAYTDQVSHPWMSPAGYRRGQIRNCI---KLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPSPFD 551 (663) T ss_pred HHHHHHHhhccCCceEccCCceeccccccc---cceeccChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCcccc Confidence 999999999999999999999877776653 34566778899999999999984 44 799999999998775 799 Q ss_pred eeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEE Q lcl|NC_019932. 281 FENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDY 360 (389) Q Consensus 281 ~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~ 360 (389) ||++||||+||+++|++.++|++||||++.+|++|+++|+.||++||++|+|.||+|+||+++||+++|++|+|+++|++ T Consensus 552 ~i~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~ 631 (663) T protein:vir:10 552 RINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVIDRNEFVGTIYV 631 (663) T ss_pred eEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EecccceEEEEEEEEcch--HHHHHHHHhcC Q lcl|NC_019932. 361 TPVPPLEDLTLRQRITDS--YLANFAASVNS 389 (389) Q Consensus 361 ~p~~p~e~i~~~~~~~~~--~~~~~~~~~~~ 389 (389) +|++|+|||+|++++... .|+|++++|++ T Consensus 632 ~p~~pae~i~~~~~~~~~~~~~~e~~~~~~~ 662 (663) T protein:vir:10 632 KPPRSINYITLNMVATSTGANFDELIGPMQL 662 (663) T ss_pred EecCCcceEEEEEEEeecCccHHHHHHHHhc Confidence 999999999999998765 59999999999 No 25 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=100.00 E-value=8.3e-85 Score=481.55 Aligned_cols=380 Identities=14% Similarity=0.113 Sum_probs=301.3 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc---cchhHHHHHhhhccc Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK---KGTLAAALQAIADQA 77 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~---~gtl~~~v~~~~~~~ 77 (389) |+-..|||||+|+ ++++++..++|++.+|+|.+. ..|.++|++++++.+|...||. ...+.+.+..+|.++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~i~~~~t~~~~~vG~~~-----~Gp~~~p~~v~~~~~~~~~fg~~~~~~~~~~~~~~~f~ng 74 (663) T protein:vir:10 1 MALLSPGIEMKET-SINSTVVRSATGRAAIVGKFA-----WGPAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQY 74 (663) T ss_pred CceecCceEEEEe-cCCccccccCcccceeEeecc-----cCCCCccEEecCHHHHHHhcCCcCCcchhHHHHHHHHHhC Confidence 8866799999999 699999999999999999985 4578999999999999999984 346778889999999 Q ss_pred CceEEEEEeccccccccc-----------------------------cc-------------c----------------- Q lcl|NC_019932. 78 KPVTVVVRVAEGATPAET-----------------------------TS-------------N----------------- 98 (389) Q Consensus 78 ~~~~~v~~~~~~~~~~~t-----------------------------~~-------------~----------------- 98 (389) +..++++|...+...... .. + T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~i~v~~~~~~~~~~~~~~~~~~~~~~~~v~~~ta~~~~~~ 154 (663) T protein:vir:10 75 GNDLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKT 154 (663) T ss_pred CCeEEEEEccCCccccccccccccceeEEeecccccccccccccccccccccccccceeeecccceEEEeeccccccccc Confidence 999988886422110000 00 0 Q ss_pred -------------------cccc----------c-ccc--------cc-----------------------chh------ Q lcl|NC_019932. 99 -------------------IIGT----------T-DEN--------GR-----------------------YTG------ 111 (389) Q Consensus 99 -------------------~~~~----------~-d~~--------~~-----------------------~tG------ 111 (389) .... . +.. .. ..| T Consensus 155 ~~v~~~~~~~~~~~~~~s~~s~~~~~a~~v~~v~~d~~~~v~~~~~a~~~~t~~~~~~~~~~~~~~~i~A~~~G~~Gn~i 234 (663) T protein:vir:10 155 RQLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLISAVYPGEIGSTV 234 (663) T ss_pred cccccceeeccceeeEeeeccCccccccccceeccccceEEeeccccccccccccccccccccccceEEeccCCccccee Confidence 0000 0 000 00 000 Q ss_pred ----------------------------------------------------------h--------------------- Q lcl|NC_019932. 112 ----------------------------------------------------------M--------------------- 112 (389) Q Consensus 112 ----------------------------------------------------------l--------------------- 112 (389) + T Consensus 235 ~V~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~ 314 (663) T protein:vir:10 235 EVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFR 314 (663) T ss_pred eeeeccccccccccccceecccccccccccceeecccccccceeeEeecCCcceeeecccccccccccccchhhhhhhhc Confidence 0 Q ss_pred ----------------------------------------HHHHHHhhhhhhhhhhhcccc--c----cchHHHHHHHHh Q lcl|NC_019932. 113 ----------------------------------------KALLSAQTQLGVKPRILGVPG--L----DALEVSTALASI 146 (389) Q Consensus 113 ----------------------------------------~a~~~~~~~~~~~p~~~~apg--~----~~~~v~~al~~~ 146 (389) .++........+.+.++++|. . ..++++.+|.++ T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~ 394 (663) T protein:vir:10 315 NGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSL 394 (663) T ss_pred CCcceEEEEeecccCccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHH Confidence 000000000000111111111 1 124588899999 Q ss_pred hhhcC-ceeeeccCCC--------ccHHHHHHhhh-------------cccCceeEEeeeeEEEEeecCCCceEEehhHH Q lcl|NC_019932. 147 AQQLR-AFAYVSAWGC--------KTLSEAMAYRE-------------NFSQRELMVIWPDFISWNTTANQSETAYATAR 204 (389) Q Consensus 147 ~~~~~-~~~i~d~~~~--------~t~~~a~~~~~-------------~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~ 204 (389) |++++ +++++|.|.+ .+.+++.+|++ +++|.+.++||||++++|+.++..+++|||++ T Consensus 395 a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~ 474 (663) T protein:vir:10 395 ADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLAAD 474 (663) T ss_pred HHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCccceEEEcCceEEecccCCceEEechhHH Confidence 99985 7899999864 24455666654 47899999999999999999999999999999 Q ss_pred HHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEE--cC-CCEEEEcCccCCCCc-ccc Q lcl|NC_019932. 205 ALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLI--RK-DGFRFWGNRTCSDDP-LFA 280 (389) Q Consensus 205 ~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~--~~-~G~~~wG~rT~~~d~-~~~ 280 (389) +||++||+|.++|||+||+|+.+.++.+.. ..+..+++.|++.||++|||++. ++ +||++||+||+++++ +|+ T Consensus 475 vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~---~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~s~~~ 551 (663) T protein:vir:10 475 IAGLCAYTDQVSHPWMSPAGYRRGQIRNCI---KLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPSPFD 551 (663) T ss_pred HHHHHHHhhccCCceEccCCceeccccccc---cceeecChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCcccc Confidence 999999999999999999999887676653 34566678899999999999884 44 799999999998775 899 Q ss_pred eeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEE Q lcl|NC_019932. 281 FENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDY 360 (389) Q Consensus 281 ~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~ 360 (389) ||+|||||+||+++|++.++|+|||||++.+|++|+++|+.||++||++|+|.||+|+||+++||+++|++|+|+++|++ T Consensus 552 ~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~ 631 (663) T protein:vir:10 552 RINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVIDRNEFVGTIYV 631 (663) T ss_pred eEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EecccceEEEEEEEEcch--HHHHHHHHhcC Q lcl|NC_019932. 361 TPVPPLEDLTLRQRITDS--YLANFAASVNS 389 (389) Q Consensus 361 ~p~~p~e~i~~~~~~~~~--~~~~~~~~~~~ 389 (389) +|++|+|||+|++++.+. .|+|+++++++ T Consensus 632 ~p~~pae~i~~~~~~~~~~~~~~e~~~~~~~ 662 (663) T protein:vir:10 632 KPPRSINYITLNMVATSTGANFDELIGPMQL 662 (663) T ss_pred EecCCcceEEEEEEEeecCccHHHHHHHHhc Confidence 999999999999998664 59999999999 No 26 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=100.00 E-value=1.8e-84 Score=479.67 Aligned_cols=376 Identities=14% Similarity=0.130 Sum_probs=298.4 Q ss_pred CC-CCC-CCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc----c-chhHHHHHhh Q lcl|NC_019932. 1 MS-DYH-HGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK----K-GTLAAALQAI 73 (389) Q Consensus 1 M~-~~~-~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~----~-gtl~~~v~~~ 73 (389) |+ +|+ |||||+|+..++++++.++|++.+|+|++. ..|.++|++++++.+|...||. + ..+.+.+..+ T Consensus 1 m~~~~~~PgVyv~e~~~~~~~i~~v~ts~~~fvG~~~-----~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~~~ 75 (729) T protein:vir:10 1 MPLNLASPGIVVREVDLTIGRVDPTSGSIGALVAPFA-----KGPVNDPQLIESEEDLLQTFGQPYSTDKHYEYWMVASS 75 (729) T ss_pred CCccccCCceEEEEecCCCcccccccccceeEEeccc-----cCCCccCeEcCCHHHHHHHcCccccCCcchhHHHHHHH Confidence 99 786 899999999999999999999999999985 4578999999999999999995 2 3356788999 Q ss_pred hcccCceEEEEEeccccccccc---------------------------------------------------------- Q lcl|NC_019932. 74 ADQAKPVTVVVRVAEGATPAET---------------------------------------------------------- 95 (389) Q Consensus 74 ~~~~~~~~~v~~~~~~~~~~~t---------------------------------------------------------- 95 (389) |.+++..|+++|.........+ T Consensus 76 f~ngg~~~~vvRv~~~~~~~a~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gn~~~v~v~ 155 (729) T protein:vir:10 76 YLAYGGTMQVVRADDYNTQTGVGLKNAFVTGGVGVGATMLRITSNTHYNQLGYDENTISGVSVAAKNPGTWANGIKVAII 155 (729) T ss_pred HHhCCceEEEEecCcccccccccccccccccccccccccccccccccccccccccCCCcceEEEEeccCccccceeeEEe Confidence 9999999999986431100000 Q ss_pred -------------------------------c-----------cccccccccc-------------------c---c--- Q lcl|NC_019932. 96 -------------------------------T-----------SNIIGTTDEN-------------------G---R--- 108 (389) Q Consensus 96 -------------------------------~-----------~~~~~~~d~~-------------------~---~--- 108 (389) . .......+.. . . T Consensus 156 ~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~s~~~~~~~~~~~~~~~~~~~ 235 (729) T protein:vir:10 156 DGKADQILTVASGNTTAVGSAVTQSISKTIGTATGTTTIDGVLKGIVTGSTDTTLEVKVISHISAAGVETAVEYQQNGTY 235 (729) T ss_pred cccCcceeeeeccccccceeeeeeeccccccccccceeeeeeecccccccccccccceecccccccccceecccccccee Confidence 0 0000000000 0 0 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_019932. 109 -------------------------------------------------------------------------------- 108 (389) Q Consensus 109 -------------------------------------------------------------------------------- 108 (389) T Consensus 236 ~~~~~~s~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~d~~~~~~~d~~~~~~ 315 (729) T protein:vir:10 236 TFDNSGSVNVIAAGSSGSGSAKSYTAQTDWFESQNIVLSNSTLEWDSIADAPGTSTYVSTRGGKNDEIHVLVIDDKGTIT 315 (729) T ss_pred eecccCccceeeeccccccccccceeeeccccccccccccccccccccccccccccccccccccccccceeeeccccccc Confidence Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_019932. 109 -------------------------------------------------------------------------------- 108 (389) Q Consensus 109 -------------------------------------------------------------------------------- 108 (389) T Consensus 316 ~~~g~vve~~~~~s~~~~~~~~~~~~~~~~~vi~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~ 395 (729) T protein:vir:10 316 GNSGTILEKHLSLSKAKDAEYSVGSSSYWRDFLATNSKYIFGGGATSGITTTGYSVSSTNTLDTDSGWDQNAEGVNFGAS 395 (729) T ss_pred cCcccceeeeeeeeeccccccccccccccceeeccccceeeecccccccccccccccccceecccccccccccccccccc Confidence Q ss_pred --------------------------------chhhHHHHHHhhhhhhhhhhhc--c---ccccchHHHHHHHHhhhhcC Q lcl|NC_019932. 109 --------------------------------YTGMKALLSAQTQLGVKPRILG--V---PGLDALEVSTALASIAQQLR 151 (389) Q Consensus 109 --------------------------------~tGl~a~~~~~~~~~~~p~~~~--a---pg~~~~~v~~al~~~~~~~~ 151 (389) .+|++++++. ..+....++ + |+.....++.+|.++|++++ T Consensus 396 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~---~~~~~~~~~~~~~~~~~~~~~~v~~a~~~~~~~~~ 472 (729) T protein:vir:10 396 GVATLTLAGGTNYGDKTDLTTSGALSSGVDDIISGYTLFENT---EEIEVDFILMGAAHHPKEQSQAVAEKVTAVAEARK 472 (729) T ss_pred ceeEEEeecccccccccccccccccccchhHHHHHHHHhhcc---cccccceeeecCCCCCccchHHHHHHHHHHHHhcC Confidence 0000000000 000000000 0 11122346778889898874 Q ss_pred -ceeeeccCCC-----------------ccHHHHHHhhhcc-cCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhh Q lcl|NC_019932. 152 -AFAYVSAWGC-----------------KTLSEAMAYRENF-SQRELMVIWPDFISWNTTANQSETAYATARALGLRAKI 212 (389) Q Consensus 152 -~~~i~d~~~~-----------------~t~~~a~~~~~~~-~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~ 212 (389) +++++|.|.. .+.+++..++..+ ++.+.++|+||++++|+.++..+++|||+++||++||+ T Consensus 473 ~~~a~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~p~s~~~aGl~a~~ 552 (729) T protein:vir:10 473 DAVAFISPYRQAFLNDTVSGTVTVSNIDQTTENVVGFYAPLSSSTYSVFDSGYKYMFDRFNNTFRYVPLNGDIAGTCART 552 (729) T ss_pred CeEEEecccccccccccccccccccccchhhHHHHHHHhhccCCceEEEEcCeeEEecccCCceEEechhHHHHHHHHHh Confidence 6788887632 2334556666665 46788999999999999999999999999999999999 Q ss_pred hccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEEc--CCCEEEEcCccC-CCCcccceeehhhHHH Q lcl|NC_019932. 213 DTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLIR--KDGFRFWGNRTC-SDDPLFAFENYTRTAQ 289 (389) Q Consensus 213 d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~--~~G~~~wG~rT~-~~d~~~~~i~vrR~~~ 289 (389) |.++|||+||+|+++.||.++. .....+++.|+++||++||+++++ ++|+++||+||+ +.|++|+||++|||++ T Consensus 553 d~~~g~~~span~~~~~i~g~~---~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~d~~~~~i~vrR~~~ 629 (729) T protein:vir:10 553 DIEQFPWFSPAGTARGPILNSV---KLVYNPGKKQRDILYSNRINPVILSPGAGIILFGDKTGFGKSSAFDRINVRRLFI 629 (729) T ss_pred hccCCcEEccCCccccceeccc---ceeeecChhhHhhhhhCCceEEEEecCCeEEEEcceecCCCCcccceeehhhhHH Confidence 9999999999999988888864 345567788999999999999965 689999999997 6799999999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEE Q lcl|NC_019932. 290 VIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDL 369 (389) Q Consensus 290 ~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i 369 (389) ||+++|++.++|+|||||++.+|++|+++|+.||++||++|+|.||+|+||+++||+++|++|+|+++|+++|++|+||| T Consensus 630 ~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~p~e~i 709 (729) T protein:vir:10 630 YLEDAISAAAKDQLFEFNDELTRTNFVNIVEPFLRDVQAKRGIFDFVVICDETNNTAAVIDSNEFVADIFIKPARSINFI 709 (729) T ss_pred HHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCccEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEcch--HHHHHHHHh Q lcl|NC_019932. 370 TLRQRITDS--YLANFAASV 387 (389) Q Consensus 370 ~~~~~~~~~--~~~~~~~~~ 387 (389) +|++++++. +|++++++| T Consensus 710 ~~~~~~~~~~~~~~e~~~~~ 729 (729) T protein:vir:10 710 GLTFVATRTGVAFEEVIGSV 729 (729) T ss_pred EEEEEEeecCccHHHHHhcC Confidence 999988776 689999999 No 27 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=100.00 E-value=2.6e-83 Score=473.37 Aligned_cols=378 Identities=15% Similarity=0.134 Sum_probs=292.5 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc---cchhHHHHHhhhccc Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK---KGTLAAALQAIADQA 77 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~---~gtl~~~v~~~~~~~ 77 (389) |+--.|||||+|+ ++++++..++|++.+|+|.++. .|.++|++++++.+|...||. ...+.+.+..+|.++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~i~~v~t~~~~fvG~~~~-----Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~f~ng 74 (671) T protein:vir:56 1 MTLLSPGIENKEI-NLASAIGRAATGRAAMVGKFEW-----GPAYSITQVTSESDLVTIFGRPNDYTAASFMTANNFLKY 74 (671) T ss_pred CceecCceEEEee-cCcccccccCcccceEEecccC-----CCCccCEEcCCHHHHHHHcCCcCCCcchhHHHHHHHHhc Confidence 8866799999999 6999999999999999999854 578999999999999999995 456888899999999 Q ss_pred CceEEEEEecccccccccc---------------------------------c-------c------------------- Q lcl|NC_019932. 78 KPVTVVVRVAEGATPAETT---------------------------------S-------N------------------- 98 (389) Q Consensus 78 ~~~~~v~~~~~~~~~~~t~---------------------------------~-------~------------------- 98 (389) +..++++|........... . + T Consensus 75 g~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~v~ 154 (671) T protein:vir:56 75 GNDLRLVRICDATTAQNATPLYNAVEYTIGASNGCVVGDDITITYSGVGALTAKGKVLEVDAGNNNAASKIFLPSAEIVA 154 (671) T ss_pred CCeEEEEEecCccccccchhhccccccccccCcceeeceeeeeecCcccccccCcceeEEeeeccceeeeeeccceeEEE Confidence 9999988874432100000 0 0 Q ss_pred ---------cccc--------------cc--cc----------------------------------------------- Q lcl|NC_019932. 99 ---------IIGT--------------TD--EN----------------------------------------------- 106 (389) Q Consensus 99 ---------~~~~--------------~d--~~----------------------------------------------- 106 (389) .... .+ .. T Consensus 155 ~~~~~~~~~~~~~~t~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~ 234 (671) T protein:vir:56 155 AAKSDGNYPSVGTITLQPTQGDIALTNIEIIDTGSVYFPNIELAFDALTAIETEGGALKYADLIEKQGFPRLSARYVGDF 234 (671) T ss_pred eeeccccccccccccccccccceeeeeecccccceEEEeccccccccccccccccccccchhhhhccccccccccccccc Confidence 0000 00 00 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_019932. 107 -------------------------------------------------------------------------------- 106 (389) Q Consensus 107 -------------------------------------------------------------------------------- 106 (389) T Consensus 235 g~~~~v~v~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~ 314 (671) T protein:vir:56 235 GDAISVEIINYADYQTAFAFAAGHTLGDIELPIYPDGGTRSINLSSYFTFGPSNSNQYAVIVRVSGEVEEAFIVSTNPGD 314 (671) T ss_pred CcceEEEEecccccccccccccceeeeeccccccccccccccccceeecccccccccceeEEeecCccceeEEEeecccc Confidence Q ss_pred ----------------------------------------------ccchhhHHHHHHhhhhhhhhhhhccccccchH-- Q lcl|NC_019932. 107 ----------------------------------------------GRYTGMKALLSAQTQLGVKPRILGVPGLDALE-- 138 (389) Q Consensus 107 ----------------------------------------------~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~-- 138 (389) +..++..+++.......+.|.++.+|++++.+ T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~ 394 (671) T protein:vir:56 315 KDVNGQSIFIDEYFENSGSAYITAIAEGWKTESGAYNFGGGSDANAGADDWMFGLDMLSDPEVLYTNLVIAGNAAAEEVS 394 (671) T ss_pred cccchhhhhhhhhhcccCceEEEecCcccCCccccccccCccccccchhHHHHHHHhhhhccccceeEEEcCCCCCccch Confidence 00000000000011111223333344333221 Q ss_pred -----HHHHHHHhhhh-cCceeeeccCC--------CccHHHHHHhhh--------------cccCceeEEeeeeEEEEe Q lcl|NC_019932. 139 -----VSTALASIAQQ-LRAFAYVSAWG--------CKTLSEAMAYRE--------------NFSQRELMVIWPDFISWN 190 (389) Q Consensus 139 -----v~~al~~~~~~-~~~~~i~d~~~--------~~t~~~a~~~~~--------------~~~s~~~~~~~p~~~~~~ 190 (389) ...++.++++. .++++++|.|. ..+.+++.+++. .++|.+.++||||++++| T Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d 474 (671) T protein:vir:56 395 IASTVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAGTAVANIQGWRTGIDPTNGQAVVDNLNVSTTYAVIDGNYKYQYD 474 (671) T ss_pred hHHHHHHHHHHHHHhhcCCEEEEEeccccccccccccccHHHHHHHhhhccccchhhhhhhccCCcceEEEecCceEEec Confidence 12335555544 46788888764 346666766664 467899999999999999 Q ss_pred ecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEE--cCCCEEEE Q lcl|NC_019932. 191 TTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLI--RKDGFRFW 268 (389) Q Consensus 191 ~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~--~~~G~~~w 268 (389) +.++..+++|||+++||++||+|.++|||+||+|+.+.++.+... + ...+++.|++.||++|||+++ +++|+++| T Consensus 475 ~~~~~~~~~p~s~~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~-~--~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~w 551 (671) T protein:vir:56 475 KYNDRNRWVPLAGDIAGLCAYTDQVSQPWMSPAGFNRGQIKGVNR-L--AVDLRRAHRDALYQIGINPVVGFAGQGFVLY 551 (671) T ss_pred ccCCceeEechHHHHHHHHHHhhccCCcEECcCCceecccccccc-c--eeecChhHHHHHhhCCceEEEEecCCeEEEE Confidence 999999999999999999999999999999999998877766532 2 334567789999999999996 46899999 Q ss_pred cCccCCCC-cccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHH Q lcl|NC_019932. 269 GNRTCSDD-PLFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKD 347 (389) Q Consensus 269 G~rT~~~d-~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~ 347 (389) |+||++.+ ++|+||++||||+||+++|++.++|++||||++.+|++|+++|+.||++||++|+|+||+|+||+++||++ T Consensus 552 G~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~g~~v~~d~~~nt~~ 631 (671) T protein:vir:56 552 GDKTATQQASAFDRINVRRLFNLLKKAISDAAKYRLFELNDEFTRSSFKSEIDAYLTNIQDLGGVYDFRVVCDETNNPGS 631 (671) T ss_pred cceecCCCCcccceEehhhHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHH Confidence 99999876 58999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 348 TLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 348 ~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) +|++|+|+++|+++|++|+|||+|++++.....+ |++|.- T Consensus 632 ~i~~G~~~~~i~~~p~~Pae~I~~~~~~~~~~~~--f~e~~~ 671 (671) T protein:vir:56 632 VIDRNEFVASIYVKPAKSINFITLNFVATSTDAD--FAEIIG 671 (671) T ss_pred HhhCCeEEEEEEEEecCCcceEEEEEEEeecCcc--hhhhcC Confidence 9999999999999999999999999999777633 444444 No 28 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=100.00 E-value=1.3e-83 Score=474.99 Aligned_cols=380 Identities=13% Similarity=0.091 Sum_probs=299.1 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc---cchhHHHHHhhhccc Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK---KGTLAAALQAIADQA 77 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~---~gtl~~~v~~~~~~~ 77 (389) |+-..|||||+|+ ++++++..++|++.+|+|.+. ..|.++|++++++.++...||. ...+.+.+..+|.++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~~~~v~t~~~~fvG~~~-----~gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~ng 74 (663) T protein:vir:10 1 MALLSPGIEMKET-SINSTVVRSATGRAALVGKFA-----WGPAYEIRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQY 74 (663) T ss_pred CccccCceEEEEe-cCcccccccccccceeeeccc-----cCCCCcCEEecCHHHHHHHcCCcccccchHHHHHHHHHhC Confidence 8866799999999 688899999999999999985 4478999999999999999984 456788899999999 Q ss_pred CceEEEEEecccccccccc-------------------------------------------c----------------- Q lcl|NC_019932. 78 KPVTVVVRVAEGATPAETT-------------------------------------------S----------------- 97 (389) Q Consensus 78 ~~~~~v~~~~~~~~~~~t~-------------------------------------------~----------------- 97 (389) +..|+++|+.......... . T Consensus 75 g~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~~~~~a 154 (663) T protein:vir:10 75 GNDLRLVRVIDMEQAKNASPLFNQIEVTITTEGQGYTVGDTVSIKHNTTTVTEEGKVTKVDADGKIKALFVPSSAVIAKA 154 (663) T ss_pred CCeEEEEecCCcccccccccccccceeeEeecccCccccceeeecccccccccCcceeeeccCCceeEEEeccccccccc Confidence 9999998875421100000 0 Q ss_pred -----------c----------------ccccc----------------cccc-------c-------------chh--- Q lcl|NC_019932. 98 -----------N----------------IIGTT----------------DENG-------R-------------YTG--- 111 (389) Q Consensus 98 -----------~----------------~~~~~----------------d~~~-------~-------------~tG--- 111 (389) . ...+. +... . ..| T Consensus 155 ~~~~~~~~~~~a~~~~v~~~~~~~~~a~av~~i~~dg~vt~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i 234 (663) T protein:vir:10 155 KQLGTYPVLGDNWRAEVSGASGGSAATLTLGGIVVDSGVTFGNSEEAPDVMTSTKVLANFAKYGMPLISAVYPGEIGSTV 234 (663) T ss_pred cccccccccccceeeEEeeccccccccceeEeeecCCceeEEeeeccccccccceeeeeccccccceeeeecccccCcce Confidence 0 00000 0000 0 000 Q ss_pred -hHH----------------------------------------------------------------------HHHHh- Q lcl|NC_019932. 112 -MKA----------------------------------------------------------------------LLSAQ- 119 (389) Q Consensus 112 -l~a----------------------------------------------------------------------~~~~~- 119 (389) +.. +.+.. T Consensus 235 ~v~~~~~~~~~~~~~~~v~~~~g~~~~~~~~~~~~g~~~~~~~~~~~~~~g~~~e~~~ls~~~~~~~~~~~~~~~~~~~~ 314 (663) T protein:vir:10 235 EVEVISKTAFQSGAAQPIYPFGGTRASNARSVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRRGDRDVYGNNIFMDDYFR 314 (663) T ss_pred eEeecccccccccceeeecccCcccccccccccccccccchhhcccccCCCcccceeeeeccccccccchhhhhhhhhhc Confidence 000 00000 Q ss_pred h-h----------------------------------------hhh-------hhhhh----cccccc-chHHHHHHHHh Q lcl|NC_019932. 120 T-Q----------------------------------------LGV-------KPRIL----GVPGLD-ALEVSTALASI 146 (389) Q Consensus 120 ~-~----------------------------------------~~~-------~p~~~----~apg~~-~~~v~~al~~~ 146 (389) . . ..+ ...++ .+++++ .++|+++|.++ T Consensus 315 ~~~s~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~ 394 (663) T protein:vir:10 315 NGSSNFIYASSVNWPAGFTGIIQLGGGASANNAVGSDELIAGWDLFADREALHVNLMIAGACKSDGVAVASTVQKHVVAL 394 (663) T ss_pred CcccceeEeeccccCcccceeEEecccccCcccchhhhhhhHHhhhccccccCceEEEeecCCCCchhhHHHHHHHHHHH Confidence 0 0 000 00000 011111 13578999999 Q ss_pred hhhcC-ceeeeccCCCcc--------HHHHHHh-------------hhcccCceeEEeeeeEEEEeecCCCceEEehhHH Q lcl|NC_019932. 147 AQQLR-AFAYVSAWGCKT--------LSEAMAY-------------RENFSQRELMVIWPDFISWNTTANQSETAYATAR 204 (389) Q Consensus 147 ~~~~~-~~~i~d~~~~~t--------~~~a~~~-------------~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~ 204 (389) |++++ +++++|+|.+.. .+++..| +.+++|.+.++||||++++|+.++..+++|||++ T Consensus 395 ~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~s~~ 474 (663) T protein:vir:10 395 ADDRQDCVAFVNPPSELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYAFISGNYKYQYDKYNDINRWVPLSAD 474 (663) T ss_pred HHhhCCEEEEEecCcccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceEEEEecceeEecccCCceEEechHHH Confidence 99985 899999987632 2334444 4568999999999999999999999999999999 Q ss_pred HHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEE--EcC-CCEEEEcCccCCCCc-ccc Q lcl|NC_019932. 205 ALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTL--IRK-DGFRFWGNRTCSDDP-LFA 280 (389) Q Consensus 205 ~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~--~~~-~G~~~wG~rT~~~d~-~~~ 280 (389) +||++||+|.++|||+||+|+.+.++.++. .....+.+.|++.||++|||++ +++ +||++||+||++.++ +|+ T Consensus 475 vAGl~Ar~D~~~g~~~span~~~~~i~g~~---~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s~~~ 551 (663) T protein:vir:10 475 IAGLCAYTDQVGHPWMSPAGYRRGQLRNTI---KLAIEPKQSLRDTMYQVSINPVTGFAGGDGFVLFGDKMATQVPSPFD 551 (663) T ss_pred HHHHHHHhhccCCcEEccCCeeecceeccc---cceeecCchhHHHHHhCCCcEEEEeeCCCcEEEEcccccCCCCcccc Confidence 999999999999999999999988777763 3345567788999999999987 444 799999999998765 899 Q ss_pred eeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEE Q lcl|NC_019932. 281 FENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDY 360 (389) Q Consensus 281 ~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~ 360 (389) ||++||||+||+++|++.++|++||||++.+|++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+++|++ T Consensus 552 ~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~G~~~~~i~~ 631 (663) T protein:vir:10 552 RINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMEVSQYLDNIRSLGGVYDFRVVCDTTNNTPQVIDSNEFVATIYI 631 (663) T ss_pred eEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EecccceEEEEEEEEcchH--HHHHHHHhcC Q lcl|NC_019932. 361 TPVPPLEDLTLRQRITDSY--LANFAASVNS 389 (389) Q Consensus 361 ~p~~p~e~i~~~~~~~~~~--~~~~~~~~~~ 389 (389) +|++|+|||+|++++.+.+ |++++++++- T Consensus 632 ~p~~pae~I~~~~~~~~~~~~f~e~~~~~~~ 662 (663) T protein:vir:10 632 KAPRSINYITLNFVATSTGANFDELIGPAQL 662 (663) T ss_pred EecCCcceEEEEEEEEecCccHHHHHHHHhc Confidence 9999999999999987664 7888888877 No 29 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=100.00 E-value=2e-81 Score=463.03 Aligned_cols=379 Identities=12% Similarity=0.096 Sum_probs=263.2 Q ss_pred CC---C---CCCCEEEEECCC-------------------------CCcccccccccceeeeecccccccccccccccE- Q lcl|NC_019932. 1 MS---D---YHHGVRVVEIND-------------------------GTRTISTVSTAIVGMVCTADDADAAAFPLNEPV- 48 (389) Q Consensus 1 M~---~---~~~GV~v~~v~~-------------------------~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~v- 48 (389) +. + .+.|.....+.. .........+.+..+..... ....+...+.+. T Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~d~~~v~v~~~~~~~~~~~~~v~~~~~~~s~-~~~~~~~~~~~~~ 377 (743) T protein:vir:10 299 KDWYLNTEIGSTGIKLGDIGPRPGTSQFATDNGITDDQVHFAVIDTTGELTGTANTIVERLTYLSK-LSDARSEENANIY 377 (743) T ss_pred ccccccchhhccccccccccccceeeeccccccccccceEEEEecCcceeeeccCceeEEEeeeec-ccccccccCccee Confidence 11 1 112222221111 00000000000000111000 000000111111 Q ss_pred ---EEecchhhhhhhcccchhHHHHHhhhcccCceEEEEEecccccccccccccccccccc-cc----chhhHHHHHHhh Q lcl|NC_019932. 49 ---LLTNVLSAIGKAGKKGTLAAALQAIADQAKPVTVVVRVAEGATPAETTSNIIGTTDEN-GR----YTGMKALLSAQT 120 (389) Q Consensus 49 ---li~~~~~~~~~~~~~gtl~~~v~~~~~~~~~~~~v~~~~~~~~~~~t~~~~~~~~d~~-~~----~tGl~a~~~~~~ 120 (389) .+.....+....+..+++.......+.......+............+..++.++.|.. .. ..+++++... T Consensus 378 ~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gG~d~~~~~~~~~~~~~~~~~~~-- 455 (743) T protein:vir:10 378 YKNVINEQSAYLYHGNDAAVQIAASGEAWGQSSDQVLADAGTAFSRTTGYWVNLAGGNDDFAYDAGEFGAAMDLFLDT-- 455 (743) T ss_pred ecceeccccceeeccCcccceeeeccccCccccceeeeecccccccccceEEEeecCccccccchhHHHHHHHHhhhc-- Confidence 1111111111122223333333333333333333333333333444445566665532 12 2233333322 Q ss_pred hhhhhhhhhccccccc-----hHHHHHHHHhhhhcC-ceeeeccCCCc--------------cHHHHHHh-hhcccCcee Q lcl|NC_019932. 121 QLGVKPRILGVPGLDA-----LEVSTALASIAQQLR-AFAYVSAWGCK--------------TLSEAMAY-RENFSQREL 179 (389) Q Consensus 121 ~~~~~p~~~~apg~~~-----~~v~~al~~~~~~~~-~~~i~d~~~~~--------------t~~~a~~~-~~~~~s~~~ 179 (389) ....+.++++|++.+ .+++.++.++|++++ +++++|+|.+. +..++..+ +..++|++. T Consensus 456 -~~~~~~ll~~p~~~~~~~~~~~v~~a~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~ 534 (743) T protein:vir:10 456 -EETEIDFVLMGGSMADEADTKSKATKVIAIAASRKDALAFVSPHKGNQIASTGNVALSSAQQKENTIAFFSDLTSTSYA 534 (743) T ss_pred -cccCcceEEecCcccCccchHHHHHHHHHHHHhhCCeEEEEecCCCccccccccccccccccchHHHHHHHhccCCeeE Confidence 122357889998753 568999999999875 89999998753 22344444 445789999 Q ss_pred EEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEE Q lcl|NC_019932. 180 MVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTL 259 (389) Q Consensus 180 ~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~ 259 (389) ++||||++++|+.++..+++|||+++||++||+|.++|||+||+|+.+.||.++. .....+++.|++.||++|||++ T Consensus 535 ~~~~p~~~~~d~~~~~~~~~p~s~~~AGl~a~~D~~~g~~~span~~~~gi~g~~---~~~~~~~~~~~~~Ln~~gIn~i 611 (743) T protein:vir:10 535 VFDSGYKYVYDRFTDKYRYIPCNGDVAGLCVQTSNQLDDWYSPAGLNRGGILNAV---KLAYNPNKADRDELYQNRINPV 611 (743) T ss_pred EEEccceeeeccccCceeEechhHHHHHHHHHhhccCCcEEccCCeeeeeeeccc---cceecCChhHHHhHhhCCceEE Confidence 9999999999999999999999999999999999999999999999998888863 3456677889999999999999 Q ss_pred E--cCCCEEEEcCccC-CCCcccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeE Q lcl|NC_019932. 260 I--RKDGFRFWGNRTC-SDDPLFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGAS 336 (389) Q Consensus 260 ~--~~~G~~~wG~rT~-~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~ 336 (389) + +++|+++||+||+ +.|++|+||++||||+||+++|++.++|++||||++.+|++|+++++.||++||++|+|.||+ T Consensus 612 ~~~~~~G~~~wG~rT~~s~d~~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~~~~~i~~~i~~fL~~l~~~gal~~~~ 691 (743) T protein:vir:10 612 VSLRGQGITLFGDKTALAAPSAFDRINVRRLFLNLEKRARRLAEGVLFEQNDATTRAGFSSALNSYLSEVQARRGVTDYL 691 (743) T ss_pred EEecCCeEEEEcccccCCCCcccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeE Confidence 5 5789999999998 568999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEE--cchHHHHHHHH Q lcl|NC_019932. 337 CWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRI--TDSYLANFAAS 386 (389) Q Consensus 337 v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~--~~~~~~~~~~~ 386 (389) |+||+++||+++|++|+|+++|+++|++|+|||+|+|++ +..+|+|++++ T Consensus 692 V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~e~~~~ 743 (743) T protein:vir:10 692 VICDESNNTPDIIDRNEFVAEVYVKPTRSINFITITFTATKTGVTFSEVVGR 743 (743) T ss_pred EEEcCCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCcchHhhhcC Confidence 999999999999999999999999999999999999986 45578888888 No 30 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=100.00 E-value=1.6e-80 Score=458.03 Aligned_cols=373 Identities=17% Similarity=0.127 Sum_probs=283.9 Q ss_pred CCC-C-CCCEEEEECCCCCccccc-ccccceeeeecccccccccccccccEEEecchhhhhhhccc-chhHH---H---- Q lcl|NC_019932. 1 MSD-Y-HHGVRVVEINDGTRTIST-VSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKK-GTLAA---A---- 69 (389) Q Consensus 1 M~~-~-~~GV~v~~v~~~~~~~~~-v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~-gtl~~---~---- 69 (389) |+= + .|||||+|+.++++++.. |.|++.+|+|.++. .|.++|++++++.++..+|+.- |.+.. + T Consensus 279 ~~~~v~~~GVYVEEVpSGvrtIeGGV~TSVAAFVG~A~r-----GPvn~PvlITS~aD~~~~Fg~~~GGl~GassA~r~~ 353 (774) T protein:vir:98 279 ITRNVEDNGVVIQLEPALTGSISNRFSFYVTANDNTANR-----GFTTSPALVTTIPDPAIHFTSFQGGLDGPRSAFRDF 353 (774) T ss_pred eEEEEecCceEEEEeCCCCccccccccceeeeecccccC-----CCCCcCEEEeehhHhhhhhccccCCccccceeeeee Confidence 442 3 299999999999999987 99999999998854 5789999999999977666420 00000 0 Q ss_pred ----------HHhhh--cccCceEEEE------------Ee-------------------c---------c--------- Q lcl|NC_019932. 70 ----------LQAIA--DQAKPVTVVV------------RV-------------------A---------E--------- 88 (389) Q Consensus 70 ----------v~~~~--~~~~~~~~v~------------~~-------------------~---------~--------- 88 (389) +.+.. ..+....+.+ .. . + T Consensus 354 ~~~sG~~~L~i~A~~pGawGN~ItV~I~~~t~~~~~l~v~~~~~s~f~~~~a~e~~tv~~~~~~~~~~v~e~~dn~~i~~ 433 (774) T protein:vir:98 354 YTFNGTPLLRLQAVSEGNWGNQVTVSIYPVNNSEFRLNVQDLNGSAFNPPLADEVYTVKLGDTNESGELNALLDSKFIRG 433 (774) T ss_pred eeecccceEEEEEeecCcCCCceEEEEEecCCceeEEEEEecCCccccccccceeEEEecccccccceeeeeeceeeEee Confidence 00000 0000000000 00 0 0 Q ss_pred -cccccc-----------------ccc------------------------cccccccccccchhhHHHHHHhhhhhhhh Q lcl|NC_019932. 89 -GATPAE-----------------TTS------------------------NIIGTTDENGRYTGMKALLSAQTQLGVKP 126 (389) Q Consensus 89 -~~~~~~-----------------t~~------------------------~~~~~~d~~~~~tGl~a~~~~~~~~~~~p 126 (389) ...... ... ...++.+. ..+....+..+........ T Consensus 434 ~~~~~~~~~in~vs~lv~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~v~v~lagG~Dg--~~tt~~~igg~~~~~~~tg 511 (774) T protein:vir:98 434 FFLPKSIDSINYDAALVRQSPLRLAPPDESETDVENPAHVDFYGPNVLVDVTLENGYDG--PPVTNDDYVSIIRTLENQP 511 (774) T ss_pred cccccccccccccccccccchhcccccccccccccccccccccCCcceEEEeecCCCCc--ccccchheecccccccccc Confidence 000000 000 00011110 0011111111111111111 Q ss_pred hhhccccccchHHHHHHHHhhhhc-----CceeeeccCCCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEeh Q lcl|NC_019932. 127 RILGVPGLDALEVSTALASIAQQL-----RAFAYVSAWGCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYA 201 (389) Q Consensus 127 ~~~~apg~~~~~v~~al~~~~~~~-----~~~~i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~ 201 (389) -.++..+....+++.++..+|+.+ .+++++|.|++.+.+++++++++++|.+.++||||++++|+..+..+++|| T Consensus 512 i~aLl~a~~~~~V~~aii~~~e~~~~~~~~r~avid~p~g~t~~~Ai~~r~~f~S~~aal~~Pwvkv~D~~~g~~~~vPp 591 (774) T protein:vir:98 512 VHILLVGTTNVGVQQALITEAERASDSDGLRIAVLAAPPRTTPTLAASVTRGFNSTRAVMVAGWFTYAGQPNSSRYGVPG 591 (774) T ss_pred eeEEEcCccchhhHHHHHHHHHHhhhcccceEEEEECCCCCCHHHHHHHHhccCCceEEEEeCcEEEeccCCCceeecCh Confidence 122334555667888887777754 578999999999999999999999999999999999999999999999999 Q ss_pred hHHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEE---EcCCCEEEEcCccCCCCcc Q lcl|NC_019932. 202 TARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTL---IRKDGFRFWGNRTCSDDPL 278 (389) Q Consensus 202 s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~---~~~~G~~~wG~rT~~~d~~ 278 (389) |+++||++||+| ||+||+|++|+|++++..+........+.+++.|+.++||++ +.++|+++||+||+++||+ T Consensus 592 Sg~vAGl~ArtD----v~kSPANk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~gIN~i~itt~g~G~rvWG~RTlssDp~ 667 (774) T protein:vir:98 592 AAVYAGKLAAID----FFVSPAARSLVGPLFNIIESDTDNYTSRSNQDIYSAARLEVLSLDTVDRTYRFASGVTLSTDPA 667 (774) T ss_pred hHHHHHHHHhcC----cccccCCceeecceeccccccccccccchhhhhhcccccceeEEEEcCCcEEEEcccccCCCcc Confidence 999999999999 999999999999999988877777778888999999999986 4579999999999999999 Q ss_pred cceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeE-EEEecCCCCHHHhhCCEEEEE Q lcl|NC_019932. 279 FAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGAS-CWYDDTANDKDTLKAGKLFID 357 (389) Q Consensus 279 ~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~-v~~d~~~n~~~~i~~G~~~~~ 357 (389) |+||++|||++||+++|++.++|++||||++.+|++|+++++.||++||++|+|.|++ |+||+++||+++|++|+|+++ T Consensus 668 wr~InVRRlfd~Ie~SI~~~~~~~VfEPNd~~l~~~I~~sI~~fL~~L~~~GaL~G~~~V~~D~etNt~~dI~~G~l~i~ 747 (774) T protein:vir:98 668 WERIYLRRVHDVVRQGAHAILRNYVAMPNSRLVRNQIAAALNAFMGELKRNGNIVSFRPAIIDGSNNSTAAYFSRELYVS 747 (774) T ss_pred cceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceecceEEEEcCCCCCHHHhhCCEEEEE Confidence 9999999999999999999999999999999999999999999999999999999997 899999999999999999999 Q ss_pred EEEEecccceEEEEEEEEcchHHHHHHHH Q lcl|NC_019932. 358 YDYTPVPPLEDLTLRQRITDSYLANFAAS 386 (389) Q Consensus 358 i~~~p~~p~e~i~~~~~~~~~~~~~~~~~ 386 (389) |+++|++|+|||+|+++++.++.+ |++ T Consensus 748 I~vaP~~PAEfIilri~q~t~~~~--l~E 774 (774) T protein:vir:98 748 LQFQPLYSADYIYVTISRDTETSP--LGE 774 (774) T ss_pred EEEEecCCcceEEEEEEEeeccee--ccC Confidence 999999999999999999888633 333 No 31 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=100.00 E-value=1.6e-76 Score=436.10 Aligned_cols=378 Identities=13% Similarity=0.053 Sum_probs=245.5 Q ss_pred CCCCCC--------------CEEEEEC--CCCCcccccc------------cccceeeeeccccccccc---ccccccEE Q lcl|NC_019932. 1 MSDYHH--------------GVRVVEI--NDGTRTISTV------------STAIVGMVCTADDADAAA---FPLNEPVL 49 (389) Q Consensus 1 M~~~~~--------------GV~v~~v--~~~~~~~~~v------------~t~v~~~~g~a~~~~~~~---~~~~~~vl 49 (389) -.+-+| |-.++.. .+.+...+.. ..+...+.+.++...... ........ T Consensus 320 ~~D~~~v~v~~~~g~~~~~~g~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~~~~~ 399 (749) T protein:vir:10 320 HRDEMHVILVDIDGGVTGTVGALLERYIDVSKASDAKTSVGETNYYAEVIKQKSEFIYWAEHESTLYAATSSASDGLFGQ 399 (749) T ss_pred CCCceEEEEecCCCeeeecccceeeeeeeccccccccccccccchhhhhhccCCCEEEEEeccccccccccccccccccc Confidence 111111 1111000 0000000000 000000111110000000 00000000 Q ss_pred Eecchhhhhhhcccc--hhHHHHHhhhcccCceEEEEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhhh Q lcl|NC_019932. 50 LTNVLSAIGKAGKKG--TLAAALQAIADQAKPVTVVVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPR 127 (389) Q Consensus 50 i~~~~~~~~~~~~~g--tl~~~v~~~~~~~~~~~~v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~ 127 (389) ......+.......+ .+......... .....+.+++..+.+...+...... .......++.++. ........+. T Consensus 400 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~gg~d~~~~~~~~~~--~~~~~~~~~~~l~-~~~~~~~~~l 475 (749) T protein:vir:10 400 TAANRQFNLFRSAAGSVDYPAGVTTLGS-KNNATYYYRLSGGVNYTVSAGQYTI--TNTDIGSAYELIG-DPESQIVDFI 475 (749) T ss_pred ccccceeeccccccccceeccccccccc-cCCcEEEEEccCCcccccccccccc--cchhHHHHHHHhh-hhhhcccceE Confidence 000000000000011 11111111111 2334455566655554333222111 1122223333333 3333333344 Q ss_pred hhcccccc---chHHHHHHHHhhhhcCc-eeeeccCCCcc---------HHHHHHhhh-cccCceeEEeeeeEEEEeecC Q lcl|NC_019932. 128 ILGVPGLD---ALEVSTALASIAQQLRA-FAYVSAWGCKT---------LSEAMAYRE-NFSQRELMVIWPDFISWNTTA 193 (389) Q Consensus 128 ~~~apg~~---~~~v~~al~~~~~~~~~-~~i~d~~~~~t---------~~~a~~~~~-~~~s~~~~~~~p~~~~~~~~~ 193 (389) ++..|+++ ..+++.+|.++|++++. ++++|.|.+.. ..++..++. ..+|.+.++||||++++|+.+ T Consensus 476 i~~~~~~~~~~~~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~ 555 (749) T protein:vir:10 476 ISGPSGTSDANALAKITSLVNIAEERRDCMVFVSPRRGNVIGISNTTTITTNIVDFFKKLPSSSYMVFDSGYKYIYDKYN 555 (749) T ss_pred EEecCCCCcchhHHHHHHHHHHHhhcCCEEEEEcCCCCcccccccchhhhhHHHHHHhhccCceeEEEEccceeeecccc Confidence 44455554 34689999999998865 56666655422 234445554 457889999999999999999 Q ss_pred CCceEEehhHHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEE--cCCCEEEEcCc Q lcl|NC_019932. 194 NQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLI--RKDGFRFWGNR 271 (389) Q Consensus 194 ~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~--~~~G~~~wG~r 271 (389) +..+++|||+++||++||+|.++|||+||||+++.++.+.. ..+..+++.|++.||++|||+++ +++|+++||+| T Consensus 556 ~~~~~~p~s~~vAGl~Ar~D~~~g~~~SPan~~~~~i~g~~---~~~~~~~~~e~~~Ln~~gIn~i~~~~g~G~~~wG~r 632 (749) T protein:vir:10 556 DVYRYIPCNGDTAGLCLQTNEISEPWFSPAGFQRGVLRNAI---KLAYTPNKAQRDQLYANRVNPIVSFPGQGVVLYGDK 632 (749) T ss_pred CceEEechHHHHHHHHHHhhccCCcEECcCCceeeeeeccc---cceeecChhHHHhhhhCCceEEEEecCCeEEEEcce Confidence 99999999999999999999999999999999876666653 23455678899999999999996 46899999999 Q ss_pred cC-CCCcccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhh Q lcl|NC_019932. 272 TC-SDDPLFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLK 350 (389) Q Consensus 272 T~-~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~ 350 (389) |+ +.|++|+||||||||+||+++|++.++|++||||++.+|++|+++++.||++||++|+|.||+|+||+++||+++|+ T Consensus 633 T~~s~d~~~~~i~vRRl~~~ie~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~G~i~~f~V~~d~~~Nt~~~i~ 712 (749) T protein:vir:10 633 TALGFASAFDRINIRRLFLTVERVISTAAKAQLFEQNDEAQRSLFINIVEPYLRDVQGRRGVVDFLVKCDSTNNTPEAVD 712 (749) T ss_pred ecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcCCCCCHHHhh Confidence 97 67899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCEEEEEEEEEecccceEEEEEEEEcch--HHHHHHH Q lcl|NC_019932. 351 AGKLFIDYDYTPVPPLEDLTLRQRITDS--YLANFAA 385 (389) Q Consensus 351 ~G~~~~~i~~~p~~p~e~i~~~~~~~~~--~~~~~~~ 385 (389) +|+|+++|+++|++|+|||+|+|++... +++|+.+ T Consensus 713 ~G~~~~~i~~~P~~pae~I~~~~~~~~~~~~~~e~~s 749 (749) T protein:vir:10 713 RGEFYAEVFLKPTRTINYVQLTFVATRTGVSFAEVAS 749 (749) T ss_pred CCEEEEEEEEEecCCccEEEEEEEEeecCcchHHHhC Confidence 9999999999999999999999988765 5666665 No 32 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=100.00 E-value=3.5e-75 Score=428.76 Aligned_cols=363 Identities=12% Similarity=0.082 Sum_probs=255.0 Q ss_pred CCCCCCCEEEEECCCCCcccc-----------------cccccceeeeecccccccccccccccEEEecchhhhhhhccc Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTIS-----------------TVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKK 63 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~-----------------~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~ 63 (389) ..+-+..|.+-...+.+++.. .++.+..++............+++.++++.+........... T Consensus 350 ~~~~~~~v~~~~t~~~~~pp~~~~~~e~v~~ngG~~f~v~s~~~~g~~i~~~~as~~~s~ln~~~~V~Gt~aa~~~~d~~ 429 (742) T protein:vir:58 350 IVNELTNVSIPVTDSAIIPPMRFTRIEQITLSGGASFSVISNQPYGFNIQDSRHSYWLSPFKDDELIIGTELVLPALDVS 429 (742) T ss_pred eccccccceeeccccccCCcccccccceeecccCcceEEEEecccCcceeccCcceEEeccCCceEEEeehhhccccccc Confidence 111123344444443333211 111111222111111122233566666554433221111100 Q ss_pred chhHHHHHhhhcccCceEEEEEeccccccc----ccccccccccc----ccccchhhHHHHHHhhhhhhhhhhhcccccc Q lcl|NC_019932. 64 GTLAAALQAIADQAKPVTVVVRVAEGATPA----ETTSNIIGTTD----ENGRYTGMKALLSAQTQLGVKPRILGVPGLD 135 (389) Q Consensus 64 gtl~~~v~~~~~~~~~~~~v~~~~~~~~~~----~t~~~~~~~~d----~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~ 135 (389) ....+.................++.+.. ....+..+..+ ..+.++|++++++... +.++++||++ T Consensus 430 --t~~~v~s~~~alp~~a~sv~laGG~dg~v~v~~~~~D~iG~~~~~d~~~adrTGL~ALlev~e-----VtILiAPG~t 502 (742) T protein:vir:58 430 --TEFGVSSWEEALPEFSFLMPFQGGSDGYIRVDENEPDTIGRVKITPALLANYERLLPLLTEDQ-----FDLVLTPYLT 502 (742) T ss_pred --hheeccccccccceeeEEEeecCCccccccccCCCcccccccccccccccchhHHHHhhhcCC-----CcEEEEcCCC Confidence 0000000001111112223333333221 11122222222 1345789999987653 6899999999 Q ss_pred chHHHHHHHHhhhhc--CceeeeccCCCcc-HHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhh Q lcl|NC_019932. 136 ALEVSTALASIAQQL--RAFAYVSAWGCKT-LSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKI 212 (389) Q Consensus 136 ~~~v~~al~~~~~~~--~~~~i~d~~~~~t-~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~ 212 (389) +..++.++.++|+.. +..++.|.|.+.+ .+++.+++..++|.+++++|||+++++ .+..+++|||+++||++||+ T Consensus 503 ~~~v~aav~A~la~a~~Rl~vL~D~P~~~tt~~~A~a~r~~~nSsraaly~PwVkv~d--~~~~r~vPpSgaIAGL~ARt 580 (742) T protein:vir:58 503 FADHAGTVNAFINRAENRFLYLFDIAGDDDTENLAISLAGYINSSFATTFFPWVRRLT--NKGMRTVPASLAAYRSIRTT 580 (742) T ss_pred chHHHHHHHHHHHhhcCCeEEEEecCCCCchHHHHHHHHhccCCceEEEEeceeeecc--CCcceeechHHHHHHHHHHh Confidence 988888888887764 4456777777654 467888999999999999999999875 46778999999999999999 Q ss_pred hccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEEcC-CCEEEEcCccC-CCCcccceeehhhHHHH Q lcl|NC_019932. 213 DTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLIRK-DGFRFWGNRTC-SDDPLFAFENYTRTAQV 290 (389) Q Consensus 213 d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~~-~G~~~wG~rT~-~~d~~~~~i~vrR~~~~ 290 (389) |.++|+|+||+|+.+.+.. ...+.|++.||++||+++++. +||++||+||+ +.|++|+||+|||+|+| T Consensus 581 D~erGvw~SPANrgii~~~----------~~s~se~d~LN~~GINtIrsfG~G~rlWGnRTlassDs~wryInVRRlfd~ 650 (742) T protein:vir:58 581 DPETGLAPVGARRGVVTGE----------PVRQVDWEDLYNNRINPIVRVGNDVLLFGQKTMLNVNSALNRINVRRLLIV 650 (742) T ss_pred ccCCceEecCCcceeeecc----------ccchhhHHHHhhCCceEEEECCCcEEEEcceecCCCCcccceEeehhhHHH Confidence 9999999999998653222 124678899999999999774 79999999998 67999999999999999 Q ss_pred HHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEE Q lcl|NC_019932. 291 IADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLT 370 (389) Q Consensus 291 i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~ 370 (389) |+++|+++++|++|||||+.+|++|+++++.||++||++|+|.||+|+||+ +||+++|++|+|+++|+++|++|||||+ T Consensus 651 Ie~SI~~a~q~~VfEPNd~~L~~sIk~sInafL~~L~aqGALlGfrV~lDe-tNTpeDI~~Gklvv~I~vAP~~PAEfI~ 729 (742) T protein:vir:58 651 MRNRISQILSSYLFENNTSENRLRAEALVRQYLESLRLRGAVTDYEVAIDS-VTTPTDIDNNTLRARVTVQPARSIEYID 729 (742) T ss_pred HHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcC-CCCHHHhhCCEEEEEEEEEccCCcceEE Confidence 999999999999999999999999999999999999999999999999995 5889999999999999999999999999 Q ss_pred EEEEEcchHHHHHHH Q lcl|NC_019932. 371 LRQRITDSYLANFAA 385 (389) Q Consensus 371 ~~~~~~~~~~~~~~~ 385 (389) |++.++..+.+ |+ T Consensus 730 lrf~it~tga~--Fs 742 (742) T protein:vir:58 730 ITFVITPTGVE--IT 742 (742) T ss_pred EEEEEEecccc--cC Confidence 99999877655 33 No 33 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=100.00 E-value=2.8e-50 Score=292.25 Aligned_cols=353 Identities=13% Similarity=0.094 Sum_probs=185.9 Q ss_pred CCCCCCCEEEEE------------------CCCCCcc----cccccccceeeeecccccccccccccccEEEecchhhhh Q lcl|NC_019932. 1 MSDYHHGVRVVE------------------INDGTRT----ISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIG 58 (389) Q Consensus 1 M~~~~~GV~v~~------------------v~~~~~~----~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~ 58 (389) |.+--+|+.-+. |.+.+.+ ....++..+++.-+....+...++.....+.....+... T Consensus 330 ~~~~~~g~~~~~pl~~ts~dy~~~~~~vdgI~~~~~~~V~~~g~~s~a~a~~~~g~~s~d~a~f~Gg~dgl~~~~ee~Y~ 409 (717) T protein:vir:79 330 KPESKRGMISEDPLVFKSGDYTNFKMLVDAINNHPFNNVVRARTKPEFEATFTSTLQAAADAKFSGGKDELSLDKEEMYK 409 (717) T ss_pred cccccCcceeccccccccCceeeeeeeecccccCchhheeeeecccccceeeeecccCchhhccCCCccccccchhhhhc Confidence 333333333222 1111222 111122222222222122212222111111000011110 Q ss_pred hh----cccchhH-HHHHhhhcccCceEEEEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhhhhh---- Q lcl|NC_019932. 59 KA----GKKGTLA-AALQAIADQAKPVTVVVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRIL---- 129 (389) Q Consensus 59 ~~----~~~gtl~-~~v~~~~~~~~~~~~v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~---- 129 (389) .. .+.+.+. .......... ...+++......+ .......+. ..+++...-....... .+.+. T Consensus 410 ~lGgk~~d~g~lt~~aays~LE~~-dVDlVil~ga~ad-----tt~ga~~d~--va~alad~caalSal~-r~ai~VI~l 480 (717) T protein:vir:79 410 RLGGEKNEEGFVTKQGAYQYLENY-EVDYVIPLGVHAD-----TKLIGKYDD--FAYQLALACAVMSHYN-SVTIGIIPT 480 (717) T ss_pred cccccccccccccchhhhhhcCcc-eeEEEEecCcccc-----ccccchhhh--HHHHHHHHHHHhhhcc-ccceeeecc Confidence 00 1111110 0111111110 1111111110000 000000000 0011111000000000 01111 Q ss_pred -ccccccchHHHHHHHHhhhhcCceeeeccCCCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHH Q lcl|NC_019932. 130 -GVPGLDALEVSTALASIAQQLRAFAYVSAWGCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGL 208 (389) Q Consensus 130 -~apg~~~~~v~~al~~~~~~~~~~~i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~ 208 (389) ..+......+......+..........+...... .+.......+++ +...++++..++.+..+.....|++|++||+ T Consensus 481 ~sp~D~~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a-~~~~~~~idis~-y~~vv~~~~~iv~~~~~~~~~~p~AG~vAGl 558 (717) T protein:vir:79 481 TTPSDISLAGVEEHVKKLENYANEFYMRDRFGNII-FDADRNKIDLGQ-FIEVVAGPDFIVRNTRLGQMASTPDASYIGM 558 (717) T ss_pred ccccccchhhHHHHHHHHHhhhhhhhhhcchhccc-cccccccccccc-eeeeeecceeEEEcCCCceeecCHHHHHHHH Confidence 0111111111111111100000000000000000 000000111222 3333344444444456667777887666666 Q ss_pred HHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEE--cCCCEEEEcCccCCCCc-ccceeehh Q lcl|NC_019932. 209 RAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLI--RKDGFRFWGNRTCSDDP-LFAFENYT 285 (389) Q Consensus 209 ~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~--~~~G~~~wG~rT~~~d~-~~~~i~vr 285 (389) . ..+|+|+||+|+.+.|+.++...+ +..|++.||++||+++. +++|+++||+||+++++ .|+||++| T Consensus 559 d----A~rGVwkSPANk~I~GVvgLa~~l------T~sE~d~Ln~aGIntIr~~~GrGirVWGaRTtasd~sdWryInVR 628 (717) T protein:vir:79 559 V----SQLKTQSAPTNKPLPSVTALRYTY------SANQLNRLTKARFATFKYKQDGSIGVVDAPTSAHAGSDYTRLSTA 628 (717) T ss_pred H----hcCCcccccccceecccccCcccC------CHHHHHHHhhCCeEEEEEeCCceEEEEeeeecCCCCcccceeehh Confidence 4 557999999999999999986553 66788999999999985 57899999999998776 59999999 Q ss_pred hHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEeccc Q lcl|NC_019932. 286 RTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPP 365 (389) Q Consensus 286 R~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p 365 (389) |++++|++++++.++|++||||++.+|.+|+.+|+.||++||++|+|.||++.+ +||++++++|+++++++++|++| T Consensus 629 Rl~D~Ie~sIr~al~~yVgEPNd~~tr~~Ik~sI~afL~~L~r~GAI~Gykvdv---tnT~~di~~G~l~V~I~vaPv~P 705 (717) T protein:vir:79 629 RIVKEAVNAVREVADPFIGEPNDTGNRNALTAAVDKRLSKMIENKALLGFDFRL---VVTPQQELLGEGSIELSLEAPNE 705 (717) T ss_pred hhHHHHHHHHHHHHHHhccccCCHHHHHHHHHHHHHHHHHHHhcCceecceeeE---ecChhHhhCCEEEEEEEEEecCc Confidence 999999999999999999999999999999999999999999999999999865 89999999999999999999999 Q ss_pred ceEEEEEEEEcc Q lcl|NC_019932. 366 LEDLTLRQRITD 377 (389) Q Consensus 366 ~e~i~~~~~~~~ 377 (389) +|||++++..+. T Consensus 706 aEfI~ititITA 717 (717) T protein:vir:79 706 LRRLTTIVSLSA 717 (717) T ss_pred ccEEEEEEEEeC Confidence 999999999988 No 34 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=100.00 E-value=6.5e-39 Score=229.95 Aligned_cols=359 Identities=11% Similarity=0.033 Sum_probs=257.6 Q ss_pred CCCC-CCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhh----c Q lcl|NC_019932. 1 MSDY-HHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIA----D 75 (389) Q Consensus 1 M~~~-~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~----~ 75 (389) |-.| +||||+++..++++++..+++++.+|+|.+... |.++|++++++.++...|+. |.|..++...+ . T Consensus 8 ~~~~~~Pgv~~~~~~s~~~~~~~~~~~~~~~iG~a~~G-----~~~~~~~~~~~~~~~~~fg~-g~l~~~i~~a~~~~~~ 81 (562) T protein:vir:63 8 RKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGG-----KPNAVYKVRNYSQAKSVFRS-GELLDAIERAWNPGEG 81 (562) T ss_pred CCcccCCceEEEEecCCCcccCCCCCceEEEEEEeCCC-----CCceeEEEccHHHHHHHhcC-CchHHHHHHhcccccc Confidence 7777 589999999999999999999999999999655 67999999999999999987 55766765555 5 Q ss_pred ccCceEEEEEeccccccccccc---------------------------------------------c------------ Q lcl|NC_019932. 76 QAKPVTVVVRVAEGATPAETTS---------------------------------------------N------------ 98 (389) Q Consensus 76 ~~~~~~~v~~~~~~~~~~~t~~---------------------------------------------~------------ 98 (389) +++..++++|+........+.. + T Consensus 82 ~g~~~~~~~rv~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~V~~i~y~g~ 161 (562) T protein:vir:63 82 TGAGDILAMRVEEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIFSIKYKGT 161 (562) T ss_pred CCceEEEEEEcCCCccceeEecceeEEEeecccCCCeEEEEEecCCCCCCcceEEEecCCCcchhhhhccceeeeeeecc Confidence 7777777766532111000000 0 Q ss_pred -------c--------------ccccc--------ccccchhhHHHHHH-----------------------------h- Q lcl|NC_019932. 99 -------I--------------IGTTD--------ENGRYTGMKALLSA-----------------------------Q- 119 (389) Q Consensus 99 -------~--------------~~~~d--------~~~~~tGl~a~~~~-----------------------------~- 119 (389) + .++.. ..+..+...+.... . T Consensus 162 ~~~~~~~v~~~~~~~~a~~l~~~~g~~~v~~~~L~~g~~~~~~~l~~~in~~~~~~aky~~~~gn~i~~~~~d~~~~~~v 241 (562) T protein:vir:63 162 EASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTDNFDAQIDVDI 241 (562) T ss_pred cccceEEEEecCcceeEEEEEeecCCcceeEEEecCCccchhHHHHHhhccccceEEEeeccCCceeeeeccccccccch Confidence 0 00000 00000001000000 0 Q ss_pred -hh-----------------hhh----------------------------------------hhhhhccccccchHHHH Q lcl|NC_019932. 120 -TQ-----------------LGV----------------------------------------KPRILGVPGLDALEVST 141 (389) Q Consensus 120 -~~-----------------~~~----------------------------------------~p~~~~apg~~~~~v~~ 141 (389) .. ..+ .....+.|..+..+++. T Consensus 242 kt~~~~v~t~~~d~~~~~~~~~~v~~~~~~~~~la~~~~~~LtGG~dGt~~~~~~~al~ale~~~~~~i~~~t~d~av~~ 321 (562) T protein:vir:63 242 KTKEAYVKAVGGDIEKQTAYNGYVDFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANEGGYYLVPLTSKQAVHA 321 (562) T ss_pred hhhhhhhhhhhhhhhhcccccceeeeeeccccceecccceeeecCCCCCchhhHHHHHHHHHhCCcEEEEecCCCHHHHH Confidence 00 000 00000111112234667 Q ss_pred HHHHhhhhc-----CceeeeccCCCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEeh---hHHHHHHHHhhh Q lcl|NC_019932. 142 ALASIAQQL-----RAFAYVSAWGCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYA---TARALGLRAKID 213 (389) Q Consensus 142 al~~~~~~~-----~~~~i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~---s~~~Ag~~a~~d 213 (389) ++.+++.++ ..+++++.+.+.+.+++......+++.+.+.+.|+....+. .+....+|+ ++++||++|..| T Consensus 322 ~l~a~vkr~~~~g~~~~aVlg~~~~~~~~~~~~~a~~~n~ervv~v~~~~~~~~~-~~~~~~~~~~~~aa~vAGl~A~~~ 400 (562) T protein:vir:63 322 EALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTRAIGLQNERAGLIGFSGTVKMD-DGRSLKMPGYMFAAQVAGLTCGLE 400 (562) T ss_pred HHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHhhhcCCCcEEEEecCeeEECC-CCceeeechhHHHHHHHHHhhcCc Confidence 777777665 35888988888889999888899999999999888765443 455666777 789999999886 Q ss_pred ccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEEc--CCCEEEEcC-c-----cCCCCcccceeehh Q lcl|NC_019932. 214 TDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLIR--KDGFRFWGN-R-----TCSDDPLFAFENYT 285 (389) Q Consensus 214 ~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~--~~G~~~wG~-r-----T~~~d~~~~~i~vr 285 (389) +++||.|+.+. ..++. ..++..|.+.++++|+.++.. +++.++|.. + |...|+.|++|+++ T Consensus 401 ----~~~SlT~~~i~-~~~v~------~~~t~~e~~~li~~Gv~~l~~~~~~~v~~~~iv~~itT~t~~~~~~~~ki~vi 469 (562) T protein:vir:63 401 ----IGEAITFKNIA-IETLD------TIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVG 469 (562) T ss_pred ----hhcCccceeec-ccccc------ccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhhhh Confidence 78899999986 45553 345788999999999999843 455566643 2 23557889999999 Q ss_pred hHHHHHHHHHHHHHH-HHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecc Q lcl|NC_019932. 286 RTAQVIADTMAEAHM-WANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVP 364 (389) Q Consensus 286 R~~~~i~~~~~~~~~-~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~ 364 (389) |++|+|.+.++..+. +|+++||+...|..|+..+..||.+|++.|+|.+|... +-+..+..+++++++.++|+. T Consensus 470 Rv~D~i~~dir~~~~~~yiGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~~~d~~~v~~~v~pv~ 544 (562) T protein:vir:63 470 EANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPE-----EVQVVIEGDVARISLTVFPIR 544 (562) T ss_pred HHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEecCCEEEEEEEEEEcc Confidence 999999999988765 89999999999999999999999999999999998532 122335678899999999999 Q ss_pred cceEEEEEEEEcchHHHH Q lcl|NC_019932. 365 PLEDLTLRQRITDSYLAN 382 (389) Q Consensus 365 p~e~i~~~~~~~~~~~~~ 382 (389) |+|+|.+++.+..+-++. T Consensus 545 ~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:63 545 SMKKIEVSLVYRQQILTA 562 (562) T ss_pred cceEEEEEEEEeeeeecC Confidence 999999999999998876 No 35 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=100.00 E-value=2e-37 Score=221.81 Aligned_cols=359 Identities=13% Similarity=0.084 Sum_probs=252.4 Q ss_pred CC-------CC-CCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHh Q lcl|NC_019932. 1 MS-------DY-HHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQA 72 (389) Q Consensus 1 M~-------~~-~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~ 72 (389) |+ .+ .||||+++..+++++++.+++++.+|+|.++.. |.+++++++++.++...|+. |.|...+.. T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~~G-----~~~~~~~~~~~~~~~~~f~~-g~l~~a~~~ 74 (569) T protein:vir:80 1 MAVEQFPRKKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAKGG-----KPDTVYRFRNYQQAKQVLRS-GDLLDAIEL 74 (569) T ss_pred CeeeeecCCccccCceEEEEecCCCcCCCCCCceeEEEEEEeCCC-----CCceeEEecCHHHHHHHhcC-CchhHHHHh Confidence 43 33 699999999999999999999999999999655 66999999999999999876 446666544 Q ss_pred hh------cccCceEEEEEecccccccc----------------------------ccc----------------c-c-- Q lcl|NC_019932. 73 IA------DQAKPVTVVVRVAEGATPAE----------------------------TTS----------------N-I-- 99 (389) Q Consensus 73 ~~------~~~~~~~~v~~~~~~~~~~~----------------------------t~~----------------~-~-- 99 (389) .+ .+++..++++++........ +.. + + T Consensus 75 a~~~~~~~~~~~~~~~~~rv~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~l~~~~~~~~~~~~v~~~~~~~~~~~~~ig~ 154 (569) T protein:vir:80 75 AWNASDVNTASAGDILAVRVEDAKNATLTKGGLTFASTIYGVDANEIQVALEDNNLTHTKRLTVAFSKDGYKKVFDNLGK 154 (569) T ss_pred hccCccccccCceEEEEEEcCCCeeeeeeccceeeeeeeccCCCceEEEEEecCcCCcceeeEEeeecCCCccccccccc Confidence 43 23444455544321000000 000 0 0 Q ss_pred ----c--c-----------c-------------ccc---------------------------------cccc--hh--- Q lcl|NC_019932. 100 ----I--G-----------T-------------TDE---------------------------------NGRY--TG--- 111 (389) Q Consensus 100 ----~--~-----------~-------------~d~---------------------------------~~~~--tG--- 111 (389) . + + .+. +... .+ T Consensus 155 v~si~ytg~~~~a~~~~~~~~~~~~a~~l~~~~g~~~~~~~~v~~~~~~~~~~~~~~~lv~~~~~~~~f~a~~~~~~~~~ 234 (569) T protein:vir:80 155 IFSIQYKGSEAQANFTIAQDSISKKATTLTLNVGSEPESTTEVMKYELGQGVYSETNVLVSAINSLPDWEAKFFPIGDKN 234 (569) T ss_pred eeeEEEeeccccceEEeecCcCcceeEEEEEEecCCcceeEEEEeeccCCccchhhhhhhhhcCCccCceEEEEecCCCc Confidence 0 0 0 000 0000 00 Q ss_pred --------------------hH--------------------------------------------HHHHHhhhhhhhhh Q lcl|NC_019932. 112 --------------------MK--------------------------------------------ALLSAQTQLGVKPR 127 (389) Q Consensus 112 --------------------l~--------------------------------------------a~~~~~~~~~~~p~ 127 (389) +. ...++...+.-... T Consensus 235 ~~~~~~d~~~~~~~~t~~~~~~~~~~di~~~~~~~~~v~~~~~~~~~l~~~~~~~LtGG~dG~~~~~~~~~l~~le~~~~ 314 (569) T protein:vir:80 235 LPTDALEAVTKVDVKTEAVFVGALAGDIAKQLEYNDYVTVAVDATKPVEDFELTNLTGGSDGTAPESWANKFPLLANEGG 314 (569) T ss_pred ceehhccchhheeccccceeeehhHHHHHHhhcCCceEEEEecCCcceeeecceeecCCCCCCccchHHHHHHHHhhCCc Confidence 00 00000000000000 Q ss_pred hhccccccchHHHHHHHHhhhhc-----CceeeeccCCCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEeh- Q lcl|NC_019932. 128 ILGVPGLDALEVSTALASIAQQL-----RAFAYVSAWGCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYA- 201 (389) Q Consensus 128 ~~~apg~~~~~v~~al~~~~~~~-----~~~~i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~- 201 (389) ..+.+......++.++.++|+++ ..+++++.+.+.+.+++...+..+++.+.++++|+..+.+. .+....+|+ T Consensus 315 ~~i~~~t~d~av~~~l~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~-~g~~~~~~~~ 393 (569) T protein:vir:80 315 YYLVPLTDKQAVHSEALAFVKDRTDNGDPMRIIVGGGTNETVEESITRATNLRDPRASLVGFSGTRKMD-DGRLLKLPGY 393 (569) T ss_pred EEEEecCCChHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHHhhcCCCeEEEEecCceeecC-CCcceeechh Confidence 11112222345778888888776 35889999888899999999999999999999999887653 344455655 Q ss_pred --hHHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEEc--CCCEEEEcC------c Q lcl|NC_019932. 202 --TARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLIR--KDGFRFWGN------R 271 (389) Q Consensus 202 --s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~--~~G~~~wG~------r 271 (389) ++++||++|..+ +++||.|+.+. +.++.. .++..|.+.+++.|+.++.. +++.++|.. + T Consensus 394 ~~aa~vAG~~A~~~----~~~S~T~k~i~-~~~i~~------~lt~~e~~~li~~G~~~l~~~~~~~~~v~~~vn~itT~ 462 (569) T protein:vir:80 394 MMASQIAGIASGLE----VGEAITFKHFN-VTSVDR------VFESSQLDMLNESGVISIEFVRNRTLTAFRVVQDVTTY 462 (569) T ss_pred hHHHHHHHHHhcCc----cccCccceeec-cccccc------cCCHHHHHHHHhCCeEEEEEecCceEEEEEEeccceec Confidence 677888888765 88899999986 455543 35788999999999999854 344455533 2 Q ss_pred cCCCCcccceeehhhHHHHHHHHHHHHH-HHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhh Q lcl|NC_019932. 272 TCSDDPLFAFENYTRTAQVIADTMAEAH-MWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLK 350 (389) Q Consensus 272 T~~~d~~~~~i~vrR~~~~i~~~~~~~~-~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~ 350 (389) |...|+.|++++++|++|+|.+.++..+ .+|+++||+...|..++..++.||.+||++|+|.||... +-+..+. T Consensus 463 t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yiGk~nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~~ 537 (569) T protein:vir:80 463 NDKSDPVKNEMSVGEANDFLVSELKIELDNNFIGTKVIDTSASLIKNFIQSFLDNKKRAREIQDYTPE-----EVQVVLE 537 (569) T ss_pred CCCCCchhhhhhhhHHHHHHHHHHHHHHHhhcCcccCChhHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEec Confidence 2345778999999999999999998876 589999999999999999999999999999999998532 1223456 Q ss_pred CCEEEEEEEEEecccceEEEEEEEEcchHHHH Q lcl|NC_019932. 351 AGKLFIDYDYTPVPPLEDLTLRQRITDSYLAN 382 (389) Q Consensus 351 ~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 382 (389) .+++++++.++|..|+|+|.+++.+..+-++. T Consensus 538 ~d~~~v~~~v~Pv~~~ekI~~ti~~~~~~~~~ 569 (569) T protein:vir:80 538 GDVASISMTVMPIRSLNKITVQLVYKQQILTA 569 (569) T ss_pred CCEEEEEEEEEEcccccEEEEEEEEeeeeecC Confidence 78999999999999999999999999998876 No 36 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=100.00 E-value=3.3e-37 Score=220.62 Aligned_cols=359 Identities=11% Similarity=0.025 Sum_probs=254.4 Q ss_pred CCCC-CCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhh----c Q lcl|NC_019932. 1 MSDY-HHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIA----D 75 (389) Q Consensus 1 M~~~-~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~----~ 75 (389) |..+ +||||+++..++.+++..+++++.+|+|.+... |.++|++++++.++...|+. |.|...+...+ . T Consensus 8 ~~~~~~pgv~~~~~~s~~~~~~~~~~~~~~~ig~a~~G-----~~~~~~~~~~~~~~~~~f~~-g~l~~~i~~a~~~~~~ 81 (562) T protein:vir:80 8 RKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGG-----KPNAVYKVRNYSQAKSVFRS-GELLDAIERAWNPGEG 81 (562) T ss_pred CCcccCCceEEEEecCCcccCCCCCCceEEEEEEeCCC-----CcceeEEEccHHHHHHHhcC-CChHHHHHHhcccccc Confidence 4445 599999999999999999999999999999655 66999999999999999987 55666665555 4 Q ss_pred ccCceEEEEEeccccccccccc---------------------------------------------c------------ Q lcl|NC_019932. 76 QAKPVTVVVRVAEGATPAETTS---------------------------------------------N------------ 98 (389) Q Consensus 76 ~~~~~~~v~~~~~~~~~~~t~~---------------------------------------------~------------ 98 (389) +++..++++|+........+.. + T Consensus 82 ~g~~~~~~~rv~~a~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~v~~i~y~g~ 161 (562) T protein:vir:80 82 TGAGDILAMRVEEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIFSIKYKGT 161 (562) T ss_pred cCceEEEEEEcCCCCcceEEecceEEEEeecccCCCceEEEEecCCCCCCcceEEEecCCcceEEeeccCceeeeeeccc Confidence 6666667666532111000000 0 Q ss_pred -------c--------------ccccc--------ccccchhhHHHHHH-----------------------------hh Q lcl|NC_019932. 99 -------I--------------IGTTD--------ENGRYTGMKALLSA-----------------------------QT 120 (389) Q Consensus 99 -------~--------------~~~~d--------~~~~~tGl~a~~~~-----------------------------~~ 120 (389) + .++.. ..+..+...+.... .. T Consensus 162 ~~~a~~~i~~~~~~~~a~~l~~~~g~~~v~~~~l~~g~~~~~~~l~~~i~~~~~~tAky~g~~~n~i~~~~~d~~~~~~~ 241 (562) T protein:vir:80 162 EASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTDNFDAQIDVDI 241 (562) T ss_pred cccceeEEEecCccceEEEEEEecCCcceeEEEeCCCccchhhhhhhhhccccceEEEecccCCceeeecccccchhhhc Confidence 0 00000 00000001110000 00 Q ss_pred h-------------------hhhh----------------------------------------hhhhccccccchHHHH Q lcl|NC_019932. 121 Q-------------------LGVK----------------------------------------PRILGVPGLDALEVST 141 (389) Q Consensus 121 ~-------------------~~~~----------------------------------------p~~~~apg~~~~~v~~ 141 (389) + ..++ ....+.+......++. T Consensus 242 kt~~~~v~~~~~d~~~~~~~n~~v~~~~~~~~~la~~~~~~LtGG~dG~~~~~~~dal~~Le~~~~~~i~~~t~d~ai~~ 321 (562) T protein:vir:80 242 KTKEAYVKAVGGDIEKQTAYNGYVEFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANEGGYYLVPLTSKQAVHA 321 (562) T ss_pred ccceeeeeehhhhhhhcccccceEEEEeccCccccccceeeeeCCCCCCccccHHHHHHHHHhCCcEEEEecCCChHHHH Confidence 0 0000 0000000111234567 Q ss_pred HHHHhhhhc-----CceeeeccCCCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEeh---hHHHHHHHHhhh Q lcl|NC_019932. 142 ALASIAQQL-----RAFAYVSAWGCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYA---TARALGLRAKID 213 (389) Q Consensus 142 al~~~~~~~-----~~~~i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~---s~~~Ag~~a~~d 213 (389) .+.++|.++ +.+++++.+.+.+.+++......+++.+.+.+.|+..+.+. .+.....|+ ++++||++|+.+ T Consensus 322 ~~~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~-~~~~~~~~~~~~aa~vAGl~Ag~~ 400 (562) T protein:vir:80 322 EALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTRAIGLQNERAGLIGFSGTVKMD-DGRSLKMPGYMFAAQVAGLTCGLE 400 (562) T ss_pred HHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHHhhhcCCCeEEEEecCeeEECC-CCceeeechhHHHHHHHHHHhcCc Confidence 777777665 36788988888899999999999999999999888766544 345566666 889999999887 Q ss_pred ccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEEc--CCCEEEEcC-c---cC--CCCcccceeehh Q lcl|NC_019932. 214 TDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLIR--KDGFRFWGN-R---TC--SDDPLFAFENYT 285 (389) Q Consensus 214 ~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~--~~G~~~wG~-r---T~--~~d~~~~~i~vr 285 (389) +++||.|+.+.+ .++. ..+...|.+.|++.|+.++.. +++.+.|.. + |. +.|+.|++|+++ T Consensus 401 ----~~~S~T~~~i~~-~~v~------~~lt~~e~~~li~~G~l~l~~~~~~~v~~~riv~~itT~t~~~~~~~~ki~vi 469 (562) T protein:vir:80 401 ----IGEAITFKNIAI-ETLD------TIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVG 469 (562) T ss_pred ----cccCccceeecc-cccc------ccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceeccCCCCchhhhhhhh Confidence 778999999875 3443 235778899999999999854 444555522 2 22 457889999999 Q ss_pred hHHHHHHHHHHHHH-HHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecc Q lcl|NC_019932. 286 RTAQVIADTMAEAH-MWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVP 364 (389) Q Consensus 286 R~~~~i~~~~~~~~-~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~ 364 (389) |++|+|.+.+++.+ .+|+++||+...|..++..+..||.+|++.|+|.+|... +-+...+++++++++.++|.. T Consensus 470 Rv~D~i~~dir~~~~~~yIGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~~~d~~~v~~~v~Pv~ 544 (562) T protein:vir:80 470 EANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPE-----EVQVVIEGDIARISLTVFPIR 544 (562) T ss_pred HHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEecCCEEEEEEEEEEcc Confidence 99999999999887 589999999999999999999999999999999998532 122335778899999999999 Q ss_pred cceEEEEEEEEcchHHHH Q lcl|NC_019932. 365 PLEDLTLRQRITDSYLAN 382 (389) Q Consensus 365 p~e~i~~~~~~~~~~~~~ 382 (389) |+|+|.+++.+..+-++. T Consensus 545 ~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:80 545 SMKKIEVSLVYRQQILTA 562 (562) T ss_pred cceEEEEEEEEEeeeecC Confidence 999999999999998876 No 37 >protein:vir:103168 Length: 641 # NCBI annotation: gp122 # Family: family:all:661 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717789;genbank:gi:113200626;genbank:GeneID:4239177 Probab=100.00 E-value=7.6e-37 Score=218.59 Aligned_cols=264 Identities=10% Similarity=0.027 Sum_probs=181.0 Q ss_pred CCCCC-CCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc---cchhHHHHHhhhcc Q lcl|NC_019932. 1 MSDYH-HGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK---KGTLAAALQAIADQ 76 (389) Q Consensus 1 M~~~~-~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~---~gtl~~~v~~~~~~ 76 (389) |++|+ |||||+|+..+ +++..+.+++.+|+|.++ ..|.++|++++++.+|...||. ...+.+.+..+|.+ T Consensus 3 m~~~~sPGVyv~E~~~~-~~i~~v~tsvaafvG~~~-----~GP~~~p~~v~s~~d~~~~FG~~~~~~~l~~av~~fF~n 76 (641) T protein:vir:10 3 VSNQLSPGVVIQERDLT-AVTTPIGLNVGVLAAPFT-----KGPVEEIFEVSTERDLASVFGEPNDYNYEYWFTASQFLS 76 (641) T ss_pred CccccCCceEEEEecCC-CcccccCCccceEEeccc-----CCCCCccEEecCHHHHHHHcCCcCCCcchHHHHHHHHHh Confidence 99986 89999999876 689999999999999985 4588999999999999999984 46788999999999 Q ss_pred cCceEEEEEecccccccc---------------------------------------------c--cccc---------- Q lcl|NC_019932. 77 AKPVTVVVRVAEGATPAE---------------------------------------------T--TSNI---------- 99 (389) Q Consensus 77 ~~~~~~v~~~~~~~~~~~---------------------------------------------t--~~~~---------- 99 (389) ++..|+++|......... + ..+. T Consensus 77 gG~~~~vvRv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~pG~~gn~l~v~i~~~~~~~~~~~~~~~~~ 156 (641) T protein:vir:10 77 YGGVLKAIRLNAASLKNSVDSGTAPLIKNLQEYETTYESSNSNTFKFASRDAGALGNSVGIFITDAGPDQIAVLPAPGTG 156 (641) T ss_pred cCCEEEEEEecCccccccccccchhhccccccccccccCcCccccEEEeccCCCcCCceEEEEEcCCCcceeeeeccccc Confidence 999999998742110000 0 0000 Q ss_pred -----------------------------------------------------------------------ccc------ Q lcl|NC_019932. 100 -----------------------------------------------------------------------IGT------ 102 (389) Q Consensus 100 -----------------------------------------------------------------------~~~------ 102 (389) .++ T Consensus 157 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~~~~~~~~~~a~~~~~~i~~~~~g~~g~~~ 236 (641) T protein:vir:10 157 NEWEFVADEAVTAASGASAKVYRYSIRLTLTNVVGTFVPGGATTISISGSDESVDVLAWDAGNKYLEIALPAGGVTGIFA 236 (641) T ss_pred ccceeccceeeeeccCcccccccccccccccceeeecccCCcceeEecccccccccccccCCcceeeeeecCCcceeeee Confidence 000 Q ss_pred -c-cc-------ccc-----------------------------------------------chh-----------hHHH Q lcl|NC_019932. 103 -T-DE-------NGR-----------------------------------------------YTG-----------MKAL 115 (389) Q Consensus 103 -~-d~-------~~~-----------------------------------------------~tG-----------l~a~ 115 (389) . .. +.. .+| ...+ T Consensus 237 ~~~~~t~gt~~~t~a~~g~~~~~~~~~~~~~ia~aat~ag~~g~~~~v~~~~v~d~~~a~~~~~g~~~~~va~~~gts~~ 316 (641) T protein:vir:10 237 DAQVVTQGTNTAAIASSGIERRLYIGKDSGSINFAATDAVVDTNATSATISSVRNEYAEREYLPGSKWVNVAARPGTSLY 316 (641) T ss_pred eeeeccCCccceeeecccchhhhhhccccccceeeeecccccccceeeEeeeeeeeecccccccccccccccccchhhhh Confidence 0 00 000 000 0000 Q ss_pred -----------------------------HHHhhhhh------------------------------------------- Q lcl|NC_019932. 116 -----------------------------LSAQTQLG------------------------------------------- 123 (389) Q Consensus 116 -----------------------------~~~~~~~~------------------------------------------- 123 (389) .+.+.... T Consensus 317 a~~~g~~~D~~~~lv~d~~~~~~g~~g~v~e~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~ 396 (641) T protein:vir:10 317 ANSVGGVNDELHVLVIDVDGKITGNPGSVLERFIGVSKASDAKTSIGEVNYYKEVIKQQSAYVYWGSHETAPFLGTAANA 396 (641) T ss_pred hhhcCCcccceEEEEEeecceeeccccceeeeeecccccCCcccccccceeeeeeeccccceEEEecccccccccccccc Confidence 00000000 Q ss_pred -----------------------------------------------------------------------------hhh Q lcl|NC_019932. 124 -----------------------------------------------------------------------------VKP 126 (389) Q Consensus 124 -----------------------------------------------------------------------------~~p 126 (389) ... T Consensus 397 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gG~d~~~~~~~~~~~~~~~~tg~~~~~~~e~~~i 476 (641) T protein:vir:10 397 AAGDWGASALNRRYNLLRSTAGTTSFPAGAVTVGSKNNATHYYRLANGADYSASGALYNLSNVDIATAYELIEDPESQVI 476 (641) T ss_pred cccccccccccccccccccccccccccccccccCCCCcceeEEEeecCcccccccccccccchhHHHHHHHhhhhhhhcc Confidence 000 Q ss_pred hhhccc-c----ccchHHHHHHHHhhhhcC-ceeeeccCCCc---------cHHHHHHhhhc-ccCceeEEeeeeEEEEe Q lcl|NC_019932. 127 RILGVP-G----LDALEVSTALASIAQQLR-AFAYVSAWGCK---------TLSEAMAYREN-FSQRELMVIWPDFISWN 190 (389) Q Consensus 127 ~~~~ap-g----~~~~~v~~al~~~~~~~~-~~~i~d~~~~~---------t~~~a~~~~~~-~~s~~~~~~~p~~~~~~ 190 (389) .++++| + .....++.+++++|++++ +|+++|.|.+. ..+.+++|+.. .+|+|+++||||++++| T Consensus 477 ~~l~~~~~~~~~~~~~~v~~~~i~~ce~~~d~~ailD~p~~~~~~~~~~~~~~~~~~~~~~~~~~s~yaa~y~P~~~v~d 556 (641) T protein:vir:10 477 DYVLSGPAGADEAAAIAKATTITTIVESRKDCMAFLSPLRSDVIGVSNTTTVTENLVNYFNQLPSSNYVVFDSGYKYIYD 556 (641) T ss_pred ceeeecCCCCCcchhHHHHHHHHHHHHhcCCEEEEEcCCcccccCCCchhhHHHHHHHHHhhcCCCceEEEEeceeEeec Confidence 000110 0 111346778889999886 89999998653 23556677764 58899999999999999 Q ss_pred ecCCCceEEehhHHHHHHHHhhhccccceeccCCc---eeccceeccccccccccCCcchhhhhcccceEEEE--cCCCE Q lcl|NC_019932. 191 TTANQSETAYATARALGLRAKIDTDTGWHKTLSNV---GVNGVTGISASVFWDLQQTGTDADLLNEACVTTLI--RKDGF 265 (389) Q Consensus 191 ~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~---~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~--~~~G~ 265 (389) +.+++.+++||||++||+|||+|.++||||||||. .|+|+++++.. .++.|++.||++|||||+ +++|+ T Consensus 557 p~~~~~~~vPpsG~iAGv~ArtD~~rGvwkAPAn~~~~~i~gv~~l~~~------~~~~e~~~Lnp~gIN~ir~fpg~G~ 630 (641) T protein:vir:10 557 KYNDVYRYVPCNGDIAGLILETGLEEEPWFSPAGFQRGVLRNAVKLAYS------PNKTQRDRLYANRINPVVSFPGHAM 630 (641) T ss_pred ccCCceeEecCCHHHHHHHHhhhccCCceECcCCcccceeeeeeeeeEe------cChhHHhhhhhcccceEEecCCcee Confidence 99999999999999999999999999999999998 47888888654 467888999999999995 45554 Q ss_pred EEEcCccCCCCccc Q lcl|NC_019932. 266 RFWGNRTCSDDPLF 279 (389) Q Consensus 266 ~~wG~rT~~~d~~~ 279 (389) + ++.- ....+. T Consensus 631 v--~~~~-~~~~~~ 641 (641) T protein:vir:10 631 I--NNNI-AFHTKL 641 (641) T ss_pred e--ccee-eeeecC Confidence 3 2211 001011 No 38 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=100.00 E-value=3.3e-35 Score=209.60 Aligned_cols=367 Identities=11% Similarity=0.061 Sum_probs=207.6 Q ss_pred CCCCCCC------------------------------------EEEEECCCCCccccccc-ccceeeeeccccccccccc Q lcl|NC_019932. 1 MSDYHHG------------------------------------VRVVEINDGTRTISTVS-TAIVGMVCTADDADAAAFP 43 (389) Q Consensus 1 M~~~~~G------------------------------------V~v~~v~~~~~~~~~v~-t~v~~~~g~a~~~~~~~~~ 43 (389) +.+-..| +...+.......-...+ ..++...+..... + T Consensus 149 V~~~~~g~~~~~~~~s~~gi~~~~~~l~~~~~~~~~~~gsd~~~~~~~~~~~~~~~~~~D~~t~~~~~~g~~~~-----~ 223 (581) T protein:vir:10 149 IASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHID-----P 223 (581) T ss_pred EeccccCcccccccccccccccccccccccccCcceeccccceeeecccCccccccccccceeeeeeecccccc-----c Confidence 1111111 11111111110000000 0011111100000 0 Q ss_pred ccccEEEecchhh----hhhhcccchhHHHHHhhhcccCceEE-EEEeccccccccccccccccccc-------cccchh Q lcl|NC_019932. 44 LNEPVLLTNVLSA----IGKAGKKGTLAAALQAIADQAKPVTV-VVRVAEGATPAETTSNIIGTTDE-------NGRYTG 111 (389) Q Consensus 44 ~~~~vli~~~~~~----~~~~~~~gtl~~~v~~~~~~~~~~~~-v~~~~~~~~~~~t~~~~~~~~d~-------~~~~tG 111 (389) .+...+..+..+. ...|.+.......+...++..+...- +...............+.++.+. ..-.++ T Consensus 224 ~~v~~~~~~~~d~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~t~~~~~~~tn~~~~~l~~gvd~~g~tvt~~dy~~A 303 (581) T protein:vir:10 224 GDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNA 303 (581) T ss_pred ceEEEEEEEeecCCcceeEEeecCcchhhhhhhhhhccCccccchhhhheeeeecccceeEEeeccCCCCccchHHHHHH Confidence 0000000000000 00111111111111111111111000 00000000000001111111111 111233 Q ss_pred hHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhhhhcC-----cee---eeccCCCccHHHHHHhhhcccCceeEEee Q lcl|NC_019932. 112 MKALLSAQTQLGVKPRILGVPGLDALEVSTALASIAQQLR-----AFA---YVSAWGCKTLSEAMAYRENFSQRELMVIW 183 (389) Q Consensus 112 l~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~-----~~~---i~d~~~~~t~~~a~~~~~~~~s~~~~~~~ 183 (389) |.+++... -..++.|+....++++.+.+++.++. ..+ +...+...+...+.+....+++.|..+++ T Consensus 304 l~ale~~~------~~~ivv~~t~~~~v~a~l~ahv~~~s~~~~~~ravigV~g~~~~~~~~~~~~~a~~~n~~Rvvlv~ 377 (581) T protein:vir:10 304 LNKFRDED------EIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALIS 377 (581) T ss_pred HHHHhcCC------ceEEEEeCCCCHHHHHHHHHHHHHHHhccCCcEEEEEecCCCCCccHHHHHHhhccCCCceEEEEe Confidence 43333321 13345777777788888888876642 233 33444555666777778899999999999 Q ss_pred eeEEEEeecCCC-ceEEehhHHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEE-- Q lcl|NC_019932. 184 PDFISWNTTANQ-SETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLI-- 260 (389) Q Consensus 184 p~~~~~~~~~~~-~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~-- 260 (389) |+....+..... ...+|+ .++|+.+|.+.....+.+||.|++++|+.++.. .++..|.+.|+++|++++. T Consensus 378 p~~~~~~g~~~~~~v~lp~-y~~AA~vAGl~a~~~~~~slT~~~i~gi~~l~~------~~s~~e~e~ll~~Gv~~l~~~ 450 (581) T protein:vir:10 378 PSSFVYYAPELNREVVLGG-QFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAE------VQRDGEKSRESSEGLMVIEKT 450 (581) T ss_pred cCceeecCcccCceeccch-hhHHHHHHHHhhccccccCcccccccccccccc------cCCHHHHHHHHhCCeEEEEEe Confidence 998887665443 344555 233443344444445788999999999987743 3466789999999999994 Q ss_pred cCCCEEE-EcCccCCCCcccceeehhhHHHHHHHHHHHHHH--HHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEE Q lcl|NC_019932. 261 RKDGFRF-WGNRTCSDDPLFAFENYTRTAQVIADTMAEAHM--WANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASC 337 (389) Q Consensus 261 ~~~G~~~-wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~--~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v 337 (389) +++|+++ ||-.|+..|+.|++|++||++|++.+.+++.++ +|+++||+..+|.+|+..+..||..||++|+|.||+. T Consensus 451 ~~~~v~Iv~gItT~~s~~~~~~i~~iR~~D~v~~~ir~~~~~~~fIG~~n~~~~r~~ik~~i~~~L~~l~~~g~I~~~~~ 530 (581) T protein:vir:10 451 PRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRN 530 (581) T ss_pred cCCeEEEEeeeecCCCCCcceeeeeehhhhHHHHHHHHHhhhhcCCCcccCHHHHHHHHHHHHHHHHHHHhcCcccCCcc Confidence 5677776 677888889999999999999999999999975 5888999999999999999999999999999999864 Q ss_pred EEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchH--HHHHHHHhcC Q lcl|NC_019932. 338 WYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSY--LANFAASVNS 389 (389) Q Consensus 338 ~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~--~~~~~~~~~~ 389 (389) . ..++.+.+.+.+++++.++|.+|+|||.+++++.++. ++.-++.... T Consensus 531 ~----~~~~~~~~~d~v~V~i~v~Pv~~i~~I~vti~~~p~~~~~~~~~~~~~~ 580 (581) T protein:vir:10 531 L----KARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEGTTS 580 (581) T ss_pred c----eeeeeecCCCEEEEEEEEEecccceEEEEEEEEecCCCceEEEEecccc Confidence 3 2344567889999999999999999999999999883 2211111111 No 39 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=100.00 E-value=3.2e-35 Score=209.73 Aligned_cols=363 Identities=11% Similarity=0.079 Sum_probs=212.5 Q ss_pred CCCC-C--CCEE-----EEECCCC-----------------Ccccccccccceeeeeccccc---ccccc---------c Q lcl|NC_019932. 1 MSDY-H--HGVR-----VVEINDG-----------------TRTISTVSTAIVGMVCTADDA---DAAAF---------P 43 (389) Q Consensus 1 M~~~-~--~GV~-----v~~v~~~-----------------~~~~~~v~t~v~~~~g~a~~~---~~~~~---------~ 43 (389) ..+| + -|++ ++....+ .........-+.+......+. ....+ . T Consensus 158 ~~~~~l~~~g~~~~~~~~~~s~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~g~~~~~~~i~~~~~~~~D~~ 237 (581) T protein:vir:76 158 AMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPN 237 (581) T ss_pred CcCceeeeccccccccceeecCCcceeeecccccceeeccCcccceeeeeeeeeeEeecccccccceeEEEEEEEeecCC Confidence 1112 1 1332 1010110 000000000000111000000 00000 0 Q ss_pred ccccEEEecchhhhhhh--------cccchhHHHHHhhhcccCceEEEEEeccccccccccccccccccccccchhhHHH Q lcl|NC_019932. 44 LNEPVLLTNVLSAIGKA--------GKKGTLAAALQAIADQAKPVTVVVRVAEGATPAETTSNIIGTTDENGRYTGMKAL 115 (389) Q Consensus 44 ~~~~vli~~~~~~~~~~--------~~~gtl~~~v~~~~~~~~~~~~v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~ 115 (389) ..+.+.+....+....+ +.+..+.......+.++....+..... +.... .....-.++|.++ T Consensus 238 ~~~~v~~~~~~~~~~~~~~~~~~~g~~~~e~~~~~~~~~t~~~~~~l~~gvd-~~g~t---------vt~~dy~~aL~al 307 (581) T protein:vir:76 238 YHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVD-PEGDT---------VTMGDYQNALNKF 307 (581) T ss_pred ccceEEEecccccccceeeehhhcCccccchhhhhheeeccccceEEEeeec-CCCCc---------cchHHHHHHHHHH Confidence 00111111111111010 000111111111111111111111000 00000 0001112334444 Q ss_pred HHHhhhhhhhhhhhccccccchHHHHHHHHhhhhcC-----c---eeeeccCCCccHHHHHHhhhcccCceeEEeeeeEE Q lcl|NC_019932. 116 LSAQTQLGVKPRILGVPGLDALEVSTALASIAQQLR-----A---FAYVSAWGCKTLSEAMAYRENFSQRELMVIWPDFI 187 (389) Q Consensus 116 ~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~-----~---~~i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~ 187 (389) +... -..++.|+....++++.+.+++.++. . +++...+...+...+......+++.|..+++|+.. T Consensus 308 e~~~------~~~ivvp~t~~~~i~a~l~ahv~~~s~~~~~~ra~igv~g~~~~~~~~~~~~~a~~~ns~Rvvlv~p~~~ 381 (581) T protein:vir:76 308 RDED------EIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSF 381 (581) T ss_pred hcCC------eEEEEEecCCChHHHHHHHHHHHHHHhccCCceEEEEeeCCCCCchHHHHHHhhcccCCCcEEEEEcCce Confidence 3321 12345677777778888877775542 2 23444445556667777788899999999999998 Q ss_pred EEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEE--cCCCE Q lcl|NC_019932. 188 SWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLI--RKDGF 265 (389) Q Consensus 188 ~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~--~~~G~ 265 (389) +++..........|..++|+.+|....+..+.+||.|++++|+.++... ++..|.+.|+++|++++. +++++ T Consensus 382 ~~~g~~~~~~~~lp~~~~AA~vAG~~a~~~~~~slT~~~i~g~~~~~~~------~s~~e~e~ll~~Gv~~l~~~~~~~v 455 (581) T protein:vir:76 382 VYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEV------QRDGEKSRESSEGLMVIEKTPRNLV 455 (581) T ss_pred EeccccCCcceecchhhhhhhHHhhhhccccccCccccccccccccccc------CCHHHHHHHHhCCeEEEEEecCCeE Confidence 8877655444444555666666666667778999999999999876433 466788999999999994 56777 Q ss_pred EE-EcCccCCCCcccceeehhhHHHHHHHHHHHHHH--HHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecC Q lcl|NC_019932. 266 RF-WGNRTCSDDPLFAFENYTRTAQVIADTMAEAHM--WANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDT 342 (389) Q Consensus 266 ~~-wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~--~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~ 342 (389) ++ ||-.|+..++.|++++++|++|++.+.+++.++ +|+++||+..+|.+|+..+..||..||+.|+|.||... T Consensus 456 ~Iv~gItT~~s~~~~k~i~viR~~D~v~~~vr~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~g~I~g~~~~---- 531 (581) T protein:vir:76 456 HVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRNL---- 531 (581) T ss_pred EEEEeeecCCCCCccceeeehhhhHHHHHHHHHHHhhhcCCCcccChHHHHHHHHHHHHHHHHHHhcCcccCcccc---- Confidence 65 888899999999999999999999999999975 58889999999999999999999999999999998632 Q ss_pred CCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchH--HHHHHHHhcC Q lcl|NC_019932. 343 ANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSY--LANFAASVNS 389 (389) Q Consensus 343 ~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~--~~~~~~~~~~ 389 (389) ..++...+.+++++++.++|++|+|||.+++++.++. ++.-++.... T Consensus 532 ~~~~~~~~~d~v~V~i~v~Pv~~ie~I~vt~~~~p~~~~~~~~~~~~~~ 580 (581) T protein:vir:76 532 KARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEGTTS 580 (581) T ss_pred eeeEEecCCCEEEEEEEEEecccceEEEEEEEEeeCCCceEEEEecccc Confidence 3455567889999999999999999999999998873 2211111111 No 40 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=100.00 E-value=1.9e-33 Score=199.98 Aligned_cols=359 Identities=13% Similarity=0.082 Sum_probs=250.8 Q ss_pred CC-------CC-CCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHh Q lcl|NC_019932. 1 MS-------DY-HHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQA 72 (389) Q Consensus 1 M~-------~~-~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~ 72 (389) |+ .+ +||||+++..++.++...+++++.+|+|.+... |.++|+.+++..++...|+. |.|...+.. T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G-----~~~~~~~~~~~~~~~~~~~~-g~l~~~~~~ 74 (587) T protein:vir:95 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGG-----EPNTVYELRNYSQAKRLFRS-GELLDAIEL 74 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEeCCC-----CCceeEEeccHHHHHHHhcC-cchHHHHHH Confidence 43 34 599999999999999999999999999999655 67899999999999999977 456666655 Q ss_pred hh----cccCceEEEEEeccccccccccc--------------------------------------------cccc--- Q lcl|NC_019932. 73 IA----DQAKPVTVVVRVAEGATPAETTS--------------------------------------------NIIG--- 101 (389) Q Consensus 73 ~~----~~~~~~~~v~~~~~~~~~~~t~~--------------------------------------------~~~~--- 101 (389) .+ .+++..++++|+........+.. +.++ T Consensus 75 a~~~~~~~g~~~~~~~rv~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ 154 (587) T protein:vir:95 75 AWGSNPNYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIF 154 (587) T ss_pred HhccccCCCceEEEEEEcCCCceeEEEecCeEEEEecccccccceEEEEecCCCCCceeEEEEEecccceeeeeecccee Confidence 55 35555555554321111000000 0000 Q ss_pred -----cc----------cc-cc------------------cchh-----hHHHHHH------------------------ Q lcl|NC_019932. 102 -----TT----------DE-NG------------------RYTG-----MKALLSA------------------------ 118 (389) Q Consensus 102 -----~~----------d~-~~------------------~~tG-----l~a~~~~------------------------ 118 (389) +. +. +. ..+| ..+..+. T Consensus 155 si~y~g~~~~~~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~in~~~~~tAky~g~~~~~i~~~~~~ 234 (587) T protein:vir:95 155 TIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLD 234 (587) T ss_pred eeeeeccccccceeeeecccceeeeeeeeecCCceEEEEEecCCchHHHHHHHHhhccccceEEEEecccCceeEEeecC Confidence 00 00 00 0000 0000000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_019932. 119 -------------------------------------------------------------------------------- 118 (389) Q Consensus 119 -------------------------------------------------------------------------------- 118 (389) T Consensus 235 ~~~~~~v~~~~~~v~a~~~d~~~~~~~~~~v~~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~t~LtGG~dG 314 (587) T protein:vir:95 235 KIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNG 314 (587) T ss_pred cccccceehhhhhhhhhhcceeeeeeceeeeeeecccccceeccchhhhhcccchheeccccccceeccceeeeecCCCC Confidence Q ss_pred ---------hhhhhhhhhhhccccccchHHHHHHHHhhhhc-----CceeeeccCCCccHHHHHHhhhcccCceeEEeee Q lcl|NC_019932. 119 ---------QTQLGVKPRILGVPGLDALEVSTALASIAQQL-----RAFAYVSAWGCKTLSEAMAYRENFSQRELMVIWP 184 (389) Q Consensus 119 ---------~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~-----~~~~i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p 184 (389) ...+...-...+.|..+.+.+++++.++++++ ..+++++.+.+.+.+++...+..+++.+.+.+++ T Consensus 315 ~~~~~y~~~l~ale~~~~~~i~~~t~d~~v~a~l~a~vk~~~~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~ervi~v~~ 394 (587) T protein:vir:95 315 EPPATWADKLDKFAHEGGYYIVPLSSKQSVHAEVASFVKERSDAGEPMRAIVGGGFNESKEQLFGRQESLSNPRVSLVAN 394 (587) T ss_pred CCcccHHHHHHHHHhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEEcCCCCCCHHHHHHHHhhcCCCcEEEecc Confidence 00000000001112223345777788887665 3578888888889999999999999999988887 Q ss_pred eEEEEeecCCCceEEeh---hHHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEE- Q lcl|NC_019932. 185 DFISWNTTANQSETAYA---TARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLI- 260 (389) Q Consensus 185 ~~~~~~~~~~~~~~~p~---s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~- 260 (389) ...+. ...+....+|+ ++++||++|..| +.+||.|+++. ..++. ..++..|.+.+.++|+.++. T Consensus 395 ~~~~~-~~dg~~~~~~~~~~aa~vAGl~Ag~~----~~~SlT~~~i~-~~~v~------~~~t~~e~e~ai~~Gvl~l~~ 462 (587) T protein:vir:95 395 SGTFV-MDDGRKNHVPAYMVAVALGGLASGLE----IGESITFKPLR-VSSLD------QIYESIDLDELNENGIISIEF 462 (587) T ss_pred cceEe-cCCCceeeechHHHHHHHHHHHhcCc----hhcCccceeee-ccccc------ccCCHHHHHHHHhCCeEEEEE Confidence 75543 22445566777 688999999886 66799999986 34443 34578889999999999984 Q ss_pred -cCCC---EEE-EcCccC--CCCcccceeehhhHHHHHHHHHHHHH-HHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCce Q lcl|NC_019932. 261 -RKDG---FRF-WGNRTC--SDDPLFAFENYTRTAQVIADTMAEAH-MWANDKPLTPVLVRDIIAGINAKFRELVSAGYL 332 (389) Q Consensus 261 -~~~G---~~~-wG~rT~--~~d~~~~~i~vrR~~~~i~~~~~~~~-~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal 332 (389) ++++ +++ .+-.|. ..|+.|++++++|++|+|.+.+++.+ .+|+++||+...|..++..+..||.+|++.|+| T Consensus 463 ~~~~~~~~vriv~~itT~t~~~d~~~~~i~viRv~D~i~~dir~~~~~~~iGk~nn~~~r~~v~~~i~~~L~~l~~~gaI 542 (587) T protein:vir:95 463 VRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINTSASIIKDFIQSYLGRKKRDNEI 542 (587) T ss_pred ecCCcceEEEEeecceeccCCCCcchhhhhhhhhHHHHHHHHHHHHHhhCCccccchHHHHHHHHHHHHHHHHHHhCCcc Confidence 3332 443 444443 55778999999999999999999886 599999999999999999999999999999999 Q ss_pred eeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHH Q lcl|NC_019932. 333 LGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLAN 382 (389) Q Consensus 333 ~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 382 (389) .+|... +.+-++...++++++.++|+.|+|+|.+++.+..+-++. T Consensus 543 ~~~~~~-----dv~v~~~~d~~~v~~~v~Pv~~mekI~vt~~~~~~~~~~ 587 (587) T protein:vir:95 543 QDFPAE-----DVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred cCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEeeeeecC Confidence 998542 222234566899999999999999999999999887776 No 41 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=100.00 E-value=3.1e-33 Score=198.85 Aligned_cols=359 Identities=11% Similarity=0.053 Sum_probs=248.1 Q ss_pred CCCC-CCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhh----c Q lcl|NC_019932. 1 MSDY-HHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIA----D 75 (389) Q Consensus 1 M~~~-~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~----~ 75 (389) |..| +||||++...++..+.+..++++.+|+|.+... |.++|+++++..++...|+. |.|..++...+ . T Consensus 8 ~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~g-----~~~~~~~~~~~~~~~~~~g~-G~l~~ai~~a~~~~~~ 81 (587) T protein:vir:96 8 RRPIQRPHASIEVDSSGIGGSASNSEKILCLIGKAEGG-----EPNTVYQVRNYAQAKSVFRS-GELLDAIELAWGSNPQ 81 (587) T ss_pred CCcccCCceEEEEecCCccCCCCCCCceEEEEEEecCC-----CCceeEEEcChHHHHHhhcC-CcHHHHHHHHhccCcC Confidence 5556 589999999999999999999999999999544 67999999999999999877 34666665444 4 Q ss_pred ccCceEEEEEecccccccc----------------------------------c----ccc------ccc--------cc Q lcl|NC_019932. 76 QAKPVTVVVRVAEGATPAE----------------------------------T----TSN------IIG--------TT 103 (389) Q Consensus 76 ~~~~~~~v~~~~~~~~~~~----------------------------------t----~~~------~~~--------~~ 103 (389) +++..++.+|+........ + ... -.+ +. T Consensus 82 ~g~~~~~a~rv~~~~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~n~G~v~~i~y~g~ 161 (587) T protein:vir:96 82 YTAGKILAMRVEDAKASQLEKGGLRVTSKIFGSVSNDIQVALEKNTITDSLRLRVVFQKDNYQEVFDNLGNIFSINYKGE 161 (587) T ss_pred CCceEEEEEecCCCccceeecccccccccccCCCCceEEEEEEeccCCCccceEEEEecCCceeeccccCceEEEEeccc Confidence 5555555554321110000 0 000 000 00 Q ss_pred ----------c-----cc-----cc---------chh-----hHHHHHH---------hh-------------------- Q lcl|NC_019932. 104 ----------D-----EN-----GR---------YTG-----MKALLSA---------QT-------------------- 120 (389) Q Consensus 104 ----------d-----~~-----~~---------~tG-----l~a~~~~---------~~-------------------- 120 (389) + +. +. .+| ..+..+. ++ T Consensus 162 ~~~a~~~~~~~~~~~~A~~l~l~gg~~~v~~yrl~~g~~~~~~~~~~~~~~~~~~tAky~g~~~n~~~v~v~d~~~~~~~ 241 (587) T protein:vir:96 162 GEKATFSVEKDKETQEAKRLVLKVDEKEVKAYELNGGAYSFTNEIITDINELPDFEAKLSPFGDKNLESRKLDEATDVDI 241 (587) T ss_pred ccceeEeeccCcccceeeeeEEEecCceEEEEEeCCCchhhhhhhhhhhccccceEEEeecccCceeEEEeecccccccc Confidence 0 00 00 000 0000000 00 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_019932. 121 -------------------------------------------------------------------------------- 120 (389) Q Consensus 121 -------------------------------------------------------------------------------- 120 (389) T Consensus 242 k~~~~y~~t~~~di~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~aLtGG~dG~~~~~y~ 321 (587) T protein:vir:96 242 KGKAVYVKAVFGDIENQTQYNQYVKFEQLPEQASEPSDVEVHAETESATVTATSKPKAIEPFELTKLSGGTNGEPPTSWS 321 (587) T ss_pred ceEEEeehhhhhhhhhhhccccceeeccccchhhhhhcccccccccceeeeecccccccccccceeeecCCCCCCcccHH Confidence Q ss_pred ----hhhhhhhhhccccccchHHHHHHHHhhhhc-----CceeeeccCCCccHHHHHHhhhcccCceeEEeeeeEEEEee Q lcl|NC_019932. 121 ----QLGVKPRILGVPGLDALEVSTALASIAQQL-----RAFAYVSAWGCKTLSEAMAYRENFSQRELMVIWPDFISWNT 191 (389) Q Consensus 121 ----~~~~~p~~~~apg~~~~~v~~al~~~~~~~-----~~~~i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~ 191 (389) .+...-...+.+....+.+++.+.+++.++ ..++++..+.+.+.+++...+..+++.+.+.++++..+.+. T Consensus 322 ~~l~ale~~~~~~i~~~t~d~ai~~~l~a~vk~~r~~gk~~~aVlg~~~~~~~~~~~~~a~~~n~e~vi~v~~~~~~~~~ 401 (587) T protein:vir:96 322 AKLEKFKNEGGYYIVPLTDRQSVHSEVATFVKNRSDAGEPMRAIVGGGTSETKEKLFGRQAILNNPRVALVANSGKFVMG 401 (587) T ss_pred HHHHHHhhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHHHhhcCCCcEEEEecceEEecC Confidence 000000000111112235677777777665 36788888888899999999999999999988887776654 Q ss_pred cCCCceEEeh---hHHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEEc--CCCEE Q lcl|NC_019932. 192 TANQSETAYA---TARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLIR--KDGFR 266 (389) Q Consensus 192 ~~~~~~~~p~---s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~--~~G~~ 266 (389) .+....+|+ ++++||++|..+ +.+||.|+.+.+ .++. ..++..|.+.+.+.|+.++.. +++.+ T Consensus 402 -~~~~~~~~~~~~aa~vAG~~Ag~~----~~~S~T~~~~~~-~~v~------~~~t~~e~~~~i~~G~~~l~~~~~~~~~ 469 (587) T protein:vir:96 402 -NGRILQAPAYMVASAVAGLVSGLD----IGESITFKPLFV-NSLD------KVYESEELDELNENGIITIEFVRNRMTT 469 (587) T ss_pred -CCceeeechhhHHHHHHHHHhcCc----cccCccceeeec-cccc------ccCCHHHHHHHHhCCeEEEEEecCCcEE Confidence 234444543 678899999876 778999999874 3443 235778899999999999843 44556 Q ss_pred EEcC-ccC-----CCCcccceeehhhHHHHHHHHHHHHH-HHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEE Q lcl|NC_019932. 267 FWGN-RTC-----SDDPLFAFENYTRTAQVIADTMAEAH-MWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWY 339 (389) Q Consensus 267 ~wG~-rT~-----~~d~~~~~i~vrR~~~~i~~~~~~~~-~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~ 339 (389) +|.. +++ ..++.|++|+++|++|+|.+.+++.+ .+|+++||+...|..++..+..||.+|++.|+|.+|... T Consensus 470 v~~~vnsitT~t~~~~~~~~~i~virv~D~i~~di~~~~~~~yiGk~nn~~~r~~v~~~i~~~L~~l~~~g~I~~~~~~- 548 (587) T protein:vir:96 470 MFRIVDDVTTFPDKNDPVKSEMALGEANDFLVSELKILLEEQYIGTRTINTSASQIKDFVQSYLGRKKRDNEIQDFPPE- 548 (587) T ss_pred EEEeeccceecCCCCCchhhhhhhHHHHHHHHHHHHHHHHhcCCccccCHHHHHHHHHHHHHHHHHHHhCCcccCCCcc- Confidence 6633 332 34667999999999999999999887 589999999999999999999999999999999998542 Q ss_pred ecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHH Q lcl|NC_019932. 340 DDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLAN 382 (389) Q Consensus 340 d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 382 (389) +-+-.+...++++++.++|..|+|+|.+++.+..+-++. T Consensus 549 ----dv~v~~~~D~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:96 549 ----DVQVIIEGNEARISLTIFPIRALKKISVSLVYRQQTLQA 587 (587) T ss_pred ----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 112223455799999999999999999999998887776 No 42 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=100.00 E-value=6.1e-33 Score=197.20 Aligned_cols=359 Identities=13% Similarity=0.075 Sum_probs=250.3 Q ss_pred CC-------CC-CCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHh Q lcl|NC_019932. 1 MS-------DY-HHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQA 72 (389) Q Consensus 1 M~-------~~-~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~ 72 (389) |+ .+ +||||+++..++..+...+++++.+|+|.+... |.++++++++..++...|+. |.|.+.+.. T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G-----~~~~~~~~~~~~~~~~~~~~-g~l~~~~~~ 74 (587) T protein:vir:99 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGG-----EPNTVYELRNYSQAKRLFRS-GELLDAIEL 74 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEecCC-----ccceeEEeccHHHHHHHhcC-cchHHHHHH Confidence 43 34 599999999999999999999999999999655 66899999999999999977 457667655 Q ss_pred hh----cccCceEEEEEeccccccccc----------------------------cc----------------cccc--- Q lcl|NC_019932. 73 IA----DQAKPVTVVVRVAEGATPAET----------------------------TS----------------NIIG--- 101 (389) Q Consensus 73 ~~----~~~~~~~~v~~~~~~~~~~~t----------------------------~~----------------~~~~--- 101 (389) .+ .+++..++++|+........+ .. +.++ T Consensus 75 a~~~~~~~g~~~~~~~rv~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ 154 (587) T protein:vir:99 75 AWGSNPNYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIF 154 (587) T ss_pred HhccccCCCceEEEEEEcCCCceeEEEecCeEEEEeeccccccceEEEEccCCCCcceeEEEEEecccceeeeeecccee Confidence 55 355555555543211110000 00 0000 Q ss_pred -----cc----------cc-cc------------------cchh-----hHHHHHH------------------------ Q lcl|NC_019932. 102 -----TT----------DE-NG------------------RYTG-----MKALLSA------------------------ 118 (389) Q Consensus 102 -----~~----------d~-~~------------------~~tG-----l~a~~~~------------------------ 118 (389) +. ++ +. ..+| ..+.... T Consensus 155 ~i~y~g~~~~a~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~i~~~~~~tAky~~~~~~~i~~~~~~ 234 (587) T protein:vir:99 155 TIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLD 234 (587) T ss_pred eEEeecccccceeeEeecCcceeeeeeeeecCCceeEEEEecCCchHHHHHHHhhhccccceeEEeeccCCceeEeeccc Confidence 00 00 00 0000 0000000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_019932. 119 -------------------------------------------------------------------------------- 118 (389) Q Consensus 119 -------------------------------------------------------------------------------- 118 (389) T Consensus 235 ~~~~~~v~~~~~~v~a~~~D~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~LtGG~dG 314 (587) T protein:vir:99 235 KIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNG 314 (587) T ss_pred ccccceeeeeeeeeehhccceeeecccceeeeeeecccccchhhhhhhhhccccceeeeeccccceecccceeeecCCCC Confidence Q ss_pred ---------hhhhhhhhhhhccccccchHHHHHHHHhhhhc-----CceeeeccCCCccHHHHHHhhhcccCceeEEeee Q lcl|NC_019932. 119 ---------QTQLGVKPRILGVPGLDALEVSTALASIAQQL-----RAFAYVSAWGCKTLSEAMAYRENFSQRELMVIWP 184 (389) Q Consensus 119 ---------~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~-----~~~~i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p 184 (389) ...+...-...+.|..+.+.+++++.++++++ ..+++++.+.+.+.+++......+++.+.+.+.+ T Consensus 315 ~~~~sy~~al~ale~~~~~~i~~~t~d~~i~a~l~a~vk~~r~~g~~~~aVlg~~~~~~~~~~~~~a~~~n~e~vi~v~~ 394 (587) T protein:vir:99 315 EPPATWADKLDKFAHEGGYYIVPLSSKQSVHAEVASFVKERSDAGEPMRAIVGGGFNESKEQLFGRQASLSNPRVSLVAN 394 (587) T ss_pred CccccHHHHHHHHhhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHhhhcCCCcEEEEec Confidence 00000000001122223345677788887665 3678888888889999999999999999988887 Q ss_pred eEEEEeecCCCceEEeh---hHHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEE- Q lcl|NC_019932. 185 DFISWNTTANQSETAYA---TARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLI- 260 (389) Q Consensus 185 ~~~~~~~~~~~~~~~p~---s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~- 260 (389) +..... ..+....+|+ ++++||++|..+ +.+||.|+.+. ..++. ..++..|.+.+.++|+.++. T Consensus 395 ~~~~~~-~dg~~~~~~~~~~aa~vAGl~Ag~~----~~~SlT~~~i~-~~~v~------~~~t~~e~e~li~~Gvl~l~~ 462 (587) T protein:vir:99 395 SGTFVM-DDGRKNHVPAYMVAVALGGLASGLE----IGESITFKPLR-VSSLD------QIYESIDLDELNENGIISIEF 462 (587) T ss_pred cceEec-CCCceeeechHHHHHHHHHHHhcCc----hhcCccceeee-ccccc------ccCCHHHHHHHHhCCeEEEEE Confidence 755432 2345566776 688899999886 77899999986 44543 34578889999999999984 Q ss_pred -cCC---CEEE-EcCccC--CCCcccceeehhhHHHHHHHHHHHHH-HHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCce Q lcl|NC_019932. 261 -RKD---GFRF-WGNRTC--SDDPLFAFENYTRTAQVIADTMAEAH-MWANDKPLTPVLVRDIIAGINAKFRELVSAGYL 332 (389) Q Consensus 261 -~~~---G~~~-wG~rT~--~~d~~~~~i~vrR~~~~i~~~~~~~~-~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal 332 (389) +++ ++++ .+=.|. ..|+.|++++++|++|+|.+.+++.+ .+|+++||+...|..|+..+..||.+|++.|+| T Consensus 463 ~~~~~~~~vriv~~ItT~t~~~~~~~~~i~viRv~D~i~~di~~~~~~~yiGk~Nn~~~r~~i~~~i~~~L~~l~~~gaI 542 (587) T protein:vir:99 463 VRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINTSASIIKDFIQSYLGRKKRDNEI 542 (587) T ss_pred ecCCcceEEEEeeceeeccCCCCchhhhhhhhhhHHHHHHHHHHHHHhhCCccccchHHHHHHHHHHHHHHHHHHhCCcc Confidence 333 2443 444443 45778999999999999999999886 589999999999999999999999999999999 Q ss_pred eeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHH Q lcl|NC_019932. 333 LGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLAN 382 (389) Q Consensus 333 ~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 382 (389) .+|... +.+-+....++++++.++|+.|+|+|.+++.+..+-++. T Consensus 543 ~~~~~~-----dv~v~~~~d~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:99 543 QDFPAE-----DVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred cCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 998542 111123455799999999999999999999999987776 No 43 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=100.00 E-value=3.1e-31 Score=187.84 Aligned_cols=357 Identities=16% Similarity=0.123 Sum_probs=228.8 Q ss_pred CC--CC-------CCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHH Q lcl|NC_019932. 1 MS--DY-------HHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQ 71 (389) Q Consensus 1 M~--~~-------~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~ 71 (389) |+ .| +|||||+++.++.++++.+++++.+|+|.+.. .|.++|++++++.++...||. +.|.+++. T Consensus 1 ma~~~yf~~~~~~~PGVyvee~~sg~~~i~gv~tsva~fvG~a~~-----Gp~~~p~~v~s~~~~~~~fgg-g~l~~av~ 74 (648) T protein:vir:10 1 MAISVYFDGKLIKQLGAYVKTDLSAVKQINGVGTGIVALLGLAEG-----GETYKPYRLTSFAEAVSIFKG-GPLLEHIK 74 (648) T ss_pred CeeeeeeCCCCccCCceEEEEeccccccccCCCCceEEEEEeeCC-----CCCceeEEecCHHHHHHHhcC-ccHHHHHH Confidence 44 34 38999999999999999999999999999954 478999999999999999986 67999999 Q ss_pred hhhcccCceEEEEEecccccccccc-------------------------cccc--------------------ccc--- Q lcl|NC_019932. 72 AIADQAKPVTVVVRVAEGATPAETT-------------------------SNII--------------------GTT--- 103 (389) Q Consensus 72 ~~~~~~~~~~~v~~~~~~~~~~~t~-------------------------~~~~--------------------~~~--- 103 (389) .+|.+++..++++|+........+. .... ... T Consensus 75 ~~F~nGg~~~~~vRv~~~~~a~~~~~~~~~~a~~~g~~gn~i~~~v~~~~~~~~~~~~l~v~~~~~~~~~d~~v~~i~~~ 154 (648) T protein:vir:10 75 AAFIGGAGEVVAVRIGNPTTASVSIPVAQNTSDTSPANLNFVSYEASTRSNQIYVSFDLDENFTSANEADDTIIFTIYQK 154 (648) T ss_pred HHHhCCCcEEEEEEcCCCcccceecceeEEeecccCCCCCceEEEEEEcCCCcCceeEEEEEecCCCcccceeEEEeccC Confidence 9999999999999864322211000 0000 000 Q ss_pred ---------ccccc----------c----------hhhHHHH---------------HH--------------------- Q lcl|NC_019932. 104 ---------DENGR----------Y----------TGMKALL---------------SA--------------------- 118 (389) Q Consensus 104 ---------d~~~~----------~----------tGl~a~~---------------~~--------------------- 118 (389) ..+.. + ..+.+.. .. T Consensus 155 ~~~y~gt~~~~t~~v~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~s~~~~~d~~ 234 (648) T protein:vir:10 155 HPDFSVTRETFTFPRKFTTPTVLVKRGSTLFFVDRSIVNAALAAGPAFQTALINLLKEQLQPTDVVQIFDASDTNPVDIP 234 (648) T ss_pred CCcccccceeccccccccccccccccccceeecCccchhhhhccCccchhhhhhchhhhhhhhhhheecccccccccccc Confidence 00000 0 0000000 00 Q ss_pred ----------------------hhhhhhh--------------------------------------------------- Q lcl|NC_019932. 119 ----------------------QTQLGVK--------------------------------------------------- 125 (389) Q Consensus 119 ----------------------~~~~~~~--------------------------------------------------- 125 (389) ....+.. T Consensus 235 ~~~~~~~a~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~tp~~~~~~~~~~~~~~~~~~~~~v~~~~~~~l~~~~~ 314 (648) T protein:vir:10 235 LGLFVYEVLYGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNLSATPFFDGSDYQDYTSLSDPANWFAKDAYTINHLVDTTI 314 (648) T ss_pred cccccccccchhhhcCCcchhhhhhhccccccccccceecccccccccccceeeeeccccccceeeeeccchhhcccccc Confidence 0000000 Q ss_pred -hhhhc-------------cc-------------cc--------------------------------cchHHHHHHHHh Q lcl|NC_019932. 126 -PRILG-------------VP-------------GL--------------------------------DALEVSTALASI 146 (389) Q Consensus 126 -p~~~~-------------ap-------------g~--------------------------------~~~~v~~al~~~ 146 (389) |.... .| .| ..+++++.++++ T Consensus 315 ~p~~~~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~~ivp~~~~~~~~~~~~~lt~~q~i~a~a~sh 394 (648) T protein:vir:10 315 NPHILATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVNFVIPAYKFTNVTQLNDRLTIFKGIASTFLSH 394 (648) T ss_pred cCcccccccceecccccCCCcccccccccccchhhHHHHhhhccCCCceEEEeecccccccccccccCCccchHHHHHHH Confidence 00000 00 00 112344444455 Q ss_pred hhhc----------CceeeeccCCCccHHHH--HHhhhcccCceeEE-e--------eeeEEEEeecCCCceEEeh---h Q lcl|NC_019932. 147 AQQL----------RAFAYVSAWGCKTLSEA--MAYRENFSQRELMV-I--------WPDFISWNTTANQSETAYA---T 202 (389) Q Consensus 147 ~~~~----------~~~~i~d~~~~~t~~~a--~~~~~~~~s~~~~~-~--------~p~~~~~~~~~~~~~~~p~---s 202 (389) +..+ ..+++...++..+..+. ...+..++..+... . .|+.-.+-...++...+|| . T Consensus 395 v~~~s~~~~~~~r~~~~~~vg~~~~es~~~se~~~~~~~~~~~~a~~~~~d~~~~~~~~~~~~~~~~~G~~~~~p~~~~A 474 (648) T protein:vir:10 395 VQTMSQVNRRKARVGVFGLPAPSPNESVTASEYLYNRNILNTISAMFGGTDRAQAVVFPFYSNVFNDEGKVELLGGEFFA 474 (648) T ss_pred HHHhhhccccccccCeEEEeCCCCchhHHHHHHHhhhhcccccceeeeecCCceEEeecccceeECCCCcEEecchhhHH Confidence 4322 12444444444443222 22222233322211 1 1111111123455667888 6 Q ss_pred HHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEEc--CC----CEEEEcCccC--- Q lcl|NC_019932. 203 ARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLIR--KD----GFRFWGNRTC--- 273 (389) Q Consensus 203 ~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~--~~----G~~~wG~rT~--- 273 (389) +++||+++++ .++.||.||+++++ ++.+. ...++.|.+.|++.|++++.. ++ ++++--+-|. T Consensus 475 a~VAGl~a~l----~~~~s~T~k~i~~~-~id~~----~~~t~~qld~L~~~Gv~~ie~~~~~~~~~~~rvv~gITT~~~ 545 (648) T protein:vir:10 475 SYVAGMHANR----EPQDSITFLPISGI-GAEPL----YNWTYTQKDDLISNRVLFVEKVKTSFGGIVYRIHHNPTTWLG 545 (648) T ss_pred HHHHhhhhcc----ccccCcccceeecc-ccccc----cCCCHHHHHHHhcCCcEEEEEecCCcceeeEEEeccceeecC Confidence 7788888875 58889999999843 33221 224678899999999999843 22 2334323222 Q ss_pred CCCcccceeehhhHHHHHHHHHHH-HHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeE---EEEecCCCCHHHh Q lcl|NC_019932. 274 SDDPLFAFENYTRTAQVIADTMAE-AHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGAS---CWYDDTANDKDTL 349 (389) Q Consensus 274 ~~d~~~~~i~vrR~~~~i~~~~~~-~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~---v~~d~~~n~~~~i 349 (389) +.++.|+.|+++|++|++.+.+++ ...+|+++||+...|.++++.+.+||.++++.++|++|. +.++ . T Consensus 546 ~~~~~~~eisv~ri~D~l~~~vr~~l~~~fIG~~n~~~~~~~ik~~i~~~L~~~~~~~~I~~y~~~~v~~~--------~ 617 (648) T protein:vir:10 546 PVTQGFQEFVLRRIDDFLQSYVYKNLQEQFIGRKSYGRKTENDIKVYTEALLSNLVGKQIVAYKDVKVTSN--------E 617 (648) T ss_pred CCCcceeeeeeeehhhHHHHHHHHHHhhhcCcccccHHHHHHHHHHHHHHHhhHhhcCcccCcccceEEEE--------e Confidence 457889999999999999999987 555999999999999999999999999999999999974 4443 2 Q ss_pred hCCEEEEEEEEEecccceEEEEEEEEcchHH Q lcl|NC_019932. 350 KAGKLFIDYDYTPVPPLEDLTLRQRITDSYL 380 (389) Q Consensus 350 ~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~ 380 (389) +++++++++.++|.+|++||.+++..+.+.- T Consensus 618 ~~~vv~V~~~v~Pv~~i~~I~vti~it~~~~ 648 (648) T protein:vir:10 618 DKTVYYVEFFYQPVTEIKFILVTMKVTFDLE 648 (648) T ss_pred cCCEEEEEEEEEecceeeEEEEEEEEEeccC Confidence 4599999999999999999999999887743 No 44 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=99.95 E-value=6.5e-29 Score=175.12 Aligned_cols=357 Identities=11% Similarity=0.064 Sum_probs=222.1 Q ss_pred CC--------CCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhH--HHH Q lcl|NC_019932. 1 MS--------DYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLA--AAL 70 (389) Q Consensus 1 M~--------~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~--~~v 70 (389) |+ --+||||+.++..+.+++..+.+++.+|++.+. -.|.++|+.+++..++...||...+-. ..+ T Consensus 1 m~gg~~~~~~k~~PGvYi~~~~~~~~~i~~~~~~~~a~~~~~~-----~Gp~~~~~~i~s~~d~~~~fG~~~~~~~~~~~ 75 (437) T protein:vir:10 1 MAGGIWKRQNKVRPGAYINVKSKDIAMTRLGGDGVVTVPLALS-----FGQSKKLMKIRRGEDLFKKLGYEQESPQLLLL 75 (437) T ss_pred CCcceecccceecCceeEEEecCCcceeeccCCcEEEEEEEec-----CCCCceeEEEecHHHHHHHcCCccchhHHHHH Confidence 33 347999999999999999999999999999883 457899999999999999999754321 222 Q ss_pred HhhhcccCceEEEEEecccccccccccc-------ccccc------------cccccc-----hh--------hHHHHHH Q lcl|NC_019932. 71 QAIADQAKPVTVVVRVAEGATPAETTSN-------IIGTT------------DENGRY-----TG--------MKALLSA 118 (389) Q Consensus 71 ~~~~~~~~~~~~v~~~~~~~~~~~t~~~-------~~~~~------------d~~~~~-----tG--------l~a~~~~ 118 (389) ...+ +++..++++|+..+.....+..+ ..+.. ++.... .| +....+. T Consensus 76 ~~~~-~g~~~~~~~R~~~g~~a~~tl~~~~~~~A~~~G~~gn~i~v~v~~~~~d~~~~~v~~~~~~~~~d~~~v~~~~~~ 154 (437) T protein:vir:10 76 NEAF-KRVSEVLLYRLNTGEKANVSLSDNVTAQAKYSGVRGNDITVTVKTNVDDPSSFDVVTFLDTVVMDLQTVKVLADL 154 (437) T ss_pred HHHh-cCCCEEEEEECCCCceeeEeeccceEEEeccCCcccceeEEEEeeccCCccceEEEEecCcceeeeeehhhhhhh Confidence 3333 56677888888665432221111 00000 000000 00 0011110 Q ss_pred hhhh----------hhhhhhhccccccc----hHHHHHHHHhhhhcCceeeeccCCCccHHHHHHhhhcc----cCceeE Q lcl|NC_019932. 119 QTQL----------GVKPRILGVPGLDA----LEVSTALASIAQQLRAFAYVSAWGCKTLSEAMAYRENF----SQRELM 180 (389) Q Consensus 119 ~~~~----------~~~p~~~~apg~~~----~~v~~al~~~~~~~~~~~i~d~~~~~t~~~a~~~~~~~----~s~~~~ 180 (389) .... ..........|.+. .....+|............++.......+.+.+|.... +-...+ T Consensus 155 ~~n~~v~~~~~~~l~~~a~~~LtGG~dg~~t~~dy~~al~~le~~~~n~l~~~~~d~~~~t~~~~~ik~~r~~~g~~~~~ 234 (437) T protein:vir:10 155 KNNALVEFSGTGELQPVAGAKLTGGTDGAISTQDYLEYFKALETVEFNYMALPVEDASIKKAAINFIKRMREDEGLGAQL 234 (437) T ss_pred hhhcccccccccccccccceeeeccccCCCChhHHHHHHHHhccCcceEEEecCCChhHHHHHHHHHHHHHhccCceEEE Confidence 0000 00011111112111 12345555543322222222222222223333342211 111111 Q ss_pred Eeee------eEE-EEeec-CCCceEEe---hhHHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhh Q lcl|NC_019932. 181 VIWP------DFI-SWNTT-ANQSETAY---ATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDAD 249 (389) Q Consensus 181 ~~~p------~~~-~~~~~-~~~~~~~p---~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~ 249 (389) +.++ .+. +.... ......++ ..+.+||++|.+ ++.+|+.|+.+.|+..+.. .++..|.+ T Consensus 235 V~~~~~~d~e~Iin~~n~~~~~~~~~~~~~~~~a~vAG~~Ag~----~~~~S~t~~~~~~~~~v~~------~~t~~e~~ 304 (437) T protein:vir:10 235 VVADSDADSEAVINVKNGVILSDKTVIDKTKATVWVAAASANA----GVEKSLTYEKYEDSVDVVG------RLSHTETE 304 (437) T ss_pred EeCCCCCCCceEEEeecceeecCcceechhhHHHHHHHHhccC----ccccCccccccCCcccccc------cCCHHHHH Confidence 1111 111 11110 01111222 347788999877 4777999999988877643 35678888 Q ss_pred hhcccceEEEEcCCC--EEEEcCccCCC-----CcccceeehhhHHHHHHHHHHHHHH-HHhhc-CCCHHHHHHHHHHHH Q lcl|NC_019932. 250 LLNEACVTTLIRKDG--FRFWGNRTCSD-----DPLFAFENYTRTAQVIADTMAEAHM-WANDK-PLTPVLVRDIIAGIN 320 (389) Q Consensus 250 ~l~~~~i~~~~~~~G--~~~wG~rT~~~-----d~~~~~i~vrR~~~~i~~~~~~~~~-~~v~e-~n~~~~~~~i~~~i~ 320 (389) .+.++|+.++.+.+| +..+|-.|+.+ ++.|+.|.++|++|+|.+.+++.+. .|+++ ||+...|..++..++ T Consensus 305 ~~i~~G~~vl~~~~~~v~i~~gInTltt~~~~~~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~~N~~~~r~~~~~~i~ 384 (437) T protein:vir:10 305 DALLKGQFVFTARRGRAVVEQDINSHVSFTIEKNQDFRKNRILRTLDDIVNDTRYAFSEYFLGKVSNNEDGRQAFKANRI 384 (437) T ss_pred HHHhCCcEEEEEeCCeEEEEEccccccccCCCCCchhhhhhHHHHHHHHHHHHHHHHHhccccccCCCHHHHHHHHHHHH Confidence 999999998866444 44478777633 5679999999999999999999877 49998 799999999999999 Q ss_pred HHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEc Q lcl|NC_019932. 321 AKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRIT 376 (389) Q Consensus 321 ~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 376 (389) .||.+|+++|+|.+|.+......+.. ....+++++.++|..++|+|.+++... T Consensus 385 ~yl~~l~~~g~I~~~~~~d~~v~~~~---~~~~v~v~~~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 385 RYFKDLEARGAIEDFKVEDIEVLRGE---LKESVVVNVKVKPVDSMEKLYMTVTVE 437 (437) T ss_pred HHHHHHHhCCCccCCCceeEEeecCC---CCCEEEEEEEEEEeeeeeeEEEEEEec Confidence 99999999999999988766554322 347899999999999999999999888 No 45 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=99.94 E-value=5.9e-28 Score=169.85 Aligned_cols=364 Identities=12% Similarity=0.082 Sum_probs=247.6 Q ss_pred CCCC-CCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhh----- Q lcl|NC_019932. 1 MSDY-HHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIA----- 74 (389) Q Consensus 1 M~~~-~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~----- 74 (389) |-.+ +||||+....++..+....++++.+|+|.+... |.++++++++..++...|+. |.|...+...+ T Consensus 17 ~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~G-----~~~~~~~~~~~~~a~~~f~~-g~l~~a~~~a~~~~~~ 90 (607) T protein:vir:10 17 LFYDSRPHVETNFDDSRLSNTASDSAKNIFMLGSATNG-----DPTKVYEIRTSQQATKIFGS-GDLVDGIKLAFDPTGN 90 (607) T ss_pred CCCccCCceEEEEecCcCcCCCCCCcceEEEEEEeCCC-----CCceEEEEcchhHHHHhhcC-cchHHHHHHhhccccC Confidence 4444 699999999999999999999999999999544 67899999999999988876 44555554444 Q ss_pred -cccCceEEEEEeccccccc---------------------------------ccc----cc------------------ Q lcl|NC_019932. 75 -DQAKPVTVVVRVAEGATPA---------------------------------ETT----SN------------------ 98 (389) Q Consensus 75 -~~~~~~~~v~~~~~~~~~~---------------------------------~t~----~~------------------ 98 (389) .+++..++++|+....... .+. .. T Consensus 91 ~~~g~~~~~~~rv~~~~~a~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~~d~~~~~~~n~g~~~~i~y~g 170 (607) T protein:vir:10 91 SVTNGGTVYALRVDNAKQASLVKDGLTFTSSIFGTNANQVSVALDNDVFGVPRITVNYSPDNYERTYTNIGQMFSITYSG 170 (607) T ss_pred CccCCceEEEEeCCCccccceecccccccccccccCCCceEEEEEecCCCccceeEEeecccceeeeeeccceeecccCc Confidence 5666667766642110000 000 00 Q ss_pred --------cc----cc---------ccccc-------------cchhhHHHHH--------------------------- Q lcl|NC_019932. 99 --------II----GT---------TDENG-------------RYTGMKALLS--------------------------- 117 (389) Q Consensus 99 --------~~----~~---------~d~~~-------------~~tGl~a~~~--------------------------- 117 (389) +. +. .+... ..+..++..+ T Consensus 171 ~~~~a~~~v~~~~~g~~~~lt~~~~~~~~~~~~V~~~~l~~~~~~t~~~l~~din~~~~~~A~~~g~~~i~tky~d~~~~ 250 (607) T protein:vir:10 171 KSASAGYTVSHDTDGKAILLTLGSGDSIDKLTNVATFDLTMSKYDTIAKLMQAISATPNFSASVVGSPSVNTSYLDEVTS 250 (607) T ss_pred ccccccceeeecCCCceeEEEecCCCccceeeeeecccccccccchHHHHHHHhhcCCceEEEEecccceeeeccccccc Confidence 00 00 00000 0000000000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_019932. 118 -------------------------------------------------------------------------------- 117 (389) Q Consensus 118 -------------------------------------------------------------------------------- 117 (389) T Consensus 251 ~i~V~~~~~iv~a~~~D~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~LtGGtdG~~~~ 330 (607) T protein:vir:10 251 PVDVKTAPAVVTAKIGDAISKLGYDPYVVVTQTSNNKPIVNGVSAGTGSATASVTTAPESFPANFDTAFLTGGSTGDVPV 330 (607) T ss_pred eeEEEEeeeeechhhhhhhhcccccceEEeeecccchhhhhhhhccccceeeeeeccccccccccceeeeeCCCCCCchh Confidence Q ss_pred ----HhhhhhhhhhhhccccccchHHHHHHHHhhhhc-----CceeeeccCCCccHHHHHHhhhcccCceeEEeeeeEEE Q lcl|NC_019932. 118 ----AQTQLGVKPRILGVPGLDALEVSTALASIAQQL-----RAFAYVSAWGCKTLSEAMAYRENFSQRELMVIWPDFIS 188 (389) Q Consensus 118 ----~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~-----~~~~i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~ 188 (389) +...+...-...+.+......+++++.+++.++ +..+++..+.+.+.+++......+++.+...+.|+..+ T Consensus 331 ty~dal~aLe~~e~~~i~~~t~d~ai~~~l~a~vkr~~~~g~~~~aVlg~~~~~t~~~~~t~a~~~N~ervv~V~~~~~~ 410 (607) T protein:vir:10 331 SWADKFNGAIGNNVYYIIPLTSEENIHAELQAFIDEQHVLGYNYHAFVGGGFAEPLEQILSRQVNINDSRFGLVGQSGHV 410 (607) T ss_pred hHHHHHHHHhhcCceEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHHHhhCCCcEEEEecCeeE Confidence 000000000000011112234667777777665 35788888888899999999999999999999888766 Q ss_pred EeecCCCceEEeh---hHHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEEc---- Q lcl|NC_019932. 189 WNTTANQSETAYA---TARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLIR---- 261 (389) Q Consensus 189 ~~~~~~~~~~~p~---s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~---- 261 (389) .+ .+.....|+ ++++||++|..+ +.+||.|+.+. ..++.. .+...|.+.+.++|+.++.. T Consensus 411 ~~--~G~~~~~~~~~~Aa~vAGl~Ag~~----~~~SlT~k~i~-~~~v~~------~lt~~e~e~ai~~Gv~~l~~~~~~ 477 (607) T protein:vir:10 411 QE--GGESVHVPAYLMAAYVGGLSSSLG----VAVPITNKKLA-LVDLDQ------NFSGDDLNTLNQNGVIGIEHLVNR 477 (607) T ss_pred ee--CCcceeccHHHHHHHHHHHHhcCc----cccCcccceec-cccccc------cCCHHHHHHHHhCCeEEEEEccCc Confidence 44 345555654 688899999876 67799999986 445533 35778899999999998843 Q ss_pred --CCCEEEEcCccC---CCCcccceeehhhHHHHHHHHHHHHHH-HHhhcCCCHHHHHHHHHHHHHHHHHHHh--CCcee Q lcl|NC_019932. 262 --KDGFRFWGNRTC---SDDPLFAFENYTRTAQVIADTMAEAHM-WANDKPLTPVLVRDIIAGINAKFRELVS--AGYLL 333 (389) Q Consensus 262 --~~G~~~wG~rT~---~~d~~~~~i~vrR~~~~i~~~~~~~~~-~~v~e~n~~~~~~~i~~~i~~~l~~l~~--~gal~ 333 (389) +++++++.+-|. ..++.|++++++|++|+|.+.+++.+. +|++++|+...|.+++..+..+|..+|. .|+|. T Consensus 478 ~~~~~vrIv~~ItT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yIGk~nnd~~~~~vk~~i~~~L~~~~l~~~gaI~ 557 (607) T protein:vir:10 478 NATGGYYIVQDVSTNTVSSSHVDGSLYLGELTDFLFDNLRFVLRDTYIGSNIRSTSADDIKSTVASYLYSEMNNDDGLIV 557 (607) T ss_pred cccceEEEeeeeeeccCCCCcchheeehhhhHHHHHHHHHHHHhhcCCcccCCcchHHHHHHHHHHHHHHHHHHhcCcee Confidence 235788766554 446789999999999999999998875 8999999999999999999999976554 68999 Q ss_pred eeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhc Q lcl|NC_019932. 334 GASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFAASVN 388 (389) Q Consensus 334 g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~ 388 (389) +|... +-+-.....++++++.++|..++|+|.+++.+.++-++..=...- T Consensus 558 df~~e-----dv~v~~~~D~v~v~~~v~Pv~~iekIyvtv~v~~~~~~~~~~~~~ 607 (607) T protein:vir:10 558 DFSES-----DIVVTISGTVVYIQFAVAPTQEIKNIVVSGTYSNYSATSEDNTTK 607 (607) T ss_pred CCCcc-----ccEEeeCCCEEEEEEEEEEcccceEEEEEEEEEEEEEeeccCCCC Confidence 87421 112233556899999999999999999999999886553222222 No 46 >protein:vir:101326 Length: 529 # NCBI annotation: gp22 # Family: family:all:9453 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006525;genbank:gi:46401679;genbank:GeneID:2777386 Probab=99.90 E-value=4.4e-25 Score=154.09 Aligned_cols=355 Identities=13% Similarity=0.060 Sum_probs=199.9 Q ss_pred CCC----CCCCEEEE------------ECCCCCcccccccccceeeeecccccccccc----c-ccccEEEecc----hh Q lcl|NC_019932. 1 MSD----YHHGVRVV------------EINDGTRTISTVSTAIVGMVCTADDADAAAF----P-LNEPVLLTNV----LS 55 (389) Q Consensus 1 M~~----~~~GV~v~------------~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~----~-~~~~vli~~~----~~ 55 (389) |.. |.+|.+++ .-+.+-.+...++-+..--....+......+ . ....++-+.. .+ T Consensus 112 ~~~~~s~~~~s~~~~l~~G~~~~iy~~Dgd~~~s~~~~l~i~~~~ads~g~e~~~l~~~~~~~~g~~~~let~~~sl~~~ 191 (529) T protein:vir:10 112 GEPAYSALPYGSEIELDSGEAFAIYVDDGDPCISPTRELTIETATADSAGNERFLLKLTQTTSLGVVTTLETHTVSLAEE 191 (529) T ss_pred ccchhhcccccccccccccceEEEEEecCcCccCCceEEEEEeeccccCCCccceeeEEEEeecCCceEEEEEEeeeeec Confidence 222 22333322 1111101111111000000000000000000 0 0111221111 23 Q ss_pred hhhhhcccchhHHHHHhhhcccCceEEEEEeccccccc-cc--cccccccccccccchhhHHHHHHhhhh---hhhhhhh Q lcl|NC_019932. 56 AIGKAGKKGTLAAALQAIADQAKPVTVVVRVAEGATPA-ET--TSNIIGTTDENGRYTGMKALLSAQTQL---GVKPRIL 129 (389) Q Consensus 56 ~~~~~~~~gtl~~~v~~~~~~~~~~~~v~~~~~~~~~~-~t--~~~~~~~~d~~~~~tGl~a~~~~~~~~---~~~p~~~ 129 (389) +....|....++..+... +..-.-++......... .+ +-...++.|.....-.-.+...+-..+ -+.-..+ T Consensus 192 a~dd~G~~~yl~svle~~---s~~l~ai~~~e~~~t~~~~t~~d~~f~~GtdG~~~~i~s~~y~~A~~~L~n~p~d~~~i 268 (529) T protein:vir:10 192 AKDDMGRLCYLPTALEAR---SKYLRAVVNEELISTAKVTNKKSLAFTGGTNGDQSKISTAAYLRAVKVLNNAPYMYTAV 268 (529) T ss_pred hhhhcCCccchhHHHhhc---cCceeeeeeeccccccchhhhhhhhccCCccccccccchHHHHHHHHHhcCCcceeeee Confidence 334444444454443321 12111111111111100 11 112333333221100001111111111 1222344 Q ss_pred ccccccchHHHHHHHHhhhhcCceeeeccCCCccHHHHHHhhhcccC------ceeEEeeeeEEEEeecCCCceEEehhH Q lcl|NC_019932. 130 GVPGLDALEVSTALASIAQQLRAFAYVSAWGCKTLSEAMAYRENFSQ------RELMVIWPDFISWNTTANQSETAYATA 203 (389) Q Consensus 130 ~apg~~~~~v~~al~~~~~~~~~~~i~d~~~~~t~~~a~~~~~~~~s------~~~~~~~p~~~~~~~~~~~~~~~p~s~ 203 (389) +.-|.....+..+|..+|.+.+..++.|.|+..|++.|+++.+..+- .+...+|||. +.|+.++....+++|| T Consensus 269 l~~g~y~~a~I~~L~~ic~~~~~d~f~DV~~~LT~~aA~~~~e~~gl~~~~~~~~s~y~~P~~-~~D~~tg~k~~~GlsG 347 (529) T protein:vir:10 269 LGLGCYDNAAITALGKICADRLIDGFFDVKPTLTYAEALPAVEDTGLLGTDYVSCSVYHYPFS-CKDKWTQSRVVFGLSG 347 (529) T ss_pred eccCCccHHHHHHHHHHHhhhhhcEEEcCCCCcCHHHHHHHHHhcCccccCceeeEEEEccee-eccccccCceeeCCCc Confidence 55555567789999999999888888899999999999999987653 3456778886 8899999999999999 Q ss_pred H--HHHHH--HhhhccccceeccCCceec-----cceeccccccccccCCcchhhhhcccceEEEEcC-CC-----EEEE Q lcl|NC_019932. 204 R--ALGLR--AKIDTDTGWHKTLSNVGVN-----GVTGISASVFWDLQQTGTDADLLNEACVTTLIRK-DG-----FRFW 268 (389) Q Consensus 204 ~--~Ag~~--a~~d~~~g~~~span~~l~-----gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~~-~G-----~~~w 268 (389) . +|+.. ++.....|++++|||+... ||..+.. .++.|...|-...||++.-+ .| -.+| T Consensus 348 ~A~~akargv~~na~v~g~hY~pAGe~r~~inr~~I~~ly~-------~d~~e~~~lv~~riNPV~~~~~g~~~idDsLt 420 (529) T protein:vir:10 348 VAYAAKARGVKKNSDVGGWHYSPAGEERAVIARASIQPLYP-------EDTPDEEAMVKGRLNKVSVGTSGQMIIDDALT 420 (529) T ss_pred ceeeccccceeecccccccccccCCCccceeecccceeccC-------CCccCHHHHHhhccCeeeeeccCcceeeeeec Confidence 4 33222 2333444579999999633 3333321 22333334555566666432 22 3556 Q ss_pred cCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceee-----------eEE Q lcl|NC_019932. 269 GNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLG-----------ASC 337 (389) Q Consensus 269 G~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g-----------~~v 337 (389) ++++ |+-|||+|+++|+++|.+.+.+..++.+|||++..+|. +++-+..+|..+|+.|+|++ |.+ T Consensus 421 ~~~k---nny~R~~hv~~lmn~I~~~~~k~a~~~~~~Pd~it~~g-l~~~l~~~L~r~~asgalv~prdp~~~G~epy~~ 496 (529) T protein:vir:10 421 CCTQ---DNYLHFQHVPSLMNAISRFFVQLARQMKHSPDGITAAG-LTKGMTKLLDRFVASGALVAPRDPDADGTEPYVL 496 (529) T ss_pred eeee---CCchhhhhHHHHHHHHHHHHHHHHHHHhhCCChHHHHH-HHHhHHHHHHHHHhcCceecccCccCCCCCceEE Confidence 6664 77899999999999999999999999999999888887 99999999999999999976 333 Q ss_pred EEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcc Q lcl|NC_019932. 338 WYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITD 377 (389) Q Consensus 338 ~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 377 (389) .. ...+.+++.+++.++|+..+++|...=..-. T Consensus 497 ~V-------~q~d~D~~~v~~~~~ptGv~Rri~~~p~l~~ 529 (529) T protein:vir:10 497 KV-------TQAEFDKWEVVWACCPTGVARRIQGVPLLIK 529 (529) T ss_pred EE-------eecccCeEEEEEEeecCCceeeEEeeeeecC Confidence 33 1345599999999999999999987655544 No 47 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=99.86 E-value=1.5e-22 Score=140.27 Aligned_cols=355 Identities=12% Similarity=0.034 Sum_probs=210.2 Q ss_pred CCC--------CCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchh--HHHH Q lcl|NC_019932. 1 MSD--------YHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTL--AAAL 70 (389) Q Consensus 1 M~~--------~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl--~~~v 70 (389) |+- -+||||+.++..+.+++..+++..+++++.+ ..++.+.++.+.+..++...||...+. ...+ T Consensus 1 magg~~~~~~K~~PGvYi~~~~~~~~~~~~~~~~~~~~i~~~-----~~~g~~~~v~i~~~~d~~~~fG~~~~~~~~~~~ 75 (451) T protein:vir:10 1 MAGGTWKAQDKRRPGTYINVVGNGQREAASSLGRVLLIRDKG-----LGWGKNGVIEVEANSDFTKKLGTTLDDPSLTAL 75 (451) T ss_pred CCceeeccceeecCceEEEEeccCcceeeccCCcEEEEEeee-----cCCCCcccEEeecHHHHHHHcCCcccchhHHHH Confidence 443 4799999999999999999999999998865 344457788999999999999965442 2344 Q ss_pred HhhhcccCceEEEEEeccccccccccc--------cccccc------------cccccc--------------h--hhHH Q lcl|NC_019932. 71 QAIADQAKPVTVVVRVAEGATPAETTS--------NIIGTT------------DENGRY--------------T--GMKA 114 (389) Q Consensus 71 ~~~~~~~~~~~~v~~~~~~~~~~~t~~--------~~~~~~------------d~~~~~--------------t--Gl~a 114 (389) ...+. ++...+++++.++.....+.. -..+.. ++.... + ...+ T Consensus 76 ~~~~~-g~~~v~~yrl~~g~~a~~t~~~~~~~~~Aky~G~~Gn~i~v~v~~~~~d~~~~~v~t~~g~~~vd~qtv~~~~~ 154 (451) T protein:vir:10 76 KETLK-GASKVLVLNPNEGTAATLTKEGLPWTVTANYPGEKGNQITVSVEVSPADQNAATVSTIFGTKLVDEQSIKFNEL 154 (451) T ss_pred HHHhc-CCcEEEEEEcCCCceEEEEeecCceEEEEeeCCcCCceEEEEEecccCCcCceEEEEEECCeEEEEEEeeccch Confidence 44444 556677777766553322210 000000 000000 0 0000 Q ss_pred HHHHhhhhh-------hhhh-----hhccc--c----ccchHHHHHHHHhhhh-cCceeeeccCCC-ccH-HHHHHhh-- Q lcl|NC_019932. 115 LLSAQTQLG-------VKPR-----ILGVP--G----LDALEVSTALASIAQQ-LRAFAYVSAWGC-KTL-SEAMAYR-- 171 (389) Q Consensus 115 ~~~~~~~~~-------~~p~-----~~~ap--g----~~~~~v~~al~~~~~~-~~~~~i~d~~~~-~t~-~~a~~~~-- 171 (389) .+....... ..+. .+..+ | -+...-..+|...... .+.++ +.+... ... ..+.+|. T Consensus 155 ~el~~nd~V~a~~~~~g~~~~~~~~~l~~~~~gg~~~~~~~~~~~~l~~~e~~~~n~l~-~~~~~~~~~i~~~~~a~ik~ 233 (451) T protein:vir:10 155 DKFKGNDYITAKVVEEGSSKPVAFTNVSGTLTGGTTTESNKVESLLNDALENEEYAVVT-TAGFEPSSNMNKLVVEAVKR 233 (451) T ss_pred hhccCCceEEEEecccccccceeeeecccccccccccCCccchHHHHHHhccceeeEEE-EccCCCchHHHHHHHHHHHH Confidence 000000000 0000 00000 0 0001112233332222 12222 221111 111 1222332 Q ss_pred --hcccCceeE-Eee--------eeEE-EEeec-CCCceEEeh---hHHHHHHHHhhhccccceeccCCceeccceeccc Q lcl|NC_019932. 172 --ENFSQRELM-VIW--------PDFI-SWNTT-ANQSETAYA---TARALGLRAKIDTDTGWHKTLSNVGVNGVTGISA 235 (389) Q Consensus 172 --~~~~s~~~~-~~~--------p~~~-~~~~~-~~~~~~~p~---s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~ 235 (389) +.-+-..-+ ++. ..+. +.+.. ......+++ .+.+||++|.+ ++.+|+.|+.+.|+..+.. T Consensus 234 ~r~~~g~~~~aVl~~~~~~~~d~egiinv~n~~~~~dg~~~~~~~~~~~vAG~~Ag~----~~~~S~T~~~~~~~~~v~~ 309 (451) T protein:vir:10 234 LRENEGRKVRGVIPTDADTTYNYEGISTVVNGYTLSDGTNVDVKDATGYFAGISASA----DVATSLTYFEVEDAVSAYP 309 (451) T ss_pred HHHhcCCeEEEEecCccCCCCCCcceEEeecceEecCceeechhhhHHHHHHHHccc----ccccCccceecCCceeeee Confidence 222222211 211 1111 11111 111223344 37888999987 4667999999988777743 Q ss_pred cccccccCCcchhhhhcccceEEE-Ec-CCCEEE-EcCccCCC-----CcccceeehhhHHHHHHHHHHHHHHH-Hhhc- Q lcl|NC_019932. 236 SVFWDLQQTGTDADLLNEACVTTL-IR-KDGFRF-WGNRTCSD-----DPLFAFENYTRTAQVIADTMAEAHMW-ANDK- 305 (389) Q Consensus 236 ~~~~~~~~~~~~~~~l~~~~i~~~-~~-~~G~~~-wG~rT~~~-----d~~~~~i~vrR~~~~i~~~~~~~~~~-~v~e- 305 (389) .++..|.+.+.++|..++ ++ ++++++ +|-.|+.+ ++.|+.|.++|++|+|.+.+++.+.. |+++ T Consensus 310 ------~~t~~e~~~~i~~G~lvl~~~~g~~v~i~~~INTltt~~~~k~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~ 383 (451) T protein:vir:10 310 ------KFDNEKTIKALDAGQIVFTTRPGQRVVIEQDINSLHKFTAEKPQAFSKNRVIRTLDEIATNTENTFERTYLGNV 383 (451) T ss_pred ------eCCHHHHHHHHhCCeEEEEEEcCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccceec Confidence 356788888999999886 34 445554 78777632 55799999999999999999999874 9886 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEE-EecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEc Q lcl|NC_019932. 306 PLTPVLVRDIIAGINAKFRELVSAGYLLGASCW-YDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRIT 376 (389) Q Consensus 306 ~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~-~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 376 (389) ||+..-|..++..|+.||.+|+++|+|..|... .+-..+ -....+++++.++|...||+|.+.+++. T Consensus 384 ~N~~~gr~~~~~~i~~yl~~l~~~g~i~~~~~~d~~v~~~----~~~~~v~v~~~v~pvdame~iy~t~~v~ 451 (451) T protein:vir:10 384 GNNAAGRDLFKADRIAYLTSLQNRNMIQSFANTDITVEAG----NDMDSIVVNLAVTPVDAMEKLYMTMVVR 451 (451) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCCccCCCccceEEeec----CCCCEEEEEEEEEEEeeeeeEEEEEEEc Confidence 699999999999999999999999999987632 111111 1357799999999999999999999888 No 48 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=99.64 E-value=2.3e-16 Score=106.28 Aligned_cols=352 Identities=13% Similarity=0.069 Sum_probs=199.6 Q ss_pred CCC--------CCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecch---hhhhhhcccchhHHH Q lcl|NC_019932. 1 MSD--------YHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVL---SAIGKAGKKGTLAAA 69 (389) Q Consensus 1 M~~--------~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~---~~~~~~~~~gtl~~~ 69 (389) |+- -+||+|+.-...+...+......+.++...+ .-.|.++++.+++.. +....||.+.+.... T Consensus 3 magg~~~~~~K~~PG~Y~n~~~~~~~~~~~~~rGi~a~p~~~-----~wGp~~~v~~i~~~~~~~~~~~~~G~~~~~~~~ 77 (436) T protein:vir:78 3 LGGGTFVTQNKVLPGSYINFVSATRATSSLSDRGIVAMPLEL-----DWGIDEEVFQVTSDDFEKYSTKYFGYDYTHEKL 77 (436) T ss_pred ccceeeccceeecCceEEEEEecCcceeeccCCeEEEEEEEe-----cCCCCceeEEeecccchHHHHHHhcCccchHHH Confidence 222 3699999887666666667777777776655 445778888887743 556668877654322 Q ss_pred --HHhhhcccCceEEEEEecccccccccc--ccccc------------cccccccc-------------hhhHHHHHHhh Q lcl|NC_019932. 70 --LQAIADQAKPVTVVVRVAEGATPAETT--SNIIG------------TTDENGRY-------------TGMKALLSAQT 120 (389) Q Consensus 70 --v~~~~~~~~~~~~v~~~~~~~~~~~t~--~~~~~------------~~d~~~~~-------------tGl~a~~~~~~ 120 (389) +...+.+ ....+.+|+..+.....+. +...+ ..++.... .-...+.+... T Consensus 78 ~~l~~~~~~-~~tv~~yrl~~G~~a~~~v~~Aky~g~~gn~i~v~v~~~~~d~~~~dv~~~~g~~~~d~~~~~~~~~l~~ 156 (436) T protein:vir:78 78 KGLRDLFKN-IRLGYFYKLNKGVKASCSIATARCSGIRGNDLKVIVTTNIDDNAKFDVVTLLDNKKVDTQIAKVITELQD 156 (436) T ss_pred HHHHHHhcC-CCEEEEEECCCcceeeeeeeeeecCCCCCcEEEEEecccccccCceEEEEEecchhhhhhhHHHHhhccC Confidence 3333332 2334455554333221110 00000 00000000 00011111111 Q ss_pred h----------hhhhhhhhccccccc-----hHHHHHHHHhhhhc-CceeeeccCCCccHHHHHHhh----hcccCceeE Q lcl|NC_019932. 121 Q----------LGVKPRILGVPGLDA-----LEVSTALASIAQQL-RAFAYVSAWGCKTLSEAMAYR----ENFSQRELM 180 (389) Q Consensus 121 ~----------~~~~p~~~~apg~~~-----~~v~~al~~~~~~~-~~~~i~d~~~~~t~~~a~~~~----~~~~s~~~~ 180 (389) . ...........|.+. ..-..+|..+.... +.+++ +.......+.+.+|. +..+-..-+ T Consensus 157 n~~V~~~~~g~la~~a~~~LtGG~dG~~~T~~dy~~al~~le~~~fn~l~~-~~~d~~~~~~~~a~ikr~re~~g~~~~a 235 (436) T protein:vir:78 157 NDYVTWKKEATLEATAGLTFTNGTNGEAVTGTEYQAFLDKIESYSFNALGC-LATTAEIKSLFVEFTKRMRDKVGAKFQT 235 (436) T ss_pred CceEEEEecccccccceeeeeccccccccchHHHHHHHHHHcccceeEEEe-cCCChHHHHHHHHHHHHHHhhcCCeEEE Confidence 0 111111222333332 22344554433222 22222 221111122223332 222212211 Q ss_pred Eeee-------eEE-EEeecCCCceEE--ehhHHHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhh Q lcl|NC_019932. 181 VIWP-------DFI-SWNTTANQSETA--YATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADL 250 (389) Q Consensus 181 ~~~p-------~~~-~~~~~~~~~~~~--p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~ 250 (389) +..+ .+. +-... .+..+- -..+.+||++|.++ +.+|+.|+.+.++..+.. .++..|.+. T Consensus 236 V~~~~~~~d~EgIInv~n~v-~g~~~~~~~~~a~vAG~~Ag~~----~~~S~T~~~~~~~~~v~~------~~t~~e~~~ 304 (436) T protein:vir:78 236 VLYKKNDADYEGVVSVENKI-KDTGLLESSLIYWTTGAIAGCD----INKSNTNKRYDGEFDVDV------NYTQIHLEE 304 (436) T ss_pred EecCCCCCCCceEEEeeccc-CCceechhHHHHHHHHHHhcCc----cccCccceecCccccccc------cCCHHHHHH Confidence 1111 111 11111 111222 24577888888775 556999999888776643 346778888 Q ss_pred hcccceEEEEcC-CCEEEE-cCccCC-----CCcccceeehhhHHHHHHHHHHHHHH-HHhhc-CCCHHHHHHHHHHHHH Q lcl|NC_019932. 251 LNEACVTTLIRK-DGFRFW-GNRTCS-----DDPLFAFENYTRTAQVIADTMAEAHM-WANDK-PLTPVLVRDIIAGINA 321 (389) Q Consensus 251 l~~~~i~~~~~~-~G~~~w-G~rT~~-----~d~~~~~i~vrR~~~~i~~~~~~~~~-~~v~e-~n~~~~~~~i~~~i~~ 321 (389) +.++|..++.+. ++.++- |=.|+. .+..|+.|.++|++|+|.+.+++.+. .|+++ ||+..-|..++..++. T Consensus 305 ai~~G~lvl~~d~~~v~I~~~VNTltt~~~~k~~~~~kI~vir~~D~i~~di~~~~~~~yiGKv~N~~dgr~~l~~~i~~ 384 (436) T protein:vir:78 305 ALKTGKFIFHKVGDEVHVLEDINTFVSFTDEKNDDFSSNQSVRVLDQIANDIATLFNTKYLGEVPNDKSGRISFWNDVVK 384 (436) T ss_pred HHhCCeEEEEEeCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccccccCCCHHHHHHHHHHHHH Confidence 899999888654 555554 444542 24569999999999999999998876 59997 5999999999999999 Q ss_pred HHHHHHhCCceeeeE---EEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEc Q lcl|NC_019932. 322 KFRELVSAGYLLGAS---CWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRIT 376 (389) Q Consensus 322 ~l~~l~~~gal~g~~---v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 376 (389) ||.+|.+.|+|..|. +..+.. + ....+++++.++|.-.+|+|.+++... T Consensus 385 yl~~L~~~g~I~~f~~~Dv~v~~~-~-----~~~~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 385 HHEQLQNMRAIEDFKADDVSVEPG-S-----DKKTVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred HHHHHHhCCcccCCCCcceEEeec-C-----CCCEEEEEEEEEEEEeeeeEEEEEEEC Confidence 999999999999876 333321 1 356789999999999999999999888 No 49 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=99.18 E-value=1.3e-11 Score=80.18 Aligned_cols=320 Identities=16% Similarity=0.104 Sum_probs=179.0 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhcccCce Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIADQAKPV 80 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~~~~~~ 80 (389) |+= +|++.++=..-....+..-.-+++.++-.- ... .. ..++...+...++.. .....+...+..+... T Consensus 1 ~~g-lp~i~i~f~~~a~ta~~~g~rGiv~~il~d---~~~--~~---~~~~~~~~v~~~~~~--~n~~~i~~~~~g~~~~ 69 (356) T protein:vir:10 1 MAG-LVNINIEFKELATSFIQRSKAGIVAIILKD---TTK--MY---KELTSEDDIPISLSA--DNKKYIKYGFVGATDN 69 (356) T ss_pred CCC-CCceeEEEeecceeeccCCccceEEEEEec---CCc--ce---eEEeccccchhHHHH--HHHHHHHHHhhccccc Confidence 774 699888765555555544333344443221 000 01 111222221112211 1223333333332221 Q ss_pred EEEEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhhhhcCc-----e-e Q lcl|NC_019932. 81 TVVVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALASIAQQLRA-----F-A 154 (389) Q Consensus 81 ~~v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~-----~-~ 154 (389) ........+.... ......-.+.|.+++. ...+.+..|+. ...+.+.+.++..+++. + + T Consensus 70 ~~~~~p~~~~~~~--------~~t~~~y~~aL~~le~------~~fn~l~~~~~-d~~~~~~~~a~ikr~r~~~~~~~~~ 134 (356) T protein:vir:10 70 EKVLRPSKVIIST--------FTEDGKVEDILEELES------VEFNYLCMPEA-IEAEKTKIVTWIKKIREEESTEAKA 134 (356) T ss_pred cccccceeeeeec--------ccCchhHHHHHHHhcC------ccceEEEecCC-ChHHHHHHHHHHHHHHhcCCcEEEE Confidence 1111111000000 0001111223333332 23345566653 45567777777766531 1 2 Q ss_pred eeccCCCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceE--EehhHHHHHHHHhhhccccceeccCCceecccee Q lcl|NC_019932. 155 YVSAWGCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSET--AYATARALGLRAKIDTDTGWHKTLSNVGVNGVTG 232 (389) Q Consensus 155 i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~--~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~ 232 (389) ++.. . ..+.....-+-... +++ +..+ ...++.+||++|.+. .-+|+.|+.+.++.. T Consensus 135 V~~~-~------------~aD~EgIInv~n~~-~~~----g~~~t~~~~~~~vAG~~Ag~~----~n~S~T~~~~~~~~~ 192 (356) T protein:vir:10 135 VLAN-I------------KADNEAIINFTENV-VVD----GEEITAEKYTTRVASLIASTP----NTQSITYAPLDEVES 192 (356) T ss_pred EecC-C------------CCCCceeEEeecCe-Eec----ceeechhHHHHHHHHHHhccc----hhccccceecCCccc Confidence 2211 1 01222222111111 111 1122 234578999999885 455899998887554 Q ss_pred ccccccccccCCcchhhhhcccceEEEEcCCC-EEE-EcCccC---C--CCcccceeehhhHHHHHHHHHHHHHH-HHhh Q lcl|NC_019932. 233 ISASVFWDLQQTGTDADLLNEACVTTLIRKDG-FRF-WGNRTC---S--DDPLFAFENYTRTAQVIADTMAEAHM-WAND 304 (389) Q Consensus 233 ~~~~~~~~~~~~~~~~~~l~~~~i~~~~~~~G-~~~-wG~rT~---~--~d~~~~~i~vrR~~~~i~~~~~~~~~-~~v~ 304 (389) .. .+...|.+..-.+|-.++.+.+| .++ -|=.|+ + .+..|+.|.+.|++|.|.+.+++.+. .|++ T Consensus 193 ~~-------~~t~~e~~~ai~~G~lvl~~d~~~V~I~~~VNSltt~t~~k~~~f~Kirvvr~~D~i~~Di~~~f~~~yiG 265 (356) T protein:vir:10 193 IV-------KIDKASADAKVQAGELILRRLSGKIRIARGINSLTTLTAEKGEIFQKIKLVDTKDLISKDIKNIYVEKYLR 265 (356) T ss_pred cc-------cCCHHHHHHHHhCCeEEEEEEcCeEEEEecCccceecCCCCCcchhhhHHHHHHHHHHHHHHHHHhhcccc Confidence 32 13456777777888888866544 444 455554 1 23459999999999999999999886 6999 Q ss_pred c-CCCHHHHHHHHHHHHHHHHHHHhCCcee-eeEEEEecCCC--------------CHHHhhC----CEEEEEEEEEecc Q lcl|NC_019932. 305 K-PLTPVLVRDIIAGINAKFRELVSAGYLL-GASCWYDDTAN--------------DKDTLKA----GKLFIDYDYTPVP 364 (389) Q Consensus 305 e-~n~~~~~~~i~~~i~~~l~~l~~~gal~-g~~v~~d~~~n--------------~~~~i~~----G~~~~~i~~~p~~ 364 (389) + ||+..-+..+...++.||.+|.+.|+|. ++.++.|.+.. +...+.. -.+.+++.+.|.- T Consensus 266 Kv~N~~dgr~~l~~ai~~y~~~L~~~~~I~~~~~~eid~e~q~~~~~~~g~d~~~~~d~~v~~~~~~~~v~~~~~v~~vd 345 (356) T protein:vir:10 266 KCPNTYDNKCLFIVAVQSYLTELAKQELIDSNFTVEIDLEKQKEYLEGKKIAVSKMKENEIKEANTGSNGFYLINLKLVD 345 (356) T ss_pred ccCCCHHHHHHHHHHHHHHHHHHHhCCccccCceeEecccchHHHhhhccccccccccceeecccCCcEEEEEEEEEEEe Confidence 8 5999999999999999999999999995 67777765432 2222222 3478999999999 Q ss_pred cceEEEEEEEE Q lcl|NC_019932. 365 PLEDLTLRQRI 375 (389) Q Consensus 365 p~e~i~~~~~~ 375 (389) .+|.|.+.+.. T Consensus 346 amE~iy~ti~v 356 (356) T protein:vir:10 346 AMEDINIRVQM 356 (356) T ss_pred eeeeEEeEEeC Confidence 99999999988 No 50 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=98.82 E-value=3.5e-08 Score=61.47 Aligned_cols=357 Identities=13% Similarity=0.068 Sum_probs=191.2 Q ss_pred CCCCCCCEEEEECCC--CCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhcccC Q lcl|NC_019932. 1 MSDYHHGVRVVEIND--GTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIADQAK 78 (389) Q Consensus 1 M~~~~~GV~v~~v~~--~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~~~~ 78 (389) |=+- +++|.- .+.++...+-..+.++|... .+.......++..+....||.+.........+|.+.. T Consensus 1 ~~s~-----iVnV~i~~~~~a~~~~~f~~~l~~~~~~------~~~~r~~~yss~~~V~~~FG~~S~ey~aA~~yF~q~p 69 (450) T protein:vir:95 1 MWNP-----IVNVDITLNTAGTTREGFGLPLFLASTD------NFEERVRGYTSLTEVAEDFDENTAAYKAAKQLWSQTP 69 (450) T ss_pred CCCc-----eEEEeecccccccccccceeEEEEcCCC------CCccceeeecCHHHHHHhcCCCcHHHHHHHHHHhCCC Confidence 6543 333332 34444455556666666543 2223333455666777788888888888888887755 Q ss_pred ceEEEE--Eecccc-----------------------ccccccccccccccccccchhhHHHHHHhhhh----------- Q lcl|NC_019932. 79 PVTVVV--RVAEGA-----------------------TPAETTSNIIGTTDENGRYTGMKALLSAQTQL----------- 122 (389) Q Consensus 79 ~~~~v~--~~~~~~-----------------------~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~----------- 122 (389) .+..+. |..... ......-+.....+.....+.+.+........ T Consensus 70 ~p~~l~igr~~~~~t~~~~~~~~~~~~g~lt~tv~G~~~~~~~i~~s~a~s~~~va~~~~tai~~~~~~~~~~~~~s~g~ 149 (450) T protein:vir:95 70 KVTQLYIGRRAMQYTVSIPDAVTESTDYSITVAAGGGISQPYQYTAQSSDTAENVLQQFKTQIEADPTIKDKVSVNVTGS 149 (450) T ss_pred cccEEEEEeeccchhhhhhhhhccccceeEEEEecceeeeeeEEEEEecCChhhHHHHhhhhhcccceeeeeeeeeeecc Confidence 433222 111100 00000000000000000011111111100000 Q ss_pred -----------------hh--hhhhhccccccchHHHHHHHHhhhhcC-ceeeeccCCCccHHHHHHhhhcccC-ceeEE Q lcl|NC_019932. 123 -----------------GV--KPRILGVPGLDALEVSTALASIAQQLR-AFAYVSAWGCKTLSEAMAYRENFSQ-RELMV 181 (389) Q Consensus 123 -----------------~~--~p~~~~apg~~~~~v~~al~~~~~~~~-~~~i~d~~~~~t~~~a~~~~~~~~s-~~~~~ 181 (389) .+ ....+...|.....+..++.++..... .+++. . ...+.+++.+...-..+ .+.+. T Consensus 150 ~~~~t~~~~~~~~~~~~~l~~~~~~~~~~g~~aet~~~a~~a~~~~~~~w~~~~-~-~~~~~~~i~a~a~w~~a~~~~f~ 227 (450) T protein:vir:95 150 NGSATMIIAKAGDNDFVKVTTTAQTVYIASTTADTASTALAAIEAYSTDWYFIA-A-EDRTQQFVLAMASEIQARKKIFF 227 (450) T ss_pred cceeeeeeeccccchhhccccccceeEecccccccHHHHHHHHHHhhCCeEEEE-e-cCCCHHHHHHHHHHHhhcCcEEE Confidence 00 011111222222334555555443332 23222 2 22344444433221111 12222 Q ss_pred eeeeE-EEEeec--------------C--CCce-EE-------ehhHHHHHHHHhhhccccceeccCCceeccceecccc Q lcl|NC_019932. 182 IWPDF-ISWNTT--------------A--NQSE-TA-------YATARALGLRAKIDTDTGWHKTLSNVGVNGVTGISAS 236 (389) Q Consensus 182 ~~p~~-~~~~~~--------------~--~~~~-~~-------p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~ 236 (389) +..|- .+.+.. . .+.. ++ .+.+.++|.....+. | -.+..+|.+.||..-... T Consensus 228 ~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~~~aa~~g~~~~~~~--g-~~T~~fk~l~Gv~~~v~~ 304 (450) T protein:vir:95 228 TANSDVTALQGTELASANDVPAQLAKNMYTRTVCLWHHAAAEDYPEMAYIAYGAPYDA--G-SIAWGNAQLTGVAASLQP 304 (450) T ss_pred EEcCCchhhhhhhhhcccchHHHHHhccCCeeEEEeeCCCchhHHHHHHHHHhhhccc--c-eeeeccccccceeeeccC Confidence 22211 111000 0 0111 11 133344444333322 2 124457777777754322 Q ss_pred ccccccCCcchhhhhcccceEEEEcCCC-EEEEcCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhh-----c-CCCH Q lcl|NC_019932. 237 VFWDLQQTGTDADLLNEACVTTLIRKDG-FRFWGNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWAND-----K-PLTP 309 (389) Q Consensus 237 ~~~~~~~~~~~~~~l~~~~i~~~~~~~G-~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~-----e-~n~~ 309 (389) .....++..|.+.|..+|+|.....+| -.++.++|++++ ||-++|..+|++..|+..+..++- + |-|. T Consensus 305 -~~~~~lt~~~~~al~~~~~n~y~~~~~~~~~~~G~~~~G~----~iD~~~~~~wl~~~iq~~l~~ll~~~~~~KiPy~~ 379 (450) T protein:vir:95 305 -SNQRPLTSIQKSALDVRHCNFIDLDGGVPVVRRGITSGGE----WIDIIRGVDWLESDLKTSLRDLLINQKGGKITYDD 379 (450) T ss_pred -ccccccchHHHHHHHhCCcEEEEEecCceeeeCCeeeCcc----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCccCh Confidence 222446788899999999998866555 456888888873 788999999999999999887662 2 6777 Q ss_pred HHHHHHHHHHHHHHHHHHhCCceeeeEEEEec-CCCCHHHhhCCEEE-EEEEEEecccceEEEEEEEEcch Q lcl|NC_019932. 310 VLVRDIIAGINAKFRELVSAGYLLGASCWYDD-TANDKDTLKAGKLF-IDYDYTPVPPLEDLTLRQRITDS 378 (389) Q Consensus 310 ~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~-~~n~~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~~ 378 (389) .-...|+..|+.-|++..++|.|.||+|.... +..++.++.++++. +.+.+.....++.++++....=+ T Consensus 380 ~G~~~i~a~i~~~l~~a~~~G~Ia~~~V~~~~~~~~~~~dr~~R~~~~i~~~~~laGAIh~~~i~~~v~~~ 450 (450) T protein:vir:95 380 TGITRIRQVIETSLQRAVNRNFLSSYTVNVPKASQVALADKKARILKDVTFAGILAGAILDVDLKGTVAYE 450 (450) T ss_pred hhHHHHHHHHHHHHHHHHhcCcccceeEecCChHhcCHHHHhccCCCCeeEEEEEccceEEEEEEEEEEeC Confidence 88888999999999999999999999988764 67788998888865 88888889999998887766544 No 51 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=98.77 E-value=2.7e-08 Score=62.03 Aligned_cols=340 Identities=11% Similarity=0.011 Sum_probs=199.9 Q ss_pred CCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc-cchhHHHHHhhhcccCceEE Q lcl|NC_019932. 4 YHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK-KGTLAAALQAIADQAKPVTV 82 (389) Q Consensus 4 ~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~-~gtl~~~v~~~~~~~~~~~~ 82 (389) -+|-|.|.+.+.+--++..+.- ..-|+|+++..... ...+....++...+|+ +..|...+.+...+.|..-. T Consensus 1 ~~~~v~vn~ln~~qg~~~~ver-~~lfig~~~~~~~~------~~~~~~~sdld~~lg~~ds~lk~~v~aa~~naG~~w~ 73 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIER-HALFVGVGTTNQGK------LLALTPDSDFDKVFGETDTDLKKQVRAAMLNAGQNWF 73 (376) T ss_pred CCCeEEEeeeeccCCCcccccc-eEEEeeccccccCc------eEEecCCCChHHhhCCCchhHHHHHHHHHhCCCCceE Confidence 4577888888777777766654 55688887644332 3334456677777764 46677777777777654321 Q ss_pred EEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHHHh----hhh--cCceeee Q lcl|NC_019932. 83 VVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALASI----AQQ--LRAFAYV 156 (389) Q Consensus 83 v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~----~~~--~~~~~i~ 156 (389) ........+ ..+-+.+++.+.....+.--.+..|--+......++.+. ..+ +..++++ T Consensus 74 a~~~~p~~~----------------~~~~~~Av~~a~~~~s~E~V~v~~p~~t~~a~i~a~qa~a~el~~~~~R~vffil 137 (376) T protein:vir:37 74 AHVYIAQED----------------GYDFVECVKKANQTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQ 137 (376) T ss_pred EEEEecCCC----------------hhhHHHHHHHHHhhCCeeEEEEecCcchhHHHHHHHHHHHHHHHHhcCCeEEEEE Confidence 111111100 012234444443322222222333322233333333333 333 2456666 Q ss_pred ccCC-------CccHHHH----HHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCc Q lcl|NC_019932. 157 SAWG-------CKTLSEA----MAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNV 225 (389) Q Consensus 157 d~~~-------~~t~~~a----~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~ 225 (389) .++. +.+.++- .+-++++.+.+..++.. .|. -..|.+||.+|.. ..-+..||... T Consensus 138 e~~g~d~~~~~ge~w~~y~~~l~a~~~gia~~~V~vV~~---~~g---------n~~G~~aGRl~na--aVsVadspgRV 203 (376) T protein:vir:37 138 AVQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLVPL---LFG---------NETGVLAGRLANR--AVTVADSPARV 203 (376) T ss_pred eccCCCCcccccCCHHHHHHHHHHHhccccccceeeeee---ecc---------chHHHHHHHHHhC--CcchhcCccce Confidence 6542 1233332 23345667766665532 121 2367889988753 22357888876 Q ss_pred eeccceecc---cccc-ccccCCcchhhhhcccceEEEEc--C-CCEEEEcCccCCCC-cccceeehhhHHHHHHHHHHH Q lcl|NC_019932. 226 GVNGVTGIS---ASVF-WDLQQTGTDADLLNEACVTTLIR--K-DGFRFWGNRTCSDD-PLFAFENYTRTAQVIADTMAE 297 (389) Q Consensus 226 ~l~gv~~~~---~~~~-~~~~~~~~~~~~l~~~~i~~~~~--~-~G~~~wG~rT~~~d-~~~~~i~vrR~~~~i~~~~~~ 297 (389) .-..+.++. .+.+ ....++......|..+|..+... + .|+-.=.+||++.+ +.+++|..+|.+|-+.|.++. T Consensus 204 ~tGai~gl~~~~~p~d~~g~el~~a~l~aLd~arysvpr~Y~gydG~Yw~dg~tl~~~gsDYq~ie~~RVvdKa~R~vR~ 283 (376) T protein:vir:37 204 QTGALVSLGSANKPLDKDGNELTLAHLKSLETARYSVPMWYPDYDGYYRADGRTLDVEGGDYQVIENLRVVDKVARKVRL 283 (376) T ss_pred eecccccccccccccccCCcccchHHHHHHHhCCCeEEEeeCCCCceEEeCCeEeccCCCCeeeehhchHHHHHHHHHHH Confidence 543344432 2221 12223556667799999998743 4 46666678888643 569999999999999999887 Q ss_pred HHHHHhhcC---CCHHHHHHHHHHHHHHHHHHHhCCceeeeE--EEEecCCCCHHHh-----hCCEEEEEEEEEecccce Q lcl|NC_019932. 298 AHMWANDKP---LTPVLVRDIIAGINAKFRELVSAGYLLGAS--CWYDDTANDKDTL-----KAGKLFIDYDYTPVPPLE 367 (389) Q Consensus 298 ~~~~~v~e~---n~~~~~~~i~~~i~~~l~~l~~~gal~g~~--v~~d~~~n~~~~i-----~~G~~~~~i~~~p~~p~e 367 (389) ..-..+..+ .++.-.+..+..+..=|+.|.+.+-|.|.. =+|..-.. +|| ...++.+-+.++|.--.+ T Consensus 284 ~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~ei~g~~fpgei~~P~d--~dI~i~w~sk~~V~I~~~vrPy~cpk 361 (376) T protein:vir:37 284 LAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKD--DAITIVWQSKTKVTIYIKVRPYDCPK 361 (376) T ss_pred HHHHHhcCccccCChhHHHHHHHHHhHHHHHHHhhhhhccccccceeecCCC--CceEEEeccCceEEEEEEEeeecCcc Confidence 666555543 366677888888888999999999998843 22321111 122 356788888999998899 Q ss_pred EEEEEEEEcchHHHH Q lcl|NC_019932. 368 DLTLRQRITDSYLAN 382 (389) Q Consensus 368 ~i~~~~~~~~~~~~~ 382 (389) .|+..+..|-+-+.+ T Consensus 362 ~i~~~I~LDls~~~~ 376 (376) T protein:vir:37 362 EITANIFLDLDSLGE 376 (376) T ss_pred eeEEEEEEecCCCCC Confidence 999999887664443 No 52 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=98.72 E-value=8.7e-08 Score=59.30 Aligned_cols=361 Identities=15% Similarity=0.060 Sum_probs=191.3 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEE-EecchhhhhhhcccchhHHHHHhhhcccCc Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVL-LTNVLSAIGKAGKKGTLAAALQAIADQAKP 79 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vl-i~~~~~~~~~~~~~gtl~~~v~~~~~~~~~ 79 (389) |+-++--|.=+.+.-.+..+....-....++++.... ..+...+.++ .++..+....||.+.........+|.+... T Consensus 1 msip~s~ivnV~i~~~~~a~~~~~f~~~l~l~~~~~~--~~~~~~~r~~~~~s~~~V~~~FG~~s~ey~aA~~yF~q~p~ 78 (502) T protein:vir:52 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQ--AFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSPR 78 (502) T ss_pred CCCCccceeEEeeccccccccccccCceEEEeeccCc--cccCCccceEEecCHHHHHHhcCCChHHHHHHHHHhcCCCc Confidence 8877644433333344455555556666777654321 1122223333 344455666677666666666666655432 Q ss_pred eEE--EEEecccccc-----------ccc----------cccccccccc----------------cccc----------- Q lcl|NC_019932. 80 VTV--VVRVAEGATP-----------AET----------TSNIIGTTDE----------------NGRY----------- 109 (389) Q Consensus 80 ~~~--v~~~~~~~~~-----------~~t----------~~~~~~~~d~----------------~~~~----------- 109 (389) +.. +.|....... ..+ ........+. .... T Consensus 79 P~~l~igR~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~i~i~g~~~t~~~i~lS~~ts~~~vA~~i~~~l~~~~ 158 (502) T protein:vir:52 79 AKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLS 158 (502) T ss_pred cceEEEEeccccccceeechhhhhhhhhHHhHHHhhhhcCceeEEEecceeeeeeccccccccchhHHHHHHHhhhcccc Confidence 221 1111100000 000 0000000000 0000 Q ss_pred -------------------------------------hh--hHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhhhhc Q lcl|NC_019932. 110 -------------------------------------TG--MKALLSAQTQLGVKPRILGVPGLDALEVSTALASIAQQL 150 (389) Q Consensus 110 -------------------------------------tG--l~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~ 150 (389) +| +..+.........++......|........+|.++.... T Consensus 159 ~~~tv~~d~~~~~F~i~s~ttg~~~~~~~~~a~~~~~~gt~~a~~l~l~~~~~av~v~~~~~g~~aet~~~al~a~~~~~ 238 (502) T protein:vir:52 159 VAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVAEVN 238 (502) T ss_pred cceEEEEecCCceEEEEeccCCCcceeEEEEeecCCcchhHHHHHhccccccceeeeeeecccccccCHHHHHHHHHhcc Confidence 00 000000000000000001112222333445555444333 Q ss_pred CceeeeccCCCccHHHHH---HhhhcccCceeEEeeeeEE-EEeec----------CC--Cce-E-----EehhHHHHHH Q lcl|NC_019932. 151 RAFAYVSAWGCKTLSEAM---AYRENFSQRELMVIWPDFI-SWNTT----------AN--QSE-T-----AYATARALGL 208 (389) Q Consensus 151 ~~~~i~d~~~~~t~~~a~---~~~~~~~s~~~~~~~p~~~-~~~~~----------~~--~~~-~-----~p~s~~~Ag~ 208 (389) .....+......+.+++. +|.+.-+ +.+.+..+-. +.+.. .+ +.. + -.+.+.+.|. T Consensus 239 ~~w~~~~~a~~~~~~~~la~a~~iea~~--~~f~~~~~d~~~~~~~~~~i~~~l~a~~~~~t~~~y~~~~~~~~aa~~g~ 316 (502) T protein:vir:52 239 NTWYGFTVAAQLTDSEVEAAAKYAQANT--KLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDKNDMYPVSSALAR 316 (502) T ss_pred CceEEEEEeecCChhHHHHHHHHHhhcC--cEEEEEecCcceeccccchHHHHHHhccCceeEEEecCCcchhHHHHHHH Confidence 222222222223334333 3333211 1122211100 00000 00 000 1 1244556788 Q ss_pred HHhhhcccc-ceeccCCceeccceeccccccccccCCcchhhhhcccceEEEEcCCCE-EEEcCccCCCCcccceeehhh Q lcl|NC_019932. 209 RAKIDTDTG-WHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLIRKDGF-RFWGNRTCSDDPLFAFENYTR 286 (389) Q Consensus 209 ~a~~d~~~g-~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~~~G~-~~wG~rT~~~d~~~~~i~vrR 286 (389) ++.+|...- -.....+|.+.||... .++..|++.|..+|+|+..+.+|. .+..+++++++ ||-+.+ T Consensus 317 ~as~~f~~~~g~iT~~fk~l~GV~~~--------~lt~t~~~al~~~~~N~y~~~~~~~~~~~G~~~~G~----~iD~~~ 384 (502) T protein:vir:52 317 LLSTNFAANNSTLTLKFKQQPTITAD--------EITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGGK----FADEIV 384 (502) T ss_pred HHhcCCCcCcceeeecccccCCcccC--------cCCHHHHHHHHhcCceEEEEecCeeEEecCeeeCCc----hhhHHH Confidence 887775432 3344566777777643 257888999999999998665553 45688888874 777888 Q ss_pred HHHHHHHHHHHHHHHHhhc-----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------eeEEEEe- Q lcl|NC_019932. 287 TAQVIADTMAEAHMWANDK-----PLTPVLVRDIIAGINAKFRELVSAGYLL--------------------GASCWYD- 340 (389) Q Consensus 287 ~~~~i~~~~~~~~~~~v~e-----~n~~~~~~~i~~~i~~~l~~l~~~gal~--------------------g~~v~~d- 340 (389) -.+|++..++..+...++. |-|..=...|+..|+.-|++-+++|.|. ||.+... T Consensus 385 ~~~Wl~~~lq~~l~~~L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~G~~~~~~~g~~~~~d~~~~gy~v~~~~ 464 (502) T protein:vir:52 385 ILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAP 464 (502) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeecccccCceEEEeCc Confidence 9999999999888776652 4577778999999999999999999984 5777665 Q ss_pred cCCCCHHHhhCCEE-EEEEEEEecccceEEEEEEEEcc Q lcl|NC_019932. 341 DTANDKDTLKAGKL-FIDYDYTPVPPLEDLTLRQRITD 377 (389) Q Consensus 341 ~~~n~~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~~ 377 (389) .++.++.++.+++. -+.+.+.+...+++|++.+..++ T Consensus 465 ~~~~s~~dr~~R~~~~~~~~~~~aGaIh~v~i~~nv~~ 502 (502) T protein:vir:52 465 MDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) T ss_pred hhhCCHHHHHcccCCCeEEEEEECceEEEEEEEEEEeC Confidence 56789999999998 89999999999999999998888 No 53 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=98.70 E-value=4.7e-08 Score=60.78 Aligned_cols=342 Identities=11% Similarity=0.021 Sum_probs=191.6 Q ss_pred CCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc-cchhHHHHHhhhcccCceEE Q lcl|NC_019932. 4 YHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK-KGTLAAALQAIADQAKPVTV 82 (389) Q Consensus 4 ~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~-~gtl~~~v~~~~~~~~~~~~ 82 (389) -+|-|.|.+.+.+-.+..++.- ..-|+|.++......+ .++...+....+|. +..|...+.+...+.+..-. T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er-~~Lfig~~~~~~~~~~------~~~~~sdld~~lg~~~~~lk~~v~aa~~naG~~~~ 73 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIER-HALFVGVGTTNQGKLL------ALTPDSDFDKVFGETDTDLKKQVRAAMLNAGQNWF 73 (376) T ss_pred CCCeEEEecccccCCCcccccc-eEEeecccccccccee------eecCccchHhhhCCCchHHHHHHHHHHhCCCCcEE Confidence 4567888888877777776654 5667887754333333 33445555556654 46777778777777665432 Q ss_pred EEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhh----hh--cCceeee Q lcl|NC_019932. 83 VVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALASIA----QQ--LRAFAYV 156 (389) Q Consensus 83 v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~----~~--~~~~~i~ 156 (389) ........+. .+-+.+++.+.......--.+..|--+.++-..++.+.. .+ +..++++ T Consensus 74 ~~~~~~~~~~----------------~~~~~Av~~a~~~~s~E~V~v~~pv~t~~a~i~aa~~~a~el~~~~~Rpv~fil 137 (376) T protein:vir:37 74 AHVYIAQEDG----------------YDFVECVKKANQTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQ 137 (376) T ss_pred EEEEeecCCc----------------hHHHHHHHHhhhhcCceEEEEeccccccHHHHHHHHHHHHHHHHhcCCeEEEEE Confidence 2221111100 012333333322222222233333112233333333333 33 2345666 Q ss_pred ccCC-------CccHHHH----HHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCc Q lcl|NC_019932. 157 SAWG-------CKTLSEA----MAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNV 225 (389) Q Consensus 157 d~~~-------~~t~~~a----~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~ 225 (389) ..+. +.+.++- .+-++++.+.+..+.. ..|. ..-|.+||.+|+. ..-+..||... T Consensus 138 e~r~~~~~~~~~e~w~~y~~~~~al~~gia~~~V~~V~---~~~g---------n~~G~~aGRl~~a--aVsVadspgRV 203 (376) T protein:vir:37 138 AVQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLVP---LLFG---------NETGVLAGRLANR--AVTVADSPARV 203 (376) T ss_pred eccCcCcccccccCHHHHHHHHHHhhcccccccceeee---eehh---------hhHHHHHHHHhhc--ccchhhCccce Confidence 6542 1222322 2233445555444321 1121 2257888888654 22256777765 Q ss_pred eeccceec---cccc-cccccCCcchhhhhcccceEEEEc--C-CCEEEEcCccCCCC-cccceeehhhHHHHHHHHHHH Q lcl|NC_019932. 226 GVNGVTGI---SASV-FWDLQQTGTDADLLNEACVTTLIR--K-DGFRFWGNRTCSDD-PLFAFENYTRTAQVIADTMAE 297 (389) Q Consensus 226 ~l~gv~~~---~~~~-~~~~~~~~~~~~~l~~~~i~~~~~--~-~G~~~wG~rT~~~d-~~~~~i~vrR~~~~i~~~~~~ 297 (389) .-..+.++ ..+. .....++....+.|..+|.++... + .|+-+=.+||++.+ +.+++|..+|..|-+.|.++. T Consensus 204 ~tG~l~gl~~~~lp~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~gsDY~~ie~~RVvdKa~R~vR~ 283 (376) T protein:vir:37 204 QTGALVSLGSANKPLDKDRNELTLAHLKSLETARYSVPMWYPDYDGYYWADGRTLDVEGGDYQVIENLRVVDKVARKVRL 283 (376) T ss_pred eccccccccccccccCcCcccCCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHH Confidence 43323332 2222 222344566677899999999844 4 46666677888653 569999999999999999988 Q ss_pred HHHHHhhcCC---CHHHHHHHHHHHHHHHHHHHhCCceeee----EEEEecCCC-CHHHhhCCEEEEEEEEEecccceEE Q lcl|NC_019932. 298 AHMWANDKPL---TPVLVRDIIAGINAKFRELVSAGYLLGA----SCWYDDTAN-DKDTLKAGKLFIDYDYTPVPPLEDL 369 (389) Q Consensus 298 ~~~~~v~e~n---~~~~~~~i~~~i~~~l~~l~~~gal~g~----~v~~d~~~n-~~~~i~~G~~~~~i~~~p~~p~e~i 369 (389) .+-+++...- ++.-.+..+.-+..=|+++.+..-+.|. ++....+.+ +..-+...++.+.+.+.|.--.+.| T Consensus 284 ~ai~~i~D~~lnst~~sia~~~~yi~~pLr~M~~s~~i~g~~fpGeI~~p~d~Di~i~w~s~~~V~I~~~v~P~~~pk~I 363 (376) T protein:vir:37 284 LAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKDDAITIVWQSKTKVTIYIKVRPYDCPKEI 363 (376) T ss_pred HHHHHhCCcccCcchhhHHHHHHHHHHHHHHHHhcchhccccccceeecCCCCCceEEeeccceEEEEEEEEeccCCceE Confidence 7777765432 3444555565566678888887777773 343333322 2223477889999999999999999 Q ss_pred EEEEEEcchHHHH Q lcl|NC_019932. 370 TLRQRITDSYLAN 382 (389) Q Consensus 370 ~~~~~~~~~~~~~ 382 (389) +..+..+-.-.-+ T Consensus 364 tv~I~Ldlsn~~~ 376 (376) T protein:vir:37 364 TANIFLDLDSLGE 376 (376) T ss_pred EEEEEeecCCCCC Confidence 9776665442222 No 54 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=98.65 E-value=1.4e-07 Score=58.15 Aligned_cols=314 Identities=12% Similarity=0.075 Sum_probs=178.7 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhcccCce Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIADQAKPV 80 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~~~~~~ 80 (389) |-+-+-+|.|.--...+... ..-..+.+..+... ......++..+....|+.+..+.......+.++... T Consensus 1 ~~~~iv~V~v~~~~~~~~~~--~~~~~~~~~~~~t~--------~~~~~y~s~~~v~~d~~~~~~~Ykaa~~~f~Q~~~~ 70 (331) T protein:vir:80 1 MVETITDVRVHISVLYPSPR--IGLGRPAIFVKGTA--------MGYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRP 70 (331) T ss_pred Cccceecceeeecccccccc--cccCcceeEEeccc--------cceEEEechhhhccCCCCCcHHHHHHHHHHhccCcc Confidence 65554444332111111111 11223333222111 123445555665566777777888888888887655 Q ss_pred EEEEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhhhhc-CceeeeccC Q lcl|NC_019932. 81 TVVVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALASIAQQL-RAFAYVSAW 159 (389) Q Consensus 81 ~~v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~-~~~~i~d~~ 159 (389) ..+....... + +-+.++..... .... .+...+. ...-..++....+.. ..+.+.+.. T Consensus 71 ~~i~v~~~~~---------------~---~~~~a~~a~~~-~~w~--~~~~~~~-~~~~~~a~a~~~~a~~~~f~~~~~~ 128 (331) T protein:vir:80 71 DTVAVITYED---------------T---KLLEAAEAYFL-KSWH--FALLAEF-KAADALALSNLIEEQKFKFAVFQVT 128 (331) T ss_pred ceEEEeccch---------------H---HHHHHHHHhcc-Ccee--EEEeecC-CHHHHHHHHHHHhhCCcEEEEEecC Confidence 4332211100 0 11222221111 1111 1111122 222223343433333 334444332 Q ss_pred CCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCc-eeccceecccccc Q lcl|NC_019932. 160 GCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNV-GVNGVTGISASVF 238 (389) Q Consensus 160 ~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~-~l~gv~~~~~~~~ 238 (389) . ......... .+....+++ .... - .+.+.+.|..+..+..+- +-.++ +|.||... T Consensus 129 ~---~~~~~~~~~--~~~t~~~~~-------~~~~---~-~~~aa~~g~~~~~~~g~~---t~~fk~~l~GV~~~----- 184 (331) T protein:vir:80 129 A---VADITPLAK--NTRTIAIVH-------SKTG---E-KLDAALIGNVASLPVGSA---TWKGRHGLAGITSE----- 184 (331) T ss_pred c---hHHHHHhhc--cccEEEEEc-------CCcc---c-hhHHHHHHHHHhcCccce---eeeeecccCCCCCC----- Confidence 1 222222111 111112221 1111 1 134556677766665331 23455 36666643 Q ss_pred ccccCCcchhhhhcccceEEEEcCCC-EEEEcCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhhc----CCCHHHHH Q lcl|NC_019932. 239 WDLQQTGTDADLLNEACVTTLIRKDG-FRFWGNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWANDK----PLTPVLVR 313 (389) Q Consensus 239 ~~~~~~~~~~~~l~~~~i~~~~~~~G-~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e----~n~~~~~~ 313 (389) .++..|.+.|..+|+|+..+..| -.++.+.|++++ ||-+.+-.+|++..++..+...+-. |-|..=.. T Consensus 185 ---~lt~t~~~al~~~~~N~y~~~~~~~~~~~G~~~~G~----~iD~~~~~dWl~~~lq~~l~~ll~~~~kiPy~~~G~~ 257 (331) T protein:vir:80 185 ---ELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIA 257 (331) T ss_pred ---CCCHHHHHHHHhcCceEEEEecCeeEEecceEeCch----hHHHHHHHHHHHHHHHHHHHHHHHhCCCCccChhhHH Confidence 25788899999999999977656 455788888874 7999999999999999888876643 44677788 Q ss_pred HHHHHHHHHHHHHHhCCcee--------eeEEEEe-cCCCCHHHhhCCEEE-EEEEEEecccceEEEEEEEEcc Q lcl|NC_019932. 314 DIIAGINAKFRELVSAGYLL--------GASCWYD-DTANDKDTLKAGKLF-IDYDYTPVPPLEDLTLRQRITD 377 (389) Q Consensus 314 ~i~~~i~~~l~~l~~~gal~--------g~~v~~d-~~~n~~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 377 (389) .|+..++.-|++-+++|.|. ||.+... .++.|++|+.+++.. +.+...+...+++|++....+. T Consensus 258 ~l~a~i~~~~~~av~~G~I~~g~~~~~~~~~v~~~~~~~~s~~dr~~R~~~~~~~~~~~~gaI~~v~i~~~v~~ 331 (331) T protein:vir:80 258 LLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) T ss_pred HHHHHHHHHHHHHHhCCceecCccCCCcceEEEeCchhcCCHHHHhccCCCCeEEEEEEcceEEEEEEEEEEeC Confidence 99999999999999999995 5777765 456799999999976 7888899999999999999888 No 55 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=98.62 E-value=1e-07 Score=58.91 Aligned_cols=339 Identities=13% Similarity=0.051 Sum_probs=189.1 Q ss_pred CCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc-cchhHHHHHhhhcccCceEE Q lcl|NC_019932. 4 YHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK-KGTLAAALQAIADQAKPVTV 82 (389) Q Consensus 4 ~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~-~gtl~~~v~~~~~~~~~~~~ 82 (389) -.|-|.|.+.+.+-.+..++.- ..-|+|+++......+ .+....++...+|+ +..|...+.+...+++..-. T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er-~~lfig~~~~~~g~~~------~~~~~sdld~~l~~~ds~lk~~v~aa~~naG~~~~ 73 (370) T protein:vir:78 1 MWPYVQIYNLNQMQGPVTEVER-HLLFIGSAASNTGKLL------SLNAQSDFDQLLGAADSELKANLLAARDNAGQNWS 73 (370) T ss_pred CCceEEEeeccccCCCcCccce-eEEEEecccccccceE------eecCccCHHHhcCCcChhHHHHHHHHHhCCCCceE Confidence 4477888888887777777654 5668888764433333 34456677777764 46677777777766654322 Q ss_pred --EEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhhhhhccccc-cchHHHHHHHHhhhh----c--Cce Q lcl|NC_019932. 83 --VVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGL-DALEVSTALASIAQQ----L--RAF 153 (389) Q Consensus 83 --v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~-~~~~v~~al~~~~~~----~--~~~ 153 (389) +..... ...-+.|++.+.... .+-.+..-+- +.+....++.+.++. + ..+ T Consensus 74 ~~~~p~~~-------------------~~d~~~Av~~a~~~~--s~E~V~v~~~~s~~a~~~a~~~~a~el~n~~~Rpv~ 132 (370) T protein:vir:78 74 AAAYVLPT-------------------DKPWLDAARDAQQTQ--SFEGVVVLGQEWHQAAINAAHALNQELIAKWGRWQF 132 (370) T ss_pred EEEEEecC-------------------chhHHHHHHHHHhhC--CccEEEEecCcchHHHHHHHHHHHHHHHHhcCCeEE Confidence 211111 112345555443322 2222223332 333444444444433 2 345 Q ss_pred eeeccCC---CccHHH----HHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCce Q lcl|NC_019932. 154 AYVSAWG---CKTLSE----AMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVG 226 (389) Q Consensus 154 ~i~d~~~---~~t~~~----a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~ 226 (389) +++..+. +.+.++ ..+-++++.+.+..++.-| |. -.-|.+||.+|.. ..-+..||.-.. T Consensus 133 file~~~~~~~e~w~~y~~~l~al~~gia~~~V~vvp~~---~g---------~~~G~~aGRL~na--avsVadsP~Rv~ 198 (370) T protein:vir:78 133 MLLAVPAIADEQDWATYEAELATLQDGIAASSVSLIPQL---WP---------TLAGAYAGRLCNR--AVSIADSPCRVK 198 (370) T ss_pred EEEeecCCCCcCCHHHHHHHHHHhhhccccccceEEeee---cc---------ccHHHHHHHHhcC--eeeecccceeee Confidence 5555433 223222 3344556667666665332 11 1136788876643 222667776543 Q ss_pred eccceecc--ccccccccCCcchhhhhcccceEEEEc--C-CCEEEEcCccCCCC-cccceeehhhHHHHHHHHHHHHHH Q lcl|NC_019932. 227 VNGVTGIS--ASVFWDLQQTGTDADLLNEACVTTLIR--K-DGFRFWGNRTCSDD-PLFAFENYTRTAQVIADTMAEAHM 300 (389) Q Consensus 227 l~gv~~~~--~~~~~~~~~~~~~~~~l~~~~i~~~~~--~-~G~~~wG~rT~~~d-~~~~~i~vrR~~~~i~~~~~~~~~ 300 (389) ..-+.+.. ........++....+.|..+|.++... + .|+-+=.+||++.+ +.++||..+|..+-+.|.++..+- T Consensus 199 tG~l~gl~~~p~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~gsDYq~ie~~RVvdKa~R~vR~~ai 278 (370) T protein:vir:78 199 TGALVGLGNKPVGKDGIPLPLATLQTLEANRYSVPMWYPDYDGIYWADGRTLDAEGGDYQVIENLRIAYKVARRMRLRAI 278 (370) T ss_pred ccccccccccccccCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHH Confidence 22222221 111223335556777899999999844 4 46666677888643 569999999999999999995554 Q ss_pred HHh-hcCCC--HHHHHHHHHHHHHHHHHHHhCCceee--eEEEEecCCC---CHHHhhCCEEEEEEEEEecccceEEEEE Q lcl|NC_019932. 301 WAN-DKPLT--PVLVRDIIAGINAKFRELVSAGYLLG--ASCWYDDTAN---DKDTLKAGKLFIDYDYTPVPPLEDLTLR 372 (389) Q Consensus 301 ~~v-~e~n~--~~~~~~i~~~i~~~l~~l~~~gal~g--~~v~~d~~~n---~~~~i~~G~~~~~i~~~p~~p~e~i~~~ 372 (389) ..+ ++-.+ +......+..+..=|+++...+.+.| |.-++....+ +..-+..+++.+.+.+.|.--.+.|+.. T Consensus 279 ~~i~D~~lnst~gsia~~~~~~~~~L~ema~s~~i~~~~fpgeI~~p~d~Di~i~w~s~~~v~I~~~v~P~~~pk~Itv~ 358 (370) T protein:vir:78 279 ARIGDRSFNSTPGSTAAAITYFGKDLREMAKSTTINGQPFPGDIASPQDGDIRIQWVAKNLVSVFVVVRTVDCPKGITVN 358 (370) T ss_pred HHhCCcccCCCCcchhHHHHHHHhhHHHHHhhhhhcccccceeEeccCCCcceEEeeccceEEEEEEEEeccCCceEEEE Confidence 444 43222 22223333334444555556776666 4344433221 1122477889999999999999999999 Q ss_pred EEEcchHHHHHH Q lcl|NC_019932. 373 QRITDSYLANFA 384 (389) Q Consensus 373 ~~~~~~~~~~~~ 384 (389) +..|-..-++-- T Consensus 359 I~LDls~e~~~~ 370 (370) T protein:vir:78 359 IMLDLSLNNGEG 370 (370) T ss_pred EEEeeccccCCC Confidence 877654322222 No 56 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=98.31 E-value=1.4e-06 Score=52.62 Aligned_cols=363 Identities=14% Similarity=0.000 Sum_probs=175.8 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhcccCce Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIADQAKPV 80 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~~~~~~ 80 (389) |.+ -|.=+++...+.+++....+.+.|+|.+...++.. .+++....++..+....||.+...+.+....|.|+... T Consensus 1 m~~---~iVnV~Is~~t~A~~~~~Fg~~liigs~~~~~p~~-~f~~~~~Yss~~~V~~Dfg~~s~~Y~AA~~~f~Q~~~~ 76 (426) T protein:vir:31 1 MPK---QIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDA-EFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQ 76 (426) T ss_pred CCc---ceEEEEeecccccccccccceeeeeeecccccccc-ccchhhhhhhHHHHHhcCCCChHHHHHHHHHHhCCcee Confidence 885 35445666777778888889999999885443222 14455566777777888999998999999999887432 Q ss_pred EEEEEec------ccccccccccccc--ccccc----cccchhhHHHHHHhhhhhhh-hh-------------------- Q lcl|NC_019932. 81 TVVVRVA------EGATPAETTSNII--GTTDE----NGRYTGMKALLSAQTQLGVK-PR-------------------- 127 (389) Q Consensus 81 ~~v~~~~------~~~~~~~t~~~~~--~~~d~----~~~~tGl~a~~~~~~~~~~~-p~-------------------- 127 (389) ....... .......+..... +..+. ....+++.+..+..+..... .. T Consensus 77 ~r~~v~~at~~~~~~~t~~~tv~g~~~s~~a~~~~~a~~i~~~~~~~~~~~~~~~~~~~~t~~g~~t~~~~~~~~~~s~~ 156 (426) T protein:vir:31 77 WRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFHA 156 (426) T ss_pred EEeeccccceeeeccCCcceeecceeeeecccCcchHHHHHHhhccccccccceeeeEeccccceeeccccceeeeeccC Confidence 2211100 0000000000000 00000 00011111111110000000 00 Q ss_pred -----------h----h--ccccccchHHHHHHHHhhhhcCceeeeccCCCccHHHHHHhhhcccCcee-EEeeeeEEEE Q lcl|NC_019932. 128 -----------I----L--GVPGLDALEVSTALASIAQQLRAFAYVSAWGCKTLSEAMAYRENFSQREL-MVIWPDFISW 189 (389) Q Consensus 128 -----------~----~--~apg~~~~~v~~al~~~~~~~~~~~i~d~~~~~t~~~a~~~~~~~~s~~~-~~~~p~~~~~ 189 (389) + + ...++....+.+.+....+..+.+.+.......+.... ...+.+.+. .-|.|-...+ T Consensus 157 dw~~~~~~~s~~~~~~ia~~~~~~~~~~~~~~~~~wa~~~~i~~va~~~e~~~~~~~---~~~~a~~~~~~~y~p~~~~~ 233 (426) T protein:vir:31 157 DWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSV---DEAMDVAHEVAGYVPSGDLM 233 (426) T ss_pred cchhhhcccccchhhhhhccccchhhhhhhHhhhhhhhhcceeeeeeccchhhhcch---hhhhhhhhcccccccchhhe Confidence 0 0 00000000000001110111111111110000000000 011111111 1122221111 Q ss_pred eecCCCceEEehhHHHHHHHHhhhccccceeccCCceecccee---ccccccccccCCcchhhhhcccceEEEEcC-CCE Q lcl|NC_019932. 190 NTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVNGVTG---ISASVFWDLQQTGTDADLLNEACVTTLIRK-DGF 265 (389) Q Consensus 190 ~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~gv~~---~~~~~~~~~~~~~~~~~~l~~~~i~~~~~~-~G~ 265 (389) . ..... .--..+++++.++..+ ||..|.=..+.+... .....+........+...++ +..|.+... ++. T Consensus 234 ~-~~~~~-~~~~~~~~~~~~aa~~----~~~~~~~~~~~~~~~~~~~~~~~gv~~t~~~~~~A~~~-~~~n~~~~~~~~~ 306 (426) T protein:vir:31 234 M-IVDAS-DDDLAAYQLGKFAVSE----PWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGE-GPVNVLIDVSDAN 306 (426) T ss_pred e-ehhcc-ccchhhHHhhhhhhhc----cccchhhhhccccccceeeccccccccccchhhhhhhc-CCceEEEEecCce Confidence 0 00000 0012457778877766 455553122111111 11111111111111222333 455777553 445 Q ss_pred EEEcCccC-CCCcccceeehhhHHHHHHHHHHHHHHHHhh---c-CCCHHHHHHHHHHHHHHHHHHHhCCc--eeeeEEE Q lcl|NC_019932. 266 RFWGNRTC-SDDPLFAFENYTRTAQVIADTMAEAHMWAND---K-PLTPVLVRDIIAGINAKFRELVSAGY--LLGASCW 338 (389) Q Consensus 266 ~~wG~rT~-~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~---e-~n~~~~~~~i~~~i~~~l~~l~~~ga--l~g~~v~ 338 (389) .+|-.-|. .....-.||-++|..+|+++.++..+...+= + |-+..-+..|+..|+.-|++.++.|. +.+|.+. T Consensus 307 ~i~~~~~~~G~~~~G~~iD~~~g~dwl~~~iq~~l~~ll~~~~KIpyt~~Gi~~I~~~i~~~L~~~v~~~g~~~~~y~v~ 386 (426) T protein:vir:31 307 RVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVD 386 (426) T ss_pred eeecceeecccccchhhhhhHHHHHHHHHHHHHHHHHHhhcCCCCccchhHHHHHHHHHHHHHHHHhcCCCccccceeec Confidence 55644443 3334457999999999999999998886663 2 67888888999999999999988643 4568777 Q ss_pred EecCCCCHHHhhCCEEE-EEEEEEecccceEEEEEEEEcc Q lcl|NC_019932. 339 YDDTANDKDTLKAGKLF-IDYDYTPVPPLEDLTLRQRITD 377 (389) Q Consensus 339 ~d~~~n~~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 377 (389) .-....++.|..+-++. +++.......+.++++...... T Consensus 387 ~P~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) T protein:vir:31 387 VPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) T ss_pred CCCccccchhhhhhccCCceEEEEEeCcEEEEEEEEEEeC Confidence 55444455677776666 8888889999999999999888 No 57 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=98.28 E-value=1.8e-09 Score=68.56 Aligned_cols=352 Identities=13% Similarity=-0.010 Sum_probs=98.9 Q ss_pred CCCCC-CCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc---cchhHHHHHhhhcc Q lcl|NC_019932. 1 MSDYH-HGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK---KGTLAAALQAIADQ 76 (389) Q Consensus 1 M~~~~-~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~---~gtl~~~v~~~~~~ 76 (389) |++|+ |||||+|+..+++++..+.+++.+|+|+++. .|.++|++++++.+|...||. ...+.+.+..+|.+ T Consensus 1 m~~~~~Pgvyv~e~~~~~~~i~~~~t~~~~~vg~~~~-----Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~n 75 (743) T protein:vir:10 1 MASQVSPGILIKERDLTNAVVTGALQIRAAHASTFAK-----GPIGDIVNINTQKELVSVFGEPKEDNAEDWMVASEFLN 75 (743) T ss_pred CccccCCceEEEEecCCCceeccCCcceeEEEEeccC-----CCCCcCEEecCHHHHHHHcCCccCCcchHHHHHHHHHh Confidence 99997 8999999999999999999999999999854 478999999999999999994 46789999999999 Q ss_pred cCceEEEEEecccccccccccccccc---ccc--cccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhhhhcC Q lcl|NC_019932. 77 AKPVTVVVRVAEGATPAETTSNIIGT---TDE--NGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALASIAQQLR 151 (389) Q Consensus 77 ~~~~~~v~~~~~~~~~~~t~~~~~~~---~d~--~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~ 151 (389) ++..++++|.........+....... ... ....+++... ..-||-..+.+.-.+..... + T Consensus 76 gg~~~~vvrv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~-------------a~~~G~~gN~i~V~v~~~~~--d 140 (743) T protein:vir:10 76 YGGRLAVVRAETTGVLNATTGSAGVLVKNRESWDAGSGNGEVFV-------------ARTAGSWGNSLMGVLVDRGA--D 140 (743) T ss_pred CCceEEEEEccCccccccccccccccccccccccccccceeEEE-------------EeeccccccceEEEEecCCC--c Confidence 99999999998655433322111000 000 0000001000 01122111110000000000 0 Q ss_pred ceeeeccCC-CccHHHHHHhhhcccCceeEEeeeeEEEEeecCCC---------ceEEehhHHHHHHHHhhhc--cccce Q lcl|NC_019932. 152 AFAYVSAWG-CKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQ---------SETAYATARALGLRAKIDT--DTGWH 219 (389) Q Consensus 152 ~~~i~d~~~-~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~---------~~~~p~s~~~Ag~~a~~d~--~~g~~ 219 (389) .......+. .................. ..+...+...+. ......+...++..+.... ..... T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 215 (743) T protein:vir:10 141 YIVTFAATPTDTAVGTQLLFSYSGTLVT-----GEILSYDSATNTATITASGTLTSQYLLDTPEQGLIGSFTDNSTTEVG 215 (743) T ss_pred ceeeeeccccccccceeeeecccccccc-----cceeeeeecCcceeeeeccccceeeeccccccccccccccccccccc Confidence 000000000 000000000000000000 000000000000 0000000111111100000 00000 Q ss_pred eccC---CceeccceeccccccccccC-----CcchhhhhcccceEEEEcCCCEEE-----EcCcc--CCCCcccceeeh Q lcl|NC_019932. 220 KTLS---NVGVNGVTGISASVFWDLQQ-----TGTDADLLNEACVTTLIRKDGFRF-----WGNRT--CSDDPLFAFENY 284 (389) Q Consensus 220 ~spa---n~~l~gv~~~~~~~~~~~~~-----~~~~~~~l~~~~i~~~~~~~G~~~-----wG~rT--~~~d~~~~~i~v 284 (389) ..+. +....+.............. .......+...+-.......+-.. .+..+ ......|.-+.. T Consensus 216 ~~~~~~~~~~~~~~~~~~~~~tv~v~~~~~~vg~~v~~~~~~~~~~~~~~~~~~v~~~~~~~~t~~~~~~~~~~~g~~~~ 295 (743) T protein:vir:10 216 RTPGTYSNVPASGGTGTGATFNVVVADAGGGVGGSVVVTLANPGTGYNQGETLTIASAATGDGTDILVTVATLSDGTIAI 295 (743) T ss_pred ccccceeeEEecccccccccccccccccccccccccccccccccceeeeccccccccccccccccchhheecccccceee Confidence 0000 00000000000000000000 000000000000000000000000 00000 000001111110 Q ss_pred hhHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecc Q lcl|NC_019932. 285 TRTAQVIADTMAEAHMWANDKPLTPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVP 364 (389) Q Consensus 285 rR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~ 364 (389) --..++-.+......... ..+..+.. ....+-... .+.-..+.+......... ....|.+...+.+.-.. T Consensus 296 ~a~~~~~~~~~~~~~~~~-~~~~~~~~------~t~~~~~~~--~~~~d~~~v~v~~~~~~~-~~~~~~v~~~~~~~s~~ 365 (743) T protein:vir:10 296 TELKDWYLNTEIGSTGIK-LGDIGPRP------GTSQFATDN--GITDDQVHFAVIDTTGEL-TGTANTIVERLTYLSKL 365 (743) T ss_pred eecccccccchhhccccc-cccccccc------eeeeccccc--cccccceEEEEecCccee-eeccCceeEEEeeeecc Confidence 000000000000000000 00000000 000000000 000001111111111000 00111111111111000 Q ss_pred ----------------cceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 365 ----------------PLEDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 365 ----------------p~e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) ..+...+-....... .+...+.. T Consensus 366 ~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~--~~~~~~~~ 404 (743) T protein:vir:10 366 SDARSEENANIYYKNVINEQSAYLYHGNDAA--VQIAASGE 404 (743) T ss_pred cccccccCcceeecceeccccceeeccCccc--ceeeeccc Confidence 000000000000000 00000000 No 58 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=98.00 E-value=7.7e-06 Score=48.62 Aligned_cols=339 Identities=11% Similarity=0.036 Sum_probs=187.1 Q ss_pred CCCCCCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc-cchhHHHHHhhhcccCc Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK-KGTLAAALQAIADQAKP 79 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~-~gtl~~~v~~~~~~~~~ 79 (389) |+- |-|.|.+.+.+-.+...+. ....|+|+.+... -.++...++...++...+|+ +..|...+.+...+.+. T Consensus 1 m~~--~~V~in~~n~~qg~~~~ve-r~~lfig~g~~~~----~~g~~~~~~~~sdld~~lg~~ds~lk~~v~aa~~naG~ 73 (369) T protein:vir:27 1 MAW--PTVIIKILNLMNGPIADIE-CHFLFVIRGTVSG----EVRNLIMVDSTSDLDDVLAEASAEGLAIVKAAQLNGKQ 73 (369) T ss_pred CCC--CceEEecccccCCCccccc-ceEEEEEeccccc----cccceEEecCccchHhhcCCcChhHHHHHHHHHhCCCC Confidence 774 6788888777777666654 4667885543211 12233345566677777764 45677778777777654 Q ss_pred eEE--EEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHH----Hhhhhc--C Q lcl|NC_019932. 80 VTV--VVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALA----SIAQQL--R 151 (389) Q Consensus 80 ~~~--v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~----~~~~~~--~ 151 (389) .-. +....... +-+.+++.+.....+.--.+..|- +......++. ....++ . T Consensus 74 ~w~a~~~p~~~~~-------------------~~~~Av~~a~~~~s~E~V~v~~p~-t~~a~i~aaq~~a~el~~~~~R~ 133 (369) T protein:vir:27 74 AWTAGVMILSEED-------------------NWQDAVKKANEVSSFEFVVLGFDA-ETKAMIEDAITLRTELKNSLGRE 133 (369) T ss_pred ceEEEEEEeCCch-------------------hHHHHHHhhhhhCCccEEEEecCc-ccHHHHHHHHHHHHHHHHhcCCe Confidence 322 21111100 112333333222122222222221 2223233333 333332 3 Q ss_pred ceeeeccCC-------CccHHH----HHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhcccccee Q lcl|NC_019932. 152 AFAYVSAWG-------CKTLSE----AMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHK 220 (389) Q Consensus 152 ~~~i~d~~~-------~~t~~~----a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~ 220 (389) .++++..+. +.+.++ ..+-++++.+.+..+..-+... -.-.|.+||.+|.. ..-+.. T Consensus 134 vffi~e~~~~~~~~~~~e~w~dy~a~l~al~~g~a~~~V~vv~~~~~~----------gn~~G~~aGRl~n~--aVsIad 201 (369) T protein:vir:27 134 VGVLCQLPAINNDPTNGQTWSEWLADTVDIPKDVASEYISVVPNVHAA----------GDTLGKYAGRLANK--EVSIAD 201 (369) T ss_pred EEEEEeccccCCCccccCCHHHHHHHHHHHhhccCcccceeeeeeccc----------cchHHHHHHHHHhc--ccchhc Confidence 455554321 123332 3344567778777765222111 12457788888753 222577 Q ss_pred ccCCceeccceeccc--cccccccCCcchhhhhcccceEEEEc--C-CCEEEEcCccCCCC-cccceeehhhHHHHHHHH Q lcl|NC_019932. 221 TLSNVGVNGVTGISA--SVFWDLQQTGTDADLLNEACVTTLIR--K-DGFRFWGNRTCSDD-PLFAFENYTRTAQVIADT 294 (389) Q Consensus 221 span~~l~gv~~~~~--~~~~~~~~~~~~~~~l~~~~i~~~~~--~-~G~~~wG~rT~~~d-~~~~~i~vrR~~~~i~~~ 294 (389) ||....-..+.|... .......++......|..+|.++... + .|+-+=.+||++.+ +.++||..+|..|-+.|. T Consensus 202 sp~RVktG~l~g~~~~p~d~~g~~l~~a~l~aLd~agysvp~~Y~gy~G~Yw~d~~tl~~~gsDYq~iE~~RVvdKa~R~ 281 (369) T protein:vir:27 202 SPARVQTGSVLGNTELMKDKAGKALDLATLKALESNRIAVPMWYPDYPGQYWTTGRTLDVPGGDYQDIRHIRVAMKAARK 281 (369) T ss_pred CcceeeecccccccccccCCCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCCeehhhhhhHHHHHHHH Confidence 887654333344321 11122224455677799999999843 4 46666677888643 569999999999999999 Q ss_pred HHHHHHHHhhcCC---CHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCC-CHHHhhCCEEEEEEEEEecccceEEE Q lcl|NC_019932. 295 MAEAHMWANDKPL---TPVLVRDIIAGINAKFRELVSAGYLLGASCWYDDTAN-DKDTLKAGKLFIDYDYTPVPPLEDLT 370 (389) Q Consensus 295 ~~~~~~~~v~e~n---~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n-~~~~i~~G~~~~~i~~~p~~p~e~i~ 370 (389) ++..+-+.+..+. ++.-.+..+.-+..=|++|...+ ..+++.-..+.. +..-....++.+-+.+.|.--.+.|+ T Consensus 282 vR~~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~--fpgei~~P~d~dI~i~w~~k~~V~I~~~vrP~~~pk~it 359 (369) T protein:vir:27 282 VRIRAIARIADRTLNSTPQSIAAAKLYFTQDLRTMALTG--VPGEIYPPEDEDIQIKWVNSTDVEIYMSVQPYECPVKIT 359 (369) T ss_pred HHHHHHHHhcCcccccChhHHHHHHHHHhhHHHHHHhhc--CCeEEecCCCCceEEEeeccceEEEEEEEeeccCCceEE Confidence 9877776665442 45555666666777777775442 222232221110 00111445788888999999999999 Q ss_pred EEEEEcchHH Q lcl|NC_019932. 371 LRQRITDSYL 380 (389) Q Consensus 371 ~~~~~~~~~~ 380 (389) ..+..|-.-. T Consensus 360 ~~I~ldl~~~ 369 (369) T protein:vir:27 360 IAISVKQGDY 369 (369) T ss_pred EEEEEeccCC Confidence 9999875533 No 59 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=97.98 E-value=2.7e-06 Score=51.08 Aligned_cols=353 Identities=15% Similarity=0.125 Sum_probs=162.4 Q ss_pred CC------CCCCCEEEEECCCCCcccccccccceeeeecccccccccccc-cccEEEecchh-hhhhhcccchhHHHHHh Q lcl|NC_019932. 1 MS------DYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPL-NEPVLLTNVLS-AIGKAGKKGTLAAALQA 72 (389) Q Consensus 1 M~------~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~-~~~vli~~~~~-~~~~~~~~gtl~~~v~~ 72 (389) |. +...-++++-+.|.. ....+..+.+.|++...-.....+ +..+.+..... ...... ..+...+.. T Consensus 74 M~~a~~~~n~~~~l~~i~~~D~a---G~aAtg~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA--~al~aaina 148 (498) T protein:vir:44 74 MVGAYRKTDPFGELYVIAVPEST---GAAATVALTVTGEATETGTVNVYTGRTRVQAPVTSGDDAAAVA--VSIKDAVNA 148 (498) T ss_pred HHHHHHHhCCCceeEEEecCCcc---cceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHH--HHHHHHHhC Confidence 22 233447787777632 345567788888876543333222 33333322111 000000 011111111 Q ss_pred h---------------------hcccCceEEEEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhh-hhhc Q lcl|NC_019932. 73 I---------------------ADQAKPVTVVVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKP-RILG 130 (389) Q Consensus 73 ~---------------------~~~~~~~~~v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p-~~~~ 130 (389) . ...+....+-+++-.......+...+..........+|-..+..+..-.+... .+++ T Consensus 149 ~~~lPVTA~~~~~~vtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~~~~~~~i~ 228 (498) T protein:vir:44 149 NPDLPFTATSEAGVVTLTARHKGLYGNEIPVTLNYYGFGGGEVLPAGVNITVASGVKGAGAPALNDAVAAMGDEPFDYIG 228 (498) T ss_pred CCCCceEEeeccceEEEEEeccCcccCcceEEEeeccCccccccccceeEEEEcccCCccCchhHHHHHhhccCCccEEE Confidence 0 00011111111111100001111111000111111112222222222222222 3333 Q ss_pred cccccchHHHHHHHHhh----------hhcCceeeeccCCCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEe Q lcl|NC_019932. 131 VPGLDALEVSTALASIA----------QQLRAFAYVSAWGCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAY 200 (389) Q Consensus 131 apg~~~~~v~~al~~~~----------~~~~~~~i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p 200 (389) .| |++..-..++.+++ +++.++++... .-|..+...+....++.+..+.+. . ....-| T Consensus 229 ~p-~~D~asl~al~~~L~~~sgRw~~~~q~~g~~~~a~--~gT~a~l~t~g~~~N~~~it~~~~-------~--~~~~sp 296 (498) T protein:vir:44 229 LP-FNDTASVNSMATEMNDSSGRWSYVRQLYGHVYTAK--TGTLSELVAAGDQFNLQHITLAGY-------E--KDTQTP 296 (498) T ss_pred Ee-ecCHHHHHHHHHHHhhhhcchHHHhhcCeEEEEec--cCCHHHHHHhhhccCCceEEEEec-------C--CCCCCH Confidence 43 33333333333332 23344444433 335777777777777766544321 1 111113 Q ss_pred hhH---HHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEEcCCCE-EEEcCccC--- Q lcl|NC_019932. 201 ATA---RALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLIRKDGF-RFWGNRTC--- 273 (389) Q Consensus 201 ~s~---~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~~~G~-~~wG~rT~--- 273 (389) +-. .+||..+.- .+..|-..--...|.|+..+.... .....|.|.|...||.+..-+.|- .+--..|. T Consensus 297 ~~~~AAa~a~~aA~~-l~~DPArPL~tl~L~Gi~~p~~~~----r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~ 371 (498) T protein:vir:44 297 ADELAASRTARAAVF-IRNDPARPTQTGELVDMLPAPKGK----RFTTTEQQTLLSHGVATAYVESGVLRIQRDITTYRK 371 (498) T ss_pred HHHHHHHHHHHHHHH-hhcccccccCceeecccccCCchh----cCChHHHHHHHhcCcceEEEcCCeEEEEeeeeeeee Confidence 322 333333310 133344444456788888775433 346778889999999999667772 22223232 Q ss_pred ----CCCcccceeehhhHHHHHHHHHHHHHHH-HhhcCCCH-----------HHHHHHHHHHHHHHHHHHhCCceeee-- Q lcl|NC_019932. 274 ----SDDPLFAFENYTRTAQVIADTMAEAHMW-ANDKPLTP-----------VLVRDIIAGINAKFRELVSAGYLLGA-- 335 (389) Q Consensus 274 ----~~d~~~~~i~vrR~~~~i~~~~~~~~~~-~v~e~n~~-----------~~~~~i~~~i~~~l~~l~~~gal~g~-- 335 (389) ..|+.|..|...|+.+|+++.++..+.. |--+.... -+-..|+..+-.-++.|..+|-+..+ T Consensus 372 n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~eli~~y~~le~~givEn~~~ 451 (498) T protein:vir:44 372 NAYGVADNSYLDSETLHTSAYVLRRLKSVITSKYGRHKLANDGTRFGSGQAIVTPAVIRGELGSTYRQMEREGIVENFDL 451 (498) T ss_pred cCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCcccccCCcccCCCcccccHHHHHHHHHHHHHhhhhhccccChhh Confidence 3477899999999999999999977652 22222222 26678899999999999988888664 Q ss_pred -E--EEEecCCCCHHHhhCCEEEEEEEEEecccc----eEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 336 -S--CWYDDTANDKDTLKAGKLFIDYDYTPVPPL----EDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 336 -~--v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~----e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) + +++.+.-+ +..|+.+.+-....-.. -.|+|+++++.. +| T Consensus 452 ~~~~LiVerd~~-----dpnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~---------~~ 498 (498) T protein:vir:44 452 FQQHLIVERNAN-----DSNRLDVLFPPDYVNQLRVFAVLNQFRLQYSEE---------AA 498 (498) T ss_pred hcceeEEEECCC-----CCcEEEEEecccccCchhhhhhhhhhhhhhhhh---------cC Confidence 2 33433322 22445444433333333 334444444333 22 No 60 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=97.96 E-value=3e-06 Score=50.88 Aligned_cols=354 Identities=15% Similarity=0.124 Sum_probs=164.1 Q ss_pred CC------CCCCCEEEEECCCCCcccccccccceeeeecccccccccccc-cccEEEecchh-hhhhhcccchhHHHHHh Q lcl|NC_019932. 1 MS------DYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPL-NEPVLLTNVLS-AIGKAGKKGTLAAALQA 72 (389) Q Consensus 1 M~------~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~-~~~vli~~~~~-~~~~~~~~gtl~~~v~~ 72 (389) |. +...-++++-+.|+. .+..+..+.+.|++.........+ +..+.+..... ...... ..+...+.. T Consensus 74 M~~a~~~~n~~~~l~~i~~~d~a---G~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA--~al~aaina 148 (498) T protein:vir:45 74 MVEAYRQTDPFGELYVIAVPEAT---GAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIA--SSIQDAINA 148 (498) T ss_pred HHHHHHHhCCcceEEEEeeCCcc---cceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHH--HHHHHHHhC Confidence 32 223457777777642 244566788888775443333222 33333322111 000000 011111111 Q ss_pred h---------------------hcccCceEEEEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhh-hhhc Q lcl|NC_019932. 73 I---------------------ADQAKPVTVVVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKP-RILG 130 (389) Q Consensus 73 ~---------------------~~~~~~~~~v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p-~~~~ 130 (389) . ...+....+-+++-.......+...+..........+|-..+..+..-.+... ..++ T Consensus 149 ~~~lPVTA~~~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamagGag~PD~a~alaal~~~~~~~I~ 228 (498) T protein:vir:45 149 VPTLPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIG 228 (498) T ss_pred CCCCceEEEecCceEEEEeeccCccccceeEEEeeccccccccccceeeEEEEccCCCccCchhHHHHHHhccCCccEEE Confidence 0 00000111111110000001111111100111111111111222212122222 2333 Q ss_pred cccccchHHHHHHHHh----------hhhcCceeeeccCCCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEe Q lcl|NC_019932. 131 VPGLDALEVSTALASI----------AQQLRAFAYVSAWGCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAY 200 (389) Q Consensus 131 apg~~~~~v~~al~~~----------~~~~~~~~i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p 200 (389) .| |++..-..++.++ .+++.++++.-. .-|..+...+....++.+..+.+. .+...-| T Consensus 229 ~p-~~D~asL~al~~~L~~~sgRw~~~~q~~g~~~~a~--~gT~~~l~t~g~~~N~~~it~~~~---------~~~~~sp 296 (498) T protein:vir:45 229 LP-FNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAK--TGTLSELVNAGDQFNQQHITLAGY---------EKETQTP 296 (498) T ss_pred Ee-eCCHHHHHHHHHHHhhhhhhhhHHhhcCeEEEEec--cCCHHHHHHhhhccCCceEEEEec---------CCCCCCh Confidence 33 3443333333333 233444454433 236788888888888877655321 1111123 Q ss_pred hhHHHHHHHHhhhc--cccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEEcCCCE-EEEcCccC---- Q lcl|NC_019932. 201 ATARALGLRAKIDT--DTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLIRKDGF-RFWGNRTC---- 273 (389) Q Consensus 201 ~s~~~Ag~~a~~d~--~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~~~G~-~~wG~rT~---- 273 (389) +-...|++.++... +..|-..--...|.|+..+.... .....|.|.|...||.+..-+.|- .+--..|. T Consensus 297 ~~~~AAa~aa~~A~~l~~DPArPL~tl~L~Gi~~p~~~~----r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n 372 (498) T protein:vir:45 297 ADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGK----RFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKN 372 (498) T ss_pred HHHHHHHHHHHHHHHhhcccccccCceeecceecCCchh----cCChHHHHHHHhCCcceEEEcCCeEEEEeeeeeeeec Confidence 32333332222221 33344444456788888775433 346778889999999999667773 22233232 Q ss_pred ---CCCcccceeehhhHHHHHHHHHHHHHHHH-hhcCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeee--- Q lcl|NC_019932. 274 ---SDDPLFAFENYTRTAQVIADTMAEAHMWA-NDKPLTPV-----------LVRDIIAGINAKFRELVSAGYLLGA--- 335 (389) Q Consensus 274 ---~~d~~~~~i~vrR~~~~i~~~~~~~~~~~-v~e~n~~~-----------~~~~i~~~i~~~l~~l~~~gal~g~--- 335 (389) ..|+.|..|...|+.+|+++.++..+... --+....+ |-..|+..+-.-++.|..+|-+..+ T Consensus 373 ~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~ell~~y~~le~~givEn~~~~ 452 (498) T protein:vir:45 373 AYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELF 452 (498) T ss_pred CCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCeeecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhh Confidence 34778999999999999999999776633 22222222 6678899988889999888888664 Q ss_pred E--EEEecCCCCHHHhhCCEEEEEEEEEeccc----ceEEEEEEEEcchHH Q lcl|NC_019932. 336 S--CWYDDTANDKDTLKAGKLFIDYDYTPVPP----LEDLTLRQRITDSYL 380 (389) Q Consensus 336 ~--v~~d~~~n~~~~i~~G~~~~~i~~~p~~p----~e~i~~~~~~~~~~~ 380 (389) + +++.++.+. ..|+.+.+-....-. +-.|+|+++++...- T Consensus 453 ~~~LiVerd~~d-----pnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:45 453 KQYLVVERDASV-----PNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred cceeEEEECCCC-----CcEEEEEecccccCchhhhhhhhhhheehhhcCC Confidence 2 333333221 234444433333222 344566666654422 No 61 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=97.59 E-value=4.1e-05 Score=44.66 Aligned_cols=350 Identities=15% Similarity=0.130 Sum_probs=159.3 Q ss_pred CC------CCCCCEEEEECCCCCcccccccccceeeeecccccccccccc-cccEEEecchhhhhhhcccchhHHHHHh- Q lcl|NC_019932. 1 MS------DYHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPL-NEPVLLTNVLSAIGKAGKKGTLAAALQA- 72 (389) Q Consensus 1 M~------~~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~-~~~vli~~~~~~~~~~~~~gtl~~~v~~- 72 (389) |. +...-++++-+.|.. .+..+..+.+.|++...-.....+ +..+.+...... +...+...+.. T Consensus 74 M~~a~~~~n~~~~l~~i~~~D~a---g~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gd-----Taa~vA~al~aa 145 (498) T protein:vir:48 74 MVDVYRQTDPFGELYVIAVPEAR---GAAATVRVTVTGEAEESGTLSLYVGRSSVQVPVVNGD-----DATAVATAIKEA 145 (498) T ss_pred HHHHHHHhCCCceeEEEeeCCcc---cceeEEEEEecccccCCceEEEEECCEEEEEeecCCC-----CHHHHHHHHHHH Confidence 32 223347777777642 244566788888775443322222 223322211110 00111111111 Q ss_pred hhcc----------cCceEEEEEeccccc--------------cccccccccccccccccchhhHHHHHHhhhhhhhh-h Q lcl|NC_019932. 73 IADQ----------AKPVTVVVRVAEGAT--------------PAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKP-R 127 (389) Q Consensus 73 ~~~~----------~~~~~~v~~~~~~~~--------------~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p-~ 127 (389) +... .+...+..+.....+ ...+...+.-........+|-.-+..+..-.+... . T Consensus 146 i~a~~~lPVTA~~~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamsgGag~PDia~aLaal~~~~~~ 225 (498) T protein:vir:48 146 VNGVITLPFAASSDAGVVTLTARHKGLYGNELPVCLNYYGSGGGEILPAGLQVVTEAGTAGSGAPDLTAAVAAMGDEAFD 225 (498) T ss_pred HhCCCCcceEEEecCcEEEEEeeecccccccceeeeeeccCcccccccceeeEEEEcccCCccCcchHHHHHhhccCCcc Confidence 1110 011111111110000 00000000000000000111111111111112122 2 Q ss_pred hhccccccchHHHHHHHHhh----------hhcCceeeeccCCCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCce Q lcl|NC_019932. 128 ILGVPGLDALEVSTALASIA----------QQLRAFAYVSAWGCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSE 197 (389) Q Consensus 128 ~~~apg~~~~~v~~al~~~~----------~~~~~~~i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~ 197 (389) +++.| |++..-..++.+++ +++.++++.-. .-|..+...+....++.+..+.+ .+. .. T Consensus 226 ~I~~p-~~D~asl~al~~~L~~~sgRw~~~~q~~g~~~~a~--~gT~~~l~t~g~~~N~~~it~~~--------~~~-~~ 293 (498) T protein:vir:48 226 FIGLP-FNDAASINMMMTEMNDSSGRWSYARQLYGHVYTAK--LGTLSELVNAGDMHNQQHITLAG--------YEK-ET 293 (498) T ss_pred EEEEe-ecCHHHHHHHHHHHhhhhhhhhHHhhcCeEEEEec--cCCHHHHHHhhhccCCceEEEEe--------cCC-CC Confidence 33333 33333333333332 33344444433 23677888888888877765442 111 11 Q ss_pred EEehhH---HHHHHHHhhhccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEEcCCC-EEEEcCccC Q lcl|NC_019932. 198 TAYATA---RALGLRAKIDTDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLIRKDG-FRFWGNRTC 273 (389) Q Consensus 198 ~~p~s~---~~Ag~~a~~d~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~~~G-~~~wG~rT~ 273 (389) .-|+.. ..|+..+ ...+..|-..--...|.|+..+.... .....|.|.|...||.+..-.+| ..+--..|. T Consensus 294 ~~p~~~~AAa~a~~aA-~~l~~DPArPLqtl~L~Gi~~p~~~~----r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITT 368 (498) T protein:vir:48 294 QSPVDELVASRLAREA-VFIRNDPARPTQTGELVGMLPAPKGK----RFIMTEQQTLLSHGVATAYVEGGTLRIQRSVTT 368 (498) T ss_pred CChHHHHHHHHHHHHH-HhhhccccccccceeeeccccCCchh----cCChHHHHHHHhcCcceEEEcCCeEEEEeeeee Confidence 123332 2233333 11123343344445788888775443 34677888999999999855555 333333332 Q ss_pred -------CCCcccceeehhhHHHHHHHHHHHHHHH-HhhcCCCHH-----------HHHHHHHHHHHHHHHHHhCCceee Q lcl|NC_019932. 274 -------SDDPLFAFENYTRTAQVIADTMAEAHMW-ANDKPLTPV-----------LVRDIIAGINAKFRELVSAGYLLG 334 (389) Q Consensus 274 -------~~d~~~~~i~vrR~~~~i~~~~~~~~~~-~v~e~n~~~-----------~~~~i~~~i~~~l~~l~~~gal~g 334 (389) ..|+.|..|...|+.+|+++.++..+.. |--+....+ |-..|+..+-.-++.|..+|-+.. T Consensus 369 Y~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~eli~~y~~le~~given 448 (498) T protein:vir:48 369 YKKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLANDGTRFGPGQAIVTPAVIKGELLATYRQMERAGIVEN 448 (498) T ss_pred eeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCceecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccC Confidence 3477899999999999999999977653 322222222 667889998888999988888766 Q ss_pred e---E--EEEecCCCCHHHhhCCEEEEEEEEEeccc----ceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 335 A---S--CWYDDTANDKDTLKAGKLFIDYDYTPVPP----LEDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 335 ~---~--v~~d~~~n~~~~i~~G~~~~~i~~~p~~p----~e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) + + +++.+..+. ..|+.+.+-....-. +-.|+|+++++.. +| T Consensus 449 ~~~~~~~LiVerd~~d-----pnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~---------~~ 498 (498) T protein:vir:48 449 YDLFKQYLIVERDADN-----PNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEE---------SA 498 (498) T ss_pred hhhhcceeEEEECCCC-----CcEEEEEecccccCchhhhhhhhhhhhhhhhc---------CC Confidence 4 2 334333221 234444433333332 2334445544333 22 No 62 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=97.35 E-value=3e-06 Score=50.88 Aligned_cols=191 Identities=15% Similarity=0.023 Sum_probs=82.8 Q ss_pred CCC-CC-CCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcc---cchhHHHHHhhhc Q lcl|NC_019932. 1 MSD-YH-HGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGK---KGTLAAALQAIAD 75 (389) Q Consensus 1 M~~-~~-~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~---~gtl~~~v~~~~~ 75 (389) |+. |+ |||||+|+..+ +.+..+.+++.+|+|.++ ..|.++|++++++.+|...||. ...+.+.+..+|. T Consensus 1 M~~~~~~PgVyv~e~~~~-~~~~~~~t~~~~fvG~~~-----~Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~F~ 74 (749) T protein:vir:10 1 MATNQSSPGVVIQERDLT-TVSTIPTANVGVIAAPFT-----KGPVEEVIEITSERQLAEKFGEPNESNYEYWFSAAQFL 74 (749) T ss_pred CCccccCCeeEEEEecCC-cccccccCceeEEEeccC-----CCCCccCEEcCCHHHHHHHcCCccCCcccHHHHHHHHh Confidence 995 76 99999999776 557788999999999985 4478999999999999999985 3568889999999 Q ss_pred ccCceEEEEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhhhhhccccccchHHHHHHHHhhhhcCceee Q lcl|NC_019932. 76 QAKPVTVVVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPGLDALEVSTALASIAQQLRAFAY 155 (389) Q Consensus 76 ~~~~~~~v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg~~~~~v~~al~~~~~~~~~~~i 155 (389) +++..++++|.........+. ..++.. +. .. ..+... . T Consensus 75 ngg~~~~vvRv~~~~~~~a~~-----------~~~~~~-~~----------~~---~~~~~~-----------------~ 112 (749) T protein:vir:10 75 SYGGLLKTIRVNSSSLKNAVD-----------TGTAPL-VK----------NL---QDYETS-----------------I 112 (749) T ss_pred hcCCeEEEEEccCcccccccc-----------cccccc-cc----------cc---cccccc-----------------c Confidence 999999999986544211110 000000 00 00 000000 0 Q ss_pred eccCCCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCCceEEehhHHHHHHHHhhhccccceeccCCceec-cceecc Q lcl|NC_019932. 156 VSAWGCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQSETAYATARALGLRAKIDTDTGWHKTLSNVGVN-GVTGIS 234 (389) Q Consensus 156 ~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~l~-gv~~~~ 234 (389) .... ...+...-+| |- ..|. |. -|.. T Consensus 113 ~~~~---------------~~~~~~a~~p--------------------------------G~---~gn~-l~v~v~~-- 139 (749) T protein:vir:10 113 EDAS---------------NNFSWVARTP--------------------------------GD---TGNS-IGIFVTD-- 139 (749) T ss_pred cccc---------------cceEEEeccC--------------------------------CC---cCCc-eEEEEEc-- Confidence 0000 0000000000 10 0110 00 0000 Q ss_pred ccccccccCCcchhhhhcccceEEEEcCCCEEEEcCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhhcCCCHHHHHH Q lcl|NC_019932. 235 ASVFWDLQQTGTDADLLNEACVTTLIRKDGFRFWGNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWANDKPLTPVLVRD 314 (389) Q Consensus 235 ~~~~~~~~~~~~~~~~l~~~~i~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~ 314 (389) ...+ .+........ .| T Consensus 140 ---------~~~~-------~~~~~~~~~~--~~---------------------------------------------- 155 (749) T protein:vir:10 140 ---------AGAD-------QVVVVPAPGS--GN---------------------------------------------- 155 (749) T ss_pred ---------CCCc-------eeeeeecCCc--cc---------------------------------------------- Confidence 0000 0000000000 00 Q ss_pred HHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcchHHHHHHHHhcC Q lcl|NC_019932. 315 IIAGINAKFRELVSAGYLLGASCWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITDSYLANFAASVNS 389 (389) Q Consensus 315 i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 389 (389) +..+..+. ..+. ..| -......-.+.+.+..... ..+....+ T Consensus 156 -------------------~~~~~~~~-~~~~---~~~-------~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 197 (749) T protein:vir:10 156 -------------------EHEFVADA-AVSA---ASG-------AAGKVFKYSIILTIDDVVG---TFAPGSAT 197 (749) T ss_pred -------------------eeeEEeee-cccc---ccc-------ccccccccceeeeeccccc---eeecccce Confidence 00000000 0000 001 1111112222222221111 11111111 No 63 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=96.59 E-value=0.00048 Score=38.76 Aligned_cols=346 Identities=12% Similarity=0.043 Sum_probs=159.8 Q ss_pred CCC------CCCCEEEEECCCCCcccccccccceeeeecccccccccccc-cccEEEecch-hhhhhh--------cccc Q lcl|NC_019932. 1 MSD------YHHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPL-NEPVLLTNVL-SAIGKA--------GKKG 64 (389) Q Consensus 1 M~~------~~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~-~~~vli~~~~-~~~~~~--------~~~g 64 (389) |.. ...-++++-+.|.. ....+..+.+.|++...-.....+ +..+.+.... +..... .... T Consensus 77 M~~a~~~~n~~~~l~~i~~~D~a---G~aA~g~it~tg~at~~G~l~l~I~g~~v~v~V~~gdTaa~vA~al~aaina~~ 153 (495) T protein:vir:19 77 MADAFLNANRVAELWCIPQGNGT---GNAAVGEISLSGTAGENGSLVTYIAGQRLAVSVAAGATGAALADLLVARIKGQP 153 (495) T ss_pred HHHHHHHhCCcceEEEEeeCChh---hceeEEEEEEeecCCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHhcCCc Confidence 322 22346777777642 234456778888776443332222 2233222211 111000 0001 Q ss_pred hhHHHHHh---------------hhcccC---ceEEEEEecccc--c--cccccccccccccccccchhhHHHHHHhhhh Q lcl|NC_019932. 65 TLAAALQA---------------IADQAK---PVTVVVRVAEGA--T--PAETTSNIIGTTDENGRYTGMKALLSAQTQL 122 (389) Q Consensus 65 tl~~~v~~---------------~~~~~~---~~~~v~~~~~~~--~--~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~ 122 (389) .|+.+... -..+.| ...+.+++-.+. + -..+.....++.......+.|.++ T Consensus 154 ~lPvTA~~~~~~~~~~a~~~VtlTAr~kG~~n~idi~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal------- 226 (495) T protein:vir:19 154 DLPVTAEVRADSGDDDTHADVVLSAKFTGALSAVDVRWNYYAGETTPYGIITAFKAASGKNGNPDISASIAGM------- 226 (495) T ss_pred cCceEEEeeccCCCCcCceeEEEEEeeccccccceeEEEeecccccccceeEEEEecCCCCCCcchHHHHHHh------- Confidence 11110000 000000 000111110000 0 001111111111111111222222 Q ss_pred hhhhhhhccccccchHHHHHHHHhhhh-------cCceeeeccCCCccHHHHHHhhhcccCceeEEeeeeEEEEeecCCC Q lcl|NC_019932. 123 GVKPRILGVPGLDALEVSTALASIAQQ-------LRAFAYVSAWGCKTLSEAMAYRENFSQRELMVIWPDFISWNTTANQ 195 (389) Q Consensus 123 ~~~p~~~~apg~~~~~v~~al~~~~~~-------~~~~~i~d~~~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~ 195 (389) +...-..+.--|++.....+|..+++. +.++++.-. .-|..+...+....++.+..+.+ .++ T Consensus 227 ~~~~~~~I~~P~tD~asL~al~~~l~~rw~~~~q~~g~~~~a~--~gT~~~l~t~g~~~N~~~it~~~--------~~g- 295 (495) T protein:vir:19 227 GDLQYKYIVMPYTDEPNLNLLRTELQERWGPVNQADGFAVTVL--SGTYGDISTFGVSRNDHLISCMG--------IAG- 295 (495) T ss_pred ccCCCcEEEEecCcHHHHHHHHHHHHHhhhHHHhcCeEEEEee--cCCHHHHHHhhhccCCceEEEEe--------cCC- Confidence 222222222234444444455444333 334444432 22567777777777776655432 111 Q ss_pred ceEEehhHHHHHHHHhhh--ccccceeccCCceeccceeccccccccccCCcchhhhhcccceEEEEc-CCCE-EEEcCc Q lcl|NC_019932. 196 SETAYATARALGLRAKID--TDTGWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLIR-KDGF-RFWGNR 271 (389) Q Consensus 196 ~~~~p~s~~~Ag~~a~~d--~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~-~~G~-~~wG~r 271 (389) ..-||....|++.+..- .+..|-..--...|.|+..+.... .....|.|.|...||.+... .+|. .+--.. T Consensus 296 -sp~~~~~~AAA~aa~~A~~l~~DPArPL~tl~L~Gi~~p~~~~----r~~~~ern~LL~~Gist~~V~~~G~V~I~R~I 370 (495) T protein:vir:19 296 -APEPSYLYAATLCAVASQALSIDPARPLQTLTLPGRMPPAVGD----RFTWSERNALLFDGISTFNVNDGGEMQIERMI 370 (495) T ss_pred -CCCcHHHHHHHHHHHHHHHhhcccccccCceeecceecCCccc----cCChHHHHHHHhCCcceEEECCCCeEEEEeee Confidence 12334333333333321 233454455566788888775443 34678889999999999854 4552 333333 Q ss_pred cC-------CCCcccceeehhhHHHHHHHHHHHHHHHHh-hcCCCHH-----------HHHHHHHHHHHHHHHHHhCCce Q lcl|NC_019932. 272 TC-------SDDPLFAFENYTRTAQVIADTMAEAHMWAN-DKPLTPV-----------LVRDIIAGINAKFRELVSAGYL 332 (389) Q Consensus 272 T~-------~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v-~e~n~~~-----------~~~~i~~~i~~~l~~l~~~gal 332 (389) |. ..|+.|..|++-|+.+|+++.++......- -+....+ |-..|+..+-.-++.|..+|-+ T Consensus 371 TTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~ell~~~~~le~~giv 450 (495) T protein:vir:19 371 TMYRTNKYGDSDPSYLNVNTIATLSYLRYSLRTRITQKFPNYKLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGLV 450 (495) T ss_pred eeeeecCCCCcchhhhhhHHHHHHHHHHHHHHHHHhhhcCCcccccCCCCCCCcccccChHHHHHHHHHHHHhhhhhccc Confidence 32 347789999999999999999997665332 2333222 5677899988889999888887 Q ss_pred eee---E--EEEecCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcc Q lcl|NC_019932. 333 LGA---S--CWYDDTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITD 377 (389) Q Consensus 333 ~g~---~--v~~d~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 377 (389) ..+ + +++.+.-+ +.+|+.+.+-....-...-+-.++++-= T Consensus 451 en~~~~~~~LiVerd~~-----dpnRln~~~p~d~vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 451 EDFDTFKEELYVARNKD-----DKDRLDVLCGPNLINQFRIFAAQVQFIL 495 (495) T ss_pred cChhhhcceeEEEECCC-----CCcEEEEEecceeeCceeeeeeeeeeeC Confidence 664 2 33333322 2245555554444444333222222211 No 64 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=95.32 E-value=0.0023 Score=35.06 Aligned_cols=356 Identities=13% Similarity=0.063 Sum_probs=176.8 Q ss_pred CCCCCCCEEEEECCCCCcccccc--cccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhcccC Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTISTV--STAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIADQAK 78 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~~~v--~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~~~~ 78 (389) |-.- =++++|..+..+.... .....-+.++.+ .+|.......++..+....||.+.........+|.+.. T Consensus 1 mip~---s~iV~V~~~v~~~~~~~~~~~~~l~l~~~~-----~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yF~~~~ 72 (504) T protein:vir:96 1 MISQ---SRYIRIISGVGAGAPVAGRKLILRVMTTNN-----VIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFIS 72 (504) T ss_pred CCCc---cceeEeeecccccccccccccceeEeeccc-----CCCccceEEecCHHHHHHhcCCChHHHHHHHHHhhcCC Confidence 6542 2344444433333222 233334444442 33444444556666667778887777777777777633 Q ss_pred ------ceEEEEEecccccccc----------------cccccccccccc----------------ccchhhHHHHHHh- Q lcl|NC_019932. 79 ------PVTVVVRVAEGATPAE----------------TTSNIIGTTDEN----------------GRYTGMKALLSAQ- 119 (389) Q Consensus 79 ------~~~~v~~~~~~~~~~~----------------t~~~~~~~~d~~----------------~~~tGl~a~~~~~- 119 (389) ...++.|......... +...+....+.. ...+-+.+..... T Consensus 73 ~~~~~P~~l~igR~~~~a~~~~l~g~~~~~~~~~~~~i~~G~lsitv~G~~~~~~~i~~S~~ts~~~vA~~i~~al~~~~ 152 (504) T protein:vir:96 73 KSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEKNITAIDTSAATSMDNVASIIQTEIRKNT 152 (504) T ss_pred CCCccccEEEEEeecCcCccceEEechhHHHHHHHhhhhceEEEEEEcceeeeecccccccccchHHHHHHHHhhhhccc Confidence 3334444331111000 000000000000 0000000000000 Q ss_pred ------------------------------------------hhhhhh-hhhhccccccchHHHHHHHHhhhhc---Cce Q lcl|NC_019932. 120 ------------------------------------------TQLGVK-PRILGVPGLDALEVSTALASIAQQL---RAF 153 (389) Q Consensus 120 ------------------------------------------~~~~~~-p~~~~apg~~~~~v~~al~~~~~~~---~~~ 153 (389) ...++. +......|........+|.++.... ..+ T Consensus 153 ~~~~~~~tv~~d~~~~~f~its~~tg~~~~~~~~~a~~~~~~~~lgl~~~~~~~v~g~~aet~~~al~al~~~~~~Wy~f 232 (504) T protein:vir:96 153 DPQLAQATVTWNPNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGSF 232 (504) T ss_pred ccccccceEEEeccCCeEEEEeeccccceeEEEeeccccchhhhhhcccccceEEeecccccHHHHHHHHHhhcCCeEEE Confidence 000000 1111111222222334444433332 223 Q ss_pred eeeccCCCccHHHHH---HhhhcccCceeEEeeeeEEEEee-------cCCCceEE----------ehhHHHHHHHHhhh Q lcl|NC_019932. 154 AYVSAWGCKTLSEAM---AYRENFSQRELMVIWPDFISWNT-------TANQSETA----------YATARALGLRAKID 213 (389) Q Consensus 154 ~i~d~~~~~t~~~a~---~~~~~~~s~~~~~~~p~~~~~~~-------~~~~~~~~----------p~s~~~Ag~~a~~d 213 (389) .+.+.+. +.+++. +|.+..+..+. |..+....+. .....+.+ -++....|..+.+| T Consensus 233 ~~a~~~~--~dd~ilalA~w~ea~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~as~~ 308 (504) T protein:vir:96 233 LFAGATL--DNDQIKAVSAWNAAQNNQFI--YTVATSLANLGALFDLVKGNSGTALNVLSATASNDFVEQCPSEILAATN 308 (504) T ss_pred EEEeccC--CHHHHHHHHHHHhhcCceEE--EEEeecccchhhHHHhhhhcceeEEEEeecCccchhHHHHHHHHHHhcC Confidence 3344322 233333 33333222222 2222110000 00011111 12344456666776 Q ss_pred ccc-cceeccCCceeccceeccccccccccCCcchhhhhcccceEEEEc--CCC--EEEE-cCccCCCCcccceeehhhH Q lcl|NC_019932. 214 TDT-GWHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLIR--KDG--FRFW-GNRTCSDDPLFAFENYTRT 287 (389) Q Consensus 214 ~~~-g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~--~~G--~~~w-G~rT~~~d~~~~~i~vrR~ 287 (389) ... .--.+...|.+.||... .++..+.+.|..+|+|+... +.| +.+| .+.++++.-.|.+|.+-+- T Consensus 309 f~~~ng~~T~~fk~l~GVta~--------~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~~~gG~~~~~wiDv~~~ 380 (504) T protein:vir:96 309 YDEPGASQNYMYYQFPGRNIT--------VSDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYAN 380 (504) T ss_pred cCcccccccccccccCCcCcc--------cCCHHHHHHHHhcCCeEEEEeecccceeeEEecCeeeCCccccchhhhhhh Confidence 333 12223455667777543 25788999999999998833 333 5665 6667676546788899999 Q ss_pred HHHHHHHHHHHHHHHhhc----CCCHHHHHHHHHHHHHHHHHHHhCCcee-----------------------------e Q lcl|NC_019932. 288 AQVIADTMAEAHMWANDK----PLTPVLVRDIIAGINAKFRELVSAGYLL-----------------------------G 334 (389) Q Consensus 288 ~~~i~~~~~~~~~~~v~e----~n~~~~~~~i~~~i~~~l~~l~~~gal~-----------------------------g 334 (389) .+|+++.++..+....-. |-|..=...|+..++.-|++-+++|.|. | T Consensus 381 ~~WL~~~lq~~l~~l~~~~~kIPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~I~~~~~~d~~~~~~~~~G 460 (504) T protein:vir:96 381 EIWLKSAIAQALLDLFLNVNAVPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQYITQITGDRRAWRQVQTLG 460 (504) T ss_pred HHHHHHHHHHHHHHHHhcCCCcccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchheecccccccccccceeccc Confidence 999999999888775433 4477888899999999999999999772 3 Q ss_pred eEEEEec-CCCCH-HHhhCCEEEEEEEEEecccceEEEEEEEEc Q lcl|NC_019932. 335 ASCWYDD-TANDK-DTLKAGKLFIDYDYTPVPPLEDLTLRQRIT 376 (389) Q Consensus 335 ~~v~~d~-~~n~~-~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 376 (389) |.+.... ++-++ +.-.++...+.+...--..+++|++....- T Consensus 461 Yyv~~~~~s~~s~~~r~~R~~~~~~~~y~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 461 YWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) T ss_pred eEEEecChhccChhHhhhccccceEEEEEECCeEEEEEeccccC Confidence 6666542 33343 344555577777788888888888776555 No 65 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=94.68 E-value=0.0038 Score=33.87 Aligned_cols=363 Identities=13% Similarity=0.063 Sum_probs=172.6 Q ss_pred CCCC-CCCEEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhc---- Q lcl|NC_019932. 1 MSDY-HHGVRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIAD---- 75 (389) Q Consensus 1 M~~~-~~GV~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~---- 75 (389) |+.= +|=-++++|..+..+.....-...+++-++ ....|.+.....++..+....||.+.........+|. T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~lllt~----~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~ 76 (501) T protein:vir:36 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQ----DTSVQPGQLADFFQETDVENWFGALSNEAKIADAYFPGIVN 76 (501) T ss_pred CCcCCcccceEEEEeeeeccCCCcceeeeeEEEec----cCCCCCcceeeecCHHHHHHhcCCChHHHHHHHHHhhcccC Confidence 7731 444456666655444433333333333221 1223455455566667777778877666666666664 Q ss_pred ccCc--eEEEEEecccccc-------------------------------ccccccccccccccccchhhHHHHHHhh-- Q lcl|NC_019932. 76 QAKP--VTVVVRVAEGATP-------------------------------AETTSNIIGTTDENGRYTGMKALLSAQT-- 120 (389) Q Consensus 76 ~~~~--~~~v~~~~~~~~~-------------------------------~~t~~~~~~~~d~~~~~tGl~a~~~~~~-- 120 (389) +... ..++.|....... .....+.....+.....+.+.+...... T Consensus 77 q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~~a~~~~~sg~l~vti~g~~~~~~i~lS~~ts~~~vA~~i~~al~~~~~t 156 (501) T protein:vir:36 77 GGQLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDFV 156 (501) T ss_pred CCccccEEEEEeecCcCcceeEeccchhhhhhhhccceeEEEEEEecceeeeeecccccccCHHHHHHHHhhhhcCcceE Confidence 2221 1122221100000 0000000000000111112222211110 Q ss_pred ----------------------------------hhhhh---hhhhccccccchHHHHHHHHhhhhcC---ceeeeccCC Q lcl|NC_019932. 121 ----------------------------------QLGVK---PRILGVPGLDALEVSTALASIAQQLR---AFAYVSAWG 160 (389) Q Consensus 121 ----------------------------------~~~~~---p~~~~apg~~~~~v~~al~~~~~~~~---~~~i~d~~~ 160 (389) ..++. +..+...|........+|..+..... .+.+.+.+. T Consensus 157 v~~d~~~~~f~i~s~t~G~~~~i~~~t~~~~ia~~l~Lt~~~~a~v~~~g~~~et~~~al~a~~~~s~~Wy~f~~a~~~~ 236 (501) T protein:vir:36 157 VAYDALRNRFTVVTNATGTAAAISAVTGTNNFADEIGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTAV 236 (501) T ss_pred EEEcCcceeEEEEeccCCcceeeEeeecccchhhhhcccccCcceEEecccccccHHHHHHHHHhccCceEEEEEecCCC Confidence 00000 01112222222333445544433332 334444333 Q ss_pred CccHHHHHHhhhcccCceeEEeeee-EEEEee-----------cCCCceEE------ehhHHHHHHHHhhhcccc-ceec Q lcl|NC_019932. 161 CKTLSEAMAYRENFSQRELMVIWPD-FISWNT-----------TANQSETA------YATARALGLRAKIDTDTG-WHKT 221 (389) Q Consensus 161 ~~t~~~a~~~~~~~~s~~~~~~~p~-~~~~~~-----------~~~~~~~~------p~s~~~Ag~~a~~d~~~g-~~~s 221 (389) ....-.+-+|.+.-+..+.+..+.. ....+. ..+-.+.+ .+.+.+.|..+.+|-.+- =-.+ T Consensus 237 ~~~~la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g~~T 316 (501) T protein:vir:36 237 IADRLAFASWNSGQAYKYMYVAPDLEAASIVSNNAASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNGRTV 316 (501) T ss_pred hHHHHHHHHHHhhcCceEEEEEecCchhhhhccchhhHHHHHHhcCCCcEEEEcCCCCHHHHHHHHHHhcCcccCcceee Confidence 2222222233333222222221100 000000 00001111 244555677777664331 1112 Q ss_pred cCCcee-ccceeccccccccccCCcchhhhhcccceEEE--EcC--CCEEEEcCccCCCCcccceeehhhHHHHHHHHHH Q lcl|NC_019932. 222 LSNVGV-NGVTGISASVFWDLQQTGTDADLLNEACVTTL--IRK--DGFRFWGNRTCSDDPLFAFENYTRTAQVIADTMA 296 (389) Q Consensus 222 pan~~l-~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~--~~~--~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~ 296 (389) ..+|.+ .|+.. ..++..+++.|..+|+|+. +.+ +.+.+|-.-+++++ |.+|.+.+-.+|++..++ T Consensus 317 ~~fkq~~~Gi~a--------~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~--~~wiD~~~g~dWL~~~iq 386 (501) T protein:vir:36 317 LAFRQFNAGVPA--------TVHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK--FLWVDTYLDQIYLNAELQ 386 (501) T ss_pred eeccccCCCcCc--------CcCCHHHHHHHHhcCCcEEEEEecccceeeEEEcCeeecc--chhhhHHHhHHHHHHHHH Confidence 234443 23332 2256788999999999986 433 44788754466665 556888888899999999 Q ss_pred HHHHHHhhc----CCCHHHHHHHHHHHHHHHHHHHhCCcee-----------------------------eeEEEEecCC Q lcl|NC_019932. 297 EAHMWANDK----PLTPVLVRDIIAGINAKFRELVSAGYLL-----------------------------GASCWYDDTA 343 (389) Q Consensus 297 ~~~~~~v~e----~n~~~~~~~i~~~i~~~l~~l~~~gal~-----------------------------g~~v~~d~~~ 343 (389) ..+...+-. |-|..=...|+..++.-|++-+++|.|. ||.+..+... T Consensus 387 ~~l~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~I~~~~g~~~~~~~v~~~Gyy~~~~~~~ 466 (501) T protein:vir:36 387 RAEFEAMLAYNSLPYNEDGYTGLYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDKAARVAGAGQLVQTRGWYFLIGDPA 466 (501) T ss_pred HHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeecccccccccccceeccceEEeeCccc Confidence 888876543 4577778889999999999999999883 2444444433 Q ss_pred C-CHHHhhCCEEEEEEEEEecccceEEEEEEEEcc Q lcl|NC_019932. 344 N-DKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITD 377 (389) Q Consensus 344 n-~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 377 (389) . +++.-.++...+.+...--..+++|++-..--. T Consensus 467 ~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:36 467 NPGQARQNRTTPACTLWYSDGGSIQSLTIGSNAVI 501 (501) T ss_pred CChhhhhhcccCcEEEEEEeCCceeEEEeeeeeeC Confidence 3 334444444667777777788888876554333 No 66 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=93.27 E-value=0.0082 Score=32.03 Aligned_cols=362 Identities=12% Similarity=0.046 Sum_probs=173.7 Q ss_pred CCCC-CCCEEEEECCCCCcccccccccceee-eecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhc--- Q lcl|NC_019932. 1 MSDY-HHGVRVVEINDGTRTISTVSTAIVGM-VCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIAD--- 75 (389) Q Consensus 1 M~~~-~~GV~v~~v~~~~~~~~~v~t~v~~~-~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~--- 75 (389) |+.= +|=-++++|..+..+.........++ +++. ...|.+.....++..+....||.+.........+|. T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~f~~lll~~~-----~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsg~~ 75 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQD-----TSVQPGQLADFFQKTDVENWFGALSNEAKIADAYFPGIV 75 (501) T ss_pred CCcCccccceEEEEeeecccCCCcccccceEEEecc-----cCCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhhc Confidence 7731 34445566655544333332222232 2222 223445455566667777778877666666666664 Q ss_pred -ccCc--eEEEEEeccccc---------ccc----------------------ccccccccccccccchhhHHHHHHhh- Q lcl|NC_019932. 76 -QAKP--VTVVVRVAEGAT---------PAE----------------------TTSNIIGTTDENGRYTGMKALLSAQT- 120 (389) Q Consensus 76 -~~~~--~~~v~~~~~~~~---------~~~----------------------t~~~~~~~~d~~~~~tGl~a~~~~~~- 120 (389) +... ..++.|...... ... ..-+.....+.....+.+.+...... T Consensus 76 ~q~p~P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~~g~l~i~i~g~~~~~~i~~s~ats~~~vA~~i~~al~~~~~ 155 (501) T protein:vir:10 76 NGGQLPYDLKFARYVAADAPASVYGIPLTGITLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDF 155 (501) T ss_pred CCCccccEEEEEeecccCccceeeeceehhhhhhhhhheeeEEEEeeccceeeeccccccccCHHHHHHHHHHhhcCCce Confidence 2221 122222211100 000 00001000010111122222221110 Q ss_pred -----------------------------------hhhhh---hhhhccccccchHHHHHHHHhhhhcC---ceeeeccC Q lcl|NC_019932. 121 -----------------------------------QLGVK---PRILGVPGLDALEVSTALASIAQQLR---AFAYVSAW 159 (389) Q Consensus 121 -----------------------------------~~~~~---p~~~~apg~~~~~v~~al~~~~~~~~---~~~i~d~~ 159 (389) .+++. +..+...|........+|.++..... .+.+.+.+ T Consensus 156 tv~~d~~~~~f~i~~~t~G~~~~i~~~t~~~d~a~~l~Lt~~~~a~v~~~g~~aet~~~Al~a~~~~~~~Wy~f~~a~~~ 235 (501) T protein:vir:10 156 VVAYDALRNRFTVVTNTTGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTA 235 (501) T ss_pred EEEEecccceEEEEecccCcceeEEEeeccccchhhhcccccCceeEEecCcccccHHHHHHHHHhcccceEEEEEEecC Confidence 00000 11122222322333445554433332 33444443 Q ss_pred CCccHHHHHHhhhcccCceeEEeee---eEEEEee---------cCCCceE------EehhHHHHHHHHhhhccccc-ee Q lcl|NC_019932. 160 GCKTLSEAMAYRENFSQRELMVIWP---DFISWNT---------TANQSET------AYATARALGLRAKIDTDTGW-HK 220 (389) Q Consensus 160 ~~~t~~~a~~~~~~~~s~~~~~~~p---~~~~~~~---------~~~~~~~------~p~s~~~Ag~~a~~d~~~g~-~~ 220 (389) .....-.+-+|.+.-+..+.+..+. ....... .++-.+. -++.+.+.|..+.+|.+.-+ -. T Consensus 236 ~~~~~la~A~wi~a~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g~~ 315 (501) T protein:vir:10 236 VIADRLAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNGRT 315 (501) T ss_pred ChHHHHHHHHHHHhcCceEEEEEecCcceeeecccchhHHHHHHhcCCCceEEECCCCCHHHHHHHHHHhcCcccCccee Confidence 3322222333444333333222211 1111000 0011111 13556667777777654311 12 Q ss_pred ccCCcee-ccceeccccccccccCCcchhhhhcccceEEE--EcC--CCEEEEcCccCCCCcccceeehhhHHHHHHHHH Q lcl|NC_019932. 221 TLSNVGV-NGVTGISASVFWDLQQTGTDADLLNEACVTTL--IRK--DGFRFWGNRTCSDDPLFAFENYTRTAQVIADTM 295 (389) Q Consensus 221 span~~l-~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~--~~~--~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~ 295 (389) +...|.+ .|+.. ..++..+++.|..+|+|+. +.+ +.+.+|-.-+++++ |.+|.+.+-.+|+++.+ T Consensus 316 T~~fkql~~Gv~a--------~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~--~~wiD~~~g~dWl~~~i 385 (501) T protein:vir:10 316 VLAFRQFNAGVPA--------TAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK--FLWVDTYLDQIYLNAEL 385 (501) T ss_pred eeeecccCCCcCc--------ccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcceeecc--ceehhhHhhHHHHHHHH Confidence 2233443 23332 2257788999999999987 333 34888844446665 55688888889999888 Q ss_pred HHHHHHHhhc----CCCHHHHHHHHHHHHHHHHHHHhCCceee-----------------------------eEEEEecC Q lcl|NC_019932. 296 AEAHMWANDK----PLTPVLVRDIIAGINAKFRELVSAGYLLG-----------------------------ASCWYDDT 342 (389) Q Consensus 296 ~~~~~~~v~e----~n~~~~~~~i~~~i~~~l~~l~~~gal~g-----------------------------~~v~~d~~ 342 (389) +..+...+-. |-|..=...|+..++.-|++-+++|.|.- |.+..+.. T Consensus 386 q~~l~~ll~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~ 465 (501) T protein:vir:10 386 QRAEFEAMLAYNSLPYNEDGYTANYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAVAGVAGAGALVQTRGWYFLIGNP 465 (501) T ss_pred HHHHHHHHhcCCCcccCHHHHHHHHHHHHHHHHHHHhCcceecCcccCcccceeecccccccccccceeccceEEeeCcc Confidence 8888765433 44677888899999999999999998832 34444433 Q ss_pred -CCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcc Q lcl|NC_019932. 343 -ANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITD 377 (389) Q Consensus 343 -~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 377 (389) ..+++.-.++...+.+...--..+++|++-..--. T Consensus 466 ~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 466 ANPGQARQNRTSPACTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred cCChhhhhhcccCceEEEEEeCCceeEEEeeeeecC Confidence 23344444445667777777788888876554333 No 67 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=92.26 E-value=0.012 Score=31.08 Aligned_cols=362 Identities=13% Similarity=0.051 Sum_probs=171.9 Q ss_pred CCC-CCCCEEEEECCCCCcccccccccceee-eecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhc--- Q lcl|NC_019932. 1 MSD-YHHGVRVVEINDGTRTISTVSTAIVGM-VCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIAD--- 75 (389) Q Consensus 1 M~~-~~~GV~v~~v~~~~~~~~~v~t~v~~~-~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~--- 75 (389) |+. =+|=-++++|..+..+.........++ +++. ...|..+....++..+....||.+.........+|. T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~l~l~~~-----~~~~~~~~~~~~s~~~V~~~FG~~S~ey~aA~~yFsg~~ 75 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQD-----TSVQPGQLADFFQETDVENWFGALSNEAKIADAYFPGIV 75 (501) T ss_pred CCCCCcccceEEEEeeecccCCCccccceeEEEecc-----CCCCccceEEecCHHHHHHhcCCChHHHHHHHHHhhhhc Confidence 773 245445666665544433333223333 2322 345666666677777777778877666666666664 Q ss_pred -ccCc--eEEEEEeccccc---------cccccc----------------------cccccccccccchhhHHHHHHhh- Q lcl|NC_019932. 76 -QAKP--VTVVVRVAEGAT---------PAETTS----------------------NIIGTTDENGRYTGMKALLSAQT- 120 (389) Q Consensus 76 -~~~~--~~~v~~~~~~~~---------~~~t~~----------------------~~~~~~d~~~~~tGl~a~~~~~~- 120 (389) +... ..++.|...... ...+.. +.....+.....+.+.+...... T Consensus 76 ~q~p~P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~sg~l~vti~g~~~~~~i~ls~ats~~~vAs~i~~al~~~~~ 155 (501) T protein:vir:10 76 NGGQLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDF 155 (501) T ss_pred CCCccccEEEEEeecCCCccceEeccchhhhhhhhcceeeeEEEEeeccceeecccccccccCHHHHHHHHhhhccCCce Confidence 3222 122222110000 000000 00000000011111111111110 Q ss_pred -----------------------------------hhhhh---hhhhccccccchHHHHHHHHhhhhc---CceeeeccC Q lcl|NC_019932. 121 -----------------------------------QLGVK---PRILGVPGLDALEVSTALASIAQQL---RAFAYVSAW 159 (389) Q Consensus 121 -----------------------------------~~~~~---p~~~~apg~~~~~v~~al~~~~~~~---~~~~i~d~~ 159 (389) .+++. +..+...|........++..+.... -.+.+.+.+ T Consensus 156 tv~~d~~~~~f~its~ttG~~~~i~~~~~~~~la~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~a~~~ 235 (501) T protein:vir:10 156 VVAYDALRNRFTVVTNATGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTA 235 (501) T ss_pred EEEEcccCceEEEEeeccCCceeEEEeeCchhhhhhcCccccccceEEecCcccccHHHHHHHHHhccCceEEEEEecCC Confidence 00000 1112222222233344444443332 233444543 Q ss_pred CCccHHHHHHhhhcccCceeEEeee---eEEEEee---------cCCCceEEe------hhHHHHHHHHhhhccccc-ee Q lcl|NC_019932. 160 GCKTLSEAMAYRENFSQRELMVIWP---DFISWNT---------TANQSETAY------ATARALGLRAKIDTDTGW-HK 220 (389) Q Consensus 160 ~~~t~~~a~~~~~~~~s~~~~~~~p---~~~~~~~---------~~~~~~~~p------~s~~~Ag~~a~~d~~~g~-~~ 220 (389) .....-.+-+|.+.-+..+.+..+. ....... .++-.+.+| +.+.+.|..+.+|.++-. -. T Consensus 236 ~~~~~la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g~~ 315 (501) T protein:vir:10 236 VIADRLAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNGRT 315 (501) T ss_pred ChHHHHHHHHHHHhcCceEEEEEecCchhhhhhhhhhhHHHHHHhcCCCceEEECCCCcHHHHHHHHHHhhCcccCccce Confidence 3222222223333322222222110 0000000 011112222 455666777777654321 12 Q ss_pred ccCCceec-cceeccccccccccCCcchhhhhcccceEEEEc--C--CCEEEEcCccCCCCcccceeehhhHHHHHHHHH Q lcl|NC_019932. 221 TLSNVGVN-GVTGISASVFWDLQQTGTDADLLNEACVTTLIR--K--DGFRFWGNRTCSDDPLFAFENYTRTAQVIADTM 295 (389) Q Consensus 221 span~~l~-gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~--~--~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~ 295 (389) +...|.+. |+.. ..++..+++.|..+|+|+... + .-+.+|-.-+++++ |.+|.+-+-.+|+++.+ T Consensus 316 T~~fkq~~~Gi~a--------~~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~--~~wiD~~~~~~Wl~~~i 385 (501) T protein:vir:10 316 VLAFRQFNAGVPA--------TAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK--FLWVDTYLDQIYLNAEL 385 (501) T ss_pred eeeccccCCCcCc--------ccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeecc--ceeehhhhhHHHHHHHH Confidence 22334432 3322 225778899999999999843 3 33788844455665 45577777777777777 Q ss_pred HHHHHHHhhc----CCCHHHHHHHHHHHHHHHHHHHhCCceee-----------------------------eEEEEecC Q lcl|NC_019932. 296 AEAHMWANDK----PLTPVLVRDIIAGINAKFRELVSAGYLLG-----------------------------ASCWYDDT 342 (389) Q Consensus 296 ~~~~~~~v~e----~n~~~~~~~i~~~i~~~l~~l~~~gal~g-----------------------------~~v~~d~~ 342 (389) +..+...+-. |-|..=...|+..++.-|++-+++|.|.- |.+..+.. T Consensus 386 q~~l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~ 465 (501) T protein:vir:10 386 QRAEFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQTRGWYFLIGDP 465 (501) T ss_pred HHHHHHHHHhcCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccCccccccceeccceeEeeccc Confidence 7666654332 56788888899999999999999998832 44444433 Q ss_pred C-CCHHHhhCCEEEEEEEEEecccceEEEEEEEEcc Q lcl|NC_019932. 343 A-NDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITD 377 (389) Q Consensus 343 ~-n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 377 (389) . .+++.-.++...+.+...--..+++|++-..--. T Consensus 466 ~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 466 ANPGQARQNRTTPACTLWYSDGGSIQQLTIGSNAVI 501 (501) T ss_pred cCChhhhhhccccceEEEEEeCCceeEEEeeeeecC Confidence 2 3344444444667777777777888876544333 No 68 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=89.82 E-value=0.024 Score=29.44 Aligned_cols=351 Identities=12% Similarity=0.029 Sum_probs=142.0 Q ss_pred CCCC-----CCC-EEEEECCCCCcccccccccceeeeecccccccccccccccEEEecchhhhh---hhcccch------ Q lcl|NC_019932. 1 MSDY-----HHG-VRVVEINDGTRTISTVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIG---KAGKKGT------ 65 (389) Q Consensus 1 M~~~-----~~G-V~v~~v~~~~~~~~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~---~~~~~gt------ 65 (389) +..+ .|+ ++|-+-....++.. +.|-.-.+..............+-....++.. .+..... T Consensus 68 Fsq~p~~~~~P~~L~igR~~~~~~~a~-----l~g~~~~~~l~~~~~~~~G~lti~v~G~~~t~~~i~lS~~ts~~~vAs 142 (507) T protein:vir:99 68 MSFISKSINSPSYISFARWVNAAIASM-----IVGDSLVKNLPALKAVATPTLSLSIGGTVVPIAGIDLTAALTLTDVAA 142 (507) T ss_pred hccCCCCCcccceEEEEeecCccccce-----eecchhhhhHHHHhhhcceeEEEEEcCceeEeccccccccCCHHHHHH Confidence 2222 122 23333221111110 00000000000000000000000000000000 0000000 Q ss_pred -hHHHHHhh-----------hcccCceEEEEEeccccccccccccccccccccccchhhHHHHHHhhhhhhhhhhhcccc Q lcl|NC_019932. 66 -LAAALQAI-----------ADQAKPVTVVVRVAEGATPAETTSNIIGTTDENGRYTGMKALLSAQTQLGVKPRILGVPG 133 (389) Q Consensus 66 -l~~~v~~~-----------~~~~~~~~~v~~~~~~~~~~~t~~~~~~~~d~~~~~tGl~a~~~~~~~~~~~p~~~~apg 133 (389) +...+... ++..+..-.+.....+..... .........+.+..+..... .......| T Consensus 143 ~i~~~l~a~~~~~~~~~tv~~d~~~~~F~v~s~~tG~~s~i------~~at~~~~gt~~s~l~~~~~-----~~a~~~~g 211 (507) T protein:vir:99 143 TLQTKIRASANAELATATVTFNTTTNQFVLNGTTTGALAPT------ITAVRTDPATDISSLLGWTN-----TGTVFVKG 211 (507) T ss_pred HHHHhhhccccccccceEEEEecCCceEEEEeeecccccee------EEEEcCCchhhHHHHhcccc-----ccceEeec Confidence 11111110 000111111111111111111 00000111112222211111 11122223 Q ss_pred ccchHHHHHHHHhhhhc---CceeeeccCCCccHHHHH---HhhhcccCceeEEeeeeE---------------EEEeec Q lcl|NC_019932. 134 LDALEVSTALASIAQQL---RAFAYVSAWGCKTLSEAM---AYRENFSQRELMVIWPDF---------------ISWNTT 192 (389) Q Consensus 134 ~~~~~v~~al~~~~~~~---~~~~i~d~~~~~t~~~a~---~~~~~~~s~~~~~~~p~~---------------~~~~~~ 192 (389) ........++..+.... -.+.+.+.+. .+.++.+ +|.+.-+..+.+..+... ...... T Consensus 212 ~~aet~~~a~~a~~~~~~nW~~~~~a~~~~-~td~~~lalA~wiea~~~~f~~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 290 (507) T protein:vir:99 212 QAAETPDTSISKSAAISTNFGSFIYTSTPA-LTNDQITAVASWNASQNNMYMYSVPTTIANIGTLYAAVKGFSGCALNIT 290 (507) T ss_pred ccccCHHHHHHHHHhhcCCeEEEEEEeccc-cChHHHHHHHHHHhhcCcEEEEEEecCchhhhhhhhhhhhcceeEEEee Confidence 33333333443333222 1222223221 2222222 222222222211110000 000000 Q ss_pred CCCceEEehhHHHHHHHHhhhcccc-ceeccCCceeccceeccccccccccCCcchhhhhcccceEEEEcC----CCEEE Q lcl|NC_019932. 193 ANQSETAYATARALGLRAKIDTDTG-WHKTLSNVGVNGVTGISASVFWDLQQTGTDADLLNEACVTTLIRK----DGFRF 267 (389) Q Consensus 193 ~~~~~~~p~s~~~Ag~~a~~d~~~g-~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~~----~G~~~ 267 (389) ....-..++.+.+.|.++.+|-.+- =-.+...|.+.||..- .++..|++.|..+|+|+.... +.+.+ T Consensus 291 ~~~~~~~~~~aa~~g~~as~nf~~~ng~~T~~fk~l~GV~a~--------~lt~t~a~al~~~n~N~y~~~a~~~~~~~~ 362 (507) T protein:vir:99 291 SDSLPVDYIEQSPCEILAATDYTRVNATQNYMYYQFPSRNIT--------VSDDTTANLVDANRGNYIGQTQSAGQSLAF 362 (507) T ss_pred cccccchhHHHHHHHHHHhhccCcCccceeecccccCCcccc--------cCCHHHHHHHHhcCCeEEEEeccccceeeE Confidence 0111112244566677777764321 1122334555555533 257888999999999998543 23777 Q ss_pred E-cCccCCCCcccceeehhhHHHHHHHHHHHHHHHHhhc----CCCHHHHHHHHHHHHHHHHHHHhCCceee-------- Q lcl|NC_019932. 268 W-GNRTCSDDPLFAFENYTRTAQVIADTMAEAHMWANDK----PLTPVLVRDIIAGINAKFRELVSAGYLLG-------- 334 (389) Q Consensus 268 w-G~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e----~n~~~~~~~i~~~i~~~l~~l~~~gal~g-------- 334 (389) | .+.++++.-+|.++.+-+-.+|++..++..+....-. |-|..=...|+..++.-|++-+++|.|.. T Consensus 363 ~~~G~~~gG~~~fid~d~~~~~~WL~~~iq~~l~~l~~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~I~~Gvtl~~~q 442 (507) T protein:vir:99 363 YQRGILCGGPNDAVDMNIYANEIWLKSAISAQILSLFLNVPRVPANETGESMLLSVIQSVVNTAKNNGTISAGKNLNVIQ 442 (507) T ss_pred EecCeeeCCcccceeeeeecchHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhccccccCCcccccc Confidence 6 5555555335777777777778888888777764332 45778888899999999999999888843 Q ss_pred ---------------------eEEEEe-cCCCC-HHHhhCCEEEEEEEEEecccceEEEEEEEEc Q lcl|NC_019932. 335 ---------------------ASCWYD-DTAND-KDTLKAGKLFIDYDYTPVPPLEDLTLRQRIT 376 (389) Q Consensus 335 ---------------------~~v~~d-~~~n~-~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 376 (389) |.+... .+..+ .+...++...+.+...--..+++|++..... T Consensus 443 ~~~in~~~~~~~~~~~~~~~Gyy~~~~~~s~~~~~~r~~r~~~~~~~~y~~~gaI~~v~~~~~~v 507 (507) T protein:vir:99 443 QQYITQISGDANAWRQVANIGYWLNITFSSYTNPNTQLTEWKASYQLIYSKDDAIRFVEGTDTLI 507 (507) T ss_pred hheecccccccccccceeccceEEEeCChHhcChhhhhccccceEEEEEEeCCeEEEEEeeeecC Confidence 334443 23334 3444566777788888888888888877655 No 69 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=89.01 E-value=0.029 Score=29.02 Aligned_cols=362 Identities=12% Similarity=0.053 Sum_probs=169.9 Q ss_pred CCCC-CCCEEEEECCCCCcccccccccceeee-ecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhc--- Q lcl|NC_019932. 1 MSDY-HHGVRVVEINDGTRTISTVSTAIVGMV-CTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIAD--- 75 (389) Q Consensus 1 M~~~-~~GV~v~~v~~~~~~~~~v~t~v~~~~-g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~--- 75 (389) |+.= +|=-++++|..+..+.........+++ ++. ...|.+.....++..+....||.+.........+|. T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~lll~~~-----~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~ 75 (501) T protein:vir:78 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQD-----TSIQPGQLADFFQKTDVENWFGGLSNEAVIADAYFPGIV 75 (501) T ss_pred CCcCccccceEEEEeeecccCCCcceeeeeEEEecC-----CCCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhcCC Confidence 7731 344456666555444333332222332 222 223444445566667777778877666666666664 Q ss_pred -ccCce--EEEEEeccccc-------------------------------cccccccccccccccccchhhHHHHHHhh- Q lcl|NC_019932. 76 -QAKPV--TVVVRVAEGAT-------------------------------PAETTSNIIGTTDENGRYTGMKALLSAQT- 120 (389) Q Consensus 76 -~~~~~--~~v~~~~~~~~-------------------------------~~~t~~~~~~~~d~~~~~tGl~a~~~~~~- 120 (389) +...+ .++.|...... ...+..+.....+.....+.+.+...... T Consensus 76 ~q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~la~~~~~~G~l~iti~g~~~~~~i~~S~~ts~~~vA~~i~~al~a~~~ 155 (501) T protein:vir:78 76 NGGQLPYDLKFARYVAADAPASVYGIPLTGVTLTQLQGYSGTLTVTTAAQHVSSNISLAAATSFANAATLIEAAFTSPDF 155 (501) T ss_pred CCCcccceEEEEeecccCcceeEeccceeccchhhhceeeeEEEEEeccceeeeccccccccCHHHHHHHHHhhhcCcce Confidence 32221 12222110000 00000000000010111111222211110 Q ss_pred -----------------------------------hhhh---hhhhhccccccchHHHHHHHHhhhhc---CceeeeccC Q lcl|NC_019932. 121 -----------------------------------QLGV---KPRILGVPGLDALEVSTALASIAQQL---RAFAYVSAW 159 (389) Q Consensus 121 -----------------------------------~~~~---~p~~~~apg~~~~~v~~al~~~~~~~---~~~~i~d~~ 159 (389) .+++ .+..+...|........++..+.... -.+.+.+.+ T Consensus 156 tv~~ds~~~~f~its~t~G~~~~i~~~t~~~~~a~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~a~~~ 235 (501) T protein:vir:78 156 VVSYDALRNRFVVNTNATGTAAAISAVTGTNNLADELGLSAAAGASLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTA 235 (501) T ss_pred EEEEccccceEEEEeeecCCceeEEEEecccchhhhhcccccCceeeEeccccccCHHHHHHHHHhccCceEEEEEecCC Confidence 0000 01112222332333344554443333 233444443 Q ss_pred CCccHHHHHHhhhcccCceeEEee---eeEEEEee---------cCCCceEEe------hhHHHHHHHHhhhccccc-ee Q lcl|NC_019932. 160 GCKTLSEAMAYRENFSQRELMVIW---PDFISWNT---------TANQSETAY------ATARALGLRAKIDTDTGW-HK 220 (389) Q Consensus 160 ~~~t~~~a~~~~~~~~s~~~~~~~---p~~~~~~~---------~~~~~~~~p------~s~~~Ag~~a~~d~~~g~-~~ 220 (389) .....-.+-+|.+.-+..+.+..+ +....... ..+-.+.+| +.+.+.|..+.+|.++-. -. T Consensus 236 ~~~~~lalA~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~a~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g~~ 315 (501) T protein:vir:78 236 VIADRLALASWNSGQAYKYMYVAPDLEPASIVTNNSASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNGRT 315 (501) T ss_pred CHHHHHHHHHHHHhcCceEEEEEecCCcceeecccchhHHHHHhhcCCCceEEEcCCcchHHHHHHHHHhcCcccCccee Confidence 322222222343332222222211 11111000 011112222 345556666777654321 12 Q ss_pred ccCCcee-ccceeccccccccccCCcchhhhhcccceEEEE--cC--CCEEEEcCccCCCCcccceeehhhHHHHHHHHH Q lcl|NC_019932. 221 TLSNVGV-NGVTGISASVFWDLQQTGTDADLLNEACVTTLI--RK--DGFRFWGNRTCSDDPLFAFENYTRTAQVIADTM 295 (389) Q Consensus 221 span~~l-~gv~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~--~~--~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~ 295 (389) +...|.+ .|+.. ..++..+++.|..+|+|+.. .+ +.+.+|-.-+++++ |.+|.+-+-.+|+++.+ T Consensus 316 T~~fkq~~~Gv~a--------~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~--~~wiD~~~~~~Wl~~~i 385 (501) T protein:vir:78 316 VLAFRQFNAGVPA--------TAHDLGTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK--FLWVDTYLDQIYLNAEL 385 (501) T ss_pred eeeccccCCCcCc--------ccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcCeeecc--ceeehhhhhHHHHHHHH Confidence 2233443 23322 22577889999999999873 33 34888844455665 45577777777777777 Q ss_pred HHHHHHHhhc----CCCHHHHHHHHHHHHHHHHHHHhCCceee-----------------------------eEEEEecC Q lcl|NC_019932. 296 AEAHMWANDK----PLTPVLVRDIIAGINAKFRELVSAGYLLG-----------------------------ASCWYDDT 342 (389) Q Consensus 296 ~~~~~~~v~e----~n~~~~~~~i~~~i~~~l~~l~~~gal~g-----------------------------~~v~~d~~ 342 (389) +..+...+-. |-|..=...|+..++.-|++-+++|.|.- |.+..+.. T Consensus 386 q~~l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~I~~~~g~~~~~~~~~~~Gyy~~~~~~ 465 (501) T protein:vir:78 386 QRAEFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQMRGWYFLIGDP 465 (501) T ss_pred HHHHHHHHHhCCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCCccceeeccccCccccccceeccceEEeeccc Confidence 7776654322 55788888899999999999999998832 34444433 Q ss_pred -CCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcc Q lcl|NC_019932. 343 -ANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITD 377 (389) Q Consensus 343 -~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 377 (389) ..+++.-.++...+.+...--..+++|++-..--. T Consensus 466 ~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:78 466 ANPGQARQNRTTPTCTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred cCChhhhhhcccCcEEEEEEeCCceeEEEeeeeecC Confidence 23344444445667777777778888876544333 No 70 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=65.56 E-value=0.28 Score=23.60 Aligned_cols=352 Identities=14% Similarity=0.070 Sum_probs=154.2 Q ss_pred CCCCCCCEEEEECCCCCccc--ccccccceeeeecccccccccccccccEEEecchhhhhhhcccchhHHHHHhhhc--- Q lcl|NC_019932. 1 MSDYHHGVRVVEINDGTRTI--STVSTAIVGMVCTADDADAAAFPLNEPVLLTNVLSAIGKAGKKGTLAAALQAIAD--- 75 (389) Q Consensus 1 M~~~~~GV~v~~v~~~~~~~--~~v~t~v~~~~g~a~~~~~~~~~~~~~vli~~~~~~~~~~~~~gtl~~~v~~~~~--- 75 (389) |.. +|==++++|..+..+. ....-...-|.+ . ...|.......++..+....||.+.........+|. T Consensus 1 m~~-ip~s~iV~V~~~v~~~~~~~~~f~~~l~~~-~-----~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~ 73 (494) T protein:vir:94 1 MPN-IPISQIVSINPQVVSAGGTQGTLDGLLLTQ-A-----TGFPVTQPQVYFSAADVGTAFGLTSDEYNAALVYFAGIL 73 (494) T ss_pred CCC-CCcccEEEeeeeccccCCcccccceeEeec-C-----ccCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhcc Confidence 652 2212344444433332 222222222222 1 223333344455666667778877666666666665 Q ss_pred -ccCc--eEEEEEecccc--------ccccccc----------------------cccccccccccchhhHHHHHH---- Q lcl|NC_019932. 76 -QAKP--VTVVVRVAEGA--------TPAETTS----------------------NIIGTTDENGRYTGMKALLSA---- 118 (389) Q Consensus 76 -~~~~--~~~v~~~~~~~--------~~~~t~~----------------------~~~~~~d~~~~~tGl~a~~~~---- 118 (389) +... ..++.|..... ....+.. +.....+..+..+.+.+.... T Consensus 74 ~q~p~P~~l~igR~~~~a~~~~l~g~~~~~tl~~~~~~~g~l~iti~g~~~~~~i~lS~~ts~~~vA~~i~~ai~~a~~~ 153 (494) T protein:vir:94 74 GGGQQPASLTIGRYASAATSAAVFGAPLTLSLAQLQTLSGTLIVTTDTQRTSAAINLSGATSFANAASLMTSGFTTPNFA 153 (494) T ss_pred CCCccccEEEEEeecCccccceeeccchhhhHHhhhhcceEEEEEEcceEEEeeecccccCChhhHHHHHhhhhccccce Confidence 3322 22222222110 0000000 000000000000111111000 Q ss_pred ------------------------------hhhhhhh---hhhhccccccchHHHHHHHHhhhhc---CceeeeccCCCc Q lcl|NC_019932. 119 ------------------------------QTQLGVK---PRILGVPGLDALEVSTALASIAQQL---RAFAYVSAWGCK 162 (389) Q Consensus 119 ------------------------------~~~~~~~---p~~~~apg~~~~~v~~al~~~~~~~---~~~~i~d~~~~~ 162 (389) -..+++. ...+...|........++..+.... ..+.+.+. . T Consensus 154 v~~d~~~~~f~v~s~ttG~~s~is~~t~~~a~~l~lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~~~~---~ 230 (494) T protein:vir:94 154 ITYDAQRRRFVLSTTATGTTASVSAVTGTLADGVGLSTASGAYVEGSGLAADTAASALDRLAASSSTWAIFTTAWA---A 230 (494) T ss_pred EEEcccCcEEEEEEccCCceeEEEEeccchhhhhhhhccccceEeecCcccccHHHHHHHHHhccCceEEEEEecC---C Confidence 0000000 0001112222222334444433222 23334443 2 Q ss_pred cHHHHHH---hhhcccCceeEEeeee-----EEEEeecC---------C--CceE----EehhHHHHHHHHhhhccccce Q lcl|NC_019932. 163 TLSEAMA---YRENFSQRELMVIWPD-----FISWNTTA---------N--QSET----AYATARALGLRAKIDTDTGWH 219 (389) Q Consensus 163 t~~~a~~---~~~~~~s~~~~~~~p~-----~~~~~~~~---------~--~~~~----~p~s~~~Ag~~a~~d~~~g~~ 219 (389) +.+++.+ |.+.-+..+ +|..| .......+ + +... ..+.+.+.|..+.+|-+. T Consensus 231 ~~~~ilalA~wiea~~~~~--~~~~~~~d~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~aa~~~~~--- 305 (494) T protein:vir:94 231 SLSDRTALAQWTSDQVFRR--IYAAWDQDAAGLSVNNVSSFGNIVKTTPFSNTIPVYGLLANAMIVLAWGASTNLQI--- 305 (494) T ss_pred CHHHHHHHHHHHhhcCccE--EEEEecCCcceeecccchhHHHHHHhhcCCceEEEcCCCChHHHHHHHHHhccccc--- Confidence 3344443 333322222 22222 11100000 0 1111 124456666666666433 Q ss_pred eccCCceecc---ceeccccccccccCCcchhhhhcccceEEEEcCCC----EEEEcCccCCCCcccceeehhhHHHHHH Q lcl|NC_019932. 220 KTLSNVGVNG---VTGISASVFWDLQQTGTDADLLNEACVTTLIRKDG----FRFWGNRTCSDDPLFAFENYTRTAQVIA 292 (389) Q Consensus 220 ~span~~l~g---v~~~~~~~~~~~~~~~~~~~~l~~~~i~~~~~~~G----~~~wG~rT~~~d~~~~~i~vrR~~~~i~ 292 (389) .+.+..++. .-++ ....++..+++.|..+|+|+.....| +.+|.+-+++++-.|- -+-+-.+|++ T Consensus 306 -~~g~~T~~~k~q~~gi-----~~~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~gg~~sG~~~~i--d~~~~~~WL~ 377 (494) T protein:vir:94 306 -AEGRTTLALRSPVSSA-----GVRVDNLANANALLSNGYTYLGKYASATNTYTVTYNGAIGGQFLWA--DTALGWIALR 377 (494) T ss_pred -cCcceeEEeeccCCCC-----CCccCCHHHHHHHHhcCCeEEEEecccCceEEEecCceecccccee--eeeccHHHHH Confidence 333333321 1111 11225678899999999999855432 7888777777664442 2222334555 Q ss_pred HHHHHHHHHHhh----cCCCHHHHHHHHHHHHHHHHHHHhCCceee----------------------------eEEEE- Q lcl|NC_019932. 293 DTMAEAHMWAND----KPLTPVLVRDIIAGINAKFRELVSAGYLLG----------------------------ASCWY- 339 (389) Q Consensus 293 ~~~~~~~~~~v~----e~n~~~~~~~i~~~i~~~l~~l~~~gal~g----------------------------~~v~~- 339 (389) +.++..+...+- =|-|..=...|+..++.-|++-+++|.|.- |.+.. T Consensus 378 ~~iq~~l~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~i~~~~G~~~~~~~~~kGyy~~~~ 457 (494) T protein:vir:94 378 RNLQQALFETLLAYRSLPYNADGYNALYQGAQDVVSQFVAAGVIRAGVALSASQRAQIDQAAGVPISGDVVDKGWYLQVI 457 (494) T ss_pred HHHHHHHHHHHHhCCCcccChhhHHHHHHHHHHHHHHHHhCceeecccccCcchhhhhhhhhcCccccceeccceeeecc Confidence 555555544332 266888888999999999999999999942 23332 Q ss_pred e-cCCCCHHHhhCCEEEEEEEEEecccceEEEEEEEEcc Q lcl|NC_019932. 340 D-DTANDKDTLKAGKLFIDYDYTPVPPLEDLTLRQRITD 377 (389) Q Consensus 340 d-~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 377 (389) + .+.|++.+...-++.+.. .--..+++|++.....- T Consensus 458 ~~~s~~~ra~R~~~~~~~~y--~~~GAIh~v~i~~~~v~ 494 (494) T protein:vir:94 458 DPITTTVRTDRGSPTVNFWY--CDGGSIQRVVVSATTVI 494 (494) T ss_pred CCCChhhhhccccCCceEEE--EecCcEEEEEEeeEEeC Confidence 2 344555544444444443 34778888887776555 Done!