Query lcl|NC_021557.1_cdsid_YP_008129849.1 [gene=RHYG_00035] [protein=tail sheath protein] [protein_id=YP_008129849.1] [location=23344..24603] Match_columns 419 No_of_seqs 164 out of 811 Neff 9.7 Searched_HMMs 1612 Date Thu Nov 7 17:50:26 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_35 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_35_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:107865 Length: 477 100.0 3E-110 2E-113 620.7 37.8 405 1-418 1-477 (477) 2 protein:vir:79092 Length: 477 100.0 3E-109 2E-112 616.0 38.2 405 1-418 1-477 (477) 3 protein:vir:103993 Length: 390 100.0 1E-108 7E-112 612.3 36.3 388 1-419 1-389 (390) 4 protein:vir:78206 Length: 390 100.0 1E-108 7E-112 612.3 36.3 388 1-419 1-389 (390) 5 protein:vir:79181 Length: 390 100.0 2E-107 1E-110 605.6 37.2 388 1-419 1-389 (390) 6 protein:vir:79141 Length: 391 100.0 1E-106 8E-110 601.2 35.4 388 1-419 1-389 (391) 7 protein:vir:98553 Length: 395 100.0 7E-106 4E-109 597.1 37.5 393 1-419 1-394 (395) 8 protein:vir:5711 Length: 396 # 100.0 2E-105 1E-108 594.2 37.4 393 1-419 1-394 (396) 9 protein:vir:2035 Length: 396 # 100.0 2E-105 1E-108 594.6 36.6 393 1-419 1-394 (396) 10 protein:vir:1172 Length: 391 # 100.0 1E-105 9E-109 595.5 35.3 388 1-419 2-390 (391) 11 protein:vir:6079 Length: 396 # 100.0 6E-105 4E-108 591.9 38.3 393 1-419 1-394 (396) 12 protein:vir:1845 Length: 392 # 100.0 6E-105 4E-108 592.0 37.2 390 1-419 1-391 (392) 13 protein:vir:100323 Length: 393 100.0 1E-104 7E-108 590.6 36.0 386 1-419 3-391 (393) 14 protein:vir:10336 Length: 386 100.0 2E-102 1E-105 577.8 34.9 384 1-415 1-386 (386) 15 protein:vir:96740 Length: 388 100.0 8E-102 5E-105 574.8 35.7 384 1-419 1-388 (388) 16 protein:vir:106984 Length: 743 100.0 2.7E-92 1.7E-95 522.6 36.8 396 1-417 1-743 (743) 17 protein:vir:6894 Length: 660 # 100.0 1.8E-91 1.1E-94 518.1 38.5 400 1-419 1-659 (660) 18 protein:vir:6594 Length: 666 # 100.0 3.1E-91 1.9E-94 516.8 37.1 400 1-419 1-664 (666) 19 protein:vir:80984 Length: 666 100.0 6.1E-91 3.8E-94 515.2 37.5 400 1-419 1-664 (666) 20 protein:vir:106427 Length: 679 100.0 8.8E-91 5.5E-94 514.3 37.3 400 1-419 1-678 (679) 21 protein:vir:98263 Length: 664 100.0 5.7E-91 3.5E-94 515.3 36.1 395 1-419 1-659 (664) 22 protein:vir:103456 Length: 659 100.0 7.6E-90 4.7E-93 509.2 37.2 400 1-419 1-659 (659) 23 protein:vir:108052 Length: 660 100.0 8.1E-90 5E-93 509.0 37.2 400 1-419 1-660 (660) 24 protein:vir:7206 Length: 659 # 100.0 5.3E-90 3.3E-93 510.0 35.9 400 1-419 1-659 (659) 25 protein:vir:104858 Length: 729 100.0 1.3E-89 7.9E-93 507.9 36.3 397 1-418 1-729 (729) 26 protein:vir:5663 Length: 671 # 100.0 2.1E-89 1.3E-92 506.8 35.4 398 1-419 1-670 (671) 27 protein:vir:101187 Length: 663 100.0 3E-89 1.9E-92 505.9 36.3 400 1-419 1-661 (663) 28 protein:vir:101804 Length: 663 100.0 5.9E-89 3.7E-92 504.3 36.1 400 1-419 1-661 (663) 29 protein:vir:100539 Length: 663 100.0 1.8E-87 1.1E-90 496.2 34.1 400 1-419 1-661 (663) 30 protein:vir:104477 Length: 749 100.0 4.9E-87 3.1E-90 493.7 36.2 394 1-416 1-749 (749) 31 protein:vir:98824 Length: 774 100.0 1.5E-82 9.2E-86 469.2 28.7 391 1-417 279-774 (774) 32 protein:vir:5833 Length: 742 # 100.0 1.3E-72 8.1E-76 414.7 27.1 376 1-416 343-742 (742) 33 protein:vir:79798 Length: 717 100.0 1.7E-50 1.1E-53 293.4 23.4 357 1-408 330-717 (717) 34 protein:vir:103168 Length: 641 100.0 1E-45 6.2E-49 267.3 21.4 282 1-311 3-641 (641) 35 protein:vir:63742 Length: 562 100.0 5.4E-43 3.4E-46 252.3 28.1 374 1-413 8-562 (562) 36 protein:vir:80779 Length: 569 100.0 5.7E-42 3.5E-45 246.7 27.6 374 1-413 1-569 (569) 37 protein:vir:80488 Length: 562 100.0 5.8E-42 3.6E-45 246.6 27.6 374 1-413 1-562 (562) 38 protein:vir:102819 Length: 648 100.0 1.1E-37 6.8E-41 223.2 31.4 373 1-412 1-648 (648) 39 protein:vir:95741 Length: 587 100.0 1.5E-37 9.5E-41 222.4 27.6 373 1-413 1-587 (587) 40 protein:vir:96586 Length: 587 100.0 5.8E-37 3.6E-40 219.2 28.3 374 1-413 1-587 (587) 41 protein:vir:99306 Length: 587 100.0 1.1E-36 7E-40 217.6 28.1 373 1-413 1-587 (587) 42 protein:vir:107310 Length: 581 100.0 7.8E-34 4.8E-37 202.1 20.9 369 1-419 160-572 (581) 43 protein:vir:100829 Length: 607 100.0 7.9E-33 4.9E-36 196.6 25.8 379 1-419 17-607 (607) 44 protein:vir:7653 Length: 581 # 100.0 7.2E-33 4.5E-36 196.8 22.2 367 1-419 159-572 (581) 45 protein:vir:102957 Length: 437 100.0 7.6E-30 4.7E-33 180.2 28.4 376 1-407 1-437 (437) 46 protein:vir:105470 Length: 451 99.9 1E-26 6.5E-30 163.0 26.6 375 1-407 1-451 (451) 47 protein:vir:101326 Length: 529 99.9 4.5E-23 2.8E-26 143.1 24.9 378 1-408 1-529 (529) 48 protein:vir:78986 Length: 436 99.7 3.4E-16 2.1E-19 105.4 28.9 372 1-407 3-436 (436) 49 protein:vir:102359 Length: 356 99.0 3.9E-09 2.4E-12 66.7 24.8 325 1-406 1-356 (356) 50 protein:vir:276 Length: 369 # 98.9 1.4E-08 8.6E-12 63.7 26.6 345 1-411 1-369 (369) 51 protein:vir:3788 Length: 376 # 98.9 1.8E-08 1.1E-11 63.0 24.8 349 5-413 1-376 (376) 52 protein:vir:78782 Length: 370 98.8 1.6E-08 9.9E-12 63.3 22.5 348 5-416 1-370 (370) 53 protein:vir:95263 Length: 450 98.7 1.3E-07 8E-11 58.4 27.4 371 1-409 1-450 (450) 54 protein:vir:4517 Length: 498 # 98.6 1.5E-07 9.5E-11 57.9 24.3 367 1-411 1-498 (498) 55 protein:vir:3751 Length: 376 # 98.6 1.6E-07 9.8E-11 57.9 25.7 344 5-413 1-376 (376) 56 protein:vir:4463 Length: 498 # 98.6 1.7E-07 1.1E-10 57.6 23.4 370 1-419 1-498 (498) 57 protein:vir:489 Length: 498 # 98.6 2.8E-07 1.7E-10 56.5 23.9 366 1-411 1-498 (498) 58 protein:vir:1996 Length: 495 # 98.4 7.4E-07 4.6E-10 54.2 26.9 373 1-408 1-495 (495) 59 protein:vir:5260 Length: 502 # 98.4 1.1E-06 6.7E-10 53.3 27.8 375 1-408 1-502 (502) 60 protein:vir:80052 Length: 331 98.1 3.6E-06 2.2E-09 50.4 24.4 312 1-408 1-331 (331) 61 protein:vir:3165 Length: 426 # 98.0 6.7E-06 4.2E-09 48.9 24.3 370 1-408 1-426 (426) 62 protein:vir:3636 Length: 501 # 93.1 0.0086 5.4E-06 31.9 26.9 373 1-408 1-501 (501) 63 protein:vir:101576 Length: 501 93.1 0.0088 5.5E-06 31.9 28.3 373 1-408 1-501 (501) 64 protein:vir:106730 Length: 501 89.5 0.026 1.6E-05 29.3 27.6 375 1-408 1-501 (501) 65 protein:vir:78611 Length: 501 87.5 0.038 2.4E-05 28.4 28.7 376 1-408 1-501 (501) 66 protein:vir:99586 Length: 507 85.0 0.056 3.5E-05 27.5 27.0 373 1-407 1-507 (507) 67 protein:vir:94073 Length: 494 81.7 0.084 5.2E-05 26.5 24.3 368 1-408 1-494 (494) 68 protein:vir:96104 Length: 504 78.5 0.11 7.1E-05 25.8 26.2 373 1-407 1-504 (504) 69 protein:vir:107720 Length: 515 59.6 0.39 0.00024 22.8 23.1 379 1-407 1-515 (515) 70 protein:vir:108311 Length: 249 28.7 1.7 0.0011 19.3 8.8 100 315-419 1-121 (249) No 1 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=100.00 E-value=3.5e-110 Score=620.67 Aligned_cols=405 Identities=38% Similarity=0.595 Sum_probs=362.2 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) ||++|+||||++|+++++++|+.++|+|++|||+++.++ .|+|++|+|+.++.. ||.....++|++++ T Consensus 1 M~~~~~pGVyv~E~~~~~~~i~~v~T~v~~~VG~a~~gp-----------~n~pv~its~~d~~~-~g~~~~~~tL~~Av 68 (477) T protein:vir:10 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGP-----------VNTPVQSLSDVDAAQ-FGPQLAGFTIPQAL 68 (477) T ss_pred CcccCCCCeEEEEccCCcccccccCCceeEEEecccCCC-----------CCcCEEEccHHHHHH-hccCCCCCcHHHHH Confidence 999999999999999999999999999999999998764 489999999999965 77777789999999 Q ss_pred HHHhhccCCcEEEEeecccccccccc------------------------------------------------------ Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEG------------------------------------------------------ 106 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~------------------------------------------------------ 106 (419) +.+|++++..++++++.......... T Consensus 69 ~~~f~nGg~~~~vVrV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 148 (477) T protein:vir:10 69 DAVYDYGSGTVIVINVLDPAVHKSNAANEPVTFDAATGRAKLAHPAAANLVLKNDSGGTTYAEGTDYAVDLINGVITRIK 148 (477) T ss_pred HHHHhccceEEEEEecCccccccccccccccccccccceecccccccccccccccccccccccchhhhhhhccccceecc Confidence 99999999999988774332100000 Q ss_pred ---------------ccccccccccceeccccccccccccchhhhhhhhhccccccccccchhhhhhhHHHHHHHHhhcc Q lcl|NC_021557. 107 ---------------ANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLIIAPGYSPAAAVRAEMDVVASRL 171 (419) Q Consensus 107 ---------------~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~~~v~a~l~~~~~~~ 171 (419) ...++.......+++..+.++..+|++++.++++.+...|.++.+|+++..++|.++|..+|++. T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~tGl~al~~~~~~~~~~~~~l~apg~~~~~~v~~~l~~~~~~~ 228 (477) T protein:vir:10 149 TGTIPPGATAAKATYDYADPTKVTAADIIGAVNAAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQL 228 (477) T ss_pred cccccccceeeeeccccccccccccccccccccccchhhhhhhhhhhhhhcchhcccccccccccchhhHHHHHHHHhhC Confidence 00011112223344444556678899999999999999999999999999999999999999999 Q ss_pred ceeEEEEeccCCCHHHHHhhhhhc--cccccCccceEEecceeEeeccccccceeeechHHHHHHHHHhhhhccCceecc Q lcl|NC_021557. 172 HALAIADLPLGLTKQQAVAARGVA--GTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGVIIATDLNEGWQNSP 249 (419) Q Consensus 172 ~~~~i~d~p~~~~~~~~~~~~~~~--~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D~~~g~~~sp 249 (419) ++++++|+|.+.+.+++++++... ...+++|+|++++|||++++|+. ++..+++|||+++||++||+|.++|||||| T Consensus 229 ~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~-~~~~~~~p~s~~~ag~~a~~d~~~g~~~sp 307 (477) T protein:vir:10 229 GAIAYIDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDTA-TNAERLEPLSSRAAGLRARVDLDKGYWWSS 307 (477) T ss_pred CEEEEEecCCCCCHHHHHhhhhhccccccccccceEEEEcCeEEEeccc-CCceeEEchHHHHHHHHHHhhhcCCceecc Confidence 999999999988888898888653 34567899999999999999864 556789999999999999999999999999 Q ss_pred cCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeehhhHHHHHHHHHHH Q lcl|NC_021557. 250 SNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHARRILDMIHEAIIF 329 (419) Q Consensus 250 an~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~ 329 (419) +|++|.||.++++++.+.+++.++|+++||++|||+|++++++|+++||+||++++++|+.|+|+++|||+++|+++|++ T Consensus 308 an~~~~gi~~~~~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~~~vrR~~~~i~~~~~~ 387 (477) T protein:vir:10 308 SNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRY 387 (477) T ss_pred CCceeccccccccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCCCcccceeehhhHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcc Q lcl|NC_021557. 330 YTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVDT 408 (419) Q Consensus 330 ~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~ 408 (419) .++|+|||||++.+|++|+++++.||++||++| ++||+|+||+++||+++|++|+|+++|+++|++|+|||+|++++++ T Consensus 388 ~~~~~v~~~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 467 (477) T protein:vir:10 388 FSQQFVDAPIDQGLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNPKEELAAGHLLINYKYTVPPPLERLTYETEITS 467 (477) T ss_pred HHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEcc Confidence 999999999999999999999999999999865 8899999999999999999999999999999999999999999999 Q ss_pred hHHHHHHHhc Q lcl|NC_021557. 409 KFISNALSLA 418 (419) Q Consensus 409 ~~~~~~~~~~ 418 (419) +||+++|+.- T Consensus 468 ~~~~~~~~g~ 477 (477) T protein:vir:10 468 EYLLTLKGGN 477 (477) T ss_pred hHHhhhhcCC Confidence 9999999888 No 2 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=100.00 E-value=2.5e-109 Score=615.97 Aligned_cols=405 Identities=38% Similarity=0.603 Sum_probs=359.5 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) ||++|+|||||+|+++++++|+.++|+|++|||++++++ .|+|++|+|+.|+.. ||.....++|+.++ T Consensus 1 M~~~~~pGVyv~E~~~g~~~I~~v~Tsv~~~VG~a~~~p-----------~n~pv~its~~d~~~-~g~~~~~~tL~~Av 68 (477) T protein:vir:79 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGP-----------VNTPVQSLSDVDAAQ-FGPQLAGFTIPQAL 68 (477) T ss_pred CcCCCCCCeEEEEecCCcccccccCCceEEEEeecccCC-----------CcccEEEccHHHHHH-hcCCCCCCcHHHHH Confidence 999999999999999999999999999999999998773 489999999999986 56666778999999 Q ss_pred HHHhhccCCcEEEEeecccccccccc------------------------------------------------------ Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEG------------------------------------------------------ 106 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~------------------------------------------------------ 106 (419) +.+|++++..+++++........... T Consensus 69 ~~~f~ngg~~~~vvrV~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 148 (477) T protein:vir:79 69 DAVYDYGSGTVIVINVLDPAVHKSNAASESVTFDAATGRAKLAHPAAANLVLKNDSGGTTYTEGTDYAVDLINGVITRIK 148 (477) T ss_pred HHHhhcCCceEEEEeccCCccccccccccccccccccccccccccccceeEEeecccccccccCccccccccchhhhhhh Confidence 99999999999988764322100000 Q ss_pred ---------------ccccccccccceeccccccccccccchhhhhhhhhccccccccccchhhhhhhHHHHHHHHhhcc Q lcl|NC_021557. 107 ---------------ANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLIIAPGYSPAAAVRAEMDVVASRL 171 (419) Q Consensus 107 ---------------~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~~~v~a~l~~~~~~~ 171 (419) ...+.......+.++..+..+..+|++++..++..+...|.++.+|+++..+.+.++|..+|++. T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~a~~~~tg~~al~~~~~~~~~~~~iv~apg~~~~~~v~~~l~~~~~~~ 228 (477) T protein:vir:79 149 TGTIPAAATAAKATYDYADPTKVTAADIIGAVNAAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQL 228 (477) T ss_pred ccccccccceeeceeccCCcccceeeeecccccccccchhhhhhhhhhhhcccccceeeccccccchhHHHHHHHHHhhc Confidence 00001111222333444445667888899999999999999999999999999999999999999 Q ss_pred ceeEEEEeccCCCHHHHHhhhhhcc--ccccCccceEEecceeEeeccccccceeeechHHHHHHHHHhhhhccCceecc Q lcl|NC_021557. 172 HALAIADLPLGLTKQQAVAARGVAG--TANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGVIIATDLNEGWQNSP 249 (419) Q Consensus 172 ~~~~i~d~p~~~~~~~~~~~~~~~~--~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D~~~g~~~sp 249 (419) ++++++|+|.+.+..++.+++.... ..+++|+|++++|||++++|+. ++..+++|||+++||++||+|.++|||+|| T Consensus 229 ~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~-~~~~~~~p~s~~~ag~~a~~d~~~g~~~sp 307 (477) T protein:vir:79 229 GAIAYIDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDIA-TNAERLEPLSSRAAGLRARVDLDKGYWWSS 307 (477) T ss_pred CeEEEEecCCCCChHHHhhhhhhccccccccccceEEEEcCeeEEeccc-CCceeeechHHHHHHHHHHhhccCCceEcc Confidence 9999999998888888888876533 4568899999999999999864 556788999999999999999999999999 Q ss_pred cCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeehhhHHHHHHHHHHH Q lcl|NC_021557. 250 SNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHARRILDMIHEAIIF 329 (419) Q Consensus 250 an~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~ 329 (419) +|++|.||.++++++.+.+++.++|++.||++|||+|++++++|+++||+||++++++++.||||++||++++|+++|++ T Consensus 308 an~~~~gv~~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~i~vrR~~~~i~~~~~~ 387 (477) T protein:vir:79 308 SNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRY 387 (477) T ss_pred CCceeecceecccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCCCccceeeehhhHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhh-cccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcc Q lcl|NC_021557. 330 YTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGI-AIYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVDT 408 (419) Q Consensus 330 ~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~-g~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~ 408 (419) .++|++||||++.+|++|+++++.||++||++ +++||+|+||+++||+++|++|+|+++|+++|++|+|||+|++++++ T Consensus 388 ~~~~~v~e~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 467 (477) T protein:vir:79 388 FSQQFVDAPIDQGLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNPKEELAAGHLLINYKYTVPPPLERLTYETEITS 467 (477) T ss_pred HHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCeEEEEEEEEecCCceeEEEEEEEec Confidence 99999999999999999999999999999986 48899999999999999999999999999999999999999999999 Q ss_pred hHHHHHHHhc Q lcl|NC_021557. 409 KFISNALSLA 418 (419) Q Consensus 409 ~~~~~~~~~~ 418 (419) +||+++++.- T Consensus 468 ~~~~~~~~~~ 477 (477) T protein:vir:79 468 EYLLTLKGGN 477 (477) T ss_pred hHHhhhccCC Confidence 9999999888 No 3 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=100.00 E-value=1.2e-108 Score=612.33 Aligned_cols=388 Identities=26% Similarity=0.370 Sum_probs=357.5 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) ||++|+|||||+|++++++++..++|++|+|||++++++...+++ |+|++++|+.++...||. .++|.+++ T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pl------n~pv~i~s~~~~~~~~g~---~gtL~~al 71 (390) T protein:vir:10 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPL------NTPVLLTNVVAALGKAGK---KGTLRRTL 71 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCcccccc------ccceEeccHHHHHhhcCC---Cceehhhh Confidence 999999999999999999999999999999999999998877664 899999999999999886 47899999 Q ss_pred HHHhhccCCcEEEEeeccccccccccccccccccccceeccccccccccccchhhhhhhhhccccccccccchhhhhhhH Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLIIAPGYSPAAAV 160 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~~~v 160 (419) +.++++++..++++.... .++...+..++++....++..+|++++...++.++..|.++.+|++++.+ + T Consensus 72 ~~~~~~gg~~~~vv~v~~----------~~~~~~~~~~~ig~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~-v 140 (390) T protein:vir:10 72 DAIGKQTKPLTVVVRVAE----------GKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQP-V 140 (390) T ss_pred hhhccccCceEEEEEecc----------cccccccccccccccccccccchhhhhhhhhhhhcceehhhcccccchHH-H Confidence 999999999998887532 23345556667777777888999999999999999999999999998764 7 Q ss_pred HHHHHHHhhccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHHHHHHHHhhh Q lcl|NC_021557. 161 RAEMDVVASRLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGVIIATD 240 (419) Q Consensus 161 ~a~l~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D 240 (419) .++|..+|+++++++++|+|.+.+..+++.+|. +++|+|+++||||++++|+. ++..+++|||+++||+++++| T Consensus 141 ~~~l~~~a~~~~~~aivD~p~~~t~~~a~~~~~-----~~~s~~~~~~~p~~~~~d~~-~~~~~~~p~s~~~Agl~a~~D 214 (390) T protein:vir:10 141 AAALAATAQSLRAMAYVSASGCKTKEEAAAYRK-----QFGQREIMVIWPDWLGWDDT-TNSTAVIPAPAIAAGLRAKID 214 (390) T ss_pred HHHHHHhhcccceEEEEecCCCCCHHHHHHHhh-----ccCCceEEEEcCceEeeccc-CCcccccchHHHHHHHHHHhh Confidence 777888889999999999998888899998874 58899999999999999864 556789999999999999999 Q ss_pred hccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeehhhHH Q lcl|NC_021557. 241 LNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHARRIL 320 (419) Q Consensus 241 ~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~ 320 (419) .++|||+||+|+.|.|+.++++++++..++..+|+++||++||+++++ ++||++||+||++ +|++|+||++|||+ T Consensus 215 ~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~--~~G~~~wG~rT~s---~d~~~~~i~~rR~~ 289 (390) T protein:vir:10 215 NDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVN--RNGFRFWGERTCS---DDPKFAFENYTRTA 289 (390) T ss_pred cCCCcEECcCCceeeceeecceecccccccccchhhhhhhcCcEEEEc--CCCEEEEcccccC---CCcccceeehhhHH Confidence 999999999999999999999999999999999999999999999975 5899999999994 47789999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceE Q lcl|NC_021557. 321 DMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMER 399 (419) Q Consensus 321 ~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~ 399 (419) ++|+++|+++++|++||||++.+|++|++++++||++||++| ++||+|+||+++||+++|++|+|+++|+++|++|+|| T Consensus 290 ~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~pae~ 369 (390) T protein:vir:10 290 QVAGDSIAEAQMPVVDGPLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLEN 369 (390) T ss_pred HHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCcce Confidence 999999999999999999999999999999999999999865 7899999999999999999999999999999999999 Q ss_pred EEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 400 ITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 400 i~~~~~~~~~~~~~~~~~~a 419 (419) |+|+++++++||+++|++++ T Consensus 370 I~~~~~~~~~~~~~~~~~~~ 389 (390) T protein:vir:10 370 LVLRQRITDRFLADFPARVA 389 (390) T ss_pred EEEEEEEchHHHHHHHHHhc Confidence 99999999999999999999 No 4 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=100.00 E-value=1.2e-108 Score=612.33 Aligned_cols=388 Identities=26% Similarity=0.370 Sum_probs=357.5 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) ||++|+|||||+|++++++++..++|++|+|||++++++...+++ |+|++++|+.++...||. .++|.+++ T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pl------n~pv~i~s~~~~~~~~g~---~gtL~~al 71 (390) T protein:vir:78 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPL------NTPVLLTNVVAALGKAGK---KGTLRRTL 71 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCcccccc------ccceEeccHHHHHhhcCC---Cceehhhh Confidence 999999999999999999999999999999999999998877664 899999999999999886 47899999 Q ss_pred HHHhhccCCcEEEEeeccccccccccccccccccccceeccccccccccccchhhhhhhhhccccccccccchhhhhhhH Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLIIAPGYSPAAAV 160 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~~~v 160 (419) +.++++++..++++.... .++...+..++++....++..+|++++...++.++..|.++.+|++++.+ + T Consensus 72 ~~~~~~gg~~~~vv~v~~----------~~~~~~~~~~~ig~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~-v 140 (390) T protein:vir:78 72 DAIGKQTKPLTVVVRVAE----------GKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQP-V 140 (390) T ss_pred hhhccccCceEEEEEecc----------cccccccccccccccccccccchhhhhhhhhhhhcceehhhcccccchHH-H Confidence 999999999998887532 23345556667777777888999999999999999999999999998764 7 Q ss_pred HHHHHHHhhccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHHHHHHHHhhh Q lcl|NC_021557. 161 RAEMDVVASRLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGVIIATD 240 (419) Q Consensus 161 ~a~l~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D 240 (419) .++|..+|+++++++++|+|.+.+..+++.+|. +++|+|+++||||++++|+. ++..+++|||+++||+++++| T Consensus 141 ~~~l~~~a~~~~~~aivD~p~~~t~~~a~~~~~-----~~~s~~~~~~~p~~~~~d~~-~~~~~~~p~s~~~Agl~a~~D 214 (390) T protein:vir:78 141 AAALAATAQSLRAMAYVSASGCKTKEEAAAYRK-----QFGQREIMVIWPDWLGWDDT-TNSTAVIPAPAIAAGLRAKID 214 (390) T ss_pred HHHHHHhhcccceEEEEecCCCCCHHHHHHHhh-----ccCCceEEEEcCceEeeccc-CCcccccchHHHHHHHHHHhh Confidence 777888889999999999998888899998874 58899999999999999864 556789999999999999999 Q ss_pred hccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeehhhHH Q lcl|NC_021557. 241 LNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHARRIL 320 (419) Q Consensus 241 ~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~ 320 (419) .++|||+||+|+.|.|+.++++++++..++..+|+++||++||+++++ ++||++||+||++ +|++|+||++|||+ T Consensus 215 ~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~--~~G~~~wG~rT~s---~d~~~~~i~~rR~~ 289 (390) T protein:vir:78 215 NDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVN--RNGFRFWGERTCS---DDPKFAFENYTRTA 289 (390) T ss_pred cCCCcEECcCCceeeceeecceecccccccccchhhhhhhcCcEEEEc--CCCEEEEcccccC---CCcccceeehhhHH Confidence 999999999999999999999999999999999999999999999975 5899999999994 47789999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceE Q lcl|NC_021557. 321 DMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMER 399 (419) Q Consensus 321 ~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~ 399 (419) ++|+++|+++++|++||||++.+|++|++++++||++||++| ++||+|+||+++||+++|++|+|+++|+++|++|+|| T Consensus 290 ~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~pae~ 369 (390) T protein:vir:78 290 QVAGDSIAEAQMPVVDGPLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLEN 369 (390) T ss_pred HHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCcce Confidence 999999999999999999999999999999999999999865 7899999999999999999999999999999999999 Q ss_pred EEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 400 ITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 400 i~~~~~~~~~~~~~~~~~~a 419 (419) |+|+++++++||+++|++++ T Consensus 370 I~~~~~~~~~~~~~~~~~~~ 389 (390) T protein:vir:78 370 LVLRQRITDRFLADFPARVA 389 (390) T ss_pred EEEEEEEchHHHHHHHHHhc Confidence 99999999999999999999 No 5 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=100.00 E-value=2e-107 Score=605.60 Aligned_cols=388 Identities=26% Similarity=0.372 Sum_probs=355.9 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |+++|+|||||+|++++++++..++|++|+|||++++++...+++ |+|++++|+.++..+||. .++|.+++ T Consensus 1 M~~~~~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~------n~pv~its~~~~~~~~g~---~~tL~~al 71 (390) T protein:vir:79 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPL------NTPVLLTNVVAALGKAGK---KGTLRRTL 71 (390) T ss_pred CccccCCCeEEEEcCCCcccccccCCceeEEEEecCCCCcccccc------ccceEeecHHHHHHhcCC---Cccchhhh Confidence 999999999999999999999999999999999999998877764 899999999999999987 47889999 Q ss_pred HHHhhccCCcEEEEeeccccccccccccccccccccceeccccccccccccchhhhhhhhhccccccccccchhhhhhhH Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLIIAPGYSPAAAV 160 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~~~v 160 (419) +.+|.+++..++++..... .....+..+.++....++..+|++++.+.++.+...|.++.+|++++. ++ T Consensus 72 ~~~~~~~~~~~~vv~v~~~----------~~~~~~~~~~ig~~~~~~~~tgl~al~~~~~~~~~~p~il~ap~~~~~-~v 140 (390) T protein:vir:79 72 DAIGKQTKPLTVVVRVAEG----------KDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQ-PV 140 (390) T ss_pred hhhcccccceEEEEeeccc----------cccccccceeeecccccccchhhhhhhhhhhhhccccccccCCcccch-HH Confidence 9999999999888875422 122334455566666778899999999999999999999999999865 57 Q ss_pred HHHHHHHhhccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHHHHHHHHhhh Q lcl|NC_021557. 161 RAEMDVVASRLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGVIIATD 240 (419) Q Consensus 161 ~a~l~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D 240 (419) .++|..+|++.++++++|+|.+.+..++.+||. +++|+|+++||||++++|+. ++..+++|||+++||++||+| T Consensus 141 ~~~l~~~a~~~~~~ai~D~p~~~t~~~a~~~~~-----~~~s~~~~~~~p~~~~~d~~-~~~~~~~p~s~~~Ag~~a~~D 214 (390) T protein:vir:79 141 AAALAATAQSLRAMAYVSASGCKTKEEAAAYRR-----QFGQREIMVIWPDWLGWDDT-TNSTAVIPAPAIAAGLRAKID 214 (390) T ss_pred HHHHHHhhhhcceEEEEEccCCCCHHHHHHHhc-----CCCCceEEEEcCceeecccc-cCceeEeehHHHHHHHHHhhh Confidence 777888999999999999998888889988873 58899999999999999864 566889999999999999999 Q ss_pred hccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeehhhHH Q lcl|NC_021557. 241 LNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHARRIL 320 (419) Q Consensus 241 ~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~ 320 (419) +++|||+||+|+.|+|+.++++++++.+++.++|+++||++||+++++ ++||++||+||++ +|+.|+||++|||+ T Consensus 215 ~~~g~~~spsN~~i~gi~~~~~~~~~~~~~~~~~a~~Ln~~gi~t~~~--~~G~~~wG~rT~~---~d~~~~~i~vrR~~ 289 (390) T protein:vir:79 215 NDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVN--RNGFRFWGERTCS---DDPKFAFENYTRTA 289 (390) T ss_pred ccCCcEEccCCceeeccceeeeeccccccccchhhhhhhhcCcEEEEc--CCCEEEEeccccC---CCcccceeeehhhH Confidence 999999999999999999999999999999999999999999999875 6899999999995 46789999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceE Q lcl|NC_021557. 321 DMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMER 399 (419) Q Consensus 321 ~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~ 399 (419) +||+++|+++++|++||||++.+|++|+++++.||++||++| ++||+|+||+++||++++++|+|+++|+++|++|+|| T Consensus 290 ~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~ 369 (390) T protein:vir:79 290 QVAADSIAEAQMPVVDGPLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLEN 369 (390) T ss_pred HHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecCCcce Confidence 999999999999999999999999999999999999999865 8899999999999999999999999999999999999 Q ss_pred EEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 400 ITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 400 i~~~~~~~~~~~~~~~~~~a 419 (419) |+|+++++++||+++|+++| T Consensus 370 i~~~~~~~~~~~~~~~~~v~ 389 (390) T protein:vir:79 370 LVLRQRITDRFLADFPARVA 389 (390) T ss_pred EEEEEEEchHHHHHHHHHhc Confidence 99999999999999999999 No 6 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=100.00 E-value=1.2e-106 Score=601.19 Aligned_cols=388 Identities=25% Similarity=0.360 Sum_probs=354.6 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |+++|+|||||+|++++++++..++|++|+||||+++++...+++ |+|++++|+.++...||. .+++.+++ T Consensus 1 M~~~~~pGv~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~------n~pv~iss~~~~~~~~g~---~gtl~~al 71 (391) T protein:vir:79 1 MPTDYHHGVRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPL------DTPVLLTNPQAYIGKAGD---KGTLAHTL 71 (391) T ss_pred CCCCCCCCeEEEECCCCcccccccCCceEEEEeeccccccccccc------ccCEEeccHHHHHHhcCC---ccccchhh Confidence 999999999999999999999999999999999999998877765 899999999999999986 47889999 Q ss_pred HHHhhccCCcEEEEeeccccccccccccccccccccceeccccccccccccchhhhhhhhhccccccccccchhhhhhhH Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLIIAPGYSPAAAV 160 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~~~v 160 (419) +.++++++..++++...... ....+..++++..+.++..+|+.++.++++.++..|.++.+|++++.. + T Consensus 72 ~~~~~~gg~~~~vv~~~~~~----------~~~~~~~~~~g~~~~~~~~tGl~~l~~~~~~~~~~p~~l~~p~~~~~~-v 140 (391) T protein:vir:79 72 DAITDQTNPLTVVVRVAGGA----------SEAETTSNLIGTTNAAGRYTGMKALLTARNRFGVAPRILAVPGLDSLP-V 140 (391) T ss_pred hhhhcccccceeeecccccc----------ccccccccccccccchhhhHHHhhhhhhhhhhcccchhhcCCccchhH-H Confidence 99999999999887754322 233445566677777888999999999999999999999999998654 6 Q ss_pred HHHHHHHhhccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHHHHHHHHhhh Q lcl|NC_021557. 161 RAEMDVVASRLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGVIIATD 240 (419) Q Consensus 161 ~a~l~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D 240 (419) .++|..+|++.++++++|+|.+.+..+++.++. +++|+|+++||||++++|+. ++..+++|||+++||++|++| T Consensus 141 ~~al~~~~~~~~~~ai~d~p~~~t~~~a~~~~~-----~~~s~~~a~~~P~~~~~d~~-~~~~~~~p~s~~~AG~~a~~D 214 (391) T protein:vir:79 141 GTELVTIAQKLRAFAYLSAYGCQTKEEAVAYRS-----NFGQREAMVMWPDFVGWDTA-ANAETTLWATARAVGLRAKID 214 (391) T ss_pred HHHHHHHHhhcCcEEEEECCCCCCHHHHHHHHh-----ccCCceeEEecceeeeecCc-CCceeeechHHHHHHHHHHhh Confidence 666667778889999999998888899988874 57899999999999999854 566889999999999999999 Q ss_pred hccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeehhhHH Q lcl|NC_021557. 241 LNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHARRIL 320 (419) Q Consensus 241 ~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~ 320 (419) .++|||+||+|+.|.||+++++++++..++..++++.||++||+++++ ++||++||+||++ +|+.|+||++|||+ T Consensus 215 ~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~~I~t~~~--~~G~~~wG~rT~~---~d~~~~~i~~rR~~ 289 (391) T protein:vir:79 215 NDTGWHKTLSNVAVGGVTGLSRDVFWDLQDPATDAGYLNANEVTTLVH--RDGYRFWGSRTCS---ADPLFAFENYTRTA 289 (391) T ss_pred hcccceeccCCceehhhhccccccccccccccchhhhhhhcCceEEEC--CCcEEEEcccccC---CCcccceeehhhHH Confidence 999999999999999999999999999999999999999999999864 6899999999995 46789999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceE Q lcl|NC_021557. 321 DMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMER 399 (419) Q Consensus 321 ~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~ 399 (419) ++|+++|+++++|++||||++.+|++|++++++||++||++| ++||+++||+++||++++++|+|+++|+++|++|+|| T Consensus 290 ~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~ 369 (391) T protein:vir:79 290 QVLADTMAEAHMWANDLPMTPTLVRDLLEGINAKLRMLTRNGYLLGGAAWFDADANSKDTLKAGQLAIDYDYTPVPPLEN 369 (391) T ss_pred HHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcce Confidence 999999999999999999999999999999999999999865 8899999999999999999999999999999999999 Q ss_pred EEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 400 ITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 400 i~~~~~~~~~~~~~~~~~~a 419 (419) |+|+++++++||++++++++ T Consensus 370 i~~~~~~~~~~~~~~~~~v~ 389 (391) T protein:vir:79 370 LTFRQRITDRYLMQFAEAVK 389 (391) T ss_pred EEEEEEEchHHHHHHHHHhh Confidence 99999999999999999998 No 7 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=100.00 E-value=7e-106 Score=597.09 Aligned_cols=393 Identities=25% Similarity=0.340 Sum_probs=356.7 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |+++ +|||||+|++++++++..++|++++||||+++++...+++ |+|++++++.++...||.. ++|..++ T Consensus 1 m~~~-~~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~------~~pv~v~s~~~~~~~~g~~---~tl~~al 70 (395) T protein:vir:98 1 MSDF-HHGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPL------NEPVLITNVQSAIAKAGKK---GTLAASL 70 (395) T ss_pred CCCC-CCCeEEEEcCCCcccccccCcceEEEEeeccCCCcccccc------ccceEeechHHhHhhcccc---cchhhHH Confidence 9875 7999999999999999999999999999999998877765 8999999999999999874 6789999 Q ss_pred HHHhhccCCcEEEEeeccccccccccccccccccccceeccccccccccccchhhhhhhhhccccccccccchhhhhhhH Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLIIAPGYSPAAAV 160 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~~~v 160 (419) +.++++++..++++......... .....+.+...+++.....+.++|++++.++.+.++..|.++.+|++++. .+ T Consensus 71 ~~~~~~~~~~~~vv~~~~~~~~~----~~~~~a~~~~~i~g~~~~~~~~Tgl~al~~~~~~~~~~p~il~ap~~~~~-~v 145 (395) T protein:vir:98 71 QAIADQSKPVTVVVRVEDGTGDD----EEAALAQTVSNIIGGTDENGKYTGIKALLTAQAVTGVKPRILGVPGLDTK-EV 145 (395) T ss_pred HHHhhccCceEEEeecccccccc----ccccccccccccccccccccchhHHHHHhhhhhhhccchhhccccccccc-HH Confidence 99999999999887654332221 12233445556677777788999999999999999999999999999765 46 Q ss_pred HHHHHHHhhccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHHHHHHHHhhh Q lcl|NC_021557. 161 RAEMDVVASRLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGVIIATD 240 (419) Q Consensus 161 ~a~l~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D 240 (419) .++|..+|++.++++++|+|.+.+.+++++||. +++|+|+++||||++++|+. ++..+++|||+++||+++++| T Consensus 146 ~~al~~~~~~~~~~~~~d~p~~~t~~~a~~~~~-----~~~s~~~~~~~p~~~~~d~~-~~~~~~~p~s~~~AG~~a~~d 219 (395) T protein:vir:98 146 AVALASAAIKLRAFAYVSAWGCKTISEAMEYRK-----NFSQRELMVIWPDFLAWDTV-KNTTATAYATARALGLRAYID 219 (395) T ss_pred HHHHHHHhhhcCcEEEEEcCCCCCHHHHHHHHh-----ccCCceEEEEecceeEeccc-CCceeeechHHHHHHHHHHhh Confidence 788888999999999999998889999999884 57899999999999999864 567889999999999999999 Q ss_pred hccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeehhhHH Q lcl|NC_021557. 241 LNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHARRIL 320 (419) Q Consensus 241 ~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~ 320 (419) .++|||+||+|+.|+||+++++++++.+++.++|++.||++|||++++ ++|+++||+||++ +|+.|+||++|||+ T Consensus 220 ~~~g~~~spaN~~i~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~--~~G~~~wG~rT~s---~d~~~~~i~~rR~~ 294 (395) T protein:vir:98 220 QTVGWHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLVR--KDGFRFWGNRTCS---DDPLFLFENYTRTA 294 (395) T ss_pred cccCcEeccCCceeecccccceecccccCCCcchHHhhhhcCcEEEEc--CCCEEEEcccccC---CCcccceeehhhHH Confidence 999999999999999999999999999999999999999999999964 6899999999995 46789999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceE Q lcl|NC_021557. 321 DMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMER 399 (419) Q Consensus 321 ~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~ 399 (419) ++|+++|++.++|++||||++.+|++|+++++.||++||++| ++||+|+||+++||+++|++|+|+++|+++|++|+|| T Consensus 295 ~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~ 374 (395) T protein:vir:98 295 QVLADTMAEAHMWAVDKPITATLIRDIVDGINAKFRELKSNGYIVEGKCWFDEESNDKETLKAGKLYIDYDYTPVPPLES 374 (395) T ss_pred HHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcce Confidence 999999999999999999999999999999999999999865 8899999999999999999999999999999999999 Q ss_pred EEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 400 ITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 400 i~~~~~~~~~~~~~~~~~~a 419 (419) |+|+++++.+||+++|++++ T Consensus 375 I~~~~~~~~~~~~~~~~~~~ 394 (395) T protein:vir:98 375 LTLRQRITDKYLVNLAESVN 394 (395) T ss_pred EEEEEEEchHHHHHHHHHhc Confidence 99999999999999999999 No 8 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=100.00 E-value=2.3e-105 Score=594.20 Aligned_cols=393 Identities=24% Similarity=0.356 Sum_probs=357.1 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |++ |+|||||+|++++++++..+++++++|||++++++...+++ |+|++++++.++...+|.. ++|..++ T Consensus 1 m~~-~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~------~~pv~i~s~~~~~~~~g~~---~tl~~al 70 (396) T protein:vir:57 1 MSD-YHHGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPL------NKPVLITNVQSAIAKAGKK---GTLAASL 70 (396) T ss_pred CCC-CCCceEEEEcCCCcccccccCCceEEEEEeccCCCcccccC------ccCeEeecchhhhhhcccc---cchHHHH Confidence 987 67999999999999999999999999999999998877764 8999999999999998874 6789999 Q ss_pred HHHhhccCCcEEEEeeccccccccccccccccccccceeccccccccccccchhhhhhhhhccccccccccchhhhhhhH Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLIIAPGYSPAAAV 160 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~~~v 160 (419) +.++++++..++++........ ........+..+++|....++..+|++++.++++.++..|.++.+|++... .+ T Consensus 71 ~~~~~~~~~~~~vv~~~~~~~~----~~~~~~a~t~~~iiG~~~~~~~~tgl~al~~~~~~~~~~p~i~~ap~~~~~-~v 145 (396) T protein:vir:57 71 QAIADQSKPVTVVVRVEDGTGD----DEETKLAQTVSNIIGTTDENGQYTGLKALMGAESVTGVKPRILGVPGLDTK-EV 145 (396) T ss_pred HHhhhcCCceeEeeeccccccc----cccccccccceeeeeeccccccchhhhhhhhcccceeEEeccccCcccchh-HH Confidence 9999999998888765433321 122334556677777777788999999999999999999999999998764 56 Q ss_pred HHHHHHHhhccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHHHHHHHHhhh Q lcl|NC_021557. 161 RAEMDVVASRLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGVIIATD 240 (419) Q Consensus 161 ~a~l~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D 240 (419) .++|..+|++.++++++|+|.+.+.+++++||. +++|+|+++||||++++|+. ++..+++|||+++||++||+| T Consensus 146 ~~al~~~~~~~~~~~~~d~p~~~~~~~~~~~~~-----~~~s~~~~~~~p~~~~~d~~-~~~~~~~p~s~~~Ag~~a~~d 219 (396) T protein:vir:57 146 AVALASVCQELNAFGYISAWGCKTISEVKAYRQ-----NFSQRELMVIWPDFLAWDTV-TSTTATAYATARALGLRAKID 219 (396) T ss_pred HHHHHHHhhhCceEEEEcCCCCCCHHHHHHHHh-----ccCCceEEEEcceeeeeccc-CCceeEEehhHHHHHHHHHhh Confidence 777888888999999999999989999999984 57899999999999999864 566889999999999999999 Q ss_pred hccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeehhhHH Q lcl|NC_021557. 241 LNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHARRIL 320 (419) Q Consensus 241 ~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~ 320 (419) .++|+|+||+|++|+||.++++.+++.+++.++|+++||++|||++++ ++||++||+||++ +|+.|+||++|||+ T Consensus 220 ~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~t~~~--~~G~~~wG~rT~~---~d~~~~~i~vrR~~ 294 (396) T protein:vir:57 220 QEQGWHKTLSNVGVNGVTGISASVFWDLQKPGTDADLLNEAGVTTLVR--RDGFRFWGNRTCS---DDPLFLFESYTRTA 294 (396) T ss_pred hccCcEeccCCceeccccccceecccccCCcchhhhhhhhcCcEEEEc--CCCEEEEcccccC---CCcccceeehhhHH Confidence 999999999999999999999999999999999999999999999975 5899999999995 46789999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceE Q lcl|NC_021557. 321 DMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMER 399 (419) Q Consensus 321 ~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~ 399 (419) ++|+++|+++++|++||||++.+|++|+++++.||++||++| ++||+|+||+++||+++|++|+|+++|+++|++|+|| T Consensus 295 ~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~ 374 (396) T protein:vir:57 295 QVLADTMAEAHMWAIDKPITATLIRDIIDGINAKFRELKNNGYIVDGTCWFSEESNDAETLKAGKLYIDYDYTPVPPLEN 374 (396) T ss_pred HHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcce Confidence 999999999999999999999999999999999999999865 7899999999999999999999999999999999999 Q ss_pred EEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 400 ITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 400 i~~~~~~~~~~~~~~~~~~a 419 (419) |+|+++++.+||+++|++++ T Consensus 375 I~~~~~~~~~~~~~~~~~~~ 394 (396) T protein:vir:57 375 LTLRQRITSRYLASLVTSVN 394 (396) T ss_pred EEEEEEEchHHHHHHHHHhh Confidence 99999999999999999999 No 9 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=100.00 E-value=2e-105 Score=594.61 Aligned_cols=393 Identities=24% Similarity=0.351 Sum_probs=355.7 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |+++ +|||||+|+++++++++.++|++++|||++++++...+++ |+|++++++.++...||.. ..|..++ T Consensus 1 m~~~-~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l------~~pvlvts~~~~~~~~g~~---~tL~~al 70 (396) T protein:vir:20 1 MSDY-HHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPL------NKPVLITNVQSAISKAGKK---GTLAASL 70 (396) T ss_pred CCCC-CCCeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccC------ccCEEeechHHHHhhcccc---cchhhhh Confidence 9984 6999999999999999999999999999999998776654 8999999999999999864 7788999 Q ss_pred HHHhhccCCcEEEEeeccccccccccccccccccccceeccccccccccccchhhhhhhhhccccccccccchhhhhhhH Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLIIAPGYSPAAAV 160 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~~~v 160 (419) +.+|++++..++++....... .........+...+++....++..++++++.++++.....|.+..+|++... .| T Consensus 71 ~~~~~ngg~~~~v~~~~~~~~----~~~~~~~a~t~~~~~~~~~~~~~~tg~~al~~~~~~~~~~p~i~~ap~~~~~-~v 145 (396) T protein:vir:20 71 QAIADQSKPVTVVMRVEDGTG----DDEETKLAQTVSNIIGTTDENGQYTGLKAMLAAESVTGVKPRILGVPGLDTK-EV 145 (396) T ss_pred hhhhccCceeEEEEecccccc----ccccccccccccccccccccccccchhhhhhhhccccccchhhhhhhhhccH-HH Confidence 999999999888776543322 1223344456666777777788899999999999999999999999999765 46 Q ss_pred HHHHHHHhhccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHHHHHHHHhhh Q lcl|NC_021557. 161 RAEMDVVASRLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGVIIATD 240 (419) Q Consensus 161 ~a~l~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D 240 (419) ..+|..+|++.++++++|+|.+.+.++++++|. +++|+|+++||||++++|+. ++..+++|||+++||++||+| T Consensus 146 ~~al~~~~~~~~~~~~iD~p~~~~~~~a~~~r~-----~~~s~~~~~~~P~~~~~d~~-~~~~~~~p~s~~~Ag~~a~~d 219 (396) T protein:vir:20 146 AVALASVCQKLRAFGYISAWGCKTISEVKAYRQ-----NFSQRELMVIWPDFLAWDTV-TSTTATAYATARALGLRAKID 219 (396) T ss_pred HHHHHHHHhcCCcEEEEecCCCCCHHHHHHHhh-----CCCCceEEEEcCccccccCc-CCcceeechhHHHHHHHHHhh Confidence 677777888999999999999889999999884 57899999999999999864 567899999999999999999 Q ss_pred hccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeehhhHH Q lcl|NC_021557. 241 LNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHARRIL 320 (419) Q Consensus 241 ~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~ 320 (419) .++|+|+||+|++|+||.++++++.+.+++.++|++.||++|||++++ ++||++||+||++ .|+.|+||++|||+ T Consensus 220 ~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~--~~G~~~wG~rT~s---~d~~~~~i~~rR~~ 294 (396) T protein:vir:20 220 QEQGWHKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGVTTLIR--RDGFRFWGNRTCS---DDPLFLFENYTRTA 294 (396) T ss_pred hhcCcEeccCCceeccceecceecccccCCCcchhhhhhhcCcEEEEc--CCCEEEEcccccC---CCcccceeehhhHH Confidence 999999999999999999999999999999999999999999999964 6899999999995 46789999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceE Q lcl|NC_021557. 321 DMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMER 399 (419) Q Consensus 321 ~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~ 399 (419) +||+++|++.++|++||||++.+|++|+++++.||++||++| ++||+|+||+++||+++|++|+|+++|+++|++|+|| T Consensus 295 ~~i~~~~~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~G~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~ 374 (396) T protein:vir:20 295 QVVADTMAEAHMWAVDKPITATLIRDIVDGINAKFRELKTNGYIVDATCWFSEESNDAETLKAGKLYIDYDYTPVPPLEN 374 (396) T ss_pred HHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCcceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcce Confidence 999999999999999999999999999999999999999865 8899999999999999999999999999999999999 Q ss_pred EEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 400 ITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 400 i~~~~~~~~~~~~~~~~~~a 419 (419) |+|+++++++||+++|++++ T Consensus 375 i~~~~~~~~~~~~~~~~~~~ 394 (396) T protein:vir:20 375 LTLRQRITDKYLANLVTSVN 394 (396) T ss_pred EEEEEEEchHHHHHHHHHhh Confidence 99999999999999999999 No 10 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=100.00 E-value=1.4e-105 Score=595.45 Aligned_cols=388 Identities=24% Similarity=0.367 Sum_probs=356.3 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |+++++|||||+|++++++++..+++++++|||++++++...+++ |+|++++++.++...||. .+.+.+++ T Consensus 2 ~~~~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~------~~p~~v~s~~~~~~~~g~---~~tl~~al 72 (391) T protein:vir:11 2 AADQYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPL------DTPVLITNVQAAIGKAGT---SGTLPASL 72 (391) T ss_pred CCCcCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccc------cccEEEecchhhheecCC---Cccchhhh Confidence 666889999999999999999999999999999999998877654 899999999999988886 47788999 Q ss_pred HHHhhccCCcEEEEeeccccccccccccccccccccceeccccccccccccchhhhhhhhhccccccccccchhhhhhhH Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLIIAPGYSPAAAV 160 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~~~v 160 (419) +.+|++++..++++... +.+....+..++++..+..+..++++++.++++.+...|.++.+|++++.+ + T Consensus 73 ~~~~~~~g~~~~vv~~~----------~~~~~~~t~~d~~g~~~a~~~~~g~~a~~~~~~~~~~~p~~~~ap~~~~~~-v 141 (391) T protein:vir:11 73 QAIADQANAATVVVRVK----------PGEDEAATNSAVIGGVSADGKYTGMKALLAAKARLGVVPRILGVPGLDTQP-V 141 (391) T ss_pred hhhhccccceeEEeeec----------ccccccccchhhhcccccccchhhhhhhhhhhhhheeccccccccccccHH-H Confidence 99999999998887653 234455667778888888889999999999999999999999999998765 6 Q ss_pred HHHHHHHhhccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHHHHHHHHhhh Q lcl|NC_021557. 161 RAEMDVVASRLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGVIIATD 240 (419) Q Consensus 161 ~a~l~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D 240 (419) .++|..+|++.++++++|+|.+.+.++++.+|. +++|+|+++||||++++|+. ++..+++|||+++||+++|+| T Consensus 142 ~~al~~~~~~~~~~~i~D~p~~~t~~~a~~~r~-----~~~s~~~~~~~p~~~~~~~~-~~~~~~~p~s~~~ag~~a~~d 215 (391) T protein:vir:11 142 ATALIAIAQQLRAFAYVSASGCKTKEEATAYRE-----NFAAREAMVIWPDFLTWSTV-VNQTVPAPAVAQALGLRARID 215 (391) T ss_pred HHHHHHhhcccceEEEEEcCCCCCHHHHHHHhh-----hcCCceEEEEcCcceecccc-cCceEEechHHHHHHHHHHhh Confidence 667777888889999999999889999999884 58899999999999999854 567899999999999999999 Q ss_pred hccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeehhhHH Q lcl|NC_021557. 241 LNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHARRIL 320 (419) Q Consensus 241 ~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~ 320 (419) .++|||+||||+.|+||.+++.++++.+++.+.|+++||++||++++ +++||++||+||++ .|+.|+||++|||+ T Consensus 216 ~~~g~~~span~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~--~~~G~~~wG~rT~~---~d~~~~~i~vrR~~ 290 (391) T protein:vir:11 216 QEVGWHKTLSNVAVNGVTGISADVFWDLQSPSTDANYLNENEVTTLV--QEGGFRFWGSRTCS---DDPLFAFENYTRTA 290 (391) T ss_pred ccCCcEEccCCceeeceeecccccccccCCCcchhhhhhhcCcEEEE--cCCCEEEEcccccC---CCcccceeehhhHH Confidence 99999999999999999999999999999999999999999999986 46899999999995 46789999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceE Q lcl|NC_021557. 321 DMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMER 399 (419) Q Consensus 321 ~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~ 399 (419) ++|+++|++.++|++||||++.+|++|+++++.||++||++| ++||+++||+++||+++|++|+|+++|+++|++|+|| T Consensus 291 ~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~g~l~g~~~~~~~~~n~~~~i~~G~~~~~i~~~p~~p~e~ 370 (391) T protein:vir:11 291 QVLADTIAEAHMWAVDKPMHPSLVRDILEGVNAKFRELKGLGLIIDAQAWYDPNVNDKDTLKAGKLRITYDYTPVPPLED 370 (391) T ss_pred HHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhccceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcce Confidence 999999999999999999999999999999999999999865 7899999999999999999999999999999999999 Q ss_pred EEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 400 ITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 400 i~~~~~~~~~~~~~~~~~~a 419 (419) |+++++++++||+++++.++ T Consensus 371 i~~~~~~~~~~~~~~~~~~~ 390 (391) T protein:vir:11 371 LTFFQKITDSYLVDFASRVN 390 (391) T ss_pred EEEEEEEchHHHHHHHHHhc Confidence 99999999999999999999 No 11 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=100.00 E-value=6.2e-105 Score=591.87 Aligned_cols=393 Identities=25% Similarity=0.352 Sum_probs=357.4 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |+++ +|||||+|++++++++..++|++|+|||++++++...++. ++|++++++.++...||.. ++|.+++ T Consensus 1 m~~~-~~Gv~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~------~~p~~v~s~~~~~~~~g~~---~tl~~a~ 70 (396) T protein:vir:60 1 MSDY-HHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPL------NKPVLITNVQSAIAKAGKK---GTLAASL 70 (396) T ss_pred CCCC-CCCeEEEEcCCCcccccccCceeEEEEecccccccccccC------ccCeEeechHHHHHhhcCc---chhHHHH Confidence 9985 6999999999999999999999999999999998877764 8999999999999999864 6899999 Q ss_pred HHHhhccCCcEEEEeeccccccccccccccccccccceeccccccccccccchhhhhhhhhccccccccccchhhhhhhH Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLIIAPGYSPAAAV 160 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~~~v 160 (419) +.+|++++..++++....... .........+...+++..+.++..+|++++.+.++.+...|.++.+|++.. ..| T Consensus 71 ~~~~~~gg~~~~vv~~~~~~~----~~~~~~~~~~~~~~~~~~d~~~~~tg~~al~~~~~~~~~~~~il~ap~~~~-~~v 145 (396) T protein:vir:60 71 QAIADQSKPVTVVVRVEDGTG----EDEETKLAQTVSNIIGTTDENGQYTGLKALLAAESVTGVKPRILGVPGLDT-KEV 145 (396) T ss_pred HHHhhccCceEEEEecccccc----cccccccccccccccccccccccccchhhhhhcccceeeeeeecccccccc-HHH Confidence 999999999999887654332 122333345556667777778889999999999999999999999999865 567 Q ss_pred HHHHHHHhhccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHHHHHHHHhhh Q lcl|NC_021557. 161 RAEMDVVASRLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGVIIATD 240 (419) Q Consensus 161 ~a~l~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D 240 (419) +.+|..+|++.++++++|+|.+.+.++++++|. +++|+|+++||||++++|+. ++..+++|||+++||++|++| T Consensus 146 ~~al~~~~~~~~~~~i~d~p~~~~~~~a~~~~~-----~~~s~~~~~~~p~~~~~d~~-~~~~~~~p~s~~~AG~~a~~d 219 (396) T protein:vir:60 146 AVALASVCQKLRAFGYISAWGCKTISEVKAYRQ-----NFSQRELMVIWPDFLAWDTV-ASTTATAYATARALGLRAKID 219 (396) T ss_pred HHHHHHHhccCCeEEEEeCCCCCCHHHHHHHHh-----hcCCceEEEEeCceeeeccc-CCceeEEchhHHHHHHHHHhh Confidence 778888889999999999999989999999884 57899999999999999864 667889999999999999999 Q ss_pred hccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeehhhHH Q lcl|NC_021557. 241 LNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHARRIL 320 (419) Q Consensus 241 ~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~ 320 (419) .++|+|+||+|++|+||.++++++++.+++.++|+++||++|||++++ ++|+++||+||++ .|+.|+||++|||+ T Consensus 220 ~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~--~~G~~~wG~rT~~---~d~~~~~i~~rR~~ 294 (396) T protein:vir:60 220 QEQGWHKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGVTTLIR--RDGFRFWGNRTCS---DDPLFLFENYTRTA 294 (396) T ss_pred hccCcEeCcCCceecceeeceeecccccCCCcchhhhhhhcCcEEEEc--CCCEEEEcccccC---CCcccceeehhhHH Confidence 999999999999999999999999999999999999999999999964 6899999999995 46789999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceE Q lcl|NC_021557. 321 DMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMER 399 (419) Q Consensus 321 ~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~ 399 (419) +||+++|++.++|++||||++.+|++|+++++.||++||++| ++||+++||+++||+++|++|+|+++|+++|++|+|| T Consensus 295 ~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~~~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~ 374 (396) T protein:vir:60 295 QVLADTMAEAHMWAVDKPITATLIRDIVDGINAKFRELKTNGYIVDATCWFSEESNDAETLKAGKLYIDYDYTPVPPLEN 374 (396) T ss_pred HHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcce Confidence 999999999999999999999999999999999999999865 7899999999999999999999999999999999999 Q ss_pred EEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 400 ITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 400 i~~~~~~~~~~~~~~~~~~a 419 (419) |+|+++++++||+++|+++. T Consensus 375 I~~~~~~~~~~~~~~~~~~~ 394 (396) T protein:vir:60 375 LTLRQRITDKYLANLVTSVN 394 (396) T ss_pred EEEEEEEchHHHHHHHHHhh Confidence 99999999999999999999 No 12 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=100.00 E-value=5.8e-105 Score=592.05 Aligned_cols=390 Identities=26% Similarity=0.358 Sum_probs=357.1 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |+++ +|||||+|++++++++..++|++++||||+++++...+++ |+|++++++.++...||.. +.+..++ T Consensus 1 m~~~-~~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~------~~p~~its~~~~~~~~g~~---gtl~~al 70 (392) T protein:vir:18 1 MSDF-HHGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPL------NEPVLITNVQSAIAKAGKK---GTLSASL 70 (392) T ss_pred CCCC-CCCeEEEEcCCCceeeeccCcceeEEEEeccCCCCccccc------ccceEeechHHHHhhcCCC---cchHHHH Confidence 9985 7999999999999999999999999999999988776654 8999999999999999873 6788999 Q ss_pred HHHhhccCCcEEEEeeccccccccccccccccccccceeccccccccccccchhhhhhhhhccccccccccchhhhhhhH Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLIIAPGYSPAAAV 160 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~~~v 160 (419) +.+|++++..++++..... ...+....+..+++|....++..++++++.++++.....|+++.+|++++. .+ T Consensus 71 ~~~~~ngg~~~~vv~v~~~-------~~~~~~~~t~~dliG~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~-~v 142 (392) T protein:vir:18 71 QAIADQSKPVTVVVRVAEG-------TGDDAEAQTTSNIIGGTDENGKYTGIKALLTAEAVTGVKPRILGVPGLDTQ-EV 142 (392) T ss_pred HHhhcccCceEEEeccccc-------ccccccccchhhheecccccchhhhHHHHHhhhhhhceeehhcccCccchH-HH Confidence 9999999988887754322 233455677777888777888999999999999999999999999999864 57 Q ss_pred HHHHHHHhhccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHHHHHHHHhhh Q lcl|NC_021557. 161 RAEMDVVASRLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGVIIATD 240 (419) Q Consensus 161 ~a~l~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D 240 (419) .++|..+|++.++++++|+|.+.+.+++.+||. +++|+|+++||||++++|+. ++..+++|||+++||+++++| T Consensus 143 ~~~l~~~~~~~~~~~~~d~~~~~~~~~a~~~~~-----~~~s~~~~~~~p~~~~~d~~-~~~~~~~p~s~~~AG~~a~~d 216 (392) T protein:vir:18 143 ATALASVCISLRAFGYVSAWGCKTISEAMAYRE-----NFSQRELMVIWPDFLAWDTT-ANATATAYATARALGLRAYID 216 (392) T ss_pred HHHHHHHHhhcCcEEEEecCCCCCHHHHHHHHh-----hccCceEEEEeCceeeeccc-CCceEEechHHHHHHHHHhhh Confidence 788888889999999999999999999999884 57899999999999999865 566789999999999999999 Q ss_pred hccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeehhhHH Q lcl|NC_021557. 241 LNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHARRIL 320 (419) Q Consensus 241 ~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~ 320 (419) .++|||+||+|++|+||.++++++++.+++.++|++.||++|||++++ ++|+++||+||++ .|+.||||++|||+ T Consensus 217 ~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~~--~~G~~~wG~rT~~---~d~~~~~i~~rR~~ 291 (392) T protein:vir:18 217 QTIGWHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLVR--KDGFRFWGNRTCS---DDPLFLFENYTRTA 291 (392) T ss_pred ccCCceEccCCceeeceeecceecccccCCCcchhhhhhhcCceEEEc--CCCEEEEcccccC---CCcccceeehhhHH Confidence 999999999999999999999999999999999999999999999964 6899999999995 46789999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceE Q lcl|NC_021557. 321 DMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMER 399 (419) Q Consensus 321 ~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~ 399 (419) ++|+++|++.++|++||||++.+|++|++++++||++||++| ++||+|+||+++||++++++|+|+++|+++|++|+|| T Consensus 292 ~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~p~e~ 371 (392) T protein:vir:18 292 QVLADTMAEAHMWAVDKPITASLIRDIVDGINAKFRELKSNGYIVDGECWFDEESNDKETLKAGKLYIDYDYTPVPPLES 371 (392) T ss_pred HHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCcccceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcce Confidence 999999999999999999999999999999999999999865 7899999999999999999999999999999999999 Q ss_pred EEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 400 ITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 400 i~~~~~~~~~~~~~~~~~~a 419 (419) |+|+++++++||+++|++++ T Consensus 372 I~~~~~~~~~~~~~~~~~~~ 391 (392) T protein:vir:18 372 LTLRQRITDKYLVNLAESVN 391 (392) T ss_pred EEEEEEEchHHHHHHHHHhc Confidence 99999999999999999999 No 13 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=100.00 E-value=1.1e-104 Score=590.56 Aligned_cols=386 Identities=25% Similarity=0.304 Sum_probs=343.9 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |+++|+|||||+|.+++++++..++|++|+||||+++++...+++ |+|++++++.++...||. .+.|.+++ T Consensus 3 m~~~~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pl------n~pv~i~s~~~~~~~~g~---~g~L~~al 73 (393) T protein:vir:10 3 ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPL------NTPVLITNPLNYLEKAGS---TGTLRRTL 73 (393) T ss_pred CCCccCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccC------ccceEecchHHHHHhhCC---ccchhhhh Confidence 889999999999999999999999999999999999998877665 899999999999999986 47889999 Q ss_pred HHHhhccCCcEEEEeeccccccccccccccccccccceeccccccccccccchhhhhhhhhccccccccccchhhhhhhH Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLIIAPGYSPAAAV 160 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~~~v 160 (419) +.++++++..++++.... .+....+..+++|. ..++..+|++++.++++.++..|+++.+|++++...+ T Consensus 74 ~~~~~~~~~~~~vv~v~~----------~~~~~~t~~~iig~-~~~~~~tgl~al~~~~~~~~~~p~li~apg~~~~~~~ 142 (393) T protein:vir:10 74 NSIGSIVKTPTVIVRVAE----------SDDSDTLTANIVGT-QENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVA 142 (393) T ss_pred hhhhcccCceEEEeeccc----------Cccccccccccccc-cccchhhHHHHHHhhhhhcceeeeeeeeccccchHHH Confidence 999999999998876532 23334455555554 3356789999999999999999999999999877655 Q ss_pred HHHHHHHhhccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHHHHHHHHhhh Q lcl|NC_021557. 161 RAEMDVVASRLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGVIIATD 240 (419) Q Consensus 161 ~a~l~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D 240 (419) ++|..+|++.+++++++.|.+.+.++++.++. +++|+++++||||++++|+. ++..+++|||+++||++|++| T Consensus 143 -~al~~~~~~~~~~~~v~d~~~~t~~~ai~~~~-----~~~s~~~~~~~P~~~~~d~~-~~~~~~~p~s~~~Ag~~a~~d 215 (393) T protein:vir:10 143 -TELLSVAKKLNAFAFISDNGATTKEQAYTYRQ-----NFSQREGMMIFGDWKSYNTD-KKAYDTDYAVARACALQAYID 215 (393) T ss_pred -HHHHHHhhccCcEEEEEcCCCCCHHHHHHHhh-----hcCCceEEEEeccccccccc-CCceeEeehhHHHHHHHHHhh Confidence 55555556666666666666888888988874 57889999999999998864 567889999999999999999 Q ss_pred hccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeehhhHH Q lcl|NC_021557. 241 LNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHARRIL 320 (419) Q Consensus 241 ~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~ 320 (419) .++|||+||+|++|.||+++++.+++.+++.++|+++||++|||++++ ++|+++||+||++ .|+.|+||++|||+ T Consensus 216 ~~~G~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~~--~~G~~~wG~rT~s---~d~~~~~i~vrR~~ 290 (393) T protein:vir:10 216 KTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICLN--HNGFRYWGSRTLA---TDTRWAFQQSVRTA 290 (393) T ss_pred cCCCcEEccCCceeeceeecceecccccCCCcchhHhHhhcCceEEEc--CCCEEEEcccccC---CCcccceeehhhHH Confidence 999999999999999999999999999999999999999999999964 5899999999995 46789999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc---ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCc Q lcl|NC_021557. 321 DMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA---IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVM 397 (419) Q Consensus 321 ~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g---~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~ 397 (419) ++|+++|++.++|++||||++.+|++|+++++.||++||++| ++||+|+||++ ||+++|++|+|+++|+++|++|+ T Consensus 291 ~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~~al~g~~v~~~~~-nt~~~i~~G~~~~~i~~~p~~p~ 369 (393) T protein:vir:10 291 QIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVAEE-ITADIIKSGKFVIKYDYHWIPSL 369 (393) T ss_pred HHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhccccccccceEEecCC-CCHHHhhCCEEEEEEEEEecCCc Confidence 999999999999999999999999999999999999999744 88999999875 88899999999999999999999 Q ss_pred eEEEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 398 ERITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 398 e~i~~~~~~~~~~~~~~~~~~a 419 (419) |||+|+++++++||+++|++++ T Consensus 370 e~I~~~~~~~~~~~~~l~~~v~ 391 (393) T protein:vir:10 370 ESLGLEQRVNDEYVVDLVNTLK 391 (393) T ss_pred ceEEEEEEEchHHHHHHHHHHh Confidence 9999999999999999999999 No 14 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=100.00 E-value=2.3e-102 Score=577.77 Aligned_cols=384 Identities=29% Similarity=0.392 Sum_probs=348.0 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) ||++|+|||||+|++++++++.+++|++++|||++++++...+++ |+|++++++.++...+|.. +.+..++ T Consensus 1 M~~~~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~------~~pv~i~s~~~~~~~~g~~---~tl~~a~ 71 (386) T protein:vir:10 1 MAEQYLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPL------NTPVLIAGSRREAAKLGAG---GTLPQAI 71 (386) T ss_pred CccccCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCccccc------ccceEecchHHHHhhcCCC---cchhHHH Confidence 999999999999999999999999999999999999988776665 8999999999999998864 7889999 Q ss_pred HHHhhccCCcEEEEeeccccccccccccccccccccceecc-ccccccccccchhhhhhhhhccccccccccchhhhhhh Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDING-TISPAGLASGFSGAYECYNNFGYFPKLIIAPGYSPAAA 159 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g-~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~~~ 159 (419) +.++.+++..++++.... ......+..++++ ....+...+|+.++.+.+..++..|++..+|+++...+ T Consensus 72 ~~~~~~gg~~~~vv~~~~----------~~~~~~t~~~~ig~~~~~t~~~tgl~~l~~~~~~~~~~p~i~~ap~~~~~~~ 141 (386) T protein:vir:10 72 DGIFDQTGAVVVVIRVDE----------GVDSAATQSNVIGKVDADTEQYTGILALLSAENTVKVQPRILIAPGFSNQKA 141 (386) T ss_pred HHHhccCceeEEEeeccc----------cccccccchhhhcccccccchhhhhHHhhhhcccccccccccccccccchhH Confidence 999999999888876432 2222333444444 44457788999999999999999999999999999999 Q ss_pred HHHHHHHHhhccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHHHHHHHHhh Q lcl|NC_021557. 160 VRAEMDVVASRLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGVIIAT 239 (419) Q Consensus 160 v~a~l~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~ 239 (419) +.+.|..+++++..+.+.|++ +.+.+++..++ .+++|+++++||||++++|+. ++..+++|||+++||+++|+ T Consensus 142 v~~~l~~~~~~~~~~~~~~~~-~~~~~~a~~~~-----~~~~s~~~~~~~p~~~v~~~~-~~~~~~~p~s~~~ag~~a~~ 214 (386) T protein:vir:10 142 VADQLVSVADTAAWLCHSGWS-NTTDAAAITYR-----ELFGSRRCEVVDPWYKVWDVE-TSAHIIQPPSARHAGVMAKV 214 (386) T ss_pred HHHHHHHhhcceEEEEEeCCC-CCchHHHHHhh-----hcccccceEEecCceeeeccc-cccceeechHHHHHHHHHHh Confidence 999999999999999998887 66677777776 357899999999999999865 55678999999999999999 Q ss_pred hhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeehhhH Q lcl|NC_021557. 240 DLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHARRI 319 (419) Q Consensus 240 D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~ 319 (419) |.++|||+||+|++|.||+++++++.+.+++.++|+++||++||++++ +++|+++||+||++. |+.|+||++||| T Consensus 215 D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gi~~~~--~~~G~~~wG~rT~~~---d~~~~~i~vrR~ 289 (386) T protein:vir:10 215 HNTLGFWWSNSNQEILGIDGLCRPVDFKLDDPTCRANLLNAKEVTTTI--QQNGFRVWGDRTCSA---DSKWAFKNVVIT 289 (386) T ss_pred hhcCCcEEccCCceeecccccceecccccccCcchhhhhhhcCcEEEE--cCCCEEEEcccccCC---CcccceeehhhH Confidence 999999999999999999999999999999999999999999999886 579999999999954 668999999999 Q ss_pred HHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCce Q lcl|NC_021557. 320 LDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVME 398 (419) Q Consensus 320 ~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e 398 (419) +++|+++|+++++|++||||++.+|++|++++++||++||++| ++||+|+||+++||++++++|+|+++|+++|++|+| T Consensus 290 ~~~i~~~~~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~~~~G~~~~~i~~~p~~p~e 369 (386) T protein:vir:10 290 NDMIADSLVRNHLWAVDRNITKTYVEDVTEGVNNYLRHLKNIGAIAGGECWVDPELNSPDQIQQGKVYFDYDFSAYAPAE 369 (386) T ss_pred HHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcccCCCHHHhhCCeEEEEEEEEecCCce Confidence 9999999999999999999999999999999999999999865 889999999999999999999999999999999999 Q ss_pred EEEEEEEEcchHHHHHH Q lcl|NC_021557. 399 RITIDSYVDTKFISNAL 415 (419) Q Consensus 399 ~i~~~~~~~~~~~~~~~ 415 (419) ||+|+++++.+||++++ T Consensus 370 ~i~~~~~~~~~~~~~~~ 386 (386) T protein:vir:10 370 HITFRSHMVNGYLTEVV 386 (386) T ss_pred eEEEEEEEehhHHHhhC Confidence 99999999999999999 No 15 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=100.00 E-value=8.1e-102 Score=574.81 Aligned_cols=384 Identities=20% Similarity=0.226 Sum_probs=344.2 Q ss_pred CCC--ccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHH Q lcl|NC_021557. 1 MAA--TFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPA 78 (419) Q Consensus 1 Ma~--~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~ 78 (419) |+. +|+|||||+|+++++++|.++++++++||||+++++.. ++ .++|+++.++.++...++.....+++.. T Consensus 1 m~~~~~~~hGv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~-~p------~~~~~~i~~~~d~~~~~~~~~~~gtl~~ 73 (388) T protein:vir:96 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHAD-VA------FSVPFRVANTADAQYLDSTGNELGTGWH 73 (388) T ss_pred CCCCCCCCCceEEEEcCCCcccccccCcceeEEEEecCCCccc-cc------cccceeeecchhhhhhhccccccccchh Confidence 984 68999999999999999999999999999999988763 33 4889999999999999888878899999 Q ss_pred HHHHHhhccCCcEEEEeeccccccccccccccccccccceecc-ccccccccccchhhhhhhhhccccccccccchhhhh Q lcl|NC_021557. 79 ALDAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDING-TISPAGLASGFSGAYECYNNFGYFPKLIIAPGYSPA 157 (419) Q Consensus 79 al~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g-~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~ 157 (419) +++.++++++..++++..... +....+..+++| ....++.++|++++.+.+ ..|+++++|++++. T Consensus 74 al~~~~~~~~~~~~vv~v~~g----------~~~~at~a~iig~~~~~tg~~~gl~al~~~~----~~p~il~aPg~s~~ 139 (388) T protein:vir:96 74 AASETLKKTSVPQYFIVVPEG----------ADDAATMANIIGGIDPTTGRRTGIAALTECT----ERPTLIGAPGFSQN 139 (388) T ss_pred hhHhhhccCCceEEEEEeccc----------cccccccceeeeecccccchhhHHHHhhhcc----cceeEEEeeccccc Confidence 999999999888877765322 122334444454 344567778887776644 46899999999999 Q ss_pred hhHHHHHHHHhhccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHHHHHHHH Q lcl|NC_021557. 158 AAVRAEMDVVASRLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGVII 237 (419) Q Consensus 158 ~~v~a~l~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a 237 (419) ++|+++|..+|++.++|+++|+|. .+..++.+++...+..+++|+|+++||||++++|+. ++..+++|||+++||++| T Consensus 140 ~~v~~al~~~~~~~~~~~i~D~p~-~~~~~~~~~~~~~~~~~~~s~~~~~~~P~~~~~d~~-~~~~~~~p~s~~~AG~~a 217 (388) T protein:vir:96 140 KAVIDALASMAKRLKCRAVIDGPS-GSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRK-AQGNIYVPPSTIAMGAVA 217 (388) T ss_pred hHHHHHHHHHHhhcCcEEEEeccC-CchhHHHHHHhhhhccCcCcceEEEEeCceeeeccc-CCceeeechHHHHHHHHH Confidence 999999999999999999999995 456677888888888999999999999999999864 566899999999999999 Q ss_pred hhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeehh Q lcl|NC_021557. 238 ATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHAR 317 (419) Q Consensus 238 ~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vr 317 (419) ++| +||||+|+++ ++.|+++.+.+..++.++|+++||++|||+|++++++|+++||+||++ |+||++| T Consensus 218 ~~D----~~~spaN~~i-~i~g~~~~~~~~~~~~~~~~~~Ln~~gI~~i~~~~~~G~~~wG~rT~~-------~~~i~vr 285 (388) T protein:vir:96 218 AVK----PWESPGNQGV-LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT-------GKFISFV 285 (388) T ss_pred hhc----CcccccCeeE-EeeeecccccccccCChhhHHhhhhcCceEEEEecCCcEEEEcccccC-------Ccceeeh Confidence 999 6999999998 699999999999999999999999999999999999999999999983 9999999 Q ss_pred hHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccC Q lcl|NC_021557. 318 RILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISV 396 (419) Q Consensus 318 R~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p 396 (419) ||++||+++|++.++|++||||++.+|++|+++++.||++||++| ++||+++||+++||+++|++|+|+++|+++|++| T Consensus 286 R~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~Gal~g~~~~~d~~~nt~~~i~~G~~~~~i~~~p~~p 365 (388) T protein:vir:96 286 GLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSP 365 (388) T ss_pred hhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecCC Confidence 999999999999999999999999999999999999999999865 7899999999999999999999999999999999 Q ss_pred ceEEEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 397 MERITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 397 ~e~i~~~~~~~~~~~~~~~~~~a 419 (419) +|||+|+++++++||+++|+++= T Consensus 366 ae~I~~~~~~~~~~~~~~~~~~~ 388 (388) T protein:vir:96 366 NEHMIFHLNAVDRIVEEFIEEVL 388 (388) T ss_pred cceEEEEEEEchHHHHHHHHHhC Confidence 99999999999999999999999 No 16 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=100.00 E-value=2.7e-92 Score=522.61 Aligned_cols=396 Identities=18% Similarity=0.157 Sum_probs=306.8 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) ||+++.|||||+|++.++++|..+.|++++|||++++++ .|+|++|+|+.||...||.......++.++ T Consensus 1 m~~~~~Pgvyv~e~~~~~~~i~~~~t~~~~~vg~~~~Gp-----------~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v 69 (743) T protein:vir:10 1 MASQVSPGILIKERDLTNAVVTGALQIRAAHASTFAKGP-----------IGDIVNINTQKELVSVFGEPKEDNAEDWMV 69 (743) T ss_pred CccccCCceEEEEecCCCceeccCCcceeEEEEeccCCC-----------CCcCEEecCHHHHHHHcCCccCCcchHHHH Confidence 999999999999999999999999999999999988775 489999999999999999988888999999 Q ss_pred HHHhhccCCcEEEEeeccccccccc---------------------------cc--------------ccc--------- Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKE---------------------------GA--------------NPD--------- 110 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~---------------------------~~--------------~~~--------- 110 (419) ..+|.+++..|++++.......... .. ..+ T Consensus 70 ~~~f~ngg~~~~vvrv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gN~i~V~v~~~~~d~~~~~~~~~ 149 (743) T protein:vir:10 70 ASEFLNYGGRLAVVRAETTGVLNATTGSAGVLVKNRESWDAGSGNGEVFVARTAGSWGNSLMGVLVDRGADYIVTFAATP 149 (743) T ss_pred HHHHHhCCceEEEEEccCccccccccccccccccccccccccccceeEEEEeeccccccceEEEEecCCCcceeeeeccc Confidence 9999999999999987532100000 00 000 Q ss_pred ------------cc------------------------------------------------------------------ Q lcl|NC_021557. 111 ------------PS------------------------------------------------------------------ 112 (419) Q Consensus 111 ------------~~------------------------------------------------------------------ 112 (419) .. T Consensus 150 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (743) T protein:vir:10 150 TDTAVGTQLLFSYSGTLVTGEILSYDSATNTATITASGTLTSQYLLDTPEQGLIGSFTDNSTTEVGRTPGTYSNVPASGG 229 (743) T ss_pred cccccceeeeecccccccccceeeeeecCcceeeeeccccceeeecccccccccccccccccccccccccceeeEEeccc Confidence 00 Q ss_pred ccc---c----ceec---c-c------------------------cc------------cccc----------------c Q lcl|NC_021557. 113 KVT---T----VDIN---G-T------------------------IS------------PAGL----------------A 129 (419) Q Consensus 113 ~~t---~----~~~~---g-~------------------------~~------------~~~~----------------~ 129 (419) ... . .... + . .. ..+. . T Consensus 230 ~~~~~~~tv~v~~~~~~vg~~v~~~~~~~~~~~~~~~~~~v~~~~~~~~t~~~~~~~~~~~g~~~~~a~~~~~~~~~~~~ 309 (743) T protein:vir:10 230 TGTGATFNVVVADAGGGVGGSVVVTLANPGTGYNQGETLTIASAATGDGTDILVTVATLSDGTIAITELKDWYLNTEIGS 309 (743) T ss_pred ccccccccccccccccccccccccccccccceeeeccccccccccccccccchhheecccccceeeeecccccccchhhc Confidence 000 0 0000 0 0 00 0000 0 Q ss_pred cc-----------------------------------------------chhh--------------------------- Q lcl|NC_021557. 130 SG-----------------------------------------------FSGA--------------------------- 135 (419) Q Consensus 130 tg-----------------------------------------------~~a~--------------------------- 135 (419) ++ +..+ T Consensus 310 ~~~~~~~~~~~~~t~~~~~~~~~~~d~~~v~v~~~~~~~~~~~~~v~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~~ 389 (743) T protein:vir:10 310 TGIKLGDIGPRPGTSQFATDNGITDDQVHFAVIDTTGELTGTANTIVERLTYLSKLSDARSEENANIYYKNVINEQSAYL 389 (743) T ss_pred cccccccccccceeeeccccccccccceEEEEecCcceeeeccCceeEEEeeeecccccccccCcceeecceecccccee Confidence 00 0000 Q ss_pred -----------------------------------------------------------hhhhhhcc-ccccccccchhh Q lcl|NC_021557. 136 -----------------------------------------------------------YECYNNFG-YFPKLIIAPGYS 155 (419) Q Consensus 136 -----------------------------------------------------------~~~~~~~~-~~p~~~~ap~~~ 155 (419) ...+.... ..+.++++|++. T Consensus 390 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gG~d~~~~~~~~~~~~~~~~~~~~~~~~~ll~~p~~~ 469 (743) T protein:vir:10 390 YHGNDAAVQIAASGEAWGQSSDQVLADAGTAFSRTTGYWVNLAGGNDDFAYDAGEFGAAMDLFLDTEETEIDFVLMGGSM 469 (743) T ss_pred eccCcccceeeeccccCccccceeeeecccccccccceEEEeecCccccccchhHHHHHHHHhhhccccCcceEEecCcc Confidence 00000000 012455566653 Q ss_pred hh----hh-HHHHHHHHhhccceeEEEEeccCCC--------------HHHHHhhhhhccccccCccceEEecceeEeec Q lcl|NC_021557. 156 PA----AA-VRAEMDVVASRLHALAIADLPLGLT--------------KQQAVAARGVAGTANTSSARTVLTYPHVVIED 216 (419) Q Consensus 156 ~~----~~-v~a~l~~~~~~~~~~~i~d~p~~~~--------------~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~ 216 (419) .. .+ +.+++.+|.++.+||+++|+|.+.. ..+...++. .+++|+|+++||||++++| T Consensus 470 ~~~~~~~~v~~a~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~s~~~~~~~p~~~~~d 545 (743) T protein:vir:10 470 ADEADTKSKATKVIAIAASRKDALAFVSPHKGNQIASTGNVALSSAQQKENTIAFFS----DLTSTSYAVFDSGYKYVYD 545 (743) T ss_pred cCccchHHHHHHHHHHHHhhCCeEEEEecCCCccccccccccccccccchHHHHHHH----hccCCeeEEEEccceeeec Confidence 22 23 4455566666778999999996532 233444432 4578999999999999998 Q ss_pred cccccceeeechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEE Q lcl|NC_021557. 217 TTGATETRLDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRV 296 (419) Q Consensus 217 ~~~~~~~~~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~ 296 (419) +. ++..+++|||+++||++||+|.++||||||+|+.+.||.++.. ......++|++.||++|||+|++|+++|+++ T Consensus 546 ~~-~~~~~~~p~s~~~AGl~a~~D~~~g~~~span~~~~gi~g~~~---~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~ 621 (743) T protein:vir:10 546 RF-TDKYRYIPCNGDVAGLCVQTSNQLDDWYSPAGLNRGGILNAVK---LAYNPNKADRDELYQNRINPVVSLRGQGITL 621 (743) T ss_pred cc-cCceeEechhHHHHHHHHHhhccCCcEEccCCeeeeeeecccc---ceecCChhHHHhHhhCCceEEEEecCCeEEE Confidence 64 6778999999999999999999999999999999888887632 3344567789999999999999999999999 Q ss_pred EeccccCCCCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhh-cccceEEEEecccC Q lcl|NC_021557. 297 FGNRSAAFPTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGI-AIYGGTFRFDRQKN 375 (419) Q Consensus 297 wG~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~-g~~~~~v~~d~~~n 375 (419) ||+||++ ++|+.|+||++|||++||+++|++.++|+|||||++.+|++|++++++||++||++ ++++|+|+||+++| T Consensus 622 wG~rT~~--s~d~~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~~~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~n 699 (743) T protein:vir:10 622 FGDKTAL--AAPSAFDRINVRRLFLNLEKRARRLAEGVLFEQNDATTRAGFSSALNSYLSEVQARRGVTDYLVICDESNN 699 (743) T ss_pred EcccccC--CCCcccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCC Confidence 9999995 45789999999999999999999999999999999999999999999999999985 58899999999999 Q ss_pred CHHHhhCCEEEEEEEEEeccCceEEEEEEE--EcchHHHHHHHh Q lcl|NC_021557. 376 TAEQIADGKFYYRLECHPISVMERITIDSY--VDTKFISNALSL 417 (419) Q Consensus 376 ~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~--~~~~~~~~~~~~ 417 (419) |+++|++|+|+++|+++|++|+|||+|+|. ....+|++++++ T Consensus 700 t~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~e~~~~ 743 (743) T protein:vir:10 700 TPDIIDRNEFVAEVYVKPTRSINFITITFTATKTGVTFSEVVGR 743 (743) T ss_pred CHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCcchHhhhcC Confidence 999999999999999999999999999987 456679999999 No 17 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=100.00 E-value=1.8e-91 Score=518.05 Aligned_cols=400 Identities=17% Similarity=0.159 Sum_probs=301.6 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |+ +..|||||+|+ +++++|..+.|++.+|||.+++++ .|+|++|+|+.|+...||.......+..++ T Consensus 1 ~~-~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~Gp-----------~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~ 67 (660) T protein:vir:68 1 MA-LLSPGVELKET-TVQSTVVNNSTGTAALAGKFQWGP-----------AFQIKQITDEVALVDMFGTPNTDTADYFMS 67 (660) T ss_pred Cc-cccCceEEEEe-cCCcccccCCCcceeEEecccCCC-----------CccCEEecCHHHHHHhcCCccCccchhHHH Confidence 76 44699999999 589999999999999999988765 489999999999999999987788888999 Q ss_pred HHHhhccCCcEEEEeeccccccc---------------------------ccccc------cc----------------- Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVH---------------------------KEGAN------PD----------------- 110 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~---------------------------~~~~~------~~----------------- 110 (419) ..+|.++|..+++++........ ..... .. T Consensus 68 ~~~f~~~g~~~~vvRv~~~~~~~~~~~~~~~~~~t~~~~g~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~t 147 (660) T protein:vir:68 68 AMNFLQYGNDLRVVRAVDRDTAKNSSPVAGNINFTISSAGTNYRVGDKVVVKYSTDIIEPDGEVTSVDSDGKILNIFIPS 147 (660) T ss_pred HHHHHhCCCeEEEEEecccccccccccccccceeeeeccCcceeeeeeeeeecccccccccccceeeeecCceeeeeecc Confidence 99999999999998764211000 00000 00 Q ss_pred -------------ccccc--cceec-------------ccc--------------------------------------- Q lcl|NC_021557. 111 -------------PSKVT--TVDIN-------------GTI--------------------------------------- 123 (419) Q Consensus 111 -------------~~~~t--~~~~~-------------g~~--------------------------------------- 123 (419) +.... ...+. +.. T Consensus 148 a~~~~~a~~~~~~~~~~~~~~~~v~~~~~~~~~~~~v~~~~~d~~~~~~~~~ta~~~~~~~~~~~~~~~~~~~~~~A~~~ 227 (660) T protein:vir:68 148 GKIIAKAKEIGEYPELGSNWTAEMSGSSSGLSAVITIDSVVMDSGILLTEVETSEEAITSLTFQESIKKYGVPGVVALYP 227 (660) T ss_pred ccccccceeeccccccccceeEEeecccccceeeeeeccccccccceeeeeccccccccccceeeeecccCccccccccc Confidence 00000 00000 000 Q ss_pred cccc-c-----------------------------cccchhh---------------h---------------------- Q lcl|NC_021557. 124 SPAG-L-----------------------------ASGFSGA---------------Y---------------------- 136 (419) Q Consensus 124 ~~~~-~-----------------------------~tg~~a~---------------~---------------------- 136 (419) ...+ . ..+...+ . T Consensus 228 g~~G~~i~v~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 307 (660) T protein:vir:68 228 GELGDQLEIEIVSKADYDKGASAQLKIYPDGGTRYSTAKAIFGYGPQTDDQYAIIVRRNDSVVQSVVLSTKRGERDIYGS 307 (660) T ss_pred cccccceEEEEeccccccccccccceeeecccccccceeeEeecccccccceeeeeecCCcceeeeeeeccccccccccc Confidence 0000 0 0000000 0 Q ss_pred ---------hhhhhc----------------ccc----------------------------cccccc-----chhhhhh Q lcl|NC_021557. 137 ---------ECYNNF----------------GYF----------------------------PKLIIA-----PGYSPAA 158 (419) Q Consensus 137 ---------~~~~~~----------------~~~----------------------------p~~~~a-----p~~~~~~ 158 (419) .....+ .+. +.++.. +...... T Consensus 308 ~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 387 (660) T protein:vir:68 308 NIFIDDFFAKGASNYIFATAQGWPKGFSGVIKLNGGLSSNETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESLEVAS 387 (660) T ss_pred ceeeehhhccCcccEEEEeecCCCccccceeeeccccccccccccchhhhHHHHhhhhhccccceeeccccCCCchHHHH Confidence 000000 000 000000 0001112 Q ss_pred hHHH-HHHHHhhccceeEEEEec--------cCCCHHHHHhhhhhcc-----ccccCccceEEecceeEeecccccccee Q lcl|NC_021557. 159 AVRA-EMDVVASRLHALAIADLP--------LGLTKQQAVAARGVAG-----TANTSSARTVLTYPHVVIEDTTGATETR 224 (419) Q Consensus 159 ~v~a-~l~~~~~~~~~~~i~d~p--------~~~~~~~~~~~~~~~~-----~~~~~s~~~~~~~p~~~~~~~~~~~~~~ 224 (419) .++. ++.+|.++.+||+++|.| .+.+.+++.+||...+ ..+++|.|+++||||++++|+. ++..+ T Consensus 388 ~v~~~l~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~-~~~~~ 466 (660) T protein:vir:68 388 TVQKHVVAIGDSRQDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKY-NDVNR 466 (660) T ss_pred HHHHHHHHHHHhhCCeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEeccc-CCceE Confidence 3444 445566666788888765 4567788888886543 3478899999999999999965 66789 Q ss_pred eechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCC Q lcl|NC_021557. 225 LDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAF 304 (419) Q Consensus 225 ~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~ 304 (419) ++|||+++||++||+|.++||||||+|+++.+|.+.. .......++|++.||++|||+|++++++|+++||+||++. T Consensus 467 ~~p~sg~~AGl~Ar~d~~~g~~~span~~~~~i~g~~---~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~ 543 (660) T protein:vir:68 467 WVPLAADIAGLCARTDNISQPWMSPAGYNRGQILNVI---KLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATS 543 (660) T ss_pred EechhHHHHHHHHHHhccCCcEEccCCeeeceeeccc---eeeecCChhHHHHHhhCCceEEEEecCCeEEEEcceecCC Confidence 9999999999999999999999999999987777653 2334446778999999999999999999999999999964 Q ss_pred CCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCC Q lcl|NC_021557. 305 PTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADG 383 (419) Q Consensus 305 ~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G 383 (419) + +..|+|||+||||+||+++|++.++|+|||||++.+|++|+++|+.||++||++| ++||+|+||+++||+++|++| T Consensus 544 ~--~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~G 621 (660) T protein:vir:68 544 V--PSPFDRINVRRLFNMVKTNIGSASKYRLFELNNAFTRSSFRTETSQYLQGIKALGGVYNFKVVCDTTNNTPAVIDRN 621 (660) T ss_pred C--CcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEecCCCCHHHhhCC Confidence 3 2369999999999999999999999999999999999999999999999999865 789999999999999999999 Q ss_pred EEEEEEEEEeccCceEEEEEEEEcch--HHHHHHHhcC Q lcl|NC_021557. 384 KFYYRLECHPISVMERITIDSYVDTK--FISNALSLAA 419 (419) Q Consensus 384 ~~~~~v~~~p~~p~e~i~~~~~~~~~--~~~~~~~~~a 419 (419) +|+++|+++|++|+|||+|++.+... ++++++++|+ T Consensus 622 ~~~~~i~~~p~~pae~i~l~~~~~~~~~~~~e~~~~v~ 659 (660) T protein:vir:68 622 EFVATFYLQPARSINYITLNFVATATGADFDELIGAVG 659 (660) T ss_pred eEEEEEEEEecCCcceEEEEEEEeecCccHHHHHHhhc Confidence 99999999999999999999887744 8999999999 No 18 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=100.00 E-value=3.1e-91 Score=516.78 Aligned_cols=400 Identities=16% Similarity=0.143 Sum_probs=304.7 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |. +..|||||+|+ +++++|..+.|++++|||++++++ .|+|++|+|+.||+..||.......+..++ T Consensus 1 ~~-~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~Gp-----------~~~p~~v~s~~~~~~~fG~~~~~~~~~~~~ 67 (666) T protein:vir:65 1 MT-LLSPGFETKET-TLSTTIVQSETGRAALVGKFQWGP-----------AFQIIQVTNEVELVNKFGQPDNNTADYFMS 67 (666) T ss_pred Cc-eecCceEEEEe-cCcccccccCcccceEEecccCCC-----------CccCEEecCHHHHHHHcCCccccchhHHHH Confidence 65 45799999999 588899999999999999988774 489999999999999999987778888999 Q ss_pred HHHhhccCCcEEEEeeccccccc---------------------------ccc------c-c--c----c---------- Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVH---------------------------KEG------A-N--P----D---------- 110 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~---------------------------~~~------~-~--~----~---------- 110 (419) ..+|.++|..|++++........ ... . . . . T Consensus 68 ~~~f~ngg~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~~~V~~~~~~~~~~~~~~~~~~~~~~~g~~~~t 147 (666) T protein:vir:65 68 GANFLQYGNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPT 147 (666) T ss_pred HHHHHhcCceEEEEEccCcccccccccccCceeeeEeeccccccccceEEEEeccccccccccccccccccccccccccc Confidence 99999999999998763211000 000 0 0 0 0 Q ss_pred -------cc-----------------------------------cc------------cc----------------ceec Q lcl|NC_021557. 111 -------PS-----------------------------------KV------------TT----------------VDIN 120 (419) Q Consensus 111 -------~~-----------------------------------~~------------t~----------------~~~~ 120 (419) .. .. +. .... T Consensus 148 ~~~~~~~~~~g~~~~l~~~~~~~~~~~~~~~~~a~sv~~~~~~g~~~~~~~~~a~~~~~~~~~~~~~~~~~~~a~~A~~~ 227 (666) T protein:vir:65 148 GKIIAHAKAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLKKYDMPAVSAIYA 227 (666) T ss_pred ceeeccccccCcceeEeeccceeecccCcccccceeeeecccccceeeeeecccccccccccccccccccccceeeeeec Confidence 00 00 00 0000 Q ss_pred ccccc---------cc------c--------------------------------cccc--h------------------ Q lcl|NC_021557. 121 GTISP---------AG------L--------------------------------ASGF--S------------------ 133 (419) Q Consensus 121 g~~~~---------~~------~--------------------------------~tg~--~------------------ 133 (419) +.... .. . ..|. + T Consensus 228 g~~g~~i~v~i~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~ 307 (666) T protein:vir:65 228 GEIGNSLEVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSI 307 (666) T ss_pred cccccceeEEeecccccccccccccccccccccccceeeecccccccccceeeeecCCcccceeecccCcccccccchhh Confidence 00000 00 0 0000 0 Q ss_pred ----hh-h--------------------------------------------------hhhh-hccccccccccchhhh- Q lcl|NC_021557. 134 ----GA-Y--------------------------------------------------ECYN-NFGYFPKLIIAPGYSP- 156 (419) Q Consensus 134 ----a~-~--------------------------------------------------~~~~-~~~~~p~~~~ap~~~~- 156 (419) .+ . +.+. .....++++++|+++. T Consensus 308 ~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~ 387 (666) T protein:vir:65 308 YMDDFFARGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWDLFAERESIHVNLLIAGACAGE 387 (666) T ss_pred hhhhhhcccccceeeeecccccccccceEEccCCCCcCcccccccccccccccHHHHHHHHhhhhhccCCceeecCcCCc Confidence 00 0 0000 0001133444554433 Q ss_pred ---hhhHHHHHHH-HhhccceeEEEEec--------cCCCHHHHHhhhhhcc-----ccccCccceEEecceeEeecccc Q lcl|NC_021557. 157 ---AAAVRAEMDV-VASRLHALAIADLP--------LGLTKQQAVAARGVAG-----TANTSSARTVLTYPHVVIEDTTG 219 (419) Q Consensus 157 ---~~~v~a~l~~-~~~~~~~~~i~d~p--------~~~~~~~~~~~~~~~~-----~~~~~s~~~~~~~p~~~~~~~~~ 219 (419) ...+..+|.. |.++.++|+++|+| .+.+.+++++||...+ ..+++|+|+++||||++++|+. T Consensus 388 ~~~~~~v~~~l~~~~~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~- 466 (666) T protein:vir:65 388 GDAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKY- 466 (666) T ss_pred cchhHHHHHHHHHHHhhccceEEEeccccceeeecCCCCCHHHHHHHHHhcccccccccccCcceEEEEcCceEEeccc- Confidence 2455555555 44556788888765 4567888999886543 3468899999999999999865 Q ss_pred ccceeeechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEec Q lcl|NC_021557. 220 ATETRLDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGN 299 (419) Q Consensus 220 ~~~~~~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~ 299 (419) ++..+++|||+++||++||+|.++|||+||+|+++.||.+.. ++.+. ..+.|++.||++|||+|++++++|+++||+ T Consensus 467 ~~~~~~~p~sg~vAGl~Ar~D~~~g~~~span~~~~~i~g~~-~~~~~--~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~ 543 (666) T protein:vir:65 467 NDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVV-KLAIE--PRKAHRDRLYQAAINPVIGAGGEGFILMGD 543 (666) T ss_pred CCceeEechHHHHHHHHHHHhccCCcEEccCCeecceeeccc-cceee--cChhHHHhhhhCCceEEEEeCCCeEEEEec Confidence 567899999999999999999999999999999877776653 23333 346788999999999999999999999999 Q ss_pred cccCCCCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHH Q lcl|NC_021557. 300 RSAAFPTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAE 378 (419) Q Consensus 300 rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~ 378 (419) ||++. ++++|+||++||||+||+++|++.++|+|||||++.||++|+++|+.||++||++| ++||+|+||+++||++ T Consensus 544 rT~~~--~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~ 621 (666) T protein:vir:65 544 KTATT--VPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPD 621 (666) T ss_pred ccCCC--CCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHH Confidence 99964 33479999999999999999999999999999999999999999999999999865 7899999999999999 Q ss_pred HhhCCEEEEEEEEEeccCceEEEEEEEEcch--HHHHHHHhcC Q lcl|NC_021557. 379 QIADGKFYYRLECHPISVMERITIDSYVDTK--FISNALSLAA 419 (419) Q Consensus 379 ~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~--~~~~~~~~~a 419 (419) +|++|+|+++|+++|++|+|||+|++.+... .++++++++| T Consensus 622 ~i~~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~ 664 (666) T protein:vir:65 622 VIDRNEFVASMFIKPAKSINYIMLNFTAVATGSDFDEIIGPAN 664 (666) T ss_pred HhhCCeEEEEEEEEecCCcceEEEEEEEeecCccHHHHHHHHh Confidence 9999999999999999999999999988866 6999999999 No 19 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=100.00 E-value=6.1e-91 Score=515.18 Aligned_cols=400 Identities=17% Similarity=0.152 Sum_probs=301.7 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |+ ++.|||||+|+ ++++++..+.|++.+|||.+++++ .|+|++|+|+.|+...||.......+..++ T Consensus 1 ~~-~~~Pgvyv~e~-~~~~~~~~~~t~~~~~vg~~~~gp-----------~~~p~~i~~~~~~~~~fg~~~~~~~~~~~~ 67 (666) T protein:vir:80 1 MT-LLSPGFETKET-TLSTTIVQSATGRAALVGKFQWGP-----------AFQIIQVTNEVELVNKFGQPDNNTADYFMS 67 (666) T ss_pred Cc-eecCceEEEEe-cCCccccccCcccceEEeccccCC-----------CccceEecCHHHHHHhcCCccCccchHHHH Confidence 66 45699999999 588999999999999999988765 489999999999999999887778888899 Q ss_pred HHHhhccCCcEEEEeeccccccccc------------ccc------------------------------c--------- Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKE------------GAN------------------------------P--------- 109 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~------------~~~------------------------------~--------- 109 (419) ..+|.++|..+++++.......... ..+ . T Consensus 68 ~~~f~~~g~~~~v~R~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~i~~~~~~~~~~~~t 147 (666) T protein:vir:80 68 GANFLQYGNDLRVVRVLNKEKAKNATALAGNIEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPT 147 (666) T ss_pred HHHHhcCCCeEEEEEecCccccccccccccceeEEEeeccccccccccccccccCcccccCcceEEEeecceeeeeecch Confidence 9999999999999886422110000 000 0 Q ss_pred ----------------cc----------------cccc-----------c--------------------------ceec Q lcl|NC_021557. 110 ----------------DP----------------SKVT-----------T--------------------------VDIN 120 (419) Q Consensus 110 ----------------~~----------------~~~t-----------~--------------------------~~~~ 120 (419) .. ..+. . .... T Consensus 148 a~~~~~a~~~~~~~~v~~~~~~~~~~~~~~~~~a~~V~~~~~~~~~~~~~~~~a~~~~t~~~~~~~~~~~~~~a~~a~~~ 227 (666) T protein:vir:80 148 GKIIAHAKAIGVYPELDGDWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLQKYDMPAVSAIYA 227 (666) T ss_pred hhhccccccccccceeeccceeeeccccccceeeeeeeeeecCCccceeeeccccccccccccccccccccchhhhhhcc Confidence 00 0000 0 0000 Q ss_pred ccccc-----------------------------------------ccc------cccc----------------h---- Q lcl|NC_021557. 121 GTISP-----------------------------------------AGL------ASGF----------------S---- 133 (419) Q Consensus 121 g~~~~-----------------------------------------~~~------~tg~----------------~---- 133 (419) +.... ... ..+. . T Consensus 228 g~~g~~l~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~ 307 (666) T protein:vir:80 228 GEIGNSLEVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSI 307 (666) T ss_pred cccccceeeeeccccccccccccceeeeccccccccceeeeeccccccceeeEeccCCccceeeecccccccccccchhh Confidence 00000 000 0000 0 Q ss_pred hhhhh----h------------h---------------------------------hcc-------ccccccccchhhh- Q lcl|NC_021557. 134 GAYEC----Y------------N---------------------------------NFG-------YFPKLIIAPGYSP- 156 (419) Q Consensus 134 a~~~~----~------------~---------------------------------~~~-------~~p~~~~ap~~~~- 156 (419) .+.+. . + ..+ ....++++|++.. T Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~p~~~~~ 387 (666) T protein:vir:80 308 YMDDFFGRGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWGLFAERESIHVNLLIAGACAGE 387 (666) T ss_pred hhhhhhccccceeeeecccccccccceEEEecCCCCcccccccccccccccccchhhhhhhhhhcccccceEeecCcCCc Confidence 00000 0 0 000 0012233444332 Q ss_pred ---hhhHHHHH-HHHhhccceeEEEEe--------ccCCCHHHHHhhhhhcc-----ccccCccceEEecceeEeecccc Q lcl|NC_021557. 157 ---AAAVRAEM-DVVASRLHALAIADL--------PLGLTKQQAVAARGVAG-----TANTSSARTVLTYPHVVIEDTTG 219 (419) Q Consensus 157 ---~~~v~a~l-~~~~~~~~~~~i~d~--------p~~~~~~~~~~~~~~~~-----~~~~~s~~~~~~~p~~~~~~~~~ 219 (419) ..+++.+| .+|.++.++|+++|. |.+.+.+++++||...+ ..+++|+|+++||||++++|+. T Consensus 388 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~- 466 (666) T protein:vir:80 388 GDAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKY- 466 (666) T ss_pred ccchHHHHHHHHHHHHhhcceEEEeecceeEEeecCCCCCHHHHHHHHHhcccchhhhcccCcceEEEEcCceEEeccc- Confidence 23454444 445556667766665 45678889999987543 3578999999999999999965 Q ss_pred ccceeeechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEec Q lcl|NC_021557. 220 ATETRLDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGN 299 (419) Q Consensus 220 ~~~~~~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~ 299 (419) ++..+++|||+++||++||+|.++|||+||+|+++.++.+.. .+ .....+.|++.||++|||+|++|+++|+++||+ T Consensus 467 ~~~~~~~p~sg~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~-~~--~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~ 543 (666) T protein:vir:80 467 NDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVV-KL--AIEPRKAHRDRLYQAAINPVIGAGGEGFILMGD 543 (666) T ss_pred CCceeEechHHHHHHHHHHHhhcCCceEccCCeecceeeccc-cc--eeecChhHHHhhhhCCeeEEEEeCCCeEEEEcc Confidence 577899999999999999999999999999999876666542 22 233356788999999999999999999999999 Q ss_pred cccCCCCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHH Q lcl|NC_021557. 300 RSAAFPTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAE 378 (419) Q Consensus 300 rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~ 378 (419) ||++.. +++|+||||||||+||+++|++.++|+|||||++.||.+|+++++.||++||++| ++||+|+||+++||++ T Consensus 544 rT~~~~--~s~~~~i~vRRl~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~ 621 (666) T protein:vir:80 544 KTATTV--PSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPD 621 (666) T ss_pred ccCCCC--CcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHH Confidence 999643 3479999999999999999999999999999999999999999999999999865 7899999999999999 Q ss_pred HhhCCEEEEEEEEEeccCceEEEEEEEEcch--HHHHHHHhcC Q lcl|NC_021557. 379 QIADGKFYYRLECHPISVMERITIDSYVDTK--FISNALSLAA 419 (419) Q Consensus 379 ~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~--~~~~~~~~~a 419 (419) +|++|+|+++|+++|++|+|||+|++++... .++++.++|| T Consensus 622 di~~G~~~~~i~~~P~~Pae~I~~~~~~~~~~~~~~e~~~~~~ 664 (666) T protein:vir:80 622 VIDRNEFVASMFIKPAKSINYIMLNFTAVATGSDFDEIIGPVN 664 (666) T ss_pred HhhCCeEEEEEEEEecCCcceEEEEEEEeecCccHHHHHHHHh Confidence 9999999999999999999999999988766 6899999999 No 20 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=100.00 E-value=8.8e-91 Score=514.30 Aligned_cols=400 Identities=17% Similarity=0.164 Sum_probs=304.0 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |. +..|||||+|++ ++++|..+.|++.+|||.+++++ .|+|++|+|+.|+...||.......+..++ T Consensus 1 ~~-~~~Pgvyv~e~~-~~~~i~~~~t~~~~~vg~~~~gp-----------~~~p~~i~s~~~~~~~fg~~~~~~~~~~~~ 67 (679) T protein:vir:10 1 MT-LLSPGVETKEIN-LQTTIARSSTGRAALVGKFNWGP-----------AYQISQVVSEVDLVDKFGRPDDQTADSFFS 67 (679) T ss_pred Cc-eecCceEEEeec-CCcccccCccccceeeecccCCC-----------CccCEEecCHHHHHHHcCCcccccchHHHH Confidence 65 456999999995 89999999999999999988764 489999999999999999988888899999 Q ss_pred HHHhhccCCcEEEEeecccccccc---------------------------------------ccccc------------ Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHK---------------------------------------EGANP------------ 109 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~---------------------------------------~~~~~------------ 109 (419) ..+|.++|..|++++......... ..... T Consensus 68 ~~~f~~gg~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~s~~~~~~~~~~~~~~~~~~~~v~~~ 147 (679) T protein:vir:10 68 GVNFLNYGNDLRLVRVLNETKSRNSSALYQSLSYTITSPGVDYKVGDVVNVLQGGNVIATGKVTVVNASGGIVAFYVPTA 147 (679) T ss_pred HHHHHhCCCeEEEEEccCcccccccccccccccccccccccccccccceeeeeCCCcccceeEEEeeccCceeeeeeccc Confidence 999999999999997632110000 00000 Q ss_pred ---------------cc------ccccc----------ceec-------------------------------------- Q lcl|NC_021557. 110 ---------------DP------SKVTT----------VDIN-------------------------------------- 120 (419) Q Consensus 110 ---------------~~------~~~t~----------~~~~-------------------------------------- 120 (419) .. ...+. ..+. T Consensus 148 ~~~~~a~~~~~~~~l~~a~~~~~~~~t~~~g~~~~~~v~~v~~~~~~~~~~~~~a~~~i~~~~~~~~t~~~~~~~~~~~~ 227 (679) T protein:vir:10 148 AIIDKAKSLNDYPALDNAWQIQFAAGGPGAGQAATATVVGINLDSTIFVPNDEYAMSAISERSETKRTFIDICEEMKVPA 227 (679) T ss_pred ccccccccccccceecccceeeeeeccccccceeeeeeeeeccCCceeeccccccccccccccccchhhhhhhhccccce Confidence 00 00000 0000 Q ss_pred ------c------------------cccc-----------c----cc--------------------------cccc--h Q lcl|NC_021557. 121 ------G------------------TISP-----------A----GL--------------------------ASGF--S 133 (419) Q Consensus 121 ------g------------------~~~~-----------~----~~--------------------------~tg~--~ 133 (419) + .... . +. ..+. + T Consensus 228 ~~A~~~g~~gn~i~v~~va~~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vvv~~~g~~~~ 307 (679) T protein:vir:10 228 IVARYAGTYGDNIKVLMIAYKDYYKFNEAGKIVSVNTINPKVFPTGLDYGNVTPSSYLEFGPQNESQFAFIVFNNGVAVE 307 (679) T ss_pred eeeecccccCCcceEEEEeecccccccccccccccccccccccccccccccceeeeecccccccccceeeEEeccccccc Confidence 0 0000 0 00 0000 0 Q ss_pred -----h-------------hhh--------------------------------------------hhhhcc----cccc Q lcl|NC_021557. 134 -----G-------------AYE--------------------------------------------CYNNFG----YFPK 147 (419) Q Consensus 134 -----a-------------~~~--------------------------------------------~~~~~~----~~p~ 147 (419) . +.. ....+. ..++ T Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 387 (679) T protein:vir:10 308 SKILSTKPGDRDIYGTSIYINEYFGNGYSSFVQGVAESWPVGYTGVLAFGGGQSSNTDISAAEFMKGWDMFADREHTDVN 387 (679) T ss_pred ceeeecccccccccchhhhhhhhhcCcccceeeeccccccccccceeeccCCccCCCccchhhhhhhhhhhhcccccccc Confidence 0 000 000000 0123 Q ss_pred ccccchhhh-----hhhHHHH-HHHHhhccceeEEEEeccCC--------CHHHHHhhhhhc--------cccccCccce Q lcl|NC_021557. 148 LIIAPGYSP-----AAAVRAE-MDVVASRLHALAIADLPLGL--------TKQQAVAARGVA--------GTANTSSART 205 (419) Q Consensus 148 ~~~ap~~~~-----~~~v~a~-l~~~~~~~~~~~i~d~p~~~--------~~~~~~~~~~~~--------~~~~~~s~~~ 205 (419) ++++|+... ...|+.+ +.+|.++.+||+++|+|.+. +.+++.+||... ...+++|.|+ T Consensus 388 ~l~~p~~~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~ 467 (679) T protein:vir:10 388 LFIAGAVAGEGAQIASTVQKAVVAIADERRDCLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYA 467 (679) T ss_pred eEEecCCCCCchhhhHHHHHHHHHHHHhhCCeEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceE Confidence 344555432 2334444 46666667899999998543 346677777543 2457889999 Q ss_pred EEecceeEeeccccccceeeechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEE Q lcl|NC_021557. 206 VLTYPHVVIEDTTGATETRLDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVT 285 (419) Q Consensus 206 ~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~ 285 (419) ++||||++++|+. ++..+++|||+++||++||+|.++||||||+|+.+.+|.+.. ++.+. ..+.|++.||++|||+ T Consensus 468 ~~~~p~~~~~d~~-~~~~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~-~~~~~--~~~~~~~~Ln~~gin~ 543 (679) T protein:vir:10 468 SVDGNYKYQYDKY-NDVNRWIPLAADIAGLCARTDTVGQPWQSPAGFNRGQIVNVI-KLAVD--TRQAHRDEMYTNGINP 543 (679) T ss_pred EEEccceeeeccc-CCceEEechHHHHHHHHHHhhccCCcEECcCCeeeccccccc-cceee--cChhhHHhhhhCCceE Confidence 9999999999864 677899999999999999999999999999999987777653 23333 3567889999999999 Q ss_pred EEEecCCcEEEEeccccCCCCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-cc Q lcl|NC_021557. 286 AMRSFATGIRVFGNRSAAFPTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IY 364 (419) Q Consensus 286 i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~ 364 (419) |++|+++|+++||+||++.. +..|+|||+|||++||+++|++.++|+|||||++.+|.+|+++|++||++||++| ++ T Consensus 544 i~~~~g~G~~~wG~rT~~~~--~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~ 621 (679) T protein:vir:10 544 IVGFAGQGYILYGDKTASQA--PTPFDRINVRRLFNLLKKSISESAKYKLFELNDAFTRSSFRSEVGSYLDTIRSLGGIY 621 (679) T ss_pred EEEecCCeEEEEcccccCCC--CcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCcee Confidence 99999999999999999643 3479999999999999999999999999999999999999999999999999865 78 Q ss_pred ceEEEEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcch--HHHHHHHhcC Q lcl|NC_021557. 365 GGTFRFDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVDTK--FISNALSLAA 419 (419) Q Consensus 365 ~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~--~~~~~~~~~a 419 (419) ||+|+||+++||+++|++|+|+++|+++|++|+|||+|++++... +++++.++++ T Consensus 622 gf~v~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~ 678 (679) T protein:vir:10 622 DFRVVCDESNNTPAVIDRNEFVATILIKPARSINYITLSFVATSTGADFDELVGSFQ 678 (679) T ss_pred eeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCccEEEEEEEEeecCccHHHHHHHhc Confidence 999999999999999999999999999999999999999887554 7899999988 No 21 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=100.00 E-value=5.7e-91 Score=515.34 Aligned_cols=395 Identities=16% Similarity=0.126 Sum_probs=305.7 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |+ +..|||||+|++ ++++|..+.|++.+|||++++++ .|+|++|+|+.|+.+.||.......+..++ T Consensus 1 ma-~~~PgVyv~E~~-~~~~i~~~~ts~~~~vG~~~~Gp-----------~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v 67 (664) T protein:vir:98 1 MA-LQSPGIETKETS-VQSTVVRNSTGRAAIVGKFSWGP-----------AYQIRQISNEVELVNYFGAPDNLTADYFMS 67 (664) T ss_pred Cc-eecCceEEEecC-CCcccccccccceEEEeeccCCC-----------CCccEEecCHHHHHHhcCCccccchhHHHH Confidence 99 668999999995 89999999999999999988765 489999999999999999988888999999 Q ss_pred HHHhhccCCcEEEEeeccccccc----------------------------------------ccccccc---------- Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVH----------------------------------------KEGANPD---------- 110 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~----------------------------------------~~~~~~~---------- 110 (419) ..+|.++|..|++++........ ....+.. T Consensus 68 ~~~f~ngg~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~gn~~~v~i~~ 147 (664) T protein:vir:98 68 AVNFLQYGNDLRLVRVVDKDAAKNASAIFNQIKTTIASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNGKILAVTIPK 147 (664) T ss_pred HHHHHhcCCeEEEEEecCccccccccccccccceeeccCCcccccccceeEeecCcccccceeeccCCCCCceeeEeecc Confidence 99999999999999874321000 0000000 Q ss_pred -------------------------------cc------cc----c------cc-------------------------- Q lcl|NC_021557. 111 -------------------------------PS------KV----T------TV-------------------------- 117 (419) Q Consensus 111 -------------------------------~~------~~----t------~~-------------------------- 117 (419) .. .. . .. T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~s~~~~s~g~a~a~~v~~v~~d~~~~~~~~~~a~~~i~~~~~~~~~~~~~~~~~~a~~ 227 (664) T protein:vir:98 148 RKKSLLVLNRSVLTQIFLLVGTTEIVSQSSGVSASITIDGIESDSGITLLNLDIAKETIQGTSFQTLTQKYQIPSVVALY 227 (664) T ss_pred Cccceeecccccccccceecccceeeeeecccceeeecccccccceeeccccceeeeccccccceeeeeccccceeeeee Confidence 00 00 0 00 Q ss_pred ----------eec-------c--------------c----------ccc------------------------------- Q lcl|NC_021557. 118 ----------DIN-------G--------------T----------ISP------------------------------- 125 (419) Q Consensus 118 ----------~~~-------g--------------~----------~~~------------------------------- 125 (419) .+. + . .+. T Consensus 228 ~G~~Gn~isv~i~s~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~ 307 (664) T protein:vir:98 228 PGELGSTVQVEIISKAAYDTGAMISGYPSGISVKNSGRSVMTYGPQTDNQYAFVVRRGGIVQESFIVSTDKTDKDIYGVN 307 (664) T ss_pred cccccceeeeeecccccccCcceEeeccCceecccceeeeeeccccCccceeEEEecCCceeeeEEeecccCcccceeee Confidence 000 0 0 000 Q ss_pred --------------------------------------------ccccccchhhhhhhhhccccccccccchhhhhh--- Q lcl|NC_021557. 126 --------------------------------------------AGLASGFSGAYECYNNFGYFPKLIIAPGYSPAA--- 158 (419) Q Consensus 126 --------------------------------------------~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~~--- 158 (419) .+..+|++++. ......|+++++|+++..+ T Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~---~~~~~~~~ll~~p~~~~~~~~~ 384 (664) T protein:vir:98 308 IYMDDFFANGGSQYVFGTSMNWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFA---DREALHVPLLIAGGCAGESVEI 384 (664) T ss_pred eechhheecccceeeeeecccCCcccceeEeccCccccccccCchhHHHHHHhhh---cccccccceEEecCCCCCcHHH Confidence 00000000000 0111235666677765442 Q ss_pred --hHHHHH-HHHhhccceeEEEEecc--------CCCHHHHHhhhhh---------ccccccCccceEEecceeEeeccc Q lcl|NC_021557. 159 --AVRAEM-DVVASRLHALAIADLPL--------GLTKQQAVAARGV---------AGTANTSSARTVLTYPHVVIEDTT 218 (419) Q Consensus 159 --~v~a~l-~~~~~~~~~~~i~d~p~--------~~~~~~~~~~~~~---------~~~~~~~s~~~~~~~p~~~~~~~~ 218 (419) ++..+| .+|.++.++|+++|.|. +.+.+++++||.. +...+++|+|+++||||++++|+. T Consensus 385 ~~~v~~al~~~a~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~ 464 (664) T protein:vir:98 385 ASTVQKHVISIGDERQDCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNYKYQYDKY 464 (664) T ss_pred HHHHHHHHHHHHHhcCCeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCeEEEeccc Confidence 355555 45556668999999873 4567778888753 234678999999999999999865 Q ss_pred cccceeeechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecC-CcEEEE Q lcl|NC_021557. 219 GATETRLDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFA-TGIRVF 297 (419) Q Consensus 219 ~~~~~~~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~-~G~~~w 297 (419) ++..+++|||+++||++||+|.++||||||+|+++.|+.+.. ++.+.+ .+.|++.||++|||+|++|++ +|+++| T Consensus 465 -~~~~~~~p~sg~~AGl~A~~D~~~g~~~span~~~~~i~g~~-~~~~~~--~~~~~~~Ln~~gIn~i~~~~~~~G~~~w 540 (664) T protein:vir:98 465 -NDVNRWVPLAGDIAGLCVYTDSVANPWMSPAGYNRGQIRNCI-KLAIEP--RTAHRDAMYQVQINPVTGFAGGSGFVLY 540 (664) T ss_pred -CCceEEechHHHHHHHHHHhhhcCCcEECcCCceeeeeeccc-cceeec--ChhhHHHHHhCCCeEEEEeeCCCcEEEE Confidence 567899999999999999999999999999999887777653 333333 456888999999999999987 799999 Q ss_pred eccccCCCCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCC Q lcl|NC_021557. 298 GNRSAAFPTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNT 376 (419) Q Consensus 298 G~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~ 376 (419) |+||++. +++.|+|||+||||+||+++|++.++|+|||||++.||++|+++|+.||++||++| ++||+|+||+++|| T Consensus 541 G~rT~~~--~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt 618 (664) T protein:vir:98 541 GDKTLTS--VPSPFDRINVRRLFNMIKKDIGDNAKYKLFENNDDFTRASFRMDTGQYMTNIRALGGCYDYRVICDTTNNT 618 (664) T ss_pred cccccCC--CCcccceEeehhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCC Confidence 9999963 23479999999999999999999999999999999999999999999999999865 88999999999999 Q ss_pred HHHhhCCEEEEEEEEEeccCceEEEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 377 AEQIADGKFYYRLECHPISVMERITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 377 ~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~a 419 (419) +++|++|+|+++|+++|++|+|||+|++++...+ ..|++++ T Consensus 619 ~~~i~~G~~~~~i~~~p~~pae~I~~~~~q~~~~--~~~~e~~ 659 (664) T protein:vir:98 619 PDVIDRNEFVATVYVKPPRSINYITLNFVATSTG--ADFDELV 659 (664) T ss_pred HHHhhCCeEEEEEEEEecCCcceEEEEEEEeecC--cchhHhc Confidence 9999999999999999999999999999988775 3555555 No 22 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=100.00 E-value=7.6e-90 Score=509.18 Aligned_cols=400 Identities=18% Similarity=0.161 Sum_probs=302.8 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |+ +..|||||+|++.+++++.. .|++.+|||++++++ .|+|++|+|+.||...||.......+..++ T Consensus 1 ~~-~~~PgVyv~e~~~~~~~~~~-~ts~~~fvG~~~~Gp-----------~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v 67 (659) T protein:vir:10 1 MT-LLSPGIELKETTVQSTVVNN-STGTAALAGKFQWGP-----------AFQIKQVTNEVDLVNTFGQPTAETADYFMS 67 (659) T ss_pred Cc-eecCceEEEEecCCceeccc-CccceEEEecccCCC-----------CCccEEecCHHHHHHHcCCcCCCcchhHHH Confidence 76 44699999999999987754 799999999988775 489999999999999999998889999999 Q ss_pred HHHhhccCCcEEEEeecccccccc---------------------------ccccc------------------------ Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHK---------------------------EGANP------------------------ 109 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~---------------------------~~~~~------------------------ 109 (419) ..+|.+++..|++++......... ..... T Consensus 68 ~~~f~ngg~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~vd~~~~~~~~~i~~ 147 (659) T protein:vir:10 68 AMNFLQYGNDLRVVRAVDRDTAKNSSPIAGNIEYTISTPGSNYAVGDKITVKYVSDAIETEGKITEVDTDGKIKKINIPT 147 (659) T ss_pred HHHHhhCCCeEEEEEccCcccccccccccccceeeEeecccccccccceeeeecCCCccccceeeEEecccccceeeecc Confidence 999999999999997632110000 00000 Q ss_pred -----------ccccc------------c--ccee----------------c---------------------------- Q lcl|NC_021557. 110 -----------DPSKV------------T--TVDI----------------N---------------------------- 120 (419) Q Consensus 110 -----------~~~~~------------t--~~~~----------------~---------------------------- 120 (419) ..... + .... . T Consensus 148 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~a~t~~~~~~~~~~~~~~~v~a~~~ 227 (659) T protein:vir:10 148 AKIIAKAKEVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYP 227 (659) T ss_pred cccccccccccccceeeeeeeeeeeeeccccceeeEEeeeecCCceeEEeeccccccccccccccceeeccccccccccc Confidence 00000 0 0000 0 Q ss_pred c---------------------------------------------cccc--------c-------------cccc--c- Q lcl|NC_021557. 121 G---------------------------------------------TISP--------A-------------GLAS--G- 131 (419) Q Consensus 121 g---------------------------------------------~~~~--------~-------------~~~t--g- 131 (419) + .... . +... + T Consensus 228 G~~g~~~tv~~~~~a~~~~~~~v~v~~~~~~~~~a~~~t~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 307 (659) T protein:vir:10 228 GELGDKIEIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDS 307 (659) T ss_pred ceecccceEEEechhhccccceeeeeeeeecccccccceeeeeeccccccchhhccccccceeeeeeeeccccccccccc Confidence 0 0000 0 0000 0 Q ss_pred ---chhh-----------------------------------------hhh---hh-hccccccccccchhhhh-----h Q lcl|NC_021557. 132 ---FSGA-----------------------------------------YEC---YN-NFGYFPKLIIAPGYSPA-----A 158 (419) Q Consensus 132 ---~~a~-----------------------------------------~~~---~~-~~~~~p~~~~ap~~~~~-----~ 158 (419) +... ... +. .-...++++++|++... . T Consensus 308 ~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~il~~p~~~~~~~~~~~ 387 (659) T protein:vir:10 308 NIYIDDFFAKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETAS 387 (659) T ss_pred hhhhhhhhccCcccEEEEeecccCCCccceeeecccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhH Confidence 0000 000 00 00012445566665432 3 Q ss_pred hHHH-HHHHHhhccceeEEEEecc--------CCCHHHHHhhhhhcc-----ccccCccceEEecceeEeecccccccee Q lcl|NC_021557. 159 AVRA-EMDVVASRLHALAIADLPL--------GLTKQQAVAARGVAG-----TANTSSARTVLTYPHVVIEDTTGATETR 224 (419) Q Consensus 159 ~v~a-~l~~~~~~~~~~~i~d~p~--------~~~~~~~~~~~~~~~-----~~~~~s~~~~~~~p~~~~~~~~~~~~~~ 224 (419) .+.. ++.+|+++.++|+++|+|. +.+.+++.+||...+ ..+++|+|+++||||++++|+. ++..+ T Consensus 388 ~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~-~~~~~ 466 (659) T protein:vir:10 388 TVQKHVVSIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNYKYQYDKY-NDVNR 466 (659) T ss_pred HHHHHHHHHHHhhCCeEEEEcCccccccCCCcccCHHHHHHHHHhcccccccccccCcceEEEEeCcEEEeccc-CCceE Confidence 3444 4566666778999999884 356678888886543 3468899999999999999964 56789 Q ss_pred eechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCC Q lcl|NC_021557. 225 LDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAF 304 (419) Q Consensus 225 ~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~ 304 (419) ++|||+++||++||+|.++||||||+|+++.++.+... . .....+.|++.||++|||+|++++++|+++||+||++. T Consensus 467 ~~p~sg~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~-~--~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~ 543 (659) T protein:vir:10 467 WVPLAADIAGLCARTDNVSQTWMSPAGYNRGQILNVIK-L--AIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATS 543 (659) T ss_pred EechHHHHHHHHHHHhccCCceEccCCceeeeeecccc-c--eecCCHhHHHHHhhCCeeEEEEeCCCeEEEEcccccCC Confidence 99999999999999999999999999998777776532 2 23345678999999999999999999999999999964 Q ss_pred CCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCC Q lcl|NC_021557. 305 PTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADG 383 (419) Q Consensus 305 ~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G 383 (419) + +..|+|||+|||++||+++|++.++|+|||||++.||++|+++|+.||++||++| +++|+|+||+++||+++|++| T Consensus 544 ~--~s~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G 621 (659) T protein:vir:10 544 V--PSPFDRINVRRLFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGIKALGGIYEYRVVCDTTNNTPSVIDRN 621 (659) T ss_pred C--CcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCC Confidence 3 3479999999999999999999999999999999999999999999999999865 789999999999999999999 Q ss_pred EEEEEEEEEeccCceEEEEEEEEcchH--HHHHHHhcC Q lcl|NC_021557. 384 KFYYRLECHPISVMERITIDSYVDTKF--ISNALSLAA 419 (419) Q Consensus 384 ~~~~~v~~~p~~p~e~i~~~~~~~~~~--~~~~~~~~a 419 (419) +|+++|+++|++|+|||+|++++.... ++++.+++- T Consensus 622 ~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~ 659 (659) T protein:vir:10 622 EFVATFYIQPARSINYITLNFVATATGADFDELTGLAG 659 (659) T ss_pred eEEEEEEEEecCCcceEEEEEEEEecCcchHHhhccCC Confidence 999999999999999999999988554 444444444 No 23 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=100.00 E-value=8.1e-90 Score=509.01 Aligned_cols=400 Identities=19% Similarity=0.157 Sum_probs=301.7 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |+ +..|||||+|++ ++++|..+.|++.+|||.+++++ .|+|++|+|+.|+...||.......+..++ T Consensus 1 ~~-~~~Pgvyv~e~~-~~~~i~~~~t~~~~~vg~~~~gp-----------~~~p~~v~s~~~~~~~fg~~~~~~~~~~~~ 67 (660) T protein:vir:10 1 MA-LLSPGIELKETS-VQSTVVRNATGRAALVGKFQWGP-----------AFQVTQITNEVELVDLFGGPNNEVADYFMS 67 (660) T ss_pred Cc-eecCceEEEeec-CCccccCCCcccceEEeecCCCC-----------CccCeEcCCHHHHHHHcCCcCCCchhHHHH Confidence 55 457999999995 88999999999999999988775 488999999999999999988888888999 Q ss_pred HHHhhccCCcEEEEeeccccccc---------------------------ccccc-------------cc---------- Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVH---------------------------KEGAN-------------PD---------- 110 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~---------------------------~~~~~-------------~~---------- 110 (419) ..+|.++|..|++++........ ..... .. T Consensus 68 ~~~f~~~g~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~v~~~~a~g~~~~~~~~t 147 (660) T protein:vir:10 68 GMNFLQYGNDLRTVRVVSREFAKNASPIAGNIETTITTAGSNYAVGDKINIKYNQTVVESEGRVTSVDTDGKILSVFIPS 147 (660) T ss_pred HHHHHhCCceEEEEEecccccccccccccccceeEEeeccccccccceeeEeeccccccccccceeeccccceeeecccc Confidence 99999999999998763322100 00000 00 Q ss_pred ------------------------------cc---ccc-----------c----------------------cee----c Q lcl|NC_021557. 111 ------------------------------PS---KVT-----------T----------------------VDI----N 120 (419) Q Consensus 111 ------------------------------~~---~~t-----------~----------------------~~~----~ 120 (419) .. .+. . ... . T Consensus 148 a~~~~~a~~v~~~~~~~~~~~~~~~~~~~~~~~a~sv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~ 227 (660) T protein:vir:10 148 AKIIAYARSLNQYPTLGPAWTAEVTSASSGVSGTITVGKIVTDSGILLTEAENSEEAITSLEFQAALKKFAMPGVVALYP 227 (660) T ss_pred ccccccccccccccccccceeEEEecccCccccceeeeeeeccCcceEEeeeccccccccccceeeccccccceeeeecc Confidence 00 000 0 000 0 Q ss_pred ccc-------------------------ccccc--------------c----------ccc--h---------------- Q lcl|NC_021557. 121 GTI-------------------------SPAGL--------------A----------SGF--S---------------- 133 (419) Q Consensus 121 g~~-------------------------~~~~~--------------~----------tg~--~---------------- 133 (419) +.. ..... . .+. + T Consensus 228 g~~G~~i~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~~~ 307 (660) T protein:vir:10 228 GEIGSTLEVEIVSKAAYEAGSSKMLDVYPGGGTRASIAKAVFNYGPQTDDQYAIIVRRDGAIVESVVLSTKEGEKDVYGN 307 (660) T ss_pred cccCcceeEEEeeccccCCcceeEEeeeeccceeeEEeeeecccccccccccccccccCCcccceeeeeccccccccccc Confidence 000 00000 0 000 0 Q ss_pred ---h---h-h------------------------------------------hhhh-hccccccccccchhhh-----hh Q lcl|NC_021557. 134 ---G---A-Y------------------------------------------ECYN-NFGYFPKLIIAPGYSP-----AA 158 (419) Q Consensus 134 ---a---~-~------------------------------------------~~~~-~~~~~p~~~~ap~~~~-----~~ 158 (419) . + . +.+. .....++++++|++.. .. T Consensus 308 ~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~t~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~ 387 (660) T protein:vir:10 308 NIYLDDYFAKGTSNYIYATSLNWPKGFSGIINLSGGISANDKVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDEVAS 387 (660) T ss_pred eeeeehhhcCCCccEEEEEeccCCCCcccceeeeccccCccccccchhhhhhhhhhhhhhcccceEEEcCcCCCchhhhH Confidence 0 0 0 0000 0001123334444332 23 Q ss_pred hHHHHHHH-HhhccceeEEEEeccC--------CCHHHHHhhhhhcc-----ccccCccceEEecceeEeecccccccee Q lcl|NC_021557. 159 AVRAEMDV-VASRLHALAIADLPLG--------LTKQQAVAARGVAG-----TANTSSARTVLTYPHVVIEDTTGATETR 224 (419) Q Consensus 159 ~v~a~l~~-~~~~~~~~~i~d~p~~--------~~~~~~~~~~~~~~-----~~~~~s~~~~~~~p~~~~~~~~~~~~~~ 224 (419) +++.+|.+ |.++.+||+++|+|.+ .+.+++.+||...+ ..+++|+|+++||||.+++|+. ++..+ T Consensus 388 ~v~~al~~~~~~~~~~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~-~~~~~ 466 (660) T protein:vir:10 388 TVQKHVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTYAAIDGNYKYQYDKY-NDVNR 466 (660) T ss_pred HHHHHHHHHHHhhCCEEEEEecCcccccccccccCHHHHHHHHhhcccccccccccCcceEEEEcCceEEeccc-CCcee Confidence 45555555 4555679999999954 35678888886543 3468899999999999999964 56789 Q ss_pred eechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecC-CcEEEEeccccC Q lcl|NC_021557. 225 LDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFA-TGIRVFGNRSAA 303 (419) Q Consensus 225 ~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~-~G~~~wG~rT~~ 303 (419) ++|||+++||++||+|.++||||||+|+++.++.+.. ++ .....+.|++.||++|||+|++|++ +||++||+||++ T Consensus 467 ~~p~sg~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~-~~--~~~~~~~e~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~ 543 (660) T protein:vir:10 467 WVPLAADLAGLCARTDDVSQPWMSPAGYNRGQILNVL-KL--AIEPRQAQRDRMYQEAINPVVGFAGGDGFVLFGDKTAT 543 (660) T ss_pred EechhHHHHHHHHHhhccCCcEEccCCeeeceeeccc-ee--eecCChhhHHhHhhCCceEEEEeeCCCcEEEEcccccC Confidence 9999999999999999999999999999877666643 22 2334567889999999999999986 799999999985 Q ss_pred CCCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhC Q lcl|NC_021557. 304 FPTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIAD 382 (419) Q Consensus 304 ~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~ 382 (419) .+ +..|+||||||||+||+++|++.++|+|||||++.||++|+++++.||++||++| +.||+|+||+++||+++|++ T Consensus 544 ~~--~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~gal~g~~V~~d~~~nt~~di~~ 621 (660) T protein:vir:10 544 KV--PSPMDHINVRRLFNMLKKNIGDASKYKLFELNDNFTRSSFRMEVSQYLDGIKALGGIYEGRVVCDTTVNTPAVIDR 621 (660) T ss_pred CC--CcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhC Confidence 43 2369999999999999999999999999999999999999999999999999865 78999999999999999999 Q ss_pred CEEEEEEEEEeccCceEEEEEEEEcchH--HHHHHHhcC Q lcl|NC_021557. 383 GKFYYRLECHPISVMERITIDSYVDTKF--ISNALSLAA 419 (419) Q Consensus 383 G~~~~~v~~~p~~p~e~i~~~~~~~~~~--~~~~~~~~a 419 (419) |+|+++|+++|++|+|||+|++++...+ ++++++++- T Consensus 622 G~~~~~i~~~P~~pae~I~~~~~~~~~~~~~~e~~~~~~ 660 (660) T protein:vir:10 622 NEFIANIYVKPARSINYITLNFVATSTGADFDELIGPLV 660 (660) T ss_pred CeEEEEEEEEecCCccEEEEEEEEeecCccHHHHhhhcC Confidence 9999999999999999999998887665 667777777 No 24 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=100.00 E-value=5.3e-90 Score=510.03 Aligned_cols=400 Identities=19% Similarity=0.185 Sum_probs=302.3 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |. +..|||||+|++.+++++ .+.|++++|||++++++ .|+|++|+|+.||.+.||.......+..++ T Consensus 1 ~~-~~~PgVyvee~~~~~~~~-~~~ts~~~fvG~~~~Gp-----------~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~ 67 (659) T protein:vir:72 1 MT-LLSPGIELKETTVQSTVV-NNSTGTAALAGKFQWGP-----------AFQIKQVTNEVDLVNTFGQPTAETADYFMS 67 (659) T ss_pred Cc-eecCceEEEEecCCcccc-cCCCcceEEEeecCCCC-----------CcccEEecCHHHHHHHcCCcCCCCchhHHH Confidence 65 446999999999998655 55999999999988775 488999999999999999988888889999 Q ss_pred HHHhhccCCcEEEEeecccccccccc---------------------------c----cc-------c------------ Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEG---------------------------A----NP-------D------------ 110 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~---------------------------~----~~-------~------------ 110 (419) ..+|.++|..|++++........... . +. + T Consensus 68 ~~~f~ngg~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~g~v~~~~~~~~~~~~~v~t 147 (659) T protein:vir:72 68 AMNFLQYGNDLRVVRAVDRDTAKNSSPIAGNIDYTISTPGSNYAVGDKITVKYVSDDIETEGKITEVDADGKIKKINIPT 147 (659) T ss_pred HHHHHhCCceEEEEEccCCcccccccccccccceeecccccccccceeeeeeeccccccccceEEEeeccccceeeeecc Confidence 99999999999998763311000000 0 00 0 Q ss_pred ---------------------------cc---c------c------ccceec---------------------------- Q lcl|NC_021557. 111 ---------------------------PS---K------V------TTVDIN---------------------------- 120 (419) Q Consensus 111 ---------------------------~~---~------~------t~~~~~---------------------------- 120 (419) .. . . ...... T Consensus 148 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~v~~~~~~~~~~v~~~~~a~~~~~~~~~v~~~~~~~~~a~~~ 227 (659) T protein:vir:72 148 GKNYAKAKEVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYP 227 (659) T ss_pred ccccccccccccccccccceeeEEeeccccccceEEEEEeecCcceeeeeccccchhhhcccccccccccccceeeeccc Confidence 00 0 0 000000 Q ss_pred ccccc-----------------------------cccc------------------------------------------ Q lcl|NC_021557. 121 GTISP-----------------------------AGLA------------------------------------------ 129 (419) Q Consensus 121 g~~~~-----------------------------~~~~------------------------------------------ 129 (419) +.... .... T Consensus 228 gt~g~~~tv~i~~~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 307 (659) T protein:vir:72 228 GELGDKIEIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDS 307 (659) T ss_pred cccccceeEEEccccccccceeeeeecccccccccccceeeeeeecccccccceeeecccceeeeeeeeeccccccccch Confidence 00000 0000 Q ss_pred ----------cc-----------------c-------hh--------hhhhhhh----ccccccccccchhhhh-----h Q lcl|NC_021557. 130 ----------SG-----------------F-------SG--------AYECYNN----FGYFPKLIIAPGYSPA-----A 158 (419) Q Consensus 130 ----------tg-----------------~-------~a--------~~~~~~~----~~~~p~~~~ap~~~~~-----~ 158 (419) .+ . .. +...... -...++++++|++... . T Consensus 308 ~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~ 387 (659) T protein:vir:72 308 NIYIDDFFAKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETAS 387 (659) T ss_pred hhhhhhhhhcCCceEEEEEecccCCcccccccccccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhH Confidence 00 0 00 0000000 0112455566665432 2 Q ss_pred hHHHH-HHHHhhccceeEEEEecc--------CCCHHHHHhhhhhc-----cccccCccceEEecceeEeecccccccee Q lcl|NC_021557. 159 AVRAE-MDVVASRLHALAIADLPL--------GLTKQQAVAARGVA-----GTANTSSARTVLTYPHVVIEDTTGATETR 224 (419) Q Consensus 159 ~v~a~-l~~~~~~~~~~~i~d~p~--------~~~~~~~~~~~~~~-----~~~~~~s~~~~~~~p~~~~~~~~~~~~~~ 224 (419) .+..+ +.+|.++.++|+++|+|. +.+.+++.+||... ...+++|+|+++||||++++|+. ++..+ T Consensus 388 ~v~~~l~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~-~~~~~ 466 (659) T protein:vir:72 388 TVQKHVVSIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNHKYQYDKY-NDVNR 466 (659) T ss_pred HHHHHHHHHHhhhCCEEEEEcCccccccCCCcccCHHHHHHHHhhccccccccccccceeEEEEcCceeecccc-CCceE Confidence 34454 455666678999999884 45567888888653 23578899999999999999864 56789 Q ss_pred eechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCC Q lcl|NC_021557. 225 LDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAF 304 (419) Q Consensus 225 ~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~ 304 (419) ++|||+++||++||+|.++||||||+|+++.++.++.. .....++.|++.||++|||+|++++++|+++||+||++. T Consensus 467 ~~p~sg~vAGl~Ar~D~~~G~~~span~~~~~i~g~~~---~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~ 543 (659) T protein:vir:72 467 WVPLAADIAGLCARTDNVSQTWMSPAGYNRGQILNVIK---LAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATS 543 (659) T ss_pred EechHHHHHHHHHHhhccCCcEEccCCeeeceeecccc---ccccCChhHHHHHhhCCceEEEEecCCeEEEEcccccCC Confidence 99999999999999999999999999999877777532 233445778899999999999999999999999999964 Q ss_pred CCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCC Q lcl|NC_021557. 305 PTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADG 383 (419) Q Consensus 305 ~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G 383 (419) + +..|+||++|||++||+++|++.++|+|||||++.||++|+++|+.||++||++| +++|+|+||+++||+++|++| T Consensus 544 ~--~s~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G 621 (659) T protein:vir:72 544 V--PSPFDRINVRRLFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGNKALGGIYEYRVVCDTTNNTPSVIDRN 621 (659) T ss_pred C--CcccceEeehhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCC Confidence 3 2379999999999999999999999999999999999999999999999999865 789999999999999999999 Q ss_pred EEEEEEEEEeccCceEEEEEEEEcch--HHHHHHHhcC Q lcl|NC_021557. 384 KFYYRLECHPISVMERITIDSYVDTK--FISNALSLAA 419 (419) Q Consensus 384 ~~~~~v~~~p~~p~e~i~~~~~~~~~--~~~~~~~~~a 419 (419) +|+++|+++|++|+|||+|+|++... +++++.+++- T Consensus 622 ~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~e~~~~~~ 659 (659) T protein:vir:72 622 EFVATFYIQPARSINYITLNFVATATGADFDELTGLAG 659 (659) T ss_pred eEEEEEEEEecCCccEEEEEEEEeecCcchHHhcccCC Confidence 99999999999999999999887544 4666666655 No 25 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=100.00 E-value=1.3e-89 Score=507.93 Aligned_cols=397 Identities=17% Similarity=0.114 Sum_probs=294.7 Q ss_pred CCC-ccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccc--hhhhHH Q lcl|NC_021557. 1 MAA-TFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHK--AGYTIP 77 (419) Q Consensus 1 Ma~-~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~--~~~~l~ 77 (419) ||. +..|||||+|++.++++|..+.|++++|||++++++ .|+|++|+|+.||.+.||... ....++ T Consensus 1 m~~~~~~PgVyv~e~~~~~~~i~~v~ts~~~fvG~~~~Gp-----------~~~p~~i~s~~~~~~~fG~~~~~~~~~~~ 69 (729) T protein:vir:10 1 MPLNLASPGIVVREVDLTIGRVDPTSGSIGALVAPFAKGP-----------VNDPQLIESEEDLLQTFGQPYSTDKHYEY 69 (729) T ss_pred CCccccCCceEEEEecCCCcccccccccceeEEeccccCC-----------CccCeEcCCHHHHHHHcCccccCCcchhH Confidence 995 567999999999999999999999999999988764 489999999999999999853 345567 Q ss_pred HHHHHHhhccCCcEEEEeeccccccc----------------------------------------c----ccc------ Q lcl|NC_021557. 78 AALDAIFDQGDGGTIIVNNVFDPDVH----------------------------------------K----EGA------ 107 (419) Q Consensus 78 ~al~~~~~~~~~~~~v~~~~~~~~~~----------------------------------------~----~~~------ 107 (419) .++..+|.++|..|++++........ . ... T Consensus 70 ~~~~~~f~ngg~~~~vvRv~~~~~~~a~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gn~ 149 (729) T protein:vir:10 70 WMVASSYLAYGGTMQVVRADDYNTQTGVGLKNAFVTGGVGVGATMLRITSNTHYNQLGYDENTISGVSVAAKNPGTWANG 149 (729) T ss_pred HHHHHHHHhCCceEEEEecCcccccccccccccccccccccccccccccccccccccccccCCCcceEEEEeccCccccc Confidence 88999999999999999863210000 0 000 Q ss_pred -------------------------------------c-----------------cccc--c---cc----------cce Q lcl|NC_021557. 108 -------------------------------------N-----------------PDPS--K---VT----------TVD 118 (419) Q Consensus 108 -------------------------------------~-----------------~~~~--~---~t----------~~~ 118 (419) . .+.. . .. ... T Consensus 150 ~~v~v~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~s~~~~~~~~~~~~~ 229 (729) T protein:vir:10 150 IKVAIIDGKADQILTVASGNTTAVGSAVTQSISKTIGTATGTTTIDGVLKGIVTGSTDTTLEVKVISHISAAGVETAVEY 229 (729) T ss_pred eeeEEecccCcceeeeeccccccceeeeeeeccccccccccceeeeeeecccccccccccccceecccccccccceeccc Confidence 0 0000 0 00 000 Q ss_pred ecccc---------------------------------------ccc--------------------------------- Q lcl|NC_021557. 119 INGTI---------------------------------------SPA--------------------------------- 126 (419) Q Consensus 119 ~~g~~---------------------------------------~~~--------------------------------- 126 (419) ..... ... T Consensus 230 ~~~~~~~~~~~~s~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~d~~~~~~~d 309 (729) T protein:vir:10 230 QQNGTYTFDNSGSVNVIAAGSSGSGSAKSYTAQTDWFESQNIVLSNSTLEWDSIADAPGTSTYVSTRGGKNDEIHVLVID 309 (729) T ss_pred cccceeeecccCccceeeeccccccccccceeeeccccccccccccccccccccccccccccccccccccccccceeeec Confidence 00000 000 Q ss_pred ------ccc-------ccchhhhh-------------h------------------------------------------ Q lcl|NC_021557. 127 ------GLA-------SGFSGAYE-------------C------------------------------------------ 138 (419) Q Consensus 127 ------~~~-------tg~~a~~~-------------~------------------------------------------ 138 (419) +.. ..+..... . T Consensus 310 ~~~~~~~~~g~vve~~~~~s~~~~~~~~~~~~~~~~~vi~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 389 (729) T protein:vir:10 310 DKGTITGNSGTILEKHLSLSKAKDAEYSVGSSSYWRDFLATNSKYIFGGGATSGITTTGYSVSSTNTLDTDSGWDQNAEG 389 (729) T ss_pred cccccccCcccceeeeeeeeeccccccccccccccceeeccccceeeecccccccccccccccccceecccccccccccc Confidence 000 00000000 0 Q ss_pred -----------------------------------------hhhc------cccccccccch--hhhhhhHH-HHHHHHh Q lcl|NC_021557. 139 -----------------------------------------YNNF------GYFPKLIIAPG--YSPAAAVR-AEMDVVA 168 (419) Q Consensus 139 -----------------------------------------~~~~------~~~p~~~~ap~--~~~~~~v~-a~l~~~~ 168 (419) ...+ ...+-+...+. ......+. +.+.+|. T Consensus 390 ~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~~~~~~v~~a~~~~~~ 469 (729) T protein:vir:10 390 VNFGASGVATLTLAGGTNYGDKTDLTTSGALSSGVDDIISGYTLFENTEEIEVDFILMGAAHHPKEQSQAVAEKVTAVAE 469 (729) T ss_pred ccccccceeEEEeecccccccccccccccccccchhHHHHHHHHhhcccccccceeeecCCCCCccchHHHHHHHHHHHH Confidence 0000 00000000000 01122343 3445566 Q ss_pred hccceeEEEEeccCCC-----------------HHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHH Q lcl|NC_021557. 169 SRLHALAIADLPLGLT-----------------KQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSR 231 (419) Q Consensus 169 ~~~~~~~i~d~p~~~~-----------------~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~ 231 (419) .+.++++++|.|.... ..++..++.. ..+++|+++||||++++|+. ++..+++|||++ T Consensus 470 ~~~~~~a~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~p~~~~~d~~-~~~~~~~p~s~~ 544 (729) T protein:vir:10 470 ARKDAVAFISPYRQAFLNDTVSGTVTVSNIDQTTENVVGFYAP----LSSSTYSVFDSGYKYMFDRF-NNTFRYVPLNGD 544 (729) T ss_pred hcCCeEEEecccccccccccccccccccccchhhHHHHHHHhh----ccCCceEEEEcCeeEEeccc-CCceEEechhHH Confidence 6778999999884422 2333444432 23577999999999999964 677899999999 Q ss_pred HHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccc Q lcl|NC_021557. 232 LAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVE 311 (419) Q Consensus 232 vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~ 311 (419) +||++||+|.++||||||+|+++.||.+... + .....++|++.||++|||+|++|+++|+++||+||++ ++|+.| T Consensus 545 ~aGl~a~~d~~~g~~~span~~~~~i~g~~~-~--~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~--~~d~~~ 619 (729) T protein:vir:10 545 IAGTCARTDIEQFPWFSPAGTARGPILNSVK-L--VYNPGKKQRDILYSNRINPVILSPGAGIILFGDKTGF--GKSSAF 619 (729) T ss_pred HHHHHHHhhccCCcEEccCCccccceecccc-e--eeecChhhHhhhhhCCceEEEEecCCeEEEEcceecC--CCCccc Confidence 9999999999999999999999888877543 2 2334567889999999999999999999999999995 467899 Q ss_pred eeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEE Q lcl|NC_021557. 312 NFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLE 390 (419) Q Consensus 312 ~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~ 390 (419) +||++|||++||+++|++.++|+|||||++.+|++|+++|++||++||++| ++||+|+||+++||+++|++|+|+++|+ T Consensus 620 ~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~ 699 (729) T protein:vir:10 620 DRINVRRLFIYLEDAISAAAKDQLFEFNDELTRTNFVNIVEPFLRDVQAKRGIFDFVVICDETNNTAAVIDSNEFVADIF 699 (729) T ss_pred ceeehhhhHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEcCCCCCHHHhhCCeEEEEEE Confidence 999999999999999999999999999999999999999999999999855 8899999999999999999999999999 Q ss_pred EEeccCceEEEEEEEEcch--HHHHHHHhc Q lcl|NC_021557. 391 CHPISVMERITIDSYVDTK--FISNALSLA 418 (419) Q Consensus 391 ~~p~~p~e~i~~~~~~~~~--~~~~~~~~~ 418 (419) ++|++|+|||+|++++... +|+++++++ T Consensus 700 ~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~ 729 (729) T protein:vir:10 700 IKPARSINFIGLTFVATRTGVAFEEVIGSV 729 (729) T ss_pred EEecCCccEEEEEEEEeecCccHHHHHhcC Confidence 9999999999999988775 689999999 No 26 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=100.00 E-value=2.1e-89 Score=506.77 Aligned_cols=398 Identities=15% Similarity=0.151 Sum_probs=299.2 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |. +..|||||+|++ ++++|..+.|++.+|||++++++ .|+|++|+|+.||.+.||.......+..++ T Consensus 1 ~~-~~~Pgvyv~e~~-~~~~i~~v~t~~~~fvG~~~~Gp-----------~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v 67 (671) T protein:vir:56 1 MT-LLSPGIENKEIN-LASAIGRAATGRAAMVGKFEWGP-----------AYSITQVTSESDLVTIFGRPNDYTAASFMT 67 (671) T ss_pred Cc-eecCceEEEeec-CcccccccCcccceEEecccCCC-----------CccCEEcCCHHHHHHHcCCcCCCcchhHHH Confidence 66 446999999995 89999999999999999988765 489999999999999999988888899999 Q ss_pred HHHhhccCCcEEEEeeccccccccc------------------------------c---cc------------------- Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKE------------------------------G---AN------------------- 108 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~------------------------------~---~~------------------- 108 (419) ..+|.++|..|++++.......... . .. T Consensus 68 ~~~f~ngg~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~ 147 (671) T protein:vir:56 68 ANNFLKYGNDLRLVRICDATTAQNATPLYNAVEYTIGASNGCVVGDDITITYSGVGALTAKGKVLEVDAGNNNAASKIFL 147 (671) T ss_pred HHHHHhcCCeEEEEEecCccccccchhhccccccccccCcceeeceeeeeecCcccccccCcceeEEeeeccceeeeeec Confidence 9999999999999986432110000 0 00 Q ss_pred -----------------cc-----c---------ccccc-------------------------------------ceec Q lcl|NC_021557. 109 -----------------PD-----P---------SKVTT-------------------------------------VDIN 120 (419) Q Consensus 109 -----------------~~-----~---------~~~t~-------------------------------------~~~~ 120 (419) .. . ..... ..+. T Consensus 148 ~~~~~v~~~~~~~~~~~~~~~t~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (671) T protein:vir:56 148 PSAEIVAAAKSDGNYPSVGTITLQPTQGDIALTNIEIIDTGSVYFPNIELAFDALTAIETEGGALKYADLIEKQGFPRLS 227 (671) T ss_pred cceeEEEeeeccccccccccccccccccceeeeeecccccceEEEeccccccccccccccccccccchhhhhcccccccc Confidence 00 0 00000 0000 Q ss_pred ----cc------------cccc---cccc--------------------------------------------------- Q lcl|NC_021557. 121 ----GT------------ISPA---GLAS--------------------------------------------------- 130 (419) Q Consensus 121 ----g~------------~~~~---~~~t--------------------------------------------------- 130 (419) +. .... .... T Consensus 228 a~~~g~~g~~~~v~v~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~ 307 (671) T protein:vir:56 228 ARYVGDFGDAISVEIINYADYQTAFAFAAGHTLGDIELPIYPDGGTRSINLSSYFTFGPSNSNQYAVIVRVSGEVEEAFI 307 (671) T ss_pred cccccccCcceEEEEecccccccccccccceeeeeccccccccccccccccceeecccccccccceeEEeecCccceeEE Confidence 00 0000 0000 Q ss_pred ---------------------------------------------------------cchhhhhhhhhccccccccccch Q lcl|NC_021557. 131 ---------------------------------------------------------GFSGAYECYNNFGYFPKLIIAPG 153 (419) Q Consensus 131 ---------------------------------------------------------g~~a~~~~~~~~~~~p~~~~ap~ 153 (419) ...++..........|.++.+++ T Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 387 (671) T protein:vir:56 308 VSTNPGDKDVNGQSIFIDEYFENSGSAYITAIAEGWKTESGAYNFGGGSDANAGADDWMFGLDMLSDPEVLYTNLVIAGN 387 (671) T ss_pred EeecccccccchhhhhhhhhhcccCceEEEecCcccCCccccccccCccccccchhHHHHHHHhhhhccccceeEEEcCC Confidence 00000000000111122233333 Q ss_pred hhhh------hhHHHHHHHHhh-ccceeEEEEeccC--------CCHHHHHhhhhhc---------cccccCccceEEec Q lcl|NC_021557. 154 YSPA------AAVRAEMDVVAS-RLHALAIADLPLG--------LTKQQAVAARGVA---------GTANTSSARTVLTY 209 (419) Q Consensus 154 ~~~~------~~v~a~l~~~~~-~~~~~~i~d~p~~--------~~~~~~~~~~~~~---------~~~~~~s~~~~~~~ 209 (419) .... ..+.+++.+|++ +.++++++|.|.. .+.+++.+|+... ...+++|+|+++|| T Consensus 388 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~ 467 (671) T protein:vir:56 388 AAAEEVSIASTVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAGTAVANIQGWRTGIDPTNGQAVVDNLNVSTTYAVIDG 467 (671) T ss_pred CCCccchhHHHHHHHHHHHHHhhcCCEEEEEeccccccccccccccHHHHHHHhhhccccchhhhhhhccCCcceEEEec Confidence 2222 123455666654 6689999998853 4566777777533 23568899999999 Q ss_pred ceeEeeccccccceeeechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEe Q lcl|NC_021557. 210 PHVVIEDTTGATETRLDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRS 289 (419) Q Consensus 210 p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~ 289 (419) ||++++|+. ++..+++|||+++||++||+|.++||||||+|+.+.++.+... +.+.+ .+.|++.||++|||+|+++ T Consensus 468 p~~~~~d~~-~~~~~~~p~s~~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~-~~~~~--~~~~~~~Ln~~gIn~i~~~ 543 (671) T protein:vir:56 468 NYKYQYDKY-NDRNRWVPLAGDIAGLCAYTDQVSQPWMSPAGFNRGQIKGVNR-LAVDL--RRAHRDALYQIGINPVVGF 543 (671) T ss_pred CceEEeccc-CCceeEechHHHHHHHHHHhhccCCcEECcCCceecccccccc-ceeec--ChhHHHHHhhCCceEEEEe Confidence 999999965 5678999999999999999999999999999998766655432 33333 3457889999999999999 Q ss_pred cCCcEEEEeccccCCCCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEE Q lcl|NC_021557. 290 FATGIRVFGNRSAAFPTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTF 368 (419) Q Consensus 290 ~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v 368 (419) +++|+++||+||++. .+++|+||++|||++||+++|++.++|+|||||++.||++|+++|+.||++||++| ++||+| T Consensus 544 ~~~G~~~wG~rT~~~--~~~~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~g~~v 621 (671) T protein:vir:56 544 AGQGFVLYGDKTATQ--QASAFDRINVRRLFNLLKKAISDAAKYRLFELNDEFTRSSFKSEIDAYLTNIQDLGGVYDFRV 621 (671) T ss_pred cCCeEEEEcceecCC--CCcccceEehhhHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEE Confidence 999999999999953 34589999999999999999999999999999999999999999999999999865 889999 Q ss_pred EEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 369 RFDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 369 ~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~a 419 (419) +||+++||+++|++|+|+++|+++|++|+|||+|++++.....+ |++++ T Consensus 622 ~~d~~~nt~~~i~~G~~~~~i~~~p~~Pae~I~~~~~~~~~~~~--f~e~~ 670 (671) T protein:vir:56 622 VCDETNNPGSVIDRNEFVASIYVKPAKSINFITLNFVATSTDAD--FAEII 670 (671) T ss_pred EEcCCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCcc--hhhhc Confidence 99999999999999999999999999999999999999888653 66666 No 27 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=100.00 E-value=3e-89 Score=505.89 Aligned_cols=400 Identities=18% Similarity=0.135 Sum_probs=297.9 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |+ +..|||||+|+ +++++|..+.|++.+|||.+++++ .|+|++|+|+.|+...||.......+..++ T Consensus 1 ~~-~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~Gp-----------~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v 67 (663) T protein:vir:10 1 MA-LLSPGIEMKET-SINSTVVRSATGRAAIVGKFAWGP-----------AYEVRQVTNEVELVDMFGSPDNVTAPYFMS 67 (663) T ss_pred Cc-eecCceEEEEe-cCcccccccCccceeEEeeeccCC-----------CCccEEecCHHHHHHHhCCcCccchhHHHH Confidence 66 44699999999 599999999999999999988775 489999999999999999988888889999 Q ss_pred HHHhhccCCcEEEEeeccccccc---------------------------cccccc-------------cc--------- Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVH---------------------------KEGANP-------------DP--------- 111 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~---------------------------~~~~~~-------------~~--------- 111 (419) ..+|.++|..+++++........ ...... +. T Consensus 68 ~~~f~ngg~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~n~~~~~v~~~~ 147 (663) T protein:vir:10 68 AMNFLQYGNDLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPT 147 (663) T ss_pred HHHHHhCCCeEEEEEccCCcccccccccCCcceeeeeccccCccccccccccccccccccccceeeeccCCceEEEEecc Confidence 99999999999999764221000 000000 00 Q ss_pred ------------------------------c-------------------------ccccce------------ec---- Q lcl|NC_021557. 112 ------------------------------S-------------------------KVTTVD------------IN---- 120 (419) Q Consensus 112 ------------------------------~-------------------------~~t~~~------------~~---- 120 (419) . ..+... +. T Consensus 148 a~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~v~~vv~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~ 227 (663) T protein:vir:10 148 AEIIAKTRQLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLVSAVYP 227 (663) T ss_pred ccccccccccceeeeccccceeEeeeccccccccccccceecccceeeEeeccccccccccchhhhcccccceeeeeecc Confidence 0 000000 00 Q ss_pred cccccc-----------------------c-------------------------------------ccccc------h- Q lcl|NC_021557. 121 GTISPA-----------------------G-------------------------------------LASGF------S- 133 (419) Q Consensus 121 g~~~~~-----------------------~-------------------------------------~~tg~------~- 133 (419) |..... + ...+. . T Consensus 228 G~~Gn~i~v~i~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~ 307 (663) T protein:vir:10 228 GEIGSTVEVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNI 307 (663) T ss_pred cccccceeEEecccccccccccccccccccccccccceeeeeccccccceeEEEecCCcceeeeeeeecccccccchhhh Confidence 000000 0 00000 0 Q ss_pred hhhhh--hhh------------------------------------------c----cccccccccc--hh---hhhhhH Q lcl|NC_021557. 134 GAYEC--YNN------------------------------------------F----GYFPKLIIAP--GY---SPAAAV 160 (419) Q Consensus 134 a~~~~--~~~------------------------------------------~----~~~p~~~~ap--~~---~~~~~v 160 (419) ..... ... + ...+.+++++ +. .....| T Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v 387 (663) T protein:vir:10 308 FMDDYFRNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTV 387 (663) T ss_pred hhhhhhccCcceEEEEeecccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHH Confidence 00000 000 0 0000111111 11 111345 Q ss_pred HHHHHH-HhhccceeEEEEeccCC--------CHHHHHhhhhhc--------cccccCccceEEecceeEeeccccccce Q lcl|NC_021557. 161 RAEMDV-VASRLHALAIADLPLGL--------TKQQAVAARGVA--------GTANTSSARTVLTYPHVVIEDTTGATET 223 (419) Q Consensus 161 ~a~l~~-~~~~~~~~~i~d~p~~~--------~~~~~~~~~~~~--------~~~~~~s~~~~~~~p~~~~~~~~~~~~~ 223 (419) +.+|.. |.++.++|+++|+|.+. +..++.+||... ...+++|+|+++||||++++|+. ++.. T Consensus 388 ~~al~~~a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d~~-~~~~ 466 (663) T protein:vir:10 388 QKYVVSLADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKY-NDIN 466 (663) T ss_pred HHHHHHHHHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEeccc-CCce Confidence 555544 44555799999999643 456677777543 24678999999999999999864 5678 Q ss_pred eeechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecC-CcEEEEecccc Q lcl|NC_021557. 224 RLDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFA-TGIRVFGNRSA 302 (419) Q Consensus 224 ~~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~-~G~~~wG~rT~ 302 (419) +++|||+++||++||+|.++||||||+|+.+.++.+.. .+. ....+.|++.||++|||+|+++++ +|+++||+||+ T Consensus 467 ~~~p~s~~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~-~~~--~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~ 543 (663) T protein:vir:10 467 RWVPLAADIAGLCAYTDQVSHPWMSPAGYRRGQIRNCI-KLA--IEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMA 543 (663) T ss_pred EEechhHHHHHHHHHhhccCCceEccCCceeccccccc-cce--eccChhHHHHHhhCCceEEEEEeCCCcEEEEccccc Confidence 99999999999999999999999999999866555542 222 233567889999999999999987 79999999999 Q ss_pred CCCCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhh Q lcl|NC_021557. 303 AFPTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIA 381 (419) Q Consensus 303 ~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~ 381 (419) +..+ ..|+|||+||||+||+++|++.++|+|||||++.+|++|+++|+.||++||++| ++||+|+||+++||+++|+ T Consensus 544 s~~~--s~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~ 621 (663) T protein:vir:10 544 TQVP--SPFDRINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVID 621 (663) T ss_pred CCCC--cccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhh Confidence 6432 379999999999999999999999999999999999999999999999999865 8899999999999999999 Q ss_pred CCEEEEEEEEEeccCceEEEEEEEEcch--HHHHHHHhcC Q lcl|NC_021557. 382 DGKFYYRLECHPISVMERITIDSYVDTK--FISNALSLAA 419 (419) Q Consensus 382 ~G~~~~~v~~~p~~p~e~i~~~~~~~~~--~~~~~~~~~a 419 (419) +|+|+++|+++|++|+|||+|++++... .+++++++++ T Consensus 622 ~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~ 661 (663) T protein:vir:10 622 RNEFVGTIYVKPPRSINYITLNMVATSTGANFDELIGPMQ 661 (663) T ss_pred CCeEEEEEEEEecCCcceEEEEEEEeecCccHHHHHHHHh Confidence 9999999999999999999999987764 4888888888 No 28 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=100.00 E-value=5.9e-89 Score=504.28 Aligned_cols=400 Identities=17% Similarity=0.115 Sum_probs=297.3 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |. ++.|||||+|++ ++++|..+.|++.+|||++++++ .|+|++|+|+.|+.+.||.......+..++ T Consensus 1 ~~-~~~Pgvyv~e~~-~~~~i~~~~t~~~~~vG~~~~Gp-----------~~~p~~v~~~~~~~~~fg~~~~~~~~~~~~ 67 (663) T protein:vir:10 1 MA-LLSPGIEMKETS-INSTVVRSATGRAAIVGKFAWGP-----------AYEVRQVTNEVELVDMFGSPDNVTAPYFMS 67 (663) T ss_pred Cc-eecCceEEEEec-CCccccccCcccceeEeecccCC-----------CCccEEecCHHHHHHhcCCcCCcchhHHHH Confidence 65 456999999995 89999999999999999988775 489999999999999999988888899999 Q ss_pred HHHhhccCCcEEEEeeccccccc---------------------------cccccc-------------cc--cc----- Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVH---------------------------KEGANP-------------DP--SK----- 113 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~---------------------------~~~~~~-------------~~--~~----- 113 (419) ..+|.++|..|++++........ ...... +. .. T Consensus 68 ~~~f~ngg~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~i~v~~~~~~~~~~~~~~~~~~~~~~~~v~~~t 147 (663) T protein:vir:10 68 AMNFLQYGNDLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPT 147 (663) T ss_pred HHHHHhCCCeEEEEEccCCccccccccccccceeEEeecccccccccccccccccccccccccceeeecccceEEEeecc Confidence 99999999999999864211000 000000 00 00 Q ss_pred ---------c-------------------------ccceecccc------------------------------------ Q lcl|NC_021557. 114 ---------V-------------------------TTVDINGTI------------------------------------ 123 (419) Q Consensus 114 ---------~-------------------------t~~~~~g~~------------------------------------ 123 (419) + ....+.... T Consensus 148 a~~~~~~~~v~~~~~~~~~~~~~~s~~s~~~~~a~~v~~v~~d~~~~v~~~~~a~~~~t~~~~~~~~~~~~~~~i~A~~~ 227 (663) T protein:vir:10 148 AEIIAKTRQLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLISAVYP 227 (663) T ss_pred ccccccccccccceeeccceeeEeeeccCccccccccceeccccceEEeeccccccccccccccccccccccceEEeccC Confidence 0 000000000 Q ss_pred ------------ccccc--------------------------------------ccc-------chh------------ Q lcl|NC_021557. 124 ------------SPAGL--------------------------------------ASG-------FSG------------ 134 (419) Q Consensus 124 ------------~~~~~--------------------------------------~tg-------~~a------------ 134 (419) ..... ..+ +.. T Consensus 228 G~~Gn~i~V~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~ 307 (663) T protein:vir:10 228 GEIGSTVEVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNI 307 (663) T ss_pred CcccceeeeeeccccccccccccceecccccccccccceeecccccccceeeEeecCCcceeeecccccccccccccchh Confidence 00000 000 000 Q ss_pred -hhhhhh------------------------------------------------hccccccccccch--h---hhhhhH Q lcl|NC_021557. 135 -AYECYN------------------------------------------------NFGYFPKLIIAPG--Y---SPAAAV 160 (419) Q Consensus 135 -~~~~~~------------------------------------------------~~~~~p~~~~ap~--~---~~~~~v 160 (419) ..+... .-.+.+++++++. . ...+++ T Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v 387 (663) T protein:vir:10 308 FMDDYFRNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTV 387 (663) T ss_pred hhhhhhcCCcceEEEEeecccCccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHH Confidence 000000 0000011111111 1 111335 Q ss_pred HHHHHH-HhhccceeEEEEeccCC--------CHHHHHhhhhh--------ccccccCccceEEecceeEeeccccccce Q lcl|NC_021557. 161 RAEMDV-VASRLHALAIADLPLGL--------TKQQAVAARGV--------AGTANTSSARTVLTYPHVVIEDTTGATET 223 (419) Q Consensus 161 ~a~l~~-~~~~~~~~~i~d~p~~~--------~~~~~~~~~~~--------~~~~~~~s~~~~~~~p~~~~~~~~~~~~~ 223 (419) +.+|.. |.++.++|+++|+|.+. +.+++..|+.. ....+++|+|+++||||++++|+. ++.. T Consensus 388 ~~~l~~~a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~-~~~~ 466 (663) T protein:vir:10 388 QKYVVSLADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKY-NDIN 466 (663) T ss_pred HHHHHHHHHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCccceEEEcCceEEeccc-CCce Confidence 555544 45556799999999643 34556666643 234678999999999999999864 6778 Q ss_pred eeechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecC-CcEEEEecccc Q lcl|NC_021557. 224 RLDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFA-TGIRVFGNRSA 302 (419) Q Consensus 224 ~~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~-~G~~~wG~rT~ 302 (419) +++|||+++||++||+|.++||||||+|+++.++.+.. .+ .....+.|++.||++|||+|++|++ +|+++||+||+ T Consensus 467 ~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~-~~--~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~ 543 (663) T protein:vir:10 467 RWVPLAADIAGLCAYTDQVSHPWMSPAGYRRGQIRNCI-KL--AIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMA 543 (663) T ss_pred EEechhHHHHHHHHHhhccCCceEccCCceeccccccc-cc--eeecChhHHHHHhhCCceEEEEEeCCCcEEEEccccc Confidence 99999999999999999999999999999866555542 22 2333567889999999999999987 79999999999 Q ss_pred CCCCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhh Q lcl|NC_021557. 303 AFPTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIA 381 (419) Q Consensus 303 ~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~ 381 (419) +.. +..|+|||+||||+||+++|++.++|+|||||++.+|.+|+++|+.||++||++| ++||+|+||+++||+++|+ T Consensus 544 ~~~--~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~ 621 (663) T protein:vir:10 544 TQV--PSPFDRINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVID 621 (663) T ss_pred CCC--CcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhh Confidence 643 2379999999999999999999999999999999999999999999999999865 8899999999999999999 Q ss_pred CCEEEEEEEEEeccCceEEEEEEEEcch--HHHHHHHhcC Q lcl|NC_021557. 382 DGKFYYRLECHPISVMERITIDSYVDTK--FISNALSLAA 419 (419) Q Consensus 382 ~G~~~~~v~~~p~~p~e~i~~~~~~~~~--~~~~~~~~~a 419 (419) +|+|+++|+++|++|+|||+|++++... .++++++.++ T Consensus 622 ~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~ 661 (663) T protein:vir:10 622 RNEFVGTIYVKPPRSINYITLNMVATSTGANFDELIGPMQ 661 (663) T ss_pred CCeEEEEEEEEecCCcceEEEEEEEeecCccHHHHHHHHh Confidence 9999999999999999999999998764 4888888888 No 29 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=100.00 E-value=1.8e-87 Score=496.20 Aligned_cols=400 Identities=17% Similarity=0.116 Sum_probs=295.8 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |. ++.|||||+|++ ++++|..+.|++.+|||.+++++ .|+|++|+|+.|+...||.......+..++ T Consensus 1 ~~-~~~Pgvyv~e~~-~~~~~~~v~t~~~~fvG~~~~gp-----------~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v 67 (663) T protein:vir:10 1 MA-LLSPGIEMKETS-INSTVVRSATGRAALVGKFAWGP-----------AYEIRQVTNEVELVDMFGSPDNVTAPYFMS 67 (663) T ss_pred Cc-cccCceEEEEec-CcccccccccccceeeeccccCC-----------CCcCEEecCHHHHHHHcCCcccccchHHHH Confidence 66 446999999995 88899999999999999988764 489999999999999999988888899999 Q ss_pred HHHhhccCCcEEEEeeccccccc-c--------------------------cc----------------c--------c- Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVH-K--------------------------EG----------------A--------N- 108 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~-~--------------------------~~----------------~--------~- 108 (419) ..+|.++|..|+++++....... . .. . . T Consensus 68 ~~~f~ngg~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 147 (663) T protein:vir:10 68 AMNFLQYGNDLRLVRVIDMEQAKNASPLFNQIEVTITTEGQGYTVGDTVSIKHNTTTVTEEGKVTKVDADGKIKALFVPS 147 (663) T ss_pred HHHHHhCCCeEEEEecCCcccccccccccccceeeEeecccCccccceeeecccccccccCcceeeeccCCceeEEEecc Confidence 99999999999999875321000 0 00 0 0 Q ss_pred ---------c--------c--------------c---------------------cccccceec---------------- Q lcl|NC_021557. 109 ---------P--------D--------------P---------------------SKVTTVDIN---------------- 120 (419) Q Consensus 109 ---------~--------~--------------~---------------------~~~t~~~~~---------------- 120 (419) . . . ...+..... T Consensus 148 a~~~~~a~~~~~~~~~~~a~~~~v~~~~~~~~~a~av~~i~~dg~vt~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~ 227 (663) T protein:vir:10 148 SAVIAKAKQLGTYPVLGDNWRAEVSGASGGSAATLTLGGIVVDSGVTFGNSEEAPDVMTSTKVLANFAKYGMPLISAVYP 227 (663) T ss_pred ccccccccccccccccccceeeEEeeccccccccceeEeeecCCceeEEeeeccccccccceeeeeccccccceeeeecc Confidence 0 0 0 000000000 Q ss_pred cc---------cccc--------------cc------------------------ccc-------c-------------- Q lcl|NC_021557. 121 GT---------ISPA--------------GL------------------------ASG-------F-------------- 132 (419) Q Consensus 121 g~---------~~~~--------------~~------------------------~tg-------~-------------- 132 (419) +. ...+ +. ..+ + T Consensus 228 g~~G~~i~v~~~~~~~~~~~~~~~v~~~~g~~~~~~~~~~~~g~~~~~~~~~~~~~~g~~~e~~~ls~~~~~~~~~~~~~ 307 (663) T protein:vir:10 228 GEIGSTVEVEVISKTAFQSGAAQPIYPFGGTRASNARSVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRRGDRDVYGNNI 307 (663) T ss_pred cccCcceeEeecccccccccceeeecccCcccccccccccccccccchhhcccccCCCcccceeeeeccccccccchhhh Confidence 00 0000 00 000 0 Q ss_pred ---hhhhhhhh-------------------------------------------------hccccccccccchhhhhhhH Q lcl|NC_021557. 133 ---SGAYECYN-------------------------------------------------NFGYFPKLIIAPGYSPAAAV 160 (419) Q Consensus 133 ---~a~~~~~~-------------------------------------------------~~~~~p~~~~ap~~~~~~~v 160 (419) ..+..... ...+++.....++.+..+.| T Consensus 308 ~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v 387 (663) T protein:vir:10 308 FMDDYFRNGSSNFIYASSVNWPAGFTGIIQLGGGASANNAVGSDELIAGWDLFADREALHVNLMIAGACKSDGVAVASTV 387 (663) T ss_pred hhhhhhcCcccceeEeeccccCcccceeEEecccccCcccchhhhhhhHHhhhccccccCceEEEeecCCCCchhhHHHH Confidence 00000000 00000000112222333455 Q ss_pred HHHHHH-HhhccceeEEEEeccCCC--------HHHHHhhhhh--------ccccccCccceEEecceeEeeccccccce Q lcl|NC_021557. 161 RAEMDV-VASRLHALAIADLPLGLT--------KQQAVAARGV--------AGTANTSSARTVLTYPHVVIEDTTGATET 223 (419) Q Consensus 161 ~a~l~~-~~~~~~~~~i~d~p~~~~--------~~~~~~~~~~--------~~~~~~~s~~~~~~~p~~~~~~~~~~~~~ 223 (419) +++|.. |.++.+||+++|+|.+.. .+++.+||.. ....+++|+|+++||||++++|+. ++.. T Consensus 388 ~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~-~~~~ 466 (663) T protein:vir:10 388 QKHVVALADDRQDCVAFVNPPSELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYAFISGNYKYQYDKY-NDIN 466 (663) T ss_pred HHHHHHHHHhhCCEEEEEecCcccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceEEEEecceeEeccc-CCce Confidence 555544 555567999999997643 2455666643 234678999999999999999864 5678 Q ss_pred eeechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecC-CcEEEEecccc Q lcl|NC_021557. 224 RLDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFA-TGIRVFGNRSA 302 (419) Q Consensus 224 ~~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~-~G~~~wG~rT~ 302 (419) +++|||+++||++||+|.++||||||+|+.+.++.++. ++.+.+ .+.|.+.||++|||+|+++++ +||++||+||+ T Consensus 467 ~~~p~s~~vAGl~Ar~D~~~g~~~span~~~~~i~g~~-~~~~~~--~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~ 543 (663) T protein:vir:10 467 RWVPLSADIAGLCAYTDQVGHPWMSPAGYRRGQLRNTI-KLAIEP--KQSLRDTMYQVSINPVTGFAGGDGFVLFGDKMA 543 (663) T ss_pred EEechHHHHHHHHHHhhccCCcEEccCCeeecceeccc-cceeec--CchhHHHHHhCCCcEEEEeeCCCcEEEEccccc Confidence 99999999999999999999999999999877776653 233333 456778999999999999987 79999999999 Q ss_pred CCCCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhh Q lcl|NC_021557. 303 AFPTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIA 381 (419) Q Consensus 303 ~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~ 381 (419) +.. ++.|+||++|||++||+++|++.++|+|||||++.+|++|+++|+.||++||++| ++||+|+||+++||+++|+ T Consensus 544 s~~--~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~ 621 (663) T protein:vir:10 544 TQV--PSPFDRINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMEVSQYLDNIRSLGGVYDFRVVCDTTNNTPQVID 621 (663) T ss_pred CCC--CcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhh Confidence 543 3479999999999999999999999999999999999999999999999999865 7899999999999999999 Q ss_pred CCEEEEEEEEEeccCceEEEEEEEEcchH--HHHHHHhcC Q lcl|NC_021557. 382 DGKFYYRLECHPISVMERITIDSYVDTKF--ISNALSLAA 419 (419) Q Consensus 382 ~G~~~~~v~~~p~~p~e~i~~~~~~~~~~--~~~~~~~~a 419 (419) +|+|+++|+++|++|+|||+|++++...+ ++++.+++= T Consensus 622 ~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~f~e~~~~~~ 661 (663) T protein:vir:10 622 SNEFVATIYIKAPRSINYITLNFVATSTGANFDELIGPAQ 661 (663) T ss_pred CCeEEEEEEEEecCCcceEEEEEEEEecCccHHHHHHHHh Confidence 99999999999999999999999988665 444444433 No 30 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=100.00 E-value=4.9e-87 Score=493.74 Aligned_cols=394 Identities=15% Similarity=0.101 Sum_probs=289.1 Q ss_pred CCCcc-CCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHH Q lcl|NC_021557. 1 MAATF-HHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAA 79 (419) Q Consensus 1 Ma~~~-~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~a 79 (419) ||.+| .|||||+|++.+ +++..+.|++.+|||.+++++ .|+|++|+||.|+.+.||.......++.+ T Consensus 1 M~~~~~~PgVyv~e~~~~-~~~~~~~t~~~~fvG~~~~Gp-----------~~~p~~v~s~~~~~~~fG~~~~~~~~~~~ 68 (749) T protein:vir:10 1 MATNQSSPGVVIQERDLT-TVSTIPTANVGVIAAPFTKGP-----------VEEVIEITSERQLAEKFGEPNESNYEYWF 68 (749) T ss_pred CCccccCCeeEEEEecCC-cccccccCceeEEEeccCCCC-----------CccCEEcCCHHHHHHHcCCccCCcccHHH Confidence 99854 699999999876 568889999999999887775 48999999999999999998888889999 Q ss_pred HHHHhhccCCcEEEEeeccccccccc------------------------------ccc--------------------- Q lcl|NC_021557. 80 LDAIFDQGDGGTIIVNNVFDPDVHKE------------------------------GAN--------------------- 108 (419) Q Consensus 80 l~~~~~~~~~~~~v~~~~~~~~~~~~------------------------------~~~--------------------- 108 (419) +..+|.++|..|++++.......+.. ..+ T Consensus 69 v~~~F~ngg~~~~vvRv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~pG~~gn~l~v~v~~~~~~~~~~~ 148 (749) T protein:vir:10 69 SAAQFLSYGGLLKTIRVNSSSLKNAVDTGTAPLVKNLQDYETSIEDASNNFSWVARTPGDTGNSIGIFVTDAGADQVVVV 148 (749) T ss_pred HHHHHhhcCCeEEEEEccCccccccccccccccccccccccccccccccceEEEeccCCCcCCceEEEEEcCCCceeeee Confidence 99999999999999986321100000 000 Q ss_pred ----------------------------------------------------------------ccccc-cccce----- Q lcl|NC_021557. 109 ----------------------------------------------------------------PDPSK-VTTVD----- 118 (419) Q Consensus 109 ----------------------------------------------------------------~~~~~-~t~~~----- 118 (419) .+... ..... T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~a~~~~~~~~~~~~~~~~~~s~~ 228 (749) T protein:vir:10 149 PAPGSGNEHEFVADAAVSAASGAAGKVFKYSIILTIDDVVGTFAPGSATTITIGGSAESVNVLAYDATNKKLEIGLPSGG 228 (749) T ss_pred ecCCccceeeEEeeecccccccccccccccceeeeeccccceeecccceeeeccCcccccccccccCCcceEEEeeeccc Confidence 00000 00000 Q ss_pred eccc------------------------------------cc---------------------cccccccc--------- Q lcl|NC_021557. 119 INGT------------------------------------IS---------------------PAGLASGF--------- 132 (419) Q Consensus 119 ~~g~------------------------------------~~---------------------~~~~~tg~--------- 132 (419) ..+. .. .....++. T Consensus 229 ~~~~~a~~~~v~~~~~~~~~~~~i~~~~~~~~~~~~~~~a~~~~~~~~~g~~~~it~v~~~~~~~~~~t~~~~~~~a~~~ 308 (749) T protein:vir:10 229 VTGILADNQVITQGTNTAKINVTIERKLLVALNKSSIEFAASDVVQDTNSTNITITSVRDEYTEREYLPGVKWINVAPRP 308 (749) T ss_pred ccceeeeeecccccccccccccccccchhhhhccccceeeccccccCCccceeEEEeeeccccccccccceeeccccccc Confidence 0000 00 00000000 Q ss_pred -----------------------------------hh---hh---hhh-h---------------hcc------------ Q lcl|NC_021557. 133 -----------------------------------SG---AY---ECY-N---------------NFG------------ 143 (419) Q Consensus 133 -----------------------------------~a---~~---~~~-~---------------~~~------------ 143 (419) +. +. ... . .+. T Consensus 309 gt~~~~~~~~g~~D~~~v~v~~~~g~~~~~~g~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~ 388 (749) T protein:vir:10 309 GTSLYANGVGGHRDEMHVILVDIDGGVTGTVGALLERYIDVSKASDAKTSVGETNYYAEVIKQKSEFIYWAEHESTLYAA 388 (749) T ss_pred cceeeeecccCCCCceEEEEecCCCeeeecccceeeeeeeccccccccccccccchhhhhhccCCCEEEEEecccccccc Confidence 00 00 000 0 000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_021557. 144 -------------------------------------------------------------------------------- 143 (419) Q Consensus 144 -------------------------------------------------------------------------------- 143 (419) T Consensus 389 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~l~~~~ 468 (749) T protein:vir:10 389 TSSASDGLFGQTAANRQFNLFRSAAGSVDYPAGVTTLGSKNNATYYYRLSGGVNYTVSAGQYTITNTDIGSAYELIGDPE 468 (749) T ss_pred cccccccccccccccceeeccccccccceeccccccccccCCcEEEEEccCCcccccccccccccchhHHHHHHHhhhhh Confidence Q ss_pred --cccc-ccccchhh--hhhhHH-HHHHHHhhccceeEEEEeccCCCH---------HHHHhhhhhccccccCccceEEe Q lcl|NC_021557. 144 --YFPK-LIIAPGYS--PAAAVR-AEMDVVASRLHALAIADLPLGLTK---------QQAVAARGVAGTANTSSARTVLT 208 (419) Q Consensus 144 --~~p~-~~~ap~~~--~~~~v~-a~l~~~~~~~~~~~i~d~p~~~~~---------~~~~~~~~~~~~~~~~s~~~~~~ 208 (419) .++. ++..++.+ ...+++ +++.+|.++.++++++|+|.+... .++..++. .+++++|+++| T Consensus 469 ~~~~~~li~~~~~~~~~~~~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~----~~~~s~~~~~~ 544 (749) T protein:vir:10 469 SQIVDFIISGPSGTSDANALAKITSLVNIAEERRDCMVFVSPRRGNVIGISNTTTITTNIVDFFK----KLPSSSYMVFD 544 (749) T ss_pred hcccceEEEecCCCCcchhHHHHHHHHHHHhhcCCEEEEEcCCCCcccccccchhhhhHHHHHHh----hccCceeEEEE Confidence 0000 00011111 112343 445556667788999998865432 22333332 34678899999 Q ss_pred cceeEeeccccccceeeechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEE Q lcl|NC_021557. 209 YPHVVIEDTTGATETRLDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMR 288 (419) Q Consensus 209 ~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~ 288 (419) |||++++|+. ++..+++|||+++||++||+|.++||||||+|+++.++.+.. .+.+. ..+.|++.||++|||+|++ T Consensus 545 ~p~~~~~d~~-~~~~~~~p~s~~vAGl~Ar~D~~~g~~~SPan~~~~~i~g~~-~~~~~--~~~~e~~~Ln~~gIn~i~~ 620 (749) T protein:vir:10 545 SGYKYIYDKY-NDVYRYIPCNGDTAGLCLQTNEISEPWFSPAGFQRGVLRNAI-KLAYT--PNKAQRDQLYANRVNPIVS 620 (749) T ss_pred ccceeeeccc-cCceEEechHHHHHHHHHHhhccCCcEECcCCceeeeeeccc-cceee--cChhHHHhhhhCCceEEEE Confidence 9999999864 677899999999999999999999999999999865555432 22222 2466789999999999999 Q ss_pred ecCCcEEEEeccccCCCCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhh-cccceE Q lcl|NC_021557. 289 SFATGIRVFGNRSAAFPTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGI-AIYGGT 367 (419) Q Consensus 289 ~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~-g~~~~~ 367 (419) |+++|+++||+||++ ++|++|+|||||||++||+++|++.++|+|||||++.||++|+++++.||++||++ ++++|+ T Consensus 621 ~~g~G~~~wG~rT~~--s~d~~~~~i~vRRl~~~ie~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~G~i~~f~ 698 (749) T protein:vir:10 621 FPGQGVVLYGDKTAL--GFASAFDRINIRRLFLTVERVISTAAKAQLFEQNDEAQRSLFINIVEPYLRDVQGRRGVVDFL 698 (749) T ss_pred ecCCeEEEEcceecC--CCCcccceeehhhhHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeE Confidence 999999999999996 45678999999999999999999999999999999999999999999999999985 488999 Q ss_pred EEEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcch--HHHHHHH Q lcl|NC_021557. 368 FRFDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVDTK--FISNALS 416 (419) Q Consensus 368 v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~--~~~~~~~ 416 (419) |+||+++||+++|++|+|+++|+++|++|+|||+|++++... +++|+.+ T Consensus 699 V~~d~~~Nt~~~i~~G~~~~~i~~~P~~pae~I~~~~~~~~~~~~~~e~~s 749 (749) T protein:vir:10 699 VKCDSTNNTPEAVDRGEFYAEVFLKPTRTINYVQLTFVATRTGVSFAEVAS 749 (749) T ss_pred EEEcCCCCCHHHhhCCEEEEEEEEEecCCccEEEEEEEEeecCcchHHHhC Confidence 999999999999999999999999999999999999988765 5666666 No 31 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=100.00 E-value=1.5e-82 Score=469.19 Aligned_cols=391 Identities=15% Similarity=0.132 Sum_probs=273.7 Q ss_pred CCCcc-CCCeEEEEcCCCccCccc-cCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHH Q lcl|NC_021557. 1 MAATF-HHGPEVIEHKDGVTVVRD-VKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPA 78 (419) Q Consensus 1 Ma~~~-~hGVyv~e~~~~~~~i~~-v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~ 78 (419) |.-++ .|||||+|+++++++|.. +.|++.+|||.++.++ .|+|++|+||.|+..+|+.......... T Consensus 279 ~~~~v~~~GVYVEEVpSGvrtIeGGV~TSVAAFVG~A~rGP-----------vn~PvlITS~aD~~~~Fg~~~GGl~Gas 347 (774) T protein:vir:98 279 ITRNVEDNGVVIQLEPALTGSISNRFSFYVTANDNTANRGF-----------TTSPALVTTIPDPAIHFTSFQGGLDGPR 347 (774) T ss_pred eEEEEecCceEEEEeCCCCccccccccceeeeecccccCCC-----------CCcCEEEeehhHhhhhhccccCCccccc Confidence 66555 599999999999999987 9999999999887765 4899999999997777753211000000 Q ss_pred -HHHHHhhccCCcEEEEeeccccc----------------c--ccc----------c---------ccccccc-cc---c Q lcl|NC_021557. 79 -ALDAIFDQGDGGTIIVNNVFDPD----------------V--HKE----------G---------ANPDPSK-VT---T 116 (419) Q Consensus 79 -al~~~~~~~~~~~~v~~~~~~~~----------------~--~~~----------~---------~~~~~~~-~t---~ 116 (419) +...++...+...+.+.....+. . ... . ...+... .. . T Consensus 348 sA~r~~~~~sG~~~L~i~A~~pGawGN~ItV~I~~~t~~~~~l~v~~~~~s~f~~~~a~e~~tv~~~~~~~~~~v~e~~d 427 (774) T protein:vir:98 348 SAFRDFYTFNGTPLLRLQAVSEGNWGNQVTVSIYPVNNSEFRLNVQDLNGSAFNPPLADEVYTVKLGDTNESGELNALLD 427 (774) T ss_pred eeeeeeeeecccceEEEEEeecCcCCCceEEEEEecCCceeEEEEEecCCccccccccceeEEEecccccccceeeeeec Confidence 11111111111100000000000 0 000 0 0000000 00 0 Q ss_pred ce-ecc-------------------------cc----------------------------ccccccccchhhhhhhhhc Q lcl|NC_021557. 117 VD-ING-------------------------TI----------------------------SPAGLASGFSGAYECYNNF 142 (419) Q Consensus 117 ~~-~~g-------------------------~~----------------------------~~~~~~tg~~a~~~~~~~~ 142 (419) .. +.+ .. ..++..+....+....... T Consensus 428 n~~i~~~~~~~~~~~in~vs~lv~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~v~v~lagG~Dg~~tt~~~igg~~~~~ 507 (774) T protein:vir:98 428 SKFIRGFFLPKSIDSINYDAALVRQSPLRLAPPDESETDVENPAHVDFYGPNVLVDVTLENGYDGPPVTNDDYVSIIRTL 507 (774) T ss_pred eeeEeecccccccccccccccccccchhcccccccccccccccccccccCCcceEEEeecCCCCcccccchheecccccc Confidence 00 000 00 0000000000011111111 Q ss_pred cccccccccchhhhhhhHHHHHHHHh----hccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccc Q lcl|NC_021557. 143 GYFPKLIIAPGYSPAAAVRAEMDVVA----SRLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTT 218 (419) Q Consensus 143 ~~~p~~~~ap~~~~~~~v~a~l~~~~----~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~ 218 (419) ....-..+..+........+.+.+|. .+.++++++|.|.+.+.++++.+|. +++|+|+++||||++++|+. T Consensus 508 ~~tgi~aLl~a~~~~~V~~aii~~~e~~~~~~~~r~avid~p~g~t~~~Ai~~r~-----~f~S~~aal~~Pwvkv~D~~ 582 (774) T protein:vir:98 508 ENQPVHILLVGTTNVGVQQALITEAERASDSDGLRIAVLAAPPRTTPTLAASVTR-----GFNSTRAVMVAGWFTYAGQP 582 (774) T ss_pred cccceeEEEcCccchhhHHHHHHHHHHhhhcccceEEEEECCCCCCHHHHHHHHh-----ccCCceEEEEeCcEEEeccC Confidence 11100011112222222233334443 2467899999999999999999884 58899999999999999964 Q ss_pred cccceeeechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEE-EecCCcEEEE Q lcl|NC_021557. 219 GATETRLDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAM-RSFATGIRVF 297 (419) Q Consensus 219 ~~~~~~~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~-~~~~~G~~~w 297 (419) ++..+++|||+++||++||+| ||+||+|++|+|+++++.++.......+.+.+.|+.++||+++ .++++|+++| T Consensus 583 -~g~~~~vPpSg~vAGl~ArtD----v~kSPANk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~gIN~i~itt~g~G~rvW 657 (774) T protein:vir:98 583 -NSSRYGVPGAAVYAGKLAAID----FFVSPAARSLVGPLFNIIESDTDNYTSRSNQDIYSAARLEVLSLDTVDRTYRFA 657 (774) T ss_pred -CCceeecChhHHHHHHHHhcC----cccccCCceeecceeccccccccccccchhhhhhcccccceeEEEEcCCcEEEE Confidence 556789999999999999999 9999999999999999888777666677788899999999986 5789999999 Q ss_pred eccccCCCCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceE-EEEecccC Q lcl|NC_021557. 298 GNRSAAFPTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGT-FRFDRQKN 375 (419) Q Consensus 298 G~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~-v~~d~~~n 375 (419) |+||++ +|+.|+||++|||++||+++|.+.++|+|||||++.+|++|+++++.||++||++| ++|++ |+||+++| T Consensus 658 G~RTls---sDp~wr~InVRRlfd~Ie~SI~~~~~~~VfEPNd~~l~~~I~~sI~~fL~~L~~~GaL~G~~~V~~D~etN 734 (774) T protein:vir:98 658 SGVTLS---TDPAWERIYLRRVHDVVRQGAHAILRNYVAMPNSRLVRNQIAAALNAFMGELKRNGNIVSFRPAIIDGSNN 734 (774) T ss_pred cccccC---CCcccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceecceEEEEcCCCC Confidence 999994 57889999999999999999999999999999999999999999999999999876 77876 89999999 Q ss_pred CHHHhhCCEEEEEEEEEeccCceEEEEEEEEcchHHHHHHHh Q lcl|NC_021557. 376 TAEQIADGKFYYRLECHPISVMERITIDSYVDTKFISNALSL 417 (419) Q Consensus 376 ~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~ 417 (419) |+++|++|+|+++|+++|++|+|||+|+++++.++. -|.+ T Consensus 735 t~~dI~~G~l~i~I~vaP~~PAEfIilri~q~t~~~--~l~E 774 (774) T protein:vir:98 735 STAAYFSRELYVSLQFQPLYSADYIYVTISRDTETS--PLGE 774 (774) T ss_pred CHHHhhCCEEEEEEEEEecCCcceEEEEEEEeecce--eccC Confidence 999999999999999999999999999999999862 2222 No 32 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=100.00 E-value=1.3e-72 Score=414.67 Aligned_cols=376 Identities=14% Similarity=0.044 Sum_probs=238.2 Q ss_pred CCCc------cCCCeEEEEcCCCccCcccc--CccceEE-------EEcccccccccccc--ccccccCcceeecchHHH Q lcl|NC_021557. 1 MAAT------FHHGPEVIEHKDGVTVVRDV--KSAVTYV-------NGTAPIQDVHATAL--AREDYINKRVIIRSRAEG 63 (419) Q Consensus 1 Ma~~------~~hGVyv~e~~~~~~~i~~v--~tav~~~-------Vgta~~a~~~~~~~--~~~~~~n~pv~its~~e~ 63 (419) |... -...|++.-..+.+++.... ...++.. ++..+.+.....+. ....+.+.++++...... T Consensus 343 ~v~d~~~~~~~~~~v~~~~t~~~~~pp~~~~~~e~v~~ngG~~f~v~s~~~~g~~i~~~~as~~~s~ln~~~~V~Gt~aa 422 (742) T protein:vir:58 343 SVGDFTVIVNELTNVSIPVTDSAIIPPMRFTRIEQITLSGGASFSVISNQPYGFNIQDSRHSYWLSPFKDDELIIGTELV 422 (742) T ss_pred cccceeeeccccccceeeccccccCCcccccccceeecccCcceEEEEecccCcceeccCcceEEeccCCceEEEeehhh Confidence 1110 11244554444443331111 0111111 11010000000000 001134555554322211 Q ss_pred HHHhcccchhhhHHHHHHHHhhccCCcEEEEeeccccccccccccccccccccceecc----ccccccccccchhhhhhh Q lcl|NC_021557. 64 AAAFGVHKAGYTIPAALDAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDING----TISPAGLASGFSGAYECY 139 (419) Q Consensus 64 ~~~fg~~~~~~~l~~al~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g----~~~~~~~~tg~~a~~~~~ 139 (419) ......... .... ...+............+... ..........+.++ .....++++|++++.+.. T Consensus 423 ~~~~d~~t~------~~v~-s~~~alp~~a~sv~laGG~d----g~v~v~~~~~D~iG~~~~~d~~~adrTGL~ALlev~ 491 (742) T protein:vir:58 423 LPALDVSTE------FGVS-SWEEALPEFSFLMPFQGGSD----GYIRVDENEPDTIGRVKITPALLANYERLLPLLTED 491 (742) T ss_pred ccccccchh------eecc-ccccccceeeEEEeecCCcc----ccccccCCCcccccccccccccccchhHHHHhhhcC Confidence 111100000 0000 00000000000000000000 00000111111111 122345678888887765 Q ss_pred hhccccccccccchhhhhhhHHHHHHHHhhcc-ceeEEEEeccCCCH-HHHHhhhhhccccccCccceEEecceeEeecc Q lcl|NC_021557. 140 NNFGYFPKLIIAPGYSPAAAVRAEMDVVASRL-HALAIADLPLGLTK-QQAVAARGVAGTANTSSARTVLTYPHVVIEDT 217 (419) Q Consensus 140 ~~~~~~p~~~~ap~~~~~~~v~a~l~~~~~~~-~~~~i~d~p~~~~~-~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~ 217 (419) . +.++++|++++.+.+.+.+++|+... +.+++.|+|.+.+. .++..++ .+++|+|+++||||++..+. T Consensus 492 e-----VtILiAPG~t~~~v~aav~A~la~a~~Rl~vL~D~P~~~tt~~~A~a~r-----~~~nSsraaly~PwVkv~d~ 561 (742) T protein:vir:58 492 Q-----FDLVLTPYLTFADHAGTVNAFINRAENRFLYLFDIAGDDDTENLAISLA-----GYINSSFATTFFPWVRRLTN 561 (742) T ss_pred C-----CcEEEEcCCCchHHHHHHHHHHHhhcCCeEEEEecCCCCchHHHHHHHH-----hccCCceEEEEeceeeeccC Confidence 3 57889999987666655556665433 45667788866543 3445554 45789999999999988763 Q ss_pred ccccceeeechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEE Q lcl|NC_021557. 218 TGATETRLDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVF 297 (419) Q Consensus 218 ~~~~~~~~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~w 297 (419) +..+++|||+++||++||+|.++|+|+||+|+.+.+... ..+.|++.||++|||+|+++ |+|+++| T Consensus 562 ---~~~r~vPpSgaIAGL~ARtD~erGvw~SPANrgii~~~~----------~s~se~d~LN~~GINtIrsf-G~G~rlW 627 (742) T protein:vir:58 562 ---KGMRTVPASLAAYRSIRTTDPETGLAPVGARRGVVTGEP----------VRQVDWEDLYNNRINPIVRV-GNDVLLF 627 (742) T ss_pred ---CcceeechHHHHHHHHHHhccCCceEecCCcceeeeccc----------cchhhHHHHhhCCceEEEEC-CCcEEEE Confidence 456889999999999999999999999999986533221 13567889999999999876 7899999 Q ss_pred eccccCCCCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCC Q lcl|NC_021557. 298 GNRSAAFPTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNT 376 (419) Q Consensus 298 G~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~ 376 (419) |+||++ ++|+.|+||++|||++||+++|+++++|++||||++.||++|++++++||++||++| ++||+|+||+ +|| T Consensus 628 GnRTla--ssDs~wryInVRRlfd~Ie~SI~~a~q~~VfEPNd~~L~~sIk~sInafL~~L~aqGALlGfrV~lDe-tNT 704 (742) T protein:vir:58 628 GQKTML--NVNSALNRINVRRLLIVMRNRISQILSSYLFENNTSENRLRAEALVRQYLESLRLRGAVTDYEVAIDS-VTT 704 (742) T ss_pred cceecC--CCCcccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcC-CCC Confidence 999996 457789999999999999999999999999999999999999999999999999865 7899999995 588 Q ss_pred HHHhhCCEEEEEEEEEeccCceEEEEEEEEcchHHHHHHH Q lcl|NC_021557. 377 AEQIADGKFYYRLECHPISVMERITIDSYVDTKFISNALS 416 (419) Q Consensus 377 ~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~ 416 (419) +++|++|+|+++|+++|++|||||+|++.+...+.+ |+ T Consensus 705 peDI~~Gklvv~I~vAP~~PAEfI~lrf~it~tga~--Fs 742 (742) T protein:vir:58 705 PTDIDNNTLRARVTVQPARSIEYIDITFVITPTGVE--IT 742 (742) T ss_pred HHHhhCCEEEEEEEEEccCCcceEEEEEEEEecccc--cC Confidence 999999999999999999999999999888777653 22 No 33 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=100.00 E-value=1.7e-50 Score=293.38 Aligned_cols=357 Identities=14% Similarity=0.043 Sum_probs=198.0 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |++ ..+|+.-+. |-.....+.+.|+.-- ++ ++ ..+.++.+..+++.++...|-..... .- T Consensus 330 ~~~-~~~g~~~~~------pl~~ts~dy~~~~~~v-dg-I~------~~~~~~V~~~g~~s~a~a~~~~g~~s-----~d 389 (717) T protein:vir:79 330 KPE-SKRGMISED------PLVFKSGDYTNFKMLV-DA-IN------NHPFNNVVRARTKPEFEATFTSTLQA-----AA 389 (717) T ss_pred ccc-ccCcceecc------ccccccCceeeeeeee-cc-cc------cCchhheeeeecccccceeeeecccC-----ch Confidence 553 346665433 1111122222222210 01 00 01235555566555554433211000 00 Q ss_pred HHHhhccCCcEEEEeeccccccccccccccccccccceeccccccccccccchhhhhhhhhccccccccccc-hhhhhhh Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLIIAP-GYSPAAA 159 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap-~~~~~~~ 159 (419) +..|..+...........+.. ......+....+ ..+++..++.....+...|+..... ....... T Consensus 390 ~a~f~Gg~dgl~~~~ee~Y~~--lGgk~~d~g~lt------------~~aays~LE~~dVDlVil~ga~adtt~ga~~d~ 455 (717) T protein:vir:79 390 DAKFSGGKDELSLDKEEMYKR--LGGEKNEEGFVT------------KQGAYQYLENYEVDYVIPLGVHADTKLIGKYDD 455 (717) T ss_pred hhccCCCccccccchhhhhcc--cccccccccccc------------chhhhhhcCcceeEEEEecCccccccccchhhh Confidence 111221111111111000000 000000000000 0001111111111111111110000 0011112 Q ss_pred HH-HHHHHHhhc----cceeEEEE--eccCCCHHHHHhhhhh----------------------ccccccCccceEEecc Q lcl|NC_021557. 160 VR-AEMDVVASR----LHALAIAD--LPLGLTKQQAVAARGV----------------------AGTANTSSARTVLTYP 210 (419) Q Consensus 160 v~-a~l~~~~~~----~~~~~i~d--~p~~~~~~~~~~~~~~----------------------~~~~~~~s~~~~~~~p 210 (419) +. +..++|+.+ ..++.+++ .|.+...+...+++.. ....+++ .+...+++ T Consensus 456 va~alad~caalSal~r~ai~VI~l~sp~D~~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a~~~~~~~idis-~y~~vv~~ 534 (717) T protein:vir:79 456 FAYQLALACAVMSHYNSVTIGIIPTTTPSDISLAGVEEHVKKLENYANEFYMRDRFGNIIFDADRNKIDLG-QFIEVVAG 534 (717) T ss_pred HHHHHHHHHHHhhhccccceeeeccccccccchhhHHHHHHHHHhhhhhhhhhcchhcccccccccccccc-ceeeeeec Confidence 22 233333321 12233333 2333322222111110 0001111 23333333 Q ss_pred eeEeeccccccceeeechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEec Q lcl|NC_021557. 211 HVVIEDTTGATETRLDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSF 290 (419) Q Consensus 211 ~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~ 290 (419) +..+..+. .......||+|++||+ |.++|+|+||+|++|.|+.++.+.+ +..|.+.||++|||+|++++ T Consensus 535 ~~~iv~~~-~~~~~~~p~AG~vAGl----dA~rGVwkSPANk~I~GVvgLa~~l------T~sE~d~Ln~aGIntIr~~~ 603 (717) T protein:vir:79 535 PDFIVRNT-RLGQMASTPDASYIGM----VSQLKTQSAPTNKPLPSVTALRYTY------SANQLNRLTKARFATFKYKQ 603 (717) T ss_pred ceeEEEcC-CCceeecCHHHHHHHH----HhcCCcccccccceecccccCcccC------CHHHHHHHhhCCeEEEEEeC Confidence 33333333 3345667776666655 5667999999999999999998877 45678899999999999999 Q ss_pred CCcEEEEeccccCCCCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEE Q lcl|NC_021557. 291 ATGIRVFGNRSAAFPTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFR 369 (419) Q Consensus 291 ~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~ 369 (419) ++|+++||+||++.++ ..|+||++||++++|+++|++.++|++||||++.+|..|+.+|++||++||++| |.||+++ T Consensus 604 GrGirVWGaRTtasd~--sdWryInVRRl~D~Ie~sIr~al~~yVgEPNd~~tr~~Ik~sI~afL~~L~r~GAI~Gykvd 681 (717) T protein:vir:79 604 DGSIGVVDAPTSAHAG--SDYTRLSTARIVKEAVNAVREVADPFIGEPNDTGNRNALTAAVDKRLSKMIENKALLGFDFR 681 (717) T ss_pred CceEEEEeeeecCCCC--cccceeehhhhHHHHHHHHHHHHHHhccccCCHHHHHHHHHHHHHHHHHHHhcCceecceee Confidence 9999999999996543 359999999999999999999999999999999999999999999999999865 7788875 Q ss_pred EecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcc Q lcl|NC_021557. 370 FDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVDT 408 (419) Q Consensus 370 ~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~ 408 (419) + +||++++++|+++++|.++|++|+|||+|+++..- T Consensus 682 v---tnT~~di~~G~l~V~I~vaPv~PaEfI~ititITA 717 (717) T protein:vir:79 682 L---VVTPQQELLGEGSIELSLEAPNELRRLTTIVSLSA 717 (717) T ss_pred E---ecChhHhhCCEEEEEEEEEecCcccEEEEEEEEeC Confidence 5 89999999999999999999999999999999887 No 34 >protein:vir:103168 Length: 641 # NCBI annotation: gp122 # Family: family:all:661 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717789;genbank:gi:113200626;genbank:GeneID:4239177 Probab=100.00 E-value=1e-45 Score=267.29 Aligned_cols=282 Identities=17% Similarity=0.097 Sum_probs=188.8 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |+++..|||||+|++.+ ++|..+.|++.+|||++++++ .|+|++|+||.||.+.||.......+..++ T Consensus 3 m~~~~sPGVyv~E~~~~-~~i~~v~tsvaafvG~~~~GP-----------~~~p~~v~s~~d~~~~FG~~~~~~~l~~av 70 (641) T protein:vir:10 3 VSNQLSPGVVIQERDLT-AVTTPIGLNVGVLAAPFTKGP-----------VEEIFEVSTERDLASVFGEPNDYNYEYWFT 70 (641) T ss_pred CccccCCceEEEEecCC-CcccccCCccceEEecccCCC-----------CCccEEecCHHHHHHHcCCcCCCcchHHHH Confidence 99888999999999976 689999999999999987764 589999999999999999988889999999 Q ss_pred HHHhhccCCcEEEEeeccccccccc-----------------------------c--c--------------cccc---- Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKE-----------------------------G--A--------------NPDP---- 111 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~-----------------------------~--~--------------~~~~---- 111 (419) ..+|.|+|..|++++.......... . . ..+. T Consensus 71 ~~fF~ngG~~~~vvRv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~pG~~gn~l~v~i~~~~~~~~~~~ 150 (641) T protein:vir:10 71 ASQFLSYGGVLKAIRLNAASLKNSVDSGTAPLIKNLQEYETTYESSNSNTFKFASRDAGALGNSVGIFITDAGPDQIAVL 150 (641) T ss_pred HHHHHhcCCEEEEEEecCccccccccccchhhccccccccccccCcCccccEEEeccCCCcCCceEEEEEcCCCcceeee Confidence 9999999999999987421100000 0 0 0000 Q ss_pred ---------------------------------------------c-c--------------------cc---------- Q lcl|NC_021557. 112 ---------------------------------------------S-K--------------------VT---------- 115 (419) Q Consensus 112 ---------------------------------------------~-~--------------------~t---------- 115 (419) . . .+ T Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~~~~~~~~~~a~~~~~~i~~~~~g 230 (641) T protein:vir:10 151 PAPGTGNEWEFVADEAVTAASGASAKVYRYSIRLTLTNVVGTFVPGGATTISISGSDESVDVLAWDAGNKYLEIALPAGG 230 (641) T ss_pred ecccccccceeccceeeeeccCcccccccccccccccceeeecccCCcceeEecccccccccccccCCcceeeeeecCCc Confidence 0 0 00 Q ss_pred -------cceec-cccccc-----------------------------------------c------cccc--------- Q lcl|NC_021557. 116 -------TVDIN-GTISPA-----------------------------------------G------LASG--------- 131 (419) Q Consensus 116 -------~~~~~-g~~~~~-----------------------------------------~------~~tg--------- 131 (419) ..... +....+ . ..++ T Consensus 231 ~~g~~~~~~~~t~gt~~~t~a~~g~~~~~~~~~~~~~ia~aat~ag~~g~~~~v~~~~v~d~~~a~~~~~g~~~~~va~~ 310 (641) T protein:vir:10 231 VTGIFADAQVVTQGTNTAAIASSGIERRLYIGKDSGSINFAATDAVVDTNATSATISSVRNEYAEREYLPGSKWVNVAAR 310 (641) T ss_pred ceeeeeeeeeccCCccceeeecccchhhhhhccccccceeeeecccccccceeeEeeeeeeeeccccccccccccccccc Confidence 00000 000000 0 0000 Q ss_pred --chh-----------------------------hhhhhhhcc------------------------------------- Q lcl|NC_021557. 132 --FSG-----------------------------AYECYNNFG------------------------------------- 143 (419) Q Consensus 132 --~~a-----------------------------~~~~~~~~~------------------------------------- 143 (419) ... +.+.+..+. T Consensus 311 ~gts~~a~~~g~~~D~~~~lv~d~~~~~~g~~g~v~e~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~ 390 (641) T protein:vir:10 311 PGTSLYANSVGGVNDELHVLVIDVDGKITGNPGSVLERFIGVSKASDAKTSIGEVNYYKEVIKQQSAYVYWGSHETAPFL 390 (641) T ss_pred chhhhhhhhcCCcccceEEEEEeecceeeccccceeeeeecccccCCcccccccceeeeeeeccccceEEEecccccccc Confidence 000 000000000 Q ss_pred --------------------------------------------------c----------------------------- Q lcl|NC_021557. 144 --------------------------------------------------Y----------------------------- 144 (419) Q Consensus 144 --------------------------------------------------~----------------------------- 144 (419) + T Consensus 391 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gG~d~~~~~~~~~~~~~~~~tg~~~~~~ 470 (641) T protein:vir:10 391 GTAANAAAGDWGASALNRRYNLLRSTAGTTSFPAGAVTVGSKNNATHYYRLANGADYSASGALYNLSNVDIATAYELIED 470 (641) T ss_pred cccccccccccccccccccccccccccccccccccccccCCCCcceeEEEeecCcccccccccccccchhHHHHHHHhhh Confidence 0 Q ss_pred -----cccccccch----hhhhhhHHHHHHHHhhccceeEEEEeccCCC---------HHHHHhhhhhccccccCccceE Q lcl|NC_021557. 145 -----FPKLIIAPG----YSPAAAVRAEMDVVASRLHALAIADLPLGLT---------KQQAVAARGVAGTANTSSARTV 206 (419) Q Consensus 145 -----~p~~~~ap~----~~~~~~v~a~l~~~~~~~~~~~i~d~p~~~~---------~~~~~~~~~~~~~~~~~s~~~~ 206 (419) +..+++.+. ......+.+++.+|+++.+||+++|+|.+.. .++...||. .+++|+|++ T Consensus 471 ~e~~~i~~l~~~~~~~~~~~~~~v~~~~i~~ce~~~d~~ailD~p~~~~~~~~~~~~~~~~~~~~~~----~~~~s~yaa 546 (641) T protein:vir:10 471 PESQVIDYVLSGPAGADEAAAIAKATTITTIVESRKDCMAFLSPLRSDVIGVSNTTTVTENLVNYFN----QLPSSNYVV 546 (641) T ss_pred hhhhccceeeecCCCCCcchhHHHHHHHHHHHHhcCCEEEEEcCCcccccCCCchhhHHHHHHHHHh----hcCCCceEE Confidence 000000000 0111234456677888888999999997542 234556653 357899999 Q ss_pred EecceeEeeccccccceeeechHHHHHHHHHhhhhccCceecccCc---eeeceeecceecccccCCcchhhccccCCce Q lcl|NC_021557. 207 LTYPHVVIEDTTGATETRLDPLSSRLAGVIIATDLNEGWQNSPSNR---EIKGVVDLEVPINFYPSDYQNDTNFLNEAGI 283 (419) Q Consensus 207 ~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D~~~g~~~span~---~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI 283 (419) +||||++++|+. +++.+++||||++||+|||+|.+|||||||||. .|+|+++++..+ ++.|++.||++|| T Consensus 547 ~y~P~~~v~dp~-~~~~~~vPpsG~iAGv~ArtD~~rGvwkAPAn~~~~~i~gv~~l~~~~------~~~e~~~Lnp~gI 619 (641) T protein:vir:10 547 FDSGYKYIYDKY-NDVYRYVPCNGDIAGLILETGLEEEPWFSPAGFQRGVLRNAVKLAYSP------NKTQRDRLYANRI 619 (641) T ss_pred EEeceeEeeccc-CCceeEecCCHHHHHHHHhhhccCCceECcCCcccceeeeeeeeeEec------ChhHHhhhhhccc Confidence 999999999975 577899999999999999999999999999998 489999998887 3567889999999 Q ss_pred EEEEEecCCcEEEEeccccCCCCCcccc Q lcl|NC_021557. 284 VTAMRSFATGIRVFGNRSAAFPTSSHVE 311 (419) Q Consensus 284 ~~i~~~~~~G~~~wG~rT~~~~s~~~~~ 311 (419) ||||.|+|+|++- +.-. . +... T Consensus 620 N~ir~fpg~G~v~--~~~~-~---~~~~ 641 (641) T protein:vir:10 620 NPVVSFPGHAMIN--NNIA-F---HTKL 641 (641) T ss_pred ceEEecCCceeec--ceee-e---eecC Confidence 9999999999631 2111 0 0000 No 35 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=100.00 E-value=5.4e-43 Score=252.31 Aligned_cols=374 Identities=13% Similarity=0.104 Sum_probs=252.7 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |-++.+||||++|.+++++++..+.+++.+|||.++.+++ ++|+++++|.++.+.||. +.|.+++ T Consensus 8 ~~~~~~Pgv~~~~~~s~~~~~~~~~~~~~~~iG~a~~G~~-----------~~~~~~~~~~~~~~~fg~----g~l~~~i 72 (562) T protein:vir:63 8 RKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKP-----------NAVYKVRNYSQAKSVFRS----GELLDAI 72 (562) T ss_pred CCcccCCceEEEEecCCCcccCCCCCceEEEEEEeCCCCC-----------ceeEEEccHHHHHHHhcC----CchHHHH Confidence 5667789999999999999999999999999999887743 899999999999999988 4466777 Q ss_pred HHHh----hccCCcEEEEeecccccccccccccc----------------------c----------------------- Q lcl|NC_021557. 81 DAIF----DQGDGGTIIVNNVFDPDVHKEGANPD----------------------P----------------------- 111 (419) Q Consensus 81 ~~~~----~~~~~~~~v~~~~~~~~~~~~~~~~~----------------------~----------------------- 111 (419) .+.| .+++..++.+++...........+.. + T Consensus 73 ~~a~~~~~~~g~~~~~~~rv~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~ 152 (562) T protein:vir:63 73 ERAWNPGEGTGAGDILAMRVEEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGS 152 (562) T ss_pred HHhccccccCCceEEEEEEcCCCccceeEecceeEEEeecccCCCeEEEEEecCCCCCCcceEEEecCCCcchhhhhccc Confidence 6666 57777777665522111000000000 0 Q ss_pred ---------------------cccccc---eecccc-------------------------------------------- Q lcl|NC_021557. 112 ---------------------SKVTTV---DINGTI-------------------------------------------- 123 (419) Q Consensus 112 ---------------------~~~t~~---~~~g~~-------------------------------------------- 123 (419) ...+.. ...|.. T Consensus 153 V~~i~y~g~~~~~~~~v~~~~~~~~a~~l~~~~g~~~v~~~~L~~g~~~~~~~l~~~in~~~~~~aky~~~~gn~i~~~~ 232 (562) T protein:vir:63 153 IFSIKYKGTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTDN 232 (562) T ss_pred eeeeeeecccccceEEEEecCcceeEEEEEeecCCcceeEEEecCCccchhHHHHHhhccccceEEEeeccCCceeeeec Confidence 000000 000000 Q ss_pred ----ccccccccch-----------------------------------------------hhhhhhhhccccccccccc Q lcl|NC_021557. 124 ----SPAGLASGFS-----------------------------------------------GAYECYNNFGYFPKLIIAP 152 (419) Q Consensus 124 ----~~~~~~tg~~-----------------------------------------------a~~~~~~~~~~~p~~~~ap 152 (419) .....++... ...+++..+.......+.+ T Consensus 233 ~d~~~~~~vkt~~~~v~t~~~d~~~~~~~~~~v~~~~~~~~~la~~~~~~LtGG~dGt~~~~~~~al~ale~~~~~~i~~ 312 (562) T protein:vir:63 233 FDAQIDVDIKTKEAYVKAVGGDIEKQTAYNGYVDFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANEGGYYLVP 312 (562) T ss_pred cccccccchhhhhhhhhhhhhhhhhcccccceeeeeeccccceecccceeeecCCCCCchhhHHHHHHHHHhCCcEEEEe Confidence 0000000000 0000000000000000011 Q ss_pred hhhhhhhHHHHHHHHhh-----ccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeec Q lcl|NC_021557. 153 GYSPAAAVRAEMDVVAS-----RLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDP 227 (419) Q Consensus 153 ~~~~~~~v~a~l~~~~~-----~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p 227 (419) .+..+++++++.++++ ...++++++.+.+.+.++..... ..+++.+.+.++|+....+. .+....+| T Consensus 313 -~t~d~av~~~l~a~vkr~~~~g~~~~aVlg~~~~~~~~~~~~~a-----~~~n~ervv~v~~~~~~~~~--~~~~~~~~ 384 (562) T protein:vir:63 313 -LTSKQAVHAEALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTRA-----IGLQNERAGLIGFSGTVKMD--DGRSLKMP 384 (562) T ss_pred -cCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHh-----hhcCCCcEEEEecCeeEECC--CCceeeec Confidence 1222344454444332 23468888887777776665543 35788999999998766543 33445567 Q ss_pred h---HHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEec-cccC Q lcl|NC_021557. 228 L---SSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGN-RSAA 303 (419) Q Consensus 228 ~---s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~-rT~~ 303 (419) + ++++||+++.+| +++||.|+.+. ..++...+ .+.|.+.|+++|++++....+++.++|.. +++. T Consensus 385 ~~~~aa~vAGl~A~~~----~~~SlT~~~i~-~~~v~~~~------t~~e~~~li~~Gv~~l~~~~~~~v~~~~iv~~it 453 (562) T protein:vir:63 385 GYMFAAQVAGLTCGLE----IGEAITFKNIA-IETLDTIY------EGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVT 453 (562) T ss_pred hhHHHHHHHHHhhcCc----hhcCccceeec-cccccccC------CHHHHHHHHhCCeEEEEEecCCcEEEEEeeccce Confidence 6 889999999887 88999999986 55665443 56788899999999998877777777754 3322 Q ss_pred --CCCCcccceeeehhhHHHHHHHHHHHHHH-HhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEecccCCHHH Q lcl|NC_021557. 304 --FPTSSHVENFIHARRILDMIHEAIIFYTM-NYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQ 379 (419) Q Consensus 304 --~~s~~~~~~~i~vrR~~~~i~~~~~~~~~-~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~ 379 (419) ..++++.|++|+++|++|+|.+.++..+. +|++|||+...|..++..+..||.+|++.| |.+|... +-+.. T Consensus 454 T~t~~~~~~~~ki~viRv~D~i~~dir~~~~~~yiGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~ 528 (562) T protein:vir:63 454 TFNDKTDPVKSEIGVGEANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPE-----EVQVV 528 (562) T ss_pred ecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEE Confidence 23567889999999999999999988865 999999999999999999999999999977 5555321 11223 Q ss_pred hhCCEEEEEEEEEeccCceEEEEEEEEcchHHHH Q lcl|NC_021557. 380 IADGKFYYRLECHPISVMERITIDSYVDTKFISN 413 (419) Q Consensus 380 i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~ 413 (419) +..+++++++.++|+.|+|+|.+++.+.++-+++ T Consensus 529 ~~~d~~~v~~~v~pv~~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:63 529 IEGDVARISLTVFPIRSMKKIEVSLVYRQQILTA 562 (562) T ss_pred ecCCEEEEEEEEEEcccceEEEEEEEEeeeeecC Confidence 5668899999999999999999999999998877 No 36 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=100.00 E-value=5.7e-42 Score=246.70 Aligned_cols=374 Identities=11% Similarity=0.089 Sum_probs=251.9 Q ss_pred CC-------CccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchh Q lcl|NC_021557. 1 MA-------ATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAG 73 (419) Q Consensus 1 Ma-------~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~ 73 (419) || ..-+||||+++.++++.++..+.+++.+|||.|+.+++ |+|+++++|.++.+.||. T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~~G~~-----------~~~~~~~~~~~~~~~f~~---- 65 (569) T protein:vir:80 1 MAVEQFPRKKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAKGGKP-----------DTVYRFRNYQQAKQVLRS---- 65 (569) T ss_pred CeeeeecCCccccCceEEEEecCCCcCCCCCCceeEEEEEEeCCCCC-----------ceeEEecCHHHHHHHhcC---- Confidence 66 33479999999999999999999999999999887753 899999999999999987 Q ss_pred hhHHHHHHHHh------hccCCcEEEEeeccccccccccc--------------------------------------cc Q lcl|NC_021557. 74 YTIPAALDAIF------DQGDGGTIIVNNVFDPDVHKEGA--------------------------------------NP 109 (419) Q Consensus 74 ~~l~~al~~~~------~~~~~~~~v~~~~~~~~~~~~~~--------------------------------------~~ 109 (419) +.|.+++...| .+++..+++++............ .. T Consensus 66 g~l~~a~~~a~~~~~~~~~~~~~~~~~rv~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~l~~~~~~~~~~~~v~~~~~~~ 145 (569) T protein:vir:80 66 GDLLDAIELAWNASDVNTASAGDILAVRVEDAKNATLTKGGLTFASTIYGVDANEIQVALEDNNLTHTKRLTVAFSKDGY 145 (569) T ss_pred CchhHHHHhhccCccccccCceEEEEEEcCCCeeeeeeccceeeeeeeccCCCceEEEEEecCcCCcceeeEEeeecCCC Confidence 44777777665 35555566665421000000000 00 Q ss_pred cc-----------------c---------cc--ccce---ecccccc-------------------------------cc Q lcl|NC_021557. 110 DP-----------------S---------KV--TTVD---INGTISP-------------------------------AG 127 (419) Q Consensus 110 ~~-----------------~---------~~--t~~~---~~g~~~~-------------------------------~~ 127 (419) +. . .. +... ..+.... +. T Consensus 146 ~~~~~~ig~v~si~ytg~~~~a~~~~~~~~~~~~a~~l~~~~g~~~~~~~~v~~~~~~~~~~~~~~~lv~~~~~~~~f~a 225 (569) T protein:vir:80 146 KKVFDNLGKIFSIQYKGSEAQANFTIAQDSISKKATTLTLNVGSEPESTTEVMKYELGQGVYSETNVLVSAINSLPDWEA 225 (569) T ss_pred ccccccccceeeEEEeeccccceEEeecCcCcceeEEEEEEecCCcceeEEEEeeccCCccchhhhhhhhhcCCccCceE Confidence 00 0 00 0000 0000000 00 Q ss_pred ----------------------ccc---cch--------------------------------------------hhhhh Q lcl|NC_021557. 128 ----------------------LAS---GFS--------------------------------------------GAYEC 138 (419) Q Consensus 128 ----------------------~~t---g~~--------------------------------------------a~~~~ 138 (419) .++ .+. ...++ T Consensus 226 ~~~~~~~~~~~~~~~d~~~~~~~~t~~~~~~~~~~di~~~~~~~~~v~~~~~~~~~l~~~~~~~LtGG~dG~~~~~~~~~ 305 (569) T protein:vir:80 226 KFFPIGDKNLPTDALEAVTKVDVKTEAVFVGALAGDIAKQLEYNDYVTVAVDATKPVEDFELTNLTGGSDGTAPESWANK 305 (569) T ss_pred EEEecCCCcceehhccchhheeccccceeeehhHHHHHHhhcCCceEEEEecCCcceeeecceeecCCCCCCccchHHHH Confidence 000 000 00000 Q ss_pred hhhccccccccccchhhhhhhHHHHHHHHhhc-----cceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeE Q lcl|NC_021557. 139 YNNFGYFPKLIIAPGYSPAAAVRAEMDVVASR-----LHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVV 213 (419) Q Consensus 139 ~~~~~~~p~~~~ap~~~~~~~v~a~l~~~~~~-----~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~ 213 (419) +..+.......+.+ .+..+++++++.+++++ ..++++++.+.+.+.+++.... .++++.+.++++||.. T Consensus 306 l~~le~~~~~~i~~-~t~d~av~~~l~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~~~a-----~~~n~e~vv~v~~~~~ 379 (569) T protein:vir:80 306 FPLLANEGGYYLVP-LTDKQAVHSEALAFVKDRTDNGDPMRIIVGGGTNETVEESITRA-----TNLRDPRASLVGFSGT 379 (569) T ss_pred HHHHhhCCcEEEEe-cCCChHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHH-----hhcCCCeEEEEecCce Confidence 00000000000111 12234555555554432 3478889888888877777654 4688999999999987 Q ss_pred eeccccccceeeech---HHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEec Q lcl|NC_021557. 214 IEDTTGATETRLDPL---SSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSF 290 (419) Q Consensus 214 ~~~~~~~~~~~~~p~---s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~ 290 (419) +.+. .+....+|+ ++++||+++.++ +++||.|+.+. +.++...+ ...|.+.|+++|+.+++..+ T Consensus 380 ~~~~--~g~~~~~~~~~~aa~vAG~~A~~~----~~~S~T~k~i~-~~~i~~~l------t~~e~~~li~~G~~~l~~~~ 446 (569) T protein:vir:80 380 RKMD--DGRLLKLPGYMMASQIAGIASGLE----VGEAITFKHFN-VTSVDRVF------ESSQLDMLNESGVISIEFVR 446 (569) T ss_pred eecC--CCcceeechhhHHHHHHHHHhcCc----cccCccceeec-cccccccC------CHHHHHHHHhCCeEEEEEec Confidence 7653 223344555 678888887776 99999999987 56666654 46788899999999998887 Q ss_pred CCcEEEEec---cccCCCCCcccceeeehhhHHHHHHHHHHHHH-HHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccc Q lcl|NC_021557. 291 ATGIRVFGN---RSAAFPTSSHVENFIHARRILDMIHEAIIFYT-MNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYG 365 (419) Q Consensus 291 ~~G~~~wG~---rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~-~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~ 365 (419) +++.++|.. -|.-...+++.|++++++|++|+|.+.++..+ .+|++|||+...|..++..+..||.+||+.| |.+ T Consensus 447 ~~~~~v~~~vn~itT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yiGk~nn~~~r~~v~~~i~~~L~~l~~~gaI~~ 526 (569) T protein:vir:80 447 NRTLTAFRVVQDVTTYNDKSDPVKNEMSVGEANDFLVSELKIELDNNFIGTKVIDTSASLIKNFIQSFLDNKKRAREIQD 526 (569) T ss_pred CceEEEEEEeccceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhhcCcccCChhHHHHHHHHHHHHHHHHHhCCcccC Confidence 777777743 22223356788999999999999999999876 5899999999999999999999999999987 555 Q ss_pred eEEEEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcchHHHH Q lcl|NC_021557. 366 GTFRFDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVDTKFISN 413 (419) Q Consensus 366 ~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~ 413 (419) |... +-+.++..+++++++.++|+.|+|+|.+++.+.++-+++ T Consensus 527 ~~~~-----dv~v~~~~d~~~v~~~v~Pv~~~ekI~~ti~~~~~~~~~ 569 (569) T protein:vir:80 527 YTPE-----EVQVVLEGDVASISMTVMPIRSLNKITVQLVYKQQILTA 569 (569) T ss_pred CCcc-----ceEEEecCCEEEEEEEEEEcccccEEEEEEEEeeeeecC Confidence 5321 122335678999999999999999999999999998877 No 37 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=100.00 E-value=5.8e-42 Score=246.65 Aligned_cols=374 Identities=14% Similarity=0.116 Sum_probs=250.8 Q ss_pred CC-------CccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchh Q lcl|NC_021557. 1 MA-------ATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAG 73 (419) Q Consensus 1 Ma-------~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~ 73 (419) || .+.+||||++|.++++.++..+.+++.+|||.++.+++ |+|++++++.++.+.||. T Consensus 1 ~~~~~~~~~~~~~pgv~~~~~~s~~~~~~~~~~~~~~~ig~a~~G~~-----------~~~~~~~~~~~~~~~f~~---- 65 (562) T protein:vir:80 1 MAIEIYPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKP-----------NAVYKVRNYSQAKSVFRS---- 65 (562) T ss_pred CeeeeeCCCcccCCceEEEEecCCcccCCCCCCceEEEEEEeCCCCc-----------ceeEEEccHHHHHHHhcC---- Confidence 65 44579999999999999999999999999999887753 899999999999999987 Q ss_pred hhHHHHHHHHh----hccCCcEEEEeecccccccccccccc-----------------------ccc------------- Q lcl|NC_021557. 74 YTIPAALDAIF----DQGDGGTIIVNNVFDPDVHKEGANPD-----------------------PSK------------- 113 (419) Q Consensus 74 ~~l~~al~~~~----~~~~~~~~v~~~~~~~~~~~~~~~~~-----------------------~~~------------- 113 (419) +.|.+++.+.| .+++..++.+++...........+.. .++ T Consensus 66 g~l~~~i~~a~~~~~~~g~~~~~~~rv~~a~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~e 145 (562) T protein:vir:80 66 GELLDAIERAWNPGEGTGAGDILAMRVEEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQ 145 (562) T ss_pred CChHHHHHHhcccccccCceEEEEEEcCCCCcceEEecceEEEEeecccCCCceEEEEecCCCCCCcceEEEecCCcceE Confidence 44666666666 47777776665522111110000000 000 Q ss_pred -------c---------------------c--cce--e-ccccc------ccc-ccccchhh------------------ Q lcl|NC_021557. 114 -------V---------------------T--TVD--I-NGTIS------PAG-LASGFSGA------------------ 135 (419) Q Consensus 114 -------~---------------------t--~~~--~-~g~~~------~~~-~~tg~~a~------------------ 135 (419) + + ... + .|... ..+ ..+...+. T Consensus 146 v~~~~g~v~~i~y~g~~~~a~~~i~~~~~~~~a~~l~~~~g~~~v~~~~l~~g~~~~~~~l~~~i~~~~~~tAky~g~~~ 225 (562) T protein:vir:80 146 VYDNLGSIFSIKYKGTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIGD 225 (562) T ss_pred EeeccCceeeeeeccccccceeEEEecCccceEEEEEEecCCcceeEEEeCCCccchhhhhhhhhccccceEEEecccCC Confidence 0 0 000 0 00000 000 00000000 Q ss_pred ----------------------------------------------------------------------hhhhhhcccc Q lcl|NC_021557. 136 ----------------------------------------------------------------------YECYNNFGYF 145 (419) Q Consensus 136 ----------------------------------------------------------------------~~~~~~~~~~ 145 (419) .+++..+... T Consensus 226 n~i~~~~~d~~~~~~~kt~~~~v~~~~~d~~~~~~~n~~v~~~~~~~~~la~~~~~~LtGG~dG~~~~~~~dal~~Le~~ 305 (562) T protein:vir:80 226 KNLTTDNFDAQIDVDIKTKEAYVKAVGGDIEKQTAYNGYVEFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANE 305 (562) T ss_pred ceeeecccccchhhhcccceeeeeehhhhhhhcccccceEEEEeccCccccccceeeeeCCCCCCccccHHHHHHHHHhC Confidence 0000000000 Q ss_pred ccccccchhhhhhhHHHHHHHHhh-----ccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccc Q lcl|NC_021557. 146 PKLIIAPGYSPAAAVRAEMDVVAS-----RLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGA 220 (419) Q Consensus 146 p~~~~ap~~~~~~~v~a~l~~~~~-----~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~ 220 (419) ....+. ..+..+++++++.++++ ...+++++..+.+.+.+++.... ..+++.+.+.++|+..+.+. . T Consensus 306 ~~~~i~-~~t~d~ai~~~~~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~~~a-----~~~n~e~vv~v~~~~~~~~~--~ 377 (562) T protein:vir:80 306 GGYYLV-PLTSKQAVHAEALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTRA-----IGLQNERAGLIGFSGTVKMD--D 377 (562) T ss_pred CcEEEE-ecCCChHHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHHh-----hhcCCCeEEEEecCeeEECC--C Confidence 000000 11223444454444332 23467888888777777776643 35788999999998766543 2 Q ss_pred cceeeech---HHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEE Q lcl|NC_021557. 221 TETRLDPL---SSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVF 297 (419) Q Consensus 221 ~~~~~~p~---s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~w 297 (419) +.....|+ ++++||++|.+| +++||.|+.+.+ .++...+ .+.|.+.|+++|+.+++...+++.++| T Consensus 378 ~~~~~~~~~~~aa~vAGl~Ag~~----~~~S~T~~~i~~-~~v~~~l------t~~e~~~li~~G~l~l~~~~~~~v~~~ 446 (562) T protein:vir:80 378 GRSLKMPGYMFAAQVAGLTCGLE----IGEAITFKNIAI-ETLDTIY------EGSQLDQLNESGIITAEFVRNRAVTNF 446 (562) T ss_pred CceeeechhHHHHHHHHHHhcCc----cccCccceeecc-ccccccC------CHHHHHHHHhCCeEEEEEecCCcEEEE Confidence 33445566 889999999987 889999999985 4554443 467888999999999988777777777 Q ss_pred ec-ccc--CCCCCcccceeeehhhHHHHHHHHHHHHH-HHhhcCCCCHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEec Q lcl|NC_021557. 298 GN-RSA--AFPTSSHVENFIHARRILDMIHEAIIFYT-MNYVDRLGSPMTVEAAEEGVNAYLRSKTGIA-IYGGTFRFDR 372 (419) Q Consensus 298 G~-rT~--~~~s~~~~~~~i~vrR~~~~i~~~~~~~~-~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g-~~~~~v~~d~ 372 (419) .. +++ ...++++.|++|+++|++|+|.+.+++.+ .||++|||+...|..++..+..||.+|++.| |.+|... T Consensus 447 riv~~itT~t~~~~~~~~ki~viRv~D~i~~dir~~~~~~yIGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~--- 523 (562) T protein:vir:80 447 RIVDDVTTFNDKTDPVKSEIGVGEANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPE--- 523 (562) T ss_pred EeeccceeccCCCCchhhhhhhhHHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHHHHHHhCCcccCCCcc--- Confidence 22 222 23356789999999999999999999887 5899999999999999999999999999877 5555421 Q ss_pred ccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcchHHHH Q lcl|NC_021557. 373 QKNTAEQIADGKFYYRLECHPISVMERITIDSYVDTKFISN 413 (419) Q Consensus 373 ~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~ 413 (419) +-+-+..++++++++.++|+.|+|+|.+++.+.++-+++ T Consensus 524 --dv~v~~~~d~~~v~~~v~Pv~~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:80 524 --EVQVVIEGDIARISLTVFPIRSMKKIEVSLVYRQQILTA 562 (562) T ss_pred --ceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 112235678899999999999999999999999998877 No 38 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=100.00 E-value=1.1e-37 Score=223.20 Aligned_cols=373 Identities=13% Similarity=0.070 Sum_probs=229.6 Q ss_pred CCCc--c------CCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccch Q lcl|NC_021557. 1 MAAT--F------HHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKA 72 (419) Q Consensus 1 Ma~~--~------~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~ 72 (419) ||-. | +|||||+|++++.+++..+.|++.+|||.++.++ .|+|++++||.+|...||. T Consensus 1 ma~~~yf~~~~~~~PGVyvee~~sg~~~i~gv~tsva~fvG~a~~Gp-----------~~~p~~v~s~~~~~~~fgg--- 66 (648) T protein:vir:10 1 MAISVYFDGKLIKQLGAYVKTDLSAVKQINGVGTGIVALLGLAEGGE-----------TYKPYRLTSFAEAVSIFKG--- 66 (648) T ss_pred CeeeeeeCCCCccCCceEEEEeccccccccCCCCceEEEEEeeCCCC-----------CceeEEecCHHHHHHHhcC--- Confidence 7742 2 3999999999999999999999999999988774 4899999999999999986 Q ss_pred hhhHHHHHHHHhhccCCcEEEEeecccccccccc------------------------ccccc----------------- Q lcl|NC_021557. 73 GYTIPAALDAIFDQGDGGTIIVNNVFDPDVHKEG------------------------ANPDP----------------- 111 (419) Q Consensus 73 ~~~l~~al~~~~~~~~~~~~v~~~~~~~~~~~~~------------------------~~~~~----------------- 111 (419) +.|.++++.+|.+|+..++++++.......... .+... T Consensus 67 -g~l~~av~~~F~nGg~~~~~vRv~~~~~a~~~~~~~~~~a~~~g~~gn~i~~~v~~~~~~~~~~~~l~v~~~~~~~~~d 145 (648) T protein:vir:10 67 -GPLLEHIKAAFIGGAGEVVAVRIGNPTTASVSIPVAQNTSDTSPANLNFVSYEASTRSNQIYVSFDLDENFTSANEADD 145 (648) T ss_pred -ccHHHHHHHHHhCCCcEEEEEEcCCCcccceecceeEEeecccCCCCCceEEEEEEcCCCcCceeEEEEEecCCCcccc Confidence 568999999999999999998763211100000 00000 Q ss_pred ------------------c---cc--ccce--------------------ecc----------c---------------- Q lcl|NC_021557. 112 ------------------S---KV--TTVD--------------------ING----------T---------------- 122 (419) Q Consensus 112 ------------------~---~~--t~~~--------------------~~g----------~---------------- 122 (419) . .. +... ..+ . T Consensus 146 ~~v~~i~~~~~~y~gt~~~~t~~v~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~ 225 (648) T protein:vir:10 146 TIIFTIYQKHPDFSVTRETFTFPRKFTTPTVLVKRGSTLFFVDRSIVNAALAAGPAFQTALINLLKEQLQPTDVVQIFDA 225 (648) T ss_pred eeEEEeccCCCcccccceeccccccccccccccccccceeecCccchhhhhccCccchhhhhhchhhhhhhhhhheeccc Confidence 0 00 0000 000 0 Q ss_pred -----cccccccccch----------------hhh-hhh----------------------------------------- Q lcl|NC_021557. 123 -----ISPAGLASGFS----------------GAY-ECY----------------------------------------- 139 (419) Q Consensus 123 -----~~~~~~~tg~~----------------a~~-~~~----------------------------------------- 139 (419) .+......+.. .+. ++. T Consensus 226 s~~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~tp~~~~~~~~~~~~~~~~~~~~~v~~~~ 305 (648) T protein:vir:10 226 SDTNPVDIPLGLFVYEVLYGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNLSATPFFDGSDYQDYTSLSDPANWFAKDAYT 305 (648) T ss_pred ccccccccccccccccccchhhhcCCcchhhhhhhccccccccccceecccccccccccceeeeeccccccceeeeeccc Confidence 00000000000 000 000 Q ss_pred ----hhccccccccc-------------cc-------------h-------------------------------hhhhh Q lcl|NC_021557. 140 ----NNFGYFPKLII-------------AP-------------G-------------------------------YSPAA 158 (419) Q Consensus 140 ----~~~~~~p~~~~-------------ap-------------~-------------------------------~~~~~ 158 (419) ......|.+.. .| + .+..+ T Consensus 306 ~~~l~~~~~~p~~~~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~~ivp~~~~~~~~~~~~~lt~~q 385 (648) T protein:vir:10 306 INHLVDTTINPHILATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVNFVIPAYKFTNVTQLNDRLTIFK 385 (648) T ss_pred hhhcccccccCcccccccceecccccCCCcccccccccccchhhHHHHhhhccCCCceEEEeecccccccccccccCCcc Confidence 00000000000 00 0 01112 Q ss_pred hHHHH-HHHHhhc---------cceeEEEEeccCCCHHHHHhhhhhccccccCccceEEe-----------cceeEeecc Q lcl|NC_021557. 159 AVRAE-MDVVASR---------LHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLT-----------YPHVVIEDT 217 (419) Q Consensus 159 ~v~a~-l~~~~~~---------~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~-----------~p~~~~~~~ 217 (419) ++++. +.++..+ ...++++-.+.+.+..+.-..+.. ..++..+++.. +.+.. +. T Consensus 386 ~i~a~a~shv~~~s~~~~~~~r~~~~~~vg~~~~es~~~se~~~~~---~~~~~~~a~~~~~d~~~~~~~~~~~~~-~~- 460 (648) T protein:vir:10 386 GIASTFLSHVQTMSQVNRRKARVGVFGLPAPSPNESVTASEYLYNR---NILNTISAMFGGTDRAQAVVFPFYSNV-FN- 460 (648) T ss_pred chHHHHHHHHHHhhhccccccccCeEEEeCCCCchhHHHHHHHhhh---hcccccceeeeecCCceEEeeccccee-EC- Confidence 33333 3333211 112444444434433221111111 11222222111 11111 11 Q ss_pred ccccceeeech---HHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCC-- Q lcl|NC_021557. 218 TGATETRLDPL---SSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFAT-- 292 (419) Q Consensus 218 ~~~~~~~~~p~---s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~-- 292 (419) ..++...+|| .+++||+++++ .++.||.||+|+++ ++... +. .++.|.+.|+++||++|....++ T Consensus 461 -~~G~~~~~p~~~~Aa~VAGl~a~l----~~~~s~T~k~i~~~-~id~~--~~--~t~~qld~L~~~Gv~~ie~~~~~~~ 530 (648) T protein:vir:10 461 -DEGKVELLGGEFFASYVAGMHANR----EPQDSITFLPISGI-GAEPL--YN--WTYTQKDDLISNRVLFVEKVKTSFG 530 (648) T ss_pred -CCCcEEecchhhHHHHHHhhhhcc----ccccCcccceeecc-ccccc--cC--CCHHHHHHHhcCCcEEEEEecCCcc Confidence 1334555787 77889998876 49999999999844 33321 11 24568889999999999877664 Q ss_pred --cEEEEeccccCCCCCcccceeeehhhHHHHHHHHHHHH-HHHhhcCCCCHHHHHHHHHHHHHHHHHHHh-hcccceE- Q lcl|NC_021557. 293 --GIRVFGNRSAAFPTSSHVENFIHARRILDMIHEAIIFY-TMNYVDRLGSPMTVEAAEEGVNAYLRSKTG-IAIYGGT- 367 (419) Q Consensus 293 --G~~~wG~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~-~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~-~g~~~~~- 367 (419) ++++--+-|....++++.|+.++++|++|++.+.+++. ..+|+++||+...|..++..+.+||.++++ ++|.+|. T Consensus 531 ~~~~rvv~gITT~~~~~~~~~~eisv~ri~D~l~~~vr~~l~~~fIG~~n~~~~~~~ik~~i~~~L~~~~~~~~I~~y~~ 610 (648) T protein:vir:10 531 GIVYRIHHNPTTWLGPVTQGFQEFVLRRIDDFLQSYVYKNLQEQFIGRKSYGRKTENDIKVYTEALLSNLVGKQIVAYKD 610 (648) T ss_pred eeeEEEeccceeecCCCCcceeeeeeeehhhHHHHHHHHHHhhhcCcccccHHHHHHHHHHHHHHHhhHhhcCcccCccc Confidence 35565555666667889999999999999999999875 459999999999999999999999998887 4577763 Q ss_pred --EEEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcchHHH Q lcl|NC_021557. 368 --FRFDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVDTKFIS 412 (419) Q Consensus 368 --v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~ 412 (419) +.++. .++++++++.+.|++|++||.++++++.+ |+ T Consensus 611 ~~v~~~~--------~~~vv~V~~~v~Pv~~i~~I~vti~it~~-~~ 648 (648) T protein:vir:10 611 VKVTSNE--------DKTVYYVEFFYQPVTEIKFILVTMKVTFD-LE 648 (648) T ss_pred ceEEEEe--------cCCEEEEEEEEEecceeeEEEEEEEEEec-cC Confidence 55543 45999999999999999999998888877 33 No 39 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=100.00 E-value=1.5e-37 Score=222.43 Aligned_cols=373 Identities=14% Similarity=0.110 Sum_probs=245.4 Q ss_pred CCC-------ccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchh Q lcl|NC_021557. 1 MAA-------TFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAG 73 (419) Q Consensus 1 Ma~-------~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~ 73 (419) ||- .-+||||+++.+++..++..+.+++.+|||.++.+++ ++|++++++.|+.+.||. T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~-----------~~~~~~~~~~~~~~~~~~---- 65 (587) T protein:vir:95 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEP-----------NTVYELRNYSQAKRLFRS---- 65 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEeCCCCC-----------ceeEEeccHHHHHHHhcC---- Confidence 763 3479999999999999999999999999999887754 899999999999999987 Q ss_pred hhHHHHHHHHh----hccCCcEEEEeeccccccccccccc------------------------c-----------cccc Q lcl|NC_021557. 74 YTIPAALDAIF----DQGDGGTIIVNNVFDPDVHKEGANP------------------------D-----------PSKV 114 (419) Q Consensus 74 ~~l~~al~~~~----~~~~~~~~v~~~~~~~~~~~~~~~~------------------------~-----------~~~~ 114 (419) +.|.+++.+.| .+++..++.+++............. + .... T Consensus 66 g~l~~~~~~a~~~~~~~g~~~~~~~rv~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~ 145 (587) T protein:vir:95 66 GELLDAIELAWGSNPNYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNE 145 (587) T ss_pred cchHHHHHHHhccccCCCceEEEEEEcCCCceeEEEecCeEEEEecccccccceEEEEecCCCCCceeEEEEEeccccee Confidence 44777777777 4677666666542111100000000 0 0000 Q ss_pred c---------------c------------------------------ceeccc---------------cccccccccc-- Q lcl|NC_021557. 115 T---------------T------------------------------VDINGT---------------ISPAGLASGF-- 132 (419) Q Consensus 115 t---------------~------------------------------~~~~g~---------------~~~~~~~tg~-- 132 (419) . . ..+... .+.+..+.+. T Consensus 146 ~~~~~g~v~si~y~g~~~~~~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~in~~~~~tAky~g~~~ 225 (587) T protein:vir:95 146 VYDNIGNIFTIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGD 225 (587) T ss_pred eeeeccceeeeeeeccccccceeeeecccceeeeeeeeecCCceEEEEEecCCchHHHHHHHHhhccccceEEEEecccC Confidence 0 0 000000 0000000000 Q ss_pred --------hhh--------------------------------------------------------------------- Q lcl|NC_021557. 133 --------SGA--------------------------------------------------------------------- 135 (419) Q Consensus 133 --------~a~--------------------------------------------------------------------- 135 (419) ..+ T Consensus 226 ~~i~~~~~~~~~~~~v~~~~~~v~a~~~d~~~~~~~~~~v~~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~ 305 (587) T protein:vir:95 226 KNLESSKLDKIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFEL 305 (587) T ss_pred ceeEEeecCcccccceehhhhhhhhhhcceeeeeeceeeeeeecccccceeccchhhhhcccchheeccccccceeccce Confidence 000 Q ss_pred ---------------hhhhhhccccccccccchhhhhhhHHHHHHHHhh-----ccceeEEEEeccCCCHHHHHhhhhhc Q lcl|NC_021557. 136 ---------------YECYNNFGYFPKLIIAPGYSPAAAVRAEMDVVAS-----RLHALAIADLPLGLTKQQAVAARGVA 195 (419) Q Consensus 136 ---------------~~~~~~~~~~p~~~~ap~~~~~~~v~a~l~~~~~-----~~~~~~i~d~p~~~~~~~~~~~~~~~ 195 (419) .+++..+.......+.+ .+..+.+++++.++++ ...+++++..+.+.+.+++.... T Consensus 306 t~LtGG~dG~~~~~y~~~l~ale~~~~~~i~~-~t~d~~v~a~l~a~vk~~~~~g~~~~aVvg~~~~~~~~~~~~~a--- 381 (587) T protein:vir:95 306 TKLKGGTNGEPPATWADKLDKFAHEGGYYIVP-LSSKQSVHAEVASFVKERSDAGEPMRAIVGGGFNESKEQLFGRQ--- 381 (587) T ss_pred eeeecCCCCCCcccHHHHHHHHHhCCcEEEEe-cCCCHHHHHHHHHHHHHHHhCCCcEEEEEcCCCCCCHHHHHHHH--- Confidence 00000000000000011 1122344454444332 23467888877777777776644 Q ss_pred cccccCccceEEecceeEeeccccccceeeech---HHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcc Q lcl|NC_021557. 196 GTANTSSARTVLTYPHVVIEDTTGATETRLDPL---SSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQ 272 (419) Q Consensus 196 ~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~---s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~ 272 (419) ..+++.+.++++++..+... .+....+|| ++++||+++.+| +.+||.|+++. ..++...+ .. T Consensus 382 --~~~n~ervi~v~~~~~~~~~--dg~~~~~~~~~~aa~vAGl~Ag~~----~~~SlT~~~i~-~~~v~~~~------t~ 446 (587) T protein:vir:95 382 --ESLSNPRVSLVANSGTFVMD--DGRKNHVPAYMVAVALGGLASGLE----IGESITFKPLR-VSSLDQIY------ES 446 (587) T ss_pred --hhcCCCcEEEecccceEecC--CCceeeechHHHHHHHHHHHhcCc----hhcCccceeee-cccccccC------CH Confidence 34788899999887654321 233455676 789999999887 88899999987 44554433 46 Q ss_pred hhhccccCCceEEEEEecCCc---EEE-EeccccCCCCCcccceeeehhhHHHHHHHHHHHHH-HHhhcCCCCHHHHHHH Q lcl|NC_021557. 273 NDTNFLNEAGIVTAMRSFATG---IRV-FGNRSAAFPTSSHVENFIHARRILDMIHEAIIFYT-MNYVDRLGSPMTVEAA 347 (419) Q Consensus 273 ~~~~~L~~~gI~~i~~~~~~G---~~~-wG~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~-~~~v~e~n~~~~~~~i 347 (419) .|.+.|+++|++++...++++ +++ .|-.|.. .++++.|++++++|++|+|.+.+++.+ .+|++|||+...|..+ T Consensus 447 ~e~e~ai~~Gvl~l~~~~~~~~~~vriv~~itT~t-~~~d~~~~~i~viRv~D~i~~dir~~~~~~~iGk~nn~~~r~~v 525 (587) T protein:vir:95 447 IDLDELNENGIISIEFVRNRTNTFFRIVDDVTTFN-DKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINTSASII 525 (587) T ss_pred HHHHHHHhCCeEEEEEecCCcceEEEEeecceecc-CCCCcchhhhhhhhhHHHHHHHHHHHHHhhCCccccchHHHHHH Confidence 788899999999988766664 332 3445543 356789999999999999999999886 5999999999999999 Q ss_pred HHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcchHHHH Q lcl|NC_021557. 348 EEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVDTKFISN 413 (419) Q Consensus 348 ~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~ 413 (419) +..+..||..|++.| |.+|... +.+-++...++++++.++|+.|+|+|.+++++.++-++. T Consensus 526 ~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~~~d~~~v~~~v~Pv~~mekI~vt~~~~~~~~~~ 587 (587) T protein:vir:95 526 KDFIQSYLGRKKRDNEIQDFPAE-----DVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred HHHHHHHHHHHHhCCcccCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEeeeeecC Confidence 999999999999877 4555331 122234556799999999999999999999999887776 No 40 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=100.00 E-value=5.8e-37 Score=219.24 Aligned_cols=374 Identities=14% Similarity=0.100 Sum_probs=244.1 Q ss_pred CC-------CccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchh Q lcl|NC_021557. 1 MA-------ATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAG 73 (419) Q Consensus 1 Ma-------~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~ 73 (419) || .+.+||||+++.++++.++....+++.+|||++..+++ ++|++++++.++.+.||. T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~g~~-----------~~~~~~~~~~~~~~~~g~---- 65 (587) T protein:vir:96 1 MAKDIFPRRPIQRPHASIEVDSSGIGGSASNSEKILCLIGKAEGGEP-----------NTVYQVRNYAQAKSVFRS---- 65 (587) T ss_pred CeeeeeCCCcccCCceEEEEecCCccCCCCCCCceEEEEEEecCCCC-----------ceeEEEcChHHHHHhhcC---- Confidence 65 44579999999999999999999999999999887754 889999999999999987 Q ss_pred hhHHHHHHHHhh----ccCCcEEEEeecccccccccccc------------------------------------cc--- Q lcl|NC_021557. 74 YTIPAALDAIFD----QGDGGTIIVNNVFDPDVHKEGAN------------------------------------PD--- 110 (419) Q Consensus 74 ~~l~~al~~~~~----~~~~~~~v~~~~~~~~~~~~~~~------------------------------------~~--- 110 (419) +.|.+++...|+ +++..++.+++............ .. T Consensus 66 G~l~~ai~~a~~~~~~~g~~~~~a~rv~~~~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~~~~~~~~~~~ 145 (587) T protein:vir:96 66 GELLDAIELAWGSNPQYTAGKILAMRVEDAKASQLEKGGLRVTSKIFGSVSNDIQVALEKNTITDSLRLRVVFQKDNYQE 145 (587) T ss_pred CcHHHHHHHHhccCcCCCceEEEEEecCCCccceeecccccccccccCCCCceEEEEEEeccCCCccceEEEEecCCcee Confidence 458888887774 66666665544111110000000 00 Q ss_pred ----cc---cc----ccc----eec--------------cc----------c-----------------ccccccccchh Q lcl|NC_021557. 111 ----PS---KV----TTV----DIN--------------GT----------I-----------------SPAGLASGFSG 134 (419) Q Consensus 111 ----~~---~~----t~~----~~~--------------g~----------~-----------------~~~~~~tg~~a 134 (419) .. .+ +.. ... +. . .-+..+.|... T Consensus 146 ~~~n~G~v~~i~y~g~~~~a~~~~~~~~~~~~A~~l~l~gg~~~v~~yrl~~g~~~~~~~~~~~~~~~~~~tAky~g~~~ 225 (587) T protein:vir:96 146 VFDNLGNIFSINYKGEGEKATFSVEKDKETQEAKRLVLKVDEKEVKAYELNGGAYSFTNEIITDINELPDFEAKLSPFGD 225 (587) T ss_pred eccccCceEEEEecccccceeEeeccCcccceeeeeEEEecCceEEEEEeCCCchhhhhhhhhhhccccceEEEeecccC Confidence 00 00 000 000 00 0 00000000000 Q ss_pred ---------------------hh-----h--------------------------------------------------- Q lcl|NC_021557. 135 ---------------------AY-----E--------------------------------------------------- 137 (419) Q Consensus 135 ---------------------~~-----~--------------------------------------------------- 137 (419) .. + T Consensus 226 n~~~v~v~d~~~~~~~k~~~~y~~t~~~di~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 305 (587) T protein:vir:96 226 KNLESRKLDEATDVDIKGKAVYVKAVFGDIENQTQYNQYVKFEQLPEQASEPSDVEVHAETESATVTATSKPKAIEPFEL 305 (587) T ss_pred ceeEEEeeccccccccceEEEeehhhhhhhhhhhccccceeeccccchhhhhhcccccccccceeeeecccccccccccc Confidence 00 0 Q ss_pred -----------------hhhhccccccccccchhhhhhhHHHHHHHHhh-----ccceeEEEEeccCCCHHHHHhhhhhc Q lcl|NC_021557. 138 -----------------CYNNFGYFPKLIIAPGYSPAAAVRAEMDVVAS-----RLHALAIADLPLGLTKQQAVAARGVA 195 (419) Q Consensus 138 -----------------~~~~~~~~p~~~~ap~~~~~~~v~a~l~~~~~-----~~~~~~i~d~p~~~~~~~~~~~~~~~ 195 (419) ++..+.......+.+ .+..+.+++++.++++ ...+++++..+.+.+.++....+ T Consensus 306 ~aLtGG~dG~~~~~y~~~l~ale~~~~~~i~~-~t~d~ai~~~l~a~vk~~r~~gk~~~aVlg~~~~~~~~~~~~~a--- 381 (587) T protein:vir:96 306 TKLSGGTNGEPPTSWSAKLEKFKNEGGYYIVP-LTDRQSVHSEVATFVKNRSDAGEPMRAIVGGGTSETKEKLFGRQ--- 381 (587) T ss_pred eeeecCCCCCCcccHHHHHHHHhhCCcEEEEe-cCCCHHHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHHH--- Confidence 000000000000000 1112334444433332 13367777777666666665543 Q ss_pred cccccCccceEEecceeEeeccccccceeeech---HHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcc Q lcl|NC_021557. 196 GTANTSSARTVLTYPHVVIEDTTGATETRLDPL---SSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQ 272 (419) Q Consensus 196 ~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~---s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~ 272 (419) ..+++.+.+.++++..+.+.. ......|+ ++++||++|.++ +.+||.|+.+.+ .++...+ .. T Consensus 382 --~~~n~e~vi~v~~~~~~~~~~--~~~~~~~~~~~aa~vAG~~Ag~~----~~~S~T~~~~~~-~~v~~~~------t~ 446 (587) T protein:vir:96 382 --AILNNPRVALVANSGKFVMGN--GRILQAPAYMVASAVAGLVSGLD----IGESITFKPLFV-NSLDKVY------ES 446 (587) T ss_pred --hhcCCCcEEEEecceEEecCC--CceeeechhhHHHHHHHHHhcCc----cccCccceeeec-ccccccC------CH Confidence 347788999999987776532 22333443 688999999886 889999999875 4555443 46 Q ss_pred hhhccccCCceEEEEEecCCcEEEEec-cccC--CCCCcccceeeehhhHHHHHHHHHHHHH-HHhhcCCCCHHHHHHHH Q lcl|NC_021557. 273 NDTNFLNEAGIVTAMRSFATGIRVFGN-RSAA--FPTSSHVENFIHARRILDMIHEAIIFYT-MNYVDRLGSPMTVEAAE 348 (419) Q Consensus 273 ~~~~~L~~~gI~~i~~~~~~G~~~wG~-rT~~--~~s~~~~~~~i~vrR~~~~i~~~~~~~~-~~~v~e~n~~~~~~~i~ 348 (419) .|.+.|.++|+.+++...+++.++|.. +++. ....++.|++++++|++|+|.+.+++.+ .+|++|||+...|..++ T Consensus 447 ~e~~~~i~~G~~~l~~~~~~~~~v~~~vnsitT~t~~~~~~~~~i~virv~D~i~~di~~~~~~~yiGk~nn~~~r~~v~ 526 (587) T protein:vir:96 447 EELDELNENGIITIEFVRNRMTTMFRIVDDVTTFPDKNDPVKSEMALGEANDFLVSELKILLEEQYIGTRTINTSASQIK 526 (587) T ss_pred HHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhhhHHHHHHHHHHHHHHHHhcCCccccCHHHHHHHH Confidence 678899999999988777777777743 3333 2345678999999999999999999987 58999999999999999 Q ss_pred HHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcchHHHH Q lcl|NC_021557. 349 EGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVDTKFISN 413 (419) Q Consensus 349 ~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~ 413 (419) ..+..||.+|++.| |.+|... +-.-++...++++++.++|+.|+|+|.+++.+.++-++. T Consensus 527 ~~i~~~L~~l~~~g~I~~~~~~-----dv~v~~~~D~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:96 527 DFVQSYLGRKKRDNEIQDFPPE-----DVQVIIEGNEARISLTIFPIRALKKISVSLVYRQQTLQA 587 (587) T ss_pred HHHHHHHHHHHhCCcccCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 99999999999877 5555331 111123445799999999999999999999999887776 No 41 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=100.00 E-value=1.1e-36 Score=217.64 Aligned_cols=373 Identities=15% Similarity=0.118 Sum_probs=244.4 Q ss_pred CCC-------ccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchh Q lcl|NC_021557. 1 MAA-------TFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAG 73 (419) Q Consensus 1 Ma~-------~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~ 73 (419) ||- .-+||||+++.+++..++..+.+++.+|||.+..+++ ++|++++++.|+.+.||. T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~-----------~~~~~~~~~~~~~~~~~~---- 65 (587) T protein:vir:99 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEP-----------NTVYELRNYSQAKRLFRS---- 65 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEecCCcc-----------ceeEEeccHHHHHHHhcC---- Confidence 763 3479999999999999999999999999999887754 889999999999999987 Q ss_pred hhHHHHHHHHh----hccCCcEEEEeeccccccccccccc------------------------ccc-----------cc Q lcl|NC_021557. 74 YTIPAALDAIF----DQGDGGTIIVNNVFDPDVHKEGANP------------------------DPS-----------KV 114 (419) Q Consensus 74 ~~l~~al~~~~----~~~~~~~~v~~~~~~~~~~~~~~~~------------------------~~~-----------~~ 114 (419) +.|.+++.+.| .+++..++.++.............. +.. .. T Consensus 66 g~l~~~~~~a~~~~~~~g~~~~~~~rv~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~ 145 (587) T protein:vir:99 66 GELLDAIELAWGSNPNYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNE 145 (587) T ss_pred cchHHHHHHHhccccCCCceEEEEEEcCCCceeEEEecCeEEEEeeccccccceEEEEccCCCCcceeEEEEEeccccee Confidence 45777887777 4666666665442111100000000 000 00 Q ss_pred c---------------c------------------------------ceeccc---------------cccccccccc-- Q lcl|NC_021557. 115 T---------------T------------------------------VDINGT---------------ISPAGLASGF-- 132 (419) Q Consensus 115 t---------------~------------------------------~~~~g~---------------~~~~~~~tg~-- 132 (419) + . ..+... .+-+..+.+. T Consensus 146 ~~~~~g~v~~i~y~g~~~~a~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~i~~~~~~tAky~~~~~ 225 (587) T protein:vir:99 146 VYDNIGNIFTIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGD 225 (587) T ss_pred eeeeccceeeEEeecccccceeeEeecCcceeeeeeeeecCCceeEEEEecCCchHHHHHHHhhhccccceeEEeeccCC Confidence 0 0 000000 0000000000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_021557. 133 -------------------------------------------------------------------------------- 132 (419) Q Consensus 133 -------------------------------------------------------------------------------- 132 (419) T Consensus 226 ~~i~~~~~~~~~~~~v~~~~~~v~a~~~D~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~ 305 (587) T protein:vir:99 226 KNLESSKLDKIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFEL 305 (587) T ss_pred ceeEeecccccccceeeeeeeeeehhccceeeecccceeeeeeecccccchhhhhhhhhccccceeeeeccccceecccc Confidence Q ss_pred ---hh---------hhhhhhhccccccccccchhhhhhhHHHHHHHHhh-----ccceeEEEEeccCCCHHHHHhhhhhc Q lcl|NC_021557. 133 ---SG---------AYECYNNFGYFPKLIIAPGYSPAAAVRAEMDVVAS-----RLHALAIADLPLGLTKQQAVAARGVA 195 (419) Q Consensus 133 ---~a---------~~~~~~~~~~~p~~~~ap~~~~~~~v~a~l~~~~~-----~~~~~~i~d~p~~~~~~~~~~~~~~~ 195 (419) .. ..+++..+.......+++ .+..+.+++++.++++ ...+++++..+.+.+.+++.... T Consensus 306 t~LtGG~dG~~~~sy~~al~ale~~~~~~i~~-~t~d~~i~a~l~a~vk~~r~~g~~~~aVlg~~~~~~~~~~~~~a--- 381 (587) T protein:vir:99 306 TKLKGGTNGEPPATWADKLDKFAHEGGYYIVP-LSSKQSVHAEVASFVKERSDAGEPMRAIVGGGFNESKEQLFGRQ--- 381 (587) T ss_pred eeeecCCCCCccccHHHHHHHHhhCCcEEEEe-cCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHh--- Confidence 00 000000000000000011 1122344444444332 23467888877777777776654 Q ss_pred cccccCccceEEecceeEeeccccccceeeech---HHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcc Q lcl|NC_021557. 196 GTANTSSARTVLTYPHVVIEDTTGATETRLDPL---SSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQ 272 (419) Q Consensus 196 ~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~---s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~ 272 (419) ..+++.+.+.++++...... .+....+|+ ++++||+++..| +++||.|+.+. ..++...+ .. T Consensus 382 --~~~n~e~vi~v~~~~~~~~~--dg~~~~~~~~~~aa~vAGl~Ag~~----~~~SlT~~~i~-~~~v~~~~------t~ 446 (587) T protein:vir:99 382 --ASLSNPRVSLVANSGTFVMD--DGRKNHVPAYMVAVALGGLASGLE----IGESITFKPLR-VSSLDQIY------ES 446 (587) T ss_pred --hhcCCCcEEEEeccceEecC--CCceeeechHHHHHHHHHHHhcCc----hhcCccceeee-cccccccC------CH Confidence 34778899999887554321 233455676 788999999887 88999999987 45555443 46 Q ss_pred hhhccccCCceEEEEEecCCc---EEE-EeccccCCCCCcccceeeehhhHHHHHHHHHHHHH-HHhhcCCCCHHHHHHH Q lcl|NC_021557. 273 NDTNFLNEAGIVTAMRSFATG---IRV-FGNRSAAFPTSSHVENFIHARRILDMIHEAIIFYT-MNYVDRLGSPMTVEAA 347 (419) Q Consensus 273 ~~~~~L~~~gI~~i~~~~~~G---~~~-wG~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~-~~~v~e~n~~~~~~~i 347 (419) .|.+.|+++|++++...++++ +++ .|-.|.. .++++.|++++++|++|+|.+.+++.+ .+|++|||+...|..+ T Consensus 447 ~e~e~li~~Gvl~l~~~~~~~~~~vriv~~ItT~t-~~~~~~~~~i~viRv~D~i~~di~~~~~~~yiGk~Nn~~~r~~i 525 (587) T protein:vir:99 447 IDLDELNENGIISIEFVRNRTNTFFRIVDDVTTFN-DKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINTSASII 525 (587) T ss_pred HHHHHHHhCCeEEEEEecCCcceEEEEeeceeecc-CCCCchhhhhhhhhhHHHHHHHHHHHHHhhCCccccchHHHHHH Confidence 788899999999988766654 433 3444443 456788999999999999999999987 5899999999999999 Q ss_pred HHHHHHHHHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcchHHHH Q lcl|NC_021557. 348 EEGVNAYLRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVDTKFISN 413 (419) Q Consensus 348 ~~~i~~~L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~ 413 (419) +..+..||..|++.| |.+|... ...-+....++++++.+.|+.|+|+|.+++.+.++-|++ T Consensus 526 ~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~~~d~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:99 526 KDFIQSYLGRKKRDNEIQDFPAE-----DVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred HHHHHHHHHHHHhCCcccCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 999999999999877 4555331 111123445799999999999999999999999987777 No 42 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=100.00 E-value=7.8e-34 Score=202.10 Aligned_cols=369 Identities=12% Similarity=0.053 Sum_probs=200.2 Q ss_pred CCCccCCCeEEE------------------------EcCCCccCccccC-ccceEEEEccc--ccccccccccccccc-C Q lcl|NC_021557. 1 MAATFHHGPEVI------------------------EHKDGVTVVRDVK-SAVTYVNGTAP--IQDVHATALAREDYI-N 52 (419) Q Consensus 1 Ma~~~~hGVyv~------------------------e~~~~~~~i~~v~-tav~~~Vgta~--~a~~~~~~~~~~~~~-n 52 (419) .++--.+|++.. .-++........+ -+++..++..- .+++....-...++. . T Consensus 160 ~~~~s~~gi~~~~~~l~~~~~~~~~~~gsd~~~~~~~~~~~~~~~~~~D~~t~~~~~~g~~~~~~~v~~~~~~~~d~~~~ 239 (581) T protein:vir:10 160 NRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYH 239 (581) T ss_pred cccccccccccccccccccccCcceeccccceeeecccCccccccccccceeeeeeecccccccceEEEEEEEeecCCcc Confidence 010001122111 1110110000000 01122121110 001100000000000 0 Q ss_pred cceeecchHHHHHHhcccchhhhHHHHHHHHhhccCCcEEEEeeccccccccccccccccccccceeccccccccccccc Q lcl|NC_021557. 53 KRVIIRSRAEGAAAFGVHKAGYTIPAALDAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGF 132 (419) Q Consensus 53 ~pv~its~~e~~~~fg~~~~~~~l~~al~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~ 132 (419) ..|.++...+....++. .++..+...--+.... ...........+.+..+..+..... T Consensus 240 ~~v~~~~~~~~~~~~~~-------------~~~~~g~~~~~~t~~~---------~~~~tn~~~~~l~~gvd~~g~tvt~ 297 (581) T protein:vir:10 240 EVIRFTDPDDIQDFYGP-------------AFDEAGNVQSEITLCA---------QLAITNGASTILACAVDPEGDTVTM 297 (581) T ss_pred eeEEeecCcchhhhhhh-------------hhhccCccccchhhhh---------eeeeecccceeEEeeccCCCCccch Confidence 12333333333222221 1111111100000000 0000000001111111111110111 Q ss_pred hhhhhhhhhcccccccc-ccchhhhhhhHHHHHHHHhhc-----cceeEEEEeccCCC---HHHHHhhhhhccccccCcc Q lcl|NC_021557. 133 SGAYECYNNFGYFPKLI-IAPGYSPAAAVRAEMDVVASR-----LHALAIADLPLGLT---KQQAVAARGVAGTANTSSA 203 (419) Q Consensus 133 ~a~~~~~~~~~~~p~~~-~ap~~~~~~~v~a~l~~~~~~-----~~~~~i~d~p~~~~---~~~~~~~~~~~~~~~~~s~ 203 (419) ..+..++..+...+.+. ++|+ +..+.++++|.+++++ ..+.+++..+.... .++.++ ...++++. T Consensus 298 ~dy~~Al~ale~~~~~~ivv~~-t~~~~v~a~l~ahv~~~s~~~~~~ravigV~g~~~~~~~~~~~~-----~a~~~n~~ 371 (581) T protein:vir:10 298 GDYQNALNKFRDEDEIAIIVAG-TGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIA-----NAQSIKDQ 371 (581) T ss_pred HHHHHHHHHHhcCCceEEEEeC-CCCHHHHHHHHHHHHHHHhccCCcEEEEEecCCCCCccHHHHHH-----hhccCCCc Confidence 12222333333333222 2333 3445666655554432 34566666553332 233332 22468899 Q ss_pred ceEEecceeEeeccccccceeeech---HHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccC Q lcl|NC_021557. 204 RTVLTYPHVVIEDTTGATETRLDPL---SSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNE 280 (419) Q Consensus 204 ~~~~~~p~~~~~~~~~~~~~~~~p~---s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~ 280 (419) |..+++|+....+.........+|+ .+++||+++.. .+++||.|++++|+.++...+ +..|.+.|++ T Consensus 372 Rvvlv~p~~~~~~g~~~~~~v~lp~y~~AA~vAGl~a~~----~~~~slT~~~i~gi~~l~~~~------s~~e~e~ll~ 441 (581) T protein:vir:10 372 RVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSA----IAAMPLTRKVIRGFSGPAEVQ------RDGEKSRESS 441 (581) T ss_pred eEEEEecCceeecCcccCceeccchhhHHHHHHHHhhcc----ccccCcccccccccccccccC------CHHHHHHHHh Confidence 9999999998887654444555555 45555666555 589999999999998876555 3567889999 Q ss_pred CceEEEEEecCCcEEE-EeccccCCCCCcccceeeehhhHHHHHHHHHHHHHH--HhhcCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_021557. 281 AGIVTAMRSFATGIRV-FGNRSAAFPTSSHVENFIHARRILDMIHEAIIFYTM--NYVDRLGSPMTVEAAEEGVNAYLRS 357 (419) Q Consensus 281 ~gI~~i~~~~~~G~~~-wG~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~--~~v~e~n~~~~~~~i~~~i~~~L~~ 357 (419) +|++++...+++|+++ ||-.|+ .+++.|++|++||++|++.+.+++.++ .|++|||+..+|.+|+..+..||.. T Consensus 442 ~Gv~~l~~~~~~~v~Iv~gItT~---~s~~~~~~i~~iR~~D~v~~~ir~~~~~~~fIG~~n~~~~r~~ik~~i~~~L~~ 518 (581) T protein:vir:10 442 EGLMVIEKTPRNLVHVRHGVTTD---PTSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVW 518 (581) T ss_pred CCeEEEEEecCCeEEEEeeeecC---CCCCcceeeeeehhhhHHHHHHHHHhhhhcCCCcccCHHHHHHHHHHHHHHHHH Confidence 9999998888899986 555665 346789999999999999999999985 5888999999999999999999999 Q ss_pred HHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 358 KTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 358 l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~a 419 (419) ||+.| |.+|+.. +.++.+.+.+.+++++.++|++|+|||.+++++.++. +++. T Consensus 519 l~~~g~I~~~~~~----~~~~~~~~~d~v~V~i~v~Pv~~i~~I~vti~~~p~~-----~~~~ 572 (581) T protein:vir:10 519 LVDNNIIRGYRNL----KARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPET-----GDIT 572 (581) T ss_pred HHhcCcccCCccc----eeeeeecCCCEEEEEEEEEecccceEEEEEEEEecCC-----CceE Confidence 99876 5666432 2345567889999999999999999999999999873 2222 No 43 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=100.00 E-value=7.9e-33 Score=196.57 Aligned_cols=379 Identities=15% Similarity=0.161 Sum_probs=245.1 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |=..-+||||+++.+++..++....+.+.+|||.+..+++ |+|++++++.++...||. +.|.+++ T Consensus 17 ~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~G~~-----------~~~~~~~~~~~a~~~f~~----g~l~~a~ 81 (607) T protein:vir:10 17 LFYDSRPHVETNFDDSRLSNTASDSAKNIFMLGSATNGDP-----------TKVYEIRTSQQATKIFGS----GDLVDGI 81 (607) T ss_pred CCCccCCceEEEEecCcCcCCCCCCcceEEEEEEeCCCCC-----------ceEEEEcchhHHHHhhcC----cchHHHH Confidence 4444579999999999999999999999999999887754 889999999999999986 4566777 Q ss_pred HHHh------hccCCcEEEEeeccccccccccc-----------------------------------cc---------- Q lcl|NC_021557. 81 DAIF------DQGDGGTIIVNNVFDPDVHKEGA-----------------------------------NP---------- 109 (419) Q Consensus 81 ~~~~------~~~~~~~~v~~~~~~~~~~~~~~-----------------------------------~~---------- 109 (419) ...| .++++.++.++............ .. T Consensus 82 ~~a~~~~~~~~~g~~~~~~~rv~~~~~a~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~~d~~~~~~~n~g 161 (607) T protein:vir:10 82 KLAFDPTGNSVTNGGTVYALRVDNAKQASLVKDGLTFTSSIFGTNANQVSVALDNDVFGVPRITVNYSPDNYERTYTNIG 161 (607) T ss_pred HHhhccccCCccCCceEEEEeCCCccccceecccccccccccccCCCceEEEEEecCCCccceeEEeecccceeeeeecc Confidence 6666 57777777776421000000000 00 Q ss_pred --------------------c--ccc--c-------------------------ccc----e----------eccc---- Q lcl|NC_021557. 110 --------------------D--PSK--V-------------------------TTV----D----------INGT---- 122 (419) Q Consensus 110 --------------------~--~~~--~-------------------------t~~----~----------~~g~---- 122 (419) + ... . +.. + ..+. T Consensus 162 ~~~~i~y~g~~~~a~~~v~~~~~g~~~~lt~~~~~~~~~~~~V~~~~l~~~~~~t~~~l~~din~~~~~~A~~~g~~~i~ 241 (607) T protein:vir:10 162 QMFSITYSGKSASAGYTVSHDTDGKAILLTLGSGDSIDKLTNVATFDLTMSKYDTIAKLMQAISATPNFSASVVGSPSVN 241 (607) T ss_pred ceeecccCcccccccceeeecCCCceeEEEecCCCccceeeeeecccccccccchHHHHHHHhhcCCceEEEEeccccee Confidence 0 000 0 000 0 0000 Q ss_pred ---ccccc----ccc--cc-----hhh----------------------------------------------------- Q lcl|NC_021557. 123 ---ISPAG----LAS--GF-----SGA----------------------------------------------------- 135 (419) Q Consensus 123 ---~~~~~----~~t--g~-----~a~----------------------------------------------------- 135 (419) .+..+ ... .+ ... T Consensus 242 tky~d~~~~~i~V~~~~~iv~a~~~D~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~Lt 321 (607) T protein:vir:10 242 TSYLDEVTSPVDVKTAPAVVTAKIGDAISKLGYDPYVVVTQTSNNKPIVNGVSAGTGSATASVTTAPESFPANFDTAFLT 321 (607) T ss_pred eeccccccceeEEEEeeeeechhhhhhhhcccccceEEeeecccchhhhhhhhccccceeeeeeccccccccccceeeee Confidence 00000 000 00 000 Q ss_pred -----------hhhhhhccccccccccchhhhhhhHHHHHHHHhh-----ccceeEEEEeccCCCHHHHHhhhhhccccc Q lcl|NC_021557. 136 -----------YECYNNFGYFPKLIIAPGYSPAAAVRAEMDVVAS-----RLHALAIADLPLGLTKQQAVAARGVAGTAN 199 (419) Q Consensus 136 -----------~~~~~~~~~~p~~~~ap~~~~~~~v~a~l~~~~~-----~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~ 199 (419) .+++..+.......+.+ .+..+.+++++.++++ ...+.+++..+.+.+.+++.... .. T Consensus 322 GGtdG~~~~ty~dal~aLe~~e~~~i~~-~t~d~ai~~~l~a~vkr~~~~g~~~~aVlg~~~~~t~~~~~t~a-----~~ 395 (607) T protein:vir:10 322 GGSTGDVPVSWADKFNGAIGNNVYYIIP-LTSEENIHAELQAFIDEQHVLGYNYHAFVGGGFAEPLEQILSRQ-----VN 395 (607) T ss_pred CCCCCCchhhHHHHHHHHhhcCceEEEe-cCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHH-----Hh Confidence 00000000000000000 1112334444433322 23467777777677766666644 34 Q ss_pred cCccceEEecceeEeeccccccceeeech---HHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhc Q lcl|NC_021557. 200 TSSARTVLTYPHVVIEDTTGATETRLDPL---SSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTN 276 (419) Q Consensus 200 ~~s~~~~~~~p~~~~~~~~~~~~~~~~p~---s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~ 276 (419) +++.+.+.+.|+..+.+. +.....|+ ++++||++|.++ +.+||.|+.+. ..++...+ ...|.+ T Consensus 396 ~N~ervv~V~~~~~~~~~---G~~~~~~~~~~Aa~vAGl~Ag~~----~~~SlT~k~i~-~~~v~~~l------t~~e~e 461 (607) T protein:vir:10 396 INDSRFGLVGQSGHVQEG---GESVHVPAYLMAAYVGGLSSSLG----VAVPITNKKLA-LVDLDQNF------SGDDLN 461 (607) T ss_pred hCCCcEEEEecCeeEeeC---CcceeccHHHHHHHHHHHHhcCc----cccCcccceec-cccccccC------CHHHHH Confidence 778899999998766542 23344454 788999998887 88899999986 45665554 466788 Q ss_pred cccCCceEEEEEec----CCcEEEEeccccCCCCCcccceeeehhhHHHHHHHHHHHHHH-HhhcCCCCHHHHHHHHHHH Q lcl|NC_021557. 277 FLNEAGIVTAMRSF----ATGIRVFGNRSAAFPTSSHVENFIHARRILDMIHEAIIFYTM-NYVDRLGSPMTVEAAEEGV 351 (419) Q Consensus 277 ~L~~~gI~~i~~~~----~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~-~~v~e~n~~~~~~~i~~~i 351 (419) .|.++|+.++...+ +++++++..-|.-..++++.|++++++|++|+|.+.+++.+. +|++|+|+..+|..++..+ T Consensus 462 ~ai~~Gv~~l~~~~~~~~~~~vrIv~~ItT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yIGk~nnd~~~~~vk~~i 541 (607) T protein:vir:10 462 TLNQNGVIGIEHLVNRNATGGYYIVQDVSTNTVSSSHVDGSLYLGELTDFLFDNLRFVLRDTYIGSNIRSTSADDIKSTV 541 (607) T ss_pred HHHhCCeEEEEEccCccccceEEEeeeeeeccCCCCcchheeehhhhHHHHHHHHHHHHhhcCCcccCCcchHHHHHHHH Confidence 99999999886543 346888777776666778899999999999999999998875 8999999999999999999 Q ss_pred HHHHHHHHh--hc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 352 NAYLRSKTG--IA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 352 ~~~L~~l~~--~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~a 419 (419) ..||..+|. .| |.+|.. + +-+-.....++++++.+.|+.++|+|.+++++.++-|++.-+.-- T Consensus 542 ~~~L~~~~l~~~gaI~df~~----e-dv~v~~~~D~v~v~~~v~Pv~~iekIyvtv~v~~~~~~~~~~~~~ 607 (607) T protein:vir:10 542 ASYLYSEMNNDDGLIVDFSE----S-DIVVTISGTVVYIQFAVAPTQEIKNIVVSGTYSNYSATSEDNTTK 607 (607) T ss_pred HHHHHHHHHHhcCceeCCCc----c-ccEEeeCCCEEEEEEEEEEcccceEEEEEEEEEEEEEeeccCCCC Confidence 999987764 35 444421 1 111123456899999999999999999999999886654433322 No 44 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=100.00 E-value=7.2e-33 Score=196.79 Aligned_cols=367 Identities=12% Similarity=0.068 Sum_probs=204.3 Q ss_pred CCCcc-CCCeEEE---EcCCCccCccc-----------------cC-----ccceEEEEccccc--ccccccccccc-cc Q lcl|NC_021557. 1 MAATF-HHGPEVI---EHKDGVTVVRD-----------------VK-----SAVTYVNGTAPIQ--DVHATALARED-YI 51 (419) Q Consensus 1 Ma~~~-~hGVyv~---e~~~~~~~i~~-----------------v~-----tav~~~Vgta~~a--~~~~~~~~~~~-~~ 51 (419) |.-.. .-|++.. +..+.--.... .+ -++...++-.-.. +.........+ .. T Consensus 159 ~~~~l~~~g~~~~~~~~~~s~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~g~~~~~~~i~~~~~~~~D~~~ 238 (581) T protein:vir:76 159 MNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNY 238 (581) T ss_pred cCceeeeccccccccceeecCCcceeeecccccceeeccCcccceeeeeeeeeeEeecccccccceeEEEEEEEeecCCc Confidence 11010 0133311 11000000000 00 0111111100000 00000000000 01 Q ss_pred CcceeecchHHHHHHhcccchh-----hhHHHHHHHHhhccCCcEEEEeeccccccccccccccccccccceeccccccc Q lcl|NC_021557. 52 NKRVIIRSRAEGAAAFGVHKAG-----YTIPAALDAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPA 126 (419) Q Consensus 52 n~pv~its~~e~~~~fg~~~~~-----~~l~~al~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~ 126 (419) ...+++..+.+....++..... ..+.......+.++... .+.+..+.. T Consensus 239 ~~~v~~~~~~~~~~~~~~~~~~~g~~~~e~~~~~~~~~t~~~~~---------------------------~l~~gvd~~ 291 (581) T protein:vir:76 239 HEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGAST---------------------------ILACAVDPE 291 (581) T ss_pred cceEEEecccccccceeeehhhcCccccchhhhhheeeccccce---------------------------EEEeeecCC Confidence 2334444444444433322110 00111111111111111 111111110 Q ss_pred cccccchhhhhhhhhcccccccc-ccchhhhhhhHHHHHHHHhhc-----cceeEEEEeccCCC---HHHHHhhhhhccc Q lcl|NC_021557. 127 GLASGFSGAYECYNNFGYFPKLI-IAPGYSPAAAVRAEMDVVASR-----LHALAIADLPLGLT---KQQAVAARGVAGT 197 (419) Q Consensus 127 ~~~tg~~a~~~~~~~~~~~p~~~-~ap~~~~~~~v~a~l~~~~~~-----~~~~~i~d~p~~~~---~~~~~~~~~~~~~ 197 (419) +.......+..++..+...+... ++| .+..+.+++++.+++++ ..+.+++..+.... .++.+. .. T Consensus 292 g~tvt~~dy~~aL~ale~~~~~~ivvp-~t~~~~i~a~l~ahv~~~s~~~~~~ra~igv~g~~~~~~~~~~~~-----~a 365 (581) T protein:vir:76 292 GDTVTMGDYQNALNKFRDEDEIAIIVA-GTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIA-----NA 365 (581) T ss_pred CCccchHHHHHHHHHHhcCCeEEEEEe-cCCChHHHHHHHHHHHHHHhccCCceEEEEeeCCCCCchHHHHHH-----hh Confidence 10111112222333333333222 233 33445566655444321 23456666553333 233332 23 Q ss_pred cccCccceEEecceeEeeccccccceeeechHHHHHHHHHhhhhccCceecccCceeeceeecceecccccCCcchhhcc Q lcl|NC_021557. 198 ANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNF 277 (419) Q Consensus 198 ~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~ 277 (419) .++++.|..++||+..+++.........+| ..++|+.+|.+..+..+++||.|++++|+.++...+ +..|.+. T Consensus 366 ~~~ns~Rvvlv~p~~~~~~g~~~~~~~~lp-~~~~AA~vAG~~a~~~~~~slT~~~i~g~~~~~~~~------s~~e~e~ 438 (581) T protein:vir:76 366 QSIKDQRVALISPSSFVYYAPELNREVVLG-GQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQ------RDGEKSR 438 (581) T ss_pred cccCCCcEEEEEcCceEeccccCCcceecc-hhhhhhhHHhhhhccccccCcccccccccccccccC------CHHHHHH Confidence 467899999999999888755444444444 445555556666666799999999999998776665 3567889 Q ss_pred ccCCceEEEEEecCCcEEE-EeccccCCCCCcccceeeehhhHHHHHHHHHHHHHH--HhhcCCCCHHHHHHHHHHHHHH Q lcl|NC_021557. 278 LNEAGIVTAMRSFATGIRV-FGNRSAAFPTSSHVENFIHARRILDMIHEAIIFYTM--NYVDRLGSPMTVEAAEEGVNAY 354 (419) Q Consensus 278 L~~~gI~~i~~~~~~G~~~-wG~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~--~~v~e~n~~~~~~~i~~~i~~~ 354 (419) |+++|++++...+++|+++ ||-+|+ .+++.|+++++||++|++.+.+++.++ .|++|||+..+|.+|+..+..| T Consensus 439 ll~~Gv~~l~~~~~~~v~Iv~gItT~---~s~~~~k~i~viR~~D~v~~~vr~~~~~~~fiG~~n~~~~r~~ik~~i~~~ 515 (581) T protein:vir:76 439 ESSEGLMVIEKTPRNLVHVRHGVTTD---PTSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAA 515 (581) T ss_pred HHhCCeEEEEEecCCeEEEEEeeecC---CCCCccceeeehhhhHHHHHHHHHHHhhhcCCCcccChHHHHHHHHHHHHH Confidence 9999999998888889875 777776 356789999999999999999999986 5788999999999999999999 Q ss_pred HHHHHhhc-ccceEEEEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 355 LRSKTGIA-IYGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 355 L~~l~~~g-~~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~a 419 (419) |..||+.| |.+|+ ..+.+....+.+++++++.++|++|+|||.+++++.++. +++. T Consensus 516 L~~l~~~g~I~g~~----~~~~~~~~~~~d~v~V~i~v~Pv~~ie~I~vt~~~~p~~-----~~~~ 572 (581) T protein:vir:76 516 LVWLVDNNIIRGYR----NLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPET-----GDIT 572 (581) T ss_pred HHHHHhcCcccCcc----cceeeEEecCCCEEEEEEEEEecccceEEEEEEEEeeCC-----CceE Confidence 99999876 55554 223455567889999999999999999999999998863 2222 No 45 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=99.96 E-value=7.6e-30 Score=180.22 Aligned_cols=376 Identities=10% Similarity=0.022 Sum_probs=220.6 Q ss_pred CCC-c------cCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchh Q lcl|NC_021557. 1 MAA-T------FHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAG 73 (419) Q Consensus 1 Ma~-~------~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~ 73 (419) |+- + ..||||+++.+.+.+++..+++++.+|+|.+..+ |.++|+.|+|+.++...||..... T Consensus 1 m~gg~~~~~~k~~PGvYi~~~~~~~~~i~~~~~~~~a~~~~~~~G-----------p~~~~~~i~s~~d~~~~fG~~~~~ 69 (437) T protein:vir:10 1 MAGGIWKRQNKVRPGAYINVKSKDIAMTRLGGDGVVTVPLALSFG-----------QSKKLMKIRRGEDLFKKLGYEQES 69 (437) T ss_pred CCcceecccceecCceeEEEecCCcceeeccCCcEEEEEEEecCC-----------CCceeEEEecHHHHHHHcCCccch Confidence 773 1 3699999999999999999999999999976655 458999999999999999976543 Q ss_pred hhHHHHHHHHhhccCCcEEEEeecccccccccc----------ccccc--ccc---------ccceecccccccc-cccc Q lcl|NC_021557. 74 YTIPAALDAIFDQGDGGTIIVNNVFDPDVHKEG----------ANPDP--SKV---------TTVDINGTISPAG-LASG 131 (419) Q Consensus 74 ~~l~~al~~~~~~~~~~~~v~~~~~~~~~~~~~----------~~~~~--~~~---------t~~~~~g~~~~~~-~~tg 131 (419) ..+. .+..+| +++..+++++........... .+... ..+ +..++........ +... T Consensus 70 ~~~~-~~~~~~-~g~~~~~~~R~~~g~~a~~tl~~~~~~~A~~~G~~gn~i~v~v~~~~~d~~~~~v~~~~~~~~~d~~~ 147 (437) T protein:vir:10 70 PQLL-LLNEAF-KRVSEVLLYRLNTGEKANVSLSDNVTAQAKYSGVRGNDITVTVKTNVDDPSSFDVVTFLDTVVMDLQT 147 (437) T ss_pred hHHH-HHHHHh-cCCCEEEEEECCCCceeeEeeccceEEEeccCCcccceeEEEEeeccCCccceEEEEecCcceeeeee Confidence 3332 344444 567777777754321111100 00000 000 0000000000000 0000 Q ss_pred chhhhh----hhhh------ccccccccccchhh---hhhhHHHHHHHHhhccceeEEEEeccCC--CHHHHHhhhhhcc Q lcl|NC_021557. 132 FSGAYE----CYNN------FGYFPKLIIAPGYS---PAAAVRAEMDVVASRLHALAIADLPLGL--TKQQAVAARGVAG 196 (419) Q Consensus 132 ~~a~~~----~~~~------~~~~p~~~~ap~~~---~~~~v~a~l~~~~~~~~~~~i~d~p~~~--~~~~~~~~~~~~~ 196 (419) +....+ .... +...+...+.-+.+ .......+|..++.. + +-++-+|... ......+|-.... T Consensus 148 v~~~~~~~~n~~v~~~~~~~l~~~a~~~LtGG~dg~~t~~dy~~al~~le~~-~-~n~l~~~~~d~~~~t~~~~~ik~~r 225 (437) T protein:vir:10 148 VKVLADLKNNALVEFSGTGELQPVAGAKLTGGTDGAISTQDYLEYFKALETV-E-FNYMALPVEDASIKKAAINFIKRMR 225 (437) T ss_pred hhhhhhhhhhcccccccccccccccceeeeccccCCCChhHHHHHHHHhccC-c-ceEEEecCCChhHHHHHHHHHHHHH Confidence 000000 0000 00001111111111 111244566666532 2 2333344322 1223333321111 Q ss_pred ccccCccceEEecce-------e-Eeeccccccceeee---chHHHHHHHHHhhhhccCceecccCceeeceeecceecc Q lcl|NC_021557. 197 TANTSSARTVLTYPH-------V-VIEDTTGATETRLD---PLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPIN 265 (419) Q Consensus 197 ~~~~~s~~~~~~~p~-------~-~~~~~~~~~~~~~~---p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~ 265 (419) .. ...+...+-+. + .+...........+ -..+.+||++|.++ +.+|+.|+.+.|+..+...+ T Consensus 226 ~~--~g~~~~~V~~~~~~d~e~Iin~~n~~~~~~~~~~~~~~~~a~vAG~~Ag~~----~~~S~t~~~~~~~~~v~~~~- 298 (437) T protein:vir:10 226 ED--EGLGAQLVVADSDADSEAVINVKNGVILSDKTVIDKTKATVWVAAASANAG----VEKSLTYEKYEDSVDVVGRL- 298 (437) T ss_pred hc--cCceEEEEeCCCCCCCceEEEeecceeecCcceechhhHHHHHHHHhccCc----cccCccccccCCcccccccC- Confidence 10 11122111111 0 00000000001112 23577888888774 88899999999887766554 Q ss_pred cccCCcchhhccccCCceEEEEEecCCcEE-EEeccccCC--CCCcccceeeehhhHHHHHHHHHHHHHH-HhhcC-CCC Q lcl|NC_021557. 266 FYPSDYQNDTNFLNEAGIVTAMRSFATGIR-VFGNRSAAF--PTSSHVENFIHARRILDMIHEAIIFYTM-NYVDR-LGS 340 (419) Q Consensus 266 ~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~-~wG~rT~~~--~s~~~~~~~i~vrR~~~~i~~~~~~~~~-~~v~e-~n~ 340 (419) +..|.+.|.++|+.++.+ .+++++ ++|-.|+.. ...++.|++|.++|++|+|.+.++..+. .|+++ ||+ T Consensus 299 -----t~~e~~~~i~~G~~vl~~-~~~~v~i~~gInTltt~~~~~~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~~N~ 372 (437) T protein:vir:10 299 -----SHTETEDALLKGQFVFTA-RRGRAVVEQDINSHVSFTIEKNQDFRKNRILRTLDDIVNDTRYAFSEYFLGKVSNN 372 (437) T ss_pred -----CHHHHHHHHhCCcEEEEE-eCCeEEEEEccccccccCCCCCchhhhhhHHHHHHHHHHHHHHHHHhccccccCCC Confidence 456788899999998765 455444 477666553 2346689999999999999999999887 59998 799 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcc-cceEEEEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEc Q lcl|NC_021557. 341 PMTVEAAEEGVNAYLRSKTGIAI-YGGTFRFDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVD 407 (419) Q Consensus 341 ~~~~~~i~~~i~~~L~~l~~~g~-~~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~ 407 (419) ...|..++..+..||.+|+++|+ ..|.++.++..+.. ....+++++.++|+.+||+|.+++... T Consensus 373 ~~~r~~~~~~i~~yl~~l~~~g~I~~~~~~d~~v~~~~---~~~~v~v~~~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 373 EDGRQAFKANRIRYFKDLEARGAIEDFKVEDIEVLRGE---LKESVVVNVKVKPVDSMEKLYMTVTVE 437 (437) T ss_pred HHHHHHHHHHHHHHHHHHHhCCCccCCCceeEEeecCC---CCCEEEEEEEEEEeeeeeeEEEEEEec Confidence 99999999999999999998775 45666655543222 356889999999999999999999887 No 46 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=99.93 E-value=1e-26 Score=163.03 Aligned_cols=375 Identities=12% Similarity=0.020 Sum_probs=213.7 Q ss_pred CCC-c------cCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchh Q lcl|NC_021557. 1 MAA-T------FHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAG 73 (419) Q Consensus 1 Ma~-~------~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~ 73 (419) |+- + .+||||+++++++.+++..+.++++++||.+... . .++|+.++|+.++...||..... T Consensus 1 magg~~~~~~K~~PGvYi~~~~~~~~~~~~~~~~~~~~i~~~~~~----------g-~~~~v~i~~~~d~~~~fG~~~~~ 69 (451) T protein:vir:10 1 MAGGTWKAQDKRRPGTYINVVGNGQREAASSLGRVLLIRDKGLGW----------G-KNGVIEVEANSDFTKKLGTTLDD 69 (451) T ss_pred CCceeeccceeecCceEEEEeccCcceeeccCCcEEEEEeeecCC----------C-CcccEEeecHHHHHHHcCCcccc Confidence 774 1 3699999999999999999999999999853322 1 26799999999999999976554 Q ss_pred hhHHHHHHHHhhccCCcEEEEeeccccccccc-----------cccccc--ccc---------ccceecccccccc-ccc Q lcl|NC_021557. 74 YTIPAALDAIFDQGDGGTIIVNNVFDPDVHKE-----------GANPDP--SKV---------TTVDINGTISPAG-LAS 130 (419) Q Consensus 74 ~~l~~al~~~~~~~~~~~~v~~~~~~~~~~~~-----------~~~~~~--~~~---------t~~~~~g~~~~~~-~~t 130 (419) ..+ .+++.++. ++...++++.......... +.+... -.+ +..++.-..+.+. +.. T Consensus 70 ~~~-~~~~~~~~-g~~~v~~yrl~~g~~a~~t~~~~~~~~~Aky~G~~Gn~i~v~v~~~~~d~~~~~v~t~~g~~~vd~q 147 (451) T protein:vir:10 70 PSL-TALKETLK-GASKVLVLNPNEGTAATLTKEGLPWTVTANYPGEKGNQITVSVEVSPADQNAATVSTIFGTKLVDEQ 147 (451) T ss_pred hhH-HHHHHHhc-CCcEEEEEEcCCCceEEEEeecCceEEEEeeCCcCCceEEEEEecccCCcCceEEEEEECCeEEEEE Confidence 433 36666664 5566666654322211111 010000 000 0000000000000 000 Q ss_pred cc---hhhhhhhhhc-----------ccccccc-ccc--hhh---hhhhHHHHHHHHhh-ccceeEEEEeccCCCHH-HH Q lcl|NC_021557. 131 GF---SGAYECYNNF-----------GYFPKLI-IAP--GYS---PAAAVRAEMDVVAS-RLHALAIADLPLGLTKQ-QA 188 (419) Q Consensus 131 g~---~a~~~~~~~~-----------~~~p~~~-~ap--~~~---~~~~v~a~l~~~~~-~~~~~~i~d~p~~~~~~-~~ 188 (419) .. .........+ ....... ..+ +.. .......+|..... ..+.+++.....+.... .+ T Consensus 148 tv~~~~~~el~~nd~V~a~~~~~g~~~~~~~~~l~~~~~gg~~~~~~~~~~~~l~~~e~~~~n~l~~~~~~~~~~i~~~~ 227 (451) T protein:vir:10 148 SIKFNELDKFKGNDYITAKVVEEGSSKPVAFTNVSGTLTGGTTTESNKVESLLNDALENEEYAVVTTAGFEPSSNMNKLV 227 (451) T ss_pred EeeccchhhccCCceEEEEecccccccceeeeecccccccccccCCccchHHHHHHhccceeeEEEEccCCCchHHHHHH Confidence 00 0000000000 0000000 000 000 00112234544433 22333322111111112 22 Q ss_pred Hhhhhhccccc---------------cCccceEEecceeEeeccccccceeeech---HHHHHHHHHhhhhccCceeccc Q lcl|NC_021557. 189 VAARGVAGTAN---------------TSSARTVLTYPHVVIEDTTGATETRLDPL---SSRLAGVIIATDLNEGWQNSPS 250 (419) Q Consensus 189 ~~~~~~~~~~~---------------~~s~~~~~~~p~~~~~~~~~~~~~~~~p~---s~~vAg~~a~~D~~~g~~~spa 250 (419) .+|-....... ++....+.+.+..... + ...+++ .+.+||++|.+. +.+|+. T Consensus 228 ~a~ik~~r~~~g~~~~aVl~~~~~~~~d~egiinv~n~~~~~-----d-g~~~~~~~~~~~vAG~~Ag~~----~~~S~T 297 (451) T protein:vir:10 228 VEAVKRLRENEGRKVRGVIPTDADTTYNYEGISTVVNGYTLS-----D-GTNVDVKDATGYFAGISASAD----VATSLT 297 (451) T ss_pred HHHHHHHHHhcCCeEEEEecCccCCCCCCcceEEeecceEec-----C-ceeechhhhHHHHHHHHcccc----cccCcc Confidence 33322111100 1111112222211111 1 122344 478889988874 778999 Q ss_pred CceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEE-EeccccCCC--CCcccceeeehhhHHHHHHHHH Q lcl|NC_021557. 251 NREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRV-FGNRSAAFP--TSSHVENFIHARRILDMIHEAI 327 (419) Q Consensus 251 n~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~-wG~rT~~~~--s~~~~~~~i~vrR~~~~i~~~~ 327 (419) |+.+.|+..+...+ +..|.+.+.++|..++....++++++ .|-.|+... ..++.|+.|.++|++|+|.+.+ T Consensus 298 ~~~~~~~~~v~~~~------t~~e~~~~i~~G~lvl~~~~g~~v~i~~~INTltt~~~~k~~~~~ki~vir~~D~i~~di 371 (451) T protein:vir:10 298 YFEVEDAVSAYPKF------DNEKTIKALDAGQIVFTTRPGQRVVIEQDINSLHKFTAEKPQAFSKNRVIRTLDEIATNT 371 (451) T ss_pred ceecCCceeeeeeC------CHHHHHHHHhCCeEEEEEEcCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHH Confidence 99999888776555 46677889999998765556777754 777776532 2355799999999999999999 Q ss_pred HHHHHH-hhcC-CCCHHHHHHHHHHHHHHHHHHHhhccc-ceEEEEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEE Q lcl|NC_021557. 328 IFYTMN-YVDR-LGSPMTVEAAEEGVNAYLRSKTGIAIY-GGTFRFDRQKNTAEQIADGKFYYRLECHPISVMERITIDS 404 (419) Q Consensus 328 ~~~~~~-~v~e-~n~~~~~~~i~~~i~~~L~~l~~~g~~-~~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~ 404 (419) +..+.. |+++ ||+...|..++..|..||.+|++.|++ .|.. .|.+. ...-....+++++.++|+..||+|.+++ T Consensus 372 ~~~~~~~yiGk~~N~~~gr~~~~~~i~~yl~~l~~~g~i~~~~~-~d~~v--~~~~~~~~v~v~~~v~pvdame~iy~t~ 448 (451) T protein:vir:10 372 ENTFERTYLGNVGNNAAGRDLFKADRIAYLTSLQNRNMIQSFAN-TDITV--EAGNDMDSIVVNLAVTPVDAMEKLYMTM 448 (451) T ss_pred HHHhhhccceecCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCc-cceEE--eecCCCCEEEEEEEEEEEeeeeeEEEEE Confidence 999874 9886 699999999999999999999997755 4432 12111 1111367799999999999999999998 Q ss_pred EEc Q lcl|NC_021557. 405 YVD 407 (419) Q Consensus 405 ~~~ 407 (419) ++- T Consensus 449 ~v~ 451 (451) T protein:vir:10 449 VVR 451 (451) T ss_pred EEc Confidence 887 No 47 >protein:vir:101326 Length: 529 # NCBI annotation: gp22 # Family: family:all:9453 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006525;genbank:gi:46401679;genbank:GeneID:2777386 Probab=99.87 E-value=4.5e-23 Score=143.07 Aligned_cols=378 Identities=14% Similarity=0.076 Sum_probs=224.5 Q ss_pred CCCcc-------CCCeEEEEcCCCc--cCccccCccceEEEEccccccccccccccccccCcceeec--chHHHHHHhcc Q lcl|NC_021557. 1 MAATF-------HHGPEVIEHKDGV--TVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIR--SRAEGAAAFGV 69 (419) Q Consensus 1 Ma~~~-------~hGVyv~e~~~~~--~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~it--s~~e~~~~fg~ 69 (419) |+.|- .-||-|.+++... ..-....+++-++||-+.++. +.+|.+++ +|.++...+.. T Consensus 1 ~~~ysi~q~ig~aSGvav~pi~~d~t~~~~~g~g~~v~a~Vgif~RG~-----------i~k~~~Vt~~n~~~~LGep~~ 69 (529) T protein:vir:10 1 MSQYSIQQSLGNASGVAVSPINADATLSTGVALNSSLWAGIGVFARGK-----------PFTVLAVTESNYEDVLGEPLK 69 (529) T ss_pred CCceehhhhhhhhcccccCCcCcccccchheecCceEEEEEEEeecCC-----------CcceEEEchhHHHHHhccccC Confidence 87531 1399998776322 223455788999999776664 46788887 77776655554 Q ss_pred cchhhhHHHHHHH-HhhccCCcEEEEeecccccccccccc-------------------------------cccc----- Q lcl|NC_021557. 70 HKAGYTIPAALDA-IFDQGDGGTIIVNNVFDPDVHKEGAN-------------------------------PDPS----- 112 (419) Q Consensus 70 ~~~~~~l~~al~~-~~~~~~~~~~v~~~~~~~~~~~~~~~-------------------------------~~~~----- 112 (419) .... ...+.+.+ ++...+..+++++++..........- .++. T Consensus 70 ~~~g-a~~E~~~h~~eA~~~~s~yVVRvv~~dak~p~i~~~~~~~~~~s~~~~s~~~~l~~G~~~~iy~~Dgd~~~s~~~ 148 (529) T protein:vir:10 70 PSSG-SQFEPIRHVYEAIQQTSGYVVRAVPDDAKFPIIMFDESGEPAYSALPYGSEIELDSGEAFAIYVDDGDPCISPTR 148 (529) T ss_pred CCcc-hhhhhHhhhhhhhcCCceEEEEEcccccCCceEEecCCccchhhcccccccccccccceEEEEEecCcCccCCce Confidence 4333 33333333 33334445777776554432221000 0000 Q ss_pred ----ccccceecccc------------------------------ccccc----cccchh-------------------- Q lcl|NC_021557. 113 ----KVTTVDINGTI------------------------------SPAGL----ASGFSG-------------------- 134 (419) Q Consensus 113 ----~~t~~~~~g~~------------------------------~~~~~----~tg~~a-------------------- 134 (419) ..+.++..|.. ++-+. .+.++. T Consensus 149 ~l~i~~~~ads~g~e~~~l~~~~~~~~g~~~~let~~~sl~~~a~dd~G~~~yl~svle~~s~~l~ai~~~e~~~t~~~~ 228 (529) T protein:vir:10 149 ELTIETATADSAGNERFLLKLTQTTSLGVVTTLETHTVSLAEEAKDDMGRLCYLPTALEARSKYLRAVVNEELISTAKVT 228 (529) T ss_pred EEEEEeeccccCCCccceeeEEEEeecCCceEEEEEEeeeeechhhhcCCccchhHHHhhccCceeeeeeeccccccchh Confidence 00000000000 00000 000000 Q ss_pred ----------------------hhhhhhhccccc--cccccchhhhhhhHHHHHHHHhhccceeEEEEeccCCCHHHHHh Q lcl|NC_021557. 135 ----------------------AYECYNNFGYFP--KLIIAPGYSPAAAVRAEMDVVASRLHALAIADLPLGLTKQQAVA 190 (419) Q Consensus 135 ----------------------~~~~~~~~~~~p--~~~~ap~~~~~~~v~a~l~~~~~~~~~~~i~d~p~~~~~~~~~~ 190 (419) ...+-..+...| ...+.-.....+.+.++|..+|++.++.+..|+|..+++.++.. T Consensus 229 t~~d~~f~~GtdG~~~~i~s~~y~~A~~~L~n~p~d~~~il~~g~y~~a~I~~L~~ic~~~~~d~f~DV~~~LT~~aA~~ 308 (529) T protein:vir:10 229 NKKSLAFTGGTNGDQSKISTAAYLRAVKVLNNAPYMYTAVLGLGCYDNAAITALGKICADRLIDGFFDVKPTLTYAEALP 308 (529) T ss_pred hhhhhhccCCccccccccchHHHHHHHHHhcCCcceeeeeeccCCccHHHHHHHHHHHhhhhhcEEEcCCCCcCHHHHHH Confidence 000000010001 00011122335666777888888888888889999999999999 Q ss_pred hhhhccccccCccc-eEEecceeEeeccccccceeeechHHH--HHHHHH--hhhhccCceecccCceeeceeecceecc Q lcl|NC_021557. 191 ARGVAGTANTSSAR-TVLTYPHVVIEDTTGATETRLDPLSSR--LAGVII--ATDLNEGWQNSPSNREIKGVVDLEVPIN 265 (419) Q Consensus 191 ~~~~~~~~~~~s~~-~~~~~p~~~~~~~~~~~~~~~~p~s~~--vAg~~a--~~D~~~g~~~span~~l~gv~~~~~~~~ 265 (419) |....+...-.+-+ +..+|||. ..|+. ++....+++||. +|...+ +.-.-.|.|.+||++. ++++.- ..|. T Consensus 309 ~~e~~gl~~~~~~~~s~y~~P~~-~~D~~-tg~k~~~GlsG~A~~akargv~~na~v~g~hY~pAGe~-r~~inr-~~I~ 384 (529) T protein:vir:10 309 AVEDTGLLGTDYVSCSVYHYPFS-CKDKW-TQSRVVFGLSGVAYAAKARGVKKNSDVGGWHYSPAGEE-RAVIAR-ASIQ 384 (529) T ss_pred HHHhcCccccCceeeEEEEccee-ecccc-ccCceeeCCCcceeeccccceeecccccccccccCCCc-cceeec-ccce Confidence 98765543333323 34667776 55654 444558899984 332222 2222334599999986 444332 2333 Q ss_pred cccCCcchhhccccCCceEEEEEecCCcE----EEEeccccCCCCCcccceeeehhhHHHHHHHHHHHHHHHhhcCCCCH Q lcl|NC_021557. 266 FYPSDYQNDTNFLNEAGIVTAMRSFATGI----RVFGNRSAAFPTSSHVENFIHARRILDMIHEAIIFYTMNYVDRLGSP 341 (419) Q Consensus 266 ~~~~~~~~~~~~L~~~gI~~i~~~~~~G~----~~wG~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~ 341 (419) .-....+.|...|-.++||++.-..++++ .+||+|+ +..|||+|+++|+++|++.+.+..++.+|||++. T Consensus 385 ~ly~~d~~e~~~lv~~riNPV~~~~~g~~~idDsLt~~~k------nny~R~~hv~~lmn~I~~~~~k~a~~~~~~Pd~i 458 (529) T protein:vir:10 385 PLYPEDTPDEEAMVKGRLNKVSVGTSGQMIIDDALTCCTQ------DNYLHFQHVPSLMNAISRFFVQLARQMKHSPDGI 458 (529) T ss_pred eccCCCccCHHHHHhhccCeeeeeccCcceeeeeeceeee------CCchhhhhHHHHHHHHHHHHHHHHHHHhhCCChH Confidence 22333444555677888888876665554 4555554 4579999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccc------------eEEEEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcc Q lcl|NC_021557. 342 MTVEAAEEGVNAYLRSKTGIAIYG------------GTFRFDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVDT 408 (419) Q Consensus 342 ~~~~~i~~~i~~~L~~l~~~g~~~------------~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~ 408 (419) .+|. +++-++.+|..+|+.|++- |.+.. + ..+.++|.+++.++|+..+..|.+.-..-. T Consensus 459 t~~g-l~~~l~~~L~r~~asgalv~prdp~~~G~epy~~~V-----~--q~d~D~~~v~~~~~ptGv~Rri~~~p~l~~ 529 (529) T protein:vir:10 459 TAAG-LTKGMTKLLDRFVASGALVAPRDPDADGTEPYVLKV-----T--QAEFDKWEVVWACCPTGVARRIQGVPLLIK 529 (529) T ss_pred HHHH-HHHhHHHHHHHHHhcCceecccCccCCCCCceEEEE-----e--ecccCeEEEEEEeecCCceeeEEeeeeecC Confidence 9987 9999999999999877542 22222 2 334599999999999999999987655444 No 48 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=99.68 E-value=3.4e-16 Score=105.40 Aligned_cols=372 Identities=12% Similarity=0.020 Sum_probs=202.4 Q ss_pred CCC------c-cCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecch---HHHHHHhccc Q lcl|NC_021557. 1 MAA------T-FHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSR---AEGAAAFGVH 70 (419) Q Consensus 1 Ma~------~-~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~---~e~~~~fg~~ 70 (419) |+- + .+||+|+.-.+.....+.....++.++...+ .|++.++++.|++. .+....||.. T Consensus 3 magg~~~~~~K~~PG~Y~n~~~~~~~~~~~~~rGi~a~p~~~-----------~wGp~~~v~~i~~~~~~~~~~~~~G~~ 71 (436) T protein:vir:78 3 LGGGTFVTQNKVLPGSYINFVSATRATSSLSDRGIVAMPLEL-----------DWGIDEEVFQVTSDDFEKYSTKYFGYD 71 (436) T ss_pred ccceeeccceeecCceEEEEEecCcceeeccCCeEEEEEEEe-----------cCCCCceeEEeecccchHHHHHHhcCc Confidence 442 2 3699999999888888888888888877653 35566889999874 4667778876 Q ss_pred chhhhHHHHHHHHhhccCCcEEEEeecccccccc-cc-----ccccc--cc---------cccceeccccccccc-cccc Q lcl|NC_021557. 71 KAGYTIPAALDAIFDQGDGGTIIVNNVFDPDVHK-EG-----ANPDP--SK---------VTTVDINGTISPAGL-ASGF 132 (419) Q Consensus 71 ~~~~~l~~al~~~~~~~~~~~~v~~~~~~~~~~~-~~-----~~~~~--~~---------~t~~~~~g~~~~~~~-~tg~ 132 (419) .....+. .++..|.+. ...+.++.. .+.... .. .+... -. .+..++.-..+.... .... T Consensus 72 ~~~~~~~-~l~~~~~~~-~tv~~yrl~-~G~~a~~~v~~Aky~g~~gn~i~v~v~~~~~d~~~~dv~~~~g~~~~d~~~~ 148 (436) T protein:vir:78 72 YTHEKLK-GLRDLFKNI-RLGYFYKLN-KGVKASCSIATARCSGIRGNDLKVIVTTNIDDNAKFDVVTLLDNKKVDTQIA 148 (436) T ss_pred cchHHHH-HHHHHhcCC-CEEEEEECC-CcceeeeeeeeeecCCCCCcEEEEEecccccccCceEEEEEecchhhhhhhH Confidence 4433332 344444322 333333332 221110 00 00000 00 000000000000000 0000 Q ss_pred hhhhh-hhh---------hccccccccccchhhh----hhhHHHHHHHHhhccceeEEEEeccCCC--HHHHHhhhhhcc Q lcl|NC_021557. 133 SGAYE-CYN---------NFGYFPKLIIAPGYSP----AAAVRAEMDVVASRLHALAIADLPLGLT--KQQAVAARGVAG 196 (419) Q Consensus 133 ~a~~~-~~~---------~~~~~p~~~~ap~~~~----~~~v~a~l~~~~~~~~~~~i~d~p~~~~--~~~~~~~~~~~~ 196 (419) ....+ ... .+.......+..|.+. ......+|..++.. + |-++-+|.... ...+.+|-.... T Consensus 149 ~~~~~l~~n~~V~~~~~g~la~~a~~~LtGG~dG~~~T~~dy~~al~~le~~-~-fn~l~~~~~d~~~~~~~~a~ikr~r 226 (436) T protein:vir:78 149 KVITELQDNDYVTWKKEATLEATAGLTFTNGTNGEAVTGTEYQAFLDKIESY-S-FNALGCLATTAEIKSLFVEFTKRMR 226 (436) T ss_pred HHHhhccCCceEEEEecccccccceeeeeccccccccchHHHHHHHHHHccc-c-eeEEEecCCChHHHHHHHHHHHHHH Confidence 00000 000 0001111111111111 12344566665443 2 33444443211 223333322111 Q ss_pred ccccCccceEEecceeEeeccc------cc-cceee--echHHHHHHHHHhhhhccCceecccCceeeceeecceecccc Q lcl|NC_021557. 197 TANTSSARTVLTYPHVVIEDTT------GA-TETRL--DPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLEVPINFY 267 (419) Q Consensus 197 ~~~~~s~~~~~~~p~~~~~~~~------~~-~~~~~--~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~ 267 (419) .. .+-+..++.++ ....|.. .+ ....+ .-..+.+||++|.++ +.+|+.|+.+.++.++...+ T Consensus 227 e~-~g~~~~aV~~~-~~~~d~EgIInv~n~v~g~~~~~~~~~a~vAG~~Ag~~----~~~S~T~~~~~~~~~v~~~~--- 297 (436) T protein:vir:78 227 DK-VGAKFQTVLYK-KNDADYEGVVSVENKIKDTGLLESSLIYWTTGAIAGCD----INKSNTNKRYDGEFDVDVNY--- 297 (436) T ss_pred hh-cCCeEEEEecC-CCCCCCceEEEeecccCCceechhHHHHHHHHHHhcCc----cccCccceecCccccccccC--- Confidence 11 11111111111 0001100 00 11111 124677888888775 77799999998887665554 Q ss_pred cCCcchhhccccCCceEEEEEecCCcEEEEecc-ccC--CCCCcccceeeehhhHHHHHHHHHHHHHH-HhhcC-CCCHH Q lcl|NC_021557. 268 PSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNR-SAA--FPTSSHVENFIHARRILDMIHEAIIFYTM-NYVDR-LGSPM 342 (419) Q Consensus 268 ~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~r-T~~--~~s~~~~~~~i~vrR~~~~i~~~~~~~~~-~~v~e-~n~~~ 342 (419) +..|.+.+.++|..++.+ .++++++--+- |+. ....+..|+.|.++|++|+|.+.+++.+. .|+++ ||+.. T Consensus 298 ---t~~e~~~ai~~G~lvl~~-d~~~v~I~~~VNTltt~~~~k~~~~~kI~vir~~D~i~~di~~~~~~~yiGKv~N~~d 373 (436) T protein:vir:78 298 ---TQIHLEEALKTGKFIFHK-VGDEVHVLEDINTFVSFTDEKNDDFSSNQSVRVLDQIANDIATLFNTKYLGEVPNDKS 373 (436) T ss_pred ---CHHHHHHHHhCCeEEEEE-eCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccccccCCCHH Confidence 456778889999887764 46666554433 332 22335579999999999999999999876 59997 69999 Q ss_pred HHHHHHHHHHHHHHHHHhhccc-ceE---EEEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEc Q lcl|NC_021557. 343 TVEAAEEGVNAYLRSKTGIAIY-GGT---FRFDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVD 407 (419) Q Consensus 343 ~~~~i~~~i~~~L~~l~~~g~~-~~~---v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~ 407 (419) -|..++..+..||.+|.+.|++ .|. ++.++. -....+++++.+.|+..||+|.++++.. T Consensus 374 gr~~l~~~i~~yl~~L~~~g~I~~f~~~Dv~v~~~------~~~~~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 374 GRISFWNDVVKHHEQLQNMRAIEDFKADDVSVEPG------SDKKTVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred HHHHHHHHHHHHHHHHHhCCcccCCCCcceEEeec------CCCCEEEEEEEEEEEEeeeeEEEEEEEC Confidence 9999999999999999987755 343 233221 1456788999999999999999999988 No 49 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=98.96 E-value=3.9e-09 Score=66.66 Aligned_cols=325 Identities=14% Similarity=0.115 Sum_probs=173.4 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) || =.|+++++=..-....+....-++..+|-... . .....+++..+....+. ......+ T Consensus 1 ~~--glp~i~i~f~~~a~ta~~~g~rGiv~~il~d~---~-----------~~~~~~~~~~~v~~~~~-----~~n~~~i 59 (356) T protein:vir:10 1 MA--GLVNINIEFKELATSFIQRSKAGIVAIILKDT---T-----------KMYKELTSEDDIPISLS-----ADNKKYI 59 (356) T ss_pred CC--CCCceeEEEeecceeeccCCccceEEEEEecC---C-----------cceeEEeccccchhHHH-----HHHHHHH Confidence 88 46899998776666666655555555553211 0 11112233322222221 1122233 Q ss_pred HHHhhccCCcEEEEeeccccccccccccccccccccceeccccc-cccccccchhhhhhhhhccccccccccchhhhhhh Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTIS-PAGLASGFSGAYECYNNFGYFPKLIIAPGYSPAAA 159 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~-~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~~~ 159 (419) ...|..+........ +..+ +.+... .......+..++.. ..+.+..|+. .+. T Consensus 60 ~~~~~g~~~~~~~~~---------------p~~~----~~~~~~t~~~y~~aL~~le~~------~fn~l~~~~~--d~~ 112 (356) T protein:vir:10 60 KYGFVGATDNEKVLR---------------PSKV----IISTFTEDGKVEDILEELESV------EFNYLCMPEA--IEA 112 (356) T ss_pred HHHhhcccccccccc---------------ceee----eeecccCchhHHHHHHHhcCc------cceEEEecCC--ChH Confidence 333332211110000 0000 000000 11111122222211 1223344432 345 Q ss_pred HHHHHHHHhhcc-----ceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHHHHH Q lcl|NC_021557. 160 VRAEMDVVASRL-----HALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAG 234 (419) Q Consensus 160 v~a~l~~~~~~~-----~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg 234 (419) .++.+.+...++ ..+..+- + +.. .+....+-+..... .+ +...-..-..+.+|| T Consensus 113 ~~~~~~a~ikr~r~~~~~~~~~V~-~-~~~---------------aD~EgIInv~n~~~-~~---g~~~t~~~~~~~vAG 171 (356) T protein:vir:10 113 EKTKIVTWIKKIREEESTEAKAVL-A-NIK---------------ADNEAIINFTENVV-VD---GEEITAEKYTTRVAS 171 (356) T ss_pred HHHHHHHHHHHHHhcCCcEEEEEe-c-CCC---------------CCCceeEEeecCeE-ec---ceeechhHHHHHHHH Confidence 556555544322 1222221 1 110 11111111111111 11 000111233678999 Q ss_pred HHHhhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEec-cccC--CCCCcccc Q lcl|NC_021557. 235 VIIATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGN-RSAA--FPTSSHVE 311 (419) Q Consensus 235 ~~a~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~-rT~~--~~s~~~~~ 311 (419) ++|.+. .-+|+.|+.+.++.... ++ ...|.+..-++|--++.+. ++..++--+ .|+. ....+..| T Consensus 172 ~~Ag~~----~n~S~T~~~~~~~~~~~-~~------t~~e~~~ai~~G~lvl~~d-~~~V~I~~~VNSltt~t~~k~~~f 239 (356) T protein:vir:10 172 LIASTP----NTQSITYAPLDEVESIV-KI------DKASADAKVQAGELILRRL-SGKIRIARGINSLTTLTAEKGEIF 239 (356) T ss_pred HHhccc----hhccccceecCCccccc-cC------CHHHHHHHHhCCeEEEEEE-cCeEEEEecCccceecCCCCCcch Confidence 998886 66689999988754332 23 3456677778898877654 454544333 3332 12234569 Q ss_pred eeeehhhHHHHHHHHHHHHHH-HhhcC-CCCHHHHHHHHHHHHHHHHHHHhhccc--ceEEEEeccc------------- Q lcl|NC_021557. 312 NFIHARRILDMIHEAIIFYTM-NYVDR-LGSPMTVEAAEEGVNAYLRSKTGIAIY--GGTFRFDRQK------------- 374 (419) Q Consensus 312 ~~i~vrR~~~~i~~~~~~~~~-~~v~e-~n~~~~~~~i~~~i~~~L~~l~~~g~~--~~~v~~d~~~------------- 374 (419) +.|.+.|++|.|.+.++..+. .|+++ ||+..-|..+...+..||.+|.+.|++ .+.++.|.+. T Consensus 240 ~Kirvvr~~D~i~~Di~~~f~~~yiGKv~N~~dgr~~l~~ai~~y~~~L~~~~~I~~~~~~eid~e~q~~~~~~~g~d~~ 319 (356) T protein:vir:10 240 QKIKLVDTKDLISKDIKNIYVEKYLRKCPNTYDNKCLFIVAVQSYLTELAKQELIDSNFTVEIDLEKQKEYLEGKKIAVS 319 (356) T ss_pred hhhHHHHHHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhCCccccCceeEecccchHHHhhhcccccc Confidence 999999999999999999986 69999 699999999999999999999998876 3556665532 Q ss_pred -CCHHHhh----CCEEEEEEEEEeccCceEEEEEEEE Q lcl|NC_021557. 375 -NTAEQIA----DGKFYYRLECHPISVMERITIDSYV 406 (419) Q Consensus 375 -n~~~~i~----~G~~~~~v~~~p~~p~e~i~~~~~~ 406 (419) ++...+. .-.+.+...+.|+-.||.|.+++.. T Consensus 320 ~~~d~~v~~~~~~~~v~~~~~v~~vdamE~iy~ti~v 356 (356) T protein:vir:10 320 KMKENEIKEANTGSNGFYLINLKLVDAMEDINIRVQM 356 (356) T ss_pred ccccceeecccCCcEEEEEEEEEEEeeeeeEEeEEeC Confidence 2222222 2457899999999999999999988 No 50 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=98.92 E-value=1.4e-08 Score=63.66 Aligned_cols=345 Identities=12% Similarity=0.017 Sum_probs=196.4 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |+ .|=|.|...+.+--++..++- ...|||+..... -.++...+....+....+|.. ...|..-+ T Consensus 1 m~---~~~V~in~~n~~qg~~~~ver-~~lfig~g~~~~----------~~g~~~~~~~~sdld~~lg~~--ds~lk~~v 64 (369) T protein:vir:27 1 MA---WPTVIIKILNLMNGPIADIEC-HFLFVIRGTVSG----------EVRNLIMVDSTSDLDDVLAEA--SAEGLAIV 64 (369) T ss_pred CC---CCceEEecccccCCCcccccc-eEEEEEeccccc----------cccceEEecCccchHhhcCCc--ChhHHHHH Confidence 98 588999988888888887774 788996533211 123445566666777778774 35678888 Q ss_pred HHHhhccCCcEEEEeeccccccccccccccccccccceeccccccccccccchhhhhhhhhccccccccccchh-hhhhh Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLIIAPGY-SPAAA 159 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~-~~~~~ 159 (419) .....++|+.......+.... .+ -..++..+... ..+..+..-+. +.... T Consensus 65 ~aa~~naG~~w~a~~~p~~~~-------~~--------------------~~~Av~~a~~~--~s~E~V~v~~p~t~~a~ 115 (369) T protein:vir:27 65 KAAQLNGKQAWTAGVMILSEE-------DN--------------------WQDAVKKANEV--SSFEFVVLGFDAETKAM 115 (369) T ss_pred HHHHhCCCCceEEEEEEeCCc-------hh--------------------HHHHHHhhhhh--CCccEEEEecCcccHHH Confidence 888888887654332211100 00 01111111110 01111111111 11122 Q ss_pred HHH---HHHHHhhc--cceeEEEEecc-C------CCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeec Q lcl|NC_021557. 160 VRA---EMDVVASR--LHALAIADLPL-G------LTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDP 227 (419) Q Consensus 160 v~a---~l~~~~~~--~~~~~i~d~p~-~------~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p 227 (419) +.+ .......+ ...|+|+.++. + .+-.+..... ..-..++.+.+..++..++... . T Consensus 116 i~aaq~~a~el~~~~~R~vffi~e~~~~~~~~~~~e~w~dy~a~l-~al~~g~a~~~V~vv~~~~~~g-----------n 183 (369) T protein:vir:27 116 IEDAITLRTELKNSLGREVGVLCQLPAINNDPTNGQTWSEWLADT-VDIPKDVASEYISVVPNVHAAG-----------D 183 (369) T ss_pred HHHHHHHHHHHHHhcCCeEEEEEeccccCCCccccCCHHHHHHHH-HHHhhccCcccceeeeeecccc-----------c Confidence 222 22222223 35677777542 1 1112222211 1123456777777763332211 2 Q ss_pred hHHHHHHHHHhhhhccCceecccCceeeceeecc-eec-ccccCCcchhhccccCCceEEEEEecCC-cEEEEeccccCC Q lcl|NC_021557. 228 LSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLE-VPI-NFYPSDYQNDTNFLNEAGIVTAMRSFAT-GIRVFGNRSAAF 304 (419) Q Consensus 228 ~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~-~~~-~~~~~~~~~~~~~L~~~gI~~i~~~~~~-G~~~wG~rT~~~ 304 (419) -.|.+||.+|.. ..-+..||.-..--.+.|+. .+. .............|..+|..+.+.++|. |+-+-..||++. T Consensus 184 ~~G~~aGRl~n~--aVsIadsp~RVktG~l~g~~~~p~d~~g~~l~~a~l~aLd~agysvp~~Y~gy~G~Yw~d~~tl~~ 261 (369) T protein:vir:27 184 TLGKYAGRLANK--EVSIADSPARVQTGSVLGNTELMKDKAGKALDLATLKALESNRIAVPMWYPDYPGQYWTTGRTLDV 261 (369) T ss_pred hHHHHHHHHHhc--ccchhcCcceeeecccccccccccCCCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEecc Confidence 367778888652 22367788765432233322 111 1111123345567999999999998874 766666789887 Q ss_pred CCCcccceeeehhhHHHHHHHHHHHHHHHhhcCC---CCHHHHHHHHHHHHHHHHHHHhhcccceEEEEecccCCHHHh- Q lcl|NC_021557. 305 PTSSHVENFIHARRILDMIHEAIIFYTMNYVDRL---GSPMTVEAAEEGVNAYLRSKTGIAIYGGTFRFDRQKNTAEQI- 380 (419) Q Consensus 305 ~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~---n~~~~~~~i~~~i~~~L~~l~~~g~~~~~v~~d~~~n~~~~i- 380 (419) +.+| ++||..+|+.|-+.|.++...-..+..+ .++.-.+..+..+..=|++|.+.+ +.|++.-.++ +|| T Consensus 262 ~gsD--Yq~iE~~RVvdKa~R~vR~~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~-fpgei~~P~d----~dI~ 334 (369) T protein:vir:27 262 PGGD--YQDIRHIRVAMKAARKVRIRAIARIADRTLNSTPQSIAAAKLYFTQDLRTMALTG-VPGEIYPPED----EDIQ 334 (369) T ss_pred CCCC--eehhhhhhHHHHHHHHHHHHHHHHhcCcccccChhHHHHHHHHHhhHHHHHHhhc-CCeEEecCCC----CceE Confidence 7666 9999999999999988877666555443 345555666666666777775443 5555554332 133 Q ss_pred ----hCCEEEEEEEEEeccCceEEEEEEEEcchHH Q lcl|NC_021557. 381 ----ADGKFYYRLECHPISVMERITIDSYVDTKFI 411 (419) Q Consensus 381 ----~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~ 411 (419) ...++.+-+.+.|.---+.|+..+..|-.-+ T Consensus 335 i~w~~k~~V~I~~~vrP~~~pk~it~~I~ldl~~~ 369 (369) T protein:vir:27 335 IKWVNSTDVEIYMSVQPYECPVKITIAISVKQGDY 369 (369) T ss_pred EEeeccceEEEEEEEeeccCCceEEEEEEEeccCC Confidence 3457888888889888999999999986544 No 51 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=98.85 E-value=1.8e-08 Score=63.05 Aligned_cols=349 Identities=11% Similarity=0.049 Sum_probs=193.7 Q ss_pred cCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHHHHHh Q lcl|NC_021557. 5 FHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAALDAIF 84 (419) Q Consensus 5 ~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al~~~~ 84 (419) ..|=|.|...+.+--++..++- ...|||.+.+.. .+...+..-.+....+|.. ...|..-+.... T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er-~~Lfig~~~~~~------------~~~~~~~~~sdld~~lg~~--~~~lk~~v~aa~ 65 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIER-HALFVGVGTTNQ------------GKLLALTPDSDFDKVFGET--DTDLKKQVRAAM 65 (376) T ss_pred CCCeEEEecccccCCCcccccc-eEEeeccccccc------------cceeeecCccchHhhhCCC--chHHHHHHHHHH Confidence 4578999988888888888774 678999765432 2233445555666677764 377777888888 Q ss_pred hccCCcEEEEeeccccccccccccccccccccceeccccccccccccchhhhhhhhhcccccccc-ccchhhhhhhHHHH Q lcl|NC_021557. 85 DQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLI-IAPGYSPAAAVRAE 163 (419) Q Consensus 85 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~-~ap~~~~~~~v~a~ 163 (419) .|+|+.......+.... .. .-+.++..+..... +..+. .-|..+....+.++ T Consensus 66 ~naG~~~~~~~~~~~~~---------~~-----------------~~~~Av~~a~~~~s-~E~V~v~~pv~t~~a~i~aa 118 (376) T protein:vir:37 66 LNAGQNWFAHVYIAQED---------GY-----------------DFVECVKKANQTAS-FEYCVNTRYLGVDKASIGKL 118 (376) T ss_pred hCCCCcEEEEEEeecCC---------ch-----------------HHHHHHHHhhhhcC-ceEEEEeccccccHHHHHHH Confidence 88887654333221100 00 00111111111110 01111 11111112222222 Q ss_pred H---HHHhh--ccceeEEEEecc-C------CCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHH Q lcl|NC_021557. 164 M---DVVAS--RLHALAIADLPL-G------LTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSR 231 (419) Q Consensus 164 l---~~~~~--~~~~~~i~d~p~-~------~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~ 231 (419) . ..... +.-.|+|+..+. + .+-.+....+. .-..++.+.+..++.- . + + -.-|. T Consensus 119 ~~~a~el~~~~~Rpv~file~r~~~~~~~~~e~w~~y~~~~~-al~~gia~~~V~~V~~-~--~----g------n~~G~ 184 (376) T protein:vir:37 119 QECYAELLAKFGRRTFFIQAVQGINHDQSDGETWDQYVQKLT-TLQQTIVADHVCLVPL-L--F----G------NETGV 184 (376) T ss_pred HHHHHHHHHhcCCeEEEEEeccCcCcccccccCHHHHHHHHH-Hhhcccccccceeeee-e--h----h------hhHHH Confidence 1 12222 245678888762 1 11122222221 1123455555443321 0 0 0 23677 Q ss_pred HHHHHHhhhhccCceecccCce---eeceeecceeccc-ccCCcchhhccccCCceEEEEEecCC-cEEEEeccccCCCC Q lcl|NC_021557. 232 LAGVIIATDLNEGWQNSPSNRE---IKGVVDLEVPINF-YPSDYQNDTNFLNEAGIVTAMRSFAT-GIRVFGNRSAAFPT 306 (419) Q Consensus 232 vAg~~a~~D~~~g~~~span~~---l~gv~~~~~~~~~-~~~~~~~~~~~L~~~gI~~i~~~~~~-G~~~wG~rT~~~~s 306 (419) +||.+|+. ..-++.||.... |.|+.....+.+. .....+.....|.++|..+.+.++|. |+-+-..|||+.+. T Consensus 185 ~aGRl~~a--aVsVadspgRV~tG~l~gl~~~~lp~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~g 262 (376) T protein:vir:37 185 LAGRLANR--AVTVADSPARVQTGALVSLGSANKPLDKDRNELTLAHLKSLETARYSVPMWYPDYDGYYWADGRTLDVEG 262 (376) T ss_pred HHHHHhhc--ccchhhCccceeccccccccccccccCcCcccCCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCC Confidence 88887644 223677887654 3333222222211 11223455667999999999998875 76666678888776 Q ss_pred CcccceeeehhhHHHHHHHHHHHHHHHhhcCCC---CHHHHHHHHHHHHHHHHHHHhhccc-----ceEEEEecccCC-H Q lcl|NC_021557. 307 SSHVENFIHARRILDMIHEAIIFYTMNYVDRLG---SPMTVEAAEEGVNAYLRSKTGIAIY-----GGTFRFDRQKNT-A 377 (419) Q Consensus 307 ~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n---~~~~~~~i~~~i~~~L~~l~~~g~~-----~~~v~~d~~~n~-~ 377 (419) +| ++||..+|+.|-+.|.++...-..+...- ++.-.+..+.-+..-|++|.+.+.+ .|+|.-.++.+. . T Consensus 263 sD--Y~~ie~~RVvdKa~R~vR~~ai~~i~D~~lnst~~sia~~~~yi~~pLr~M~~s~~i~g~~fpGeI~~p~d~Di~i 340 (376) T protein:vir:37 263 GD--YQVIENLRVVDKVARKVRLLAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKDDAITI 340 (376) T ss_pred CC--hhhhhhhhHHHHHHHHHHHHHHHHhCCcccCcchhhHHHHHHHHHHHHHHHHhcchhccccccceeecCCCCCceE Confidence 66 99999999999999888877666654422 3444555566566668888664422 244554443211 1 Q ss_pred HHhhCCEEEEEEEEEeccCceEEEEEEEEcchHHHH Q lcl|NC_021557. 378 EQIADGKFYYRLECHPISVMERITIDSYVDTKFISN 413 (419) Q Consensus 378 ~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~ 413 (419) .-+...++.+.+.+.|.--.++|+..+..|-.-.-+ T Consensus 341 ~w~s~~~V~I~~~v~P~~~pk~Itv~I~Ldlsn~~~ 376 (376) T protein:vir:37 341 VWQSKTKVTIYIKVRPYDCPKEITANIFLDLDSLGE 376 (376) T ss_pred EeeccceEEEEEEEEeccCCceEEEEEEeecCCCCC Confidence 224678899999999999999999887776442222 No 52 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=98.79 E-value=1.6e-08 Score=63.34 Aligned_cols=348 Identities=13% Similarity=0.071 Sum_probs=187.7 Q ss_pred cCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHHHHHh Q lcl|NC_021557. 5 FHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAALDAIF 84 (419) Q Consensus 5 ~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al~~~~ 84 (419) ..|=|.|...+.+-.++..++- ...|||++.... .+...+..-.++...+|.. ...|..-+.... T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er-~~lfig~~~~~~------------g~~~~~~~~sdld~~l~~~--ds~lk~~v~aa~ 65 (370) T protein:vir:78 1 MWPYVQIYNLNQMQGPVTEVER-HLLFIGSAASNT------------GKLLSLNAQSDFDQLLGAA--DSELKANLLAAR 65 (370) T ss_pred CCceEEEeeccccCCCcCccce-eEEEEecccccc------------cceEeecCccCHHHhcCCc--ChhHHHHHHHHH Confidence 4588999999988888888874 688999876432 2233455556667777764 366777788888 Q ss_pred hccCCcEEEEeeccccccccccccccccccccceeccccccccccccchhhhhhhhhccccccccccchhhhhhhHHHHH Q lcl|NC_021557. 85 DQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLIIAPGYSPAAAVRAEM 164 (419) Q Consensus 85 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~~~~~~~v~a~l 164 (419) .++|+.......+.... . .-+.++..+... ..+..+..-+-....+...++ T Consensus 66 ~naG~~~~~~~~p~~~~------------------------~---d~~~Av~~a~~~--~s~E~V~v~~~~s~~a~~~a~ 116 (370) T protein:vir:78 66 DNAGQNWSAAAYVLPTD------------------------K---PWLDAARDAQQT--QSFEGVVVLGQEWHQAAINAA 116 (370) T ss_pred hCCCCceEEEEEEecCc------------------------h---hHHHHHHHHHhh--CCccEEEEecCcchHHHHHHH Confidence 88887654333221100 0 112222222111 111111111211111222223 Q ss_pred HHH----hhc--cceeEEEEeccCCCHH---HHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHHHHHH Q lcl|NC_021557. 165 DVV----ASR--LHALAIADLPLGLTKQ---QAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGV 235 (419) Q Consensus 165 ~~~----~~~--~~~~~i~d~p~~~~~~---~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~ 235 (419) .+. ..+ ...|+++.++.-...+ +..... ..-..++.+.+..++-.|+. -.-|.+||. T Consensus 117 ~~~a~el~n~~~Rpv~file~~~~~~~e~w~~y~~~l-~al~~gia~~~V~vvp~~~g-------------~~~G~~aGR 182 (370) T protein:vir:78 117 HALNQELIAKWGRWQFMLLAVPAIADEQDWATYEAEL-ATLQDGIAASSVSLIPQLWP-------------TLAGAYAGR 182 (370) T ss_pred HHHHHHHHHhcCCeEEEEEeecCCCCcCCHHHHHHHH-HHhhhccccccceEEeeecc-------------ccHHHHHHH Confidence 222 222 3567777776422222 222211 11224466666666544321 113667887 Q ss_pred HHhhhhccCceecccCceeeceeecc-eec-ccccCCcchhhccccCCceEEEEEecCC-cEEEEeccccCCCCCcccce Q lcl|NC_021557. 236 IIATDLNEGWQNSPSNREIKGVVDLE-VPI-NFYPSDYQNDTNFLNEAGIVTAMRSFAT-GIRVFGNRSAAFPTSSHVEN 312 (419) Q Consensus 236 ~a~~D~~~g~~~span~~l~gv~~~~-~~~-~~~~~~~~~~~~~L~~~gI~~i~~~~~~-G~~~wG~rT~~~~s~~~~~~ 312 (419) +|.. .--+..+|.-...--+.++. .++ .............|..+|..+.+.++|. |+-+-..|||+.+.+| ++ T Consensus 183 L~na--avsVadsP~Rv~tG~l~gl~~~p~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~gsD--Yq 258 (370) T protein:vir:78 183 LCNR--AVSIADSPCRVKTGALVGLGNKPVGKDGIPLPLATLQTLEANRYSVPMWYPDYDGIYWADGRTLDAEGGD--YQ 258 (370) T ss_pred HhcC--eeeecccceeeeccccccccccccccCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCC--hh Confidence 6542 22267788754322222211 111 1122223455678999999999998874 7666667888877666 99 Q ss_pred eeehhhHHHHHHHHHHHH-HHHhhcCCCCHHHHHHHHHHHHHH---HHHHHhhcccc-----eEEEEeccc-CCHHHhhC Q lcl|NC_021557. 313 FIHARRILDMIHEAIIFY-TMNYVDRLGSPMTVEAAEEGVNAY---LRSKTGIAIYG-----GTFRFDRQK-NTAEQIAD 382 (419) Q Consensus 313 ~i~vrR~~~~i~~~~~~~-~~~~v~e~n~~~~~~~i~~~i~~~---L~~l~~~g~~~-----~~v~~d~~~-n~~~~i~~ 382 (419) ||..+|+.|-+.|.++.. +....++-.++.. ..+......| |+++.+.+-+. +++.-.++. =+..-+.. T Consensus 259 ~ie~~RVvdKa~R~vR~~ai~~i~D~~lnst~-gsia~~~~~~~~~L~ema~s~~i~~~~fpgeI~~p~d~Di~i~w~s~ 337 (370) T protein:vir:78 259 VIENLRIAYKVARRMRLRAIARIGDRSFNSTP-GSTAAAITYFGKDLREMAKSTTINGQPFPGDIASPQDGDIRIQWVAK 337 (370) T ss_pred hhhhhhHHHHHHHHHHHHHHHHhCCcccCCCC-cchhHHHHHHHhhHHHHHhhhhhcccccceeEeccCCCcceEEeecc Confidence 999999999999999844 4444554333221 3333444444 44443333222 233322211 01112367 Q ss_pred CEEEEEEEEEeccCceEEEEEEEEcchHHHHHHH Q lcl|NC_021557. 383 GKFYYRLECHPISVMERITIDSYVDTKFISNALS 416 (419) Q Consensus 383 G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~ 416 (419) +++.+.+.+.|.--.+.|+..+..|... ++=-+ T Consensus 338 ~~v~I~~~v~P~~~pk~Itv~I~LDls~-e~~~~ 370 (370) T protein:vir:78 338 NLVSVFVVVRTVDCPKGITVNIMLDLSL-NNGEG 370 (370) T ss_pred ceEEEEEEEEeccCCceEEEEEEEeecc-ccCCC Confidence 8899999999999999999999887542 22111 No 53 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=98.67 E-value=1.3e-07 Score=58.36 Aligned_cols=371 Identities=11% Similarity=-0.013 Sum_probs=190.0 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |=+. =|.|. ++-.+.++...+...+.|+|.... +.......++..+....||..... +.+. T Consensus 1 ~~s~---iVnV~-i~~~~~a~~~~~f~~~l~~~~~~~------------~~~r~~~yss~~~V~~~FG~~S~e---y~aA 61 (450) T protein:vir:95 1 MWNP---IVNVD-ITLNTAGTTREGFGLPLFLASTDN------------FEERVRGYTSLTEVAEDFDENTAA---YKAA 61 (450) T ss_pred CCCc---eEEEe-ecccccccccccceeEEEEcCCCC------------CccceeeecCHHHHHHhcCCCcHH---HHHH Confidence 6543 24444 344566677778888888885321 123345567888888999986443 4455 Q ss_pred HHHhhccCCcEEEE--eecccccccccc--------------c-------cccccccc---------cceeccc------ Q lcl|NC_021557. 81 DAIFDQGDGGTIIV--NNVFDPDVHKEG--------------A-------NPDPSKVT---------TVDINGT------ 122 (419) Q Consensus 81 ~~~~~~~~~~~~v~--~~~~~~~~~~~~--------------~-------~~~~~~~t---------~~~~~g~------ 122 (419) ..+|.+......++ +........... . ..+.+..+ ...+.+. T Consensus 62 ~~yF~q~p~p~~l~igr~~~~~t~~~~~~~~~~~~g~lt~tv~G~~~~~~~i~~s~a~s~~~va~~~~tai~~~~~~~~~ 141 (450) T protein:vir:95 62 KQLWSQTPKVTQLYIGRRAMQYTVSIPDAVTESTDYSITVAAGGGISQPYQYTAQSSDTAENVLQQFKTQIEADPTIKDK 141 (450) T ss_pred HHHHhCCCcccEEEEEeeccchhhhhhhhhccccceeEEEEecceeeeeeEEEEEecCChhhHHHHhhhhhcccceeeee Confidence 66666543332221 111110000000 0 00000000 0000000 Q ss_pred ----cccccccccchhhhhhhhhcc-c--cccccccchhhhhhhHHHHHHHHhhc-cceeEEEEeccCCCHHHHHhhhhh Q lcl|NC_021557. 123 ----ISPAGLASGFSGAYECYNNFG-Y--FPKLIIAPGYSPAAAVRAEMDVVASR-LHALAIADLPLGLTKQQAVAARGV 194 (419) Q Consensus 123 ----~~~~~~~tg~~a~~~~~~~~~-~--~p~~~~ap~~~~~~~v~a~l~~~~~~-~~~~~i~d~p~~~~~~~~~~~~~~ 194 (419) ....+....+........... + ........+.. ......+|..+... .+++.+. .+ ..+.++..+...+ T Consensus 142 ~~~~s~g~~~~~t~~~~~~~~~~~~~l~~~~~~~~~~g~~-aet~~~a~~a~~~~~~~w~~~~-~~-~~~~~~i~a~a~w 218 (450) T protein:vir:95 142 VSVNVTGSNGSATMIIAKAGDNDFVKVTTTAQTVYIASTT-ADTASTALAAIEAYSTDWYFIA-AE-DRTQQFVLAMASE 218 (450) T ss_pred eeeeeecccceeeeeeeccccchhhccccccceeEecccc-cccHHHHHHHHHHhhCCeEEEE-ec-CCCHHHHHHHHHH Confidence 000000000000000000000 0 01111111111 12344566666543 3454443 33 3444444443332 Q ss_pred ccccccCccceEEeccee-Eeeccc-------------cc--cce-eee-------chHHHHHHHHHhhhhccCceeccc Q lcl|NC_021557. 195 AGTANTSSARTVLTYPHV-VIEDTT-------------GA--TET-RLD-------PLSSRLAGVIIATDLNEGWQNSPS 250 (419) Q Consensus 195 ~~~~~~~s~~~~~~~p~~-~~~~~~-------------~~--~~~-~~~-------p~s~~vAg~~a~~D~~~g~~~spa 250 (419) ....+ +...+..|- ...+.. .+ .++ .++ .|.+.++|.....+.-+ ..-. T Consensus 219 ~~a~~----~~f~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~~~aa~~g~~~~~~~g~---~T~~ 291 (450) T protein:vir:95 219 IQARK----KIFFTANSDVTALQGTELASANDVPAQLAKNMYTRTVCLWHHAAAEDYPEMAYIAYGAPYDAGS---IAWG 291 (450) T ss_pred HhhcC----cEEEEEcCCchhhhhhhhhcccchHHHHHhccCCeeEEEeeCCCchhHHHHHHHHHhhhcccce---eeec Confidence 22111 222222211 110000 00 001 111 23444444433332222 2233 Q ss_pred CceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeehhhHHHHHHHHHHHH Q lcl|NC_021557. 251 NREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHARRILDMIHEAIIFY 330 (419) Q Consensus 251 n~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~ 330 (419) +|.+.||..-... ......+..|.+.|.++++|++.++-+.+ .++..+|+.+ .||-++|-.+|++..|+.. T Consensus 292 fk~l~Gv~~~v~~-~~~~~lt~~~~~al~~~~~n~y~~~~~~~-~~~~G~~~~G-------~~iD~~~~~~wl~~~iq~~ 362 (450) T protein:vir:95 292 NAQLTGVAASLQP-SNQRPLTSIQKSALDVRHCNFIDLDGGVP-VVRRGITSGG-------EWIDIIRGVDWLESDLKTS 362 (450) T ss_pred cccccceeeeccC-ccccccchHHHHHHHhCCcEEEEEecCce-eeeCCeeeCc-------chhHHHHHHHHHHHHHHHH Confidence 6777777643322 12233456788999999999999886665 4788888754 2588999999999999999 Q ss_pred HHHhh--c---C-CCCHHHHHHHHHHHHHHHHHHHhhccc-ceEEEEec-ccCCHHHhhCCEEE-EEEEEEeccCceEEE Q lcl|NC_021557. 331 TMNYV--D---R-LGSPMTVEAAEEGVNAYLRSKTGIAIY-GGTFRFDR-QKNTAEQIADGKFY-YRLECHPISVMERIT 401 (419) Q Consensus 331 ~~~~v--~---e-~n~~~~~~~i~~~i~~~L~~l~~~g~~-~~~v~~d~-~~n~~~~i~~G~~~-~~v~~~p~~p~e~i~ 401 (419) +...+ - + |.+..-...|+..|+.-|++..++|++ +|.|...+ +..++.|+.++++. +.+.+.....++++. T Consensus 363 l~~ll~~~~~~KiPy~~~G~~~i~a~i~~~l~~a~~~G~Ia~~~V~~~~~~~~~~~dr~~R~~~~i~~~~~laGAIh~~~ 442 (450) T protein:vir:95 363 LRDLLINQKGGKITYDDTGITRIRQVIETSLQRAVNRNFLSSYTVNVPKASQVALADKKARILKDVTFAGILAGAILDVD 442 (450) T ss_pred HHHHHHhcCCCCCccChhhHHHHHHHHHHHHHHHHhcCcccceeEecCChHhcCHHHHhccCCCCeeEEEEEccceEEEE Confidence 88766 2 2 778888889999999999998888866 56677654 78888998888755 777778888898877 Q ss_pred EEEEEcch Q lcl|NC_021557. 402 IDSYVDTK 409 (419) Q Consensus 402 ~~~~~~~~ 409 (419) ++....=+ T Consensus 443 i~~~v~~~ 450 (450) T protein:vir:95 443 LKGTVAYE 450 (450) T ss_pred EEEEEEeC Confidence 76655544 No 54 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=98.64 E-value=1.5e-07 Score=57.95 Aligned_cols=367 Identities=11% Similarity=0.046 Sum_probs=166.3 Q ss_pred CC-------Ccc-CCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccch Q lcl|NC_021557. 1 MA-------ATF-HHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKA 72 (419) Q Consensus 1 Ma-------~~~-~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~ 72 (419) |. ..+ .||+|+|-.++...+.... .-..+||-.-. ....+.++|++++|..++...||.. T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~~~--q~vLiiGq~la--------~gs~~~~~~v~v~s~~~a~~lfG~G-- 68 (498) T protein:vir:45 1 MTISFNTIPSNTLVPLFYAEMDNQAANTAQDS--GASLLIGHANN--------GAEIVANSLVLMPSADYARQICGAG-- 68 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCCCCCCCC--cceEEEEecCC--------ccccccceeEEecCHHHHHHhcCcC-- Confidence 44 444 5999999877777555433 35677774321 1122358999999999999999975 Q ss_pred hhhHHHHHHHHhhccCCcEEEEeeccccc-------cccc----------------------ccccccc----------- Q lcl|NC_021557. 73 GYTIPAALDAIFDQGDGGTIIVNNVFDPD-------VHKE----------------------GANPDPS----------- 112 (419) Q Consensus 73 ~~~l~~al~~~~~~~~~~~~v~~~~~~~~-------~~~~----------------------~~~~~~~----------- 112 (419) +-|..-...+.++.....+.+....+.. .+.. ..+..++ T Consensus 69 -Sml~~M~~a~~~~n~~~~l~~i~~~d~aG~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aain 147 (498) T protein:vir:45 69 -SQLARMVEAYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAIN 147 (498) T ss_pred -cHHHHHHHHHHHhCCcceEEEEeeCCcccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHh Confidence 2233333333322222211111101000 0000 0000000 Q ss_pred -----ccccceeccccccccccccchh-----hh--------hhhhhccccccccccchhhhhhhHHHHHHHHhhcc--- Q lcl|NC_021557. 113 -----KVTTVDINGTISPAGLASGFSG-----AY--------ECYNNFGYFPKLIIAPGYSPAAAVRAEMDVVASRL--- 171 (419) Q Consensus 113 -----~~t~~~~~g~~~~~~~~tg~~a-----~~--------~~~~~~~~~p~~~~ap~~~~~~~v~a~l~~~~~~~--- 171 (419) .++.....+...-+...+|... .. +..+ -++...+....+....+.+..+|++..... T Consensus 148 a~~~lPVTA~~~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p-~Glt~~itamagGag~PD~a~alaal~~~~~~~ 226 (498) T protein:vir:45 148 AVPTLPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLP-AGVQIAVATGTAGTGAPVLTGAVAAMADEPFDY 226 (498) T ss_pred CCCCCceEEEecCceEEEEeeccCccccceeEEEeecccccccccc-ceeeEEEEccCCCccCchhHHHHHHhccCCccE Confidence 0011100111111111111110 00 0000 011111111222333334444444443222 Q ss_pred -----------------------------ceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccccc Q lcl|NC_021557. 172 -----------------------------HALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATE 222 (419) Q Consensus 172 -----------------------------~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~ 222 (419) .+.++.-.+..-+..+...+- ...++.|..+.+... . T Consensus 227 I~~p~~D~asL~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~~~l~t~g-----~~~N~~~it~~~~~~-------~-- 292 (498) T protein:vir:45 227 IGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAG-----DQFNQQHITLAGYEK-------E-- 292 (498) T ss_pred EEEeeCCHHHHHHHHHHHhhhhhhhhHHhhcCeEEEEeccCCHHHHHHhh-----hccCCceEEEEecCC-------C-- Confidence 111222111222233333322 123344444432100 0 Q ss_pred eeeech---HHHHHHHHH---hhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCc-EE Q lcl|NC_021557. 223 TRLDPL---SSRLAGVII---ATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATG-IR 295 (419) Q Consensus 223 ~~~~p~---s~~vAg~~a---~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G-~~ 295 (419) ..-|| ++.+||+.+ +.| |-..--...|.|+..+.....+ ...|.|.|..+||.++.-. .| .. T Consensus 293 -~~sp~~~~AAa~aa~~A~~l~~D----PArPL~tl~L~Gi~~p~~~~r~----~~~ern~LL~~Gist~~V~--~G~V~ 361 (498) T protein:vir:45 293 -TQTPADELAASRTARAAVFIRND----PARPTQTGELVGMLPAPKGKRF----TMTEQQTLLSHGVATAYVE--SGVLR 361 (498) T ss_pred -CCChHHHHHHHHHHHHHHHhhcc----cccccCceeecceecCCchhcC----ChHHHHHHHhCCcceEEEc--CCeEE Confidence 01133 233333444 344 4332234568888866543333 3567788999999998643 44 33 Q ss_pred EEecccc----CCCCCcccceeeehhhHHHHHHHHHHHHHHHh-hcCCCCHH-----------HHHHHHHHHHHHHHHHH Q lcl|NC_021557. 296 VFGNRSA----AFPTSSHVENFIHARRILDMIHEAIIFYTMNY-VDRLGSPM-----------TVEAAEEGVNAYLRSKT 359 (419) Q Consensus 296 ~wG~rT~----~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~-v~e~n~~~-----------~~~~i~~~i~~~L~~l~ 359 (419) +--..|. .....|+.|..|+..|+.+|+++.++..+... --+..... |-..|+..+-.-++.|. T Consensus 362 I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~ell~~y~~le 441 (498) T protein:vir:45 362 IQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLE 441 (498) T ss_pred EEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCeeecccCcccCCCCcccchHHHHHHHHHHHHhhh Confidence 3333332 12245788999999999999999999887643 22222222 56788888888888888 Q ss_pred hhccc----ceE--EEEecccCCHHHhhCCEEEEEEEEEeccCc----eEEEEEEEEcchHH Q lcl|NC_021557. 360 GIAIY----GGT--FRFDRQKNTAEQIADGKFYYRLECHPISVM----ERITIDSYVDTKFI 411 (419) Q Consensus 360 ~~g~~----~~~--v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~----e~i~~~~~~~~~~~ 411 (419) ..|++ .|+ ..+.++.+ +..|+.+.+-...+-+. -.|.|+++++...- T Consensus 442 ~~givEn~~~~~~~LiVerd~~-----dpnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:45 442 RAGIVENYELFKQYLVVERDAS-----VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred hhccccChhhhcceeEEEECCC-----CCcEEEEEecccccCchhhhhhhhhhheehhhcCC Confidence 77754 232 33333322 12334443333333332 23566666654422 No 55 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=98.64 E-value=1.6e-07 Score=57.88 Aligned_cols=344 Identities=12% Similarity=0.085 Sum_probs=198.3 Q ss_pred cCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHHHHHh Q lcl|NC_021557. 5 FHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAALDAIF 84 (419) Q Consensus 5 ~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al~~~~ 84 (419) ..|=|.|...+.+--++..++- .-.|||.+.+.. .+...+..-.++...||.. ...|..-+.... T Consensus 1 ~~~~v~vn~ln~~qg~~~~ver-~~lfig~~~~~~------------~~~~~~~~~sdld~~lg~~--ds~lk~~v~aa~ 65 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIER-HALFVGVGTTNQ------------GKLLALTPDSDFDKVFGET--DTDLKKQVRAAM 65 (376) T ss_pred CCCeEEEeeeeccCCCcccccc-eEEEeecccccc------------CceEEecCCCChHHhhCCC--chhHHHHHHHHH Confidence 4578999988888888877764 578999866432 2334455566777778773 477778888888 Q ss_pred hccCCcEEEEeeccccccccccccccccccccceeccccccccccccchhhhhhhhhccccccccccch--hhhhhhHHH Q lcl|NC_021557. 85 DQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGFSGAYECYNNFGYFPKLIIAPG--YSPAAAVRA 162 (419) Q Consensus 85 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~~~~~~~~p~~~~ap~--~~~~~~v~a 162 (419) .|+|+...........+ .. .-+.++..+... ..+..+..-+ ......+.+ T Consensus 66 ~naG~~w~a~~~~p~~~-----------------------~~---~~~~Av~~a~~~--~s~E~V~v~~p~~t~~a~i~a 117 (376) T protein:vir:37 66 LNAGQNWFAHVYIAQED-----------------------GY---DFVECVKKANQT--ASFEYCVNTRYLGVDKASIGK 117 (376) T ss_pred hCCCCceEEEEEecCCC-----------------------hh---hHHHHHHHHHhh--CCeeEEEEecCcchhHHHHHH Confidence 88877543322111000 00 011222222111 1111111111 111112211 Q ss_pred ---HHHHHhhc--cceeEEEEecc-C------CCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHH Q lcl|NC_021557. 163 ---EMDVVASR--LHALAIADLPL-G------LTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSS 230 (419) Q Consensus 163 ---~l~~~~~~--~~~~~i~d~p~-~------~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~ 230 (419) .......+ ...|+++.++. + .+-.+..... ..-..++.+.+..++...+ + -..| T Consensus 118 ~qa~a~el~~~~~R~vffile~~g~d~~~~~ge~w~~y~~~l-~a~~~gia~~~V~vV~~~~-------g------n~~G 183 (376) T protein:vir:37 118 LQECYAELLAKFGRRTFFIQAVQGINHDQSDGETWDQYVQKL-TTLQQTIVADHVCLVPLLF-------G------NETG 183 (376) T ss_pred HHHHHHHHHHhcCCeEEEEEeccCCCCcccccCCHHHHHHHH-HHHhccccccceeeeeeec-------c------chHH Confidence 12222232 35688888762 1 1222222221 1223456666766653211 0 1367 Q ss_pred HHHHHHHhhhhccCceecccCceeeceeec---ceeccc-ccCCcchhhccccCCceEEEEEecCC-cEEEEeccccCCC Q lcl|NC_021557. 231 RLAGVIIATDLNEGWQNSPSNREIKGVVDL---EVPINF-YPSDYQNDTNFLNEAGIVTAMRSFAT-GIRVFGNRSAAFP 305 (419) Q Consensus 231 ~vAg~~a~~D~~~g~~~span~~l~gv~~~---~~~~~~-~~~~~~~~~~~L~~~gI~~i~~~~~~-G~~~wG~rT~~~~ 305 (419) .+||.+|.. ..-++.||....--.+.++ ..+.+. ...........|..+|..+.+.++|. |+.+-..||++.+ T Consensus 184 ~~aGRl~na--aVsVadspgRV~tGai~gl~~~~~p~d~~g~el~~a~l~aLd~arysvpr~Y~gydG~Yw~dg~tl~~~ 261 (376) T protein:vir:37 184 VLAGRLANR--AVTVADSPARVQTGALVSLGSANKPLDKDGNELTLAHLKSLETARYSVPMWYPDYDGYYRADGRTLDVE 261 (376) T ss_pred HHHHHHHhC--CcchhcCccceeecccccccccccccccCCcccchHHHHHHHhCCCeEEEeeCCCCceEEeCCeEeccC Confidence 888888752 3346889987653223332 222211 11113345567999999999998874 7666667899877 Q ss_pred CCcccceeeehhhHHHHHHHHHHHHHHHhhcC---CCCHHHHHHHHHHHHHHHHHHHhhc-ccc----eEEEEecccCCH Q lcl|NC_021557. 306 TSSHVENFIHARRILDMIHEAIIFYTMNYVDR---LGSPMTVEAAEEGVNAYLRSKTGIA-IYG----GTFRFDRQKNTA 377 (419) Q Consensus 306 s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e---~n~~~~~~~i~~~i~~~L~~l~~~g-~~~----~~v~~d~~~n~~ 377 (419) .+| ++||..+|+.|-+.|.++...-..+.. +.++.-....+..+..=|++|.+.+ +.| |+|.-.++ T Consensus 262 gsD--Yq~ie~~RVvdKa~R~vR~~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~ei~g~~fpgei~~P~d---- 335 (376) T protein:vir:37 262 GGD--YQVIENLRVVDKVARKVRLLAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKD---- 335 (376) T ss_pred CCC--eeeehhchHHHHHHHHHHHHHHHHhcCccccCChhHHHHHHHHHhHHHHHHHhhhhhccccccceeecCCC---- Confidence 766 999999999998888877665544443 3456677788888888899997653 433 44554332 Q ss_pred HHh-----hCCEEEEEEEEEeccCceEEEEEEEEcchHHHH Q lcl|NC_021557. 378 EQI-----ADGKFYYRLECHPISVMERITIDSYVDTKFISN 413 (419) Q Consensus 378 ~~i-----~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~ 413 (419) +|| ...++.+-+.+.|.---+.|+..+..|-+-+.+ T Consensus 336 ~dI~i~w~sk~~V~I~~~vrPy~cpk~i~~~I~LDls~~~~ 376 (376) T protein:vir:37 336 DAITIVWQSKTKVTIYIKVRPYDCPKEITANIFLDLDSLGE 376 (376) T ss_pred CceEEEeccCceEEEEEEEeeecCcceeEEEEEEecCCCCC Confidence 122 246677888888888889999999988664433 No 56 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=98.62 E-value=1.7e-07 Score=57.65 Aligned_cols=370 Identities=13% Similarity=0.060 Sum_probs=167.5 Q ss_pred CC-------Ccc-CCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccch Q lcl|NC_021557. 1 MA-------ATF-HHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKA 72 (419) Q Consensus 1 Ma-------~~~-~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~ 72 (419) |. ..+ .||+|+|-.++...+.... .-..+||..-. ....+.++|++++|..++...||.. T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~~~--q~vLiiGq~la--------~gs~~~~~~v~v~s~~~a~~~fG~G-- 68 (498) T protein:vir:44 1 MAISFNSIPSDTRVPLFYAEMDNSAANTARDS--GASLLIGHASN--------DASIAVNSLVLVSSVDYARQICGAG-- 68 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCCCCCcCC--cceEEEEecCc--------ccccccceeEeecCHHHHHHhcCcc-- Confidence 44 444 5999999877666544433 34677775321 1122358999999999999999975 Q ss_pred hhhHHHHHHHHhhccCCcEEEEeecccccccccccc-----------------------------ccccccc-------- Q lcl|NC_021557. 73 GYTIPAALDAIFDQGDGGTIIVNNVFDPDVHKEGAN-----------------------------PDPSKVT-------- 115 (419) Q Consensus 73 ~~~l~~al~~~~~~~~~~~~v~~~~~~~~~~~~~~~-----------------------------~~~~~~t-------- 115 (419) +-|..-...+.++.....+.+....+..-...... ..++.+. T Consensus 69 -Sml~~M~~a~~~~n~~~~l~~i~~~D~aG~aAtg~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aain 147 (498) T protein:vir:44 69 -SQLARMVGAYRKTDPFGELYVIAVPESTGAAATVALTVTGEATETGTVNVYTGRTRVQAPVTSGDDAAAVAVSIKDAVN 147 (498) T ss_pred -cHHHHHHHHHHHhCCCceeEEEecCCcccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHh Confidence 33444444444443333222222221110000000 0000000 Q ss_pred --------cceeccccccccccccchhhhhhhhh-------------ccccccccccchhhhhhhHHHHHHHHhhcccee Q lcl|NC_021557. 116 --------TVDINGTISPAGLASGFSGAYECYNN-------------FGYFPKLIIAPGYSPAAAVRAEMDVVASRLHAL 174 (419) Q Consensus 116 --------~~~~~g~~~~~~~~tg~~a~~~~~~~-------------~~~~p~~~~ap~~~~~~~v~a~l~~~~~~~~~~ 174 (419) ..-..+...-+...+|... .+..-. -++...+....+....+.+..+|+++....-- T Consensus 148 a~~~lPVTA~~~~~~vtlTAr~kG~~G-N~I~l~~~~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~~~~~~- 225 (498) T protein:vir:44 148 ANPDLPFTATSEAGVVTLTARHKGLYG-NEIPVTLNYYGFGGGEVLPAGVNITVASGVKGAGAPALNDAVAAMGDEPFD- 225 (498) T ss_pred CCCCCceEEeeccceEEEEEeccCccc-CcceEEEeeccCccccccccceeEEEEcccCCccCchhHHHHHhhccCCcc- Confidence 0000000000000111000 000000 01111112223334455666666666544322 Q ss_pred EEEEeccCCC-HHHHHh-h------hhhc---------------------cccccCccceEEecceeEeeccccccceee Q lcl|NC_021557. 175 AIADLPLGLT-KQQAVA-A------RGVA---------------------GTANTSSARTVLTYPHVVIEDTTGATETRL 225 (419) Q Consensus 175 ~i~d~p~~~~-~~~~~~-~------~~~~---------------------~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~ 225 (419) ++-+|...+ .-.+.. . |+.. -....++.|..+.+.. .. .. T Consensus 226 -~i~~p~~D~asl~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~a~l~t~g~~~N~~~it~~~~~-------~~---~~ 294 (498) T protein:vir:44 226 -YIGLPFNDTASVNSMATEMNDSSGRWSYVRQLYGHVYTAKTGTLSELVAAGDQFNLQHITLAGYE-------KD---TQ 294 (498) T ss_pred -EEEEeecCHHHHHHHHHHHhhhhcchHHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEecC-------CC---CC Confidence 222232211 111111 0 1000 0011222333222110 00 00 Q ss_pred ech---HHHHHHHHH---hhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCc-EEEEe Q lcl|NC_021557. 226 DPL---SSRLAGVII---ATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATG-IRVFG 298 (419) Q Consensus 226 ~p~---s~~vAg~~a---~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G-~~~wG 298 (419) -|+ ++.+||+.+ +.| |-..--...|.|+..+.....+ ...|.|.|..+||.++.-. .| ..+-- T Consensus 295 sp~~~~AAa~a~~aA~~l~~D----PArPL~tl~L~Gi~~p~~~~r~----~~~ern~LL~~Gist~~V~--~G~V~I~R 364 (498) T protein:vir:44 295 TPADELAASRTARAAVFIRND----PARPTQTGELVDMLPAPKGKRF----TTTEQQTLLSHGVATAYVE--SGVLRIQR 364 (498) T ss_pred CHHHHHHHHHHHHHHHHhhcc----cccccCceeecccccCCchhcC----ChHHHHHHHhcCcceEEEc--CCeEEEEe Confidence 122 223333333 334 4332234568888866443333 3567788999999998643 44 33333 Q ss_pred cccc----CCCCCcccceeeehhhHHHHHHHHHHHHHHHh-hcCCCCH-----------HHHHHHHHHHHHHHHHHHhhc Q lcl|NC_021557. 299 NRSA----AFPTSSHVENFIHARRILDMIHEAIIFYTMNY-VDRLGSP-----------MTVEAAEEGVNAYLRSKTGIA 362 (419) Q Consensus 299 ~rT~----~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~-v~e~n~~-----------~~~~~i~~~i~~~L~~l~~~g 362 (419) ..|. .....|+.|..|+..|+.+|+++.++..+... --+.... .|-..|+..+-.-++.|...| T Consensus 365 ~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~eli~~y~~le~~g 444 (498) T protein:vir:44 365 DITTYRKNAYGVADNSYLDSETLHTSAYVLRRLKSVITSKYGRHKLANDGTRFGSGQAIVTPAVIRGELGSTYRQMEREG 444 (498) T ss_pred eeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCcccccCCcccCCCcccccHHHHHHHHHHHHHhhhhhc Confidence 3332 12245788999999999999999999887532 2223222 266789999988888888777 Q ss_pred cc----ceE--EEEecccCCHHHhhCCEEEEEEEEEeccCce----EEEEEEEEcchHHHHHHHhcC Q lcl|NC_021557. 363 IY----GGT--FRFDRQKNTAEQIADGKFYYRLECHPISVME----RITIDSYVDTKFISNALSLAA 419 (419) Q Consensus 363 ~~----~~~--v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e----~i~~~~~~~~~~~~~~~~~~a 419 (419) ++ .|+ ..+.++.+ +..|+.+.+-...+-... .|.|+++++.. -| T Consensus 445 ivEn~~~~~~~LiVerd~~-----dpnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~--------~~ 498 (498) T protein:vir:44 445 IVENFDLFQQHLIVERNAN-----DSNRLDVLFPPDYVNQLRVFAVLNQFRLQYSEE--------AA 498 (498) T ss_pred cccChhhhcceeEEEECCC-----CCcEEEEEecccccCchhhhhhhhhhhhhhhhh--------cC Confidence 54 232 33333322 223444444333333333 23444444432 12 No 57 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=98.56 E-value=2.8e-07 Score=56.49 Aligned_cols=366 Identities=12% Similarity=0.031 Sum_probs=168.4 Q ss_pred CC-------Ccc-CCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccch Q lcl|NC_021557. 1 MA-------ATF-HHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKA 72 (419) Q Consensus 1 Ma-------~~~-~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~ 72 (419) |. .++ .||+|+|-.++...+-... .-..+||..-. ....+.++|++++|..++...||.. T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~~~--qrvLiiGq~la--------~gt~~~~~~v~v~s~~~a~~~fG~G-- 68 (498) T protein:vir:48 1 MTISFSAVPSDTLVPLFYAEMDNSAANTAVTS--APALLIGHASN--------DAAIEVNSLVLMPSADYARQICGAG-- 68 (498) T ss_pred CCccccccCcccccceEEEEEecCCCccccCC--cceEEEeecCc--------cccccccceEEecCHHHHHHhcCcc-- Confidence 44 444 5999999877777654443 34677774321 1122358999999999999999975 Q ss_pred hhhHHHHHHHHhhccCCcEEEEeecccccccccccc-----------------------------ccccc---------- Q lcl|NC_021557. 73 GYTIPAALDAIFDQGDGGTIIVNNVFDPDVHKEGAN-----------------------------PDPSK---------- 113 (419) Q Consensus 73 ~~~l~~al~~~~~~~~~~~~v~~~~~~~~~~~~~~~-----------------------------~~~~~---------- 113 (419) +-+..-++.+.++.....+.+....+..-...... ..+.. T Consensus 69 -S~l~~M~~a~~~~n~~~~l~~i~~~D~ag~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aai~ 147 (498) T protein:vir:48 69 -SQLARMVDVYRQTDPFGELYVIAVPEARGAAATVRVTVTGEAEESGTLSLYVGRSSVQVPVVNGDDATAVATAIKEAVN 147 (498) T ss_pred -cHHHHHHHHHHHhCCCceeEEEeeCCcccceeEEEEEecccccCCceEEEEECCEEEEEeecCCCCHHHHHHHHHHHHh Confidence 33444444444433333222222111110000000 00000 Q ss_pred ------cccceeccccccccccccchhhhhhhhhc-------------cccccccccchhhhhhhHHHHHHHHhhcccee Q lcl|NC_021557. 114 ------VTTVDINGTISPAGLASGFSGAYECYNNF-------------GYFPKLIIAPGYSPAAAVRAEMDVVASRLHAL 174 (419) Q Consensus 114 ------~t~~~~~g~~~~~~~~tg~~a~~~~~~~~-------------~~~p~~~~ap~~~~~~~v~a~l~~~~~~~~~~ 174 (419) ++..-..+...-+...+|... .+..-.. ++...+....+....+.+..+|+++....-- T Consensus 148 a~~~lPVTA~~~~~~VtlTAr~kG~~G-N~I~l~~~~~~~~~ge~~p~Glt~~itamsgGag~PDia~aLaal~~~~~~- 225 (498) T protein:vir:48 148 GVITLPFAASSDAGVVTLTARHKGLYG-NELPVCLNYYGSGGGEILPAGLQVVTEAGTAGSGAPDLTAAVAAMGDEAFD- 225 (498) T ss_pred CCCCcceEEEecCcEEEEEeeeccccc-ccceeeeeeccCcccccccceeeEEEEcccCCccCcchHHHHHhhccCCcc- Confidence 000000000000011111100 0000000 1111111122334445555666665443322 Q ss_pred EEEEeccCC----------------------------------CHHHHHhhhhhccccccCccceEEecceeEeeccccc Q lcl|NC_021557. 175 AIADLPLGL----------------------------------TKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGA 220 (419) Q Consensus 175 ~i~d~p~~~----------------------------------~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~ 220 (419) ++-+|... +..+...+- ...++.|..+.+. + T Consensus 226 -~I~~p~~D~asl~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~~~l~t~g-----~~~N~~~it~~~~---------~ 290 (498) T protein:vir:48 226 -FIGLPFNDAASINMMMTEMNDSSGRWSYARQLYGHVYTAKLGTLSELVNAG-----DMHNQQHITLAGY---------E 290 (498) T ss_pred -EEEEeecCHHHHHHHHHHHhhhhhhhhHHhhcCeEEEEeccCCHHHHHHhh-----hccCCceEEEEec---------C Confidence 22223221 122222111 1122333332221 0 Q ss_pred cceeeechHH---HHHHHHH---hhhhccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcE Q lcl|NC_021557. 221 TETRLDPLSS---RLAGVII---ATDLNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGI 294 (419) Q Consensus 221 ~~~~~~p~s~---~vAg~~a---~~D~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~ 294 (419) .. ..-|+.. ..|++.+ +.| |-..--...|.|+..+.....+ ...|.|.|..+||.++.- .++-. T Consensus 291 ~~-~~~p~~~~AAa~a~~aA~~l~~D----PArPLqtl~L~Gi~~p~~~~r~----~~~ern~LL~~Gist~~V-~~G~V 360 (498) T protein:vir:48 291 KE-TQSPVDELVASRLAREAVFIRND----PARPTQTGELVGMLPAPKGKRF----IMTEQQTLLSHGVATAYV-EGGTL 360 (498) T ss_pred CC-CCChHHHHHHHHHHHHHHhhhcc----ccccccceeeeccccCCchhcC----ChHHHHHHHhcCcceEEE-cCCeE Confidence 00 0013322 3333333 444 3222223568888866544333 356778899999999864 54434 Q ss_pred EEEeccccC----CCCCcccceeeehhhHHHHHHHHHHHHHHHh-hcCCCCHH-----------HHHHHHHHHHHHHHHH Q lcl|NC_021557. 295 RVFGNRSAA----FPTSSHVENFIHARRILDMIHEAIIFYTMNY-VDRLGSPM-----------TVEAAEEGVNAYLRSK 358 (419) Q Consensus 295 ~~wG~rT~~----~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~-v~e~n~~~-----------~~~~i~~~i~~~L~~l 358 (419) .+--..|.- ....|+.|..|+..|+.+|+++.++..+... --+..... |-..|+..+-.-++.| T Consensus 361 ~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~eli~~y~~l 440 (498) T protein:vir:48 361 RIQRSVTTYKKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLANDGTRFGPGQAIVTPAVIKGELLATYRQM 440 (498) T ss_pred EEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCceecccCcccCCCCcccchHHHHHHHHHHHHhh Confidence 444444431 2244788999999999999999999887633 22233222 6678899888888888 Q ss_pred Hhhccc----ceE--EEEecccCCHHHhhCCEEEEEEEEEeccCce----EEEEEEEEcchHH Q lcl|NC_021557. 359 TGIAIY----GGT--FRFDRQKNTAEQIADGKFYYRLECHPISVME----RITIDSYVDTKFI 411 (419) Q Consensus 359 ~~~g~~----~~~--v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e----~i~~~~~~~~~~~ 411 (419) ...|++ .|+ ..+.++.+ +..|+.+.+-....-+.. .|.|+++++...- T Consensus 441 e~~given~~~~~~~LiVerd~~-----dpnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:48 441 ERAGIVENYDLFKQYLIVERDAD-----NPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred hhhccccChhhhcceeEEEECCC-----CCcEEEEEecccccCchhhhhhhhhhhhhhhhcCC Confidence 877754 232 33333322 223444443333333332 3455555543311 No 58 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=98.42 E-value=7.4e-07 Score=54.18 Aligned_cols=373 Identities=11% Similarity=0.001 Sum_probs=173.7 Q ss_pred CCC--------cc-CCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccc Q lcl|NC_021557. 1 MAA--------TF-HHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHK 71 (419) Q Consensus 1 Ma~--------~~-~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~ 71 (419) |++ .+ .||+|+|-.++...+-......-..+||..-. ....+.++|++++|..++...||.. T Consensus 1 m~~i~F~~IP~~iRvP~~y~E~dns~A~~g~~~~~q~vLiiGq~la--------~gs~~~~~pv~v~s~~~a~~~fG~G- 71 (495) T protein:vir:19 1 MSDISFNAIPSDVRVPLTYIEFDNSNAVSGTPAPRQRVLMFGQSGS--------KASAAPNVPVRIRSGSQASAAFGQG- 71 (495) T ss_pred CCCCchhhCCcccccCeEEEEEccCCCCcCCcCCCceEEEEEecCc--------ccccccceeEEecCHHHHHHhcCcC- Confidence 553 33 59999998877666443344445677885321 1223458999999999999999975 Q ss_pred hhhhHHHHHHHHhhccCCcEEEEeeccccc-------ccc----------------------ccccccccccc------- Q lcl|NC_021557. 72 AGYTIPAALDAIFDQGDGGTIIVNNVFDPD-------VHK----------------------EGANPDPSKVT------- 115 (419) Q Consensus 72 ~~~~l~~al~~~~~~~~~~~~v~~~~~~~~-------~~~----------------------~~~~~~~~~~t------- 115 (419) +-+..-++.+.++.....+.+....+.. .+. ...+..++.+. T Consensus 72 --S~la~M~~a~~~~n~~~~l~~i~~~D~aG~aA~g~it~tg~at~~G~l~l~I~g~~v~v~V~~gdTaa~vA~al~aai 149 (495) T protein:vir:19 72 --SMLALMADAFLNANRVAELWCIPQGNGTGNAAVGEISLSGTAGENGSLVTYIAGQRLAVSVAAGATGAALADLLVARI 149 (495) T ss_pred --cHHHHHHHHHHHhCCcceEEEEeeCChhhceeEEEEEEeecCCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHh Confidence 2233333333332222211111111000 000 00000111000 Q ss_pred --------ccee---------ccccccccccccchhhhhhhhhc----------cccccccccchhhhhhhHHHHHHHHh Q lcl|NC_021557. 116 --------TVDI---------NGTISPAGLASGFSGAYECYNNF----------GYFPKLIIAPGYSPAAAVRAEMDVVA 168 (419) Q Consensus 116 --------~~~~---------~g~~~~~~~~tg~~a~~~~~~~~----------~~~p~~~~ap~~~~~~~v~a~l~~~~ 168 (419) ++.. .+...-+...+|- . .+....+ ++...+....+....+.+..+|++.. T Consensus 150 na~~~lPvTA~~~~~~~~~~a~~~VtlTAr~kG~-~-n~idi~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~ 227 (495) T protein:vir:19 150 KGQPDLPVTAEVRADSGDDDTHADVVLSAKFTGA-L-SAVDVRWNYYAGETTPYGIITAFKAASGKNGNPDISASIAGMG 227 (495) T ss_pred cCCccCceEEEeeccCCCCcCceeEEEEEeeccc-c-ccceeEEEeecccccccceeEEEEecCCCCCCcchHHHHHHhc Confidence 0000 0111112222221 1 1111110 11112222334445556666666665 Q ss_pred hccceeEEEEeccCCCH-HHH----Hhhhhhc---------------------cccccCccceEEecceeEeeccccccc Q lcl|NC_021557. 169 SRLHALAIADLPLGLTK-QQA----VAARGVA---------------------GTANTSSARTVLTYPHVVIEDTTGATE 222 (419) Q Consensus 169 ~~~~~~~i~d~p~~~~~-~~~----~~~~~~~---------------------~~~~~~s~~~~~~~p~~~~~~~~~~~~ 222 (419) .... -++-+|...+. -++ ++.|+.. -....++.|..+.+- ++ T Consensus 228 ~~~~--~~I~~P~tD~asL~al~~~l~~rw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~~---------~g- 295 (495) T protein:vir:19 228 DLQY--KYIVMPYTDEPNLNLLRTELQERWGPVNQADGFAVTVLSGTYGDISTFGVSRNDHLISCMGI---------AG- 295 (495) T ss_pred cCCC--cEEEEecCcHHHHHHHHHHHHHhhhHHHhcCeEEEEeecCCHHHHHHhhhccCCceEEEEec---------CC- Confidence 4432 22223322211 111 1111100 001122333333210 00 Q ss_pred eeeechHHHHHHHHHhhh--hccCceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEecc Q lcl|NC_021557. 223 TRLDPLSSRLAGVIIATD--LNEGWQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNR 300 (419) Q Consensus 223 ~~~~p~s~~vAg~~a~~D--~~~g~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~r 300 (419) ..-||....|++.++.- .+..|-..--...|.|+..+.....+ ...|.|.|..+||.++.-..++=..+--.. T Consensus 296 -sp~~~~~~AAA~aa~~A~~l~~DPArPL~tl~L~Gi~~p~~~~r~----~~~ern~LL~~Gist~~V~~~G~V~I~R~I 370 (495) T protein:vir:19 296 -APEPSYLYAATLCAVASQALSIDPARPLQTLTLPGRMPPAVGDRF----TWSERNALLFDGISTFNVNDGGEMQIERMI 370 (495) T ss_pred -CCCcHHHHHHHHHHHHHHHhhcccccccCceeecceecCCccccC----ChHHHHHHHhCCcceEEECCCCeEEEEeee Confidence 11244443333333221 12224333334678888866544333 356778899999998865444323343333 Q ss_pred ccC----CCCCcccceeeehhhHHHHHHHHHHHHHHHhhc-CCCCHH-----------HHHHHHHHHHHHHHHHHhhccc Q lcl|NC_021557. 301 SAA----FPTSSHVENFIHARRILDMIHEAIIFYTMNYVD-RLGSPM-----------TVEAAEEGVNAYLRSKTGIAIY 364 (419) Q Consensus 301 T~~----~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~-e~n~~~-----------~~~~i~~~i~~~L~~l~~~g~~ 364 (419) |.- ....|+.|..|+.-|+.+|+++.++..+...-. +..... |-+.|+..+-+-++.|...|++ T Consensus 371 TTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~ell~~~~~le~~giv 450 (495) T protein:vir:19 371 TMYRTNKYGDSDPSYLNVNTIATLSYLRYSLRTRITQKFPNYKLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGLV 450 (495) T ss_pred eeeeecCCCCcchhhhhhHHHHHHHHHHHHHHHHHhhhcCCcccccCCCCCCCcccccChHHHHHHHHHHHHhhhhhccc Confidence 331 224477899999999999999999988763322 233322 5677898888888888777754 Q ss_pred ----ceE--EEEecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcc Q lcl|NC_021557. 365 ----GGT--FRFDRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVDT 408 (419) Q Consensus 365 ----~~~--v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~ 408 (419) .|+ ..+.++-+ +.+|+.+.+-...+....-+-.++++-- T Consensus 451 en~~~~~~~LiVerd~~-----dpnRln~~~p~d~vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 451 EDFDTFKEELYVARNKD-----DKDRLDVLCGPNLINQFRIFAAQVQFIL 495 (495) T ss_pred cChhhhcceeEEEECCC-----CCcEEEEEecceeeCceeeeeeeeeeeC Confidence 232 33333221 2345555555555555544333333222 No 59 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=98.36 E-value=1.1e-06 Score=53.28 Aligned_cols=375 Identities=10% Similarity=-0.048 Sum_probs=188.9 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCccee-ecchHHHHHHhcccchhhhHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVI-IRSRAEGAAAFGVHKAGYTIPAA 79 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~-its~~e~~~~fg~~~~~~~l~~a 79 (419) |+=-...=|.|. ++-.+.++...+...+.|+|+...-.+. .....++ .+|..+....||....+ +.+ T Consensus 1 msip~s~ivnV~-i~~~~~a~~~~~f~~~l~l~~~~~~~~~--------~~~~r~~~~~s~~~V~~~FG~~s~e---y~a 68 (502) T protein:vir:52 1 MALSISHIVNVQ-LNTVPKSAARKSFGIVALFTPEAGQAFA--------DEKTRYVYVENQRDVEQLFGTNSET---AKA 68 (502) T ss_pred CCCCccceeEEe-eccccccccccccCceEEEeeccCcccc--------CCccceEEecCHHHHHHhcCCChHH---HHH Confidence 884333444444 3444566677777788888853222111 0122333 46778888888875333 334 Q ss_pred HHHHhhccCCcEEE-E-eeccccc-----------------------------------cccccccc---------ccc- Q lcl|NC_021557. 80 LDAIFDQGDGGTII-V-NNVFDPD-----------------------------------VHKEGANP---------DPS- 112 (419) Q Consensus 80 l~~~~~~~~~~~~v-~-~~~~~~~-----------------------------------~~~~~~~~---------~~~- 112 (419) ...+|.+......+ + +...... ......+. +.. T Consensus 69 A~~yF~q~p~P~~l~igR~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~i~i~g~~~t~~~i~lS~~ts~~~vA~ 148 (502) T protein:vir:52 69 AQPFFAQSPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVAT 148 (502) T ss_pred HHHHhcCCCccceEEEEeccccccceeechhhhhhhhhHHhHHHhhhhcCceeEEEecceeeeeeccccccccchhHHHH Confidence 44445443222111 1 1100000 00000000 000 Q ss_pred ----ccccce-------------ecccccccccccc--c-hhh------hhhhhhcccccccc------ccchhhhhhhH Q lcl|NC_021557. 113 ----KVTTVD-------------INGTISPAGLASG--F-SGA------YECYNNFGYFPKLI------IAPGYSPAAAV 160 (419) Q Consensus 113 ----~~t~~~-------------~~g~~~~~~~~tg--~-~a~------~~~~~~~~~~p~~~------~ap~~~~~~~v 160 (419) +..... ++-....++..+. + .+. ...-..+.+....- ...+.. .... T Consensus 149 ~i~~~l~~~~~~~tv~~d~~~~~F~i~s~ttg~~~~~~~~~a~~~~~~gt~~a~~l~l~~~~~av~v~~~~~g~~-aet~ 227 (502) T protein:vir:52 149 KIQEKLTTLSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLK-KETL 227 (502) T ss_pred HHHhhhcccccceEEEEecCCceEEEEeccCCCcceeEEEEeecCCcchhHHHHHhccccccceeeeeeeccccc-ccCH Confidence 000000 0000000000000 0 000 00001111111110 111111 2233 Q ss_pred HHHHHHHhhccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecce-eEeeccc------------cccceee-- Q lcl|NC_021557. 161 RAEMDVVASRLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPH-VVIEDTT------------GATETRL-- 225 (419) Q Consensus 161 ~a~l~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~-~~~~~~~------------~~~~~~~-- 225 (419) ..+|.++....+-+..+....+.+.++..+.-.+....+ +...+..+ ....+.. +.....+ T Consensus 228 ~~al~a~~~~~~~w~~~~~a~~~~~~~~la~a~~iea~~----~~f~~~~~d~~~~~~~~~~i~~~l~a~~~~~t~~~y~ 303 (502) T protein:vir:52 228 GEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQANT----KLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFD 303 (502) T ss_pred HHHHHHHHhccCceEEEEEeecCChhHHHHHHHHHhhcC----cEEEEEecCcceeccccchHHHHHHhccCceeEEEec Confidence 445566554332222222222333333333222221111 11122111 0000000 0000011 Q ss_pred ---echHHHHHHHHHhhhhccC-ceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccc Q lcl|NC_021557. 226 ---DPLSSRLAGVIIATDLNEG-WQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRS 301 (419) Q Consensus 226 ---~p~s~~vAg~~a~~D~~~g-~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT 301 (419) -.|.+.+.|.++.+|...- -...-.+|.+.||.... .+.+|++.|..+++|++.++-+.+ .+...++ T Consensus 304 ~~~~~~~aa~~g~~as~~f~~~~g~iT~~fk~l~GV~~~~--------lt~t~~~al~~~~~N~y~~~~~~~-~~~~G~~ 374 (502) T protein:vir:52 304 KNDMYPVSSALARLLSTNFAANNSTLTLKFKQQPTITADE--------ITATEFAKAKRLGINVYTYFDDVA-MIAEGTV 374 (502) T ss_pred CCcchhHHHHHHHHHhcCCCcCcceeeecccccCCcccCc--------CCHHHHHHHHhcCceEEEEecCee-EEecCee Confidence 1356667788888875432 23334567777775322 246788899999999999886555 4666677 Q ss_pred cCCCCCcccceeeehhhHHHHHHHHHHHHHHHhhcC-----CCCHHHHHHHHHHHHHHHHHHHhhccc------------ Q lcl|NC_021557. 302 AAFPTSSHVENFIHARRILDMIHEAIIFYTMNYVDR-----LGSPMTVEAAEEGVNAYLRSKTGIAIY------------ 364 (419) Q Consensus 302 ~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e-----~n~~~~~~~i~~~i~~~L~~l~~~g~~------------ 364 (419) +.+ + ||-+.+-.+|++..|+..+...++. |.|+.=...|+..++.-|++-.++|++ T Consensus 375 ~~G-----~--~iD~~~~~~Wl~~~lq~~l~~~L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~G~~~~~~~g~ 447 (502) T protein:vir:52 375 IGG-----K--FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGN 447 (502) T ss_pred eCC-----c--hhhHHHHHHHHHHHHHHHHHHHHHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCccccccccCcccce Confidence 643 2 5778888899999999988766542 567777888999999999988877754 Q ss_pred ---------ceEEEEe-cccCCHHHhhCCEE-EEEEEEEeccCceEEEEEEEEcc Q lcl|NC_021557. 365 ---------GGTFRFD-RQKNTAEQIADGKF-YYRLECHPISVMERITIDSYVDT 408 (419) Q Consensus 365 ---------~~~v~~d-~~~n~~~~i~~G~~-~~~v~~~p~~p~e~i~~~~~~~~ 408 (419) ||.+... .++.++.|+.+++. -+.+.+.+...+++|+|.+..+. T Consensus 448 ~~~~d~~~~gy~v~~~~~~~~s~~dr~~R~~~~~~~~~~~aGaIh~v~i~~nv~~ 502 (502) T protein:vir:52 448 LSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) T ss_pred eeecccccCceEEEeCchhhCCHHHHHcccCCCeEEEEEECceEEEEEEEEEEeC Confidence 3566665 46889999999888 89999999999999999998887 No 60 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=98.15 E-value=3.6e-06 Score=50.41 Aligned_cols=312 Identities=11% Similarity=0.045 Sum_probs=172.7 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |-+.. -+|.|.-....+.+.. .-.++.+..+... -..+..++..+...-|+. ...++.+. T Consensus 1 ~~~~i-v~V~v~~~~~~~~~~~--~~~~~~~~~~~t~--------------~~~~~y~s~~~v~~d~~~---~~~~Ykaa 60 (331) T protein:vir:80 1 MVETI-TDVRVHISVLYPSPRI--GLGRPAIFVKGTA--------------MGYKEYTTLEELKDTFAD---NTEVYAKA 60 (331) T ss_pred Cccce-ecceeeeccccccccc--ccCcceeEEeccc--------------cceEEEechhhhccCCCC---CcHHHHHH Confidence 66654 2555543322222222 2233433321100 123555666665555554 35566777 Q ss_pred HHHhhccCCcEEEEeeccccccccccccccccccccceeccccccccccccchhhhhh--hhhccccccccccchhhhhh Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPAGLASGFSGAYEC--YNNFGYFPKLIIAPGYSPAA 158 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~~~~tg~~a~~~~--~~~~~~~p~~~~ap~~~~~~ 158 (419) ..+|.++.....+........ +.+.++.+. ...+++... ..+ . T Consensus 61 ~~~f~Q~~~~~~i~v~~~~~~----------------------------~~~~a~~a~~~~~w~~~~~~-----~~~--~ 105 (331) T protein:vir:80 61 KAVFLQKDRPDTVAVITYEDT----------------------------KLLEAAEAYFLKSWHFALLA-----EFK--A 105 (331) T ss_pred HHHHhccCccceEEEeccchH----------------------------HHHHHHHHhccCceeEEEee-----cCC--H Confidence 778887765433221110000 001111110 001111111 111 1 Q ss_pred hHHHHHHH-HhhccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeeccccccceeeechHHHHHHHHH Q lcl|NC_021557. 159 AVRAEMDV-VASRLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIEDTTGATETRLDPLSSRLAGVII 237 (419) Q Consensus 159 ~v~a~l~~-~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a 237 (419) +...+++. ++.+...|.+++.. +........ . .+....+++.. .+ - -+.+.+.|..+ T Consensus 106 ~~~~a~a~~~~a~~~~f~~~~~~---~~~~~~~~~-----~--~~~t~~~~~~~-------~~----~-~~~aa~~g~~~ 163 (331) T protein:vir:80 106 ADALALSNLIEEQKFKFAVFQVT---AVADITPLA-----K--NTRTIAIVHSK-------TG----E-KLDAALIGNVA 163 (331) T ss_pred HHHHHHHHHHhhCCcEEEEEecC---chHHHHHhh-----c--cccEEEEEcCC-------cc----c-hhHHHHHHHHH Confidence 11222333 23344556655432 122222110 1 11222233221 11 1 24556667777 Q ss_pred hhhhccCceecccCc-eeeceeecceecccccCCcchhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeeh Q lcl|NC_021557. 238 ATDLNEGWQNSPSNR-EIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHA 316 (419) Q Consensus 238 ~~D~~~g~~~span~-~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~v 316 (419) ..|.-+--| .++ .|.||.... .+.+|.+.|..+|+|++.++.+.. .++...|+.+ .||.+ T Consensus 164 ~~~~g~~t~---~fk~~l~GV~~~~--------lt~t~~~al~~~~~N~y~~~~~~~-~~~~G~~~~G-------~~iD~ 224 (331) T protein:vir:80 164 SLPVGSATW---KGRHGLAGITSEE--------LKVSEIDAIQKAGGMCYIEKAGIA-QTSEGKTVSG-------EFIDS 224 (331) T ss_pred hcCccceee---eeecccCCCCCCC--------CCHHHHHHHHhcCceEEEEecCee-EEecceEeCc-------hhHHH Confidence 776533222 344 366665321 246788999999999999875544 4666677643 25889 Q ss_pred hhHHHHHHHHHHHHHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhhccc---------ceEEEEe-cccCCHHHhhC Q lcl|NC_021557. 317 RRILDMIHEAIIFYTMNYVDR----LGSPMTVEAAEEGVNAYLRSKTGIAIY---------GGTFRFD-RQKNTAEQIAD 382 (419) Q Consensus 317 rR~~~~i~~~~~~~~~~~v~e----~n~~~~~~~i~~~i~~~L~~l~~~g~~---------~~~v~~d-~~~n~~~~i~~ 382 (419) .+-.+|++..|+..+...+-. |-++.-...|+..++.-|++-.+.|++ +|.|... .++.+++|+.+ T Consensus 225 ~~~~dWl~~~lq~~l~~ll~~~~kiPy~~~G~~~l~a~i~~~~~~av~~G~I~~g~~~~~~~~~v~~~~~~~~s~~dr~~ 304 (331) T protein:vir:80 225 IHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAK 304 (331) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCCccChhhHHHHHHHHHHHHHHHHhCCceecCccCCCcceEEEeCchhcCCHHHHhc Confidence 999999999999888766543 556777788999999999998887765 4667765 36779999999 Q ss_pred CEEE-EEEEEEeccCceEEEEEEEEcc Q lcl|NC_021557. 383 GKFY-YRLECHPISVMERITIDSYVDT 408 (419) Q Consensus 383 G~~~-~~v~~~p~~p~e~i~~~~~~~~ 408 (419) ++.. +.+.+.+...+++|+|....+. T Consensus 305 R~~~~~~~~~~~~gaI~~v~i~~~v~~ 331 (331) T protein:vir:80 305 RNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) T ss_pred cCCCCeEEEEEEcceEEEEEEEEEEeC Confidence 8877 7888899999999999999887 No 61 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=98.02 E-value=6.7e-06 Score=48.94 Aligned_cols=370 Identities=13% Similarity=0.028 Sum_probs=175.8 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |+.. =|.| .++-.+.++..-..+.|.|||+....++... .+...+.+|..+...-||.. ...+.+. T Consensus 1 m~~~---iVnV-~Is~~t~A~~~~~Fg~~liigs~~~~~p~~~-------f~~~~~Yss~~~V~~Dfg~~---s~~Y~AA 66 (426) T protein:vir:31 1 MPKQ---IVEI-ELTAEIADRPQETFTDAAIVGTAEEEPPDAE-------FGEVNQYSTSTSVGDDYGED---SDVYTAS 66 (426) T ss_pred CCcc---eEEE-Eeecccccccccccceeeeeeeccccccccc-------cchhhhhhhHHHHHhcCCCC---hHHHHHH Confidence 9953 2333 3566777788888999999998755544321 25667788999988888864 5667777 Q ss_pred HHHhhccCCcEEEEeeccccccccccccccccccccceeccccccc----cccccchhhhhhhhh--------------- Q lcl|NC_021557. 81 DAIFDQGDGGTIIVNNVFDPDVHKEGANPDPSKVTTVDINGTISPA----GLASGFSGAYECYNN--------------- 141 (419) Q Consensus 81 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~~~~g~~~~~----~~~tg~~a~~~~~~~--------------- 141 (419) ..+|.++-...... +...+........+...+....+.+..... ....++.+-.+..+. T Consensus 67 ~~~f~Q~~~~~r~~--v~~at~~~~~~~t~~~tv~g~~~s~~a~~~~~a~~i~~~~~~~~~~~~~~~~~~~~t~~g~~t~ 144 (426) T protein:vir:31 67 EAIEEMGAEQWRVM--VLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVAT 144 (426) T ss_pred HHHHhCCceeEEee--ccccceeeeccCCcceeecceeeeecccCcchHHHHHHhhccccccccceeeeEeccccceeec Confidence 78887763221111 000000000000000000000000000000 000000000000000 Q ss_pred ----------------ccccc-cccccchhhhhh--hHHHHHHHH---hhccceeEEEEeccCC---CHHHHHhhhhhcc Q lcl|NC_021557. 142 ----------------FGYFP-KLIIAPGYSPAA--AVRAEMDVV---ASRLHALAIADLPLGL---TKQQAVAARGVAG 196 (419) Q Consensus 142 ----------------~~~~p-~~~~ap~~~~~~--~v~a~l~~~---~~~~~~~~i~d~p~~~---~~~~~~~~~~~~~ 196 (419) +..++ +...+...+... .....++.+ +...+.+.+...-... ....++.+ T Consensus 145 ~~~~~~~~~s~~dw~~~~~~~s~~~~~~ia~~~~~~~~~~~~~~~~~wa~~~~i~~va~~~e~~~~~~~~~~~a~----- 219 (426) T protein:vir:31 145 SEDSIELTYFHADWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDV----- 219 (426) T ss_pred cccceeeeeccCcchhhhcccccchhhhhhccccchhhhhhhHhhhhhhhhcceeeeeeccchhhhcchhhhhhh----- Confidence 00000 000000000000 000011111 1111111111111000 01111111 Q ss_pred ccccCccceE-EecceeEeeccccccceeeechHHHHHHHHHhhhhccCceecccCceeeceeecc---eecccccCCcc Q lcl|NC_021557. 197 TANTSSARTV-LTYPHVVIEDTTGATETRLDPLSSRLAGVIIATDLNEGWQNSPSNREIKGVVDLE---VPINFYPSDYQ 272 (419) Q Consensus 197 ~~~~~s~~~~-~~~p~~~~~~~~~~~~~~~~p~s~~vAg~~a~~D~~~g~~~span~~l~gv~~~~---~~~~~~~~~~~ 272 (419) ++.. -|.|........... .--..+++++.++.++ ||..|.-..+.+..... .+.+....... T Consensus 220 ------~~~~~~y~p~~~~~~~~~~~---~~~~~~~~~~~~aa~~----~~~~~~~~~~~~~~~~~~~~~~~gv~~t~~~ 286 (426) T protein:vir:31 220 ------AHEVAGYVPSGDLMMIVDAS---DDDLAAYQLGKFAVSE----PWYNPLWNELPAGETVSKNVGDPEEQGTFEG 286 (426) T ss_pred ------hhcccccccchhheeehhcc---ccchhhHHhhhhhhhc----cccchhhhhccccccceeeccccccccccch Confidence 1111 122221111000000 0013667888887776 56665422221211111 11111100001 Q ss_pred hhhccccCCceEEEEEecCCcEEEEeccccCCCCCcccceeeehhhHHHHHHHHHHHHHHHhhc---C-CCCHHHHHHHH Q lcl|NC_021557. 273 NDTNFLNEAGIVTAMRSFATGIRVFGNRSAAFPTSSHVENFIHARRILDMIHEAIIFYTMNYVD---R-LGSPMTVEAAE 348 (419) Q Consensus 273 ~~~~~L~~~gI~~i~~~~~~G~~~wG~rT~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~---e-~n~~~~~~~i~ 348 (419) .+.-.++ +..|+++.+. ++..+|-.-|..+...+ -.||-++|..+|+++.++..++..+= + |.+..-...|+ T Consensus 287 ~~~A~~~-~~~n~~~~~~-~~~~i~~~~~~~G~~~~--G~~iD~~~g~dwl~~~iq~~l~~ll~~~~KIpyt~~Gi~~I~ 362 (426) T protein:vir:31 287 GDEAEGE-GPVNVLIDVS-DANRVSNAVTTAGADSD--TSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIE 362 (426) T ss_pred hhhhhhc-CCceEEEEec-Cceeeecceeecccccc--hhhhhhHHHHHHHHHHHHHHHHHHhhcCCCCccchhHHHHHH Confidence 1212344 5678887764 45566665555543333 34699999999999999999886653 3 77888899999 Q ss_pred HHHHHHHHHHHh-hc--ccceEEEEecccCCHHHhhCCEEE-EEEEEEeccCceEEEEEEEEcc Q lcl|NC_021557. 349 EGVNAYLRSKTG-IA--IYGGTFRFDRQKNTAEQIADGKFY-YRLECHPISVMERITIDSYVDT 408 (419) Q Consensus 349 ~~i~~~L~~l~~-~g--~~~~~v~~d~~~n~~~~i~~G~~~-~~v~~~p~~p~e~i~~~~~~~~ 408 (419) ..|+.-|++.++ +| +-+|.+.......++.|..+-++. +++.......+.++.|+..... T Consensus 363 ~~i~~~L~~~v~~~g~~~~~y~v~~P~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) T protein:vir:31 363 DAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) T ss_pred HHHHHHHHHHhcCCCccccceeecCCCccccchhhhhhccCCceEEEEEeCcEEEEEEEEEEeC Confidence 999999998876 34 445777655544455677776666 7788888899999999998887 No 62 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=93.14 E-value=0.0086 Score=31.90 Aligned_cols=373 Identities=11% Similarity=-0.015 Sum_probs=164.1 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) ||-+=.|=-++..+.-+..+.....-...+++-+... ..+.+.....+|..+....||....++ .+. T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~lllt~~~----------~~~~~r~~~y~s~~~V~~~FG~~S~ey---~aA 67 (501) T protein:vir:36 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDT----------SVQPGQLADFFQETDVENWFGALSNEA---KIA 67 (501) T ss_pred CCcCCcccceEEEEeeeeccCCCcceeeeeEEEeccC----------CCCCcceeeecCHHHHHHhcCCChHHH---HHH Confidence 9943233333333333333333333223333332211 123355666788899999999865443 344 Q ss_pred HHHhh----ccCC--cEEEEeeccccccccc-----------------------------cccccccccc---------c Q lcl|NC_021557. 81 DAIFD----QGDG--GTIIVNNVFDPDVHKE-----------------------------GANPDPSKVT---------T 116 (419) Q Consensus 81 ~~~~~----~~~~--~~~v~~~~~~~~~~~~-----------------------------~~~~~~~~~t---------~ 116 (419) ..+|. +... ..++-+.......... ..+.+-+..+ . T Consensus 68 ~~yFs~~~~q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~~a~~~~~sg~l~vti~g~~~~~~i~lS~~ts~~~vA~~i~ 147 (501) T protein:vir:36 68 DAYFPGIVNGGQLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIE 147 (501) T ss_pred HHHhhcccCCCccccEEEEEeecCcCcceeEeccchhhhhhhhccceeEEEEEEecceeeeeecccccccCHHHHHHHHh Confidence 44453 2222 1122222111000000 0000000000 0 Q ss_pred ceeccc-----cc-------cccccccchhh-------hhhhhhcccccc---ccccchhhhhhhHHHHHHHHhhc-cce Q lcl|NC_021557. 117 VDINGT-----IS-------PAGLASGFSGA-------YECYNNFGYFPK---LIIAPGYSPAAAVRAEMDVVASR-LHA 173 (419) Q Consensus 117 ~~~~g~-----~~-------~~~~~tg~~a~-------~~~~~~~~~~p~---~~~ap~~~~~~~v~a~l~~~~~~-~~~ 173 (419) ..+... .+ -+...+|.... ...-..+++... .....+.... ....+|.++... .++ T Consensus 148 ~al~~~~~tv~~d~~~~~f~i~s~t~G~~~~i~~~t~~~~ia~~l~Lt~~~~a~v~~~g~~~e-t~~~al~a~~~~s~~W 226 (501) T protein:vir:36 148 AAFTSPDFVVAYDALRNRFTVVTNATGTAAAISAVTGTNNFADEIGLSAAAGATLQAAGVAAD-TPASAMNRAVGLSRNW 226 (501) T ss_pred hhhcCcceEEEEcCcceeEEEEeccCCcceeeEeeecccchhhhhcccccCcceEEecccccc-cHHHHHHHHHhccCce Confidence 000000 00 00000000000 000000111111 0111111111 223445555432 333 Q ss_pred --eEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeE---eecc----------ccccceee------echHHHH Q lcl|NC_021557. 174 --LAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVV---IEDT----------TGATETRL------DPLSSRL 232 (419) Q Consensus 174 --~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~---~~~~----------~~~~~~~~------~p~s~~v 232 (419) |.+++-+.+........|-. ..+ +..+|..|-. ..+. +.....+. ..+.+++ T Consensus 227 y~f~~a~~~~~~~~la~A~wie---a~~----~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~ 299 (501) T protein:vir:36 227 ATFTTAWTAVIADRLAFASWNS---GQA----YKYMYVAPDLEAASIVSNNAASFGAQVFAAPYQGTLPLYGDQATAGAV 299 (501) T ss_pred EEEEEecCCChHHHHHHHHHHh---hcC----ceEEEEEecCchhhhhccchhhHHHHHHhcCCCcEEEEcCCCCHHHHH Confidence 45555443322222223322 111 1112221100 0000 00000011 2456667 Q ss_pred HHHHHhhhhcc--CceecccCcee-eceeecceecccccCCcchhhccccCCceEEEEEecC--CcEEEEeccccCCCCC Q lcl|NC_021557. 233 AGVIIATDLNE--GWQNSPSNREI-KGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFA--TGIRVFGNRSAAFPTS 307 (419) Q Consensus 233 Ag~~a~~D~~~--g~~~span~~l-~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~--~G~~~wG~rT~~~~s~ 307 (419) .|..+.+|.++ | -..-.+|.+ .|+.. + .....+++.|..+|+|++..+-+ ..+.+|-.-++++ T Consensus 300 ~g~~as~nf~~~~g-~~T~~fkq~~~Gi~a---~-----~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG--- 367 (501) T protein:vir:36 300 MGYAASINFQLRNG-RTVLAFRQFNAGVPA---T-----VHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSG--- 367 (501) T ss_pred HHHHHhcCcccCcc-eeeeeccccCCCcCc---C-----cCCHHHHHHHHhcCCcEEEEEecccceeeEEEcCeeec--- Confidence 77778777544 2 111123333 23322 1 12467889999999998877654 3477776556643 Q ss_pred cccceeeehhhHHHHHHHHHHHHHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhhccc------------------- Q lcl|NC_021557. 308 SHVENFIHARRILDMIHEAIIFYTMNYVDR----LGSPMTVEAAEEGVNAYLRSKTGIAIY------------------- 364 (419) Q Consensus 308 ~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e----~n~~~~~~~i~~~i~~~L~~l~~~g~~------------------- 364 (419) + +.+|.+.+-.+|++..++..+....-. |.|..-...|+..++.-|++-.++|++ T Consensus 368 ~--~~wiD~~~g~dWL~~~iq~~l~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~I~~~~g 445 (501) T protein:vir:36 368 K--FLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTGLYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDKAAR 445 (501) T ss_pred c--chhhhHHHhHHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccc Confidence 3 445888888899999888888765533 567777788888888888887766643 Q ss_pred -----------ceEEEEecccCC-HHHhhCCEEEEEEEEEeccCceEEEEEEEEcc Q lcl|NC_021557. 365 -----------GGTFRFDRQKNT-AEQIADGKFYYRLECHPISVMERITIDSYVDT 408 (419) Q Consensus 365 -----------~~~v~~d~~~n~-~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~ 408 (419) ||.++.+....+ ++.-.++...+.+.+.--..+++|++-..--. T Consensus 446 ~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:36 446 VAGAGQLVQTRGWYFLIGDPANPGQARQNRTTPACTLWYSDGGSIQSLTIGSNAVI 501 (501) T ss_pred ccccccceeccceEEeeCcccCChhhhhhcccCcEEEEEEeCCceeEEEeeeeeeC Confidence 233444433233 33444455666666667777777776444332 No 63 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=93.09 E-value=0.0088 Score=31.85 Aligned_cols=373 Identities=10% Similarity=-0.018 Sum_probs=161.3 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceE-EEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTY-VNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAA 79 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~-~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~a 79 (419) ||-+=.|=-++..+.-+..+.........+ ++++.. ..+.+.....+|..+....||....++. + T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~l~l~~~~-----------~~~~~~~~~~~s~~~V~~~FG~~S~ey~---a 66 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDT-----------SVQPGQLADFFQETDVENWFGALSNEAK---I 66 (501) T ss_pred CCCCCcccceEEEEeeecccCCCccccceeEEEeccC-----------CCCccceEEecCHHHHHHhcCCChHHHH---H Confidence 993212333333333333332222222222 233211 1234666778999999999998754443 4 Q ss_pred HHHHhh----ccCCc--EEEEeecccccccccccc-c-------------------cccccc-cceeccccccccccccc Q lcl|NC_021557. 80 LDAIFD----QGDGG--TIIVNNVFDPDVHKEGAN-P-------------------DPSKVT-TVDINGTISPAGLASGF 132 (419) Q Consensus 80 l~~~~~----~~~~~--~~v~~~~~~~~~~~~~~~-~-------------------~~~~~t-~~~~~g~~~~~~~~tg~ 132 (419) ...+|. +.... .++-+............. . +..... ..++.....-.+..+.+ T Consensus 67 A~~yFsg~~~q~p~P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~sg~l~vti~g~~~~~~i~ls~ats~~~vAs~i 146 (501) T protein:vir:10 67 ADAYFPGIVNGGQLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLI 146 (501) T ss_pred HHHHhhhhcCCCccccEEEEEeecCCCccceEeccchhhhhhhhcceeeeEEEEeeccceeecccccccccCHHHHHHHH Confidence 444443 22221 222222211110000000 0 000000 00000000000000000 Q ss_pred hhhhh------------------------------------hhhhccccc---cccccchhhhhhhHHHHHHHHhh---c Q lcl|NC_021557. 133 SGAYE------------------------------------CYNNFGYFP---KLIIAPGYSPAAAVRAEMDVVAS---R 170 (419) Q Consensus 133 ~a~~~------------------------------------~~~~~~~~p---~~~~ap~~~~~~~v~a~l~~~~~---~ 170 (419) ..... .-..+++.- ......+... .....+|..+.. . T Consensus 147 ~~al~~~~~tv~~d~~~~~f~its~ttG~~~~i~~~~~~~~la~~l~Lt~~~~a~v~~~g~~a-et~~~a~~a~~~~~~~ 225 (501) T protein:vir:10 147 EAAFTSPDFVVAYDALRNRFTVVTNATGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVAA-DTPASAMNRAVGLSRN 225 (501) T ss_pred hhhccCCceEEEEcccCceEEEEeeccCCceeEEEeeCchhhhhhcCccccccceEEecCccc-ccHHHHHHHHHhccCc Confidence 00000 000011111 0011111111 122344555443 3 Q ss_pred cceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeE-----eec--------cccccceeee------chHHH Q lcl|NC_021557. 171 LHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVV-----IED--------TTGATETRLD------PLSSR 231 (419) Q Consensus 171 ~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~-----~~~--------~~~~~~~~~~------p~s~~ 231 (419) +-.|.+++-+.+.+......|-.. .+ +-.+|+.|-. ... -......+.+ .+.+. T Consensus 226 Wy~f~~a~~~~~~~~la~A~wiea---~~----~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa 298 (501) T protein:vir:10 226 WATFTTAWTAVIADRLAFAAWNSG---QA----YKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGDQATAGA 298 (501) T ss_pred eEEEEEecCCChHHHHHHHHHHHh---cC----ceEEEEEecCchhhhhhhhhhhHHHHHHhcCCCceEEECCCCcHHHH Confidence 334555665433333333333221 11 1111221100 000 0001111222 35667 Q ss_pred HHHHHHhhhhccCc-eecccCceee-ceeecceecccccCCcchhhccccCCceEEEEEecCCc--EEEEeccccCCCCC Q lcl|NC_021557. 232 LAGVIIATDLNEGW-QNSPSNREIK-GVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATG--IRVFGNRSAAFPTS 307 (419) Q Consensus 232 vAg~~a~~D~~~g~-~~span~~l~-gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G--~~~wG~rT~~~~s~ 307 (419) +.|..+.+|.++-. -.+-..|.+. |+.. + .....+++.|..+|+|+...+.+.| +.+|-.-++++ T Consensus 299 ~~g~~as~nf~~~~g~~T~~fkq~~~Gi~a---~-----~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG--- 367 (501) T protein:vir:10 299 VMGYAASINFQLRNGRTVLAFRQFNAGVPA---T-----AHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSG--- 367 (501) T ss_pred HHHHHHhhCcccCccceeeeccccCCCcCc---c-----cCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeec--- Confidence 77888888754421 1122233332 2321 1 1246788999999999998876544 77885555543 Q ss_pred cccceeeehhhHHHHHHHHHHHHHHHhhc---C-CCCHHHHHHHHHHHHHHHHHHHhhcccc------------------ Q lcl|NC_021557. 308 SHVENFIHARRILDMIHEAIIFYTMNYVD---R-LGSPMTVEAAEEGVNAYLRSKTGIAIYG------------------ 365 (419) Q Consensus 308 ~~~~~~i~vrR~~~~i~~~~~~~~~~~v~---e-~n~~~~~~~i~~~i~~~L~~l~~~g~~~------------------ 365 (419) + |.+|.+-+=.+|++..++..+....- + |.+..-...|+..++.-|++-.++|+++ T Consensus 368 ~--~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~i~~~~g 445 (501) T protein:vir:10 368 K--FLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAG 445 (501) T ss_pred c--ceeehhhhhHHHHHHHHHHHHHHHHHhcCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccC Confidence 2 44466766667777777666654332 2 6678888888888888888877766442 Q ss_pred ------------eEEEEecc-cCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcc Q lcl|NC_021557. 366 ------------GTFRFDRQ-KNTAEQIADGKFYYRLECHPISVMERITIDSYVDT 408 (419) Q Consensus 366 ------------~~v~~d~~-~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~ 408 (419) |.++.+.. ..+++.-.++...+.+.+.--..+++|++-..--. T Consensus 446 ~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 446 VAGAGQLVQTRGWYFLIGDPANPGQARQNRTTPACTLWYSDGGSIQQLTIGSNAVI 501 (501) T ss_pred ccccccceeccceeEeeccccCChhhhhhccccceEEEEEeCCceeEEEeeeeecC Confidence 23333333 22333444455666666666677777766444322 No 64 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=89.53 E-value=0.026 Score=29.29 Aligned_cols=375 Identities=10% Similarity=-0.006 Sum_probs=163.9 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEE-EEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYV-NGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAA 79 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~-Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~a 79 (419) ||-+=.|=-++..+.-+..+.....-...++ +++... .+.+.....+|..+....||....++ .+ T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~f~~lll~~~~~-----------~~~~r~~~y~s~~~V~~~FG~~S~ey---~a 66 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTS-----------VQPGQLADFFQKTDVENWFGALSNEA---KI 66 (501) T ss_pred CCcCccccceEEEEeeecccCCCcccccceEEEecccC-----------CCccceeeecCHHHHHHhcCCChHHH---HH Confidence 9943233333333333332222222222222 222111 23356667789899999999875444 34 Q ss_pred HHHHhh----ccCC--cEEEEeecccccccc------cccc------c--------ccccc-ccceeccccccccccccc Q lcl|NC_021557. 80 LDAIFD----QGDG--GTIIVNNVFDPDVHK------EGAN------P--------DPSKV-TTVDINGTISPAGLASGF 132 (419) Q Consensus 80 l~~~~~----~~~~--~~~v~~~~~~~~~~~------~~~~------~--------~~~~~-t~~~~~g~~~~~~~~tg~ 132 (419) ...+|. +... ..++-+......... .... . +.... ...++.....-.+..+.+ T Consensus 67 A~~yFsg~~~q~p~P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~~g~l~i~i~g~~~~~~i~~s~ats~~~vA~~i 146 (501) T protein:vir:10 67 ADAYFPGIVNGGQLPYDLKFARYVAADAPASVYGIPLTGITLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLI 146 (501) T ss_pred HHHHhhhhcCCCccccEEEEEeecccCccceeeeceehhhhhhhhhheeeEEEEeeccceeeeccccccccCHHHHHHHH Confidence 444443 2222 122222221111000 0000 0 00000 000000000000000000 Q ss_pred hhhhh------------------------------------hhhhcccc---ccccccchhhhhhhHHHHHHHHhh---c Q lcl|NC_021557. 133 SGAYE------------------------------------CYNNFGYF---PKLIIAPGYSPAAAVRAEMDVVAS---R 170 (419) Q Consensus 133 ~a~~~------------------------------------~~~~~~~~---p~~~~ap~~~~~~~v~a~l~~~~~---~ 170 (419) ..... .-..+++. +..+...+.... ....+|.++.. . T Consensus 147 ~~al~~~~~tv~~d~~~~~f~i~~~t~G~~~~i~~~t~~~d~a~~l~Lt~~~~a~v~~~g~~ae-t~~~Al~a~~~~~~~ 225 (501) T protein:vir:10 147 EAAFTSPDFVVAYDALRNRFTVVTNTTGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVAAD-TPASAMNRAVGLSRN 225 (501) T ss_pred HHhhcCCceEEEEecccceEEEEecccCcceeEEEeeccccchhhhcccccCceeEEecCcccc-cHHHHHHHHHhcccc Confidence 00000 00011111 011111111111 12344555543 3 Q ss_pred cceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecc---eeEeec--------cccccceee------echHHHHH Q lcl|NC_021557. 171 LHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYP---HVVIED--------TTGATETRL------DPLSSRLA 233 (419) Q Consensus 171 ~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p---~~~~~~--------~~~~~~~~~------~p~s~~vA 233 (419) +-.|.+++-+.+.+...+..|-.. .+ .++....+- -..... -..+...+. .+|.+++. T Consensus 226 Wy~f~~a~~~~~~~~la~A~wi~a---~~--~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~ 300 (501) T protein:vir:10 226 WATFTTAWTAVIADRLAFAAWNSG---QA--YKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGDQATAGAVM 300 (501) T ss_pred eEEEEEEecCChHHHHHHHHHHHh---cC--ceEEEEEecCcceeeecccchhHHHHHHhcCCCceEEECCCCCHHHHHH Confidence 334555654433333333333221 11 111111110 000000 000011111 25677778 Q ss_pred HHHHhhhhccCc-eecccCcee-eceeecceecccccCCcchhhccccCCceEEEEEecCCc--EEEEeccccCCCCCcc Q lcl|NC_021557. 234 GVIIATDLNEGW-QNSPSNREI-KGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATG--IRVFGNRSAAFPTSSH 309 (419) Q Consensus 234 g~~a~~D~~~g~-~~span~~l-~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G--~~~wG~rT~~~~s~~~ 309 (419) |..+.+|.++-+ -.+-..|.+ .|+.. + .....+++.|..+|+|++..+-+.| +.+|-.-++++ + T Consensus 301 g~~as~nf~~~~g~~T~~fkql~~Gv~a---~-----~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG---~- 368 (501) T protein:vir:10 301 GYAASINFQLRNGRTVLAFRQFNAGVPA---T-----AHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSG---K- 368 (501) T ss_pred HHHHhcCcccCcceeeeeecccCCCcCc---c-----cCCHHHHHHHHhcCCeEEEEEecccceeeEEEcceeec---c- Confidence 888888754421 111223333 23321 1 1246788999999999988876544 78885555543 2 Q ss_pred cceeeehhhHHHHHHHHHHHHHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhhcccc-------------------- Q lcl|NC_021557. 310 VENFIHARRILDMIHEAIIFYTMNYVDR----LGSPMTVEAAEEGVNAYLRSKTGIAIYG-------------------- 365 (419) Q Consensus 310 ~~~~i~vrR~~~~i~~~~~~~~~~~v~e----~n~~~~~~~i~~~i~~~L~~l~~~g~~~-------------------- 365 (419) +.+|.+.+-.+|++..++..+....-. |-|..-...|...++.-|++-.++|+++ T Consensus 369 -~~wiD~~~g~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~~~q~~~i~~~~g~~ 447 (501) T protein:vir:10 369 -FLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTANYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAVAGVA 447 (501) T ss_pred -ceehhhHhhHHHHHHHHHHHHHHHHhcCCCcccCHHHHHHHHHHHHHHHHHHHhCcceecCcccCcccceeeccccccc Confidence 445788888888888888887765432 5567777888888888888877766432 Q ss_pred ----------eEEEEecc-cCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcc Q lcl|NC_021557. 366 ----------GTFRFDRQ-KNTAEQIADGKFYYRLECHPISVMERITIDSYVDT 408 (419) Q Consensus 366 ----------~~v~~d~~-~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~ 408 (419) |.++.+.. +.+++.-.++...+.+.+.--..+++|++-..--. T Consensus 448 ~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 448 GAGALVQTRGWYFLIGNPANPGQARQNRTSPACTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred ccccceeccceEEeeCcccCChhhhhhcccCceEEEEEeCCceeEEEeeeeecC Confidence 23334332 23333444455666666666677777766444332 No 65 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=87.52 E-value=0.038 Score=28.36 Aligned_cols=376 Identities=10% Similarity=-0.015 Sum_probs=159.5 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) ||-+=.|=-++..+.-+..+.........+++-+.. ...+.+.....+|..+....||....++ .+. T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~lll~~~----------~~~~~~r~~~y~s~~~V~~~FG~~S~ey---~aA 67 (501) T protein:vir:78 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQD----------TSIQPGQLADFFQKTDVENWFGGLSNEA---VIA 67 (501) T ss_pred CCcCccccceEEEEeeecccCCCcceeeeeEEEecC----------CCCCccceeeecCHHHHHHhcCCChHHH---HHH Confidence 994323333333333333333322222223322211 1113355666788899999999865443 344 Q ss_pred HHHhh----ccCCc--EEEEeeccccccccccc------------cc--------ccccc-ccceeccccccccccccch Q lcl|NC_021557. 81 DAIFD----QGDGG--TIIVNNVFDPDVHKEGA------------NP--------DPSKV-TTVDINGTISPAGLASGFS 133 (419) Q Consensus 81 ~~~~~----~~~~~--~~v~~~~~~~~~~~~~~------------~~--------~~~~~-t~~~~~g~~~~~~~~tg~~ 133 (419) ..+|. +.... .++-+............ .. +.... +..++.....-.+..+.+. T Consensus 68 ~~yFs~~~~q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~la~~~~~~G~l~iti~g~~~~~~i~~S~~ts~~~vA~~i~ 147 (501) T protein:vir:78 68 DAYFPGIVNGGQLPYDLKFARYVAADAPASVYGIPLTGVTLTQLQGYSGTLTVTTAAQHVSSNISLAAATSFANAATLIE 147 (501) T ss_pred HHHhhcCCCCCcccceEEEEeecccCcceeEeccceeccchhhhceeeeEEEEEeccceeeeccccccccCHHHHHHHHH Confidence 44554 22221 12222211100000000 00 00000 0000000000000000000 Q ss_pred hhhhh------------------------------------hhhcccc---ccccccchhhhhhhHHHHHHHHhh---cc Q lcl|NC_021557. 134 GAYEC------------------------------------YNNFGYF---PKLIIAPGYSPAAAVRAEMDVVAS---RL 171 (419) Q Consensus 134 a~~~~------------------------------------~~~~~~~---p~~~~ap~~~~~~~v~a~l~~~~~---~~ 171 (419) ..... -..+++. +..+...+... .....+|.++.. .+ T Consensus 148 ~al~a~~~tv~~ds~~~~f~its~t~G~~~~i~~~t~~~~~a~~l~Lt~~~~a~v~~~g~~a-et~~~a~~a~~~~~~~W 226 (501) T protein:vir:78 148 AAFTSPDFVVSYDALRNRFVVNTNATGTAAAISAVTGTNNLADELGLSAAAGASLQAAGVAA-DTPASAMNRAVGLSRNW 226 (501) T ss_pred hhhcCcceEEEEccccceEEEEeeecCCceeEEEEecccchhhhhcccccCceeeEeccccc-cCHHHHHHHHHhccCce Confidence 00000 0001111 01111111111 122344555543 33 Q ss_pred ceeEEEEeccCCCHHHHHhhhhhccccccCccceEEec---ceeEeec--------cccccceeee------chHHHHHH Q lcl|NC_021557. 172 HALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTY---PHVVIED--------TTGATETRLD------PLSSRLAG 234 (419) Q Consensus 172 ~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~---p~~~~~~--------~~~~~~~~~~------p~s~~vAg 234 (419) -.|.+++-+.+.+......|-.. .+ .+|....+ +...... -......+.+ .+.+.+.| T Consensus 227 y~f~~a~~~~~~~~lalA~wiea---~~--~~f~~~~~~~~~~~~~~~~~~~i~~~l~a~~y~~t~~~y~~~~~~aa~~g 301 (501) T protein:vir:78 227 ATFTTAWTAVIADRLALASWNSG---QA--YKYMYVAPDLEPASIVTNNSASFGAQVFAAPYQGTLPLYGDQATAGAVMG 301 (501) T ss_pred EEEEEecCCCHHHHHHHHHHHHh---cC--ceEEEEEecCCcceeecccchhHHHHHhhcCCCceEEEcCCcchHHHHHH Confidence 33555665433322233333221 11 11111111 0000000 0001111222 24566677 Q ss_pred HHHhhhhccCc-eecccCcee-eceeecceecccccCCcchhhccccCCceEEEEEecCCc--EEEEeccccCCCCCccc Q lcl|NC_021557. 235 VIIATDLNEGW-QNSPSNREI-KGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATG--IRVFGNRSAAFPTSSHV 310 (419) Q Consensus 235 ~~a~~D~~~g~-~~span~~l-~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G--~~~wG~rT~~~~s~~~~ 310 (419) ..+.+|.++-+ -.+-..|.+ .|+.. + .....+++.|..+|+|++..+-+.| +.+|-.-++++ + T Consensus 302 ~~as~nf~~~~g~~T~~fkq~~~Gv~a---~-----~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG---~-- 368 (501) T protein:vir:78 302 YAASINFQLRNGRTVLAFRQFNAGVPA---T-----AHDLGTANALRSNNYTYIGAYANAANNYTIAYDGKLSG---K-- 368 (501) T ss_pred HHHhcCcccCcceeeeeccccCCCcCc---c-----cCCHHHHHHHHhcCCeEEEEEecccceeeEEEcCeeec---c-- Confidence 77777754321 111123332 23321 1 1246788999999999988776544 78885555543 2 Q ss_pred ceeeehhhHHHHHHHHHHHHHHHhhc---C-CCCHHHHHHHHHHHHHHHHHHHhhcccc--------------------- Q lcl|NC_021557. 311 ENFIHARRILDMIHEAIIFYTMNYVD---R-LGSPMTVEAAEEGVNAYLRSKTGIAIYG--------------------- 365 (419) Q Consensus 311 ~~~i~vrR~~~~i~~~~~~~~~~~v~---e-~n~~~~~~~i~~~i~~~L~~l~~~g~~~--------------------- 365 (419) |.+|.+-+=.+|++..++..+....- + |.+..-...|+..++.-|++-.++|+++ T Consensus 369 ~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~I~~~~g~~~ 448 (501) T protein:vir:78 369 FLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAG 448 (501) T ss_pred ceeehhhhhHHHHHHHHHHHHHHHHHhCCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCCccceeeccccCccc Confidence 44466666667777777776654432 2 6677778888888888888877766442 Q ss_pred ---------eEEEEecc-cCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcc Q lcl|NC_021557. 366 ---------GTFRFDRQ-KNTAEQIADGKFYYRLECHPISVMERITIDSYVDT 408 (419) Q Consensus 366 ---------~~v~~d~~-~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~ 408 (419) |.++.+.. +.+++.-.++...+.+.+.--..+++|++-..--. T Consensus 449 ~~~~~~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:78 449 AGQLVQMRGWYFLIGDPANPGQARQNRTTPTCTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred cccceeccceEEeeccccCChhhhhhcccCcEEEEEEeCCceeEEEeeeeecC Confidence 23334332 23333444455666666666677777766444322 No 66 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=85.05 E-value=0.056 Score=27.46 Aligned_cols=373 Identities=11% Similarity=0.004 Sum_probs=161.1 Q ss_pred CCCccCCCeEEEEcCCCccCcc--ccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVR--DVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPA 78 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~--~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~ 78 (419) |-.- =++..+.-+..+.. ........|++.. ...+..+....+|..+....||....+ +. T Consensus 1 mip~----s~iVnV~~~v~~~a~~~~~~~~~lilt~~-----------~~~~~~r~~~y~s~~~V~~~FG~~S~e---y~ 62 (507) T protein:vir:99 1 MISQ----SRYVRIVSGVGAGAPVAQRRLIMRVMTTN-----------AVLPPGVVFESSSADAVGAYFGMASEE---YK 62 (507) T ss_pred CCCc----cceeEEeeeccccCcccccccceeeeccc-----------cCCCccceEeecCHHHHHHhcCCChHH---HH Confidence 6531 11222222222222 2222334444321 111234556678888889999986444 34 Q ss_pred HHHHHhhccCC------cEEEEeecccccccccccc--------------------ccccccccc--eec---------- Q lcl|NC_021557. 79 ALDAIFDQGDG------GTIIVNNVFDPDVHKEGAN--------------------PDPSKVTTV--DIN---------- 120 (419) Q Consensus 79 al~~~~~~~~~------~~~v~~~~~~~~~~~~~~~--------------------~~~~~~t~~--~~~---------- 120 (419) +...+|.+... ..++-+............. .+....+.. ++. T Consensus 63 aA~~yFsq~p~~~~~P~~L~igR~~~~~~~a~l~g~~~~~~l~~~~~~~~G~lti~v~G~~~t~~~i~lS~~ts~~~vAs 142 (507) T protein:vir:99 63 RAKAYMSFISKSINSPSYISFARWVNAAIASMIVGDSLVKNLPALKAVATPTLSLSIGGTVVPIAGIDLTAALTLTDVAA 142 (507) T ss_pred HHHHHhccCCCCCcccceEEEEeecCccccceeecchhhhhHHHHhhhcceeEEEEEcCceeEeccccccccCCHHHHHH Confidence 55555655441 2222222211110000000 000000000 000 Q ss_pred ------c-----------------------cccccc------ccccchhhhhhhhhcccc-ccccccchhhhhhhHHHHH Q lcl|NC_021557. 121 ------G-----------------------TISPAG------LASGFSGAYECYNNFGYF-PKLIIAPGYSPAAAVRAEM 164 (419) Q Consensus 121 ------g-----------------------~~~~~~------~~tg~~a~~~~~~~~~~~-p~~~~ap~~~~~~~v~a~l 164 (419) + ....++ ..+.......+...+++. .......+.... ....+| T Consensus 143 ~i~~~l~a~~~~~~~~~tv~~d~~~~~F~v~s~~tG~~s~i~~at~~~~gt~~s~l~~~~~~~a~~~~g~~ae-t~~~a~ 221 (507) T protein:vir:99 143 TLQTKIRASANAELATATVTFNTTTNQFVLNGTTTGALAPTITAVRTDPATDISSLLGWTNTGTVFVKGQAAE-TPDTSI 221 (507) T ss_pred HHHHhhhccccccccceEEEEecCCceEEEEeeeccccceeEEEEcCCchhhHHHHhccccccceEeeccccc-CHHHHH Confidence 0 000000 000000000000000000 011111122222 233455 Q ss_pred HHHhh-cccee--EEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeec--c--------------ccccceee Q lcl|NC_021557. 165 DVVAS-RLHAL--AIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIED--T--------------TGATETRL 225 (419) Q Consensus 165 ~~~~~-~~~~~--~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~--~--------------~~~~~~~~ 225 (419) ..+.. ..+++ ...+.| ..+.++..+.-.+....+ ..+ +|..|..... . ........ T Consensus 222 ~a~~~~~~nW~~~~~a~~~-~~td~~~lalA~wiea~~--~~f--~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (507) T protein:vir:99 222 SKSAAISTNFGSFIYTSTP-ALTNDQITAVASWNASQN--NMY--MYSVPTTIANIGTLYAAVKGFSGCALNITSDSLPV 296 (507) T ss_pred HHHHhhcCCeEEEEEEecc-ccChHHHHHHHHHHhhcC--cEE--EEEEecCchhhhhhhhhhhhcceeEEEeecccccc Confidence 55543 33443 445554 233333332221111111 111 1111110000 0 00001111 Q ss_pred echHHHHHHHHHhhhhccC-ceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCc--EEEEecccc Q lcl|NC_021557. 226 DPLSSRLAGVIIATDLNEG-WQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATG--IRVFGNRSA 302 (419) Q Consensus 226 ~p~s~~vAg~~a~~D~~~g-~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G--~~~wG~rT~ 302 (419) ..+.+.+.|.++.+|.++- =-.+-..|.+.||..-. ..+.|++.|.++|+|+...+.+.| +.+|-.-.+ T Consensus 297 ~~~~aa~~g~~as~nf~~~ng~~T~~fk~l~GV~a~~--------lt~t~a~al~~~n~N~y~~~a~~~~~~~~~~~G~~ 368 (507) T protein:vir:99 297 DYIEQSPCEILAATDYTRVNATQNYMYYQFPSRNITV--------SDDTTANLVDANRGNYIGQTQSAGQSLAFYQRGIL 368 (507) T ss_pred hhHHHHHHHHHHhhccCcCccceeecccccCCccccc--------CCHHHHHHHHhcCCeEEEEeccccceeeEEecCee Confidence 2356677777777774331 11122344556655221 256788999999999998886644 666655444 Q ss_pred CCCCCcccceeeehhhHHHHHHHHHHHHHHHhhc---C-CCCHHHHHHHHHHHHHHHHHHHhhccc-------------- Q lcl|NC_021557. 303 AFPTSSHVENFIHARRILDMIHEAIIFYTMNYVD---R-LGSPMTVEAAEEGVNAYLRSKTGIAIY-------------- 364 (419) Q Consensus 303 ~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~---e-~n~~~~~~~i~~~i~~~L~~l~~~g~~-------------- 364 (419) ++... .|..+.+-+=.+|++..++..+....- + |-+..-...|+..++.-|++-+++|++ T Consensus 369 ~gG~~--~fid~d~~~~~~WL~~~iq~~l~~l~~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~I~~Gvtl~~~q~~~i 446 (507) T protein:vir:99 369 CGGPN--DAVDMNIYANEIWLKSAISAQILSLFLNVPRVPANETGESMLLSVIQSVVNTAKNNGTISAGKNLNVIQQQYI 446 (507) T ss_pred eCCcc--cceeeeeecchHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhccccccCCcccccchhee Confidence 32111 244455555555777777666665332 2 667777788888888888877766543 Q ss_pred ----------------ceEEEEec-ccCCH-HHhhCCEEEEEEEEEeccCceEEEEEEEEc Q lcl|NC_021557. 365 ----------------GGTFRFDR-QKNTA-EQIADGKFYYRLECHPISVMERITIDSYVD 407 (419) Q Consensus 365 ----------------~~~v~~d~-~~n~~-~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~ 407 (419) ||.++.+. ++.++ +...++...+.+-+.--..+++|++....- T Consensus 447 n~~~~~~~~~~~~~~~Gyy~~~~~~s~~~~~~r~~r~~~~~~~~y~~~gaI~~v~~~~~~v 507 (507) T protein:vir:99 447 TQISGDANAWRQVANIGYWLNITFSSYTNPNTQLTEWKASYQLIYSKDDAIRFVEGTDTLI 507 (507) T ss_pred cccccccccccceeccceEEEeCChHhcChhhhhccccceEEEEEEeCCeEEEEEeeeecC Confidence 13344432 33343 444567777777777778888887776654 No 67 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=81.65 E-value=0.084 Score=26.49 Aligned_cols=368 Identities=10% Similarity=-0.021 Sum_probs=150.0 Q ss_pred CCCc-cCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHH Q lcl|NC_021557. 1 MAAT-FHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAA 79 (419) Q Consensus 1 Ma~~-~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~a 79 (419) |+.- ...=|.|. ++-.+.++...+...+.|.+. ...+.......+|..+....||....++ .+ T Consensus 1 m~~ip~s~iV~V~-~~v~~~~~~~~~f~~~l~~~~------------~~~~~~r~~~y~s~~~V~~~FG~~S~ey---~a 64 (494) T protein:vir:94 1 MPNIPISQIVSIN-PQVVSAGGTQGTLDGLLLTQA------------TGFPVTQPQVYFSAADVGTAFGLTSDEY---NA 64 (494) T ss_pred CCCCCcccEEEee-eeccccCCcccccceeEeecC------------ccCCccceeeecCHHHHHHhcCCChHHH---HH Confidence 7731 11112221 111222333444444443331 1122345556678888899999865443 34 Q ss_pred HHHHhh----ccCCc--EEEEeeccccccccccc----------------------------cccccccc---------c Q lcl|NC_021557. 80 LDAIFD----QGDGG--TIIVNNVFDPDVHKEGA----------------------------NPDPSKVT---------T 116 (419) Q Consensus 80 l~~~~~----~~~~~--~~v~~~~~~~~~~~~~~----------------------------~~~~~~~t---------~ 116 (419) ...+|. +.... .++-+............ ..+-+..+ . T Consensus 65 A~~yFs~~~~q~p~P~~l~igR~~~~a~~~~l~g~~~~~tl~~~~~~~g~l~iti~g~~~~~~i~lS~~ts~~~vA~~i~ 144 (494) T protein:vir:94 65 ALVYFAGILGGGQQPASLTIGRYASAATSAAVFGAPLTLSLAQLQTLSGTLIVTTDTQRTSAAINLSGATSFANAASLMT 144 (494) T ss_pred HHHHhhhccCCCccccEEEEEeecCccccceeeccchhhhHHhhhhcceEEEEEEcceEEEeeecccccCChhhHHHHHh Confidence 444454 22221 22222221110000000 00000000 0 Q ss_pred ceec--c--------------ccccccccccchhh-hhhhhhcccccc---ccccchhhhhhhHHHHHHHHhh---ccce Q lcl|NC_021557. 117 VDIN--G--------------TISPAGLASGFSGA-YECYNNFGYFPK---LIIAPGYSPAAAVRAEMDVVAS---RLHA 173 (419) Q Consensus 117 ~~~~--g--------------~~~~~~~~tg~~a~-~~~~~~~~~~p~---~~~ap~~~~~~~v~a~l~~~~~---~~~~ 173 (419) ..+. + ....++....+... ...-..+++... .+...+.. ......+|..+.. .+-. T Consensus 145 ~ai~~a~~~v~~d~~~~~f~v~s~ttG~~s~is~~t~~~a~~l~lt~~~~a~v~~~g~~-aet~~~a~~a~~~~~~~Wy~ 223 (494) T protein:vir:94 145 SGFTTPNFAITYDAQRRRFVLSTTATGTTASVSAVTGTLADGVGLSTASGAYVEGSGLA-ADTAASALDRLAASSSTWAI 223 (494) T ss_pred hhhccccceEEEcccCcEEEEEEccCCceeEEEEeccchhhhhhhhccccceEeecCcc-cccHHHHHHHHHhccCceEE Confidence 0000 0 00000000000000 000000111100 00111111 1122344555543 2334 Q ss_pred eEEEEeccCCCHHHHHhhhhhccccccCccceEEecce-----eEeecc-----------ccccceee---echHHHHHH Q lcl|NC_021557. 174 LAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPH-----VVIEDT-----------TGATETRL---DPLSSRLAG 234 (419) Q Consensus 174 ~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~-----~~~~~~-----------~~~~~~~~---~p~s~~vAg 234 (419) |.+.+.+.+.+......|-.. .+ ...+|..| ...... .+..-..+ ..|.+++.| T Consensus 224 f~~~~~~~~~~ilalA~wiea---~~----~~~~~~~~~~d~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g 296 (494) T protein:vir:94 224 FTTAWAASLSDRTALAQWTSD---QV----FRRIYAAWDQDAAGLSVNNVSSFGNIVKTTPFSNTIPVYGLLANAMIVLA 296 (494) T ss_pred EEEecCCCHHHHHHHHHHHhh---cC----ccEEEEEecCCcceeecccchhHHHHHHhhcCCceEEEcCCCChHHHHHH Confidence 455554433222233333221 11 11122211 100000 00000011 134566677 Q ss_pred HHHhhhhccCceecccCceeec---eeecceecccccCCcchhhccccCCceEEEEEecCCc--EEEEeccccCCCCCcc Q lcl|NC_021557. 235 VIIATDLNEGWQNSPSNREIKG---VVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATG--IRVFGNRSAAFPTSSH 309 (419) Q Consensus 235 ~~a~~D~~~g~~~span~~l~g---v~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G--~~~wG~rT~~~~s~~~ 309 (419) ..+.+|-+. .+.+..+.. .-++..+ .....+++.|..+|+|+...+.+.+ +.+|...++.++. T Consensus 297 ~~aa~~~~~----~~g~~T~~~k~q~~gi~~~-----~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~gg~~sG~~--- 364 (494) T protein:vir:94 297 WGASTNLQI----AEGRTTLALRSPVSSAGVR-----VDNLANANALLSNGYTYLGKYASATNTYTVTYNGAIGGQF--- 364 (494) T ss_pred HHHhccccc----cCcceeEEeeccCCCCCCc-----cCCHHHHHHHHhcCCeEEEEecccCceEEEecCceecccc--- Confidence 777776332 333333321 1121111 1245688899999999998875433 5778666765432 Q ss_pred cceeeehhhHHHHHHHHHHHHHHHhh---cC-CCCHHHHHHHHHHHHHHHHHHHhhcccc-------------------- Q lcl|NC_021557. 310 VENFIHARRILDMIHEAIIFYTMNYV---DR-LGSPMTVEAAEEGVNAYLRSKTGIAIYG-------------------- 365 (419) Q Consensus 310 ~~~~i~vrR~~~~i~~~~~~~~~~~v---~e-~n~~~~~~~i~~~i~~~L~~l~~~g~~~-------------------- 365 (419) .| |-+-+=.+|++..++..+...+ .+ |.|..-...|+..++.-|++-.++|+++ T Consensus 365 ~~--id~~~~~~WL~~~iq~~l~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~i~~~~G~~ 442 (494) T protein:vir:94 365 LW--ADTALGWIALRRNLQQALFETLLAYRSLPYNADGYNALYQGAQDVVSQFVAAGVIRAGVALSASQRAQIDQAAGVP 442 (494) T ss_pred ce--eeeeccHHHHHHHHHHHHHHHHHhCCCcccChhhHHHHHHHHHHHHHHHHhCceeecccccCcchhhhhhhhhcCc Confidence 12 2222222355555555544332 33 7788888888999998888887776542 Q ss_pred ---------eEEEE-e-cccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEcc Q lcl|NC_021557. 366 ---------GTFRF-D-RQKNTAEQIADGKFYYRLECHPISVMERITIDSYVDT 408 (419) Q Consensus 366 ---------~~v~~-d-~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~ 408 (419) |.+.. + .+.++..+...-++.+.+.. -..+++|++...--. T Consensus 443 ~~~~~~~kGyy~~~~~~~s~~~ra~R~~~~~~~~y~~--~GAIh~v~i~~~~v~ 494 (494) T protein:vir:94 443 ISGDVVDKGWYLQVIDPITTTVRTDRGSPTVNFWYCD--GGSIQRVVVSATTVI 494 (494) T ss_pred cccceeccceeeeccCCCChhhhhccccCCceEEEEe--cCcEEEEEEeeEEeC Confidence 12222 2 24455555555555554444 677777777766554 No 68 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=78.51 E-value=0.11 Score=25.76 Aligned_cols=373 Identities=10% Similarity=0.010 Sum_probs=162.5 Q ss_pred CCCccCCCeEEEEcCCCccCccccCccceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKSAVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAAL 80 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~tav~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~al 80 (419) |-.- -.=|.|. ++-.+.........++.|+++-.. .+..+....+|..+....||....+ +.+. T Consensus 1 mip~-s~iV~V~-~~v~~~~~~~~~~~~~l~l~~~~~-----------~~~~r~~~y~s~~~V~~~FG~~S~e---y~aA 64 (504) T protein:vir:96 1 MISQ-SRYIRII-SGVGAGAPVAGRKLILRVMTTNNV-----------IPPGIVIEFDNANAVLSYFGAQSEE---YQRA 64 (504) T ss_pred CCCc-cceeEee-ecccccccccccccceeEeecccC-----------CCccceEEecCHHHHHHhcCCChHH---HHHH Confidence 6531 0112221 111122222223344455553211 1224456668888888999986544 4455 Q ss_pred HHHhhccC------CcEEEEeeccccccccccccc-c-----ccccc----cceecccc---------ccc---cccccc Q lcl|NC_021557. 81 DAIFDQGD------GGTIIVNNVFDPDVHKEGANP-D-----PSKVT----TVDINGTI---------SPA---GLASGF 132 (419) Q Consensus 81 ~~~~~~~~------~~~~v~~~~~~~~~~~~~~~~-~-----~~~~t----~~~~~g~~---------~~~---~~~tg~ 132 (419) ..+|.+.. ...++-+.............. . ...++ ...+.|.. ..+ +..+.+ T Consensus 65 ~~yF~~~~~~~~~P~~l~igR~~~~a~~~~l~g~~~~~~~~~~~~i~~G~lsitv~G~~~~~~~i~~S~~ts~~~vA~~i 144 (504) T protein:vir:96 65 AAYFKFISKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEKNITAIDTSAATSMDNVASII 144 (504) T ss_pred HHHhhcCCCCCccccEEEEEeecCcCccceEEechhHHHHHHHhhhhceEEEEEEcceeeeecccccccccchHHHHHHH Confidence 56666533 233333332211111000000 0 00000 00000000 000 000000 Q ss_pred h----h---------------------------------h------hhhhhhcccc-ccccccchhhhhhhHHHHHHHHh Q lcl|NC_021557. 133 S----G---------------------------------A------YECYNNFGYF-PKLIIAPGYSPAAAVRAEMDVVA 168 (419) Q Consensus 133 ~----a---------------------------------~------~~~~~~~~~~-p~~~~ap~~~~~~~v~a~l~~~~ 168 (419) . . . .+....+++. +......+.... ....+|..+. T Consensus 145 ~~al~~~~~~~~~~~tv~~d~~~~~f~its~~tg~~~~~~~~~a~~~~~~~~lgl~~~~~~~v~g~~ae-t~~~al~al~ 223 (504) T protein:vir:96 145 QTEIRKNTDPQLAQATVTWNPNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAAD-LPDAAVAKST 223 (504) T ss_pred HhhhhcccccccccceEEEeccCCeEEEEeeccccceeEEEeeccccchhhhhhcccccceEEeecccc-cHHHHHHHHH Confidence 0 0 0 0000000000 111111111111 1223444544 Q ss_pred h---ccceeEEEEeccCCCHHHHHhhhhhccccccCccceEEecceeEeec------cccccceeee----------chH Q lcl|NC_021557. 169 S---RLHALAIADLPLGLTKQQAVAARGVAGTANTSSARTVLTYPHVVIED------TTGATETRLD----------PLS 229 (419) Q Consensus 169 ~---~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~------~~~~~~~~~~----------p~s 229 (419) . .+-.|.+.+.+.. .++..+.-.+....+ .++ ++..+....+ .......+.. -++ T Consensus 224 ~~~~~Wy~f~~a~~~~~--dd~ilalA~w~ea~~--~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (504) T protein:vir:96 224 NVSNNFGSFLFAGATLD--NDQIKAVSAWNAAQN--NQF--IYTVATSLANLGALFDLVKGNSGTALNVLSATASNDFVE 297 (504) T ss_pred hhcCCeEEEEEEeccCC--HHHHHHHHHHHhhcC--ceE--EEEEeecccchhhHHHhhhhcceeEEEEeecCccchhHH Confidence 3 3344455555432 233222211111111 111 1111111000 0000111111 134 Q ss_pred HHHHHHHHhhhhccC-ceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCCc--EEEE-eccccCCC Q lcl|NC_021557. 230 SRLAGVIIATDLNEG-WQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFATG--IRVF-GNRSAAFP 305 (419) Q Consensus 230 ~~vAg~~a~~D~~~g-~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~G--~~~w-G~rT~~~~ 305 (419) .+..|.++.+|.++- --.+-..|.+.||.... ...+|++.|..+|+|++..+-+.| +.+| ...++.+. T Consensus 298 ~~~~~~~as~~f~~~ng~~T~~fk~l~GVta~~--------lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~~~gG~ 369 (504) T protein:vir:96 298 QCPSEILAATNYDEPGASQNYMYYQFPGRNITV--------SDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGP 369 (504) T ss_pred HHHHHHHHhcCcCcccccccccccccCCcCccc--------CCHHHHHHHHhcCCeEEEEeecccceeeEEecCeeeCCc Confidence 555677777774331 11223356667775321 256788999999999998876555 4555 34444331 Q ss_pred CCcccceeeehhhHHHHHHHHHHHHHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhhccc----------------- Q lcl|NC_021557. 306 TSSHVENFIHARRILDMIHEAIIFYTMNYVDR----LGSPMTVEAAEEGVNAYLRSKTGIAIY----------------- 364 (419) Q Consensus 306 s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e----~n~~~~~~~i~~~i~~~L~~l~~~g~~----------------- 364 (419) . .|..|.+-+-.+|++..++..+....-. |.|+.-...|+..++.-|++-+++|++ T Consensus 370 -~--~~~wiDv~~~~~WL~~~lq~~l~~l~~~~~kIPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~I~~~ 446 (504) T protein:vir:96 370 -T--DAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQYITQI 446 (504) T ss_pred -c--ccchhhhhhhHHHHHHHHHHHHHHHHhcCCCcccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchheeccc Confidence 1 2444777777788888888887764433 557777888888888888887766643 Q ss_pred -------------ceEEEEec-ccCCHH-HhhCCEEEEEEEEEeccCceEEEEEEEEc Q lcl|NC_021557. 365 -------------GGTFRFDR-QKNTAE-QIADGKFYYRLECHPISVMERITIDSYVD 407 (419) Q Consensus 365 -------------~~~v~~d~-~~n~~~-~i~~G~~~~~v~~~p~~p~e~i~~~~~~~ 407 (419) ||.+..+. ++-+++ .-.++...+.+...--..+++|++....- T Consensus 447 ~~~d~~~~~~~~~GYyv~~~~~s~~s~~~r~~R~~~~~~~~y~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 447 TGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) T ss_pred ccccccccceeccceEEEecChhccChhHhhhccccceEEEEEECCeEEEEEeccccC Confidence 24455432 233333 44455566666666667788777665543 No 69 >protein:vir:107720 Length: 515 # NCBI annotation: gp17 # Family: family:all:396 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024865;genbank:gi:48697507;genbank:GeneID:2948336 Probab=59.65 E-value=0.39 Score=22.84 Aligned_cols=379 Identities=9% Similarity=-0.009 Sum_probs=149.0 Q ss_pred CCCccCCCeEEEEcCCCccCccccCc-cceEEEEccccccccccccccccccCcceeecchHHHHHHhcccchhhhHHHH Q lcl|NC_021557. 1 MAATFHHGPEVIEHKDGVTVVRDVKS-AVTYVNGTAPIQDVHATALAREDYINKRVIIRSRAEGAAAFGVHKAGYTIPAA 79 (419) Q Consensus 1 Ma~~~~hGVyv~e~~~~~~~i~~v~t-av~~~Vgta~~a~~~~~~~~~~~~~n~pv~its~~e~~~~fg~~~~~~~l~~a 79 (419) || +.+=++|. ++-++.....+.. ...+++-+... ..+.......+|..+....||....++. + T Consensus 1 m~--I~~~~~V~-i~~~v~aa~~~~~~~f~~li~t~~~----------~~p~~r~~~y~s~~~V~~~FG~~S~ey~---a 64 (515) T protein:vir:10 1 MP--ISFDKYVA-ITSGVAAQQQIAARSFAIRVYTPNP----------MVSVDRLITATSAADVGAYFGTASEEYK---R 64 (515) T ss_pred CC--CCceeEEE-eecccccCCccccccceeeeeeccc----------CCCccceeeecCHHHHHHhcCCChHHHH---H Confidence 88 44445554 4444433333222 12233322111 1122445567888889999998755543 3 Q ss_pred HHHHhh----ccCC--cEEEEeecccccccc-ccccccc---------c-ccccceecccc----------cc---cccc Q lcl|NC_021557. 80 LDAIFD----QGDG--GTIIVNNVFDPDVHK-EGANPDP---------S-KVTTVDINGTI----------SP---AGLA 129 (419) Q Consensus 80 l~~~~~----~~~~--~~~v~~~~~~~~~~~-~~~~~~~---------~-~~t~~~~~g~~----------~~---~~~~ 129 (419) ...+|. +... ..++-+......... ....... + ......+.|.. .. .+.. T Consensus 65 A~~yFsg~~~q~p~P~~L~igR~~~~a~~~~l~g~~~~~~~l~~~~~is~G~ltitidG~~~~t~s~i~~S~ats~~~vA 144 (515) T protein:vir:10 65 AVKNFGFISKKTRRPTSIQFARWQREAGPVAIYGGAKKAAALATLQAVTAGAISFLFGGATTVTVSGISFSAATSLADVA 144 (515) T ss_pred HHHHhhhccCCcccccEEEEEeccCcccceEEEeccchhhhHHhhhcccceeEEEEEcceEEEEeeccccccccCHHHHH Confidence 444443 2222 122222111111000 0000000 0 00000000000 00 0000 Q ss_pred ccchhhhhh----------------h-------------------------------hhcccccc--ccccchhhhhhhH Q lcl|NC_021557. 130 SGFSGAYEC----------------Y-------------------------------NNFGYFPK--LIIAPGYSPAAAV 160 (419) Q Consensus 130 tg~~a~~~~----------------~-------------------------------~~~~~~p~--~~~ap~~~~~~~v 160 (419) +.+...... . ..+++... .....+.. .... T Consensus 145 s~i~tal~~~~~~~~~~~tv~~d~~~~~F~v~s~~tG~~~~is~~~~t~~~~~t~~a~~lglt~~~~av~~~g~a-aet~ 223 (515) T protein:vir:10 145 SELQTALRANADANLATCTVSYDPVGARFNFAGSPSDDTVQESISIVPQSNPAIDVAQLLGWNSAQGASYIAASP-VVSP 223 (515) T ss_pred HHHHhhhccccccccceeEEEEecCCCeEEEEEeecCCceeEEEEEecCCCchhhHHHHhccccccceEEecccc-cccH Confidence 000000000 0 00000000 00000111 1112 Q ss_pred HHHHHHHhh---ccceeEEEEeccC-CCHHHHHh---hhhhcccccc---CccceEEe-cceeEeeccccccce-eee-- Q lcl|NC_021557. 161 RAEMDVVAS---RLHALAIADLPLG-LTKQQAVA---ARGVAGTANT---SSARTVLT-YPHVVIEDTTGATET-RLD-- 226 (419) Q Consensus 161 ~a~l~~~~~---~~~~~~i~d~p~~-~~~~~~~~---~~~~~~~~~~---~s~~~~~~-~p~~~~~~~~~~~~~-~~~-- 226 (419) ..+|.++.+ .+-.|.+.+-+.. .+..++.. |-...+..-+ ........ +.-.... ....... ... T Consensus 224 ~~a~~a~~~~s~nWy~f~~a~~~~~~~~~a~~~a~a~~~e~~~~~~~~~~~~~~~~~~~~~a~~~~-~~~~~~~~~~~~~ 302 (515) T protein:vir:10 224 VDTLIASVAGNNNFGSILFTKNGGTGITLSDAEAIALQNQSYNVAYKFQVGVDDTTYSSWQAALAA-IGGVNMIYSPVAL 302 (515) T ss_pred HHHHHHHHhccCCeEEEEEeecCccccchhHHHHHHHHHhhcCceEEEEeccCccceechhhhhhh-hhhcCceEEEEec Confidence 345555543 3444555554422 22222221 1111000000 00000000 0000000 0000000 100 Q ss_pred ---chHHHHHHHHHhhhhccC-ceecccCceeeceeecceecccccCCcchhhccccCCceEEEEEecCC--cEEEEecc Q lcl|NC_021557. 227 ---PLSSRLAGVIIATDLNEG-WQNSPSNREIKGVVDLEVPINFYPSDYQNDTNFLNEAGIVTAMRSFAT--GIRVFGNR 300 (419) Q Consensus 227 ---p~s~~vAg~~a~~D~~~g-~~~span~~l~gv~~~~~~~~~~~~~~~~~~~~L~~~gI~~i~~~~~~--G~~~wG~r 300 (419) .+.....|..+.+|.++- =...-..|.+.||.--. .++.+++.|..+|+|+...+.++ .+.+|-.= T Consensus 303 ~~~~~~a~~~g~~asvnf~~~ng~iT~kfKq~~Gita~~--------lt~t~a~al~~~~~N~Y~~~~~~~~~~~~~~~G 374 (515) T protein:vir:10 303 AAEYHDMQDGIIEAATDFTQQGGATGYMYVQFNNQTPAV--------NDDTLSGILDDLNINYYGQTQVNGTNLSFYQDG 374 (515) T ss_pred cCcchHHHHHHHHHhcCCCccchhheeccccCCCCcccc--------CCHHHHHHHHhcCCeEEEEEeccCceEEEEeCC Confidence 123355667777764332 12223345555554222 24678899999999999887654 48888655 Q ss_pred ccCCCCCcccceeeehhhHHHHHHHHHHHHHHHhhcC----CCCHHHHHHHHHHHH-HHHHHHHhhcccce--------- Q lcl|NC_021557. 301 SAAFPTSSHVENFIHARRILDMIHEAIIFYTMNYVDR----LGSPMTVEAAEEGVN-AYLRSKTGIAIYGG--------- 366 (419) Q Consensus 301 T~~~~s~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e----~n~~~~~~~i~~~i~-~~L~~l~~~g~~~~--------- 366 (419) ++++...+ |.+|.+.|-.+|++..++..+....-. |.+..=...|+..+. +-|+.-+++|++.- T Consensus 375 ~~~gG~~~--~~WiD~~~g~~WL~~~iq~~l~~L~~s~~KIPytd~G~a~i~a~v~q~vl~~av~nG~I~~Gv~ls~~Q~ 452 (515) T protein:vir:10 375 VMMGGPTD--PRDSNVYANEQWLKSYAGASFMSLQLAQGKIPANIEGRGLLLGKMTKDIIPAAKLNGTFSIGKTLTVDQQ 452 (515) T ss_pred eeeCCccc--hhHHHHHhhHHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHHhCCeeecCcccchhHH Confidence 55443333 445899999999999999988764322 345555555665553 45655555443321 Q ss_pred ---------------------EEEE-ecccCCHHHhhCCEEEEEEEEEeccCceEEEEEEEEc Q lcl|NC_021557. 367 ---------------------TFRF-DRQKNTAEQIADGKFYYRLECHPISVMERITIDSYVD 407 (419) Q Consensus 367 ---------------------~v~~-d~~~n~~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~ 407 (419) .+.. +.+..++.+...+.+.+..-..-=-.+++|+....-- T Consensus 453 ~~i~~~~g~d~~~~~~~~~Gyy~~~~~~~~~~~~~r~~~~~~~~~~y~~g~~i~~i~~~~~~v 515 (515) T protein:vir:10 453 LFVTELTGDDTAWQKVQNLGYWYDVQISSFVDTGGTTKYQAVYSLVYSKDDLIRKVVGTHTLI 515 (515) T ss_pred HHHHhhhcCcccccchhhcceeEecCcCCCCCcccccccCceeEEEEEcCceEEEEEeeeecC Confidence 1111 1122223334444443333333344455555544433 No 70 >protein:vir:108311 Length: 249 # NCBI annotation: hypothetical protein # Family: family:all:28027 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552279;genbank:gi:160700604;genbank:GeneID:5758827 Probab=28.66 E-value=1.7 Score=19.28 Aligned_cols=100 Identities=14% Similarity=0.035 Sum_probs=68.9 Q ss_pred ehhhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhhcccceEEEEecccCCHHHhhCCEEEEEEE---- Q lcl|NC_021557. 315 HARRILDMIHEAIIFYTMNYVDRLGSPMTVEAAEEGVNAYLRSKTGIAIYGGTFRFDRQKNTAEQIADGKFYYRLE---- 390 (419) Q Consensus 315 ~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~~~~~v~~d~~~n~~~~i~~G~~~~~v~---- 390 (419) -.|-.-+.|..++++..--.++|+-.....++....++..|.++...++.-+.+. +- + .-+..|.....|+ T Consensus 1 ~sqt~~~II~~ALk~aGvla~Getp~aee~~DA~~~Ln~Ml~~W~~~rl~V~~~~---~~-t-~vl~~G~~~YtVGi~~~ 75 (249) T protein:vir:10 1 MARTVGDIIRSSMRKIGVLAAGEPLPANEGDDALEVFAQMVDAWTNETLLIPVVN---VV-T-KVLVENQPEYTIGIYPE 75 (249) T ss_pred CccCHHHHHHHHHHHccccccCCCCCHhHHHHHHHHHHHHHHHHHhCceeEEeee---ee-e-eeccCCcceEEeeeccc Confidence 2223347888999999999999999999999999999999999888886544331 10 0 0145677777777 Q ss_pred ----------EEeccCc--eEEEEEEEEcchHHHHH-----HHhcC Q lcl|NC_021557. 391 ----------CHPISVM--ERITIDSYVDTKFISNA-----LSLAA 419 (419) Q Consensus 391 ----------~~p~~p~--e~i~~~~~~~~~~~~~~-----~~~~a 419 (419) +--.+|. ++-.|+-..|.+|+... ++.++ T Consensus 76 ~~~~~~p~~~i~~~RP~~i~sA~~r~~~d~~~~~~~i~~EdY~rI~ 121 (249) T protein:vir:10 76 PVPDPLPSNHIETGRPERILSAFIRDRYDTDYIQEIIDVETYSRIS 121 (249) T ss_pred cccccCCCCceEeecchheeeeeeecccccchhhhhhchhhhhhcC Confidence 2234555 66677777888887766 34444 Done!