Query lcl|NC_017984.1_cdsid_YP_006383790.1 [gene=A323_gp40] [protein=hypothetical protein] [protein_id=YP_006383790.1] [location=18625..20088] Match_columns 487 No_of_seqs 165 out of 206 Neff 7.7 Searched_HMMs 1612 Date Thu Nov 7 13:46:40 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_40 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_40_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:106730 Length: 501 100.0 3E-162 2E-165 906.2 50.8 486 1-487 1-501 (501) 2 protein:vir:101576 Length: 501 100.0 8E-162 5E-165 903.8 50.4 486 1-487 1-501 (501) 3 protein:vir:3636 Length: 501 # 100.0 3E-161 2E-164 900.9 51.3 486 1-487 1-501 (501) 4 protein:vir:78611 Length: 501 100.0 5E-161 3E-164 899.5 51.1 486 1-487 1-501 (501) 5 protein:vir:94073 Length: 494 100.0 2E-157 1E-160 879.3 49.2 483 3-487 1-494 (494) 6 protein:vir:99586 Length: 507 100.0 2E-150 1E-153 841.5 45.6 477 5-486 1-507 (507) 7 protein:vir:96104 Length: 504 100.0 2E-148 1E-151 830.9 48.0 476 5-486 1-504 (504) 8 protein:vir:107720 Length: 515 100.0 1E-148 8E-152 831.3 46.5 480 1-486 1-515 (515) 9 protein:vir:5260 Length: 502 # 100.0 7E-141 4E-144 789.1 48.8 465 1-487 1-502 (502) 10 protein:vir:95263 Length: 450 100.0 3E-120 2E-123 675.6 43.3 433 7-487 1-449 (450) 11 protein:vir:80052 Length: 331 100.0 1.2E-84 7.6E-88 480.6 36.8 319 7-487 1-330 (331) 12 protein:vir:3165 Length: 426 # 100.0 1.3E-66 8E-70 381.8 30.7 395 1-487 1-426 (426) 13 protein:vir:1996 Length: 495 # 99.0 5.1E-09 3.2E-12 66.0 30.0 426 1-487 4-492 (495) 14 protein:vir:489 Length: 498 # 99.0 7.8E-09 4.9E-12 65.0 31.3 430 1-475 3-498 (498) 15 protein:vir:4463 Length: 498 # 98.9 1.1E-08 7E-12 64.1 31.7 433 1-475 3-498 (498) 16 protein:vir:4517 Length: 498 # 98.9 1.4E-08 8.7E-12 63.6 33.0 434 1-475 3-498 (498) 17 protein:vir:102957 Length: 437 98.7 6.9E-08 4.3E-11 59.9 28.8 393 1-487 1-437 (437) 18 protein:vir:105470 Length: 451 98.6 2.9E-07 1.8E-10 56.5 32.7 409 1-487 1-451 (451) 19 protein:vir:78986 Length: 436 98.6 3E-07 1.8E-10 56.4 28.3 395 1-486 1-436 (436) 20 protein:vir:99306 Length: 587 98.4 1.1E-06 6.7E-10 53.3 35.7 434 1-487 1-582 (587) 21 protein:vir:95741 Length: 587 98.3 1.4E-06 8.8E-10 52.6 36.5 432 1-487 1-582 (587) 22 protein:vir:102359 Length: 356 98.2 2.5E-06 1.6E-09 51.3 21.9 331 91-485 1-356 (356) 23 protein:vir:107865 Length: 477 98.0 6.4E-06 4E-09 49.1 31.0 420 1-487 1-466 (477) 24 protein:vir:80779 Length: 569 98.0 7.4E-06 4.6E-09 48.7 34.8 430 1-487 1-564 (569) 25 protein:vir:79092 Length: 477 98.0 8.7E-06 5.4E-09 48.3 31.4 413 1-487 1-466 (477) 26 protein:vir:63742 Length: 562 97.7 3.2E-05 2E-08 45.2 36.0 434 1-487 1-557 (562) 27 protein:vir:80488 Length: 562 97.6 3.8E-05 2.4E-08 44.8 39.7 434 1-487 1-557 (562) 28 protein:vir:6079 Length: 396 # 97.5 5E-05 3.1E-08 44.1 29.2 364 1-487 1-383 (396) 29 protein:vir:2035 Length: 396 # 97.5 6.2E-05 3.8E-08 43.7 20.6 352 73-487 1-383 (396) 30 protein:vir:5711 Length: 396 # 97.5 6.3E-05 3.9E-08 43.6 20.5 349 73-487 1-383 (396) 31 protein:vir:10336 Length: 386 97.4 8.3E-05 5.1E-08 43.0 25.2 360 1-487 1-377 (386) 32 protein:vir:1845 Length: 392 # 97.2 0.00012 7.7E-08 42.0 23.5 347 63-487 1-380 (392) 33 protein:vir:96586 Length: 587 97.2 0.00013 7.8E-08 42.0 37.1 435 1-487 1-582 (587) 34 protein:vir:107310 Length: 581 97.1 0.00016 1E-07 41.4 30.9 416 1-487 52-566 (581) 35 protein:vir:98553 Length: 395 97.1 0.00018 1.1E-07 41.1 22.5 350 63-487 1-383 (395) 36 protein:vir:79141 Length: 391 96.9 0.00026 1.6E-07 40.3 25.5 361 1-487 1-378 (391) 37 protein:vir:78206 Length: 390 96.5 0.00052 3.2E-07 38.6 27.4 361 1-487 1-378 (390) 38 protein:vir:103993 Length: 390 96.5 0.00052 3.2E-07 38.6 27.4 361 1-487 1-378 (390) 39 protein:vir:80984 Length: 666 96.5 0.00055 3.4E-07 38.5 24.3 431 1-487 145-649 (666) 40 protein:vir:106984 Length: 743 96.3 0.00074 4.6E-07 37.8 29.1 426 1-487 214-730 (743) 41 protein:vir:98263 Length: 664 96.3 0.00075 4.6E-07 37.7 23.4 433 1-487 104-648 (664) 42 protein:vir:1172 Length: 391 # 95.7 0.0015 9.5E-07 36.0 27.8 361 1-487 2-379 (391) 43 protein:vir:104858 Length: 729 95.6 0.0017 1.1E-06 35.7 26.4 423 1-487 210-715 (729) 44 protein:vir:79181 Length: 390 95.4 0.0022 1.4E-06 35.2 26.9 358 1-487 1-378 (390) 45 protein:vir:100829 Length: 607 95.2 0.0025 1.6E-06 34.8 38.0 435 1-487 15-596 (607) 46 protein:vir:98824 Length: 774 94.9 0.0032 2E-06 34.2 18.5 420 1-487 274-765 (774) 47 protein:vir:108052 Length: 660 94.9 0.0033 2E-06 34.2 28.3 423 1-487 120-645 (660) 48 protein:vir:7653 Length: 581 # 94.8 0.0035 2.2E-06 34.1 29.5 430 1-487 20-566 (581) 49 protein:vir:103456 Length: 659 94.6 0.0039 2.4E-06 33.8 26.8 429 1-487 124-644 (659) 50 protein:vir:102819 Length: 648 94.5 0.0043 2.7E-06 33.6 22.8 420 1-487 122-645 (648) 51 protein:vir:5833 Length: 742 # 94.5 0.0044 2.7E-06 33.5 29.5 388 1-487 308-736 (742) 52 protein:vir:96740 Length: 388 93.8 0.0063 3.9E-06 32.7 25.3 351 63-487 1-377 (388) 53 protein:vir:6894 Length: 660 # 93.7 0.0065 4E-06 32.6 26.6 425 1-487 119-644 (660) 54 protein:vir:7206 Length: 659 # 93.5 0.0075 4.7E-06 32.2 26.5 421 1-487 119-644 (659) 55 protein:vir:101804 Length: 663 93.4 0.0077 4.8E-06 32.2 27.6 420 1-487 120-646 (663) 56 protein:vir:100323 Length: 393 92.7 0.01 6.4E-06 31.5 30.0 357 1-487 1-380 (393) 57 protein:vir:101187 Length: 663 92.1 0.013 8E-06 30.9 26.4 420 1-487 127-646 (663) 58 protein:vir:6594 Length: 666 # 92.1 0.013 8.1E-06 30.9 26.3 422 1-487 153-649 (666) 59 protein:vir:100539 Length: 663 91.5 0.015 9.6E-06 30.5 24.4 419 1-487 120-646 (663) 60 protein:vir:104477 Length: 749 90.6 0.02 1.2E-05 29.9 27.9 426 1-487 206-737 (749) 61 protein:vir:5663 Length: 671 # 90.6 0.02 1.2E-05 29.9 27.0 418 1-487 143-659 (671) 62 protein:vir:106427 Length: 679 86.7 0.044 2.7E-05 28.0 28.4 430 1-487 120-663 (679) 63 protein:vir:79798 Length: 717 80.0 0.099 6.2E-05 26.1 24.2 415 1-487 192-717 (717) 64 protein:vir:3788 Length: 376 # 67.2 0.26 0.00016 23.8 21.6 329 119-487 1-373 (376) 65 protein:vir:78782 Length: 370 51.9 0.57 0.00035 21.9 25.5 339 67-487 1-369 (370) 66 protein:vir:3751 Length: 376 # 36.6 1.2 0.00072 20.2 21.4 333 77-480 1-376 (376) No 1 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=100.00 E-value=2.8e-162 Score=906.23 Aligned_cols=486 Identities=32% Similarity=0.487 Sum_probs=458.1 Q ss_pred CCcCCccccceEEEeeeeecccccccccceeEEecCcceee---eeeccHHHHHHhcCCChHHHHHHHHHhhcccCCccC Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGNPLGLNTNLFVQDAIYPN---YEYFSNTLVGQHYGLESPIYKFATVYFNGFRNATTR 77 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~~~~~~~ll~~~~~~~~~---~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~q~p~ 77 (487) ||+|+||+||||||+|+|+++++++++|++|||++++.+|+ ++|+++++|++|||.+||||+||++||+||+||+|| T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~f~~lll~~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q~p~ 80 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSVQPGQLADFFQKTDVENWFGALSNEAKIADAYFPGIVNGGQL 80 (501) T ss_pred CCcCccccceEEEEeeecccCCCcccccceEEEecccCCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhhcCCCcc Confidence 99999999999999999999999999999999999998884 579999999999999999999999999999999999 Q ss_pred CCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhhee---eEEEe Q lcl|NC_017984. 78 PNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALTL---PCTYE 154 (487) Q Consensus 78 P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~a---~vt~d 154 (487) |++||||||++++++++|+|+++++.+++.++.++|+|+|++||+.++..||||.+++|+++|+.|+++|++ +|+|| T Consensus 81 P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~~g~l~i~i~g~~~~~~i~~s~ats~~~vA~~i~~al~~~~~tv~~d 160 (501) T protein:vir:10 81 PYDLKFARYVAADAPASVYGIPLTGITLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDFVVAYD 160 (501) T ss_pred ccEEEEEeecccCccceeeeceehhhhhhhhhheeeEEEEeeccceeeeccccccccCHHHHHHHHHHhhcCCceEEEEe Confidence 999999999999999999999999999999999999999999999888789999999999999999999974 69999 Q ss_pred cccceEEEEecccccceeEEeccc--chhhhhhhccccce-eEecCcccccHHHHHHHHHhcccceeEEEEEeccCChhH Q lcl|NC_017984. 155 STVKGFVIKSGTSGANSTISFATG--DISDDLKLTQETGA-VLNNHTAADTPTTGALNALAFSQNFVNITYSEGVFNEDA 231 (487) Q Consensus 155 ~~~~~f~its~t~g~~stit~atg--d~a~~l~lt~~~gA-~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~~~~~ 231 (487) ++.+||+|+++++|..++|+++++ +++.+|||++++++ +.++|.++|+|.++|.++.+++++||+|.++++ .++++ T Consensus 161 ~~~~~f~i~~~t~G~~~~i~~~t~~~d~a~~l~Lt~~~~a~v~~~g~~aet~~~Al~a~~~~~~~Wy~f~~a~~-~~~~~ 239 (501) T protein:vir:10 161 ALRNRFTVVTNTTGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWT-AVIAD 239 (501) T ss_pred cccceEEEEecccCcceeEEEeeccccchhhhcccccCceeEEecCcccccHHHHHHHHHhcccceEEEEEEec-CChHH Confidence 999999999999999999999975 49999999999988 469999999999999999999999999998875 56788 Q ss_pred HHHHHHHHhccCceEEEEEcccccccc-ccchHH-HHHHHhCCcceEEEecCCCchHHHHHHHHHhcCcCcCCceeeeee Q lcl|NC_017984. 232 LKDLALWVTSQNSRFKLYTWGLDPVAL-GQSGAS-FGEWAKENTSGVVPLYGTFDKAAFFCGVSGSINYQEENGRTTTAF 309 (487) Q Consensus 232 i~a~A~w~~a~~~~~~~~~~~~~~~~~-~~~~~~-~~~l~~~~~~~t~~~y~~~~~~a~~~g~~as~~~~~~~gs~T~~f 309 (487) ++++|+|+|++++||++..++.+.... ...++. ...|+.++|.|++++||+++++++++|+++++||++.||++|||| T Consensus 240 ~la~A~wi~a~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g~~T~~f 319 (501) T protein:vir:10 240 RLAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNGRTVLAF 319 (501) T ss_pred HHHHHHHHHhcCceEEEEEecCcceeeecccchhHHHHHHhcCCCceEEECCCCCHHHHHHHHHHhcCcccCcceeeeee Confidence 889999999999999999888875443 444444 456788899999999999999999999999999999999999999 Q ss_pred eec-CcccccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCCceehHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_017984. 310 RSQ-DGLVPDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQYKWIDNFDFQVFLRTQLQLAYMNMFQAQKTI 388 (487) Q Consensus 310 k~l-~Gv~~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kI 388 (487) ||+ +||+|++++++|+++|++||||||+.|++.+++++|+++|+|||+++|||+++|+|||+++||++|++||.+++|| T Consensus 320 kql~~Gv~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~g~dWl~~~iq~~l~~ll~~~~kI 399 (501) T protein:vir:10 320 RQFNAGVPATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQRAEFEAMLAYNSL 399 (501) T ss_pred cccCCCcCcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcceeeccceehhhHhhHHHHHHHHHHHHHHHHhcCCCc Confidence 996 8999999999999999999999999999999999999999999998899999999999999999999999999999 Q ss_pred CcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCc-cccceeeeeeEEeccC--CCHHHHhhcccCC Q lcl|NC_017984. 389 PYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFD-AASQLFTKGWALSVTL--PDSQTRVARESFI 465 (487) Q Consensus 389 Pyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~-~~~~~~~~Gy~~~~~~--~s~~dra~R~~~~ 465 (487) |||+.|+++|+++|+++|+||++||+|+|||+|++.|+++++++.|.+ .++++++||||+++.. .++++|++|++|+ T Consensus 400 Pyt~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~ 479 (501) T protein:vir:10 400 PYNEDGYTANYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAVAGVAGAGALVQTRGWYFLIGNPANPGQARQNRTSPA 479 (501) T ss_pred ccCHHHHHHHHHHHHHHHHHHHhCcceecCcccCcccceeecccccccccccceeccceEEeeCcccCChhhhhhcccCc Confidence 999999999999999999999999999999999999999999999987 5688999999999865 4578999999999 Q ss_pred eEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 466 IKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 466 i~~~~~~aGAIh~v~i~gt~vq 487 (487) ++|+|+++||||+|+|..++|= T Consensus 480 ~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 480 CTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred eEEEEEeCCceeEEEeeeeecC Confidence 9999999999999999998888 No 2 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=100.00 E-value=7.8e-162 Score=903.85 Aligned_cols=486 Identities=31% Similarity=0.491 Sum_probs=458.2 Q ss_pred CCcCCccccceEEEeeeeecccccccccceeEEecCcceee---eeeccHHHHHHhcCCChHHHHHHHHHhhcccCCccC Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGNPLGLNTNLFVQDAIYPN---YEYFSNTLVGQHYGLESPIYKFATVYFNGFRNATTR 77 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~~~~~~~ll~~~~~~~~~---~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~q~p~ 77 (487) ||+|+||+||||||+|+|+++++++++|++|||++++.+|+ .+|+|+++|++|||.+||||+||++||+||+||+|| T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~l~l~~~~~~~~~~~~~~~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q~p~ 80 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSVQPGQLADFFQETDVENWFGALSNEAKIADAYFPGIVNGGQL 80 (501) T ss_pred CCCCCcccceEEEEeeecccCCCccccceeEEEeccCCCCccceEEecCHHHHHHhcCCChHHHHHHHHHhhhhcCCCcc Confidence 99999999999999999999999999999999999999885 479999999999999999999999999999999999 Q ss_pred CCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhhe---eeEEEe Q lcl|NC_017984. 78 PNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALT---LPCTYE 154 (487) Q Consensus 78 P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~---a~vt~d 154 (487) |++||||||++++++++|+|++++..+++++++++|+|+|++||+.++..||||.+++|+++|+.|+++|+ ++|+|| T Consensus 81 P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~sg~l~vti~g~~~~~~i~ls~ats~~~vAs~i~~al~~~~~tv~~d 160 (501) T protein:vir:10 81 PYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDFVVAYD 160 (501) T ss_pred ccEEEEEeecCCCccceEeccchhhhhhhhcceeeeEEEEeeccceeecccccccccCHHHHHHHHhhhccCCceEEEEc Confidence 99999999999999999999999999999999999999999999988877999999999999999999997 469999 Q ss_pred cccceEEEEecccccceeEEecccc--hhhhhhhcccccee-EecCcccccHHHHHHHHHhcccceeEEEEEeccCChhH Q lcl|NC_017984. 155 STVKGFVIKSGTSGANSTISFATGD--ISDDLKLTQETGAV-LNNHTAADTPTTGALNALAFSQNFVNITYSEGVFNEDA 231 (487) Q Consensus 155 ~~~~~f~its~t~g~~stit~atgd--~a~~l~lt~~~gA~-~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~~~~~ 231 (487) ++.+||+|+++++|..++|+++++. ++.+|||++++++. .++|.++|+|.++|.++.+++++||+|.++++ .++++ T Consensus 161 ~~~~~f~its~ttG~~~~i~~~~~~~~la~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~a~~-~~~~~ 239 (501) T protein:vir:10 161 ALRNRFTVVTNATGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWT-AVIAD 239 (501) T ss_pred ccCceEEEEeeccCCceeEEEeeCchhhhhhcCccccccceEEecCcccccHHHHHHHHHhccCceEEEEEecC-CChHH Confidence 9999999999999999999999764 89999999999885 69999999999999999999999999998875 56788 Q ss_pred HHHHHHHHhccCceEEEEEccccccc-cccchHH-HHHHHhCCcceEEEecCCCchHHHHHHHHHhcCcCcCCceeeeee Q lcl|NC_017984. 232 LKDLALWVTSQNSRFKLYTWGLDPVA-LGQSGAS-FGEWAKENTSGVVPLYGTFDKAAFFCGVSGSINYQEENGRTTTAF 309 (487) Q Consensus 232 i~a~A~w~~a~~~~~~~~~~~~~~~~-~~~~~~~-~~~l~~~~~~~t~~~y~~~~~~a~~~g~~as~~~~~~~gs~T~~f 309 (487) ++++|+|+|++++||+++.++.+... .....+. ...|+.++|.|++++||+++++++++|+++++||++.+|++|||| T Consensus 240 ~la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g~~T~~f 319 (501) T protein:vir:10 240 RLAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNGRTVLAF 319 (501) T ss_pred HHHHHHHHHhcCceEEEEEecCchhhhhhhhhhhHHHHHHhcCCCceEEECCCCcHHHHHHHHHHhhCcccCccceeeec Confidence 88999999999999999888876433 3444444 456778899999999999999999999999999999999999999 Q ss_pred eecC-cccccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCCceehHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_017984. 310 RSQD-GLVPDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQYKWIDNFDFQVFLRTQLQLAYMNMFQAQKTI 388 (487) Q Consensus 310 k~l~-Gv~~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kI 388 (487) ||+| ||+|++++++|+++|++||||||++|++.+++++|+++|+|||+++|||+++|+|||+++||++|++||++++|| T Consensus 320 kq~~~Gi~a~~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kI 399 (501) T protein:vir:10 320 RQFNAGVPATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQRAEFEAMLAYNSL 399 (501) T ss_pred cccCCCcCcccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeeccceeehhhhhHHHHHHHHHHHHHHHHHhcCCc Confidence 9986 899999999999999999999999999999999999999999998899999999999999999999999999999 Q ss_pred CcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCc-cccceeeeeeEEeccC--CCHHHHhhcccCC Q lcl|NC_017984. 389 PYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFD-AASQLFTKGWALSVTL--PDSQTRVARESFI 465 (487) Q Consensus 389 Pyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~-~~~~~~~~Gy~~~~~~--~s~~dra~R~~~~ 465 (487) |||+.|+++|+++|+++|+||++||+|+|||+|++.|+++++++.|.+ .++++++||||+++.. .+++||++|++|+ T Consensus 400 Pyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~ 479 (501) T protein:vir:10 400 PYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQTRGWYFLIGDPANPGQARQNRTTPA 479 (501) T ss_pred ccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccCccccccceeccceeEeeccccCChhhhhhccccc Confidence 999999999999999999999999999999999999999999999987 5688999999999855 5688999999999 Q ss_pred eEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 466 IKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 466 i~~~~~~aGAIh~v~i~gt~vq 487 (487) ++|+|+++||||+|+|..++|= T Consensus 480 ~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 480 CTLWYSDGGSIQQLTIGSNAVI 501 (501) T ss_pred eEEEEEeCCceeEEEeeeeecC Confidence 9999999999999999998888 No 3 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=100.00 E-value=2.6e-161 Score=900.94 Aligned_cols=486 Identities=31% Similarity=0.477 Sum_probs=457.8 Q ss_pred CCcCCccccceEEEeeeeecccccccccceeEEecCccee---eeeeccHHHHHHhcCCChHHHHHHHHHhhcccCCccC Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGNPLGLNTNLFVQDAIYP---NYEYFSNTLVGQHYGLESPIYKFATVYFNGFRNATTR 77 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~~~~~~~ll~~~~~~~~---~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~q~p~ 77 (487) ||+|+||+||||||+|+|+++++++++|++|||++++.+| +++|+++++|++|||.+||||+||++||+||+||+|| T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~lllt~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~q~~~ 80 (501) T protein:vir:36 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSVQPGQLADFFQETDVENWFGALSNEAKIADAYFPGIVNGGQL 80 (501) T ss_pred CCcCCcccceEEEEeeeeccCCCcceeeeeEEEeccCCCCCcceeeecCHHHHHHhcCCChHHHHHHHHHhhcccCCCcc Confidence 9999999999999999999999999999999999988877 4679999999999999999999999999999999999 Q ss_pred CCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhhee---eEEEe Q lcl|NC_017984. 78 PNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALTL---PCTYE 154 (487) Q Consensus 78 P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~a---~vt~d 154 (487) |++||||||++++++++|+|++++..+++++++++|+|+|++||+.++..||||.+++++++|+.|+++|++ +|+|| T Consensus 81 P~~l~igR~~~~a~~~~l~g~~l~~~~~a~~~~~sg~l~vti~g~~~~~~i~lS~~ts~~~vA~~i~~al~~~~~tv~~d 160 (501) T protein:vir:36 81 PYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDFVVAYD 160 (501) T ss_pred ccEEEEEeecCcCcceeEeccchhhhhhhhccceeEEEEEEecceeeeeecccccccCHHHHHHHHhhhhcCcceEEEEc Confidence 999999999999999999999999999999999999999999999888779999999999999999999974 68999 Q ss_pred cccceEEEEecccccceeEEecccc--hhhhhhhccccce-eEecCcccccHHHHHHHHHhcccceeEEEEEeccCChhH Q lcl|NC_017984. 155 STVKGFVIKSGTSGANSTISFATGD--ISDDLKLTQETGA-VLNNHTAADTPTTGALNALAFSQNFVNITYSEGVFNEDA 231 (487) Q Consensus 155 ~~~~~f~its~t~g~~stit~atgd--~a~~l~lt~~~gA-~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~~~~~ 231 (487) +..+||+++++++|..++|+++++. ++.+|+|++++++ +.++|.++|+|.++|.++.+.+++||+|.++++ .++++ T Consensus 161 ~~~~~f~i~s~t~G~~~~i~~~t~~~~ia~~l~Lt~~~~a~v~~~g~~~et~~~al~a~~~~s~~Wy~f~~a~~-~~~~~ 239 (501) T protein:vir:36 161 ALRNRFTVVTNATGTAAAISAVTGTNNFADEIGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWT-AVIAD 239 (501) T ss_pred CcceeEEEEeccCCcceeeEeeecccchhhhhcccccCcceEEecccccccHHHHHHHHHhccCceEEEEEecC-CChHH Confidence 9999999999999999999999764 8999999999988 569999999999999999999999999998876 56778 Q ss_pred HHHHHHHHhccCceEEEEEccccccc-cccchH-HHHHHHhCCcceEEEecCCCchHHHHHHHHHhcCcCcCCceeeeee Q lcl|NC_017984. 232 LKDLALWVTSQNSRFKLYTWGLDPVA-LGQSGA-SFGEWAKENTSGVVPLYGTFDKAAFFCGVSGSINYQEENGRTTTAF 309 (487) Q Consensus 232 i~a~A~w~~a~~~~~~~~~~~~~~~~-~~~~~~-~~~~l~~~~~~~t~~~y~~~~~~a~~~g~~as~~~~~~~gs~T~~f 309 (487) ++++|+|+|++++||+++.++.+... .....+ +...|+.++|.|++++||+.+++++++|+++++||++.||++|||| T Consensus 240 ~la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g~~T~~f 319 (501) T protein:vir:36 240 RLAFASWNSGQAYKYMYVAPDLEAASIVSNNAASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNGRTVLAF 319 (501) T ss_pred HHHHHHHHhhcCceEEEEEecCchhhhhccchhhHHHHHHhcCCCcEEEEcCCCCHHHHHHHHHHhcCcccCcceeeeec Confidence 88999999999999999888876543 344444 4566778899999999999999999999999999999999999999 Q ss_pred eec-CcccccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCCceehHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_017984. 310 RSQ-DGLVPDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQYKWIDNFDFQVFLRTQLQLAYMNMFQAQKTI 388 (487) Q Consensus 310 k~l-~Gv~~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kI 388 (487) ||+ +||+|++++++|+++|++||||||+.|++.+++++|+++|+|||+++|||++||+||||++||++|++||.+++|| T Consensus 320 kq~~~Gi~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~g~dWL~~~iq~~l~~ll~~~~KI 399 (501) T protein:vir:36 320 RQFNAGVPATVHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQRAEFEAMLAYNSL 399 (501) T ss_pred cccCCCcCcCcCCHHHHHHHHhcCCcEEEEEecccceeeEEEcCeeeccchhhhHHHhHHHHHHHHHHHHHHHHhcCCCC Confidence 996 8999999999999999999999999999999999999999999998899999999999999999999999999999 Q ss_pred CcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCc-cccceeeeeeEEeccC--CCHHHHhhcccCC Q lcl|NC_017984. 389 PYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFD-AASQLFTKGWALSVTL--PDSQTRVARESFI 465 (487) Q Consensus 389 Pyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~-~~~~~~~~Gy~~~~~~--~s~~dra~R~~~~ 465 (487) |||+.|+++|+++|+++|+||++||+|+|||++++.|+++|+++.|.+ .++++++||||+++.. .+++||++|++|+ T Consensus 400 Pytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~I~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~ 479 (501) T protein:vir:36 400 PYNEDGYTGLYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDKAARVAGAGQLVQTRGWYFLIGDPANPGQARQNRTTPA 479 (501) T ss_pred ccChhhHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeecccccccccccceeccceEEeeCcccCChhhhhhcccCc Confidence 999999999999999999999999999999999999999999999987 5678999999999854 5688999999999 Q ss_pred eEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 466 IKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 466 i~~~~~~aGAIh~v~i~gt~vq 487 (487) ++|+|+++||||+|+|..++|= T Consensus 480 ~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:36 480 CTLWYSDGGSIQSLTIGSNAVI 501 (501) T ss_pred EEEEEEeCCceeEEEeeeeeeC Confidence 9999999999999999998888 No 4 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=100.00 E-value=4.8e-161 Score=899.49 Aligned_cols=486 Identities=31% Similarity=0.485 Sum_probs=457.3 Q ss_pred CCcCCccccceEEEeeeeecccccccccceeEEecCccee---eeeeccHHHHHHhcCCChHHHHHHHHHhhcccCCccC Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGNPLGLNTNLFVQDAIYP---NYEYFSNTLVGQHYGLESPIYKFATVYFNGFRNATTR 77 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~~~~~~~ll~~~~~~~~---~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~q~p~ 77 (487) ||+|+||+||||||+|+|+++++++++|++|||++++.+| +++|+++++|++|||.+||||+||++||+|++||+|| T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~lll~~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~q~~~ 80 (501) T protein:vir:78 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSIQPGQLADFFQKTDVENWFGGLSNEAVIADAYFPGIVNGGQL 80 (501) T ss_pred CCcCccccceEEEEeeecccCCCcceeeeeEEEecCCCCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhcCCCCCcc Confidence 9999999999999999999999999999999999988887 4579999999999999999999999999999999999 Q ss_pred CCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhhee---eEEEe Q lcl|NC_017984. 78 PNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALTL---PCTYE 154 (487) Q Consensus 78 P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~a---~vt~d 154 (487) |++||||||++++++++|+|++++..++++++.++|+|+|++||+.++..||||.+++++++|+.|+++|++ +|+|| T Consensus 81 P~~l~igR~~~~a~~~~l~g~~l~~~~la~~~~~~G~l~iti~g~~~~~~i~~S~~ts~~~vA~~i~~al~a~~~tv~~d 160 (501) T protein:vir:78 81 PYDLKFARYVAADAPASVYGIPLTGVTLTQLQGYSGTLTVTTAAQHVSSNISLAAATSFANAATLIEAAFTSPDFVVSYD 160 (501) T ss_pred cceEEEEeecccCcceeEeccceeccchhhhceeeeEEEEEeccceeeeccccccccCHHHHHHHHHhhhcCcceEEEEc Confidence 999999999999999999999999999999999999999999998777779999999999999999999974 59999 Q ss_pred cccceEEEEecccccceeEEecccc--hhhhhhhcccccee-EecCcccccHHHHHHHHHhcccceeEEEEEeccCChhH Q lcl|NC_017984. 155 STVKGFVIKSGTSGANSTISFATGD--ISDDLKLTQETGAV-LNNHTAADTPTTGALNALAFSQNFVNITYSEGVFNEDA 231 (487) Q Consensus 155 ~~~~~f~its~t~g~~stit~atgd--~a~~l~lt~~~gA~-~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~~~~~ 231 (487) ++.+||+++++++|..++|+++++. ++.+|||++++++. .++|.++|+|.++|.++.+.+++||+|.++++ .++++ T Consensus 161 s~~~~f~its~t~G~~~~i~~~t~~~~~a~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~a~~-~~~~~ 239 (501) T protein:vir:78 161 ALRNRFVVNTNATGTAAAISAVTGTNNLADELGLSAAAGASLQAAGVAADTPASAMNRAVGLSRNWATFTTAWT-AVIAD 239 (501) T ss_pred cccceEEEEeeecCCceeEEEEecccchhhhhcccccCceeeEeccccccCHHHHHHHHHhccCceEEEEEecC-CCHHH Confidence 9999999999999999999999764 79999999999874 59999999999999999999999999998876 57888 Q ss_pred HHHHHHHHhccCceEEEEEccccccc-cccchHH-HHHHHhCCcceEEEecCCCchHHHHHHHHHhcCcCcCCceeeeee Q lcl|NC_017984. 232 LKDLALWVTSQNSRFKLYTWGLDPVA-LGQSGAS-FGEWAKENTSGVVPLYGTFDKAAFFCGVSGSINYQEENGRTTTAF 309 (487) Q Consensus 232 i~a~A~w~~a~~~~~~~~~~~~~~~~-~~~~~~~-~~~l~~~~~~~t~~~y~~~~~~a~~~g~~as~~~~~~~gs~T~~f 309 (487) ++++|+|+|++++||+++.++.+... ...+++. ...|+.++|.|++++||+++.+++++|+++++||++.+|++|||| T Consensus 240 ~lalA~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~a~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g~~T~~f 319 (501) T protein:vir:78 240 RLALASWNSGQAYKYMYVAPDLEPASIVTNNSASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNGRTVLAF 319 (501) T ss_pred HHHHHHHHHhcCceEEEEEecCCcceeecccchhHHHHHhhcCCCceEEEcCCcchHHHHHHHHHhcCcccCcceeeeec Confidence 88999999999999999888876543 3444444 466788899999999999999999999999999999999999999 Q ss_pred eec-CcccccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCCceehHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_017984. 310 RSQ-DGLVPDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQYKWIDNFDFQVFLRTQLQLAYMNMFQAQKTI 388 (487) Q Consensus 310 k~l-~Gv~~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kI 388 (487) ||+ +||+|++++++|+++|++||||||+.|++.+++++|+++|+|||+++|||+++|+|||+++||++|++||.+++|| T Consensus 320 kq~~~Gv~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kI 399 (501) T protein:vir:78 320 RQFNAGVPATAHDLGTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQRAEFEAMLAYNSL 399 (501) T ss_pred cccCCCcCcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcCeeeccceeehhhhhHHHHHHHHHHHHHHHHHhCCCc Confidence 996 8999999999999999999999999999999999999999999998899999999999999999999999999999 Q ss_pred CcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCc-cccceeeeeeEEeccC--CCHHHHhhcccCC Q lcl|NC_017984. 389 PYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFD-AASQLFTKGWALSVTL--PDSQTRVARESFI 465 (487) Q Consensus 389 Pyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~-~~~~~~~~Gy~~~~~~--~s~~dra~R~~~~ 465 (487) |||+.|+++|+++|+++|+||++||+|+|||+|++.|+++++++.|.+ .++++++||||+++.. .++++|++|++|+ T Consensus 400 Pyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~I~~~~g~~~~~~~~~~~Gyy~~~~~~~~~~~~R~~R~~p~ 479 (501) T protein:vir:78 400 PYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQMRGWYFLIGDPANPGQARQNRTTPT 479 (501) T ss_pred ccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCCccceeeccccCccccccceeccceEEeeccccCChhhhhhcccCc Confidence 999999999999999999999999999999999999999999999987 5688999999999865 4578999999999 Q ss_pred eEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 466 IKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 466 i~~~~~~aGAIh~v~i~gt~vq 487 (487) ++|+|+++||||+|+|..++|= T Consensus 480 ~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:78 480 CTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred EEEEEEeCCceeEEEeeeeecC Confidence 9999999999999999998888 No 5 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=100.00 E-value=2.3e-157 Score=879.29 Aligned_cols=483 Identities=31% Similarity=0.477 Sum_probs=455.7 Q ss_pred cCCccccceEEEeeeeecccccccccceeEEecCccee---eeeeccHHHHHHhcCCChHHHHHHHHHhhcccCCccCCC Q lcl|NC_017984. 3 FNSIPASNIAAVYPAVIGGGGNPLGLNTNLFVQDAIYP---NYEYFSNTLVGQHYGLESPIYKFATVYFNGFRNATTRPN 79 (487) Q Consensus 3 ~~~ip~s~iV~V~~~~~~~~~~~~~~~~ll~~~~~~~~---~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~q~p~P~ 79 (487) ||+||+||||||+|+|+++++++|+|++|||++++.+| +++|+++++|++|||.+||||+||++||+||+||+|||+ T Consensus 1 m~~ip~s~iV~V~~~v~~~~~~~~~f~~~l~~~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~q~p~P~ 80 (494) T protein:vir:94 1 MPNIPISQIVSINPQVVSAGGTQGTLDGLLLTQATGFPVTQPQVYFSAADVGTAFGLTSDEYNAALVYFAGILGGGQQPA 80 (494) T ss_pred CCCCCcccEEEeeeeccccCCcccccceeEeecCccCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhccCCCcccc Confidence 99999999999999999999999999999999988877 468999999999999999999999999999999999999 Q ss_pred EEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhhe---eeEEEecc Q lcl|NC_017984. 80 SLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALT---LPCTYEST 156 (487) Q Consensus 80 ~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~---a~vt~d~~ 156 (487) +||||||++++++++|+|+++. .+++.++..+|+|+|+|||+.++..||||.+++++++|+.|+++|+ ++|+||++ T Consensus 81 ~l~igR~~~~a~~~~l~g~~~~-~tl~~~~~~~g~l~iti~g~~~~~~i~lS~~ts~~~vA~~i~~ai~~a~~~v~~d~~ 159 (494) T protein:vir:94 81 SLTIGRYASAATSAAVFGAPLT-LSLAQLQTLSGTLIVTTDTQRTSAAINLSGATSFANAASLMTSGFTTPNFAITYDAQ 159 (494) T ss_pred EEEEEeecCccccceeeccchh-hhHHhhhhcceEEEEEEcceEEEeeecccccCChhhHHHHHhhhhccccceEEEccc Confidence 9999999999999999999985 6788888889999999999766666999999999999999999996 36999999 Q ss_pred cceEEEEecccccceeEEecccchhhhhhhccccce-eEecCcccccHHHHHHHHHhcccceeEEEEEeccCChhHHHHH Q lcl|NC_017984. 157 VKGFVIKSGTSGANSTISFATGDISDDLKLTQETGA-VLNNHTAADTPTTGALNALAFSQNFVNITYSEGVFNEDALKDL 235 (487) Q Consensus 157 ~~~f~its~t~g~~stit~atgd~a~~l~lt~~~gA-~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~~~~~i~a~ 235 (487) .+||+|+++++|+.++|++++++++..|||++++++ +.++|.++|+|.++|.++.+++++||+|.++++ .++++++++ T Consensus 160 ~~~f~v~s~ttG~~s~is~~t~~~a~~l~lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~~~~-~~~~~ilal 238 (494) T protein:vir:94 160 RRRFVLSTTATGTTASVSAVTGTLADGVGLSTASGAYVEGSGLAADTAASALDRLAASSSTWAIFTTAWA-ASLSDRTAL 238 (494) T ss_pred CcEEEEEEccCCceeEEEEeccchhhhhhhhccccceEeecCcccccHHHHHHHHHhccCceEEEEEecC-CCHHHHHHH Confidence 999999999999999999999999999999999888 567999999999999999999999999999876 567888899 Q ss_pred HHHHhccCceEEEEEccccccc-cc-cchHHHHHHHhCCcceEEEecCCCchHHHHHHHHHhcCcCcCCceeeeeee-ec Q lcl|NC_017984. 236 ALWVTSQNSRFKLYTWGLDPVA-LG-QSGASFGEWAKENTSGVVPLYGTFDKAAFFCGVSGSINYQEENGRTTTAFR-SQ 312 (487) Q Consensus 236 A~w~~a~~~~~~~~~~~~~~~~-~~-~~~~~~~~l~~~~~~~t~~~y~~~~~~a~~~g~~as~~~~~~~gs~T~~fk-~l 312 (487) |+|+|+++++|+|+.++.+... .. ..+++...|+.++|.|++++||+..++++++|++++++|+..+|++||||| ++ T Consensus 239 A~wiea~~~~~~~~~~~~d~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~aa~~~~~~~g~~T~~~k~q~ 318 (494) T protein:vir:94 239 AQWTSDQVFRRIYAAWDQDAAGLSVNNVSSFGNIVKTTPFSNTIPVYGLLANAMIVLAWGASTNLQIAEGRTTLALRSPV 318 (494) T ss_pred HHHHhhcCccEEEEEecCCcceeecccchhHHHHHHhhcCCceEEEcCCCChHHHHHHHHHhccccccCcceeEEeeccC Confidence 9999999999999888876543 33 344555678888999999999999999999999999999999999999999 68 Q ss_pred CcccccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCCceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCH Q lcl|NC_017984. 313 DGLVPDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQYKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYND 392 (487) Q Consensus 313 ~Gv~~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~ 392 (487) +|++|++++++|+++|++||||||++|+++++++.|+++|+|+|.+.|||.+++++|||++||++|++||++++|||||+ T Consensus 319 ~gi~~~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~gg~~sG~~~~id~~~~~~WL~~~iq~~l~~ll~~~~KIPytd 398 (494) T protein:vir:94 319 SSAGVRVDNLANANALLSNGYTYLGKYASATNTYTVTYNGAIGGQFLWADTALGWIALRRNLQQALFETLLAYRSLPYNA 398 (494) T ss_pred CCCCCccCCHHHHHHHHhcCCeEEEEecccCceEEEecCceeccccceeeeeccHHHHHHHHHHHHHHHHHhCCCcccCh Confidence 99999999999999999999999999999999999999999887788999999999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEec-cCCCHHHHhhcccCCeEEEEE Q lcl|NC_017984. 393 QGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALSV-TLPDSQTRVARESFIIKLFYT 471 (487) Q Consensus 393 ~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~-~~~s~~dra~R~~~~i~~~~~ 471 (487) .|+++|+++|+++|+||++||+|+|||+|++.|++++++++|+++++++++||||+++ .++|+++|++|.+|+++|+|+ T Consensus 399 ~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~i~~~~G~~~~~~~~~kGyy~~~~~~~s~~~ra~R~~~~~~~~y~ 478 (494) T protein:vir:94 399 DGYNALYQGAQDVVSQFVAAGVIRAGVALSASQRAQIDQAAGVPISGDVVDKGWYLQVIDPITTTVRTDRGSPTVNFWYC 478 (494) T ss_pred hhHHHHHHHHHHHHHHHHhCceeecccccCcchhhhhhhhhcCccccceeccceeeeccCCCChhhhhccccCCceEEEE Confidence 9999999999999999999999999999999999999999999999999999999997 569999999999999999999 Q ss_pred ECCeEEEEEEEEEeeC Q lcl|NC_017984. 472 DGSSMQRLEMTATNVQ 487 (487) Q Consensus 472 ~aGAIh~v~i~gt~vq 487 (487) ++||||+|+|++|+|= T Consensus 479 ~~GAIh~v~i~~~~v~ 494 (494) T protein:vir:94 479 DGGSIQRVVVSATTVI 494 (494) T ss_pred ecCcEEEEEEeeEEeC Confidence 9999999999999999 No 6 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=100.00 E-value=1.8e-150 Score=841.51 Aligned_cols=477 Identities=21% Similarity=0.280 Sum_probs=432.3 Q ss_pred CccccceEEEeeeeeccccccccc-ceeEEecCccee---eeeeccHHHHHHhcCCChHHHHHHHHHhhcccCCccCCCE Q lcl|NC_017984. 5 SIPASNIAAVYPAVIGGGGNPLGL-NTNLFVQDAIYP---NYEYFSNTLVGQHYGLESPIYKFATVYFNGFRNATTRPNS 80 (487) Q Consensus 5 ~ip~s~iV~V~~~~~~~~~~~~~~-~~ll~~~~~~~~---~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~q~p~P~~ 80 (487) =||+||||||+|+|+++++.+++| ++|||+.++++| +++|+++++|++|||.+||||+||++||++++|+.+||++ T Consensus 1 mip~s~iVnV~~~v~~~a~~~~~~~~~lilt~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsq~p~~~~~P~~ 80 (507) T protein:vir:99 1 MISQSRYVRIVSGVGAGAPVAQRRLIMRVMTTNAVLPPGVVFESSSADAVGAYFGMASEEYKRAKAYMSFISKSINSPSY 80 (507) T ss_pred CCCccceeEEeeeccccCcccccccceeeeccccCCCccceEeecCHHHHHHhcCCChHHHHHHHHHhccCCCCCcccce Confidence 489999999999999999999987 578888888877 4689999999999999999999999999999999889999 Q ss_pred EEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEE-eeccccCchHHHHHhhhhhhe---------ee Q lcl|NC_017984. 81 LFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVP-VDLATANSYSDAAALIATALT---------LP 150 (487) Q Consensus 81 l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~-i~~s~ats~~~vA~~i~t~l~---------a~ 150 (487) ||||||++++++++|+|++++....+++++.+|+|+|+|||+.+.++ ||||.+++|+++|+.|+++|+ +. T Consensus 81 L~igR~~~~~~~a~l~g~~~~~~l~~~~~~~~G~lti~v~G~~~t~~~i~lS~~ts~~~vAs~i~~~l~a~~~~~~~~~t 160 (507) T protein:vir:99 81 ISFARWVNAAIASMIVGDSLVKNLPALKAVATPTLSLSIGGTVVPIAGIDLTAALTLTDVAATLQTKIRASANAELATAT 160 (507) T ss_pred EEEEeecCccccceeecchhhhhHHHHhhhcceeEEEEEcCceeEeccccccccCCHHHHHHHHHHhhhccccccccceE Confidence 99999999999999999999998888999999999999999998875 999999999999999999997 46 Q ss_pred EEEecccceEEEEecccccceeEEecc----c-chhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeEEEEEec Q lcl|NC_017984. 151 CTYESTVKGFVIKSGTSGANSTISFAT----G-DISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVNITYSEG 225 (487) Q Consensus 151 vt~d~~~~~f~its~t~g~~stit~at----g-d~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~ 225 (487) |+||++.+||+++++++|+.++++|++ | +++.++++ +++++++++|.++|+|.++|.++.+.+++||+|.+++. T Consensus 161 v~~d~~~~~F~v~s~~tG~~s~i~~at~~~~gt~~s~l~~~-~~~~a~~~~g~~aet~~~a~~a~~~~~~nW~~~~~a~~ 239 (507) T protein:vir:99 161 VTFNTTTNQFVLNGTTTGALAPTITAVRTDPATDISSLLGW-TNTGTVFVKGQAAETPDTSISKSAAISTNFGSFIYTST 239 (507) T ss_pred EEEecCCceEEEEeeeccccceeEEEEcCCchhhHHHHhcc-ccccceEeecccccCHHHHHHHHHhhcCCeEEEEEEec Confidence 999999999999999999999999986 3 35555555 57899999999999999999999999999999988765 Q ss_pred c-CChhHHHHHHHHHhccCceEEEEEccccccccccchHHHHH-HHhCCcceEEEec--CCCchHHHHHHHHHhcCcCcC Q lcl|NC_017984. 226 V-FNEDALKDLALWVTSQNSRFKLYTWGLDPVALGQSGASFGE-WAKENTSGVVPLY--GTFDKAAFFCGVSGSINYQEE 301 (487) Q Consensus 226 ~-~~~~~i~a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~-l~~~~~~~t~~~y--~~~~~~a~~~g~~as~~~~~~ 301 (487) . +++++++++|+|+|++++||++..++.+..... .... +....+.++...+ +.++++++++|+++++||++. T Consensus 240 ~~~td~~~lalA~wiea~~~~f~~~~~~~~a~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~ 315 (507) T protein:vir:99 240 PALTNDQITAVASWNASQNNMYMYSVPTTIANIGT----LYAAVKGFSGCALNITSDSLPVDYIEQSPCEILAATDYTRV 315 (507) T ss_pred cccChHHHHHHHHHHhhcCcEEEEEEecCchhhhh----hhhhhhhcceeEEEeecccccchhHHHHHHHHHHhhccCcC Confidence 4 678889999999999999999988876543222 2222 2233344444332 346778999999999999999 Q ss_pred CceeeeeeeecCcccccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCCc-e--ehHHHHHHHHHHHHHHHHH Q lcl|NC_017984. 302 NGRTTTAFRSQDGLVPDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQY-K--WIDNFDFQVFLRTQLQLAY 378 (487) Q Consensus 302 ~gs~T~~fk~l~Gv~~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg~-~--~iD~~~~~dWl~~~iq~~l 378 (487) ||++|||||++|||+|++++++|+++|++||||||++|++.+++++|+++|+|+||+ + |||.++++|||+++||++| T Consensus 316 ng~~T~~fk~l~GV~a~~lt~t~a~al~~~n~N~y~~~a~~~~~~~~~~~G~~~gG~~~fid~d~~~~~~WL~~~iq~~l 395 (507) T protein:vir:99 316 NATQNYMYYQFPSRNITVSDDTTANLVDANRGNYIGQTQSAGQSLAFYQRGILCGGPNDAVDMNIYANEIWLKSAISAQI 395 (507) T ss_pred ccceeecccccCCcccccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeeCCcccceeeeeecchHHHHHHHHHHH Confidence 999999999999999999999999999999999999999999999999999999985 3 5667889999999999999 Q ss_pred HHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccC-ccccceeeeeeEEeccC---CC Q lcl|NC_017984. 379 MNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGF-DAASQLFTKGWALSVTL---PD 454 (487) Q Consensus 379 ~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~-~~~~~~~~~Gy~~~~~~---~s 454 (487) ++||.+++|||||+.|+++|+++|+++|+||++||+|+|||+|++.|+++++++.|. ++++++++||||+++.. ++ T Consensus 396 ~~l~~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~I~~Gvtl~~~q~~~in~~~~~~~~~~~~~~~Gyy~~~~~~s~~~ 475 (507) T protein:vir:99 396 LSLFLNVPRVPANETGESMLLSVIQSVVNTAKNNGTISAGKNLNVIQQQYITQISGDANAWRQVANIGYWLNITFSSYTN 475 (507) T ss_pred HHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhccccccCCcccccchheecccccccccccceeccceEEEeCChHhcC Confidence 999999999999999999999999999999999999999999999999999999986 59999999999999754 67 Q ss_pred HHHHhhcccCCeEEEEEECCeEEEEEEEEEee Q lcl|NC_017984. 455 SQTRVARESFIIKLFYTDGSSMQRLEMTATNV 486 (487) Q Consensus 455 ~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~v 486 (487) +++|++|++|+++|+|+++|+||+|+|++++| T Consensus 476 ~~~r~~r~~~~~~~~y~~~gaI~~v~~~~~~v 507 (507) T protein:vir:99 476 PNTQLTEWKASYQLIYSKDDAIRFVEGTDTLI 507 (507) T ss_pred hhhhhccccceEEEEEEeCCeEEEEEeeeecC Confidence 89999999999999999999999999999999 No 7 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=100.00 E-value=1.6e-148 Score=830.92 Aligned_cols=476 Identities=20% Similarity=0.286 Sum_probs=434.3 Q ss_pred CccccceEEEeeeeecccccccccc-eeEEecCccee---eeeeccHHHHHHhcCCChHHHHHHHHHhhcccCCccCCCE Q lcl|NC_017984. 5 SIPASNIAAVYPAVIGGGGNPLGLN-TNLFVQDAIYP---NYEYFSNTLVGQHYGLESPIYKFATVYFNGFRNATTRPNS 80 (487) Q Consensus 5 ~ip~s~iV~V~~~~~~~~~~~~~~~-~ll~~~~~~~~---~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~q~p~P~~ 80 (487) =||+||||||+|+|+++++.+++|+ +|||++++++| +++|+++++|++|||++||||+||++||++++|+.|||++ T Consensus 1 mip~s~iV~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yF~~~~~~~~~P~~ 80 (504) T protein:vir:96 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPSS 80 (504) T ss_pred CCCccceeEeeecccccccccccccceeEeecccCCCccceEEecCHHHHHHhcCCChHHHHHHHHHhhcCCCCCccccE Confidence 4899999999999999999999886 67888888887 4689999999999999999999999999999999999999 Q ss_pred EEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEE-eeccccCchHHHHHhhhhhhee---------e Q lcl|NC_017984. 81 LFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVP-VDLATANSYSDAAALIATALTL---------P 150 (487) Q Consensus 81 l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~-i~~s~ats~~~vA~~i~t~l~a---------~ 150 (487) ||||||++++++++|+|++++....+++++.+|+|+|+|||+.+.+. ||||.+++|+++|+.|+++|++ + T Consensus 81 l~igR~~~~a~~~~l~g~~~~~~~~~~~~i~~G~lsitv~G~~~~~~~i~~S~~ts~~~vA~~i~~al~~~~~~~~~~~t 160 (504) T protein:vir:96 81 ISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEKNITAIDTSAATSMDNVASIIQTEIRKNTDPQLAQAT 160 (504) T ss_pred EEEEeecCcCccceEEechhHHHHHHHhhhhceEEEEEEcceeeeecccccccccchHHHHHHHHhhhhcccccccccce Confidence 99999999999999999999998888999999999999999999886 9999999999999999999974 5 Q ss_pred EEEecccceEEEEecccccceeEEecc---cchhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeEEEEEeccC Q lcl|NC_017984. 151 CTYESTVKGFVIKSGTSGANSTISFAT---GDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVNITYSEGVF 227 (487) Q Consensus 151 vt~d~~~~~f~its~t~g~~stit~at---gd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~ 227 (487) |+||++.+||+|+++++|..+.....+ .+++.+++++.+ ++++++|.++|+|.++|.++.+++++||+|.++++.. T Consensus 161 v~~d~~~~~f~its~~tg~~~~~~~~~a~~~~~~~~lgl~~~-~~~~v~g~~aet~~~al~al~~~~~~Wy~f~~a~~~~ 239 (504) T protein:vir:96 161 VTWNPNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTS-NVVNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGATL 239 (504) T ss_pred EEEeccCCeEEEEeeccccceeEEEeeccccchhhhhhcccc-cceEEeecccccHHHHHHHHHhhcCCeEEEEEEeccC Confidence 999999999999999999877666654 368999999854 5668999999999999999999999999999998888 Q ss_pred ChhHHHHHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcceEEEecCC----CchHHHHHHHHHhcCcCcCCc Q lcl|NC_017984. 228 NEDALKDLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGT----FDKAAFFCGVSGSINYQEENG 303 (487) Q Consensus 228 ~~~~i~a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~----~~~~a~~~g~~as~~~~~~~g 303 (487) ++++++++|+|+|++++||+|+.++.+. +..+.+. +.++++.+++++++. .+.+++++++++++||++.|| T Consensus 240 ~dd~ilalA~w~ea~~~~~~~~~~~~~~----~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~as~~f~~~ng 314 (504) T protein:vir:96 240 DNDQIKAVSAWNAAQNNQFIYTVATSLA----NLGALFD-LVKGNSGTALNVLSATASNDFVEQCPSEILAATNYDEPGA 314 (504) T ss_pred CHHHHHHHHHHHhhcCceEEEEEeeccc----chhhHHH-hhhhcceeEEEEeecCccchhHHHHHHHHHHhcCcCcccc Confidence 8999999999999999999999886543 2223333 344444555555542 234677899999999999999 Q ss_pred eeeeeeeecCcccccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCCce---ehHHHHHHHHHHHHHHHHHHH Q lcl|NC_017984. 304 RTTTAFRSQDGLVPDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQYK---WIDNFDFQVFLRTQLQLAYMN 380 (487) Q Consensus 304 s~T~~fk~l~Gv~~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg~~---~iD~~~~~dWl~~~iq~~l~~ 380 (487) ++|||||++|||+|++++++|+++|+++|||||+.+++.+++++|+++|+|+||+. |||+++++||||++||++|++ T Consensus 315 ~~T~~fk~l~GVta~~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~~~gG~~~~~wiDv~~~~~WL~~~lq~~l~~ 394 (504) T protein:vir:96 315 SQNYMYYQFPGRNITVSDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWLKSAIAQALLD 394 (504) T ss_pred cccccccccCCcCcccCCHHHHHHHHhcCCeEEEEeecccceeeEEecCeeeCCccccchhhhhhhHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999863 899999999999999999999 Q ss_pred HHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccc-cceeeeeeEEecc---CCCHH Q lcl|NC_017984. 381 MFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAA-SQLFTKGWALSVT---LPDSQ 456 (487) Q Consensus 381 ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~-~~~~~~Gy~~~~~---~~s~~ 456 (487) ||.+++|||||+.|+++|+++|+++|++|++||+|+|||++++.|+++++.+.|.+.. +++++||||+++. +++++ T Consensus 395 l~~~~~kIPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~I~~~~~~d~~~~~~~~~GYyv~~~~~s~~s~~ 474 (504) T protein:vir:96 395 LFLNVNAVPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQYITQITGDRRAWRQVQTLGYWINITFSSYTNSN 474 (504) T ss_pred HHhcCCCcccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchheecccccccccccceeccceEEEecChhccChh Confidence 9999999999999999999999999999999999999999999999999999999865 8899999999974 66799 Q ss_pred HHhhcccCCeEEEEEECCeEEEEEEEEEee Q lcl|NC_017984. 457 TRVARESFIIKLFYTDGSSMQRLEMTATNV 486 (487) Q Consensus 457 dra~R~~~~i~~~~~~aGAIh~v~i~gt~v 486 (487) ||++|++|+++|+|+++||||+|+|++++| T Consensus 475 ~r~~R~~~~~~~~y~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 475 TGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) T ss_pred HhhhccccceEEEEEECCeEEEEEeccccC Confidence 999999999999999999999999999999 No 8 >protein:vir:107720 Length: 515 # NCBI annotation: gp17 # Family: family:all:396 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024865;genbank:gi:48697507;genbank:GeneID:2948336 Probab=100.00 E-value=1.3e-148 Score=831.28 Aligned_cols=480 Identities=18% Similarity=0.240 Sum_probs=430.1 Q ss_pred CCcCCccccceEEEeeeeecccc-cccccceeEEecCcceee---eeeccHHHHHHhcCCChHHHHHHHHHhhcccCCcc Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGG-NPLGLNTNLFVQDAIYPN---YEYFSNTLVGQHYGLESPIYKFATVYFNGFRNATT 76 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~-~~~~~~~ll~~~~~~~~~---~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~q~p 76 (487) || ||.+++|+|++++..+++ ..|+|++|||+.++.+|+ ++|+|+++|++|||++||||+||++||+||+||+| T Consensus 1 m~---I~~~~~V~i~~~v~aa~~~~~~~f~~li~t~~~~~p~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q~p 77 (515) T protein:vir:10 1 MP---ISFDKYVAITSGVAAQQQIAARSFAIRVYTPNPMVSVDRLITATSAADVGAYFGTASEEYKRAVKNFGFISKKTR 77 (515) T ss_pred CC---CCceeEEEeecccccCCccccccceeeeeecccCCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhccCCcc Confidence 55 888888888888765555 456999999999998884 68999999999999999999999999999999999 Q ss_pred CCCEEEEEeeecccceeeEeeccccccchhhheeee-eEEEEEEccceE-EE-EeeccccCchHHHHHhhhhhhe----- Q lcl|NC_017984. 77 RPNSLFITKYNLTDVPASLIGGDITSTTLADLKLIN-GTLTIVVDGVSK-SV-PVDLATANSYSDAAALIATALT----- 148 (487) Q Consensus 77 ~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~-g~~~iti~g~~~-~~-~i~~s~ats~~~vA~~i~t~l~----- 148 (487) ||++||||||++++++++|+|+.+.+.+++.++.++ |+|+|+|||+.+ .+ .||||.+++|+++|+.|+++|+ T Consensus 78 ~P~~L~igR~~~~a~~~~l~g~~~~~~~l~~~~~is~G~ltitidG~~~~t~s~i~~S~ats~~~vAs~i~tal~~~~~~ 157 (515) T protein:vir:10 78 RPTSIQFARWQREAGPVAIYGGAKKAAALATLQAVTAGAISFLFGGATTVTVSGISFSAATSLADVASELQTALRANADA 157 (515) T ss_pred cccEEEEEeccCcccceEEEeccchhhhHHhhhcccceeEEEEEcceEEEEeeccccccccCHHHHHHHHHhhhcccccc Confidence 999999999999999999999999999998888775 999999999875 44 5999999999999999999997 Q ss_pred ----eeEEEecccceEEEEecccccceeEEeccc-------chhhhhhhccccceeEecCcccccHHHHHHHHHhcccce Q lcl|NC_017984. 149 ----LPCTYESTVKGFVIKSGTSGANSTISFATG-------DISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNF 217 (487) Q Consensus 149 ----a~vt~d~~~~~f~its~t~g~~stit~atg-------d~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~w 217 (487) ++|+||++.+||+|+++++|..++|++..+ +++.+|||++++++++++|.++|+|.++|.++.+.+++| T Consensus 158 ~~~~~tv~~d~~~~~F~v~s~~tG~~~~is~~~~t~~~~~t~~a~~lglt~~~~av~~~g~aaet~~~a~~a~~~~s~nW 237 (515) T protein:vir:10 158 NLATCTVSYDPVGARFNFAGSPSDDTVQESISIVPQSNPAIDVAQLLGWNSAQGASYIAASPVVSPVDTLIASVAGNNNF 237 (515) T ss_pred ccceeEEEEecCCCeEEEEEeecCCceeEEEEEecCCCchhhHHHHhccccccceEEecccccccHHHHHHHHHhccCCe Confidence 479999999999999999999999988743 479999999999999999999999999999999999999 Q ss_pred eEEEEEecc---CChhHHHHHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcceEEEecC--CCchHHHHHHH Q lcl|NC_017984. 218 VNITYSEGV---FNEDALKDLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYG--TFDKAAFFCGV 292 (487) Q Consensus 218 y~~~~~~~~---~~~~~i~a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~--~~~~~a~~~g~ 292 (487) |+|.+++++ .+++++++++.|+|+++++|++...+.+. ..+...+..... +.+.++..+++ .++++++++|+ T Consensus 238 y~f~~a~~~~~~~~~a~~~a~a~~~e~~~~~~~~~~~~~~~-~~~~~~a~~~~~--~~~~~~~~~~~~~~~~~~a~~~g~ 314 (515) T protein:vir:10 238 GSILFTKNGGTGITLSDAEAIALQNQSYNVAYKFQVGVDDT-TYSSWQAALAAI--GGVNMIYSPVALAAEYHDMQDGII 314 (515) T ss_pred EEEEEeecCccccchhHHHHHHHHHhhcCceEEEEeccCcc-ceechhhhhhhh--hhcCceEEEEeccCcchHHHHHHH Confidence 999998653 45678889999999999998876654433 333333333332 33344443332 34778899999 Q ss_pred HHhcCcCcCCceeeeeeeecCcccccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCCc---eehHHHHHHHH Q lcl|NC_017984. 293 SGSINYQEENGRTTTAFRSQDGLVPDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQY---KWIDNFDFQVF 369 (487) Q Consensus 293 ~as~~~~~~~gs~T~~fk~l~Gv~~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg~---~~iD~~~~~dW 369 (487) ++++||++.+|++||||||+|||+|++++++|+++|++||||||+.|+++++.++|++||+|+||+ +|||++||+|| T Consensus 315 ~asvnf~~~ng~iT~kfKq~~Gita~~lt~t~a~al~~~~~N~Y~~~~~~~~~~~~~~~G~~~gG~~~~~WiD~~~g~~W 394 (515) T protein:vir:10 315 EAATDFTQQGGATGYMYVQFNNQTPAVNDDTLSGILDDLNINYYGQTQVNGTNLSFYQDGVMMGGPTDPRDSNVYANEQW 394 (515) T ss_pred HHhcCCCccchhheeccccCCCCccccCCHHHHHHHHhcCCeEEEEEeccCceEEEEeCCeeeCCccchhHHHHHhhHHH Confidence 999999999999999999999999999999999999999999999999999999999999999974 48999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHH-HHHHHHHHhcCccccCcccCccccccccccccCc-cccceeeeeeE Q lcl|NC_017984. 370 LRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYS-QDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFD-AASQLFTKGWA 447 (487) Q Consensus 370 l~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v-~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~-~~~~~~~~Gy~ 447 (487) |+++||++|++||.+++||||||.|+++|+++| +++|+||++||+|+|||+|++.|+++|++..|+| +++++++|||| T Consensus 395 L~~~iq~~l~~L~~s~~KIPytd~G~a~i~a~v~q~vl~~av~nG~I~~Gv~ls~~Q~~~i~~~~g~d~~~~~~~~~Gyy 474 (515) T protein:vir:10 395 LKSYAGASFMSLQLAQGKIPANIEGRGLLLGKMTKDIIPAAKLNGTFSIGKTLTVDQQLFVTELTGDDTAWQKVQNLGYW 474 (515) T ss_pred HHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHHhCCeeecCcccchhHHHHHHhhhcCcccccchhhccee Confidence 999999999999999999999999999999987 5799999999999999999999999999999998 78999999999 Q ss_pred EeccCCCHHHHhhcccCCe--EEEEEECCeEEEEEEEEEee Q lcl|NC_017984. 448 LSVTLPDSQTRVARESFII--KLFYTDGSSMQRLEMTATNV 486 (487) Q Consensus 448 ~~~~~~s~~dra~R~~~~i--~~~~~~aGAIh~v~i~gt~v 486 (487) +++..++.+.|..|+.|.+ .|||+++|+||+|++++|+| T Consensus 475 ~~~~~~~~~~~~~r~~~~~~~~~~y~~g~~i~~i~~~~~~v 515 (515) T protein:vir:10 475 YDVQISSFVDTGGTTKYQAVYSLVYSKDDLIRKVVGTHTLI 515 (515) T ss_pred EecCcCCCCCcccccccCceeEEEEEcCceEEEEEeeeecC Confidence 9999999999999998855 69999999999999999999 No 9 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=100.00 E-value=6.6e-141 Score=789.09 Aligned_cols=465 Identities=15% Similarity=0.203 Sum_probs=417.0 Q ss_pred CCcCCccccceEEEeeeeecccccccccceeEEe-c---Ccc----eeeeeeccHHHHHHhcCCChHHHHHHHHHhhccc Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGNPLGLNTNLFV-Q---DAI----YPNYEYFSNTLVGQHYGLESPIYKFATVYFNGFR 72 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~~~~~~~ll~~-~---~~~----~~~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~ 72 (487) |+ ||+||||||+|++.+.++.+++|+.+||. + +.+ .++++|+|+++|++|||++|||||||++||+ T Consensus 1 ms---ip~s~ivnV~i~~~~~a~~~~~f~~~l~l~~~~~~~~~~~~~r~~~~~s~~~V~~~FG~~s~ey~aA~~yF~--- 74 (502) T protein:vir:52 1 MA---LSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFA--- 74 (502) T ss_pred CC---CCccceeEEeeccccccccccccCceEEEeeccCccccCCccceEEecCHHHHHHhcCCChHHHHHHHHHhc--- Confidence 64 99999999999999999999999976554 2 222 2367899999999999999999999999998 Q ss_pred CCccCCCEEEEEeeecccceeeEeeccccccchh-----hheeeeeEEEEEEccceEEEE-eeccccCchHHHHHhhhhh Q lcl|NC_017984. 73 NATTRPNSLFITKYNLTDVPASLIGGDITSTTLA-----DLKLINGTLTIVVDGVSKSVP-VDLATANSYSDAAALIATA 146 (487) Q Consensus 73 ~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~-----~~~~~~g~~~iti~g~~~~~~-i~~s~ats~~~vA~~i~t~ 146 (487) |+|||.+||||||+++++.++|+++.+.+..++ +..+.+|+|+++|||+.++++ ||||.+++++++|+.|+++ T Consensus 75 -q~p~P~~l~igR~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~i~i~g~~~t~~~i~lS~~ts~~~vA~~i~~~ 153 (502) T protein:vir:52 75 -QSPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEK 153 (502) T ss_pred -CCCccceEEEEeccccccceeechhhhhhhhhHHhHHHhhhhcCceeEEEecceeeeeeccccccccchhHHHHHHHhh Confidence 999999999999999999999999998876653 334579999999999999886 9999999999999999999 Q ss_pred hee-----eEEEecccceEEEEecccccceeEEeccc--------chhhhhhhccccceeEe----cCcccccHHHHHHH Q lcl|NC_017984. 147 LTL-----PCTYESTVKGFVIKSGTSGANSTISFATG--------DISDDLKLTQETGAVLN----NHTAADTPTTGALN 209 (487) Q Consensus 147 l~a-----~vt~d~~~~~f~its~t~g~~stit~atg--------d~a~~l~lt~~~gA~~~----~G~aaet~~~al~a 209 (487) +++ .|+||++.+||+++++++|..+++++..+ +++.+|+++...+++.+ +|.++|+|.++|.+ T Consensus 154 l~~~~~~~tv~~d~~~~~F~i~s~ttg~~~~~~~~~a~~~~~~gt~~a~~l~l~~~~~av~v~~~~~g~~aet~~~al~a 233 (502) T protein:vir:52 154 LTTLSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFN 233 (502) T ss_pred hcccccceEEEEecCCceEEEEeccCCCcceeEEEEeecCCcchhHHHHHhccccccceeeeeeecccccccCHHHHHHH Confidence 963 68999999999999999998777555321 37899999999888764 58889999999999 Q ss_pred HHhcccceeEEEEEeccCChhHHHHHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcceEEEecCC--CchHH Q lcl|NC_017984. 210 ALAFSQNFVNITYSEGVFNEDALKDLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGT--FDKAA 287 (487) Q Consensus 210 ~~~~~~~wy~~~~~~~~~~~~~i~a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~--~~~~a 287 (487) +.+++++||+|.++++ .++++++++|+|+|+++|+|++.+++.+. .....+++++.|+.++|.|++++||+ +++.+ T Consensus 234 ~~~~~~~w~~~~~a~~-~~~~~~la~a~~iea~~~~f~~~~~d~~~-~~~~~~~i~~~l~a~~~~~t~~~y~~~~~~~~a 311 (502) T protein:vir:52 234 VAEVNNTWYGFTVAAQ-LTDSEVEAAAKYAQANTKLFGANVIRAEQ-IEWSADNIYKKLYDAGLDHTLAMFDKNDMYPVS 311 (502) T ss_pred HHhccCceEEEEEeec-CChhHHHHHHHHHhhcCcEEEEEecCcce-eccccchHHHHHHhccCceeEEEecCCcchhHH Confidence 9999999999999876 46777889999999999988887766654 45557788889999999999999995 45677 Q ss_pred HHHHHHHhcCcCcCCceeeeeeeecCcccccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCCceehHHHHHH Q lcl|NC_017984. 288 FFCGVSGSINYQEENGRTTTAFRSQDGLVPDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQYKWIDNFDFQ 367 (487) Q Consensus 288 ~~~g~~as~~~~~~~gs~T~~fk~l~Gv~~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg~~~iD~~~~~ 367 (487) +++|+++++||++.+|++|||||+++||+|++++++|+++|++||||||+++.+ ..++++|++++|+ |||++||+ T Consensus 312 a~~g~~as~~f~~~~g~iT~~fk~l~GV~~~~lt~t~~~al~~~~~N~y~~~~~----~~~~~~G~~~~G~-~iD~~~~~ 386 (502) T protein:vir:52 312 SALARLLSTNFAANNSTLTLKFKQQPTITADEITATEFAKAKRLGINVYTYFDD----VAMIAEGTVIGGK-FADEIVIL 386 (502) T ss_pred HHHHHHHhcCCCcCcceeeecccccCCcccCcCCHHHHHHHHhcCceEEEEecC----eeEEecCeeeCCc-hhhHHHHH Confidence 899999999999999999999999999999999999999999999999999865 3588999999994 99999999 Q ss_pred HHHHHHHHHHHHHHHHh-cCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeee Q lcl|NC_017984. 368 VFLRTQLQLAYMNMFQA-QKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGW 446 (487) Q Consensus 368 dWl~~~iq~~l~~ll~~-~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy 446 (487) |||+++||++|+++|.+ ++|||||+.|+++|+++|+++|+||++||+|+||+++++ ..|.+.+++++.+|| T Consensus 387 ~Wl~~~lq~~l~~~L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~G~~~~~--------~~g~~~~~d~~~~gy 458 (502) T protein:vir:52 387 DWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGA--------GFGNLSTGDYLDKGF 458 (502) T ss_pred HHHHHHHHHHHHHHHHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCccccccccCc--------ccceeeecccccCce Confidence 99999999999998865 689999999999999999999999999999999996654 457889999999999 Q ss_pred EEe---ccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 447 ALS---VTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 447 ~~~---~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) |++ +++++++||++|++|+++|+|+++||||+|+|+++++| T Consensus 459 ~v~~~~~~~~s~~dr~~R~~~~~~~~~~~aGaIh~v~i~~nv~~ 502 (502) T protein:vir:52 459 YVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) T ss_pred EEEeCchhhCCHHHHHcccCCCeEEEEEECceEEEEEEEEEEeC Confidence 998 67899999999999999999999999999999999999 No 10 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=100.00 E-value=3.4e-120 Score=675.55 Aligned_cols=433 Identities=15% Similarity=0.126 Sum_probs=388.2 Q ss_pred cccceEEEeeeeecccccccccceeEEecCccee---eeeeccHHHHHHhcCCChHHHHHHHHHhhcccCCccCCCEEEE Q lcl|NC_017984. 7 PASNIAAVYPAVIGGGGNPLGLNTNLFVQDAIYP---NYEYFSNTLVGQHYGLESPIYKFATVYFNGFRNATTRPNSLFI 83 (487) Q Consensus 7 p~s~iV~V~~~~~~~~~~~~~~~~ll~~~~~~~~---~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~q~p~P~~l~i 83 (487) --||||||+|+|.++++++|+|+++||++++..| +++|+++++|++|||.+|||||||++||+ |+|+|.+||| T Consensus 1 ~~s~iVnV~i~~~~~a~~~~~f~~~l~~~~~~~~~~r~~~yss~~~V~~~FG~~S~ey~aA~~yF~----q~p~p~~l~i 76 (450) T protein:vir:95 1 MWNPIVNVDITLNTAGTTREGFGLPLFLASTDNFEERVRGYTSLTEVAEDFDENTAAYKAAKQLWS----QTPKVTQLYI 76 (450) T ss_pred CCCceEEEeecccccccccccceeEEEEcCCCCCccceeeecCHHHHHHhcCCCcHHHHHHHHHHh----CCCcccEEEE Confidence 4699999999999999999999999998877655 56899999999999999999999999999 9999999999 Q ss_pred EeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEE-EeeccccCchHHHHHhhhhhheeeEEEecccceEEE Q lcl|NC_017984. 84 TKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSV-PVDLATANSYSDAAALIATALTLPCTYESTVKGFVI 162 (487) Q Consensus 84 gr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~-~i~~s~ats~~~vA~~i~t~l~a~vt~d~~~~~f~i 162 (487) |||+++++++++.+. ..+.+|+|+++|+|+.+.. .++++.+++++++|+.+++++.+. +....+|.+ T Consensus 77 gr~~~~~t~~~~~~~---------~~~~~g~lt~tv~G~~~~~~~i~~s~a~s~~~va~~~~tai~~~---~~~~~~~~~ 144 (450) T protein:vir:95 77 GRRAMQYTVSIPDAV---------TESTDYSITVAAGGGISQPYQYTAQSSDTAENVLQQFKTQIEAD---PTIKDKVSV 144 (450) T ss_pred Eeeccchhhhhhhhh---------ccccceeEEEEecceeeeeeEEEEEecCChhhHHHHhhhhhccc---ceeeeeeee Confidence 999999888777653 3567899999999988755 599999999999999999999754 223356888 Q ss_pred EecccccceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeEEEEEeccCChhHHHHHHHHHhcc Q lcl|NC_017984. 163 KSGTSGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVNITYSEGVFNEDALKDLALWVTSQ 242 (487) Q Consensus 163 ts~t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~~~~~i~a~A~w~~a~ 242 (487) ++..++..+++++..+..+.+++++...+++.++|.++|++.++|.++.+++++||+|.+.+ .++++++++|+|+|++ T Consensus 145 ~s~g~~~~~t~~~~~~~~~~~~~l~~~~~~~~~~g~~aet~~~a~~a~~~~~~~w~~~~~~~--~~~~~i~a~a~w~~a~ 222 (450) T protein:vir:95 145 NVTGSNGSATMIIAKAGDNDFVKVTTTAQTVYIASTTADTASTALAAIEAYSTDWYFIAAED--RTQQFVLAMASEIQAR 222 (450) T ss_pred eeecccceeeeeeeccccchhhccccccceeEecccccccHHHHHHHHHHhhCCeEEEEecC--CCHHHHHHHHHHHhhc Confidence 88888889999999988899999999999999999999999999999999999999887643 5788899999999999 Q ss_pred CceEEEEEcccccc---ccccchHHHHHHHhCCcceEEEecCCCchHHHHHHHHHhcCcCcCCceeeeeeeecCccccc- Q lcl|NC_017984. 243 NSRFKLYTWGLDPV---ALGQSGASFGEWAKENTSGVVPLYGTFDKAAFFCGVSGSINYQEENGRTTTAFRSQDGLVPD- 318 (487) Q Consensus 243 ~~~~~~~~~~~~~~---~~~~~~~~~~~l~~~~~~~t~~~y~~~~~~a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~~~- 318 (487) +++|+++.++.+.. ..+..++...+|++++|.|++++||+..+.+++++++++.+|+..+|++|||||+++||+|+ T Consensus 223 ~~~f~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~~~aa~~g~~~~~~~g~~T~~fk~l~Gv~~~v 302 (450) T protein:vir:95 223 KKIFFTANSDVTALQGTELASANDVPAQLAKNMYTRTVCLWHHAAAEDYPEMAYIAYGAPYDAGSIAWGNAQLTGVAASL 302 (450) T ss_pred CcEEEEEcCCchhhhhhhhhcccchHHHHHhccCCeeEEEeeCCCchhHHHHHHHHHhhhcccceeeeccccccceeeec Confidence 99999988876543 33556678888999999999999999888889999999999999999999999999999995 Q ss_pred ------CCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCCceehHHHHHHHHHHHHHHHHHHHHHHhc--CCCCc Q lcl|NC_017984. 319 ------VTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQYKWIDNFDFQVFLRTQLQLAYMNMFQAQ--KTIPY 390 (487) Q Consensus 319 ------~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg~~~iD~~~~~dWl~~~iq~~l~~ll~~~--~kIPy 390 (487) +|+++|+++|+++|||||+++.+ ..++++|++++|+ |||++||+|||+++||++|++||+++ +|||| T Consensus 303 ~~~~~~~lt~~~~~al~~~~~n~y~~~~~----~~~~~~G~~~~G~-~iD~~~~~~wl~~~iq~~l~~ll~~~~~~KiPy 377 (450) T protein:vir:95 303 QPSNQRPLTSIQKSALDVRHCNFIDLDGG----VPVVRRGITSGGE-WIDIIRGVDWLESDLKTSLRDLLINQKGGKITY 377 (450) T ss_pred cCccccccchHHHHHHHhCCcEEEEEecC----ceeeeCCeeeCcc-hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcc Confidence 58999999999999999999865 3589999999995 99999999999999999999999865 59999 Q ss_pred CHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEeccCCCHHHHhhcccCCeEEEE Q lcl|NC_017984. 391 NDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVARESFIIKLFY 470 (487) Q Consensus 391 t~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~ 470 (487) |+.|+++|+++|+++|+|+++||+|+.. +-+.+.+++++++||++|++|+++|+| T Consensus 378 ~~~G~~~i~a~i~~~l~~a~~~G~Ia~~-------------------------~V~~~~~~~~~~~dr~~R~~~~i~~~~ 432 (450) T protein:vir:95 378 DDTGITRIRQVIETSLQRAVNRNFLSSY-------------------------TVNVPKASQVALADKKARILKDVTFAG 432 (450) T ss_pred ChhhHHHHHHHHHHHHHHHHhcCcccce-------------------------eEecCChHhcCHHHHhccCCCCeeEEE Confidence 9999999999999999999999999621 112345889999999999999999999 Q ss_pred EECCeEEEEEEEEEeeC Q lcl|NC_017984. 471 TDGSSMQRLEMTATNVQ 487 (487) Q Consensus 471 ~~aGAIh~v~i~gt~vq 487 (487) +|+||||.++|+|+|== T Consensus 433 ~laGAIh~~~i~~~v~~ 449 (450) T protein:vir:95 433 ILAGAILDVDLKGTVAY 449 (450) T ss_pred EEccceEEEEEEEEEEe Confidence 99999999999998765 No 11 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=100.00 E-value=1.2e-84 Score=480.61 Aligned_cols=319 Identities=14% Similarity=0.173 Sum_probs=263.7 Q ss_pred cccceEEEeeeeecc---cccccccceeEEecCcceeeeeeccHHHHHHhcCCChHHHHHHHHHhhcccCCccCCCEEEE Q lcl|NC_017984. 7 PASNIAAVYPAVIGG---GGNPLGLNTNLFVQDAIYPNYEYFSNTLVGQHYGLESPIYKFATVYFNGFRNATTRPNSLFI 83 (487) Q Consensus 7 p~s~iV~V~~~~~~~---~~~~~~~~~ll~~~~~~~~~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~q~p~P~~l~i 83 (487) =+||||+|++.+... +...++|+++|+++.. .+.++|+++++|+.|||.++|+|++|.++|+ |.|+|.++++ T Consensus 1 ~~~~iv~V~v~~~~~~~~~~~~~~~~~~~~~~t~-~~~~~y~s~~~v~~d~~~~~~~Ykaa~~~f~----Q~~~~~~i~v 75 (331) T protein:vir:80 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTA-MGYKEYTTLEELKDTFADNTEVYAKAKAVFL----QKDRPDTVAV 75 (331) T ss_pred CccceecceeeecccccccccccCcceeEEeccc-cceEEEechhhhccCCCCCcHHHHHHHHHHh----ccCccceEEE Confidence 689999999998743 3445577777766544 5678899999999999999999999999999 9999999999 Q ss_pred EeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhheeeEEEecccceEEEE Q lcl|NC_017984. 84 TKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALTLPCTYESTVKGFVIK 163 (487) Q Consensus 84 gr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~a~vt~d~~~~~f~it 163 (487) ++|.++ T Consensus 76 ~~~~~~-------------------------------------------------------------------------- 81 (331) T protein:vir:80 76 ITYEDT-------------------------------------------------------------------------- 81 (331) T ss_pred eccchH-------------------------------------------------------------------------- Confidence 865321 Q ss_pred ecccccceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhc-ccceeEEEEEeccCChhHHHHHHHHHhcc Q lcl|NC_017984. 164 SGTSGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAF-SQNFVNITYSEGVFNEDALKDLALWVTSQ 242 (487) Q Consensus 164 s~t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~-~~~wy~~~~~~~~~~~~~i~a~A~w~~a~ 242 (487) +.+.++.+. +++||++.+.+ .++++++++|+|+|++ T Consensus 82 -----------------------------------------~~~~a~~a~~~~~w~~~~~~~--~~~~~~~a~a~~~~a~ 118 (331) T protein:vir:80 82 -----------------------------------------KLLEAAEAYFLKSWHFALLAE--FKAADALALSNLIEEQ 118 (331) T ss_pred -----------------------------------------HHHHHHHHhccCceeEEEeec--CCHHHHHHHHHHHhhC Confidence 012233333 45677665543 5678889999999999 Q ss_pred CceEEEEEccccccccccchHHHHHHHhCCcceEEEecCC---CchHHHHHHHHHhcCcCcCCceeeeeeee-cCccccc Q lcl|NC_017984. 243 NSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGT---FDKAAFFCGVSGSINYQEENGRTTTAFRS-QDGLVPD 318 (487) Q Consensus 243 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~---~~~~a~~~g~~as~~~~~~~gs~T~~fk~-l~Gv~~~ 318 (487) +++|+++.++.+ ...+...++.++++++|+ ++.+++++|++++++ +|++|||||+ |+||+|+ T Consensus 119 ~~~f~~~~~~~~----------~~~~~~~~~~~t~~~~~~~~~~~~~aa~~g~~~~~~----~g~~t~~fk~~l~GV~~~ 184 (331) T protein:vir:80 119 KFKFAVFQVTAV----------ADITPLAKNTRTIAIVHSKTGEKLDAALIGNVASLP----VGSATWKGRHGLAGITSE 184 (331) T ss_pred CcEEEEEecCch----------HHHHHhhccccEEEEEcCCccchhHHHHHHHHHhcC----ccceeeeeecccCCCCCC Confidence 999988765322 222333456777777763 355677788888765 5899999997 8999999 Q ss_pred CCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCCceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHH Q lcl|NC_017984. 319 VTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQYKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATV 398 (487) Q Consensus 319 ~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l 398 (487) +++.+|+++|+++|||||+++++ ..++++|++++|+ |||++||+|||+++||++|++||++++|||||+.|+++| T Consensus 185 ~lt~t~~~al~~~~~N~y~~~~~----~~~~~~G~~~~G~-~iD~~~~~dWl~~~lq~~l~~ll~~~~kiPy~~~G~~~l 259 (331) T protein:vir:80 185 ELKVSEIDAIQKAGGMCYIEKAG----IAQTSEGKTVSGE-FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALL 259 (331) T ss_pred CCCHHHHHHHHhcCceEEEEecC----eeEEecceEeCch-hHHHHHHHHHHHHHHHHHHHHHHHhCCCCccChhhHHHH Confidence 99999999999999999999875 4689999999995 999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEE---eccCCCHHHHhhcccCCeEEEEEECCe Q lcl|NC_017984. 399 RAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWAL---SVTLPDSQTRVARESFIIKLFYTDGSS 475 (487) Q Consensus 399 ~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~---~~~~~s~~dra~R~~~~i~~~~~~aGA 475 (487) +++|+++|+||++||+|+||+++++. |||+ .++++|++||++|++|+++|+|+++|| T Consensus 260 ~a~i~~~~~~av~~G~I~~g~~~~~~--------------------~~~v~~~~~~~~s~~dr~~R~~~~~~~~~~~~ga 319 (331) T protein:vir:80 260 QSELTTVLNEGFANGIIDSNDETGEP--------------------NFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGA 319 (331) T ss_pred HHHHHHHHHHHHhCCceecCccCCCc--------------------ceEEEeCchhcCCHHHHhccCCCCeEEEEEEcce Confidence 99999999999999999999865432 3444 578999999999999999999999999 Q ss_pred EEEEEEEEEeeC Q lcl|NC_017984. 476 MQRLEMTATNVQ 487 (487) Q Consensus 476 Ih~v~i~gt~vq 487 (487) ||+|+|+|+ |+ T Consensus 320 I~~v~i~~~-v~ 330 (331) T protein:vir:80 320 IHSVDVYGE-VE 330 (331) T ss_pred EEEEEEEEE-Ee Confidence 999999985 45 No 12 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=100.00 E-value=1.3e-66 Score=381.81 Aligned_cols=395 Identities=12% Similarity=0.021 Sum_probs=263.5 Q ss_pred CCcCCccccceEEEeeeeecccccccccceeEEec-Ccce-------eeeeeccHHHHHHhcCCChHHHHHHHHHhhccc Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGNPLGLNTNLFVQ-DAIY-------PNYEYFSNTLVGQHYGLESPIYKFATVYFNGFR 72 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~~~~~~~ll~~~-~~~~-------~~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~ 72 (487) || .+||||+|+++++++..++|+.+||.+ +..+ +++.|+|+++|++|||.+||+||||++||+ T Consensus 1 m~------~~iVnV~Is~~t~A~~~~~Fg~~liigs~~~~~p~~~f~~~~~Yss~~~V~~Dfg~~s~~Y~AA~~~f~--- 71 (426) T protein:vir:31 1 MP------KQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEE--- 71 (426) T ss_pred CC------cceEEEEeecccccccccccceeeeeeeccccccccccchhhhhhhHHHHHhcCCCChHHHHHHHHHHh--- Confidence 66 699999999999999999999665553 4443 355699999999999999999999999999 Q ss_pred CCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhheeeEE Q lcl|NC_017984. 73 NATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALTLPCT 152 (487) Q Consensus 73 ~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~a~vt 152 (487) |. +++||+.....+ .+ ..... ....+|+|+.. +.+....+.++.|...+.+. T Consensus 72 -Q~-----~~~~r~~v~~at-~~----------~~~~~---t~~~tv~g~~~------s~~a~~~~~a~~i~~~~~~~-- 123 (426) T protein:vir:31 72 -MG-----AEQWRVMVLEAT-EV----------TEEEL---SDGDTIDKVPI------LGNHEVESPDGDIEFTTDDD-- 123 (426) T ss_pred -CC-----ceeEEeeccccc-ee----------eeccC---Ccceeecceee------eecccCcchHHHHHHhhccc-- Confidence 76 456776322111 11 00011 22245666443 33444567777777777543 Q ss_pred EecccceEEEEecccccc-------eeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeEEEEEec Q lcl|NC_017984. 153 YESTVKGFVIKSGTSGAN-------STISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVNITYSEG 225 (487) Q Consensus 153 ~d~~~~~f~its~t~g~~-------stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~ 225 (487) +|.....+.+...++... ..+++..+|++.+.++.+. +-.........||.... T Consensus 124 ~~~~~~~~~~~~~t~~g~~t~~~~~~~~~~s~~dw~~~~~~~s~----------------~~~~~ia~~~~~~~~~~--- 184 (426) T protein:vir:31 124 PDVEDFDAEIVINSATGDVATSEDSIELTYFHADWSQLDEFPSD----------------VNNFAVADRRFDLKGVG--- 184 (426) T ss_pred cccccceeeeEeccccceeeccccceeeeeccCcchhhhccccc----------------chhhhhhccccchhhhh--- Confidence 333333334333322211 2233444555443333221 11122333445543221 Q ss_pred cCChhHHHHHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcc--eEEEecCC---CchHHHHHHHHHhcCc-- Q lcl|NC_017984. 226 VFNEDALKDLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTS--GVVPLYGT---FDKAAFFCGVSGSINY-- 298 (487) Q Consensus 226 ~~~~~~i~a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~--~t~~~y~~---~~~~a~~~g~~as~~~-- 298 (487) + ..++..|.+.+.++++. ++.......+..++...++..++|. +.+++|.. +......++.+++.++ T Consensus 185 ---~--~~~~~~wa~~~~i~~va-~~~e~~~~~~~~~~~a~~~~~~~y~p~~~~~~~~~~~~~~~~~~~~~~~aa~~~~~ 258 (426) T protein:vir:31 185 ---V--LDETHSWASDEDMGMIA-NGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDASDDDLAAYQLGKFAVSEPWY 258 (426) T ss_pred ---h--hHhhhhhhhhcceeeee-eccchhhhcchhhhhhhhhcccccccchhheeehhccccchhhHHhhhhhhhcccc Confidence 1 22577888887665543 4433344444455556667777774 45555542 3335567777777654 Q ss_pred -----CcCCceeeeeeeecCcccccCCCHHHHHHHHhCCceEEEEecCCCc-eEEEEECCEEcCCceehHHHHHHHHHHH Q lcl|NC_017984. 299 -----QEENGRTTTAFRSQDGLVPDVTNEADAETLVKNGYSFYGAWATAND-RFQFAGNGSVTGQYKWIDNFDFQVFLRT 372 (487) Q Consensus 299 -----~~~~gs~T~~fk~l~Gv~~~~lt~t~~~al~~~~~n~y~~~~~~~~-~~~~~~~G~~sgg~~~iD~~~~~dWl~~ 372 (487) +..+++..++|++.||+.. .+..++...+ ++++|+|+.+.+.+. ...+.++|++++|+ |||++|++|||++ T Consensus 259 ~~~~~~~~~~~~~~~~~~~~gv~~-t~~~~~~A~~-~~~~n~~~~~~~~~~i~~~~~~~G~~~~G~-~iD~~~g~dwl~~ 335 (426) T protein:vir:31 259 NPLWNELPAGETVSKNVGDPEEQG-TFEGGDEAEG-EGPVNVLIDVSDANRVSNAVTTAGADSDTS-FFDIRRTKVYTAE 335 (426) T ss_pred chhhhhccccccceeecccccccc-ccchhhhhhh-cCCceEEEEecCceeeecceeecccccchh-hhhhHHHHHHHHH Confidence 4456777788999999983 3334444444 588999999877642 22345678888885 9999999999999 Q ss_pred HHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEe--- Q lcl|NC_017984. 373 QLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALS--- 449 (487) Q Consensus 373 ~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~--- 449 (487) +||++|++||.|.+|||||+.|++||++.|+.+|+++++.|.. +..+|++. T Consensus 336 ~iq~~l~~ll~~~~KIpyt~~Gi~~I~~~i~~~L~~~v~~~g~--------------------------~~~~y~v~~P~ 389 (426) T protein:vir:31 336 MLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQ--------------------------PLAEYEVDVPE 389 (426) T ss_pred HHHHHHHHHhhcCCCCccchhHHHHHHHHHHHHHHHHhcCCCc--------------------------cccceeecCCC Confidence 9999999999999999999999999999999999999875531 11234443 Q ss_pred ccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 450 VTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 450 ~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) +++ +++||++|++++|+|.|+|+||||.++|+|+|== T Consensus 390 ~~~-~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) T protein:vir:31 390 WDD-DDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) T ss_pred ccc-cchhhhhhccCCceEEEEEeCcEEEEEEEEEEeC Confidence 444 4579999999999999999999999999997644 No 13 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=99.02 E-value=5.1e-09 Score=66.04 Aligned_cols=426 Identities=16% Similarity=0.183 Sum_probs=205.1 Q ss_pred CCcCCccccceEE-Eeeeeecccc---ccccc-ceeEEec-------CcceeeeeeccHHHHHHhcCCChHHHHHHHHHh Q lcl|NC_017984. 1 MQFNSIPASNIAA-VYPAVIGGGG---NPLGL-NTNLFVQ-------DAIYPNYEYFSNTLVGQHYGLESPIYKFATVYF 68 (487) Q Consensus 1 ~~~~~ip~s~iV~-V~~~~~~~~~---~~~~~-~~ll~~~-------~~~~~~~~y~s~~~V~~~fg~~s~ey~aA~~yF 68 (487) |.|+.||-|--|= +-+.+....+ .+.+- -.|||.+ ....|+ .-.|.+++...||.+|-...|++.|. T Consensus 4 i~F~~IP~~iRvP~~y~E~dns~A~~g~~~~~q~vLiiGq~la~gs~~~~~pv-~v~s~~~a~~~fG~GS~la~M~~a~~ 82 (495) T protein:vir:19 4 ISFNAIPSDVRVPLTYIEFDNSNAVSGTPAPRQRVLMFGQSGSKASAAPNVPV-RIRSGSQASAAFGQGSMLALMADAFL 82 (495) T ss_pred CchhhCCcccccCeEEEEEccCCCCcCCcCCCceEEEEEecCcccccccceeE-EecCHHHHHHhcCcCcHHHHHHHHHH Confidence 6677777763221 2222222211 11222 2344432 122344 36688899999999999999999999 Q ss_pred hcccCCccCCCEEEEEeeecc-cceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhh Q lcl|NC_017984. 69 NGFRNATTRPNSLFITKYNLT-DVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATAL 147 (487) Q Consensus 69 ~g~~~q~p~P~~l~igr~~~~-~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l 147 (487) . ..| =..||+=..... .+.+ .| .++ +.-.....|.+.+.|+|....+.+ ....+.+.+|+.+.+++ T Consensus 83 ~----~n~-~~~l~~i~~~D~aG~aA--~g-~it---~tg~at~~G~l~l~I~g~~v~v~V--~~gdTaa~vA~al~aai 149 (495) T protein:vir:19 83 N----ANR-VAELWCIPQGNGTGNAA--VG-EIS---LSGTAGENGSLVTYIAGQRLAVSV--AAGATGAALADLLVARI 149 (495) T ss_pred H----hCC-cceEEEEeeCChhhcee--EE-EEE---EeecCCCCcEEEEEECCEEEEEEe--cCCCCHHHHHHHHHHHh Confidence 7 433 344555333321 1111 11 111 112223579999999998766544 55667788888888888 Q ss_pred eee----EEEe--------cccceEEEEecccccce----eEEecccchhhhhhhccccceeEecCcccccHHHHHHHHH Q lcl|NC_017984. 148 TLP----CTYE--------STVKGFVIKSGTSGANS----TISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNAL 211 (487) Q Consensus 148 ~a~----vt~d--------~~~~~f~its~t~g~~s----tit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~ 211 (487) .+. ||.. +.....++|..-.|+.. .+.|-.|. ...-+++-.. .....|...-.+.++++++- T Consensus 150 na~~~lPvTA~~~~~~~~~~a~~~VtlTAr~kG~~n~idi~~~~~~ge-~~p~Glt~ti-tamsgGag~PDia~alaal~ 227 (495) T protein:vir:19 150 KGQPDLPVTAEVRADSGDDDTHADVVLSAKFTGALSAVDVRWNYYAGE-TTPYGIITAF-KAASGKNGNPDISASIAGMG 227 (495) T ss_pred cCCccCceEEEeeccCCCCcCceeEEEEEeeccccccceeEEEeeccc-ccccceeEEE-EecCCCCCCcchHHHHHHhc Confidence 642 2221 12234445554444321 11221111 0111121110 11223444456777777776 Q ss_pred hcccceeEEEEEeccCChhHHHHHHHHHhcc----CceEEEEEccccccccccchHHHHHHH-hCCcceEEEe-cC-CCc Q lcl|NC_017984. 212 AFSQNFVNITYSEGVFNEDALKDLALWVTSQ----NSRFKLYTWGLDPVALGQSGASFGEWA-KENTSGVVPL-YG-TFD 284 (487) Q Consensus 212 ~~~~~wy~~~~~~~~~~~~~i~a~A~w~~a~----~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~t~~~-y~-~~~ 284 (487) +...+|....+ .+.+.+.++-.+++.. +.++..+.... ..+ -..+..+. ..|..+..++ +. .+. T Consensus 228 ~~~~~~I~~P~----tD~asL~al~~~l~~rw~~~~q~~g~~~~a~----~gT-~~~l~t~g~~~N~~~it~~~~~gsp~ 298 (495) T protein:vir:19 228 DLQYKYIVMPY----TDEPNLNLLRTELQERWGPVNQADGFAVTVL----SGT-YGDISTFGVSRNDHLISCMGIAGAPE 298 (495) T ss_pred cCCCcEEEEec----CcHHHHHHHHHHHHHhhhHHHhcCeEEEEee----cCC-HHHHHHhhhccCCceEEEEecCCCCC Confidence 55444433322 3455566666666641 12221111111 111 11222222 2344444433 32 222 Q ss_pred h----HHHHHHHHH---hcCcCcCCceeeeeeeecCccccc----CCCHHHHHHHHhCCceEEEEe-cCCCceEEEEECC Q lcl|NC_017984. 285 K----AAFFCGVSG---SINYQEENGRTTTAFRSQDGLVPD----VTNEADAETLVKNGYSFYGAW-ATANDRFQFAGNG 352 (487) Q Consensus 285 ~----~a~~~g~~a---s~~~~~~~gs~T~~fk~l~Gv~~~----~lt~t~~~al~~~~~n~y~~~-~~~~~~~~~~~~G 352 (487) . ++.+.++++ ..|+.+. +.--.|+||.|. .++.+|.+.|-.+|+..|..- .|.-+-.+.+..= T Consensus 299 ~~~~~AAA~aa~~A~~l~~DPArP-----L~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~~G~V~I~R~ITTY 373 (495) T protein:vir:19 299 PSYLYAATLCAVASQALSIDPARP-----LQTLTLPGRMPPAVGDRFTWSERNALLFDGISTFNVNDGGEMQIERMITMY 373 (495) T ss_pred cHHHHHHHHHHHHHHHhhcccccc-----cCceeecceecCCccccCChHHHHHHHhCCcceEEECCCCeEEEEeeeeee Confidence 1 122333322 2343333 333457888753 478999999999999887543 2221111111110 Q ss_pred EEc-CC---cee--hHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHh----h-----HHHHHHHHHHHHHHHHhcCcccc Q lcl|NC_017984. 353 SVT-GQ---YKW--IDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQ----G-----IATVRAYSQDPIDQGINFGGIRA 417 (487) Q Consensus 353 ~~s-gg---~~~--iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~----G-----~~~l~~~v~~vl~~a~~nG~Ia~ 417 (487) +.. .| ..| |-.++-++++...+++.+..-|-.. |+--+.. | -..|++.+-..+++....|++.- T Consensus 374 ~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~-KLa~d~~~~~~gq~IvTp~~ir~ell~~~~~le~~given 452 (495) T protein:vir:19 374 RTNKYGDSDPSYLNVNTIATLSYLRYSLRTRITQKFPNY-KLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGLVED 452 (495) T ss_pred eecCCCCcchhhhhhHHHHHHHHHHHHHHHHHhhhcCCc-ccccCCCCCCCcccccChHHHHHHHHHHHHhhhhhccccC Confidence 111 11 124 5678999999999999988766432 2222211 1 25789999999999999998842 Q ss_pred CcccCccccccccccccCccccceeeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 418 GVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 418 Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) + +. +-+.=.+.|+...+ .|-+=..|+ .+ |+...|-...+| T Consensus 453 -~-----------~~---------~~~~LiVerd~~dp-nRln~~~p~-----d~---vn~L~V~A~~i~ 492 (495) T protein:vir:19 453 -F-----------DT---------FKEELYVARNKDDK-DRLDVLCGP-----NL---INQFRIFAAQVQ 492 (495) T ss_pred -h-----------hh---------hcceeEEEECCCCC-cEEEEEecc-----ee---eCceeeeeeeee Confidence 0 00 00100112222222 233222221 11 222223333334 No 14 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=98.98 E-value=7.8e-09 Score=65.02 Aligned_cols=430 Identities=13% Similarity=0.135 Sum_probs=205.3 Q ss_pred CCcCCccccceEE-Eeeeeecccccc--cccceeEEec-------CcceeeeeeccHHHHHHhcCCChHHHHHHHHHhhc Q lcl|NC_017984. 1 MQFNSIPASNIAA-VYPAVIGGGGNP--LGLNTNLFVQ-------DAIYPNYEYFSNTLVGQHYGLESPIYKFATVYFNG 70 (487) Q Consensus 1 ~~~~~ip~s~iV~-V~~~~~~~~~~~--~~~~~ll~~~-------~~~~~~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g 70 (487) |.|+.||-|--|= +-+.+....+.. ...-.|||.+ ....|+ .-+|.+++...||.+|-...|++.|.. T Consensus 3 IsF~~IP~~iRvP~~y~E~dns~A~~~~~~qrvLiiGq~la~gt~~~~~~v-~v~s~~~a~~~fG~GS~l~~M~~a~~~- 80 (498) T protein:vir:48 3 ISFSAVPSDTLVPLFYAEMDNSAANTAVTSAPALLIGHASNDAAIEVNSLV-LMPSADYARQICGAGSQLARMVDVYRQ- 80 (498) T ss_pred ccccccCcccccceEEEEEecCCCccccCCcceEEEeecCccccccccceE-EecCHHHHHHhcCcccHHHHHHHHHHH- Confidence 7888999885432 222222222211 1112444432 112333 467899999999999999999999987 Q ss_pred ccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhhee- Q lcl|NC_017984. 71 FRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALTL- 149 (487) Q Consensus 71 ~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~a- 149 (487) ..| =..|++=.... +....-.| .++ +.-.....|.+.+.|+|....+.+ ....+.+.+|+.+.+++.+ T Consensus 81 ---~n~-~~~l~~i~~~D-~ag~aA~g-~it---~tg~at~~G~l~l~Igg~~v~v~V--~~gdTaa~vA~al~aai~a~ 149 (498) T protein:vir:48 81 ---TDP-FGELYVIAVPE-ARGAAATV-RVT---VTGEAEESGTLSLYVGRSSVQVPV--VNGDDATAVATAIKEAVNGV 149 (498) T ss_pred ---hCC-CceeEEEeeCC-cccceeEE-EEE---ecccccCCceEEEEECCEEEEEee--cCCCCHHHHHHHHHHHHhCC Confidence 443 34455533332 11111111 111 122223579999999998776554 4555778888888888864 Q ss_pred ---eEEEecccceEEEEeccccc---ceeEEec--c---cchhhhhhhcccccee---EecCcccccHHHHHHHHHhccc Q lcl|NC_017984. 150 ---PCTYESTVKGFVIKSGTSGA---NSTISFA--T---GDISDDLKLTQETGAV---LNNHTAADTPTTGALNALAFSQ 215 (487) Q Consensus 150 ---~vt~d~~~~~f~its~t~g~---~stit~a--t---gd~a~~l~lt~~~gA~---~~~G~aaet~~~al~a~~~~~~ 215 (487) .||........++|..-.|. +..+... . |. ... .+.... ...|...-.+.++++++-+... T Consensus 150 ~~lPVTA~~~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge-~~p----~Glt~~itamsgGag~PDia~aLaal~~~~~ 224 (498) T protein:vir:48 150 ITLPFAASSDAGVVTLTARHKGLYGNELPVCLNYYGSGGGE-ILP----AGLQVVTEAGTAGSGAPDLTAAVAAMGDEAF 224 (498) T ss_pred CCcceEEEecCcEEEEEeeecccccccceeeeeeccCcccc-ccc----ceeeEEEEcccCCccCcchHHHHHhhccCCc Confidence 23333333344444433332 1122111 0 10 011 111111 1234444466666666655544 Q ss_pred ceeEEEEEeccCChhHHHHHHHHHhc-------cCceEEEEEccccccccccchHHHHHHH-hCCcceEEEec-CC-C-- Q lcl|NC_017984. 216 NFVNITYSEGVFNEDALKDLALWVTS-------QNSRFKLYTWGLDPVALGQSGASFGEWA-KENTSGVVPLY-GT-F-- 283 (487) Q Consensus 216 ~wy~~~~~~~~~~~~~i~a~A~w~~a-------~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~t~~~y-~~-~-- 283 (487) +|....+ .+.+.+.++..+.+. -+.++..+.... ..+ -..+..+. ..|..|..+++ .. + T Consensus 225 ~~I~~p~----~D~asl~al~~~L~~~sgRw~~~~q~~g~~~~a~----~gT-~~~l~t~g~~~N~~~it~~~~~~~~~~ 295 (498) T protein:vir:48 225 DFIGLPF----NDAASINMMMTEMNDSSGRWSYARQLYGHVYTAK----LGT-LSELVNAGDMHNQQHITLAGYEKETQS 295 (498) T ss_pred cEEEEee----cCHHHHHHHHHHHhhhhhhhhHHhhcCeEEEEec----cCC-HHHHHHhhhccCCceEEEEecCCCCCC Confidence 4433322 345566667666653 122222211111 111 11122222 23555554433 21 1 Q ss_pred --ch-HHHHHHHHHhcCcCcCCceeeeeeeecCccccc----CCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEc- Q lcl|NC_017984. 284 --DK-AAFFCGVSGSINYQEENGRTTTAFRSQDGLVPD----VTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVT- 355 (487) Q Consensus 284 --~~-~a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~~~----~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~s- 355 (487) +. ++++.++++ .....+|.+ .+.=-.|+||.|. .++.+|.+.|-.+|+..|..-.| +.. +.+..++ T Consensus 296 p~~~~AAa~a~~aA-~~l~~DPAr-PLqtl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G---~V~-I~R~ITTY 369 (498) T protein:vir:48 296 PVDELVASRLAREA-VFIRNDPAR-PTQTGELVGMLPAPKGKRFIMTEQQTLLSHGVATAYVEGG---TLR-IQRSVTTY 369 (498) T ss_pred hHHHHHHHHHHHHH-Hhhhccccc-cccceeeeccccCCchhcCChHHHHHHHhcCcceEEEcCC---eEE-EEeeeeee Confidence 11 223333333 211223322 2222357788754 46889999999999988854222 222 2333221 Q ss_pred -----CC----ceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhh---------HHHHHHHHHHHHHHHHhcCcccc Q lcl|NC_017984. 356 -----GQ----YKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQG---------IATVRAYSQDPIDQGINFGGIRA 417 (487) Q Consensus 356 -----gg----~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G---------~~~l~~~v~~vl~~a~~nG~Ia~ 417 (487) |- +..|-.++-++++...++..+-.-|-. .|+--+..+ -.+|++.+-..+++....|++.- T Consensus 370 ~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR-~KLa~dg~~~~~gq~IvTp~~ir~eli~~y~~le~~given 448 (498) T protein:vir:48 370 KKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGR-HKLANDGTRFGPGQAIVTPAVIKGELLATYRQMERAGIVEN 448 (498) T ss_pred eecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCC-ceecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccC Confidence 11 224567899999999999998876632 233322111 25788999999999999998852 Q ss_pred CcccCccccccccccccCccccceeeeeeEEeccCCCHHHHhhcccC---CeEEEEEECCe Q lcl|NC_017984. 418 GVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVARESF---IIKLFYTDGSS 475 (487) Q Consensus 418 Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~~---~i~~~~~~aGA 475 (487) + +.-.+.-+-.-.+.|-.. +++-..++---.-|... .+.+.|..++| T Consensus 449 -~--~~~~~~LiVerd~~dpnR--------ln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:48 449 -Y--DLFKQYLIVERDADNPNR--------LNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred -h--hhhcceeEEEECCCCCcE--------EEEEecccccCchhhhhhhhhhhhhhhhcCC Confidence 1 000011111111111000 00000000000001111 12333444444 No 15 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=98.94 E-value=1.1e-08 Score=64.14 Aligned_cols=433 Identities=12% Similarity=0.107 Sum_probs=206.5 Q ss_pred CCcCCccccceEE-Eeeeee-cccccccccc-eeEEec-------CcceeeeeeccHHHHHHhcCCChHHHHHHHHHhhc Q lcl|NC_017984. 1 MQFNSIPASNIAA-VYPAVI-GGGGNPLGLN-TNLFVQ-------DAIYPNYEYFSNTLVGQHYGLESPIYKFATVYFNG 70 (487) Q Consensus 1 ~~~~~ip~s~iV~-V~~~~~-~~~~~~~~~~-~ll~~~-------~~~~~~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g 70 (487) |.|+.||-|--|= +-+.+. ..+..+.+-- .|||.+ ....|+ .-+|.+++...||.+|-...|++.|.. T Consensus 3 IsF~~IP~~iRvP~~y~E~dns~A~~~~~~q~vLiiGq~la~gs~~~~~~v-~v~s~~~a~~~fG~GSml~~M~~a~~~- 80 (498) T protein:vir:44 3 ISFNSIPSDTRVPLFYAEMDNSAANTARDSGASLLIGHASNDASIAVNSLV-LVSSVDYARQICGAGSQLARMVGAYRK- 80 (498) T ss_pred CchhhcCcccccCeEEEEEeCCCCCCCcCCcceEEEEecCcccccccceeE-eecCHHHHHHhcCcccHHHHHHHHHHH- Confidence 7788999884432 222222 2222333322 444432 112333 467999999999999999999999997 Q ss_pred ccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhheee Q lcl|NC_017984. 71 FRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALTLP 150 (487) Q Consensus 71 ~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~a~ 150 (487) ..| =..||+=...- +....-.| .++ +.-.....|.+.+.|+|....+.+ ....+.+.+|+.+.+++.+. T Consensus 81 ---~n~-~~~l~~i~~~D-~aG~aAtg-~it---~tg~at~~G~l~l~Igg~~v~v~V--~~gdTaa~vA~al~aaina~ 149 (498) T protein:vir:44 81 ---TDP-FGELYVIAVPE-STGAAATV-ALT---VTGEATETGTVNVYTGRTRVQAPV--TSGDDAAAVAVSIKDAVNAN 149 (498) T ss_pred ---hCC-CceeEEEecCC-cccceeEE-EEE---eecccCCCcEEEEEECCEEEEEEe--cCCCCHHHHHHHHHHHHhCC Confidence 333 34455543332 11111111 111 122233579999999998776554 45557788998888888642 Q ss_pred ----EEEecccceEEEEeccccc---ceeEEe--cc--cchhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeE Q lcl|NC_017984. 151 ----CTYESTVKGFVIKSGTSGA---NSTISF--AT--GDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVN 219 (487) Q Consensus 151 ----vt~d~~~~~f~its~t~g~---~stit~--at--gd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~ 219 (487) ||........++|..-.|. +..+.. -. ++-...-+++-. -...+.|+....+.++++++-+...+|.. T Consensus 150 ~~lPVTA~~~~~~vtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~t-itamsgGag~PDia~alaal~~~~~~~i~ 228 (498) T protein:vir:44 150 PDLPFTATSEAGVVTLTARHKGLYGNEIPVTLNYYGFGGGEVLPAGVNIT-VASGVKGAGAPALNDAVAAMGDEPFDYIG 228 (498) T ss_pred CCCceEEeeccceEEEEEeccCcccCcceEEEeeccCccccccccceeEE-EEcccCCccCchhHHHHHhhccCCccEEE Confidence 3333333344455443332 212211 11 011111111100 01122344444666766666555444433 Q ss_pred EEEEeccCChhHHHHHHHHHhcc-------CceEEEEEccccccccccchHHHHHHH-hCCcceEEEe-cCC--Cch--- Q lcl|NC_017984. 220 ITYSEGVFNEDALKDLALWVTSQ-------NSRFKLYTWGLDPVALGQSGASFGEWA-KENTSGVVPL-YGT--FDK--- 285 (487) Q Consensus 220 ~~~~~~~~~~~~i~a~A~w~~a~-------~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~t~~~-y~~--~~~--- 285 (487) ..+ .+...+.++..+.+.. +.++..+.... ..+- ..+..+. ..|..|..++ +.. +.+ T Consensus 229 ~p~----~D~asl~al~~~L~~~sgRw~~~~q~~g~~~~a~----~gT~-a~l~t~g~~~N~~~it~~~~~~~~~sp~~~ 299 (498) T protein:vir:44 229 LPF----NDTASVNSMATEMNDSSGRWSYVRQLYGHVYTAK----TGTL-SELVAAGDQFNLQHITLAGYEKDTQTPADE 299 (498) T ss_pred Eee----cCHHHHHHHHHHHhhhhcchHHHhhcCeEEEEec----cCCH-HHHHHhhhccCCceEEEEecCCCCCCHHHH Confidence 322 3455666676666431 22222111111 1111 1122222 2354554443 321 112 Q ss_pred -HHHHHHHHHhcCcCcCCceeeeeeeecCccccc----CCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEc----- Q lcl|NC_017984. 286 -AAFFCGVSGSINYQEENGRTTTAFRSQDGLVPD----VTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVT----- 355 (487) Q Consensus 286 -~a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~~~----~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~s----- 355 (487) ++.+.++++ .....+|.+ .+.=-.|+||.|. .++.+|.+.|-.+|+..|..- .+ +.. +.+..++ T Consensus 300 ~AAa~a~~aA-~~l~~DPAr-PL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~--~G-~V~-I~R~ITTY~~n~ 373 (498) T protein:vir:44 300 LAASRTARAA-VFIRNDPAR-PTQTGELVDMLPAPKGKRFTTTEQQTLLSHGVATAYVE--SG-VLR-IQRDITTYRKNA 373 (498) T ss_pred HHHHHHHHHH-HHhhccccc-ccCceeecccccCCchhcCChHHHHHHHhcCcceEEEc--CC-eEE-EEeeeeeeeecC Confidence 223333333 211223322 2333357888754 478999999999999888542 23 222 2333321 Q ss_pred CC-----ceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCH----hhH-----HHHHHHHHHHHHHHHhcCccccCccc Q lcl|NC_017984. 356 GQ-----YKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYND----QGI-----ATVRAYSQDPIDQGINFGGIRAGVNL 421 (487) Q Consensus 356 gg-----~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~----~G~-----~~l~~~v~~vl~~a~~nG~Ia~Gv~l 421 (487) .| +..|-.++-.+++...++..+..-|-. .|+-=++ .|. ..|++.+-..+++....|++.- + T Consensus 374 ~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR-~KLa~d~~~~~~gq~IvTp~~ir~eli~~y~~le~~givEn-~-- 449 (498) T protein:vir:44 374 YGVADNSYLDSETLHTSAYVLRRLKSVITSKYGR-HKLANDGTRFGSGQAIVTPAVIRGELGSTYRQMEREGIVEN-F-- 449 (498) T ss_pred CCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCC-cccccCCcccCCCcccccHHHHHHHHHHHHHhhhhhccccC-h-- Confidence 11 224667999999999999999665522 2322111 122 4788999999999999998852 1 Q ss_pred CccccccccccccCc-cccceeeeeeEEeccCCCHHHHhhcccC---CeEEEEEECCe Q lcl|NC_017984. 422 SNAQKFQVNQEAGFD-AASQLFTKGWALSVTLPDSQTRVARESF---IIKLFYTDGSS 475 (487) Q Consensus 422 ~~~q~~~~~~~~g~~-~~~~~~~~Gy~~~~~~~s~~dra~R~~~---~i~~~~~~aGA 475 (487) +.-.+.-+..-.+.| ..=+.+ ..++---.-|... .+.+.|..+.| T Consensus 450 ~~~~~~LiVerd~~dpnRln~~---------~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:44 450 DLFQQHLIVERNANDSNRLDVL---------FPPDYVNQLRVFAVLNQFRLQYSEEAA 498 (498) T ss_pred hhhcceeEEEECCCCCcEEEEE---------ecccccCchhhhhhhhhhhhhhhhhcC Confidence 000011111111111 000000 0000000001110 12233333333 No 16 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=98.92 E-value=1.4e-08 Score=63.63 Aligned_cols=434 Identities=14% Similarity=0.135 Sum_probs=210.2 Q ss_pred CCcCCccccceEE-Eeeeeec-ccccccccc-eeEEec-------CcceeeeeeccHHHHHHhcCCChHHHHHHHHHhhc Q lcl|NC_017984. 1 MQFNSIPASNIAA-VYPAVIG-GGGNPLGLN-TNLFVQ-------DAIYPNYEYFSNTLVGQHYGLESPIYKFATVYFNG 70 (487) Q Consensus 1 ~~~~~ip~s~iV~-V~~~~~~-~~~~~~~~~-~ll~~~-------~~~~~~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g 70 (487) |.|+.||-|--|= +-+.+.. .+..+.+-- .|||.+ ....|+ .-+|.+++...||.+|-...|++.|.. T Consensus 3 IsF~~IP~~iRvP~~y~E~dns~A~~~~~~q~vLiiGq~la~gs~~~~~~v-~v~s~~~a~~lfG~GSml~~M~~a~~~- 80 (498) T protein:vir:45 3 ISFNTIPSNTLVPLFYAEMDNQAANTAQDSGASLLIGHANNGAEIVANSLV-LMPSADYARQICGAGSQLARMVEAYRQ- 80 (498) T ss_pred CchhhcCcccccCeEEEEEeCCCCCCCCCCcceEEEEecCCccccccceeE-EecCHHHHHHhcCcCcHHHHHHHHHHH- Confidence 7788999885432 2222222 222222222 444432 122333 467999999999999999999999997 Q ss_pred ccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhhee- Q lcl|NC_017984. 71 FRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALTL- 149 (487) Q Consensus 71 ~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~a- 149 (487) ..| =..||+=.... +....-.| .++ +.-.....|.+.+.|+|....+.+ ....+.+.+|+.+.+++.+ T Consensus 81 ---~n~-~~~l~~i~~~d-~aG~aA~g-~it---~tg~at~~G~l~l~Igg~~v~v~V--~~gdTaa~vA~al~aaina~ 149 (498) T protein:vir:45 81 ---TDP-FGELYVIAVPE-ATGAAATV-TLT---VTGEATESGTVNVYVGRTRVQAPV--TNGDNVTTIASSIQDAINAV 149 (498) T ss_pred ---hCC-cceEEEEeeCC-cccceeEE-EEE---eecccCCCcEEEEEECCEEEEEEe--cCCCCHHHHHHHHHHHHhCC Confidence 443 35666544332 11111111 111 122223579999999998776554 4555778899888888864 Q ss_pred ---eEEEecccceEEEEeccccc---ceeEEec--c--cchhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeE Q lcl|NC_017984. 150 ---PCTYESTVKGFVIKSGTSGA---NSTISFA--T--GDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVN 219 (487) Q Consensus 150 ---~vt~d~~~~~f~its~t~g~---~stit~a--t--gd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~ 219 (487) .||........++|..-.|. +..+... . ++-...-+++-. -.....|.....+.++++++-+...+|.. T Consensus 150 ~~lPVTA~~~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~-itamagGag~PD~a~alaal~~~~~~~I~ 228 (498) T protein:vir:45 150 PTLPFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIA-VATGTAGTGAPVLTGAVAAMADEPFDYIG 228 (498) T ss_pred CCCceEEEecCceEEEEeeccCccccceeEEEeeccccccccccceeeEE-EEccCCCccCchhHHHHHHhccCCccEEE Confidence 24433333444555544432 2122111 1 011111112110 01122344444677777776655444443 Q ss_pred EEEEeccCChhHHHHHHHHHhcc-------CceEEEEEccccccccccchHHHHHHH-hCCcceEEEe-cC-C-Cch--- Q lcl|NC_017984. 220 ITYSEGVFNEDALKDLALWVTSQ-------NSRFKLYTWGLDPVALGQSGASFGEWA-KENTSGVVPL-YG-T-FDK--- 285 (487) Q Consensus 220 ~~~~~~~~~~~~i~a~A~w~~a~-------~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~t~~~-y~-~-~~~--- 285 (487) ..+ .+.+.+.++..+.+.. +.++..+.... ..+- ..+..+. ..|..|..++ +. . +.+ T Consensus 229 ~p~----~D~asL~al~~~L~~~sgRw~~~~q~~g~~~~a~----~gT~-~~l~t~g~~~N~~~it~~~~~~~~~sp~~~ 299 (498) T protein:vir:45 229 LPF----NDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAK----TGTL-SELVNAGDQFNQQHITLAGYEKETQTPADE 299 (498) T ss_pred Eee----CCHHHHHHHHHHHhhhhhhhhHHhhcCeEEEEec----cCCH-HHHHHhhhccCCceEEEEecCCCCCChHHH Confidence 333 3455666666666531 22222211111 1111 1122222 2355555443 32 1 212 Q ss_pred -HHHHHHHHHhcCcCcCCceeeeeeeecCccccc----CCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEc----- Q lcl|NC_017984. 286 -AAFFCGVSGSINYQEENGRTTTAFRSQDGLVPD----VTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVT----- 355 (487) Q Consensus 286 -~a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~~~----~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~s----- 355 (487) ++.+.++++ .....+|.+ .+.=-.|+||.|. .++.+|.+.|-.+|+..|..- .+ +.. +.+..++ T Consensus 300 ~AAa~aa~~A-~~l~~DPAr-PL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~--~G-~V~-I~R~ITTY~~n~ 373 (498) T protein:vir:45 300 LAASRTARAA-VFIRNDPAR-PTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVE--SG-VLR-IQRDVTTYRKNA 373 (498) T ss_pred HHHHHHHHHH-HHhhccccc-ccCceeecceecCCchhcCChHHHHHHHhCCcceEEEc--CC-eEE-EEeeeeeeeecC Confidence 223333333 211223322 2333357788754 468999999999999888542 23 222 2333321 Q ss_pred -CC----ceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHh---------hHHHHHHHHHHHHHHHHhcCccccCccc Q lcl|NC_017984. 356 -GQ----YKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQ---------GIATVRAYSQDPIDQGINFGGIRAGVNL 421 (487) Q Consensus 356 -gg----~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~---------G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l 421 (487) |- +..|-.++-.+++...++..+-.-|-. .|+--+.. =-.+|++.+-..+++....|++.- + T Consensus 374 ~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR-~KLa~dg~~~~~gq~IvTp~~ir~ell~~y~~le~~givEn-~-- 449 (498) T protein:vir:45 374 YGVADNSYLDSETLHTSAYVLRKLKSVITSKYGR-HKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVEN-Y-- 449 (498) T ss_pred CCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCC-eeecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccC-h-- Confidence 11 224567899999999999998876532 23332211 125788999999999999998852 1 Q ss_pred CccccccccccccCccccceeeeeeEEeccCCCHHHHhhcccC---CeEEEEEECCe Q lcl|NC_017984. 422 SNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVARESF---IIKLFYTDGSS 475 (487) Q Consensus 422 ~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~~---~i~~~~~~aGA 475 (487) +.-.+.-+..-.+.|-.. +++-..++---.-|... .+.+.|..++| T Consensus 450 ~~~~~~LiVerd~~dpnR--------ln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:45 450 ELFKQYLVVERDASVPNR--------LNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred hhhcceeEEEECCCCCcE--------EEEEecccccCchhhhhhhhhhheehhhcCC Confidence 000011111111111000 00000000000011111 23334444444 No 17 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=98.74 E-value=6.9e-08 Score=59.85 Aligned_cols=393 Identities=15% Similarity=0.120 Sum_probs=192.0 Q ss_pred CCcCC------ccccceEEEee-eeecccccccccceeEEecCccee---eeeeccHHHHHHhcCCC--hHHHHHHHHHh Q lcl|NC_017984. 1 MQFNS------IPASNIAAVYP-AVIGGGGNPLGLNTNLFVQDAIYP---NYEYFSNTLVGQHYGLE--SPIYKFATVYF 68 (487) Q Consensus 1 ~~~~~------ip~s~iV~V~~-~~~~~~~~~~~~~~ll~~~~~~~~---~~~y~s~~~V~~~fg~~--s~ey~aA~~yF 68 (487) |+=.. +-=--+++..+ ...+.++..++.-+++... ..-| +..-+|-+|....||.. .+.|++.+.+| T Consensus 1 m~gg~~~~~~k~~PGvYi~~~~~~~~~i~~~~~~~~a~~~~~-~~Gp~~~~~~i~s~~d~~~~fG~~~~~~~~~~~~~~~ 79 (437) T protein:vir:10 1 MAGGIWKRQNKVRPGAYINVKSKDIAMTRLGGDGVVTVPLAL-SFGQSKKLMKIRRGEDLFKKLGYEQESPQLLLLNEAF 79 (437) T ss_pred CCcceecccceecCceeEEEecCCcceeeccCCcEEEEEEEe-cCCCCceeEEEecHHHHHHHcCCccchhHHHHHHHHh Confidence 33211 11123445332 2223334445543333321 2222 33467788999999965 45667777777 Q ss_pred hcccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeE----EEEEEccceEEEEeeccccCchHHHHHhhh Q lcl|NC_017984. 69 NGFRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGT----LTIVVDGVSKSVPVDLATANSYSDAAALIA 144 (487) Q Consensus 69 ~g~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~----~~iti~g~~~~~~i~~s~ats~~~vA~~i~ 144 (487) . .+.++++-|-.. .+.+...- +. .+...+...|. ++++|.- +....+.+ .+. T Consensus 80 ~-------g~~~~~~~R~~~-g~~a~~tl---~~-~~~~~A~~~G~~gn~i~v~v~~-------~~~d~~~~-----~v~ 135 (437) T protein:vir:10 80 K-------RVSEVLLYRLNT-GEKANVSL---SD-NVTAQAKYSGVRGNDITVTVKT-------NVDDPSSF-----DVV 135 (437) T ss_pred c-------CCCEEEEEECCC-CceeeEee---cc-ceEEEeccCCcccceeEEEEee-------ccCCccce-----EEE Confidence 5 367899999643 22222110 00 00000111121 2222211 00000000 000 Q ss_pred hhheeeEEEecccc--eEEEEecccccce-eEEec-ccchhhhhhhccccceeEecCcc----cccHHHHHHHHHhcccc Q lcl|NC_017984. 145 TALTLPCTYESTVK--GFVIKSGTSGANS-TISFA-TGDISDDLKLTQETGAVLNNHTA----ADTPTTGALNALAFSQN 216 (487) Q Consensus 145 t~l~a~vt~d~~~~--~f~its~t~g~~s-tit~a-tgd~a~~l~lt~~~gA~~~~G~a----aet~~~al~a~~~~~~~ 216 (487) ++..... .+.+......... .+.+. .+.++ ...++....|.+ .+...++|++++...-+ T Consensus 136 -------~~~~~~~~d~~~v~~~~~~~~n~~v~~~~~~~l~------~~a~~~LtGG~dg~~t~~dy~~al~~le~~~~n 202 (437) T protein:vir:10 136 -------TFLDTVVMDLQTVKVLADLKNNALVEFSGTGELQ------PVAGAKLTGGTDGAISTQDYLEYFKALETVEFN 202 (437) T ss_pred -------EecCcceeeeeehhhhhhhhhhcccccccccccc------cccceeeeccccCCCChhHHHHHHHHhccCcce Confidence 0000000 0001100000000 00000 01000 011112222222 34567888888766544 Q ss_pred eeEEEEEeccCChhHHHHHHHHHhc----cCceEEEEEccccccccccchHHHHHHHhCCcceEEEe-----cCCCchHH Q lcl|NC_017984. 217 FVNITYSEGVFNEDALKDLALWVTS----QNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPL-----YGTFDKAA 287 (487) Q Consensus 217 wy~~~~~~~~~~~~~i~a~A~w~~a----~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~-----y~~~~~~a 287 (487) | +++.. .+.....++..|++. ..+++..+......... +.. +....... |.....++ T Consensus 203 ~--l~~~~--~d~~~~t~~~~~ik~~r~~~g~~~~~V~~~~~~d~e-------~Ii---n~~n~~~~~~~~~~~~~~~~a 268 (437) T protein:vir:10 203 Y--MALPV--EDASIKKAAINFIKRMREDEGLGAQLVVADSDADSE-------AVI---NVKNGVILSDKTVIDKTKATV 268 (437) T ss_pred E--EEecC--CChhHHHHHHHHHHHHHhccCceEEEEeCCCCCCCc-------eEE---EeecceeecCcceechhhHHH Confidence 4 33332 345566789999874 24555555433211100 000 00000011 12222344 Q ss_pred HHHHHHHhcCcCcCCceeeeeeeecCccc-c-cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEc----C----- Q lcl|NC_017984. 288 FFCGVSGSINYQEENGRTTTAFRSQDGLV-P-DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVT----G----- 356 (487) Q Consensus 288 ~~~g~~as~~~~~~~gs~T~~fk~l~Gv~-~-~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~s----g----- 356 (487) .+.|..|+...++ + .-||.++|+. . ..++.+|++.+.++|...+..- +..+.+ .+|.-+ + T Consensus 269 ~vAG~~Ag~~~~~---S--~t~~~~~~~~~v~~~~t~~e~~~~i~~G~~vl~~~---~~~v~i-~~gInTltt~~~~~~~ 339 (437) T protein:vir:10 269 WVAAASANAGVEK---S--LTYEKYEDSVDVVGRLSHTETEDALLKGQFVFTAR---RGRAVV-EQDINSHVSFTIEKNQ 339 (437) T ss_pred HHHHHhccCcccc---C--ccccccCCcccccccCCHHHHHHHHhCCcEEEEEe---CCeEEE-EEccccccccCCCCCc Confidence 5566666654332 3 3478899874 3 5789999999999999887543 233444 345311 1 Q ss_pred CceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCc Q lcl|NC_017984. 357 QYKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFD 436 (487) Q Consensus 357 g~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~ 436 (487) .+..|-.++-.|.+.+.++..+-+.++ +|+|=+..|..++++.|...|++..+.|.|.+.... T Consensus 340 ~~~ki~vir~~D~i~~di~~~~~~~yi--Gk~~N~~~~r~~~~~~i~~yl~~l~~~g~I~~~~~~--------------- 402 (437) T protein:vir:10 340 DFRKNRILRTLDDIVNDTRYAFSEYFL--GKVSNNEDGRQAFKANRIRYFKDLEARGAIEDFKVE--------------- 402 (437) T ss_pred hhhhhhHHHHHHHHHHHHHHHHHhccc--cccCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCce--------------- Confidence 112356677777777777665554444 699999999999999999999999999999753100 Q ss_pred cccceeeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 437 AASQLFTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 437 ~~~~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) + +.+.+. ++ +..--+++.++.-.++.++.++. .|| T Consensus 403 ---d-------~~v~~~--~~---~~~v~v~~~v~~~dame~iy~ti-~v~ 437 (437) T protein:vir:10 403 ---D-------IEVLRG--EL---KESVVVNVKVKPVDSMEKLYMTV-TVE 437 (437) T ss_pred ---e-------EEeecC--CC---CCEEEEEEEEEEeeeeeeEEEEE-Eec Confidence 0 011111 11 11223889999999999999996 677 No 18 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=98.56 E-value=2.9e-07 Score=56.45 Aligned_cols=409 Identities=12% Similarity=0.041 Sum_probs=190.7 Q ss_pred CCcCC-ccccceEE-Eeeeeecccccc-ccc--ceeEEec-----CcceeeeeeccHHHHHHhcC--CChHHHHHHHHHh Q lcl|NC_017984. 1 MQFNS-IPASNIAA-VYPAVIGGGGNP-LGL--NTNLFVQ-----DAIYPNYEYFSNTLVGQHYG--LESPIYKFATVYF 68 (487) Q Consensus 1 ~~~~~-ip~s~iV~-V~~~~~~~~~~~-~~~--~~ll~~~-----~~~~~~~~y~s~~~V~~~fg--~~s~ey~aA~~yF 68 (487) |+=.. .-..|+.= |-+...+.+..+ .+. .+.++.+ -+..|+ .-.+-+|....|| .+++.|++-+.+| T Consensus 1 magg~~~~~~K~~PGvYi~~~~~~~~~~~~~~~~~~~~i~~~~~~g~~~~v-~i~~~~d~~~~fG~~~~~~~~~~~~~~~ 79 (451) T protein:vir:10 1 MAGGTWKAQDKRRPGTYINVVGNGQREAASSLGRVLLIRDKGLGWGKNGVI-EVEANSDFTKKLGTTLDDPSLTALKETL 79 (451) T ss_pred CCceeeccceeecCceEEEEeccCcceeeccCCcEEEEEeeecCCCCcccE-EeecHHHHHHHcCCcccchhHHHHHHHh Confidence 43111 12233211 333333332211 122 1222222 222333 3566688888999 4466788888888 Q ss_pred hcccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeE----EEEEEccc-----eEEEEe--eccccCchH Q lcl|NC_017984. 69 NGFRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGT----LTIVVDGV-----SKSVPV--DLATANSYS 137 (487) Q Consensus 69 ~g~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~----~~iti~g~-----~~~~~i--~~s~ats~~ 137 (487) . .|.++++-|-.. .+.+...- ....+...+...|. ++++|.-. .+.+.+ +-......+ T Consensus 80 ~-------g~~~v~~yrl~~-g~~a~~t~---~~~~~~~~Aky~G~~Gn~i~v~v~~~~~d~~~~~v~t~~g~~~vd~qt 148 (451) T protein:vir:10 80 K-------GASKVLVLNPNE-GTAATLTK---EGLPWTVTANYPGEKGNQITVSVEVSPADQNAATVSTIFGTKLVDEQS 148 (451) T ss_pred c-------CCcEEEEEEcCC-CceEEEEe---ecCceEEEEeeCCcCCceEEEEEecccCCcCceEEEEEECCeEEEEEE Confidence 5 367899988643 22222111 00011111111222 44433211 111110 000000000 Q ss_pred HHHHhhhhhheeeEEEecccceEEEEecccccceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhcccce Q lcl|NC_017984. 138 DAAALIATALTLPCTYESTVKGFVIKSGTSGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNF 217 (487) Q Consensus 138 ~vA~~i~t~l~a~vt~d~~~~~f~its~t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~w 217 (487) .-... ...+. +...-.|.......+... ....++.. ......+...+...+++.++....-+| T Consensus 149 v~~~~-~~el~-----~nd~V~a~~~~~g~~~~~----------~~~~l~~~-~~gg~~~~~~~~~~~~l~~~e~~~~n~ 211 (451) T protein:vir:10 149 IKFNE-LDKFK-----GNDYITAKVVEEGSSKPV----------AFTNVSGT-LTGGTTTESNKVESLLNDALENEEYAV 211 (451) T ss_pred eeccc-hhhcc-----CCceEEEEecccccccce----------eeeecccc-cccccccCCccchHHHHHHhccceeeE Confidence 00000 00000 000001111111111000 00001000 001112234556667777777766555 Q ss_pred eEEEEEeccCChhHHHHHHHHHhc----cCceEEEEEccccccccccchHHHHHHH-hCC---cceEEEecCCCchHHHH Q lcl|NC_017984. 218 VNITYSEGVFNEDALKDLALWVTS----QNSRFKLYTWGLDPVALGQSGASFGEWA-KEN---TSGVVPLYGTFDKAAFF 289 (487) Q Consensus 218 y~~~~~~~~~~~~~i~a~A~w~~a----~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~---~~~t~~~y~~~~~~a~~ 289 (487) +.+.....+.+....+..|+.. ..+++..+....... ..+. .+... .++ ..+ ..|.....++.+ T Consensus 212 --l~~~~~~~~~~i~~~~~a~ik~~r~~~g~~~~aVl~~~~~~-~~d~---egiinv~n~~~~~dg--~~~~~~~~~~~v 283 (451) T protein:vir:10 212 --VTTAGFEPSSNMNKLVVEAVKRLRENEGRKVRGVIPTDADT-TYNY---EGISTVVNGYTLSDG--TNVDVKDATGYF 283 (451) T ss_pred --EEEccCCCchHHHHHHHHHHHHHHHhcCCeEEEEecCccCC-CCCC---cceEEeecceEecCc--eeechhhhHHHH Confidence 3333222233345568899985 245555544321110 0000 00000 000 000 112233344556 Q ss_pred HHHHHhcCcCcCCceeeeeeeecCccc-c-cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEc----C-----Cc Q lcl|NC_017984. 290 CGVSGSINYQEENGRTTTAFRSQDGLV-P-DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVT----G-----QY 358 (487) Q Consensus 290 ~g~~as~~~~~~~gs~T~~fk~l~Gv~-~-~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~s----g-----g~ 358 (487) .|..|+..++ .++ -||.++|+. . ..++.+|+..+.++|..++..-.+. .+.+. +|.-+ + .+ T Consensus 284 AG~~Ag~~~~---~S~--T~~~~~~~~~v~~~~t~~e~~~~i~~G~lvl~~~~g~--~v~i~-~~INTltt~~~~k~~~~ 355 (451) T protein:vir:10 284 AGISASADVA---TSL--TYFEVEDAVSAYPKFDNEKTIKALDAGQIVFTTRPGQ--RVVIE-QDINSLHKFTAEKPQAF 355 (451) T ss_pred HHHHcccccc---cCc--cceecCCceeeeeeCCHHHHHHHHhCCeEEEEEEcCC--eEEEE-EccccceecCCCCCcch Confidence 6777765433 333 466888863 3 5799999999999998766433332 23333 45322 1 12 Q ss_pred eehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccc Q lcl|NC_017984. 359 KWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAA 438 (487) Q Consensus 359 ~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~ 438 (487) ..|-+++-.|-+.+.++..+-+.++ +|+|=+..|..++++.|+.-|++..+.|.|.++... +. T Consensus 356 ~ki~vir~~D~i~~di~~~~~~~yi--Gk~~N~~~gr~~~~~~i~~yl~~l~~~g~i~~~~~~--------------d~- 418 (451) T protein:vir:10 356 SKNRVIRTLDEIATNTENTFERTYL--GNVGNNAAGRDLFKADRIAYLTSLQNRNMIQSFANT--------------DI- 418 (451) T ss_pred hhhhHHHHHHHHHHHHHHHhhhccc--eecCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCcc--------------ce- Confidence 2366777777777777664433333 699999999999999999999999999999764311 00 Q ss_pred cceeeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 439 SQLFTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 439 ~~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) .+.... .+..--+++..+.-.+|.++.++. .|| T Consensus 419 ----------~v~~~~-----~~~~v~v~~~v~pvdame~iy~t~-~v~ 451 (451) T protein:vir:10 419 ----------TVEAGN-----DMDSIVVNLAVTPVDAMEKLYMTM-VVR 451 (451) T ss_pred ----------EEeecC-----CCCEEEEEEEEEEEeeeeeEEEEE-EEc Confidence 011101 122234888899999999999996 677 No 19 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=98.55 E-value=3e-07 Score=56.36 Aligned_cols=395 Identities=14% Similarity=0.083 Sum_probs=195.9 Q ss_pred CCcCCc---cccc-----eEEEeeee-ecccccccccceeEEecCcceeeee---ecc---HHHHHHhcCCC--hHHHHH Q lcl|NC_017984. 1 MQFNSI---PASN-----IAAVYPAV-IGGGGNPLGLNTNLFVQDAIYPNYE---YFS---NTLVGQHYGLE--SPIYKF 63 (487) Q Consensus 1 ~~~~~i---p~s~-----iV~V~~~~-~~~~~~~~~~~~ll~~~~~~~~~~~---y~s---~~~V~~~fg~~--s~ey~a 63 (487) |+|.+- -..| ++|+...- ...+...|+.-++.+. ..--|+.+ -++ ..++...||.+ .|..+. T Consensus 1 ~~magg~~~~~~K~~PG~Y~n~~~~~~~~~~~~~rGi~a~p~~-~~wGp~~~v~~i~~~~~~~~~~~~~G~~~~~~~~~~ 79 (436) T protein:vir:78 1 MALGGGTFVTQNKVLPGSYINFVSATRATSSLSDRGIVAMPLE-LDWGIDEEVFQVTSDDFEKYSTKYFGYDYTHEKLKG 79 (436) T ss_pred CcccceeeccceeecCceEEEEEecCcceeeccCCeEEEEEEE-ecCCCCceeEEeecccchHHHHHHhcCccchHHHHH Confidence 988873 2333 23332211 2223344554444443 22333322 233 34677779975 344455 Q ss_pred HHHHhhcccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeee-EEEEEEccc-----eEEEEeeccccCchH Q lcl|NC_017984. 64 ATVYFNGFRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLING-TLTIVVDGV-----SKSVPVDLATANSYS 137 (487) Q Consensus 64 A~~yF~g~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g-~~~iti~g~-----~~~~~i~~s~ats~~ 137 (487) .+.+|. .|++|++-|......++.. + .++--...-| .++|+|.-. .+.+.+=+.+.--.. T Consensus 80 l~~~~~-------~~~tv~~yrl~~G~~a~~~----v---~~Aky~g~~gn~i~v~v~~~~~d~~~~dv~~~~g~~~~d~ 145 (436) T protein:vir:78 80 LRDLFK-------NIRLGYFYKLNKGVKASCS----I---ATARCSGIRGNDLKVIVTTNIDDNAKFDVVTLLDNKKVDT 145 (436) T ss_pred HHHHhc-------CCCEEEEEECCCcceeeee----e---eeeecCCCCCcEEEEEecccccccCceEEEEEecchhhhh Confidence 666775 5789999997532111111 0 1111112223 355555332 122211000000011 Q ss_pred HHHHhhhhhheeeEEEecccceEEEEecccccceeEEecccchhhhhhhccccceeEecCc--ccccHHHHHHHHHhccc Q lcl|NC_017984. 138 DAAALIATALTLPCTYESTVKGFVIKSGTSGANSTISFATGDISDDLKLTQETGAVLNNHT--AADTPTTGALNALAFSQ 215 (487) Q Consensus 138 ~vA~~i~t~l~a~vt~d~~~~~f~its~t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~--aaet~~~al~a~~~~~~ 215 (487) .++..+ ..+. .+.|+ +...+|. +...++ ..|+.+ ..|. ..+...++|++++... T Consensus 146 ~~~~~~-~~l~--------~n~~V-~~~~~g~---la~~a~-----~~LtGG-----~dG~~~T~~dy~~al~~le~~~- 201 (436) T protein:vir:78 146 QIAKVI-TELQ--------DNDYV-TWKKEAT---LEATAG-----LTFTNG-----TNGEAVTGTEYQAFLDKIESYS- 201 (436) T ss_pred hhHHHH-hhcc--------CCceE-EEEeccc---ccccce-----eeeecc-----ccccccchHHHHHHHHHHcccc- Confidence 111111 1111 12221 1111111 000000 001111 1121 2456778888887664 Q ss_pred ceeEEEEEeccCChhHHHHHHHHHhc----cCceEEEEEccccccccccchHHHHHHH-hCCcceEEEecCCCchHHHHH Q lcl|NC_017984. 216 NFVNITYSEGVFNEDALKDLALWVTS----QNSRFKLYTWGLDPVALGQSGASFGEWA-KENTSGVVPLYGTFDKAAFFC 290 (487) Q Consensus 216 ~wy~~~~~~~~~~~~~i~a~A~w~~a----~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~t~~~y~~~~~~a~~~ 290 (487) |..+++.. .+++....++.|+.. .++++..+.......... +.+. .++..++ .|.....++.+. T Consensus 202 -fn~l~~~~--~d~~~~~~~~a~ikr~re~~g~~~~aV~~~~~~~d~E------gIInv~n~v~g~--~~~~~~~~a~vA 270 (436) T protein:vir:78 202 -FNALGCLA--TTAEIKSLFVEFTKRMRDKVGAKFQTVLYKKNDADYE------GVVSVENKIKDT--GLLESSLIYWTT 270 (436) T ss_pred -eeEEEecC--CChHHHHHHHHHHHHHHhhcCCeEEEEecCCCCCCCc------eEEEeecccCCc--eechhHHHHHHH Confidence 54454443 345566788999884 345666554332111111 1100 0111111 122223344556 Q ss_pred HHHHhcCcCcCCceeeeeeeecCccc-c-cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEc----CC-----ce Q lcl|NC_017984. 291 GVSGSINYQEENGRTTTAFRSQDGLV-P-DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVT----GQ-----YK 359 (487) Q Consensus 291 g~~as~~~~~~~gs~T~~fk~l~Gv~-~-~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~s----gg-----~~ 359 (487) |..|+..+ +.++| ||.++|+. . ..++.+|++.+.++|...+..- + ....+ .+|+-+ +. +. T Consensus 271 G~~Ag~~~---~~S~T--~~~~~~~~~v~~~~t~~e~~~ai~~G~lvl~~d-~--~~v~I-~~~VNTltt~~~~k~~~~~ 341 (436) T protein:vir:78 271 GAIAGCDI---NKSNT--NKRYDGEFDVDVNYTQIHLEEALKTGKFIFHKV-G--DEVHV-LEDINTFVSFTDEKNDDFS 341 (436) T ss_pred HHHhcCcc---ccCcc--ceecCccccccccCCHHHHHHHHhCCeEEEEEe-C--CeEEE-EEccccceecCCCCCcchh Confidence 66666543 33433 77888873 3 4699999999999998777532 2 23333 445422 11 12 Q ss_pred ehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCcccc Q lcl|NC_017984. 360 WIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAAS 439 (487) Q Consensus 360 ~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~ 439 (487) -|-.++-.|-+.+.++..+-+.++ +|+|=+.+|..++.+.|+.-|++..+.|.|.+.. . .|. T Consensus 342 kI~vir~~D~i~~di~~~~~~~yi--GKv~N~~dgr~~l~~~i~~yl~~L~~~g~I~~f~---~-----------~Dv-- 403 (436) T protein:vir:78 342 SNQSVRVLDQIANDIATLFNTKYL--GEVPNDKSGRISFWNDVVKHHEQLQNMRAIEDFK---A-----------DDV-- 403 (436) T ss_pred hhhHHHHHHHHHHHHHHHhhhccc--cccCCCHHHHHHHHHHHHHHHHHHHhCCcccCCC---C-----------cce-- Confidence 366777777777777665444333 6999999999999999999999999999996421 0 010 Q ss_pred ceeeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEee Q lcl|NC_017984. 440 QLFTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNV 486 (487) Q Consensus 440 ~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~v 486 (487) .+ .+.+ . +..--+++..+.-.|+.++.++..+= T Consensus 404 ---------~v---~~~~-~-~~~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 404 ---------SV---EPGS-D-KKTVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred ---------EE---eecC-C-CCEEEEEEEEEEEEeeeeEEEEEEEC Confidence 01 0000 1 11223778888889999999988766 No 20 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=98.36 E-value=1.1e-06 Score=53.29 Aligned_cols=434 Identities=13% Similarity=0.109 Sum_probs=211.2 Q ss_pred CCcCCccccceEE--Eeeeeeccccc---ccccceeEEec-Cccee---eeeeccHHHHHHhcCCChHHHHHHHHHhhcc Q lcl|NC_017984. 1 MQFNSIPASNIAA--VYPAVIGGGGN---PLGLNTNLFVQ-DAIYP---NYEYFSNTLVGQHYGLESPIYKFATVYFNGF 71 (487) Q Consensus 1 ~~~~~ip~s~iV~--V~~~~~~~~~~---~~~~~~ll~~~-~~~~~---~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~ 71 (487) |+..--|--+++. |.+....-+.. ..+.+.+.|.+ ...-| +..+++-+|.-+-||.+. --.+....|..+ T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~~~~~~~~~~g~-l~~~~~~a~~~~ 79 (587) T protein:vir:99 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNYSQAKRLFRSGE-LLDAIELAWGSN 79 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEecCCccceeEEeccHHHHHHHhcCcc-hHHHHHHHhccc Confidence 8866567777766 55555443332 23344454443 33333 345899999999999875 555666666522 Q ss_pred cCCccCCCEEEEEeeecccceeeEeecccc--ccc----hhhhe------eeeeEE----------------------EE Q lcl|NC_017984. 72 RNATTRPNSLFITKYNLTDVPASLIGGDIT--STT----LADLK------LINGTL----------------------TI 117 (487) Q Consensus 72 ~~q~p~P~~l~igr~~~~~~~~~l~g~~~~--~~~----~~~~~------~~~g~~----------------------~i 117 (487) . ...++++|+-|-. .+.++.+.-+.+. +.. -..++ .++++. +| T Consensus 80 ~--~~g~~~~~~~rv~-~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~i 156 (587) T protein:vir:99 80 P--NYTAGRILAMRIE-DAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIFTI 156 (587) T ss_pred c--CCCceEEEEEEcC-CCceeEEEecCeEEEEeeccccccceEEEEccCCCCcceeEEEEEecccceeeeeeccceeeE Confidence 2 2366889998873 3333433222221 000 00000 011221 12 Q ss_pred EEccceEE-------------E-------------Eeecccc--CchHHHHHhhhhh--heee----------EEEeccc Q lcl|NC_017984. 118 VVDGVSKS-------------V-------------PVDLATA--NSYSDAAALIATA--LTLP----------CTYESTV 157 (487) Q Consensus 118 ti~g~~~~-------------~-------------~i~~s~a--ts~~~vA~~i~t~--l~a~----------vt~d~~~ 157 (487) .-.|...+ . .+.|... .........|.+- +.+. ..+-... T Consensus 157 ~y~g~~~~a~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~i~~~~~~tAky~~~~~~~i~~~~~~~~ 236 (587) T protein:vir:99 157 KYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLDKI 236 (587) T ss_pred EeecccccceeeEeecCcceeeeeeeeecCCceeEEEEecCCchHHHHHHHhhhccccceeEEeeccCCceeEeeccccc Confidence 11221110 0 0111110 0001111111110 0010 0000000 Q ss_pred ceEEEEeccc------cc-------ce--eEEecccc------h-------------hhhhhhcccc-ceeEecCcc--- Q lcl|NC_017984. 158 KGFVIKSGTS------GA-------NS--TISFATGD------I-------------SDDLKLTQET-GAVLNNHTA--- 199 (487) Q Consensus 158 ~~f~its~t~------g~-------~s--tit~atgd------~-------------a~~l~lt~~~-gA~~~~G~a--- 199 (487) ..|.++.... ++ .. .++...++ . ....+..... ......|.+ T Consensus 237 ~~~~v~~~~~~v~a~~~D~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~LtGG~dG~~ 316 (587) T protein:vir:99 237 ENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNGEP 316 (587) T ss_pred ccceeeeeeeeeehhccceeeecccceeeeeeecccccchhhhhhhhhccccceeeeeccccceecccceeeecCCCCCc Confidence 1112221100 00 00 00000000 0 0000000000 111333433 Q ss_pred cccHHHHHHHHHhcccceeEEEEEeccCChhHHHHHHHHHhcc---CceEEEEEccccccccccchHHHHHHHhCCcceE Q lcl|NC_017984. 200 ADTPTTGALNALAFSQNFVNITYSEGVFNEDALKDLALWVTSQ---NSRFKLYTWGLDPVALGQSGASFGEWAKENTSGV 276 (487) Q Consensus 200 aet~~~al~a~~~~~~~wy~~~~~~~~~~~~~i~a~A~w~~a~---~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t 276 (487) +++..++++++..+ +|+.+...+ .+...+.++..|++.. .++...+...... .+.......-...++.|. T Consensus 317 ~~sy~~al~ale~~--~~~~i~~~t--~d~~i~a~l~a~vk~~r~~g~~~~aVlg~~~~---~~~~~~~~~a~~~n~e~v 389 (587) T protein:vir:99 317 PATWADKLDKFAHE--GGYYIVPLS--SKQSVHAEVASFVKERSDAGEPMRAIVGGGFN---ESKEQLFGRQASLSNPRV 389 (587) T ss_pred cccHHHHHHHHhhC--CcEEEEecC--CCHHHHHHHHHHHHHHHhCCCcEEEEecCCCC---CCHHHHHHHhhhcCCCcE Confidence 44567888888775 455554322 2334445699998652 2333333322111 111112222233466665 Q ss_pred EEecCC-----------Cch----HHHHHHHHHhcCcCcCCceeeeeeeecC--cccccCCCHHHHHHHHhCCceEEEEe Q lcl|NC_017984. 277 VPLYGT-----------FDK----AAFFCGVSGSINYQEENGRTTTAFRSQD--GLVPDVTNEADAETLVKNGYSFYGAW 339 (487) Q Consensus 277 ~~~y~~-----------~~~----~a~~~g~~as~~~~~~~gs~T~~fk~l~--Gv~~~~lt~t~~~al~~~~~n~y~~~ 339 (487) ..+... .++ ++++.|..++.++.+ ++| ||.++ ++. ..++.+|++.+..+|++.+... T Consensus 390 i~v~~~~~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~~~~---SlT--~~~i~~~~v~-~~~t~~e~e~li~~Gvl~l~~~ 463 (587) T protein:vir:99 390 SLVANSGTFVMDDGRKNHVPAYMVAVALGGLASGLEIGE---SIT--FKPLRVSSLD-QIYESIDLDELNENGIISIEFV 463 (587) T ss_pred EEEeccceEecCCCceeeechHHHHHHHHHHHhcCchhc---Ccc--ceeeeccccc-ccCCHHHHHHHHhCCeEEEEEe Confidence 433211 022 345556666665433 333 34444 444 4799999999999999988766 Q ss_pred cCCCce-EEEEECCEEc----CCcee--hHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhc Q lcl|NC_017984. 340 ATANDR-FQFAGNGSVT----GQYKW--IDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINF 412 (487) Q Consensus 340 ~~~~~~-~~~~~~G~~s----gg~~~--iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~n 412 (487) .++... ..+. ++..+ .++.| |-.++-.|.+.+.++..+-+.+.- | |=++.|...|++.|+..|++-.+. T Consensus 464 ~~~~~~~vriv-~~ItT~t~~~~~~~~~i~viRv~D~i~~di~~~~~~~yiG--k-~Nn~~~r~~i~~~i~~~L~~l~~~ 539 (587) T protein:vir:99 464 RNRTNTFFRIV-DDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIG--T-RTINTSASIIKDFIQSYLGRKKRD 539 (587) T ss_pred cCCcceEEEEe-eceeeccCCCCchhhhhhhhhhHHHHHHHHHHHHHhhCCc--c-ccchHHHHHHHHHHHHHHHHHHhC Confidence 554322 2332 23222 23334 678999999999998887776654 3 567899999999999999999999 Q ss_pred CccccCcccCccccccccccccCccccceeeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 413 GGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 413 G~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) |.|.-. +..+ + .+. ...|+ . -+++.+..-.+|++|-++.+.-| T Consensus 540 gaI~~~-~~~d-----------------v-------~v~--~~~d~---~--~v~~~v~Pv~~mekIy~tv~~~~ 582 (587) T protein:vir:99 540 NEIQDF-PAED-----------------V-------QVI--VEGNE---A--RISMTVYPIRSFKKISVSLVYKQ 582 (587) T ss_pred CcccCC-Cccc-----------------e-------EEE--ecCCE---E--EEEEEEEEcccceEEEEEEEEEe Confidence 999532 1100 0 010 01121 1 37889999999999999999988 No 21 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=98.31 E-value=1.4e-06 Score=52.64 Aligned_cols=432 Identities=13% Similarity=0.092 Sum_probs=211.8 Q ss_pred CCcCCccccceEE--Eeeeeeccccc---ccccceeEEec-Cccee---eeeeccHHHHHHhcCCChHHHHHHHHHhhcc Q lcl|NC_017984. 1 MQFNSIPASNIAA--VYPAVIGGGGN---PLGLNTNLFVQ-DAIYP---NYEYFSNTLVGQHYGLESPIYKFATVYFNGF 71 (487) Q Consensus 1 ~~~~~ip~s~iV~--V~~~~~~~~~~---~~~~~~ll~~~-~~~~~---~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~ 71 (487) |+..--|--+++. |.+...+-+.. ....+.+.|.+ ...-| +..+++-+|.-+-||.+. --.+.+..|..+ T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~~~~~~~~~~g~-l~~~~~~a~~~~ 79 (587) T protein:vir:95 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNYSQAKRLFRSGE-LLDAIELAWGSN 79 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEeCCCCCceeEEeccHHHHHHHhcCcc-hHHHHHHHhccc Confidence 8865557777766 55555433332 23344444443 33333 345899999999999864 445556666522 Q ss_pred cCCccCCCEEEEEeeecccceeeEeeccccccc------hhhhe------eeeeEEEEE--------------------- Q lcl|NC_017984. 72 RNATTRPNSLFITKYNLTDVPASLIGGDITSTT------LADLK------LINGTLTIV--------------------- 118 (487) Q Consensus 72 ~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~------~~~~~------~~~g~~~it--------------------- 118 (487) . ...++++|+-|- ..+.++.+.-+.+.-.. -..++ .+.++..++ T Consensus 80 ~--~~g~~~~~~~rv-~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~si 156 (587) T protein:vir:95 80 P--NYTAGRILAMRI-EDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIFTI 156 (587) T ss_pred c--CCCceEEEEEEc-CCCceeEEEecCeEEEEecccccccceEEEEecCCCCCceeEEEEEecccceeeeeeccceeee Confidence 2 236688999885 44445544333222000 00000 012222111 Q ss_pred -EccceE--------------------------EEEeeccccCchHHHHHhhhhhhe------ee----------EEEec Q lcl|NC_017984. 119 -VDGVSK--------------------------SVPVDLATANSYSDAAALIATALT------LP----------CTYES 155 (487) Q Consensus 119 -i~g~~~--------------------------~~~i~~s~ats~~~vA~~i~t~l~------a~----------vt~d~ 155 (487) -.|... ...+.|.... ...+..+...+. +. +++-. T Consensus 157 ~y~g~~~~~~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~--~~~~~~~~~~in~~~~~tAky~g~~~~~i~~~~~~ 234 (587) T protein:vir:95 157 KYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGA--YDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLD 234 (587) T ss_pred eeeccccccceeeeecccceeeeeeeeecCCceEEEEEecCCc--hHHHHHHHHhhccccceEEEEecccCceeEEeecC Confidence 111110 0001111100 001111111110 11 11101 Q ss_pred ccceEEEEeccc-----ccce--------eEE--ecccc-------------h------hhhhhhcccc-ceeEecCcc- Q lcl|NC_017984. 156 TVKGFVIKSGTS-----GANS--------TIS--FATGD-------------I------SDDLKLTQET-GAVLNNHTA- 199 (487) Q Consensus 156 ~~~~f~its~t~-----g~~s--------tit--~atgd-------------~------a~~l~lt~~~-gA~~~~G~a- 199 (487) ...-|.++.... ..+. .+. ...++ . ....+..... ......|.+ T Consensus 235 ~~~~~~v~~~~~~v~a~~~d~~~~~~~~~~v~~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~t~LtGG~dG 314 (587) T protein:vir:95 235 KIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNG 314 (587) T ss_pred cccccceehhhhhhhhhhcceeeeeeceeeeeeecccccceeccchhhhhcccchheeccccccceeccceeeeecCCCC Confidence 111111111000 0000 000 00000 0 0000000000 111333443 Q ss_pred --cccHHHHHHHHHhcccceeEEEEEeccCChhHHHHHHHHHhcc---CceEEEEEccccccccccchHHHHHHHhCCcc Q lcl|NC_017984. 200 --ADTPTTGALNALAFSQNFVNITYSEGVFNEDALKDLALWVTSQ---NSRFKLYTWGLDPVALGQSGASFGEWAKENTS 274 (487) Q Consensus 200 --aet~~~al~a~~~~~~~wy~~~~~~~~~~~~~i~a~A~w~~a~---~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 274 (487) +.+..++++++..+ +|..+...+ .+...+.++..|++.. .++...+...... .+.......-..-++. T Consensus 315 ~~~~~y~~~l~ale~~--~~~~i~~~t--~d~~v~a~l~a~vk~~~~~g~~~~aVvg~~~~---~~~~~~~~~a~~~n~e 387 (587) T protein:vir:95 315 EPPATWADKLDKFAHE--GGYYIVPLS--SKQSVHAEVASFVKERSDAGEPMRAIVGGGFN---ESKEQLFGRQESLSNP 387 (587) T ss_pred CCcccHHHHHHHHHhC--CcEEEEecC--CCHHHHHHHHHHHHHHHhCCCcEEEEEcCCCC---CCHHHHHHHHhhcCCC Confidence 44567888888775 455554322 2334445689998642 2333333322111 1111122222233666 Q ss_pred eEEEecCC---------------CchHHHHHHHHHhcCcCcCCceeeeeeeecC--cccccCCCHHHHHHHHhCCceEEE Q lcl|NC_017984. 275 GVVPLYGT---------------FDKAAFFCGVSGSINYQEENGRTTTAFRSQD--GLVPDVTNEADAETLVKNGYSFYG 337 (487) Q Consensus 275 ~t~~~y~~---------------~~~~a~~~g~~as~~~~~~~gs~T~~fk~l~--Gv~~~~lt~t~~~al~~~~~n~y~ 337 (487) |...+... ...++++.|..++.+..+ ++| ||.++ ++. ..++.+|++.+..+|++.+. T Consensus 388 rvi~v~~~~~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~~~~---SlT--~~~i~~~~v~-~~~t~~e~e~ai~~Gvl~l~ 461 (587) T protein:vir:95 388 RVSLVANSGTFVMDDGRKNHVPAYMVAVALGGLASGLEIGE---SIT--FKPLRVSSLD-QIYESIDLDELNENGIISIE 461 (587) T ss_pred cEEEecccceEecCCCceeeechHHHHHHHHHHHhcCchhc---Ccc--ceeeeccccc-ccCCHHHHHHHHhCCeEEEE Confidence 66543221 112345566667665443 333 34444 444 47899999999999999887 Q ss_pred EecCCCce-EEEEECCEEc----CCcee--hHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHH Q lcl|NC_017984. 338 AWATANDR-FQFAGNGSVT----GQYKW--IDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGI 410 (487) Q Consensus 338 ~~~~~~~~-~~~~~~G~~s----gg~~~--iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~ 410 (487) ...++... ..+. ++..+ .++.| |-.++-.|.+.+.++..+-+.+.- | |=++.|...|++.++..|++-. T Consensus 462 ~~~~~~~~~vriv-~~itT~t~~~d~~~~~i~viRv~D~i~~dir~~~~~~~iG--k-~nn~~~r~~v~~~i~~~L~~l~ 537 (587) T protein:vir:95 462 FVRNRTNTFFRIV-DDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIG--T-RTINTSASIIKDFIQSYLGRKK 537 (587) T ss_pred EecCCcceEEEEe-ecceeccCCCCcchhhhhhhhhHHHHHHHHHHHHHhhCCc--c-ccchHHHHHHHHHHHHHHHHHH Confidence 65554322 2222 23222 23334 778999999999998887766654 4 5688999999999999999999 Q ss_pred hcCccccCcccCccccccccccccCccccceeeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 411 NFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 411 ~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) +.|.|.-.. .. ++ .+. ...|+ --++|.+...-++++|.++.+.-| T Consensus 538 ~~gaI~~~~-~~-----------------dv-------~v~--~~~d~-----~~v~~~v~Pv~~mekI~vt~~~~~ 582 (587) T protein:vir:95 538 RDNEIQDFP-AE-----------------DV-------QVI--VEGNE-----ARISMTVYPIRSFKKISVSLVYKQ 582 (587) T ss_pred hCCcccCCC-cc-----------------ce-------EEE--ecCCE-----EEEEEEEEEcccceEEEEEEEEee Confidence 999995321 00 00 010 01121 247888999999999999999988 No 22 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=98.21 E-value=2.5e-06 Score=51.29 Aligned_cols=331 Identities=11% Similarity=0.084 Sum_probs=157.0 Q ss_pred ceeeEeeccccccchhhhe---eeeeEEEEE-EccceEEEEeeccccCchHHHHHhhhhhheeeEEEecccceEEEEecc Q lcl|NC_017984. 91 VPASLIGGDITSTTLADLK---LINGTLTIV-VDGVSKSVPVDLATANSYSDAAALIATALTLPCTYESTVKGFVIKSGT 166 (487) Q Consensus 91 ~~~~l~g~~~~~~~~~~~~---~~~g~~~it-i~g~~~~~~i~~s~ats~~~vA~~i~t~l~a~vt~d~~~~~f~its~t 166 (487) .++. -+-.+.-..++..+ ..-|-+-+. .|.+.... .+...+ .+-......... .....| .+.+ T Consensus 1 ~~gl-p~i~i~f~~~a~ta~~~g~rGiv~~il~d~~~~~~--~~~~~~---~v~~~~~~~n~~-----~i~~~~--~g~~ 67 (356) T protein:vir:10 1 MAGL-VNINIEFKELATSFIQRSKAGIVAIILKDTTKMYK--ELTSED---DIPISLSADNKK-----YIKYGF--VGAT 67 (356) T ss_pred CCCC-CceeEEEeecceeeccCCccceEEEEEecCCccee--EEeccc---cchhHHHHHHHH-----HHHHHh--hccc Confidence 0000 01111111111111 111322222 12111100 011111 110000000000 000000 0000 Q ss_pred cccceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeEEEEEeccCChhHHHHHHHHHhc----c Q lcl|NC_017984. 167 SGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVNITYSEGVFNEDALKDLALWVTS----Q 242 (487) Q Consensus 167 ~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~~~~~i~a~A~w~~a----~ 242 (487) .+ ...+ ....+....+...++..++|++++.+..+| |.+.. .+++....++.|+.. . T Consensus 68 ~~---~~~~------------~p~~~~~~~~~t~~~y~~aL~~le~~~fn~--l~~~~--~d~~~~~~~~a~ikr~r~~~ 128 (356) T protein:vir:10 68 DN---EKVL------------RPSKVIISTFTEDGKVEDILEELESVEFNY--LCMPE--AIEAEKTKIVTWIKKIREEE 128 (356) T ss_pred cc---cccc------------cceeeeeecccCchhHHHHHHHhcCccceE--EEecC--CChHHHHHHHHHHHHHHhcC Confidence 00 0000 000011111234567899999998776565 44432 345666789999984 3 Q ss_pred CceEEEEEccccccccccchHHHHHHH-hCCcceEEEecCCCchHHHHHHHHHhcCcCcCCceeeeeeeecCcccc-cCC Q lcl|NC_017984. 243 NSRFKLYTWGLDPVALGQSGASFGEWA-KENTSGVVPLYGTFDKAAFFCGVSGSINYQEENGRTTTAFRSQDGLVP-DVT 320 (487) Q Consensus 243 ~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~t~~~y~~~~~~a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~~-~~l 320 (487) .+++..+........ .+.+. .++..-.-..|.....++.+.|..|+... +.|+| |+.++++.. +.+ T Consensus 129 ~~~~~~V~~~~~aD~-------EgIInv~n~~~~~g~~~t~~~~~~~vAG~~Ag~~~---n~S~T--~~~~~~~~~~~~~ 196 (356) T protein:vir:10 129 STEAKAVLANIKADN-------EAIINFTENVVVDGEEITAEKYTTRVASLIASTPN---TQSIT--YAPLDEVESIVKI 196 (356) T ss_pred CcEEEEEecCCCCCC-------ceeEEeecCeEecceeechhHHHHHHHHHHhccch---hcccc--ceecCCccccccC Confidence 455655543322111 11110 01100000122222334456666776643 33443 557777654 358 Q ss_pred CHHHHHHHHhCCceEEEEecCCCceEEEEECCEEc----CC-----ceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcC Q lcl|NC_017984. 321 NEADAETLVKNGYSFYGAWATANDRFQFAGNGSVT----GQ-----YKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYN 391 (487) Q Consensus 321 t~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~s----gg-----~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt 391 (487) +.+|++.+.++|.-.+..-++ ... +.+|.-| +. +.-|-+++.+|-+.+.++..+-+.++ +|+|=+ T Consensus 197 t~~e~~~ai~~G~lvl~~d~~---~V~-I~~~VNSltt~t~~k~~~f~Kirvvr~~D~i~~Di~~~f~~~yi--GKv~N~ 270 (356) T protein:vir:10 197 DKASADAKVQAGELILRRLSG---KIR-IARGINSLTTLTAEKGEIFQKIKLVDTKDLISKDIKNIYVEKYL--RKCPNT 270 (356) T ss_pred CHHHHHHHHhCCeEEEEEEcC---eEE-EEecCccceecCCCCCcchhhhHHHHHHHHHHHHHHHHHhhccc--cccCCC Confidence 899999999999887754322 233 3445311 22 11277777777777766653322222 799999 Q ss_pred HhhHHHHHHHHHHHHHHHHhcCccccCcccC---ccccccccccccCccccceeeeeeEEeccCCCHHHHhh---cccCC Q lcl|NC_017984. 392 DQGIATVRAYSQDPIDQGINFGGIRAGVNLS---NAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVA---RESFI 465 (487) Q Consensus 392 ~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~---~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~---R~~~~ 465 (487) .+|..++.+.++.-+++..+.|.|.++.+.. +.|+.|.. ..|.+ .+++++.+... ...-- T Consensus 271 ~dgr~~l~~ai~~y~~~L~~~~~I~~~~~~eid~e~q~~~~~-~~g~d-------------~~~~~d~~v~~~~~~~~v~ 336 (356) T protein:vir:10 271 YDNKCLFIVAVQSYLTELAKQELIDSNFTVEIDLEKQKEYLE-GKKIA-------------VSKMKENEIKEANTGSNGF 336 (356) T ss_pred HHHHHHHHHHHHHHHHHHHhCCccccCceeEecccchHHHhh-hcccc-------------ccccccceeecccCCcEEE Confidence 9999999999999999999999998764311 33444433 12222 12222222111 12223 Q ss_pred eEEEEEECCeEEEEEEEEEe Q lcl|NC_017984. 466 IKLFYTDGSSMQRLEMTATN 485 (487) Q Consensus 466 i~~~~~~aGAIh~v~i~gt~ 485 (487) +++..+.-.|+..+.++..+ T Consensus 337 ~~~~v~~vdamE~iy~ti~v 356 (356) T protein:vir:10 337 YLINLKLVDAMEDINIRVQM 356 (356) T ss_pred EEEEEEEEeeeeeEEeEEeC Confidence 77888999999999999887 No 23 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=98.03 E-value=6.4e-06 Score=49.06 Aligned_cols=420 Identities=12% Similarity=0.038 Sum_probs=178.6 Q ss_pred CCcCCccccceEEEeeeeeccccccccccee-EEecCcc----eeeeeeccHHHHHHhcC--CChHHHHHHHHHhhcccC Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGNPLGLNTN-LFVQDAI----YPNYEYFSNTLVGQHYG--LESPIYKFATVYFNGFRN 73 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~~~~~~~l-l~~~~~~----~~~~~y~s~~~V~~~fg--~~s~ey~aA~~yF~g~~~ 73 (487) ||.+-.|==.+..+..+.. .....+-+.. |+...+. .|+. -+|..|...++| .....+.+...||. T Consensus 1 M~~~~~pGVyv~E~~~~~~--~i~~v~T~v~~~VG~a~~gp~n~pv~-its~~d~~~~g~~~~~~tL~~Av~~~f~---- 73 (477) T protein:vir:10 1 MAANYLHGVETIEKETGSR--PVKVVKSAVIGLIGTAPIGPVNTPVQ-SLSDVDAAQFGPQLAGFTIPQALDAVYD---- 73 (477) T ss_pred CcccCCCCeEEEEccCCcc--cccccCCceeEEEecccCCCCCcCEE-EccHHHHHHhccCCCCCcHHHHHHHHHh---- Confidence 8855455422332322222 1222222222 2222222 2333 456556555433 34678889999997 Q ss_pred CccCCCEEEEEeeecccceee-EeeccccccchhhheeeeeEEEEEEccceEEEEeecc-ccCchHHHHHhhhhhheeeE Q lcl|NC_017984. 74 ATTRPNSLFITKYNLTDVPAS-LIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLA-TANSYSDAAALIATALTLPC 151 (487) Q Consensus 74 q~p~P~~l~igr~~~~~~~~~-l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s-~ats~~~vA~~i~t~l~a~v 151 (487) +. ...+++-|-........ +............ ... ...+...........+.. ................ T Consensus 74 nG--g~~~~vVrV~~~~~~~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~~---- 144 (477) T protein:vir:10 74 YG--SGTVIVINVLDPAVHKSNAANEPVTFDAATG--RAK-LAHPAAANLVLKNDSGGTTYAEGTDYAVDLINGVI---- 144 (477) T ss_pred cc--ceEEEEEecCccccccccccccccccccccc--eec-ccccccccccccccccccccccchhhhhhhccccc---- Confidence 33 35567766543221111 0000000000000 000 000000000000000000 0000001111111000 Q ss_pred EEecccceEEEEeccccc-ceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhcc---cceeEEEEE-ecc Q lcl|NC_017984. 152 TYESTVKGFVIKSGTSGA-NSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFS---QNFVNITYS-EGV 226 (487) Q Consensus 152 t~d~~~~~f~its~t~g~-~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~---~~wy~~~~~-~~~ 226 (487) ..........+. ...+.+..++... ... .......+.....+.+.++.... ..-..++.. ... T Consensus 145 ------~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~---~~~~g~~~~~~~~tGl~al~~~~~~~~~~~~~l~apg~~ 212 (477) T protein:vir:10 145 ------TRIKTGTIPPGATAAKATYDYADPTK---VTA---ADIIGAVNAAGMRTGMKALKDTYNLYGYFSKILIAPAYC 212 (477) T ss_pred ------eecccccccccceeeeeccccccccc---ccc---ccccccccccchhhhhhhhhhhhhhcchhcccccccccc Confidence 000000000000 0111111111000 000 00000001111111222222211 100011111 111 Q ss_pred CChhHHHHHHHHHhccCceEEEEEccccccccccchHHHHHHH--hC--CcceEEEecC------C-------CchHHHH Q lcl|NC_017984. 227 FNEDALKDLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWA--KE--NTSGVVPLYG------T-------FDKAAFF 289 (487) Q Consensus 227 ~~~~~i~a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~--~~~~t~~~y~------~-------~~~~a~~ 289 (487) .......++...++.- +.+.++-...+.. ............ .. +..+..+.|. . -..+..+ T Consensus 213 ~~~~v~~~l~~~~~~~-~~~~~~d~p~~~~-~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ 290 (477) T protein:vir:10 213 TQNSVSVELEAMAVQL-GAIAYIDAPIGTT-LAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDTATNAERLEPLSSRA 290 (477) T ss_pred cchhhHHHHHHHHhhC-CEEEEEecCCCCC-HHHHHhhhhhccccccccccceEEEEcCeEEEecccCCceeEEchHHHH Confidence 1122223344444422 2232221111100 000000001010 01 1222332221 0 1235666 Q ss_pred HHHHHhcCcCcCCceeeeeeeecCcccc---c-----CCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC---- Q lcl|NC_017984. 290 CGVSGSINYQEENGRTTTAFRSQDGLVP---D-----VTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ---- 357 (487) Q Consensus 290 ~g~~as~~~~~~~gs~T~~fk~l~Gv~~---~-----~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg---- 357 (487) +|..+.++-.+. =.....+|.+.||.. . ..+++|.+.|.++++|++..+.+.+ +.+|-.-++++. T Consensus 291 ag~~a~~d~~~g-~~~span~~~~gi~~~~~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~~G--~~~wG~rT~~~~~~~~ 367 (477) T protein:vir:10 291 AGLRARVDLDKG-YWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSG--LRLWGNRTAAWPTVTH 367 (477) T ss_pred HHHHHHhhhcCC-ceeccCCceeccccccccccccccCCChhhHHHHhhCCceEEEEecCCc--EEEEcccccCCCCCCc Confidence 777777763221 122334455555532 2 2367899999999999999987655 445543333221 Q ss_pred -ceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCc Q lcl|NC_017984. 358 -YKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFD 436 (487) Q Consensus 358 -~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~ 436 (487) +.||-+.+-.+|+...|+..+...+-. |.|..=...|+..|+.-|+..++.|.|.. T Consensus 368 ~~~~~~vrR~~~~i~~~~~~~~~~~v~~----~~~~~~~~~i~~~i~~~l~~l~~~g~l~g------------------- 424 (477) T protein:vir:10 368 MRNFENVRRTGDVINESLRYFSQQFVDA----PIDQGLIDSLVESVNGFGRKLIGDGALLG------------------- 424 (477) T ss_pred ccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCceee------------------- Confidence 237888899999999888888764432 55777889999999999999999998852 Q ss_pred cccceeeeeeEE--eccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 437 AASQLFTKGWAL--SVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 437 ~~~~~~~~Gy~~--~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) |++ ..+..|++|+.+.+.. +.+.+.....+++|.+.... | T Consensus 425 ---------~~v~~~~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~-~ 466 (477) T protein:vir:10 425 ---------FKAWFDPARNPKEELAAGHLL-INYKYTVPPPLERLTYETEI-T 466 (477) T ss_pred ---------eEEEEecCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEE-c Confidence 233 4566789999998884 99999999999999887532 3 No 24 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=98.00 E-value=7.4e-06 Score=48.71 Aligned_cols=430 Identities=12% Similarity=0.072 Sum_probs=211.4 Q ss_pred CCcCCccccceEE--Eeeeeeccc---ccccccceeEEec-Cccee---eeeeccHHHHHHhcCCChHHHHHHHHHhhcc Q lcl|NC_017984. 1 MQFNSIPASNIAA--VYPAVIGGG---GNPLGLNTNLFVQ-DAIYP---NYEYFSNTLVGQHYGLESPIYKFATVYFNGF 71 (487) Q Consensus 1 ~~~~~ip~s~iV~--V~~~~~~~~---~~~~~~~~ll~~~-~~~~~---~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~ 71 (487) |+..=.|--+++. |.+....-+ ....+.+.+.|.+ ...-| +..+++.++.-+-||.+. --.+..++|... T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~~G~~~~~~~~~~~~~~~~~f~~g~-l~~a~~~a~~~~ 79 (569) T protein:vir:80 1 MAVEQFPRKKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAKGGKPDTVYRFRNYQQAKQVLRSGD-LLDAIELAWNAS 79 (569) T ss_pred CeeeeecCCccccCceEEEEecCCCcCCCCCCceeEEEEEEeCCCCCceeEEecCHHHHHHHhcCCc-hhHHHHhhccCc Confidence 8887788888876 444443222 2233445555443 33333 345889999999998863 666778888644 Q ss_pred cCCccCCCEEEEEeeecccceeeEeecccccc--------chhhhe----eeeeEEE----------------------E Q lcl|NC_017984. 72 RNATTRPNSLFITKYNLTDVPASLIGGDITST--------TLADLK----LINGTLT----------------------I 117 (487) Q Consensus 72 ~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~--------~~~~~~----~~~g~~~----------------------i 117 (487) ......|+++|+-|-.. +.++.+.-+.+... ...... ...|+.. | T Consensus 80 ~~~~~~~~~~~~~rv~~-a~~a~~~~~~~~~~a~~~g~~~n~i~v~l~~~~~~~~~~~~v~~~~~~~~~~~~~ig~v~si 158 (569) T protein:vir:80 80 DVNTASAGDILAVRVED-AKNATLTKGGLTFASTIYGVDANEIQVALEDNNLTHTKRLTVAFSKDGYKKVFDNLGKIFSI 158 (569) T ss_pred cccccCceEEEEEEcCC-CeeeeeeccceeeeeeeccCCCceEEEEEecCcCCcceeeEEeeecCCCccccccccceeeE Confidence 43455678888887633 22232221111100 000000 0011111 1 Q ss_pred EEccceEEEE--------------eecc---------------ccCchHHHHHhhhhhheeeEEEecccceEEEEecccc Q lcl|NC_017984. 118 VVDGVSKSVP--------------VDLA---------------TANSYSDAAALIATALTLPCTYESTVKGFVIKSGTSG 168 (487) Q Consensus 118 ti~g~~~~~~--------------i~~s---------------~ats~~~vA~~i~t~l~a~vt~d~~~~~f~its~t~g 168 (487) +..|+..... +.+- ...+...++..+.+++.. ...|..+....+ T Consensus 159 ~ytg~~~~a~~~~~~~~~~~~a~~l~~~~g~~~~~~~~v~~~~~~~~~~~~~~~lv~~~~~-------~~~f~a~~~~~~ 231 (569) T protein:vir:80 159 QYKGSEAQANFTIAQDSISKKATTLTLNVGSEPESTTEVMKYELGQGVYSETNVLVSAINS-------LPDWEAKFFPIG 231 (569) T ss_pred EEeeccccceEEeecCcCcceeEEEEEEecCCcceeEEEEeeccCCccchhhhhhhhhcCC-------ccCceEEEEecC Confidence 1111100000 0000 000000111112222211 001111110000 Q ss_pred ----------cc---------eeEEecccchhhhhh--------------hccccceeEecCc---ccccHHHHHHHHHh Q lcl|NC_017984. 169 ----------AN---------STISFATGDISDDLK--------------LTQETGAVLNNHT---AADTPTTGALNALA 212 (487) Q Consensus 169 ----------~~---------stit~atgd~a~~l~--------------lt~~~gA~~~~G~---aaet~~~al~a~~~ 212 (487) .. ..++...+++...+. +....+.....|. ..++..+++++++. T Consensus 232 ~~~~~~~~~d~~~~~~~~t~~~~~~~~~~di~~~~~~~~~v~~~~~~~~~l~~~~~~~LtGG~dG~~~~~~~~~l~~le~ 311 (569) T protein:vir:80 232 DKNLPTDALEAVTKVDVKTEAVFVGALAGDIAKQLEYNDYVTVAVDATKPVEDFELTNLTGGSDGTAPESWANKFPLLAN 311 (569) T ss_pred CCcceehhccchhheeccccceeeehhHHHHHHhhcCCceEEEEecCCcceeeecceeecCCCCCCccchHHHHHHHHhh Confidence 00 001111112111110 0001111122233 34456778888876 Q ss_pred cccceeEEEEEeccCChhHHHHHHHHHhcc---CceEEEEEccccccccccchHHHHHHHhCCcceEEEecCC------- Q lcl|NC_017984. 213 FSQNFVNITYSEGVFNEDALKDLALWVTSQ---NSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGT------- 282 (487) Q Consensus 213 ~~~~wy~~~~~~~~~~~~~i~a~A~w~~a~---~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~------- 282 (487) +. |..+.... .+.....++..|++.. .++.+.+...... .+..........-++.|...++.. T Consensus 312 ~~--~~~i~~~t--~d~av~~~l~a~vkr~r~~g~~~~aVvg~~~~---~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~~ 384 (569) T protein:vir:80 312 EG--GYYLVPLT--DKQAVHSEALAFVKDRTDNGDPMRIIVGGGTN---ETVEESITRATNLRDPRASLVGFSGTRKMDD 384 (569) T ss_pred CC--cEEEEecC--CChHHHHHHHHHHHHHHhCCCcEEEEecCCCC---CCHHHHHHHHhhcCCCeEEEEecCceeecCC Confidence 54 44443322 2344456789999853 2333333322111 111122222223366666554321 Q ss_pred ----Cch----HHHHHHHHHhcCcCcCCceeeeeeeecCcccc-cCCCHHHHHHHHhCCceEEEEecCCCceE-EEEECC Q lcl|NC_017984. 283 ----FDK----AAFFCGVSGSINYQEENGRTTTAFRSQDGLVP-DVTNEADAETLVKNGYSFYGAWATANDRF-QFAGNG 352 (487) Q Consensus 283 ----~~~----~a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~~-~~lt~t~~~al~~~~~n~y~~~~~~~~~~-~~~~~G 352 (487) .+. ++.+.|..++.++. -++| ||.++++.. ..++.+|++.+.++|++.+....+..... +.. ++ T Consensus 385 g~~~~~~~~~~aa~vAG~~A~~~~~---~S~T--~k~i~~~~i~~~lt~~e~~~li~~G~~~l~~~~~~~~~v~~~v-n~ 458 (569) T protein:vir:80 385 GRLLKLPGYMMASQIAGIASGLEVG---EAIT--FKHFNVTSVDRVFESSQLDMLNESGVISIEFVRNRTLTAFRVV-QD 458 (569) T ss_pred CcceeechhhHHHHHHHHHhcCccc---cCcc--ceeeccccccccCCHHHHHHHHhCCeEEEEEecCceEEEEEEe-cc Confidence 122 33445555655433 3444 455554332 36899999999999999997665543221 222 23 Q ss_pred EEc----CCce--ehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCcccc Q lcl|NC_017984. 353 SVT----GQYK--WIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQK 426 (487) Q Consensus 353 ~~s----gg~~--~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~ 426 (487) ..+ .++. +|-.++-.|.+.+.|+..+-+.+.- | |=++.|...|++.++..|++-.+.|.|.- .+ +. T Consensus 459 itT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yiG--k-~nn~~~r~~v~~~i~~~L~~l~~~gaI~~-~~--~~-- 530 (569) T protein:vir:80 459 VTTYNDKSDPVKNEMSVGEANDFLVSELKIELDNNFIG--T-KVIDTSASLIKNFIQSFLDNKKRAREIQD-YT--PE-- 530 (569) T ss_pred ceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhhcCc--c-cCChhHHHHHHHHHHHHHHHHHhCCcccC-CC--cc-- Confidence 322 1222 4888888888888888777665543 4 67889999999999999999999999952 11 00 Q ss_pred ccccccccCccccceeeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 427 FQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 427 ~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) +++ +.+ ..| |. -+.|.+..--++++|.++.+.-| T Consensus 531 -------------dv~-----v~~----~~d---~~--~v~~~v~Pv~~~ekI~~ti~~~~ 564 (569) T protein:vir:80 531 -------------EVQ-----VVL----EGD---VA--SISMTVMPIRSLNKITVQLVYKQ 564 (569) T ss_pred -------------ceE-----EEe----cCC---EE--EEEEEEEEcccccEEEEEEEEee Confidence 000 001 111 22 37888999999999999999999 No 25 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=97.97 E-value=8.7e-06 Score=48.33 Aligned_cols=413 Identities=11% Similarity=0.042 Sum_probs=179.0 Q ss_pred CCcCCccccceEEEeeeeecccccccccceeEEe-cCcceee---eeeccHHHHHHhcC--CChHHHHHHHHHhhcccCC Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGNPLGLNTNLFV-QDAIYPN---YEYFSNTLVGQHYG--LESPIYKFATVYFNGFRNA 74 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~~~~~~~ll~~-~~~~~~~---~~y~s~~~V~~~fg--~~s~ey~aA~~yF~g~~~q 74 (487) |+-+-.|==.+..+..+.. +.....-....|. .....|+ ..-+|..|-...|| .+...+.+...||. + T Consensus 1 M~~~~~pGVyv~E~~~g~~--~I~~v~Tsv~~~VG~a~~~p~n~pv~its~~d~~~~g~~~~~~tL~~Av~~~f~----n 74 (477) T protein:vir:79 1 MAANYLHGVETIEKETGSR--PVKVVKSAVIGLIGTAPIGPVNTPVQSLSDVDAAQFGPQLAGFTIPQALDAVYD----Y 74 (477) T ss_pred CcCCCCCCeEEEEecCCcc--cccccCCceEEEEeecccCCCcccEEEccHHHHHHhcCCCCCCcHHHHHHHHhh----c Confidence 8865556322222222221 2222222222222 2222222 13556666555544 34667888888886 3 Q ss_pred ccCCCEEEEEeeeccccee---eEeeccccccch--hhheeeeeEEEEEEccceEEEEeeccccCchHHH---HHhhhh- Q lcl|NC_017984. 75 TTRPNSLFITKYNLTDVPA---SLIGGDITSTTL--ADLKLINGTLTIVVDGVSKSVPVDLATANSYSDA---AALIAT- 145 (487) Q Consensus 75 ~p~P~~l~igr~~~~~~~~---~l~g~~~~~~~~--~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~v---A~~i~t- 145 (487) . -.++++-|-....... ....+....... .........+.+..+....... .........+ ...+.. T Consensus 75 g--g~~~~vvrV~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 150 (477) T protein:vir:79 75 G--SGTVIVINVLDPAVHKSNAASESVTFDAATGRAKLAHPAAANLVLKNDSGGTTYT--EGTDYAVDLINGVITRIKTG 150 (477) T ss_pred C--CceEEEEeccCCccccccccccccccccccccccccccccceeEEeecccccccc--cCccccccccchhhhhhhcc Confidence 2 2557776653221111 111111110000 0000111111111111100000 0000000000 000000 Q ss_pred hheeeEEEecccceEEEEecccccceeEEecccchhhhhhhccccceeEec---CcccccHHHHHHHHHhcccceeEEEE Q lcl|NC_017984. 146 ALTLPCTYESTVKGFVIKSGTSGANSTISFATGDISDDLKLTQETGAVLNN---HTAADTPTTGALNALAFSQNFVNITY 222 (487) Q Consensus 146 ~l~a~vt~d~~~~~f~its~t~g~~stit~atgd~a~~l~lt~~~gA~~~~---G~aaet~~~al~a~~~~~~~wy~~~~ 222 (487) ..+... ......+..++.. ....+.... .....+..+++.........--.++. T Consensus 151 ~~~~~~-----------------~~~~~~~~~~~~~------~~~~~~~~g~~~a~~~~tg~~al~~~~~~~~~~~~iv~ 207 (477) T protein:vir:79 151 TIPAAA-----------------TAAKATYDYADPT------KVTAADIIGAVNAAGMRTGMKALKDTYNLYGYFSKILI 207 (477) T ss_pred cccccc-----------------ceeeceeccCCcc------cceeeeecccccccccchhhhhhhhhhhhcccccceee Confidence 000000 0000000000000 000000000 00111112222222211111111111 Q ss_pred Eecc-CChhHHHHHHHHHhccCceEEEEEccccccccccchHHHHHHHh--C--CcceEEEecC------C-------Cc Q lcl|NC_017984. 223 SEGV-FNEDALKDLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAK--E--NTSGVVPLYG------T-------FD 284 (487) Q Consensus 223 ~~~~-~~~~~i~a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~--~--~~~~t~~~y~------~-------~~ 284 (487) .... .......++...++.. +++..+....... .....+....+.+ . +..+..+.|. . -. T Consensus 208 apg~~~~~~v~~~l~~~~~~~-~~~a~~d~p~~~~-~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p 285 (477) T protein:vir:79 208 APAYCTQNSVSVELEAMAVQL-GAIAYIDAPIGTT-LAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDIATNAERLEP 285 (477) T ss_pred ccccccchhHHHHHHHHHhhc-CeEEEEecCCCCC-hHHHhhhhhhccccccccccceEEEEcCeeEEecccCCceeeec Confidence 1111 1122233344444322 2222221111000 0000000000000 0 1122222221 0 12 Q ss_pred hHHHHHHHHHhcCcCcCCc-eeeeeeeecCcccc--------cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEc Q lcl|NC_017984. 285 KAAFFCGVSGSINYQEENG-RTTTAFRSQDGLVP--------DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVT 355 (487) Q Consensus 285 ~~a~~~g~~as~~~~~~~g-s~T~~fk~l~Gv~~--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~s 355 (487) ....++|..+.++-+ .| ......|.+.||.. ...+++|.+.|.++++|.+..+.+.+ +.+|- +.++ T Consensus 286 ~s~~~ag~~a~~d~~--~g~~~span~~~~gv~~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~~G--~~~wG-~rT~ 360 (477) T protein:vir:79 286 LSSRAAGLRARVDLD--KGYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSG--LRLWG-NRTA 360 (477) T ss_pred hHHHHHHHHHHhhcc--CCceEccCCceeecceecccccccccCCChhhHHHHhhCCceEEEEecCCc--EEEEc-cccc Confidence 345667777776533 22 22334555666542 22357899999999999999887755 34553 3443 Q ss_pred C----C--ceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccc Q lcl|NC_017984. 356 G----Q--YKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQV 429 (487) Q Consensus 356 g----g--~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~ 429 (487) . . +.||-+.+-.+|+...|+..+..++-. |.|..=...|+..|+.-|++.++.|.|.. T Consensus 361 ~~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e----~~~~~~~~~i~~~i~~~l~~l~~~g~l~g------------ 424 (477) T protein:vir:79 361 AWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDA----PIDQGLIDSLVESVNGFGRKLIGDGALLG------------ 424 (477) T ss_pred CCCCCCccceeeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCceee------------ Confidence 1 1 347888899999999999888765432 45777789999999999999999998852 Q ss_pred cccccCccccceeeeeeEE--eccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 430 NQEAGFDAASQLFTKGWAL--SVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 430 ~~~~g~~~~~~~~~~Gy~~--~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) |.+ ..+..+++|+.+.+. -+.+.+.....+++|.+.... + T Consensus 425 ----------------~~v~~~~~~nt~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~-~ 466 (477) T protein:vir:79 425 ----------------FKAWFDPARNPKEELAAGHL-LINYKYTVPPPLERLTYETEI-T 466 (477) T ss_pred ----------------eEEEEecCCCCHHHhhCCeE-EEEEEEEecCCceeEEEEEEE-e Confidence 233 456788999998887 499999999999999887532 3 No 26 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=97.66 E-value=3.2e-05 Score=45.24 Aligned_cols=434 Identities=11% Similarity=0.026 Sum_probs=208.7 Q ss_pred CCcCCccccceEE--Eeeeeecccc---cccccceeEEec-Cccee---eeeeccHHHHHHhcCCChHHHHHHHHHhhcc Q lcl|NC_017984. 1 MQFNSIPASNIAA--VYPAVIGGGG---NPLGLNTNLFVQ-DAIYP---NYEYFSNTLVGQHYGLESPIYKFATVYFNGF 71 (487) Q Consensus 1 ~~~~~ip~s~iV~--V~~~~~~~~~---~~~~~~~ll~~~-~~~~~---~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~ 71 (487) |+..=-|..+++. |.+....-+. ...+.+.+.|.+ ...-| +..+++-+|.-+-||.+. .-.+..++|..+ T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~s~~~~~~~~~~~~~~~iG~a~~G~~~~~~~~~~~~~~~~~fg~g~-l~~~i~~a~~~~ 79 (562) T protein:vir:63 1 MAIEIYPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKPNAVYKVRNYSQAKSVFRSGE-LLDAIERAWNPG 79 (562) T ss_pred CeeeeeCCCcccCCceEEEEecCCCcccCCCCCceEEEEEEeCCCCCceeEEEccHHHHHHHhcCCc-hHHHHHHhcccc Confidence 8877777777766 5554433222 233445555543 22333 345899999999998864 445566666422 Q ss_pred cCCccCCCEEEEEeeecccceeeEeecccc-----------------------ccc-h-------hhheeee--e----- Q lcl|NC_017984. 72 RNATTRPNSLFITKYNLTDVPASLIGGDIT-----------------------STT-L-------ADLKLIN--G----- 113 (487) Q Consensus 72 ~~q~p~P~~l~igr~~~~~~~~~l~g~~~~-----------------------~~~-~-------~~~~~~~--g----- 113 (487) +.. --+++|+-|-. .+.++.+.-+.+. ... + .+..+.+ | T Consensus 80 ~~~--g~~~~~~~rv~-~a~~a~~~~~~~~~~a~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~V~~i 156 (562) T protein:vir:63 80 EGT--GAGDILAMRVE-EAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIFSI 156 (562) T ss_pred ccC--CceEEEEEEcC-CCccceeEecceeEEEeecccCCCeEEEEEecCCCCCCcceEEEecCCCcchhhhhccceeee Confidence 211 12457777763 3333332222111 000 0 0000100 1 Q ss_pred ---------------------EEEEEEccc-eEE--EEeeccccCchHHHHHhhhhhheeeEEEecccceEEEEeccccc Q lcl|NC_017984. 114 ---------------------TLTIVVDGV-SKS--VPVDLATANSYSDAAALIATALTLPCTYESTVKGFVIKSGTSGA 169 (487) Q Consensus 114 ---------------------~~~iti~g~-~~~--~~i~~s~ats~~~vA~~i~t~l~a~vt~d~~~~~f~its~t~g~ 169 (487) ++.+.+.+. .+. +.+..............|.+.......|-.. ..+.++...... T Consensus 157 ~y~g~~~~~~~~v~~~~~~~~a~~l~~~~g~~~v~~~~L~~g~~~~~~~l~~~in~~~~~~aky~~~-~gn~i~~~~~d~ 235 (562) T protein:vir:63 157 KYKGTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPI-GDKNLTTDNFDA 235 (562) T ss_pred eeecccccceEEEEecCcceeEEEEEeecCCcceeEEEecCCccchhHHHHHhhccccceEEEeecc-CCceeeeecccc Confidence 011111111 111 1122222222333333443322222222111 011111100000 Q ss_pred --cee-------EEecccchh---------hh-----hhhccccceeEecCcc---cccHHHHHHHHHhcccceeEEEEE Q lcl|NC_017984. 170 --NST-------ISFATGDIS---------DD-----LKLTQETGAVLNNHTA---ADTPTTGALNALAFSQNFVNITYS 223 (487) Q Consensus 170 --~st-------it~atgd~a---------~~-----l~lt~~~gA~~~~G~a---aet~~~al~a~~~~~~~wy~~~~~ 223 (487) ... ++...+|+. .. -.+....+.....|.+ +++..+++++++.. +|+.+... T Consensus 236 ~~~~~vkt~~~~v~t~~~d~~~~~~~~~~v~~~~~~~~~la~~~~~~LtGG~dGt~~~~~~~al~ale~~--~~~~i~~~ 313 (562) T protein:vir:63 236 QIDVDIKTKEAYVKAVGGDIEKQTAYNGYVDFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANE--GGYYLVPL 313 (562) T ss_pred ccccchhhhhhhhhhhhhhhhhcccccceeeeeeccccceecccceeeecCCCCCchhhHHHHHHHHHhC--CcEEEEec Confidence 000 000000100 00 0011112233333333 34556788888765 35544432 Q ss_pred eccCChhHHHHHHHHHhc---cCceEEEEEccccccccccchHHHHHHHhCCcceEEEecCC-----------CchH--- Q lcl|NC_017984. 224 EGVFNEDALKDLALWVTS---QNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGT-----------FDKA--- 286 (487) Q Consensus 224 ~~~~~~~~i~a~A~w~~a---~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~-----------~~~~--- 286 (487) ..+.....++..|++. ..++...+...... .+.......-...++.|...+... .+++ T Consensus 314 --t~d~av~~~l~a~vkr~~~~g~~~~aVlg~~~~---~~~~~~~~~a~~~n~ervv~v~~~~~~~~~~~~~~~~~~~~~ 388 (562) T protein:vir:63 314 --TSKQAVHAEALQFVRDCSYNGNPMRVFVGGGIG---ESMEQLFTRAIGLQNERAGLIGFSGTVKMDDGRSLKMPGYMF 388 (562) T ss_pred --CCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCC---CCHHHHHHHhhhcCCCcEEEEecCeeEECCCCceeeechhHH Confidence 2233344568899854 22333333322111 111122222223366666554321 1223 Q ss_pred -HHHHHHHHhcCcCcCCceeeeeeeecCccc-ccCCCHHHHHHHHhCCceEEEEecCCCceE-EEEECCEEc----CCce Q lcl|NC_017984. 287 -AFFCGVSGSINYQEENGRTTTAFRSQDGLV-PDVTNEADAETLVKNGYSFYGAWATANDRF-QFAGNGSVT----GQYK 359 (487) Q Consensus 287 -a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~-~~~lt~t~~~al~~~~~n~y~~~~~~~~~~-~~~~~G~~s----gg~~ 359 (487) +.+.|..++.+. +.++| ||.++++. ...++.+|++.+..+|++.+....+..... ++. ++..+ .++. T Consensus 389 aa~vAGl~A~~~~---~~SlT--~~~i~~~~v~~~~t~~e~~~li~~Gv~~l~~~~~~~v~~~~iv-~~itT~t~~~~~~ 462 (562) T protein:vir:63 389 AAQVAGLTCGLEI---GEAIT--FKNIAIETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIV-DDVTTFNDKTDPV 462 (562) T ss_pred HHHHHHHhhcCch---hcCcc--ceeeccccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEee-ccceecCCCCCch Confidence 344454555443 33434 44555332 257999999999999999997765543321 222 23222 1222 Q ss_pred --ehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCcc Q lcl|NC_017984. 360 --WIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDA 437 (487) Q Consensus 360 --~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~ 437 (487) .|-+++-.|.+.+.++..+-+.+.- | |=++.|...|++.++..|++-.+.|.|.-. + +. T Consensus 463 ~~ki~viRv~D~i~~dir~~~~~~yiG--k-~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~-~--~~------------- 523 (562) T protein:vir:63 463 KSEIGVGEANDFLVSELKISLDNEYIG--T-KIIDTSASLVKNFVQSFLDRKKLAKEIQDY-S--PE------------- 523 (562) T ss_pred hhhhhhhHHHHHHHHHHHHHHHhcCCc--c-ccChHHHHHHHHHHHHHHHHHHhCCcccCC-C--cc------------- Confidence 4778888888888887776655543 4 568899999999999999999999999521 0 00 Q ss_pred ccceeeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 438 ASQLFTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 438 ~~~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) ++ .+. ...|+ . -+++.+...-++|+|.++.+.-| T Consensus 524 --dv-------~v~--~~~d~---~--~v~~~v~pv~~mekIy~ti~~~~ 557 (562) T protein:vir:63 524 --EV-------QVV--IEGDV---A--RISLTVFPIRSMKKIEVSLVYRQ 557 (562) T ss_pred --ce-------EEE--ecCCE---E--EEEEEEEEcccceEEEEEEEEee Confidence 00 010 11122 1 36788899999999999999999 No 27 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=97.61 E-value=3.8e-05 Score=44.80 Aligned_cols=434 Identities=10% Similarity=0.014 Sum_probs=212.3 Q ss_pred CCcCCccccceEE--Eeeeeeccccc---ccccceeEEec-Cccee---eeeeccHHHHHHhcCCChHHHHHHHHHhhcc Q lcl|NC_017984. 1 MQFNSIPASNIAA--VYPAVIGGGGN---PLGLNTNLFVQ-DAIYP---NYEYFSNTLVGQHYGLESPIYKFATVYFNGF 71 (487) Q Consensus 1 ~~~~~ip~s~iV~--V~~~~~~~~~~---~~~~~~ll~~~-~~~~~---~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~ 71 (487) |+..=-|-.+++. |.+....-+.. ..+.+.+.|.+ ...-| +..+++-++.-+-||.+. --.+...+|..+ T Consensus 1 ~~~~~~~~~~~~~pgv~~~~~~s~~~~~~~~~~~~~~~ig~a~~G~~~~~~~~~~~~~~~~~f~~g~-l~~~i~~a~~~~ 79 (562) T protein:vir:80 1 MAIEIYPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKPNAVYKVRNYSQAKSVFRSGE-LLDAIERAWNPG 79 (562) T ss_pred CeeeeeCCCcccCCceEEEEecCCcccCCCCCCceEEEEEEeCCCCcceeEEEccHHHHHHHhcCCC-hHHHHHHhcccc Confidence 8855447666665 44444322222 23345555554 33333 345899999999998763 334556666422 Q ss_pred cCCccCCCEEEEEeeecccceeeEeecccc-----------------------ccc-h-------hhheee--------- Q lcl|NC_017984. 72 RNATTRPNSLFITKYNLTDVPASLIGGDIT-----------------------STT-L-------ADLKLI--------- 111 (487) Q Consensus 72 ~~q~p~P~~l~igr~~~~~~~~~l~g~~~~-----------------------~~~-~-------~~~~~~--------- 111 (487) +... -+++|+-|-. .+.++.+.-+.+. ... . .+..+. T Consensus 80 ~~~g--~~~~~~~rv~-~a~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~v~~i 156 (562) T protein:vir:80 80 EGTG--AGDILAMRVE-EAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIFSI 156 (562) T ss_pred cccC--ceEEEEEEcC-CCCcceEEecceEEEEeecccCCCceEEEEecCCCCCCcceEEEecCCcceEEeeccCceeee Confidence 2121 2346666652 2333332222111 000 0 000110 Q ss_pred ------------------ee-EEEEEEc-cceEEEE--eeccccCchHHHHHhhhhhheeeEEEecccceEEEEecccc- Q lcl|NC_017984. 112 ------------------NG-TLTIVVD-GVSKSVP--VDLATANSYSDAAALIATALTLPCTYESTVKGFVIKSGTSG- 168 (487) Q Consensus 112 ------------------~g-~~~iti~-g~~~~~~--i~~s~ats~~~vA~~i~t~l~a~vt~d~~~~~f~its~t~g- 168 (487) ++ ++.+.+. |..+... +..............|++.......|-.. ..+.|+..... T Consensus 157 ~y~g~~~~a~~~i~~~~~~~~a~~l~~~~g~~~v~~~~l~~g~~~~~~~l~~~i~~~~~~tAky~g~-~~n~i~~~~~d~ 235 (562) T protein:vir:80 157 KYKGTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPI-GDKNLTTDNFDA 235 (562) T ss_pred eeccccccceeEEEecCccceEEEEEEecCCcceeEEEeCCCccchhhhhhhhhccccceEEEeccc-CCceeeeccccc Confidence 00 1222221 1122222 33232333333344444332222222111 11122211100 Q ss_pred --------cceeEEecccchhh------hh--------hhccccceeEecCcc---cccHHHHHHHHHhcccceeEEEEE Q lcl|NC_017984. 169 --------ANSTISFATGDISD------DL--------KLTQETGAVLNNHTA---ADTPTTGALNALAFSQNFVNITYS 223 (487) Q Consensus 169 --------~~stit~atgd~a~------~l--------~lt~~~gA~~~~G~a---aet~~~al~a~~~~~~~wy~~~~~ 223 (487) ....++...+|+.. +. .++.-.++....|.+ +++..++++++... +|+.+... T Consensus 236 ~~~~~~kt~~~~v~~~~~d~~~~~~~n~~v~~~~~~~~~la~~~~~~LtGG~dG~~~~~~~dal~~Le~~--~~~~i~~~ 313 (562) T protein:vir:80 236 QIDVDIKTKEAYVKAVGGDIEKQTAYNGYVEFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANE--GGYYLVPL 313 (562) T ss_pred chhhhcccceeeeeehhhhhhhcccccceEEEEeccCccccccceeeeeCCCCCCccccHHHHHHHHHhC--CcEEEEec Confidence 00111111122100 00 011112233334443 44567888888865 45545433 Q ss_pred eccCChhHHHHHHHHHhc---cCceEEEEEccccccccccchHHHHHHHhCCcceEEEecCC-----------Cch---- Q lcl|NC_017984. 224 EGVFNEDALKDLALWVTS---QNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGT-----------FDK---- 285 (487) Q Consensus 224 ~~~~~~~~i~a~A~w~~a---~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~-----------~~~---- 285 (487) . .+......+..|++. ..++...+.-.... .+.......-..-++.|...+... .++ T Consensus 314 t--~d~ai~~~~~a~vkr~r~~g~~~~aVvg~~~~---~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~~~~~~~~~~~~~ 388 (562) T protein:vir:80 314 T--SKQAVHAEALQFVRDCSYNGNPMRVFVGGGIG---ESMEQLFTRAIGLQNERAGLIGFSGTVKMDDGRSLKMPGYMF 388 (562) T ss_pred C--CChHHHHHHHHHHHHHHhCCCeEEEEecCCCC---CCHHHHHHHhhhcCCCeEEEEecCeeEECCCCceeeechhHH Confidence 2 223344668889864 23334433322111 111122222223366676654321 122 Q ss_pred HHHHHHHHHhcCcCcCCceeeeeeeecCcccc-cCCCHHHHHHHHhCCceEEEEecCCCceE-EEEECCEEc----CCce Q lcl|NC_017984. 286 AAFFCGVSGSINYQEENGRTTTAFRSQDGLVP-DVTNEADAETLVKNGYSFYGAWATANDRF-QFAGNGSVT----GQYK 359 (487) Q Consensus 286 ~a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~~-~~lt~t~~~al~~~~~n~y~~~~~~~~~~-~~~~~G~~s----gg~~ 359 (487) ++.+.|..++.++. .++ .||.++++.. ..++.+|++.+..+|++.+....+..... ++. ++..+ .++. T Consensus 389 aa~vAGl~Ag~~~~---~S~--T~~~i~~~~v~~~lt~~e~~~li~~G~l~l~~~~~~~v~~~riv-~~itT~t~~~~~~ 462 (562) T protein:vir:80 389 AAQVAGLTCGLEIG---EAI--TFKNIAIETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIV-DDVTTFNDKTDPV 462 (562) T ss_pred HHHHHHHHhcCccc---cCc--cceeeccccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEee-ccceeccCCCCch Confidence 33455555555433 333 4466665432 46899999999999999997765543322 222 23222 1223 Q ss_pred --ehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCcc Q lcl|NC_017984. 360 --WIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDA 437 (487) Q Consensus 360 --~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~ 437 (487) .|-+++-.|.+.+.|+..+-+.+.- | |=++.|...|++.++..|++..+.|.|.-.. +. T Consensus 463 ~~ki~viRv~D~i~~dir~~~~~~yIG--k-~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~---~~------------- 523 (562) T protein:vir:80 463 KSEIGVGEANDFLVSELKISLDNEYIG--T-KIIDTSASLVKNFVQSFLDRKKLAKEIQDYS---PE------------- 523 (562) T ss_pred hhhhhhhHHHHHHHHHHHHHHHhcCCc--c-ccChHHHHHHHHHHHHHHHHHHhCCcccCCC---cc------------- Confidence 3678888888888887776665543 4 5688999999999999999999999995311 00 Q ss_pred ccceeeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 438 ASQLFTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 438 ~~~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) ++ .+. .++|+ . -+++.+...-++++|.++.+.-| T Consensus 524 --dv-------~v~--~~~d~---~--~v~~~v~Pv~~mekIy~ti~~~~ 557 (562) T protein:vir:80 524 --EV-------QVV--IEGDI---A--RISLTVFPIRSMKKIEVSLVYRQ 557 (562) T ss_pred --ce-------EEE--ecCCE---E--EEEEEEEEcccceEEEEEEEEEe Confidence 00 011 12222 1 37888999999999999999988 No 28 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=97.53 E-value=5e-05 Score=44.14 Aligned_cols=364 Identities=11% Similarity=0.041 Sum_probs=187.1 Q ss_pred CCcCCccccceEEEeeeeecccccccccceeEEec-C-----ccee---eeeeccHHHHHHhcCCChHHHHHHHHHhhcc Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGNPLGLNTNLFVQ-D-----AIYP---NYEYFSNTLVGQHYGLESPIYKFATVYFNGF 71 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~~~~~~~ll~~~-~-----~~~~---~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~ 71 (487) |+.- +|=-.+..+.....+ ....+.....|.+ . ...| ....++..+-...||.++..+.+...+|. T Consensus 1 m~~~-~~Gv~v~e~~~~~~~--v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~~~~~~~~g~~~tl~~a~~~~~~-- 75 (396) T protein:vir:60 1 MSDY-HHGVQVLEINEGTRV--ISTVSTAIVGMVCTASDADAEIFPLNKPVLITNVQSAIAKAGKKGTLAASLQAIAD-- 75 (396) T ss_pred CCCC-CCCeEEEEcCCCccc--ccccCceeEEEEecccccccccccCccCeEeechHHHHHhhcCcchhHHHHHHHhh-- Confidence 7742 354444444333332 2333333222221 1 1122 12477888888889999999999999996 Q ss_pred cCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhheeeE Q lcl|NC_017984. 72 RNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALTLPC 151 (487) Q Consensus 72 ~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~a~v 151 (487) +.. ...++-+......... . +....+. ..+... T Consensus 76 --~gg--~~~~vv~~~~~~~~~~----------~-----------------------~~~~~~~---------~~~~~~- 108 (396) T protein:vir:60 76 --QSK--PVTVVVRVEDGTGEDE----------E-----------------------TKLAQTV---------SNIIGT- 108 (396) T ss_pred --ccC--ceEEEEeccccccccc----------c-----------------------ccccccc---------cccccc- Confidence 321 2233322211100000 0 0000000 000000 Q ss_pred EEecccceEEEEecccccceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeEEEEEeccCChhH Q lcl|NC_017984. 152 TYESTVKGFVIKSGTSGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVNITYSEGVFNEDA 231 (487) Q Consensus 152 t~d~~~~~f~its~t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~~~~~ 231 (487) .|.. ...+|-..-. +.....+.. .......+........++..+ ......+.+.+.+... . T Consensus 109 -~d~~-------~~~tg~~al~-----~~~~~~~~~--~~il~ap~~~~~~v~~al~~~---~~~~~~~~i~d~p~~~-~ 169 (396) T protein:vir:60 109 -TDEN-------GQYTGLKALL-----AAESVTGVK--PRILGVPGLDTKEVAVALASV---CQKLRAFGYISAWGCK-T 169 (396) T ss_pred -cccc-------ccccchhhhh-----hcccceeee--eeeccccccccHHHHHHHHHH---hccCCeEEEEeCCCCC-C Confidence 0000 0000000000 000000000 011112223222333333333 3333445555554332 2 Q ss_pred HHHHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcceEEEecCCCchHHHHHHHHHhcCcCcCCceeeeeeee Q lcl|NC_017984. 232 LKDLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGTFDKAAFFCGVSGSINYQEENGRTTTAFRS 311 (487) Q Consensus 232 i~a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~~~~a~~~g~~as~~~~~~~gs~T~~fk~ 311 (487) ..++-+|.+.-+.++..+.+.--...+.. .+..+.+ .....+.|..+.++-++. -......|. T Consensus 170 ~~~a~~~~~~~~s~~~~~~~p~~~~~d~~----------~~~~~~~------p~s~~~AG~~a~~d~~~g-~~~spaN~~ 232 (396) T protein:vir:60 170 ISEVKAYRQNFSQRELMVIWPDFLAWDTV----------ASTTATA------YATARALGLRAKIDQEQG-WHKTLSNVG 232 (396) T ss_pred HHHHHHHHhhcCCceEEEEeCceeeeccc----------CCceeEE------chhHHHHHHHHHhhhccC-cEeCcCCce Confidence 33455666654444444433221111000 0111111 235667788887764432 122234666 Q ss_pred cCcccc--------cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC--ceehHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017984. 312 QDGLVP--------DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ--YKWIDNFDFQVFLRTQLQLAYMNM 381 (487) Q Consensus 312 l~Gv~~--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg--~~~iD~~~~~dWl~~~iq~~l~~l 381 (487) +.||.. ...+.+|++.|..+|+|+... +.+ +.+|-.-+++++ ..||-+.+-.+|++..|+..+... T Consensus 233 l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~--~~G--~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~ 308 (396) T protein:vir:60 233 VNGVTGISASVFWDLQESGTDADLLNESGVTTLIR--RDG--FRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWA 308 (396) T ss_pred ecceeeceeecccccCCCcchhhhhhhcCcEEEEc--CCC--EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHh Confidence 777642 335778999999999999854 333 456643334433 237889999999999999888765 Q ss_pred HHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEeccCCCHHHHhhc Q lcl|NC_017984. 382 FQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVAR 461 (487) Q Consensus 382 l~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R 461 (487) +-. |.|..-...|+..|+.-|+.-+++|.|..+. -|...+..+++|+.+. T Consensus 309 v~e----~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~--------------------------~~~d~~~nt~~~i~~G 358 (396) T protein:vir:60 309 VDK----PITATLIRDIVDGINAKFRELKTNGYIVDAT--------------------------CWFSEESNDAETLKAG 358 (396) T ss_pred ccC----CCCHHHHHHHHHHHHHHHHHHHhCCceeceE--------------------------EEEecCCCCHHHhhCC Confidence 432 6788889999999999999999999985310 0234677889998887 Q ss_pred ccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 462 ESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 462 ~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) +.. +.+.+.....+++|.+.....= T Consensus 359 ~~~-~~i~~~p~~pae~I~~~~~~~~ 383 (396) T protein:vir:60 359 KLY-IDYDYTPVPPLENLTLRQRITD 383 (396) T ss_pred EEE-EEEEEEecCCcceEEEEEEEch Confidence 774 8888899999999998865322 No 29 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=97.46 E-value=6.2e-05 Score=43.65 Aligned_cols=352 Identities=13% Similarity=0.055 Sum_probs=148.5 Q ss_pred CCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEE------eeccccCchHHHHHhhhhh Q lcl|NC_017984. 73 NATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVP------VDLATANSYSDAAALIATA 146 (487) Q Consensus 73 ~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~------i~~s~ats~~~vA~~i~t~ 146 (487) =..+.| =+||=+....+.+...-...+....-....+-.+.+-+ + +.+-++ ..+.....+.+....+ T Consensus 1 m~~~~~-GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l--~-~pvlvts~~~~~~~~g~~~tL~~al~~~--- 73 (396) T protein:vir:20 1 MSDYHH-GVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPL--N-KPVLITNVQSAISKAGKKGTLAASLQAI--- 73 (396) T ss_pred CCCCCC-CeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccC--c-cCEEeechHHHHhhcccccchhhhhhhh--- Confidence 022344 25665554433221110000000000000000000000 0 000000 0111111111111111 Q ss_pred heeeEEEecccceEEEEeccccc--ceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeEEEEEe Q lcl|NC_017984. 147 LTLPCTYESTVKGFVIKSGTSGA--NSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVNITYSE 224 (487) Q Consensus 147 l~a~vt~d~~~~~f~its~t~g~--~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~ 224 (487) +|.......+.....+. ......+ .+..............+..+++..........-.+.... T Consensus 74 ------~~ngg~~~~v~~~~~~~~~~~~~~~a---------~t~~~~~~~~~~~~~~tg~~al~~~~~~~~~~p~i~~ap 138 (396) T protein:vir:20 74 ------ADQSKPVTVVMRVEDGTGDDEETKLA---------QTVSNIIGTTDENGQYTGLKAMLAAESVTGVKPRILGVP 138 (396) T ss_pred ------hccCceeEEEEecccccccccccccc---------ccccccccccccccccchhhhhhhhccccccchhhhhhh Confidence 11111110111110000 0000000 000000000000000111111211111111110111111 Q ss_pred ccCChhHHHHHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcceEEEecC-----C--------CchHHHHHH Q lcl|NC_017984. 225 GVFNEDALKDLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYG-----T--------FDKAAFFCG 291 (487) Q Consensus 225 ~~~~~~~i~a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~-----~--------~~~~a~~~g 291 (487) .........++...++... .++.+ +.+. ..+..+.......-+-.+....|. + ...+..+.| T Consensus 139 ~~~~~~v~~al~~~~~~~~-~~~~i--D~p~--~~~~~~a~~~r~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag 213 (396) T protein:vir:20 139 GLDTKEVAVALASVCQKLR-AFGYI--SAWG--CKTISEVKAYRQNFSQRELMVIWPDFLAWDTVTSTTATAYATARALG 213 (396) T ss_pred hhccHHHHHHHHHHHhcCC-cEEEE--ecCC--CCCHHHHHHHhhCCCCceEEEEcCccccccCcCCcceeechhHHHHH Confidence 1112223333333333221 11111 1110 001111111111111122222111 0 123556677 Q ss_pred HHHhcCcCcCCceeeeeeeecCcccc--------cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC--ceeh Q lcl|NC_017984. 292 VSGSINYQEENGRTTTAFRSQDGLVP--------DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ--YKWI 361 (487) Q Consensus 292 ~~as~~~~~~~gs~T~~fk~l~Gv~~--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg--~~~i 361 (487) .++.++.++. -......|.+.||.. ..++++|++.|..+|+|.... +.+ +.+|-.-+++++ ..|| T Consensus 214 ~~a~~d~~~g-~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~--~~G--~~~wG~rT~s~d~~~~~i 288 (396) T protein:vir:20 214 LRAKIDQEQG-WHKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGVTTLIR--RDG--FRFWGNRTCSDDPLFLFE 288 (396) T ss_pred HHHHhhhhcC-cEeccCCceeccceecceecccccCCCcchhhhhhhcCcEEEEc--CCC--EEEEcccccCCCccccee Confidence 7777764432 222344566777642 235678999999999999864 333 556643334433 2378 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccce Q lcl|NC_017984. 362 DNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQL 441 (487) Q Consensus 362 D~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~ 441 (487) -+.+-.+|+...|+..+...+=. |.|+.=...|+..++.-|++-+++|.|..+. T Consensus 289 ~~rR~~~~i~~~~~~~~~~~v~e----~~~~~~~~~i~~~i~~~L~~l~~~G~l~g~~---------------------- 342 (396) T protein:vir:20 289 NYTRTAQVVADTMAEAHMWAVDK----PITATLIRDIVDGINAKFRELKTNGYIVDAT---------------------- 342 (396) T ss_pred ehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCcceeceE---------------------- Confidence 88999999999998888764432 6788889999999999999999999985321 Q ss_pred eeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 442 FTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 442 ~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) -+...+..|++|+.+.+.. +.+.+.....+++|+++....= T Consensus 343 ----v~~d~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 383 (396) T protein:vir:20 343 ----CWFSEESNDAETLKAGKLY-IDYDYTPVPPLENLTLRQRITD 383 (396) T ss_pred ----EEEecCCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 0235677889998888874 8899999999999998854221 No 30 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=97.46 E-value=6.3e-05 Score=43.62 Aligned_cols=349 Identities=14% Similarity=0.076 Sum_probs=147.0 Q ss_pred CCccCCCEEEEEeeecccceeeEeeccccccchhhh---eeeeeEE-E----EEEccceEEEEeeccccCchHHHHHhhh Q lcl|NC_017984. 73 NATTRPNSLFITKYNLTDVPASLIGGDITSTTLADL---KLINGTL-T----IVVDGVSKSVPVDLATANSYSDAAALIA 144 (487) Q Consensus 73 ~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~---~~~~g~~-~----iti~g~~~~~~i~~s~ats~~~vA~~i~ 144 (487) =..+.| =+||=+......+.. + +.+....+. ...++.. - +.+.+... ....+.....+..+...+. T Consensus 1 m~~~~~-GV~v~e~~~g~~~i~--~--v~tav~~~vg~a~~~d~~~~~~~~pv~i~s~~~-~~~~~g~~~tl~~al~~~~ 74 (396) T protein:vir:57 1 MSDYHH-GVQVLEINDGTRVIS--T--VSTAIVGMVCTASDADAETFPLNKPVLITNVQS-AIAKAGKKGTLAASLQAIA 74 (396) T ss_pred CCCCCC-ceEEEEcCCCccccc--c--cCCceEEEEEeccCCCcccccCccCeEeecchh-hhhhcccccchHHHHHHhh Confidence 022344 255544433222111 0 000000000 0000000 0 00000000 0000111111111111111 Q ss_pred hhheeeEEEecccceEEEEecccccceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhcccc---eeEEE Q lcl|NC_017984. 145 TALTLPCTYESTVKGFVIKSGTSGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQN---FVNIT 221 (487) Q Consensus 145 t~l~a~vt~d~~~~~f~its~t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~---wy~~~ 221 (487) ...+..+ .+.....+....-. ... ..+...+..+.......+.+.++.+.... -..+. T Consensus 75 ~~~~~~~---------~vv~~~~~~~~~~~-------~~~---a~t~~~iiG~~~~~~~~tgl~al~~~~~~~~~~p~i~ 135 (396) T protein:vir:57 75 DQSKPVT---------VVVRVEDGTGDDEE-------TKL---AQTVSNIIGTTDENGQYTGLKALMGAESVTGVKPRIL 135 (396) T ss_pred hcCCcee---------Eeeecccccccccc-------ccc---cccceeeeeeccccccchhhhhhhhcccceeEEeccc Confidence 1101111 11110000000000 000 00000000000001111112222222111 11111 Q ss_pred EEeccCChhHHHHHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcceEEEecC------C-------CchHHH Q lcl|NC_017984. 222 YSEGVFNEDALKDLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYG------T-------FDKAAF 288 (487) Q Consensus 222 ~~~~~~~~~~i~a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~------~-------~~~~a~ 288 (487) .+..........++...++.. .++.++. .+.. .+..........-+..+..+.|. + ...+.. T Consensus 136 ~ap~~~~~~v~~al~~~~~~~-~~~~~~d--~p~~--~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~ 210 (396) T protein:vir:57 136 GVPGLDTKEVAVALASVCQEL-NAFGYIS--AWGC--KTISEVKAYRQNFSQRELMVIWPDFLAWDTVTSTTATAYATAR 210 (396) T ss_pred cCcccchhHHHHHHHHHhhhC-ceEEEEc--CCCC--CCHHHHHHHHhccCCceEEEEcceeeeecccCCceeEEehhHH Confidence 111111122222333333322 2222211 1100 00011111111112222222221 0 123566 Q ss_pred HHHHHHhcCcCcCCceeeeeeeecCcccc--------cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCCc-- Q lcl|NC_017984. 289 FCGVSGSINYQEENGRTTTAFRSQDGLVP--------DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQY-- 358 (487) Q Consensus 289 ~~g~~as~~~~~~~gs~T~~fk~l~Gv~~--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg~-- 358 (487) +.|..+.++..+. -.....+|.+.||.. ...+.+|++.|..+|+|+... +.+ +.+|-.-++++.. T Consensus 211 ~Ag~~a~~d~~~g-~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~t~~~--~~G--~~~wG~rT~~~d~~~ 285 (396) T protein:vir:57 211 ALGLRAKIDQEQG-WHKTLSNVGVNGVTGISASVFWDLQKPGTDADLLNEAGVTTLVR--RDG--FRFWGNRTCSDDPLF 285 (396) T ss_pred HHHHHHHhhhccC-cEeccCCceeccccccceecccccCCcchhhhhhhhcCcEEEEc--CCC--EEEEcccccCCCccc Confidence 7777777764432 233344677777653 234678999999999999864 333 5566433343332 Q ss_pred eehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccc Q lcl|NC_017984. 359 KWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAA 438 (487) Q Consensus 359 ~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~ 438 (487) .||-+.+-.+|++..|+..+...+=. |.++.=...|+..|+.-|+.-+++|.|..+. T Consensus 286 ~~i~vrR~~~~i~~~i~~~~~~~v~e----~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~------------------- 342 (396) T protein:vir:57 286 LFESYTRTAQVLADTMAEAHMWAIDK----PITATLIRDIIDGINAKFRELKNNGYIVDGT------------------- 342 (396) T ss_pred ceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCceeceE------------------- Confidence 37888889999998888887764432 6788889999999999999999999996421 Q ss_pred cceeeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 439 SQLFTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 439 ~~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) + +...+..+++++.+.+.. +.+.+.....+++|.++....= T Consensus 343 --v-----~~d~~~n~~~~i~~G~~~-~~v~~~p~~p~e~I~~~~~~~~ 383 (396) T protein:vir:57 343 --C-----WFSEESNDAETLKAGKLY-IDYDYTPVPPLENLTLRQRITS 383 (396) T ss_pred --E-----EEecCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 0 234567788998888874 8899999999999998854222 No 31 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=97.37 E-value=8.3e-05 Score=42.96 Aligned_cols=360 Identities=9% Similarity=-0.001 Sum_probs=190.3 Q ss_pred CCcCCccccceEEEeeeeecccccccccceeEEecC----cceee---eeeccHHHHHHhcCCChHHHHHHHHHhhcccC Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGNPLGLNTNLFVQD----AIYPN---YEYFSNTLVGQHYGLESPIYKFATVYFNGFRN 73 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~~~~~~~ll~~~~----~~~~~---~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~ 73 (487) ||.+-.|=-.++.|.-...+.........+++-+.. ...|. ...++..+....||..-..+.+...+|. T Consensus 1 M~~~~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~a~~~~~~---- 76 (386) T protein:vir:10 1 MAEQYLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVLIAGSRREAAKLGAGGTLPQAIDGIFD---- 76 (386) T ss_pred CccccCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceEecchHHHHhhcCCCcchhHHHHHHhc---- Confidence 987766766666665544443333333333333211 11221 2367777777889999899999999996 Q ss_pred CccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhheeeEEE Q lcl|NC_017984. 74 ATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALTLPCTY 153 (487) Q Consensus 74 q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~a~vt~ 153 (487) +.. ...++-+...... ...+.. . .+++. . ..+.. -..+....... T Consensus 77 ~gg--~~~~vv~~~~~~~---------~~~t~~--~--------~ig~~-~-------~~t~~---~tgl~~l~~~~--- 121 (386) T protein:vir:10 77 QTG--AVVVVIRVDEGVD---------SAATQS--N--------VIGKV-D-------ADTEQ---YTGILALLSAE--- 121 (386) T ss_pred cCc--eeEEEeecccccc---------ccccch--h--------hhccc-c-------cccch---hhhhHHhhhhc--- Confidence 332 2344443321110 000000 0 00000 0 00000 00000000000 Q ss_pred ecccceEEEEecccccceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeEEEEEeccCChhHHH Q lcl|NC_017984. 154 ESTVKGFVIKSGTSGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVNITYSEGVFNEDALK 233 (487) Q Consensus 154 d~~~~~f~its~t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~~~~~i~ 233 (487) ..+.+. ..+..+ .+. .......+.+....+.+..+...+...+.. . T Consensus 122 ----~~~~~~-------p~i~~a-------------------p~~--~~~~~v~~~l~~~~~~~~~~~~~~~~~~~~--~ 167 (386) T protein:vir:10 122 ----NTVKVQ-------PRILIA-------------------PGF--SNQKAVADQLVSVADTAAWLCHSGWSNTTD--A 167 (386) T ss_pred ----cccccc-------cccccc-------------------ccc--cchhHHHHHHHHhhcceEEEEEeCCCCCch--H Confidence 000000 000000 000 001122333444444455555544432222 2 Q ss_pred HHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcceEEEecCCCchHHHHHHHHHhcCcCcCCceeeeeeeecC Q lcl|NC_017984. 234 DLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGTFDKAAFFCGVSGSINYQEENGRTTTAFRSQD 313 (487) Q Consensus 234 a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~~~~a~~~g~~as~~~~~~~gs~T~~fk~l~ 313 (487) ....|.+.-++.+..+.+..-...... .+-.+.+ .....++|..+.++.++. =......|.+. T Consensus 168 ~a~~~~~~~~s~~~~~~~p~~~v~~~~----------~~~~~~~------p~s~~~ag~~a~~D~~~G-~~~spaN~~l~ 230 (386) T protein:vir:10 168 AAITYRELFGSRRCEVVDPWYKVWDVE----------TSAHIIQ------PPSARHAGVMAKVHNTLG-FWWSNSNQEIL 230 (386) T ss_pred HHHHhhhcccccceEEecCceeeeccc----------cccceee------chHHHHHHHHHHhhhcCC-cEEccCCceee Confidence 344566655554444333211110000 0111111 235667788888765431 12234456666 Q ss_pred cccc--------cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCCc--eehHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017984. 314 GLVP--------DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQY--KWIDNFDFQVFLRTQLQLAYMNMFQ 383 (487) Q Consensus 314 Gv~~--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg~--~~iD~~~~~dWl~~~iq~~l~~ll~ 383 (487) ||.- ...++.|.+.|.++|+|.... +.+ +.+|-.-+++++. .||-+.+-.+|+...|+..+...+= T Consensus 231 gv~~~~~~~~~~~~~~~~~~~~l~~~gi~~~~~--~~G--~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~ 306 (386) T protein:vir:10 231 GIDGLCRPVDFKLDDPTCRANLLNAKEVTTTIQ--QNG--FRVWGDRTCSADSKWAFKNVVITNDMIADSLVRNHLWAVD 306 (386) T ss_pred cccccceecccccccCcchhhhhhhcCcEEEEc--CCC--EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhcc Confidence 6642 234688999999999998853 333 5666444444442 3788888999999888888876443 Q ss_pred hcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEeccCCCHHHHhhccc Q lcl|NC_017984. 384 AQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVARES 463 (487) Q Consensus 384 ~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~ 463 (487) . |.++.=...|+..|+.-|+.-+++|.|..+. + +++.+..+++|+.+.+. T Consensus 307 e----~~~~~~~~~i~~~i~~~L~~l~~~g~l~g~~---------------------v-----~~d~~~nt~~~~~~G~~ 356 (386) T protein:vir:10 307 R----NITKTYVEDVTEGVNNYLRHLKNIGAIAGGE---------------------C-----WVDPELNSPDQIQQGKV 356 (386) T ss_pred C----CCCHHHHHHHHHHHHHHHHHHHhCCceeeeE---------------------E-----EEcccCCCHHHhhCCeE Confidence 2 6788889999999999999999999885310 0 23466788999888888 Q ss_pred CCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 464 FIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 464 ~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) . +.+.+.....+++++++. .| T Consensus 357 ~-~~i~~~p~~p~e~i~~~~--~~ 377 (386) T protein:vir:10 357 Y-FDYDFSAYAPAEHITFRS--HM 377 (386) T ss_pred E-EEEEEEecCCceeEEEEE--EE Confidence 5 899999999999999874 44 No 32 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=97.23 E-value=0.00012 Score=42.00 Aligned_cols=347 Identities=15% Similarity=0.086 Sum_probs=146.4 Q ss_pred HHHHHhhcccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccce--EEEEeeccccCchHHHH Q lcl|NC_017984. 63 FATVYFNGFRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVS--KSVPVDLATANSYSDAA 140 (487) Q Consensus 63 aA~~yF~g~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~--~~~~i~~s~ats~~~vA 140 (487) |+ .+.| =++|=+......+.. + +.+.... .-|+.. .-++.. ...++-+ ++..+.. T Consensus 1 m~----------~~~~-Gv~v~e~~~g~~~i~--~--~~tav~g----~vgta~-~~~~~~~~~~~p~~i---ts~~~~~ 57 (392) T protein:vir:18 1 MS----------DFHH-GTKVIEINDGTRVIS--T--VATAIVG----MVWTAS-DADAETFPLNEPVLI---TNVQSAI 57 (392) T ss_pred CC----------CCCC-CeEEEEcCCCceeee--c--cCcceeE----EEEecc-CCCCcccccccceEe---echHHHH Confidence 11 1222 144433332221111 0 0000000 000000 000000 0000000 0111111 Q ss_pred Hhhhhh--heee--EEEecccc-eEEEEecccccceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhccc Q lcl|NC_017984. 141 ALIATA--LTLP--CTYESTVK-GFVIKSGTSGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQ 215 (487) Q Consensus 141 ~~i~t~--l~a~--vt~d~~~~-~f~its~t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~ 215 (487) ...-.. +..+ .-++.... .+++.. ..+.. ......+.. .+..+.+.....+.+.++.+... T Consensus 58 ~~~g~~gtl~~al~~~~~ngg~~~~vv~v-~~~~~----------~~~~~~t~~---dliG~~~~~~~~tg~~al~~~~~ 123 (392) T protein:vir:18 58 AKAGKKGTLSASLQAIADQSKPVTVVVRV-AEGTG----------DDAEAQTTS---NIIGGTDENGKYTGIKALLTAEA 123 (392) T ss_pred hhcCCCcchHHHHHHhhcccCceEEEecc-ccccc----------ccccccchh---hheecccccchhhhHHHHHhhhh Confidence 000000 0000 00000000 001100 00000 000000000 00111111111222222222111 Q ss_pred -cee--EEEEEeccCChhHHHHHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcceEEEecC-----C----- Q lcl|NC_017984. 216 -NFV--NITYSEGVFNEDALKDLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYG-----T----- 282 (487) Q Consensus 216 -~wy--~~~~~~~~~~~~~i~a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~-----~----- 282 (487) .+. .++.+..........++...++.- ..+..+.. +. ..+..........-+..+..+.|. + T Consensus 124 ~~~~~p~il~ap~~~~~~v~~~l~~~~~~~-~~~~~~d~--~~--~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~ 198 (392) T protein:vir:18 124 VTGVKPRILGVPGLDTQEVATALASVCISL-RAFGYVSA--WG--CKTISEAMAYRENFSQRELMVIWPDFLAWDTTANA 198 (392) T ss_pred hhceeehhcccCccchHHHHHHHHHHHhhc-CcEEEEec--CC--CCCHHHHHHHHhhccCceEEEEeCceeeecccCCc Confidence 111 111121111222233333333321 12222111 00 001111111111111122222221 0 Q ss_pred ---CchHHHHHHHHHhcCcCcCCceeeeeeeecCcccc--------cCCCHHHHHHHHhCCceEEEEecCCCceEEEEEC Q lcl|NC_017984. 283 ---FDKAAFFCGVSGSINYQEENGRTTTAFRSQDGLVP--------DVTNEADAETLVKNGYSFYGAWATANDRFQFAGN 351 (487) Q Consensus 283 ---~~~~a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~~--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~ 351 (487) ..+.+.+.|..+.++.++. =.....+|.+.||.. ..++..|++.|..+|+|++.. +.+ +.+|-. T Consensus 199 ~~~~p~s~~~AG~~a~~d~~~g-~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~~--~~G--~~~wG~ 273 (392) T protein:vir:18 199 TATAYATARALGLRAYIDQTIG-WHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLVR--KDG--FRFWGN 273 (392) T ss_pred eEEechHHHHHHHHHhhhccCC-ceEccCCceeeceeecceecccccCCCcchhhhhhhcCceEEEc--CCC--EEEEcc Confidence 1235666777777764332 222334566777642 234678999999999999864 333 556643 Q ss_pred CEEcCCc--eehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccc Q lcl|NC_017984. 352 GSVTGQY--KWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQV 429 (487) Q Consensus 352 G~~sgg~--~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~ 429 (487) =+++++. .||-+.+-.+|+...|+..+...+= | |.++.-...|+..++.-|++-+++|.|..+. T Consensus 274 rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~---e-~n~~~~~~~i~~~i~~~L~~l~~~gal~g~~---------- 339 (392) T protein:vir:18 274 RTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVD---K-PITASLIRDIVDGINAKFRELKSNGYIVDGE---------- 339 (392) T ss_pred cccCCCcccceeehhhHHHHHHHHHHHHHHHhcc---C-CCCHHHHHHHHHHHHHHHHHHHhcCcccceE---------- Confidence 3344332 3788999999999999888876443 2 7899999999999999999999999996421 Q ss_pred cccccCccccceeeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 430 NQEAGFDAASQLFTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 430 ~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) - |+.....+++|+.+.+.. +.+.+.....+++++++....= T Consensus 340 ---------------v-~~d~~~nt~~~i~~G~~~-~~v~~~p~~p~e~I~~~~~~~~ 380 (392) T protein:vir:18 340 ---------------C-WFDEESNDKETLKAGKLY-IDYDYTPVPPLESLTLRQRITD 380 (392) T ss_pred ---------------E-EEecCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 0 234667889998888874 8888899999999998864321 No 33 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=97.22 E-value=0.00013 Score=41.97 Aligned_cols=435 Identities=11% Similarity=0.068 Sum_probs=205.1 Q ss_pred CCcCCccccceEE--Eeeeeecc--cccc-cccceeEEec-Cccee---eeeeccHHHHHHhcCCChHHHHHHHHHhhcc Q lcl|NC_017984. 1 MQFNSIPASNIAA--VYPAVIGG--GGNP-LGLNTNLFVQ-DAIYP---NYEYFSNTLVGQHYGLESPIYKFATVYFNGF 71 (487) Q Consensus 1 ~~~~~ip~s~iV~--V~~~~~~~--~~~~-~~~~~ll~~~-~~~~~---~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~ 71 (487) |+..=-|-.++.. |.+....- ...+ .+.+.+.|.+ ...-| +..+++.++.-+-||.+ +-+.|..+.|... T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~g~~~~~~~~~~~~~~~~~~g~G-~l~~ai~~a~~~~ 79 (587) T protein:vir:96 1 MAKDIFPRRPIQRPHASIEVDSSGIGGSASNSEKILCLIGKAEGGEPNTVYQVRNYAQAKSVFRSG-ELLDAIELAWGSN 79 (587) T ss_pred CeeeeeCCCcccCCceEEEEecCCccCCCCCCCceEEEEEEecCCCCceeEEEcChHHHHHhhcCC-cHHHHHHHHhccC Confidence 8876677777765 44444322 2222 2334454443 22223 33488889989999987 4677778888532 Q ss_pred cCCccCCCEEEEEeeecccceeeEeeccccc--------------cc-------hhhhe----------eee--e-EEEE Q lcl|NC_017984. 72 RNATTRPNSLFITKYNLTDVPASLIGGDITS--------------TT-------LADLK----------LIN--G-TLTI 117 (487) Q Consensus 72 ~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~--------------~~-------~~~~~----------~~~--g-~~~i 117 (487) +-. -...+|.=| +..++++.+.-+.+.. .. ...+. +.+ | -++| T Consensus 80 ~~~--g~~~~~a~r-v~~~~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~n~G~v~~i 156 (587) T protein:vir:96 80 PQY--TAGKILAMR-VEDAKASQLEKGGLRVTSKIFGSVSNDIQVALEKNTITDSLRLRVVFQKDNYQEVFDNLGNIFSI 156 (587) T ss_pred cCC--CceEEEEEe-cCCCccceeecccccccccccCCCCceEEEEEEeccCCCccceEEEEecCCceeeccccCceEEE Confidence 211 234455444 3334444322221110 00 00000 000 1 1122 Q ss_pred EEccceEE----------------------------EEeeccccCchHHHHHhhhhhheeeEEEecc-cceEEEEe---c Q lcl|NC_017984. 118 VVDGVSKS----------------------------VPVDLATANSYSDAAALIATALTLPCTYEST-VKGFVIKS---G 165 (487) Q Consensus 118 ti~g~~~~----------------------------~~i~~s~ats~~~vA~~i~t~l~a~vt~d~~-~~~f~its---~ 165 (487) ...|+... +.++-............++.-......|-+. .+-++++. . T Consensus 157 ~y~g~~~~a~~~~~~~~~~~~A~~l~l~gg~~~v~~yrl~~g~~~~~~~~~~~~~~~~~~tAky~g~~~n~~~v~v~d~~ 236 (587) T protein:vir:96 157 NYKGEGEKATFSVEKDKETQEAKRLVLKVDEKEVKAYELNGGAYSFTNEIITDINELPDFEAKLSPFGDKNLESRKLDEA 236 (587) T ss_pred EecccccceeEeeccCcccceeeeeEEEecCceEEEEEeCCCchhhhhhhhhhhccccceEEEeecccCceeEEEeeccc Confidence 22222111 1111111000011111111100000011110 01111110 0 Q ss_pred ccccceeEE-eccc---chh--------------------------------hhhhhccc-------cceeEecC---cc Q lcl|NC_017984. 166 TSGANSTIS-FATG---DIS--------------------------------DDLKLTQE-------TGAVLNNH---TA 199 (487) Q Consensus 166 t~g~~stit-~atg---d~a--------------------------------~~l~lt~~-------~gA~~~~G---~a 199 (487) +.....+.. |.+. ++. .....+.+ .......| .. T Consensus 237 ~~~~~k~~~~y~~t~~~di~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~aLtGG~dG~~ 316 (587) T protein:vir:96 237 TDVDIKGKAVYVKAVFGDIENQTQYNQYVKFEQLPEQASEPSDVEVHAETESATVTATSKPKAIEPFELTKLSGGTNGEP 316 (587) T ss_pred cccccceEEEeehhhhhhhhhhhccccceeeccccchhhhhhcccccccccceeeeecccccccccccceeeecCCCCCC Confidence 000000000 1100 000 00000000 00112223 33 Q ss_pred cccHHHHHHHHHhcccceeEEEEEeccCChhHHHHHHHHHhc---cCceEEEEEccccccccccchHHHHHHHhCCcceE Q lcl|NC_017984. 200 ADTPTTGALNALAFSQNFVNITYSEGVFNEDALKDLALWVTS---QNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGV 276 (487) Q Consensus 200 aet~~~al~a~~~~~~~wy~~~~~~~~~~~~~i~a~A~w~~a---~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t 276 (487) +++..+++++++.+. |+.+.... .+...+..+..|++. ..++...+...... .+.......-..-++.|. T Consensus 317 ~~~y~~~l~ale~~~--~~~i~~~t--~d~ai~~~l~a~vk~~r~~gk~~~aVlg~~~~---~~~~~~~~~a~~~n~e~v 389 (587) T protein:vir:96 317 PTSWSAKLEKFKNEG--GYYIVPLT--DRQSVHSEVATFVKNRSDAGEPMRAIVGGGTS---ETKEKLFGRQAILNNPRV 389 (587) T ss_pred cccHHHHHHHHhhCC--cEEEEecC--CCHHHHHHHHHHHHHHHhCCCeEEEEecCCCC---CCHHHHHHHHhhcCCCcE Confidence 446688899887764 44443322 233445568999954 23444444422211 111122222233356665 Q ss_pred EEecCC-----------Cc----hHHHHHHHHHhcCcCcCCceeeeeeeecCcccc-cCCCHHHHHHHHhCCceEEEEec Q lcl|NC_017984. 277 VPLYGT-----------FD----KAAFFCGVSGSINYQEENGRTTTAFRSQDGLVP-DVTNEADAETLVKNGYSFYGAWA 340 (487) Q Consensus 277 ~~~y~~-----------~~----~~a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~~-~~lt~t~~~al~~~~~n~y~~~~ 340 (487) ..+.+. .+ .++++.|..++.+.+ -++| ||.++++.. ..++.+|++.+.++|++++.... T Consensus 390 i~v~~~~~~~~~~~~~~~~~~~~~aa~vAG~~Ag~~~~---~S~T--~~~~~~~~v~~~~t~~e~~~~i~~G~~~l~~~~ 464 (587) T protein:vir:96 390 ALVANSGKFVMGNGRILQAPAYMVASAVAGLVSGLDIG---ESIT--FKPLFVNSLDKVYESEELDELNENGIITIEFVR 464 (587) T ss_pred EEEecceEEecCCCceeeechhhHHHHHHHHHhcCccc---cCcc--ceeeecccccccCCHHHHHHHHhCCeEEEEEec Confidence 543321 11 234556666666543 3334 445544322 47999999999999999987765 Q ss_pred CCCce-EEEEECCEEc-C---C--ceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcC Q lcl|NC_017984. 341 TANDR-FQFAGNGSVT-G---Q--YKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFG 413 (487) Q Consensus 341 ~~~~~-~~~~~~G~~s-g---g--~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG 413 (487) +.... .++. ++.++ . + +..|-.++-.|.+...|+..+-+.+.- | |=++.|...|++.++..|++..+.| T Consensus 465 ~~~~~v~~~v-nsitT~t~~~~~~~~~i~virv~D~i~~di~~~~~~~yiG--k-~nn~~~r~~v~~~i~~~L~~l~~~g 540 (587) T protein:vir:96 465 NRMTTMFRIV-DDVTTFPDKNDPVKSEMALGEANDFLVSELKILLEEQYIG--T-RTINTSASQIKDFVQSYLGRKKRDN 540 (587) T ss_pred CCcEEEEEee-ccceecCCCCCchhhhhhhHHHHHHHHHHHHHHHHhcCCc--c-ccCHHHHHHHHHHHHHHHHHHHhCC Confidence 54322 1222 22222 1 1 224778888888888887776655543 5 5688999999999999999999999 Q ss_pred ccccCcccCccccccccccccCccccceeeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 414 GIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 414 ~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) .|.-. +..+ + .+. ..+|+ --+++.+...-+|++|-++.+.-| T Consensus 541 ~I~~~-~~~d-----------------v-------~v~--~~~D~-----~~v~~~v~Pv~~mekIy~tv~~~~ 582 (587) T protein:vir:96 541 EIQDF-PPED-----------------V-------QVI--IEGNE-----ARISLTIFPIRALKKISVSLVYRQ 582 (587) T ss_pred cccCC-Cccc-----------------e-------EEE--ecCCE-----EEEEEEEEEcccceEEEEEEEEEe Confidence 99531 1100 0 010 01121 137889999999999999999988 No 34 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=97.12 E-value=0.00016 Score=41.38 Aligned_cols=416 Identities=11% Similarity=0.036 Sum_probs=167.0 Q ss_pred CCcCCccccceEEEeeeeecccccccccceeEEecCcceeeeeeccHHHHHHhcCCChHHHHHHHHHhhcccCCccCCCE Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGNPLGLNTNLFVQDAIYPNYEYFSNTLVGQHYGLESPIYKFATVYFNGFRNATTRPNS 80 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~~~~~~~ll~~~~~~~~~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~q~p~P~~ 80 (487) -| .+-+.+.+-.|++.-.+.+++ |. |-|-+.+..+ =.|..+..+-+.|-.-...+ .+.. T Consensus 52 ~p-~~~~~~e~q~v~~~~~~t~Gt---Ft-Lsf~G~tT~~-----------I~~~asa~~v~~AL~~L~~i-----~~~~ 110 (581) T protein:vir:10 52 NP-DTGETITTQILALVGEPTGGS---FK-LSLAGEPTGN-----------IPFNATQGQVQSALRALPNV-----EDDE 110 (581) T ss_pred CC-CCCCccceEEEEEEecCCCce---EE-EEeCceeccc-----------ccccCCHHHHHHHHhccCCC-----Ccce Confidence 11 111112222222221122111 11 1111111111 11222333333333222211 1111 Q ss_pred EEEEeeecccceeeEeeccccccchhhheee-eeEEEEEEccceEEEE-------------------------------- Q lcl|NC_017984. 81 LFITKYNLTDVPASLIGGDITSTTLADLKLI-NGTLTIVVDGVSKSVP-------------------------------- 127 (487) Q Consensus 81 l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~-~g~~~iti~g~~~~~~-------------------------------- 127 (487) +-+-.-.-..-...+.|. ...........+ ....++++.+..+... T Consensus 111 v~v~g~~g~~~~VtF~g~-~~~l~~~~~~lt~g~~~~vtV~~~~~g~~~~~~~~s~~gi~~~~~~l~~~~~~~~~~~gsd 189 (581) T protein:vir:10 111 VTVLGDPGGPWTVTFTKA-VAALTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTD 189 (581) T ss_pred EEEECCCCceEEEEEcCC-ccceeeeeceecCCCceeEEEeccccCcccccccccccccccccccccccccCcceecccc Confidence 111000000000000000 000000000000 0112222222221110 Q ss_pred -----eecc---ccCchHHH---HHhhhhhhe-----eeEEE--ecccceEEEEeccccccee-----EEecccc----h Q lcl|NC_017984. 128 -----VDLA---TANSYSDA---AALIATALT-----LPCTY--ESTVKGFVIKSGTSGANST-----ISFATGD----I 180 (487) Q Consensus 128 -----i~~s---~ats~~~v---A~~i~t~l~-----a~vt~--d~~~~~f~its~t~g~~st-----it~atgd----~ 180 (487) .++. .+..-.++ ...+-+++. ..+.| ......=++... .+.... .+..+|. + T Consensus 190 ~~~~~~~~~~~~~~~~~~D~~t~~~~~~g~~~~~~~v~~~~~~~~d~~~~~~v~~~-~~~~~~~~~~~~~~~~g~~~~~~ 268 (581) T protein:vir:10 190 YVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFT-DPDDIQDFYGPAFDEAGNVQSEI 268 (581) T ss_pred ceeeecccCccccccccccceeeeeeecccccccceEEEEEEEeecCCcceeEEee-cCcchhhhhhhhhhccCccccch Confidence 0000 00000000 000000000 00011 010000000000 000000 0000000 0 Q ss_pred --hhhhhhccccceeEecCcccc-------cHHHHHHHHHhcccceeEEEEEeccCChhHHHHHHHHHhcc-----CceE Q lcl|NC_017984. 181 --SDDLKLTQETGAVLNNHTAAD-------TPTTGALNALAFSQNFVNITYSEGVFNEDALKDLALWVTSQ-----NSRF 246 (487) Q Consensus 181 --a~~l~lt~~~gA~~~~G~aae-------t~~~al~a~~~~~~~wy~~~~~~~~~~~~~i~a~A~w~~a~-----~~~~ 246 (487) ...+.++....+....|.+++ ...++|++++++..+. ++..+. .......++..|++.. ..|- T Consensus 269 t~~~~~~~tn~~~~~l~~gvd~~g~tvt~~dy~~Al~ale~~~~~~--ivv~~t-~~~~v~a~l~ahv~~~s~~~~~~ra 345 (581) T protein:vir:10 269 TLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIA--IIVAGT-GAQPIQALVQQHVSAQSNNKYERRA 345 (581) T ss_pred hhhheeeeecccceeEEeeccCCCCccchHHHHHHHHHHhcCCceE--EEEeCC-CCHHHHHHHHHHHHHHHhccCCcEE Confidence 011112333333443343332 3456777776654333 223222 2233334577777542 1121 Q ss_pred -EEEEccccccccccchHHHHHHHhCCcceEEEecCC------C-------c----hHHHHHHHHHhcCcCcCCceeeee Q lcl|NC_017984. 247 -KLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGT------F-------D----KAAFFCGVSGSINYQEENGRTTTA 308 (487) Q Consensus 247 -~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~------~-------~----~~a~~~g~~as~~~~~~~gs~T~~ 308 (487) .++.-.... .........-..-+..|...++.. . . .++.+.|..++.+ -...+- T Consensus 346 vigV~g~~~~---~~~~~~~~~a~~~n~~Rvvlv~p~~~~~~g~~~~~~v~lp~y~~AA~vAGl~a~~~-----~~~slT 417 (581) T protein:vir:10 346 ILGMDGSVTP---VPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAI-----AAMPLT 417 (581) T ss_pred EEEecCCCCC---ccHHHHHHhhccCCCceEEEEecCceeecCcccCceeccchhhHHHHHHHHhhccc-----cccCcc Confidence 222111100 111111222222355666655421 1 1 2334445555543 334567 Q ss_pred eeecCcccc--cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEc----CCceehHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_017984. 309 FRSQDGLVP--DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVT----GQYKWIDNFDFQVFLRTQLQLAYMN-M 381 (487) Q Consensus 309 fk~l~Gv~~--~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~s----gg~~~iD~~~~~dWl~~~iq~~l~~-l 381 (487) ||.++|+.. ..++.+|++.|..+|++.+....+.. ..+. +|..+ .+++.|-.++-.|.+...+++.+.. . T Consensus 418 ~~~i~gi~~l~~~~s~~e~e~ll~~Gv~~l~~~~~~~--v~Iv-~gItT~~s~~~~~~i~~iR~~D~v~~~ir~~~~~~~ 494 (581) T protein:vir:10 418 RKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNL--VHVR-HGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADG 494 (581) T ss_pred cccccccccccccCCHHHHHHHHhCCeEEEEEecCCe--EEEE-eeeecCCCCCcceeeeeehhhhHHHHHHHHHhhhhc Confidence 888888864 46899999999999999998765543 3332 34322 2345689999999999999998863 4 Q ss_pred HHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEeccCCCHHHHhhc Q lcl|NC_017984. 382 FQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVAR 461 (487) Q Consensus 382 l~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R 461 (487) |.. | |=++.|...|++.+++.|++-.++|.|.....+.+ ++.++.. T Consensus 495 fIG--~-~n~~~~r~~ik~~i~~~L~~l~~~g~I~~~~~~~~------------------------------~~~~~~~- 540 (581) T protein:vir:10 495 LIG--M-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKA------------------------------RQIERQP- 540 (581) T ss_pred CCC--c-ccCHHHHHHHHHHHHHHHHHHHhcCcccCCcccee------------------------------eeeecCC- Confidence 553 4 77889999999999999999999999975321111 1111111 Q ss_pred ccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 462 ESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 462 ~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) -.--+.|.+...-+|++|.++.-.+= T Consensus 541 d~v~V~i~v~Pv~~i~~I~vti~~~p 566 (581) T protein:vir:10 541 DVIEVRYEWRPAYPLNYIVVRYSIAP 566 (581) T ss_pred CEEEEEEEEEecccceEEEEEEEEec Confidence 11237778888888888888766665 No 35 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=97.09 E-value=0.00018 Score=41.15 Aligned_cols=350 Identities=15% Similarity=0.083 Sum_probs=149.2 Q ss_pred HHHHHhhcccCCccCCCEEEEEeeecccceee-----Eeeccccccchhhhe--eeeeEEEEEEccceEEEEeeccccCc Q lcl|NC_017984. 63 FATVYFNGFRNATTRPNSLFITKYNLTDVPAS-----LIGGDITSTTLADLK--LINGTLTIVVDGVSKSVPVDLATANS 135 (487) Q Consensus 63 aA~~yF~g~~~q~p~P~~l~igr~~~~~~~~~-----l~g~~~~~~~~~~~~--~~~g~~~iti~g~~~~~~i~~s~ats 135 (487) |+ .+.|+ +||=+....+.+.. +.|-... ...+... ..+.-. .+.+.. .....+..... T Consensus 1 m~----------~~~~G-V~v~e~~~g~~~v~~v~tav~~~vgt-a~~~~~~~~p~~~pv--~v~s~~-~~~~~~g~~~t 65 (395) T protein:vir:98 1 MS----------DFHHG-TQVIEINDGTRVISTVATAVVGMVCT-ASDADATLFPLNEPV--LITNVQ-SAIAKAGKKGT 65 (395) T ss_pred CC----------CCCCC-eEEEEcCCCcccccccCcceEEEEee-ccCCCccccccccce--EeechH-HhHhhcccccc Confidence 22 22221 34433322221110 0000000 0000000 000000 000000 00000111111 Q ss_pred hHHHHHhhhhhheeeEEEecccceEEEEecccccceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhccc Q lcl|NC_017984. 136 YSDAAALIATALTLPCTYESTVKGFVIKSGTSGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQ 215 (487) Q Consensus 136 ~~~vA~~i~t~l~a~vt~d~~~~~f~its~t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~ 215 (487) +......+. +.......+.+...+....- ... +.. +......+.......+.+.++.+... T Consensus 66 l~~al~~~~---------~~~~~~~~vv~~~~~~~~~~---~~~----~a~---~~~~i~g~~~~~~~~Tgl~al~~~~~ 126 (395) T protein:vir:98 66 LAASLQAIA---------DQSKPVTVVVRVEDGTGDDE---EAA----LAQ---TVSNIIGGTDENGKYTGIKALLTAQA 126 (395) T ss_pred hhhHHHHHh---------hccCceEEEeeccccccccc---ccc----ccc---cccccccccccccchhHHHHHhhhhh Confidence 111111111 11111111111111100000 000 000 00000011111112223333333222 Q ss_pred ce---eEEEEEeccCChhHHHHHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcceEEEecC-----C----- Q lcl|NC_017984. 216 NF---VNITYSEGVFNEDALKDLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYG-----T----- 282 (487) Q Consensus 216 ~w---y~~~~~~~~~~~~~i~a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~-----~----- 282 (487) .+ ..+..+..........++...++.- ..+.++-. +. ..+..+.......-+..+..+.|. + T Consensus 127 ~~~~~p~il~ap~~~~~~v~~al~~~~~~~-~~~~~~d~--p~--~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~ 201 (395) T protein:vir:98 127 VTGVKPRILGVPGLDTKEVAVALASAAIKL-RAFAYVSA--WG--CKTISEAMEYRKNFSQRELMVIWPDFLAWDTVKNT 201 (395) T ss_pred hhccchhhcccccccccHHHHHHHHHhhhc-CcEEEEEc--CC--CCCHHHHHHHHhccCCceEEEEecceeEecccCCc Confidence 11 1222222222233333444444422 22222111 00 011111111111112222222221 1 Q ss_pred ---CchHHHHHHHHHhcCcCcCCceeeeeeeecCcccc--------cCCCHHHHHHHHhCCceEEEEecCCCceEEEEEC Q lcl|NC_017984. 283 ---FDKAAFFCGVSGSINYQEENGRTTTAFRSQDGLVP--------DVTNEADAETLVKNGYSFYGAWATANDRFQFAGN 351 (487) Q Consensus 283 ---~~~~a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~~--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~ 351 (487) ......+.|..+.++..+. -......|.+.||.. ..++.+|++.|.++|+|.+.. +.+ +.+|-. T Consensus 202 ~~~~p~s~~~AG~~a~~d~~~g-~~~spaN~~i~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~--~~G--~~~wG~ 276 (395) T protein:vir:98 202 TATAYATARALGLRAYIDQTVG-WHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLVR--KDG--FRFWGN 276 (395) T ss_pred eeeechHHHHHHHHHHhhcccC-cEeccCCceeecccccceecccccCCCcchHHhhhhcCcEEEEc--CCC--EEEEcc Confidence 1234566777777764331 111223556666532 234688999999999999954 333 556643 Q ss_pred CEEcCC--ceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccc Q lcl|NC_017984. 352 GSVTGQ--YKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQV 429 (487) Q Consensus 352 G~~sgg--~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~ 429 (487) -+++++ ..||-+.+-.+|+...|+..+...+-. |.|+.=...|+..|+.-|++-+++|.|..+. T Consensus 277 rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e----~~~~~~~~~i~~~i~~~L~~l~~~g~l~g~~---------- 342 (395) T protein:vir:98 277 RTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDK----PITATLIRDIVDGINAKFRELKSNGYIVEGK---------- 342 (395) T ss_pred cccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCceeceE---------- Confidence 334433 237888999999999998888764432 6778778899999999999999999985310 Q ss_pred cccccCccccceeeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 430 NQEAGFDAASQLFTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 430 ~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) + ++..+..+++|+.+.+.. +.+.+..-..+++|+++....= T Consensus 343 -----------v-----~~d~~~nt~~~i~~G~~~-~~i~~~p~~p~e~I~~~~~~~~ 383 (395) T protein:vir:98 343 -----------C-----WFDEESNDKETLKAGKLY-IDYDYTPVPPLESLTLRQRITD 383 (395) T ss_pred -----------E-----EEecCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 0 234567889999888874 8999999999999998865322 No 36 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=96.92 E-value=0.00026 Score=40.27 Aligned_cols=361 Identities=11% Similarity=0.021 Sum_probs=185.1 Q ss_pred CCcCCccccceEEEeeeeecccccccccceeEEecC----cceee---eeeccHHHHHHhcCCChHHHHHHHHHhhcccC Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGNPLGLNTNLFVQD----AIYPN---YEYFSNTLVGQHYGLESPIYKFATVYFNGFRN 73 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~~~~~~~ll~~~~----~~~~~---~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~ 73 (487) |+.+-.|==.++.+.....+.........+++-+.. ...|. ...++.++-...||.....+.+-+.+|. T Consensus 1 M~~~~~pGv~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~n~pv~iss~~~~~~~~g~~gtl~~al~~~~~---- 76 (391) T protein:vir:79 1 MPTDYHHGVRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPLDTPVLLTNPQAYIGKAGDKGTLAHTLDAITD---- 76 (391) T ss_pred CCCCCCCCeEEEECCCCcccccccCCceEEEEeecccccccccccccCEEeccHHHHHHhcCCccccchhhhhhhc---- Confidence 876544544555554444433333333334433321 12221 2478888888888988777777777876 Q ss_pred CccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhheeeEEE Q lcl|NC_017984. 74 ATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALTLPCTY 153 (487) Q Consensus 74 q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~a~vt~ 153 (487) +.-.+ .++-+...... . .....++......++..+.+.....+ T Consensus 77 ~gg~~--~~vv~~~~~~~-----------~--------------------~~~~~~~~g~~~~~~~~tGl~~l~~~---- 119 (391) T protein:vir:79 77 QTNPL--TVVVRVAGGAS-----------E--------------------AETTSNLIGTTNAAGRYTGMKALLTA---- 119 (391) T ss_pred ccccc--eeeeccccccc-----------c--------------------ccccccccccccchhhhHHHhhhhhh---- Confidence 32111 11111100000 0 00000000000001111111110000 Q ss_pred ecccceEEEEecccccceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeEEEEEeccCChhHHH Q lcl|NC_017984. 154 ESTVKGFVIKSGTSGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVNITYSEGVFNEDALK 233 (487) Q Consensus 154 d~~~~~f~its~t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~~~~~i~ 233 (487) +.+ .+ .... .....+........++... ...+..+.+.+.+... ... T Consensus 120 ------~~~----~~------~~p~-------------~l~~p~~~~~~v~~al~~~---~~~~~~~ai~d~p~~~-t~~ 166 (391) T protein:vir:79 120 ------RNR----FG------VAPR-------------ILAVPGLDSLPVGTELVTI---AQKLRAFAYLSAYGCQ-TKE 166 (391) T ss_pred ------hhh----hc------ccch-------------hhcCCccchhHHHHHHHHH---HhhcCcEEEEECCCCC-CHH Confidence 000 00 0000 0001111112223333332 2334445555554322 223 Q ss_pred HHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcceEEEecCCCchHHHHHHHHHhcCcCcCCceeeeeeeecC Q lcl|NC_017984. 234 DLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGTFDKAAFFCGVSGSINYQEENGRTTTAFRSQD 313 (487) Q Consensus 234 a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~~~~a~~~g~~as~~~~~~~gs~T~~fk~l~ 313 (487) .+-.|.+.-+.++..+.+.--...... ++-.+.+ .....+.|..+.++-++.+ ......|.+. T Consensus 167 ~a~~~~~~~~s~~~a~~~P~~~~~d~~----------~~~~~~~------p~s~~~AG~~a~~D~~~g~-~~spaN~~l~ 229 (391) T protein:vir:79 167 EAVAYRSNFGQREAMVMWPDFVGWDTA----------ANAETTL------WATARAVGLRAKIDNDTGW-HKTLSNVAVG 229 (391) T ss_pred HHHHHHhccCCceeEEecceeeeecCc----------CCceeee------chHHHHHHHHHHhhhcccc-eeccCCceeh Confidence 456666655555544443321111100 0111111 2356677888887733211 1223335666 Q ss_pred cccc--------cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC--ceehHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017984. 314 GLVP--------DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ--YKWIDNFDFQVFLRTQLQLAYMNMFQ 383 (487) Q Consensus 314 Gv~~--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg--~~~iD~~~~~dWl~~~iq~~l~~ll~ 383 (487) ||.. +..+.+|.+.|..+++|.+.. +.+ +.+|-.-+++++ ..||-+.+-.+|+...|+..+...+- T Consensus 230 gi~~~~~~~~~~~~~~~~~~~~Ln~~~I~t~~~--~~G--~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~ 305 (391) T protein:vir:79 230 GVTGLSRDVFWDLQDPATDAGYLNANEVTTLVH--RDG--YRFWGSRTCSADPLFAFENYTRTAQVLADTMAEAHMWAND 305 (391) T ss_pred hhhccccccccccccccchhhhhhhcCceEEEC--CCc--EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhcc Confidence 6642 234566888999999999854 333 556644344443 23789999999999999988876443 Q ss_pred hcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEeccCCCHHHHhhccc Q lcl|NC_017984. 384 AQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVARES 463 (487) Q Consensus 384 ~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~ 463 (487) . |.|+.-...|+..|+.-|+.-+++|.|..+. - +...+..+++|+.+-+. T Consensus 306 e----pn~~~~~~~i~~~i~~~l~~l~~~g~l~g~~-------------------------v-~~~~~~nt~~~i~~G~~ 355 (391) T protein:vir:79 306 L----PMTPTLVRDLLEGINAKLRMLTRNGYLLGGA-------------------------A-WFDADANSKDTLKAGQL 355 (391) T ss_pred C----CCCHHHHHHHHHHHHHHHHHHHhCCceeceE-------------------------E-EEecCCCCHHHhhCCEE Confidence 2 7899999999999999999999999996421 0 23456778888877776 Q ss_pred CCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 464 FIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 464 ~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) . +.+.+...-.+++++++....= T Consensus 356 ~-~~i~~~p~~p~e~i~~~~~~~~ 378 (391) T protein:vir:79 356 A-IDYDYTPVPPLENLTFRQRITD 378 (391) T ss_pred E-EEEEEEecCCcceEEEEEEEch Confidence 4 8888889999999998854322 No 37 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=96.55 E-value=0.00052 Score=38.59 Aligned_cols=361 Identities=11% Similarity=0.062 Sum_probs=185.4 Q ss_pred CCcCCccccceEEEeeeeecccccccccceeEEec----Ccceee---eeeccHHHHHHhcCCChHHHHHHHHHhhcccC Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGNPLGLNTNLFVQ----DAIYPN---YEYFSNTLVGQHYGLESPIYKFATVYFNGFRN 73 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~~~~~~~ll~~~----~~~~~~---~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~ 73 (487) |+-+=.|==.++.|..+-.+.........+++-+. ....|. ...++..+....||.....+.+...+|. T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~---- 76 (390) T protein:vir:78 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGK---- 76 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhcc---- Confidence 88655554455555443333333333333333321 112221 2367778888889998888888888886 Q ss_pred CccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhheeeEEE Q lcl|NC_017984. 74 ATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALTLPCTY 153 (487) Q Consensus 74 q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~a~vt~ 153 (487) +.. ...++-|...- .-...+.. .. +++. +-... .++.- .|.. .. T Consensus 77 ~gg--~~~~vv~v~~~---------~~~~~~~~---~~-------ig~~------~~~~~--~tg~~-----al~~--~~ 120 (390) T protein:vir:78 77 QTK--PLTVVVRVAEG---------KDADETTS---NV-------IGTV------TPDGK--YTGIK-----ALLA--AQ 120 (390) T ss_pred ccC--ceEEEEEeccc---------cccccccc---cc-------cccc------ccccc--cchhh-----hhhh--hh Confidence 332 22344332110 00000000 00 0000 00000 00000 0000 00 Q ss_pred ecccceEEEEecccccceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeEEEEEeccCChhHHH Q lcl|NC_017984. 154 ESTVKGFVIKSGTSGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVNITYSEGVFNEDALK 233 (487) Q Consensus 154 d~~~~~f~its~t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~~~~~i~ 233 (487) +. .+....+ ....+........++..+ ......+.+.+.+.. +... T Consensus 121 ~~-----------~~~~p~i-------------------l~ap~~~~~~v~~~l~~~---a~~~~~~aivD~p~~-~t~~ 166 (390) T protein:vir:78 121 GA-----------LGVKPRI-------------------LAAPGLDTQPVAAALAAT---AQSLRAMAYVSASGC-KTKE 166 (390) T ss_pred hh-----------hcceehh-------------------hcccccchHHHHHHHHHh---hcccceEEEEecCCC-CCHH Confidence 00 0000000 001111111222222222 223334555554432 2334 Q ss_pred HHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcceEEEecCCCchHHHHHHHHHhcCcCcCCceeeeeeeecC Q lcl|NC_017984. 234 DLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGTFDKAAFFCGVSGSINYQEENGRTTTAFRSQD 313 (487) Q Consensus 234 a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~~~~a~~~g~~as~~~~~~~gs~T~~fk~l~ 313 (487) ++..|.+.-+..+..+.+..-...+.. .+-.+. ....+.+.|..+.++.++- =.....+|.+. T Consensus 167 ~a~~~~~~~~s~~~~~~~p~~~~~d~~----------~~~~~~------~p~s~~~Agl~a~~D~~~g-~~~spaN~~l~ 229 (390) T protein:vir:78 167 EAAAYRKQFGQREIMVIWPDWLGWDDT----------TNSTAV------IPAPAIAAGLRAKIDNDIG-WHKTISNVVVN 229 (390) T ss_pred HHHHHhhccCCceEEEEcCceEeeccc----------CCcccc------cchHHHHHHHHHHhhcCCC-cEECcCCceee Confidence 566777765555555444321111110 000111 1345677788888874321 12233456666 Q ss_pred cccc--------cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC--ceehHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017984. 314 GLVP--------DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ--YKWIDNFDFQVFLRTQLQLAYMNMFQ 383 (487) Q Consensus 314 Gv~~--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg--~~~iD~~~~~dWl~~~iq~~l~~ll~ 383 (487) |+.- +..+..|.+.|..+|+|.+.... + +.+|-.-+++++ ..||-+.+-.+|+.+.|+..+...+= T Consensus 230 gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~~~--G--~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~ 305 (390) T protein:vir:78 230 GVSGISADVSWDLQDPATDAGYLNEHEVTTLVNRN--G--FRFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVD 305 (390) T ss_pred ceeecceecccccccccchhhhhhhcCcEEEEcCC--C--EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhcc Confidence 6653 23456678899999999986533 3 456633333443 23789999999999999888876432 Q ss_pred hcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEeccCCCHHHHhhccc Q lcl|NC_017984. 384 AQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVARES 463 (487) Q Consensus 384 ~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~ 463 (487) | |.|+.-...|+..++.-|+.-+++|.|..+. + +++.+..+++|+.+-+. T Consensus 306 ---e-~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~---------------------v-----~~d~~~nt~~~i~~G~~ 355 (390) T protein:vir:78 306 ---G-PLNPSLARDIVESINGWFRQQVANGYLIGGS---------------------A-----WIDPEPNTADILASGKA 355 (390) T ss_pred ---C-CCCHHHHHHHHHHHHHHHHHHHhCCceeeeE---------------------E-----EEccCCCCHHHhhCCeE Confidence 2 7899999999999999999999999885320 0 12345678888877777 Q ss_pred CCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 464 FIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 464 ~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) . +.+.+...-.++++++.....= T Consensus 356 ~-~~v~~~p~~pae~I~~~~~~~~ 378 (390) T protein:vir:78 356 Y-IDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred E-EEEEEEecCCcceEEEEEEEch Confidence 4 8888889999999888754211 No 38 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=96.55 E-value=0.00052 Score=38.59 Aligned_cols=361 Identities=11% Similarity=0.062 Sum_probs=185.4 Q ss_pred CCcCCccccceEEEeeeeecccccccccceeEEec----Ccceee---eeeccHHHHHHhcCCChHHHHHHHHHhhcccC Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGNPLGLNTNLFVQ----DAIYPN---YEYFSNTLVGQHYGLESPIYKFATVYFNGFRN 73 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~~~~~~~ll~~~----~~~~~~---~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~ 73 (487) |+-+=.|==.++.|..+-.+.........+++-+. ....|. ...++..+....||.....+.+...+|. T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~---- 76 (390) T protein:vir:10 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGK---- 76 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhcc---- Confidence 88655554455555443333333333333333321 112221 2367778888889998888888888886 Q ss_pred CccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhheeeEEE Q lcl|NC_017984. 74 ATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALTLPCTY 153 (487) Q Consensus 74 q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~a~vt~ 153 (487) +.. ...++-|...- .-...+.. .. +++. +-... .++.- .|.. .. T Consensus 77 ~gg--~~~~vv~v~~~---------~~~~~~~~---~~-------ig~~------~~~~~--~tg~~-----al~~--~~ 120 (390) T protein:vir:10 77 QTK--PLTVVVRVAEG---------KDADETTS---NV-------IGTV------TPDGK--YTGIK-----ALLA--AQ 120 (390) T ss_pred ccC--ceEEEEEeccc---------cccccccc---cc-------cccc------ccccc--cchhh-----hhhh--hh Confidence 332 22344332110 00000000 00 0000 00000 00000 0000 00 Q ss_pred ecccceEEEEecccccceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeEEEEEeccCChhHHH Q lcl|NC_017984. 154 ESTVKGFVIKSGTSGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVNITYSEGVFNEDALK 233 (487) Q Consensus 154 d~~~~~f~its~t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~~~~~i~ 233 (487) +. .+....+ ....+........++..+ ......+.+.+.+.. +... T Consensus 121 ~~-----------~~~~p~i-------------------l~ap~~~~~~v~~~l~~~---a~~~~~~aivD~p~~-~t~~ 166 (390) T protein:vir:10 121 GA-----------LGVKPRI-------------------LAAPGLDTQPVAAALAAT---AQSLRAMAYVSASGC-KTKE 166 (390) T ss_pred hh-----------hcceehh-------------------hcccccchHHHHHHHHHh---hcccceEEEEecCCC-CCHH Confidence 00 0000000 001111111222222222 223334555554432 2334 Q ss_pred HHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcceEEEecCCCchHHHHHHHHHhcCcCcCCceeeeeeeecC Q lcl|NC_017984. 234 DLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGTFDKAAFFCGVSGSINYQEENGRTTTAFRSQD 313 (487) Q Consensus 234 a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~~~~a~~~g~~as~~~~~~~gs~T~~fk~l~ 313 (487) ++..|.+.-+..+..+.+..-...+.. .+-.+. ....+.+.|..+.++.++- =.....+|.+. T Consensus 167 ~a~~~~~~~~s~~~~~~~p~~~~~d~~----------~~~~~~------~p~s~~~Agl~a~~D~~~g-~~~spaN~~l~ 229 (390) T protein:vir:10 167 EAAAYRKQFGQREIMVIWPDWLGWDDT----------TNSTAV------IPAPAIAAGLRAKIDNDIG-WHKTISNVVVN 229 (390) T ss_pred HHHHHhhccCCceEEEEcCceEeeccc----------CCcccc------cchHHHHHHHHHHhhcCCC-cEECcCCceee Confidence 566777765555555444321111110 000111 1345677788888874321 12233456666 Q ss_pred cccc--------cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC--ceehHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017984. 314 GLVP--------DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ--YKWIDNFDFQVFLRTQLQLAYMNMFQ 383 (487) Q Consensus 314 Gv~~--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg--~~~iD~~~~~dWl~~~iq~~l~~ll~ 383 (487) |+.- +..+..|.+.|..+|+|.+.... + +.+|-.-+++++ ..||-+.+-.+|+.+.|+..+...+= T Consensus 230 gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~~~--G--~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~ 305 (390) T protein:vir:10 230 GVSGISADVSWDLQDPATDAGYLNEHEVTTLVNRN--G--FRFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVD 305 (390) T ss_pred ceeecceecccccccccchhhhhhhcCcEEEEcCC--C--EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhcc Confidence 6653 23456678899999999986533 3 456633333443 23789999999999999888876432 Q ss_pred hcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEeccCCCHHHHhhccc Q lcl|NC_017984. 384 AQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVARES 463 (487) Q Consensus 384 ~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~ 463 (487) | |.|+.-...|+..++.-|+.-+++|.|..+. + +++.+..+++|+.+-+. T Consensus 306 ---e-~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~---------------------v-----~~d~~~nt~~~i~~G~~ 355 (390) T protein:vir:10 306 ---G-PLNPSLARDIVESINGWFRQQVANGYLIGGS---------------------A-----WIDPEPNTADILASGKA 355 (390) T ss_pred ---C-CCCHHHHHHHHHHHHHHHHHHHhCCceeeeE---------------------E-----EEccCCCCHHHhhCCeE Confidence 2 7899999999999999999999999885320 0 12345678888877777 Q ss_pred CCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 464 FIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 464 ~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) . +.+.+...-.++++++.....= T Consensus 356 ~-~~v~~~p~~pae~I~~~~~~~~ 378 (390) T protein:vir:10 356 Y-IDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred E-EEEEEEecCCcceEEEEEEEch Confidence 4 8888889999999888754211 No 39 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=96.52 E-value=0.00055 Score=38.46 Aligned_cols=431 Identities=12% Similarity=0.056 Sum_probs=161.8 Q ss_pred CC-----cCCccccceEEE----eeeeeccccccc-ccce-eEEecCcceee-eeeccHHHHHHhcCCChHHH---HHHH Q lcl|NC_017984. 1 MQ-----FNSIPASNIAAV----YPAVIGGGGNPL-GLNT-NLFVQDAIYPN-YEYFSNTLVGQHYGLESPIY---KFAT 65 (487) Q Consensus 1 ~~-----~~~ip~s~iV~V----~~~~~~~~~~~~-~~~~-ll~~~~~~~~~-~~y~s~~~V~~~fg~~s~ey---~aA~ 65 (487) .+ ....++-++-.+ ............ .... -++.+...... ....--..-...+....+.. ..+. T Consensus 145 ~~ta~~~~~a~~~~~~~~v~~~~~~~~~~~~~~~~~a~~V~~~~~~~~~~~~~~~~a~~~~t~~~~~~~~~~~~~~a~~a 224 (666) T protein:vir:80 145 IPTGKIIAHAKAIGVYPELDGDWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLQKYDMPAVSA 224 (666) T ss_pred cchhhhccccccccccceeeccceeeeccccccceeeeeeeeeecCCccceeeeccccccccccccccccccccchhhhh Confidence 00 000010000000 000000000000 0000 00111111110 00000000000000000000 0011 Q ss_pred HHhhcccCCccCCCEEEEEeeec---ccceeeEeecccccc------chhhheeeee--EEEEEEccceEEEEeeccccC Q lcl|NC_017984. 66 VYFNGFRNATTRPNSLFITKYNL---TDVPASLIGGDITST------TLADLKLING--TLTIVVDGVSKSVPVDLATAN 134 (487) Q Consensus 66 ~yF~g~~~q~p~P~~l~igr~~~---~~~~~~l~g~~~~~~------~~~~~~~~~g--~~~iti~g~~~~~~i~~s~at 134 (487) .+-. . ....+.+.-... ...+..+..-..... ........+. .+.+..+|... .+..++... T Consensus 225 ~~~g-~-----~g~~l~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~-e~~~~~~~~ 297 (666) T protein:vir:80 225 IYAG-E-----IGNSLEVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVV-ESYVLSTLK 297 (666) T ss_pred hccc-c-----cccceeeeeccccccccccccceeeeccccccccceeeeeccccccceeeEeccCCccc-eeeeccccc Confidence 1110 0 001111100000 000001100000000 0000001111 12222223211 011111111 Q ss_pred c---hHHHHHhhhhhheeeEEEecccceEEEEecc---cccceeEEecccchhhhhhhccccceeEecCcccccHHHHHH Q lcl|NC_017984. 135 S---YSDAAALIATALTLPCTYESTVKGFVIKSGT---SGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGAL 208 (487) Q Consensus 135 s---~~~vA~~i~t~l~a~vt~d~~~~~f~its~t---~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~ 208 (487) . .......+...+ +....++...... .+....+.+..++-... ..+...+.....| .......+. T Consensus 298 ~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~-~~~~~~~~~~~~g--~~~~~~~~~ 368 (666) T protein:vir:80 298 GDKDVYGNSIYMDDFF------GRGSSQYIYATAQGWVDGFSGIISLAGGVSANE-ATTGGVGADPFIG--AMMQGWGLF 368 (666) T ss_pred ccccccchhhhhhhhh------ccccceeeeecccccccccceEEEecCCCCccc-ccccccccccccc--cchhhhhhh Confidence 1 000000010000 1111121111111 11112222322211000 0000111100001 001111222 Q ss_pred HHHhcccceeEEEEEecc-----CChhHHHHHHHHHhccCceEEEEEccc----cccccccchHHHHHHHhC-------- Q lcl|NC_017984. 209 NALAFSQNFVNITYSEGV-----FNEDALKDLALWVTSQNSRFKLYTWGL----DPVALGQSGASFGEWAKE-------- 271 (487) Q Consensus 209 a~~~~~~~wy~~~~~~~~-----~~~~~i~a~A~w~~a~~~~~~~~~~~~----~~~~~~~~~~~~~~l~~~-------- 271 (487) ++.+.. ... ++++... .......++...++....++.+..... +.....+..+........ T Consensus 369 ~~~~~~-~~~-~l~~p~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (666) T protein:vir:80 369 AERESI-HVN-LLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNM 446 (666) T ss_pred hhhccc-ccc-eEeecCcCCcccchHHHHHHHHHHHHhhcceEEEeecceeEEeecCCCCCHHHHHHHHHhcccchhhhc Confidence 222221 121 2222111 112333456666665544443321110 011111112222221111 Q ss_pred --CcceEEEecC-----CC--------chHHHHHHHHHhcCcCcCCceeeeeeeecCccc-----ccCCCHHHHHHHHhC Q lcl|NC_017984. 272 --NTSGVVPLYG-----TF--------DKAAFFCGVSGSINYQEENGRTTTAFRSQDGLV-----PDVTNEADAETLVKN 331 (487) Q Consensus 272 --~~~~t~~~y~-----~~--------~~~a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~-----~~~lt~t~~~al~~~ 331 (487) +..+....|. ++ ..+..+.|.++.++.++.+ ......|.+.||. .-.+++.|.+.|..+ T Consensus 447 ~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar~D~~~g~-~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~ 525 (666) T protein:vir:80 447 NINTTYAVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQP-WMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQA 525 (666) T ss_pred ccCcceEEEEcCceEEecccCCceeEechHHHHHHHHHHHhhcCCc-eEccCCeecceeeccccceeecChhHHHhhhhC Confidence 1122222221 11 2345667778877644321 1112245444442 135789999999999 Q ss_pred CceEEEEecCCCceEEEEECCEEcCC---ceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHH Q lcl|NC_017984. 332 GYSFYGAWATANDRFQFAGNGSVTGQ---YKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQ 408 (487) Q Consensus 332 ~~n~y~~~~~~~~~~~~~~~G~~sgg---~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~ 408 (487) |+|+...+.+.+ +.+|-.-++++. +.||-+.+-.+|+++.|+..++-.+=. |.++.=...|+..|+.-|++ T Consensus 526 gIn~i~~~~g~G--~~~wG~rT~~~~~s~~~~i~vRRl~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~ 599 (666) T protein:vir:80 526 AINPVIGAGGEG--FILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFE----NNDNFTRASFRMEVSQYLST 599 (666) T ss_pred CeeEEEEeCCCe--EEEEccccCCCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHH Confidence 999999887755 456544333433 347889999999999988888764432 56777788999999999999 Q ss_pred HHhcCccccCcccCccccccccccccCccccceeeeeeEEe--ccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEee Q lcl|NC_017984. 409 GINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALS--VTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNV 486 (487) Q Consensus 409 a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~--~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~v 486 (487) -+++|.|. ||++. .+..+++|+.+.+. -+.+.+...-.+++|+++ ++ T Consensus 600 l~~~gal~----------------------------g~~V~~d~~~nt~~di~~G~~-~~~i~~~P~~Pae~I~~~--~~ 648 (666) T protein:vir:80 600 IRSLGGIY----------------------------DFRVQCDTTNNTPDVIDRNEF-VASMFIKPAKSINYIMLN--FT 648 (666) T ss_pred HHhcCcee----------------------------eeEEEEcCCCCCHHHhhCCeE-EEEEEEEecCCcceEEEE--EE Confidence 99999885 34554 56678999988888 499999999999999998 44 Q ss_pred C Q lcl|NC_017984. 487 Q 487 (487) Q Consensus 487 q 487 (487) | T Consensus 649 ~ 649 (666) T protein:vir:80 649 A 649 (666) T ss_pred E Confidence 6 No 40 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=96.33 E-value=0.00074 Score=37.76 Aligned_cols=426 Identities=11% Similarity=0.042 Sum_probs=170.4 Q ss_pred CCcCCc-----cccceEE----Eeeeeeccccc-ccccceeEEecCccee-------ee----eeccHHH-H-H------ Q lcl|NC_017984. 1 MQFNSI-----PASNIAA----VYPAVIGGGGN-PLGLNTNLFVQDAIYP-------NY----EYFSNTL-V-G------ 51 (487) Q Consensus 1 ~~~~~i-----p~s~iV~----V~~~~~~~~~~-~~~~~~ll~~~~~~~~-------~~----~y~s~~~-V-~------ 51 (487) ..+... +.+-... +++.+....+. .......+........ +. ....... + . T Consensus 214 ~~~~~~~~~~~~~~~~~~~~~~~tv~v~~~~~~vg~~v~~~~~~~~~~~~~~~~~~v~~~~~~~~t~~~~~~~~~~~g~~ 293 (743) T protein:vir:10 214 VGRTPGTYSNVPASGGTGTGATFNVVVADAGGGVGGSVVVTLANPGTGYNQGETLTIASAATGDGTDILVTVATLSDGTI 293 (743) T ss_pred ccccccceeeEEecccccccccccccccccccccccccccccccccceeeeccccccccccccccccchhheecccccce Confidence 111110 0000000 00000000000 0000000000000000 00 0000000 0 0 Q ss_pred ----HhcCCChHHHHHHHHHhhcccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeee--eEEEEEEccceEE Q lcl|NC_017984. 52 ----QHYGLESPIYKFATVYFNGFRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLIN--GTLTIVVDGVSKS 125 (487) Q Consensus 52 ----~~fg~~s~ey~aA~~yF~g~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~--g~~~iti~g~~~~ 125 (487) ......++.+......+. +..++|..+.++.-. ...+..+ ...+.+ |..+ ...+.... T Consensus 294 ~~~a~~~~~~~~~~~~~~~~~~---~~~~~~~t~~~~~~~------~~~~d~~------~v~v~~~~~~~~-~~~~~v~~ 357 (743) T protein:vir:10 294 AITELKDWYLNTEIGSTGIKLG---DIGPRPGTSQFATDN------GITDDQV------HFAVIDTTGELT-GTANTIVE 357 (743) T ss_pred eeeecccccccchhhccccccc---cccccceeeeccccc------cccccce------EEEEecCcceee-eccCceeE Confidence 000112233333322222 233444444432100 0000000 000111 1100 01111100 Q ss_pred EEeeccccCchH---HHHHhhhhhheeeEEEecccceEEEEeccccc---------ceeEEecccchhhhhhhcccccee Q lcl|NC_017984. 126 VPVDLATANSYS---DAAALIATALTLPCTYESTVKGFVIKSGTSGA---------NSTISFATGDISDDLKLTQETGAV 193 (487) Q Consensus 126 ~~i~~s~ats~~---~vA~~i~t~l~a~vt~d~~~~~f~its~t~g~---------~stit~atgd~a~~l~lt~~~gA~ 193 (487) .-..++..+... .....+...+ ...+++.......+. .....+...+.............. T Consensus 358 ~~~~~s~~~~~~~~~~~~~~~~~~~-------~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (743) T protein:vir:10 358 RLTYLSKLSDARSEENANIYYKNVI-------NEQSAYLYHGNDAAVQIAASGEAWGQSSDQVLADAGTAFSRTTGYWVN 430 (743) T ss_pred EEeeeecccccccccCcceeeccee-------ccccceeeccCcccceeeeccccCccccceeeeecccccccccceEEE Confidence 001111111100 0000000000 001111111100000 000000000000000000011112 Q ss_pred EecCccccc-----HHHHHHHHHhcccceeEEEEEecc-----CChhHHHHHHHHHhccCceEEEEEcccccc------- Q lcl|NC_017984. 194 LNNHTAADT-----PTTGALNALAFSQNFVNITYSEGV-----FNEDALKDLALWVTSQNSRFKLYTWGLDPV------- 256 (487) Q Consensus 194 ~~~G~aaet-----~~~al~a~~~~~~~wy~~~~~~~~-----~~~~~i~a~A~w~~a~~~~~~~~~~~~~~~------- 256 (487) ...|.+..+ ...++..+.....-...++++... .......++...++...+++.++....... T Consensus 431 ~~gG~d~~~~~~~~~~~~~~~~~~~~~~~~~ll~~p~~~~~~~~~~~v~~a~~~~~~~~~~~~a~~d~p~~~~~~~~~~~ 510 (743) T protein:vir:10 431 LAGGNDDFAYDAGEFGAAMDLFLDTEETEIDFVLMGGSMADEADTKSKATKVIAIAASRKDALAFVSPHKGNQIASTGNV 510 (743) T ss_pred eecCccccccchhHHHHHHHHhhhccccCcceEEecCcccCccchHHHHHHHHHHHHhhCCeEEEEecCCCccccccccc Confidence 223333221 233444444332222233333211 112345566677776555555443221100 Q ss_pred --cc-ccchHHHHHHHhC-CcceEEEec------CC-------CchHHHHHHHHHhcCcCcCCceeeeeeeecCcccc-- Q lcl|NC_017984. 257 --AL-GQSGASFGEWAKE-NTSGVVPLY------GT-------FDKAAFFCGVSGSINYQEENGRTTTAFRSQDGLVP-- 317 (487) Q Consensus 257 --~~-~~~~~~~~~l~~~-~~~~t~~~y------~~-------~~~~a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~~-- 317 (487) .. .+........... +..+..+.| ++ ......++|.++.++.++. =.....+|.+.||.- T Consensus 511 ~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AGl~a~~D~~~g-~~~span~~~~gi~g~~ 589 (743) T protein:vir:10 511 ALSSAQQKENTIAFFSDLTSTSYAVFDSGYKYVYDRFTDKYRYIPCNGDVAGLCVQTSNQLD-DWYSPAGLNRGGILNAV 589 (743) T ss_pred cccccccchHHHHHHHhccCCeeEEEEccceeeeccccCceeEechhHHHHHHHHHhhccCC-cEEccCCeeeeeeeccc Confidence 00 0111112222111 122222221 11 1234567788887764432 122334455555531 Q ss_pred ---cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC---ceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcC Q lcl|NC_017984. 318 ---DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ---YKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYN 391 (487) Q Consensus 318 ---~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg---~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt 391 (487) ..+++.|++.|..+++|+...+.+.+ +.+|-.-++.+. +.||-+.+-.+|++..|+..+...+=. |.| T Consensus 590 ~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G--~~~wG~rT~~s~d~~~~~i~vrR~~~~i~~si~~~~~~~v~e----~n~ 663 (743) T protein:vir:10 590 KLAYNPNKADRDELYQNRINPVVSLRGQG--ITLFGDKTALAAPSAFDRINVRRLFLNLEKRARRLAEGVLFE----QND 663 (743) T ss_pred cceecCChhHHHhHhhCCceEEEEecCCe--EEEEcccccCCCCcccceEeehhhHHHHHHHHHHHHHHhccC----CCC Confidence 34789999999999999999887665 556644433332 348899999999999999988764432 457 Q ss_pred HhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEe--ccCCCHHHHhhcccCCeEEE Q lcl|NC_017984. 392 DQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALS--VTLPDSQTRVARESFIIKLF 469 (487) Q Consensus 392 ~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~--~~~~s~~dra~R~~~~i~~~ 469 (487) +.=...|+..|+.-|+.-+++|.|. ||.+. .+..+++|+.+.+.. +.+. T Consensus 664 ~~~~~~i~~~i~~fL~~l~~~gal~----------------------------~~~V~~d~~~nt~~~i~~G~~~-~~i~ 714 (743) T protein:vir:10 664 ATTRAGFSSALNSYLSEVQARRGVT----------------------------DYLVICDESNNTPDIIDRNEFV-AEVY 714 (743) T ss_pred HHHHHHHHHHHHHHHHHHHhcCcee----------------------------eeEEEEcCCCCCHHHhhCCeEE-EEEE Confidence 8888899999999999999999773 33443 556788898888884 9999 Q ss_pred EEECCeEEEEEEEEEeeC Q lcl|NC_017984. 470 YTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 470 ~~~aGAIh~v~i~gt~vq 487 (487) ++..-.+++|.++- +| T Consensus 715 ~~p~~pae~I~~~~--~~ 730 (743) T protein:vir:10 715 VKPTRSINFITITF--TA 730 (743) T ss_pred EEecCCcceEEEEE--EE Confidence 99999999999884 46 No 41 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=96.32 E-value=0.00075 Score=37.72 Aligned_cols=433 Identities=9% Similarity=-0.001 Sum_probs=159.9 Q ss_pred CC-cCCccccceEEEeeeee--------cccccccccceeEEecCccee--------ee--------ee----------- Q lcl|NC_017984. 1 MQ-FNSIPASNIAAVYPAVI--------GGGGNPLGLNTNLFVQDAIYP--------NY--------EY----------- 44 (487) Q Consensus 1 ~~-~~~ip~s~iV~V~~~~~--------~~~~~~~~~~~ll~~~~~~~~--------~~--------~y----------- 44 (487) +. -.+.-..+-+.|..+=. ..+....+-..+.+.....-. .. ++ T Consensus 104 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~gn~~~v~i~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~s~g~a~a~ 183 (664) T protein:vir:98 104 ASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNGKILAVTIPKRKKSLLVLNRSVLTQIFLLVGTTEIVSQSSGVSASI 183 (664) T ss_pred ccCCcccccccceeEeecCcccccceeeccCCCCCceeeEeeccCccceeecccccccccceecccceeeeeecccceee Confidence 00 00011111111111000 000000000000111000000 00 00 Q ss_pred ---ccHHHHHHhcCCChHHHHHHHHHhhcccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEE-- Q lcl|NC_017984. 45 ---FSNTLVGQHYGLESPIYKFATVYFNGFRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVV-- 119 (487) Q Consensus 45 ---~s~~~V~~~fg~~s~ey~aA~~yF~g~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti-- 119 (487) ....+....++.. ..-...+....+.+...........+...-..|-.+.....+ .........+++ T Consensus 184 ~v~~v~~d~~~~~~~~-------~~a~~~i~~~~~~~~~~~~~~~~~~a~~~G~~Gn~isv~i~s-~~~~~~~~~i~~~~ 255 (664) T protein:vir:98 184 TIDGIESDSGITLLNL-------DIAKETIQGTSFQTLTQKYQIPSVVALYPGELGSTVQVEIIS-KAAYDTGAMISGYP 255 (664) T ss_pred ecccccccceeecccc-------ceeeeccccccceeeeeccccceeeeeecccccceeeeeecc-cccccCcceEeecc Confidence 0000000000000 000000011111110000000000000000000000000000 000000011111 Q ss_pred ccceEEEE----eeccccCchHHHHHhhhhhheeeEEEe-cccceEEEEecccccceeEE--ecc-----cc----hhhh Q lcl|NC_017984. 120 DGVSKSVP----VDLATANSYSDAAALIATALTLPCTYE-STVKGFVIKSGTSGANSTIS--FAT-----GD----ISDD 183 (487) Q Consensus 120 ~g~~~~~~----i~~s~ats~~~vA~~i~t~l~a~vt~d-~~~~~f~its~t~g~~stit--~at-----gd----~a~~ 183 (487) .+.....+ +....+. .......+..+ .....|.+...........+ +.. +. .+.. T Consensus 256 ~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (664) T protein:vir:98 256 SGISVKNSGRSVMTYGPQT---------DNQYAFVVRRGGIVQESFIVSTDKTDKDIYGVNIYMDDFFANGGSQYVFGTS 326 (664) T ss_pred CceecccceeeeeeccccC---------ccceeEEEecCCceeeeEEeecccCcccceeeeeechhheecccceeeeeec Confidence 11100000 0000000 00000000000 01111222111111100000 000 00 0000 Q ss_pred hhhccc-cceeEec-Cc------ccccHHHHHHHHHhcccceeEEEEEeccCCh------hHHHHHHHHHhccCceEEEE Q lcl|NC_017984. 184 LKLTQE-TGAVLNN-HT------AADTPTTGALNALAFSQNFVNITYSEGVFNE------DALKDLALWVTSQNSRFKLY 249 (487) Q Consensus 184 l~lt~~-~gA~~~~-G~------aaet~~~al~a~~~~~~~wy~~~~~~~~~~~------~~i~a~A~w~~a~~~~~~~~ 249 (487) ...... ....... |. ......+.+.++.+...---.++++.....+ ....++...++....+|.+. T Consensus 327 ~~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~~~~~~~~~ll~~p~~~~~~~~~~~~v~~al~~~a~~~~~~~a~~ 406 (664) T protein:vir:98 327 MNWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFADREALHVPLLIAGGCAGESVEIASTVQKHVISIGDERQDCTVFV 406 (664) T ss_pred ccCCcccceeEeccCccccccccCchhHHHHHHhhhcccccccceEEecCCCCCcHHHHHHHHHHHHHHHHhcCCeEEEE Confidence 000000 0000001 11 1122334555555432211123333221111 12344556666555555433 Q ss_pred Eccccc----cccccchHHHHHHH------------hCC--cceEEEecC------C-------CchHHHHHHHHHhcCc Q lcl|NC_017984. 250 TWGLDP----VALGQSGASFGEWA------------KEN--TSGVVPLYG------T-------FDKAAFFCGVSGSINY 298 (487) Q Consensus 250 ~~~~~~----~~~~~~~~~~~~l~------------~~~--~~~t~~~y~------~-------~~~~a~~~g~~as~~~ 298 (487) ...... ....+..+...... ..+ ..+....|. . -.....++|..|.++. T Consensus 407 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg~~AGl~A~~D~ 486 (664) T protein:vir:98 407 SPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNYKYQYDKYNDVNRWVPLAGDIAGLCVYTDS 486 (664) T ss_pred ccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCeEEEecccCCceEEechHHHHHHHHHHhhh Confidence 211100 00111111111110 111 122222221 1 1245667888888764 Q ss_pred CcCCceeeeeeeecCccc-----ccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC---ceehHHHHHHHHH Q lcl|NC_017984. 299 QEENGRTTTAFRSQDGLV-----PDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ---YKWIDNFDFQVFL 370 (487) Q Consensus 299 ~~~~gs~T~~fk~l~Gv~-----~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg---~~~iD~~~~~dWl 370 (487) ++.+ ......|.+.||. ...+++.|.+.|..+|+|....+-+. +.+.+|-.-++++. +.||-+.+-.+|+ T Consensus 487 ~~g~-~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~-~G~~~wG~rT~~~~~s~~~~i~vrR~~~~i 564 (664) T protein:vir:98 487 VANP-WMSPAGYNRGQIRNCIKLAIEPRTAHRDAMYQVQINPVTGFAGG-SGFVLYGDKTLTSVPSPFDRINVRRLFNMI 564 (664) T ss_pred cCCc-EECcCCceeeeeeccccceeecChhhHHHHHhCCCeEEEEeeCC-CcEEEEcccccCCCCcccceEeehhHHHHH Confidence 4321 1222334434432 24578899999999999999887552 22455544334443 3468888999999 Q ss_pred HHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEe- Q lcl|NC_017984. 371 RTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALS- 449 (487) Q Consensus 371 ~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~- 449 (487) ...|+..++..+-. |.++.=...|+..|+.-|+.-+++|.|. ||++. T Consensus 565 ~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~----------------------------g~~V~~ 612 (664) T protein:vir:98 565 KKDIGDNAKYKLFE----NNDDFTRASFRMDTGQYMTNIRALGGCY----------------------------DYRVIC 612 (664) T ss_pred HHHHHHHHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhcCcee----------------------------eeEEEE Confidence 98888887764432 6678888899999999999999999884 35554 Q ss_pred -ccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 450 -VTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 450 -~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) .+..+++|+.+.+. -+.+.+...-.+++|.++ ++| T Consensus 613 d~~~nt~~~i~~G~~-~~~i~~~p~~pae~I~~~--~~q 648 (664) T protein:vir:98 613 DTTNNTPDVIDRNEF-VATVYVKPPRSINYITLN--FVA 648 (664) T ss_pred cCCCCCHHHhhCCeE-EEEEEEEecCCcceEEEE--EEE Confidence 45678999988888 499999999999999988 556 No 42 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=95.74 E-value=0.0015 Score=36.01 Aligned_cols=361 Identities=11% Similarity=0.055 Sum_probs=186.2 Q ss_pred CCcCCccccceEEEeeeeecccccccccceeEEec----Ccceee---eeeccHHHHHHhcCCChHHHHHHHHHhhcccC Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGNPLGLNTNLFVQ----DAIYPN---YEYFSNTLVGQHYGLESPIYKFATVYFNGFRN 73 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~~~~~~~ll~~~----~~~~~~---~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~ 73 (487) |+.+-.|=-.+++|.-...+.....-...+++-+. ....|. ...++..+....||.....+.+.+.+|. T Consensus 2 ~~~~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v~s~~~~~~~~g~~~tl~~al~~~~~---- 77 (391) T protein:vir:11 2 AADQYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLITNVQAAIGKAGTSGTLPASLQAIAD---- 77 (391) T ss_pred CCCcCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEEecchhhheecCCCccchhhhhhhhc---- Confidence 44455565566655554444444333333333321 112232 2466777777779998888888888886 Q ss_pred CccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhheeeEEE Q lcl|NC_017984. 74 ATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALTLPCTY 153 (487) Q Consensus 74 q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~a~vt~ 153 (487) +.. ...++-+... +.-...+.. .+.|. ++ ......+.. ..+.+. T Consensus 78 ~~g--~~~~vv~~~~---------~~~~~~t~~-----------d~~g~-----~~--a~~~~~g~~----a~~~~~--- 121 (391) T protein:vir:11 78 QAN--AATVVVRVKP---------GEDEAATNS-----------AVIGG-----VS--ADGKYTGMK----ALLAAK--- 121 (391) T ss_pred ccc--ceeEEeeecc---------cccccccch-----------hhhcc-----cc--cccchhhhh----hhhhhh--- Confidence 321 2234433211 000000000 00000 00 000000000 000000 Q ss_pred ecccceEEEEecccccceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeEEEEEeccCChhHHH Q lcl|NC_017984. 154 ESTVKGFVIKSGTSGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVNITYSEGVFNEDALK 233 (487) Q Consensus 154 d~~~~~f~its~t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~~~~~i~ 233 (487) . ...... ......+........++..+. ...-.|.+.+.+... ... T Consensus 122 -------~----------~~~~~p-------------~~~~ap~~~~~~v~~al~~~~---~~~~~~~i~D~p~~~-t~~ 167 (391) T protein:vir:11 122 -------A----------RLGVVP-------------RILGVPGLDTQPVATALIAIA---QQLRAFAYVSASGCK-TKE 167 (391) T ss_pred -------h----------hheecc-------------ccccccccccHHHHHHHHHhh---cccceEEEEEcCCCC-CHH Confidence 0 000000 000111111122333333333 333455555544332 234 Q ss_pred HHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcceEEEecCCCchHHHHHHHHHhcCcCcCCceeeeeeeecC Q lcl|NC_017984. 234 DLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGTFDKAAFFCGVSGSINYQEENGRTTTAFRSQD 313 (487) Q Consensus 234 a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~~~~a~~~g~~as~~~~~~~gs~T~~fk~l~ 313 (487) ++-.|-+.-+..+..+.+..-..... .+.....+ .....+.|..+.++.++- =......|.+. T Consensus 168 ~a~~~r~~~~s~~~~~~~p~~~~~~~------------~~~~~~~~----p~s~~~ag~~a~~d~~~g-~~~span~~l~ 230 (391) T protein:vir:11 168 EATAYRENFAAREAMVIWPDFLTWST------------VVNQTVPA----PAVAQALGLRARIDQEVG-WHKTLSNVAVN 230 (391) T ss_pred HHHHHhhhcCCceEEEEcCcceeccc------------ccCceEEe----chHHHHHHHHHHhhccCC-cEEccCCceee Confidence 45566665555554444332111100 01111111 235556777776653321 11222345666 Q ss_pred cccc--------cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC--ceehHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017984. 314 GLVP--------DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ--YKWIDNFDFQVFLRTQLQLAYMNMFQ 383 (487) Q Consensus 314 Gv~~--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg--~~~iD~~~~~dWl~~~iq~~l~~ll~ 383 (487) ||.. ..+++.|.+.|..+|+|.... +.+ +.+|-.-+++++ +.||-+.+-.+|++..|+..+...+= T Consensus 231 gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~--~~G--~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~ 306 (391) T protein:vir:11 231 GVTGISADVFWDLQSPSTDANYLNENEVTTLVQ--EGG--FRFWGSRTCSDDPLFAFENYTRTAQVLADTIAEAHMWAVD 306 (391) T ss_pred ceeecccccccccCCCcchhhhhhhcCcEEEEc--CCC--EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhcc Confidence 6643 234678999999999999853 333 556644334443 24788999999999988888775432 Q ss_pred hcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEeccCCCHHHHhhccc Q lcl|NC_017984. 384 AQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVARES 463 (487) Q Consensus 384 ~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~ 463 (487) . |.++.=...|+..|+.-|+.-+++|.|..+. . +...+..+++|+.+.+. T Consensus 307 e----~n~~~~~~~i~~~i~~~l~~l~~~g~l~g~~---------------------~-----~~~~~~n~~~~i~~G~~ 356 (391) T protein:vir:11 307 K----PMHPSLVRDILEGVNAKFRELKGLGLIIDAQ---------------------A-----WYDPNVNDKDTLKAGKL 356 (391) T ss_pred C----CCCHHHHHHHHHHHHHHHHHHHhccceeceE---------------------E-----EEecCCCCHHHhhCCeE Confidence 2 6788888999999999999999999886320 0 23456778999888877 Q ss_pred CCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 464 FIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 464 ~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) . +.+.+.....++++.++....= T Consensus 357 ~-~~i~~~p~~p~e~i~~~~~~~~ 379 (391) T protein:vir:11 357 R-ITYDYTPVPPLEDLTFFQKITD 379 (391) T ss_pred E-EEEEEEecCCcceEEEEEEEch Confidence 4 8999999999999999854221 No 43 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=95.63 E-value=0.0017 Score=35.74 Aligned_cols=423 Identities=12% Similarity=0.055 Sum_probs=159.9 Q ss_pred CCc---CCccccceEEEeeeeeccccc------ccccceeEEecCcceeeeeeccHHHHHHhcCCChHH-HHHHHHHhhc Q lcl|NC_017984. 1 MQF---NSIPASNIAAVYPAVIGGGGN------PLGLNTNLFVQDAIYPNYEYFSNTLVGQHYGLESPI-YKFATVYFNG 70 (487) Q Consensus 1 ~~~---~~ip~s~iV~V~~~~~~~~~~------~~~~~~ll~~~~~~~~~~~y~s~~~V~~~fg~~s~e-y~aA~~yF~g 70 (487) .++ ..+|++... ......... ...++.+-............+ +..++...... -......+.. T Consensus 210 ~~v~~~s~~~~~~~~---~~~~~~~~~~~~~~~~~s~~~~a~~~~~~~~~~~~t----~~~~~~~~~~~~~~~~~~~~~~ 282 (729) T protein:vir:10 210 LEVKVISHISAAGVE---TAVEYQQNGTYTFDNSGSVNVIAAGSSGSGSAKSYT----AQTDWFESQNIVLSNSTLEWDS 282 (729) T ss_pred ccceecccccccccc---eeccccccceeeecccCccceeeeccccccccccce----eeeccccccccccccccccccc Confidence 111 112222111 000000000 000000000000000000000 00001000000 0000011110 Q ss_pred ccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCc----h---HHHHHhh Q lcl|NC_017984. 71 FRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANS----Y---SDAAALI 143 (487) Q Consensus 71 ~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats----~---~~vA~~i 143 (487) -.+.|...-.+-- .......+.+ .... ..|.+. ...|........++.+.+ . ......+ T Consensus 283 ---~~~~~~t~~~~~~-~~~~~d~~~~-----~~~d----~~~~~~-~~~g~vve~~~~~s~~~~~~~~~~~~~~~~~vi 348 (729) T protein:vir:10 283 ---IADAPGTSTYVST-RGGKNDEIHV-----LVID----DKGTIT-GNSGTILEKHLSLSKAKDAEYSVGSSSYWRDFL 348 (729) T ss_pred ---ccccccccccccc-ccccccccce-----eeec----cccccc-cCcccceeeeeeeeeccccccccccccccceee Confidence 0111111100000 0000000000 0000 000000 000000000000000000 0 0000000 Q ss_pred hhhheeeEEEecccceEEEE-----------------------ecccccceeEEeccc-chhhhhhhccccceeEecCcc Q lcl|NC_017984. 144 ATALTLPCTYESTVKGFVIK-----------------------SGTSGANSTISFATG-DISDDLKLTQETGAVLNNHTA 199 (487) Q Consensus 144 ~t~l~a~vt~d~~~~~f~it-----------------------s~t~g~~stit~atg-d~a~~l~lt~~~gA~~~~G~a 199 (487) ... ...+.+.......... .........+..+.+ +.......+...+ . ... T Consensus 349 ~~~-s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~--~--~~~ 423 (729) T protein:vir:10 349 ATN-SKYIFGGGATSGITTTGYSVSSTNTLDTDSGWDQNAEGVNFGASGVATLTLAGGTNYGDKTDLTTSGA--L--SSG 423 (729) T ss_pred ccc-cceeeecccccccccccccccccceeccccccccccccccccccceeEEEeecccccccccccccccc--c--ccc Confidence 000 0000000000000000 000000111111111 1111111100000 0 001 Q ss_pred cccHHHHHHHHHhccc-ceeEEEEEe----ccCChhHHHHHHHHHhccCceEEEEEccccc----------ccc---ccc Q lcl|NC_017984. 200 ADTPTTGALNALAFSQ-NFVNITYSE----GVFNEDALKDLALWVTSQNSRFKLYTWGLDP----------VAL---GQS 261 (487) Q Consensus 200 aet~~~al~a~~~~~~-~wy~~~~~~----~~~~~~~i~a~A~w~~a~~~~~~~~~~~~~~----------~~~---~~~ 261 (487) .+...+++.++.+... ....+.... .........++...++....++.++...... ... ... T Consensus 424 ~~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~~~~~~v~~a~~~~~~~~~~~~a~~~~~~~~~i~~~~~~~~~~~~~~~~~ 503 (729) T protein:vir:10 424 VDDIISGYTLFENTEEIEVDFILMGAAHHPKEQSQAVAEKVTAVAEARKDAVAFISPYRQAFLNDTVSGTVTVSNIDQTT 503 (729) T ss_pred hhHHHHHHHHhhcccccccceeeecCCCCCccchHHHHHHHHHHHHhcCCeEEEecccccccccccccccccccccchhh Confidence 1223445555544321 121111111 1122334456666777655444433211000 000 000 Q ss_pred hHHHHHHHhCCcceEEEecCC--------------CchHHHHHHHHHhcCcCcCCceeeeeeeecCcccc-----cCCCH Q lcl|NC_017984. 262 GASFGEWAKENTSGVVPLYGT--------------FDKAAFFCGVSGSINYQEENGRTTTAFRSQDGLVP-----DVTNE 322 (487) Q Consensus 262 ~~~~~~l~~~~~~~t~~~y~~--------------~~~~a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~~-----~~lt~ 322 (487) .+...........+-..+|++ -.....++|.++.++.++. =.....+|.+.||.- ..+++ T Consensus 504 ~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~p~s~~~aGl~a~~d~~~g-~~~span~~~~~i~g~~~~~~~~~~ 582 (729) T protein:vir:10 504 ENVVGFYAPLSSSTYSVFDSGYKYMFDRFNNTFRYVPLNGDIAGTCARTDIEQF-PWFSPAGTARGPILNSVKLVYNPGK 582 (729) T ss_pred HHHHHHHhhccCCceEEEEcCeeEEecccCCceEEechhHHHHHHHHHhhccCC-cEEccCCccccceecccceeeecCh Confidence 111111111111111112211 1234567788888875442 122334444444422 35788 Q ss_pred HHHHHHHhCCceEEEEecCCCceEEEEECCEEcC-C--ceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHH Q lcl|NC_017984. 323 ADAETLVKNGYSFYGAWATANDRFQFAGNGSVTG-Q--YKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVR 399 (487) Q Consensus 323 t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sg-g--~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~ 399 (487) .|.+.|..+++|++..+.+.+ +.+|-.-++++ + +.||-+.+-.+|++..|+..+...+=. |.++.=...|+ T Consensus 583 ~~~~~Ln~~gIn~i~~~~~~G--~~~wG~rT~~~~d~~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~~~~~i~ 656 (729) T protein:vir:10 583 KQRDILYSNRINPVILSPGAG--IILFGDKTGFGKSSAFDRINVRRLFIYLEDAISAAAKDQLFE----FNDELTRTNFV 656 (729) T ss_pred hhHhhhhhCCceEEEEecCCe--EEEEcceecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcC----CCCHHHHHHHH Confidence 999999999999999987655 44554433333 2 358999999999999999888764432 56788889999 Q ss_pred HHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEe--ccCCCHHHHhhcccCCeEEEEEECCeEE Q lcl|NC_017984. 400 AYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALS--VTLPDSQTRVARESFIIKLFYTDGSSMQ 477 (487) Q Consensus 400 ~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~--~~~~s~~dra~R~~~~i~~~~~~aGAIh 477 (487) ..|+.-|+.-+++|.|. ||.+. .+..+++|+.+.+.. +.+.+...-.++ T Consensus 657 ~~i~~~L~~l~~~g~l~----------------------------g~~v~~d~~~nt~~~i~~G~~~-~~v~~~p~~p~e 707 (729) T protein:vir:10 657 NIVEPFLRDVQAKRGIF----------------------------DFVVICDETNNTAAVIDSNEFV-ADIFIKPARSIN 707 (729) T ss_pred HHHHHHHHHHHhcccee----------------------------eeEEEEcCCCCCHHHhhCCeEE-EEEEEEecCCcc Confidence 99999999999999884 34454 456789998888874 999999999999 Q ss_pred EEEEEEEeeC Q lcl|NC_017984. 478 RLEMTATNVQ 487 (487) Q Consensus 478 ~v~i~gt~vq 487 (487) +|.++ ++| T Consensus 708 ~i~~~--~~~ 715 (729) T protein:vir:10 708 FIGLT--FVA 715 (729) T ss_pred EEEEE--EEE Confidence 99996 666 No 44 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=95.38 E-value=0.0022 Score=35.17 Aligned_cols=358 Identities=11% Similarity=0.058 Sum_probs=187.1 Q ss_pred CCcCCccccceEEEeeeeecccccccccceeEEec-C---cceee---eeeccHHHHHHhcCCChHHHHHHHHHhhcccC Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGNPLGLNTNLFVQ-D---AIYPN---YEYFSNTLVGQHYGLESPIYKFATVYFNGFRN 73 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~~~~~~~ll~~~-~---~~~~~---~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~ 73 (487) |+-+=.|==.++.+.....+.........+++.+. . ...|. ...++..+....||.+-..+.+...+|. T Consensus 1 M~~~~~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~its~~~~~~~~g~~~tL~~al~~~~~---- 76 (390) T protein:vir:79 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGK---- 76 (390) T ss_pred CccccCCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccceEeecHHHHHHhcCCCccchhhhhhhcc---- Confidence 88666665556555544444444333333333321 1 12232 2367777777889988888888888886 Q ss_pred CccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhheeeEEE Q lcl|NC_017984. 74 ATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALTLPCTY 153 (487) Q Consensus 74 q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~a~vt~ 153 (487) +.. ...++-+...... ...+. . -.+.+ .+ ... ..+.+.+ + T Consensus 77 ~~~--~~~~vv~v~~~~~---------~~~~~-----~-----~~ig~------~~--~~~--------~~tgl~a---l 116 (390) T protein:vir:79 77 QTK--PLTVVVRVAEGKD---------ADETT-----S-----NVIGT------VT--PDG--------KYTGIKA---L 116 (390) T ss_pred ccc--ceEEEEeeccccc---------ccccc-----c-----eeeec------cc--ccc--------cchhhhh---h Confidence 332 2344443321100 00000 0 00000 00 000 0000000 0 Q ss_pred ecccceEEEEecccccceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeEEEEEeccCChhHHH Q lcl|NC_017984. 154 ESTVKGFVIKSGTSGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVNITYSEGVFNEDALK 233 (487) Q Consensus 154 d~~~~~f~its~t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~~~~~i~ 233 (487) ... ...++.. .......+........++. .....+..+.+.+.+.... .. T Consensus 117 ~~~------------------------~~~~~~~--p~il~ap~~~~~~v~~~l~---~~a~~~~~~ai~D~p~~~t-~~ 166 (390) T protein:vir:79 117 LAA------------------------QGALGVK--PRILAAPGLDTQPVAAALA---ATAQSLRAMAYVSASGCKT-KE 166 (390) T ss_pred hhh------------------------hhhhccc--cccccCCcccchHHHHHHH---HhhhhcceEEEEEccCCCC-HH Confidence 000 0000000 0000111111122223333 3333444566665543322 23 Q ss_pred HHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcceEEEecCCCchHHHHHHHHHhcCcCcCCceeeee---ee Q lcl|NC_017984. 234 DLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGTFDKAAFFCGVSGSINYQEENGRTTTA---FR 310 (487) Q Consensus 234 a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~~~~a~~~g~~as~~~~~~~gs~T~~---fk 310 (487) ++..|.+.-+..+..+.+..-...+.. .+-.+. ....+.+.|.++.++.++ | -|+ .| T Consensus 167 ~a~~~~~~~~s~~~~~~~p~~~~~d~~----------~~~~~~------~p~s~~~Ag~~a~~D~~~--g--~~~spsN~ 226 (390) T protein:vir:79 167 EAAAYRRQFGQREIMVIWPDWLGWDDT----------TNSTAV------IPAPAIAAGLRAKIDNDI--G--WHKTISNV 226 (390) T ss_pred HHHHHhcCCCCceEEEEcCceeecccc----------cCceeE------eehHHHHHHHHHhhhccC--C--cEEccCCc Confidence 456777765555555444321111000 011111 123566778888776322 2 343 66 Q ss_pred ecCcccc--------cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC--ceehHHHHHHHHHHHHHHHHHHH Q lcl|NC_017984. 311 SQDGLVP--------DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ--YKWIDNFDFQVFLRTQLQLAYMN 380 (487) Q Consensus 311 ~l~Gv~~--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg--~~~iD~~~~~dWl~~~iq~~l~~ 380 (487) .+.|+.. +..+..|++.|..+|+|..... .+ +.+|-.-+++++ ..||-+.+-.+|+...|+..+.. T Consensus 227 ~i~gi~~~~~~~~~~~~~~~~~a~~Ln~~gi~t~~~~--~G--~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~i~~~~~~ 302 (390) T protein:vir:79 227 VVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVNR--NG--FRFWGERTCSDDPKFAFENYTRTAQVAADSIAEAQMP 302 (390) T ss_pred eeeccceeeeeccccccccchhhhhhhhcCcEEEEcC--CC--EEEEeccccCCCcccceeeehhhHHHHHHHHHHHHHH Confidence 6666532 2345668888999999998643 23 556543333333 24788999999999999888876 Q ss_pred HHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEeccCCCHHHHhh Q lcl|NC_017984. 381 MFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVA 460 (487) Q Consensus 381 ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~ 460 (487) .+=. |.++.=...|+..|+.-|+.-+++|.|..+. + ++..+..+++|+.+ T Consensus 303 ~v~e----~~~~~~~~~i~~~i~~~L~~l~~~gal~g~~---------------------v-----~~d~~~nt~~~i~~ 352 (390) T protein:vir:79 303 VVDG----PLNPSLARDIVESINGWFRQQVANGYLIGGS---------------------A-----WIDPEPNTADILAS 352 (390) T ss_pred hccC----CCCHHHHHHHHHHHHHHHHHHHhCCceeeeE---------------------E-----EEecCCCCHHHhhC Confidence 4432 7788889999999999999999999986421 0 23356678888887 Q ss_pred cccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 461 RESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 461 R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) -+.. +.+.+.....++++++.....= T Consensus 353 G~~~-~~i~~~p~~p~e~i~~~~~~~~ 378 (390) T protein:vir:79 353 GKAY-IDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred CEEE-EEEEEEecCCcceEEEEEEEch Confidence 7774 8888889999999988754321 No 45 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=95.21 E-value=0.0025 Score=34.82 Aligned_cols=435 Identities=13% Similarity=0.093 Sum_probs=198.2 Q ss_pred CCcCCccccceEEEeeeeeccccc-ccccceeEEec-Ccceee---eeeccHHHHHHhcCCChHHHHHHHHHhhcccCCc Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGN-PLGLNTNLFVQ-DAIYPN---YEYFSNTLVGQHYGLESPIYKFATVYFNGFRNAT 75 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~-~~~~~~ll~~~-~~~~~~---~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~q~ 75 (487) .|++.++--- |-|.+.-.+-... ..+.+.+.|.+ ....|+ ..+++.++.-+-||.+ +-..+..+.|.-.+... T Consensus 15 ~~~~~~~~pg-v~~~~~~~~~~~~~~~~~~~~~~iG~a~~G~~~~~~~~~~~~~a~~~f~~g-~l~~a~~~a~~~~~~~~ 92 (607) T protein:vir:10 15 YPLFYDSRPH-VETNFDDSRLSNTASDSAKNIFMLGSATNGDPTKVYEIRTSQQATKIFGSG-DLVDGIKLAFDPTGNSV 92 (607) T ss_pred hCCCCccCCc-eEEEEecCcCcCCCCCCcceEEEEEEeCCCCCceEEEEcchhHHHHhhcCc-chHHHHHHhhccccCCc Confidence 3333222111 1122222222222 23344444443 333343 4588899999999886 46667778885333334 Q ss_pred cCCCEEEEEeeecccceeeE-----------eeccccccchhh---------h----------eeee--e---------- Q lcl|NC_017984. 76 TRPNSLFITKYNLTDVPASL-----------IGGDITSTTLAD---------L----------KLIN--G---------- 113 (487) Q Consensus 76 p~P~~l~igr~~~~~~~~~l-----------~g~~~~~~~~~~---------~----------~~~~--g---------- 113 (487) ..|+.+|.=|-.. ++++.+ .|........+. + .+.+ | T Consensus 93 ~g~~~~~~~rv~~-~~~a~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~~d~~~~~~~n~g~~~~i~y~g~ 171 (607) T protein:vir:10 93 TNGGTVYALRVDN-AKQASLVKDGLTFTSSIFGTNANQVSVALDNDVFGVPRITVNYSPDNYERTYTNIGQMFSITYSGK 171 (607) T ss_pred cCCceEEEEeCCC-ccccceecccccccccccccCCCceEEEEEecCCCccceeEEeecccceeeeeeccceeecccCcc Confidence 5677888887532 111111 111000000000 0 0000 0 Q ss_pred ----EEEEEEc--cceEEEEe---------------eccc--cCchHHHHHhhhhhh--eee--------EE-Eecccce Q lcl|NC_017984. 114 ----TLTIVVD--GVSKSVPV---------------DLAT--ANSYSDAAALIATAL--TLP--------CT-YESTVKG 159 (487) Q Consensus 114 ----~~~iti~--g~~~~~~i---------------~~s~--ats~~~vA~~i~t~l--~a~--------vt-~d~~~~~ 159 (487) .+++..| |..+..++ +|.. -.+.+.+...|++.- .++ .. .|...+. T Consensus 172 ~~~a~~~v~~~~~g~~~~lt~~~~~~~~~~~~V~~~~l~~~~~~t~~~l~~din~~~~~~A~~~g~~~i~tky~d~~~~~ 251 (607) T protein:vir:10 172 SASAGYTVSHDTDGKAILLTLGSGDSIDKLTNVATFDLTMSKYDTIAKLMQAISATPNFSASVVGSPSVNTSYLDEVTSP 251 (607) T ss_pred cccccceeeecCCCceeEEEecCCCccceeeeeecccccccccchHHHHHHHhhcCCceEEEEecccceeeeccccccce Confidence 0122222 33222211 1111 112222222222221 111 01 2333444 Q ss_pred EEEEeccc------cc-------ceeEE--ecccc--hhh----------hhhhcc-------ccceeEecCcc---ccc Q lcl|NC_017984. 160 FVIKSGTS------GA-------NSTIS--FATGD--ISD----------DLKLTQ-------ETGAVLNNHTA---ADT 202 (487) Q Consensus 160 f~its~t~------g~-------~stit--~atgd--~a~----------~l~lt~-------~~gA~~~~G~a---aet 202 (487) |.|+.... ++ ...+. ...+. +.. ..+.+. -.+.....|.+ +++ T Consensus 252 i~V~~~~~iv~a~~~D~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~LtGGtdG~~~~t 331 (607) T protein:vir:10 252 VDVKTAPAVVTAKIGDAISKLGYDPYVVVTQTSNNKPIVNGVSAGTGSATASVTTAPESFPANFDTAFLTGGSTGDVPVS 331 (607) T ss_pred eEEEEeeeeechhhhhhhhcccccceEEeeecccchhhhhhhhccccceeeeeeccccccccccceeeeeCCCCCCchhh Confidence 45443210 00 00000 00000 000 000000 01111233343 345 Q ss_pred HHHHHHHHHhcccceeEEEEEeccCChhHHHHHHHHHhc---cCceEEEEEccccccccccchHHHHHHHhCCcceEEEe Q lcl|NC_017984. 203 PTTGALNALAFSQNFVNITYSEGVFNEDALKDLALWVTS---QNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPL 279 (487) Q Consensus 203 ~~~al~a~~~~~~~wy~~~~~~~~~~~~~i~a~A~w~~a---~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~ 279 (487) ..++++++... +|+.|.... .+...+.++..|++. ..+++..+....... +.......-..-++.|...+ T Consensus 332 y~dal~aLe~~--e~~~i~~~t--~d~ai~~~l~a~vkr~~~~g~~~~aVlg~~~~~---t~~~~~t~a~~~N~ervv~V 404 (607) T protein:vir:10 332 WADKFNGAIGN--NVYYIIPLT--SEENIHAELQAFIDEQHVLGYNYHAFVGGGFAE---PLEQILSRQVNINDSRFGLV 404 (607) T ss_pred HHHHHHHHhhc--CceEEEecC--CCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCC---CHHHHHHHHHhhCCCcEEEE Confidence 57788888775 355554432 234445679999864 334444443222111 11112222223366666543 Q ss_pred cCC----------Cc----hHHHHHHHHHhcCcCcCCceeeeeeeecC--cccccCCCHHHHHHHHhCCceEEEEecCCC Q lcl|NC_017984. 280 YGT----------FD----KAAFFCGVSGSINYQEENGRTTTAFRSQD--GLVPDVTNEADAETLVKNGYSFYGAWATAN 343 (487) Q Consensus 280 y~~----------~~----~~a~~~g~~as~~~~~~~gs~T~~fk~l~--Gv~~~~lt~t~~~al~~~~~n~y~~~~~~~ 343 (487) ... .+ .++++.|..++.+. +.++| ||.++ ++. ..++.+|++.+..+|+..+....+.+ T Consensus 405 ~~~~~~~~~G~~~~~~~~~~Aa~vAGl~Ag~~~---~~SlT--~k~i~~~~v~-~~lt~~e~e~ai~~Gv~~l~~~~~~~ 478 (607) T protein:vir:10 405 GQSGHVQEGGESVHVPAYLMAAYVGGLSSSLGV---AVPIT--NKKLALVDLD-QNFSGDDLNTLNQNGVIGIEHLVNRN 478 (607) T ss_pred ecCeeEeeCCcceeccHHHHHHHHHHHHhcCcc---ccCcc--cceecccccc-ccCCHHHHHHHHhCCeEEEEEccCcc Confidence 211 12 23444565665543 33444 44444 444 36999999999999998886544332 Q ss_pred --ceEEEEECCEEc----CCce--ehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHH--HHhcC Q lcl|NC_017984. 344 --DRFQFAGNGSVT----GQYK--WIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQ--GINFG 413 (487) Q Consensus 344 --~~~~~~~~G~~s----gg~~--~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~--a~~nG 413 (487) ....+. +|..+ .+.. .|-.++-+|.+.+.++..+-+.+. .|++. +.....++..+...|.. -...| T Consensus 479 ~~~~vrIv-~~ItT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yI--Gk~nn-d~~~~~vk~~i~~~L~~~~l~~~g 554 (607) T protein:vir:10 479 ATGGYYIV-QDVSTNTVSSSHVDGSLYLGELTDFLFDNLRFVLRDTYI--GSNIR-STSADDIKSTVASYLYSEMNNDDG 554 (607) T ss_pred ccceEEEe-eeeeeccCCCCcchheeehhhhHHHHHHHHHHHHhhcCC--cccCC-cchHHHHHHHHHHHHHHHHHHhcC Confidence 112332 33222 1222 378888999988888877766555 34444 45667788888888743 33457 Q ss_pred ccccCcccCccccccccccccCccccceeeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 414 GIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 414 ~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) .|. +.+. .++ .+. .+.| | --+++.+..-.+|+++.++.++.+ T Consensus 555 aI~-df~~-----------------edv-------~v~--~~~D---~--v~v~~~v~Pv~~iekIyvtv~v~~ 596 (607) T protein:vir:10 555 LIV-DFSE-----------------SDI-------VVT--ISGT---V--VYIQFAVAPTQEIKNIVVSGTYSN 596 (607) T ss_pred cee-CCCc-----------------ccc-------EEe--eCCC---E--EEEEEEEEEcccceEEEEEEEEEE Confidence 673 1110 011 011 1112 1 237889999999999999999999 No 46 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=94.90 E-value=0.0032 Score=34.25 Aligned_cols=420 Identities=13% Similarity=0.064 Sum_probs=167.7 Q ss_pred CCcCCccccceEE-Eeeeeecccccc-cc-c--c-eeEEecCcceee---eeeccHHHHHHhc----CCChHHHHHHHHH Q lcl|NC_017984. 1 MQFNSIPASNIAA-VYPAVIGGGGNP-LG-L--N-TNLFVQDAIYPN---YEYFSNTLVGQHY----GLESPIYKFATVY 67 (487) Q Consensus 1 ~~~~~ip~s~iV~-V~~~~~~~~~~~-~~-~--~-~ll~~~~~~~~~---~~y~s~~~V~~~f----g~~s~ey~aA~~y 67 (487) -|++.|-++=.=. |-+...+.+... .+ . . +-++..+..-|+ .--+|..|....| |.-... ..+| T Consensus 274 ~~~~~~~~~v~~~GVYVEEVpSGvrtIeGGV~TSVAAFVG~A~rGPvn~PvlITS~aD~~~~Fg~~~GGl~Ga---ssA~ 350 (774) T protein:vir:98 274 EPFGEITRNVEDNGVVIQLEPALTGSISNRFSFYVTANDNTANRGFTTSPALVTTIPDPAIHFTSFQGGLDGP---RSAF 350 (774) T ss_pred ccccceEEEEecCceEEEEeCCCCccccccccceeeeecccccCCCCCcCEEEeehhHhhhhhccccCCcccc---ceee Confidence 3444443321100 222222222111 11 1 1 112222222222 1255666644444 322221 1122 Q ss_pred hhcccCCccCCCEEEE-----EeeecccceeeEeeccccccchhhheeeeeEEEEEEccceE----------EEEeeccc Q lcl|NC_017984. 68 FNGFRNATTRPNSLFI-----TKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSK----------SVPVDLAT 132 (487) Q Consensus 68 F~g~~~q~p~P~~l~i-----gr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~----------~~~i~~s~ 132 (487) +..+.. .-.| .|.| |.|... ....+.-. .++.+.+.+.-... ...+.... T Consensus 351 r~~~~~-sG~~-~L~i~A~~pGawGN~-ItV~I~~~------------t~~~~~l~v~~~~~s~f~~~~a~e~~tv~~~~ 415 (774) T protein:vir:98 351 RDFYTF-NGTP-LLRLQAVSEGNWGNQ-VTVSIYPV------------NNSEFRLNVQDLNGSAFNPPLADEVYTVKLGD 415 (774) T ss_pred eeeeee-cccc-eEEEEEeecCcCCCc-eEEEEEec------------CCceeEEEEEecCCccccccccceeEEEeccc Confidence 221111 1112 2222 111110 00111000 00111111100000 00000000 Q ss_pred cCchHHHHHhhh------hhheee---EEEeccc---ceEEEEecccccceeEEecccchhhhhhhcccccee-EecCcc Q lcl|NC_017984. 133 ANSYSDAAALIA------TALTLP---CTYESTV---KGFVIKSGTSGANSTISFATGDISDDLKLTQETGAV-LNNHTA 199 (487) Q Consensus 133 ats~~~vA~~i~------t~l~a~---vt~d~~~---~~f~its~t~g~~stit~atgd~a~~l~lt~~~gA~-~~~G~a 199 (487) ......+.+... ..+... +.+.+.. +.+.... .+.....+. .......... .....+ ...|.+ T Consensus 416 ~~~~~~v~e~~dn~~i~~~~~~~~~~~in~vs~lv~~~~~~~a~-~d~~~~~~~--~~~~~~~~~~-~~~v~v~lagG~D 491 (774) T protein:vir:98 416 TNESGELNALLDSKFIRGFFLPKSIDSINYDAALVRQSPLRLAP-PDESETDVE--NPAHVDFYGP-NVLVDVTLENGYD 491 (774) T ss_pred ccccceeeeeeceeeEeecccccccccccccccccccchhcccc-ccccccccc--ccccccccCC-cceEEEeecCCCC Confidence 000000000000 000000 0000000 0000000 000000000 0000000000 000000 112222 Q ss_pred ccc-HHHHHHHHHh--cccceeEEEEEeccCChhHHHHHHHHHhc----cCceEEEEEccccccccccchHHHHHHHhCC Q lcl|NC_017984. 200 ADT-PTTGALNALA--FSQNFVNITYSEGVFNEDALKDLALWVTS----QNSRFKLYTWGLDPVALGQSGASFGEWAKEN 272 (487) Q Consensus 200 aet-~~~al~a~~~--~~~~wy~~~~~~~~~~~~~i~a~A~w~~a----~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 272 (487) ... ..+.+....+ ....++.+.... .......++-.+++. ...|+..+... . ..+..........-+ T Consensus 492 g~~tt~~~igg~~~~~~~tgi~aLl~a~--~~~~V~~aii~~~e~~~~~~~~r~avid~p--~--g~t~~~Ai~~r~~f~ 565 (774) T protein:vir:98 492 GPPVTNDDYVSIIRTLENQPVHILLVGT--TNVGVQQALITEAERASDSDGLRIAVLAAP--P--RTTPTLAASVTRGFN 565 (774) T ss_pred cccccchheecccccccccceeEEEcCc--cchhhHHHHHHHHHHhhhcccceEEEEECC--C--CCCHHHHHHHHhccC Confidence 111 1111111111 123444443322 222333345455553 23344433211 1 111112222111112 Q ss_pred cceEEEecC-----CC--------chHHHHHHHHHhcCcCcCCceeeeeeeecCccc--------ccCCCHHHHHHHHhC Q lcl|NC_017984. 273 TSGVVPLYG-----TF--------DKAAFFCGVSGSINYQEENGRTTTAFRSQDGLV--------PDVTNEADAETLVKN 331 (487) Q Consensus 273 ~~~t~~~y~-----~~--------~~~a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~--------~~~lt~t~~~al~~~ 331 (487) ..+....|. ++ .++..++|..+.+++... ..+|.+.|+. .+..++.+.+.|..+ T Consensus 566 S~~aal~~Pwvkv~D~~~g~~~~vPpSg~vAGl~ArtDv~kS-----PANk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~ 640 (774) T protein:vir:98 566 STRAVMVAGWFTYAGQPNSSRYGVPGAAVYAGKLAAIDFFVS-----PAARSLVGPLFNIIESDTDNYTSRSNQDIYSAA 640 (774) T ss_pred CceEEEEeCcEEEeccCCCceeecChhHHHHHHHHhcCcccc-----cCCceeecceeccccccccccccchhhhhhccc Confidence 233333332 11 235677888887775443 3355666654 223567888889999 Q ss_pred CceEEE-EecCCCceEEEEECCEEcCCc--eehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHH Q lcl|NC_017984. 332 GYSFYG-AWATANDRFQFAGNGSVTGQY--KWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQ 408 (487) Q Consensus 332 ~~n~y~-~~~~~~~~~~~~~~G~~sgg~--~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~ 408 (487) ++|..+ .+.+.+ +.+|-.-+++++. .||-+.+-.+|++..|+..+..++ .+ |.|+.....|+..++.-|+. T Consensus 641 gIN~i~itt~g~G--~rvWG~RTlssDp~wr~InVRRlfd~Ie~SI~~~~~~~V---fE-PNd~~l~~~I~~sI~~fL~~ 714 (774) T protein:vir:98 641 RLEVLSLDTVDRT--YRFASGVTLSTDPAWERIYLRRVHDVVRQGAHAILRNYV---AM-PNSRLVRNQIAAALNAFMGE 714 (774) T ss_pred ccceeEEEEcCCc--EEEEcccccCCCcccceEeehhhHHHHHHHHHHHHHHhc---cC-CCCHHHHHHHHHHHHHHHHH Confidence 999886 344444 5566444444443 478899999999998888776643 34 78999999999999999999 Q ss_pred HHhcCccccCcccCccccccccccccCccccceeeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 409 GINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 409 a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) -++.|.|.-+. +. +++.+..+++++.+.+.. +.+.+.....+++|.++ +-| T Consensus 715 L~~~GaL~G~~----------------~V---------~~D~etNt~~dI~~G~l~-i~I~vaP~~PAEfIilr--i~q 765 (774) T protein:vir:98 715 LKRNGNIVSFR----------------PA---------IIDGSNNSTAAYFSRELY-VSLQFQPLYSADYIYVT--ISR 765 (774) T ss_pred HHhCCceecce----------------EE---------EEcCCCCCHHHhhCCEEE-EEEEEEecCCcceEEEE--EEE Confidence 99999985321 00 234556788888877664 88888999999999886 455 No 47 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=94.88 E-value=0.0033 Score=34.21 Aligned_cols=423 Identities=12% Similarity=0.060 Sum_probs=160.2 Q ss_pred CCc-------CCccccceEEEe-eeeecccccccccceeEEecCccee--eeeeccHH-------HHHHhc--------- Q lcl|NC_017984. 1 MQF-------NSIPASNIAAVY-PAVIGGGGNPLGLNTNLFVQDAIYP--NYEYFSNT-------LVGQHY--------- 54 (487) Q Consensus 1 ~~~-------~~ip~s~iV~V~-~~~~~~~~~~~~~~~ll~~~~~~~~--~~~y~s~~-------~V~~~f--------- 54 (487) +.- +..+++-.-.+. ..+..+.. ..++..+........ ...+.+.. .|...+ T Consensus 120 ~~~~~~~~~~~v~~~~a~g~~~~~~~~ta~~--~~~a~~v~~~~~~~~~~~~~~~~~~~~~~~a~sv~~~v~~~~~~~~~ 197 (660) T protein:vir:10 120 YNQTVVESEGRVTSVDTDGKILSVFIPSAKI--IAYARSLNQYPTLGPAWTAEVTSASSGVSGTITVGKIVTDSGILLTE 197 (660) T ss_pred eccccccccccceeeccccceeeeccccccc--cccccccccccccccceeEEEecccCccccceeeeeeeccCcceEEe Confidence 110 000000000000 00000000 000000000000000 00000000 000000 Q ss_pred ------CCChHHHHHHHHHhhcccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEe Q lcl|NC_017984. 55 ------GLESPIYKFATVYFNGFRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPV 128 (487) Q Consensus 55 ------g~~s~ey~aA~~yF~g~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i 128 (487) +...+++. .+... -..+.....+.|.|...-............ .....+.+...+..+.... T Consensus 198 ~~~~~~~~~~~~~~---~~~~~--~~~~~~~a~~~g~~G~~i~v~i~~~~~~~~-------~~~~~~~~~~~~~~~~~~~ 265 (660) T protein:vir:10 198 AENSEEAITSLEFQ---AALKK--FAMPGVVALYPGEIGSTLEVEIVSKAAYEA-------GSSKMLDVYPGGGTRASIA 265 (660) T ss_pred eeccccccccccce---eeccc--cccceeeeecccccCcceeEEEeeccccCC-------cceeEEeeeeccceeeEEe Confidence 00000000 00000 001111111222221110000000000000 0000011111111111100 Q ss_pred eccc--cCchHHHHHhhhhhheeeEEEe-cccceEEEEec-----------------ccccceeEEecc-cchhhhhhhc Q lcl|NC_017984. 129 DLAT--ANSYSDAAALIATALTLPCTYE-STVKGFVIKSG-----------------TSGANSTISFAT-GDISDDLKLT 187 (487) Q Consensus 129 ~~s~--ats~~~vA~~i~t~l~a~vt~d-~~~~~f~its~-----------------t~g~~stit~at-gd~a~~l~lt 187 (487) +... .....+.-..+ +..+ .....|.+... ..|....+.... +.... T Consensus 266 ~~~~~~~~~~~~~~~~~-------v~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~----- 333 (660) T protein:vir:10 266 KAVFNYGPQTDDQYAII-------VRRDGAIVESVVLSTKEGEKDVYGNNIYLDDYFAKGTSNYIYATSLNWPKG----- 333 (660) T ss_pred eeecccccccccccccc-------cccCCcccceeeeeccccccccccceeeeehhhcCCCccEEEEEeccCCCC----- Confidence 0000 00000000000 0000 00001111100 000000000000 00000 Q ss_pred cccceeEecCcc---c---ccHHHHHHHHHhcccceeEEEEEeccC------ChhHHHHHHHHHhccCceEEEEEccccc Q lcl|NC_017984. 188 QETGAVLNNHTA---A---DTPTTGALNALAFSQNFVNITYSEGVF------NEDALKDLALWVTSQNSRFKLYTWGLDP 255 (487) Q Consensus 188 ~~~gA~~~~G~a---a---et~~~al~a~~~~~~~wy~~~~~~~~~------~~~~i~a~A~w~~a~~~~~~~~~~~~~~ 255 (487) .........|.+ . .....++.++......-..+++..... ......++...++....++..+-..... T Consensus 334 ~~~~~~l~gg~~~~~~~t~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~al~~~~~~~~~~~aiid~p~~~ 413 (660) T protein:vir:10 334 FSGIINLSGGISANDKVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDEVASTVQKHVVSIADERQDCLAFISPPKGL 413 (660) T ss_pred cccceeeeccccCccccccchhhhhhhhhhhhhhcccceEEEcCcCCCchhhhHHHHHHHHHHHHhhCCEEEEEecCccc Confidence 000001111111 1 122334444443322222222222111 1123445667777665565544321110 Q ss_pred ---cccccchHHHHHHHh-C--------C--cceEEEecC------C-------CchHHHHHHHHHhcCcCcCCceeeee Q lcl|NC_017984. 256 ---VALGQSGASFGEWAK-E--------N--TSGVVPLYG------T-------FDKAAFFCGVSGSINYQEENGRTTTA 308 (487) Q Consensus 256 ---~~~~~~~~~~~~l~~-~--------~--~~~t~~~y~------~-------~~~~a~~~g~~as~~~~~~~gs~T~~ 308 (487) .........+..+.. . + ..+..+.|. + -.....+.|.++.++.++. =..... T Consensus 414 ~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar~D~~~g-~~~sPa 492 (660) T protein:vir:10 414 LVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADLAGLCARTDDVSQ-PWMSPA 492 (660) T ss_pred ccccccccCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceeEechhHHHHHHHHHhhccCC-cEEccC Confidence 001111111112211 1 1 122222221 1 1235677888888764432 111123 Q ss_pred eeecCccc-----ccCCCHHHHHHHHhCCceEEEEecC-CCceEEEEECCEEcCC---ceehHHHHHHHHHHHHHHHHHH Q lcl|NC_017984. 309 FRSQDGLV-----PDVTNEADAETLVKNGYSFYGAWAT-ANDRFQFAGNGSVTGQ---YKWIDNFDFQVFLRTQLQLAYM 379 (487) Q Consensus 309 fk~l~Gv~-----~~~lt~t~~~al~~~~~n~y~~~~~-~~~~~~~~~~G~~sgg---~~~iD~~~~~dWl~~~iq~~l~ 379 (487) +|.+.||. ...+++.|.+.|..+|+|+...+-+ .+ +.+|-.-++++. +.||-+.+-.+|+.+.|+...+ T Consensus 493 n~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~~G--~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~ 570 (660) T protein:vir:10 493 GYNRGQILNVLKLAIEPRQAQRDRMYQEAINPVVGFAGGDG--FVLFGDKTATKVPSPMDHINVRRLFNMLKKNIGDASK 570 (660) T ss_pred CeeeceeeccceeeecCChhhHHhHhhCCceEEEEeeCCCc--EEEEcccccCCCCcccceEehhhHHHHHHHHHHHHHH Confidence 55544442 1357899999999999999988754 33 445544334443 3578999999999999998887 Q ss_pred HHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEe--ccCCCHHH Q lcl|NC_017984. 380 NMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALS--VTLPDSQT 457 (487) Q Consensus 380 ~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~--~~~~s~~d 457 (487) ..+-. |.++.-...|+..|+.-|+.-+++|.|.. |++. .+..+++| T Consensus 571 ~~v~e----pn~~~l~~~i~~~i~~fL~~l~~~gal~g----------------------------~~V~~d~~~nt~~d 618 (660) T protein:vir:10 571 YKLFE----LNDNFTRSSFRMEVSQYLDGIKALGGIYE----------------------------GRVVCDTTVNTPAV 618 (660) T ss_pred HhccC----CCCHHHHHHHHHHHHHHHHHHHhCCceee----------------------------eEEEEcCCCCCHHH Confidence 75433 56888899999999999999999998852 4444 45678999 Q ss_pred HhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 458 RVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 458 ra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) +.+.+.. +.+.+...-.+++|.++ ++| T Consensus 619 i~~G~~~-~~i~~~P~~pae~I~~~--~~~ 645 (660) T protein:vir:10 619 IDRNEFI-ANIYVKPARSINYITLN--FVA 645 (660) T ss_pred hhCCeEE-EEEEEEecCCccEEEEE--EEE Confidence 9888884 99999999999999998 456 No 48 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=94.79 E-value=0.0035 Score=34.06 Aligned_cols=430 Identities=12% Similarity=0.053 Sum_probs=168.6 Q ss_pred CCcCCccccceEEEee--------------eeecccccccccceeEEecCcce--eeeeeccHHHHHHhcCCChHHHHHH Q lcl|NC_017984. 1 MQFNSIPASNIAAVYP--------------AVIGGGGNPLGLNTNLFVQDAIY--PNYEYFSNTLVGQHYGLESPIYKFA 64 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~--------------~~~~~~~~~~~~~~ll~~~~~~~--~~~~y~s~~~V~~~fg~~s~ey~aA 64 (487) -+|.++--+--+.+.+ ..-|.+.+...---+-++.++.- -...|....-..=.|..+..+-|.| T Consensus 20 ~~~~g~~~~~~~~~~i~g~~~g~~g~~~s~r~~p~~~~~~evq~v~~~~~~t~G~ftLt~~g~tT~~I~~~asa~~v~~A 99 (581) T protein:vir:76 20 APQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGETITTQILALVGEPTGGSFKLSLAGEPTGNIPFNATQGQVQSA 99 (581) T ss_pred ccccCcceeeeeeeeecccccccccccceeeecCCCCCCCceEEEEEeecCCcceEEEEeCceeccccccCCCHHHHHHH Confidence 1122211100001110 11111111100000001111000 0001111111111233333444444 Q ss_pred HHHhhcccCCccCCCEEEEE-----eeeccc---------ceeeEeeccccccchhhh-ee-eeeEEEEEEccceEEEEe Q lcl|NC_017984. 65 TVYFNGFRNATTRPNSLFIT-----KYNLTD---------VPASLIGGDITSTTLADL-KL-INGTLTIVVDGVSKSVPV 128 (487) Q Consensus 65 ~~yF~g~~~q~p~P~~l~ig-----r~~~~~---------~~~~l~g~~~~~~~~~~~-~~-~~g~~~iti~g~~~~~~i 128 (487) -.-... ..+..+-+- .|..+. ....|.|+.-........ +. ....+++..+|...+. + T Consensus 100 L~~L~~-----i~~~~v~vtg~~~~~~~V~F~g~~~~~~~~~~~ltg~~~~~~~V~~~~~G~~~~~~~l~~~g~~~~~-~ 173 (581) T protein:vir:76 100 LRALPN-----VEDDEVTVLGDPGGPWTVTFTKAVAALTKDVTGLTGGDNPDLNIASEQTGVPAMNRALAKKGIKTDT-I 173 (581) T ss_pred HhhccC-----CCCceEEEEcCCCceEEEEEcCCccceeEeeeeeecCCcceeEEEEEecCcCCcCceeeeccccccc-c Confidence 433331 122222221 011111 011222321111111100 00 0112344444432111 0 Q ss_pred eccccCchHHHHHhhhhhhee----------eEEEec--------------ccceEEEEecccc--cceeEEecc----- Q lcl|NC_017984. 129 DLATANSYSDAAALIATALTL----------PCTYES--------------TVKGFVIKSGTSG--ANSTISFAT----- 177 (487) Q Consensus 129 ~~s~ats~~~vA~~i~t~l~a----------~vt~d~--------------~~~~f~its~t~g--~~stit~at----- 177 (487) +..... .+.-..+.+.+++ .-+++. ....+.+....++ ..=.+.|.. T Consensus 174 ~~~~s~--~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~g~~~~~~~i~~~~~~~~D~~~~~~v~~~~~~~~~ 251 (581) T protein:vir:76 174 RVVNPN--SGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQ 251 (581) T ss_pred ceeecC--CcceeeecccccceeeccCcccceeeeeeeeeeEeecccccccceeEEEEEEEeecCCccceEEEecccccc Confidence 000000 0000001111000 000000 0000011100000 000111111 Q ss_pred ----------cc------hhhhhhhccccceeEecCcccc-------cHHHHHHHHHhcccceeEEEEEeccCChhHHHH Q lcl|NC_017984. 178 ----------GD------ISDDLKLTQETGAVLNNHTAAD-------TPTTGALNALAFSQNFVNITYSEGVFNEDALKD 234 (487) Q Consensus 178 ----------gd------~a~~l~lt~~~gA~~~~G~aae-------t~~~al~a~~~~~~~wy~~~~~~~~~~~~~i~a 234 (487) |. ....+.++....+....|.+++ ...++|+++.++..+. ++.... .+.....+ T Consensus 252 ~~~~~~~~~~g~~~~e~~~~~~~~~t~~~~~~l~~gvd~~g~tvt~~dy~~aL~ale~~~~~~--ivvp~t-~~~~i~a~ 328 (581) T protein:vir:76 252 DFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIA--IIVAGT-GAQPIQAL 328 (581) T ss_pred cceeeehhhcCccccchhhhhheeeccccceEEEeeecCCCCccchHHHHHHHHHHhcCCeEE--EEEecC-CChHHHHH Confidence 10 1111223334444444444432 2446666666654333 223222 22232344 Q ss_pred HHHHHhcc---Cc-eE--EEEEccccccccccchHHHHHHHhCCcceEEEecCCC----------------c-hHHHHHH Q lcl|NC_017984. 235 LALWVTSQ---NS-RF--KLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGTF----------------D-KAAFFCG 291 (487) Q Consensus 235 ~A~w~~a~---~~-~~--~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~----------------~-~~a~~~g 291 (487) +..|++.. .+ +. +++.-.... ....+.......-+..|...++... + .++++.| T Consensus 329 l~ahv~~~s~~~~~~ra~igv~g~~~~---~~~~~~~~~a~~~ns~Rvvlv~p~~~~~~g~~~~~~~~lp~~~~AA~vAG 405 (581) T protein:vir:76 329 VQQHVSAQSNNKYERRAILGMDGSVTP---VPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAG 405 (581) T ss_pred HHHHHHHHHhccCCceEEEEeeCCCCC---chHHHHHHhhcccCCCcEEEEEcCceEeccccCCcceecchhhhhhhHHh Confidence 66777543 12 21 222111111 1111112222223566666554210 1 1223334 Q ss_pred HHHhcCcCcCCceeeeeeeecCcccc--cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEc----CCceehHHHH Q lcl|NC_017984. 292 VSGSINYQEENGRTTTAFRSQDGLVP--DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVT----GQYKWIDNFD 365 (487) Q Consensus 292 ~~as~~~~~~~gs~T~~fk~l~Gv~~--~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~s----gg~~~iD~~~ 365 (487) ..++.++ ...+-||.++|+.. ..++.+|++.|..+|++.+....+.. +.+. +|..+ ..++.|-.++ T Consensus 406 ~~a~~~~-----~~slT~~~i~g~~~~~~~~s~~e~e~ll~~Gv~~l~~~~~~~--v~Iv-~gItT~~s~~~~k~i~viR 477 (581) T protein:vir:76 406 KSVSAIA-----AMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNL--VHVR-HGVTTDPTSLHTREWNIIG 477 (581) T ss_pred hhhcccc-----ccCcccccccccccccccCCHHHHHHHHhCCeEEEEEecCCe--EEEE-EeeecCCCCCccceeeehh Confidence 4444433 23456888888864 46899999999999999998765543 3332 34322 2335688999 Q ss_pred HHHHHHHHHHHHHHH-HHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeee Q lcl|NC_017984. 366 FQVFLRTQLQLAYMN-MFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTK 444 (487) Q Consensus 366 ~~dWl~~~iq~~l~~-ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~ 444 (487) -.|.+...+++.+.. .|.. | |=++.|...|++.+++.|++..++|.|.......+ T Consensus 478 ~~D~v~~~vr~~~~~~~fiG--~-~n~~~~r~~ik~~i~~~L~~l~~~g~I~g~~~~~~--------------------- 533 (581) T protein:vir:76 478 QQDVMVYRIRDYLDADGLIG--M-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKA--------------------- 533 (581) T ss_pred hhHHHHHHHHHHHhhhcCCC--c-ccChHHHHHHHHHHHHHHHHHHhcCcccCccccee--------------------- Confidence 999999999988854 3543 3 77889999999999999999999999964211110 Q ss_pred eeEEeccCCCHHHHh-hcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 445 GWALSVTLPDSQTRV-ARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 445 Gy~~~~~~~s~~dra-~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) +..+++ .| --+.+.+...-+|.+|.++.-.+= T Consensus 534 ---------~~~~~~~d~--v~V~i~v~Pv~~ie~I~vt~~~~p 566 (581) T protein:vir:76 534 ---------RQIERQPDV--IEVRYEWRPAYPLNYIVVRYSIAP 566 (581) T ss_pred ---------eEEecCCCE--EEEEEEEEecccceEEEEEEEEee Confidence 000111 11 135667777777777777766655 No 49 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=94.63 E-value=0.0039 Score=33.79 Aligned_cols=429 Identities=10% Similarity=0.000 Sum_probs=158.7 Q ss_pred CCcCCccccceEEE-------eee------eecccccccccc--ee--EEecCc----ceeee--------eeccHHHHH Q lcl|NC_017984. 1 MQFNSIPASNIAAV-------YPA------VIGGGGNPLGLN--TN--LFVQDA----IYPNY--------EYFSNTLVG 51 (487) Q Consensus 1 ~~~~~ip~s~iV~V-------~~~------~~~~~~~~~~~~--~l--l~~~~~----~~~~~--------~y~s~~~V~ 51 (487) .+-..+.+. ++.. .+. ........+.+. .. +..... .+... .+....... T Consensus 124 ~~g~~~~v~-~vd~~~~~~~~~i~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~ 202 (659) T protein:vir:10 124 AIETEGKIT-EVDTDGKIKKINIPTAKIIAKAKEVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAE 202 (659) T ss_pred Cccccceee-EEecccccceeeecccccccccccccccceeeeeeeeeeeeeccccceeeEEeeeecCCceeEEeecccc Confidence 111111110 0000 000 000000000000 00 000000 00000 000000000 Q ss_pred HhcCCChHHHHHHHHHhhcccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceE----EEE Q lcl|NC_017984. 52 QHYGLESPIYKFATVYFNGFRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSK----SVP 127 (487) Q Consensus 52 ~~fg~~s~ey~aA~~yF~g~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~----~~~ 127 (487) .... ++......... ..+.+..++-|-+...-+...... ........+.+.+...+... ... T Consensus 203 ~a~t--~~~~~~~~~~~-----~~~~v~a~~~G~~g~~~tv~~~~~-------a~~~~~~~v~v~~~~~~~~~a~~~t~~ 268 (659) T protein:vir:10 203 AAMT--AVDFQANLKKY-----GIPGVVALYPGELGDKIEIEIVSK-------ADYAKGASALLPIYPGGGTRASTAKAV 268 (659) T ss_pred cccc--ccccccceeec-----ccccccccccceecccceEEEech-------hhccccceeeeeeeeecccccccceee Confidence 0000 00000000000 011111111111110000000000 00000000111111000000 000 Q ss_pred eeccccCc-hHHHHHhhhhhheeeEEE--ecccc------eEEEEecccccceeEEecccchhhhhhhccccce-eEecC Q lcl|NC_017984. 128 VDLATANS-YSDAAALIATALTLPCTY--ESTVK------GFVIKSGTSGANSTISFATGDISDDLKLTQETGA-VLNNH 197 (487) Q Consensus 128 i~~s~ats-~~~vA~~i~t~l~a~vt~--d~~~~------~f~its~t~g~~stit~atgd~a~~l~lt~~~gA-~~~~G 197 (487) +.+..... ...++-.....+..+... ..... .+.......+. +...+... .... . ...+. ....| T Consensus 269 ~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~v~~~~-~~~~--~-~~~~~~~l~gg 343 (659) T protein:vir:10 269 FGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDFFAKGG-SEYIFATA-QNWP--E-GFSGILTLSGG 343 (659) T ss_pred eeeccccccchhhccccccceeeeeeeeccccccccccchhhhhhhhccCc-ccEEEEee-cccC--C-Cccceeeeccc Confidence 00000000 000000000000000000 00000 00000000000 00000000 0000 0 00000 11111 Q ss_pred cc------cccHHHHHHHHHhcccceeEEEEEeccCC------hhHHHHHHHHHhccCceEEEEEccccccc---cccch Q lcl|NC_017984. 198 TA------ADTPTTGALNALAFSQNFVNITYSEGVFN------EDALKDLALWVTSQNSRFKLYTWGLDPVA---LGQSG 262 (487) Q Consensus 198 ~a------aet~~~al~a~~~~~~~wy~~~~~~~~~~------~~~i~a~A~w~~a~~~~~~~~~~~~~~~~---~~~~~ 262 (487) .+ ......++.++.....---.++++..... .....++...++....+|.+......... ..... T Consensus 344 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~il~~p~~~~~~~~~~~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~ 423 (659) T protein:vir:10 344 LSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVVSIGDARQDCLVLCSPPRETVVGIPVTRAV 423 (659) T ss_pred ccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHHHHHHhhCCeEEEEcCccccccCCCcccCH Confidence 11 11123344444333221122333322211 22345566777766666554332211111 01111 Q ss_pred HHHHHHHhC---------C--cceEEEecC------C-------CchHHHHHHHHHhcCcCcCCceeeeeeeecCccc-- Q lcl|NC_017984. 263 ASFGEWAKE---------N--TSGVVPLYG------T-------FDKAAFFCGVSGSINYQEENGRTTTAFRSQDGLV-- 316 (487) Q Consensus 263 ~~~~~l~~~---------~--~~~t~~~y~------~-------~~~~a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~-- 316 (487) ..+..+.+. + ..+....|. + -....+++|..+.++.++.+ .....+|.+.||. T Consensus 424 ~~~~~~r~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar~D~~~g~-~~span~~~~~i~g~ 502 (659) T protein:vir:10 424 DNLVNWRTAAGSYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDNVSQT-WMSPAGYNRGQILNV 502 (659) T ss_pred HHHHHHHHhcccccccccccCcceEEEEeCcEEEecccCCceEEechHHHHHHHHHHHhccCCc-eEccCCceeeeeecc Confidence 112112110 1 122222211 1 12346677888877644321 1122334433332 Q ss_pred ---ccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC---ceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCc Q lcl|NC_017984. 317 ---PDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ---YKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPY 390 (487) Q Consensus 317 ---~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg---~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPy 390 (487) ...+++.|.+.|..+++|+...+.+.+ +.+|-.-++++. +.||-+.+-.+|+...|+..+.-.+= =|. T Consensus 503 ~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G--~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~----e~n 576 (659) T protein:vir:10 503 IKLAIETRQAQRDRLYQEAINPVTGTGGDG--YVLYGDKTATSVPSPFDRINVRRLFNMLKTNIGRSSKYRLF----ELN 576 (659) T ss_pred ccceecCCHhHHHHHhhCCeeEEEEeCCCe--EEEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhcc----CCC Confidence 235789999999999999999887655 456654444433 35788889999999988888766432 256 Q ss_pred CHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEe--ccCCCHHHHhhcccCCeEE Q lcl|NC_017984. 391 NDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALS--VTLPDSQTRVARESFIIKL 468 (487) Q Consensus 391 t~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~--~~~~s~~dra~R~~~~i~~ 468 (487) ++.=...|+..|+.-|+.-+++|.|. ||++. .+..+++|+.+.+.. +.+ T Consensus 577 ~~~l~~~i~~~i~~fL~~l~~~gal~----------------------------~~~V~~d~~~nt~~~i~~G~~~-~~i 627 (659) T protein:vir:10 577 NAFTRSSFRTETAQYLQGIKALGGIY----------------------------EYRVVCDTTNNTPSVIDRNEFV-ATF 627 (659) T ss_pred CHHHHHHHHHHHHHHHHHHHhcCcee----------------------------eEEEEEcCCCCCHHHhhCCeEE-EEE Confidence 77778899999999999999999884 34554 456788898888874 999 Q ss_pred EEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 469 FYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 469 ~~~~aGAIh~v~i~gt~vq 487 (487) .+...-.+++|.++ ++| T Consensus 628 ~~~p~~pae~i~~~--~~~ 644 (659) T protein:vir:10 628 YIQPARSINYITLN--FVA 644 (659) T ss_pred EEEecCCcceEEEE--EEE Confidence 99999999999997 556 No 50 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=94.49 E-value=0.0043 Score=33.57 Aligned_cols=420 Identities=15% Similarity=0.085 Sum_probs=153.8 Q ss_pred CCcCCccccceEEEeeeeecccccccccceeEEec---------------------Ccceeee------eeccHHHHHHh Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVIGGGGNPLGLNTNLFVQ---------------------DAIYPNY------EYFSNTLVGQH 53 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~~~~~~~~~~~~ll~~~---------------------~~~~~~~------~y~s~~~V~~~ 53 (487) ..-+.++-.. ++++...... .-|-.+++.. ...+..+ .+. .+.+... T Consensus 122 ~~~~~~~~~~--~l~v~~~~~~---~~~d~~v~~i~~~~~~y~gt~~~~t~~v~~~~~~~~~~~~~~~~~~~-~~~v~~~ 195 (648) T protein:vir:10 122 TRSNQIYVSF--DLDENFTSAN---EADDTIIFTIYQKHPDFSVTRETFTFPRKFTTPTVLVKRGSTLFFVD-RSIVNAA 195 (648) T ss_pred EcCCCcCcee--EEEEEecCCC---cccceeEEEeccCCCcccccceeccccccccccccccccccceeecC-ccchhhh Confidence 2223333222 2222221111 1111111110 0000000 000 0111111 Q ss_pred cCCChHHH--------------HHHHHHhhcccCCc-cCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEE Q lcl|NC_017984. 54 YGLESPIY--------------KFATVYFNGFRNAT-TRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIV 118 (487) Q Consensus 54 fg~~s~ey--------------~aA~~yF~g~~~q~-p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~it 118 (487) -. .++.+ ......|. .+ ..|..+-.+-..-+ .+.+...........+...|.+. T Consensus 196 ~~-~~~~~~~~~v~~~~~~~~~~~~~~~~~----~s~~~~~d~~~~~~~~~----a~~~~~~~~~~~~~~~~~~gd~~-- 264 (648) T protein:vir:10 196 LA-AGPAFQTALINLLKEQLQPTDVVQIFD----ASDTNPVDIPLGLFVYE----VLYGGLFGFTKSRLVKTSFGTVD-- 264 (648) T ss_pred hc-cCccchhhhhhchhhhhhhhhhheecc----ccccccccccccccccc----ccchhhhcCCcchhhhhhhcccc-- Confidence 00 01111 11111111 00 00000000000000 00111111000000011111110 Q ss_pred EccceEEEEeeccccCchHHHH-----------H------hhhhhheeeEEEecccceE---EEEecccccceeE----- Q lcl|NC_017984. 119 VDGVSKSVPVDLATANSYSDAA-----------A------LIATALTLPCTYESTVKGF---VIKSGTSGANSTI----- 173 (487) Q Consensus 119 i~g~~~~~~i~~s~ats~~~vA-----------~------~i~t~l~a~vt~d~~~~~f---~its~t~g~~sti----- 173 (487) +....+.-++++....+.+.- . ....+|.. .+-.+..-.| .++..++|...+- T Consensus 265 -~~~~~~~~~~~~~tp~~~~~~~~~~~~~~~~~~~~~v~~~~~~~l~~-~~~~p~~~~~~~t~L~GGtdG~~p~s~~~~~ 342 (648) T protein:vir:10 265 -DLLSNPLLFNLSATPFFDGSDYQDYTSLSDPANWFAKDAYTINHLVD-TTINPHILATRIFSLSGGTNGDDGTGYYQTA 342 (648) T ss_pred -ccccccceecccccccccccceeeeeccccccceeeeeccchhhccc-ccccCcccccccceecccccCCCcccccccc Confidence 000011112222111111100 0 00000000 0000000111 1223333322100 Q ss_pred -EecccchhhhhhhccccceeE-ec-------------Ccccc-cHHHHHHHHHhccc--------ceeEEEEEeccCCh Q lcl|NC_017984. 174 -SFATGDISDDLKLTQETGAVL-NN-------------HTAAD-TPTTGALNALAFSQ--------NFVNITYSEGVFNE 229 (487) Q Consensus 174 -t~atgd~a~~l~lt~~~gA~~-~~-------------G~aae-t~~~al~a~~~~~~--------~wy~~~~~~~~~~~ 229 (487) +-..+|+++.|.+.+..+... +. .+... -..+++..+.+.+. .|..++. ....+ T Consensus 343 ~~~~~~d~~d~l~~~~~~~~~~ivp~~~~~~~~~~~~~lt~~q~i~a~a~shv~~~s~~~~~~~r~~~~~~vg--~~~~e 420 (648) T protein:vir:10 343 VSNYINIWSQGLATLEEEEVNFVIPAYKFTNVTQLNDRLTIFKGIASTFLSHVQTMSQVNRRKARVGVFGLPA--PSPNE 420 (648) T ss_pred cccchhhHHHHhhhccCCCceEEEeecccccccccccccCCccchHHHHHHHHHHhhhccccccccCeEEEeC--CCCch Confidence 011234544444443322211 10 00000 01122222222211 1211111 11011 Q ss_pred h--HHHHHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcceEEEecCCCchHHHHHHHHHhcCcCcCCceeee Q lcl|NC_017984. 230 D--ALKDLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGTFDKAAFFCGVSGSINYQEENGRTTT 307 (487) Q Consensus 230 ~--~i~a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~~~~a~~~g~~as~~~~~~~gs~T~ 307 (487) . +.+-+-.-...+..+.++.-.+...+...- +......+ .+-+.+....+-++++.|..+++.+... . T Consensus 421 s~~~se~~~~~~~~~~~~a~~~~~d~~~~~~~~----~~~~~~~~-~G~~~~~p~~~~Aa~VAGl~a~l~~~~s-----~ 490 (648) T protein:vir:10 421 SVTASEYLYNRNILNTISAMFGGTDRAQAVVFP----FYSNVFND-EGKVELLGGEFFASYVAGMHANREPQDS-----I 490 (648) T ss_pred hHHHHHHHhhhhcccccceeeeecCCceEEeec----ccceeECC-CCcEEecchhhHHHHHHhhhhccccccC-----c Confidence 0 000000000011122222222111111100 00000000 1112222333445666777777654443 4 Q ss_pred eeeecCccc--c-cCCCHHHHHHHHhCCceEEEEecCCCce--EEEEECCEEcCC------ceehHHHHHHHHHHHHHHH Q lcl|NC_017984. 308 AFRSQDGLV--P-DVTNEADAETLVKNGYSFYGAWATANDR--FQFAGNGSVTGQ------YKWIDNFDFQVFLRTQLQL 376 (487) Q Consensus 308 ~fk~l~Gv~--~-~~lt~t~~~al~~~~~n~y~~~~~~~~~--~~~~~~G~~sgg------~~~iD~~~~~dWl~~~iq~ 376 (487) -||.++++. + ..++++|++.|..+|++++....+++.+ +++. .|.++-+ ..-|-+++-.|.+...++. T Consensus 491 T~k~i~~~~id~~~~~t~~qld~L~~~Gv~~ie~~~~~~~~~~~rvv-~gITT~~~~~~~~~~eisv~ri~D~l~~~vr~ 569 (648) T protein:vir:10 491 TFLPISGIGAEPLYNWTYTQKDDLISNRVLFVEKVKTSFGGIVYRIH-HNPTTWLGPVTQGFQEFVLRRIDDFLQSYVYK 569 (648) T ss_pred ccceeeccccccccCCCHHHHHHHhcCCcEEEEEecCCcceeeEEEe-ccceeecCCCCcceeeeeeeehhhHHHHHHHH Confidence 466665553 3 4789999999999999999888776543 3333 3443322 1247889999999999999 Q ss_pred HHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEeccCCCHH Q lcl|NC_017984. 377 AYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQ 456 (487) Q Consensus 377 ~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~ 456 (487) .+.+.|+-. |=++.....|++.+..-|.+-++.+-|.+-.+++. ... .. T Consensus 570 ~l~~~fIG~---~n~~~~~~~ik~~i~~~L~~~~~~~~I~~y~~~~v-------------------------~~~--~~- 618 (648) T protein:vir:10 570 NLQEQFIGR---KSYGRKTENDIKVYTEALLSNLVGKQIVAYKDVKV-------------------------TSN--ED- 618 (648) T ss_pred HHhhhcCcc---cccHHHHHHHHHHHHHHHhhHhhcCcccCcccceE-------------------------EEE--ec- Confidence 999998864 45677888999999999888888777764211110 000 01 Q ss_pred HHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 457 TRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 457 dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) ..|. -|.|.+....+|++|.++..+.= T Consensus 619 --~~vv--~V~~~v~Pv~~i~~I~vti~it~ 645 (648) T protein:vir:10 619 --KTVY--YVEFFYQPVTEIKFILVTMKVTF 645 (648) T ss_pred --CCEE--EEEEEEEecceeeEEEEEEEEEe Confidence 1232 58899999999999999988777 No 51 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=94.46 E-value=0.0044 Score=33.53 Aligned_cols=388 Identities=10% Similarity=0.078 Sum_probs=149.7 Q ss_pred CCcCCccccceEEEeeeee--cccc---cccccceeEEecC----------cceeee--------eeccHHHHHHhcCCC Q lcl|NC_017984. 1 MQFNSIPASNIAAVYPAVI--GGGG---NPLGLNTNLFVQD----------AIYPNY--------EYFSNTLVGQHYGLE 57 (487) Q Consensus 1 ~~~~~ip~s~iV~V~~~~~--~~~~---~~~~~~~ll~~~~----------~~~~~~--------~y~s~~~V~~~fg~~ 57 (487) .+.+. | +..+.|.-... +++. ....+.+++-.++ ..+++. ++.-.+++-.. |. T Consensus 308 ~~n~~-~-~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~v~d~~~~~~~~~~v~~~~t~~~~~pp~~~~~~e~v~~n-gG- 383 (742) T protein:vir:58 308 YPNQV-P-FLRVVVSQDIKQNVAGVEKWVPVGFEGIYSVGDFTVIVNELTNVSIPVTDSAIIPPMRFTRIEQITLS-GG- 383 (742) T ss_pred ccccc-c-ceeeEeccccCcCccceeEEEeccccccccccceeeeccccccceeeccccccCCcccccccceeecc-cC- Confidence 11111 1 01111111110 0000 0111222222211 111111 11111111000 00 Q ss_pred hHHHHHHHHHhhcccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchH Q lcl|NC_017984. 58 SPIYKFATVYFNGFRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYS 137 (487) Q Consensus 58 s~ey~aA~~yF~g~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~ 137 (487) ..|.-..++ |..-.+.-+|.. .....+++......-+.... .. T Consensus 384 --------~~f~v~s~~-~~g~~i~~~~as--------------------------~~~s~ln~~~~V~Gt~aa~~--~~ 426 (742) T protein:vir:58 384 --------ASFSVISNQ-PYGFNIQDSRHS--------------------------YWLSPFKDDELIIGTELVLP--AL 426 (742) T ss_pred --------cceEEEEec-ccCcceeccCcc--------------------------eEEeccCCceEEEeehhhcc--cc Confidence 011100000 000001111000 00000111000000000000 00 Q ss_pred HHHHhhhhhheeeEEEecccceEEEEec-ccccceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhcccc Q lcl|NC_017984. 138 DAAALIATALTLPCTYESTVKGFVIKSG-TSGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQN 216 (487) Q Consensus 138 ~vA~~i~t~l~a~vt~d~~~~~f~its~-t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~ 216 (487) +.. +....+ +++.....+.++.. .-|.+..+...... .+..+. ........ ...+.|.++.+.. + T Consensus 427 d~~----t~~~v~-s~~~alp~~a~sv~laGG~dg~v~v~~~~-~D~iG~-----~~~~d~~~--adrTGL~ALlev~-e 492 (742) T protein:vir:58 427 DVS----TEFGVS-SWEEALPEFSFLMPFQGGSDGYIRVDENE-PDTIGR-----VKITPALL--ANYERLLPLLTED-Q 492 (742) T ss_pred ccc----hheecc-ccccccceeeEEEeecCCccccccccCCC-cccccc-----cccccccc--cchhHHHHhhhcC-C Confidence 000 000000 00000011111100 00111111100000 000000 00000000 0112233332221 1 Q ss_pred eeEEEEEeccCChhHHHHHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcceEEEecCC-----------Cch Q lcl|NC_017984. 217 FVNITYSEGVFNEDALKDLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGT-----------FDK 285 (487) Q Consensus 217 wy~~~~~~~~~~~~~i~a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~-----------~~~ 285 (487) ..++.+......+...++...++...+|++... +.+... .............+..+....|.- -.. T Consensus 493 -VtILiAPG~t~~~v~aav~A~la~a~~Rl~vL~-D~P~~~-tt~~~A~a~r~~~nSsraaly~PwVkv~d~~~~r~vPp 569 (742) T protein:vir:58 493 -FDLVLTPYLTFADHAGTVNAFINRAENRFLYLF-DIAGDD-DTENLAISLAGYINSSFATTFFPWVRRLTNKGMRTVPA 569 (742) T ss_pred -CcEEEEcCCCchHHHHHHHHHHHhhcCCeEEEE-ecCCCC-chHHHHHHHHhccCCceEEEEeceeeeccCCcceeech Confidence 122222222222333334444443333333221 111100 000111111111122333322210 123 Q ss_pred HHHHHHHHHhcCcCcCCceeeeeee-ecCcccccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEc-C-C--cee Q lcl|NC_017984. 286 AAFFCGVSGSINYQEENGRTTTAFR-SQDGLVPDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVT-G-Q--YKW 360 (487) Q Consensus 286 ~a~~~g~~as~~~~~~~gs~T~~fk-~l~Gv~~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~s-g-g--~~~ 360 (487) ...+.|.+|.++.+ +|- |+-- ....+.....+++|++.|..+++|+...+ +.+ +.+|- +.++ + + +.| T Consensus 570 SgaIAGL~ARtD~e--rGv--w~SPANrgii~~~~~s~se~d~LN~~GINtIrsf-G~G--~rlWG-nRTlassDs~wry 641 (742) T protein:vir:58 570 SLAAYRSIRTTDPE--TGL--APVGARRGVVTGEPVRQVDWEDLYNNRINPIVRV-GND--VLLFG-QKTMLNVNSALNR 641 (742) T ss_pred HHHHHHHHHHhccC--Cce--EecCCcceeeeccccchhhHHHHhhCCceEEEEC-CCc--EEEEc-ceecCCCCcccce Confidence 45677777777643 331 2211 11223334578899999999999999876 433 55664 4443 3 2 357 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccc Q lcl|NC_017984. 361 IDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQ 440 (487) Q Consensus 361 iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~ 440 (487) |-+.+-.+|++..|+..+...+-. |.|+.-...|+..|+.-|+.-+++|.|.- T Consensus 642 InVRRlfd~Ie~SI~~a~q~~VfE----PNd~~L~~sIk~sInafL~~L~aqGALlG----------------------- 694 (742) T protein:vir:58 642 INVRRLLIVMRNRISQILSSYLFE----NNTSENRLRAEALVRQYLESLRLRGAVTD----------------------- 694 (742) T ss_pred EeehhhHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCceee----------------------- Confidence 999999999999888887664322 67888899999999999999999998852 Q ss_pred eeeeeeEEec-cCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 441 LFTKGWALSV-TLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 441 ~~~~Gy~~~~-~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) |.+.. ++.+++|+.+-+. -+.+.+...-.+++|+++-+..| T Consensus 695 -----frV~lDetNTpeDI~~Gkl-vv~I~vAP~~PAEfI~lrf~it~ 736 (742) T protein:vir:58 695 -----YEVAIDSVTTPTDIDNNTL-RARVTVQPARSIEYIDITFVITP 736 (742) T ss_pred -----eEEEEcCCCCHHHhhCCEE-EEEEEEEccCCcceEEEEEEEEe Confidence 33332 3467778776665 48888899999999999988888 No 52 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=93.81 E-value=0.0063 Score=32.65 Aligned_cols=351 Identities=13% Similarity=0.082 Sum_probs=148.1 Q ss_pred HHHHHhhcccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEE-ccceEEEEeeccc-cCchHHHH Q lcl|NC_017984. 63 FATVYFNGFRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVV-DGVSKSVPVDLAT-ANSYSDAA 140 (487) Q Consensus 63 aA~~yF~g~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti-~g~~~~~~i~~s~-ats~~~vA 140 (487) |+.. ....|+ ++|=+......+..... .... | +..+- ++. ...+++... ..+..+.+ T Consensus 1 m~~~-------~~~~hG-v~v~ev~~g~~~i~~~~----tavi-------~-~Vgta~~ad-~~~p~~~~~~i~~~~d~~ 59 (388) T protein:vir:96 1 MPVI-------DQFEHN-GISIETHEPPPPMGPPG----DNVV-------A-WVVTAPDKH-ADVAFSVPFRVANTADAQ 59 (388) T ss_pred CCCC-------CCCCCc-eEEEEcCCCcccccccC----ccee-------E-EEEecCCCc-cccccccceeeecchhhh Confidence 1100 000111 23333322221111000 0000 0 00000 000 000000000 00011111 Q ss_pred HhhhhhheeeEE-EecccceEEEEeccccc-ceeEEecccchhhhhhhccccceeEecCccccc-HHHHHHHHHhcccce Q lcl|NC_017984. 141 ALIATALTLPCT-YESTVKGFVIKSGTSGA-NSTISFATGDISDDLKLTQETGAVLNNHTAADT-PTTGALNALAFSQNF 217 (487) Q Consensus 141 ~~i~t~l~a~vt-~d~~~~~f~its~t~g~-~stit~atgd~a~~l~lt~~~gA~~~~G~aaet-~~~al~a~~~~~~~w 217 (487) .+........+ +++. ..|. ...+. ...+.+..++.. ..+.+.+..+.++.+ ..+.+.++..... - T Consensus 60 -~~~~~~~~~gtl~~al-~~~~---~~~~~~~~vv~v~~g~~~------~at~a~iig~~~~~tg~~~gl~al~~~~~-~ 127 (388) T protein:vir:96 60 -YLDSTGNELGTGWHAA-SETL---KKTSVPQYFIVVPEGADD------AATMANIIGGIDPTTGRRTGIAALTECTE-R 127 (388) T ss_pred -hhhccccccccchhhh-Hhhh---ccCCceEEEEEecccccc------ccccceeeeecccccchhhHHHHhhhccc-c Confidence 11111100000 0000 0000 00000 000111111000 000111111221111 1223333333221 1 Q ss_pred eEEEEEecc-CChhHHHHHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCc--ceEEEecC------C------ Q lcl|NC_017984. 218 VNITYSEGV-FNEDALKDLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENT--SGVVPLYG------T------ 282 (487) Q Consensus 218 y~~~~~~~~-~~~~~i~a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~--~~t~~~y~------~------ 282 (487) ..++.+... .......++...++.-+ .|..+. .+.................++ .+....|. + T Consensus 128 p~il~aPg~s~~~~v~~al~~~~~~~~-~~~i~D--~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~P~~~~~d~~~~~~~ 204 (388) T protein:vir:96 128 PTLIGAPGFSQNKAVIDALASMAKRLK-CRAVID--GPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNI 204 (388) T ss_pred eeEEEeeccccchHHHHHHHHHHhhcC-cEEEEe--ccCCchhHHHHHHhhhhccCcCcceEEEEeCceeeecccCCcee Confidence 122222111 12222334444444321 222211 111000000000111111111 22222221 0 Q ss_pred -CchHHHHHHHHHhcCcCcCCceeeeeeeecCcccc-----cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcC Q lcl|NC_017984. 283 -FDKAAFFCGVSGSINYQEENGRTTTAFRSQDGLVP-----DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTG 356 (487) Q Consensus 283 -~~~~a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~~-----~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sg 356 (487) ...+..+.|..+.+++...+.-..+. +.|+.- ..++.+|++.|..+|+|++..+.+.+ +.+| ...++. T Consensus 205 ~~p~s~~~AG~~a~~D~~~spaN~~i~---i~g~~~~~~~~~~~~~~~~~~Ln~~gI~~i~~~~~~G--~~~w-G~rT~~ 278 (388) T protein:vir:96 205 YVPPSTIAMGAVAAVKPWESPGNQGVL---IQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGG--FSLI-GNRTVT 278 (388) T ss_pred eechHHHHHHHHHhhcCcccccCeeEE---eeeecccccccccCChhhHHhhhhcCceEEEEecCCc--EEEE-cccccC Confidence 12356677888877764444332221 344431 23477899999999999999887655 4455 444555 Q ss_pred CceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCc Q lcl|NC_017984. 357 QYKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFD 436 (487) Q Consensus 357 g~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~ 436 (487) . .||-+.+-.+|++..|+..+...+ .+ |.|+.=...|+..|+.-|+.-+++|.|..+. T Consensus 279 ~-~~i~vrR~~~~i~~si~~~~~~~v---~e-pn~~~~~~~i~~~i~~fL~~l~~~Gal~g~~----------------- 336 (388) T protein:vir:96 279 G-KFISFVGLEDAIARKLEAASQRAM---SK-QLTKSFMEQEIKKINLFMQDLVAAEIIPGGE----------------- 336 (388) T ss_pred C-cceeehhhHHHHHHHHHHHHHHhc---cC-CCCHHHHHHHHHHHHHHHHHHHhCCceeeeE----------------- Confidence 4 699999999999999988877543 23 6788888999999999999999999886321 Q ss_pred cccceeeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 437 AASQLFTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 437 ~~~~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) - |+..+..+++|+.+-+.. +.+.+...-.+++|+++....= T Consensus 337 --------~-~~d~~~nt~~~i~~G~~~-~~i~~~p~~pae~I~~~~~~~~ 377 (388) T protein:vir:96 337 --------V-YLHPTLNTVERYKNGSWY-IVIDYGRYSPNEHMIFHLNAVD 377 (388) T ss_pred --------E-EEecCCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 0 235667788888877764 8888888999999988754321 No 53 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=93.75 E-value=0.0065 Score=32.57 Aligned_cols=425 Identities=12% Similarity=0.025 Sum_probs=154.6 Q ss_pred CCcCCccccc-----------eEEEeeeeecc---ccc-----cccccee-EEecCcceeeeeeccHHHHHHhcCCChHH Q lcl|NC_017984. 1 MQFNSIPASN-----------IAAVYPAVIGG---GGN-----PLGLNTN-LFVQDAIYPNYEYFSNTLVGQHYGLESPI 60 (487) Q Consensus 1 ~~~~~ip~s~-----------iV~V~~~~~~~---~~~-----~~~~~~l-l~~~~~~~~~~~y~s~~~V~~~fg~~s~e 60 (487) ..-..++... +..+-...... +.. ...-+.. -+..........+. ...+-.+-+..-+. T Consensus 119 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ta~~~~~a~~~~~~~~~~~~~~~~v~~~~~~~~~~~~-v~~~~~d~~~~~~~ 197 (660) T protein:vir:68 119 KYSTDIIEPDGEVTSVDSDGKILNIFIPSGKIIAKAKEIGEYPELGSNWTAEMSGSSSGLSAVIT-IDSVVMDSGILLTE 197 (660) T ss_pred ecccccccccccceeeeecCceeeeeeccccccccceeeccccccccceeEEeecccccceeeee-eccccccccceeee Confidence 0001111100 00000000000 000 0000000 00000000000000 00000000000000 Q ss_pred HHHHH------HHhhcc-cCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEcc-----ceEEEEe Q lcl|NC_017984. 61 YKFAT------VYFNGF-RNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDG-----VSKSVPV 128 (487) Q Consensus 61 y~aA~------~yF~g~-~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g-----~~~~~~i 128 (487) -..+. .+-... ....+.+...+.|.|... ++-........ .......+.+..++ ..+.+ + T Consensus 198 ~~ta~~~~~~~~~~~~~~~~~~~~~~A~~~g~~G~~-----i~v~~~~~a~~--~~~~~~~~~~~~~~~~~~~~~~~~-~ 269 (660) T protein:vir:68 198 VETSEEAITSLTFQESIKKYGVPGVVALYPGELGDQ-----LEIEIVSKADY--DKGASAQLKIYPDGGTRYSTAKAI-F 269 (660) T ss_pred eccccccccccceeeeecccCccccccccccccccc-----eEEEEeccccc--cccccccceeeecccccccceeeE-e Confidence 00000 000000 000000000000111000 00000000000 00000000000000 00000 0 Q ss_pred eccccCchHHHHHhhhhhheeeEEE-ecccceEEEEec-----------------ccccceeEEec-ccchhhhhhhccc Q lcl|NC_017984. 129 DLATANSYSDAAALIATALTLPCTY-ESTVKGFVIKSG-----------------TSGANSTISFA-TGDISDDLKLTQE 189 (487) Q Consensus 129 ~~s~ats~~~vA~~i~t~l~a~vt~-d~~~~~f~its~-----------------t~g~~stit~a-tgd~a~~l~lt~~ 189 (487) .....++ + .....+.. +.....|.++.. ..+....+... .+...... T Consensus 270 ~~~~~~~--~-------~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~----- 335 (660) T protein:vir:68 270 GYGPQTD--D-------QYAIIVRRNDSVVQSVVLSTKRGERDIYGSNIFIDDFFAKGASNYIFATAQGWPKGFS----- 335 (660) T ss_pred ecccccc--c-------ceeeeeecCCcceeeeeeecccccccccccceeeehhhccCcccEEEEeecCCCcccc----- Confidence 0000000 0 00000000 000011111100 00101101000 00000000 Q ss_pred cceeEecCcc------cccHHHHHHHHHhcccceeEEEEEeccCC--h----hHHHHHHHHHhccCceEEEEEcccc--- Q lcl|NC_017984. 190 TGAVLNNHTA------ADTPTTGALNALAFSQNFVNITYSEGVFN--E----DALKDLALWVTSQNSRFKLYTWGLD--- 254 (487) Q Consensus 190 ~gA~~~~G~a------aet~~~al~a~~~~~~~wy~~~~~~~~~~--~----~~i~a~A~w~~a~~~~~~~~~~~~~--- 254 (487) .......|.+ ..+...++..+.....---.++....... . ....++...++....+|..+..... T Consensus 336 ~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~~~~d~~~~~~~ 415 (660) T protein:vir:68 336 GVIKLNGGLSSNETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESLEVASTVQKHVVAIGDSRQDCLVLCSPPRAAVV 415 (660) T ss_pred ceeeeccccccccccccchhhhHHHHhhhhhccccceeeccccCCCchHHHHHHHHHHHHHHHhhCCeEEEEcccceeEe Confidence 0001111111 11223334443333221111222211111 1 2244566666665555443321110 Q ss_pred -ccccccchHHHHHHHhC----------CcceEEEecC------C-------CchHHHHHHHHHhcCcCcCCc-eeeeee Q lcl|NC_017984. 255 -PVALGQSGASFGEWAKE----------NTSGVVPLYG------T-------FDKAAFFCGVSGSINYQEENG-RTTTAF 309 (487) Q Consensus 255 -~~~~~~~~~~~~~l~~~----------~~~~t~~~y~------~-------~~~~a~~~g~~as~~~~~~~g-s~T~~f 309 (487) .....+..+........ +..+..+.|. + -.....++|.++.++-++ | ...... T Consensus 416 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar~d~~~--g~~~span 493 (660) T protein:vir:68 416 GIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDNIS--QPWMSPAG 493 (660) T ss_pred cCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceEEechhHHHHHHHHHHhccC--CcEEccCC Confidence 11111111221111111 1223332222 1 023566788888876433 3 112234 Q ss_pred eecCccc-----ccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC---ceehHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017984. 310 RSQDGLV-----PDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ---YKWIDNFDFQVFLRTQLQLAYMNM 381 (487) Q Consensus 310 k~l~Gv~-----~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg---~~~iD~~~~~dWl~~~iq~~l~~l 381 (487) |.+.||. ...++++|.+.|..+++|+...+.+.+ +.+|-.-++++. +.||-+.+-.+|+...|+..+.-. T Consensus 494 ~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G--~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~ 571 (660) T protein:vir:68 494 YNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDG--YVLYGDKTATSVPSPFDRINVRRLFNMVKTNIGSASKYR 571 (660) T ss_pred eeeceeeccceeeecCChhHHHHHhhCCceEEEEecCCe--EEEEcceecCCCCcccceEehhhHHHHHHHHHHHHHHHh Confidence 5544442 124789999999999999999887765 455544344443 357888899999998888888764 Q ss_pred HHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEE--eccCCCHHHHh Q lcl|NC_017984. 382 FQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWAL--SVTLPDSQTRV 459 (487) Q Consensus 382 l~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~--~~~~~s~~dra 459 (487) +-. |.++.=...|+..|+.-|+.-+++|.|. ||++ +.+..+++|+. T Consensus 572 v~e----pn~~~~~~~i~~~i~~~L~~l~~~gal~----------------------------gf~V~~d~~~nt~~~i~ 619 (660) T protein:vir:68 572 LFE----LNNAFTRSSFRTETSQYLQGIKALGGVY----------------------------NFKVVCDTTNNTPAVID 619 (660) T ss_pred ccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee----------------------------eeEEEEecCCCCHHHhh Confidence 432 4566668899999999999999999885 2444 35678899999 Q ss_pred hcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 460 ARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 460 ~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) +.+.. +.+.+...-.+++|.++- +| T Consensus 620 ~G~~~-~~i~~~p~~pae~i~l~~--~~ 644 (660) T protein:vir:68 620 RNEFV-ATFYLQPARSINYITLNF--VA 644 (660) T ss_pred CCeEE-EEEEEEecCCcceEEEEE--EE Confidence 88884 999999999999999884 45 No 54 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=93.45 E-value=0.0075 Score=32.23 Aligned_cols=421 Identities=11% Similarity=0.035 Sum_probs=158.1 Q ss_pred CCcCCccccce--EEEe-------eeee--------cccccccccc----eeEEecCc----ceee-eeeccHHHHHHhc Q lcl|NC_017984. 1 MQFNSIPASNI--AAVY-------PAVI--------GGGGNPLGLN----TNLFVQDA----IYPN-YEYFSNTLVGQHY 54 (487) Q Consensus 1 ~~~~~ip~s~i--V~V~-------~~~~--------~~~~~~~~~~----~ll~~~~~----~~~~-~~y~s~~~V~~~f 54 (487) ..-...|..+. ..+. +.+. ........+. ..+..... .+.+ ........+..+. T Consensus 119 ~~~~~~~g~~g~v~~~~~~~~~~~~~v~t~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~v~~~~~~~~~~v 198 (659) T protein:vir:72 119 KYVSDDIETEGKITEVDADGKIKKINIPTGKNYAKAKEVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEI 198 (659) T ss_pred eeccccccccceEEEeeccccceeeeeccccccccccccccccccccceeeEEeeccccccceEEEEEeecCcceeeeec Confidence 00000111100 0000 0000 0000000000 00000000 0000 0000000000000 Q ss_pred CCChHHHHHHHHHhhcccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEc-------------- Q lcl|NC_017984. 55 GLESPIYKFATVYFNGFRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVD-------------- 120 (487) Q Consensus 55 g~~s~ey~aA~~yF~g~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~-------------- 120 (487) ..++..-.. .-+... -........++....+. +..++.. ........+...+.+. T Consensus 199 -~~~~~a~~~-~~~~~~--v~~~~~~~~~a~~~gt~------g~~~tv~-i~~~~~~~~~~~~~v~~~~~~~~~a~~~~~ 267 (659) T protein:vir:72 199 -ENAEAAMTA-VDFQAN--LKKYGIPGVVALYPGEL------GDKIEIE-IVSKADYAKGASALLPIYPGGGTRASTAKA 267 (659) T ss_pred -cccchhhhc-cccccc--ccccccceeeecccccc------ccceeEE-Eccccccccceeeeeeccccccccccccee Confidence 000000000 000000 00000000011100000 0000000 0000000000000000 Q ss_pred -----cce------------EEE-Eeeccc---cCchHHHHHhhhhhheeeEEEecccceEEEEecccccceeEEecccc Q lcl|NC_017984. 121 -----GVS------------KSV-PVDLAT---ANSYSDAAALIATALTLPCTYESTVKGFVIKSGTSGANSTISFATGD 179 (487) Q Consensus 121 -----g~~------------~~~-~i~~s~---ats~~~vA~~i~t~l~a~vt~d~~~~~f~its~t~g~~stit~atgd 179 (487) +.. ... ...++. ..+.......+...+ +...+.++........ .+ T Consensus 268 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~v~~~~~~~~--------~~ 333 (659) T protein:vir:72 268 VFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDFF------AKGGSEYIFATAQNWP--------EG 333 (659) T ss_pred eeeeecccccccceeeecccceeeeeeeeeccccccccchhhhhhhhh------hcCCceEEEEEecccC--------Cc Confidence 000 000 000000 000000000000000 0011111111100000 00 Q ss_pred hhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeEEEEEeccCC------hhHHHHHHHHHhccCceEEEEEccc Q lcl|NC_017984. 180 ISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVNITYSEGVFN------EDALKDLALWVTSQNSRFKLYTWGL 253 (487) Q Consensus 180 ~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~~------~~~i~a~A~w~~a~~~~~~~~~~~~ 253 (487) .+..+.+..+.. ...+........++.++.....--..++.+..... .....++...++....++.+..... T Consensus 334 ~~~~~~l~gg~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~~l~~~~~~~~~~~~~~d~~~ 411 (659) T protein:vir:72 334 FSGILTLSGGLS--SNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVVSIGDARQDCLVLCSPPR 411 (659) T ss_pred cccccccccccc--ccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHHHHHhhhCCEEEEEcCcc Confidence 000000100000 00011112233445544433221112333322111 1223445666666555555432211 Q ss_pred cccc---cccchHHHHHHHhC-----------CcceEEEecC-----CC--------chHHHHHHHHHhcCcCcCCc-ee Q lcl|NC_017984. 254 DPVA---LGQSGASFGEWAKE-----------NTSGVVPLYG-----TF--------DKAAFFCGVSGSINYQEENG-RT 305 (487) Q Consensus 254 ~~~~---~~~~~~~~~~l~~~-----------~~~~t~~~y~-----~~--------~~~a~~~g~~as~~~~~~~g-s~ 305 (487) .... .......+..+.+. +..+....|. ++ .....++|..+.++.++ | .. T Consensus 412 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~--G~~~ 489 (659) T protein:vir:72 412 ETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNHKYQYDKYNDVNRWVPLAADIAGLCARTDNVS--QTWM 489 (659) T ss_pred ccccCCCcccCHHHHHHHHhhccccccccccccceeEEEEcCceeeccccCCceEEechHHHHHHHHHHhhccC--CcEE Confidence 1100 11111111111110 1223322221 01 23456778888776433 3 22 Q ss_pred eeeeeecCccc-----ccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC---ceehHHHHHHHHHHHHHHHH Q lcl|NC_017984. 306 TTAFRSQDGLV-----PDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ---YKWIDNFDFQVFLRTQLQLA 377 (487) Q Consensus 306 T~~fk~l~Gv~-----~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg---~~~iD~~~~~dWl~~~iq~~ 377 (487) ...+|.+.||. ...+++.|.+.|..+++|+...+.+.+ +.+|-.-++++. +.||-+.+-.+|+...|+.. T Consensus 490 span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G--~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~ 567 (659) T protein:vir:72 490 SPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDG--YVLYGDKTATSVPSPFDRINVRRLFNMLKTNIGRS 567 (659) T ss_pred ccCCeeeceeeccccccccCChhHHHHHhhCCceEEEEecCCe--EEEEcccccCCCCcccceEeehhHHHHHHHHHHHH Confidence 23344444442 235789999999999999999987765 456554444443 35888999999999998888 Q ss_pred HHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEe--ccCCCH Q lcl|NC_017984. 378 YMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALS--VTLPDS 455 (487) Q Consensus 378 l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~--~~~~s~ 455 (487) +...+=. |.++.=...|+..|+.-|++-+++|.|. ||++. .+..++ T Consensus 568 ~~~~v~e----~n~~~l~~~i~~~i~~fL~~l~~~gal~----------------------------~~~V~~d~~~nt~ 615 (659) T protein:vir:72 568 SKYRLFE----LNNAFTRSSFRTETAQYLQGNKALGGIY----------------------------EYRVVCDTTNNTP 615 (659) T ss_pred HHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhcCcee----------------------------eEEEEEcCCCCCH Confidence 8764322 5678888899999999999999999883 34554 456788 Q ss_pred HHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 456 QTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 456 ~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) +|+.+-+.. +.+.+...-.+++|.++ ++| T Consensus 616 ~~i~~G~~~-~~i~~~p~~pae~I~~~--~~~ 644 (659) T protein:vir:72 616 SVIDRNEFV-ATFYIQPARSINYITLN--FVA 644 (659) T ss_pred HHhhCCeEE-EEEEEEecCCccEEEEE--EEE Confidence 998888874 99999999999999997 557 No 55 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=93.40 E-value=0.0077 Score=32.17 Aligned_cols=420 Identities=11% Similarity=0.056 Sum_probs=156.1 Q ss_pred CCcCCcccc-ce-------EEEeeeeeccccccccc--ce-eEEecCcceeeee----ecc---HHHHHHhcCC------ Q lcl|NC_017984. 1 MQFNSIPAS-NI-------AAVYPAVIGGGGNPLGL--NT-NLFVQDAIYPNYE----YFS---NTLVGQHYGL------ 56 (487) Q Consensus 1 ~~~~~ip~s-~i-------V~V~~~~~~~~~~~~~~--~~-ll~~~~~~~~~~~----y~s---~~~V~~~fg~------ 56 (487) ..-.+++.. ++ ..+.+...++.....+- +. .-+..+....... .+. ..++..+.+. T Consensus 120 ~~~~~~~~~~~~~~~~~~~~~~~v~~~ta~~~~~~~~v~~~~~~~~~~~~~~s~~s~~~~~a~~v~~v~~d~~~~v~~~~ 199 (663) T protein:vir:10 120 YNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKTRQLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSE 199 (663) T ss_pred cccccccccccceeeecccceEEEeeccccccccccccccceeeccceeeEeeeccCccccccccceeccccceEEeecc Confidence 110111000 00 00111111111000000 00 0000000000000 000 0000001000 Q ss_pred ------ChHHHHHHHHHhhcccCCccCCCEEEEEeeecccceeeEeecc-ccccch------------hhhe------ee Q lcl|NC_017984. 57 ------ESPIYKFATVYFNGFRNATTRPNSLFITKYNLTDVPASLIGGD-ITSTTL------------ADLK------LI 111 (487) Q Consensus 57 ------~s~ey~aA~~yF~g~~~q~p~P~~l~igr~~~~~~~~~l~g~~-~~~~~~------------~~~~------~~ 111 (487) ..+++......+. .+.-...+-|.|... ....+.... ...... .... .. T Consensus 200 ~a~~~~t~~~~~~~~~~~~-----~~~i~A~~~G~~Gn~-i~V~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~ 273 (663) T protein:vir:10 200 DAPAVMTSPAVMEKYAKFG-----MPLISAVYPGEIGST-VEVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMT 273 (663) T ss_pred ccccccccccccccccccc-----cceEEeccCCcccce-eeeeeccccccccccccceecccccccccccceeeccccc Confidence 0011110000000 000000000111000 000000000 000000 0000 00 Q ss_pred eeE--EEEEEccceEEEEeeccccC---chHHHHHhhhhhheeeEEEecccceEEEEecccccceeEEecccchhhhhhh Q lcl|NC_017984. 112 NGT--LTIVVDGVSKSVPVDLATAN---SYSDAAALIATALTLPCTYESTVKGFVIKSGTSGANSTISFATGDISDDLKL 186 (487) Q Consensus 112 ~g~--~~iti~g~~~~~~i~~s~at---s~~~vA~~i~t~l~a~vt~d~~~~~f~its~t~g~~stit~atgd~a~~l~l 186 (487) +.. +.+..+|..... ..++... ........+...+ ....+.+......... T Consensus 274 ~~~~~~~~~~~~~~~~~-~~~s~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~----------------- 329 (663) T protein:vir:10 274 DDQFAIIVRRDGIVVES-TVLSTRKGDRDVYGSNIFMDDYF------RNGGSNFIFASSEGWP----------------- 329 (663) T ss_pred ccceeeEeecCCcceee-ecccccccccccccchhhhhhhh------cCCcceEEEEeecccC----------------- Confidence 000 000011100000 0000000 0000000000000 0001111111000000 Q ss_pred cccccee-EecCcc------cccHHHHHHHHHhcccceeEEEEEeccCC------hhHHHHHHHHHhccCceEEEEEccc Q lcl|NC_017984. 187 TQETGAV-LNNHTA------ADTPTTGALNALAFSQNFVNITYSEGVFN------EDALKDLALWVTSQNSRFKLYTWGL 253 (487) Q Consensus 187 t~~~gA~-~~~G~a------aet~~~al~a~~~~~~~wy~~~~~~~~~~------~~~i~a~A~w~~a~~~~~~~~~~~~ 253 (487) ....... ...|.+ ......++..+.+...-.-.++++..... .....++...++....+|....... T Consensus 330 ~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~a~~~~~~~ai~d~p~ 409 (663) T protein:vir:10 330 AGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSLADDRQDCVAIVNPPA 409 (663) T ss_pred ccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHHHHhhCCEEEEEecCc Confidence 0000000 111111 11223334444433221222333322211 1123345556665444544432211 Q ss_pred cccc---cccchHHHHHHHh------------C--CcceEEEecC------C-------CchHHHHHHHHHhcCcCcCCc Q lcl|NC_017984. 254 DPVA---LGQSGASFGEWAK------------E--NTSGVVPLYG------T-------FDKAAFFCGVSGSINYQEENG 303 (487) Q Consensus 254 ~~~~---~~~~~~~~~~l~~------------~--~~~~t~~~y~------~-------~~~~a~~~g~~as~~~~~~~g 303 (487) .... .......+..+.. . +..+..+.|. + -....+++|.++.++.++.+ T Consensus 410 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~- 488 (663) T protein:vir:10 410 ELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLAADIAGLCAYTDQVSHP- 488 (663) T ss_pred ccccccccccchHHHHHHHHhccccccchhhhcccCccceEEEcCceEEecccCCceEEechhHHHHHHHHHhhccCCc- Confidence 1000 0111111111110 0 1122222221 1 12346678888888755421 Q ss_pred eeeeeeeecC---ccc--ccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC---ceehHHHHHHHHHHHHHH Q lcl|NC_017984. 304 RTTTAFRSQD---GLV--PDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ---YKWIDNFDFQVFLRTQLQ 375 (487) Q Consensus 304 s~T~~fk~l~---Gv~--~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg---~~~iD~~~~~dWl~~~iq 375 (487) .....+|.+. |+. ...+++.|.+.|..+|+|+...+-+. ..+.+|-.-++++. +.||-+.+-.+|+.+.|+ T Consensus 489 ~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~-~G~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~ 567 (663) T protein:vir:10 489 WMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGG-DGFVLFGDKMATQVPSPFDRINVRRLFNMLKKNIG 567 (663) T ss_pred eEccCCceeccccccccceeecChhHHHHHhhCCceEEEEEeCC-CcEEEEcccccCCCCcccceEehhhHHHHHHHHHH Confidence 1122334433 332 24579999999999999998877542 12445544334443 357899999999999999 Q ss_pred HHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEe--ccCC Q lcl|NC_017984. 376 LAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALS--VTLP 453 (487) Q Consensus 376 ~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~--~~~~ 453 (487) ..+...+=. |.|+.=...|+..|+.-|++-+++|.|. ||++. .+.. T Consensus 568 ~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~----------------------------g~~v~~d~~~n 615 (663) T protein:vir:10 568 DTSKYELFE----NNDAFTRQSFRMETSQYLDGIRSLGGCY----------------------------DFRVVCDTTNN 615 (663) T ss_pred HHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee----------------------------eeEEEEcCCCC Confidence 888764322 6788888999999999999999999884 34554 4567 Q ss_pred CHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 454 DSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 454 s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) |++|+.+.+. -+.+.+...-.+++|.++ ++| T Consensus 616 t~~~i~~G~~-~~~i~~~p~~pae~i~~~--~~~ 646 (663) T protein:vir:10 616 TPNVIDRNEF-VGTIYVKPPRSINYITLN--MVA 646 (663) T ss_pred CHHHhhCCeE-EEEEEEEecCCcceEEEE--EEE Confidence 8999888888 499999999999999997 556 No 56 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=92.69 E-value=0.01 Score=31.46 Aligned_cols=357 Identities=10% Similarity=0.038 Sum_probs=176.6 Q ss_pred CCcCC--ccccceEEEeeeeeccccccccccee--EEec----Ccceee---eeeccHHHHHHhcCCChHHHHHHHHHhh Q lcl|NC_017984. 1 MQFNS--IPASNIAAVYPAVIGGGGNPLGLNTN--LFVQ----DAIYPN---YEYFSNTLVGQHYGLESPIYKFATVYFN 69 (487) Q Consensus 1 ~~~~~--ip~s~iV~V~~~~~~~~~~~~~~~~l--l~~~----~~~~~~---~~y~s~~~V~~~fg~~s~ey~aA~~yF~ 69 (487) |+|.. +|==.++.+..+.. +....+.... +-+. ....|. ...++..+....||.....+.+-..+|. T Consensus 1 m~m~~~~~~GV~v~e~~~g~~--~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~~~~~~~~g~~g~L~~al~~~~~ 78 (393) T protein:vir:10 1 MSILDTYLHGVEVVEVNAGGV--TISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGS 78 (393) T ss_pred CCCCCccCCCeEEEEcCCCcc--eecccCcceeEEEeeccCcCcccccCccceEecchHHHHHhhCCccchhhhhhhhhc Confidence 88876 34445555544332 2222223322 2221 112232 2357777888889988888888888886 Q ss_pred cccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEEeeccccCchHHHHHhhhhhhee Q lcl|NC_017984. 70 GFRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVPVDLATANSYSDAAALIATALTL 149 (487) Q Consensus 70 g~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~i~~s~ats~~~vA~~i~t~l~a 149 (487) +. -...++-+......+ .. .. -.+.|.. .....+++........ T Consensus 79 ----~~--~~~~~vv~v~~~~~~---------~~----------t~-~~iig~~--------~~~~~tgl~al~~~~~-- 122 (393) T protein:vir:10 79 ----IV--KTPTVIVRVAESDDS---------DT----------LT-ANIVGTQ--------ENGKFTGIKALLTAQS-- 122 (393) T ss_pred ----cc--CceEEEeecccCccc---------cc----------cc-ccccccc--------ccchhhHHHHHHhhhh-- Confidence 22 122233222111000 00 00 0000000 0000011110000000 Q ss_pred eEEEecccceEEEEecccccceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeEEEEEeccCCh Q lcl|NC_017984. 150 PCTYESTVKGFVIKSGTSGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVNITYSEGVFNE 229 (487) Q Consensus 150 ~vt~d~~~~~f~its~t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~~~ 229 (487) . .+ .. .......|........++..+.+.-+.- +.+.+.+.+ T Consensus 123 ---------~-------~~-----------------~~--p~li~apg~~~~~~~~al~~~~~~~~~~--~~v~d~~~~- 164 (393) T protein:vir:10 123 ---------T-------VF-----------------VK--PKLLCVPQHDNQAVATELLSVAKKLNAF--AFISDNGAT- 164 (393) T ss_pred ---------h-------cc-----------------ee--eeeeeeccccchHHHHHHHHHhhccCcE--EEEEcCCCC- Confidence 0 00 00 0011223433344444554444433322 222222221 Q ss_pred hHHHHHHHHHhccCceEEEEEccccccccccchHHHHHHHhCCcceEEEecCCCchHHHHHHHHHhcCcCcCCceeeeee Q lcl|NC_017984. 230 DALKDLALWVTSQNSRFKLYTWGLDPVALGQSGASFGEWAKENTSGVVPLYGTFDKAAFFCGVSGSINYQEENGRTTTAF 309 (487) Q Consensus 230 ~~i~a~A~w~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~t~~~y~~~~~~a~~~g~~as~~~~~~~gs~T~~f 309 (487) ...++-.|.+.-+..+..+.+.--..... ..+..+.+ .....++|..+.++-++. =...... T Consensus 165 -t~~~ai~~~~~~~s~~~~~~~P~~~~~d~----------~~~~~~~~------p~s~~~Ag~~a~~d~~~G-~~~spaN 226 (393) T protein:vir:10 165 -TKEQAYTYRQNFSQREGMMIFGDWKSYNT----------DKKAYDTD------YAVARACALQAYIDKTVG-WHKNISN 226 (393) T ss_pred -CHHHHHHHhhhcCCceEEEEecccccccc----------cCCceeEe------ehhHHHHHHHHHhhcCCC-cEEccCC Confidence 22344566665555444443322111000 01111111 234667788887763321 1223345 Q ss_pred eecCcccc--------cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCCc--eehHHHHHHHHHHHHHHHHHH Q lcl|NC_017984. 310 RSQDGLVP--------DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQY--KWIDNFDFQVFLRTQLQLAYM 379 (487) Q Consensus 310 k~l~Gv~~--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg~--~~iD~~~~~dWl~~~iq~~l~ 379 (487) |.+.||.. ..++++|++.|..+|+|++.. +.+ +.+|-.-+++++. .||-+.+-.+|+++.|+..+. T Consensus 227 ~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~~--~~G--~~~wG~rT~s~d~~~~~i~vrR~~~~i~~~i~~~~~ 302 (393) T protein:vir:10 227 VELDGVTGITKAVEFDINESSTEANYLNEKGITICLN--HNG--FRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLA 302 (393) T ss_pred ceeeceeecceecccccCCCcchhHhHhhcCceEEEc--CCC--EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHH Confidence 66666653 234688999999999999854 333 5566443444432 378888889999888888877 Q ss_pred HHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcC--ccccCcccCccccccccccccCccccceeeeeeEEeccCCCHHH Q lcl|NC_017984. 380 NMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFG--GIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQT 457 (487) Q Consensus 380 ~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG--~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~d 457 (487) ..+- | |.++.=...++..++.-|+.-+++| .|..+. + ++ -++.+++| T Consensus 303 ~~v~---e-~~~~~~~~~i~~~i~~~L~~l~~~g~~al~g~~---------------------v-----~~-~~~nt~~~ 351 (393) T protein:vir:10 303 WAVD---M-PLTPLRVKTMLEAINNKLRSWASGDDPRILGAR---------------------V-----WV-AEEITADI 351 (393) T ss_pred Hhcc---C-CCCHHHHHHHHHHHHHHHHHHHhccccccccce---------------------E-----Ee-cCCCCHHH Confidence 6432 3 6788888889999999998887766 343210 0 11 23466777 Q ss_pred HhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 458 RVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 458 ra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) ..+-+.. +.+.+...-.++++++.....= T Consensus 352 i~~G~~~-~~i~~~p~~p~e~I~~~~~~~~ 380 (393) T protein:vir:10 352 IKSGKFV-IKYDYHWIPSLESLGLEQRVND 380 (393) T ss_pred hhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 6665553 7888899999999998853321 No 57 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=92.07 E-value=0.013 Score=30.93 Aligned_cols=420 Identities=11% Similarity=0.067 Sum_probs=156.1 Q ss_pred CCcCCccc---cceEEEeeeeeccccccccccee--------------------------EEecCcceeeeeeccHHHHH Q lcl|NC_017984. 1 MQFNSIPA---SNIAAVYPAVIGGGGNPLGLNTN--------------------------LFVQDAIYPNYEYFSNTLVG 51 (487) Q Consensus 1 ~~~~~ip~---s~iV~V~~~~~~~~~~~~~~~~l--------------------------l~~~~~~~~~~~y~s~~~V~ 51 (487) =+.+...+ ...+.+.+.-.......+-.+.. ++.......... .... T Consensus 127 ~~~~~~~~~~n~~~~~v~~~~a~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~v~~vv~~~~~~~~~~----~~a~ 202 (663) T protein:vir:10 127 EAGKVTAVDSDGKIKSLFVPTAEIIAKTRQLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNS----EDAP 202 (663) T ss_pred cccceeeeccCCceEEEEeccccccccccccceeeeccccceeEeeeccccccccccccceecccceeeEee----cccc Confidence 00000000 00011111000000000000000 000000000000 0000 Q ss_pred HhcCCChHHHHHHHHHhhcccCCccCCCEEEEEeeeccc------ceeeEeeccccc---------cchh--hhe-eeee Q lcl|NC_017984. 52 QHYGLESPIYKFATVYFNGFRNATTRPNSLFITKYNLTD------VPASLIGGDITS---------TTLA--DLK-LING 113 (487) Q Consensus 52 ~~fg~~s~ey~aA~~yF~g~~~q~p~P~~l~igr~~~~~------~~~~l~g~~~~~---------~~~~--~~~-~~~g 113 (487) . +...+.+.....-. ..++-...+.|-|...- ....-.+..+.. .... ... ..+. T Consensus 203 ~--~~~~~~~~~~~~~~-----~~~~~~a~~~G~~Gn~i~v~i~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 275 (663) T protein:vir:10 203 A--VMTSPAVMEKYAKF-----GMPLVSAVYPGEIGSTVEVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDD 275 (663) T ss_pred c--cccccchhhhcccc-----cceeeeeecccccccceeEEecccccccccccccccccccccccccceeeeecccccc Confidence 0 00000000000000 01111111111111000 000000000000 0000 000 0000 Q ss_pred --EEEEEEccceEEE-Eeecc-ccCchHHHHHhhhhhheeeEEEecccceEEEEeccc---ccceeEEecccchhhhhhh Q lcl|NC_017984. 114 --TLTIVVDGVSKSV-PVDLA-TANSYSDAAALIATALTLPCTYESTVKGFVIKSGTS---GANSTISFATGDISDDLKL 186 (487) Q Consensus 114 --~~~iti~g~~~~~-~i~~s-~ats~~~vA~~i~t~l~a~vt~d~~~~~f~its~t~---g~~stit~atgd~a~~l~l 186 (487) .+.+..+|..... .+.+. ...........+...+ ....+.+....... +....+....|.- T Consensus 276 ~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d------ 343 (663) T protein:vir:10 276 QFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYF------RNGGSNFIFASSEGWPAGFTGIIQLGGGTS------ 343 (663) T ss_pred ceeEEEecCCcceeeeeeeecccccccchhhhhhhhhh------ccCcceEEEEeecccCccccceeEcccccC------ Confidence 0111111211100 00000 0000000000011111 00111111111000 0000011100000 Q ss_pred ccccceeEecCcccccHHHHHHHHHhcccceeEEEEEeccC--C----hhHHHHHHHHHhccCceEEEEEccccccc-c- Q lcl|NC_017984. 187 TQETGAVLNNHTAADTPTTGALNALAFSQNFVNITYSEGVF--N----EDALKDLALWVTSQNSRFKLYTWGLDPVA-L- 258 (487) Q Consensus 187 t~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~--~----~~~i~a~A~w~~a~~~~~~~~~~~~~~~~-~- 258 (487) +. ..........+++++.+...-.-.+++...+. . .....++...++....+|........... . T Consensus 344 ----~~---~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~al~~~a~~~~~~~ai~d~p~~~~~~~~ 416 (663) T protein:vir:10 344 ----AN---ADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSLADDRQDCVAIVNPPAELMVGIP 416 (663) T ss_pred ----CC---ccccchhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHHHHhhCCEEEEEecCcccccccc Confidence 00 00011222334444443322111222322211 1 11233445555554444443322111000 0 Q ss_pred -ccchHHHHHHHh------------CCc--ceEEEecC------C-------CchHHHHHHHHHhcCcCcCCceeeeeee Q lcl|NC_017984. 259 -GQSGASFGEWAK------------ENT--SGVVPLYG------T-------FDKAAFFCGVSGSINYQEENGRTTTAFR 310 (487) Q Consensus 259 -~~~~~~~~~l~~------------~~~--~~t~~~y~------~-------~~~~a~~~g~~as~~~~~~~gs~T~~fk 310 (487) ......+..+.+ .++ .+....|. + -.....++|.++.++.++.+ ......| T Consensus 417 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~~~g~-~~sPan~ 495 (663) T protein:vir:10 417 TSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLAADIAGLCAYTDQVSHP-WMSPAGY 495 (663) T ss_pred cccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEecccCCceEEechhHHHHHHHHHhhccCCc-eEccCCc Confidence 001011111100 111 22222221 1 12356677888887754421 1112234 Q ss_pred ecCccc-----ccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC---ceehHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017984. 311 SQDGLV-----PDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ---YKWIDNFDFQVFLRTQLQLAYMNMF 382 (487) Q Consensus 311 ~l~Gv~-----~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg---~~~iD~~~~~dWl~~~iq~~l~~ll 382 (487) .+.+|. ...+++.|.+.|..+|+|+...+-+. ..+.+|-.-++++. +.||-+.+-.+|+.+.|+..+...+ T Consensus 496 ~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~-~G~~~wG~rT~s~~~s~~~~i~vrR~~~~i~~si~~~~~~~v 574 (663) T protein:vir:10 496 RRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGG-DGFVLFGDKMATQVPSPFDRINVRRLFNMLKKNIGDTSKYEL 574 (663) T ss_pred eeccccccccceeccChhHHHHHhhCCceEEEEEeCC-CcEEEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhc Confidence 333332 24689999999999999999887542 12455544444443 3478899999999999888887643 Q ss_pred HhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEe--ccCCCHHHHhh Q lcl|NC_017984. 383 QAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALS--VTLPDSQTRVA 460 (487) Q Consensus 383 ~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~--~~~~s~~dra~ 460 (487) =. |.|+.-...|+..|+.-|++-+++|.|. ||++. .+..|++|+.+ T Consensus 575 ~e----~n~~~l~~~i~~~i~~~L~~l~~~gal~----------------------------g~~v~~d~~~nt~~~i~~ 622 (663) T protein:vir:10 575 FE----NNDAFTRQSFRMETSQYLDGIRSLGGCY----------------------------DFRVVCDTTNNTPNVIDR 622 (663) T ss_pred cC----CCCHHHHHHHHHHHHHHHHHHHhcCcee----------------------------eeEEEEcCCCCCHHHhhC Confidence 22 6788889999999999999999999885 24454 45678999988 Q ss_pred cccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 461 RESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 461 R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) .+.. +.+.+.....+++|+++ ++| T Consensus 623 G~~~-~~i~~~p~~pae~i~~~--~~~ 646 (663) T protein:vir:10 623 NEFV-GTIYVKPPRSINYITLN--MVA 646 (663) T ss_pred CeEE-EEEEEEecCCcceEEEE--EEE Confidence 8884 99999999999999987 556 No 58 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=92.05 E-value=0.013 Score=30.92 Aligned_cols=422 Identities=11% Similarity=0.066 Sum_probs=166.2 Q ss_pred CCcCCccccc--eEEEeeeeeccccc-cccccee-EEecCcc-eeeeeeccHHHHHHhcCCChHHHHHHHHHhhcccCCc Q lcl|NC_017984. 1 MQFNSIPASN--IAAVYPAVIGGGGN-PLGLNTN-LFVQDAI-YPNYEYFSNTLVGQHYGLESPIYKFATVYFNGFRNAT 75 (487) Q Consensus 1 ~~~~~ip~s~--iV~V~~~~~~~~~~-~~~~~~l-l~~~~~~-~~~~~y~s~~~V~~~fg~~s~ey~aA~~yF~g~~~q~ 75 (487) +. +.++.-. +.+....+...... ...+... .+..... .+........-....+....+.. .. T Consensus 153 ~~-~~~g~~~~l~~~~~~~~~~~~~~~~~a~sv~~~~~~g~~~~~~~~~a~~~~~~~~~~~~~~~~------------~~ 219 (666) T protein:vir:65 153 HA-KAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLKKY------------DM 219 (666) T ss_pred cc-cccCcceeEeeccceeecccCcccccceeeeecccccceeeeeeccccccccccccccccccc------------cc Confidence 21 1111000 00000011100000 0000000 0000000 00000000000000000000000 00 Q ss_pred cCCCEEEEEeeecccc------------eeeE----eecc--ccccchhhheeee--eEEEEEEccceEEEEeeccccCc Q lcl|NC_017984. 76 TRPNSLFITKYNLTDV------------PASL----IGGD--ITSTTLADLKLIN--GTLTIVVDGVSKSVPVDLATANS 135 (487) Q Consensus 76 p~P~~l~igr~~~~~~------------~~~l----~g~~--~~~~~~~~~~~~~--g~~~iti~g~~~~~~i~~s~ats 135 (487) +.-...+.|.|..... ...+ .+.. ............+ ..+.+..+|.... +..++.... T Consensus 220 ~a~~A~~~g~~g~~i~v~i~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e-~~~~~~~~~ 298 (666) T protein:vir:65 220 PAVSAIYAGEIGNSLEVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVE-SYVLSTLKG 298 (666) T ss_pred ceeeeeeccccccceeEEeecccccccccccccccccccccccceeeecccccccccceeeeecCCcccc-eeecccCcc Confidence 0000111111111000 0000 0000 0000000000000 1122222332110 011111110 Q ss_pred ---hHHHHHhhhhhheeeEEEecccceEEEEe---cccccceeEEecccchhhhhhhccccceeEecCcc--cccHHHHH Q lcl|NC_017984. 136 ---YSDAAALIATALTLPCTYESTVKGFVIKS---GTSGANSTISFATGDISDDLKLTQETGAVLNNHTA--ADTPTTGA 207 (487) Q Consensus 136 ---~~~vA~~i~t~l~a~vt~d~~~~~f~its---~t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~a--aet~~~al 207 (487) .......+...+. + ..++++... ...+....+.+..++-.. .......|.+ .......+ T Consensus 299 ~~~~~~~~~~~~~~~~-----~-~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~-------~~~~~~~g~~~~~~~~~~~~ 365 (666) T protein:vir:65 299 DKDVYGNSIYMDDFFA-----R-GSSQYIYATAQGWVDGFSGIISLAGGVSAN-------EATTGGVGADPFIGAMMQGW 365 (666) T ss_pred cccccchhhhhhhhhc-----c-cccceeeeecccccccccceEEccCCCCcC-------cccccccccccccccHHHHH Confidence 0000001111110 0 011111111 111222233333322100 0000001111 12234555 Q ss_pred HHHHhcccceeEEEEEecc-----CChhHHHHHHHHHhccCceEEEEEcccc----ccccccchHHHHHHHhC------- Q lcl|NC_017984. 208 LNALAFSQNFVNITYSEGV-----FNEDALKDLALWVTSQNSRFKLYTWGLD----PVALGQSGASFGEWAKE------- 271 (487) Q Consensus 208 ~a~~~~~~~wy~~~~~~~~-----~~~~~i~a~A~w~~a~~~~~~~~~~~~~----~~~~~~~~~~~~~l~~~------- 271 (487) .++.+.......++.+... .......++...++....++........ .....+..+........ T Consensus 366 ~~~~~~~~~~~~~l~~p~~~~~~~~~~~v~~~l~~~~~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 445 (666) T protein:vir:65 366 DLFAERESIHVNLLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENN 445 (666) T ss_pred HHHhhhhhccCCceeecCcCCccchhHHHHHHHHHHHhhccceEEEeccccceeeecCCCCCHHHHHHHHHhcccccccc Confidence 5555443222222222111 1233445566667765544433221110 11111111221111110 Q ss_pred ---CcceEEEecC------CC-------chHHHHHHHHHhcCcCcCCceeeeeeeecCccc-----ccCCCHHHHHHHHh Q lcl|NC_017984. 272 ---NTSGVVPLYG------TF-------DKAAFFCGVSGSINYQEENGRTTTAFRSQDGLV-----PDVTNEADAETLVK 330 (487) Q Consensus 272 ---~~~~t~~~y~------~~-------~~~a~~~g~~as~~~~~~~gs~T~~fk~l~Gv~-----~~~lt~t~~~al~~ 330 (487) +..+....|. +. .....+.|.++.++..+. =......|.+.||. .-.+++.|.+.|.. T Consensus 446 ~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g-~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~ 524 (666) T protein:vir:65 446 MNINTTYAVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQ-PWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQ 524 (666) T ss_pred cccCcceEEEEcCceEEecccCCceeEechHHHHHHHHHHHhccCC-cEEccCCeecceeeccccceeecChhHHHhhhh Confidence 1122222221 10 234566777777764332 11122345444442 13578899999999 Q ss_pred CCceEEEEecCCCceEEEEECCEEcCC---ceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHH Q lcl|NC_017984. 331 NGYSFYGAWATANDRFQFAGNGSVTGQ---YKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPID 407 (487) Q Consensus 331 ~~~n~y~~~~~~~~~~~~~~~G~~sgg---~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~ 407 (487) +|+|++..+.+.+ +.+|-.-++++. +.||-+.+-.+|+++.|+..++-.+=. |.++.=...|+..|+.-|+ T Consensus 525 ~gIn~i~~~~~~G--~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~ 598 (666) T protein:vir:65 525 AAINPVIGAGGEG--FILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFE----NNDNFTRASFRMEVSQYLS 598 (666) T ss_pred CCceEEEEeCCCe--EEEEecccCCCCCcccceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHH Confidence 9999999887655 455544444443 347888999999999998888764432 5678888999999999999 Q ss_pred HHHhcCccccCcccCccccccccccccCccccceeeeeeEEe--ccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEe Q lcl|NC_017984. 408 QGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALS--VTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATN 485 (487) Q Consensus 408 ~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~--~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~ 485 (487) +.+++|.|. ||++. .+..+++|+.+.+. -+.+.+...-.+++|.++ + T Consensus 599 ~l~~~gal~----------------------------g~~V~~d~~~nt~~~i~~G~~-~~~i~~~p~~pae~i~~~--~ 647 (666) T protein:vir:65 599 TIRSLGGIY----------------------------DFRVQCDTTNNTPDVIDRNEF-VASMFIKPAKSINYIMLN--F 647 (666) T ss_pred HHHhCCcee----------------------------eeEEEEcCCCCCHHHhhCCeE-EEEEEEEecCCcceEEEE--E Confidence 999999885 34554 45678999988888 499999999999999998 4 Q ss_pred eC Q lcl|NC_017984. 486 VQ 487 (487) Q Consensus 486 vq 487 (487) +| T Consensus 648 ~~ 649 (666) T protein:vir:65 648 TA 649 (666) T ss_pred EE Confidence 46 No 59 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=91.52 E-value=0.015 Score=30.51 Aligned_cols=419 Identities=10% Similarity=0.057 Sum_probs=152.3 Q ss_pred CCcCCc----------cccceEEEeeeeeccccccccccee-EEecCcceeeeeeccH-------HHHHHh----c---- Q lcl|NC_017984. 1 MQFNSI----------PASNIAAVYPAVIGGGGNPLGLNTN-LFVQDAIYPNYEYFSN-------TLVGQH----Y---- 54 (487) Q Consensus 1 ~~~~~i----------p~s~iV~V~~~~~~~~~~~~~~~~l-l~~~~~~~~~~~y~s~-------~~V~~~----f---- 54 (487) ..-..+ +..+++.+.+.........+..... -+..+....+...+.. .++..+ + T Consensus 120 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~~~~~a~~~~~~~~~~~a~~~~v~~~~~~~~~a~av~~i~~dg~vt~~~~~ 199 (663) T protein:vir:10 120 HNTTTVTEEGKVTKVDADGKIKALFVPSSAVIAKAKQLGTYPVLGDNWRAEVSGASGGSAATLTLGGIVVDSGVTFGNSE 199 (663) T ss_pred ccccccccCcceeeeccCCceeEEEeccccccccccccccccccccceeeEEeeccccccccceeEeeecCCceeEEeee Confidence 100000 0111111111100000000000000 0000000000000000 000000 0 Q ss_pred ----CCChHHHHHHHHHhhcccCCccCCCEEEEEeeecc-----------c----ceeeEeeccccccchhhh---eeee Q lcl|NC_017984. 55 ----GLESPIYKFATVYFNGFRNATTRPNSLFITKYNLT-----------D----VPASLIGGDITSTTLADL---KLIN 112 (487) Q Consensus 55 ----g~~s~ey~aA~~yF~g~~~q~p~P~~l~igr~~~~-----------~----~~~~l~g~~~~~~~~~~~---~~~~ 112 (487) +..+.+......-+ ..|+-...+.|-|... . .+..+.++.......... -..+ T Consensus 200 ~a~~~~~~~~~~~~~~~~-----~~~~~~a~~~g~~G~~i~v~~~~~~~~~~~~~~~v~~~~g~~~~~~~~~~~~g~~~~ 274 (663) T protein:vir:10 200 EAPDVMTSTKVLANFAKY-----GMPLISAVYPGEIGSTVEVEVISKTAFQSGAAQPIYPFGGTRASNARSVIQYGPMTD 274 (663) T ss_pred ccccccccceeeeecccc-----ccceeeeecccccCcceeEeecccccccccceeeecccCcccccccccccccccccc Confidence 00000000000000 0000000000100000 0 000000000000000000 0000 Q ss_pred e--EEEEEEccceE-EEEeec-cccCchHHHHHhhhhhheeeEEEecccceEEEEecc---cccceeEEecccchhhhhh Q lcl|NC_017984. 113 G--TLTIVVDGVSK-SVPVDL-ATANSYSDAAALIATALTLPCTYESTVKGFVIKSGT---SGANSTISFATGDISDDLK 185 (487) Q Consensus 113 g--~~~iti~g~~~-~~~i~~-s~ats~~~vA~~i~t~l~a~vt~d~~~~~f~its~t---~g~~stit~atgd~a~~l~ 185 (487) . .+.+..+|... ...+.. .......+....+...+. ...++++..... .+....+.. T Consensus 275 ~~~~~~~~~~g~~~e~~~ls~~~~~~~~~~~~~~~~~~~~------~~~s~~v~~~~~~~~~~~~~~~~l---------- 338 (663) T protein:vir:10 275 DQFAIIVRRDGIVVESTVLSTRRGDRDVYGNNIFMDDYFR------NGSSNFIYASSVNWPAGFTGIIQL---------- 338 (663) T ss_pred hhhcccccCCCcccceeeeeccccccccchhhhhhhhhhc------CcccceeEeeccccCcccceeEEe---------- Confidence 0 00011111100 000000 000000000000111110 000111100000 000001111 Q ss_pred hccccceeEecCccc------ccHHHHHHHHHhcc-cceeEEEEEeccCC-----hhHHHHHHHHHhccCceEEEEEccc Q lcl|NC_017984. 186 LTQETGAVLNNHTAA------DTPTTGALNALAFS-QNFVNITYSEGVFN-----EDALKDLALWVTSQNSRFKLYTWGL 253 (487) Q Consensus 186 lt~~~gA~~~~G~aa------et~~~al~a~~~~~-~~wy~~~~~~~~~~-----~~~i~a~A~w~~a~~~~~~~~~~~~ 253 (487) ..|.+. .....+++.+.+.. .+...++....... .....++...++....+|....... T Consensus 339 ---------~gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~ 409 (663) T protein:vir:10 339 ---------GGGASANNAVGSDELIAGWDLFADREALHVNLMIAGACKSDGVAVASTVQKHVVALADDRQDCVAFVNPPS 409 (663) T ss_pred ---------cccccCcccchhhhhhhHHhhhccccccCceEEEeecCCCCchhhHHHHHHHHHHHHHhhCCEEEEEecCc Confidence 111111 11122233333221 22333322111111 1123345555565444444332211 Q ss_pred ccccc-ccch--HHHHHHH------------hC--CcceEEEecC-----CC--------chHHHHHHHHHhcCcCcCCc Q lcl|NC_017984. 254 DPVAL-GQSG--ASFGEWA------------KE--NTSGVVPLYG-----TF--------DKAAFFCGVSGSINYQEENG 303 (487) Q Consensus 254 ~~~~~-~~~~--~~~~~l~------------~~--~~~~t~~~y~-----~~--------~~~a~~~g~~as~~~~~~~g 303 (487) ..... .... ..+..+. .. +..+....|. ++ ....+++|.++.++.++. = T Consensus 410 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~~~g-~ 488 (663) T protein:vir:10 410 ELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYAFISGNYKYQYDKYNDINRWVPLSADIAGLCAYTDQVGH-P 488 (663) T ss_pred ccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceEEEEecceeEecccCCceEEechHHHHHHHHHHhhccCC-c Confidence 10000 0000 0000000 01 1123333221 01 234667888887774432 1 Q ss_pred eeeeeeeecCcccc-----cCCCHHHHHHHHhCCceEEEEecC-CCceEEEEECCEEcCC---ceehHHHHHHHHHHHHH Q lcl|NC_017984. 304 RTTTAFRSQDGLVP-----DVTNEADAETLVKNGYSFYGAWAT-ANDRFQFAGNGSVTGQ---YKWIDNFDFQVFLRTQL 374 (487) Q Consensus 304 s~T~~fk~l~Gv~~-----~~lt~t~~~al~~~~~n~y~~~~~-~~~~~~~~~~G~~sgg---~~~iD~~~~~dWl~~~i 374 (487) ......|.+.||.- ..+++.|.+.|..+|+|....+.+ .+ +.+|-.-++++. +.||-+.+-.+|+...| T Consensus 489 ~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G--~~~wG~rT~s~~~s~~~~i~vrR~~~~i~~si 566 (663) T protein:vir:10 489 WMSPAGYRRGQLRNTIKLAIEPKQSLRDTMYQVSINPVTGFAGGDG--FVLFGDKMATQVPSPFDRINVRRLFNMLKKNI 566 (663) T ss_pred EEccCCeeecceeccccceeecCchhHHHHHhCCCcEEEEeeCCCc--EEEEcccccCCCCcccceEehhhHHHHHHHHH Confidence 11223344444422 357889999999999999988754 33 445543334443 34788899999998888 Q ss_pred HHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEe--ccC Q lcl|NC_017984. 375 QLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALS--VTL 452 (487) Q Consensus 375 q~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~--~~~ 452 (487) +..+...+=. |.++.-...|+..|+.-|++-+++|.|. ||++. .+. T Consensus 567 ~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~----------------------------gf~V~~d~~~ 614 (663) T protein:vir:10 567 GDTSKYELFE----NNDAFTRQSFRMEVSQYLDNIRSLGGVY----------------------------DFRVVCDTTN 614 (663) T ss_pred HHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee----------------------------eeEEEEcCCC Confidence 8887764322 6788889999999999999999999885 34554 456 Q ss_pred CCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 453 PDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 453 ~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) .+++|+.+-+. -+.+.+...-.+++|.++ .+| T Consensus 615 nt~~~i~~G~~-~~~i~~~p~~pae~I~~~--~~~ 646 (663) T protein:vir:10 615 NTPQVIDSNEF-VATIYIKAPRSINYITLN--FVA 646 (663) T ss_pred CCHHHhhCCeE-EEEEEEEecCCcceEEEE--EEE Confidence 78889888777 499999999999999997 446 No 60 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=90.62 E-value=0.02 Score=29.91 Aligned_cols=426 Identities=11% Similarity=0.072 Sum_probs=163.8 Q ss_pred CCcCCccccceEE-Eeeeeeccccccc-ccceeEEe-cCcceeee-----eecc---------HHHHHHh-cCCC----- Q lcl|NC_017984. 1 MQFNSIPASNIAA-VYPAVIGGGGNPL-GLNTNLFV-QDAIYPNY-----EYFS---------NTLVGQH-YGLE----- 57 (487) Q Consensus 1 ~~~~~ip~s~iV~-V~~~~~~~~~~~~-~~~~ll~~-~~~~~~~~-----~y~s---------~~~V~~~-fg~~----- 57 (487) -+...++.+-.-. +.+.+...++... .-...++. .+...... .+.. ..++..+ -|.. T Consensus 206 ~a~~~~~~~~~~~~~~~~~~s~~~~~~~a~~~~v~~~~~~~~~~~~i~~~~~~~~~~~~~~~a~~~~~~~~~g~~~~it~ 285 (749) T protein:vir:10 206 ESVNVLAYDATNKKLEIGLPSGGVTGILADNQVITQGTNTAKINVTIERKLLVALNKSSIEFAASDVVQDTNSTNITITS 285 (749) T ss_pred cccccccccCCcceEEEeeecccccceeeeeecccccccccccccccccchhhhhccccceeeccccccCCccceeEEEe Confidence 0011111111110 1111111111000 00000000 00000000 0000 0000000 0000 Q ss_pred -hHHHHHHHHHhhcc--cCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccc-------eEEEE Q lcl|NC_017984. 58 -SPIYKFATVYFNGF--RNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGV-------SKSVP 127 (487) Q Consensus 58 -s~ey~aA~~yF~g~--~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~-------~~~~~ 127 (487) +.+|.- +.++.++ ..-.|+|.+..++.- .|+ ....+.+ +.+..+|. ....- T Consensus 286 v~~~~~~-~~~~t~~~~~~~a~~~gt~~~~~~---------~~g-----~~D~~~v----~v~~~~g~~~~~~g~v~e~~ 346 (749) T protein:vir:10 286 VRDEYTE-REYLPGVKWINVAPRPGTSLYANG---------VGG-----HRDEMHV----ILVDIDGGVTGTVGALLERY 346 (749) T ss_pred eeccccc-cccccceeeccccccccceeeeec---------ccC-----CCCceEE----EEecCCCeeeecccceeeee Confidence 000000 0001000 001122222111110 000 0000000 01111121 10001 Q ss_pred eeccccCch-------HHHHHhhhhhheeeEEEecccceEEEEecc--cc----------------cceeEEecccch-- Q lcl|NC_017984. 128 VDLATANSY-------SDAAALIATALTLPCTYESTVKGFVIKSGT--SG----------------ANSTISFATGDI-- 180 (487) Q Consensus 128 i~~s~ats~-------~~vA~~i~t~l~a~vt~d~~~~~f~its~t--~g----------------~~stit~atgd~-- 180 (487) ++++.+.+. ..+...+...-. .+.+-.....+...+.+ ++ ...++.+..+.. T Consensus 347 ~~~~~~~~~~~~~~~~~~~~~~~~~~s~-~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (749) T protein:vir:10 347 IDVSKASDAKTSVGETNYYAEVIKQKSE-FIYWAEHESTLYAATSSASDGLFGQTAANRQFNLFRSAAGSVDYPAGVTTL 425 (749) T ss_pred eeccccccccccccccchhhhhhccCCC-EEEEEecccccccccccccccccccccccceeeccccccccceeccccccc Confidence 222222211 111111111100 00000000000000000 00 000111110000 Q ss_pred ---hh---hhhhccccceeEecC---cccccHHHHHHHHHhccc-ceeEEEEEeccCC----hhHHHHHHHHHhccCceE Q lcl|NC_017984. 181 ---SD---DLKLTQETGAVLNNH---TAADTPTTGALNALAFSQ-NFVNITYSEGVFN----EDALKDLALWVTSQNSRF 246 (487) Q Consensus 181 ---a~---~l~lt~~~gA~~~~G---~aaet~~~al~a~~~~~~-~wy~~~~~~~~~~----~~~i~a~A~w~~a~~~~~ 246 (487) .. ...+..+.......+ ........++.++.+... ..-.++......+ .....++...+|....++ T Consensus 426 ~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~li~~~~~~~~~~~~~v~~al~~~~~~~~~~~ 505 (749) T protein:vir:10 426 GSKNNATYYYRLSGGVNYTVSAGQYTITNTDIGSAYELIGDPESQIVDFIISGPSGTSDANALAKITSLVNIAEERRDCM 505 (749) T ss_pred cccCCcEEEEEccCCcccccccccccccchhHHHHHHHhhhhhhcccceEEEecCCCCcchhHHHHHHHHHHHhhcCCEE Confidence 00 000000000000000 011223444555544332 2222222222112 224456677777766555 Q ss_pred EEEEccccccc----cccchHHHHHHHhC--CcceEEEecC------CC-------chHHHHHHHHHhcCcCcCCceeee Q lcl|NC_017984. 247 KLYTWGLDPVA----LGQSGASFGEWAKE--NTSGVVPLYG------TF-------DKAAFFCGVSGSINYQEENGRTTT 307 (487) Q Consensus 247 ~~~~~~~~~~~----~~~~~~~~~~l~~~--~~~~t~~~y~------~~-------~~~a~~~g~~as~~~~~~~gs~T~ 307 (487) .+......... ............+. +..+..+.|. +. .....++|.++.++.++ | -| T Consensus 506 ~~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~~~--g--~~ 581 (749) T protein:vir:10 506 VFVSPRRGNVIGISNTTTITTNIVDFFKKLPSSSYMVFDSGYKYIYDKYNDVYRYIPCNGDTAGLCLQTNEIS--E--PW 581 (749) T ss_pred EEEcCCCCcccccccchhhhhHHHHHHhhccCceeEEEEccceeeeccccCceEEechHHHHHHHHHHhhccC--C--cE Confidence 44332211111 11111111111111 1222222221 10 23456777788776443 2 23 Q ss_pred e---eeecC---ccc--ccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC---ceehHHHHHHHHHHHHHHH Q lcl|NC_017984. 308 A---FRSQD---GLV--PDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ---YKWIDNFDFQVFLRTQLQL 376 (487) Q Consensus 308 ~---fk~l~---Gv~--~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg---~~~iD~~~~~dWl~~~iq~ 376 (487) | .|++. |+. ...+++.|.+.|..+|+|+...+.+.+ +.+|-.-++++. +.||-+.+-.+|++..|+. T Consensus 582 ~SPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~g~G--~~~wG~rT~~s~d~~~~~i~vRRl~~~ie~si~~ 659 (749) T protein:vir:10 582 FSPAGFQRGVLRNAIKLAYTPNKAQRDQLYANRVNPIVSFPGQG--VVLYGDKTALGFASAFDRINIRRLFLTVERVIST 659 (749) T ss_pred ECcCCceeeeeeccccceeecChhHHHhhhhCCceEEEEecCCe--EEEEcceecCCCCcccceeehhhhHHHHHHHHHH Confidence 3 45433 332 245789999999999999999887755 445544333332 4578999999999988888 Q ss_pred HHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEe--ccCCC Q lcl|NC_017984. 377 AYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALS--VTLPD 454 (487) Q Consensus 377 ~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~--~~~~s 454 (487) .+...+=. |.++.=...|+..|+.-|+.-+++|.|. ||++. .+..+ T Consensus 660 ~~~~~v~e----pn~~~l~~~i~~~i~~fL~~l~~~G~i~----------------------------~f~V~~d~~~Nt 707 (749) T protein:vir:10 660 AAKAQLFE----QNDEAQRSLFINIVEPYLRDVQGRRGVV----------------------------DFLVKCDSTNNT 707 (749) T ss_pred HHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhcCCee----------------------------eeEEEEcCCCCC Confidence 87764332 5677778889999999999999888773 34454 45678 Q ss_pred HHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 455 SQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 455 ~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) ++|+.+.+. -+.+.++..-.+++|+++ ++| T Consensus 708 ~~~i~~G~~-~~~i~~~P~~pae~I~~~--~~~ 737 (749) T protein:vir:10 708 PEAVDRGEF-YAEVFLKPTRTINYVQLT--FVA 737 (749) T ss_pred HHHhhCCEE-EEEEEEEecCCccEEEEE--EEE Confidence 888888887 499999999999999997 557 No 61 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=90.59 E-value=0.02 Score=29.89 Aligned_cols=418 Identities=13% Similarity=0.077 Sum_probs=150.9 Q ss_pred CCcCCccccceEEEe------eeeecccc--cccccceeEEecCcceeee------------------eeccHHHHHHhc Q lcl|NC_017984. 1 MQFNSIPASNIAAVY------PAVIGGGG--NPLGLNTNLFVQDAIYPNY------------------EYFSNTLVGQHY 54 (487) Q Consensus 1 ~~~~~ip~s~iV~V~------~~~~~~~~--~~~~~~~ll~~~~~~~~~~------------------~y~s~~~V~~~f 54 (487) .... .+-.+.+.+. ..+..... ................+.. .-.+..++...+ T Consensus 143 ~~~~-~~~~~~v~~~~~~~~~~~~~~~t~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 221 (671) T protein:vir:56 143 SKIF-LPSAEIVAAAKSDGNYPSVGTITLQPTQGDIALTNIEIIDTGSVYFPNIELAFDALTAIETEGGALKYADLIEKQ 221 (671) T ss_pred eeee-ccceeEEEeeeccccccccccccccccccceeeeeecccccceEEEeccccccccccccccccccccchhhhhcc Confidence 1110 1111111111 11110000 0000000000000000000 000011111111 Q ss_pred CCChHHHHHHHHHhhcccCCccCCCEEEEEe-eec---ccceeeEeeccccccc----hhhheee-----------e-eE Q lcl|NC_017984. 55 GLESPIYKFATVYFNGFRNATTRPNSLFITK-YNL---TDVPASLIGGDITSTT----LADLKLI-----------N-GT 114 (487) Q Consensus 55 g~~s~ey~aA~~yF~g~~~q~p~P~~l~igr-~~~---~~~~~~l~g~~~~~~~----~~~~~~~-----------~-g~ 114 (487) +. +.+. ..+-....+. -.+.+.. ... ........+..+.... ....... + .. T Consensus 222 ~~--~~~~--a~~~g~~g~~----~~v~v~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 293 (671) T protein:vir:56 222 GF--PRLS--ARYVGDFGDA----ISVEIINYADYQTAFAFAAGHTLGDIELPIYPDGGTRSINLSSYFTFGPSNSNQYA 293 (671) T ss_pred cc--cccc--cccccccCcc----eEEEEecccccccccccccceeeeeccccccccccccccccceeecccccccccce Confidence 11 0000 0000000000 0000000 000 0000000000000000 0000000 0 00 Q ss_pred EEEEEccceEEE-EeeccccCchHHHHHhhhhhheeeEEEecccceEEEEecccccceeEEecccchhhhhhhcccccee Q lcl|NC_017984. 115 LTIVVDGVSKSV-PVDLATANSYSDAAALIATALTLPCTYESTVKGFVIKSGTSGANSTISFATGDISDDLKLTQETGAV 193 (487) Q Consensus 115 ~~iti~g~~~~~-~i~~s~ats~~~vA~~i~t~l~a~vt~d~~~~~f~its~t~g~~stit~atgd~a~~l~lt~~~gA~ 193 (487) +.+..+|..... .+..............+...+ ...+....+........ ....... T Consensus 294 ~~v~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~~~~~~~-----~~~~~~~ 351 (671) T protein:vir:56 294 VIVRVSGEVEEAFIVSTNPGDKDVNGQSIFIDEY-----------------FENSGSAYITAIAEGWK-----TESGAYN 351 (671) T ss_pred eEEeecCccceeEEEeecccccccchhhhhhhhh-----------------hcccCceEEEecCcccC-----Ccccccc Confidence 111111111000 000000000000000000000 00000000000000000 0000001 Q ss_pred EecCcc----cccHHHHHHHHHhcccceeEEEEEeccCChh-------HHHHHHHHHhccCceEEEEEcccccc---ccc Q lcl|NC_017984. 194 LNNHTA----ADTPTTGALNALAFSQNFVNITYSEGVFNED-------ALKDLALWVTSQNSRFKLYTWGLDPV---ALG 259 (487) Q Consensus 194 ~~~G~a----aet~~~al~a~~~~~~~wy~~~~~~~~~~~~-------~i~a~A~w~~a~~~~~~~~~~~~~~~---~~~ 259 (487) ...|.+ ..+...++.++.+...-.-.++.+.....++ ....+...++....++.++....... ... T Consensus 352 ~~gg~d~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~ 431 (671) T protein:vir:56 352 FGGGSDANAGADDWMFGLDMLSDPEVLYTNLVIAGNAAAEEVSIASTVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAG 431 (671) T ss_pred ccCccccccchhHHHHHHHhhhhccccceeEEEcCCCCCccchhHHHHHHHHHHHHHhhcCCEEEEEecccccccccccc Confidence 111221 1223344444443322111222222111111 11112222223333333322110000 000 Q ss_pred cchHHHHHHH-------------hC--CcceEEEecC------CC-------chHHHHHHHHHhcCcCcCCceeeeeeee Q lcl|NC_017984. 260 QSGASFGEWA-------------KE--NTSGVVPLYG------TF-------DKAAFFCGVSGSINYQEENGRTTTAFRS 311 (487) Q Consensus 260 ~~~~~~~~l~-------------~~--~~~~t~~~y~------~~-------~~~a~~~g~~as~~~~~~~gs~T~~fk~ 311 (487) ..-..+..+. .. +..+....|. +. .....+.|.++.++.++. =......|. T Consensus 432 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AGl~Ar~D~~~g-~~~span~~ 510 (671) T protein:vir:56 432 TAVANIQGWRTGIDPTNGQAVVDNLNVSTTYAVIDGNYKYQYDKYNDRNRWVPLAGDIAGLCAYTDQVSQ-PWMSPAGFN 510 (671) T ss_pred ccHHHHHHHhhhccccchhhhhhhccCCcceEEEecCceEEecccCCceeEechHHHHHHHHHHhhccCC-cEECcCCce Confidence 0000000010 01 1122222221 10 235667788888764431 111122443 Q ss_pred cCc---cc--ccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC---ceehHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017984. 312 QDG---LV--PDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ---YKWIDNFDFQVFLRTQLQLAYMNMFQ 383 (487) Q Consensus 312 l~G---v~--~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg---~~~iD~~~~~dWl~~~iq~~l~~ll~ 383 (487) +.+ +. ...+++.|.+.|..+|+|+...+.+.+ +.+|-.-++++. +.||-+.+-.+|+.+.|+..++..+= T Consensus 511 ~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G--~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~v~ 588 (671) T protein:vir:56 511 RGQIKGVNRLAVDLRRAHRDALYQIGINPVVGFAGQG--FVLYGDKTATQQASAFDRINVRRLFNLLKKAISDAAKYRLF 588 (671) T ss_pred eccccccccceeecChhHHHHHhhCCceEEEEecCCe--EEEEcceecCCCCcccceEehhhHHHHHHHHHHHHHHHhcC Confidence 333 32 245788999999999999999887665 455544333332 35899999999999999888876432 Q ss_pred hcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEe--ccCCCHHHHhhc Q lcl|NC_017984. 384 AQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALS--VTLPDSQTRVAR 461 (487) Q Consensus 384 ~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~--~~~~s~~dra~R 461 (487) . |.++.=...|+..|+.-|+.-+++|.|. ||++. .+..|++|+.+. T Consensus 589 e----pn~~~~~~~i~~~i~~fL~~l~~~gal~----------------------------g~~v~~d~~~nt~~~i~~G 636 (671) T protein:vir:56 589 E----LNDEFTRSSFKSEIDAYLTNIQDLGGVY----------------------------DFRVVCDETNNPGSVIDRN 636 (671) T ss_pred C----CCCHHHHHHHHHHHHHHHHHHHhCCcee----------------------------eeEEEEcCCCCCHHHhhCC Confidence 2 5577777889999999999999999884 34444 466788898888 Q ss_pred ccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 462 ESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 462 ~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) +. -+.+.+...-.+++|+++- +| T Consensus 637 ~~-~~~i~~~p~~Pae~I~~~~--~~ 659 (671) T protein:vir:56 637 EF-VASIYVKPAKSINFITLNF--VA 659 (671) T ss_pred eE-EEEEEEEecCCcceEEEEE--EE Confidence 87 4999999999999999974 57 No 62 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=86.65 E-value=0.044 Score=28.02 Aligned_cols=430 Identities=11% Similarity=0.030 Sum_probs=156.3 Q ss_pred CC-------------------cCCccccceEEEee------eeecc---------ccccc--ccceeEEecCcceeeeee Q lcl|NC_017984. 1 MQ-------------------FNSIPASNIAAVYP------AVIGG---------GGNPL--GLNTNLFVQDAIYPNYEY 44 (487) Q Consensus 1 ~~-------------------~~~ip~s~iV~V~~------~~~~~---------~~~~~--~~~~ll~~~~~~~~~~~y 44 (487) -+ ---+|.++.+...- .+... .+... ......+.......+... T Consensus 120 ~~~s~~~~~~~~~~~~~~~~~~~~v~~~~~~~~a~~~~~~~~l~~a~~~~~~~~t~~~g~~~~~~v~~v~~~~~~~~~~~ 199 (679) T protein:vir:10 120 QGGNVIATGKVTVVNASGGIVAFYVPTAAIIDKAKSLNDYPALDNAWQIQFAAGGPGAGQAATATVVGINLDSTIFVPND 199 (679) T ss_pred eCCCcccceeEEEeeccCceeeeeecccccccccccccccceecccceeeeeeccccccceeeeeeeeeccCCceeeccc Confidence 00 00011111111000 00000 00000 000000000000000000 Q ss_pred ccH-HHHHHhcCCChHHHHHHHHHhhcccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEcc-- Q lcl|NC_017984. 45 FSN-TLVGQHYGLESPIYKFATVYFNGFRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDG-- 121 (487) Q Consensus 45 ~s~-~~V~~~fg~~s~ey~aA~~yF~g~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g-- 121 (487) ... ..+...-+.. +........+ ..|...-..-|.|...-. ........ ......+...+.++. T Consensus 200 ~~a~~~i~~~~~~~-~t~~~~~~~~-----~~~~~~A~~~g~~gn~i~-v~~va~~~------~~~~~~~~a~v~~~~~~ 266 (679) T protein:vir:10 200 EYAMSAISERSETK-RTFIDICEEM-----KVPAIVARYAGTYGDNIK-VLMIAYKD------YYKFNEAGKIVSVNTIN 266 (679) T ss_pred cccccccccccccc-hhhhhhhhcc-----ccceeeeecccccCCcce-EEEEeecc------ccccccccccccccccc Confidence 000 0000000000 0000000000 011100011111110000 00000000 000000000000000 Q ss_pred -ceEEEEeeccccCchHHHHHhhhh--hheeeEEEe-cccceEEEEeccc-----------------ccceeEEecccch Q lcl|NC_017984. 122 -VSKSVPVDLATANSYSDAAALIAT--ALTLPCTYE-STVKGFVIKSGTS-----------------GANSTISFATGDI 180 (487) Q Consensus 122 -~~~~~~i~~s~ats~~~vA~~i~t--~l~a~vt~d-~~~~~f~its~t~-----------------g~~stit~atgd~ 180 (487) ............+.........+. .+...+.-+ .....|.++.... +....+ +.... T Consensus 267 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vvv~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v-~~~~~- 344 (679) T protein:vir:10 267 PKVFPTGLDYGNVTPSSYLEFGPQNESQFAFIVFNNGVAVESKILSTKPGDRDIYGTSIYINEYFGNGYSSFV-QGVAE- 344 (679) T ss_pred ccccccccccccceeeeecccccccccceeeEEecccccccceeeecccccccccchhhhhhhhhcCccccee-eeccc- Confidence 000000000000000000000000 000000000 0111111111100 000000 00000 Q ss_pred hhhhhhc-cccceeEe-cCccc------ccHHHHHHHHHhcccceeEEEEEeccC------ChhHHHHHHHHHhccCceE Q lcl|NC_017984. 181 SDDLKLT-QETGAVLN-NHTAA------DTPTTGALNALAFSQNFVNITYSEGVF------NEDALKDLALWVTSQNSRF 246 (487) Q Consensus 181 a~~l~lt-~~~gA~~~-~G~aa------et~~~al~a~~~~~~~wy~~~~~~~~~------~~~~i~a~A~w~~a~~~~~ 246 (487) ... ...+.+.. .|.+. .....++..+......---++++.... ......++-..++....+| T Consensus 345 ----~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~~l~~~~~~~~~~~ 420 (679) T protein:vir:10 345 ----SWPVGYTGVLAFGGGQSSNTDISAAEFMKGWDMFADREHTDVNLFIAGAVAGEGAQIASTVQKAVVAIADERRDCL 420 (679) T ss_pred ----cccccccceeeccCCccCCCccchhhhhhhhhhhhcccccccceEEecCCCCCchhhhHHHHHHHHHHHHhhCCeE Confidence 000 00011111 11111 111222222221111111223332211 1223445667777666666 Q ss_pred EEEEccccccc---cccchHHHHHHHh------------CC--cceEEEecC-----CC--------chHHHHHHHHHhc Q lcl|NC_017984. 247 KLYTWGLDPVA---LGQSGASFGEWAK------------EN--TSGVVPLYG-----TF--------DKAAFFCGVSGSI 296 (487) Q Consensus 247 ~~~~~~~~~~~---~~~~~~~~~~l~~------------~~--~~~t~~~y~-----~~--------~~~a~~~g~~as~ 296 (487) .+......... ..+....+..+.. .+ ..+....|. ++ .....+.|.++.+ T Consensus 421 ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~ 500 (679) T protein:vir:10 421 VLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYASVDGNYKYQYDKYNDVNRWIPLAADIAGLCART 500 (679) T ss_pred EEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceEEEEccceeeecccCCceEEechHHHHHHHHHHh Confidence 55432111100 0111111111110 01 122232221 11 2246677888877 Q ss_pred CcCcCCceeeeeeeecCccc-----ccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC---ceehHHHHHHH Q lcl|NC_017984. 297 NYQEENGRTTTAFRSQDGLV-----PDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ---YKWIDNFDFQV 368 (487) Q Consensus 297 ~~~~~~gs~T~~fk~l~Gv~-----~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg---~~~iD~~~~~d 368 (487) +.++. =.....+|.+.||. .-.+++.|.+.|..+++|+...+.+.+ +.+|-.-++++. +.||-+.+-.+ T Consensus 501 D~~~g-~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~g~G--~~~wG~rT~~~~~s~~~~i~vrR~~~ 577 (679) T protein:vir:10 501 DTVGQ-PWQSPAGFNRGQIVNVIKLAVDTRQAHRDEMYTNGINPIVGFAGQG--YILYGDKTASQAPTPFDRINVRRLFN 577 (679) T ss_pred hccCC-cEECcCCeeeccccccccceeecChhhHHhhhhCCceEEEEecCCe--EEEEcccccCCCCcccceEehhhHHH Confidence 64332 11122344444442 234788999999999999999887765 445544434443 34788889999 Q ss_pred HHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEE Q lcl|NC_017984. 369 FLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWAL 448 (487) Q Consensus 369 Wl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~ 448 (487) |++..|+..+...+=. |.|+.=...|+..|+.-|.+-+++|.|. ||++ T Consensus 578 ~i~~si~~~~~~~v~e----pn~~~~~~~i~~~i~~fL~~l~~~gal~----------------------------gf~v 625 (679) T protein:vir:10 578 LLKKSISESAKYKLFE----LNDAFTRSSFRSEVGSYLDTIRSLGGIY----------------------------DFRV 625 (679) T ss_pred HHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee----------------------------eeEE Confidence 9998888888764432 5677778899999999999999999884 3555 Q ss_pred e--ccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 449 S--VTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 449 ~--~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) . .+..+++|+.+.+. -+.+.+...-.+++|.++ ++| T Consensus 626 ~~d~~~nt~~~i~~G~~-~~~i~~~p~~pae~i~~~--~~~ 663 (679) T protein:vir:10 626 VCDESNNTPAVIDRNEF-VATILIKPARSINYITLS--FVA 663 (679) T ss_pred EEcCCCCCHHHhhCCeE-EEEEEEEecCCccEEEEE--EEE Confidence 4 45678889888887 499999999999999997 446 No 63 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=79.95 E-value=0.099 Score=26.08 Aligned_cols=415 Identities=13% Similarity=0.115 Sum_probs=154.6 Q ss_pred CCcCCccccceE-------------------------------EEeeeeeccccccccc--ceeEEecCcceee-ee--- Q lcl|NC_017984. 1 MQFNSIPASNIA-------------------------------AVYPAVIGGGGNPLGL--NTNLFVQDAIYPN-YE--- 43 (487) Q Consensus 1 ~~~~~ip~s~iV-------------------------------~V~~~~~~~~~~~~~~--~~ll~~~~~~~~~-~~--- 43 (487) |. ..|.+|--+ +|++-+.-.+-....| +++-+.....+-+ .+ T Consensus 192 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 270 (717) T protein:vir:79 192 ME-SEITVSYEFTYKDAQGETKTSEVLDNNTDKDGKPMIAKGADVTIKLEHVALAGLKLYADGIEVVDAKAFTVAGDQLT 270 (717) T ss_pred cc-ceeEEEEEEEeecccCcchhhhhhcCCCCCCCceeEEecccceeehhhhhhhhhHHhhcchhhhhhhheeeecceEE Confidence 32 222222111 1111111111111100 1111111110000 00 Q ss_pred -eccHH-HHHHhcCCChHHHHHHHHHhhcccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeE--EEEEE Q lcl|NC_017984. 44 -YFSNT-LVGQHYGLESPIYKFATVYFNGFRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGT--LTIVV 119 (487) Q Consensus 44 -y~s~~-~V~~~fg~~s~ey~aA~~yF~g~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~--~~iti 119 (487) -+... .| |. |-|.+.+--..- --+.|++=| .+.||.+.....-.....++. +++|. T Consensus 271 ~~~~~~~~~----~~-~~~~~~~~~~~~-------~~~~~~~~~--------~~~~g~~~n~~~~~v~~~D~~~~~~~t~ 330 (717) T protein:vir:79 271 IHSNSKMKL----GA-SLEAQYAYNLVE-------VIQPVIELE--------SIFGGGVYNDIMRKVESKDGAVTVTITK 330 (717) T ss_pred EEecCCccc----ch-hhHHHHHhhHHH-------hhccceEEe--------ecccCceeeeeeeEEecCCceEEEEEec Confidence 00000 01 11 223221111111 001122211 111222111100000000000 11111 Q ss_pred c----cceEEEEeeccccCchHHHHHhhhhhheeeEEEecccceEEEEecccccceeEEec----ccchh------hhhh Q lcl|NC_017984. 120 D----GVSKSVPVDLATANSYSDAAALIATALTLPCTYESTVKGFVIKSGTSGANSTISFA----TGDIS------DDLK 185 (487) Q Consensus 120 ~----g~~~~~~i~~s~ats~~~vA~~i~t~l~a~vt~d~~~~~f~its~t~g~~stit~a----tgd~a------~~l~ 185 (487) - |..+..+++++.-. +...+-.+. .+.-|+-.+ +...++.....+++. +.+-+ +.+. T Consensus 331 ~~~~~g~~~~~pl~~ts~d-y~~~~~~vd-----gI~~~~~~~---V~~~g~~s~a~a~~~~g~~s~d~a~f~Gg~dgl~ 401 (717) T protein:vir:79 331 PESKRGMISEDPLVFKSGD-YTNFKMLVD-----AINNHPFNN---VVRARTKPEFEATFTSTLQAAADAKFSGGKDELS 401 (717) T ss_pred ccccCcceeccccccccCc-eeeeeeeec-----ccccCchhh---eeeeecccccceeeeecccCchhhccCCCccccc Confidence 0 00011122222111 000000000 000000000 001111111111111 11100 0011 Q ss_pred hccccceeEecCcccc-----cHHHHHHHHHhcccceeEEEEEeccC-----ChhHHHHHHHHHhccCc----eEEEEEc Q lcl|NC_017984. 186 LTQETGAVLNNHTAAD-----TPTTGALNALAFSQNFVNITYSEGVF-----NEDALKDLALWVTSQNS----RFKLYTW 251 (487) Q Consensus 186 lt~~~gA~~~~G~aae-----t~~~al~a~~~~~~~wy~~~~~~~~~-----~~~~i~a~A~w~~a~~~----~~~~~~~ 251 (487) +.... .....|...+ ++..+...+....-++..+.-..... .++...+++.+++.... ++..... T Consensus 402 ~~~ee-~Y~~lGgk~~d~g~lt~~aays~LE~~dVDlVil~ga~adtt~ga~~d~va~alad~caalSal~r~ai~VI~l 480 (717) T protein:vir:79 402 LDKEE-MYKRLGGEKNEEGFVTKQGAYQYLENYEVDYVIPLGVHADTKLIGKYDDFAYQLALACAVMSHYNSVTIGIIPT 480 (717) T ss_pred cchhh-hhccccccccccccccchhhhhhcCcceeEEEEecCccccccccchhhhHHHHHHHHHHHhhhccccceeeecc Confidence 10000 0001111111 12233444443333332221111110 12345567778776431 2211111 Q ss_pred cccc-cccccchHHHHHH---HhC-------------------C---cceEE-----EecCC------CchHHHHHHHHH Q lcl|NC_017984. 252 GLDP-VALGQSGASFGEW---AKE-------------------N---TSGVV-----PLYGT------FDKAAFFCGVSG 294 (487) Q Consensus 252 ~~~~-~~~~~~~~~~~~l---~~~-------------------~---~~~t~-----~~y~~------~~~~a~~~g~~a 294 (487) .... ...+........+ ... . |..++ .+... ...+..+.|..+ T Consensus 481 ~sp~D~~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a~~~~~~~idis~y~~vv~~~~~iv~~~~~~~~~~p~AG~vAGldA 560 (717) T protein:vir:79 481 TTPSDISLAGVEEHVKKLENYANEFYMRDRFGNIIFDADRNKIDLGQFIEVVAGPDFIVRNTRLGQMASTPDASYIGMVS 560 (717) T ss_pred ccccccchhhHHHHHHHHHhhhhhhhhhcchhccccccccccccccceeeeeecceeEEEcCCCceeecCHHHHHHHHHh Confidence 1100 0001111111111 000 0 00000 00000 011233344444 Q ss_pred hcCcCcCCceeeeeeeecCccc--ccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcCC---ceehHHHHHHHH Q lcl|NC_017984. 295 SINYQEENGRTTTAFRSQDGLV--PDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTGQ---YKWIDNFDFQVF 369 (487) Q Consensus 295 s~~~~~~~gs~T~~fk~l~Gv~--~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sgg---~~~iD~~~~~dW 369 (487) ...+...+ .+|.+.|+. ...++..|++.|..+|+|++..+.+.+ +.+|..=++++. +.||-+++-.|+ T Consensus 561 ~rGVwkSP-----ANk~I~GVvgLa~~lT~sE~d~Ln~aGIntIr~~~GrG--irVWGaRTtasd~sdWryInVRRl~D~ 633 (717) T protein:vir:79 561 QLKTQSAP-----TNKPLPSVTALRYTYSANQLNRLTKARFATFKYKQDGS--IGVVDAPTSAHAGSDYTRLSTARIVKE 633 (717) T ss_pred cCCccccc-----ccceecccccCcccCCHHHHHHHhhCCeEEEEEeCCce--EEEEeeeecCCCCcccceeehhhhHHH Confidence 44433333 367777765 356899999999999999998887655 556644334443 457899999999 Q ss_pred HHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCccccceeeeeeEEe Q lcl|NC_017984. 370 LRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLFTKGWALS 449 (487) Q Consensus 370 l~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~ 449 (487) +...|+..+...+ .+ |-++.+...|+..|+.-|++-.+.|.|.- |.+. T Consensus 634 Ie~sIr~al~~yV---gE-PNd~~tr~~Ik~sI~afL~~L~r~GAI~G----------------------------ykvd 681 (717) T protein:vir:79 634 AVNAVREVADPFI---GE-PNDTGNRNALTAAVDKRLSKMIENKALLG----------------------------FDFR 681 (717) T ss_pred HHHHHHHHHHHhc---cc-cCCHHHHHHHHHHHHHHHHHHHhcCceec----------------------------ceee Confidence 9999988876533 23 77889999999999999999999999952 1222 Q ss_pred ccCCCHHHHhhcccCCeEEEEEECCeEEEEEEEEEeeC Q lcl|NC_017984. 450 VTLPDSQTRVARESFIIKLFYTDGSSMQRLEMTATNVQ 487 (487) Q Consensus 450 ~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~i~gt~vq 487 (487) + .++++|..+=+. -+.+.+.....+++|.++-++== T Consensus 682 v-tnT~~di~~G~l-~V~I~vaPv~PaEfI~ititITA 717 (717) T protein:vir:79 682 L-VVTPQQELLGEG-SIELSLEAPNELRRLTTIVSLSA 717 (717) T ss_pred E-ecChhHhhCCEE-EEEEEEEecCcccEEEEEEEEeC Confidence 2 245555443222 37888889999999888743322 No 64 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=67.18 E-value=0.26 Score=23.83 Aligned_cols=329 Identities=10% Similarity=0.075 Sum_probs=128.8 Q ss_pred EccceEEEEeeccccCchHHHHHhhhhh-heeeEEEecccceEEEEecccccceeEEecccchhhhh---hhcccc---c Q lcl|NC_017984. 119 VDGVSKSVPVDLATANSYSDAAALIATA-LTLPCTYESTVKGFVIKSGTSGANSTISFATGDISDDL---KLTQET---G 191 (487) Q Consensus 119 i~g~~~~~~i~~s~ats~~~vA~~i~t~-l~a~vt~d~~~~~f~its~t~g~~stit~atgd~a~~l---~lt~~~---g 191 (487) .=|+...-.+|+..-. +.. |+-. |=.....-...+-+.+...++. +.-.+-+...+-.-+ .+..+. . T Consensus 1 ~~~~v~vn~~n~~~g~-~~~----~er~~Lfig~~~~~~~~~~~~~~~sdl-d~~lg~~~~~lk~~v~aa~~naG~~~~~ 74 (376) T protein:vir:37 1 MFPSVQINALNQLSGE-TKE----IERHALFVGVGTTNQGKLLALTPDSDF-DKVFGETDTDLKKQVRAAMLNAGQNWFA 74 (376) T ss_pred CCCeEEEecccccCCC-ccc----ccceEEeeccccccccceeeecCccch-HhhhCCCchHHHHHHHHHHhCCCCcEEE Confidence 1111111112222111 000 0000 0000000001111112211211 111111111111111 111111 1 Q ss_pred eeEecCcccccHHHHHHHHHhcccceeEEEEEeccC-ChhHHH---HHHHHHhccCceEEEEEcccc-----ccccccch Q lcl|NC_017984. 192 AVLNNHTAADTPTTGALNALAFSQNFVNITYSEGVF-NEDALK---DLALWVTSQNSRFKLYTWGLD-----PVALGQSG 262 (487) Q Consensus 192 A~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~-~~~~i~---a~A~w~~a~~~~~~~~~~~~~-----~~~~~~~~ 262 (487) .+.....+.++..+++.... ....+.+..++..+. +..++. +.+.....+-.|+.++..... .....+=. T Consensus 75 ~~~~~~~~~~~~~~Av~~a~-~~~s~E~V~v~~pv~t~~a~i~aa~~~a~el~~~~~Rpv~file~r~~~~~~~~~e~w~ 153 (376) T protein:vir:37 75 HVYIAQEDGYDFVECVKKAN-QTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQAVQGINHDQSDGETWD 153 (376) T ss_pred EEEeecCCchHHHHHHHHhh-hhcCceEEEEeccccccHHHHHHHHHHHHHHHHhcCCeEEEEEeccCcCcccccccCHH Confidence 12223333345666665542 334455555555432 233333 344455444344444433321 11111111 Q ss_pred HHHHHHHh--CC--cceE--EE-ecCCCchHHHHHHHHH--hcCcCcCCceeeeeeeecCc---------ccccCCCHHH Q lcl|NC_017984. 263 ASFGEWAK--EN--TSGV--VP-LYGTFDKAAFFCGVSG--SINYQEENGRTTTAFRSQDG---------LVPDVTNEAD 324 (487) Q Consensus 263 ~~~~~l~~--~~--~~~t--~~-~y~~~~~~a~~~g~~a--s~~~~~~~gs~T~~fk~l~G---------v~~~~lt~t~ 324 (487) +....+.. .+ ..++ ++ .|+ ...-.++|+++ ++.-...+|++.-- .+.| -....++... T Consensus 154 ~y~~~~~al~~gia~~~V~~V~~~~g--n~~G~~aGRl~~aaVsVadspgRV~tG--~l~gl~~~~lp~d~~~~~l~~a~ 229 (376) T protein:vir:37 154 QYVQKLTTLQQTIVADHVCLVPLLFG--NETGVLAGRLANRAVTVADSPARVQTG--ALVSLGSANKPLDKDRNELTLAH 229 (376) T ss_pred HHHHHHHHhhcccccccceeeeeehh--hhHHHHHHHHhhcccchhhCccceecc--ccccccccccccCcCcccCCHHH Confidence 22222222 11 1222 22 333 22445678763 44334456554311 1222 2335689999 Q ss_pred HHHHHhCCceEEEEecCCCceEEEEECCEEc----CCceehHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHH Q lcl|NC_017984. 325 AETLVKNGYSFYGAWATANDRFQFAGNGSVT----GQYKWIDNFDFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRA 400 (487) Q Consensus 325 ~~al~~~~~n~y~~~~~~~~~~~~~~~G~~s----gg~~~iD~~~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~ 400 (487) +.+|+++|+.+...|.+.. .-||.+|.|+ |+|.+|-..+=.|=....+.......+.. ...=-+..+++..+. T Consensus 230 l~aLd~agy~vp~~Y~gy~--G~Y~~d~~tl~~~gsDY~~ie~~RVvdKa~R~vR~~ai~~i~D-~~lnst~~sia~~~~ 306 (376) T protein:vir:37 230 LKSLETARYSVPMWYPDYD--GYYWADGRTLDVEGGDYQVIENLRVVDKVARKVRLLAIGKIAD-RSFNSTTSSTEYHKN 306 (376) T ss_pred HHHHHhCCCeEEEeeCCCC--ceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHHhCC-cccCcchhhHHHHHH Confidence 9999999999999998865 5689999988 33444544443333333333333332222 233346777888888 Q ss_pred HHHHHHHHHHhcCcccc----CcccCccccccccccccCccccceeeeeeEEeccCCCHHHHhhcccCCeEEEEEECCeE Q lcl|NC_017984. 401 YSQDPIDQGINFGGIRA----GVNLSNAQKFQVNQEAGFDAASQLFTKGWALSVTLPDSQTRVARESFIIKLFYTDGSSM 476 (487) Q Consensus 401 ~v~~vl~~a~~nG~Ia~----Gv~l~~~q~~~~~~~~g~~~~~~~~~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAI 476 (487) .+..+|++..+...|.. |. +.. ++.+ |+ ..+-... ..-.|.+..+.=|.- T Consensus 307 yi~~pLr~M~~s~~i~g~~fpGe-I~~-------p~d~-Di----------------~i~w~s~-~~V~I~~~v~P~~~p 360 (376) T protein:vir:37 307 YFAKPLRDMSKSATINGKDFPGE-CMP-------PKDD-AI----------------TIVWQSK-TKVTIYIKVRPYDCP 360 (376) T ss_pred HHHHHHHHHHhcchhccccccce-eec-------CCCC-Cc----------------eEEeecc-ceEEEEEEEEeccCC Confidence 89999999877766642 20 100 0000 11 0000011 111122222222222 Q ss_pred EEEEEEEEe--eC Q lcl|NC_017984. 477 QRLEMTATN--VQ 487 (487) Q Consensus 477 h~v~i~gt~--vq 487 (487) ..+.++.-| -= T Consensus 361 k~Itv~I~Ldlsn 373 (376) T protein:vir:37 361 KEITANIFLDLDS 373 (376) T ss_pred ceEEEEEEeecCC Confidence 222222100 00 No 65 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=51.88 E-value=0.57 Score=21.92 Aligned_cols=339 Identities=9% Similarity=0.023 Sum_probs=128.1 Q ss_pred HhhcccCCccCCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEE-eeccccCchHHHHHhhhh Q lcl|NC_017984. 67 YFNGFRNATTRPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVP-VDLATANSYSDAAALIAT 145 (487) Q Consensus 67 yF~g~~~q~p~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~-i~~s~ats~~~vA~~i~t 145 (487) -| .++.|-..+. .-|.+. .+.=.+-+.--|.....+ +.+-+.++|..+...-.+ T Consensus 1 ~~----------~~v~vn~~n~-------~~g~~~--------~~er~~lfig~~~~~~g~~~~~~~~sdld~~l~~~ds 55 (370) T protein:vir:78 1 MW----------PYVQIYNLNQ-------MQGPVT--------EVERHLLFIGSAASNTGKLLSLNAQSDFDQLLGAADS 55 (370) T ss_pred CC----------ceEEEeeccc-------cCCCcC--------ccceeEEEEecccccccceEeecCccCHHHhcCCcCh Confidence 00 0111111000 000000 000011111011110111 222333333333321122 Q ss_pred hheeeE---EEecccceEEEEecccccceeEEecccchhhhhhhccccceeEecCcccccHHHHHHHHHhcccceeEEEE Q lcl|NC_017984. 146 ALTLPC---TYESTVKGFVIKSGTSGANSTISFATGDISDDLKLTQETGAVLNNHTAADTPTTGALNALAFSQNFVNITY 222 (487) Q Consensus 146 ~l~a~v---t~d~~~~~f~its~t~g~~stit~atgd~a~~l~lt~~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~ 222 (487) .|...+ --++-.+.|. ........+...+|+..+. ....+.+..+ T Consensus 56 ~lk~~v~aa~~naG~~~~~-------------------------------~~~p~~~~~d~~~Av~~a~-~~~s~E~V~v 103 (370) T protein:vir:78 56 ELKANLLAARDNAGQNWSA-------------------------------AAYVLPTDKPWLDAARDAQ-QTQSFEGVVV 103 (370) T ss_pred hHHHHHHHHHhCCCCceEE-------------------------------EEEEecCchhHHHHHHHHH-hhCCccEEEE Confidence 221100 0000111111 1111122234555555443 2344444445 Q ss_pred EeccCChhHHHHHHHHHhc---cC-ceEEEEEccccccccccchHHHHHH---Hh---CCcceEEEecCCCchHHHHHHH Q lcl|NC_017984. 223 SEGVFNEDALKDLALWVTS---QN-SRFKLYTWGLDPVALGQSGASFGEW---AK---ENTSGVVPLYGTFDKAAFFCGV 292 (487) Q Consensus 223 ~~~~~~~~~i~a~A~w~~a---~~-~~~~~~~~~~~~~~~~~~~~~~~~l---~~---~~~~~t~~~y~~~~~~a~~~g~ 292 (487) ++...+...+.++..-.+. .- +..++..-........+-.+....+ .+ ..+-..++.++... .-.++|+ T Consensus 104 ~~~~s~~a~~~a~~~~a~el~n~~~Rpv~file~~~~~~~e~w~~y~~~l~al~~gia~~~V~vvp~~~g~~-~G~~aGR 182 (370) T protein:vir:78 104 LGQEWHQAAINAAHALNQELIAKWGRWQFMLLAVPAIADEQDWATYEAELATLQDGIAASSVSLIPQLWPTL-AGAYAGR 182 (370) T ss_pred ecCcchHHHHHHHHHHHHHHHHhcCCeEEEEEeecCCCCcCCHHHHHHHHHHhhhccccccceEEeeecccc-HHHHHHH Confidence 5443344444444333322 22 2233333222211111111112222 21 23334455555432 2345666 Q ss_pred HH--hcCcCcCCceee-eeeee---cCc-ccccCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEcC--CceehHH Q lcl|NC_017984. 293 SG--SINYQEENGRTT-TAFRS---QDG-LVPDVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVTG--QYKWIDN 363 (487) Q Consensus 293 ~a--s~~~~~~~gs~T-~~fk~---l~G-v~~~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~sg--g~~~iD~ 363 (487) ++ ++.-...++++. -.-+. +|- -....++.+.+++|+++|+.+...|.+.. .-||.+|.|+. |.+|=-. T Consensus 183 L~naavsVadsP~Rv~tG~l~gl~~~p~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~--G~Y~~d~~tl~~~gsDYq~i 260 (370) T protein:vir:78 183 LCNRAVSIADSPCRVKTGALVGLGNKPVGKDGIPLPLATLQTLEANRYSVPMWYPDYD--GIYWADGRTLDAEGGDYQVI 260 (370) T ss_pred HhcCeeeecccceeeeccccccccccccccCCcccCHHHHHHHHhCCCeEEEeeCCCC--ceEEeCceEeccCCCChhhh Confidence 53 221122333321 11111 110 02245888999999999999999998865 56899999882 2334334 Q ss_pred HHHHHHHHHHHHHHHHHHHH-hcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccccCcccCccccccccccccCcccccee Q lcl|NC_017984. 364 FDFQVFLRTQLQLAYMNMFQ-AQKTIPYNDQGIATVRAYSQDPIDQGINFGGIRAGVNLSNAQKFQVNQEAGFDAASQLF 442 (487) Q Consensus 364 ~~~~dWl~~~iq~~l~~ll~-~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~Gv~l~~~q~~~~~~~~g~~~~~~~~ 442 (487) =+.+.+-|..-+.++..+.. ...++==++..++..+.....+|++..+.+-|.. ++... .+...-..|+ T Consensus 261 e~~RVvdKa~R~vR~~ai~~i~D~~lnst~gsia~~~~~~~~~L~ema~s~~i~~-~~fpg----eI~~p~d~Di----- 330 (370) T protein:vir:78 261 ENLRIAYKVARRMRLRAIARIGDRSFNSTPGSTAAAITYFGKDLREMAKSTTING-QPFPG----DIASPQDGDI----- 330 (370) T ss_pred hhhhHHHHHHHHHHHHHHHHhCCcccCCCCcchhHHHHHHHhhHHHHHhhhhhcc-cccce----eEeccCCCcc----- Confidence 44455555544444433322 2223434557788888889999999888887763 00000 0000000011 Q ss_pred eeeeEEeccCCCHHHHhhcccCCeEEEEEECCeEEEEE------EEEEeeC Q lcl|NC_017984. 443 TKGWALSVTLPDSQTRVARESFIIKLFYTDGSSMQRLE------MTATNVQ 487 (487) Q Consensus 443 ~~Gy~~~~~~~s~~dra~R~~~~i~~~~~~aGAIh~v~------i~gt~vq 487 (487) ..+-.+.+ .-.|.+..+.=|.-..+. .+...=| T Consensus 331 -----------~i~w~s~~-~v~I~~~v~P~~~pk~Itv~I~LDls~e~~~ 369 (370) T protein:vir:78 331 -----------RIQWVAKN-LVSVFVVVRTVDCPKGITVNIMLDLSLNNGE 369 (370) T ss_pred -----------eEEeeccc-eEEEEEEEEeccCCceEEEEEEEeeccccCC Confidence 11111111 112333333222222222 2333333 No 66 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=36.64 E-value=1.2 Score=20.23 Aligned_cols=333 Identities=13% Similarity=0.094 Sum_probs=132.6 Q ss_pred CCCEEEEEeeecccceeeEeeccccccchhhheeeeeEEEEEEccceEEEE-eeccccCchHHHHHhhhhhheeeEEEec Q lcl|NC_017984. 77 RPNSLFITKYNLTDVPASLIGGDITSTTLADLKLINGTLTIVVDGVSKSVP-VDLATANSYSDAAALIATALTLPCTYES 155 (487) Q Consensus 77 ~P~~l~igr~~~~~~~~~l~g~~~~~~~~~~~~~~~g~~~iti~g~~~~~~-i~~s~ats~~~vA~~i~t~l~a~vt~d~ 155 (487) -=.++.|-..+ +..|.++. +.=.|-+.--|.....+ +.+-+.++|..+-..-.+.|...+. T Consensus 1 ~~~~v~vn~ln-------~~qg~~~~--------ver~~lfig~~~~~~~~~~~~~~~sdld~~lg~~ds~lk~~v~--- 62 (376) T protein:vir:37 1 MFPSVQINALN-------QLSGETKE--------IERHALFVGVGTTNQGKLLALTPDSDFDKVFGETDTDLKKQVR--- 62 (376) T ss_pred CCCeEEEeeee-------ccCCCccc--------ccceEEEeeccccccCceEEecCCCChHHhhCCCchhHHHHHH--- Confidence 00011111110 00111110 00011111111111111 2233333333333111122211000 Q ss_pred ccceEEEEecccccceeEEecccchhhhhhhcc-ccceeEecCcccccHHHHHHHHHhcccceeEEEEEeccC-ChhHHH Q lcl|NC_017984. 156 TVKGFVIKSGTSGANSTISFATGDISDDLKLTQ-ETGAVLNNHTAADTPTTGALNALAFSQNFVNITYSEGVF-NEDALK 233 (487) Q Consensus 156 ~~~~f~its~t~g~~stit~atgd~a~~l~lt~-~~gA~~~~G~aaet~~~al~a~~~~~~~wy~~~~~~~~~-~~~~i~ 233 (487) ++.+.... -...+...+.+.++..+++..+ +....+.+..++..+. +...+. T Consensus 63 -------------------------aa~~naG~~w~a~~~~p~~~~~~~~~Av~~a-~~~~s~E~V~v~~p~~t~~a~i~ 116 (376) T protein:vir:37 63 -------------------------AAMLNAGQNWFAHVYIAQEDGYDFVECVKKA-NQTASFEYCVNTRYLGVDKASIG 116 (376) T ss_pred -------------------------HHHhCCCCceEEEEEecCCChhhHHHHHHHH-HhhCCeeEEEEecCcchhHHHHH Confidence 00000000 0011112223334566666665 3345555555555432 234444 Q ss_pred HH---HHHHhccCceEEEEEcccc-----ccccccchHHHHHHH---h---CCcceEEEecCCCchHHHHHHHHH--hcC Q lcl|NC_017984. 234 DL---ALWVTSQNSRFKLYTWGLD-----PVALGQSGASFGEWA---K---ENTSGVVPLYGTFDKAAFFCGVSG--SIN 297 (487) Q Consensus 234 a~---A~w~~a~~~~~~~~~~~~~-----~~~~~~~~~~~~~l~---~---~~~~~t~~~y~~~~~~a~~~g~~a--s~~ 297 (487) ++ +.-..++-.|+.|+..... .....+=.+....+. + ..+-..++.++.. ..-.++|+++ ++- T Consensus 117 a~qa~a~el~~~~~R~vffile~~g~d~~~~~ge~w~~y~~~l~a~~~gia~~~V~vV~~~~gn-~~G~~aGRl~naaVs 195 (376) T protein:vir:37 117 KLQECYAELLAKFGRRTFFIQAVQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLVPLLFGN-ETGVLAGRLANRAVT 195 (376) T ss_pred HHHHHHHHHHHhcCCeEEEEEeccCCCCcccccCCHHHHHHHHHHHhccccccceeeeeeeccc-hHHHHHHHHHhCCcc Confidence 43 3333333223333332221 111111122222232 1 1233333443332 3345678874 443 Q ss_pred cCcCCceeeeeeeecCccc----c-----cCCCHHHHHHHHhCCceEEEEecCCCceEEEEECCEEc----CCceehHHH Q lcl|NC_017984. 298 YQEENGRTTTAFRSQDGLV----P-----DVTNEADAETLVKNGYSFYGAWATANDRFQFAGNGSVT----GQYKWIDNF 364 (487) Q Consensus 298 ~~~~~gs~T~~fk~l~Gv~----~-----~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~s----gg~~~iD~~ 364 (487) -...++++--- -+.|+- | ..++...+.+|+++|+.+.-.|.+.. .-+|.+|.|+ |+|.+|-.+ T Consensus 196 VadspgRV~tG--ai~gl~~~~~p~d~~g~el~~a~l~aLd~arysvpr~Y~gyd--G~Yw~dg~tl~~~gsDYq~ie~~ 271 (376) T protein:vir:37 196 VADSPARVQTG--ALVSLGSANKPLDKDGNELTLAHLKSLETARYSVPMWYPDYD--GYYRADGRTLDVEGGDYQVIENL 271 (376) T ss_pred hhcCccceeec--ccccccccccccccCCcccchHHHHHHHhCCCeEEEeeCCCC--ceEEeCCeEeccCCCCeeeehhc Confidence 34456665221 123331 1 23788999999999999999998754 4588999987 335566666 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCcCHhhHHHHHHHHHHHHHHHHhcCccc----cCcccCccccccccccccCccccc Q lcl|NC_017984. 365 DFQVFLRTQLQLAYMNMFQAQKTIPYNDQGIATVRAYSQDPIDQGINFGGIR----AGVNLSNAQKFQVNQEAGFDAASQ 440 (487) Q Consensus 365 ~~~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia----~Gv~l~~~q~~~~~~~~g~~~~~~ 440 (487) +=.|=...+++.....-+ ....+.-|+.+++..+..+..+|++..+.+-|. ||- +.. |..++ T Consensus 272 RVvdKa~R~vR~~Ai~~i-~Dr~lnstp~sia~~~~~~~~pLr~M~ks~ei~g~~fpge---------i~~----P~d~d 337 (376) T protein:vir:37 272 RVVDKVARKVRLLAIGKI-ADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGE---------CMP----PKDDA 337 (376) T ss_pred hHHHHHHHHHHHHHHHHh-cCccccCChhHHHHHHHHHhHHHHHHHhhhhhccccccce---------eec----CCCCc Confidence 655555555554443333 344577889999999999999999987665443 331 111 11111 Q ss_pred eeeeeeEEeccCCCHHH----Hhhccc--C-CeEEEEEECCeEEEEE Q lcl|NC_017984. 441 LFTKGWALSVTLPDSQT----RVARES--F-IIKLFYTDGSSMQRLE 480 (487) Q Consensus 441 ~~~~Gy~~~~~~~s~~d----ra~R~~--~-~i~~~~~~aGAIh~v~ 480 (487) +. |.-.+... -.-|-+ | .|+....|.=+ ...+ T Consensus 338 I~-------i~w~sk~~V~I~~~vrPy~cpk~i~~~I~LDls-~~~~ 376 (376) T protein:vir:37 338 IT-------IVWQSKTKVTIYIKVRPYDCPKEITANIFLDLD-SLGE 376 (376) T ss_pred eE-------EEeccCceEEEEEEEeeecCcceeEEEEEEecC-CCCC Confidence 10 00000000 001111 1 11111111000 0000 Done!