Query lcl|NC_019451.1_cdsid_YP_007002577.1 [gene=F356_gp31] [protein=hypothetical protein] [protein_id=YP_007002577.1] [location=18929..20443] Match_columns 504 No_of_seqs 163 out of 202 Neff 7.7 Searched_HMMs 1612 Date Thu Nov 7 17:18:32 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_31 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_31_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:96104 Length: 504 100.0 4E-178 2E-181 993.2 51.4 504 1-504 1-504 (504) 2 protein:vir:99586 Length: 507 100.0 3E-175 2E-178 977.2 50.9 504 1-504 1-507 (507) 3 protein:vir:107720 Length: 515 100.0 2E-159 1E-162 890.2 47.1 503 1-504 2-515 (515) 4 protein:vir:78611 Length: 501 100.0 1E-158 8E-162 886.1 50.1 487 1-504 5-500 (501) 5 protein:vir:106730 Length: 501 100.0 5E-158 3E-161 883.1 50.8 487 1-504 5-500 (501) 6 protein:vir:3636 Length: 501 # 100.0 9E-158 5E-161 881.6 51.0 488 1-504 5-500 (501) 7 protein:vir:101576 Length: 501 100.0 9E-157 6E-160 876.1 49.9 487 1-504 5-500 (501) 8 protein:vir:94073 Length: 494 100.0 4E-150 3E-153 839.5 47.6 483 1-504 3-493 (494) 9 protein:vir:5260 Length: 502 # 100.0 4E-143 2E-146 801.5 50.1 478 1-504 2-501 (502) 10 protein:vir:95263 Length: 450 100.0 3E-118 2E-121 664.6 43.3 430 3-504 1-448 (450) 11 protein:vir:80052 Length: 331 100.0 1.5E-89 9.3E-93 507.6 36.3 328 3-504 1-330 (331) 12 protein:vir:3165 Length: 426 # 100.0 4.8E-68 3E-71 389.7 29.0 405 2-504 1-425 (426) 13 protein:vir:105470 Length: 451 99.2 1.7E-10 1.1E-13 74.1 28.8 413 1-504 6-451 (451) 14 protein:vir:102957 Length: 437 99.2 6.6E-10 4.1E-13 70.9 31.8 403 1-504 6-437 (437) 15 protein:vir:4517 Length: 498 # 99.1 1.7E-09 1E-12 68.7 35.1 443 1-493 7-498 (498) 16 protein:vir:99306 Length: 587 99.1 2.6E-09 1.6E-12 67.6 34.3 445 1-504 8-581 (587) 17 protein:vir:4463 Length: 498 # 99.0 3.9E-09 2.4E-12 66.7 32.9 443 1-493 7-498 (498) 18 protein:vir:489 Length: 498 # 99.0 8.9E-09 5.5E-12 64.7 34.1 446 1-493 7-498 (498) 19 protein:vir:95741 Length: 587 98.8 2.8E-08 1.7E-11 62.0 36.8 444 1-504 8-581 (587) 20 protein:vir:78986 Length: 436 98.8 5E-08 3.1E-11 60.6 31.5 400 1-504 8-436 (436) 21 protein:vir:1996 Length: 495 # 98.7 1.1E-07 6.7E-11 58.8 31.5 438 1-498 8-495 (495) 22 protein:vir:107865 Length: 477 98.6 2.1E-07 1.3E-10 57.2 32.0 422 1-504 1-466 (477) 23 protein:vir:79092 Length: 477 98.6 3E-07 1.9E-10 56.3 34.5 427 1-504 1-466 (477) 24 protein:vir:6079 Length: 396 # 98.5 3.7E-07 2.3E-10 55.8 29.0 366 1-504 1-382 (396) 25 protein:vir:96586 Length: 587 98.5 5.3E-07 3.3E-10 55.0 35.9 448 1-504 5-581 (587) 26 protein:vir:63742 Length: 562 98.4 1E-06 6.3E-10 53.4 36.6 438 1-504 8-556 (562) 27 protein:vir:80488 Length: 562 98.4 1.1E-06 6.8E-10 53.3 36.9 442 1-504 1-556 (562) 28 protein:vir:80779 Length: 569 98.3 1.4E-06 8.4E-10 52.8 35.3 442 1-504 5-563 (569) 29 protein:vir:1172 Length: 391 # 98.3 2E-06 1.3E-09 51.8 27.0 362 1-504 1-378 (391) 30 protein:vir:98824 Length: 774 98.2 9.4E-07 5.8E-10 53.6 18.5 442 1-504 273-766 (774) 31 protein:vir:100829 Length: 607 98.2 3.5E-06 2.2E-09 50.5 33.0 450 1-504 14-595 (607) 32 protein:vir:102359 Length: 356 98.1 4.4E-06 2.7E-09 50.0 22.4 328 119-503 1-356 (356) 33 protein:vir:2035 Length: 396 # 98.0 6.3E-06 3.9E-09 49.1 28.2 366 1-504 1-382 (396) 34 protein:vir:1845 Length: 392 # 98.0 8.5E-06 5.3E-09 48.4 29.1 363 1-504 1-379 (392) 35 protein:vir:103993 Length: 390 97.8 1.9E-05 1.2E-08 46.5 28.5 359 1-504 1-377 (390) 36 protein:vir:78206 Length: 390 97.8 1.9E-05 1.2E-08 46.5 28.5 359 1-504 1-377 (390) 37 protein:vir:79141 Length: 391 97.6 4E-05 2.5E-08 44.7 27.0 359 1-504 1-377 (391) 38 protein:vir:108052 Length: 660 97.6 4.2E-05 2.6E-08 44.6 32.3 458 1-504 1-646 (660) 39 protein:vir:98553 Length: 395 97.5 6.1E-05 3.8E-08 43.7 30.2 365 1-504 1-382 (395) 40 protein:vir:5711 Length: 396 # 97.3 9.5E-05 5.9E-08 42.6 29.6 365 1-504 1-382 (396) 41 protein:vir:104858 Length: 729 97.2 0.00012 7.4E-08 42.1 23.0 446 1-504 204-716 (729) 42 protein:vir:10336 Length: 386 97.1 0.00018 1.1E-07 41.2 28.0 360 1-504 1-378 (386) 43 protein:vir:101187 Length: 663 97.1 0.00018 1.1E-07 41.2 31.8 458 1-504 1-647 (663) 44 protein:vir:96740 Length: 388 97.0 0.00022 1.3E-07 40.7 24.0 349 71-504 1-376 (388) 45 protein:vir:6594 Length: 666 # 96.9 0.00027 1.7E-07 40.1 34.2 458 1-504 1-650 (666) 46 protein:vir:79181 Length: 390 96.9 0.0003 1.8E-07 39.9 29.1 359 1-504 1-377 (390) 47 protein:vir:5833 Length: 742 # 96.6 0.00045 2.8E-07 38.9 25.9 435 1-504 198-735 (742) 48 protein:vir:100323 Length: 393 96.5 0.00055 3.4E-07 38.5 31.2 359 1-504 1-379 (393) 49 protein:vir:80984 Length: 666 96.5 0.00057 3.5E-07 38.4 31.6 456 1-504 1-650 (666) 50 protein:vir:100539 Length: 663 96.4 0.00064 4E-07 38.1 31.2 457 1-504 1-647 (663) 51 protein:vir:101804 Length: 663 96.0 0.0011 7E-07 36.8 33.7 460 1-504 1-647 (663) 52 protein:vir:107310 Length: 581 95.7 0.0016 1E-06 35.9 26.2 417 1-504 100-565 (581) 53 protein:vir:7206 Length: 659 # 95.7 0.0017 1E-06 35.8 36.9 457 1-504 1-645 (659) 54 protein:vir:106984 Length: 743 95.6 0.0018 1.1E-06 35.7 28.2 439 1-504 216-731 (743) 55 protein:vir:79798 Length: 717 95.6 0.0018 1.1E-06 35.6 20.1 415 1-504 228-716 (717) 56 protein:vir:5663 Length: 671 # 95.4 0.0021 1.3E-06 35.3 34.3 456 1-504 1-660 (671) 57 protein:vir:6894 Length: 660 # 95.0 0.0029 1.8E-06 34.5 35.6 458 1-504 1-645 (660) 58 protein:vir:98263 Length: 664 94.9 0.0033 2.1E-06 34.2 34.2 457 1-504 1-649 (664) 59 protein:vir:106427 Length: 679 94.1 0.0053 3.3E-06 33.1 28.2 443 1-504 143-664 (679) 60 protein:vir:104477 Length: 749 93.9 0.0059 3.6E-06 32.8 28.2 403 1-504 276-738 (749) 61 protein:vir:103456 Length: 659 93.9 0.0061 3.8E-06 32.7 34.1 457 1-504 1-645 (659) 62 protein:vir:7653 Length: 581 # 88.4 0.032 2E-05 28.7 26.9 417 1-504 100-565 (581) 63 protein:vir:3788 Length: 376 # 67.2 0.26 0.00016 23.8 26.1 321 119-504 1-370 (376) 64 protein:vir:102819 Length: 648 66.1 0.27 0.00017 23.7 30.9 447 1-504 9-644 (648) No 1 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=100.00 E-value=4e-178 Score=993.16 Aligned_cols=504 Identities=97% Similarity=1.357 Sum_probs=490.6 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCcccce Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPSS 80 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~~P~~ 80 (504) |||+||||||+|+|+++++.+++|+++|||++|+++|+||+|+|+|+++|++|||.+||||+||++||+|+||+++||++ T Consensus 1 mip~s~iV~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yF~~~~~~~~~P~~ 80 (504) T protein:vir:96 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPSS 80 (504) T ss_pred CCCccceeEeeecccccccccccccceeEeecccCCCccceEEecCHHHHHHhcCCChHHHHHHHHHhhcCCCCCccccE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcccccccceeE Q lcl|NC_019451. 81 ISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNADPQLAQAT 160 (504) Q Consensus 81 l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~~~~~~~a~ 160 (504) ||||||++++++++|+|+++++++.+++++++|+|+|+|+|+.+++.+||||.+++|+++|+.|++++++...+...+++ T Consensus 81 l~igR~~~~a~~~~l~g~~~~~~~~~~~~i~~G~lsitv~G~~~~~~~i~~S~~ts~~~vA~~i~~al~~~~~~~~~~~t 160 (504) T protein:vir:96 81 ISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEKNITAIDTSAATSMDNVASIIQTEIRKNTDPQLAQAT 160 (504) T ss_pred EEEEeecCcCccceEEechhHHHHHHHhhhhceEEEEEEcceeeeecccccccccchHHHHHHHHhhhhcccccccccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccceeEEEEEeccCC Q lcl|NC_019451. 161 VTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGAPLD 240 (504) Q Consensus 161 vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~wy~~~~~~~~~~ 240 (504) |+||.+.++|+++++++|+.+.....++.+++++.++|++.++.+.++|.++|+|.++|+++.+++++||+|+++++.++ T Consensus 161 v~~d~~~~~f~its~~tg~~~~~~~~~a~~~~~~~~lgl~~~~~~~v~g~~aet~~~al~al~~~~~~Wy~f~~a~~~~~ 240 (504) T protein:vir:96 161 VTWNPNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGATLD 240 (504) T ss_pred EEEeccCCeEEEEeeccccceeEEEeeccccchhhhhhcccccceEEeecccccHHHHHHHHHhhcCCeEEEEEEeccCC Confidence 99999999999999999999988888899999999999999888999999999999999999999999999999988889 Q ss_pred HHHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCcCCceeeecc Q lcl|NC_019451. 241 NDQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDEPGASQNYMY 320 (504) Q Consensus 241 ~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~~g~~T~kf 320 (504) +++++++|+|+|+++++|+|++++++++.............+...+++....+++++++++++++++||++.||++|||| T Consensus 241 dd~ilalA~w~ea~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~as~~f~~~ng~~T~~f 320 (504) T protein:vir:96 241 NDQIKAVSAWNAAQNNQFIYTVATSLANLGALFDLVKGNSGTALNVLSATASNDFVEQCPSEILAATNYDEPGASQNYMY 320 (504) T ss_pred HHHHHHHHHHHhhcCceEEEEEeecccchhhHHHhhhhcceeEEEEeecCccchhHHHHHHHHHHhcCcCcccccccccc Confidence 99999999999999999999999988877766666666677777777777888999999999999999999999999999 Q ss_pred cccCccccccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHHHHHHHHHHHHHHHHhcCC Q lcl|NC_019451. 321 YQFPGRNITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWLKSAIAQALLDLFLNVN 400 (504) Q Consensus 321 k~l~Gv~a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~ 400 (504) |+++||+|++++++|+++|+++|||||+.|++++++++|++||+|+||+|+|+|||+++|+||||++||++|++||++++ T Consensus 321 k~l~GVta~~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~~~gG~~~~~wiDv~~~~~WL~~~lq~~l~~l~~~~~ 400 (504) T protein:vir:96 321 YQFPGRNITVSDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWLKSAIAQALLDLFLNVN 400 (504) T ss_pred cccCCcCcccCCHHHHHHHHhcCCeEEEEeecccceeeEEecCeeeCCccccchhhhhhhHHHHHHHHHHHHHHHHhcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcc Q lcl|NC_019451. 401 AVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEW 480 (504) Q Consensus 401 kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~ 480 (504) |||||+.|++||+++|+++|++|++||+|+||+|+++.|+++|+++.|++++++++++|||||++|++++++++||++|+ T Consensus 401 kIPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~I~~~~~~d~~~~~~~~~GYyv~~~~~s~~s~~~r~~R~ 480 (504) T protein:vir:96 401 AVPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQYITQITGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEW 480 (504) T ss_pred CcccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchheecccccccccccceeccceEEEecChhccChhHhhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 481 KANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 481 ~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) +|+|+|+|+||||||+|+|+++|| T Consensus 481 ~~~~~~~y~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 481 KANYTLIYSKGDAIRFVEGSDVMI 504 (504) T ss_pred ccceEEEEEECCeEEEEEeccccC Confidence 999999999999999999999999 No 2 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=100.00 E-value=3.2e-175 Score=977.25 Aligned_cols=504 Identities=59% Similarity=0.965 Sum_probs=486.6 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCcccce Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPSS 80 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~~P~~ 80 (504) |||+||||||+|+|+++++++++|+++|||++|+++|+||+|+|+|+++|++|||.+||||+||++||+|+||+++||++ T Consensus 1 mip~s~iVnV~~~v~~~a~~~~~~~~~lilt~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsq~p~~~~~P~~ 80 (507) T protein:vir:99 1 MISQSRYVRIVSGVGAGAPVAQRRLIMRVMTTNAVLPPGVVFESSSADAVGAYFGMASEEYKRAKAYMSFISKSINSPSY 80 (507) T ss_pred CCCccceeEEeeeccccCcccccccceeeeccccCCCccceEeecCHHHHHHhcCCChHHHHHHHHHhccCCCCCcccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcccccccceeE Q lcl|NC_019451. 81 ISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNADPQLAQAT 160 (504) Q Consensus 81 l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~~~~~~~a~ 160 (504) ||||||++++++++|+|++++..+.+++++++|+|+|+|||+.+++.+||||++++|+++|+.|+++|+++..+...+++ T Consensus 81 L~igR~~~~~~~a~l~g~~~~~~l~~~~~~~~G~lti~v~G~~~t~~~i~lS~~ts~~~vAs~i~~~l~a~~~~~~~~~t 160 (507) T protein:vir:99 81 ISFARWVNAAIASMIVGDSLVKNLPALKAVATPTLSLSIGGTVVPIAGIDLTAALTLTDVAATLQTKIRASANAELATAT 160 (507) T ss_pred EEEEeecCccccceeecchhhhhHHHHhhhcceeEEEEEcCceeEeccccccccCCHHHHHHHHHHhhhccccccccceE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecccceeEEecccCcceeEEEEee--ccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccceeEEEEEec- Q lcl|NC_019451. 161 VTWNQNTNQFTLVGATIGTGVLAVAKS--ADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGA- 237 (504) Q Consensus 161 vt~d~~~~~F~its~t~ga~s~~~~~s--a~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~wy~~~~~~~- 237 (504) |+||.+.++|++++.++|+.+.+...+ ..+++++.+++++.++.+.++|.++|+|.++|+++++.++|||+|.+++. T Consensus 161 v~~d~~~~~F~v~s~~tG~~s~i~~at~~~~gt~~s~l~~~~~~~a~~~~g~~aet~~~a~~a~~~~~~nW~~~~~a~~~ 240 (507) T protein:vir:99 161 VTFNTTTNQFVLNGTTTGALAPTITAVRTDPATDISSLLGWTNTGTVFVKGQAAETPDTSISKSAAISTNFGSFIYTSTP 240 (507) T ss_pred EEEecCCceEEEEeeeccccceeEEEEcCCchhhHHHHhccccccceEeecccccCHHHHHHHHHhhcCCeEEEEEEecc Confidence 999999999999999999877665554 35788999999998888999999999999999999999999999998765 Q ss_pred cCCHHHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCcCCceee Q lcl|NC_019451. 238 PLDNDQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDEPGASQN 317 (504) Q Consensus 238 ~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~~g~~T 317 (504) .+++++++++|+|+|+|+++|+|..+++++........+.......+....+..+.+|+++++||+++++||++.||++| T Consensus 241 ~~td~~~lalA~wiea~~~~f~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~ng~~T 320 (507) T protein:vir:99 241 ALTNDQITAVASWNASQNNMYMYSVPTTIANIGTLYAAVKGFSGCALNITSDSLPVDYIEQSPCEILAATDYTRVNATQN 320 (507) T ss_pred ccChHHHHHHHHHHhhcCcEEEEEEecCchhhhhhhhhhhhcceeEEEeecccccchhHHHHHHHHHHhhccCcCcccee Confidence 47899999999999999999999999998887777777777777777777778888999999999999999999999999 Q ss_pred ecccccCccccccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHHHHHHHHHHHHHHHHh Q lcl|NC_019451. 318 YMYYQFPGRNITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWLKSAIAQALLDLFL 397 (504) Q Consensus 318 ~kfk~l~Gv~a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~dwl~~~lq~~l~~l~~ 397 (504) ||||++|||+|++++++|+++|++||||||+.|++++++++|+++|+|+||+|+|+|||+++|+||||++||.+|++||+ T Consensus 321 ~~fk~l~GV~a~~lt~t~a~al~~~n~N~y~~~a~~~~~~~~~~~G~~~gG~~~fid~d~~~~~~WL~~~iq~~l~~l~~ 400 (507) T protein:vir:99 321 YMYYQFPSRNITVSDDTTANLVDANRGNYIGQTQSAGQSLAFYQRGILCGGPNDAVDMNIYANEIWLKSAISAQILSLFL 400 (507) T ss_pred ecccccCCcccccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeeCCcccceeeeeecchHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHh Q lcl|NC_019451. 398 NVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGL 477 (504) Q Consensus 398 ~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~ 477 (504) +++|||||+.|+++|+++|+++|++|++||+|+||+++++.|+++|+++.|+++.++++++|||||++|+++.+++++|+ T Consensus 401 ~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~I~~Gvtl~~~q~~~in~~~~~~~~~~~~~~~Gyy~~~~~~s~~~~~~r~ 480 (507) T protein:vir:99 401 NVPRVPANETGESMLLSVIQSVVNTAKNNGTISAGKNLNVIQQQYITQISGDANAWRQVANIGYWLNITFSSYTNPNTQL 480 (507) T ss_pred cCCCCccChhhHHHHHHHHHHHHHHHHhccccccCCcccccchheecccccccccccceeccceEEEeCChHhcChhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 478 TEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 478 ~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) +|++|+|+|||++||+||+|+|++++| T Consensus 481 ~r~~~~~~~~y~~~gaI~~v~~~~~~v 507 (507) T protein:vir:99 481 TEWKASYQLIYSKDDAIRFVEGTDTLI 507 (507) T ss_pred ccccceEEEEEEeCCeEEEEEeeeecC Confidence 999999999999999999999999999 No 3 >protein:vir:107720 Length: 515 # NCBI annotation: gp17 # Family: family:all:396 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024865;genbank:gi:48697507;genbank:GeneID:2948336 Probab=100.00 E-value=2.4e-159 Score=890.18 Aligned_cols=503 Identities=34% Similarity=0.571 Sum_probs=450.5 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCcccce Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPSS 80 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~~P~~ 80 (504) =||.+++|+|+|+|.++++.+++.+.+|+|++++++|+||+|+|+|+++|++|||++||||+||++||+++.|++|||++ T Consensus 2 ~I~~~~~V~i~~~v~aa~~~~~~~f~~li~t~~~~~p~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q~p~P~~ 81 (515) T protein:vir:10 2 PISFDKYVAITSGVAAQQQIAARSFAIRVYTPNPMVSVDRLITATSAADVGAYFGTASEEYKRAVKNFGFISKKTRRPTS 81 (515) T ss_pred CCCceeEEEeecccccCCccccccceeeeeecccCCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhccCCcccccE Confidence 34444555555566555555554334789999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeeeccCCcceeeecccc-hhhHhHhhccCceEEEEEcccce-eeeeeccccccchHHHHHHHHhhhhcccccccce Q lcl|NC_019451. 81 ISFARWVNTAIAPMVVGDNLP-KTIADFAGFSAGVLTIMVGAAEQ-NITAIDTSAATSMDNVASIIQTEIRKNADPQLAQ 158 (504) Q Consensus 81 l~igr~~~~a~~~~l~g~~~~-~~~~~~~~~~~g~~titi~g~~~-~~~~i~~s~ats~~~vA~~i~~~i~a~~~~~~~~ 158 (504) ||||||++++++++|+|+++. .++.+|+.+++|+|+|+|||+.+ ++++||||.+++|+++|+.|+++|.++.++.... T Consensus 82 L~igR~~~~a~~~~l~g~~~~~~~l~~~~~is~G~ltitidG~~~~t~s~i~~S~ats~~~vAs~i~tal~~~~~~~~~~ 161 (515) T protein:vir:10 82 IQFARWQREAGPVAIYGGAKKAAALATLQAVTAGAISFLFGGATTVTVSGISFSAATSLADVASELQTALRANADANLAT 161 (515) T ss_pred EEEEeccCcccceEEEeccchhhhHHhhhcccceeEEEEEcceEEEEeeccccccccCHHHHHHHHHhhhccccccccce Confidence 999999999999999999885 67889999999999999999886 7899999999999999999999999999999999 Q ss_pred eEEEEecccceeEEecccCcceeEEEEe----eccchhhhhhhhcccC-cceeecccccccHHHHHHHHHhhccceeEEE Q lcl|NC_019451. 159 ATVTWNQNTNQFTLVGATIGTGVLAVAK----SADPQDMSTALGWSTS-NVVNVAGQAADLPDAAVAKSTNVSNNFGSFL 233 (504) Q Consensus 159 a~vt~d~~~~~F~its~t~ga~s~~~~~----sa~~~~ia~~l~~t~~-~~~~~~g~aaet~~~al~~~~~~~~~wy~~~ 233 (504) ++|+||.+.++|++++.++|..+++... ++.+++++.+|||+.+ +.+.++|.++|+|.++|+++.+.++|||+|+ T Consensus 162 ~tv~~d~~~~~F~v~s~~tG~~~~is~~~~t~~~~~t~~a~~lglt~~~~av~~~g~aaet~~~a~~a~~~~s~nWy~f~ 241 (515) T protein:vir:10 162 CTVSYDPVGARFNFAGSPSDDTVQESISIVPQSNPAIDVAQLLGWNSAQGASYIAASPVVSPVDTLIASVAGNNNFGSIL 241 (515) T ss_pred eEEEEecCCCeEEEEEeecCCceeEEEEEecCCCchhhHHHHhccccccceEEecccccccHHHHHHHHHhccCCeEEEE Confidence 9999999999999999999987766443 3457889999999865 5688999999999999999999999999999 Q ss_pred EEec---cCCHHHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcC Q lcl|NC_019451. 234 FAGA---PLDNDQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYD 310 (504) Q Consensus 234 ~~~~---~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~ 310 (504) ++++ ..++++++++|+|+|+++++|+|...+.+.................+.+..+....+|++++++|+++++||+ T Consensus 242 ~a~~~~~~~~~a~~~a~a~~~e~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~asvnf~ 321 (515) T protein:vir:10 242 FTKNGGTGITLSDAEAIALQNQSYNVAYKFQVGVDDTTYSSWQAALAAIGGVNMIYSPVALAAEYHDMQDGIIEAATDFT 321 (515) T ss_pred EeecCccccchhHHHHHHHHHhhcCceEEEEeccCccceechhhhhhhhhhcCceEEEEeccCcchHHHHHHHHHhcCCC Confidence 9864 3568899999999999999999988877666554544444455566666666677889999999999999999 Q ss_pred cCCceeeecccccCccccccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHHHHHHHHHH Q lcl|NC_019451. 311 EPGASQNYMYYQFPGRNITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWLKSAIAQ 390 (504) Q Consensus 311 ~~~g~~T~kfk~l~Gv~a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~dwl~~~lq~ 390 (504) +.||++||||||+|||+|++++++|+++|++||||||+.|.++|++++|++||+|+||+|+|+|||++||+||||++||+ T Consensus 322 ~~ng~iT~kfKq~~Gita~~lt~t~a~al~~~~~N~Y~~~~~~~~~~~~~~~G~~~gG~~~~~WiD~~~g~~WL~~~iq~ 401 (515) T protein:vir:10 322 QQGGATGYMYVQFNNQTPAVNDDTLSGILDDLNINYYGQTQVNGTNLSFYQDGVMMGGPTDPRDSNVYANEQWLKSYAGA 401 (515) T ss_pred ccchhheeccccCCCCccccCCHHHHHHHHhcCCeEEEEEeccCceEEEEeCCeeeCCccchhHHHHHhhHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhcCCCCCcCHHHHHHHHHHH-HHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEEecchH Q lcl|NC_019451. 391 ALLDLFLNVNAVPASSTGEAMTLAVL-QPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSS 469 (504) Q Consensus 391 ~l~~l~~~~~kIPyt~~G~~~l~~~v-~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~ 469 (504) +|++||++++|||||+.|+++|++.| +++|+||++||+|+|||||++.|+++|+++.|+|..++++++||||+++|+++ T Consensus 402 ~l~~L~~s~~KIPytd~G~a~i~a~v~q~vl~~av~nG~I~~Gv~ls~~Q~~~i~~~~g~d~~~~~~~~~Gyy~~~~~~~ 481 (515) T protein:vir:10 402 SFMSLQLAQGKIPANIEGRGLLLGKMTKDIIPAAKLNGTFSIGKTLTVDQQLFVTELTGDDTAWQKVQNLGYWYDVQISS 481 (515) T ss_pred HHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHHhCCeeecCcccchhHHHHHHhhhcCcccccchhhcceeEecCcCC Confidence 99999999999999999999999987 57999999999999999999999999999999999999999999999999876 Q ss_pred hCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 470 YTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 470 ~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) .++..+ ..|..+++.|||+++|+||+|++++|+| T Consensus 482 ~~~~~~-r~~~~~~~~~~y~~g~~i~~i~~~~~~v 515 (515) T protein:vir:10 482 FVDTGG-TTKYQAVYSLVYSKDDLIRKVVGTHTLI 515 (515) T ss_pred CCCccc-ccccCceeEEEEEcCceEEEEEeeeecC Confidence 665332 2233446789999999999999999999 No 4 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=100.00 E-value=1.4e-158 Score=886.07 Aligned_cols=487 Identities=23% Similarity=0.319 Sum_probs=446.6 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCcccce Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPSS 80 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~~P~~ 80 (504) =||+||||||+|+|+++++++++|+ +|+|+++.++|+||+|+|+++++|++|||.+||||+||++||++++|++|||++ T Consensus 5 ~ip~s~iV~V~~~v~~~~~~~~~~~-~lll~~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~q~~~P~~ 83 (501) T protein:vir:78 5 TIPIDQIVQMLPGVIGAGGAPGRLT-GLVLTQDTSIQPGQLADFFQKTDVENWFGGLSNEAVIADAYFPGIVNGGQLPYD 83 (501) T ss_pred ccccceEEEEeeecccCCCcceeee-eEEEecCCCCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhcCCCCCcccce Confidence 3899999999999999999999876 788999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeeeccCCcceeeecccch-hhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhccccccccee Q lcl|NC_019451. 81 ISFARWVNTAIAPMVVGDNLPK-TIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNADPQLAQA 159 (504) Q Consensus 81 l~igr~~~~a~~~~l~g~~~~~-~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~~~~~~~a 159 (504) ||||||++++++++|+|++++. .+..|+.+ +|+|+|+|+|+ +.+.+||+|++++++++|+.|+++|+++ .+ T Consensus 84 l~igR~~~~a~~~~l~g~~l~~~~la~~~~~-~G~l~iti~g~-~~~~~i~~S~~ts~~~vA~~i~~al~a~------~~ 155 (501) T protein:vir:78 84 LKFARYVAADAPASVYGIPLTGVTLTQLQGY-SGTLTVTTAAQ-HVSSNISLAAATSFANAATLIEAAFTSP------DF 155 (501) T ss_pred EEEEeecccCcceeEeccceeccchhhhcee-eeEEEEEeccc-eeeeccccccccCHHHHHHHHHhhhcCc------ce Confidence 9999999999999999999976 57788888 69999999997 6678899999999999999999999763 46 Q ss_pred EEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcc--eeecccccccHHHHHHHHHhhccceeEEEEEec Q lcl|NC_019451. 160 TVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNV--VNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGA 237 (504) Q Consensus 160 ~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~--~~~~g~aaet~~~al~~~~~~~~~wy~~~~~~~ 237 (504) +|+||++.++|++++.++|..+++...+ .+++++..||++.+.+ +.++|.++|+|.++|+++.+.+++||+|.++++ T Consensus 156 tv~~ds~~~~f~its~t~G~~~~i~~~t-~~~~~a~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~a~~ 234 (501) T protein:vir:78 156 VVSYDALRNRFVVNTNATGTAAAISAVT-GTNNLADELGLSAAAGASLQAAGVAADTPASAMNRAVGLSRNWATFTTAWT 234 (501) T ss_pred EEEEccccceEEEEeeecCCceeEEEEe-cccchhhhhcccccCceeeEeccccccCHHHHHHHHHhccCceEEEEEecC Confidence 8999999999999999999988887766 4668899999997643 568899999999999999999999999998876 Q ss_pred cCCHHHHHHHHHHHhhcCCcEEEEEeccccch-----hHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCcC Q lcl|NC_019451. 238 PLDNDQIKAVSAWNAAQNNQFIYTVATSLANL-----GTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDEP 312 (504) Q Consensus 238 ~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~ 312 (504) ++++|++++|+|+|+|+++|+|..+++++.. .+.+....+..++.+++..+ .+++++++++|+++++||++. T Consensus 235 -~~~~~~lalA~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~a~~y~~t~~~y--~~~~~~aa~~g~~as~nf~~~ 311 (501) T protein:vir:78 235 -AVIADRLALASWNSGQAYKYMYVAPDLEPASIVTNNSASFGAQVFAAPYQGTLPLY--GDQATAGAVMGYAASINFQLR 311 (501) T ss_pred -CCHHHHHHHHHHHHhcCceEEEEEecCCcceeecccchhHHHHHhhcCCCceEEEc--CCcchHHHHHHHHHhcCcccC Confidence 6899999999999999999999999987543 23333333333444444433 378899999999999999999 Q ss_pred Cceeeeccccc-CccccccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHHHHHHHHHHH Q lcl|NC_019451. 313 GASQNYMYYQF-PGRNITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWLKSAIAQA 391 (504) Q Consensus 313 ~g~~T~kfk~l-~Gv~a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~dwl~~~lq~~ 391 (504) ||++||||||+ +||+|++++++|+++|++||||||+.|++++++++|+++|+|+| +|+|||+++|+||||++||.+ T Consensus 312 ~g~~T~~fkq~~~Gv~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG---~~~wiD~~~~~~Wl~~~iq~~ 388 (501) T protein:vir:78 312 NGRTVLAFRQFNAGVPATAHDLGTANALRSNNYTYIGAYANAANNYTIAYDGKLSG---KFLWVDTYLDQIYLNAELQRA 388 (501) T ss_pred cceeeeeccccCCCcCcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcCeeec---cceeehhhhhHHHHHHHHHHH Confidence 99999999996 89999999999999999999999999999999999999999986 567999999999999999999 Q ss_pred HHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEEecchHhC Q lcl|NC_019451. 392 LLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYT 471 (504) Q Consensus 392 l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~ 471 (504) +++||++++|||||+.|+++|++.|+++|+||++||+|+||+|+++.|+++|+++.|++++++++++|||||++++++.+ T Consensus 389 l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~I~~~~g~~~~~~~~~~~Gyy~~~~~~~~~ 468 (501) T protein:vir:78 389 EFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQMRGWYFLIGDPANP 468 (501) T ss_pred HHHHHHhCCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCCccceeeccccCccccccceeccceEEeeccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999988765 Q ss_pred CHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 472 NSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 472 s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) + ++|++|++|+|+|+|+||||||+|+|.+++| T Consensus 469 ~-~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v 500 (501) T protein:vir:78 469 G-QARQNRTTPTCTLWYSDGGSIQELTIGSNAV 500 (501) T ss_pred h-hhhhhcccCcEEEEEEeCCceeEEEeeeeec Confidence 5 7899999999999999999999999999999 No 5 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=100.00 E-value=4.7e-158 Score=883.09 Aligned_cols=487 Identities=25% Similarity=0.334 Sum_probs=443.3 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCcccce Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPSS 80 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~~P~~ 80 (504) =||+||||||+|+|+++++++++|+ +|+|++++++|+||+|+|+|+++|++|||.+||||+||++||+++.|++|||++ T Consensus 5 ~ip~s~iV~V~~~v~~~~~~~~~f~-~lll~~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q~p~P~~ 83 (501) T protein:vir:10 5 TIPIDQIVQMLPGVIGAGGAPGRLT-GLVLTQDTSVQPGQLADFFQKTDVENWFGALSNEAKIADAYFPGIVNGGQLPYD 83 (501) T ss_pred ccccceEEEEeeecccCCCcccccc-eEEEecccCCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhhcCCCccccE Confidence 3899999999999999999999887 778899999999999999999999999999999999999999999999999999 Q ss_pred EEEEeeeccCCcceeeecccchh-hHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhccccccccee Q lcl|NC_019451. 81 ISFARWVNTAIAPMVVGDNLPKT-IADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNADPQLAQA 159 (504) Q Consensus 81 l~igr~~~~a~~~~l~g~~~~~~-~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~~~~~~~a 159 (504) ||||||++++++++|+|++++.. +..++.+ +|+|+|+|+|+. ...+||+|.+++|+++|+.|+++|++ ..+ T Consensus 84 l~igR~~~~~~~~~l~g~~l~~~~la~~~~~-~g~l~i~i~g~~-~~~~i~~s~ats~~~vA~~i~~al~~------~~~ 155 (501) T protein:vir:10 84 LKFARYVAADAPASVYGIPLTGITLAQLQGY-SGTLTVTTAAQH-VSANISLAAATSFANAATLIEAAFTS------PDF 155 (501) T ss_pred EEEEeecccCccceeeeceehhhhhhhhhhe-eeEEEEeeccce-eeeccccccccCHHHHHHHHHHhhcC------Cce Confidence 99999999999999999999865 5555555 699999999975 55789999999999999999999975 346 Q ss_pred EEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcc--eeecccccccHHHHHHHHHhhccceeEEEEEec Q lcl|NC_019451. 160 TVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNV--VNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGA 237 (504) Q Consensus 160 ~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~--~~~~g~aaet~~~al~~~~~~~~~wy~~~~~~~ 237 (504) +|+||++.++|++++.++|..+.+...+. +.+++.+||++.+.+ +.++|.++|+|.++|+++.+++++||+|.++++ T Consensus 156 tv~~d~~~~~f~i~~~t~G~~~~i~~~t~-~~d~a~~l~Lt~~~~a~v~~~g~~aet~~~Al~a~~~~~~~Wy~f~~a~~ 234 (501) T protein:vir:10 156 VVAYDALRNRFTVVTNTTGTAAAISAVTG-TNNLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWT 234 (501) T ss_pred EEEEecccceEEEEecccCcceeEEEeec-cccchhhhcccccCceeEEecCcccccHHHHHHHHHhcccceEEEEEEec Confidence 89999999999999999999888776654 468999999997643 568899999999999999999999999998876 Q ss_pred cCCHHHHHHHHHHHhhcCCcEEEEEeccccch-----hHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCcC Q lcl|NC_019451. 238 PLDNDQIKAVSAWNAAQNNQFIYTVATSLANL-----GTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDEP 312 (504) Q Consensus 238 ~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~ 312 (504) ++++|++++|+|+|+++++|+|..+++++.. .+.+....+..++.+++..+ .++.++++++|+++++||++. T Consensus 235 -~~~~~~la~A~wi~a~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y--~~~~~~aa~~g~~as~nf~~~ 311 (501) T protein:vir:10 235 -AVIADRLAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLY--GDQATAGAVMGYAASINFQLR 311 (501) T ss_pred -CChHHHHHHHHHHHhcCceEEEEEecCcceeeecccchhHHHHHHhcCCCceEEEC--CCCCHHHHHHHHHHhcCcccC Confidence 6899999999999999999999999987543 23333333334444444433 356788999999999999999 Q ss_pred Cceeeeccccc-CccccccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHHHHHHHHHHH Q lcl|NC_019451. 313 GASQNYMYYQF-PGRNITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWLKSAIAQA 391 (504) Q Consensus 313 ~g~~T~kfk~l-~Gv~a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~dwl~~~lq~~ 391 (504) ||++||||||+ +||+|++++++|+++|++||||||+.|++++++++|+++|+|+| +|.|||+++|+||||++||.+ T Consensus 312 ~g~~T~~fkql~~Gv~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG---~~~wiD~~~g~dWl~~~iq~~ 388 (501) T protein:vir:10 312 NGRTVLAFRQFNAGVPATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSG---KFLWVDTYLDQIYLNAELQRA 388 (501) T ss_pred cceeeeeecccCCCcCcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcceeec---cceehhhHhhHHHHHHHHHHH Confidence 99999999996 89999999999999999999999999999999999999999987 567999999999999999999 Q ss_pred HHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEEecchHhC Q lcl|NC_019451. 392 LLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYT 471 (504) Q Consensus 392 l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~ 471 (504) +++||++++|||||+.|+++|++.|+++|+||++||+|+||||+++.|+++|++++|++++++++++|||||++++++.+ T Consensus 389 l~~ll~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~ 468 (501) T protein:vir:10 389 EFEAMLAYNSLPYNEDGYTANYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAVAGVAGAGALVQTRGWYFLIGNPANP 468 (501) T ss_pred HHHHHhcCCCcccCHHHHHHHHHHHHHHHHHHHhCcceecCcccCcccceeecccccccccccceeccceEEeeCcccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999988765 Q ss_pred CHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 472 NSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 472 s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) + ++|++|++|+|+|+|+||||||+|+|.+++| T Consensus 469 ~-~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v 500 (501) T protein:vir:10 469 G-QARQNRTSPACTLWYSDGGSIQELTIGSNAV 500 (501) T ss_pred h-hhhhhcccCceEEEEEeCCceeEEEeeeeec Confidence 5 7899999999999999999999999999999 No 6 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=100.00 E-value=8.7e-158 Score=881.65 Aligned_cols=488 Identities=23% Similarity=0.301 Sum_probs=444.1 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCcccce Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPSS 80 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~~P~~ 80 (504) =||+||||||+|+|.++++++++|+ +|+|++++++|+||+|+|+++++|++|||.+||||+||++||++++|++|||++ T Consensus 5 ~ip~s~iV~V~~~v~~~~~~~~~~~-~lllt~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~q~~~P~~ 83 (501) T protein:vir:36 5 TIPIDQIVQMLPGVIGAGGAPGRLT-GLVLTQDTSVQPGQLADFFQETDVENWFGALSNEAKIADAYFPGIVNGGQLPYD 83 (501) T ss_pred CcccceEEEEeeeeccCCCcceeee-eEEEeccCCCCCcceeeecCHHHHHHhcCCChHHHHHHHHHhhcccCCCccccE Confidence 3999999999999999999999876 789999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcccccccceeE Q lcl|NC_019451. 81 ISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNADPQLAQAT 160 (504) Q Consensus 81 l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~~~~~~~a~ 160 (504) ||||||++++++++|+|++++....+.++..+|+|+|+|+|+. ..++||||.+++|+++|+.|+++|++ ..++ T Consensus 84 l~igR~~~~a~~~~l~g~~l~~~~~a~~~~~sg~l~vti~g~~-~~~~i~lS~~ts~~~vA~~i~~al~~------~~~t 156 (501) T protein:vir:36 84 LKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQH-VSANISLAAATSFANAATLIEAAFTS------PDFV 156 (501) T ss_pred EEEEeecCcCcceeEeccchhhhhhhhccceeEEEEEEeccee-eeeecccccccCHHHHHHHHhhhhcC------cceE Confidence 9999999999999999999987654444445699999999985 56899999999999999999999975 3468 Q ss_pred EEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcc--eeecccccccHHHHHHHHHhhccceeEEEEEecc Q lcl|NC_019451. 161 VTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNV--VNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGAP 238 (504) Q Consensus 161 vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~--~~~~g~aaet~~~al~~~~~~~~~wy~~~~~~~~ 238 (504) |+||++.++|++++.++|..+.+...+. +++++..|+++.+.+ +.++|.++|+|.++|+++.+.++|||+|.++++ T Consensus 157 v~~d~~~~~f~i~s~t~G~~~~i~~~t~-~~~ia~~l~Lt~~~~a~v~~~g~~~et~~~al~a~~~~s~~Wy~f~~a~~- 234 (501) T protein:vir:36 157 VAYDALRNRFTVVTNATGTAAAISAVTG-TNNFADEIGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWT- 234 (501) T ss_pred EEEcCcceeEEEEeccCCcceeeEeeec-ccchhhhhcccccCcceEEecccccccHHHHHHHHHhccCceEEEEEecC- Confidence 9999999999999999999888777654 567999999997754 568899999999999999999999999998876 Q ss_pred CCHHHHHHHHHHHhhcCCcEEEEEeccccch-----hHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCcCC Q lcl|NC_019451. 239 LDNDQIKAVSAWNAAQNNQFIYTVATSLANL-----GTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDEPG 313 (504) Q Consensus 239 ~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~~ 313 (504) ++++|++++|+|+|+++++|+|..+++++.. .+.+....+..++.+++..++ +++++++++|+++++||++.| T Consensus 235 ~~~~~~la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~--~~~~~aa~~g~~as~nf~~~~ 312 (501) T protein:vir:36 235 AVIADRLAFASWNSGQAYKYMYVAPDLEAASIVSNNAASFGAQVFAAPYQGTLPLYG--DQATAGAVMGYAASINFQLRN 312 (501) T ss_pred CChHHHHHHHHHHhhcCceEEEEEecCchhhhhccchhhHHHHHHhcCCCcEEEEcC--CCCHHHHHHHHHHhcCcccCc Confidence 6899999999999999999999999987543 233433344444444444333 567888999999999999999 Q ss_pred ceeeeccccc-CccccccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHHHHHHHHHHHH Q lcl|NC_019451. 314 ASQNYMYYQF-PGRNITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWLKSAIAQAL 392 (504) Q Consensus 314 g~~T~kfk~l-~Gv~a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~dwl~~~lq~~l 392 (504) |++||||||+ +||+|++++++|+++|++||||||+.|++++++++|+++|+|+| +|.|||++||+||||++||+++ T Consensus 313 g~~T~~fkq~~~Gi~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG---~~~wiD~~~g~dWL~~~iq~~l 389 (501) T protein:vir:36 313 GRTVLAFRQFNAGVPATVHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSG---KFLWVDTYLDQIYLNAELQRAE 389 (501) T ss_pred ceeeeeccccCCCcCcCcCCHHHHHHHHhcCCcEEEEEecccceeeEEEcCeeec---cchhhhHHHhHHHHHHHHHHHH Confidence 9999999997 89999999999999999999999999999999999999999987 5679999999999999999999 Q ss_pred HHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEEecchHhCC Q lcl|NC_019451. 393 LDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTN 472 (504) Q Consensus 393 ~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s 472 (504) ++||++++|||||+.|+++|+++|+++|+||++||+|+||+|+++.|+++|+++.|++++++++++|||||++++++ ++ T Consensus 390 ~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~I~~~~g~~~~~~~v~~~Gyy~~~~~~~-~~ 468 (501) T protein:vir:36 390 FEAMLAYNSLPYNEDGYTGLYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDKAARVAGAGQLVQTRGWYFLIGDPA-NP 468 (501) T ss_pred HHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeecccccccccccceeccceEEeeCccc-CC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998877 46 Q ss_pred HHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 473 SNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 473 ~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ++||++|++|+|+|+|+||||||+|+|.+++| T Consensus 469 ~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v 500 (501) T protein:vir:36 469 GQARQNRTTPACTLWYSDGGSIQSLTIGSNAV 500 (501) T ss_pred hhhhhhcccCcEEEEEEeCCceeEEEeeeeee Confidence 67999999999999999999999999999999 No 7 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=100.00 E-value=9.1e-157 Score=876.05 Aligned_cols=487 Identities=24% Similarity=0.326 Sum_probs=444.7 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCcccce Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPSS 80 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~~P~~ 80 (504) =||+||||||+|+|+++++++++|+ +|||++++++|++|++.|+|++||++|||.+||||+||++||+++.|++|||++ T Consensus 5 ~ip~s~iV~V~~~v~~~~~~~~~~~-~l~l~~~~~~~~~~~~~~~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q~p~P~~ 83 (501) T protein:vir:10 5 TIPIDQIVQMLPGVIGAGGAPGRLT-GLVLTQDTSVQPGQLADFFQETDVENWFGALSNEAKIADAYFPGIVNGGQLPYD 83 (501) T ss_pred CcccceEEEEeeecccCCCccccce-eEEEeccCCCCccceEEecCHHHHHHhcCCChHHHHHHHHHhhhhcCCCccccE Confidence 3899999999999999999998875 789999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeeeccCCcceeeecccchh-hHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhccccccccee Q lcl|NC_019451. 81 ISFARWVNTAIAPMVVGDNLPKT-IADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNADPQLAQA 159 (504) Q Consensus 81 l~igr~~~~a~~~~l~g~~~~~~-~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~~~~~~~a 159 (504) ||||||++++++++|+|++++.. +.+++.+ +|+|+|+|+|+. ...+||||.+++|+++|+.|+++|++ ..+ T Consensus 84 l~igR~~~~~~~~~l~g~~l~~~~la~~~~~-sg~l~vti~g~~-~~~~i~ls~ats~~~vAs~i~~al~~------~~~ 155 (501) T protein:vir:10 84 LKFARYVAADAPASVYGIPLTGVTLAQLQGY-SGTLTVTTAAQH-VSANISLAAATSFANAATLIEAAFTS------PDF 155 (501) T ss_pred EEEEeecCCCccceEeccchhhhhhhhccee-eeEEEEeeccce-eecccccccccCHHHHHHHHhhhccC------Cce Confidence 99999999999999999999865 6777777 599999999985 55789999999999999999999976 347 Q ss_pred EEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcc--eeecccccccHHHHHHHHHhhccceeEEEEEec Q lcl|NC_019451. 160 TVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNV--VNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGA 237 (504) Q Consensus 160 ~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~--~~~~g~aaet~~~al~~~~~~~~~wy~~~~~~~ 237 (504) +|+||++.++|++++.++|..+++...+ .+++++..|||+.+.+ +.++|.++|+|.++|+++.+.+++||+|.++++ T Consensus 156 tv~~d~~~~~f~its~ttG~~~~i~~~~-~~~~la~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~a~~ 234 (501) T protein:vir:10 156 VVAYDALRNRFTVVTNATGTAAAISAVT-GTNNLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWT 234 (501) T ss_pred EEEEcccCceEEEEeeccCCceeEEEee-CchhhhhhcCccccccceEEecCcccccHHHHHHHHHhccCceEEEEEecC Confidence 8999999999999999999988877665 4678999999997643 568899999999999999999999999998876 Q ss_pred cCCHHHHHHHHHHHhhcCCcEEEEEeccccchh-----HHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCcC Q lcl|NC_019451. 238 PLDNDQIKAVSAWNAAQNNQFIYTVATSLANLG-----TLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDEP 312 (504) Q Consensus 238 ~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~ 312 (504) ++++|++++|+|+|+|+++|+|..+++++... ..+....+..++.+.+..+ .+++++++++|+++++||++. T Consensus 235 -~~~~~~la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y--~~~~~~aa~~g~~as~nf~~~ 311 (501) T protein:vir:10 235 -AVIADRLAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLY--GDQATAGAVMGYAASINFQLR 311 (501) T ss_pred -CChHHHHHHHHHHHhcCceEEEEEecCchhhhhhhhhhhHHHHHHhcCCCceEEEC--CCCcHHHHHHHHHHhhCcccC Confidence 68999999999999999999999999875432 2233333334444444443 467899999999999999999 Q ss_pred CceeeecccccC-ccccccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHHHHHHHHHHH Q lcl|NC_019451. 313 GASQNYMYYQFP-GRNITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWLKSAIAQA 391 (504) Q Consensus 313 ~g~~T~kfk~l~-Gv~a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~dwl~~~lq~~ 391 (504) ||++||||||++ ||+|++++++|+++|++||||||+.|++++++++|+++|+|+| +|+|||+++|+||||++||.+ T Consensus 312 ~g~~T~~fkq~~~Gi~a~~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG---~~~wiD~~~~~~Wl~~~iq~~ 388 (501) T protein:vir:10 312 NGRTVLAFRQFNAGVPATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSG---KFLWVDTYLDQIYLNAELQRA 388 (501) T ss_pred ccceeeeccccCCCcCcccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeec---cceeehhhhhHHHHHHHHHHH Confidence 999999999986 9999999999999999999999999999999999999999986 567999999999999999999 Q ss_pred HHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEEecchHhC Q lcl|NC_019451. 392 LLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYT 471 (504) Q Consensus 392 l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~ 471 (504) +++||.+++|||||+.|+++|++.|+++|+||++||+|+||+|+++.|+++|++++|++++++++++|||||++++++. T Consensus 389 l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~- 467 (501) T protein:vir:10 389 EFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQTRGWYFLIGDPAN- 467 (501) T ss_pred HHHHHHhcCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccCccccccceeccceeEeeccccC- Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999988775 Q ss_pred CHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 472 NSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 472 s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) +++||++|++|+|+|+|+||||||+|+|.+++| T Consensus 468 ~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v 500 (501) T protein:vir:10 468 PGQARQNRTTPACTLWYSDGGSIQQLTIGSNAV 500 (501) T ss_pred ChhhhhhccccceEEEEEeCCceeEEEeeeeec Confidence 557999999999999999999999999999999 No 8 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=100.00 E-value=4.2e-150 Score=839.53 Aligned_cols=483 Identities=21% Similarity=0.233 Sum_probs=433.8 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCcccce Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPSS 80 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~~P~~ 80 (504) =||+||||||+|+|+++++++++|+.+||++ +..+|+||+|+|+++++|++|||.+||||+||++||+++.|++|||++ T Consensus 3 ~ip~s~iV~V~~~v~~~~~~~~~f~~~l~~~-~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~q~p~P~~ 81 (494) T protein:vir:94 3 NIPISQIVSINPQVVSAGGTQGTLDGLLLTQ-ATGFPVTQPQVYFSAADVGTAFGLTSDEYNAALVYFAGILGGGQQPAS 81 (494) T ss_pred CCCcccEEEeeeeccccCCcccccceeEeec-CccCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhccCCCccccE Confidence 3999999999999999999999888666554 556788999999999999999999999999999999999999999999 Q ss_pred EEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcccccccceeE Q lcl|NC_019451. 81 ISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNADPQLAQAT 160 (504) Q Consensus 81 l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~~~~~~~a~ 160 (504) ||||||++++++++|+|++++.++.+++.+ +|+|+|+|+| .+++++||||.+++++++|+.|++++++ .+++ T Consensus 82 l~igR~~~~a~~~~l~g~~~~~tl~~~~~~-~g~l~iti~g-~~~~~~i~lS~~ts~~~vA~~i~~ai~~------a~~~ 153 (494) T protein:vir:94 82 LTIGRYASAATSAAVFGAPLTLSLAQLQTL-SGTLIVTTDT-QRTSAAINLSGATSFANAASLMTSGFTT------PNFA 153 (494) T ss_pred EEEEeecCccccceeeccchhhhHHhhhhc-ceEEEEEEcc-eEEEeeecccccCChhhHHHHHhhhhcc------ccce Confidence 999999999999999999999999999998 7999999999 4789999999999999999999999975 3568 Q ss_pred EEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCc--ceeecccccccHHHHHHHHHhhccceeEEEEEecc Q lcl|NC_019451. 161 VTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSN--VVNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGAP 238 (504) Q Consensus 161 vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~--~~~~~g~aaet~~~al~~~~~~~~~wy~~~~~~~~ 238 (504) |+||++.++|++++.++|+.+.+...+ .+++..|+++... .+.++|.++|+|.++|+++.+.+++||+|.++++ T Consensus 154 v~~d~~~~~f~v~s~ttG~~s~is~~t---~~~a~~l~lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~~~~- 229 (494) T protein:vir:94 154 ITYDAQRRRFVLSTTATGTTASVSAVT---GTLADGVGLSTASGAYVEGSGLAADTAASALDRLAASSSTWAIFTTAWA- 229 (494) T ss_pred EEEcccCcEEEEEEccCCceeEEEEec---cchhhhhhhhccccceEeecCcccccHHHHHHHHHhccCceEEEEEecC- Confidence 999999999999999999888766554 4689999998654 3567899999999999999999999999999876 Q ss_pred CCHHHHHHHHHHHhhcCCcEEEEEeccccch-----hHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCcCC Q lcl|NC_019451. 239 LDNDQIKAVSAWNAAQNNQFIYTVATSLANL-----GTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDEPG 313 (504) Q Consensus 239 ~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~~ 313 (504) .+++|++++|+|+|+++++|+|+.+++|+.. ...+....+..++.+++..++ ++.++++++|++++.||+..+ T Consensus 230 ~~~~~ilalA~wiea~~~~~~~~~~~~d~~~~~~~~~~~i~~~l~~~~y~~t~~~y~--~~~~~aa~~g~~aa~~~~~~~ 307 (494) T protein:vir:94 230 ASLSDRTALAQWTSDQVFRRIYAAWDQDAAGLSVNNVSSFGNIVKTTPFSNTIPVYG--LLANAMIVLAWGASTNLQIAE 307 (494) T ss_pred CCHHHHHHHHHHHhhcCccEEEEEecCCcceeecccchhHHHHHHhhcCCceEEEcC--CCChHHHHHHHHHhccccccC Confidence 6899999999999999999999999987543 233333444444444444443 344678999999999999999 Q ss_pred ceeeeccc-ccCccccccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHHHHHHHHHHHH Q lcl|NC_019451. 314 ASQNYMYY-QFPGRNITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWLKSAIAQAL 392 (504) Q Consensus 314 g~~T~kfk-~l~Gv~a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~dwl~~~lq~~l 392 (504) |++||||| +++|++|++++++|+++|++||||||+.|.+.++++.|+++|+ ++|+ |.|||+++|+||||++||.+| T Consensus 308 g~~T~~~k~q~~gi~~~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~gg~-~sG~--~~~id~~~~~~WL~~~iq~~l 384 (494) T protein:vir:94 308 GRTTLALRSPVSSAGVRVDNLANANALLSNGYTYLGKYASATNTYTVTYNGA-IGGQ--FLWADTALGWIALRRNLQQAL 384 (494) T ss_pred cceeEEeeccCCCCCCccCCHHHHHHHHhcCCeEEEEecccCceEEEecCce-eccc--cceeeeeccHHHHHHHHHHHH Confidence 99999999 6899999999999999999999999999999998888877775 5674 579999999999999999999 Q ss_pred HHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEEecchHhCC Q lcl|NC_019451. 393 LDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTN 472 (504) Q Consensus 393 ~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s 472 (504) ++||.+++|||||+.|+++|+++|+++|+||++||+|+||+|+++.|+++|+++.|.+.. +++++||||+++- ..++ T Consensus 385 ~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~i~~~~G~~~~-~~~~~kGyy~~~~--~~~s 461 (494) T protein:vir:94 385 FETLLAYRSLPYNADGYNALYQGAQDVVSQFVAAGVIRAGVALSASQRAQIDQAAGVPIS-GDVVDKGWYLQVI--DPIT 461 (494) T ss_pred HHHHHhCCCcccChhhHHHHHHHHHHHHHHHHhCceeecccccCcchhhhhhhhhcCccc-cceeccceeeecc--CCCC Confidence 999999999999999999999999999999999999999999999999999999998755 8999999999972 4577 Q ss_pred HHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 473 SNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 473 ~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) +++|++|.+|+++|+|++|||||+|+|+||+| T Consensus 462 ~~~ra~R~~~~~~~~y~~~GAIh~v~i~~~~v 493 (494) T protein:vir:94 462 TTVRTDRGSPTVNFWYCDGGSIQRVVVSATTV 493 (494) T ss_pred hhhhhccccCCceEEEEecCcEEEEEEeeEEe Confidence 89999999999999999999999999999999 No 9 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=100.00 E-value=3.5e-143 Score=801.55 Aligned_cols=478 Identities=14% Similarity=0.167 Sum_probs=420.7 Q ss_pred CCCccceEEEeeeecccccccccccceEEEeccc---ccC-ccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCc Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNN---VIP-PGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVN 76 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~---~~~-~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~ 76 (504) =||+||||||+|++.+.++.+++|+.+|||++++ +++ .||+|.|+|+++|++|||++|||||||++||+|+| T Consensus 2 sip~s~ivnV~i~~~~~a~~~~~f~~~l~l~~~~~~~~~~~~~r~~~~~s~~~V~~~FG~~s~ey~aA~~yF~q~p---- 77 (502) T protein:vir:52 2 ALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSP---- 77 (502) T ss_pred CCCccceeEEeeccccccccccccCceEEEeeccCccccCCccceEEecCHHHHHHhcCCChHHHHHHHHHhcCCC---- Confidence 5999999999999999999999999999997654 555 49999999999999999999999999999999988 Q ss_pred ccceEEEEeeeccCCcceeeecccc-----hhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcc Q lcl|NC_019451. 77 SPSSISFARWVNTAIAPMVVGDNLP-----KTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKN 151 (504) Q Consensus 77 ~P~~l~igr~~~~a~~~~l~g~~~~-----~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~ 151 (504) ||++||||||+++++.++++|+++. ..+.+++.+++|+|+|+|+|+.+++++||+|.+++++++|+.|++++++. T Consensus 78 ~P~~l~igR~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~i~i~g~~~t~~~i~lS~~ts~~~vA~~i~~~l~~~ 157 (502) T protein:vir:52 78 RAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTL 157 (502) T ss_pred ccceEEEEeccccccceeechhhhhhhhhHHhHHHhhhhcCceeEEEecceeeeeeccccccccchhHHHHHHHhhhccc Confidence 9999999999999998887766654 45678889999999999999999999999999999999999999999864 Q ss_pred cccccceeEEEEecccceeEEecccCcceeEEEE--e---eccchhhhhhhhcccCcc-ee----ecccccccHHHHHHH Q lcl|NC_019451. 152 ADPQLAQATVTWNQNTNQFTLVGATIGTGVLAVA--K---SADPQDMSTALGWSTSNV-VN----VAGQAADLPDAAVAK 221 (504) Q Consensus 152 ~~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~--~---sa~~~~ia~~l~~t~~~~-~~----~~g~aaet~~~al~~ 221 (504) .. .++|+||.+.++|+++++++|..+.+.. . .+++++++.+++++.... +. .+|.++|+|.++|++ T Consensus 158 ~~----~~tv~~d~~~~~F~i~s~ttg~~~~~~~~~a~~~~~~gt~~a~~l~l~~~~~av~v~~~~~g~~aet~~~al~a 233 (502) T protein:vir:52 158 SV----AVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFN 233 (502) T ss_pred cc----ceEEEEecCCceEEEEeccCCCcceeEEEEeecCCcchhHHHHHhccccccceeeeeeecccccccCHHHHHHH Confidence 32 4789999999999999999887655432 2 235688999999986543 33 357789999999999 Q ss_pred HHhhccceeEEEEEeccCCHHHHHHHHHHHhhcCCcEEEEEeccccch--hHHHHHHhhhcceeEEEEeccCCCccHHHH Q lcl|NC_019451. 222 STNVSNNFGSFLFAGAPLDNDQIKAVSAWNAAQNNQFIYTVATSLANL--GTLFTLVNGNAGTALNVLSATAANDFVEQC 299 (504) Q Consensus 222 ~~~~~~~wy~~~~~~~~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa 299 (504) +.+++++||+|.++++ .++++++++|+|+|+++|+|+|..++.+... ........+..++.+++..++..++|++++ T Consensus 234 ~~~~~~~w~~~~~a~~-~~~~~~la~a~~iea~~~~f~~~~~d~~~~~~~~~~i~~~l~a~~~~~t~~~y~~~~~~~~aa 312 (502) T protein:vir:52 234 VAEVNNTWYGFTVAAQ-LTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDKNDMYPVSS 312 (502) T ss_pred HHhccCceEEEEEeec-CChhHHHHHHHHHhhcCcEEEEEecCcceeccccchHHHHHHhccCceeEEEecCCcchhHHH Confidence 9999999999999876 5899999999999999999988777665432 122222333344444555555678999999 Q ss_pred HHHHHHhcCcCcCCceeeecccccCccccccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhh Q lcl|NC_019451. 300 PSEILAATNYDEPGASQNYMYYQFPGRNITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYA 379 (504) Q Consensus 300 ~~g~~as~nf~~~~g~~T~kfk~l~Gv~a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~ 379 (504) ++|+++++||++.||++|||||+++||+|++++++|+++|++||||||+.+.+ +.++++|+|++|+ |||++| T Consensus 313 ~~g~~as~~f~~~~g~iT~~fk~l~GV~~~~lt~t~~~al~~~~~N~y~~~~~----~~~~~~G~~~~G~----~iD~~~ 384 (502) T protein:vir:52 313 ALARLLSTNFAANNSTLTLKFKQQPTITADEITATEFAKAKRLGINVYTYFDD----VAMIAEGTVIGGK----FADEIV 384 (502) T ss_pred HHHHHHhcCCCcCcceeeecccccCCcccCcCCHHHHHHHHhcCceEEEEecC----eeEEecCeeeCCc----hhhHHH Confidence 99999999999999999999999999999999999999999999999998843 5689999999997 699999 Q ss_pred hHHHHHHHHHHHHHHHH-hcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceee Q lcl|NC_019451. 380 NEIWLKSAIAQALLDLF-LNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQT 458 (504) Q Consensus 380 ~~dwl~~~lq~~l~~l~-~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~ 458 (504) |+|||+++||++|+++| ++++|||||+.|+++|+++|+++|+||++||+|+||+|+++.|.. +..+|+++ T Consensus 385 ~~~Wl~~~lq~~l~~~L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~G~~~~~~~g~---------~~~~d~~~ 455 (502) T protein:vir:52 385 ILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGN---------LSTGDYLD 455 (502) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCccccccccCcccce---------eeeccccc Confidence 99999999999999976 567899999999999999999999999999999999999987644 34467889 Q ss_pred cceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 459 LGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 459 ~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) +||||++|++++|+++||++|++|+++|+|+++||||.|+|+++.+ T Consensus 456 ~gy~v~~~~~~~~s~~dr~~R~~~~~~~~~~~aGaIh~v~i~~nv~ 501 (502) T protein:vir:52 456 KGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYN 501 (502) T ss_pred CceEEEeCchhhCCHHHHHcccCCCeEEEEEECceEEEEEEEEEEe Confidence 9999999999999999999999999999999999999999999999 No 10 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=100.00 E-value=3.4e-118 Score=664.58 Aligned_cols=430 Identities=14% Similarity=0.137 Sum_probs=367.9 Q ss_pred CccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCcccceEE Q lcl|NC_019451. 3 SQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPSSIS 82 (504) Q Consensus 3 p~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~~P~~l~ 82 (504) --||||||+|+|.++++.+++|+.+||++++.++ .||+|.|+++++|++|||.+|||||||++||+|.| +|.+|| T Consensus 1 ~~s~iVnV~i~~~~~a~~~~~f~~~l~~~~~~~~-~~r~~~yss~~~V~~~FG~~S~ey~aA~~yF~q~p----~p~~l~ 75 (450) T protein:vir:95 1 MWNPIVNVDITLNTAGTTREGFGLPLFLASTDNF-EERVRGYTSLTEVAEDFDENTAAYKAAKQLWSQTP----KVTQLY 75 (450) T ss_pred CCCceEEEeecccccccccccceeEEEEcCCCCC-ccceeeecCHHHHHHhcCCCcHHHHHHHHHHhCCC----cccEEE Confidence 4799999999999999999999999999999864 69999999999999999999999999999999988 999999 Q ss_pred EEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcccccccceeEEE Q lcl|NC_019451. 83 FARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNADPQLAQATVT 162 (504) Q Consensus 83 igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~~~~~~~a~vt 162 (504) ||||+++++++.+.+ ++..++|+|+++|+|+.+.+..+|++.+++++++|+.+++++.+. .+. T Consensus 76 igr~~~~~t~~~~~~---------~~~~~~g~lt~tv~G~~~~~~~i~~s~a~s~~~va~~~~tai~~~--------~~~ 138 (450) T protein:vir:95 76 IGRRAMQYTVSIPDA---------VTESTDYSITVAAGGGISQPYQYTAQSSDTAENVLQQFKTQIEAD--------PTI 138 (450) T ss_pred EEeeccchhhhhhhh---------hccccceeEEEEecceeeeeeEEEEEecCChhhHHHHhhhhhccc--------cee Confidence 999999887766544 567789999999999999999999999999999999999998753 233 Q ss_pred EecccceeEEecccCcceeEEEEeeccchhhhhhhhcc-cCcceeecccccccHHHHHHHHHhhccceeEEEEEeccCCH Q lcl|NC_019451. 163 WNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWS-TSNVVNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGAPLDN 241 (504) Q Consensus 163 ~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t-~~~~~~~~g~aaet~~~al~~~~~~~~~wy~~~~~~~~~~~ 241 (504) ++ .|.+++...+...+........ +..++++ ..+.+.+++.++|++.++|+++.+.+++||+|.+. ..++ T Consensus 139 ~~----~~~~~s~g~~~~~t~~~~~~~~---~~~~~l~~~~~~~~~~g~~aet~~~a~~a~~~~~~~w~~~~~~--~~~~ 209 (450) T protein:vir:95 139 KD----KVSVNVTGSNGSATMIIAKAGD---NDFVKVTTTAQTVYIASTTADTASTALAAIEAYSTDWYFIAAE--DRTQ 209 (450) T ss_pred ee----eeeeeeecccceeeeeeecccc---chhhccccccceeEecccccccHHHHHHHHHHhhCCeEEEEec--CCCH Confidence 43 4677776666555544433322 3455665 34667889999999999999999999999988764 3689 Q ss_pred HHHHHHHHHHhhcCCcEEEEEeccccchh------HHHHHH-hh-hcceeEEEEeccCCCccHHHHHHHHHHhcCcCcCC Q lcl|NC_019451. 242 DQIKAVSAWNAAQNNQFIYTVATSLANLG------TLFTLV-NG-NAGTALNVLSATAANDFVEQCPSEILAATNYDEPG 313 (504) Q Consensus 242 ~~~~a~A~w~e~~~~~~~~~~~~~d~~~~------~~~~~~-~~-~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~~ 313 (504) ++++++|+|+|+|+++|++..++.+.... ..+... +. ...+....+++....+|++++++|+++ +..| T Consensus 210 ~~i~a~a~w~~a~~~~f~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~~~aa~~g~~~----~~~~ 285 (450) T protein:vir:95 210 QFVLAMASEIQARKKIFFTANSDVTALQGTELASANDVPAQLAKNMYTRTVCLWHHAAAEDYPEMAYIAYGA----PYDA 285 (450) T ss_pred HHHHHHHHHHhhcCcEEEEEcCCchhhhhhhhhcccchHHHHHhccCCeeEEEeeCCCchhHHHHHHHHHhh----hccc Confidence 99999999999999999988877654221 122222 22 234455566665566788888888765 5679 Q ss_pred ceeeecccccCccccc-------cCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHHHHHH Q lcl|NC_019451. 314 ASQNYMYYQFPGRNIT-------VSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWLKS 386 (504) Q Consensus 314 g~~T~kfk~l~Gv~a~-------~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~dwl~~ 386 (504) |++|||||+++||+|+ +|+++|+++|+++|||||+.+.+ +.++++|+|++|+ |||++||+|||++ T Consensus 286 g~~T~~fk~l~Gv~~~v~~~~~~~lt~~~~~al~~~~~n~y~~~~~----~~~~~~G~~~~G~----~iD~~~~~~wl~~ 357 (450) T protein:vir:95 286 GSIAWGNAQLTGVAASLQPSNQRPLTSIQKSALDVRHCNFIDLDGG----VPVVRRGITSGGE----WIDIIRGVDWLES 357 (450) T ss_pred ceeeeccccccceeeeccCccccccchHHHHHHHhCCcEEEEEecC----ceeeeCCeeeCcc----hhHHHHHHHHHHH Confidence 9999999999999996 58999999999999999998754 5789999999997 5999999999999 Q ss_pred HHHHHHHHHHhcC--CCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEE Q lcl|NC_019451. 387 AIAQALLDLFLNV--NAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWIN 464 (504) Q Consensus 387 ~lq~~l~~l~~~~--~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~ 464 (504) +||++|++||.++ +|||||+.|+++|+++|+++|+|+++||+|+ ||+|+ T Consensus 358 ~iq~~l~~ll~~~~~~KiPy~~~G~~~i~a~i~~~l~~a~~~G~Ia-----------------------------~~~V~ 408 (450) T protein:vir:95 358 DLKTSLRDLLINQKGGKITYDDTGITRIRQVIETSLQRAVNRNFLS-----------------------------SYTVN 408 (450) T ss_pred HHHHHHHHHHHhcCCCCCccChhhHHHHHHHHHHHHHHHHhcCccc-----------------------------ceeEe Confidence 9999999998654 5999999999999999999999999999996 59999 Q ss_pred ecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 465 ITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 465 ~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) +|++++++++||++|++|+++|+|+|+||||.++|+|++= T Consensus 409 ~~~~~~~~~~dr~~R~~~~i~~~~~laGAIh~~~i~~~v~ 448 (450) T protein:vir:95 409 VPKASQVALADKKARILKDVTFAGILAGAILDVDLKGTVA 448 (450) T ss_pred cCChHhcCHHHHhccCCCCeeEEEEEccceEEEEEEEEEE Confidence 9999999999999999999999999999999999999988 No 11 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=100.00 E-value=1.5e-89 Score=507.56 Aligned_cols=328 Identities=15% Similarity=0.130 Sum_probs=276.7 Q ss_pred CccceEEEeeeecccc-cccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCcccceE Q lcl|NC_019451. 3 SQSRYIRIISGVGAGA-PVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPSSI 81 (504) Q Consensus 3 p~s~iv~V~~~v~~~~-~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~~P~~l 81 (504) -+||||+|++.+...+ ..+.+++.+++|..++. +|+|.|+++++|+.|||.++|+|++|..+|+|.| +|.++ T Consensus 1 ~~~~iv~V~v~~~~~~~~~~~~~~~~~~~~~~t~---~~~~~y~s~~~v~~d~~~~~~~Ykaa~~~f~Q~~----~~~~i 73 (331) T protein:vir:80 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTA---MGYKEYTTLEELKDTFADNTEVYAKAKAVFLQKD----RPDTV 73 (331) T ss_pred CccceecceeeecccccccccccCcceeEEeccc---cceEEEechhhhccCCCCCcHHHHHHHHHHhccC----ccceE Confidence 6899999999987433 23445666666665543 7999999999999999999999999999999988 99999 Q ss_pred EEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcccccccceeEE Q lcl|NC_019451. 82 SFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNADPQLAQATV 161 (504) Q Consensus 82 ~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~~~~~~~a~v 161 (504) +++++.++. T Consensus 74 ~v~~~~~~~----------------------------------------------------------------------- 82 (331) T protein:vir:80 74 AVITYEDTK----------------------------------------------------------------------- 82 (331) T ss_pred EEeccchHH----------------------------------------------------------------------- Confidence 998764210 Q ss_pred EEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccceeEEEEEeccCCH Q lcl|NC_019451. 162 TWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGAPLDN 241 (504) Q Consensus 162 t~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~wy~~~~~~~~~~~ 241 (504) ..+++. ...+++||++.+. .+++ T Consensus 83 -----------------------------------------------------~~~a~~--a~~~~~w~~~~~~--~~~~ 105 (331) T protein:vir:80 83 -----------------------------------------------------LLEAAE--AYFLKSWHFALLA--EFKA 105 (331) T ss_pred -----------------------------------------------------HHHHHH--HhccCceeEEEee--cCCH Confidence 001111 1135678876654 3689 Q ss_pred HHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCcCCceeeeccc Q lcl|NC_019451. 242 DQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDEPGASQNYMYY 321 (504) Q Consensus 242 ~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~~g~~T~kfk 321 (504) ++++++|+|+|+++++|+++.++.+++... .. ...+.....+...++|++++++|++++++| |++||||| T Consensus 106 ~~~~a~a~~~~a~~~~f~~~~~~~~~~~~~-----~~-~~~~t~~~~~~~~~~~~~aa~~g~~~~~~~----g~~t~~fk 175 (331) T protein:vir:80 106 ADALALSNLIEEQKFKFAVFQVTAVADITP-----LA-KNTRTIAIVHSKTGEKLDAALIGNVASLPV----GSATWKGR 175 (331) T ss_pred HHHHHHHHHHhhCCcEEEEEecCchHHHHH-----hh-ccccEEEEEcCCccchhHHHHHHHHHhcCc----cceeeeee Confidence 999999999999999998887765543322 11 223344444567789999999999999876 78999999 Q ss_pred c-cCccccccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHHHHHHHHHHHHHHHHhcCC Q lcl|NC_019451. 322 Q-FPGRNITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWLKSAIAQALLDLFLNVN 400 (504) Q Consensus 322 ~-l~Gv~a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~ 400 (504) + |+||+|++++.+|+++|+++|||||+.+.+ +.++++|+|++|+ |||++||+|||+++||++|++||.+++ T Consensus 176 ~~l~GV~~~~lt~t~~~al~~~~~N~y~~~~~----~~~~~~G~~~~G~----~iD~~~~~dWl~~~lq~~l~~ll~~~~ 247 (331) T protein:vir:80 176 HGLAGITSEELKVSEIDAIQKAGGMCYIEKAG----IAQTSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLLTETD 247 (331) T ss_pred cccCCCCCCCCCHHHHHHHHhcCceEEEEecC----eeEEecceEeCch----hHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 7 899999999999999999999999988754 5789999999997 699999999999999999999999999 Q ss_pred CCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcc Q lcl|NC_019451. 401 AVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEW 480 (504) Q Consensus 401 kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~ 480 (504) |||||+.|+++|+++|+++|++|++||+|+||+|+++. ||+|++|++++++++||++|+ T Consensus 248 kiPy~~~G~~~l~a~i~~~~~~av~~G~I~~g~~~~~~---------------------~~~v~~~~~~~~s~~dr~~R~ 306 (331) T protein:vir:80 248 KLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEP---------------------NFSITALQRSDLNDDDIAKRN 306 (331) T ss_pred CCccChhhHHHHHHHHHHHHHHHHhCCceecCccCCCc---------------------ceEEEeCchhcCCHHHHhccC Confidence 99999999999999999999999999999999997642 699999999999999999999 Q ss_pred cCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 481 KANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 481 ~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) +|+++|+|+++||||.|+|+|++= T Consensus 307 ~~~~~~~~~~~gaI~~v~i~~~v~ 330 (331) T protein:vir:80 307 YKGLSFRYKRSGAIHSVDVYGEVE 330 (331) T ss_pred CCCeEEEEEEcceEEEEEEEEEEe Confidence 999999999999999999999876 No 12 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=100.00 E-value=4.8e-68 Score=389.67 Aligned_cols=405 Identities=10% Similarity=0.047 Sum_probs=267.1 Q ss_pred CCccceEEEeeeecccccccccccceEEEecccccCc----cceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCcc Q lcl|NC_019451. 2 ISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPP----GIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNS 77 (504) Q Consensus 2 ip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~----~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~~ 77 (504) .| .+||||++++.++++..++|+.+|||+.|.++|+ +|+|.|+|+++|++|||.+||+||||..||+|.+ T Consensus 1 m~-~~iVnV~Is~~t~A~~~~~Fg~~liigs~~~~~p~~~f~~~~~Yss~~~V~~Dfg~~s~~Y~AA~~~f~Q~~----- 74 (426) T protein:vir:31 1 MP-KQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGA----- 74 (426) T ss_pred CC-cceEEEEeecccccccccccceeeeeeeccccccccccchhhhhhhHHHHHhcCCCChHHHHHHHHHHhCCc----- Confidence 45 8999999999999999999999999999999875 5888899999999999999999999999999954 Q ss_pred cceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcccccccc Q lcl|NC_019451. 78 PSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNADPQLA 157 (504) Q Consensus 78 P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~~~~~~ 157 (504) ++||+.. ..+.. +.. ... +.+.+|+| ++.+......+.++.|...+.+..+.... T Consensus 75 ----~~~r~~v-~~at~-----~~~-----~~~---t~~~tv~g-------~~~s~~a~~~~~a~~i~~~~~~~~~~~~~ 129 (426) T protein:vir:31 75 ----EQWRVMV-LEATE-----VTE-----EEL---SDGDTIDK-------VPILGNHEVESPDGDIEFTTDDDPDVEDF 129 (426) T ss_pred ----eeEEeec-cccce-----eee-----ccC---Ccceeecc-------eeeeecccCcchHHHHHHhhccccccccc Confidence 5777631 11111 100 111 23345665 44455666678888888777665443221 Q ss_pred eeEEEEecccceeEEecccCcceeEEEEe---eccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccceeEEEE Q lcl|NC_019451. 158 QATVTWNQNTNQFTLVGATIGTGVLAVAK---SADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGSFLF 234 (504) Q Consensus 158 ~a~vt~d~~~~~F~its~t~ga~s~~~~~---sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~wy~~~~ 234 (504) ...+..+ ..+|..+..... +-...+.+.+.++ .+++-.........||... T Consensus 130 ~~~~~~~----------t~~g~~t~~~~~~~~~~s~~dw~~~~~~---------------~s~~~~~~ia~~~~~~~~~- 183 (426) T protein:vir:31 130 DAEIVIN----------SATGDVATSEDSIELTYFHADWSQLDEF---------------PSDVNNFAVADRRFDLKGV- 183 (426) T ss_pred eeeeEec----------cccceeeccccceeeeeccCcchhhhcc---------------cccchhhhhhccccchhhh- Confidence 1111111 111111110000 0001111111111 1111112222344554221 Q ss_pred EeccCCHHHHHHHHHHHhhcCCcEEEEEeccc--cchhHHHHHHhh--hcceeEEEEec-cCCCccHHHHHHHHHHhcCc Q lcl|NC_019451. 235 AGAPLDNDQIKAVSAWNAAQNNQFIYTVATSL--ANLGTLFTLVNG--NAGTALNVLSA-TAANDFVEQCPSEILAATNY 309 (504) Q Consensus 235 ~~~~~~~~~~~a~A~w~e~~~~~~~~~~~~~d--~~~~~~~~~~~~--~~~~~~~~~~~-~~~~~~~~aa~~g~~as~nf 309 (504) .+..++..|.+.+.++++...-..+ ....+....... .+.....+.++ ....+...+..++.++++++ T Consensus 184 -------~~~~~~~~wa~~~~i~~va~~~e~~~~~~~~~~~a~~~~~~~y~p~~~~~~~~~~~~~~~~~~~~~~~aa~~~ 256 (426) T protein:vir:31 184 -------GVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDASDDDLAAYQLGKFAVSEP 256 (426) T ss_pred -------hhhHhhhhhhhhcceeeeeeccchhhhcchhhhhhhhhcccccccchhheeehhccccchhhHHhhhhhhhcc Confidence 1233677888887776543222211 111222222111 11112222222 33445667778888887765 Q ss_pred -------CcCCceeeecccccCccccccCCHHHHHHHHhCCCeEEEEEeeccc-eeeEEEcCEEeCCccccchhhhhhhH Q lcl|NC_019451. 310 -------DEPGASQNYMYYQFPGRNITVSDDTVANTVDKSRGNYIGVTQANGQ-QLAFYQRGILCGGPTDAVDMNVYANE 381 (504) Q Consensus 310 -------~~~~g~~T~kfk~l~Gv~a~~lt~t~~~al~~~~~n~y~~~~~~~~-~~~~~~~G~~~~G~y~~~wiD~~~~~ 381 (504) +..++...++|++.+|+...-. .++... .++++|.|+.+.+.+. ...+.++|++++|+| ||++||+ T Consensus 257 ~~~~~~~~~~~~~~~~~~~~~~gv~~t~~-~~~~A~-~~~~~n~~~~~~~~~~i~~~~~~~G~~~~G~~----iD~~~g~ 330 (426) T protein:vir:31 257 WYNPLWNELPAGETVSKNVGDPEEQGTFE-GGDEAE-GEGPVNVLIDVSDANRVSNAVTTAGADSDTSF----FDIRRTK 330 (426) T ss_pred ccchhhhhccccccceeeccccccccccc-hhhhhh-hcCCceEEEEecCceeeecceeecccccchhh----hhhHHHH Confidence 3455677788999999984333 233334 4588999988876532 445678899999995 9999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecce Q lcl|NC_019451. 382 IWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGY 461 (504) Q Consensus 382 dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY 461 (504) |||+++||++|++||.|.+|||||+.||+||++.|+++|+++++.|..- -.|| T Consensus 331 dwl~~~iq~~l~~ll~~~~KIpyt~~Gi~~I~~~i~~~L~~~v~~~g~~---------------------------~~~y 383 (426) T protein:vir:31 331 VYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQP---------------------------LAEY 383 (426) T ss_pred HHHHHHHHHHHHHHhhcCCCCccchhHHHHHHHHHHHHHHHHhcCCCcc---------------------------ccce Confidence 9999999999999999999999999999999999999999998755311 1269 Q ss_pred EEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 462 WINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 462 ~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) +|+.|.+++++ +||++|++|+|+|.|+|+||||.++|+|++= T Consensus 384 ~v~~P~~~~~~-~dra~R~~~~i~~~~~laGAIh~v~I~g~v~ 425 (426) T protein:vir:31 384 EVDVPEWDDDD-VDRVNRNWGGIDLDARLAQRAHTFSLGLNVS 425 (426) T ss_pred eecCCCccccc-hhhhhhccCCceEEEEEeCcEEEEEEEEEEe Confidence 99999888865 6999999999999999999999999999987 No 13 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=99.24 E-value=1.7e-10 Score=74.14 Aligned_cols=413 Identities=12% Similarity=0.040 Sum_probs=194.9 Q ss_pred CCCccc-----eEEEeee-ecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCC--cHHHHHHHHHhccCC Q lcl|NC_019451. 1 MISQSR-----YIRIISG-VGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQ--SEEYQRAAAYFKFIS 72 (504) Q Consensus 1 mip~s~-----iv~V~~~-v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~--s~ey~aA~~yF~~~~ 72 (504) -.-.+| ++|+.++ ..+..+. .....+++.-..--|+.....-.+-+|...-||.. ++.|++-+.+|. T Consensus 6 ~~~~~K~~PGvYi~~~~~~~~~~~~~--~~~~~~~i~~~~~~g~~~~v~i~~~~d~~~~fG~~~~~~~~~~~~~~~~--- 80 (451) T protein:vir:10 6 WKAQDKRRPGTYINVVGNGQREAASS--LGRVLLIRDKGLGWGKNGVIEVEANSDFTKKLGTTLDDPSLTALKETLK--- 80 (451) T ss_pred eccceeecCceEEEEeccCcceeecc--CCcEEEEEeeecCCCCcccEEeecHHHHHHHcCCcccchhHHHHHHHhc--- Confidence 223344 3444332 2221111 12233444433333333333445667888999944 667888888886 Q ss_pred CCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCc----eEEEEEcccceeeeeeccccccchHHHHHHHHhhh Q lcl|NC_019451. 73 KSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAG----VLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEI 148 (504) Q Consensus 73 ~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g----~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i 148 (504) .|+++|+.|.+.- +.+.. ++......+++...| .|+|+|.-........+++.... T Consensus 81 ----g~~~v~~yrl~~g-~~a~~---t~~~~~~~~~Aky~G~~Gn~i~v~v~~~~~d~~~~~v~t~~g------------ 140 (451) T protein:vir:10 81 ----GASKVLVLNPNEG-TAATL---TKEGLPWTVTANYPGEKGNQITVSVEVSPADQNAATVSTIFG------------ 140 (451) T ss_pred ----CCcEEEEEEcCCC-ceEEE---EeecCceEEEEeeCCcCCceEEEEEecccCCcCceEEEEEEC------------ Confidence 4788999987532 21111 111101112222222 46666533221111111110000 Q ss_pred hcccccccceeEEEE---e-cccceeE-EecccCcceeEEEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHH Q lcl|NC_019451. 149 RKNADPQLAQATVTW---N-QNTNQFT-LVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKST 223 (504) Q Consensus 149 ~a~~~~~~~~a~vt~---d-~~~~~F~-its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~ 223 (504) .. .....++.. + -..+.++ ++....+ .+ .......++.+......+.+.+...++|+++. T Consensus 141 -~~---~vd~qtv~~~~~~el~~nd~V~a~~~~~g----------~~-~~~~~~~l~~~~~gg~~~~~~~~~~~~l~~~e 205 (451) T protein:vir:10 141 -TK---LVDEQSIKFNELDKFKGNDYITAKVVEEG----------SS-KPVAFTNVSGTLTGGTTTESNKVESLLNDALE 205 (451) T ss_pred -Ce---EEEEEEeeccchhhccCCceEEEEecccc----------cc-cceeeeecccccccccccCCccchHHHHHHhc Confidence 00 000000000 0 0001111 0000000 00 00001111111011112234567778888887 Q ss_pred hhccceeEEEEEeccCCHHHHHHHHHHHhhc----CCcEEEEEecc---ccchhHHHHHHhhhcceeEEEEeccCCCccH Q lcl|NC_019451. 224 NVSNNFGSFLFAGAPLDNDQIKAVSAWNAAQ----NNQFIYTVATS---LANLGTLFTLVNGNAGTALNVLSATAANDFV 296 (504) Q Consensus 224 ~~~~~wy~~~~~~~~~~~~~~~a~A~w~e~~----~~~~~~~~~~~---d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (504) ....|| +.+.....+.+.+..+.+|+... .+++..++... +++.+.-+ +-....... -....+.... T Consensus 206 ~~~~n~--l~~~~~~~~~~i~~~~~a~ik~~r~~~g~~~~aVl~~~~~~~~d~egii---nv~n~~~~~-dg~~~~~~~~ 279 (451) T protein:vir:10 206 NEEYAV--VTTAGFEPSSNMNKLVVEAVKRLRENEGRKVRGVIPTDADTTYNYEGIS---TVVNGYTLS-DGTNVDVKDA 279 (451) T ss_pred cceeeE--EEEccCCCchHHHHHHHHHHHHHHHhcCCeEEEEecCccCCCCCCcceE---EeecceEec-Cceeechhhh Confidence 765554 33322222345677889999863 34444443211 11111000 000111000 0011222333 Q ss_pred HHHHHHHHHhcCcCcCCceeeecccccCccc-c-ccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEe----C--- Q lcl|NC_019451. 297 EQCPSEILAATNYDEPGASQNYMYYQFPGRN-I-TVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILC----G--- 367 (504) Q Consensus 297 ~aa~~g~~as~nf~~~~g~~T~kfk~l~Gv~-a-~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~----~--- 367 (504) .+.+.|..+++.++ .+ .-||.++|+. . ..++.+|++.+.++|........ +.... +.+|+.+ + T Consensus 280 ~~~vAG~~Ag~~~~---~S--~T~~~~~~~~~v~~~~t~~e~~~~i~~G~lvl~~~~--g~~v~-i~~~INTltt~~~~k 351 (451) T protein:vir:10 280 TGYFAGISASADVA---TS--LTYFEVEDAVSAYPKFDNEKTIKALDAGQIVFTTRP--GQRVV-IEQDINSLHKFTAEK 351 (451) T ss_pred HHHHHHHHcccccc---cC--ccceecCCceeeeeeCCHHHHHHHHhCCeEEEEEEc--CCeEE-EEEccccceecCCCC Confidence 45555677666443 33 3566888863 3 57999999999999987543222 22222 3456543 1 Q ss_pred CccccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeecccc Q lcl|NC_019451. 368 GPTDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVT 447 (504) Q Consensus 368 G~y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~ 447 (504) ++ +|.=|-+++-.|-+++.++..+-+.+. +|+|=+..|..++.+.+..-|++..+.|.|.++... +. T Consensus 352 ~~-~~~ki~vir~~D~i~~di~~~~~~~yi--Gk~~N~~~gr~~~~~~i~~yl~~l~~~g~i~~~~~~-d~--------- 418 (451) T protein:vir:10 352 PQ-AFSKNRVIRTLDEIATNTENTFERTYL--GNVGNNAAGRDLFKADRIAYLTSLQNRNMIQSFANT-DI--------- 418 (451) T ss_pred Cc-chhhhhHHHHHHHHHHHHHHHhhhccc--eecCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCcc-ce--------- Confidence 33 455577777788777777665444333 699999999999999999999999999999764321 10 Q ss_pred CCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 448 GDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 448 g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) .+. . -..+...-+++..+.-.||.++.++...= T Consensus 419 ----------------~v~---~-----~~~~~~v~v~~~v~pvdame~iy~t~~v~ 451 (451) T protein:vir:10 419 ----------------TVE---A-----GNDMDSIVVNLAVTPVDAMEKLYMTMVVR 451 (451) T ss_pred ----------------EEe---e-----cCCCCEEEEEEEEEEEeeeeeEEEEEEEc Confidence 000 0 00122234666666667777776665444 No 14 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=99.20 E-value=6.6e-10 Score=70.90 Aligned_cols=403 Identities=12% Similarity=0.027 Sum_probs=199.4 Q ss_pred CCCc-----cceEEEeee-ecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCC--cHHHHHHHHHhccCC Q lcl|NC_019451. 1 MISQ-----SRYIRIISG-VGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQ--SEEYQRAAAYFKFIS 72 (504) Q Consensus 1 mip~-----s~iv~V~~~-v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~--s~ey~aA~~yF~~~~ 72 (504) ..-. --+++..+. ..+..+.. -+...|+..-.-=|++.+..-+|.+|....||.. .+.|++.+.+|. T Consensus 6 ~~~~~k~~PGvYi~~~~~~~~~i~~~~--~~~~a~~~~~~~Gp~~~~~~i~s~~d~~~~fG~~~~~~~~~~~~~~~~--- 80 (437) T protein:vir:10 6 WKRQNKVRPGAYINVKSKDIAMTRLGG--DGVVTVPLALSFGQSKKLMKIRRGEDLFKKLGYEQESPQLLLLNEAFK--- 80 (437) T ss_pred ecccceecCceeEEEecCCcceeeccC--CcEEEEEEEecCCCCceeEEEecHHHHHHHcCCccchhHHHHHHHHhc--- Confidence 1112 334554332 22222332 2444444443333666666667788999999975 356777777875 Q ss_pred CCCcccceEEEEeeeccCCcceeeecccchhhHhHhhcc----CceEEEEEcccceeeeeeccccccchHHHHHHHHhhh Q lcl|NC_019451. 73 KSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFS----AGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEI 148 (504) Q Consensus 73 ~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~----~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i 148 (504) -++++|+-|... ...+.. +++.. ..+++.. ...|+|+|.-........++. T Consensus 81 ----g~~~~~~~R~~~-g~~a~~---tl~~~-~~~~A~~~G~~gn~i~v~v~~~~~d~~~~~v~---------------- 135 (437) T protein:vir:10 81 ----RVSEVLLYRLNT-GEKANV---SLSDN-VTAQAKYSGVRGNDITVTVKTNVDDPSSFDVV---------------- 135 (437) T ss_pred ----CCCEEEEEECCC-CceeeE---eeccc-eEEEeccCCcccceeEEEEeeccCCccceEEE---------------- Confidence 478999999754 222221 11110 0111111 124555543221111011100 Q ss_pred hcccccccceeEEEEecccc--eeEEecccCcceeEEEEeeccchhhhhh--hhcccCcceeecccccccHHHHHHHHHh Q lcl|NC_019451. 149 RKNADPQLAQATVTWNQNTN--QFTLVGATIGTGVLAVAKSADPQDMSTA--LGWSTSNVVNVAGQAADLPDAAVAKSTN 224 (504) Q Consensus 149 ~a~~~~~~~~a~vt~d~~~~--~F~its~t~ga~s~~~~~sa~~~~ia~~--l~~t~~~~~~~~g~aaet~~~al~~~~~ 224 (504) ++..... ...+................. ..++.. ..++-+ .. .....++..++|+++.. T Consensus 136 -------------~~~~~~~~d~~~v~~~~~~~~n~~v~~~~~-~~l~~~a~~~LtGG--~d-g~~t~~dy~~al~~le~ 198 (437) T protein:vir:10 136 -------------TFLDTVVMDLQTVKVLADLKNNALVEFSGT-GELQPVAGAKLTGG--TD-GAISTQDYLEYFKALET 198 (437) T ss_pred -------------EecCcceeeeeehhhhhhhhhhcccccccc-cccccccceeeecc--cc-CCCChhHHHHHHHHhcc Confidence 0000000 000000000000000000000 000000 001100 00 01234567889999976 Q ss_pred hccceeEEEEEeccCCHHHHHHHHHHHhhc----CCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHH Q lcl|NC_019451. 225 VSNNFGSFLFAGAPLDNDQIKAVSAWNAAQ----NNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCP 300 (504) Q Consensus 225 ~~~~wy~~~~~~~~~~~~~~~a~A~w~e~~----~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~ 300 (504) ...+| +.+ ...+.+++.++.+|++.. .+++..+.....++.+.-. ........ .-....+.....+.+ T Consensus 199 ~~~n~--l~~--~~~d~~~~t~~~~~ik~~r~~~g~~~~~V~~~~~~d~e~Ii---n~~n~~~~-~~~~~~~~~~~~a~v 270 (437) T protein:vir:10 199 VEFNY--MAL--PVEDASIKKAAINFIKRMREDEGLGAQLVVADSDADSEAVI---NVKNGVIL-SDKTVIDKTKATVWV 270 (437) T ss_pred CcceE--EEe--cCCChhHHHHHHHHHHHHHhccCceEEEEeCCCCCCCceEE---Eeecceee-cCcceechhhHHHHH Confidence 65444 333 234677889999998853 3333344333322211000 00000000 000011222233444 Q ss_pred HHHHHhcCcCcCCceeeecccccCccc-c-ccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEe--C-----Cccc Q lcl|NC_019451. 301 SEILAATNYDEPGASQNYMYYQFPGRN-I-TVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILC--G-----GPTD 371 (504) Q Consensus 301 ~g~~as~nf~~~~g~~T~kfk~l~Gv~-a-~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~--~-----G~y~ 371 (504) .|..++.+.++ .+-||.++|+. . ..++.+|++.+.++|...... .+... .+.+|+.+ + ++ + T Consensus 271 AG~~Ag~~~~~-----S~t~~~~~~~~~v~~~~t~~e~~~~i~~G~~vl~~---~~~~v-~i~~gInTltt~~~~~~~-~ 340 (437) T protein:vir:10 271 AAASANAGVEK-----SLTYEKYEDSVDVVGRLSHTETEDALLKGQFVFTA---RRGRA-VVEQDINSHVSFTIEKNQ-D 340 (437) T ss_pred HHHhccCcccc-----CccccccCCcccccccCCHHHHHHHHhCCcEEEEE---eCCeE-EEEEccccccccCCCCCc-h Confidence 56666654332 34578899874 3 578999999999999886632 23333 34456533 1 22 4 Q ss_pred cchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcc Q lcl|NC_019451. 372 AVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRR 451 (504) Q Consensus 372 ~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~ 451 (504) |.-|-+++-.|.+.+.++..+-+.+. +|+|=+..|..++++.+...|++..+.|.|.+.... T Consensus 341 ~~ki~vir~~D~i~~di~~~~~~~yi--Gk~~N~~~~r~~~~~~i~~yl~~l~~~g~I~~~~~~---------------- 402 (437) T protein:vir:10 341 FRKNRILRTLDDIVNDTRYAFSEYFL--GKVSNNEDGRQAFKANRIRYFKDLEARGAIEDFKVE---------------- 402 (437) T ss_pred hhhhhHHHHHHHHHHHHHHHHHhccc--cccCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCce---------------- Confidence 55577777777777777765555444 689999999999999999999999999999752211 Q ss_pred cccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 452 AWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 452 ~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) + +.+ .+. ..+...-+++..+.-.++.++.++..+= T Consensus 403 ---d-------~~v---~~~-----~~~~~v~v~~~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 403 ---D-------IEV---LRG-----ELKESVVVNVKVKPVDSMEKLYMTVTVE 437 (437) T ss_pred ---e-------EEe---ecC-----CCCCEEEEEEEEEEeeeeeeEEEEEEec Confidence 0 111 000 1122334777788888888888876655 No 15 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=99.12 E-value=1.7e-09 Score=68.70 Aligned_cols=443 Identities=13% Similarity=0.063 Sum_probs=232.3 Q ss_pred CCCccc-----eEEEeeeecccccccccccceEEEeccc---ccCc-cceEEecCHHHHHHhcCCCcHHHHHHHHHhccC Q lcl|NC_019451. 1 MISQSR-----YIRIISGVGAGAPVAGRKLILRVMTTNN---VIPP-GIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFI 71 (504) Q Consensus 1 mip~s~-----iv~V~~~v~~~~~~~~~~~~~l~l~~~~---~~~~-~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~ 71 (504) =||-|- ++.+.-+.. ..+.+.--.|+++... -.++ ..+|.+ |.++..+.||..|--..|++.|..-- T Consensus 7 ~IP~~iRvP~~y~E~dns~A---~~~~~~q~vLiiGq~la~gs~~~~~~v~v~-s~~~a~~lfG~GSml~~M~~a~~~~n 82 (498) T protein:vir:45 7 TIPSNTLVPLFYAEMDNQAA---NTAQDSGASLLIGHANNGAEIVANSLVLMP-SADYARQICGAGSQLARMVEAYRQTD 82 (498) T ss_pred hcCcccccCeEEEEEeCCCC---CCCCCCcceEEEEecCCccccccceeEEec-CHHHHHHhcCcCcHHHHHHHHHHHhC Confidence 344442 223333333 3333344566665532 2233 445665 67778999999999999999999975 Q ss_pred CCCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcc Q lcl|NC_019451. 72 SKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKN 151 (504) Q Consensus 72 ~~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~ 151 (504) | =..||+=..... . +.--.|+++-. -.+-.+|.+.+.|+|.... +......+.+.+|+.+.++|++. T Consensus 83 ~-----~~~l~~i~~~d~-a-G~aA~g~it~t---g~at~~G~l~l~Igg~~v~---v~V~~gdTaa~vA~al~aaina~ 149 (498) T protein:vir:45 83 P-----FGELYVIAVPEA-T-GAAATVTLTVT---GEATESGTVNVYVGRTRVQ---APVTNGDNVTTIASSIQDAINAV 149 (498) T ss_pred C-----cceEEEEeeCCc-c-cceeEEEEEee---cccCCCcEEEEEECCEEEE---EEecCCCCHHHHHHHHHHHHhCC Confidence 4 356776666432 1 11112233311 1223579999999998765 34556678889999999999986 Q ss_pred cccccceeEEEEecccceeEEecccCcc---eeEEEE--ee-ccchhhhhhhhcccCcceeecccccccHHHHHHHHHhh Q lcl|NC_019451. 152 ADPQLAQATVTWNQNTNQFTLVGATIGT---GVLAVA--KS-ADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNV 225 (504) Q Consensus 152 ~~~~~~~a~vt~d~~~~~F~its~t~ga---~s~~~~--~s-a~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~ 225 (504) +.. .|+-........+|..-.|. ...... .. ..+..+...+.++.. ....|....++.++|+++.+. T Consensus 150 ~~l-----PVTA~~~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~it--amagGag~PD~a~alaal~~~ 222 (498) T protein:vir:45 150 PTL-----PFTASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVA--TGTAGTGAPVLTGAVAAMADE 222 (498) T ss_pred CCC-----ceEEEecCceEEEEeeccCccccceeEEEeeccccccccccceeeEEEE--ccCCCccCchhHHHHHHhccC Confidence 442 23322333444555433332 111111 00 001111111211111 112244444778888888654 Q ss_pred ccceeEEEEEeccCCHHHHHHHHHHHhh-------cCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccH-- Q lcl|NC_019451. 226 SNNFGSFLFAGAPLDNDQIKAVSAWNAA-------QNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFV-- 296 (504) Q Consensus 226 ~~~wy~~~~~~~~~~~~~~~a~A~w~e~-------~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 296 (504) ||-++.. ...+.+.+.++-.+.+. -..++..++.-...+.....+.-...+....++..+....+-| T Consensus 223 ---~~~~I~~-p~~D~asL~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~~~~~~~sp~~ 298 (498) T protein:vir:45 223 ---PFDYIGL-PFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEKETQTPAD 298 (498) T ss_pred ---CccEEEE-eeCCHHHHHHHHHHHhhhhhhhhHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEecCCCCCChHH Confidence 4434332 22345556677777654 2334444444444444333333333344444444443232323 Q ss_pred --HHHHHHHHH---hcCcCcCCceeeecccccCccccc----cCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEe- Q lcl|NC_019451. 297 --EQCPSEILA---ATNYDEPGASQNYMYYQFPGRNIT----VSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILC- 366 (504) Q Consensus 297 --~aa~~g~~a---s~nf~~~~g~~T~kfk~l~Gv~a~----~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~- 366 (504) +++++++++ ..|+.+ ++.=-.|+||.|. .++.+|.+.|..+|+..+.+- .| ...+.+.+++ T Consensus 299 ~~AAa~aa~~A~~l~~DPAr-----PL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~--~G--~V~I~R~ITTY 369 (498) T protein:vir:45 299 ELAASRTARAAVFIRNDPAR-----PTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVE--SG--VLRIQRDVTTY 369 (498) T ss_pred HHHHHHHHHHHHHhhccccc-----ccCceeecceecCCchhcCChHHHHHHHhCCcceEEEc--CC--eEEEEeeeeee Confidence 334444444 334433 3333456788754 467999999999999999642 33 2444566553 Q ss_pred ----CCccccchhh--hhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHH---------HHHHHHHHHHHHHHHHhcCcccc Q lcl|NC_019451. 367 ----GGPTDAVDMN--VYANEIWLKSAIAQALLDLFLNVNAVPASSTG---------EAMTLAVLQPVLDKATANGTFTY 431 (504) Q Consensus 367 ----~G~y~~~wiD--~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G---------~~~l~~~v~~vl~~a~~nG~Ia~ 431 (504) .|.-|..|.| .++-.+++...++..+-.-|-. .|+--+... -.+|++.+-..+++....|++. T Consensus 370 ~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR-~KLa~dg~~~~~gq~IvTp~~ir~ell~~y~~le~~givE- 447 (498) T protein:vir:45 370 RKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGR-HKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVE- 447 (498) T ss_pred eecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCC-eeecccCcccCCCCcccchHHHHHHHHHHHHhhhhhcccc- Confidence 5666666666 4677788888888777765522 233322111 2578889999999999999986 Q ss_pred ccccCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCe Q lcl|NC_019451. 432 GKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDA 493 (504) Q Consensus 432 G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGA 493 (504) +-+..|+.+..|-..+++.+ .-+.+|+ +.--+-|---..-.+.+.|..++| T Consensus 448 ---n~~~~~~~LiVerd~~dpnR------ln~~~p~--d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:45 448 ---NYELFKQYLVVERDASVPNR------LNTLFPP--DYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred ---ChhhhcceeEEEECCCCCcE------EEEEecc--cccCchhhhhhhhhhheehhhcCC Confidence 23333433333333333211 1223331 111122222222234555666666 No 16 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=99.08 E-value=2.6e-09 Score=67.63 Aligned_cols=445 Identities=11% Similarity=0.077 Sum_probs=214.7 Q ss_pred CCCccc---eEEEeee-ecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCc Q lcl|NC_019451. 1 MISQSR---YIRIISG-VGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVN 76 (504) Q Consensus 1 mip~s~---iv~V~~~-v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~ 76 (504) +=|++| +|++..+ +.+..+ ....++.|++....=|++++..+++.++..+-||.. +--.+....|.+++ .. T Consensus 8 ~~~~~~pgv~~~~~~~~~~~~~~--~~~~~~~~~g~a~~G~~~~~~~~~~~~~~~~~~~~g-~l~~~~~~a~~~~~--~~ 82 (587) T protein:vir:99 8 RRPITRPHASIEVDTSGIGGSAG--SSEKVFCLIGQAEGGEPNTVYELRNYSQAKRLFRSG-ELLDAIELAWGSNP--NY 82 (587) T ss_pred CcccccCceEEEEecCCccccCC--CCCceEEEEEEecCCccceeEEeccHHHHHHHhcCc-chHHHHHHHhcccc--CC Confidence 333333 3344333 333333 356888888887766788989999999999999885 45677888998876 35 Q ss_pred ccceEEEEeeeccCCcceeeeccc--chhhHhHhhccCceEEEEEccccee----ee-eecc-ccccchHHHHHH--HH- Q lcl|NC_019451. 77 SPSSISFARWVNTAIAPMVVGDNL--PKTIADFAGFSAGVLTIMVGAAEQN----IT-AIDT-SAATSMDNVASI--IQ- 145 (504) Q Consensus 77 ~P~~l~igr~~~~a~~~~l~g~~~--~~~~~~~~~~~~g~~titi~g~~~~----~~-~i~~-s~ats~~~vA~~--i~- 145 (504) .++++|+.|... +.++.+.=+.+ .+... ..-.+.+.+.+.-.... .. ...- ..-..+..+-.. |+ T Consensus 83 g~~~~~~~rv~~-~~~a~~~~~~l~~~a~~~---G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~i~y 158 (587) T protein:vir:99 83 TAGRILAMRIED-AKPASAEIGGLKITSKIY---GNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIFTIKY 158 (587) T ss_pred CceEEEEEEcCC-CceeEEEecCeEEEEeec---cccccceEEEEccCCCCcceeEEEEEecccceeeeeeccceeeEEe Confidence 779999999843 34443321211 11000 00011223311110000 00 0000 000111111110 00 Q ss_pred ------hhhhccccccccee-EEEEecc---cceeEEecccCcce--------------eE------------------- Q lcl|NC_019451. 146 ------TEIRKNADPQLAQA-TVTWNQN---TNQFTLVGATIGTG--------------VL------------------- 182 (504) Q Consensus 146 ------~~i~a~~~~~~~~a-~vt~d~~---~~~F~its~t~ga~--------------s~------------------- 182 (504) +.+..........+ ..+...- ...|.++++.+.+. +. T Consensus 159 ~g~~~~a~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~i~~~~~~tAky~~~~~~~i~~~~~~~~~~ 238 (587) T protein:vir:99 159 KGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLDKIEN 238 (587) T ss_pred ecccccceeeEeecCcceeeeeeeeecCCceeEEEEecCCchHHHHHHHhhhccccceeEEeeccCCceeEeeccccccc Confidence 00000000000000 0111000 00122322210000 00 Q ss_pred --EEEe----eccchhh-------------------------h-----------hhhhc---c-cCcceeecccc---cc Q lcl|NC_019451. 183 --AVAK----SADPQDM-------------------------S-----------TALGW---S-TSNVVNVAGQA---AD 213 (504) Q Consensus 183 --~~~~----sa~~~~i-------------------------a-----------~~l~~---t-~~~~~~~~g~a---ae 213 (504) +... .+...++ . ...+. . ........|.+ .+ T Consensus 239 ~~v~~~~~~v~a~~~D~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~LtGG~dG~~~~ 318 (587) T protein:vir:99 239 ANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNGEPPA 318 (587) T ss_pred ceeeeeeeeeehhccceeeecccceeeeeeecccccchhhhhhhhhccccceeeeeccccceecccceeeecCCCCCccc Confidence 0000 0000000 0 00000 0 00001222322 34 Q ss_pred cHHHHHHHHHhhccceeEEEEEeccCCHHHHHHHHHHHhhcC---CcEEEEEecc-ccchhHHHHHHhhh-cceeEEEEe Q lcl|NC_019451. 214 LPDAAVAKSTNVSNNFGSFLFAGAPLDNDQIKAVSAWNAAQN---NQFIYTVATS-LANLGTLFTLVNGN-AGTALNVLS 288 (504) Q Consensus 214 t~~~al~~~~~~~~~wy~~~~~~~~~~~~~~~a~A~w~e~~~---~~~~~~~~~~-d~~~~~~~~~~~~~-~~~~~~~~~ 288 (504) +..++|+++..+ +|+.+.. +. .+.+.+.++.+|++... +++..++... +.+... ..+.... ...+...+. T Consensus 319 sy~~al~ale~~--~~~~i~~-~t-~d~~i~a~l~a~vk~~r~~g~~~~aVlg~~~~~~~~~-~~~~a~~~n~e~vi~v~ 393 (587) T protein:vir:99 319 TWADKLDKFAHE--GGYYIVP-LS-SKQSVHAEVASFVKERSDAGEPMRAIVGGGFNESKEQ-LFGRQASLSNPRVSLVA 393 (587) T ss_pred cHHHHHHHHhhC--CcEEEEe-cC-CCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHH-HHHHhhhcCCCcEEEEe Confidence 568889998764 5665543 22 24455677999987542 3344333322 222222 2222222 222222221 Q ss_pred cc------C--CCccH----HHHHHHHHHhcCcCcCCceeeecccccC--ccccccCCHHHHHHHHhCCCeEEEEEeecc Q lcl|NC_019451. 289 AT------A--ANDFV----EQCPSEILAATNYDEPGASQNYMYYQFP--GRNITVSDDTVANTVDKSRGNYIGVTQANG 354 (504) Q Consensus 289 ~~------~--~~~~~----~aa~~g~~as~nf~~~~g~~T~kfk~l~--Gv~a~~lt~t~~~al~~~~~n~y~~~~~~~ 354 (504) .. . ...+| ++.+.|..+..+.+. ++| ||.++ ++. ..++.+|++.+..+|++......++. T Consensus 394 ~~~~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~~~~---SlT--~~~i~~~~v~-~~~t~~e~e~li~~Gvl~l~~~~~~~ 467 (587) T protein:vir:99 394 NSGTFVMDDGRKNHVPAYMVAVALGGLASGLEIGE---SIT--FKPLRVSSLD-QIYESIDLDELNENGIISIEFVRNRT 467 (587) T ss_pred ccceEecCCCceeeechHHHHHHHHHHHhcCchhc---Ccc--ceeeeccccc-ccCCHHHHHHHHhCCeEEEEEecCCc Confidence 11 0 11233 345556777665443 333 34444 443 37899999999999999886554443 Q ss_pred ceeeEEEcCEEe---CCccccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccc Q lcl|NC_019451. 355 QQLAFYQRGILC---GGPTDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTY 431 (504) Q Consensus 355 ~~~~~~~~G~~~---~G~y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~ 431 (504) ...-...+|..+ +....|.-|-+++-.|.+.+.++..+-+.|.- | |=++.|...|++.+++.|++-.+.|.|.. T Consensus 468 ~~~vriv~~ItT~t~~~~~~~~~i~viRv~D~i~~di~~~~~~~yiG--k-~Nn~~~r~~i~~~i~~~L~~l~~~gaI~~ 544 (587) T protein:vir:99 468 NTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIG--T-RTINTSASIIKDFIQSYLGRKKRDNEIQD 544 (587) T ss_pred ceEEEEeeceeeccCCCCchhhhhhhhhhHHHHHHHHHHHHHhhCCc--c-ccchHHHHHHHHHHHHHHHHHHhCCcccC Confidence 322223345544 22224555788999999999998887766644 3 56789999999999999999999999963 Q ss_pred ccccCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 432 GKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 432 G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) . ...+.| +.. .+|+ .-+++.+..--+|++|-++.+.- T Consensus 545 ~-~~~dv~-----------------------v~~-------~~d~-----~~v~~~v~Pv~~mekIy~tv~~~ 581 (587) T protein:vir:99 545 F-PAEDVQ-----------------------VIV-------EGNE-----ARISMTVYPIRSFKKISVSLVYK 581 (587) T ss_pred C-CccceE-----------------------EEe-------cCCE-----EEEEEEEEEcccceEEEEEEEEE Confidence 1 111110 000 1111 13667777777888888777766 No 17 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=99.05 E-value=3.9e-09 Score=66.68 Aligned_cols=443 Identities=12% Similarity=0.061 Sum_probs=229.7 Q ss_pred CCCccc-----eEEEeeeecccccccccccceEEEeccc---ccCc-cceEEecCHHHHHHhcCCCcHHHHHHHHHhccC Q lcl|NC_019451. 1 MISQSR-----YIRIISGVGAGAPVAGRKLILRVMTTNN---VIPP-GIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFI 71 (504) Q Consensus 1 mip~s~-----iv~V~~~v~~~~~~~~~~~~~l~l~~~~---~~~~-~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~ 71 (504) =||-|- ++.+.-+ .+..+.+---.|+++... -.++ ..+|.+ |.++..+.||..|--..|++.|..-- T Consensus 7 ~IP~~iRvP~~y~E~dns---~A~~~~~~q~vLiiGq~la~gs~~~~~~v~v~-s~~~a~~~fG~GSml~~M~~a~~~~n 82 (498) T protein:vir:44 7 SIPSDTRVPLFYAEMDNS---AANTARDSGASLLIGHASNDASIAVNSLVLVS-SVDYARQICGAGSQLARMVGAYRKTD 82 (498) T ss_pred hcCcccccCeEEEEEeCC---CCCCCcCCcceEEEEecCcccccccceeEeec-CHHHHHHhcCcccHHHHHHHHHHHhC Confidence 244332 2223222 223444444566666532 2233 445664 77778999999999999999999974 Q ss_pred CCCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcc Q lcl|NC_019451. 72 SKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKN 151 (504) Q Consensus 72 ~~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~ 151 (504) | =..||+=..... . +.--.|+++-. -.+-.+|.+.+.|+|.... +......+.+.+|+.+.++|++. T Consensus 83 ~-----~~~l~~i~~~D~-a-G~aAtg~it~t---g~at~~G~l~l~Igg~~v~---v~V~~gdTaa~vA~al~aaina~ 149 (498) T protein:vir:44 83 P-----FGELYVIAVPES-T-GAAATVALTVT---GEATETGTVNVYTGRTRVQ---APVTSGDDAAAVAVSIKDAVNAN 149 (498) T ss_pred C-----CceeEEEecCCc-c-cceeEEEEEee---cccCCCcEEEEEECCEEEE---EEecCCCCHHHHHHHHHHHHhCC Confidence 3 445666555432 1 11112233211 1223579999999998765 44566678889999999999986 Q ss_pred cccccceeEEEEecccceeEEecccCcc---eeEEE--Eee-ccchhhhhhhhcccCcceeecccccccHHHHHHHHHhh Q lcl|NC_019451. 152 ADPQLAQATVTWNQNTNQFTLVGATIGT---GVLAV--AKS-ADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNV 225 (504) Q Consensus 152 ~~~~~~~a~vt~d~~~~~F~its~t~ga---~s~~~--~~s-a~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~ 225 (504) +.. .|+-........+|..-.|. ..... +.. ..+..+...+.++.. ....|....++.++|+++.+. T Consensus 150 ~~l-----PVTA~~~~~~vtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~tit--amsgGag~PDia~alaal~~~ 222 (498) T protein:vir:44 150 PDL-----PFTATSEAGVVTLTARHKGLYGNEIPVTLNYYGFGGGEVLPAGVNITVA--SGVKGAGAPALNDAVAAMGDE 222 (498) T ss_pred CCC-----ceEEeeccceEEEEEeccCcccCcceEEEeeccCccccccccceeEEEE--cccCCccCchhHHHHHhhccC Confidence 542 22222223444454433332 11111 000 011111111211111 112233344778888887654 Q ss_pred ccceeEEEEEeccCCHHHHHHHHHHHhh-------cCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccH-- Q lcl|NC_019451. 226 SNNFGSFLFAGAPLDNDQIKAVSAWNAA-------QNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFV-- 296 (504) Q Consensus 226 ~~~wy~~~~~~~~~~~~~~~a~A~w~e~-------~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 296 (504) ||-++.. ...+.+.+.++..+.+. -..++..+++....+.....+.-...+....++..+....+.| T Consensus 223 ---~~~~i~~-p~~D~asl~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~a~l~t~g~~~N~~~it~~~~~~~~~sp~~ 298 (498) T protein:vir:44 223 ---PFDYIGL-PFNDTASVNSMATEMNDSSGRWSYVRQLYGHVYTAKTGTLSELVAAGDQFNLQHITLAGYEKDTQTPAD 298 (498) T ss_pred ---CccEEEE-eecCHHHHHHHHHHHhhhhcchHHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEecCCCCCCHHH Confidence 4433332 22345566677776653 2334444444444444433333333344445444443222222 Q ss_pred --HHHHHHHHH---hcCcCcCCceeeecccccCccccc----cCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEe- Q lcl|NC_019451. 297 --EQCPSEILA---ATNYDEPGASQNYMYYQFPGRNIT----VSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILC- 366 (504) Q Consensus 297 --~aa~~g~~a---s~nf~~~~g~~T~kfk~l~Gv~a~----~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~- 366 (504) +++++++++ ..|+.+ ++.=-.|+||.|. .++.+|.+.|..+|+..+.+- .| ...+.+.+++ T Consensus 299 ~~AAa~a~~aA~~l~~DPAr-----PL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~--~G--~V~I~R~ITTY 369 (498) T protein:vir:44 299 ELAASRTARAAVFIRNDPAR-----PTQTGELVDMLPAPKGKRFTTTEQQTLLSHGVATAYVE--SG--VLRIQRDITTY 369 (498) T ss_pred HHHHHHHHHHHHHhhccccc-----ccCceeecccccCCchhcCChHHHHHHHhcCcceEEEc--CC--eEEEEeeeeee Confidence 234444444 334433 3333457888754 468999999999999999642 33 2444566553 Q ss_pred ----CCccccchhh--hhhhHHHHHHHHHHHHHHHHhcCCCCCcCH----HHH-----HHHHHHHHHHHHHHHhcCcccc Q lcl|NC_019451. 367 ----GGPTDAVDMN--VYANEIWLKSAIAQALLDLFLNVNAVPASS----TGE-----AMTLAVLQPVLDKATANGTFTY 431 (504) Q Consensus 367 ----~G~y~~~wiD--~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~----~G~-----~~l~~~v~~vl~~a~~nG~Ia~ 431 (504) .|.-|..|.| .++-.+++...++..+-.-|-. .|+-=++ .|. ..|++.+-..+++....|+|.- T Consensus 370 ~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR-~KLa~d~~~~~~gq~IvTp~~ir~eli~~y~~le~~givEn 448 (498) T protein:vir:44 370 RKNAYGVADNSYLDSETLHTSAYVLRRLKSVITSKYGR-HKLANDGTRFGSGQAIVTPAVIRGELGSTYRQMEREGIVEN 448 (498) T ss_pred eecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCC-cccccCCcccCCCcccccHHHHHHHHHHHHHhhhhhccccC Confidence 5666666655 5777888888888888665532 2322111 222 4788899999999999999862 Q ss_pred ccccCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCe Q lcl|NC_019451. 432 GKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDA 493 (504) Q Consensus 432 G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGA 493 (504) -+..|+.+..+-..+++.+ .-+.+|+ +.--+-|---..-.+.+.|..+.| T Consensus 449 ----~~~~~~~LiVerd~~dpnR------ln~~~p~--d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:44 449 ----FDLFQQHLIVERNANDSNR------LDVLFPP--DYVNQLRVFAVLNQFRLQYSEEAA 498 (498) T ss_pred ----hhhhcceeEEEECCCCCcE------EEEEecc--cccCchhhhhhhhhhhhhhhhhcC Confidence 2333333333332222211 1222221 111111211122234444555555 No 18 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=98.97 E-value=8.9e-09 Score=64.73 Aligned_cols=446 Identities=13% Similarity=0.054 Sum_probs=226.4 Q ss_pred CCCccceEE-Eeeeeccccccc-ccccceEEEeccc---ccCc-cceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCC Q lcl|NC_019451. 1 MISQSRYIR-IISGVGAGAPVA-GRKLILRVMTTNN---VIPP-GIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKS 74 (504) Q Consensus 1 mip~s~iv~-V~~~v~~~~~~~-~~~~~~l~l~~~~---~~~~-~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~ 74 (504) =||-|--|= +-+.+....+.. ..-.-.|+++... -.++ ..+|.+ |.++..+.||..|--..|++.|...-| T Consensus 7 ~IP~~iRvP~~y~E~dns~A~~~~~~qrvLiiGq~la~gt~~~~~~v~v~-s~~~a~~~fG~GS~l~~M~~a~~~~n~-- 83 (498) T protein:vir:48 7 AVPSDTLVPLFYAEMDNSAANTAVTSAPALLIGHASNDAAIEVNSLVLMP-SADYARQICGAGSQLARMVDVYRQTDP-- 83 (498) T ss_pred ccCcccccceEEEEEecCCCccccCCcceEEEeecCccccccccceEEec-CHHHHHHhcCcccHHHHHHHHHHHhCC-- Confidence 344442211 112221111111 0011245555432 1233 445665 666789999999999999999988753 Q ss_pred CcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhccccc Q lcl|NC_019451. 75 VNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNADP 154 (504) Q Consensus 75 ~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~~~ 154 (504) =..||+=..... . +.--.|+++-. -.+-.+|.+.+.|+|.... +......+-+.+|+.+.+++++.... T Consensus 84 ---~~~l~~i~~~D~-a-g~aA~g~it~t---g~at~~G~l~l~Igg~~v~---v~V~~gdTaa~vA~al~aai~a~~~l 152 (498) T protein:vir:48 84 ---FGELYVIAVPEA-R-GAAATVRVTVT---GEAEESGTLSLYVGRSSVQ---VPVVNGDDATAVATAIKEAVNGVITL 152 (498) T ss_pred ---CceeEEEeeCCc-c-cceeEEEEEec---ccccCCceEEEEECCEEEE---EeecCCCCHHHHHHHHHHHHhCCCCc Confidence 445666555432 1 11112233211 1223579999999998765 34556668889999999999886543 Q ss_pred ccceeEEEEecccceeEEecccCcc---eeEEEE---eeccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccc Q lcl|NC_019451. 155 QLAQATVTWNQNTNQFTLVGATIGT---GVLAVA---KSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNN 228 (504) Q Consensus 155 ~~~~a~vt~d~~~~~F~its~t~ga---~s~~~~---~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~ 228 (504) .|+-........+|..-.|. ...... ....+..+-..+.++.. ....|....++.++|+++.+. T Consensus 153 -----PVTA~~~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~it--amsgGag~PDia~aLaal~~~--- 222 (498) T protein:vir:48 153 -----PFAASSDAGVVTLTARHKGLYGNELPVCLNYYGSGGGEILPAGLQVVTE--AGTAGSGAPDLTAAVAAMGDE--- 222 (498) T ss_pred -----ceEEEecCcEEEEEeeecccccccceeeeeeccCcccccccceeeEEEE--cccCCccCcchHHHHHhhccC--- Confidence 23222233444454432332 111111 00001111111211111 112233444677888877654 Q ss_pred eeEEEEEeccCCHHHHHHHHHHHhh-------cCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCc---c-HH Q lcl|NC_019451. 229 FGSFLFAGAPLDNDQIKAVSAWNAA-------QNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAAND---F-VE 297 (504) Q Consensus 229 wy~~~~~~~~~~~~~~~a~A~w~e~-------~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~-~~ 297 (504) ||.++.. ...+.+.+.++-.+.+. -+.++..++.-...+.....+.-...+....++..+..... + .+ T Consensus 223 ~~~~I~~-p~~D~asl~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~~~~~~~~p~~~~A 301 (498) T protein:vir:48 223 AFDFIGL-PFNDAASINMMMTEMNDSSGRWSYARQLYGHVYTAKLGTLSELVNAGDMHNQQHITLAGYEKETQSPVDELV 301 (498) T ss_pred CccEEEE-eecCHHHHHHHHHHHhhhhhhhhHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEecCCCCCChHHHHH Confidence 4444332 22345566677777653 23444444444443433333333333344444444332222 2 23 Q ss_pred HHHHHHHH---hcCcCcCCceeeecccccCccccc----cCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEe---- Q lcl|NC_019451. 298 QCPSEILA---ATNYDEPGASQNYMYYQFPGRNIT----VSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILC---- 366 (504) Q Consensus 298 aa~~g~~a---s~nf~~~~g~~T~kfk~l~Gv~a~----~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~---- 366 (504) ++++++++ ..|+.+. +.=-.|+||.|. .++.+|.+.|..+|+..+.+ . .| ...+.+.+++ T Consensus 302 Aa~a~~aA~~l~~DPArP-----Lqtl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V-~-~G--~V~I~R~ITTY~~n 372 (498) T protein:vir:48 302 ASRLAREAVFIRNDPARP-----TQTGELVGMLPAPKGKRFIMTEQQTLLSHGVATAYV-E-GG--TLRIQRSVTTYKKN 372 (498) T ss_pred HHHHHHHHHhhhcccccc-----ccceeeeccccCCchhcCChHHHHHHHhcCcceEEE-c-CC--eEEEEeeeeeeeec Confidence 44444444 3444332 333356787754 35789999999999999965 2 22 2344555543 Q ss_pred -CCccccchhh--hhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHH---------HHHHHHHHHHHHHHHHhcCccccccc Q lcl|NC_019451. 367 -GGPTDAVDMN--VYANEIWLKSAIAQALLDLFLNVNAVPASSTG---------EAMTLAVLQPVLDKATANGTFTYGKE 434 (504) Q Consensus 367 -~G~y~~~wiD--~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G---------~~~l~~~v~~vl~~a~~nG~Ia~G~~ 434 (504) .|.-|..|.| .++-.+++...++..+-.-|-. .|+--+..+ -.+|++.+-..+++....|++.- T Consensus 373 ~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR-~KLa~dg~~~~~gq~IvTp~~ir~eli~~y~~le~~given--- 448 (498) T protein:vir:48 373 AYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGR-HKLANDGTRFGPGQAIVTPAVIKGELLATYRQMERAGIVEN--- 448 (498) T ss_pred CCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCC-ceecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccC--- Confidence 5666666655 5677788888888877765532 233322221 25788899999999999999862 Q ss_pred cCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCe Q lcl|NC_019451. 435 ISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDA 493 (504) Q Consensus 435 ~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGA 493 (504) -+..|+.+..|-..+++.+ .-+.+|+ +.--+-|---..-.+.+.|..++| T Consensus 449 -~~~~~~~LiVerd~~dpnR------ln~~~p~--d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:48 449 -YDLFKQYLIVERDADNPNR------LNTLFPP--DYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred -hhhhcceeEEEECCCCCcE------EEEEecc--cccCchhhhhhhhhhhhhhhhcCC Confidence 3333333333333332211 1222321 111111222222234455666666 No 19 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=98.85 E-value=2.8e-08 Score=62.01 Aligned_cols=444 Identities=10% Similarity=0.058 Sum_probs=217.8 Q ss_pred CCCccc---eEEEeee-ecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCc Q lcl|NC_019451. 1 MISQSR---YIRIISG-VGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVN 76 (504) Q Consensus 1 mip~s~---iv~V~~~-v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~ 76 (504) +=|+++ ++++..+ +.+.++. ...++.|++....=|++++..+++.++..+-||.. +--.+....|.+++ .. T Consensus 8 ~~~~~~pgv~~~~~~~~~~~~~~~--~~~~~~~~g~a~~G~~~~~~~~~~~~~~~~~~~~g-~l~~~~~~a~~~~~--~~ 82 (587) T protein:vir:95 8 RRPITRPHASIEVDTSGIGGSAGS--SEKVFCLIGQAEGGEPNTVYELRNYSQAKRLFRSG-ELLDAIELAWGSNP--NY 82 (587) T ss_pred CcccccCceEEEEecCCccccCCC--CCceEEEEEEeCCCCCceeEEeccHHHHHHHhcCc-chHHHHHHHhcccc--CC Confidence 333333 3444433 3334333 46888888887776788889999999999999885 35567788888876 35 Q ss_pred ccceEEEEeeeccCCcceeeecccc--h------------hhHhHhhccCce----------------------EEEEEc Q lcl|NC_019451. 77 SPSSISFARWVNTAIAPMVVGDNLP--K------------TIADFAGFSAGV----------------------LTIMVG 120 (504) Q Consensus 77 ~P~~l~igr~~~~a~~~~l~g~~~~--~------------~~~~~~~~~~g~----------------------~titi~ 120 (504) .++++|+.|. ..+.++.+.=+.+. + .+. +....++ ++|... T Consensus 83 g~~~~~~~rv-~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~si~y~ 159 (587) T protein:vir:95 83 TAGRILAMRI-EDAKPASAEIGGLKITSKIYGNVANNIQVGLE--KNTLSDSLRLRVIFQDDRFNEVYDNIGNIFTIKYK 159 (587) T ss_pred CceEEEEEEc-CCCceeEEEecCeEEEEecccccccceEEEEe--cCCCCCceeEEEEEecccceeeeeeccceeeeeee Confidence 7799999997 44444443222121 1 000 0001111 112111 Q ss_pred ccc-------------------------eeeeeeccccccchHHHHHHHHhhhhcccc--cc-----cceeEEEE-eccc Q lcl|NC_019451. 121 AAE-------------------------QNITAIDTSAATSMDNVASIIQTEIRKNAD--PQ-----LAQATVTW-NQNT 167 (504) Q Consensus 121 g~~-------------------------~~~~~i~~s~ats~~~vA~~i~~~i~a~~~--~~-----~~~a~vt~-d~~~ 167 (504) |.. +.+....+... -...+..+...++..+. +. .....+.+ +.. T Consensus 160 g~~~~~~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g--~~~~~~~~~~~in~~~~~tAky~g~~~~~i~~~~~~~~- 236 (587) T protein:vir:95 160 GEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGG--AYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLDKI- 236 (587) T ss_pred ccccccceeeeecccceeeeeeeeecCCceEEEEEecCC--chHHHHHHHHhhccccceEEEEecccCceeEEeecCcc- Confidence 211 11111111110 00111122212211100 00 00011111 111 Q ss_pred ceeEEecc-------------------------cCcceeEEEEeec---cc-hhhhhhhhcc----cCcceeecccc--- Q lcl|NC_019451. 168 NQFTLVGA-------------------------TIGTGVLAVAKSA---DP-QDMSTALGWS----TSNVVNVAGQA--- 211 (504) Q Consensus 168 ~~F~its~-------------------------t~ga~s~~~~~sa---~~-~~ia~~l~~t----~~~~~~~~g~a--- 211 (504) ..|.++.. ..+.......... .. .......+.. ........|.+ T Consensus 237 ~~~~v~~~~~~v~a~~~d~~~~~~~~~~v~~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~t~LtGG~dG~~ 316 (587) T protein:vir:95 237 ENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNGEP 316 (587) T ss_pred cccceehhhhhhhhhhcceeeeeeceeeeeeecccccceeccchhhhhcccchheeccccccceeccceeeeecCCCCCC Confidence 11111110 0000000000000 00 0000000000 00111222332 Q ss_pred cccHHHHHHHHHhhccceeEEEEEeccCCHHHHHHHHHHHhhc---CCcEEEEEeccccchhHHHHHHhhh-cceeEEEE Q lcl|NC_019451. 212 ADLPDAAVAKSTNVSNNFGSFLFAGAPLDNDQIKAVSAWNAAQ---NNQFIYTVATSLANLGTLFTLVNGN-AGTALNVL 287 (504) Q Consensus 212 aet~~~al~~~~~~~~~wy~~~~~~~~~~~~~~~a~A~w~e~~---~~~~~~~~~~~d~~~~~~~~~~~~~-~~~~~~~~ 287 (504) .++..++|+++..+ +|+.+.. +. .+.+.+.++.+|++.. .+++..++............+.... ...+...+ T Consensus 317 ~~~y~~~l~ale~~--~~~~i~~-~t-~d~~v~a~l~a~vk~~~~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~ervi~v 392 (587) T protein:vir:95 317 PATWADKLDKFAHE--GGYYIVP-LS-SKQSVHAEVASFVKERSDAGEPMRAIVGGGFNESKEQLFGRQESLSNPRVSLV 392 (587) T ss_pred cccHHHHHHHHHhC--CcEEEEe-cC-CCHHHHHHHHHHHHHHHhCCCcEEEEEcCCCCCCHHHHHHHHhhcCCCcEEEe Confidence 34678899998764 5665543 22 2445567799998654 2334444332222211222222222 22222222 Q ss_pred ecc------C--CCccH----HHHHHHHHHhcCcCcCCceeeecccccC--ccccccCCHHHHHHHHhCCCeEEEEEeec Q lcl|NC_019451. 288 SAT------A--ANDFV----EQCPSEILAATNYDEPGASQNYMYYQFP--GRNITVSDDTVANTVDKSRGNYIGVTQAN 353 (504) Q Consensus 288 ~~~------~--~~~~~----~aa~~g~~as~nf~~~~g~~T~kfk~l~--Gv~a~~lt~t~~~al~~~~~n~y~~~~~~ 353 (504) +.. . ...+| ++.+.|..+..+.... +| ||.++ ++. ..++.+|++.+..+|++......++ T Consensus 393 ~~~~~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~~~~S---lT--~~~i~~~~v~-~~~t~~e~e~ai~~Gvl~l~~~~~~ 466 (587) T protein:vir:95 393 ANSGTFVMDDGRKNHVPAYMVAVALGGLASGLEIGES---IT--FKPLRVSSLD-QIYESIDLDELNENGIISIEFVRNR 466 (587) T ss_pred cccceEecCCCceeeechHHHHHHHHHHHhcCchhcC---cc--ceeeeccccc-ccCCHHHHHHHHhCCeEEEEEecCC Confidence 111 0 01133 3455577777665433 33 34444 444 3789999999999999988655444 Q ss_pred cceeeEEEcCEEe---CCccccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccc Q lcl|NC_019451. 354 GQQLAFYQRGILC---GGPTDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFT 430 (504) Q Consensus 354 ~~~~~~~~~G~~~---~G~y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia 430 (504) ....-...+|..+ +....|.-|-+++-.|.+.+.++..+-+.|.- | |=++.|...|++.++..|++-.+.|.|. T Consensus 467 ~~~~vriv~~itT~t~~~d~~~~~i~viRv~D~i~~dir~~~~~~~iG--k-~nn~~~r~~v~~~i~~~L~~l~~~gaI~ 543 (587) T protein:vir:95 467 TNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIG--T-RTINTSASIIKDFIQSYLGRKKRDNEIQ 543 (587) T ss_pred cceEEEEeecceeccCCCCcchhhhhhhhhHHHHHHHHHHHHHhhCCc--c-ccchHHHHHHHHHHHHHHHHHHhCCccc Confidence 3322222345444 22224555888999999999998887766644 4 5678999999999999999999999996 Q ss_pred cccccCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 431 YGKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 431 ~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) -. ...+. .+. ..+|+ .-++|.+...-++++|-++.+.- T Consensus 544 ~~-~~~dv-------------------------~v~-----~~~d~-----~~v~~~v~Pv~~mekI~vt~~~~ 581 (587) T protein:vir:95 544 DF-PAEDV-------------------------QVI-----VEGNE-----ARISMTVYPIRSFKKISVSLVYK 581 (587) T ss_pred CC-Cccce-------------------------EEE-----ecCCE-----EEEEEEEEEcccceEEEEEEEEe Confidence 32 11111 010 01111 23566777777788877777765 No 20 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=98.78 E-value=5e-08 Score=60.61 Aligned_cols=400 Identities=12% Similarity=0.072 Sum_probs=202.0 Q ss_pred CCCccc-----eEEEeeeeccc-ccccccccceEEEecccccCccceEEecC---HHHHHHhcCCCcH--HHHHHHHHhc Q lcl|NC_019451. 1 MISQSR-----YIRIISGVGAG-APVAGRKLILRVMTTNNVIPPGIVIEFDN---ANAVLSYFGAQSE--EYQRAAAYFK 69 (504) Q Consensus 1 mip~s~-----iv~V~~~v~~~-~~~~~~~~~~l~l~~~~~~~~~r~~~y~s---~~~V~~~Fg~~s~--ey~aA~~yF~ 69 (504) -.-.+| ++|+...-... +...++. ..+.+..+= =|++.+..-++ ..++...||.+-. ..+..+..|. T Consensus 8 ~~~~~K~~PG~Y~n~~~~~~~~~~~~~rGi-~a~p~~~~w-Gp~~~v~~i~~~~~~~~~~~~~G~~~~~~~~~~l~~~~~ 85 (436) T protein:vir:78 8 FVTQNKVLPGSYINFVSATRATSSLSDRGI-VAMPLELDW-GIDEEVFQVTSDDFEKYSTKYFGYDYTHEKLKGLRDLFK 85 (436) T ss_pred eccceeecCceEEEEEecCcceeeccCCeE-EEEEEEecC-CCCceeEEeecccchHHHHHHhcCccchHHHHHHHHHhc Confidence 112233 44544322222 2222222 222233332 24444444444 3467778998643 4456777886 Q ss_pred cCCCCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhh Q lcl|NC_019451. 70 FISKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIR 149 (504) Q Consensus 70 ~~~~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~ 149 (504) .|++||+-|...-..++. ++ .+..........|+|+|.-.....+..|+.....-..+ ..... T Consensus 86 -------~~~tv~~yrl~~G~~a~~----~v--~~Aky~g~~gn~i~v~v~~~~~d~~~~dv~~~~g~~~~----d~~~~ 148 (436) T protein:vir:78 86 -------NIRLGYFYKLNKGVKASC----SI--ATARCSGIRGNDLKVIVTTNIDDNAKFDVVTLLDNKKV----DTQIA 148 (436) T ss_pred -------CCCEEEEEECCCcceeee----ee--eeeecCCCCCcEEEEEecccccccCceEEEEEecchhh----hhhhH Confidence 689999999864211111 11 12233333334788888654433333333332221111 11111 Q ss_pred cccccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccce Q lcl|NC_019451. 150 KNADPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNF 229 (504) Q Consensus 150 a~~~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~w 229 (504) +..........|.+.. .+.-..+..+ .+.|-+.+ .....++..++|+++.... | T Consensus 149 ~~~~~l~~n~~V~~~~-----------~g~la~~a~~--------~LtGG~dG-----~~~T~~dy~~al~~le~~~--f 202 (436) T protein:vir:78 149 KVITELQDNDYVTWKK-----------EATLEATAGL--------TFTNGTNG-----EAVTGTEYQAFLDKIESYS--F 202 (436) T ss_pred HHHhhccCCceEEEEe-----------ccccccccee--------eeeccccc-----cccchHHHHHHHHHHcccc--e Confidence 1111001112333221 0100000000 00010111 1123467889999997664 5 Q ss_pred eEEEEEeccCCHHHHHHHHHHHhhc----CCcEEEEEecc-ccchhHHHHHHhhhcceeEEEEeccCCCcc----HHHHH Q lcl|NC_019451. 230 GSFLFAGAPLDNDQIKAVSAWNAAQ----NNQFIYTVATS-LANLGTLFTLVNGNAGTALNVLSATAANDF----VEQCP 300 (504) Q Consensus 230 y~~~~~~~~~~~~~~~a~A~w~e~~----~~~~~~~~~~~-d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~aa~ 300 (504) ..+.+. ..+.+.+..+++|+... ++++..+.... .++.+. + .++........| ..+.+ T Consensus 203 n~l~~~--~~d~~~~~~~~a~ikr~re~~g~~~~aV~~~~~~~d~Eg-I----------Inv~n~v~g~~~~~~~~~a~v 269 (436) T protein:vir:78 203 NALGCL--ATTAEIKSLFVEFTKRMRDKVGAKFQTVLYKKNDADYEG-V----------VSVENKIKDTGLLESSLIYWT 269 (436) T ss_pred eEEEec--CCChHHHHHHHHHHHHHHhhcCCeEEEEecCCCCCCCce-E----------EEeecccCCceechhHHHHHH Confidence 545443 34677888999999844 34455554332 222211 0 011111122222 33344 Q ss_pred HHHHHhcCcCcCCceeeecccccCccc-c-ccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEe----C---Cccc Q lcl|NC_019451. 301 SEILAATNYDEPGASQNYMYYQFPGRN-I-TVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILC----G---GPTD 371 (504) Q Consensus 301 ~g~~as~nf~~~~g~~T~kfk~l~Gv~-a-~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~----~---G~y~ 371 (504) .|..++...+ .+ .-||.++|+. . ..++.+|++.+.++|..... .. ++. ..+.+|+.+ + ++ + T Consensus 270 AG~~Ag~~~~---~S--~T~~~~~~~~~v~~~~t~~e~~~ai~~G~lvl~-~d--~~~-v~I~~~VNTltt~~~~k~~-~ 339 (436) T protein:vir:78 270 TGAIAGCDIN---KS--NTNKRYDGEFDVDVNYTQIHLEEALKTGKFIFH-KV--GDE-VHVLEDINTFVSFTDEKND-D 339 (436) T ss_pred HHHHhcCccc---cC--ccceecCccccccccCCHHHHHHHHhCCeEEEE-Ee--CCe-EEEEEccccceecCCCCCc-c Confidence 4555555433 33 3477888873 4 46899999999999987663 22 222 345666644 1 33 4 Q ss_pred cchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcc Q lcl|NC_019451. 372 AVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRR 451 (504) Q Consensus 372 ~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~ 451 (504) |.=|-+++..|-+.+.++..+-+.+. +|+|=+..|..++.+.+..-|++-.+.|.|.+- +..+ T Consensus 340 ~~kI~vir~~D~i~~di~~~~~~~yi--GKv~N~~dgr~~l~~~i~~yl~~L~~~g~I~~f-~~~D-------------- 402 (436) T protein:vir:78 340 FSSNQSVRVLDQIANDIATLFNTKYL--GEVPNDKSGRISFWNDVVKHHEQLQNMRAIEDF-KADD-------------- 402 (436) T ss_pred hhhhhHHHHHHHHHHHHHHHhhhccc--cccCCCHHHHHHHHHHHHHHHHHHHhCCcccCC-CCcc-------------- Confidence 55577777788777777665544443 699999999999999999999999999999741 1111 Q ss_pred cccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 452 AWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 452 ~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) +.+. +. ..+.+.-+++..+.=.||.++.++..+= T Consensus 403 -----------v~v~------~~--~~~~~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 403 -----------VSVE------PG--SDKKTVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred -----------eEEe------ec--CCCCEEEEEEEEEEEEeeeeEEEEEEEC Confidence 0110 00 0122223566666667777777665544 No 21 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=98.69 E-value=1.1e-07 Score=58.78 Aligned_cols=438 Identities=11% Similarity=0.061 Sum_probs=216.2 Q ss_pred CCCccc-----eEEEeeeecccccccccccceEEEecc---cccCc-cceEEecCHHHHHHhcCCCcHHHHHHHHHhccC Q lcl|NC_019451. 1 MISQSR-----YIRIISGVGAGAPVAGRKLILRVMTTN---NVIPP-GIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFI 71 (504) Q Consensus 1 mip~s~-----iv~V~~~v~~~~~~~~~~~~~l~l~~~---~~~~~-~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~ 71 (504) =||-|- ++.++.+..-. +.+.+---.|+++.. .-.++ ..+|.+ |.++..+.||..|--..|++.|..-- T Consensus 8 ~IP~~iRvP~~y~E~dns~A~~-g~~~~~q~vLiiGq~la~gs~~~~~pv~v~-s~~~a~~~fG~GS~la~M~~a~~~~n 85 (495) T protein:vir:19 8 AIPSDVRVPLTYIEFDNSNAVS-GTPAPRQRVLMFGQSGSKASAAPNVPVRIR-SGSQASAAFGQGSMLALMADAFLNAN 85 (495) T ss_pred hCCcccccCeEEEEEccCCCCc-CCcCCCceEEEEEecCcccccccceeEEec-CHHHHHHhcCcCcHHHHHHHHHHHhC Confidence 344442 22223222210 112222334555542 22333 456655 66678999999999999999999864 Q ss_pred CCCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcc Q lcl|NC_019451. 72 SKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKN 151 (504) Q Consensus 72 ~~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~ 151 (504) | =..||+=..... . +.-..|+++-. -.+-.+|.+.+.|+|.... +......+-+.+|+.+.++|++. T Consensus 86 ~-----~~~l~~i~~~D~-a-G~aA~g~it~t---g~at~~G~l~l~I~g~~v~---v~V~~gdTaa~vA~al~aaina~ 152 (495) T protein:vir:19 86 R-----VAELWCIPQGNG-T-GNAAVGEISLS---GTAGENGSLVTYIAGQRLA---VSVAAGATGAALADLLVARIKGQ 152 (495) T ss_pred C-----cceEEEEeeCCh-h-hceeEEEEEEe---ecCCCCcEEEEEECCEEEE---EEecCCCCHHHHHHHHHHHhcCC Confidence 3 445666655432 1 11112233311 1223579999999998765 45567778889999999999886 Q ss_pred cccccceeEEEEe----cccceeEEecccCcceeE----EEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHH Q lcl|NC_019451. 152 ADPQLAQATVTWN----QNTNQFTLVGATIGTGVL----AVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKST 223 (504) Q Consensus 152 ~~~~~~~a~vt~d----~~~~~F~its~t~ga~s~----~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~ 223 (504) ...-. .+++.-+ +......+|..-.|+... ..+.. +..+.. |++..-.....|....++.++|+++. T Consensus 153 ~~lPv-TA~~~~~~~~~~a~~~VtlTAr~kG~~n~idi~~~~~~--ge~~p~--Glt~titamsgGag~PDia~alaal~ 227 (495) T protein:vir:19 153 PDLPV-TAEVRADSGDDDTHADVVLSAKFTGALSAVDVRWNYYA--GETTPY--GIITAFKAASGKNGNPDISASIAGMG 227 (495) T ss_pred ccCce-EEEeeccCCCCcCceeEEEEEeeccccccceeEEEeec--cccccc--ceeEEEEecCCCCCCcchHHHHHHhc Confidence 54321 1222111 122334444444443211 01100 001111 11111111223444456788888876 Q ss_pred hhccceeEEEEEeccCCHHHHHHHHHHHhhc----CCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHH-- Q lcl|NC_019451. 224 NVSNNFGSFLFAGAPLDNDQIKAVSAWNAAQ----NNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVE-- 297 (504) Q Consensus 224 ~~~~~wy~~~~~~~~~~~~~~~a~A~w~e~~----~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 297 (504) + .||.+++.. ..+.+.+.++-.+++.- +.++..++.-...+.....+.....+....++..+. ..+.+. T Consensus 228 ~---~~~~~I~~P-~tD~asL~al~~~l~~rw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~~~-gsp~~~~~ 302 (495) T protein:vir:19 228 D---LQYKYIVMP-YTDEPNLNLLRTELQERWGPVNQADGFAVTVLSGTYGDISTFGVSRNDHLISCMGIA-GAPEPSYL 302 (495) T ss_pred c---CCCcEEEEe-cCcHHHHHHHHHHHHHhhhHHHhcCeEEEEeecCCHHHHHHhhhccCCceEEEEecC-CCCCcHHH Confidence 4 455555432 23445556666666542 233333333333333333222222233333333321 233222 Q ss_pred --HHHHHHHH---hcCcCcCCceeeecccccCccccc----cCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEe-- Q lcl|NC_019451. 298 --QCPSEILA---ATNYDEPGASQNYMYYQFPGRNIT----VSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILC-- 366 (504) Q Consensus 298 --aa~~g~~a---s~nf~~~~g~~T~kfk~l~Gv~a~----~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~-- 366 (504) ++++++++ ..|+.+ ++.=-.|+||.|. .++.+|.+.|..+|+..+.+-.+ . ...+.+.+++ T Consensus 303 ~AAA~aa~~A~~l~~DPAr-----PL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~~--G-~V~I~R~ITTY~ 374 (495) T protein:vir:19 303 YAATLCAVASQALSIDPAR-----PLQTLTLPGRMPPAVGDRFTWSERNALLFDGISTFNVNDG--G-EMQIERMITMYR 374 (495) T ss_pred HHHHHHHHHHHHhhccccc-----ccCceeecceecCCccccCChHHHHHHHhCCcceEEECCC--C-eEEEEeeeeeee Confidence 23333322 234333 3444467888744 46899999999999998864322 1 2233444433 Q ss_pred ---CCccccchhh--hhhhHHHHHHHHHHHHHHHHhcCCCCCcCHH----H-----HHHHHHHHHHHHHHHHhcCccccc Q lcl|NC_019451. 367 ---GGPTDAVDMN--VYANEIWLKSAIAQALLDLFLNVNAVPASST----G-----EAMTLAVLQPVLDKATANGTFTYG 432 (504) Q Consensus 367 ---~G~y~~~wiD--~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~----G-----~~~l~~~v~~vl~~a~~nG~Ia~G 432 (504) .|.-|..|.| .++-.+++...++..+-.-|-.. |+--+.. | -.+|++.+-..+++....|++.- T Consensus 375 ~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~-KLa~d~~~~~~gq~IvTp~~ir~ell~~~~~le~~given- 452 (495) T protein:vir:19 375 TNKYGDSDPSYLNVNTIATLSYLRYSLRTRITQKFPNY-KLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGLVED- 452 (495) T ss_pred ecCCCCcchhhhhhHHHHHHHHHHHHHHHHHhhhcCCc-ccccCCCCCCCcccccChHHHHHHHHHHHHhhhhhccccC- Confidence 5666666655 57778888888888887766442 3322211 1 24688889999999999999862 Q ss_pred cccCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEE--EEEECCeEEEEE Q lcl|NC_019451. 433 KEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTL--IYSKGDAIRFVE 498 (504) Q Consensus 433 ~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~--~~~~aGAIh~v~ 498 (504) -+..|+.+..|-..+++ .|-+=..|+--+ -.-.+|.|+.+= T Consensus 453 ---~~~~~~~LiVerd~~dp----------------------nRln~~~p~d~vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 453 ---FDTFKEELYVARNKDDK----------------------DRLDVLCGPNLINQFRIFAAQVQFIL 495 (495) T ss_pred ---hhhhcceeEEEECCCCC----------------------cEEEEEecceeeCceeeeeeeeeeeC Confidence 22222222222222211 122212211100 012233333322 No 22 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=98.60 E-value=2.1e-07 Score=57.22 Aligned_cols=422 Identities=10% Similarity=0.026 Sum_probs=194.1 Q ss_pred CCCccceEEEeeeec---ccccccccccceEEEecccccCccceEEecCHHHHHHhcC--CCcHHHHHHHHHhccCCCCC Q lcl|NC_019451. 1 MISQSRYIRIISGVG---AGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFG--AQSEEYQRAAAYFKFISKSV 75 (504) Q Consensus 1 mip~s~iv~V~~~v~---~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg--~~s~ey~aA~~yF~~~~~~~ 75 (504) |--. -.=-|.+... +............|++....-|+..-..-+|..|...++| .++..+.+...||.+ T Consensus 1 M~~~-~~pGVyv~E~~~~~~~i~~v~T~v~~~VG~a~~gp~n~pv~its~~d~~~~g~~~~~~tL~~Av~~~f~n----- 74 (477) T protein:vir:10 1 MAAN-YLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPVQSLSDVDAAQFGPQLAGFTIPQALDAVYDY----- 74 (477) T ss_pred Cccc-CCCCeEEEEccCCcccccccCCceeEEEecccCCCCCcCEEEccHHHHHHhccCCCCCcHHHHHHHHHhc----- Confidence 5421 0111222222 2223333456777888777777654344455555545433 346788899999997 Q ss_pred cccceEEEEeeeccCCcce-eeecccchhhHhHhhccCceEEEEEcccceeeee-eccccccch----HHHHHHHHhhhh Q lcl|NC_019451. 76 NSPSSISFARWVNTAIAPM-VVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITA-IDTSAATSM----DNVASIIQTEIR 149 (504) Q Consensus 76 ~~P~~l~igr~~~~a~~~~-l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~-i~~s~ats~----~~vA~~i~~~i~ 149 (504) -...+|+-|......... +.... .....+.-.....+....... ...+..... ........... T Consensus 75 -Gg~~~~vVrV~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~~- 144 (477) T protein:vir:10 75 -GSGTVIVINVLDPAVHKSNAANEP--------VTFDAATGRAKLAHPAAANLVLKNDSGGTTYAEGTDYAVDLINGVI- 144 (477) T ss_pred -cceEEEEEecCccccccccccccc--------cccccccceecccccccccccccccccccccccchhhhhhhccccc- Confidence 456788888754321111 00000 000000000000000000000 000000000 00000000000 Q ss_pred cccccccceeEEEEecccceeEEecccCcc-eeEEEEeeccchh--hhhhhhcccCcceeecccccccHHHHHHHHHhhc Q lcl|NC_019451. 150 KNADPQLAQATVTWNQNTNQFTLVGATIGT-GVLAVAKSADPQD--MSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVS 226 (504) Q Consensus 150 a~~~~~~~~a~vt~d~~~~~F~its~t~ga-~s~~~~~sa~~~~--ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~ 226 (504) ..........+. ..........+.. .....+.. ......+-.+++..+.... T Consensus 145 ------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~-------~~~~~~tGl~al~~~~~~~ 199 (477) T protein:vir:10 145 ------------------TRIKTGTIPPGATAAKATYDYADPTKVTAADIIGAV-------NAAGMRTGMKALKDTYNLY 199 (477) T ss_pred ------------------eecccccccccceeeeeccccccccccccccccccc-------cccchhhhhhhhhhhhhhc Confidence 000000000000 0000000000000 00000000 0000011112222221111 Q ss_pred cceeEEEEEeccCCH-HHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHh-------hh-cceeEEEEecc------C Q lcl|NC_019451. 227 NNFGSFLFAGAPLDN-DQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVN-------GN-AGTALNVLSAT------A 291 (504) Q Consensus 227 ~~wy~~~~~~~~~~~-~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~-------~~-~~~~~~~~~~~------~ 291 (504) .---..+.+.....+ +-..++...++.. ..+.++-...+........... .. ..+...+++.. . T Consensus 200 ~~~~~~l~apg~~~~~~v~~~l~~~~~~~-~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~ 278 (477) T protein:vir:10 200 GYFSKILIAPAYCTQNSVSVELEAMAVQL-GAIAYIDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDTAT 278 (477) T ss_pred chhcccccccccccchhhHHHHHHHHhhC-CEEEEEecCCCCCHHHHHhhhhhccccccccccceEEEEcCeEEEecccC Confidence 000011111111111 1122333333322 2222221111111111111000 00 11112222211 0 Q ss_pred --CCc-cHHHHHHHHHHhcCcCcCCc-eeeecccccCccccc--------cCCHHHHHHHHhCCCeEEEEEeeccceeeE Q lcl|NC_019451. 292 --AND-FVEQCPSEILAATNYDEPGA-SQNYMYYQFPGRNIT--------VSDDTVANTVDKSRGNYIGVTQANGQQLAF 359 (504) Q Consensus 292 --~~~-~~~aa~~g~~as~nf~~~~g-~~T~kfk~l~Gv~a~--------~lt~t~~~al~~~~~n~y~~~~~~~~~~~~ 359 (504) ... .+.+.++|..+.+|-. .| ......|.+.||..- ..+++|.+.|.++++|.+..+.+.|. . T Consensus 279 ~~~~~~p~s~~~ag~~a~~d~~--~g~~~span~~~~gi~~~~~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~~G~---~ 353 (477) T protein:vir:10 279 NAERLEPLSSRAAGLRARVDLD--KGYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGL---R 353 (477) T ss_pred CceeEEchHHHHHHHHHHhhhc--CCceeccCCceeccccccccccccccCCChhhHHHHhhCCceEEEEecCCcE---E Confidence 011 2456677888877632 23 234445566555422 23568999999999999988876543 3 Q ss_pred EEcCEEeCCc---cccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccC Q lcl|NC_019451. 360 YQRGILCGGP---TDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEIS 436 (504) Q Consensus 360 ~~~G~~~~G~---y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~ 436 (504) ++.++++.+. ..|.+|-+.+-.+|+++.|+..+....-. |.|..=...|+..++.-|+.-++.|.|. T Consensus 354 ~wG~rT~~~~~~~~~~~~~~vrR~~~~i~~~~~~~~~~~v~~----~~~~~~~~~i~~~i~~~l~~l~~~g~l~------ 423 (477) T protein:vir:10 354 LWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDA----PIDQGLIDSLVESVNGFGRKLIGDGALL------ 423 (477) T ss_pred EEcccccCCCCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee------ Confidence 5777877442 23667888899999999999888874332 4578888999999999999999999996 Q ss_pred cccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 437 AVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 437 ~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ||++.+. .++.|++|+.+.+. .+.+.+.-...+++|++.-... T Consensus 424 -----------------------g~~v~~~-~~~nt~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~ 466 (477) T protein:vir:10 424 -----------------------GFKAWFD-PARNPKEELAAGHL-LINYKYTVPPPLERLTYETEIT 466 (477) T ss_pred -----------------------eeEEEEe-cCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEc Confidence 4788874 56888999998888 5999999999999999988877 No 23 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=98.55 E-value=3e-07 Score=56.34 Aligned_cols=427 Identities=10% Similarity=0.021 Sum_probs=197.6 Q ss_pred CCC-ccceEEEee-eecccccccccccceEEEecccccCccceEEecCHHHHHHhcC--CCcHHHHHHHHHhccCCCCCc Q lcl|NC_019451. 1 MIS-QSRYIRIIS-GVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFG--AQSEEYQRAAAYFKFISKSVN 76 (504) Q Consensus 1 mip-~s~iv~V~~-~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg--~~s~ey~aA~~yF~~~~~~~~ 76 (504) |-. ..-=|-|.- .-.+............|++....-|......-+|..|-...|| .+..-+.+...||.+ T Consensus 1 M~~~~~pGVyv~E~~~g~~~I~~v~Tsv~~~VG~a~~~p~n~pv~its~~d~~~~g~~~~~~tL~~Av~~~f~n------ 74 (477) T protein:vir:79 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPVQSLSDVDAAQFGPQLAGFTIPQALDAVYDY------ 74 (477) T ss_pred CcCCCCCCeEEEEecCCcccccccCCceEEEEeecccCCCcccEEEccHHHHHHhcCCCCCCcHHHHHHHHhhc------ Confidence 652 111111111 1112233344557778888777777654333455555555444 346678899999987 Q ss_pred ccceEEEEeeeccCCcce---eeecccchhhHhHh--hccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcc Q lcl|NC_019451. 77 SPSSISFARWVNTAIAPM---VVGDNLPKTIADFA--GFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKN 151 (504) Q Consensus 77 ~P~~l~igr~~~~a~~~~---l~g~~~~~~~~~~~--~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~ 151 (504) --.++|+-|......... ...+.......... ......+.+..+..... ...........+........... T Consensus 75 gg~~~~vvrV~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~ 151 (477) T protein:vir:79 75 GSGTVIVINVLDPAVHKSNAASESVTFDAATGRAKLAHPAAANLVLKNDSGGTT---YTEGTDYAVDLINGVITRIKTGT 151 (477) T ss_pred CCceEEEEeccCCccccccccccccccccccccccccccccceeEEeecccccc---cccCccccccccchhhhhhhccc Confidence 346688888743221111 00000000000000 00001111111100000 00000000000000000000000 Q ss_pred cccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccceeE Q lcl|NC_019451. 152 ADPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGS 231 (504) Q Consensus 152 ~~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~wy~ 231 (504) .........+.++..... .. ......+. .......+..+++..+......--. T Consensus 152 ~~~~~~~~~~~~~~~~~~----------~~----------~~~~~~g~-------~~a~~~~tg~~al~~~~~~~~~~~~ 204 (477) T protein:vir:79 152 IPAAATAAKATYDYADPT----------KV----------TAADIIGA-------VNAAGMRTGMKALKDTYNLYGYFSK 204 (477) T ss_pred cccccceeeceeccCCcc----------cc----------eeeeeccc-------ccccccchhhhhhhhhhhhcccccc Confidence 000000001111100000 00 00000000 0000111222222222221111111 Q ss_pred EEEEecc-CCHHHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHh-------hh-cceeEEEEecc------C--CC- Q lcl|NC_019451. 232 FLFAGAP-LDNDQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVN-------GN-AGTALNVLSAT------A--AN- 293 (504) Q Consensus 232 ~~~~~~~-~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~-------~~-~~~~~~~~~~~------~--~~- 293 (504) ++..... ....-..++...++.. ..+.++-............... .. ..+...++++. . .. T Consensus 205 iv~apg~~~~~~v~~~l~~~~~~~-~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~ 283 (477) T protein:vir:79 205 ILIAPAYCTQNSVSVELEAMAVQL-GAIAYIDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDIATNAERL 283 (477) T ss_pred eeeccccccchhHHHHHHHHHhhc-CeEEEEecCCCCChHHHhhhhhhccccccccccceEEEEcCeeEEecccCCceee Confidence 1111111 1112223333333322 2222211111111111111000 00 11112222110 0 01 Q ss_pred ccHHHHHHHHHHhcCcCcCCc-eeeecccccCcccc---c-----cCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCE Q lcl|NC_019451. 294 DFVEQCPSEILAATNYDEPGA-SQNYMYYQFPGRNI---T-----VSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGI 364 (504) Q Consensus 294 ~~~~aa~~g~~as~nf~~~~g-~~T~kfk~l~Gv~a---~-----~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~ 364 (504) .-+.+.++|.++.+|-+ .| ......|.+.||.. + ..+++|.+.|.++|+|....+.+.|. .++.++ T Consensus 284 ~p~s~~~ag~~a~~d~~--~g~~~span~~~~gv~~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~~G~---~~wG~r 358 (477) T protein:vir:79 284 EPLSSRAAGLRARVDLD--KGYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGL---RLWGNR 358 (477) T ss_pred echHHHHHHHHHHhhcc--CCceEccCCceeecceecccccccccCCChhhHHHHhhCCceEEEEecCCcE---EEEccc Confidence 12457777888877632 33 23444566666542 1 23568999999999999988876553 357788 Q ss_pred EeCC---ccccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccce Q lcl|NC_019451. 365 LCGG---PTDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQ 441 (504) Q Consensus 365 ~~~G---~y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~ 441 (504) ++.+ ...|.+|-+.+-.+|+++.|+..+..++-. |-|..=...|+..++.-|++-++.|.|. T Consensus 359 T~~~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e----~~~~~~~~~i~~~i~~~l~~l~~~g~l~----------- 423 (477) T protein:vir:79 359 TAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDA----PIDQGLIDSLVESVNGFGRKLIGDGALL----------- 423 (477) T ss_pred ccCCCCCCccceeeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee----------- Confidence 7732 224667889999999999999988874432 3477788999999999999999999995 Q ss_pred eeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 442 YITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 442 ~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ||.+.+ +.++.+++|+.+.+. .+.+.+.-...+++|++.-... T Consensus 424 ------------------g~~v~~-~~~~nt~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~ 466 (477) T protein:vir:79 424 ------------------GFKAWF-DPARNPKEELAAGHL-LINYKYTVPPPLERLTYETEIT 466 (477) T ss_pred ------------------eeEEEE-ecCCCCHHHhhCCeE-EEEEEEEecCCceeEEEEEEEe Confidence 477877 457888999988887 5999999999999999988777 No 24 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=98.52 E-value=3.7e-07 Score=55.83 Aligned_cols=366 Identities=13% Similarity=0.071 Sum_probs=199.0 Q ss_pred CCCc---cceEEEeeeecccccccccccceEEEeccc-----ccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCC Q lcl|NC_019451. 1 MISQ---SRYIRIISGVGAGAPVAGRKLILRVMTTNN-----VIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFIS 72 (504) Q Consensus 1 mip~---s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~-----~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~ 72 (504) |-.- =+|..+... +.+........+.|++..+ .+|...-...++..+-...||.++..+.+...+|.+.. T Consensus 1 m~~~~~Gv~v~e~~~~--~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~~~~~~~~g~~~tl~~a~~~~~~~gg 78 (396) T protein:vir:60 1 MSDYHHGVQVLEINEG--TRVISTVSTAIVGMVCTASDADAEIFPLNKPVLITNVQSAIAKAGKKGTLAASLQAIADQSK 78 (396) T ss_pred CCCCCCCeEEEEcCCC--cccccccCceeEEEEecccccccccccCccCeEeechHHHHHhhcCcchhHHHHHHHhhccC Confidence 4332 122222222 2333344566677776543 23443334556667777889999999999999998742 Q ss_pred CCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhccc Q lcl|NC_019451. 73 KSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNA 152 (504) Q Consensus 73 ~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~ 152 (504) ...++-+......... . ... ..+. .. +. T Consensus 79 ------~~~~vv~~~~~~~~~~------~-----------~~~-----------------~~~~-~~----~~------- 106 (396) T protein:vir:60 79 ------PVTVVVRVEDGTGEDE------E-----------TKL-----------------AQTV-SN----II------- 106 (396) T ss_pred ------ceEEEEeccccccccc------c-----------ccc-----------------cccc-cc----cc------- Confidence 2344444321100000 0 000 0000 00 00 Q ss_pred ccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccceeEE Q lcl|NC_019451. 153 DPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGSF 232 (504) Q Consensus 153 ~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~wy~~ 232 (504) ...|..+ ..++. ... . +.....+. ........+........++..+.+ ....+ T Consensus 107 --------~~~d~~~-------~~tg~--~al-~-----~~~~~~~~-~~~il~ap~~~~~~v~~al~~~~~---~~~~~ 159 (396) T protein:vir:60 107 --------GTTDENG-------QYTGL--KAL-L-----AAESVTGV-KPRILGVPGLDTKEVAVALASVCQ---KLRAF 159 (396) T ss_pred --------ccccccc-------cccch--hhh-h-----hcccceee-eeeeccccccccHHHHHHHHHHhc---cCCeE Confidence 0000000 00000 000 0 00000000 011111112222223333333333 33344 Q ss_pred EEEeccCCHHHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCcC Q lcl|NC_019451. 233 LFAGAPLDNDQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDEP 312 (504) Q Consensus 233 ~~~~~~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~ 312 (504) .+.+.+.. .+..++-+|-+.-+.++...++----. . ...... ..+ ..+.+.++|.++.+|-++. T Consensus 160 ~i~d~p~~-~~~~~a~~~~~~~~s~~~~~~~p~~~~---~----d~~~~~-~~~-------~p~s~~~AG~~a~~d~~~g 223 (396) T protein:vir:60 160 GYISAWGC-KTISEVKAYRQNFSQRELMVIWPDFLA---W----DTVAST-TAT-------AYATARALGLRAKIDQEQG 223 (396) T ss_pred EEEeCCCC-CCHHHHHHHHhhcCCceEEEEeCceee---e----cccCCc-eeE-------EchhHHHHHHHHHhhhccC Confidence 45554422 223344455555444444443321100 0 000000 001 1245667788888875431 Q ss_pred CceeeecccccCcccc--------ccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHHHH Q lcl|NC_019451. 313 GASQNYMYYQFPGRNI--------TVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWL 384 (504) Q Consensus 313 ~g~~T~kfk~l~Gv~a--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~dwl 384 (504) -......|.+.||.. ...+++|.+.|..+|+|.. +.+.| + .++.+.++++.-.|.+|-+.+-.+|+ T Consensus 224 -~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~--~~~~G--~-~~wG~rT~~~d~~~~~i~~rR~~~~i 297 (396) T protein:vir:60 224 -WHKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGVTTL--IRRDG--F-RFWGNRTCSDDPLFLFENYTRTAQVL 297 (396) T ss_pred -cEeCcCCceecceeeceeecccccCCCcchhhhhhhcCcEEE--EcCCC--E-EEEcccccCCCcccceeehhhHHHHH Confidence 122334667777642 2346789999999999987 33333 3 45788899998888889999999999 Q ss_pred HHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEE Q lcl|NC_019451. 385 KSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWIN 464 (504) Q Consensus 385 ~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~ 464 (504) ++.|+..+...+-. |-|..-...|+..++.-|+.-+++|.|. ||.++ T Consensus 298 ~~~i~~~~~~~v~e----~n~~~~~~~i~~~i~~~l~~l~~~gal~-----------------------------g~~~~ 344 (396) T protein:vir:60 298 ADTMAEAHMWAVDK----PITATLIRDIVDGINAKFRELKTNGYIV-----------------------------DATCW 344 (396) T ss_pred HHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee-----------------------------ceEEE Confidence 99999998874432 6788999999999999999999999996 35566 Q ss_pred ecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 465 ITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 465 ~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) .. .+..+++|+.+.+. .+.+.+..--.++.|++..... T Consensus 345 ~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~I~~~~~~~ 382 (396) T protein:vir:60 345 FS-EESNDAETLKAGKL-YIDYDYTPVPPLENLTLRQRIT 382 (396) T ss_pred Ee-cCCCCHHHhhCCEE-EEEEEEEecCCcceEEEEEEEc Confidence 54 36778888877776 5888899999999999999988 No 25 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=98.47 E-value=5.3e-07 Score=55.01 Aligned_cols=448 Identities=12% Similarity=0.056 Sum_probs=210.2 Q ss_pred CCCccceEE------Eee-eecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCC Q lcl|NC_019451. 1 MISQSRYIR------IIS-GVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISK 73 (504) Q Consensus 1 mip~s~iv~------V~~-~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~ 73 (504) .-|-.++.. +.- ++.+..+ ...+++.|++....=|++++..+++.++.-+.||.. +-+.|..+.|..++ T Consensus 5 ~~~~~~~~~Pgv~~~~~~~~~~~~~~--~~~~~~~~iG~a~~g~~~~~~~~~~~~~~~~~~g~G-~l~~ai~~a~~~~~- 80 (587) T protein:vir:96 5 IFPRRPIQRPHASIEVDSSGIGGSAS--NSEKILCLIGKAEGGEPNTVYQVRNYAQAKSVFRSG-ELLDAIELAWGSNP- 80 (587) T ss_pred eeCCCcccCCceEEEEecCCccCCCC--CCCceEEEEEEecCCCCceeEEEcChHHHHHhhcCC-cHHHHHHHHhccCc- Confidence 123344332 222 2222322 246788888887777888888899999999999987 46778888897654 Q ss_pred CCcccceEEEEeeeccCCcceeeecccc--------------hhhHhHhhccC-------------------c-eEEEEE Q lcl|NC_019451. 74 SVNSPSSISFARWVNTAIAPMVVGDNLP--------------KTIADFAGFSA-------------------G-VLTIMV 119 (504) Q Consensus 74 ~~~~P~~l~igr~~~~a~~~~l~g~~~~--------------~~~~~~~~~~~-------------------g-~~titi 119 (504) ..-..++|.=|. ..++++.+.-+.+. ..+.+.+.... | -++|.. T Consensus 81 -~~g~~~~~a~rv-~~~~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~n~G~v~~i~y 158 (587) T protein:vir:96 81 -QYTAGKILAMRV-EDAKASQLEKGGLRVTSKIFGSVSNDIQVALEKNTITDSLRLRVVFQKDNYQEVFDNLGNIFSINY 158 (587) T ss_pred -CCCceEEEEEec-CCCccceeecccccccccccCCCCceEEEEEEeccCCCccceEEEEecCCceeeccccCceEEEEe Confidence 335566776665 34444433222111 11111000000 1 122322 Q ss_pred cccce-------------------------eeeeeccccccchHHHHHHHHhhhhccccc-------ccceeEEEE-ecc Q lcl|NC_019451. 120 GAAEQ-------------------------NITAIDTSAATSMDNVASIIQTEIRKNADP-------QLAQATVTW-NQN 166 (504) Q Consensus 120 ~g~~~-------------------------~~~~i~~s~ats~~~vA~~i~~~i~a~~~~-------~~~~a~vt~-d~~ 166 (504) .|+.. .+...-+.. .-...+..+...++..+.. .....++.+ |.. T Consensus 159 ~g~~~~a~~~~~~~~~~~~A~~l~l~gg~~~v~~yrl~~--g~~~~~~~~~~~~~~~~~~tAky~g~~~n~~~v~v~d~~ 236 (587) T protein:vir:96 159 KGEGEKATFSVEKDKETQEAKRLVLKVDEKEVKAYELNG--GAYSFTNEIITDINELPDFEAKLSPFGDKNLESRKLDEA 236 (587) T ss_pred cccccceeEeeccCcccceeeeeEEEecCceEEEEEeCC--CchhhhhhhhhhhccccceEEEeecccCceeEEEeeccc Confidence 22221 111111100 0000111111111111000 000111111 000 Q ss_pred ----cceeEEecccC-cc-------ee-EEEEeeccc--------------hh---hhhhhh-cc-cCcceeec---ccc Q lcl|NC_019451. 167 ----TNQFTLVGATI-GT-------GV-LAVAKSADP--------------QD---MSTALG-WS-TSNVVNVA---GQA 211 (504) Q Consensus 167 ----~~~F~its~t~-ga-------~s-~~~~~sa~~--------------~~---ia~~l~-~t-~~~~~~~~---g~a 211 (504) .....+-.++. +. .. ......... .. .....+ +. ........ |.. T Consensus 237 ~~~~~k~~~~y~~t~~~di~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~aLtGG~dG~~ 316 (587) T protein:vir:96 237 TDVDIKGKAVYVKAVFGDIENQTQYNQYVKFEQLPEQASEPSDVEVHAETESATVTATSKPKAIEPFELTKLSGGTNGEP 316 (587) T ss_pred cccccceEEEeehhhhhhhhhhhccccceeeccccchhhhhhcccccccccceeeeecccccccccccceeeecCCCCCC Confidence 00000000000 00 00 000000000 00 000000 00 00000111 222 Q ss_pred cccHHHHHHHHHhhccceeEEEEEeccCCHHHHHHHHHHHhhc---CCcEEEEEeccccchhHHHHHHhhhc-ceeEEEE Q lcl|NC_019451. 212 ADLPDAAVAKSTNVSNNFGSFLFAGAPLDNDQIKAVSAWNAAQ---NNQFIYTVATSLANLGTLFTLVNGNA-GTALNVL 287 (504) Q Consensus 212 aet~~~al~~~~~~~~~wy~~~~~~~~~~~~~~~a~A~w~e~~---~~~~~~~~~~~d~~~~~~~~~~~~~~-~~~~~~~ 287 (504) .++..++++++..+ +|+.+.. ...+.+.+..+.+|++.. .+++..++............+..... ..+...+ T Consensus 317 ~~~y~~~l~ale~~--~~~~i~~--~t~d~ai~~~l~a~vk~~r~~gk~~~aVlg~~~~~~~~~~~~~a~~~n~e~vi~v 392 (587) T protein:vir:96 317 PTSWSAKLEKFKNE--GGYYIVP--LTDRQSVHSEVATFVKNRSDAGEPMRAIVGGGTSETKEKLFGRQAILNNPRVALV 392 (587) T ss_pred cccHHHHHHHHhhC--CcEEEEe--cCCCHHHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHHHhhcCCCcEEEE Confidence 44678899998765 5555433 223455667799999644 33444444332222222222222222 2222222 Q ss_pred ec------------cCCCccHHHHHHHHHHhcCcCcCCceeeecccccCccc-cccCCHHHHHHHHhCCCeEEEEEeecc Q lcl|NC_019451. 288 SA------------TAANDFVEQCPSEILAATNYDEPGASQNYMYYQFPGRN-ITVSDDTVANTVDKSRGNYIGVTQANG 354 (504) Q Consensus 288 ~~------------~~~~~~~~aa~~g~~as~nf~~~~g~~T~kfk~l~Gv~-a~~lt~t~~~al~~~~~n~y~~~~~~~ 354 (504) .+ .++..+.++.+.|..++.+.+. ++| ||.++++. ...++.+|++.+.++|++......+.. T Consensus 393 ~~~~~~~~~~~~~~~~~~~~~aa~vAG~~Ag~~~~~---S~T--~~~~~~~~v~~~~t~~e~~~~i~~G~~~l~~~~~~~ 467 (587) T protein:vir:96 393 ANSGKFVMGNGRILQAPAYMVASAVAGLVSGLDIGE---SIT--FKPLFVNSLDKVYESEELDELNENGIITIEFVRNRM 467 (587) T ss_pred ecceEEecCCCceeeechhhHHHHHHHHHhcCcccc---Ccc--ceeeecccccccCCHHHHHHHHhCCeEEEEEecCCc Confidence 21 1112234455567777766543 333 44554432 237899999999999999886544432 Q ss_pred ceeeEEEcCEEeCC---ccccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccc Q lcl|NC_019451. 355 QQLAFYQRGILCGG---PTDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTY 431 (504) Q Consensus 355 ~~~~~~~~G~~~~G---~y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~ 431 (504) ...-..-++.++=. .-+|.-|-+++-.|.+.+.++..+-+.|.- | |=++.|...|++.++..|++..+.|.|.- T Consensus 468 ~~v~~~vnsitT~t~~~~~~~~~i~virv~D~i~~di~~~~~~~yiG--k-~nn~~~r~~v~~~i~~~L~~l~~~g~I~~ 544 (587) T protein:vir:96 468 TTMFRIVDDVTTFPDKNDPVKSEMALGEANDFLVSELKILLEEQYIG--T-RTINTSASQIKDFVQSYLGRKKRDNEIQD 544 (587) T ss_pred EEEEEeeccceecCCCCCchhhhhhhHHHHHHHHHHHHHHHHhcCCc--c-ccCHHHHHHHHHHHHHHHHHHHhCCcccC Confidence 22111223444311 113556788888888888888776655543 4 56889999999999999999999999963 Q ss_pred ccccCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 432 GKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 432 G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ....+. .+.+ .+|+ --+.+.+...-+|++|-++.+.- T Consensus 545 -~~~~dv-----------------------~v~~-------~~D~-----~~v~~~v~Pv~~mekIy~tv~~~ 581 (587) T protein:vir:96 545 -FPPEDV-----------------------QVII-------EGNE-----ARISLTIFPIRALKKISVSLVYR 581 (587) T ss_pred -CCccce-----------------------EEEe-------cCCE-----EEEEEEEEEcccceEEEEEEEEE Confidence 111111 0111 1111 13677777888888888877765 No 26 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=98.37 E-value=1e-06 Score=53.44 Aligned_cols=438 Identities=10% Similarity=0.047 Sum_probs=208.8 Q ss_pred CCCccc---eEEEe-eeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCc Q lcl|NC_019451. 1 MISQSR---YIRII-SGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVN 76 (504) Q Consensus 1 mip~s~---iv~V~-~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~ 76 (504) |-+.+| +|.+. .++.+..+ ....++.|++....=|++++..+++-++.-.-||... .-.+..++|..++.. T Consensus 8 ~~~~~~Pgv~~~~~~s~~~~~~~--~~~~~~~~iG~a~~G~~~~~~~~~~~~~~~~~fg~g~-l~~~i~~a~~~~~~~-- 82 (562) T protein:vir:63 8 RKPVSRPHTEISVDTSGIGGSSS--GSEKILCLVGSATGGKPNAVYKVRNYSQAKSVFRSGE-LLDAIERAWNPGEGT-- 82 (562) T ss_pred CCcccCCceEEEEecCCCcccCC--CCCceEEEEEEeCCCCCceeEEEccHHHHHHHhcCCc-hHHHHHHhccccccC-- Confidence 444443 23333 23333333 3567888888776667799999999999989998853 556677777655522 Q ss_pred ccceEEEEeeeccCCcceeeecccc--------------hhh-----H---hHh---------hccC--c-eEE------ Q lcl|NC_019451. 77 SPSSISFARWVNTAIAPMVVGDNLP--------------KTI-----A---DFA---------GFSA--G-VLT------ 116 (504) Q Consensus 77 ~P~~l~igr~~~~a~~~~l~g~~~~--------------~~~-----~---~~~---------~~~~--g-~~t------ 116 (504) --+++|+-|... +.++.+.=+.+. ..+ . .|+ .+-+ | -|+ T Consensus 83 g~~~~~~~rv~~-a~~a~~~~~~~~~~a~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~V~~i~y~g~ 161 (562) T protein:vir:63 83 GAGDILAMRVEE-AKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIFSIKYKGT 161 (562) T ss_pred CceEEEEEEcCC-CccceeEecceeEEEeecccCCCeEEEEEecCCCCCCcceEEEecCCCcchhhhhccceeeeeeecc Confidence 335688888844 333332222111 000 0 111 0000 0 112 Q ss_pred -------------------EEEcccceeeeeeccccccchHHHHHHHHhhhhcccccccceeEEEEecc-cceeEE---- Q lcl|NC_019451. 117 -------------------IMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNADPQLAQATVTWNQN-TNQFTL---- 172 (504) Q Consensus 117 -------------------iti~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~~~~~~~a~vt~d~~-~~~F~i---- 172 (504) +.+.+..+++..+.+.... ...+..+...++.... .+..|-.. ++.+.+ T Consensus 162 ~~~~~~~v~~~~~~~~a~~l~~~~g~~~v~~~~L~~g~--~~~~~~l~~~in~~~~-----~~aky~~~~gn~i~~~~~d 234 (562) T protein:vir:63 162 EASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGA--YAETNVLISDINNLPD-----FEAKFFPIGDKNLTTDNFD 234 (562) T ss_pred cccceEEEEecCcceeEEEEEeecCCcceeEEEecCCc--cchhHHHHHhhccccc-----eEEEeeccCCceeeeeccc Confidence 2222222222222222110 0111222222322111 01111100 001111 Q ss_pred -------eccc-C------------cceeEEEEeeccchhhhhhhhcccCcceeec---ccccccHHHHHHHHHhhccce Q lcl|NC_019451. 173 -------VGAT-I------------GTGVLAVAKSADPQDMSTALGWSTSNVVNVA---GQAADLPDAAVAKSTNVSNNF 229 (504) Q Consensus 173 -------ts~t-~------------ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~---g~aaet~~~al~~~~~~~~~w 229 (504) .+.. + .....+.........++ ......... |...++..++++++... +| T Consensus 235 ~~~~~~vkt~~~~v~t~~~d~~~~~~~~~~v~~~~~~~~~la-----~~~~~~LtGG~dGt~~~~~~~al~ale~~--~~ 307 (562) T protein:vir:63 235 AQIDVDIKTKEAYVKAVGGDIEKQTAYNGYVDFEFDRSKEIA-----NFPLTKLTGGDNGTIPESWADKFSYFANE--GG 307 (562) T ss_pred cccccchhhhhhhhhhhhhhhhhcccccceeeeeecccccee-----cccceeeecCCCCCchhhHHHHHHHHHhC--Cc Confidence 0000 0 00000000000000000 001111112 22334567888888764 56 Q ss_pred eEEEEEeccCCHHHHHHHHHHHhhc---CCcEEEEEeccccchhHHHHHHhhh-cceeEEEEecc--------CCCccH- Q lcl|NC_019451. 230 GSFLFAGAPLDNDQIKAVSAWNAAQ---NNQFIYTVATSLANLGTLFTLVNGN-AGTALNVLSAT--------AANDFV- 296 (504) Q Consensus 230 y~~~~~~~~~~~~~~~a~A~w~e~~---~~~~~~~~~~~d~~~~~~~~~~~~~-~~~~~~~~~~~--------~~~~~~- 296 (504) +.+... ..+.+-+.++.+|++.. .++++.++............+.... ...+...+... ....+| T Consensus 308 ~~i~~~--t~d~av~~~l~a~vkr~~~~g~~~~aVlg~~~~~~~~~~~~~a~~~n~ervv~v~~~~~~~~~~~~~~~~~~ 385 (562) T protein:vir:63 308 YYLVPL--TSKQAVHAEALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTRAIGLQNERAGLIGFSGTVKMDDGRSLKMPG 385 (562) T ss_pred EEEEec--CCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHhhhcCCCcEEEEecCeeEECCCCceeeech Confidence 555432 22344567799999543 3334444432222222222222222 22222222211 112243 Q ss_pred ---HHHHHHHHHhcCcCcCCceeeecccccCccc-cccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCC---c Q lcl|NC_019451. 297 ---EQCPSEILAATNYDEPGASQNYMYYQFPGRN-ITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGG---P 369 (504) Q Consensus 297 ---~aa~~g~~as~nf~~~~g~~T~kfk~l~Gv~-a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G---~ 369 (504) ++.+.|..+..+.+ .++ .||.++++. ...++.+|++.+..+|++......+.+......-++.++-+ . T Consensus 386 ~~~aa~vAGl~A~~~~~---~Sl--T~~~i~~~~v~~~~t~~e~~~li~~Gv~~l~~~~~~~v~~~~iv~~itT~t~~~~ 460 (562) T protein:vir:63 386 YMFAAQVAGLTCGLEIG---EAI--TFKNIAIETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTD 460 (562) T ss_pred hHHHHHHHHHhhcCchh---cCc--cceeeccccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCC Confidence 34455555555433 333 344554332 34789999999999999988654433222111224444321 1 Q ss_pred cccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCC Q lcl|NC_019451. 370 TDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGD 449 (504) Q Consensus 370 y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~ 449 (504) ..|.-|-+++-.|.+.+.++..+-+.|.- | |=++.|...|++.++..|++-.+.|.|.- .+..+ T Consensus 461 ~~~~ki~viRv~D~i~~dir~~~~~~yiG--k-~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~-~~~~d------------ 524 (562) T protein:vir:63 461 PVKSEIGVGEANDFLVSELKISLDNEYIG--T-KIIDTSASLVKNFVQSFLDRKKLAKEIQD-YSPEE------------ 524 (562) T ss_pred chhhhhhhhHHHHHHHHHHHHHHHhcCCc--c-ccChHHHHHHHHHHHHHHHHHHhCCcccC-CCccc------------ Confidence 13555778888888888887766654443 4 56889999999999999999999999952 11111 Q ss_pred cccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 450 RRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 450 ~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) + .+.. .+|+ .-+.+.+...-++|+|-++.+.- T Consensus 525 ------v-----~v~~-------~~d~-----~~v~~~v~pv~~mekIy~ti~~~ 556 (562) T protein:vir:63 525 ------V-----QVVI-------EGDV-----ARISLTVFPIRSMKKIEVSLVYR 556 (562) T ss_pred ------e-----EEEe-------cCCE-----EEEEEEEEEcccceEEEEEEEEe Confidence 0 0100 1122 23567778888888888888877 No 27 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=98.36 E-value=1.1e-06 Score=53.28 Aligned_cols=442 Identities=10% Similarity=0.025 Sum_probs=209.8 Q ss_pred CC----Cccce------EEE-eeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhc Q lcl|NC_019451. 1 MI----SQSRY------IRI-ISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFK 69 (504) Q Consensus 1 mi----p~s~i------v~V-~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~ 69 (504) |. |--++ |.+ ..++.+..+ ....++.|++....=|++++..+++.++.-.-||... --.+...+|. T Consensus 1 ~~~~~~~~~~~~~pgv~~~~~~s~~~~~~~--~~~~~~~~ig~a~~G~~~~~~~~~~~~~~~~~f~~g~-l~~~i~~a~~ 77 (562) T protein:vir:80 1 MAIEIYPRKPVSRPHTEISVDTSGIGGSSS--GSEKILCLVGSATGGKPNAVYKVRNYSQAKSVFRSGE-LLDAIERAWN 77 (562) T ss_pred CeeeeeCCCcccCCceEEEEecCCcccCCC--CCCceEEEEEEeCCCCcceeEEEccHHHHHHHhcCCC-hHHHHHHhcc Confidence 21 22222 222 333333333 3467888888776667799999999999889998753 3355777787 Q ss_pred cCCCCCcccceEEEEeeeccCCcceeeecccc--------------hhh-----Hh---Hh---------hcc------- Q lcl|NC_019451. 70 FISKSVNSPSSISFARWVNTAIAPMVVGDNLP--------------KTI-----AD---FA---------GFS------- 111 (504) Q Consensus 70 ~~~~~~~~P~~l~igr~~~~a~~~~l~g~~~~--------------~~~-----~~---~~---------~~~------- 111 (504) .++.. --+++|+-|... +.++.+.=+.+. ..+ .. |+ .+- T Consensus 78 ~~~~~--g~~~~~~~rv~~-a~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~v~ 154 (562) T protein:vir:80 78 PGEGT--GAGDILAMRVEE-AKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIF 154 (562) T ss_pred ccccc--CceEEEEEEcCC-CCcceEEecceEEEEeecccCCCceEEEEecCCCCCCcceEEEecCCcceEEeeccCcee Confidence 65522 234688888744 333333222111 000 00 10 000 Q ss_pred --------------------Cc-eEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcccccccceeEEEEeccccee Q lcl|NC_019451. 112 --------------------AG-VLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNADPQLAQATVTWNQNTNQF 170 (504) Q Consensus 112 --------------------~g-~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~~~~~~~a~vt~d~~~~~F 170 (504) ++ .+.+.+.+..+++..+.+.... ...+..+...++.... .+..|.+.. .+ T Consensus 155 ~i~y~g~~~~a~~~i~~~~~~~~a~~l~~~~g~~~v~~~~l~~g~--~~~~~~l~~~i~~~~~-----~tAky~g~~-~n 226 (562) T protein:vir:80 155 SIKYKGTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGA--YAETNVLISDINNLPD-----FEAKFFPIG-DK 226 (562) T ss_pred eeeeccccccceeEEEecCccceEEEEEEecCCcceeEEEeCCCc--cchhhhhhhhhccccc-----eEEEecccC-Cc Confidence 00 1222222222222222222111 0011111222221110 011111000 00 Q ss_pred EEecc-------cCcceeEEEEeeccc--------hhhhh----hhh-ccc-Ccceeeccc---ccccHHHHHHHHHhhc Q lcl|NC_019451. 171 TLVGA-------TIGTGVLAVAKSADP--------QDMST----ALG-WST-SNVVNVAGQ---AADLPDAAVAKSTNVS 226 (504) Q Consensus 171 ~its~-------t~ga~s~~~~~sa~~--------~~ia~----~l~-~t~-~~~~~~~g~---aaet~~~al~~~~~~~ 226 (504) .++.. ..........+...+ .+.-. ..+ ++. .......|. ..++..++++++.+. T Consensus 227 ~i~~~~~d~~~~~~~kt~~~~v~~~~~d~~~~~~~n~~v~~~~~~~~~la~~~~~~LtGG~dG~~~~~~~dal~~Le~~- 305 (562) T protein:vir:80 227 NLTTDNFDAQIDVDIKTKEAYVKAVGGDIEKQTAYNGYVEFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANE- 305 (562) T ss_pred eeeecccccchhhhcccceeeeeehhhhhhhcccccceEEEEeccCccccccceeeeeCCCCCCccccHHHHHHHHHhC- Confidence 01000 000000000000000 00000 000 000 011111222 245678899998765 Q ss_pred cceeEEEEEeccCCHHHHHHHHHHHhhc---CCcEEEEEeccccchhHHHHHHhh-hcceeEEEEecc--------CCCc Q lcl|NC_019451. 227 NNFGSFLFAGAPLDNDQIKAVSAWNAAQ---NNQFIYTVATSLANLGTLFTLVNG-NAGTALNVLSAT--------AAND 294 (504) Q Consensus 227 ~~wy~~~~~~~~~~~~~~~a~A~w~e~~---~~~~~~~~~~~d~~~~~~~~~~~~-~~~~~~~~~~~~--------~~~~ 294 (504) +|+.+... ..+.+.+.+++.|++.. .++++.++................ ....+...+... .... T Consensus 306 -~~~~i~~~--t~d~ai~~~~~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~~~~~~~ 382 (562) T protein:vir:80 306 -GGYYLVPL--TSKQAVHAEALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTRAIGLQNERAGLIGFSGTVKMDDGRSLK 382 (562) T ss_pred -CcEEEEec--CCChHHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHHhhhcCCCeEEEEecCeeEECCCCceee Confidence 56555432 22455567899999644 333443332222221122222222 222232222211 0112 Q ss_pred c----HHHHHHHHHHhcCcCcCCceeeecccccCccc-cccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeC-- Q lcl|NC_019451. 295 F----VEQCPSEILAATNYDEPGASQNYMYYQFPGRN-ITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCG-- 367 (504) Q Consensus 295 ~----~~aa~~g~~as~nf~~~~g~~T~kfk~l~Gv~-a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~-- 367 (504) + .++.+.|..+..+.+. + ..||.++++. ...++.+|++.+..+|++......+........-++.++- T Consensus 383 ~~~~~~aa~vAGl~Ag~~~~~---S--~T~~~i~~~~v~~~lt~~e~~~li~~G~l~l~~~~~~~v~~~riv~~itT~t~ 457 (562) T protein:vir:80 383 MPGYMFAAQVAGLTCGLEIGE---A--ITFKNIAIETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFND 457 (562) T ss_pred echhHHHHHHHHHHhcCcccc---C--ccceeeccccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceeccC Confidence 2 3445556666665432 3 3445666543 2478999999999999998865444322221223444431 Q ss_pred -CccccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccc Q lcl|NC_019451. 368 -GPTDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQV 446 (504) Q Consensus 368 -G~y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~ 446 (504) -...|.-|-+++-.|.+.+.++..+-+.|.- | |=++.|...|++.++..|++-.+.|.|.-. ...+ T Consensus 458 ~~~~~~~ki~viRv~D~i~~dir~~~~~~yIG--k-~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~-~~~d--------- 524 (562) T protein:vir:80 458 KTDPVKSEIGVGEANDFLVSELKISLDNEYIG--T-KIIDTSASLVKNFVQSFLDRKKLAKEIQDY-SPEE--------- 524 (562) T ss_pred CCCchhhhhhhhHHHHHHHHHHHHHHHhcCCc--c-ccChHHHHHHHHHHHHHHHHHHhCCcccCC-Cccc--------- Confidence 1224555778888888888887766665543 4 568899999999999999999999999621 1100 Q ss_pred cCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 447 TGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 447 ~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) + .+. ..+|+ . -+++.+..--++++|-++.+.- T Consensus 525 ---------v-------~v~-----~~~d~---~--~v~~~v~Pv~~mekIy~ti~~~ 556 (562) T protein:vir:80 525 ---------V-------QVV-----IEGDI---A--RISLTVFPIRSMKKIEVSLVYR 556 (562) T ss_pred ---------e-------EEE-----ecCCE---E--EEEEEEEEcccceEEEEEEEEE Confidence 0 110 01222 1 3677788888888888888766 No 28 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=98.32 E-value=1.4e-06 Score=52.76 Aligned_cols=442 Identities=10% Similarity=0.057 Sum_probs=212.9 Q ss_pred CCCccceEE--Ee-----eeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCC Q lcl|NC_019451. 1 MISQSRYIR--II-----SGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISK 73 (504) Q Consensus 1 mip~s~iv~--V~-----~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~ 73 (504) ..|--++++ |. .++.+. ......++.|++....=|++++..+++.++.-+-||.. +--.|..++|.-.+. T Consensus 5 ~~~~~~~~~Pgv~~~~~~~~~~~~--~~~~~~~~~~ig~a~~G~~~~~~~~~~~~~~~~~f~~g-~l~~a~~~a~~~~~~ 81 (569) T protein:vir:80 5 QFPRKKVSRPHTEITVDTSGIGGS--SSSSDKTLMLVGSAKGGKPDTVYRFRNYQQAKQVLRSG-DLLDAIELAWNASDV 81 (569) T ss_pred eecCCccccCceEEEEecCCCcCC--CCCCceeEEEEEEeCCCCCceeEEecCHHHHHHHhcCC-chhHHHHhhccCccc Confidence 234444433 22 222223 33357788888887777889999999999998999885 366677888876665 Q ss_pred CCcccceEEEEeeeccCCcceeeec--------------ccchhhHh--------Hhh---------ccC---ceEEEEE Q lcl|NC_019451. 74 SVNSPSSISFARWVNTAIAPMVVGD--------------NLPKTIAD--------FAG---------FSA---GVLTIMV 119 (504) Q Consensus 74 ~~~~P~~l~igr~~~~a~~~~l~g~--------------~~~~~~~~--------~~~---------~~~---g~~titi 119 (504) ...-|+++|+-|...+ .++.+.-+ .+...+.. ++. +.+ .-++|++ T Consensus 82 ~~~~~~~~~~~rv~~a-~~a~~~~~~~~~~a~~~g~~~n~i~v~l~~~~~~~~~~~~v~~~~~~~~~~~~~ig~v~si~y 160 (569) T protein:vir:80 82 NTASAGDILAVRVEDA-KNATLTKGGLTFASTIYGVDANEIQVALEDNNLTHTKRLTVAFSKDGYKKVFDNLGKIFSIQY 160 (569) T ss_pred cccCceEEEEEEcCCC-eeeeeeccceeeeeeeccCCCceEEEEEecCcCCcceeeEEeeecCCCccccccccceeeEEE Confidence 5668889999988442 22221101 01000100 000 000 0122222 Q ss_pred cccceee-------------eeecccc------c-------cc--hHHHHHHHHhhhhcccccccceeEEE-Eecc---- Q lcl|NC_019451. 120 GAAEQNI-------------TAIDTSA------A-------TS--MDNVASIIQTEIRKNADPQLAQATVT-WNQN---- 166 (504) Q Consensus 120 ~g~~~~~-------------~~i~~s~------a-------ts--~~~vA~~i~~~i~a~~~~~~~~a~vt-~d~~---- 166 (504) .|+.... ..+.+.. . ++ -...+..+.+++..... ..+++. .... T Consensus 161 tg~~~~a~~~~~~~~~~~~a~~l~~~~g~~~~~~~~v~~~~~~~~~~~~~~~lv~~~~~~~~---f~a~~~~~~~~~~~~ 237 (569) T protein:vir:80 161 KGSEAQANFTIAQDSISKKATTLTLNVGSEPESTTEVMKYELGQGVYSETNVLVSAINSLPD---WEAKFFPIGDKNLPT 237 (569) T ss_pred eeccccceEEeecCcCcceeEEEEEEecCCcceeEEEEeeccCCccchhhhhhhhhcCCccC---ceEEEEecCCCccee Confidence 2211100 0000000 0 00 00011112222211110 011110 0000 Q ss_pred -----cceeEEecccCcceeEEEEeeccchhhhhh------hhcc---------cCcceee---cccccccHHHHHHHHH Q lcl|NC_019451. 167 -----TNQFTLVGATIGTGVLAVAKSADPQDMSTA------LGWS---------TSNVVNV---AGQAADLPDAAVAKST 223 (504) Q Consensus 167 -----~~~F~its~t~ga~s~~~~~sa~~~~ia~~------l~~t---------~~~~~~~---~g~aaet~~~al~~~~ 223 (504) ...+.+++.... ......++... .... ....... .|...++..++|+++. T Consensus 238 ~~~d~~~~~~~~t~~~~-------~~~~~~di~~~~~~~~~v~~~~~~~~~l~~~~~~~LtGG~dG~~~~~~~~~l~~le 310 (569) T protein:vir:80 238 DALEAVTKVDVKTEAVF-------VGALAGDIAKQLEYNDYVTVAVDATKPVEDFELTNLTGGSDGTAPESWANKFPLLA 310 (569) T ss_pred hhccchhheecccccee-------eehhHHHHHHhhcCCceEEEEecCCcceeeecceeecCCCCCCccchHHHHHHHHh Confidence 000111110000 00000000000 0000 0000111 1223446788899887 Q ss_pred hhccceeEEEEEeccCCHHHHHHHHHHHhhcC---CcEEEEEeccccchhHHHHHHhhh-cceeEEEEecc--------C Q lcl|NC_019451. 224 NVSNNFGSFLFAGAPLDNDQIKAVSAWNAAQN---NQFIYTVATSLANLGTLFTLVNGN-AGTALNVLSAT--------A 291 (504) Q Consensus 224 ~~~~~wy~~~~~~~~~~~~~~~a~A~w~e~~~---~~~~~~~~~~d~~~~~~~~~~~~~-~~~~~~~~~~~--------~ 291 (504) .. +|+.+.. ...+.+.+.++.+|++... ++++.++............+.... ...+...+... . T Consensus 311 ~~--~~~~i~~--~t~d~av~~~l~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~~g~ 386 (569) T protein:vir:80 311 NE--GGYYLVP--LTDKQAVHSEALAFVKDRTDNGDPMRIIVGGGTNETVEESITRATNLRDPRASLVGFSGTRKMDDGR 386 (569) T ss_pred hC--CcEEEEe--cCCChHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHHhhcCCCeEEEEecCceeecCCCc Confidence 65 4554433 2234566778999998653 334444433222222222222222 22222222211 0 Q ss_pred CCccH----HHHHHHHHHhcCcCcCCceeeecccccCcccc-ccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEe Q lcl|NC_019451. 292 ANDFV----EQCPSEILAATNYDEPGASQNYMYYQFPGRNI-TVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILC 366 (504) Q Consensus 292 ~~~~~----~aa~~g~~as~nf~~~~g~~T~kfk~l~Gv~a-~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~ 366 (504) ...++ .+.+.|..++.+++. ++| ||.++++.. ..++.+|++.+..+|++.+....+.....-..-++.++ T Consensus 387 ~~~~~~~~~aa~vAG~~A~~~~~~---S~T--~k~i~~~~i~~~lt~~e~~~li~~G~~~l~~~~~~~~~v~~~vn~itT 461 (569) T protein:vir:80 387 LLKLPGYMMASQIAGIASGLEVGE---AIT--FKHFNVTSVDRVFESSQLDMLNESGVISIEFVRNRTLTAFRVVQDVTT 461 (569) T ss_pred ceeechhhHHHHHHHHHhcCcccc---Ccc--ceeeccccccccCCHHHHHHHHhCCeEEEEEecCceEEEEEEecccee Confidence 11233 344456666655433 333 455554332 36899999999999999886544332221112244444 Q ss_pred ---CCccccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceee Q lcl|NC_019451. 367 ---GGPTDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYI 443 (504) Q Consensus 367 ---~G~y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i 443 (504) .-..+|.-|-+++-.|.+.+.++..+-+.|.- | |=++.|...|++.++..|++-.+.|.|.- .+..+.| T Consensus 462 ~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yiG--k-~nn~~~r~~v~~~i~~~L~~l~~~gaI~~-~~~~dv~---- 533 (569) T protein:vir:80 462 YNDKSDPVKNEMSVGEANDFLVSELKIELDNNFIG--T-KVIDTSASLIKNFIQSFLDNKKRAREIQD-YTPEEVQ---- 533 (569) T ss_pred cCCCCCchhhhhhhhHHHHHHHHHHHHHHHhhcCc--c-cCChhHHHHHHHHHHHHHHHHHhCCcccC-CCccceE---- Confidence 21224555888888888888888776665543 4 67889999999999999999999999952 2111110 Q ss_pred ccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 444 TQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 444 ~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) +.+ .+| |. -+.+.+..--++++|-++.+.- T Consensus 534 -------------------v~~-------~~d---~~--~v~~~v~Pv~~~ekI~~ti~~~ 563 (569) T protein:vir:80 534 -------------------VVL-------EGD---VA--SISMTVMPIRSLNKITVQLVYK 563 (569) T ss_pred -------------------EEe-------cCC---EE--EEEEEEEEcccccEEEEEEEEe Confidence 100 011 22 3677777888888888887776 No 29 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=98.25 E-value=2e-06 Score=51.80 Aligned_cols=362 Identities=11% Similarity=0.028 Sum_probs=194.8 Q ss_pred CCCccceEEEeeeecc---cccccccccceEEEeccc-----ccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCC Q lcl|NC_019451. 1 MISQSRYIRIISGVGA---GAPVAGRKLILRVMTTNN-----VIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFIS 72 (504) Q Consensus 1 mip~s~iv~V~~~v~~---~~~~~~~~~~~l~l~~~~-----~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~ 72 (504) |-+-...--|.+.... ............|.+... .+|...-...++..+-...||.....+.+...+|.+. T Consensus 1 M~~~~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v~s~~~~~~~~g~~~tl~~al~~~~~~~- 79 (391) T protein:vir:11 1 MAADQYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLITNVQAAIGKAGTSGTLPASLQAIADQA- 79 (391) T ss_pred CCCCcCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEEecchhhheecCCCccchhhhhhhhccc- Confidence 6654443333332222 222233344555554443 3444333345666666677999999999999999873 Q ss_pred CCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhccc Q lcl|NC_019451. 73 KSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNA 152 (504) Q Consensus 73 ~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~ 152 (504) ....++-+.... .....+..+.. |.. + +.....+.....+ +. T Consensus 80 -----g~~~~vv~~~~~--------~~~~~t~~d~~----g~~----~------------a~~~~~g~~a~~~----~~- 121 (391) T protein:vir:11 80 -----NAATVVVRVKPG--------EDEAATNSAVI----GGV----S------------ADGKYTGMKALLA----AK- 121 (391) T ss_pred -----cceeEEeeeccc--------ccccccchhhh----ccc----c------------cccchhhhhhhhh----hh- Confidence 334566554211 00000000000 000 0 0000000000000 00 Q ss_pred ccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccceeEE Q lcl|NC_019451. 153 DPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGSF 232 (504) Q Consensus 153 ~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~wy~~ 232 (504) ..+... +.....++........++..+.+. .-.+ T Consensus 122 ---------------~~~~~~----------------------------p~~~~ap~~~~~~v~~al~~~~~~---~~~~ 155 (391) T protein:vir:11 122 ---------------ARLGVV----------------------------PRILGVPGLDTQPVATALIAIAQQ---LRAF 155 (391) T ss_pred ---------------hhheec----------------------------cccccccccccHHHHHHHHHhhcc---cceE Confidence 000000 000000111111223334333332 3345 Q ss_pred EEEeccCCHHHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCcC Q lcl|NC_019451. 233 LFAGAPLDNDQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDEP 312 (504) Q Consensus 233 ~~~~~~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~ 312 (504) .+.+.+.. ....++-+|-+.-+..+..+++-.--.. ...... .. ..-+.+.+.|..+.+|.+.- T Consensus 156 ~i~D~p~~-~t~~~a~~~r~~~~s~~~~~~~p~~~~~-------~~~~~~-~~-------~~p~s~~~ag~~a~~d~~~g 219 (391) T protein:vir:11 156 AYVSASGC-KTKEEATAYRENFAAREAMVIWPDFLTW-------STVVNQ-TV-------PAPAVAQALGLRARIDQEVG 219 (391) T ss_pred EEEEcCCC-CCHHHHHHHhhhcCCceEEEEcCcceec-------ccccCc-eE-------EechHHHHHHHHHHhhccCC Confidence 55554322 2233444555554555544443211000 000000 00 11245666777777763321 Q ss_pred CceeeecccccCccccc--------cCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHHHH Q lcl|NC_019451. 313 GASQNYMYYQFPGRNIT--------VSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWL 384 (504) Q Consensus 313 ~g~~T~kfk~l~Gv~a~--------~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~dwl 384 (504) =......|.+.||..- ..++.|.+.|..+|+|.. +.+.| + .++.+.++++.-.|.+|-+.+-.+|+ T Consensus 220 -~~~span~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~--~~~~G--~-~~wG~rT~~~d~~~~~i~vrR~~~~i 293 (391) T protein:vir:11 220 -WHKTLSNVAVNGVTGISADVFWDLQSPSTDANYLNENEVTTL--VQEGG--F-RFWGSRTCSDDPLFAFENYTRTAQVL 293 (391) T ss_pred -cEEccCCceeeceeecccccccccCCCcchhhhhhhcCcEEE--EcCCC--E-EEEcccccCCCcccceeehhhHHHHH Confidence 1222335666666532 235789999999999986 33333 3 45788889888788889999999999 Q ss_pred HHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEE Q lcl|NC_019451. 385 KSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWIN 464 (504) Q Consensus 385 ~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~ 464 (504) ++.|+..+...+-. |-++.=...|+..++.-|+.-+++|.|. ||.+. T Consensus 294 ~~~~~~~~~~~v~e----~n~~~~~~~i~~~i~~~l~~l~~~g~l~-----------------------------g~~~~ 340 (391) T protein:vir:11 294 ADTIAEAHMWAVDK----PMHPSLVRDILEGVNAKFRELKGLGLII-----------------------------DAQAW 340 (391) T ss_pred HHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcccee-----------------------------ceEEE Confidence 99999888764322 5688889999999999999999999986 34454 Q ss_pred ecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 465 ITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 465 ~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) .. .+..+++|+.+.+. .+.+.+.-...++.|++..... T Consensus 341 ~~-~~~n~~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~ 378 (391) T protein:vir:11 341 YD-PNVNDKDTLKAGKL-RITYDYTPVPPLEDLTFFQKIT 378 (391) T ss_pred Ee-cCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEc Confidence 43 36778888887776 5899999999999999998877 No 30 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=98.23 E-value=9.4e-07 Score=53.64 Aligned_cols=442 Identities=9% Similarity=-0.018 Sum_probs=173.9 Q ss_pred CCCccce-EE-----Eeeeecccc---ccc-ccccceEEEecccccCccceEEecCHHHHHHhcC----CCcHHHHHHHH Q lcl|NC_019451. 1 MISQSRY-IR-----IISGVGAGA---PVA-GRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFG----AQSEEYQRAAA 66 (504) Q Consensus 1 mip~s~i-v~-----V~~~v~~~~---~~~-~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg----~~s~ey~aA~~ 66 (504) .=|...| |+ |-+...+.+ +.. ....+.-|++....=|..+-..-+|..|....|| --.....|-.. T Consensus 273 ~~~~~~~~~~v~~~GVYVEEVpSGvrtIeGGV~TSVAAFVG~A~rGPvn~PvlITS~aD~~~~Fg~~~GGl~GassA~r~ 352 (774) T protein:vir:98 273 VEPFGEITRNVEDNGVVIQLEPALTGSISNRFSFYVTANDNTANRGFTTSPALVTTIPDPAIHFTSFQGGLDGPRSAFRD 352 (774) T ss_pred cccccceEEEEecCceEEEEeCCCCccccccccceeeeecccccCCCCCcCEEEeehhHhhhhhccccCCccccceeeee Confidence 1121111 11 122222221 111 2345666776665556655555566667444453 22111111111 Q ss_pred HhccCCCCCcccceEEE-EeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccch--HHHHHH Q lcl|NC_019451. 67 YFKFISKSVNSPSSISF-ARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSM--DNVASI 143 (504) Q Consensus 67 yF~~~~~~~~~P~~l~i-gr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~--~~vA~~ 143 (504) ++.-.. .| .|.+ ++. ++. .|-.+...+. ....+.+.+.+.-....... +......+ ...... T Consensus 353 ~~~~sG----~~-~L~i~A~~-----pGa-wGN~ItV~I~---~~t~~~~~l~v~~~~~s~f~-~~~a~e~~tv~~~~~~ 417 (774) T protein:vir:98 353 FYTFNG----TP-LLRLQAVS-----EGN-WGNQVTVSIY---PVNNSEFRLNVQDLNGSAFN-PPLADEVYTVKLGDTN 417 (774) T ss_pred eeeecc----cc-eEEEEEee-----cCc-CCCceEEEEE---ecCCceeEEEEEecCCcccc-ccccceeEEEeccccc Confidence 111111 11 1111 111 000 0111111110 00111111111100000000 00000000 000000 Q ss_pred HHhhhhcccccccc-e--eEEEEeccc--ceeEEecccCcceeEEEEeeccchhhhhhhhcccC--cceee-cccc-ccc Q lcl|NC_019451. 144 IQTEIRKNADPQLA-Q--ATVTWNQNT--NQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTS--NVVNV-AGQA-ADL 214 (504) Q Consensus 144 i~~~i~a~~~~~~~-~--a~vt~d~~~--~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~--~~~~~-~g~a-aet 214 (504) ....+.+....... . .....+... ..|...+....... .......+ .++..-..... ..+.. .|.+ +++ T Consensus 418 ~~~~v~e~~dn~~i~~~~~~~~~~~in~vs~lv~~~~~~~a~~-d~~~~~~~-~~~~~~~~~~~~~v~v~lagG~Dg~~t 495 (774) T protein:vir:98 418 ESGELNALLDSKFIRGFFLPKSIDSINYDAALVRQSPLRLAPP-DESETDVE-NPAHVDFYGPNVLVDVTLENGYDGPPV 495 (774) T ss_pred ccceeeeeeceeeEeecccccccccccccccccccchhccccc-cccccccc-ccccccccCCcceEEEeecCCCCcccc Confidence 00000000000000 0 000000000 00000000000000 00000000 00000000000 00001 1111 111 Q ss_pred HHHHHHHHHh--hccceeEEEEEeccCCHHHHHHHHHHHhhc---CCcEEEEEecc-ccchhHHHHHHhhh-cceeEEEE Q lcl|NC_019451. 215 PDAAVAKSTN--VSNNFGSFLFAGAPLDNDQIKAVSAWNAAQ---NNQFIYTVATS-LANLGTLFTLVNGN-AGTALNVL 287 (504) Q Consensus 215 ~~~al~~~~~--~~~~wy~~~~~~~~~~~~~~~a~A~w~e~~---~~~~~~~~~~~-d~~~~~~~~~~~~~-~~~~~~~~ 287 (504) ..+.+....+ ....++.+..... ......++..+++.. .+..+.++... +.+......-.... ..+...++ T Consensus 496 t~~~igg~~~~~~~tgi~aLl~a~~--~~~V~~aii~~~e~~~~~~~~r~avid~p~g~t~~~Ai~~r~~f~S~~aal~~ 573 (774) T protein:vir:98 496 TNDDYVSIIRTLENQPVHILLVGTT--NVGVQQALITEAERASDSDGLRIAVLAAPPRTTPTLAASVTRGFNSTRAVMVA 573 (774) T ss_pred cchheecccccccccceeEEEcCcc--chhhHHHHHHHHHHhhhcccceEEEEECCCCCCHHHHHHHHhccCCceEEEEe Confidence 1111211111 1235554443222 233344555555532 23333333222 22221111111111 11222233 Q ss_pred eccC-----C----CccHHHHHHHHHHhcCcCcCCceeeecccccCccc--------cccCCHHHHHHHHhCCCeEEE-E Q lcl|NC_019451. 288 SATA-----A----NDFVEQCPSEILAATNYDEPGASQNYMYYQFPGRN--------ITVSDDTVANTVDKSRGNYIG-V 349 (504) Q Consensus 288 ~~~~-----~----~~~~~aa~~g~~as~nf~~~~g~~T~kfk~l~Gv~--------a~~lt~t~~~al~~~~~n~y~-~ 349 (504) +... . ...|.+.++|.++.+|+...+ ..|.+.|+. .+..++.+.+.|..+++|..+ . T Consensus 574 Pwvkv~D~~~g~~~~vPpSg~vAGl~ArtDv~kSP-----ANk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~gIN~i~it 648 (774) T protein:vir:98 574 GWFTYAGQPNSSRYGVPGAAVYAGKLAAIDFFVSP-----AARSLVGPLFNIIESDTDNYTSRSNQDIYSAARLEVLSLD 648 (774) T ss_pred CcEEEeccCCCceeecChhHHHHHHHHhcCccccc-----CCceeecceeccccccccccccchhhhhhcccccceeEEE Confidence 2110 1 123567888999988864433 355666654 223467788889999999886 3 Q ss_pred EeeccceeeEEEcCEEeCCccccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcc Q lcl|NC_019451. 350 TQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTF 429 (504) Q Consensus 350 ~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~I 429 (504) +.+.| . .++.+.++++.-.|.+|-+.+-.+|+++.|+..+.... .+ |.|+.....|+..++.-|+.-++.|.| T Consensus 649 t~g~G--~-rvWG~RTlssDp~wr~InVRRlfd~Ie~SI~~~~~~~V---fE-PNd~~l~~~I~~sI~~fL~~L~~~GaL 721 (774) T protein:vir:98 649 TVDRT--Y-RFASGVTLSTDPAWERIYLRRVHDVVRQGAHAILRNYV---AM-PNSRLVRNQIAAALNAFMGELKRNGNI 721 (774) T ss_pred EcCCc--E-EEEcccccCCCcccceEeehhhHHHHHHHHHHHHHHhc---cC-CCCHHHHHHHHHHHHHHHHHHHhCCce Confidence 54444 3 34677778887789999999999999999998877643 23 679999999999999999999999999 Q ss_pred ccccccCcccceeeccccCCcccccceeecceE-EEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 430 TYGKEISAVQQQYITQVTGDRRAWRQVQTLGYW-INITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 430 a~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~-v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) .- |+ +.++ .+..+++++.+.+. -+.+.+...-.+++|.++-.-- T Consensus 722 ~G-----------------------------~~~V~~D-~etNt~~dI~~G~l-~i~I~vaP~~PAEfIilri~q~ 766 (774) T protein:vir:98 722 VS-----------------------------FRPAIID-GSNNSTAAYFSREL-YVSLQFQPLYSADYIYVTISRD 766 (774) T ss_pred ec-----------------------------ceEEEEc-CCCCCHHHhhCCEE-EEEEEEEecCCcceEEEEEEEe Confidence 63 22 2222 24455556555444 3566666666666666543322 No 31 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=98.15 E-value=3.5e-06 Score=50.47 Aligned_cols=450 Identities=10% Similarity=0.049 Sum_probs=211.5 Q ss_pred CCCccceE------EEee-eecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCC Q lcl|NC_019451. 1 MISQSRYI------RIIS-GVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISK 73 (504) Q Consensus 1 mip~s~iv------~V~~-~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~ 73 (504) .-|..+++ .+.- ++.+.++ ...+.+.|++....=|++++..+++.++.-+-||.. +-..+..+.|.-++. T Consensus 14 ~~~~~~~~~pgv~~~~~~~~~~~~~~--~~~~~~~~iG~a~~G~~~~~~~~~~~~~a~~~f~~g-~l~~a~~~a~~~~~~ 90 (607) T protein:vir:10 14 IYPLFYDSRPHVETNFDDSRLSNTAS--DSAKNIFMLGSATNGDPTKVYEIRTSQQATKIFGSG-DLVDGIKLAFDPTGN 90 (607) T ss_pred HhCCCCccCCceEEEEecCcCcCCCC--CCcceEEEEEEeCCCCCceEEEEcchhHHHHhhcCc-chHHHHHHhhccccC Confidence 33333333 3322 3333333 357888888887777889999999999999999875 456677888976666 Q ss_pred CCcccceEEEEeeecc-CCcceeeecccc------------hhh----HhHhhcc------------------------- Q lcl|NC_019451. 74 SVNSPSSISFARWVNT-AIAPMVVGDNLP------------KTI----ADFAGFS------------------------- 111 (504) Q Consensus 74 ~~~~P~~l~igr~~~~-a~~~~l~g~~~~------------~~~----~~~~~~~------------------------- 111 (504) ....++.+|.-|...+ +..+..-|...+ -.+ ..-+.++ T Consensus 91 ~~~g~~~~~~~rv~~~~~a~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~~d~~~~~~~n~g~~~~i~y~g 170 (607) T protein:vir:10 91 SVTNGGTVYALRVDNAKQASLVKDGLTFTSSIFGTNANQVSVALDNDVFGVPRITVNYSPDNYERTYTNIGQMFSITYSG 170 (607) T ss_pred CccCCceEEEEeCCCccccceecccccccccccccCCCceEEEEEecCCCccceeEEeecccceeeeeeccceeecccCc Confidence 6678999999997442 111111111100 000 0000000 Q ss_pred ---CceEEEEEc--ccceeeeeeccccc--c------------chHHHHHHHHhhhhcccccc-----cceeEEE-Eecc Q lcl|NC_019451. 112 ---AGVLTIMVG--AAEQNITAIDTSAA--T------------SMDNVASIIQTEIRKNADPQ-----LAQATVT-WNQN 166 (504) Q Consensus 112 ---~g~~titi~--g~~~~~~~i~~s~a--t------------s~~~vA~~i~~~i~a~~~~~-----~~~a~vt-~d~~ 166 (504) ...++|..+ |..+.++ +..... . .+.-+..+++. |+..+... ....... .|.. T Consensus 171 ~~~~a~~~v~~~~~g~~~~lt-~~~~~~~~~~~~V~~~~l~~~~~~t~~~l~~d-in~~~~~~A~~~g~~~i~tky~d~~ 248 (607) T protein:vir:10 171 KSASAGYTVSHDTDGKAILLT-LGSGDSIDKLTNVATFDLTMSKYDTIAKLMQA-ISATPNFSASVVGSPSVNTSYLDEV 248 (607) T ss_pred ccccccceeeecCCCceeEEE-ecCCCccceeeeeecccccccccchHHHHHHH-hhcCCceEEEEecccceeeeccccc Confidence 001222222 3322221 111000 0 01112222221 22211100 0000111 1222 Q ss_pred cceeEEecccC------cc--------eeEEEEeeccchhhhh----------hhhcc--------cCcceeeccc---c Q lcl|NC_019451. 167 TNQFTLVGATI------GT--------GVLAVAKSADPQDMST----------ALGWS--------TSNVVNVAGQ---A 211 (504) Q Consensus 167 ~~~F~its~t~------ga--------~s~~~~~sa~~~~ia~----------~l~~t--------~~~~~~~~g~---a 211 (504) ...|.++.... +. .............+.. ....+ ..+.....|. . T Consensus 249 ~~~i~V~~~~~iv~a~~~D~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~LtGGtdG~~ 328 (607) T protein:vir:10 249 TSPVDVKTAPAVVTAKIGDAISKLGYDPYVVVTQTSNNKPIVNGVSAGTGSATASVTTAPESFPANFDTAFLTGGSTGDV 328 (607) T ss_pred cceeEEEEeeeeechhhhhhhhcccccceEEeeecccchhhhhhhhccccceeeeeeccccccccccceeeeeCCCCCCc Confidence 23333332100 00 0000000000000000 00000 0011112222 2 Q ss_pred cccHHHHHHHHHhhccceeEEEEEeccCCHHHHHHHHHHHhhc---CCcEEEEEeccccchhHHHHHHhhh-cceeEEEE Q lcl|NC_019451. 212 ADLPDAAVAKSTNVSNNFGSFLFAGAPLDNDQIKAVSAWNAAQ---NNQFIYTVATSLANLGTLFTLVNGN-AGTALNVL 287 (504) Q Consensus 212 aet~~~al~~~~~~~~~wy~~~~~~~~~~~~~~~a~A~w~e~~---~~~~~~~~~~~d~~~~~~~~~~~~~-~~~~~~~~ 287 (504) .++..++++++.+. +|+.+... . .+.+.+.++.+|++.. .+++..++............+.... ...+...+ T Consensus 329 ~~ty~dal~aLe~~--e~~~i~~~-t-~d~ai~~~l~a~vkr~~~~g~~~~aVlg~~~~~t~~~~~t~a~~~N~ervv~V 404 (607) T protein:vir:10 329 PVSWADKFNGAIGN--NVYYIIPL-T-SEENIHAELQAFIDEQHVLGYNYHAFVGGGFAEPLEQILSRQVNINDSRFGLV 404 (607) T ss_pred hhhHHHHHHHHhhc--CceEEEec-C-CCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHHHhhCCCcEEEE Confidence 34567888888765 56655442 2 2455567899998644 3445444433222222222222222 22222221 Q ss_pred ec----c-------CCCccHHHHHHHHHHhcCcCcCCceeeecccccC--ccccccCCHHHHHHHHhCCCeEEEEEeecc Q lcl|NC_019451. 288 SA----T-------AANDFVEQCPSEILAATNYDEPGASQNYMYYQFP--GRNITVSDDTVANTVDKSRGNYIGVTQANG 354 (504) Q Consensus 288 ~~----~-------~~~~~~~aa~~g~~as~nf~~~~g~~T~kfk~l~--Gv~a~~lt~t~~~al~~~~~n~y~~~~~~~ 354 (504) .. . ++..+.++.+.|..++.+.+ .++| ||.++ ++.+ .++.+|++.+..+|+..+....... T Consensus 405 ~~~~~~~~~G~~~~~~~~~~Aa~vAGl~Ag~~~~---~SlT--~k~i~~~~v~~-~lt~~e~e~ai~~Gv~~l~~~~~~~ 478 (607) T protein:vir:10 405 GQSGHVQEGGESVHVPAYLMAAYVGGLSSSLGVA---VPIT--NKKLALVDLDQ-NFSGDDLNTLNQNGVIGIEHLVNRN 478 (607) T ss_pred ecCeeEeeCCcceeccHHHHHHHHHHHHhcCccc---cCcc--cceeccccccc-cCCHHHHHHHHhCCeEEEEEccCcc Confidence 11 0 11112344455666666543 3333 44444 4443 6999999999999998775433221 Q ss_pred cee-eEEEcCEEeCC---ccccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHH--HhcCc Q lcl|NC_019451. 355 QQL-AFYQRGILCGG---PTDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKA--TANGT 428 (504) Q Consensus 355 ~~~-~~~~~G~~~~G---~y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a--~~nG~ 428 (504) ..- -.+.+|..+=+ ...|..|-+++-+|.+.+.++..+-+.|. +|++. +.....++..+...|..- ...|. T Consensus 479 ~~~~vrIv~~ItT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yI--Gk~nn-d~~~~~vk~~i~~~L~~~~l~~~ga 555 (607) T protein:vir:10 479 ATGGYYIVQDVSTNTVSSSHVDGSLYLGELTDFLFDNLRFVLRDTYI--GSNIR-STSADDIKSTVASYLYSEMNNDDGL 555 (607) T ss_pred ccceEEEeeeeeeccCCCCcchheeehhhhHHHHHHHHHHHHhhcCC--cccCC-cchHHHHHHHHHHHHHHHHHHhcCc Confidence 111 12345554421 22455588888899888888877766554 34444 456677888888887443 34566 Q ss_pred cccccccCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 429 FTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 429 Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) |. +.... + +.++ ..+| | .-+.+.+..-.+|++|-++.+.. T Consensus 556 I~-df~~e------------------d-------v~v~-----~~~D---~--v~v~~~v~Pv~~iekIyvtv~v~ 595 (607) T protein:vir:10 556 IV-DFSES------------------D-------IVVT-----ISGT---V--VYIQFAVAPTQEIKNIVVSGTYS 595 (607) T ss_pred ee-CCCcc------------------c-------cEEe-----eCCC---E--EEEEEEEEEcccceEEEEEEEEE Confidence 64 11110 0 1111 0112 2 23788888999999999998888 No 32 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=98.11 E-value=4.4e-06 Score=49.97 Aligned_cols=328 Identities=10% Similarity=0.058 Sum_probs=163.9 Q ss_pred EcccceeeeeeccccccchHH-HHHHHHhhhhcccccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhh Q lcl|NC_019451. 119 VGAAEQNITAIDTSAATSMDN-VASIIQTEIRKNADPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTAL 197 (504) Q Consensus 119 i~g~~~~~~~i~~s~ats~~~-vA~~i~~~i~a~~~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l 197 (504) ..|-+ +.--.|-. .+++++..... -.+-+-.|....-..++..+.- ......+....+ T Consensus 1 ~~glp--------~i~i~f~~~a~ta~~~g~rG------iv~~il~d~~~~~~~~~~~~~v-------~~~~~~~n~~~i 59 (356) T protein:vir:10 1 MAGLV--------NINIEFKELATSFIQRSKAG------IVAIILKDTTKMYKELTSEDDI-------PISLSADNKKYI 59 (356) T ss_pred CCCCC--------ceeEEEeecceeeccCCccc------eEEEEEecCCcceeEEeccccc-------hhHHHHHHHHHH Confidence 12211 11111111 11111111100 0111222322221222221110 000001111122 Q ss_pred hccc-----------Ccce-eecccccccHHHHHHHHHhhccceeEEEEEeccCCHHHHHHHHHHHhhc----CCcEEEE Q lcl|NC_019451. 198 GWST-----------SNVV-NVAGQAADLPDAAVAKSTNVSNNFGSFLFAGAPLDNDQIKAVSAWNAAQ----NNQFIYT 261 (504) Q Consensus 198 ~~t~-----------~~~~-~~~g~aaet~~~al~~~~~~~~~wy~~~~~~~~~~~~~~~a~A~w~e~~----~~~~~~~ 261 (504) .+.- +..+ ...+...++..++|+++.....|| +.+ ...+++++..+++|+... .+++..+ T Consensus 60 ~~~~~g~~~~~~~~~p~~~~~~~~~t~~~y~~aL~~le~~~fn~--l~~--~~~d~~~~~~~~a~ikr~r~~~~~~~~~V 135 (356) T protein:vir:10 60 KYGFVGATDNEKVLRPSKVIISTFTEDGKVEDILEELESVEFNY--LCM--PEAIEAEKTKIVTWIKKIREEESTEAKAV 135 (356) T ss_pred HHHhhccccccccccceeeeeecccCchhHHHHHHHhcCccceE--EEe--cCCChHHHHHHHHHHHHHHhcCCcEEEEE Confidence 1110 0001 111224578999999998776665 433 334678889999999843 3444444 Q ss_pred EeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCcCCceeeecccccCccccc-cCCHHHHHHHH Q lcl|NC_019451. 262 VATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDEPGASQNYMYYQFPGRNIT-VSDDTVANTVD 340 (504) Q Consensus 262 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~~g~~T~kfk~l~Gv~a~-~lt~t~~~al~ 340 (504) .....++.+--+. + ...... ....++..-..+.+.|..++.+.++ + +-|+.++++... .++.+|++.+. T Consensus 136 ~~~~~aD~EgIIn-v--~n~~~~--~g~~~t~~~~~~~vAG~~Ag~~~n~---S--~T~~~~~~~~~~~~~t~~e~~~ai 205 (356) T protein:vir:10 136 LANIKADNEAIIN-F--TENVVV--DGEEITAEKYTTRVASLIASTPNTQ---S--ITYAPLDEVESIVKIDKASADAKV 205 (356) T ss_pred ecCCCCCCceeEE-e--ecCeEe--cceeechhHHHHHHHHHHhccchhc---c--ccceecCCccccccCCHHHHHHHH Confidence 4333322221111 0 011111 1111122222345556777665433 3 345677776533 58899999999 Q ss_pred hCCCeEEEEEeeccceeeEEEcCEEe----C---CccccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHH Q lcl|NC_019451. 341 KSRGNYIGVTQANGQQLAFYQRGILC----G---GPTDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTL 413 (504) Q Consensus 341 ~~~~n~y~~~~~~~~~~~~~~~G~~~----~---G~y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~ 413 (504) ++|.-.+. ..+ .. -.+.+|+.+ + ++ +|.=|-+++.+|-+.+.++..+-+.+. +|+|=+..|..++. T Consensus 206 ~~G~lvl~-~d~--~~-V~I~~~VNSltt~t~~k~~-~f~Kirvvr~~D~i~~Di~~~f~~~yi--GKv~N~~dgr~~l~ 278 (356) T protein:vir:10 206 QAGELILR-RLS--GK-IRIARGINSLTTLTAEKGE-IFQKIKLVDTKDLISKDIKNIYVEKYL--RKCPNTYDNKCLFI 278 (356) T ss_pred hCCeEEEE-EEc--Ce-EEEEecCccceecCCCCCc-chhhhHHHHHHHHHHHHHHHHHhhccc--cccCCCHHHHHHHH Confidence 99877663 222 22 344566533 1 33 466688888888777777654433332 79999999999999 Q ss_pred HHHHHHHHHHHhcCccccccc---cCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEE Q lcl|NC_019451. 414 AVLQPVLDKATANGTFTYGKE---ISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSK 490 (504) Q Consensus 414 ~~v~~vl~~a~~nG~Ia~G~~---~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~ 490 (504) +.+..-+++-.+.|+|.++.. .-+.|++++.. .|.+.. ...++.-.........-+....+. T Consensus 279 ~ai~~y~~~L~~~~~I~~~~~~eid~e~q~~~~~~-~g~d~~--------------~~~d~~v~~~~~~~~v~~~~~v~~ 343 (356) T protein:vir:10 279 VAVQSYLTELAKQELIDSNFTVEIDLEKQKEYLEG-KKIAVS--------------KMKENEIKEANTGSNGFYLINLKL 343 (356) T ss_pred HHHHHHHHHHHhCCccccCceeEecccchHHHhhh-cccccc--------------ccccceeecccCCcEEEEEEEEEE Confidence 999999999999999988643 22445544331 121111 011111111222334447777888 Q ss_pred CCeEEEEEeeeee Q lcl|NC_019451. 491 GDAIRFVEGSDVM 503 (504) Q Consensus 491 aGAIh~v~i~~~~ 503 (504) -.|+.++.++-.+ T Consensus 344 vdamE~iy~ti~v 356 (356) T protein:vir:10 344 VDAMEDINIRVQM 356 (356) T ss_pred EeeeeeEEeEEeC Confidence 8999999999888 No 33 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=98.04 E-value=6.3e-06 Score=49.09 Aligned_cols=366 Identities=11% Similarity=0.039 Sum_probs=197.0 Q ss_pred CCCc---cceEEEeeeecccccccccccceEEEecc-----cccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCC Q lcl|NC_019451. 1 MISQ---SRYIRIISGVGAGAPVAGRKLILRVMTTN-----NVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFIS 72 (504) Q Consensus 1 mip~---s~iv~V~~~v~~~~~~~~~~~~~l~l~~~-----~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~ 72 (504) |-.- =+|+.+..+..+ .........-|++.. ..+|.+.-...++..+-...||.....+.+...+|.+- T Consensus 1 m~~~~~GV~v~e~~~g~~~--i~~v~tav~~~vg~a~~a~~~~~~l~~pvlvts~~~~~~~~g~~~tL~~al~~~~~ng- 77 (396) T protein:vir:20 1 MSDYHHGVQVLEINEGTRV--ISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAISKAGKKGTLAASLQAIADQS- 77 (396) T ss_pred CCCCCCCeEEEEcCCCcce--eeecCCceeEEEeeeccCCCccccCccCEEeechHHHHhhcccccchhhhhhhhhccC- Confidence 4321 222333222222 222223444444432 23444444556777888888998888887777777753 Q ss_pred CCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhccc Q lcl|NC_019451. 73 KSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNA 152 (504) Q Consensus 73 ~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~ 152 (504) -...++-+......... . .. ...+...+ .+.. T Consensus 78 -----g~~~~v~~~~~~~~~~~------~---~~-~a~t~~~~----~~~~----------------------------- 109 (396) T protein:vir:20 78 -----KPVTVVMRVEDGTGDDE------E---TK-LAQTVSNI----IGTT----------------------------- 109 (396) T ss_pred -----ceeEEEEeccccccccc------c---cc-cccccccc----cccc----------------------------- Confidence 22234433311000000 0 00 00000000 0000 Q ss_pred ccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccceeEE Q lcl|NC_019451. 153 DPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGSF 232 (504) Q Consensus 153 ~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~wy~~ 232 (504) +.. ...++.. ... +.....+. .+......+........+|..+.+.... + T Consensus 110 -----------~~~-------~~~tg~~---al~-----~~~~~~~~-~p~i~~ap~~~~~~v~~al~~~~~~~~~---~ 159 (396) T protein:vir:20 110 -----------DEN-------GQYTGLK---AML-----AAESVTGV-KPRILGVPGLDTKEVAVALASVCQKLRA---F 159 (396) T ss_pred -----------ccc-------cccchhh---hhh-----hhcccccc-chhhhhhhhhccHHHHHHHHHHHhcCCc---E Confidence 000 0000000 000 00000000 0011111122223344555555544333 3 Q ss_pred EEEeccCCHHHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCcC Q lcl|NC_019451. 233 LFAGAPLDNDQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDEP 312 (504) Q Consensus 233 ~~~~~~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~ 312 (504) .+.+.+.. .+..++.+|-+.-+.++...++-.-- . ....... . ....+.+.+.|.++.+|.++- T Consensus 160 ~~iD~p~~-~~~~~a~~~r~~~~s~~~~~~~P~~~---~----~d~~~~~-~-------~~~p~s~~~Ag~~a~~d~~~g 223 (396) T protein:vir:20 160 GYISAWGC-KTISEVKAYRQNFSQRELMVIWPDFL---A----WDTVTST-T-------ATAYATARALGLRAKIDQEQG 223 (396) T ss_pred EEEecCCC-CCHHHHHHHhhCCCCceEEEEcCccc---c----ccCcCCc-c-------eeechhHHHHHHHHHhhhhcC Confidence 34444432 23345556766655555555432110 0 0000000 0 011245667778887764331 Q ss_pred CceeeecccccCcccc--------ccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHHHH Q lcl|NC_019451. 313 GASQNYMYYQFPGRNI--------TVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWL 384 (504) Q Consensus 313 ~g~~T~kfk~l~Gv~a--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~dwl 384 (504) -......|.+.||.. ..++++|++.|..+|+|.... + ..+ .++.+.++++.-.|.+|-+.+-.+|+ T Consensus 224 -~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~--~--~G~-~~wG~rT~s~d~~~~~i~~rR~~~~i 297 (396) T protein:vir:20 224 -WHKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGVTTLIR--R--DGF-RFWGNRTCSDDPLFLFENYTRTAQVV 297 (396) T ss_pred -cEeccCCceeccceecceecccccCCCcchhhhhhhcCcEEEEc--C--CCE-EEEcccccCCCcccceeehhhHHHHH Confidence 223345666777642 235678999999999998732 3 233 45788889988788889999999999 Q ss_pred HHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEE Q lcl|NC_019451. 385 KSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWIN 464 (504) Q Consensus 385 ~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~ 464 (504) .+.|+..+...+-. |-|..=+..|+..++.-|++-++.|.|. ||.+. T Consensus 298 ~~~~~~~~~~~v~e----~~~~~~~~~i~~~i~~~L~~l~~~G~l~-----------------------------g~~v~ 344 (396) T protein:vir:20 298 ADTMAEAHMWAVDK----PITATLIRDIVDGINAKFRELKTNGYIV-----------------------------DATCW 344 (396) T ss_pred HHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCccee-----------------------------ceEEE Confidence 99999988874432 5688889999999999999999999996 46666 Q ss_pred ecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 465 ITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 465 ~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) .. +++.+++|+.+.+. -+.+.+.....++.|++..... T Consensus 345 ~d-~~~nt~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~ 382 (396) T protein:vir:20 345 FS-EESNDAETLKAGKL-YIDYDYTPVPPLENLTLRQRIT 382 (396) T ss_pred Ee-cCCCCHHHhhCCEE-EEEEEEEecCCcceEEEEEEEc Confidence 64 46788888888887 4889999999999999998877 No 34 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=97.97 E-value=8.5e-06 Score=48.38 Aligned_cols=363 Identities=11% Similarity=0.050 Sum_probs=193.9 Q ss_pred CCC---ccceEEEeeeecccccccccccceEEEeccc-----ccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCC Q lcl|NC_019451. 1 MIS---QSRYIRIISGVGAGAPVAGRKLILRVMTTNN-----VIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFIS 72 (504) Q Consensus 1 mip---~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~-----~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~ 72 (504) |-. ==.++.|..+. .+.........-|++..+ .+|...-...++.++-...||.++....+...+|.+-. T Consensus 1 m~~~~~Gv~v~e~~~g~--~~i~~~~tav~g~vgta~~~~~~~~~~~~p~~its~~~~~~~~g~~gtl~~al~~~~~ngg 78 (392) T protein:vir:18 1 MSDFHHGTKVIEINDGT--RVISTVATAIVGMVWTASDADAETFPLNEPVLITNVQSAIAKAGKKGTLSASLQAIADQSK 78 (392) T ss_pred CCCCCCCeEEEEcCCCc--eeeeccCcceeEEEEeccCCCCcccccccceEeechHHHHhhcCCCcchHHHHHHhhcccC Confidence 433 11233333332 222222333333343332 33444334556777777888998888888888887632 Q ss_pred CCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhccc Q lcl|NC_019451. 73 KSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNA 152 (504) Q Consensus 73 ~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~ 152 (504) ...++.+..... . ....+.+..++ +++. +..+..+ +. ..+..... T Consensus 79 ------~~~~vv~v~~~~---~--~~~~~~t~~dl-----------iG~~-------~~~~~~t--g~-~al~~~~~--- 123 (392) T protein:vir:18 79 ------PVTVVVRVAEGT---G--DDAEAQTTSNI-----------IGGT-------DENGKYT--GI-KALLTAEA--- 123 (392) T ss_pred ------ceEEEecccccc---c--ccccccchhhh-----------eecc-------cccchhh--hH-HHHHhhhh--- Confidence 223333321100 0 00000000000 0000 0000000 00 00000000 Q ss_pred ccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccceeEE Q lcl|NC_019451. 153 DPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGSF 232 (504) Q Consensus 153 ~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~wy~~ 232 (504) .....+.+ ...++........+|..+.+. +..+ T Consensus 124 ------------~~~~~p~i--------------------------------l~ap~~~~~~v~~~l~~~~~~---~~~~ 156 (392) T protein:vir:18 124 ------------VTGVKPRI--------------------------------LGVPGLDTQEVATALASVCIS---LRAF 156 (392) T ss_pred ------------hhceeehh--------------------------------cccCccchHHHHHHHHHHHhh---cCcE Confidence 00000000 000011111223334433333 2334 Q ss_pred EEEeccCCHHHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCcC Q lcl|NC_019451. 233 LFAGAPLDNDQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDEP 312 (504) Q Consensus 233 ~~~~~~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~ 312 (504) .+.+.+. ..+..++.+|-+..+.++...++----.. ....+. . ...-|.+.+.|..+.+|.++- T Consensus 157 ~~~d~~~-~~~~~~a~~~~~~~~s~~~~~~~p~~~~~-------d~~~~~-~-------~~~p~s~~~AG~~a~~d~~~g 220 (392) T protein:vir:18 157 GYVSAWG-CKTISEAMAYRENFSQRELMVIWPDFLAW-------DTTANA-T-------ATAYATARALGLRAYIDQTIG 220 (392) T ss_pred EEEecCC-CCCHHHHHHHHhhccCceEEEEeCceeee-------cccCCc-e-------EEechHHHHHHHHHhhhccCC Confidence 4444432 22344455677665555555443211000 000000 0 011245677788887764331 Q ss_pred CceeeecccccCcccc--------ccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHHHH Q lcl|NC_019451. 313 GASQNYMYYQFPGRNI--------TVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWL 384 (504) Q Consensus 313 ~g~~T~kfk~l~Gv~a--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~dwl 384 (504) =......|.+.||.. ...+..|.+.|..+|+|... .+ ..+ .++.+.++++.-+|.+|-+.+-.+|+ T Consensus 221 -~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~--~~--~G~-~~wG~rT~~~d~~~~~i~~rR~~~~i 294 (392) T protein:vir:18 221 -WHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLV--RK--DGF-RFWGNRTCSDDPLFLFENYTRTAQVL 294 (392) T ss_pred -ceEccCCceeeceeecceecccccCCCcchhhhhhhcCceEEE--cC--CCE-EEEcccccCCCcccceeehhhHHHHH Confidence 123345667777642 22457889999999999873 23 233 45788999988788889999999999 Q ss_pred HHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEE Q lcl|NC_019451. 385 KSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWIN 464 (504) Q Consensus 385 ~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~ 464 (504) ++.|+..+...+-. |-++.-...|+..++.-|++-+++|.|. ||.++ T Consensus 295 ~~~i~~~~~~~v~e----~n~~~~~~~i~~~i~~~L~~l~~~gal~-----------------------------g~~v~ 341 (392) T protein:vir:18 295 ADTMAEAHMWAVDK----PITASLIRDIVDGINAKFRELKSNGYIV-----------------------------DGECW 341 (392) T ss_pred HHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCccc-----------------------------ceEEE Confidence 99999988774432 6799999999999999999999999996 35555 Q ss_pred ecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 465 ITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 465 ~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) .. +...+++|+.+.+. .+.+.+.....+++|++..... T Consensus 342 ~d-~~~nt~~~i~~G~~-~~~v~~~p~~p~e~I~~~~~~~ 379 (392) T protein:vir:18 342 FD-EESNDKETLKAGKL-YIDYDYTPVPPLESLTLRQRIT 379 (392) T ss_pred Ee-cCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEc Confidence 53 36778888888777 4888889999999999998887 No 35 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=97.79 E-value=1.9e-05 Score=46.50 Aligned_cols=359 Identities=13% Similarity=0.087 Sum_probs=195.2 Q ss_pred CCC----ccceEEEeeeecccccccccccceEEEeccc-----ccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccC Q lcl|NC_019451. 1 MIS----QSRYIRIISGVGAGAPVAGRKLILRVMTTNN-----VIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFI 71 (504) Q Consensus 1 mip----~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~-----~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~ 71 (504) |-. ==.|+.|. ..+.+........+.|++..+ .+|...-...++..+....||.....+.+...+|.+. T Consensus 1 M~~~~~~Gv~v~e~~--~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~~g 78 (390) T protein:vir:10 1 MPQDYHHGVRVIEIN--EGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQT 78 (390) T ss_pred CcccccCCeEEEEcC--CCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhcccc Confidence 542 11122222 222233333445555554332 2344333456777777789999988888888888874 Q ss_pred CCCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcc Q lcl|NC_019451. 72 SKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKN 151 (504) Q Consensus 72 ~~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~ 151 (504) . ...|+-+...- ..-..+..++ +++.. ..+..+ + +....... T Consensus 79 g------~~~~vv~v~~~--------~~~~~~~~~~-----------ig~~~-------~~~~~t--g----~~al~~~~ 120 (390) T protein:vir:10 79 K------PLTVVVRVAEG--------KDADETTSNV-----------IGTVT-------PDGKYT--G----IKALLAAQ 120 (390) T ss_pred C------ceEEEEEeccc--------cccccccccc-----------ccccc-------cccccc--h----hhhhhhhh Confidence 3 33566554211 1101000000 00000 000000 0 00000000 Q ss_pred cccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccceeE Q lcl|NC_019451. 152 ADPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGS 231 (504) Q Consensus 152 ~~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~wy~ 231 (504) ..+.+. + .....++........++..+.+ .... T Consensus 121 ----------------~~~~~~----------------p------------~il~ap~~~~~~v~~~l~~~a~---~~~~ 153 (390) T protein:vir:10 121 ----------------GALGVK----------------P------------RILAAPGLDTQPVAAALAATAQ---SLRA 153 (390) T ss_pred ----------------hhhcce----------------e------------hhhcccccchHHHHHHHHHhhc---ccce Confidence 000000 0 0000001111112222333322 2233 Q ss_pred EEEEeccCCHHHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCc Q lcl|NC_019451. 232 FLFAGAPLDNDQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDE 311 (504) Q Consensus 232 ~~~~~~~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~ 311 (504) +.+.+.+. .....++.+|.+..+.++...++-.--. .. .... ......|.+.+.|..+.+|.+ T Consensus 154 ~aivD~p~-~~t~~~a~~~~~~~~s~~~~~~~p~~~~---~d----~~~~--------~~~~~p~s~~~Agl~a~~D~~- 216 (390) T protein:vir:10 154 MAYVSASG-CKTKEEAAAYRKQFGQREIMVIWPDWLG---WD----DTTN--------STAVIPAPAIAAGLRAKIDND- 216 (390) T ss_pred EEEEecCC-CCCHHHHHHHhhccCCceEEEEcCceEe---ec----ccCC--------cccccchHHHHHHHHHHhhcC- Confidence 44555442 2233445567666565555554321000 00 0000 001112457777888888743 Q ss_pred CCc-eeeecccccCcccc--------ccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHH Q lcl|NC_019451. 312 PGA-SQNYMYYQFPGRNI--------TVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEI 382 (504) Q Consensus 312 ~~g-~~T~kfk~l~Gv~a--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~d 382 (504) .| ......|.+.||.- +..+..|.+.|..+|+|.... +.| + .++.+.++++.-.|.+|-+.+-.+ T Consensus 217 -~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~--~~G--~-~~wG~rT~s~d~~~~~i~~rR~~~ 290 (390) T protein:vir:10 217 -IGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVN--RNG--F-RFWGERTCSDDPKFAFENYTRTAQ 290 (390) T ss_pred -CCcEECcCCceeeceeecceecccccccccchhhhhhhcCcEEEEc--CCC--E-EEEcccccCCCcccceeehhhHHH Confidence 22 23344666666653 233466788999999998743 333 3 347888888887888899999999 Q ss_pred HHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceE Q lcl|NC_019451. 383 WLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYW 462 (504) Q Consensus 383 wl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~ 462 (504) |+++.|+..+...+-. |.|+.-...|+..++.-|+.-+++|.|. ||. T Consensus 291 ~i~~~i~~~~~~~v~e----~n~~~~~~~i~~~i~~~L~~l~~~g~l~-----------------------------g~~ 337 (390) T protein:vir:10 291 VAGDSIAEAQMPVVDG----PLNPSLARDIVESINGWFRQQVANGYLI-----------------------------GGS 337 (390) T ss_pred HHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee-----------------------------eeE Confidence 9999999988874322 6799999999999999999999999986 477 Q ss_pred EEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 463 INITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 463 v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) +... .+..+++|+.+-+. -+.+.+.---.+++|++.-..- T Consensus 338 v~~d-~~~nt~~~i~~G~~-~~~v~~~p~~pae~I~~~~~~~ 377 (390) T protein:vir:10 338 AWID-PEPNTADILASGKA-YIDYDYTPVPPLENLVLRQRIT 377 (390) T ss_pred EEEc-cCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEc Confidence 7765 35778888777666 4888888888999999888877 No 36 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=97.79 E-value=1.9e-05 Score=46.50 Aligned_cols=359 Identities=13% Similarity=0.087 Sum_probs=195.2 Q ss_pred CCC----ccceEEEeeeecccccccccccceEEEeccc-----ccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccC Q lcl|NC_019451. 1 MIS----QSRYIRIISGVGAGAPVAGRKLILRVMTTNN-----VIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFI 71 (504) Q Consensus 1 mip----~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~-----~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~ 71 (504) |-. ==.|+.|. ..+.+........+.|++..+ .+|...-...++..+....||.....+.+...+|.+. T Consensus 1 M~~~~~~Gv~v~e~~--~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~~g 78 (390) T protein:vir:78 1 MPQDYHHGVRVIEIN--EGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQT 78 (390) T ss_pred CcccccCCeEEEEcC--CCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhcccc Confidence 542 11122222 222233333445555554332 2344333456777777789999988888888888874 Q ss_pred CCCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcc Q lcl|NC_019451. 72 SKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKN 151 (504) Q Consensus 72 ~~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~ 151 (504) . ...|+-+...- ..-..+..++ +++.. ..+..+ + +....... T Consensus 79 g------~~~~vv~v~~~--------~~~~~~~~~~-----------ig~~~-------~~~~~t--g----~~al~~~~ 120 (390) T protein:vir:78 79 K------PLTVVVRVAEG--------KDADETTSNV-----------IGTVT-------PDGKYT--G----IKALLAAQ 120 (390) T ss_pred C------ceEEEEEeccc--------cccccccccc-----------ccccc-------cccccc--h----hhhhhhhh Confidence 3 33566554211 1101000000 00000 000000 0 00000000 Q ss_pred cccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccceeE Q lcl|NC_019451. 152 ADPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGS 231 (504) Q Consensus 152 ~~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~wy~ 231 (504) ..+.+. + .....++........++..+.+ .... T Consensus 121 ----------------~~~~~~----------------p------------~il~ap~~~~~~v~~~l~~~a~---~~~~ 153 (390) T protein:vir:78 121 ----------------GALGVK----------------P------------RILAAPGLDTQPVAAALAATAQ---SLRA 153 (390) T ss_pred ----------------hhhcce----------------e------------hhhcccccchHHHHHHHHHhhc---ccce Confidence 000000 0 0000001111112222333322 2233 Q ss_pred EEEEeccCCHHHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCc Q lcl|NC_019451. 232 FLFAGAPLDNDQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDE 311 (504) Q Consensus 232 ~~~~~~~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~ 311 (504) +.+.+.+. .....++.+|.+..+.++...++-.--. .. .... ......|.+.+.|..+.+|.+ T Consensus 154 ~aivD~p~-~~t~~~a~~~~~~~~s~~~~~~~p~~~~---~d----~~~~--------~~~~~p~s~~~Agl~a~~D~~- 216 (390) T protein:vir:78 154 MAYVSASG-CKTKEEAAAYRKQFGQREIMVIWPDWLG---WD----DTTN--------STAVIPAPAIAAGLRAKIDND- 216 (390) T ss_pred EEEEecCC-CCCHHHHHHHhhccCCceEEEEcCceEe---ec----ccCC--------cccccchHHHHHHHHHHhhcC- Confidence 44555442 2233445567666565555554321000 00 0000 001112457777888888743 Q ss_pred CCc-eeeecccccCcccc--------ccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHH Q lcl|NC_019451. 312 PGA-SQNYMYYQFPGRNI--------TVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEI 382 (504) Q Consensus 312 ~~g-~~T~kfk~l~Gv~a--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~d 382 (504) .| ......|.+.||.- +..+..|.+.|..+|+|.... +.| + .++.+.++++.-.|.+|-+.+-.+ T Consensus 217 -~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~--~~G--~-~~wG~rT~s~d~~~~~i~~rR~~~ 290 (390) T protein:vir:78 217 -IGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVN--RNG--F-RFWGERTCSDDPKFAFENYTRTAQ 290 (390) T ss_pred -CCcEECcCCceeeceeecceecccccccccchhhhhhhcCcEEEEc--CCC--E-EEEcccccCCCcccceeehhhHHH Confidence 22 23344666666653 233466788999999998743 333 3 347888888887888899999999 Q ss_pred HHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceE Q lcl|NC_019451. 383 WLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYW 462 (504) Q Consensus 383 wl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~ 462 (504) |+++.|+..+...+-. |.|+.-...|+..++.-|+.-+++|.|. ||. T Consensus 291 ~i~~~i~~~~~~~v~e----~n~~~~~~~i~~~i~~~L~~l~~~g~l~-----------------------------g~~ 337 (390) T protein:vir:78 291 VAGDSIAEAQMPVVDG----PLNPSLARDIVESINGWFRQQVANGYLI-----------------------------GGS 337 (390) T ss_pred HHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee-----------------------------eeE Confidence 9999999988874322 6799999999999999999999999986 477 Q ss_pred EEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 463 INITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 463 v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) +... .+..+++|+.+-+. -+.+.+.---.+++|++.-..- T Consensus 338 v~~d-~~~nt~~~i~~G~~-~~~v~~~p~~pae~I~~~~~~~ 377 (390) T protein:vir:78 338 AWID-PEPNTADILASGKA-YIDYDYTPVPPLENLVLRQRIT 377 (390) T ss_pred EEEc-cCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEc Confidence 7765 35778888777666 4888888888999999888877 No 37 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=97.59 E-value=4e-05 Score=44.67 Aligned_cols=359 Identities=12% Similarity=0.043 Sum_probs=189.7 Q ss_pred CCC----ccceEEEeeeecccccccccccceEEEeccc-----ccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccC Q lcl|NC_019451. 1 MIS----QSRYIRIISGVGAGAPVAGRKLILRVMTTNN-----VIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFI 71 (504) Q Consensus 1 mip----~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~-----~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~ 71 (504) |-- ==.|+.+..+. .+........+-|++..+ .+|...-...++..+-...||.....+.+-..+|.+- T Consensus 1 M~~~~~pGv~v~e~~~~~--~~i~~~~tav~~~vg~a~~a~~~~~p~n~pv~iss~~~~~~~~g~~gtl~~al~~~~~~g 78 (391) T protein:vir:79 1 MPTDYHHGVRVVELNDGT--RPIRTIETAVAGIVCTADDADAATFPLDTPVLLTNPQAYIGKAGDKGTLAHTLDAITDQT 78 (391) T ss_pred CCCCCCCCeEEEECCCCc--ccccccCCceEEEEeecccccccccccccCEEeccHHHHHHhcCCccccchhhhhhhccc Confidence 432 11233332222 222333345555555443 4454333456777777778898877777778888764 Q ss_pred CCCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcc Q lcl|NC_019451. 72 SKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKN 151 (504) Q Consensus 72 ~~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~ 151 (504) - + ..|+-+......... . ..++......++..+.+....... T Consensus 79 g-----~-~~~vv~~~~~~~~~~--------~------------------------~~~~~g~~~~~~~~tGl~~l~~~~ 120 (391) T protein:vir:79 79 N-----P-LTVVVRVAGGASEAE--------T------------------------TSNLIGTTNAAGRYTGMKALLTAR 120 (391) T ss_pred c-----c-ceeeecccccccccc--------c------------------------cccccccccchhhhHHHhhhhhhh Confidence 2 1 122222211100000 0 000000000001111111100000 Q ss_pred cccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccceeE Q lcl|NC_019451. 152 ADPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGS 231 (504) Q Consensus 152 ~~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~wy~ 231 (504) . .+.+. +.. ....+........++..+.+ .+.. T Consensus 121 ~----------------~~~~~----------------p~~------------l~~p~~~~~~v~~al~~~~~---~~~~ 153 (391) T protein:vir:79 121 N----------------RFGVA----------------PRI------------LAVPGLDSLPVGTELVTIAQ---KLRA 153 (391) T ss_pred h----------------hhccc----------------chh------------hcCCccchhHHHHHHHHHHh---hcCc Confidence 0 00000 000 00001111122223333332 3334 Q ss_pred EEEEeccCCHHHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCc Q lcl|NC_019451. 232 FLFAGAPLDNDQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDE 311 (504) Q Consensus 232 ~~~~~~~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~ 311 (504) +.+.+.+.. .....+-+|-+.-+.++...++..-- .. ...... . ....+.+.++|..+.+|-+ T Consensus 154 ~ai~d~p~~-~t~~~a~~~~~~~~s~~~a~~~P~~~---~~----d~~~~~-~-------~~~p~s~~~AG~~a~~D~~- 216 (391) T protein:vir:79 154 FAYLSAYGC-QTKEEAVAYRSNFGQREAMVMWPDFV---GW----DTAANA-E-------TTLWATARAVGLRAKIDND- 216 (391) T ss_pred EEEEECCCC-CCHHHHHHHHhccCCceeEEecceee---ee----cCcCCc-e-------eeechHHHHHHHHHHhhhc- Confidence 455554321 22234455666555555444332110 00 000000 0 1113457777888888743 Q ss_pred CCc-eeeecccccCcccc--------ccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHH Q lcl|NC_019451. 312 PGA-SQNYMYYQFPGRNI--------TVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEI 382 (504) Q Consensus 312 ~~g-~~T~kfk~l~Gv~a--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~d 382 (504) .| ......|.+.||.. +..+.+|.+.|..+++|++.. + ..+ .++.+.++++.-.|.+|-+.+-.+ T Consensus 217 -~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~~I~t~~~--~--~G~-~~wG~rT~~~d~~~~~i~~rR~~~ 290 (391) T protein:vir:79 217 -TGWHKTLSNVAVGGVTGLSRDVFWDLQDPATDAGYLNANEVTTLVH--R--DGY-RFWGSRTCSADPLFAFENYTRTAQ 290 (391) T ss_pred -ccceeccCCceehhhhccccccccccccccchhhhhhhcCceEEEC--C--CcE-EEEcccccCCCcccceeehhhHHH Confidence 23 22333456666642 234566788999999998732 2 333 457888888887788899999999 Q ss_pred HHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceE Q lcl|NC_019451. 383 WLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYW 462 (504) Q Consensus 383 wl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~ 462 (504) |+.+.|+..+...+-. |-|+.-...|+..++.-|+.-+++|.|. ||. T Consensus 291 ~i~~~i~~~~~~~v~e----pn~~~~~~~i~~~i~~~l~~l~~~g~l~-----------------------------g~~ 337 (391) T protein:vir:79 291 VLADTMAEAHMWANDL----PMTPTLVRDLLEGINAKLRMLTRNGYLL-----------------------------GGA 337 (391) T ss_pred HHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee-----------------------------ceE Confidence 9999999998874432 6799999999999999999999999996 355 Q ss_pred EEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 463 INITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 463 v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) +... .+..+++|+.+-+. -+.+.+.-.-.++.|++..... T Consensus 338 v~~~-~~~nt~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~ 377 (391) T protein:vir:79 338 AWFD-ADANSKDTLKAGQL-AIDYDYTPVPPLENLTFRQRIT 377 (391) T ss_pred EEEe-cCCCCHHHhhCCEE-EEEEEEEecCCcceEEEEEEEc Confidence 5553 35677777776665 4888888889999999998877 No 38 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=97.58 E-value=4.2e-05 Score=44.59 Aligned_cols=458 Identities=13% Similarity=-0.000 Sum_probs=219.9 Q ss_pred CCCccceEEEe-eeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCC---cHHHHHHHHHhccCCCCCc Q lcl|NC_019451. 1 MISQSRYIRII-SGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQ---SEEYQRAAAYFKFISKSVN 76 (504) Q Consensus 1 mip~s~iv~V~-~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~---s~ey~aA~~yF~~~~~~~~ 76 (504) |--++-=|-|. ++ .+..........+.|++....=|++.....+|..|....||.- +.++.+...||-+. T Consensus 1 ~~~~~Pgvyv~e~~-~~~~i~~~~t~~~~~vg~~~~gp~~~p~~v~s~~~~~~~fg~~~~~~~~~~~~~~~f~~~----- 74 (660) T protein:vir:10 1 MALLSPGIELKETS-VQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNEVADYFMSGMNFLQY----- 74 (660) T ss_pred CceecCceEEEeec-CCccccCCCcccceEEeecCCCCCccCeEcCCHHHHHHHcCCcCCCchhHHHHHHHHHhC----- Confidence 66666656554 33 2445555566778888887777777777889999999999852 45567777777663 Q ss_pred ccceEEEEeeeccCCcce--eeecccc--hhhHhHhhccCceEEEEEccccee----eeeecccc--ccchHHHHHHHHh Q lcl|NC_019451. 77 SPSSISFARWVNTAIAPM--VVGDNLP--KTIADFAGFSAGVLTIMVGAAEQN----ITAIDTSA--ATSMDNVASIIQT 146 (504) Q Consensus 77 ~P~~l~igr~~~~a~~~~--l~g~~~~--~~~~~~~~~~~g~~titi~g~~~~----~~~i~~s~--ats~~~vA~~i~~ 146 (504) =+++||-|......+.- -.+..+. .............+.+.+.+.... +...+-.. ...+...+..... T Consensus 75 -g~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~v~~~~a~g~~~~~~~~ta~~~~~ 153 (660) T protein:vir:10 75 -GNDLRTVRVVSREFAKNASPIAGNIETTITTAGSNYAVGDKINIKYNQTVVESEGRVTSVDTDGKILSVFIPSAKIIAY 153 (660) T ss_pred -CceEEEEEecccccccccccccccceeEEeeccccccccceeeEeeccccccccccceeeccccceeeecccccccccc Confidence 35699999865422110 0000000 000000000011233333322211 00111100 0000001110000 Q ss_pred h----------------hhc-ccccccceeEEE--Eecccc---------------eeEEecccCccee----------- Q lcl|NC_019451. 147 E----------------IRK-NADPQLAQATVT--WNQNTN---------------QFTLVGATIGTGV----------- 181 (504) Q Consensus 147 ~----------------i~a-~~~~~~~~a~vt--~d~~~~---------------~F~its~t~ga~s----------- 181 (504) . +.. ....... .++. +..... .+........... T Consensus 154 a~~v~~~~~~~~~~~~~~~~~~~~~~~a-~sv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~ 232 (660) T protein:vir:10 154 ARSLNQYPTLGPAWTAEVTSASSGVSGT-ITVGKIVTDSGILLTEAENSEEAITSLEFQAALKKFAMPGVVALYPGEIGS 232 (660) T ss_pred ccccccccccccceeEEEecccCccccc-eeeeeeeccCcceEEeeeccccccccccceeeccccccceeeeecccccCc Confidence 0 000 0000000 0000 000000 0111110000000 Q ss_pred --EEEEeec----------------c----------------chh---h--------------hhhhhc----------- Q lcl|NC_019451. 182 --LAVAKSA----------------D----------------PQD---M--------------STALGW----------- 199 (504) Q Consensus 182 --~~~~~sa----------------~----------------~~~---i--------------a~~l~~----------- 199 (504) ....... . ..+ + ....+. T Consensus 233 ~i~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (660) T protein:vir:10 233 TLEVEIVSKAAYEAGSSKMLDVYPGGGTRASIAKAVFNYGPQTDDQYAIIVRRDGAIVESVVLSTKEGEKDVYGNNIYLD 312 (660) T ss_pred ceeEEEeeccccCCcceeEEeeeeccceeeEEeeeecccccccccccccccccCCcccceeeeeccccccccccceeeee Confidence 0000000 0 000 0 000000 Q ss_pred ---ccC--cce---------------e-ecccc------cccHHHHHHHHHhhccceeEEEEEecc--CCHH----HHHH Q lcl|NC_019451. 200 ---STS--NVV---------------N-VAGQA------ADLPDAAVAKSTNVSNNFGSFLFAGAP--LDND----QIKA 246 (504) Q Consensus 200 ---t~~--~~~---------------~-~~g~a------aet~~~al~~~~~~~~~wy~~~~~~~~--~~~~----~~~a 246 (504) ..+ ..+ . ..+.+ ......++..+.+....-..+++.... ..++ -..+ T Consensus 313 ~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~t~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~a 392 (660) T protein:vir:10 313 DYFAKGTSNYIYATSLNWPKGFSGIINLSGGISANDKVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDEVASTVQKH 392 (660) T ss_pred hhhcCCCccEEEEEeccCCCCcccceeeeccccCccccccchhhhhhhhhhhhhhcccceEEEcCcCCCchhhhHHHHHH Confidence 000 000 0 00000 001122333333222111122222111 1122 2334 Q ss_pred HHHHHhhcCCcEEEEEecc-----cc--chhHHHHHHhh------------hcceeEEEEeccC-----C----CccHHH Q lcl|NC_019451. 247 VSAWNAAQNNQFIYTVATS-----LA--NLGTLFTLVNG------------NAGTALNVLSATA-----A----NDFVEQ 298 (504) Q Consensus 247 ~A~w~e~~~~~~~~~~~~~-----d~--~~~~~~~~~~~------------~~~~~~~~~~~~~-----~----~~~~~a 298 (504) +...+|....++.++-.-. .. ........... ...+...++++.. . ...+.. T Consensus 393 l~~~~~~~~~~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg 472 (660) T protein:vir:10 393 VVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAA 472 (660) T ss_pred HHHHHHhhCCEEEEEecCcccccccccccCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceeEechhH Confidence 5566666554443331100 00 00011111110 0112222332211 0 013567 Q ss_pred HHHHHHHhcCcCcCCceeeecccccCccc-----cccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCcc-cc Q lcl|NC_019451. 299 CPSEILAATNYDEPGASQNYMYYQFPGRN-----ITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPT-DA 372 (504) Q Consensus 299 a~~g~~as~nf~~~~g~~T~kfk~l~Gv~-----a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y-~~ 372 (504) .++|.++.+|-++. =...-.+|.+.||. ...+++.|.+.|..+|+|+...+-+. +.+ .++...++++.- +| T Consensus 473 ~~AGl~Ar~D~~~g-~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~-~G~-~~wG~rT~~~~~s~~ 549 (660) T protein:vir:10 473 DLAGLCARTDDVSQ-PWMSPAGYNRGQILNVLKLAIEPRQAQRDRMYQEAINPVVGFAGG-DGF-VLFGDKTATKVPSPM 549 (660) T ss_pred HHHHHHHHhhccCC-cEEccCCeeeceeeccceeeecCChhhHHhHhhCCceEEEEeeCC-CcE-EEEcccccCCCCccc Confidence 77888888874331 11122355554442 23578999999999999998877542 222 346777777753 67 Q ss_pred chhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCccc Q lcl|NC_019451. 373 VDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRA 452 (504) Q Consensus 373 ~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~ 452 (504) .||-+.+-.+|+++.|+...+...-. |.++.-...|+..++.-|+.-+++|.|. T Consensus 550 ~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~fL~~l~~~gal~---------------------- 603 (660) T protein:vir:10 550 DHINVRRLFNMLKKNIGDASKYKLFE----LNDNFTRSSFRMEVSQYLDGIKALGGIY---------------------- 603 (660) T ss_pred ceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee---------------------- Confidence 88999999999999999988874433 4688899999999999999999999996 Q ss_pred ccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 453 WRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 453 ~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ||+|.++ .++.+++|..+.+. .+.+.++-.-.++.|.++-+-. T Consensus 604 -------g~~V~~d-~~~nt~~di~~G~~-~~~i~~~P~~pae~I~~~~~~~ 646 (660) T protein:vir:10 604 -------EGRVVCD-TTVNTPAVIDRNEF-IANIYVKPARSINYITLNFVAT 646 (660) T ss_pred -------eeEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCccEEEEEEEEe Confidence 4888887 57888888887777 5899999999999999885555 No 39 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=97.47 E-value=6.1e-05 Score=43.70 Aligned_cols=365 Identities=10% Similarity=0.030 Sum_probs=194.9 Q ss_pred CCCc---cceEEEeeeecccccccccccceEEEeccc-----ccCc-cceEEecCHHHHHHhcCCCcHHHHHHHHHhccC Q lcl|NC_019451. 1 MISQ---SRYIRIISGVGAGAPVAGRKLILRVMTTNN-----VIPP-GIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFI 71 (504) Q Consensus 1 mip~---s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~-----~~~~-~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~ 71 (504) |-.- -+++.+..+ +.+..........|++..+ .+|. +.+ ..++..+....||.....+.+-..+|.+. T Consensus 1 m~~~~~GV~v~e~~~g--~~~v~~v~tav~~~vgta~~~~~~~~p~~~pv-~v~s~~~~~~~~g~~~tl~~al~~~~~~~ 77 (395) T protein:vir:98 1 MSDFHHGTQVIEINDG--TRVISTVATAVVGMVCTASDADATLFPLNEPV-LITNVQSAIAKAGKKGTLAASLQAIADQS 77 (395) T ss_pred CCCCCCCeEEEEcCCC--cccccccCcceEEEEeeccCCCccccccccce-EeechHHhHhhcccccchhhHHHHHhhcc Confidence 4332 112222222 2222233344444444332 2333 234 34677777788999988888888888874 Q ss_pred CCCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcc Q lcl|NC_019451. 72 SKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKN 151 (504) Q Consensus 72 ~~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~ 151 (504) . ...++-+......... ....+ .+ ...+.|... .....+++.......- T Consensus 78 ~------~~~~vv~~~~~~~~~~--~~~~a--------~~----~~~i~g~~~--------~~~~~Tgl~al~~~~~--- 126 (395) T protein:vir:98 78 K------PVTVVVRVEDGTGDDE--EAALA--------QT----VSNIIGGTD--------ENGKYTGIKALLTAQA--- 126 (395) T ss_pred C------ceEEEeeccccccccc--ccccc--------cc----ccccccccc--------cccchhHHHHHhhhhh--- Confidence 3 3344444422111000 00000 00 000011000 0000011111000000 Q ss_pred cccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccceeE Q lcl|NC_019451. 152 ADPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGS 231 (504) Q Consensus 152 ~~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~wy~ 231 (504) .+.+ .+.....++........+|..+.+.... T Consensus 127 -----------------~~~~----------------------------~p~il~ap~~~~~~v~~al~~~~~~~~~--- 158 (395) T protein:vir:98 127 -----------------VTGV----------------------------KPRILGVPGLDTKEVAVALASAAIKLRA--- 158 (395) T ss_pred -----------------hhcc----------------------------chhhcccccccccHHHHHHHHHhhhcCc--- Confidence 0000 0000000111112233444444443333 Q ss_pred EEEEeccCCHHHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCc Q lcl|NC_019451. 232 FLFAGAPLDNDQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDE 311 (504) Q Consensus 232 ~~~~~~~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~ 311 (504) +.+.+.+.. ....++-+|-+.-+.++...++----- . ...... ....-+.+.+.|.++.+|.++ T Consensus 159 ~~~~d~p~~-~t~~~a~~~~~~~~s~~~~~~~p~~~~---~----d~~~~~--------~~~~p~s~~~AG~~a~~d~~~ 222 (395) T protein:vir:98 159 FAYVSAWGC-KTISEAMEYRKNFSQRELMVIWPDFLA---W----DTVKNT--------TATAYATARALGLRAYIDQTV 222 (395) T ss_pred EEEEEcCCC-CCHHHHHHHHhccCCceEEEEecceeE---e----cccCCc--------eeeechHHHHHHHHHHhhccc Confidence 344444322 223344556665555555544321100 0 000000 001124566777888776433 Q ss_pred CCceeeecccccCcccc--------ccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHHH Q lcl|NC_019451. 312 PGASQNYMYYQFPGRNI--------TVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIW 383 (504) Q Consensus 312 ~~g~~T~kfk~l~Gv~a--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~dw 383 (504) . =......|.+.||.. ...+.+|++.|.++|+|... .+ ..+ .++.+.++++.-+|.+|-+.+-.+| T Consensus 223 g-~~~spaN~~i~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~--~~--~G~-~~wG~rT~s~d~~~~~i~~rR~~~~ 296 (395) T protein:vir:98 223 G-WHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLV--RK--DGF-RFWGNRTCSDDPLFLFENYTRTAQV 296 (395) T ss_pred C-cEeccCCceeecccccceecccccCCCcchHHhhhhcCcEEEE--cC--CCE-EEEcccccCCCcccceeehhhHHHH Confidence 1 122234556666532 23468899999999999873 23 333 4578888888888888999999999 Q ss_pred HHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEE Q lcl|NC_019451. 384 LKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWI 463 (504) Q Consensus 384 l~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v 463 (504) +.+.|+..+...+-. |-|+.=...|+..++.-|++-+++|.|. ||.+ T Consensus 297 i~~~i~~~~~~~v~e----~~~~~~~~~i~~~i~~~L~~l~~~g~l~-----------------------------g~~v 343 (395) T protein:vir:98 297 LADTMAEAHMWAVDK----PITATLIRDIVDGINAKFRELKSNGYIV-----------------------------EGKC 343 (395) T ss_pred HHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee-----------------------------ceEE Confidence 999999988874433 5688888999999999999999999996 4666 Q ss_pred EecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 464 NITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 464 ~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ... ++..++++..+.+. .+.+.+..-..++.|++..... T Consensus 344 ~~d-~~~nt~~~i~~G~~-~~~i~~~p~~p~e~I~~~~~~~ 382 (395) T protein:vir:98 344 WFD-EESNDKETLKAGKL-YIDYDYTPVPPLESLTLRQRIT 382 (395) T ss_pred EEe-cCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEc Confidence 654 36778888888887 4899999999999999999888 No 40 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=97.32 E-value=9.5e-05 Score=42.64 Aligned_cols=365 Identities=10% Similarity=0.056 Sum_probs=194.8 Q ss_pred CCCc---cceEEEeeeecccccccccccceEEEeccc-----ccCc-cceEEecCHHHHHHhcCCCcHHHHHHHHHhccC Q lcl|NC_019451. 1 MISQ---SRYIRIISGVGAGAPVAGRKLILRVMTTNN-----VIPP-GIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFI 71 (504) Q Consensus 1 mip~---s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~-----~~~~-~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~ 71 (504) |-.- =.|+.|..+ +.+..........|++... .+|. +.++. ++..+....||..+..+.+-+.+|.+. T Consensus 1 m~~~~~GV~v~e~~~g--~~~i~~v~tav~~~vg~a~~~d~~~~~~~~pv~i-~s~~~~~~~~g~~~tl~~al~~~~~~~ 77 (396) T protein:vir:57 1 MSDYHHGVQVLEINDG--TRVISTVSTAIVGMVCTASDADAETFPLNKPVLI-TNVQSAIAKAGKKGTLAASLQAIADQS 77 (396) T ss_pred CCCCCCceEEEEcCCC--cccccccCCceEEEEEeccCCCcccccCccCeEe-ecchhhhhhcccccchHHHHHHhhhcC Confidence 4321 122222222 2222333345555554432 3344 34444 566677788899888888888888763 Q ss_pred CCCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcc Q lcl|NC_019451. 72 SKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKN 151 (504) Q Consensus 72 ~~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~ 151 (504) . ...++-+......... ... . .....-.+++... ....+++... ..+. T Consensus 78 ~------~~~~vv~~~~~~~~~~---------~~~-~---a~t~~~iiG~~~~---------~~~~tgl~al----~~~~ 125 (396) T protein:vir:57 78 K------PVTVVVRVEDGTGDDE---------ETK-L---AQTVSNIIGTTDE---------NGQYTGLKAL----MGAE 125 (396) T ss_pred C------ceeEeeeccccccccc---------ccc-c---cccceeeeeeccc---------cccchhhhhh----hhcc Confidence 2 2344444321110000 000 0 0000001111000 0000111000 0000 Q ss_pred cccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccceeE Q lcl|NC_019451. 152 ADPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGS 231 (504) Q Consensus 152 ~~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~wy~ 231 (504) ..+.+ .+. .....+........+|..+.+.. .. T Consensus 126 ----------------~~~~~--------------~p~--------------i~~ap~~~~~~v~~al~~~~~~~---~~ 158 (396) T protein:vir:57 126 ----------------SVTGV--------------KPR--------------ILGVPGLDTKEVAVALASVCQEL---NA 158 (396) T ss_pred ----------------cceeE--------------Eec--------------cccCcccchhHHHHHHHHHhhhC---ce Confidence 00000 000 00000111112333444444332 34 Q ss_pred EEEEeccCCHHHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCc Q lcl|NC_019451. 232 FLFAGAPLDNDQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDE 311 (504) Q Consensus 232 ~~~~~~~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~ 311 (504) +.+.+.+.. .+..++.+|-+.-+..+...++--- ... ....+. ... ..+.+.+.|.++.+|..+ T Consensus 159 ~~~~d~p~~-~~~~~~~~~~~~~~s~~~~~~~p~~---~~~----d~~~~~-~~~-------~p~s~~~Ag~~a~~d~~~ 222 (396) T protein:vir:57 159 FGYISAWGC-KTISEVKAYRQNFSQRELMVIWPDF---LAW----DTVTST-TAT-------AYATARALGLRAKIDQEQ 222 (396) T ss_pred EEEEcCCCC-CCHHHHHHHHhccCCceEEEEccee---eee----cccCCc-eeE-------EehhHHHHHHHHHhhhcc Confidence 445544422 2234455676665555554443110 000 000000 001 124566777888776433 Q ss_pred CCceeeecccccCccccc--------cCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHHH Q lcl|NC_019451. 312 PGASQNYMYYQFPGRNIT--------VSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIW 383 (504) Q Consensus 312 ~~g~~T~kfk~l~Gv~a~--------~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~dw 383 (504) .--.....|.+.||..- ..+++|.+.|..+|+|+.. .+. .+ .++.+.++++.-.|.+|-+.+-.+| T Consensus 223 -g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~t~~--~~~--G~-~~wG~rT~~~d~~~~~i~vrR~~~~ 296 (396) T protein:vir:57 223 -GWHKTLSNVGVNGVTGISASVFWDLQKPGTDADLLNEAGVTTLV--RRD--GF-RFWGNRTCSDDPLFLFESYTRTAQV 296 (396) T ss_pred -CcEeccCCceeccccccceecccccCCcchhhhhhhhcCcEEEE--cCC--CE-EEEcccccCCCcccceeehhhHHHH Confidence 12334456777776532 2357899999999999873 232 33 4578889998878888999999999 Q ss_pred HHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEE Q lcl|NC_019451. 384 LKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWI 463 (504) Q Consensus 384 l~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v 463 (504) +++.|+..+...+-. |-++.=...|+..|+.-|+.-+++|.|. ||.+ T Consensus 297 i~~~i~~~~~~~v~e----~n~~~~~~~i~~~i~~~l~~l~~~gal~-----------------------------g~~v 343 (396) T protein:vir:57 297 LADTMAEAHMWAIDK----PITATLIRDIIDGINAKFRELKNNGYIV-----------------------------DGTC 343 (396) T ss_pred HHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee-----------------------------ceEE Confidence 999999888774422 5688999999999999999999999996 3566 Q ss_pred EecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 464 NITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 464 ~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ..+ ++..+++++.+.+. -+.+.+.-...++.|.+..... T Consensus 344 ~~d-~~~n~~~~i~~G~~-~~~v~~~p~~p~e~I~~~~~~~ 382 (396) T protein:vir:57 344 WFS-EESNDAETLKAGKL-YIDYDYTPVPPLENLTLRQRIT 382 (396) T ss_pred EEe-cCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEc Confidence 654 35678888877776 5899999999999999998887 No 41 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=97.24 E-value=0.00012 Score=42.09 Aligned_cols=446 Identities=10% Similarity=0.041 Sum_probs=165.5 Q ss_pred CCCccceE-EEe---eeecccccccccccceEEEecccccC--------ccceEEecCHHHHHHhcCCCcH-HHHHHHHH Q lcl|NC_019451. 1 MISQSRYI-RII---SGVGAGAPVAGRKLILRVMTTNNVIP--------PGIVIEFDNANAVLSYFGAQSE-EYQRAAAY 67 (504) Q Consensus 1 mip~s~iv-~V~---~~v~~~~~~~~~~~~~l~l~~~~~~~--------~~r~~~y~s~~~V~~~Fg~~s~-ey~aA~~y 67 (504) ..+.+.+. .+. +......+................++ .+-.+..+... ++..... +-...... T Consensus 204 ~~~~~~~~v~~~s~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~~~~~~~~~~t~~~----~~~~~~~~~~~~~~~~ 279 (729) T protein:vir:10 204 GSTDTTLEVKVISHISAAGVETAVEYQQNGTYTFDNSGSVNVIAAGSSGSGSAKSYTAQT----DWFESQNIVLSNSTLE 279 (729) T ss_pred ccccccccceecccccccccceeccccccceeeecccCccceeeeccccccccccceeee----cccccccccccccccc Confidence 11111110 000 00000000000000000000000000 00001111000 0000000 00000111 Q ss_pred hccCCCCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceE-----EEEEcccceeeeeeccccccchHHHH- Q lcl|NC_019451. 68 FKFISKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVL-----TIMVGAAEQNITAIDTSAATSMDNVA- 141 (504) Q Consensus 68 F~~~~~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~-----titi~g~~~~~~~i~~s~ats~~~vA- 141 (504) +.... +.|...-.+.-.. ...-.+.+. +......+.. ..|.. .++.....+............+..-. T Consensus 280 ~~~~~---~~~~t~~~~~~~~-~~~d~~~~~-~~d~~~~~~~-~~g~vve~~~~~s~~~~~~~~~~~~~~~~~vi~~~s~ 353 (729) T protein:vir:10 280 WDSIA---DAPGTSTYVSTRG-GKNDEIHVL-VIDDKGTITG-NSGTILEKHLSLSKAKDAEYSVGSSSYWRDFLATNSK 353 (729) T ss_pred ccccc---ccccccccccccc-cccccccee-eecccccccc-Ccccceeeeeeeeeccccccccccccccceeeccccc Confidence 11111 1111100000000 000000000 0000000000 00000 00000000000000000000000000 Q ss_pred ---------HHHHhhhhcccccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeeccccc Q lcl|NC_019451. 142 ---------SIIQTEIRKNADPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAA 212 (504) Q Consensus 142 ---------~~i~~~i~a~~~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aa 212 (504) ..+............ .....+......... ... ..... ....+.+.......+.... ..... T Consensus 354 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~a~~~~~--~~~-~~~~~--~~~~g~~~~~~~~~~~~~~---~~~~~ 424 (729) T protein:vir:10 354 YIFGGGATSGITTTGYSVSSTNTL-DTDSGWDQNAEGVNF--GAS-GVATL--TLAGGTNYGDKTDLTTSGA---LSSGV 424 (729) T ss_pred eeeeccccccccccccccccccee-ccccccccccccccc--ccc-ceeEE--Eeecccccccccccccccc---cccch Confidence 000000000000000 000000000000000 000 00000 0000111000000000000 00112 Q ss_pred ccHHHHHHHHHhhcc-ceeEEEEEec----cCCHHHHHHHHHHHhhcCCcEEEEEec--------------cccchh--H Q lcl|NC_019451. 213 DLPDAAVAKSTNVSN-NFGSFLFAGA----PLDNDQIKAVSAWNAAQNNQFIYTVAT--------------SLANLG--T 271 (504) Q Consensus 213 et~~~al~~~~~~~~-~wy~~~~~~~----~~~~~~~~a~A~w~e~~~~~~~~~~~~--------------~d~~~~--~ 271 (504) +...+++.++.+... .......... ........++...++....++.+.-.- .+.... . T Consensus 425 ~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~~~~~~v~~a~~~~~~~~~~~~a~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 504 (729) T protein:vir:10 425 DDIISGYTLFENTEEIEVDFILMGAAHHPKEQSQAVAEKVTAVAEARKDAVAFISPYRQAFLNDTVSGTVTVSNIDQTTE 504 (729) T ss_pred hHHHHHHHHhhcccccccceeeecCCCCCccchHHHHHHHHHHHHhcCCeEEEecccccccccccccccccccccchhhH Confidence 234455666554321 1211111111 112333456666666654443322100 000000 0 Q ss_pred HHHHHh-hhcc--eeEEEEecc--C---CC---c-cHHHHHHHHHHhcCcCcCCceeeecccccCccc-----cccCCHH Q lcl|NC_019451. 272 LFTLVN-GNAG--TALNVLSAT--A---AN---D-FVEQCPSEILAATNYDEPGASQNYMYYQFPGRN-----ITVSDDT 334 (504) Q Consensus 272 ~~~~~~-~~~~--~~~~~~~~~--~---~~---~-~~~aa~~g~~as~nf~~~~g~~T~kfk~l~Gv~-----a~~lt~t 334 (504) ...... .... +...++++. + .+ . -+.+.++|.++.+|.++. =......|.+.||. ...+++. T Consensus 505 ~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~p~s~~~aGl~a~~d~~~g-~~~span~~~~~i~g~~~~~~~~~~~ 583 (729) T protein:vir:10 505 NVVGFYAPLSSSTYSVFDSGYKYMFDRFNNTFRYVPLNGDIAGTCARTDIEQF-PWFSPAGTARGPILNSVKLVYNPGKK 583 (729) T ss_pred HHHHHHhhccCCceEEEEcCeeEEecccCCceEEechhHHHHHHHHHhhccCC-cEEccCCccccceecccceeeecChh Confidence 000110 1111 111221111 0 11 1 244667788888875432 12334455555543 2357899 Q ss_pred HHHHHHhCCCeEEEEEeeccceeeEEEcCEEe-CCccccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHH Q lcl|NC_019451. 335 VANTVDKSRGNYIGVTQANGQQLAFYQRGILC-GGPTDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTL 413 (504) Q Consensus 335 ~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~-~G~y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~ 413 (504) |.+.|..+++|+...+.+.|. .++.++++ +...+|.+|-+.+-.+|+++.|+..++..+-. |-|+.=...|+ T Consensus 584 ~~~~Ln~~gIn~i~~~~~~G~---~~wG~rT~~~~d~~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~~~~~i~ 656 (729) T protein:vir:10 584 QRDILYSNRINPVILSPGAGI---ILFGDKTGFGKSSAFDRINVRRLFIYLEDAISAAAKDQLFE----FNDELTRTNFV 656 (729) T ss_pred hHhhhhhCCceEEEEecCCeE---EEEcceecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcC----CCCHHHHHHHH Confidence 999999999999988876553 34677765 44557888999999999999999998874432 56888899999 Q ss_pred HHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCe Q lcl|NC_019451. 414 AVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDA 493 (504) Q Consensus 414 ~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGA 493 (504) ..|+.-|+.-+++|.|. ||.|.++ .+..+++|+.+-+. .+.+.+.-.-- T Consensus 657 ~~i~~~L~~l~~~g~l~-----------------------------g~~v~~d-~~~nt~~~i~~G~~-~~~v~~~p~~p 705 (729) T protein:vir:10 657 NIVEPFLRDVQAKRGIF-----------------------------DFVVICD-ETNNTAAVIDSNEF-VADIFIKPARS 705 (729) T ss_pred HHHHHHHHHHHhcccee-----------------------------eeEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCC Confidence 99999999999999985 5888886 57788888887777 48889999999 Q ss_pred EEEEEeeeeeC Q lcl|NC_019451. 494 IRFVEGSDVMI 504 (504) Q Consensus 494 Ih~v~i~~~~v 504 (504) +++|+++-.-. T Consensus 706 ~e~i~~~~~~~ 716 (729) T protein:vir:10 706 INFIGLTFVAT 716 (729) T ss_pred ccEEEEEEEEe Confidence 99998874444 No 42 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=97.09 E-value=0.00018 Score=41.17 Aligned_cols=360 Identities=11% Similarity=0.029 Sum_probs=194.9 Q ss_pred CC----CccceEEEeeeecccccccccccceEEEeccc-----ccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccC Q lcl|NC_019451. 1 MI----SQSRYIRIISGVGAGAPVAGRKLILRVMTTNN-----VIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFI 71 (504) Q Consensus 1 mi----p~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~-----~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~ 71 (504) |- |=-.++.|.....+... .....+.|++... .+|...-...++..+....||..-.-+.+...+|.+- T Consensus 1 M~~~~~~Gv~v~ev~~~~~~i~~--v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~a~~~~~~~g 78 (386) T protein:vir:10 1 MAEQYLHGAEVVEIDNGARPIRT--AQSGVIGLVGTAPDADATAFPLNTPVLIAGSRREAAKLGAGGTLPQAIDGIFDQT 78 (386) T ss_pred CccccCCCeEEEEcCCCcccccc--cCcceeEEEEecCCCCCcccccccceEecchHHHHhhcCCCcchhHHHHHHhccC Confidence 54 23334444333322222 2234444444332 2344444455666666778899989999999999873 Q ss_pred CCCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcc Q lcl|NC_019451. 72 SKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKN 151 (504) Q Consensus 72 ~~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~ 151 (504) ....++-+....... ..+.... +++.. ..+.. ...+....... T Consensus 79 ------g~~~~vv~~~~~~~~--------~~t~~~~-----------ig~~~---------~~t~~---~tgl~~l~~~~ 121 (386) T protein:vir:10 79 ------GAVVVVIRVDEGVDS--------AATQSNV-----------IGKVD---------ADTEQ---YTGILALLSAE 121 (386) T ss_pred ------ceeEEEeeccccccc--------cccchhh-----------hcccc---------cccch---hhhhHHhhhhc Confidence 334555554321110 0000000 00000 00000 00000000000 Q ss_pred cccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccceeE Q lcl|NC_019451. 152 ADPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGS 231 (504) Q Consensus 152 ~~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~wy~ 231 (504) ..+... + ++.. .++. .......+.+......+-. T Consensus 122 ----------------~~~~~~----------------p-~i~~-----------ap~~--~~~~~v~~~l~~~~~~~~~ 155 (386) T protein:vir:10 122 ----------------NTVKVQ----------------P-RILI-----------APGF--SNQKAVADQLVSVADTAAW 155 (386) T ss_pred ----------------cccccc----------------c-cccc-----------cccc--cchhHHHHHHHHhhcceEE Confidence 000000 0 0000 0000 0011122233333334444 Q ss_pred EEEEeccCCHHHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCc Q lcl|NC_019451. 232 FLFAGAPLDNDQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDE 311 (504) Q Consensus 232 ~~~~~~~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~ 311 (504) +...+... .......+|.+.-+..+...++-.- ... ...... . -...+.+.++|.++.+|... T Consensus 156 ~~~~~~~~--~~~~~a~~~~~~~~s~~~~~~~p~~---~v~----~~~~~~-~-------~~~p~s~~~ag~~a~~D~~~ 218 (386) T protein:vir:10 156 LCHSGWSN--TTDAAAITYRELFGSRRCEVVDPWY---KVW----DVETSA-H-------IIQPPSARHAGVMAKVHNTL 218 (386) T ss_pred EEEeCCCC--CchHHHHHhhhcccccceEEecCce---eee----cccccc-c-------eeechHHHHHHHHHHhhhcC Confidence 44443321 1222334565555555444432110 000 000000 0 01124567778888887533 Q ss_pred CCc-eeeecccccCccccc--------cCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHH Q lcl|NC_019451. 312 PGA-SQNYMYYQFPGRNIT--------VSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEI 382 (504) Q Consensus 312 ~~g-~~T~kfk~l~Gv~a~--------~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~d 382 (504) | ......|.+.||.-- ..++.|.+.|.++|++.. +.+. .+ .++.+.+++++-.|.+|-+.+-.+ T Consensus 219 --G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gi~~~--~~~~--G~-~~wG~rT~~~d~~~~~i~vrR~~~ 291 (386) T protein:vir:10 219 --GFWWSNSNQEILGIDGLCRPVDFKLDDPTCRANLLNAKEVTTT--IQQN--GF-RVWGDRTCSADSKWAFKNVVITND 291 (386) T ss_pred --CcEEccCCceeecccccceecccccccCcchhhhhhhcCcEEE--EcCC--CE-EEEcccccCCCcccceeehhhHHH Confidence 3 234456667766422 246889999999999976 3333 33 456888888777788899999999 Q ss_pred HHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceE Q lcl|NC_019451. 383 WLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYW 462 (504) Q Consensus 383 wl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~ 462 (504) |+.+.|+..+...+-. |.|+.=...|+..++.-|+.-+++|.|. ||. T Consensus 292 ~i~~~~~~~~~~~v~e----~~~~~~~~~i~~~i~~~L~~l~~~g~l~-----------------------------g~~ 338 (386) T protein:vir:10 292 MIADSLVRNHLWAVDR----NITKTYVEDVTEGVNNYLRHLKNIGAIA-----------------------------GGE 338 (386) T ss_pred HHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee-----------------------------eeE Confidence 9999999888874332 6789999999999999999999999996 477 Q ss_pred EEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 463 INITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 463 v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) +.+. +++.+++|+.+.+. .+.+.+.-..-+++|++...-- T Consensus 339 v~~d-~~~nt~~~~~~G~~-~~~i~~~p~~p~e~i~~~~~~~ 378 (386) T protein:vir:10 339 CWVD-PELNSPDQIQQGKV-YFDYDFSAYAPAEHITFRSHMV 378 (386) T ss_pred EEEc-ccCCCHHHhhCCeE-EEEEEEEecCCceeEEEEEEEe Confidence 8876 57888899888887 4899999999999999888766 No 43 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=97.09 E-value=0.00018 Score=41.15 Aligned_cols=458 Identities=13% Similarity=0.052 Sum_probs=214.2 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCC---CcHHHHHHHHHhccCCCCCcc Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGA---QSEEYQRAAAYFKFISKSVNS 77 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~---~s~ey~aA~~yF~~~~~~~~~ 77 (504) |--++-=|-|.--=.+............|++.-..=|++....-+|..|..+.||. .+.++.+...||-+- T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ng------ 74 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAAIVGKFAWGPAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQY------ 74 (663) T ss_pred CceecCceEEEEecCcccccccCccceeEEeeeccCCCCccEEecCHHHHHHHhCCcCccchhHHHHHHHHHhC------ Confidence 66666555553211233344445677788887776677666777889999999998 566778888888863 Q ss_pred cceEEEEeeecc---CCcceeeecc-cch---------------hh-----H---hHhhcc-Cc-eEEEEEccc-----c Q lcl|NC_019451. 78 PSSISFARWVNT---AIAPMVVGDN-LPK---------------TI-----A---DFAGFS-AG-VLTIMVGAA-----E 123 (504) Q Consensus 78 P~~l~igr~~~~---a~~~~l~g~~-~~~---------------~~-----~---~~~~~~-~g-~~titi~g~-----~ 123 (504) -+++||.|.... +.+..+.++. +.. .. . ....+. ++ .+.+.+... . T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~n~~~~~v~~~~a~~~~~~ 154 (663) T protein:vir:10 75 GNDLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKT 154 (663) T ss_pred CCeEEEEEccCCcccccccccCCcceeeeeccccCccccccccccccccccccccceeeeccCCceEEEEeccccccccc Confidence 457999998542 1111111100 000 00 0 000000 00 011111000 0 Q ss_pred eeee---------eeccccccc-------------------------hHHHHH-HHHhhhhccccccc-------c--ee Q lcl|NC_019451. 124 QNIT---------AIDTSAATS-------------------------MDNVAS-IIQTEIRKNADPQL-------A--QA 159 (504) Q Consensus 124 ~~~~---------~i~~s~ats-------------------------~~~vA~-~i~~~i~a~~~~~~-------~--~a 159 (504) ..+. ...++.... ..+... ...........+.. . .. T Consensus 155 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~v~~vv~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~G~~Gn~i 234 (663) T protein:vir:10 155 RQLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLVSAVYPGEIGSTV 234 (663) T ss_pred cccceeeeccccceeEeeeccccccccccccceecccceeeEeeccccccccccchhhhcccccceeeeeecccccccce Confidence 0000 000000000 000000 00000000000000 0 00 Q ss_pred EEEE-ec-------------------------------ccceeEEecccCcceeEEEEeec---c----chhhhhhhhcc Q lcl|NC_019451. 160 TVTW-NQ-------------------------------NTNQFTLVGATIGTGVLAVAKSA---D----PQDMSTALGWS 200 (504) Q Consensus 160 ~vt~-d~-------------------------------~~~~F~its~t~ga~s~~~~~sa---~----~~~ia~~l~~t 200 (504) .+.. +. ....|.+.-...+........+. . +........+. T Consensus 235 ~v~i~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~ 314 (663) T protein:vir:10 235 EVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFR 314 (663) T ss_pred eEEecccccccccccccccccccccccccceeeeeccccccceeEEEecCCcceeeeeeeecccccccchhhhhhhhhhc Confidence 0000 00 00000000000000000000000 0 00000000000 Q ss_pred cCc-----------------cee-ecccc------cccHHHHHHHHHhhccceeEEEEEecc--CCHHH----HHHHHHH Q lcl|NC_019451. 201 TSN-----------------VVN-VAGQA------ADLPDAAVAKSTNVSNNFGSFLFAGAP--LDNDQ----IKAVSAW 250 (504) Q Consensus 201 ~~~-----------------~~~-~~g~a------aet~~~al~~~~~~~~~wy~~~~~~~~--~~~~~----~~a~A~w 250 (504) .+. ... ..|.+ ..+...+++.+.+...-.-.+++.... ...++ ..++... T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~al~~~ 394 (663) T protein:vir:10 315 NGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSL 394 (663) T ss_pred cCcceEEEEeecccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHH Confidence 000 000 01111 122333444444432111112222211 12222 2344555 Q ss_pred HhhcCCcEEEEEeccc--------c-chhHHHHHHh--------------hhcceeEEEEeccC-----CC----ccHHH Q lcl|NC_019451. 251 NAAQNNQFIYTVATSL--------A-NLGTLFTLVN--------------GNAGTALNVLSATA-----AN----DFVEQ 298 (504) Q Consensus 251 ~e~~~~~~~~~~~~~d--------~-~~~~~~~~~~--------------~~~~~~~~~~~~~~-----~~----~~~~a 298 (504) ++....++. ++.... . .......-.. ....+...++++.. .+ .-+.+ T Consensus 395 a~~~~~~~a-i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d~~~~~~~~~p~s~ 473 (663) T protein:vir:10 395 ADDRQDCVA-IVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLAA 473 (663) T ss_pred HHhhCCEEE-EEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEecccCCceEEechhH Confidence 555433333 221110 0 0000000000 00112223332210 11 13556 Q ss_pred HHHHHHHhcCcCcCCc-eeeecccccCccc-----cccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCcc-c Q lcl|NC_019451. 299 CPSEILAATNYDEPGA-SQNYMYYQFPGRN-----ITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPT-D 371 (504) Q Consensus 299 a~~g~~as~nf~~~~g-~~T~kfk~l~Gv~-----a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y-~ 371 (504) .++|.++.+|.++ | ......|.+.+|. ...+++.|.+.|..+|+|....+-+. +.+ .++..+++++.- + T Consensus 474 ~vAGl~Ar~D~~~--g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~-~G~-~~wG~rT~s~~~s~ 549 (663) T protein:vir:10 474 DIAGLCAYTDQVS--HPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGG-DGF-VLFGDKMATQVPSP 549 (663) T ss_pred HHHHHHHHhhccC--CceEccCCceeccccccccceeccChhHHHHHhhCCceEEEEEeCC-CcE-EEEcccccCCCCcc Confidence 7778888887543 3 1122334433332 34678999999999999998877542 223 346777777652 6 Q ss_pred cchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcc Q lcl|NC_019451. 372 AVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRR 451 (504) Q Consensus 372 ~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~ 451 (504) |++|-+.+-.+|+.+.|+...+...-. |.|+.-...|+..|+.-|++-+++|.|. T Consensus 550 ~~~i~vrR~~~~i~~si~~~~~~~v~e----~n~~~l~~~i~~~i~~~L~~l~~~gal~--------------------- 604 (663) T protein:vir:10 550 FDRINVRRLFNMLKKNIGDTSKYELFE----NNDAFTRQSFRMETSQYLDGIRSLGGCY--------------------- 604 (663) T ss_pred cceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee--------------------- Confidence 778999999999999999888773322 5788899999999999999999999986 Q ss_pred cccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 452 AWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 452 ~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ||++.++ .+..+++|+.+-+. .+.+.++-.-.+++|+++-... T Consensus 605 --------g~~v~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~i~~~~~~~ 647 (663) T protein:vir:10 605 --------DFRVVCD-TTNNTPNVIDRNEF-VGTIYVKPPRSINYITLNMVAT 647 (663) T ss_pred --------eeEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEe Confidence 5889987 57788888888777 5889999999999998875544 No 44 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=97.00 E-value=0.00022 Score=40.68 Aligned_cols=349 Identities=8% Similarity=-0.026 Sum_probs=152.7 Q ss_pred CCCCC-cccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeecccc---ccchHHHHHHHHh Q lcl|NC_019451. 71 ISKSV-NSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSA---ATSMDNVASIIQT 146 (504) Q Consensus 71 ~~~~~-~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~---ats~~~vA~~i~~ 146 (504) -|--+ ..| -+++=+......+-.. +.+...-+ |+..........+.. ..+..+.+.. .+ T Consensus 1 m~~~~~~~h-Gv~v~ev~~g~~~i~~--------------~~tavi~~-Vgta~~ad~~~p~~~~~~i~~~~d~~~~-~~ 63 (388) T protein:vir:96 1 MPVIDQFEH-NGISIETHEPPPPMGP--------------PGDNVVAW-VVTAPDKHADVAFSVPFRVANTADAQYL-DS 63 (388) T ss_pred CCCCCCCCC-ceEEEEcCCCcccccc--------------cCcceeEE-EEecCCCccccccccceeeecchhhhhh-hc Confidence 11000 011 1244444332211110 00110000 000000000000000 0111111111 11 Q ss_pred hhhcccccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeeccccccc-HHHHHHHHHhh Q lcl|NC_019451. 147 EIRKNADPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADL-PDAAVAKSTNV 225 (504) Q Consensus 147 ~i~a~~~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet-~~~al~~~~~~ 225 (504) ...... .-++++..-|. .+....+......+.+... +. .-.+.+.++.+ .-..+.++.+. T Consensus 64 ~~~~~g--------tl~~al~~~~~-----~~~~~~~vv~v~~g~~~~a----t~--a~iig~~~~~tg~~~gl~al~~~ 124 (388) T protein:vir:96 64 TGNELG--------TGWHAASETLK-----KTSVPQYFIVVPEGADDAA----TM--ANIIGGIDPTTGRRTGIAALTEC 124 (388) T ss_pred cccccc--------cchhhhHhhhc-----cCCceEEEEEecccccccc----cc--ceeeeecccccchhhHHHHhhhc Confidence 000000 00000000000 0000001100000000000 00 00011111111 11223333332 Q ss_pred ccceeEEEEEecc-CCHHHHHHHHHHHhhcCCcEEEEEeccccch-hHHH--HHHhh----hcceeEEEEeccC------ Q lcl|NC_019451. 226 SNNFGSFLFAGAP-LDNDQIKAVSAWNAAQNNQFIYTVATSLANL-GTLF--TLVNG----NAGTALNVLSATA------ 291 (504) Q Consensus 226 ~~~wy~~~~~~~~-~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~-~~~~--~~~~~----~~~~~~~~~~~~~------ 291 (504) . .-..++.+... ....-..++...++.-+ .| ++.+..... .... ..... ...+...++++.. T Consensus 125 ~-~~p~il~aPg~s~~~~v~~al~~~~~~~~-~~--~i~D~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~P~~~~~d~~~ 200 (388) T protein:vir:96 125 T-ERPTLIGAPGFSQNKAVIDALASMAKRLK-CR--AVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKA 200 (388) T ss_pred c-cceeEEEeeccccchHHHHHHHHHHhhcC-cE--EEEeccCCchhHHHHHHhhhhccCcCcceEEEEeCceeeecccC Confidence 2 11233333221 11222334444444321 22 222221111 1100 00000 1112222222110 Q ss_pred ---CCccHHHHHHHHHHhcCcCcCCceeeecccccCcccc-----ccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcC Q lcl|NC_019451. 292 ---ANDFVEQCPSEILAATNYDEPGASQNYMYYQFPGRNI-----TVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRG 363 (504) Q Consensus 292 ---~~~~~~aa~~g~~as~nf~~~~g~~T~kfk~l~Gv~a-----~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G 363 (504) -...+...++|..+.+|+...+.-..+. +.|+.- ..++++|++.|..+|+|.+..+.+.|- .++.+ T Consensus 201 ~~~~~~p~s~~~AG~~a~~D~~~spaN~~i~---i~g~~~~~~~~~~~~~~~~~~Ln~~gI~~i~~~~~~G~---~~wG~ 274 (388) T protein:vir:96 201 QGNIYVPPSTIAMGAVAAVKPWESPGNQGVL---IQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGF---SLIGN 274 (388) T ss_pred CceeeechHHHHHHHHHhhcCcccccCeeEE---eeeecccccccccCChhhHHhhhhcCceEEEEecCCcE---EEEcc Confidence 1123567777888888865444432222 344431 234678999999999999988866553 34788 Q ss_pred EEeCCccccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceee Q lcl|NC_019451. 364 ILCGGPTDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYI 443 (504) Q Consensus 364 ~~~~G~y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i 443 (504) .+++..| |-+.+-.+|+++.|+..+....- + |.|+.=...|+..++.-|+.-+++|.|.. T Consensus 275 rT~~~~~----i~vrR~~~~i~~si~~~~~~~v~---e-pn~~~~~~~i~~~i~~fL~~l~~~Gal~g------------ 334 (388) T protein:vir:96 275 RTVTGKF----ISFVGLEDAIARKLEAASQRAMS---K-QLTKSFMEQEIKKINLFMQDLVAAEIIPG------------ 334 (388) T ss_pred cccCCcc----eeehhhHHHHHHHHHHHHHHhcc---C-CCCHHHHHHHHHHHHHHHHHHHhCCceee------------ Confidence 8887544 89999999999999988876332 2 57888899999999999999999999862 Q ss_pred ccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 444 TQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 444 ~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) |.++. .++..+++|+.+-+. .+.+.+.-.-.++.|++....- T Consensus 335 -----------------~~~~~-d~~~nt~~~i~~G~~-~~~i~~~p~~pae~I~~~~~~~ 376 (388) T protein:vir:96 335 -----------------GEVYL-HPTLNTVERYKNGSW-YIVIDYGRYSPNEHMIFHLNAV 376 (388) T ss_pred -----------------eEEEE-ecCCCCHHHhhCCEE-EEEEEEEecCCcceEEEEEEEc Confidence 44444 345677777777666 4888888888899998887766 No 45 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=96.90 E-value=0.00027 Score=40.15 Aligned_cols=458 Identities=11% Similarity=0.014 Sum_probs=216.3 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCC---CcHHHHHHHHHhccCCCCCcc Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGA---QSEEYQRAAAYFKFISKSVNS 77 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~---~s~ey~aA~~yF~~~~~~~~~ 77 (504) |--++-=|-|.--=.+..........+.|++....=|++....-+|..|..+.||. .+.++.+...||-+- T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng------ 74 (666) T protein:vir:65 1 MTLLSPGFETKETTLSTTIVQSETGRAALVGKFQWGPAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQY------ 74 (666) T ss_pred CceecCceEEEEecCcccccccCcccceEEecccCCCCccCEEecCHHHHHHHcCCccccchhHHHHHHHHHhc------ Confidence 65555544443211233344445677888888777777777888889999999994 466778888888764 Q ss_pred cceEEEEeeeccC-Cc-ceeeecccch--hhHhHhhccCceEEEEEcccceee----eeecccc---------------c Q lcl|NC_019451. 78 PSSISFARWVNTA-IA-PMVVGDNLPK--TIADFAGFSAGVLTIMVGAAEQNI----TAIDTSA---------------A 134 (504) Q Consensus 78 P~~l~igr~~~~a-~~-~~l~g~~~~~--~~~~~~~~~~g~~titi~g~~~~~----~~i~~s~---------------a 134 (504) =+++||-|..... .. +...++.... ....-.......+.+...+..... ..++..+ . T Consensus 75 g~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~~~V~~~~~~~~~~~~~~~~~~~~~~~g~~~~t~~~~~~~ 154 (666) T protein:vir:65 75 GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (666) T ss_pred CceEEEEEccCcccccccccccCceeeeEeeccccccccceEEEEecccccccccccccccccccccccccccceeeccc Confidence 3569999984421 11 1111110000 000000000001222221111100 0000000 0 Q ss_pred cchHHHHHHHHhh---hhcccccccceeEE--EEec---------------ccceeEEecccC-------------ccee Q lcl|NC_019451. 135 TSMDNVASIIQTE---IRKNADPQLAQATV--TWNQ---------------NTNQFTLVGATI-------------GTGV 181 (504) Q Consensus 135 ts~~~vA~~i~~~---i~a~~~~~~~~a~v--t~d~---------------~~~~F~its~t~-------------ga~s 181 (504) .+....+..+... +............+ ++.. ....+....... +... T Consensus 155 ~~~g~~~~l~~~~~~~~~~~~~~~~~a~sv~~~~~~g~~~~~~~~~a~~~~~~~~~~~~~~~~~~~a~~A~~~g~~g~~i 234 (666) T protein:vir:65 155 KAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLKKYDMPAVSAIYAGEIGNSL 234 (666) T ss_pred cccCcceeEeeccceeecccCcccccceeeeecccccceeeeeecccccccccccccccccccccceeeeeeccccccce Confidence 0000000000000 00000000000000 0000 000111110000 0000 Q ss_pred EEEEeecc-chhhhhhhh------------------------------------------------------------c- Q lcl|NC_019451. 182 LAVAKSAD-PQDMSTALG------------------------------------------------------------W- 199 (504) Q Consensus 182 ~~~~~sa~-~~~ia~~l~------------------------------------------------------------~- 199 (504) .+...... ....+..++ + T Consensus 235 ~v~i~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (666) T protein:vir:65 235 EVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFA 314 (666) T ss_pred eEEeecccccccccccccccccccccccceeeecccccccccceeeeecCCcccceeecccCcccccccchhhhhhhhhc Confidence 00000000 000000000 0 Q ss_pred -ccCccee----------------ecccc--------------cccHHHHHHHHHhhccceeEEEEEecc-----CCHHH Q lcl|NC_019451. 200 -STSNVVN----------------VAGQA--------------ADLPDAAVAKSTNVSNNFGSFLFAGAP-----LDNDQ 243 (504) Q Consensus 200 -t~~~~~~----------------~~g~a--------------aet~~~al~~~~~~~~~wy~~~~~~~~-----~~~~~ 243 (504) .....+. ..+.+ .......+..+.+.......++.+... ....- T Consensus 315 ~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v 394 (666) T protein:vir:65 315 RGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTV 394 (666) T ss_pred ccccceeeeecccccccccceEEccCCCCcCcccccccccccccccHHHHHHHHhhhhhccCCceeecCcCCccchhHHH Confidence 0000000 00000 001223333333322111122221110 01222 Q ss_pred HHHHHHHHhhcCCcEEEEEe------ccc--cchhHHHHHHhh-----------hcceeEEEEeccC-----CC---c-c Q lcl|NC_019451. 244 IKAVSAWNAAQNNQFIYTVA------TSL--ANLGTLFTLVNG-----------NAGTALNVLSATA-----AN---D-F 295 (504) Q Consensus 244 ~~a~A~w~e~~~~~~~~~~~------~~d--~~~~~~~~~~~~-----------~~~~~~~~~~~~~-----~~---~-~ 295 (504) ..++...++....++.+.-. +.. .+.......... ...+...++++.. .+ . - T Consensus 395 ~~~l~~~~~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p 474 (666) T protein:vir:65 395 QKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVP 474 (666) T ss_pred HHHHHHHHhhccceEEEeccccceeeecCCCCCHHHHHHHHHhcccccccccccCcceEEEEcCceEEecccCCceeEec Confidence 34455555554443322110 111 111111110100 0112222222110 11 1 2 Q ss_pred HHHHHHHHHHhcCcCcCCc-eeeecccccCccc-----cccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCc Q lcl|NC_019451. 296 VEQCPSEILAATNYDEPGA-SQNYMYYQFPGRN-----ITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGP 369 (504) Q Consensus 296 ~~aa~~g~~as~nf~~~~g-~~T~kfk~l~Gv~-----a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~ 369 (504) +...+.|.++.+|..+ | ......|.+.||. .-.+++.|.+.|..+|+|+...+.+.|- .++.+.++++. T Consensus 475 ~sg~vAGl~Ar~D~~~--g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~---~~wG~rT~~~~ 549 (666) T protein:vir:65 475 LAADIAGLCARTDAVS--QPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGF---ILMGDKTATTV 549 (666) T ss_pred hHHHHHHHHHHHhccC--CcEEccCCeecceeeccccceeecChhHHHhhhhCCceEEEEeCCCeE---EEEecccCCCC Confidence 4566678888887433 3 1223345544442 2356889999999999999988866543 45788887775 Q ss_pred c-ccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccC Q lcl|NC_019451. 370 T-DAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTG 448 (504) Q Consensus 370 y-~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g 448 (504) - +|.+|-+.+-.+|+++.|+...+-.+-. |.++.=...|+..|+.-|++-+++|.|. T Consensus 550 ~s~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~------------------ 607 (666) T protein:vir:65 550 PSPFDRINVRRLFNMLKKNIGDSSKYKLFE----NNDNFTRASFRMEVSQYLSTIRSLGGIY------------------ 607 (666) T ss_pred CcccceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee------------------ Confidence 3 5788999999999999999988874433 4688888999999999999999999996 Q ss_pred CcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 449 DRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 449 ~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ||+|.++ .++.+++|+.+.+. .+.+.++-.-.++.|+++-... T Consensus 608 -----------g~~V~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~i~~~~~~~ 650 (666) T protein:vir:65 608 -----------DFRVQCD-TTNNTPDVIDRNEF-VASMFIKPAKSINYIMLNFTAV 650 (666) T ss_pred -----------eeEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEe Confidence 5899987 57888888887777 5999999999999999886555 No 46 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=96.85 E-value=0.0003 Score=39.92 Aligned_cols=359 Identities=12% Similarity=0.071 Sum_probs=191.4 Q ss_pred CCC-c---cceEEEeeeecccccccccccceEEEeccc-----ccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccC Q lcl|NC_019451. 1 MIS-Q---SRYIRIISGVGAGAPVAGRKLILRVMTTNN-----VIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFI 71 (504) Q Consensus 1 mip-~---s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~-----~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~ 71 (504) |-. . =.++.+..+ +.+........+-|++..+ .+|.+.-...++..+....||..-..+.+...+|.+- T Consensus 1 M~~~~~~Gv~v~e~~~~--~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~its~~~~~~~~g~~~tL~~al~~~~~~~ 78 (390) T protein:vir:79 1 MPQDYHHGVRVIEINEG--GRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQT 78 (390) T ss_pred CccccCCCeEEEEcCCC--cccccccCCceeEEEEecCCCCccccccccceEeecHHHHHHhcCCCccchhhhhhhcccc Confidence 433 1 112222222 2222222233444444322 3454333344666677778999888888888888873 Q ss_pred CCCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcc Q lcl|NC_019451. 72 SKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKN 151 (504) Q Consensus 72 ~~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~ 151 (504) . ...|+-+..+... -.... .-.+++.. ...-.+++.. +. ... T Consensus 79 ~------~~~~vv~v~~~~~--------~~~~~-----------~~~ig~~~---------~~~~~tgl~a-l~---~~~ 120 (390) T protein:vir:79 79 K------PLTVVVRVAEGKD--------ADETT-----------SNVIGTVT---------PDGKYTGIKA-LL---AAQ 120 (390) T ss_pred c------ceEEEEeeccccc--------ccccc-----------ceeeeccc---------ccccchhhhh-hh---hhh Confidence 2 3456665532110 00000 00000000 0000000000 00 000 Q ss_pred cccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccceeE Q lcl|NC_019451. 152 ADPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGS 231 (504) Q Consensus 152 ~~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~wy~ 231 (504) ..+.+ .+......+........++..+.+ .+.. T Consensus 121 ----------------~~~~~----------------------------~p~il~ap~~~~~~v~~~l~~~a~---~~~~ 153 (390) T protein:vir:79 121 ----------------GALGV----------------------------KPRILAAPGLDTQPVAAALAATAQ---SLRA 153 (390) T ss_pred ----------------hhhcc----------------------------ccccccCCcccchHHHHHHHHhhh---hcce Confidence 00000 000000011111122333333333 3445 Q ss_pred EEEEeccCCHHHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCc Q lcl|NC_019451. 232 FLFAGAPLDNDQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDE 311 (504) Q Consensus 232 ~~~~~~~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~ 311 (504) +.+.+.+.. ....++.+|-+..+..+...++--- ... ...... ... ..+.+.+.|.++.+|.++ T Consensus 154 ~ai~D~p~~-~t~~~a~~~~~~~~s~~~~~~~p~~---~~~----d~~~~~-~~~-------~p~s~~~Ag~~a~~D~~~ 217 (390) T protein:vir:79 154 MAYVSASGC-KTKEEAAAYRRQFGQREIMVIWPDW---LGW----DDTTNS-TAV-------IPAPAIAAGLRAKIDNDI 217 (390) T ss_pred EEEEEccCC-CCHHHHHHHhcCCCCceEEEEcCce---eec----ccccCc-eeE-------eehHHHHHHHHHhhhccC Confidence 666555422 1233445666665555554443210 000 000000 011 124567778888887322 Q ss_pred CCc-eeeecccccCcccc--------ccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHH Q lcl|NC_019451. 312 PGA-SQNYMYYQFPGRNI--------TVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEI 382 (504) Q Consensus 312 ~~g-~~T~kfk~l~Gv~a--------~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~d 382 (504) | ......|.+.|+.. +..+..|++.|..+|+|... .+. .+ .++.+.++++.-.|.+|-+.+-.+ T Consensus 218 --g~~~spsN~~i~gi~~~~~~~~~~~~~~~~~a~~Ln~~gi~t~~--~~~--G~-~~wG~rT~~~d~~~~~i~vrR~~~ 290 (390) T protein:vir:79 218 --GWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLV--NRN--GF-RFWGERTCSDDPKFAFENYTRTAQ 290 (390) T ss_pred --CcEEccCCceeeccceeeeeccccccccchhhhhhhhcCcEEEE--cCC--CE-EEEeccccCCCcccceeeehhhHH Confidence 2 11222666666531 23356688899999999873 232 33 457888888887888899999999 Q ss_pred HHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceE Q lcl|NC_019451. 383 WLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYW 462 (504) Q Consensus 383 wl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~ 462 (504) |+.+.|+..+...+-. |-|..=...|+..++.-|+.-+++|.|. ||. T Consensus 291 ~i~~~i~~~~~~~v~e----~~~~~~~~~i~~~i~~~L~~l~~~gal~-----------------------------g~~ 337 (390) T protein:vir:79 291 VAADSIAEAQMPVVDG----PLNPSLARDIVESINGWFRQQVANGYLI-----------------------------GGS 337 (390) T ss_pred HHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee-----------------------------eeE Confidence 9999999988874432 6789999999999999999999999996 466 Q ss_pred EEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 463 INITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 463 v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) +.+. .++.+++|+.+-+. -+.+.+.-.-.++.|++.-... T Consensus 338 v~~d-~~~nt~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~ 377 (390) T protein:vir:79 338 AWID-PEPNTADILASGKA-YIDYDYTPVPPLENLVLRQRIT 377 (390) T ss_pred EEEe-cCCCCHHHhhCCEE-EEEEEEEecCCcceEEEEEEEc Confidence 6665 35778888777766 4888888888999999888877 No 47 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=96.63 E-value=0.00045 Score=38.92 Aligned_cols=435 Identities=11% Similarity=0.044 Sum_probs=167.5 Q ss_pred CCC-------cc-------------------c-----eEEEeeeecccccccccccceEEEecccccCc--c--ceEEec Q lcl|NC_019451. 1 MIS-------QS-------------------R-----YIRIISGVGAGAPVAGRKLILRVMTTNNVIPP--G--IVIEFD 45 (504) Q Consensus 1 mip-------~s-------------------~-----iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~--~--r~~~y~ 45 (504) -+| -+ | |-+++++.-+..-.-..|-...-+-.--..|. | |.|.. T Consensus 198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 276 (742) T protein:vir:58 198 WVYFAEYGTPTSSLTLYKGFYLEGIDLNSFNKQFVVSIENITVNREKGQVLYPSFDVVVHFRDIRGVSANTEYIRFRQV- 276 (742) T ss_pred cccccccCCCccceeeeecccccccccCcccceeeEEEeeeeecccCCceeccceeEEEEEeeccCCCCCccceeeeee- Confidence 000 00 0 22223333322222211111111110000010 1 11110 Q ss_pred CHHHHHHhcCCCcHHHH------------H-----HHHHhccCCCCCc------ccceEEEEeeeccCCcceeeecccch Q lcl|NC_019451. 46 NANAVLSYFGAQSEEYQ------------R-----AAAYFKFISKSVN------SPSSISFARWVNTAIAPMVVGDNLPK 102 (504) Q Consensus 46 s~~~V~~~Fg~~s~ey~------------a-----A~~yF~~~~~~~~------~P~~l~igr~~~~a~~~~l~g~~~~~ 102 (504) ++-..||.|. . ...|+++.|--.. ++.-.-+.+|......+. . T Consensus 277 -------~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~n~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~-------~ 342 (742) T protein:vir:58 277 -------NLNPESPNYIERVIGNMTFEFDGERIVTGGEYPNQVPFLRVVVSQDIKQNVAGVEKWVPVGFEGI-------Y 342 (742) T ss_pred -------ecCCCCcceeeecccceeeeeccceeeecccccccccceeeEeccccCcCccceeEEEecccccc-------c Confidence 1122222221 0 1122222110000 000011112211100000 0 Q ss_pred hhHhHhhccCceEEEEEcccceeee---------eeccccccchHHHHHHHHhhhhcccccccc-eeEEEEeccc----- Q lcl|NC_019451. 103 TIADFAGFSAGVLTIMVGAAEQNIT---------AIDTSAATSMDNVASIIQTEIRKNADPQLA-QATVTWNQNT----- 167 (504) Q Consensus 103 ~~~~~~~~~~g~~titi~g~~~~~~---------~i~~s~ats~~~vA~~i~~~i~a~~~~~~~-~a~vt~d~~~----- 167 (504) +..++.-+.+....+++-.+..... .+......++.-+... ......... ...+...-.. T Consensus 343 ~v~d~~~~~~~~~~v~~~~t~~~~~pp~~~~~~e~v~~ngG~~f~v~s~~-----~~g~~i~~~~as~~~s~ln~~~~V~ 417 (742) T protein:vir:58 343 SVGDFTVIVNELTNVSIPVTDSAIIPPMRFTRIEQITLSGGASFSVISNQ-----PYGFNIQDSRHSYWLSPFKDDELII 417 (742) T ss_pred cccceeeeccccccceeeccccccCCcccccccceeecccCcceEEEEec-----ccCcceeccCcceEEeccCCceEEE Confidence 0001111111111111111111000 0000000000000000 000000000 0000000000 Q ss_pred ------------ceeEEecccCccee-EEEEeeccchhhhhhhhcccCcceeecccccc----cHHHHHHHHHhhcccee Q lcl|NC_019451. 168 ------------NQFTLVGATIGTGV-LAVAKSADPQDMSTALGWSTSNVVNVAGQAAD----LPDAAVAKSTNVSNNFG 230 (504) Q Consensus 168 ------------~~F~its~t~ga~s-~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aae----t~~~al~~~~~~~~~wy 230 (504) .++...+....... ........+.+-... ..........+.... ..-+.|.++.+. .+ . T Consensus 418 Gt~aa~~~~d~~t~~~v~s~~~alp~~a~sv~laGG~dg~v~--v~~~~~D~iG~~~~~d~~~adrTGL~ALlev-~e-V 493 (742) T protein:vir:58 418 GTELVLPALDVSTEFGVSSWEEALPEFSFLMPFQGGSDGYIR--VDENEPDTIGRVKITPALLANYERLLPLLTE-DQ-F 493 (742) T ss_pred eehhhccccccchheeccccccccceeeEEEeecCCcccccc--ccCCCcccccccccccccccchhHHHHhhhc-CC-C Confidence 01111111000000 000000000000000 000000000000000 012345555443 22 2 Q ss_pred EEEEEeccCCHHHHHHHHHHHhhcCCcEEEEEeccccchhH--HHHHHhhh--cceeEEEEeccCC-------CccHHHH Q lcl|NC_019451. 231 SFLFAGAPLDNDQIKAVSAWNAAQNNQFIYTVATSLANLGT--LFTLVNGN--AGTALNVLSATAA-------NDFVEQC 299 (504) Q Consensus 231 ~~~~~~~~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~--~~~~~~~~--~~~~~~~~~~~~~-------~~~~~aa 299 (504) .++.+......+...++.+.++...+++.... +.+..... ........ ..+...++++... ..-+.+. T Consensus 494 tILiAPG~t~~~v~aav~A~la~a~~Rl~vL~-D~P~~~tt~~~A~a~r~~~nSsraaly~PwVkv~d~~~~r~vPpSga 572 (742) T protein:vir:58 494 DLVLTPYLTFADHAGTVNAFINRAENRFLYLF-DIAGDDDTENLAISLAGYINSSFATTFFPWVRRLTNKGMRTVPASLA 572 (742) T ss_pred cEEEEcCCCchHHHHHHHHHHHhhcCCeEEEE-ecCCCCchHHHHHHHHhccCCceEEEEeceeeeccCCcceeechHHH Confidence 34444332233334456666666555544333 22211111 11111111 2222333332111 0123456 Q ss_pred HHHHHHhcCcCcCCceeeeccc-ccCccccccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEe-CCccccchhhh Q lcl|NC_019451. 300 PSEILAATNYDEPGASQNYMYY-QFPGRNITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILC-GGPTDAVDMNV 377 (504) Q Consensus 300 ~~g~~as~nf~~~~g~~T~kfk-~l~Gv~a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~-~G~y~~~wiD~ 377 (504) ++|.++.+|.+ +|- |+-- ....+.....+++|.+.|..+++|+...+ + +.+ .++.+.++ +-.-+|.+|.+ T Consensus 573 IAGL~ARtD~e--rGv--w~SPANrgii~~~~~s~se~d~LN~~GINtIrsf-G--~G~-rlWGnRTlassDs~wryInV 644 (742) T protein:vir:58 573 AYRSIRTTDPE--TGL--APVGARRGVVTGEPVRQVDWEDLYNNRINPIVRV-G--NDV-LLFGQKTMLNVNSALNRINV 644 (742) T ss_pred HHHHHHHhccC--Cce--EecCCcceeeeccccchhhHHHHhhCCceEEEEC-C--CcE-EEEcceecCCCCcccceEee Confidence 77888888743 331 2211 11223344567899999999999998776 3 334 34677776 44556788999 Q ss_pred hhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCccccccee Q lcl|NC_019451. 378 YANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQ 457 (504) Q Consensus 378 ~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~ 457 (504) .+-.+|+++.|+..+....-. |-|+.-...|+..|+.-|+.-+++|.|. T Consensus 645 RRlfd~Ie~SI~~a~q~~VfE----PNd~~L~~sIk~sInafL~~L~aqGALl--------------------------- 693 (742) T protein:vir:58 645 RRLLIVMRNRISQILSSYLFE----NNTSENRLRAEALVRQYLESLRLRGAVT--------------------------- 693 (742) T ss_pred hhhHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------------- Confidence 999999999999888764322 6788899999999999999999999986 Q ss_pred ecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 458 TLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 458 ~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ||.|.+. ++.+++|+.+-+. -+.+.+.-.-.++.|+++-+.. T Consensus 694 --GfrV~lD--etNTpeDI~~Gkl-vv~I~vAP~~PAEfI~lrf~it 735 (742) T protein:vir:58 694 --DYEVAID--SVTTPTDIDNNTL-RARVTVQPARSIEYIDITFVIT 735 (742) T ss_pred --eeEEEEc--CCCCHHHhhCCEE-EEEEEEEccCCcceEEEEEEEE Confidence 4777775 3577777765554 5777788888888888766655 No 48 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=96.52 E-value=0.00055 Score=38.48 Aligned_cols=359 Identities=9% Similarity=0.041 Sum_probs=187.6 Q ss_pred CCCccceE-EEee---eecccccccccccceEEEecccc-----cCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccC Q lcl|NC_019451. 1 MISQSRYI-RIIS---GVGAGAPVAGRKLILRVMTTNNV-----IPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFI 71 (504) Q Consensus 1 mip~s~iv-~V~~---~v~~~~~~~~~~~~~l~l~~~~~-----~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~ 71 (504) |-=.+.+. -|.+ .-.+.+........+.|++..+. +|.+.-...++..+-...||.....+.+-..+|.+. T Consensus 1 m~m~~~~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~~~~~~~~g~~g~L~~al~~~~~~~ 80 (393) T protein:vir:10 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIV 80 (393) T ss_pred CCCCCccCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEecchHHHHHhhCCccchhhhhhhhhccc Confidence 43333321 2222 22222233333455555554332 344333344666777788999888888888888873 Q ss_pred CCCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcc Q lcl|NC_019451. 72 SKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKN 151 (504) Q Consensus 72 ~~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~ 151 (504) -...++-+......+.. ... .+.|... ....+++..... +. T Consensus 81 ------~~~~~vv~v~~~~~~~~--------t~~------------~iig~~~---------~~~~tgl~al~~----~~ 121 (393) T protein:vir:10 81 ------KTPTVIVRVAESDDSDT--------LTA------------NIVGTQE---------NGKFTGIKALLT----AQ 121 (393) T ss_pred ------CceEEEeecccCccccc--------ccc------------ccccccc---------cchhhHHHHHHh----hh Confidence 34445555432111000 000 0011000 000111111110 00 Q ss_pred cccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeecccccccHHHHHHHHHhhccceeE Q lcl|NC_019451. 152 ADPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGS 231 (504) Q Consensus 152 ~~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aaet~~~al~~~~~~~~~wy~ 231 (504) . .+... + .....++........++..+.+.-.+-+ T Consensus 122 ~----------------~~~~~----------------p------------~li~apg~~~~~~~~al~~~~~~~~~~~- 156 (393) T protein:vir:10 122 S----------------TVFVK----------------P------------KLLCVPQHDNQAVATELLSVAKKLNAFA- 156 (393) T ss_pred h----------------hccee----------------e------------eeeeeccccchHHHHHHHHHhhccCcEE- Confidence 0 00000 0 0000111112233444555444444322 Q ss_pred EEEEeccCCHHHHHHHHHHHhhcCCcEEEEEeccccchhHHHHHHhhhcceeEEEEeccCCCccHHHHHHHHHHhcCcCc Q lcl|NC_019451. 232 FLFAGAPLDNDQIKAVSAWNAAQNNQFIYTVATSLANLGTLFTLVNGNAGTALNVLSATAANDFVEQCPSEILAATNYDE 311 (504) Q Consensus 232 ~~~~~~~~~~~~~~a~A~w~e~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~ 311 (504) .+.+.+ +....++-+|-+..+..+..+++-.- .... .... .... ..+.+.++|.++.+|-. T Consensus 157 -~v~d~~--~~t~~~ai~~~~~~~s~~~~~~~P~~---~~~d----~~~~-~~~~-------~p~s~~~Ag~~a~~d~~- 217 (393) T protein:vir:10 157 -FISDNG--ATTKEQAYTYRQNFSQREGMMIFGDW---KSYN----TDKK-AYDT-------DYAVARACALQAYIDKT- 217 (393) T ss_pred -EEEcCC--CCCHHHHHHHhhhcCCceEEEEeccc---cccc----ccCC-ceeE-------eehhHHHHHHHHHhhcC- Confidence 222221 11223444666665555554443211 0000 0000 1111 12446777888887632 Q ss_pred CCc-eeeecccccCccccc--------cCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCccccchhhhhhhHH Q lcl|NC_019451. 312 PGA-SQNYMYYQFPGRNIT--------VSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEI 382 (504) Q Consensus 312 ~~g-~~T~kfk~l~Gv~a~--------~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y~~~wiD~~~~~d 382 (504) .| ...-..|.+.||..- .++++|++.|..+|+|+.. .+ ..+ .++.+.++++.-.|.+|-+.+-.+ T Consensus 218 -~G~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~--~~--~G~-~~wG~rT~s~d~~~~~i~vrR~~~ 291 (393) T protein:vir:10 218 -VGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NH--NGF-RYWGSRTLATDTRWAFQQSVRTAQ 291 (393) T ss_pred -CCcEEccCCceeeceeecceecccccCCCcchhHhHhhcCceEEE--cC--CCE-EEEcccccCCCcccceeehhhHHH Confidence 22 233456667776532 2458899999999999873 23 334 346888888877788899999999 Q ss_pred HHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcC--ccccccccCcccceeeccccCCcccccceeecc Q lcl|NC_019451. 383 WLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANG--TFTYGKEISAVQQQYITQVTGDRRAWRQVQTLG 460 (504) Q Consensus 383 wl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG--~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~G 460 (504) |+++.|+..+...+-. |.++.=+..++..++.-|+.-+++| .|. | T Consensus 292 ~i~~~i~~~~~~~v~e----~~~~~~~~~i~~~i~~~L~~l~~~g~~al~-----------------------------g 338 (393) T protein:vir:10 292 IIKETIGAGLAWAVDM----PLTPLRVKTMLEAINNKLRSWASGDDPRIL-----------------------------G 338 (393) T ss_pred HHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhccccccc-----------------------------c Confidence 9999999888764422 5688888899999999998887766 232 3 Q ss_pred eEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 461 YWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 461 Y~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) |.+..+ ++.+++|..+-+. -+.+.+...-.++.|++..... T Consensus 339 ~~v~~~--~~nt~~~i~~G~~-~~~i~~~p~~p~e~I~~~~~~~ 379 (393) T protein:vir:10 339 ARVWVA--EEITADIIKSGKF-VIKYDYHWIPSLESLGLEQRVN 379 (393) T ss_pred ceEEec--CCCCHHHhhCCEE-EEEEEEEecCCcceEEEEEEEc Confidence 445443 3466777666544 4788888888999998888777 No 49 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=96.49 E-value=0.00057 Score=38.37 Aligned_cols=456 Identities=13% Similarity=0.072 Sum_probs=213.8 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCC---cHHHHHHHHHhccCCCCCcc Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQ---SEEYQRAAAYFKFISKSVNS 77 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~---s~ey~aA~~yF~~~~~~~~~ 77 (504) |--++-=|-|.--=.+..........+-|++....=|++..+.-+|..|....||.- +.++.++..+|-+.- T Consensus 1 ~~~~~Pgvyv~e~~~~~~~~~~~t~~~~~vg~~~~gp~~~p~~i~~~~~~~~~fg~~~~~~~~~~~~~~~f~~~g----- 75 (666) T protein:vir:80 1 MTLLSPGFETKETTLSTTIVQSATGRAALVGKFQWGPAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQYG----- 75 (666) T ss_pred CceecCceEEEEecCCccccccCcccceEEeccccCCCccceEecCHHHHHHhcCCccCccchHHHHHHHHhcCC----- Confidence 666665555432112233333445677788877766777778888999999999952 445566667776543 Q ss_pred cceEEEEeeeccCC--cceeeecccc-----------------hh----h-----------------------Hh----- Q lcl|NC_019451. 78 PSSISFARWVNTAI--APMVVGDNLP-----------------KT----I-----------------------AD----- 106 (504) Q Consensus 78 P~~l~igr~~~~a~--~~~l~g~~~~-----------------~~----~-----------------------~~----- 106 (504) +++||-|...... .+.-.++.+. .. . .. T Consensus 76 -~~~~v~R~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~i~~~~~~~~~~~~ta~~~~~a 154 (666) T protein:vir:80 76 -NDLRVVRVLNKEKAKNATALAGNIEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (666) T ss_pred -CeEEEEEecCccccccccccccceeEEEeeccccccccccccccccCcccccCcceEEEeecceeeeeecchhhhcccc Confidence 4689998753210 0000000000 00 0 00 Q ss_pred ----------------HhhccC-ceEEEEEcc----cceeeeeecc----------ccccchHHHH---HHHHhhhhccc Q lcl|NC_019451. 107 ----------------FAGFSA-GVLTIMVGA----AEQNITAIDT----------SAATSMDNVA---SIIQTEIRKNA 152 (504) Q Consensus 107 ----------------~~~~~~-g~~titi~g----~~~~~~~i~~----------s~ats~~~vA---~~i~~~i~a~~ 152 (504) +..... ....+.+.+ ........+. ...+...+.. +.......... T Consensus 155 ~~~~~~~~v~~~~~~~~~~~~~~~~~a~~V~~~~~~~~~~~~~~~~a~~~~t~~~~~~~~~~~~~~a~~a~~~g~~g~~l 234 (666) T protein:vir:80 155 KAIGVYPELDGDWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLQKYDMPAVSAIYAGEIGNSL 234 (666) T ss_pred ccccccceeeccceeeeccccccceeeeeeeeeecCCccceeeeccccccccccccccccccccchhhhhhcccccccce Confidence 000000 000011100 0000000000 0000000000 10100000000 Q ss_pred ccc------------------------cceeEEEEe-cccceeEEecccCccee-EEEEeec------cchh--hhhhhh Q lcl|NC_019451. 153 DPQ------------------------LAQATVTWN-QNTNQFTLVGATIGTGV-LAVAKSA------DPQD--MSTALG 198 (504) Q Consensus 153 ~~~------------------------~~~a~vt~d-~~~~~F~its~t~ga~s-~~~~~sa------~~~~--ia~~l~ 198 (504) ... ......+.. .....|.++....+... ....... .+.. +..... T Consensus 235 ~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (666) T protein:vir:80 235 EVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFG 314 (666) T ss_pred eeeeccccccccccccceeeeccccccccceeeeeccccccceeeEeccCCccceeeecccccccccccchhhhhhhhhc Confidence 000 000000000 00111221111111000 0000000 0000 000000 Q ss_pred ----------cc---cC-cce--eeccccc----------c------cHHHHHHHHHhhccceeEEEEEeccC-----CH Q lcl|NC_019451. 199 ----------WS---TS-NVV--NVAGQAA----------D------LPDAAVAKSTNVSNNFGSFLFAGAPL-----DN 241 (504) Q Consensus 199 ----------~t---~~-~~~--~~~g~aa----------e------t~~~al~~~~~~~~~wy~~~~~~~~~-----~~ 241 (504) .. .. ..+ ...|.+. . .....+-++.+. .++. ++++.... .. T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~-~~~~-~l~~p~~~~~~~~~~ 392 (666) T protein:vir:80 315 RGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWGLFAERES-IHVN-LLIAGACAGEGDAFS 392 (666) T ss_pred cccceeeeecccccccccceEEEecCCCCcccccccccccccccccchhhhhhhhhhcc-cccc-eEeecCcCCcccchH Confidence 00 00 000 0001000 0 001112222222 2222 22222111 12 Q ss_pred HHHHHHHHHHhhcCCcEEEE-------Eecc-ccchhHHHHHHhh-----------hcceeEEEEeccC-----CC---c Q lcl|NC_019451. 242 DQIKAVSAWNAAQNNQFIYT-------VATS-LANLGTLFTLVNG-----------NAGTALNVLSATA-----AN---D 294 (504) Q Consensus 242 ~~~~a~A~w~e~~~~~~~~~-------~~~~-d~~~~~~~~~~~~-----------~~~~~~~~~~~~~-----~~---~ 294 (504) .-..++...++....++.+. +... +.+.......... ...+...++++.. .. . T Consensus 393 ~v~~~~~~~~~~~~~~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~ 472 (666) T protein:vir:80 393 TVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRW 472 (666) T ss_pred HHHHHHHHHHHhhcceEEEeecceeEEeecCCCCCHHHHHHHHHhcccchhhhcccCcceEEEEcCceEEecccCCceeE Confidence 23345666666655443222 1111 1111111111100 0112222222210 11 1 Q ss_pred -cHHHHHHHHHHhcCcCcCCc-eeeecccccCccc-----cccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeC Q lcl|NC_019451. 295 -FVEQCPSEILAATNYDEPGA-SQNYMYYQFPGRN-----ITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCG 367 (504) Q Consensus 295 -~~~aa~~g~~as~nf~~~~g-~~T~kfk~l~Gv~-----a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~ 367 (504) -|...++|.++.+|.++ | ......|.+.||. .-.+++.|.+.|..+|+|....+.+.|. .++.+++++ T Consensus 473 ~p~sg~~AGl~Ar~D~~~--g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~---~~wG~rT~~ 547 (666) T protein:vir:80 473 VPLAADIAGLCARTDAVS--QPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGF---ILMGDKTAT 547 (666) T ss_pred echHHHHHHHHHHHhhcC--CceEccCCeecceeeccccceeecChhHHHhhhhCCeeEEEEeCCCeE---EEEccccCC Confidence 24566778888887433 3 1122345444442 2356899999999999999988876553 457888877 Q ss_pred Ccc-ccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccc Q lcl|NC_019451. 368 GPT-DAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQV 446 (504) Q Consensus 368 G~y-~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~ 446 (504) +.- +|.+|-+.+-.+|+++.|+...+-.+-. |.|+.=...|+..|+.-|++-+++|.|. T Consensus 548 ~~~s~~~~i~vRRl~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~---------------- 607 (666) T protein:vir:80 548 TVPSPFDRINVRRLFNMLKKNIGDSSKYKLFE----NNDNFTRASFRMEVSQYLSTIRSLGGIY---------------- 607 (666) T ss_pred CCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee---------------- Confidence 753 6788999999999999999988874332 4678888999999999999999999996 Q ss_pred cCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 447 TGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 447 ~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ||.+.++ .++.+++|+.+.+. .+.+.++---.++.|+++-+-. T Consensus 608 -------------g~~V~~d-~~~nt~~di~~G~~-~~~i~~~P~~Pae~I~~~~~~~ 650 (666) T protein:vir:80 608 -------------DFRVQCD-TTNNTPDVIDRNEF-VASMFIKPAKSINYIMLNFTAV 650 (666) T ss_pred -------------eeEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEe Confidence 5889887 57888889888877 5999999999999999986655 No 50 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=96.42 E-value=0.00064 Score=38.10 Aligned_cols=457 Identities=11% Similarity=0.008 Sum_probs=213.2 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCC---CcHHHHHHHHHhccCCCCCcc Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGA---QSEEYQRAAAYFKFISKSVNS 77 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~---~s~ey~aA~~yF~~~~~~~~~ 77 (504) |--++-=|-|.--=.+.......-..+.|++....=|++....-+|..|..+.||. .+.++.+...||-+- T Consensus 1 ~~~~~Pgvyv~e~~~~~~~~~v~t~~~~fvG~~~~gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~ng------ 74 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAALVGKFAWGPAYEIRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQY------ 74 (663) T ss_pred CccccCceEEEEecCcccccccccccceeeeccccCCCCcCEEecCHHHHHHHcCCcccccchHHHHHHHHHhC------ Confidence 66666555553211222333444567788888777777777788889999999996 367789999999874 Q ss_pred cceEEEEeeeccC---Ccceeeec-ccchhhHhHhhccC-ceEEE----------------EEcccceeeeeecccc--- Q lcl|NC_019451. 78 PSSISFARWVNTA---IAPMVVGD-NLPKTIADFAGFSA-GVLTI----------------MVGAAEQNITAIDTSA--- 133 (504) Q Consensus 78 P~~l~igr~~~~a---~~~~l~g~-~~~~~~~~~~~~~~-g~~ti----------------ti~g~~~~~~~i~~s~--- 133 (504) -+++||-|..... .+.-|.++ +... ...-....- ..+.+ ...|..+.+ .++... T Consensus 75 g~~~~vvRv~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~-~~~~a~~~~ 152 (663) T protein:vir:10 75 GNDLRLVRVIDMEQAKNASPLFNQIEVTI-TTEGQGYTVGDTVSIKHNTTTVTEEGKVTKVDADGKIKAL-FVPSSAVIA 152 (663) T ss_pred CCeEEEEecCCcccccccccccccceeeE-eecccCccccceeeecccccccccCcceeeeccCCceeEE-Eeccccccc Confidence 4689999986431 11111111 0000 000000000 01111 111111110 000000 Q ss_pred -ccchHHHHHHHHhh---h-hcccccccc-eeEEEEe--------------cc-cceeEEecccCcceeEE--------- Q lcl|NC_019451. 134 -ATSMDNVASIIQTE---I-RKNADPQLA-QATVTWN--------------QN-TNQFTLVGATIGTGVLA--------- 183 (504) Q Consensus 134 -ats~~~vA~~i~~~---i-~a~~~~~~~-~a~vt~d--------------~~-~~~F~its~t~ga~s~~--------- 183 (504) +............. + ......... .+.-.++ .. ...+............. T Consensus 153 ~a~~~~~~~~~~~a~~~~v~~~~~~~~~a~av~~i~~dg~vt~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~ 232 (663) T protein:vir:10 153 KAKQLGTYPVLGDNWRAEVSGASGGSAATLTLGGIVVDSGVTFGNSEEAPDVMTSTKVLANFAKYGMPLISAVYPGEIGS 232 (663) T ss_pred cccccccccccccceeeEEeeccccccccceeEeeecCCceeEEeeeccccccccceeeeeccccccceeeeecccccCc Confidence 00000000000000 0 000000000 0000000 00 00111111100000000 Q ss_pred ----EEeecc-------------ch-----------------h-----------------hhhh------------h--h Q lcl|NC_019451. 184 ----VAKSAD-------------PQ-----------------D-----------------MSTA------------L--G 198 (504) Q Consensus 184 ----~~~sa~-------------~~-----------------~-----------------ia~~------------l--~ 198 (504) ...... +. + ++.. + - T Consensus 233 ~i~v~~~~~~~~~~~~~~~v~~~~g~~~~~~~~~~~~g~~~~~~~~~~~~~~g~~~e~~~ls~~~~~~~~~~~~~~~~~~ 312 (663) T protein:vir:10 233 TVEVEVISKTAFQSGAAQPIYPFGGTRASNARSVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRRGDRDVYGNNIFMDDY 312 (663) T ss_pred ceeEeecccccccccceeeecccCcccccccccccccccccchhhcccccCCCcccceeeeeccccccccchhhhhhhhh Confidence 000000 00 0 0000 0 0 Q ss_pred cccC--cce---------------e-eccccc------ccHHHHHHHHHhh-ccceeEEEEEeccC-CHH----HHHHHH Q lcl|NC_019451. 199 WSTS--NVV---------------N-VAGQAA------DLPDAAVAKSTNV-SNNFGSFLFAGAPL-DND----QIKAVS 248 (504) Q Consensus 199 ~t~~--~~~---------------~-~~g~aa------et~~~al~~~~~~-~~~wy~~~~~~~~~-~~~----~~~a~A 248 (504) +..+ ..+ . ..|.+. .+...+++.+.+. ..+...++...... ..+ -..++. T Consensus 313 ~~~~~s~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~ 392 (663) T protein:vir:10 313 FRNGSSNFIYASSVNWPAGFTGIIQLGGGASANNAVGSDELIAGWDLFADREALHVNLMIAGACKSDGVAVASTVQKHVV 392 (663) T ss_pred hcCcccceeEeeccccCcccceeEEecccccCcccchhhhhhhHHhhhccccccCceEEEeecCCCCchhhHHHHHHHHH Confidence 0000 000 0 000000 0011112222221 11233322211111 111 123344 Q ss_pred HHHhhcCCcEEEEEecc-c--cch---------hHHHHHH----------h-hhcceeEEEEeccC-----CC----ccH Q lcl|NC_019451. 249 AWNAAQNNQFIYTVATS-L--ANL---------GTLFTLV----------N-GNAGTALNVLSATA-----AN----DFV 296 (504) Q Consensus 249 ~w~e~~~~~~~~~~~~~-d--~~~---------~~~~~~~----------~-~~~~~~~~~~~~~~-----~~----~~~ 296 (504) ..+|....++. ++... . ... ..+...+ . ....+...++++.. .+ --| T Consensus 393 ~~~~~~~~~~a-i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~ 471 (663) T protein:vir:10 393 ALADDRQDCVA-FVNPPSELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYAFISGNYKYQYDKYNDINRWVPL 471 (663) T ss_pred HHHHhhCCEEE-EEecCcccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceEEEEecceeEecccCCceEEech Confidence 44444333332 22111 0 000 0000000 0 01112223333211 01 124 Q ss_pred HHHHHHHHHhcCcCcCCceeeecccccCccc-----cccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCc-c Q lcl|NC_019451. 297 EQCPSEILAATNYDEPGASQNYMYYQFPGRN-----ITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGP-T 370 (504) Q Consensus 297 ~aa~~g~~as~nf~~~~g~~T~kfk~l~Gv~-----a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~-y 370 (504) .+.++|.++.+|.++. =......|.+.||. ...+++.|.+.|..+|+|....+-+. +.+ .++..+++++. . T Consensus 472 s~~vAGl~Ar~D~~~g-~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~-~G~-~~wG~rT~s~~~s 548 (663) T protein:vir:10 472 SADIAGLCAYTDQVGH-PWMSPAGYRRGQLRNTIKLAIEPKQSLRDTMYQVSINPVTGFAGG-DGF-VLFGDKMATQVPS 548 (663) T ss_pred HHHHHHHHHHhhccCC-cEEccCCeeecceeccccceeecCchhHHHHHhCCCcEEEEeeCC-CcE-EEEcccccCCCCc Confidence 5777788888874331 12223334444443 23578899999999999998877542 122 45677777765 2 Q ss_pred ccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCc Q lcl|NC_019451. 371 DAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDR 450 (504) Q Consensus 371 ~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~ 450 (504) +|.+|-+.+-.+|+++.|+..+....-. |-++.-...|+..|+.-|++-+++|.|. T Consensus 549 ~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~-------------------- 604 (663) T protein:vir:10 549 PFDRINVRRLFNMLKKNIGDTSKYELFE----NNDAFTRQSFRMEVSQYLDNIRSLGGVY-------------------- 604 (663) T ss_pred ccceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee-------------------- Confidence 5788999999999999999888763322 5788899999999999999999999996 Q ss_pred ccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 451 RAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 451 ~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ||.|.++ .+..+++|..+-+. -+.+.++-.-.++.|+++-..+ T Consensus 605 ---------gf~V~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~I~~~~~~~ 647 (663) T protein:vir:10 605 ---------DFRVVCD-TTNNTPQVIDSNEF-VATIYIKAPRSINYITLNFVAT 647 (663) T ss_pred ---------eeEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEE Confidence 5888887 57888888887777 5999999999999999886666 No 51 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=96.01 E-value=0.0011 Score=36.76 Aligned_cols=460 Identities=13% Similarity=0.031 Sum_probs=214.6 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCC---cHHHHHHHHHhccCCCCCcc Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQ---SEEYQRAAAYFKFISKSVNS 77 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~---s~ey~aA~~yF~~~~~~~~~ 77 (504) |--++-=|-|.--=.+..........+-|++.-..=|++..+.-+|..|..+.||.- +.++.+...||-+- T Consensus 1 ~~~~~Pgvyv~e~~~~~~i~~~~t~~~~~vG~~~~Gp~~~p~~v~~~~~~~~~fg~~~~~~~~~~~~~~~f~ng------ 74 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAAIVGKFAWGPAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQY------ 74 (663) T ss_pred CceecCceEEEEecCCccccccCcccceeEeecccCCCCccEEecCHHHHHHhcCCcCCcchhHHHHHHHHHhC------ Confidence 666666555532113444555556778888887777777777778899999999864 45677788888763 Q ss_pred cceEEEEeeeccC---Ccceeeecc----------------cchhhHh--------Hhhcc-Cc-eEEEEEccc-----c Q lcl|NC_019451. 78 PSSISFARWVNTA---IAPMVVGDN----------------LPKTIAD--------FAGFS-AG-VLTIMVGAA-----E 123 (504) Q Consensus 78 P~~l~igr~~~~a---~~~~l~g~~----------------~~~~~~~--------~~~~~-~g-~~titi~g~-----~ 123 (504) =+++||-|..... .+..+.++. +...... ...+. ++ ...+.+... . T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~i~v~~~~~~~~~~~~~~~~~~~~~~~~v~~~ta~~~~~~ 154 (663) T protein:vir:10 75 GNDLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKT 154 (663) T ss_pred CCeEEEEEccCCccccccccccccceeEEeecccccccccccccccccccccccccceeeecccceEEEeeccccccccc Confidence 4578999985321 111111100 0000000 00000 00 000100000 0 Q ss_pred eee---------eeeccccccchHHHHHHHHhhhhcc-----------------c---------ccc-------c--cee Q lcl|NC_019451. 124 QNI---------TAIDTSAATSMDNVASIIQTEIRKN-----------------A---------DPQ-------L--AQA 159 (504) Q Consensus 124 ~~~---------~~i~~s~ats~~~vA~~i~~~i~a~-----------------~---------~~~-------~--~~a 159 (504) ..+ ..+.++........+..+...+... . .+. . ... T Consensus 155 ~~v~~~~~~~~~~~~~~s~~s~~~~~a~~v~~v~~d~~~~v~~~~~a~~~~t~~~~~~~~~~~~~~~i~A~~~G~~Gn~i 234 (663) T protein:vir:10 155 RQLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLISAVYPGEIGSTV 234 (663) T ss_pred cccccceeeccceeeEeeeccCccccccccceeccccceEEeeccccccccccccccccccccccceEEeccCCccccee Confidence 000 0000000000000000000000000 0 000 0 000 Q ss_pred EEEEec--------------------------------ccceeEEecccCcceeEEEEeec-------cchhhhhhhhcc Q lcl|NC_019451. 160 TVTWNQ--------------------------------NTNQFTLVGATIGTGVLAVAKSA-------DPQDMSTALGWS 200 (504) Q Consensus 160 ~vt~d~--------------------------------~~~~F~its~t~ga~s~~~~~sa-------~~~~ia~~l~~t 200 (504) .+.... ..+.|.+.-...+........+. .+......-.+. T Consensus 235 ~V~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~ 314 (663) T protein:vir:10 235 EVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFR 314 (663) T ss_pred eeeeccccccccccccceecccccccccccceeecccccccceeeEeecCCcceeeecccccccccccccchhhhhhhhc Confidence 000000 00000000000000000000000 000000000000 Q ss_pred cCc-----------------ceee-cccc------cccHHHHHHHHHhhccceeEEEEEeccC--CHH----HHHHHHHH Q lcl|NC_019451. 201 TSN-----------------VVNV-AGQA------ADLPDAAVAKSTNVSNNFGSFLFAGAPL--DND----QIKAVSAW 250 (504) Q Consensus 201 ~~~-----------------~~~~-~g~a------aet~~~al~~~~~~~~~wy~~~~~~~~~--~~~----~~~a~A~w 250 (504) .+. .... .|.+ ..+...+++.+.+...-.-.++++.... ..+ -..++... T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~ 394 (663) T protein:vir:10 315 NGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSL 394 (663) T ss_pred CCcceEEEEeecccCccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHH Confidence 000 0011 1111 1123334444444322112223332211 111 23345556 Q ss_pred HhhcCCcEEEEEeccc-------cchhHHHHH-Hh--------------hhcceeEEEEeccC-----CC----ccHHHH Q lcl|NC_019451. 251 NAAQNNQFIYTVATSL-------ANLGTLFTL-VN--------------GNAGTALNVLSATA-----AN----DFVEQC 299 (504) Q Consensus 251 ~e~~~~~~~~~~~~~d-------~~~~~~~~~-~~--------------~~~~~~~~~~~~~~-----~~----~~~~aa 299 (504) ++....++.+.-.-.. ......... .. ....+...++++.. .+ .-|.+. T Consensus 395 a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~ 474 (663) T protein:vir:10 395 ADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLAAD 474 (663) T ss_pred HHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCccceEEEcCceEEecccCCceEEechhHH Confidence 6654444333211000 000000000 00 01112223332210 11 124567 Q ss_pred HHHHHHhcCcCcCCceeeecccccC---ccc--cccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCc-cccc Q lcl|NC_019451. 300 PSEILAATNYDEPGASQNYMYYQFP---GRN--ITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGP-TDAV 373 (504) Q Consensus 300 ~~g~~as~nf~~~~g~~T~kfk~l~---Gv~--a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~-y~~~ 373 (504) ++|.++.+|.++.+ ......|.+. |+. ...+++.|.+.|..+|+|+...+-+. +.+ .++...++++. .+|. T Consensus 475 vAGl~Ar~D~~~g~-~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~-~G~-~~wG~rT~~~~~s~~~ 551 (663) T protein:vir:10 475 IAGLCAYTDQVSHP-WMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGG-DGF-VLFGDKMATQVPSPFD 551 (663) T ss_pred HHHHHHHhhccCCc-eEccCCceeccccccccceeecChhHHHHHhhCCceEEEEEeCC-CcE-EEEcccccCCCCcccc Confidence 77888888754421 1223344433 332 34578999999999999998777542 122 34677777665 2678 Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccc Q lcl|NC_019451. 374 DMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAW 453 (504) Q Consensus 374 wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~ 453 (504) +|-+.+-.+|+.+.|+.......-. |-|+.=...|+..|+.-|++-+++|.|. T Consensus 552 ~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~----------------------- 604 (663) T protein:vir:10 552 RINVRRLFNMLKKNIGDTSKYELFE----NNDAFTRQSFRMETSQYLDGIRSLGGCY----------------------- 604 (663) T ss_pred eEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee----------------------- Confidence 8999999999999999988773322 5788889999999999999999999996 Q ss_pred cceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 454 RQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 454 ~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ||.+.++ .++.+++|..+-+. .+.+.++-.-.+++|.++-... T Consensus 605 ------g~~v~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~i~~~~~~~ 647 (663) T protein:vir:10 605 ------DFRVVCD-TTNNTPNVIDRNEF-VGTIYVKPPRSINYITLNMVAT 647 (663) T ss_pred ------eeEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEe Confidence 5889987 57888888887777 5999999999999998875544 No 52 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=95.68 E-value=0.0016 Score=35.88 Aligned_cols=417 Identities=10% Similarity=0.030 Sum_probs=168.1 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCcccce Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPSS 80 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~~P~~ 80 (504) .-.|+.|--..+.|...++.. ...-|.+.-..++. ++...-|.. .| . T Consensus 100 L~~L~~i~~~~v~v~g~~g~~---~~VtF~g~~~~l~~----------~~~~lt~g~-------------------~~-~ 146 (581) T protein:vir:10 100 LRALPNVEDDEVTVLGDPGGP---WTVTFTKAVAALTK----------DVTGLTGGD-------------------DP-D 146 (581) T ss_pred HhccCCCCcceEEEECCCCce---EEEEEcCCccceee----------eeceecCCC-------------------ce-e Confidence 223344422222222222111 11111111111110 000000000 01 2 Q ss_pred EEEEeeeccCCcceeeecc---cchhhHhHhhccCceEEEEEcccceeeeeecccc--ccc-hHH---HHHHHHhhhhcc Q lcl|NC_019451. 81 ISFARWVNTAIAPMVVGDN---LPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSA--ATS-MDN---VASIIQTEIRKN 151 (504) Q Consensus 81 l~igr~~~~a~~~~l~g~~---~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~--ats-~~~---vA~~i~~~i~a~ 151 (504) |-+++-.+.. .+.....+ +...+..+.....|. +.+.|....+...++.. .++ -.| +...+.+.+... T Consensus 147 vtV~~~~~g~-~~~~~~~s~~gi~~~~~~l~~~~~~~--~~~~gsd~~~~~~~~~~~~~~~~~~D~~t~~~~~~g~~~~~ 223 (581) T protein:vir:10 147 LNIASEQTGV-PAMNRALAKKGIKTDTIRVVNPNSGQ--VYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDP 223 (581) T ss_pred EEEeccccCc-ccccccccccccccccccccccccCc--ceeccccceeeecccCccccccccccceeeeeeeccccccc Confidence 2333222111 00000000 011111111111121 22333333333322211 111 001 111111111000 Q ss_pred cccccceeEEEEecccce--eEEecccCcc-----eeEEEEeeccchhhhhhhhcccC-cceeeccc-------ccccHH Q lcl|NC_019451. 152 ADPQLAQATVTWNQNTNQ--FTLVGATIGT-----GVLAVAKSADPQDMSTALGWSTS-NVVNVAGQ-------AADLPD 216 (504) Q Consensus 152 ~~~~~~~a~vt~d~~~~~--F~its~t~ga-----~s~~~~~sa~~~~ia~~l~~t~~-~~~~~~g~-------aaet~~ 216 (504) ... .....-|...+.. ..++....-. .................+-++.+ ......+. ..+... T Consensus 224 ~~v--~~~~~~~~d~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~t~~~~~~~tn~~~~~l~~gvd~~g~tvt~~dy~ 301 (581) T protein:vir:10 224 GDI--VQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQ 301 (581) T ss_pred ceE--EEEEEEeecCCcceeEEeecCcchhhhhhhhhhccCccccchhhhheeeeecccceeEEeeccCCCCccchHHHH Confidence 000 0001111111110 0110000000 00000000000000011111111 11112222 223567 Q ss_pred HHHHHHHhhccceeEEEEEeccCCHHHHHHHHHHHhhcC---C-cEE-EEEe-ccc-cchhHHHHHHhhhcceeEEEEec Q lcl|NC_019451. 217 AAVAKSTNVSNNFGSFLFAGAPLDNDQIKAVSAWNAAQN---N-QFI-YTVA-TSL-ANLGTLFTLVNGNAGTALNVLSA 289 (504) Q Consensus 217 ~al~~~~~~~~~wy~~~~~~~~~~~~~~~a~A~w~e~~~---~-~~~-~~~~-~~d-~~~~~~~~~~~~~~~~~~~~~~~ 289 (504) ++|+++.++. ...+++... .+.+-+.++.+|++... + ++. ..+- ... ................++.++.+ T Consensus 302 ~Al~ale~~~--~~~ivv~~t-~~~~v~a~l~ahv~~~s~~~~~~ravigV~g~~~~~~~~~~~~~a~~~n~~Rvvlv~p 378 (581) T protein:vir:10 302 NALNKFRDED--EIAIIVAGT-GAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISP 378 (581) T ss_pred HHHHHHhcCC--ceEEEEeCC-CCHHHHHHHHHHHHHHHhccCCcEEEEEecCCCCCccHHHHHHhhccCCCceEEEEec Confidence 8888887753 222333332 23333456888886542 2 222 2221 111 12222222222223344443332 Q ss_pred c--------------CCCccHHHHHHHHHHhcCcCcCCceeeecccccCccc--cccCCHHHHHHHHhCCCeEEEEEeec Q lcl|NC_019451. 290 T--------------AANDFVEQCPSEILAATNYDEPGASQNYMYYQFPGRN--ITVSDDTVANTVDKSRGNYIGVTQAN 353 (504) Q Consensus 290 ~--------------~~~~~~~aa~~g~~as~nf~~~~g~~T~kfk~l~Gv~--a~~lt~t~~~al~~~~~n~y~~~~~~ 353 (504) . .+..+.++.+.|..+..+. ...+-||.++|+. ...++.+|++.|..+|++.+....+. T Consensus 379 ~~~~~~g~~~~~~v~lp~y~~AA~vAGl~a~~~~-----~~slT~~~i~gi~~l~~~~s~~e~e~ll~~Gv~~l~~~~~~ 453 (581) T protein:vir:10 379 SSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIA-----AMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRN 453 (581) T ss_pred CceeecCcccCceeccchhhHHHHHHHHhhcccc-----ccCcccccccccccccccCCHHHHHHHHhCCeEEEEEecCC Confidence 1 1112234555566655443 3456788888886 44789999999999999999765443 Q ss_pred cceeeEEEcCEEe-CCccccchhhhhhhHHHHHHHHHHHHHH-HHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccc Q lcl|NC_019451. 354 GQQLAFYQRGILC-GGPTDAVDMNVYANEIWLKSAIAQALLD-LFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTY 431 (504) Q Consensus 354 ~~~~~~~~~G~~~-~G~y~~~wiD~~~~~dwl~~~lq~~l~~-l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~ 431 (504) + . .+.+|+.+ .-.-+|+-|.+++-.|.|...+++.+.. .|.. | |=++.|...|++.+++.|++-.++|+|.. T Consensus 454 ~--v-~Iv~gItT~~s~~~~~~i~~iR~~D~v~~~ir~~~~~~~fIG--~-~n~~~~r~~ik~~i~~~L~~l~~~g~I~~ 527 (581) T protein:vir:10 454 L--V-HVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIG--M-PIYDTTIVQVKASAEAALVWLVDNNIIRG 527 (581) T ss_pred e--E-EEEeeeecCCCCCcceeeeeehhhhHHHHHHHHHhhhhcCCC--c-ccCHHHHHHHHHHHHHHHHHHHhcCcccC Confidence 2 2 23466544 1112455688999999999999988863 4543 4 77889999999999999999999999985 Q ss_pred ccccCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 432 GKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 432 G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) .....+.| .++. +-.--+.|.+.-.-+|++|-++--.+ T Consensus 528 ~~~~~~~~----------------------------------~~~~-~d~v~V~i~v~Pv~~i~~I~vti~~~ 565 (581) T protein:vir:10 528 YRNLKARQ----------------------------------IERQ-PDVIEVRYEWRPAYPLNYIVVRYSIA 565 (581) T ss_pred Cccceeee----------------------------------eecC-CCEEEEEEEEEecccceEEEEEEEEe Confidence 32211111 0010 11123566666666666666666666 No 53 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=95.66 E-value=0.0017 Score=35.81 Aligned_cols=457 Identities=13% Similarity=0.067 Sum_probs=211.1 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcC---CCcHHHHHHHHHhccCCCCCcc Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFG---AQSEEYQRAAAYFKFISKSVNS 77 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg---~~s~ey~aA~~yF~~~~~~~~~ 77 (504) |-=++-=|-|.--=.+.......-..+.|++....=|++.....+|..|..+.|| ..+.++.+...||-+- T Consensus 1 ~~~~~PgVyvee~~~~~~~~~~~ts~~~fvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng------ 74 (659) T protein:vir:72 1 MTLLSPGIELKETTVQSTVVNNSTGTAALAGKFQWGPAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQY------ 74 (659) T ss_pred CceecCceEEEEecCCcccccCCCcceEEEeecCCCCCcccEEecCHHHHHHHcCCcCCCCchhHHHHHHHHhC------ Confidence 4444433333211011111122356778888877777777888899999999999 4577788888888763 Q ss_pred cceEEEEeeeccCCc--ceeeecccchhhHhHh-hccC-ceEEEEEcc----cceeeeeeccccc---------cchHH- Q lcl|NC_019451. 78 PSSISFARWVNTAIA--PMVVGDNLPKTIADFA-GFSA-GVLTIMVGA----AEQNITAIDTSAA---------TSMDN- 139 (504) Q Consensus 78 P~~l~igr~~~~a~~--~~l~g~~~~~~~~~~~-~~~~-g~~titi~g----~~~~~~~i~~s~a---------ts~~~- 139 (504) =+++||-|....... +...+..+......-. .... -...+...+ ..-.+..++.+.. ..++. T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~g~v~~~~~~~~~~~~~v~t~~~~a~~ 154 (659) T protein:vir:72 75 GNDLRVVRAVDRDTAKNSSPIAGNIDYTISTPGSNYAVGDKITVKYVSDDIETEGKITEVDADGKIKKINIPTGKNYAKA 154 (659) T ss_pred CceEEEEEccCCcccccccccccccceeecccccccccceeeeeeeccccccccceEEEeeccccceeeeeccccccccc Confidence 356999998542111 1111111100000000 0000 000111110 0000111111100 00000 Q ss_pred -----HHHHHHh---hhhcccc--cccceeE----------------------EEEecccceeEEec------ccCccee Q lcl|NC_019451. 140 -----VASIIQT---EIRKNAD--PQLAQAT----------------------VTWNQNTNQFTLVG------ATIGTGV 181 (504) Q Consensus 140 -----vA~~i~~---~i~a~~~--~~~~~a~----------------------vt~d~~~~~F~its------~t~ga~s 181 (504) ....... .+..... ....... ..++.......... .+.+... T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~v~~~~~~~~~~v~~~~~a~~~~~~~~~v~~~~~~~~~a~~~gt~g~~~ 234 (659) T protein:vir:72 155 KEVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKI 234 (659) T ss_pred cccccccccccceeeEEeeccccccceEEEEEeecCcceeeeeccccchhhhcccccccccccccceeeeccccccccce Confidence 0000000 0000000 0000000 00000000000000 0000000 Q ss_pred EEEEee--------------------------------ccchh---hh--------------h--------------hhh Q lcl|NC_019451. 182 LAVAKS--------------------------------ADPQD---MS--------------T--------------ALG 198 (504) Q Consensus 182 ~~~~~s--------------------------------a~~~~---ia--------------~--------------~l~ 198 (504) .+.... ....+ +. . ... T Consensus 235 tv~i~~~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (659) T protein:vir:72 235 EIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDF 314 (659) T ss_pred eEEEccccccccceeeeeecccccccccccceeeeeeecccccccceeeecccceeeeeeeeeccccccccchhhhhhhh Confidence 000000 00000 00 0 000 Q ss_pred cccCcc--ee---------------e-cccc------cccHHHHHHHHHhhc-cceeEEEEEeccC--CHHH----HHHH Q lcl|NC_019451. 199 WSTSNV--VN---------------V-AGQA------ADLPDAAVAKSTNVS-NNFGSFLFAGAPL--DNDQ----IKAV 247 (504) Q Consensus 199 ~t~~~~--~~---------------~-~g~a------aet~~~al~~~~~~~-~~wy~~~~~~~~~--~~~~----~~a~ 247 (504) +..... +. . .|.+ ..+...++..+.+.. .+. .++.+.... ..++ ..++ T Consensus 315 ~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~p~~~~~~~~~~~~v~~~l 393 (659) T protein:vir:72 315 FAKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDV-QLFIAGSCAGESLETASTVQKHV 393 (659) T ss_pred hhcCCceEEEEEecccCCcccccccccccccccccccchhHHHHHHHhhhccccce-eEEEecCCCCcchhhhHHHHHHH Confidence 000000 00 0 0000 011223333333221 122 222222111 1112 2334 Q ss_pred HHHHhhcCCcEEEEEecc--------ccchhHHHHHHhh-----------hcceeEEEEeccC-----CC---c-cHHHH Q lcl|NC_019451. 248 SAWNAAQNNQFIYTVATS--------LANLGTLFTLVNG-----------NAGTALNVLSATA-----AN---D-FVEQC 299 (504) Q Consensus 248 A~w~e~~~~~~~~~~~~~--------d~~~~~~~~~~~~-----------~~~~~~~~~~~~~-----~~---~-~~~aa 299 (504) ...+|....++.+.-.-. ..+......-... ...+...++++.. .+ . -|... T Consensus 394 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~ 473 (659) T protein:vir:72 394 VSIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNHKYQYDKYNDVNRWVPLAAD 473 (659) T ss_pred HHHHhhhCCEEEEEcCccccccCCCcccCHHHHHHHHhhccccccccccccceeEEEEcCceeeccccCCceEEechHHH Confidence 455555444443321100 0011111110000 0112223333211 01 1 24567 Q ss_pred HHHHHHhcCcCcCCc-eeeecccccCccc-----cccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCcc-cc Q lcl|NC_019451. 300 PSEILAATNYDEPGA-SQNYMYYQFPGRN-----ITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPT-DA 372 (504) Q Consensus 300 ~~g~~as~nf~~~~g-~~T~kfk~l~Gv~-----a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y-~~ 372 (504) ++|.++.+|.++ | ......|.+.||. ...+++.|.+.|..+++|+...+.+.|. .++...++++.- +| T Consensus 474 vAGl~Ar~D~~~--G~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~---~~wG~rT~~~~~s~~ 548 (659) T protein:vir:72 474 IAGLCARTDNVS--QTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGY---VLYGDKTATSVPSPF 548 (659) T ss_pred HHHHHHHhhccC--CcEEccCCeeeceeeccccccccCChhHHHHHhhCCceEEEEecCCeE---EEEcccccCCCCccc Confidence 778888887433 3 2233445544443 2357899999999999999988876553 457778877763 67 Q ss_pred chhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCccc Q lcl|NC_019451. 373 VDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRA 452 (504) Q Consensus 373 ~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~ 452 (504) .+|.+.+-.+|+.+.|+.......-. |.|+.=...|+..|+.-|++-+++|.|. T Consensus 549 ~~i~vrR~~~~i~~si~~~~~~~v~e----~n~~~l~~~i~~~i~~fL~~l~~~gal~---------------------- 602 (659) T protein:vir:72 549 DRINVRRLFNMLKTNIGRSSKYRLFE----LNNAFTRSSFRTETAQYLQGNKALGGIY---------------------- 602 (659) T ss_pred ceEeehhHHHHHHHHHHHHHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhcCcee---------------------- Confidence 88999999999999999888763322 5688888999999999999999999984 Q ss_pred ccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 453 WRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 453 ~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ||.|.++ .++.+++|+.+-+. .+.+.++-.-.+++|.++=+-. T Consensus 603 -------~~~V~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~I~~~~~~~ 645 (659) T protein:vir:72 603 -------EYRVVCD-TTNNTPSVIDRNEF-VATFYIQPARSINYITLNFVAT 645 (659) T ss_pred -------eEEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCccEEEEEEEEe Confidence 4889887 57888888887777 5899999999999998875544 No 54 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=95.60 E-value=0.0018 Score=35.67 Aligned_cols=439 Identities=11% Similarity=-0.005 Sum_probs=172.0 Q ss_pred CCCccceEEE---------eeeecccccccc---cccceEEEeccc--ccCccce-----EEecCHHH-----------H Q lcl|NC_019451. 1 MISQSRYIRI---------ISGVGAGAPVAG---RKLILRVMTTNN--VIPPGIV-----IEFDNANA-----------V 50 (504) Q Consensus 1 mip~s~iv~V---------~~~v~~~~~~~~---~~~~~l~l~~~~--~~~~~r~-----~~y~s~~~-----------V 50 (504) |.+.+...-. .+.+..+..... .....+...... ..+..++ ........ + T Consensus 216 ~~~~~~~~~~~~~~~~~~~~~tv~v~~~~~~vg~~v~~~~~~~~~~~~~~~~~~v~~~~~~~~t~~~~~~~~~~~g~~~~ 295 (743) T protein:vir:10 216 RTPGTYSNVPASGGTGTGATFNVVVADAGGGVGGSVVVTLANPGTGYNQGETLTIASAATGDGTDILVTVATLSDGTIAI 295 (743) T ss_pred ccccceeeEEecccccccccccccccccccccccccccccccccceeeeccccccccccccccccchhheecccccceee Confidence 3222111100 000000000000 000000000000 0000000 00000000 0 Q ss_pred HH-hcCCCcHHHHHHHHHhccCCCCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeee Q lcl|NC_019451. 51 LS-YFGAQSEEYQRAAAYFKFISKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAI 129 (504) Q Consensus 51 ~~-~Fg~~s~ey~aA~~yF~~~~~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i 129 (504) .. .....++.+......+.. ..++|..+.++.-. ...+..+.....+ ..+.++.+ .+.... .-. T Consensus 296 ~a~~~~~~~~~~~~~~~~~~~---~~~~~~t~~~~~~~------~~~~d~~~v~v~~----~~~~~~~~-~~~v~~-~~~ 360 (743) T protein:vir:10 296 TELKDWYLNTEIGSTGIKLGD---IGPRPGTSQFATDN------GITDDQVHFAVID----TTGELTGT-ANTIVE-RLT 360 (743) T ss_pred eecccccccchhhcccccccc---ccccceeeeccccc------cccccceEEEEec----Ccceeeec-cCceeE-EEe Confidence 00 001122223322222221 12333333322100 0000000000000 00000000 000000 000 Q ss_pred ccccccch---HHHHHHHHhhhhcccccccceeEEEEecccceeEEeccc-CcceeEEEEeeccchhhhhhhhcccCcce Q lcl|NC_019451. 130 DTSAATSM---DNVASIIQTEIRKNADPQLAQATVTWNQNTNQFTLVGAT-IGTGVLAVAKSADPQDMSTALGWSTSNVV 205 (504) Q Consensus 130 ~~s~ats~---~~vA~~i~~~i~a~~~~~~~~a~vt~d~~~~~F~its~t-~ga~s~~~~~sa~~~~ia~~l~~t~~~~~ 205 (504) .++..... .+....+...+.. ....+.+............. .+............. .. ....... T Consensus 361 ~~s~~~~~~~~~~~~~~~~~~~~~------~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~---~~~~~~~ 429 (743) T protein:vir:10 361 YLSKLSDARSEENANIYYKNVINE------QSAYLYHGNDAAVQIAASGEAWGQSSDQVLADAGTA--FS---RTTGYWV 429 (743) T ss_pred eeecccccccccCcceeecceecc------ccceeeccCcccceeeeccccCccccceeeeecccc--cc---cccceEE Confidence 01110000 0000000000000 00001111000000000000 000000000000000 00 0000001 Q ss_pred e-ecccc-----cccHHHHHHHHHhhccceeEEEEEecc-----CCHHHHHHHHHHHhhcCCcEEEEEeccc-------- Q lcl|NC_019451. 206 N-VAGQA-----ADLPDAAVAKSTNVSNNFGSFLFAGAP-----LDNDQIKAVSAWNAAQNNQFIYTVATSL-------- 266 (504) Q Consensus 206 ~-~~g~a-----aet~~~al~~~~~~~~~wy~~~~~~~~-----~~~~~~~a~A~w~e~~~~~~~~~~~~~d-------- 266 (504) . ..|.+ ......++..+.+...-...++.+... ....-..++.+.++....++.++-.-.. T Consensus 430 ~~~gG~d~~~~~~~~~~~~~~~~~~~~~~~~~ll~~p~~~~~~~~~~~v~~a~~~~~~~~~~~~a~~d~p~~~~~~~~~~ 509 (743) T protein:vir:10 430 NLAGGNDDFAYDAGEFGAAMDLFLDTEETEIDFVLMGGSMADEADTKSKATKVIAIAASRKDALAFVSPHKGNQIASTGN 509 (743) T ss_pred EeecCccccccchhHHHHHHHHhhhccccCcceEEecCcccCccchHHHHHHHHHHHHhhCCeEEEEecCCCcccccccc Confidence 1 11222 122334444444322111233333211 1233345666666665544433311100 Q ss_pred ------cchhHHHHHHhhhc--ceeEEEEecc--C---CC----ccHHHHHHHHHHhcCcCcCCceeeecccccCccc-- Q lcl|NC_019451. 267 ------ANLGTLFTLVNGNA--GTALNVLSAT--A---AN----DFVEQCPSEILAATNYDEPGASQNYMYYQFPGRN-- 327 (504) Q Consensus 267 ------~~~~~~~~~~~~~~--~~~~~~~~~~--~---~~----~~~~aa~~g~~as~nf~~~~g~~T~kfk~l~Gv~-- 327 (504) .............. .+...++++. + .. .-+.+.++|.++.+|.++. =......|.+.||. T Consensus 510 ~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AGl~a~~D~~~g-~~~span~~~~gi~g~ 588 (743) T protein:vir:10 510 VALSSAQQKENTIAFFSDLTSTSYAVFDSGYKYVYDRFTDKYRYIPCNGDVAGLCVQTSNQLD-DWYSPAGLNRGGILNA 588 (743) T ss_pred ccccccccchHHHHHHHhccCCeeEEEEccceeeeccccCceeEechhHHHHHHHHHhhccCC-cEEccCCeeeeeeecc Confidence 00011111111111 1222222211 0 01 1244667788888874331 12234455655553 Q ss_pred ---cccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCC-ccccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCC Q lcl|NC_019451. 328 ---ITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGG-PTDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVP 403 (504) Q Consensus 328 ---a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G-~y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIP 403 (504) ...+++.|.+.|..+++|+...+.+.| + .++..+++.+ .-+|.+|-+.+-.+|+++.|+..++..+-. | T Consensus 589 ~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G--~-~~wG~rT~~s~d~~~~~i~vrR~~~~i~~si~~~~~~~v~e----~ 661 (743) T protein:vir:10 589 VKLAYNPNKADRDELYQNRINPVVSLRGQG--I-TLFGDKTALAAPSAFDRINVRRLFLNLEKRARRLAEGVLFE----Q 661 (743) T ss_pred ccceecCChhHHHhHhhCCceEEEEecCCe--E-EEEcccccCCCCcccceEeehhhHHHHHHHHHHHHHHhccC----C Confidence 134789999999999999998887655 3 3467777644 446778999999999999999999874432 4 Q ss_pred cCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCc Q lcl|NC_019451. 404 ASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKAN 483 (504) Q Consensus 404 yt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~ 483 (504) .|+.=...|+..|+.-|+.-+++|.|. ||.|.+. .+..+++|..+-+. . T Consensus 662 n~~~~~~~i~~~i~~fL~~l~~~gal~-----------------------------~~~V~~d-~~~nt~~~i~~G~~-~ 710 (743) T protein:vir:10 662 NDATTRAGFSSALNSYLSEVQARRGVT-----------------------------DYLVICD-ESNNTPDIIDRNEF-V 710 (743) T ss_pred CCHHHHHHHHHHHHHHHHHHHhcCcee-----------------------------eeEEEEc-CCCCCHHHhhCCeE-E Confidence 588888999999999999999999873 5889986 57888888887777 5 Q ss_pred eEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 484 YTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 484 i~~~~~~aGAIh~v~i~~~~v 504 (504) +.+.++----+++|+++=.-. T Consensus 711 ~~i~~~p~~pae~I~~~~~~~ 731 (743) T protein:vir:10 711 AEVYVKPTRSINFITITFTAT 731 (743) T ss_pred EEEEEEecCCcceEEEEEEEe Confidence 889999999999988775433 No 55 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=95.57 E-value=0.0018 Score=35.60 Aligned_cols=415 Identities=11% Similarity=0.010 Sum_probs=169.6 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCc--cceEEecCHH-HHHHhcCCCcHHHHHHHHHhccCCCCCcc Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPP--GIVIEFDNAN-AVLSYFGAQSEEYQRAAAYFKFISKSVNS 77 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~--~r~~~y~s~~-~V~~~Fg~~s~ey~aA~~yF~~~~~~~~~ 77 (504) ||----.|.|...-.+-++.. .|-+++-+..-.++.. +.+..-+... .|++ |-|.+-+-..-- - T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~-------~ 294 (717) T protein:vir:79 228 MIAKGADVTIKLEHVALAGLK-LYADGIEVVDAKAFTVAGDQLTIHSNSKMKLGA-----SLEAQYAYNLVE-------V 294 (717) T ss_pred eEEecccceeehhhhhhhhhH-HhhcchhhhhhhheeeecceEEEEecCCcccch-----hhHHHHHhhHHH-------h Confidence 443333344443333333322 2333333322222221 2333333222 2322 233221111111 1 Q ss_pred cceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHHhhhhcccccccc Q lcl|NC_019451. 78 PSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQTEIRKNADPQLA 157 (504) Q Consensus 78 P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~~~~~~ 157 (504) -+.++.-|= ..||.+- ..++++++-.+..+. ..+.-.+.-.++ +...+..... T Consensus 295 ~~~~~~~~~--------~~~g~~~-----------n~~~~~v~~~D~~~~-~~~t~~~~~~g~-------~~~~pl~~ts 347 (717) T protein:vir:79 295 IQPVIELES--------IFGGGVY-----------NDIMRKVESKDGAVT-VTITKPESKRGM-------ISEDPLVFKS 347 (717) T ss_pred hccceEEee--------cccCcee-----------eeeeeEEecCCceEE-EEEecccccCcc-------eecccccccc Confidence 111222211 0112111 112222221111000 000000000000 0000000011 Q ss_pred eeEEEE----ecc-----cceeEEeccc-CcceeEEEEeeccchhhhhhhhcccCcce-----e-ecccccc-----cHH Q lcl|NC_019451. 158 QATVTW----NQN-----TNQFTLVGAT-IGTGVLAVAKSADPQDMSTALGWSTSNVV-----N-VAGQAAD-----LPD 216 (504) Q Consensus 158 ~a~vt~----d~~-----~~~F~its~t-~ga~s~~~~~sa~~~~ia~~l~~t~~~~~-----~-~~g~aae-----t~~ 216 (504) ++++.+ |.. ..-+...+.. ..+.......+.+ -..+.|...+-.. + ..|...+ ++. T Consensus 348 ~dy~~~~~~vdgI~~~~~~~V~~~g~~s~a~a~~~~g~~s~d---~a~f~Gg~dgl~~~~ee~Y~~lGgk~~d~g~lt~~ 424 (717) T protein:vir:79 348 GDYTNFKMLVDAINNHPFNNVVRARTKPEFEATFTSTLQAAA---DAKFSGGKDELSLDKEEMYKRLGGEKNEEGFVTKQ 424 (717) T ss_pred CceeeeeeeecccccCchhheeeeecccccceeeeecccCch---hhccCCCccccccchhhhhccccccccccccccch Confidence 111111 111 0111111100 0000000000000 0001111110000 0 0011111 112 Q ss_pred HHHHHHHhhccceeEEEEEecc-----CCHHHHHHHHHHHhhcCCc---EEEEEe-c--cccch---hHHHHHHhhh--- Q lcl|NC_019451. 217 AAVAKSTNVSNNFGSFLFAGAP-----LDNDQIKAVSAWNAAQNNQ---FIYTVA-T--SLANL---GTLFTLVNGN--- 279 (504) Q Consensus 217 ~al~~~~~~~~~wy~~~~~~~~-----~~~~~~~a~A~w~e~~~~~---~~~~~~-~--~d~~~---~~~~~~~~~~--- 279 (504) .++..+....-+|-.+.-.... .-++...+++.+++++... .+-+.. . .|... ......+... T Consensus 425 aays~LE~~dVDlVil~ga~adtt~ga~~d~va~alad~caalSal~r~ai~VI~l~sp~D~~~AtVe~~~~kLs~~Aaa 504 (717) T protein:vir:79 425 GAYQYLENYEVDYVIPLGVHADTKLIGKYDDFAYQLALACAVMSHYNSVTIGIIPTTTPSDISLAGVEEHVKKLENYANE 504 (717) T ss_pred hhhhhcCcceeEEEEecCccccccccchhhhHHHHHHHHHHHhhhccccceeeeccccccccchhhHHHHHHHHHhhhhh Confidence 3344443332333221111110 0123356778888765421 111111 0 01100 1100000000 Q ss_pred ---------------------cceeEE------EEeccCCC---ccHHHHHHHHHHhcCcCcCCceeeecccccCccc-- Q lcl|NC_019451. 280 ---------------------AGTALN------VLSATAAN---DFVEQCPSEILAATNYDEPGASQNYMYYQFPGRN-- 327 (504) Q Consensus 280 ---------------------~~~~~~------~~~~~~~~---~~~~aa~~g~~as~nf~~~~g~~T~kfk~l~Gv~-- 327 (504) ..+... +....... ..+++.+.|..+..++...+ .+|.+.|+. T Consensus 505 ~~~~d~~~a~a~~~~~~~idis~y~~vv~~~~~iv~~~~~~~~~~p~AG~vAGldA~rGVwkSP-----ANk~I~GVvgL 579 (717) T protein:vir:79 505 FYMRDRFGNIIFDADRNKIDLGQFIEVVAGPDFIVRNTRLGQMASTPDASYIGMVSQLKTQSAP-----TNKPLPSVTAL 579 (717) T ss_pred hhhhcchhccccccccccccccceeeeeecceeEEEcCCCceeecCHHHHHHHHHhcCCccccc-----ccceecccccC Confidence 000000 00011111 12344455555554443333 367777775 Q ss_pred cccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCcc-ccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCH Q lcl|NC_019451. 328 ITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPT-DAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASS 406 (504) Q Consensus 328 a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y-~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~ 406 (504) ...++..|++.|..+|+|++..+.+.| + .++.++++++.. +|..|-+.+-.|++...|+..+.... .+ |-++ T Consensus 580 a~~lT~sE~d~Ln~aGIntIr~~~GrG--i-rVWGaRTtasd~sdWryInVRRl~D~Ie~sIr~al~~yV---gE-PNd~ 652 (717) T protein:vir:79 580 RYTYSANQLNRLTKARFATFKYKQDGS--I-GVVDAPTSAHAGSDYTRLSTARIVKEAVNAVREVADPFI---GE-PNDT 652 (717) T ss_pred cccCCHHHHHHHhhCCeEEEEEeCCce--E-EEEeeeecCCCCcccceeehhhhHHHHHHHHHHHHHHhc---cc-cCCH Confidence 457899999999999999998776654 3 456888887653 47778999999999999988877533 22 6788 Q ss_pred HHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEE Q lcl|NC_019451. 407 TGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTL 486 (504) Q Consensus 407 ~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~ 486 (504) .+...|+..|+.-|++-.+.|.|. ||.+.+ .++++|..+= .--+.+ T Consensus 653 ~tr~~Ik~sI~afL~~L~r~GAI~-----------------------------Gykvdv----tnT~~di~~G-~l~V~I 698 (717) T protein:vir:79 653 GNRNALTAAVDKRLSKMIENKALL-----------------------------GFDFRL----VVTPQQELLG-EGSIEL 698 (717) T ss_pred HHHHHHHHHHHHHHHHHHhcCcee-----------------------------cceeeE----ecChhHhhCC-EEEEEE Confidence 999999999999999999999996 233332 2344554432 224788 Q ss_pred EEEECCeEEEEEeeeeeC Q lcl|NC_019451. 487 IYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 487 ~~~~aGAIh~v~i~~~~v 504 (504) .+.....+++|.++-++= T Consensus 699 ~vaPv~PaEfI~ititIT 716 (717) T protein:vir:79 699 SLEAPNELRRLTTIVSLS 716 (717) T ss_pred EEEecCcccEEEEEEEEe Confidence 888889999988875444 No 56 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=95.43 E-value=0.0021 Score=35.29 Aligned_cols=456 Identities=11% Similarity=-0.009 Sum_probs=214.2 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCC---CcHHHHHHHHHhccCCCCCcc Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGA---QSEEYQRAAAYFKFISKSVNS 77 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~---~s~ey~aA~~yF~~~~~~~~~ 77 (504) |--++-=|-|.--=.+..........+.|++....=|++.-..-+|..|..+.||. .+.++.+...||-+- T Consensus 1 ~~~~~Pgvyv~e~~~~~~i~~v~t~~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~f~ng------ 74 (671) T protein:vir:56 1 MTLLSPGIENKEINLASAIGRAATGRAAMVGKFEWGPAYSITQVTSESDLVTIFGRPNDYTAASFMTANNFLKY------ 74 (671) T ss_pred CceecCceEEEeecCcccccccCcccceEEecccCCCCccCEEcCCHHHHHHHcCCcCCCcchhHHHHHHHHhc------ Confidence 66666555554322345566666778888888777677777778889999999997 677888999999874 Q ss_pred cceEEEEeeeccCCc--ce-ee---------------ecccch------------hhH---------------------- Q lcl|NC_019451. 78 PSSISFARWVNTAIA--PM-VV---------------GDNLPK------------TIA---------------------- 105 (504) Q Consensus 78 P~~l~igr~~~~a~~--~~-l~---------------g~~~~~------------~~~---------------------- 105 (504) =+++||-|....... +. +. |-.+.. ... T Consensus 75 g~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~v~ 154 (671) T protein:vir:56 75 GNDLRLVRICDATTAQNATPLYNAVEYTIGASNGCVVGDDITITYSGVGALTAKGKVLEVDAGNNNAASKIFLPSAEIVA 154 (671) T ss_pred CCeEEEEEecCccccccchhhccccccccccCcceeeceeeeeecCcccccccCcceeEEeeeccceeeeeeccceeEEE Confidence 356899987542110 00 00 000000 000 Q ss_pred ---------hHhh--c--cCceEEEEEc-----ccce------eee---ee----------ccccccchHHHHHHH---- Q lcl|NC_019451. 106 ---------DFAG--F--SAGVLTIMVG-----AAEQ------NIT---AI----------DTSAATSMDNVASII---- 144 (504) Q Consensus 106 ---------~~~~--~--~~g~~titi~-----g~~~------~~~---~i----------~~s~ats~~~vA~~i---- 144 (504) .+.. . ..+...+.+. +... ... .+ +........+....- T Consensus 155 ~~~~~~~~~~~~~~t~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~ 234 (671) T protein:vir:56 155 AAKSDGNYPSVGTITLQPTQGDIALTNIEIIDTGSVYFPNIELAFDALTAIETEGGALKYADLIEKQGFPRLSARYVGDF 234 (671) T ss_pred eeeccccccccccccccccccceeeeeecccccceEEEeccccccccccccccccccccchhhhhccccccccccccccc Confidence 0000 0 0000000000 0000 000 00 000000000000000 Q ss_pred -----------Hhhhhccccccc-----ceeEEEEec----------------ccceeEEecccCcc-eeEEEEeeccc- Q lcl|NC_019451. 145 -----------QTEIRKNADPQL-----AQATVTWNQ----------------NTNQFTLVGATIGT-GVLAVAKSADP- 190 (504) Q Consensus 145 -----------~~~i~a~~~~~~-----~~a~vt~d~----------------~~~~F~its~t~ga-~s~~~~~sa~~- 190 (504) ............ ....++-+. ..+.|.+.-...+. ..........+ T Consensus 235 g~~~~v~v~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~ 314 (671) T protein:vir:56 235 GDAISVEIINYADYQTAFAFAAGHTLGDIELPIYPDGGTRSINLSSYFTFGPSNSNQYAVIVRVSGEVEEAFIVSTNPGD 314 (671) T ss_pred CcceEEEEecccccccccccccceeeeeccccccccccccccccceeecccccccccceeEEeecCccceeEEEeecccc Confidence 000000000000 000000000 00011110000000 00000000000 Q ss_pred -----hhhhhhhhcccCcce-----------------eecccc----cccHHHHHHHHHhhccceeE-EEEEeccCCH-- Q lcl|NC_019451. 191 -----QDMSTALGWSTSNVV-----------------NVAGQA----ADLPDAAVAKSTNVSNNFGS-FLFAGAPLDN-- 241 (504) Q Consensus 191 -----~~ia~~l~~t~~~~~-----------------~~~g~a----aet~~~al~~~~~~~~~wy~-~~~~~~~~~~-- 241 (504) ...........+... ...|.+ ..+..+++..+.+.. ...- ++.+.....+ T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~-~~~~~~~~a~~~~~~~~ 393 (671) T protein:vir:56 315 KDVNGQSIFIDEYFENSGSAYITAIAEGWKTESGAYNFGGGSDANAGADDWMFGLDMLSDPE-VLYTNLVIAGNAAAEEV 393 (671) T ss_pred cccchhhhhhhhhhcccCceEEEecCcccCCccccccccCccccccchhHHHHHHHhhhhcc-ccceeEEEcCCCCCccc Confidence 000000000011000 011111 122334455554432 2221 2222111111 Q ss_pred --H--HHH-HHHHHHhhcCCcEEEEEeccc------c-ch-hHHHHHHh----------------hhcceeEEEEeccC- Q lcl|NC_019451. 242 --D--QIK-AVSAWNAAQNNQFIYTVATSL------A-NL-GTLFTLVN----------------GNAGTALNVLSATA- 291 (504) Q Consensus 242 --~--~~~-a~A~w~e~~~~~~~~~~~~~d------~-~~-~~~~~~~~----------------~~~~~~~~~~~~~~- 291 (504) . ... .+...++. .+..+.++.... . .. ........ ....+...++++.. T Consensus 394 ~~~~~~~~~~~~~~~~~-~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~ 472 (671) T protein:vir:56 394 SIASTVQKYAIDSVGNV-RQDCVVFVSPPQSLIVNKQAGTAVANIQGWRTGIDPTNGQAVVDNLNVSTTYAVIDGNYKYQ 472 (671) T ss_pred hhHHHHHHHHHHHHHhh-cCCEEEEEeccccccccccccccHHHHHHHhhhccccchhhhhhhccCCcceEEEecCceEE Confidence 1 111 12222223 233333322110 0 00 00000000 00112222322211 Q ss_pred ----CC---c-cHHHHHHHHHHhcCcCcCCc-eeeecccccCc---cc--cccCCHHHHHHHHhCCCeEEEEEeecccee Q lcl|NC_019451. 292 ----AN---D-FVEQCPSEILAATNYDEPGA-SQNYMYYQFPG---RN--ITVSDDTVANTVDKSRGNYIGVTQANGQQL 357 (504) Q Consensus 292 ----~~---~-~~~aa~~g~~as~nf~~~~g-~~T~kfk~l~G---v~--a~~lt~t~~~al~~~~~n~y~~~~~~~~~~ 357 (504) .+ . -|...++|.++.+|.++ | ......|.+.+ +. ...+++.|.+.|..+|+|....+.+.| T Consensus 473 ~d~~~~~~~~~p~s~~~AGl~Ar~D~~~--g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G--- 547 (671) T protein:vir:56 473 YDKYNDRNRWVPLAGDIAGLCAYTDQVS--QPWMSPAGFNRGQIKGVNRLAVDLRRAHRDALYQIGINPVVGFAGQG--- 547 (671) T ss_pred ecccCCceeEechHHHHHHHHHHhhccC--CcEECcCCceeccccccccceeecChhHHHHHhhCCceEEEEecCCe--- Confidence 11 1 15677778888887443 2 11122444333 32 345788999999999999998887655 Q ss_pred eEEEcCEEeCCcc-ccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccC Q lcl|NC_019451. 358 AFYQRGILCGGPT-DAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEIS 436 (504) Q Consensus 358 ~~~~~G~~~~G~y-~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~ 436 (504) -.++..+++++.- +|.+|-+.+-.+|+.+.|+..++...-. |.++.=...|+..|+.-|+.-+++|.|. T Consensus 548 ~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~~~~~i~~~i~~fL~~l~~~gal~------ 617 (671) T protein:vir:56 548 FVLYGDKTATQQASAFDRINVRRLFNLLKKAISDAAKYRLFE----LNDEFTRSSFKSEIDAYLTNIQDLGGVY------ 617 (671) T ss_pred EEEEcceecCCCCcccceEehhhHHHHHHHHHHHHHHHhcCC----CCCHHHHHHHHHHHHHHHHHHHhCCcee------ Confidence 2457778887763 6888999999999999999988873322 4577778899999999999999999986 Q ss_pred cccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 437 AVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 437 ~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ||.+.+. .++.+++|+.+-+. .+.+.++-.--+++|+++-.-. T Consensus 618 -----------------------g~~v~~d-~~~nt~~~i~~G~~-~~~i~~~p~~Pae~I~~~~~~~ 660 (671) T protein:vir:56 618 -----------------------DFRVVCD-ETNNPGSVIDRNEF-VASIYVKPAKSINFITLNFVAT 660 (671) T ss_pred -----------------------eeEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEe Confidence 5888887 57888888887776 5888888888899888875544 No 57 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=95.03 E-value=0.0029 Score=34.47 Aligned_cols=458 Identities=12% Similarity=-0.008 Sum_probs=214.5 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCC---CcHHHHHHHHHhccCCCCCcc Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGA---QSEEYQRAAAYFKFISKSVNS 77 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~---~s~ey~aA~~yF~~~~~~~~~ 77 (504) |--++-=|-|.--=.+...........-|++....=|.+....-+|..|..+.||. .+.++.+...+|-+- T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~~~------ 74 (660) T protein:vir:68 1 MALLSPGVELKETTVQSTVVNNSTGTAALAGKFQWGPAFQIKQITDEVALVDMFGTPNTDTADYFMSAMNFLQY------ 74 (660) T ss_pred CccccCceEEEEecCCcccccCCCcceeEEecccCCCCccCEEecCHHHHHHhcCCccCccchhHHHHHHHHhC------ Confidence 66666555543211233444455678888888777677777777889999999994 466777888888763 Q ss_pred cceEEEEeeeccCCcc--e-----------------eeecccchhhHhHhhccCce-EEEEEcccce------------- Q lcl|NC_019451. 78 PSSISFARWVNTAIAP--M-----------------VVGDNLPKTIADFAGFSAGV-LTIMVGAAEQ------------- 124 (504) Q Consensus 78 P~~l~igr~~~~a~~~--~-----------------l~g~~~~~~~~~~~~~~~g~-~titi~g~~~------------- 124 (504) =+++||-|........ . ..|..+............+. ..+..++... T Consensus 75 g~~~~vvRv~~~~~~~~~~~~~~~~~~t~~~~g~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ta~~~~~a 154 (660) T protein:vir:68 75 GNDLRVVRAVDRDTAKNSSPVAGNINFTISSAGTNYRVGDKVVVKYSTDIIEPDGEVTSVDSDGKILNIFIPSGKIIAKA 154 (660) T ss_pred CCeEEEEEecccccccccccccccceeeeeccCcceeeeeeeeeecccccccccccceeeeecCceeeeeeccccccccc Confidence 3568999985422110 0 00100000000000000000 0000000000 Q ss_pred -eee---------eeccccccchHHHHHHHHhhhh----------cccc-----------------------cccc--ee Q lcl|NC_019451. 125 -NIT---------AIDTSAATSMDNVASIIQTEIR----------KNAD-----------------------PQLA--QA 159 (504) Q Consensus 125 -~~~---------~i~~s~ats~~~vA~~i~~~i~----------a~~~-----------------------~~~~--~a 159 (504) .+. ...+.........+..++.... .... .... .. T Consensus 155 ~~~~~~~~~~~~~~~~v~~~~~~~~~~~~v~~~~~d~~~~~~~~~ta~~~~~~~~~~~~~~~~~~~~~~A~~~g~~G~~i 234 (660) T protein:vir:68 155 KEIGEYPELGSNWTAEMSGSSSGLSAVITIDSVVMDSGILLTEVETSEEAITSLTFQESIKKYGVPGVVALYPGELGDQL 234 (660) T ss_pred eeeccccccccceeEEeecccccceeeeeeccccccccceeeeeccccccccccceeeeecccCccccccccccccccce Confidence 000 0000000000000000000000 0000 0000 00 Q ss_pred EEEEeccc-------ce---eEEecccCcceeEEEEeec------------------------c-c--hhhhhh----hh Q lcl|NC_019451. 160 TVTWNQNT-------NQ---FTLVGATIGTGVLAVAKSA------------------------D-P--QDMSTA----LG 198 (504) Q Consensus 160 ~vt~d~~~-------~~---F~its~t~ga~s~~~~~sa------------------------~-~--~~ia~~----l~ 198 (504) ++...... .. +.......+....+....+ . . ...... .. T Consensus 235 ~v~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (660) T protein:vir:68 235 EIEIVSKADYDKGASAQLKIYPDGGTRYSTAKAIFGYGPQTDDQYAIIVRRNDSVVQSVVLSTKRGERDIYGSNIFIDDF 314 (660) T ss_pred EEEEeccccccccccccceeeecccccccceeeEeecccccccceeeeeecCCcceeeeeeecccccccccccceeeehh Confidence 00000000 00 0000000000000000000 0 0 000000 00 Q ss_pred ccc--Ccce----------------eeccccc------ccHHHHHHHHHhhc-cceeEEEEEecc-CCHHH----HHHHH Q lcl|NC_019451. 199 WST--SNVV----------------NVAGQAA------DLPDAAVAKSTNVS-NNFGSFLFAGAP-LDNDQ----IKAVS 248 (504) Q Consensus 199 ~t~--~~~~----------------~~~g~aa------et~~~al~~~~~~~-~~wy~~~~~~~~-~~~~~----~~a~A 248 (504) ... ...+ ...|.+. .+...++..+.+.. .....++..... .+.++ +.++. T Consensus 315 ~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~v~~~l~ 394 (660) T protein:vir:68 315 FAKGASNYIFATAQGWPKGFSGVIKLNGGLSSNETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESLEVASTVQKHVV 394 (660) T ss_pred hccCcccEEEEeecCCCccccceeeeccccccccccccchhhhHHHHhhhhhccccceeeccccCCCchHHHHHHHHHHH Confidence 000 0000 0111110 11222333332221 122222211111 11222 33455 Q ss_pred HHHhhcCCcEEEEEe------ccccch-hHHHHHHhh------------hcceeEEEEeccC-----CC----ccHHHHH Q lcl|NC_019451. 249 AWNAAQNNQFIYTVA------TSLANL-GTLFTLVNG------------NAGTALNVLSATA-----AN----DFVEQCP 300 (504) Q Consensus 249 ~w~e~~~~~~~~~~~------~~d~~~-~~~~~~~~~------------~~~~~~~~~~~~~-----~~----~~~~aa~ 300 (504) ..++....++.+.-. +..... ......... ...+...++++.. .+ --|...+ T Consensus 395 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~ 474 (660) T protein:vir:68 395 AIGDSRQDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADI 474 (660) T ss_pred HHHHhhCCeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceEEechhHHH Confidence 666665544432210 111111 111111111 0112223332210 11 1245777 Q ss_pred HHHHHhcCcCcCCc-eeeecccccCccc-----cccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCcc-ccc Q lcl|NC_019451. 301 SEILAATNYDEPGA-SQNYMYYQFPGRN-----ITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPT-DAV 373 (504) Q Consensus 301 ~g~~as~nf~~~~g-~~T~kfk~l~Gv~-----a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y-~~~ 373 (504) +|.++.+|-++ | ......|.+.||. ...+++.|.+.|..+++|+...+.+.|- .++..+++++.- +|. T Consensus 475 AGl~Ar~d~~~--g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~---~~wG~rT~~~~~s~~~ 549 (660) T protein:vir:68 475 AGLCARTDNIS--QPWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGY---VLYGDKTATSVPSPFD 549 (660) T ss_pred HHHHHHHhccC--CcEEccCCeeeceeeccceeeecCChhHHHHHhhCCceEEEEecCCeE---EEEcceecCCCCcccc Confidence 78888887433 3 1222355544442 1246899999999999999988876552 457888888763 678 Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccc Q lcl|NC_019451. 374 DMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAW 453 (504) Q Consensus 374 wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~ 453 (504) +|-+.+-.+|+++.|+..++-..-. |.++.=...|+..|+.-|+.-+++|.|. T Consensus 550 ~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~~~~~i~~~i~~~L~~l~~~gal~----------------------- 602 (660) T protein:vir:68 550 RINVRRLFNMVKTNIGSASKYRLFE----LNNAFTRSSFRTETSQYLQGIKALGGVY----------------------- 602 (660) T ss_pred eEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee----------------------- Confidence 8999999999999999888874322 4577778899999999999999999996 Q ss_pred cceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 454 RQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 454 ~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ||.|.+ +.+..+++|+.+.+. .+.+.++-.-.+++|+++-... T Consensus 603 ------gf~V~~-d~~~nt~~~i~~G~~-~~~i~~~p~~pae~i~l~~~~~ 645 (660) T protein:vir:68 603 ------NFKVVC-DTTNNTPAVIDRNEF-VATFYLQPARSINYITLNFVAT 645 (660) T ss_pred ------eeEEEE-ecCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEe Confidence 588887 467888888888777 5999999999999999886666 No 58 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=94.87 E-value=0.0033 Score=34.18 Aligned_cols=457 Identities=13% Similarity=0.066 Sum_probs=218.6 Q ss_pred CCCccceEEEee-eecccccccccccceEEEecccccCccceEEecCHHHHHHhcCC---CcHHHHHHHHHhccCCCCCc Q lcl|NC_019451. 1 MISQSRYIRIIS-GVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGA---QSEEYQRAAAYFKFISKSVN 76 (504) Q Consensus 1 mip~s~iv~V~~-~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~---~s~ey~aA~~yF~~~~~~~~ 76 (504) |.-+|-=|-|.- + .+.......-....|++.-..=|++.....+|..|..+-||. .+.++.+...||-+- T Consensus 1 ma~~~PgVyv~E~~-~~~~i~~~~ts~~~~vG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ng----- 74 (664) T protein:vir:98 1 MALQSPGIETKETS-VQSTVVRNSTGRAAIVGKFSWGPAYQIRQISNEVELVNYFGAPDNLTADYFMSAVNFLQY----- 74 (664) T ss_pred CceecCceEEEecC-CCcccccccccceEEEeeccCCCCCccEEecCHHHHHHhcCCccccchhHHHHHHHHHhc----- Confidence 888887776652 3 345555556778888888777778777888889999999994 366788888888864 Q ss_pred ccceEEEEeeeccCC---ccee----------------eecccch----h--hHhHhh---ccCc-eEEEEE-cccceee Q lcl|NC_019451. 77 SPSSISFARWVNTAI---APMV----------------VGDNLPK----T--IADFAG---FSAG-VLTIMV-GAAEQNI 126 (504) Q Consensus 77 ~P~~l~igr~~~~a~---~~~l----------------~g~~~~~----~--~~~~~~---~~~g-~~titi-~g~~~~~ 126 (504) =+++|+-|...... +..+ .|..+.. . ...+.. -..| .+.+.+ ++....+ T Consensus 75 -g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~gn~~~v~i~~~~~~~~ 153 (664) T protein:vir:98 75 -GNDLRLVRVVDKDAAKNASAIFNQIKTTIASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNGKILAVTIPKRKKSLL 153 (664) T ss_pred -CCeEEEEEecCccccccccccccccceeeccCCcccccccceeEeecCcccccceeeccCCCCCceeeEeeccCcccee Confidence 45699999853210 1100 0000000 0 000000 0000 111111 0000000 Q ss_pred e--------------eecc-------ccccc----hH---------HHH-HHHHhh-------------hh--------- Q lcl|NC_019451. 127 T--------------AIDT-------SAATS----MD---------NVA-SIIQTE-------------IR--------- 149 (504) Q Consensus 127 ~--------------~i~~-------s~ats----~~---------~vA-~~i~~~-------------i~--------- 149 (504) . ..++ +.... .. +.+ ..+... +. T Consensus 154 ~~~~~~~~~~~~~~~~~s~~~~s~g~a~a~~v~~v~~d~~~~~~~~~~a~~~i~~~~~~~~~~~~~~~~~~a~~~G~~Gn 233 (664) T protein:vir:98 154 VLNRSVLTQIFLLVGTTEIVSQSSGVSASITIDGIESDSGITLLNLDIAKETIQGTSFQTLTQKYQIPSVVALYPGELGS 233 (664) T ss_pred ecccccccccceecccceeeeeecccceeeecccccccceeeccccceeeeccccccceeeeeccccceeeeeecccccc Confidence 0 0000 00000 00 000 000000 00 Q ss_pred -------cccc----------------cccceeEEEEec-ccceeEEecccCcceeEEE--Eeec--------------- Q lcl|NC_019451. 150 -------KNAD----------------PQLAQATVTWNQ-NTNQFTLVGATIGTGVLAV--AKSA--------------- 188 (504) Q Consensus 150 -------a~~~----------------~~~~~a~vt~d~-~~~~F~its~t~ga~s~~~--~~sa--------------- 188 (504) .... .......+.+.. ..+.|.++-...+...... .... T Consensus 234 ~isv~i~s~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 313 (664) T protein:vir:98 234 TVQVEIISKAAYDTGAMISGYPSGISVKNSGRSVMTYGPQTDNQYAFVVRRGGIVQESFIVSTDKTDKDIYGVNIYMDDF 313 (664) T ss_pred eeeeeecccccccCcceEeeccCceecccceeeeeeccccCccceeEEEecCCceeeeEEeecccCcccceeeeeechhh Confidence 0000 000000000000 0011111000000000000 0000 Q ss_pred --cchh---hhhhhhcccC--ccee-ecccc------cccHHHHHHHHHhhcc-ceeEEEEEeccC--CHHH----HHHH Q lcl|NC_019451. 189 --DPQD---MSTALGWSTS--NVVN-VAGQA------ADLPDAAVAKSTNVSN-NFGSFLFAGAPL--DNDQ----IKAV 247 (504) Q Consensus 189 --~~~~---ia~~l~~t~~--~~~~-~~g~a------aet~~~al~~~~~~~~-~wy~~~~~~~~~--~~~~----~~a~ 247 (504) .... .+........ .... ..|.+ .+...+++.++.+... +- .++++.... ..+. ..++ T Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~~~~~~~~-~ll~~p~~~~~~~~~~~~v~~al 392 (664) T protein:vir:98 314 FANGGSQYVFGTSMNWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFADREALHV-PLLIAGGCAGESVEIASTVQKHV 392 (664) T ss_pred eecccceeeeeecccCCcccceeEeccCccccccccCchhHHHHHHhhhccccccc-ceEEecCCCCCcHHHHHHHHHHH Confidence 0000 0000000000 0000 00111 1223345555554321 11 233332211 1222 3344 Q ss_pred HHHHhhcCCcEEEEEe------ccccc-hhHHHHHH------------h----hhcceeEEEEecc-----CCC----cc Q lcl|NC_019451. 248 SAWNAAQNNQFIYTVA------TSLAN-LGTLFTLV------------N----GNAGTALNVLSAT-----AAN----DF 295 (504) Q Consensus 248 A~w~e~~~~~~~~~~~------~~d~~-~~~~~~~~------------~----~~~~~~~~~~~~~-----~~~----~~ 295 (504) ...++....+|.+.-. +.... ........ . ....+...++++. ..+ .- T Consensus 393 ~~~a~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p 472 (664) T protein:vir:98 393 ISIGDERQDCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNYKYQYDKYNDVNRWVP 472 (664) T ss_pred HHHHHhcCCeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCeEEEecccCCceEEec Confidence 4555554444332210 00000 00000000 0 0011222232211 011 12 Q ss_pred HHHHHHHHHHhcCcCcCCc-eeeecccccCccc-----cccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCc Q lcl|NC_019451. 296 VEQCPSEILAATNYDEPGA-SQNYMYYQFPGRN-----ITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGP 369 (504) Q Consensus 296 ~~aa~~g~~as~nf~~~~g-~~T~kfk~l~Gv~-----a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~ 369 (504) |.+.++|.++.+|.++ | ......|.+.||. ...+++.|.+.|..+|+|....+-+. +.+ .++..+++++. T Consensus 473 ~sg~~AGl~A~~D~~~--g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~-~G~-~~wG~rT~~~~ 548 (664) T protein:vir:98 473 LAGDIAGLCVYTDSVA--NPWMSPAGYNRGQIRNCIKLAIEPRTAHRDAMYQVQINPVTGFAGG-SGF-VLYGDKTLTSV 548 (664) T ss_pred hHHHHHHHHHHhhhcC--CcEECcCCceeeeeeccccceeecChhhHHHHHhCCCeEEEEeeCC-CcE-EEEcccccCCC Confidence 5667788888887433 3 2222334444442 24578899999999999999877542 222 35677877765 Q ss_pred c-ccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccC Q lcl|NC_019451. 370 T-DAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTG 448 (504) Q Consensus 370 y-~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g 448 (504) - +|.||-+.+-.+|+++.|+..++...-. |.++.=...|+..|+.-|+.-+++|.|. T Consensus 549 ~s~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~------------------ 606 (664) T protein:vir:98 549 PSPFDRINVRRLFNMIKKDIGDNAKYKLFE----NNDDFTRASFRMDTGQYMTNIRALGGCY------------------ 606 (664) T ss_pred CcccceEeehhHHHHHHHHHHHHHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhcCcee------------------ Confidence 3 6788999999999999999888774332 5688888999999999999999999885 Q ss_pred CcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 449 DRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 449 ~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ||+|.++ .++.+++|..+-+. .+.+.++-.-.+++|+++-... T Consensus 607 -----------g~~V~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~I~~~~~q~ 649 (664) T protein:vir:98 607 -----------DYRVICD-TTNNTPDVIDRNEF-VATVYVKPPRSINYITLNFVAT 649 (664) T ss_pred -----------eeEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEe Confidence 5899987 57888888887777 5899999999999998885554 No 59 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=94.13 E-value=0.0053 Score=33.05 Aligned_cols=443 Identities=10% Similarity=-0.008 Sum_probs=163.5 Q ss_pred CCCccceEEEeeeecccccccccccceEEE--ec----------ccccC-ccceE-EecCHH-HHHHhcCCCcHHHHHHH Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVM--TT----------NNVIP-PGIVI-EFDNAN-AVLSYFGAQSEEYQRAA 65 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l--~~----------~~~~~-~~r~~-~y~s~~-~V~~~Fg~~s~ey~aA~ 65 (504) -+|.++.+.........+... .....-+. +. ....+ .+... ....+. .+...-+.. .....+. T Consensus 143 ~v~~~~~~~~a~~~~~~~~l~-~a~~~~~~~~t~~~g~~~~~~v~~v~~~~~~~~~~~~~a~~~i~~~~~~~-~t~~~~~ 220 (679) T protein:vir:10 143 YVPTAAIIDKAKSLNDYPALD-NAWQIQFAAGGPGAGQAATATVVGINLDSTIFVPNDEYAMSAISERSETK-RTFIDIC 220 (679) T ss_pred eecccccccccccccccceec-ccceeeeeeccccccceeeeeeeeeccCCceeeccccccccccccccccc-hhhhhhh Confidence 112222111100000000000 00000000 00 00000 00000 000000 000000000 0000000 Q ss_pred HHhccCCCCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccccchHHHHHHHH Q lcl|NC_019451. 66 AYFKFISKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAATSMDNVASIIQ 145 (504) Q Consensus 66 ~yF~~~~~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~ats~~~vA~~i~ 145 (504) . ....+ ...-..-|.+..... ... +.. ..+....++.-.+.++....... .. ......+.. . T Consensus 221 ~-~~~~~----~~~A~~~g~~gn~i~-v~~----va~--~~~~~~~~~~a~v~~~~~~~~~~---~~-~~~~~~~~~--~ 282 (679) T protein:vir:10 221 E-EMKVP----AIVARYAGTYGDNIK-VLM----IAY--KDYYKFNEAGKIVSVNTINPKVF---PT-GLDYGNVTP--S 282 (679) T ss_pred h-ccccc----eeeeecccccCCcce-EEE----Eee--ccccccccccccccccccccccc---cc-cccccccee--e Confidence 0 00000 000011111100000 000 000 00000000000000000000000 00 000000000 0 Q ss_pred hhhhcccccc-cceeEEEEe-cccceeEEecccC-----------------cceeEEEEeec--cchhhhhhhhcccCcc Q lcl|NC_019451. 146 TEIRKNADPQ-LAQATVTWN-QNTNQFTLVGATI-----------------GTGVLAVAKSA--DPQDMSTALGWSTSNV 204 (504) Q Consensus 146 ~~i~a~~~~~-~~~a~vt~d-~~~~~F~its~t~-----------------ga~s~~~~~sa--~~~~ia~~l~~t~~~~ 204 (504) ..+....... .....+.-+ .....|.++.... +....+ .... .+...+....+..+. T Consensus 283 ~~~~~~~~~~~~~~vvv~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v-~~~~~~~~~~~~~~~~~~gg~- 360 (679) T protein:vir:10 283 SYLEFGPQNESQFAFIVFNNGVAVESKILSTKPGDRDIYGTSIYINEYFGNGYSSFV-QGVAESWPVGYTGVLAFGGGQ- 360 (679) T ss_pred eecccccccccceeeEEecccccccceeeecccccccccchhhhhhhhhcCccccee-eeccccccccccceeeccCCc- Confidence 0000000000 000000000 0001111111000 000000 0000 000000000000000 Q ss_pred eeecccccccHHHHHHHHHhhccceeEEEEEeccC--C----HHHHHHHHHHHhhcCCcEEEEEec-----cccchh--H Q lcl|NC_019451. 205 VNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGAPL--D----NDQIKAVSAWNAAQNNQFIYTVAT-----SLANLG--T 271 (504) Q Consensus 205 ~~~~g~aaet~~~al~~~~~~~~~wy~~~~~~~~~--~----~~~~~a~A~w~e~~~~~~~~~~~~-----~d~~~~--~ 271 (504) -.............+..+......--.++++.... . ..-..++...++....+|.+.-.- .++... . T Consensus 361 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~ 440 (679) T protein:vir:10 361 SSNTDISAAEFMKGWDMFADREHTDVNLFIAGAVAGEGAQIASTVQKAVVAIADERRDCLVLISPPREYMINQPAASVVR 440 (679) T ss_pred cCCCccchhhhhhhhhhhhcccccccceEEecCCCCCchhhhHHHHHHHHHHHHhhCCeEEEEeccccccccccccchHH Confidence 00000011112222222221111111223322211 1 122345556666655554433110 000000 1 Q ss_pred HHHHHhh---------------hcceeEEEEeccC-----CC----ccHHHHHHHHHHhcCcCcCCceeeecccccCccc Q lcl|NC_019451. 272 LFTLVNG---------------NAGTALNVLSATA-----AN----DFVEQCPSEILAATNYDEPGASQNYMYYQFPGRN 327 (504) Q Consensus 272 ~~~~~~~---------------~~~~~~~~~~~~~-----~~----~~~~aa~~g~~as~nf~~~~g~~T~kfk~l~Gv~ 327 (504) ....... ...+...++++.. .+ .-|.+.+.|.++.+|.++. =......|.+.||. T Consensus 441 ~~~~~r~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g-~~~sPan~~~~~i~ 519 (679) T protein:vir:10 441 KLVDWRRGVNQAGISLDDNMNIGTTYASVDGNYKYQYDKYNDVNRWIPLAADIAGLCARTDTVGQ-PWQSPAGFNRGQIV 519 (679) T ss_pred HHHHHHhhcccccchhhhhhccCcceEEEEccceeeecccCCceEEechHHHHHHHHHHhhccCC-cEECcCCeeecccc Confidence 1100000 0112222222211 11 1235667788888874331 12223344544442 Q ss_pred -----cccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCcc-ccchhhhhhhHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_019451. 328 -----ITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPT-DAVDMNVYANEIWLKSAIAQALLDLFLNVNA 401 (504) Q Consensus 328 -----a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y-~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~k 401 (504) .-.+++.|.+.|..+++|....+.+.|. .++..+++++.- +|.+|-+.+-.+|+++.|+.......-. T Consensus 520 g~~~~~~~~~~~~~~~Ln~~gin~i~~~~g~G~---~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e--- 593 (679) T protein:vir:10 520 NVIKLAVDTRQAHRDEMYTNGINPIVGFAGQGY---ILYGDKTASQAPTPFDRINVRRLFNLLKKSISESAKYKLFE--- 593 (679) T ss_pred ccccceeecChhhHHhhhhCCceEEEEecCCeE---EEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccC--- Confidence 2356899999999999999988876653 457888887753 6788899999999999999888774332 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhccc Q lcl|NC_019451. 402 VPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWK 481 (504) Q Consensus 402 IPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~ 481 (504) |.|+.=...|+..|+.-|++-+++|.|. ||.+.+. .++.+++|+.+-+. T Consensus 594 -pn~~~~~~~i~~~i~~fL~~l~~~gal~-----------------------------gf~v~~d-~~~nt~~~i~~G~~ 642 (679) T protein:vir:10 594 -LNDAFTRSSFRSEVGSYLDTIRSLGGIY-----------------------------DFRVVCD-ESNNTPAVIDRNEF 642 (679) T ss_pred -CCCHHHHHHHHHHHHHHHHHHHhCCcee-----------------------------eeEEEEc-CCCCCHHHhhCCeE Confidence 4678888999999999999999999986 5899987 57888888887777 Q ss_pred CceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 482 ANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 482 ~~i~~~~~~aGAIh~v~i~~~~v 504 (504) .+.+.++-.-.+++|+++-+.. T Consensus 643 -~~~i~~~p~~pae~i~~~~~~~ 664 (679) T protein:vir:10 643 -VATILIKPARSINYITLSFVAT 664 (679) T ss_pred -EEEEEEEecCCccEEEEEEEEe Confidence 5899999999999998875554 No 60 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=93.95 E-value=0.0059 Score=32.82 Aligned_cols=403 Identities=9% Similarity=0.019 Sum_probs=170.3 Q ss_pred CCCccceEEE---------------eeeecccccccc---------cccceEEEeccccc--Cccc-eEEecCHHHH-HH Q lcl|NC_019451. 1 MISQSRYIRI---------------ISGVGAGAPVAG---------RKLILRVMTTNNVI--PPGI-VIEFDNANAV-LS 52 (504) Q Consensus 1 mip~s~iv~V---------------~~~v~~~~~~~~---------~~~~~l~l~~~~~~--~~~r-~~~y~s~~~V-~~ 52 (504) |+.-+.-+.. .+.+.++.+... .....++......+ ..+. +..|..++.- .. T Consensus 276 ~~g~~~~it~v~~~~~~~~~~t~~~~~~~a~~~gt~~~~~~~~g~~D~~~v~v~~~~g~~~~~~g~v~e~~~~~~~~~~~ 355 (749) T protein:vir:10 276 TNSTNITITSVRDEYTEREYLPGVKWINVAPRPGTSLYANGVGGHRDEMHVILVDIDGGVTGTVGALLERYIDVSKASDA 355 (749) T ss_pred CccceeEEEeeeccccccccccceeeccccccccceeeeecccCCCCceEEEEecCCCeeeecccceeeeeeeccccccc Confidence 1111110000 001111111100 00001111111100 0011 1222222110 11 Q ss_pred hcCCCcHHHHHHHHHhccCCCCCcccceEEEEeeeccCCcceeeecccchhhHhHhhccCceEEEEEcccceeeeeeccc Q lcl|NC_019451. 53 YFGAQSEEYQRAAAYFKFISKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTS 132 (504) Q Consensus 53 ~Fg~~s~ey~aA~~yF~~~~~~~~~P~~l~igr~~~~a~~~~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s 132 (504) -+....+.|- ...+.+ ....++++.-.... ... ... ..++.......+...... .+ T Consensus 356 ~~~~~~~~~~--~~~~~~------~s~~v~~~~~~~~~---~~~--~~~--------~~~~~~~~~~~~~~~~~~---~~ 411 (749) T protein:vir:10 356 KTSVGETNYY--AEVIKQ------KSEFIYWAEHESTL---YAA--TSS--------ASDGLFGQTAANRQFNLF---RS 411 (749) T ss_pred cccccccchh--hhhhcc------CCCEEEEEeccccc---ccc--ccc--------ccccccccccccceeecc---cc Confidence 1222333332 222222 12233333221100 000 000 000100000000000000 00 Q ss_pred cccchHHHHHHHHhhhhcccccccceeEEEEecccceeEEecccCcceeEEEEeeccchhhhhhhhcccCcceeeccccc Q lcl|NC_019451. 133 AATSMDNVASIIQTEIRKNADPQLAQATVTWNQNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAA 212 (504) Q Consensus 133 ~ats~~~vA~~i~~~i~a~~~~~~~~a~vt~d~~~~~F~its~t~ga~s~~~~~sa~~~~ia~~l~~t~~~~~~~~g~aa 212 (504) .. .+..+-.....+. ......+......+.+.... .+. ..... T Consensus 412 ~~------------------------~~~~~~~~~~~~~-----~~~~~~~~~~~~gg~d~~~~----~~~----~~~~~ 454 (749) T protein:vir:10 412 AA------------------------GSVDYPAGVTTLG-----SKNNATYYYRLSGGVNYTVS----AGQ----YTITN 454 (749) T ss_pred cc------------------------ccceecccccccc-----ccCCcEEEEEccCCcccccc----ccc----ccccc Confidence 00 0000000000000 00000111111111110000 000 00011 Q ss_pred ccHHHHHHHHHhhc-cceeEEEEEeccCCH----HHHHHHHHHHhhcCCcEEEEEecccc-------chhHHHHHHh--h Q lcl|NC_019451. 213 DLPDAAVAKSTNVS-NNFGSFLFAGAPLDN----DQIKAVSAWNAAQNNQFIYTVATSLA-------NLGTLFTLVN--G 278 (504) Q Consensus 213 et~~~al~~~~~~~-~~wy~~~~~~~~~~~----~~~~a~A~w~e~~~~~~~~~~~~~d~-------~~~~~~~~~~--~ 278 (504) .....+++.+.+.. ..+-.++......++ ....++...+|....++.+.-...+. .......... . T Consensus 455 ~~~~~~~~~l~~~~~~~~~~li~~~~~~~~~~~~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~ 534 (749) T protein:vir:10 455 TDIGSAYELIGDPESQIVDFIISGPSGTSDANALAKITSLVNIAEERRDCMVFVSPRRGNVIGISNTTTITTNIVDFFKK 534 (749) T ss_pred hhHHHHHHHhhhhhhcccceEEEecCCCCcchhHHHHHHHHHHHhhcCCEEEEEcCCCCcccccccchhhhhHHHHHHhh Confidence 22334444444322 222222222222222 23455666666655543333211100 0001111100 1 Q ss_pred --hcceeEEEEeccC-----CC----ccHHHHHHHHHHhcCcCcCCc-eeeecccccC---ccc--cccCCHHHHHHHHh Q lcl|NC_019451. 279 --NAGTALNVLSATA-----AN----DFVEQCPSEILAATNYDEPGA-SQNYMYYQFP---GRN--ITVSDDTVANTVDK 341 (504) Q Consensus 279 --~~~~~~~~~~~~~-----~~----~~~~aa~~g~~as~nf~~~~g-~~T~kfk~l~---Gv~--a~~lt~t~~~al~~ 341 (504) ...+...++++.. .+ .-|.+.++|.++.+|.++ | ......|++. |+. ...+++.|.+.|.. T Consensus 535 ~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~~~--g~~~SPan~~~~~i~g~~~~~~~~~~~e~~~Ln~ 612 (749) T protein:vir:10 535 LPSSSYMVFDSGYKYIYDKYNDVYRYIPCNGDTAGLCLQTNEIS--EPWFSPAGFQRGVLRNAIKLAYTPNKAQRDQLYA 612 (749) T ss_pred ccCceeEEEEccceeeeccccCceEEechHHHHHHHHHHhhccC--CcEECcCCceeeeeeccccceeecChhHHHhhhh Confidence 1112222222211 11 124567778888887543 3 1112255433 332 34568999999999 Q ss_pred CCCeEEEEEeeccceeeEEEcCEEe-CCccccchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHH Q lcl|NC_019451. 342 SRGNYIGVTQANGQQLAFYQRGILC-GGPTDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVL 420 (504) Q Consensus 342 ~~~n~y~~~~~~~~~~~~~~~G~~~-~G~y~~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl 420 (504) +|+|....+.+.| + .++.++++ +..-.|.+|-+.+-.+|+++.|+..+...+-. |.++.=...|+..|+.-| T Consensus 613 ~gIn~i~~~~g~G--~-~~wG~rT~~s~d~~~~~i~vRRl~~~ie~si~~~~~~~v~e----pn~~~l~~~i~~~i~~fL 685 (749) T protein:vir:10 613 NRVNPIVSFPGQG--V-VLYGDKTALGFASAFDRINIRRLFLTVERVISTAAKAQLFE----QNDEAQRSLFINIVEPYL 685 (749) T ss_pred CCceEEEEecCCe--E-EEEcceecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcC----CCCHHHHHHHHHHHHHHH Confidence 9999998887655 2 34677775 44546888999999999999999888774322 468888899999999999 Q ss_pred HHHHhcCccccccccCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEee Q lcl|NC_019451. 421 DKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGS 500 (504) Q Consensus 421 ~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~ 500 (504) +.-+++|.|. ||.|.++ .+..+++|..+.+. .+.+.++---.++.|+++ T Consensus 686 ~~l~~~G~i~-----------------------------~f~V~~d-~~~Nt~~~i~~G~~-~~~i~~~P~~pae~I~~~ 734 (749) T protein:vir:10 686 RDVQGRRGVV-----------------------------DFLVKCD-STNNTPEAVDRGEF-YAEVFLKPTRTINYVQLT 734 (749) T ss_pred HHHHhcCCee-----------------------------eeEEEEc-CCCCCHHHhhCCEE-EEEEEEEecCCccEEEEE Confidence 9999888773 5889887 57788888887777 589999999999998887 Q ss_pred eeeC Q lcl|NC_019451. 501 DVMI 504 (504) Q Consensus 501 ~~~v 504 (504) -.-. T Consensus 735 ~~~~ 738 (749) T protein:vir:10 735 FVAT 738 (749) T ss_pred EEEe Confidence 5443 No 61 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=93.88 E-value=0.0061 Score=32.74 Aligned_cols=457 Identities=12% Similarity=0.018 Sum_probs=208.6 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCC---cHHHHHHHHHhccCCCCCcc Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQ---SEEYQRAAAYFKFISKSVNS 77 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~---s~ey~aA~~yF~~~~~~~~~ 77 (504) |-=++-=|-|.--=.+...........-|++....=|++.....+|..|..+.||.- +.++.+.+.||-+- T Consensus 1 ~~~~~PgVyv~e~~~~~~~~~~~ts~~~fvG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ng------ 74 (659) T protein:vir:10 1 MTLLSPGIELKETTVQSTVVNNSTGTAALAGKFQWGPAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQY------ 74 (659) T ss_pred CceecCceEEEEecCCceecccCccceEEEecccCCCCCccEEecCHHHHHHHcCCcCCCcchhHHHHHHHhhC------ Confidence 544444333321111111112235678888887777777778888899999999875 44567788888763 Q ss_pred cceEEEEeeeccCC--cceeeecccch--hhHhHhhccCceEEEEEcc----cceeeeeecccc---------ccchHHH Q lcl|NC_019451. 78 PSSISFARWVNTAI--APMVVGDNLPK--TIADFAGFSAGVLTIMVGA----AEQNITAIDTSA---------ATSMDNV 140 (504) Q Consensus 78 P~~l~igr~~~~a~--~~~l~g~~~~~--~~~~~~~~~~g~~titi~g----~~~~~~~i~~s~---------ats~~~v 140 (504) =+++||-|...... .+...++.+.. ....-.........++... ..-.+..++.+. ....+ . T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~vd~~~~~~~~~i~~~~~~~-~ 153 (659) T protein:vir:10 75 GNDLRVVRAVDRDTAKNSSPIAGNIEYTISTPGSNYAVGDKITVKYVSDAIETEGKITEVDTDGKIKKINIPTAKIIA-K 153 (659) T ss_pred CCeEEEEEccCcccccccccccccceeeEeecccccccccceeeeecCCCccccceeeEEecccccceeeeccccccc-c Confidence 35799999854211 01111111110 0000000000011111110 000111111110 00000 0 Q ss_pred HHHHHhh----------hhccc--ccccceeEEEEecccceeE--------Eeccc-------Ccc-------------e Q lcl|NC_019451. 141 ASIIQTE----------IRKNA--DPQLAQATVTWNQNTNQFT--------LVGAT-------IGT-------------G 180 (504) Q Consensus 141 A~~i~~~----------i~a~~--~~~~~~a~vt~d~~~~~F~--------its~t-------~ga-------------~ 180 (504) +...... +.... ...........+.....+. .+... .+. . T Consensus 154 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~a~t~~~~~~~~~~~~~~~v~a~~~G~~g~~ 233 (659) T protein:vir:10 154 AKEVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDK 233 (659) T ss_pred cccccccceeeeeeeeeeeeeccccceeeEEeeeecCCceeEEeeccccccccccccccceeecccccccccccceeccc Confidence 0000000 00000 0000000000000000010 00000 000 0 Q ss_pred eEEEEeecc-----------------------------------chhhh--------------hhhhc------------ Q lcl|NC_019451. 181 VLAVAKSAD-----------------------------------PQDMS--------------TALGW------------ 199 (504) Q Consensus 181 s~~~~~sa~-----------------------------------~~~ia--------------~~l~~------------ 199 (504) ......... ..++. ...+. T Consensus 234 ~tv~~~~~a~~~~~~~v~v~~~~~~~~~a~~~t~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (659) T protein:vir:10 234 IEIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDD 313 (659) T ss_pred ceEEEechhhccccceeeeeeeeecccccccceeeeeeccccccchhhccccccceeeeeeeeccccccccccchhhhhh Confidence 000000000 00000 00000 Q ss_pred --ccCcc--e---------------ee-cccc------cccHHHHHHHHHhhc-cceeEEEEEeccC--C----HHHHHH Q lcl|NC_019451. 200 --STSNV--V---------------NV-AGQA------ADLPDAAVAKSTNVS-NNFGSFLFAGAPL--D----NDQIKA 246 (504) Q Consensus 200 --t~~~~--~---------------~~-~g~a------aet~~~al~~~~~~~-~~wy~~~~~~~~~--~----~~~~~a 246 (504) ..... + .. .|.+ ......++..+.+.. .+. .++.+.... . ..-..+ T Consensus 314 ~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~-~il~~p~~~~~~~~~~~~v~~a 392 (659) T protein:vir:10 314 FFAKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDV-QLFIAGSCAGESLETASTVQKH 392 (659) T ss_pred hhccCcccEEEEeecccCCCccceeeecccccccccccchhHHHHHHHhhhccccce-eEEEecCCCCcchhhhHHHHHH Confidence 00000 0 00 0000 001122222222211 122 122222211 1 112334 Q ss_pred HHHHHhhcCCcEEEEEecc----c----cchhHHHHHHhh-----------hcceeEEEEeccC-----CC----ccHHH Q lcl|NC_019451. 247 VSAWNAAQNNQFIYTVATS----L----ANLGTLFTLVNG-----------NAGTALNVLSATA-----AN----DFVEQ 298 (504) Q Consensus 247 ~A~w~e~~~~~~~~~~~~~----d----~~~~~~~~~~~~-----------~~~~~~~~~~~~~-----~~----~~~~a 298 (504) +...++....++.+.-.-. + ........-... ...+...++++.. .+ .-|.. T Consensus 393 l~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg 472 (659) T protein:vir:10 393 VVSIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAA 472 (659) T ss_pred HHHHHHhhCCeEEEEcCccccccCCCcccCHHHHHHHHHhcccccccccccCcceEEEEeCcEEEecccCCceEEechHH Confidence 5555665554443321100 0 001111000000 0112233332110 11 12446 Q ss_pred HHHHHHHhcCcCcCCceeeecccccCccc-----cccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeCCcc-cc Q lcl|NC_019451. 299 CPSEILAATNYDEPGASQNYMYYQFPGRN-----ITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPT-DA 372 (504) Q Consensus 299 a~~g~~as~nf~~~~g~~T~kfk~l~Gv~-----a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~y-~~ 372 (504) .++|.++.+|.++.+ ......|.+.||. ...+++.|.+.|..+++|....+.+.|. .++...++++.- +| T Consensus 473 ~~AGl~Ar~D~~~g~-~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~---~~wG~rT~~~~~s~~ 548 (659) T protein:vir:10 473 DIAGLCARTDNVSQT-WMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGY---VLYGDKTATSVPSPF 548 (659) T ss_pred HHHHHHHHHhccCCc-eEccCCceeeeeeccccceecCCHhHHHHHhhCCeeEEEEeCCCeE---EEEcccccCCCCccc Confidence 777888888654321 2223344444432 2357899999999999999887766542 456777777653 68 Q ss_pred chhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCccc Q lcl|NC_019451. 373 VDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRA 452 (504) Q Consensus 373 ~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~ 452 (504) .+|.+.+-.+|+.+.|+....-..-. |.|+.=...|+..|+.-|+.-+++|.|. T Consensus 549 ~~i~vrR~~~~i~~si~~~~~~~v~e----~n~~~l~~~i~~~i~~fL~~l~~~gal~---------------------- 602 (659) T protein:vir:10 549 DRINVRRLFNMLKTNIGRSSKYRLFE----LNNAFTRSSFRTETAQYLQGIKALGGIY---------------------- 602 (659) T ss_pred ceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee---------------------- Confidence 88999999999999999888763322 5678888999999999999999999884 Q ss_pred ccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 453 WRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 453 ~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ||+|.++. +..+++|+.+-+. .+.+.++-.-.+++|.++=+.. T Consensus 603 -------~~~V~~d~-~~nt~~~i~~G~~-~~~i~~~p~~pae~i~~~~~~~ 645 (659) T protein:vir:10 603 -------EYRVVCDT-TNNTPSVIDRNEF-VATFYIQPARSINYITLNFVAT 645 (659) T ss_pred -------eEEEEEcC-CCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEE Confidence 48898874 7788888887776 5888899999999988875544 No 62 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=88.41 E-value=0.032 Score=28.74 Aligned_cols=417 Identities=10% Similarity=0.027 Sum_probs=151.8 Q ss_pred CCCccceEEEeeeecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCcccce Q lcl|NC_019451. 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPSS 80 (504) Q Consensus 1 mip~s~iv~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~~P~~ 80 (504) .--|+.|-...+.|.+.++. --..-|.+.-..++. ++...-|..+ | . T Consensus 100 L~~L~~i~~~~v~vtg~~~~---~~~V~F~g~~~~~~~----------~~~~ltg~~~-------------------~-~ 146 (581) T protein:vir:76 100 LRALPNVEDDEVTVLGDPGG---PWTVTFTKAVAALTK----------DVTGLTGGDN-------------------P-D 146 (581) T ss_pred HhhccCCCCceEEEEcCCCc---eEEEEEcCCccceeE----------eeeeeecCCc-------------------c-e Confidence 11222222212222211110 011111111000000 0000000000 0 0 Q ss_pred EEEEeeeccCCc--c--eeeecccchhhHhHhhccCceEEEEEcccceeeeeeccccc--cchHHHHHHHHhhhhccccc Q lcl|NC_019451. 81 ISFARWVNTAIA--P--MVVGDNLPKTIADFAGFSAGVLTIMVGAAEQNITAIDTSAA--TSMDNVASIIQTEIRKNADP 154 (504) Q Consensus 81 l~igr~~~~a~~--~--~l~g~~~~~~~~~~~~~~~g~~titi~g~~~~~~~i~~s~a--ts~~~vA~~i~~~i~a~~~~ 154 (504) +-+..-.+-..+ . .++|.. .... ....++.=.+.+-|....+...++... ++..+..-.++..+...... T Consensus 147 ~~V~~~~~G~~~~~~~l~~~g~~--~~~~--~~~~s~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~g~~~~ 222 (581) T protein:vir:76 147 LNIASEQTGVPAMNRALAKKGIK--TDTI--RVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHID 222 (581) T ss_pred eEEEEEecCcCCcCceeeecccc--cccc--ceeecCCcceeeecccccceeeccCcccceeeeeeeeeeEeeccccccc Confidence 111110000000 0 000000 0000 000000000000011111111111100 00000000000000000000 Q ss_pred ccceeEEEE---eccc-ceeEEecccCcceeEE-EEeecc----chhhhhhhhcccC-cceeeccc-------ccccHHH Q lcl|NC_019451. 155 QLAQATVTW---NQNT-NQFTLVGATIGTGVLA-VAKSAD----PQDMSTALGWSTS-NVVNVAGQ-------AADLPDA 217 (504) Q Consensus 155 ~~~~a~vt~---d~~~-~~F~its~t~ga~s~~-~~~sa~----~~~ia~~l~~t~~-~~~~~~g~-------aaet~~~ 217 (504) ......+.| |+.. ..+.+........... ....+. .......+-++.+ ......+. ..+...+ T Consensus 223 ~~~i~~~~~~~~D~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~e~~~~~~~~~t~~~~~~l~~gvd~~g~tvt~~dy~~ 302 (581) T protein:vir:76 223 PGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQN 302 (581) T ss_pred ceeEEEEEEEeecCCccceEEEecccccccceeeehhhcCccccchhhhhheeeccccceEEEeeecCCCCccchHHHHH Confidence 000001111 1100 1122222111110000 000000 0011111111111 11222222 2234678 Q ss_pred HHHHHHhhccceeEEEEEeccCCHHHH-HHHHHHHhhcC---Cc-EE-EEEe-ccc-cchhHHHHHHhhhcceeEEEEec Q lcl|NC_019451. 218 AVAKSTNVSNNFGSFLFAGAPLDNDQI-KAVSAWNAAQN---NQ-FI-YTVA-TSL-ANLGTLFTLVNGNAGTALNVLSA 289 (504) Q Consensus 218 al~~~~~~~~~wy~~~~~~~~~~~~~~-~a~A~w~e~~~---~~-~~-~~~~-~~d-~~~~~~~~~~~~~~~~~~~~~~~ 289 (504) +|+++.++. +..+++... .++.+ .++..|++... +. +. ..+- ... ................++.++.+ T Consensus 303 aL~ale~~~--~~~ivvp~t--~~~~i~a~l~ahv~~~s~~~~~~ra~igv~g~~~~~~~~~~~~~a~~~ns~Rvvlv~p 378 (581) T protein:vir:76 303 ALNKFRDED--EIAIIVAGT--GAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISP 378 (581) T ss_pred HHHHHhcCC--eEEEEEecC--CChHHHHHHHHHHHHHHhccCCceEEEEeeCCCCCchHHHHHHhhcccCCCcEEEEEc Confidence 888887653 333333322 33444 45777776553 22 21 1221 111 12222222112222333333321 Q ss_pred c--------------CCCccHHHHHHHHHHhcCcCcCCceeeecccccCccc--cccCCHHHHHHHHhCCCeEEEEEeec Q lcl|NC_019451. 290 T--------------AANDFVEQCPSEILAATNYDEPGASQNYMYYQFPGRN--ITVSDDTVANTVDKSRGNYIGVTQAN 353 (504) Q Consensus 290 ~--------------~~~~~~~aa~~g~~as~nf~~~~g~~T~kfk~l~Gv~--a~~lt~t~~~al~~~~~n~y~~~~~~ 353 (504) . .+..+.++.+.|..+..+. ...+-||.++|+. ...++.+|++.|..+|++.+....+. T Consensus 379 ~~~~~~g~~~~~~~~lp~~~~AA~vAG~~a~~~~-----~~slT~~~i~g~~~~~~~~s~~e~e~ll~~Gv~~l~~~~~~ 453 (581) T protein:vir:76 379 SSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIA-----AMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRN 453 (581) T ss_pred CceEeccccCCcceecchhhhhhhHHhhhhcccc-----ccCcccccccccccccccCCHHHHHHHHhCCeEEEEEecCC Confidence 1 0112233444455555433 3355688888886 44678999999999999999765443 Q ss_pred cceeeEEEcCEEe-CCccccchhhhhhhHHHHHHHHHHHHHH-HHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccc Q lcl|NC_019451. 354 GQQLAFYQRGILC-GGPTDAVDMNVYANEIWLKSAIAQALLD-LFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTY 431 (504) Q Consensus 354 ~~~~~~~~~G~~~-~G~y~~~wiD~~~~~dwl~~~lq~~l~~-l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~ 431 (504) + .. +.+|+.+ .-.-.|+-|.+++-.|.+...+++.+.. .|.. | |=++.|...|++.+++.|++..++|+|.. T Consensus 454 ~--v~-Iv~gItT~~s~~~~k~i~viR~~D~v~~~vr~~~~~~~fiG--~-~n~~~~r~~ik~~i~~~L~~l~~~g~I~g 527 (581) T protein:vir:76 454 L--VH-VRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIG--M-PIYDTTIVQVKASAEAALVWLVDNNIIRG 527 (581) T ss_pred e--EE-EEEeeecCCCCCccceeeehhhhHHHHHHHHHHHhhhcCCC--c-ccChHHHHHHHHHHHHHHHHHHhcCcccC Confidence 2 22 2456544 1111345588889999999888888753 3543 3 77889999999999999999999999975 Q ss_pred ccccCcccceeeccccCCcccccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 432 GKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 432 G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) -....+.| ....+ | | .-+.+.+...-+|.+|-++--.| T Consensus 528 ~~~~~~~~----~~~~~--------------------------d---~--v~V~i~v~Pv~~ie~I~vt~~~~ 565 (581) T protein:vir:76 528 YRNLKARQ----IERQP--------------------------D---V--IEVRYEWRPAYPLNYIVVRYSIA 565 (581) T ss_pred cccceeeE----EecCC--------------------------C---E--EEEEEEEEecccceEEEEEEEEe Confidence 22111111 00000 0 1 12344444444444444444444 No 63 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=67.21 E-value=0.26 Score=23.83 Aligned_cols=321 Identities=11% Similarity=0.011 Sum_probs=134.1 Q ss_pred EcccceeeeeeccccccchHHHHHHHHhhhhcccccccceeEEEEecccceeEEecc-cCcceeEEEEeeccchhhhhhh Q lcl|NC_019451. 119 VGAAEQNITAIDTSAATSMDNVASIIQTEIRKNADPQLAQATVTWNQNTNQFTLVGA-TIGTGVLAVAKSADPQDMSTAL 197 (504) Q Consensus 119 i~g~~~~~~~i~~s~ats~~~vA~~i~~~i~a~~~~~~~~a~vt~d~~~~~F~its~-t~ga~s~~~~~sa~~~~ia~~l 197 (504) .-|+. .+..+|+ .|+.+ ...-..|.+... +.........- .-.|+-..+ T Consensus 1 ~~~~v-~vn~~n~------------~~g~~---------------~~~er~~Lfig~~~~~~~~~~~~~--~~sdld~~l 50 (376) T protein:vir:37 1 MFPSV-QINALNQ------------LSGET---------------KEIERHALFVGVGTTNQGKLLALT--PDSDFDKVF 50 (376) T ss_pred CCCeE-EEecccc------------cCCCc---------------ccccceEEeeccccccccceeeec--CccchHhhh Confidence 11111 1111111 11111 122223333321 11122221111 111222222 Q ss_pred hccc--------------Cc----ceeecccccccHHHHHHHHHhhccceeEEEEEecc-CCHHHHHH---HHHHHhhcC Q lcl|NC_019451. 198 GWST--------------SN----VVNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGAP-LDNDQIKA---VSAWNAAQN 255 (504) Q Consensus 198 ~~t~--------------~~----~~~~~g~aaet~~~al~~~~~~~~~wy~~~~~~~~-~~~~~~~a---~A~w~e~~~ 255 (504) |..+ ++ .+.......++..+++..+. ...++.+..++..+ .+.+++.+ ++.....+- T Consensus 51 g~~~~~lk~~v~aa~~naG~~~~~~~~~~~~~~~~~~~Av~~a~-~~~s~E~V~v~~pv~t~~a~i~aa~~~a~el~~~~ 129 (376) T protein:vir:37 51 GETDTDLKKQVRAAMLNAGQNWFAHVYIAQEDGYDFVECVKKAN-QTASFEYCVNTRYLGVDKASIGKLQECYAELLAKF 129 (376) T ss_pred CCCchHHHHHHHHHHhCCCCcEEEEEEeecCCchHHHHHHHHhh-hhcCceEEEEeccccccHHHHHHHHHHHHHHHHhc Confidence 2211 11 11222223345666666553 33455555554432 23555544 455555554 Q ss_pred CcEEEEEecc---c--c----chhHHHHHHh----hhcceeEEEEeccCCCccHHHHHHHHH--HhcCcCcCCceeeec- Q lcl|NC_019451. 256 NQFIYTVATS---L--A----NLGTLFTLVN----GNAGTALNVLSATAANDFVEQCPSEIL--AATNYDEPGASQNYM- 319 (504) Q Consensus 256 ~~~~~~~~~~---d--~----~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~aa~~g~~--as~nf~~~~g~~T~k- 319 (504) .++.++.... + + +...+..... +....+..+++..+. +....++||. +++--...++++.-- T Consensus 130 ~Rpv~file~r~~~~~~~~~e~w~~y~~~~~al~~gia~~~V~~V~~~~g--n~~G~~aGRl~~aaVsVadspgRV~tG~ 207 (376) T protein:vir:37 130 GRRTFFIQAVQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLVPLLFG--NETGVLAGRLANRAVTVADSPARVQTGA 207 (376) T ss_pred CCeEEEEEeccCcCcccccccCHHHHHHHHHHhhcccccccceeeeeehh--hhHHHHHHHHhhcccchhhCccceeccc Confidence 5555555433 1 1 1111121111 112223333333222 3355667886 344323445443211 Q ss_pred ccc------cCccccccCCHHHHHHHHhCCCeEEEEEeeccceeeEEEcCEEeC---CccccchhhhhhhHHHHHHHHHH Q lcl|NC_019451. 320 YYQ------FPGRNITVSDDTVANTVDKSRGNYIGVTQANGQQLAFYQRGILCG---GPTDAVDMNVYANEIWLKSAIAQ 390 (504) Q Consensus 320 fk~------l~Gv~a~~lt~t~~~al~~~~~n~y~~~~~~~~~~~~~~~G~~~~---G~y~~~wiD~~~~~dwl~~~lq~ 390 (504) ... ...-....++...+.+|+++|+.+...|.++. --++.+|.|+. |.| .-|- +.+.+-|..=+. T Consensus 208 l~gl~~~~lp~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~--G~Y~~d~~tl~~~gsDY--~~ie--~~RVvdKa~R~v 281 (376) T protein:vir:37 208 LVSLGSANKPLDKDRNELTLAHLKSLETARYSVPMWYPDYD--GYYWADGRTLDVEGGDY--QVIE--NLRVVDKVARKV 281 (376) T ss_pred cccccccccccCcCcccCCHHHHHHHHhCCCeEEEeeCCCC--ceEEeCceEeccCCCCh--hhhh--hhhHHHHHHHHH Confidence 111 12333457889999999999999999999876 34788999984 444 2233 356666665444 Q ss_pred HHHHHH-hcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcccccceeecceEEEecchH Q lcl|NC_019451. 391 ALLDLF-LNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRRAWRQVQTLGYWINITFSS 469 (504) Q Consensus 391 ~l~~l~-~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~~~~~~~~~GY~v~~~~~~ 469 (504) .+..+- .....+=-+..+++..+..+..+|++..+...|.. +. +. | .|..|.-. T Consensus 282 R~~ai~~i~D~~lnst~~sia~~~~yi~~pLr~M~~s~~i~g-~~----fp-------------------G-eI~~p~d~ 336 (376) T protein:vir:37 282 RLLAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATING-KD----FP-------------------G-ECMPPKDD 336 (376) T ss_pred HHHHHHHhCCcccCcchhhHHHHHHHHHHHHHHHHhcchhcc-cc----cc-------------------c-eeecCCCC Confidence 433221 12233444677889999999999999988777653 11 00 0 12221111 Q ss_pred hCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 470 YTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 470 ~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) ++..+-.. +....|.+..+-=|.-..++++--|= T Consensus 337 Di~i~w~s-~~~V~I~~~v~P~~~pk~Itv~I~Ld 370 (376) T protein:vir:37 337 AITIVWQS-KTKVTIYIKVRPYDCPKEITANIFLD 370 (376) T ss_pred CceEEeec-cceEEEEEEEEeccCCceEEEEEEee Confidence 11111000 11111222111111111111110000 No 64 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=66.09 E-value=0.27 Score=23.67 Aligned_cols=447 Identities=11% Similarity=0.045 Sum_probs=202.9 Q ss_pred CCCccc---eEEEeee-ecccccccccccceEEEecccccCccceEEecCHHHHHHhcCCCcHHHHHHHHHhccCCCCCc Q lcl|NC_019451. 1 MISQSR---YIRIISG-VGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVN 76 (504) Q Consensus 1 mip~s~---iv~V~~~-v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~Fg~~s~ey~aA~~yF~~~~~~~~ 76 (504) |-|+++ +|...++ +.+..+ .......|++....=|++.....+|.++....||. ++.-.|...||.. T Consensus 9 ~~~~~~PGVyvee~~sg~~~i~g--v~tsva~fvG~a~~Gp~~~p~~v~s~~~~~~~fgg-g~l~~av~~~F~n------ 79 (648) T protein:vir:10 9 GKLIKQLGAYVKTDLSAVKQING--VGTGIVALLGLAEGGETYKPYRLTSFAEAVSIFKG-GPLLEHIKAAFIG------ 79 (648) T ss_pred CCCccCCceEEEEeccccccccC--CCCceEEEEEeeCCCCCceeEEecCHHHHHHHhcC-ccHHHHHHHHHhC------ Confidence 222222 3333332 333333 34677888888877788888999999999999986 4567788899975 Q ss_pred ccceEEEEeeeccCCcceee--ecccchh--------hH---hH-hhccCceEEEEEc---------------------- Q lcl|NC_019451. 77 SPSSISFARWVNTAIAPMVV--GDNLPKT--------IA---DF-AGFSAGVLTIMVG---------------------- 120 (504) Q Consensus 77 ~P~~l~igr~~~~a~~~~l~--g~~~~~~--------~~---~~-~~~~~g~~titi~---------------------- 120 (504) =-+++|+-|.... +.+.+. |-.+.+. +. .. ..-..+.+.+++. T Consensus 80 Gg~~~~~vRv~~~-~~a~~~~~~~~~~a~~~g~~gn~i~~~v~~~~~~~~~~~~l~v~~~~~~~~~d~~v~~i~~~~~~y 158 (648) T protein:vir:10 80 GAGEVVAVRIGNP-TTASVSIPVAQNTSDTSPANLNFVSYEASTRSNQIYVSFDLDENFTSANEADDTIIFTIYQKHPDF 158 (648) T ss_pred CCcEEEEEEcCCC-cccceecceeEEeecccCCCCCceEEEEEEcCCCcCceeEEEEEecCCCcccceeEEEeccCCCcc Confidence 4578999998542 111111 1111000 00 00 0000111222110 Q ss_pred -ccceeeeeecccccc-----chH-----HHHHHHHhhhhcccccccc-----eeEEEEecccceeEE------------ Q lcl|NC_019451. 121 -AAEQNITAIDTSAAT-----SMD-----NVASIIQTEIRKNADPQLA-----QATVTWNQNTNQFTL------------ 172 (504) Q Consensus 121 -g~~~~~~~i~~s~at-----s~~-----~vA~~i~~~i~a~~~~~~~-----~a~vt~d~~~~~F~i------------ 172 (504) |+..... .+....+ ... .-...+...+.+....... .....|+.....|.. T Consensus 159 ~gt~~~~t-~~v~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~s~~~~~d~~~~~ 237 (648) T protein:vir:10 159 SVTRETFT-FPRKFTTPTVLVKRGSTLFFVDRSIVNAALAAGPAFQTALINLLKEQLQPTDVVQIFDASDTNPVDIPLGL 237 (648) T ss_pred cccceecc-ccccccccccccccccceeecCccchhhhhccCccchhhhhhchhhhhhhhhhheeccccccccccccccc Confidence 0000000 0000000 000 0000000000000000000 000000000000000 Q ss_pred -------------ecc-----cCccee--------EEEEeec--------------cchh-------hhhhhhccc--Cc Q lcl|NC_019451. 173 -------------VGA-----TIGTGV--------LAVAKSA--------------DPQD-------MSTALGWST--SN 203 (504) Q Consensus 173 -------------ts~-----t~ga~s--------~~~~~sa--------------~~~~-------ia~~l~~t~--~~ 203 (504) +.. -.|... ....+++ .+.+ ....+.... +. T Consensus 238 ~~~~a~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~tp~~~~~~~~~~~~~~~~~~~~~v~~~~~~~l~~~~~~p~ 317 (648) T protein:vir:10 238 FVYEVLYGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNLSATPFFDGSDYQDYTSLSDPANWFAKDAYTINHLVDTTINPH 317 (648) T ss_pred ccccccchhhhcCCcchhhhhhhccccccccccceecccccccccccceeeeeccccccceeeeeccchhhcccccccCc Confidence 000 000000 0000000 0000 000000000 00 Q ss_pred c------eeecccc---------------cccHHHHHHHHHhhccceeEEEEEe---------ccCC--HHHHHHHHHHH Q lcl|NC_019451. 204 V------VNVAGQA---------------ADLPDAAVAKSTNVSNNFGSFLFAG---------APLD--NDQIKAVSAWN 251 (504) Q Consensus 204 ~------~~~~g~a---------------aet~~~al~~~~~~~~~wy~~~~~~---------~~~~--~~~~~a~A~w~ 251 (504) . ....|.+ ..+..++++.+++....| .+... +.++ .+-+.++-+|+ T Consensus 318 ~~~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~~--ivp~~~~~~~~~~~~~lt~~q~i~a~a~shv 395 (648) T protein:vir:10 318 ILATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVNF--VIPAYKFTNVTQLNDRLTIFKGIASTFLSHV 395 (648) T ss_pred ccccccceecccccCCCcccccccccccchhhHHHHhhhccCCCceE--EEeecccccccccccccCCccchHHHHHHHH Confidence 0 0111222 223567787776654332 22200 0011 22233333566 Q ss_pred hhcC--C----c---EEEEEeccccchhHHHHHHhh--hcc-e---------eEEE-------Ee-----ccCCCccHHH Q lcl|NC_019451. 252 AAQN--N----Q---FIYTVATSLANLGTLFTLVNG--NAG-T---------ALNV-------LS-----ATAANDFVEQ 298 (504) Q Consensus 252 e~~~--~----~---~~~~~~~~d~~~~~~~~~~~~--~~~-~---------~~~~-------~~-----~~~~~~~~~a 298 (504) ..+. + + +..+.-............... ... + +... +. ...+..+.++ T Consensus 396 ~~~s~~~~~~~r~~~~~~vg~~~~es~~~se~~~~~~~~~~~~a~~~~~d~~~~~~~~~~~~~~~~~G~~~~~p~~~~Aa 475 (648) T protein:vir:10 396 QTMSQVNRRKARVGVFGLPAPSPNESVTASEYLYNRNILNTISAMFGGTDRAQAVVFPFYSNVFNDEGKVELLGGEFFAS 475 (648) T ss_pred HHhhhccccccccCeEEEeCCCCchhHHHHHHHhhhhcccccceeeeecCCceEEeecccceeECCCCcEEecchhhHHH Confidence 5432 1 2 222211111111000000000 000 0 0000 00 0023445567 Q ss_pred HHHHHHHhcCcCcCCceeeecccccCccc--c-ccCCHHHHHHHHhCCCeEEEEEeecccee-eEEEcCEEeCCc---cc Q lcl|NC_019451. 299 CPSEILAATNYDEPGASQNYMYYQFPGRN--I-TVSDDTVANTVDKSRGNYIGVTQANGQQL-AFYQRGILCGGP---TD 371 (504) Q Consensus 299 a~~g~~as~nf~~~~g~~T~kfk~l~Gv~--a-~~lt~t~~~al~~~~~n~y~~~~~~~~~~-~~~~~G~~~~G~---y~ 371 (504) ++.|..+.++.... .-||.++++. + ..++++|++.|..+|+++.+...+++... -....|.++-+. +. T Consensus 476 ~VAGl~a~l~~~~s-----~T~k~i~~~~id~~~~~t~~qld~L~~~Gv~~ie~~~~~~~~~~~rvv~gITT~~~~~~~~ 550 (648) T protein:vir:10 476 YVAGMHANREPQDS-----ITFLPISGIGAEPLYNWTYTQKDDLISNRVLFVEKVKTSFGGIVYRIHHNPTTWLGPVTQG 550 (648) T ss_pred HHHhhhhccccccC-----cccceeeccccccccCCCHHHHHHHhcCCcEEEEEecCCcceeeEEEeccceeecCCCCcc Confidence 77888877655443 4466665543 3 47899999999999999998776654432 223467776552 22 Q ss_pred cchhhhhhhHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeccccCCcc Q lcl|NC_019451. 372 AVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASSTGEAMTLAVLQPVLDKATANGTFTYGKEISAVQQQYITQVTGDRR 451 (504) Q Consensus 372 ~~wiD~~~~~dwl~~~lq~~l~~l~~~~~kIPyt~~G~~~l~~~v~~vl~~a~~nG~Ia~G~~~~~~q~~~i~~~~g~~~ 451 (504) |.-|-+.+-.|.+...++..+.+.|.-. |=++.....|++.+..-|.+-++.+-|.+-..+ T Consensus 551 ~~eisv~ri~D~l~~~vr~~l~~~fIG~---~n~~~~~~~ik~~i~~~L~~~~~~~~I~~y~~~---------------- 611 (648) T protein:vir:10 551 FQEFVLRRIDDFLQSYVYKNLQEQFIGR---KSYGRKTENDIKVYTEALLSNLVGKQIVAYKDV---------------- 611 (648) T ss_pred eeeeeeeehhhHHHHHHHHHHhhhcCcc---cccHHHHHHHHHHHHHHHhhHhhcCcccCcccc---------------- Confidence 3347888889999999999999988663 556778999999999998888888877641111 Q ss_pred cccceeecceEEEecchHhCCHHHHhhcccCceEEEEEECCeEEEEEeeeeeC Q lcl|NC_019451. 452 AWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) Q Consensus 452 ~~~~~~~~GY~v~~~~~~~~s~~dr~~R~~~~i~~~~~~aGAIh~v~i~~~~v 504 (504) .++... .+ .| .-|.|.+.-.-+|++|.++-.+. T Consensus 612 ----------~v~~~~-----~~---~v--v~V~~~v~Pv~~i~~I~vti~it 644 (648) T protein:vir:10 612 ----------KVTSNE-----DK---TV--YYVEFFYQPVTEIKFILVTMKVT 644 (648) T ss_pred ----------eEEEEe-----cC---CE--EEEEEEEEecceeeEEEEEEEEE Confidence 122110 01 22 25888999999999998888777 Done!