Query lcl|NC_011273.1_cdsid_YP_002225031.1 [gene=120] [protein=119] [protein_id=YP_002225031.1] [location=68214..68684] Match_columns 156 No_of_seqs 2 out of 5 Neff 1.3 Searched_HMMs 1612 Date Thu Nov 7 14:56:22 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_113 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_113_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:94654 Length: 142 78.5 0.013 7.8E-06 31.0 3.2 116 1-143 1-142 (142) 2 protein:vir:102441 Length: 137 78.0 0.015 9.4E-06 30.5 3.5 113 1-144 9-137 (137) 3 protein:vir:8669 Length: 142 # 72.6 0.038 2.3E-05 28.4 4.2 115 1-143 13-142 (142) 4 protein:vir:99101 Length: 142 72.6 0.038 2.3E-05 28.4 4.2 115 1-143 13-142 (142) 5 protein:vir:97982 Length: 140 67.2 0.059 3.6E-05 27.3 4.0 112 1-140 12-140 (140) 6 protein:vir:107545 Length: 140 67.2 0.059 3.6E-05 27.3 4.0 112 1-140 12-140 (140) 7 protein:vir:106041 Length: 137 65.9 0.029 1.8E-05 29.0 2.1 116 1-140 1-137 (137) 8 protein:vir:96829 Length: 135 59.1 0.16 9.9E-05 25.0 4.8 113 1-138 1-135 (135) 9 protein:vir:106506 Length: 137 57.1 0.084 5.2E-05 26.5 3.0 114 1-147 14-137 (137) 10 protein:vir:107099 Length: 137 36.1 0.4 0.00025 22.8 3.2 110 1-138 1-137 (137) 11 protein:vir:95062 Length: 116 36.0 0.59 0.00036 21.9 4.1 114 11-146 1-116 (116) 12 protein:vir:5978 Length: 144 # 33.8 0.69 0.00043 21.5 4.1 117 1-147 1-144 (144) 13 protein:vir:78077 Length: 141 28.0 1.7 0.001 19.4 5.2 133 1-146 8-141 (141) 14 protein:vir:105330 Length: 137 27.4 0.81 0.00051 21.1 3.3 115 1-138 1-137 (137) 15 protein:vir:105916 Length: 149 26.1 1.3 0.00082 19.9 4.3 108 1-138 13-149 (149) 16 protein:vir:94108 Length: 149 24.0 1 0.00064 20.5 3.3 108 1-138 13-149 (149) 17 protein:vir:94796 Length: 137 23.1 1.3 0.00081 19.9 3.6 114 1-138 1-137 (137) 18 protein:vir:96121 Length: 137 22.4 1 0.00064 20.5 2.9 124 1-138 1-137 (137) 19 protein:vir:97327 Length: 116 21.1 1.3 0.00084 19.9 3.3 114 10-138 1-116 (116) 20 protein:vir:1243 Length: 116 # 21.1 1.3 0.00084 19.9 3.3 114 10-138 1-116 (116) No 1 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=78.46 E-value=0.013 Score=31.00 Aligned_cols=116 Identities=21% Similarity=0.203 Sum_probs=50.1 Q ss_pred CCcc----cc----------cHHHHHHHHHHHHHHHHhhh----hccCccccccccc-----ccccc---ccchHHHHHH Q lcl|NC_011273. 1 MPLV----GL----------PAQTCAVISQMATTKARQDV----MGRGWRSAGALQP-----VSNQG---EVGIRSTMKH 54 (156) Q Consensus 1 mplv----g~----------p~~~~~vi~~~a~~~ar~d~----~grgw~s~galqp-----~s~~g---~vgi~stmkh 54 (156) |--+ |+ +.+...+ ...+++.+=+++ +-.-=..||.|.- ++..| .+.+.+++.+ T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~~~-~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~g~~~~~~v~~~~~Y 79 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLTGA-AREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSGGRFSFSVTIGTNVTY 79 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeccCCceEEEEEecCccc Confidence 2111 21 1111111 111222111111 1111113455532 12222 4556678888 Q ss_pred HhhhcCCCceeeeeeecCeeeeeecCccceeeccccCCCCceeeeCCcccccccccccCCCCchhHHHHHHHHHHHHhhH Q lcl|NC_011273. 55 LLYQNSGVKSFLMYWVEGRTVPITDKTGTHFVRGREVGKPGYVNIPGRGRVWRNQKWRYPGLQPKRFIESSIAQAVKENR 134 (156) Q Consensus 55 ll~qn~g~~~f~m~wvegr~vpitdktgt~~irgre~gkpgyv~ipgrg~iwr~qkwr~pgl~pkrf~e~~ia~aike~~ 134 (156) ..+++.|-.+.. |++. ++++. ..++.+. +. ..=+|||.+|..||+.|+.++-.+-+ T Consensus 80 A~~vE~Gt~~~~-------------------i~pk--~~k~l-~~~~~~~-~~-~~v~~pG~~~~pfl~~A~~~~~~~i~ 135 (142) T protein:vir:94 80 AADVEYGTAPHV-------------------IVPK--DKKAL-YWPGAAH-PV-AKVNHPGTRAQPFMRPAIAAASTFLR 135 (142) T ss_pred chhhhccCCCce-------------------eccC--CCccc-eecccce-ee-eeeeecCCCCCcchhHHHHHHHHHHH Confidence 888888865532 2211 12221 2333332 11 12268999999999999876544444 Q ss_pred HHHHHHHHh Q lcl|NC_011273. 135 ELIRSSVIR 143 (156) Q Consensus 135 ~~ir~~~~~ 143 (156) +.|+.- | T Consensus 136 ~~~~~~--~ 142 (142) T protein:vir:94 136 NHAKGI--R 142 (142) T ss_pred HHHHhc--C Confidence 433321 1 No 2 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=78.04 E-value=0.015 Score=30.55 Aligned_cols=113 Identities=19% Similarity=0.311 Sum_probs=54.6 Q ss_pred CCcccccHHHHHHHHHHHHHHHHhhhhccC-----cccccccccc------ccccc----cchHHHHHHHhhhcCCCcee Q lcl|NC_011273. 1 MPLVGLPAQTCAVISQMATTKARQDVMGRG-----WRSAGALQPV------SNQGE----VGIRSTMKHLLYQNSGVKSF 65 (156) Q Consensus 1 mplvg~p~~~~~vi~~~a~~~ar~d~~grg-----w~s~galqp~------s~~g~----vgi~stmkhll~qn~g~~~f 65 (156) .-..++..++..++ ++++..+-.++...- + .+|.|+-- ++.++ .++-+++++-.|.+-| T Consensus 9 ~~~~~~~~~~~~v~-r~~l~~~a~~v~~~Ak~~aPv-~tG~Lr~SI~~~~~~~~~~~~~~~~V~~~~~YA~~ve~G---- 82 (137) T protein:vir:10 9 RNPVGEARQFQVIA-RRRLSRITRGTANQARADVPV-KTGNLGRSIREDPIVVAGPLRLDSGVTAHADYARYVHDG---- 82 (137) T ss_pred cCchhHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCc-cchhhhcCceeeeeeccccceEEEEecCCCccceeeecC---- Confidence 22334444554444 445555444443221 1 13333321 12111 1233444444443333 Q ss_pred eeeeecCeeeeeecCccceeeccccCCCCceeeeCCcccc-cccccccCCCCchhHHHHHHHHHHHHhhHHHHHHHHHhh Q lcl|NC_011273. 66 LMYWVEGRTVPITDKTGTHFVRGREVGKPGYVNIPGRGRV-WRNQKWRYPGLQPKRFIESSIAQAVKENRELIRSSVIRS 144 (156) Q Consensus 66 ~m~wvegr~vpitdktgt~~irgre~gkpgyv~ipgrg~i-wr~qkwr~pgl~pkrf~e~~ia~aike~~~~ir~~~~~~ 144 (156) |+-|.|+.+ .+.|+++.++.|+. .+. +=+|||.+|..||+-|+.+++.+-- - ++ T Consensus 83 ---------------T~ph~I~Pk--~~k~~l~~~~~g~~vf~k-~V~hPG~~a~PfL~~A~~~~~~~~~--~-----~~ 137 (137) T protein:vir:10 83 ---------------TRAHVIRPR--RPGGVLRFTVGGRVVYAR-RVNHPGTRARPFLRNAAERVVARET--A-----TS 137 (137) T ss_pred ---------------CCCceeecc--ccceeeeEeeCCeeEecc-eeecCCCCCCchHHHHHHHhhhhhc--c-----cC Confidence 344556543 24667777766653 233 3459999999999988877653211 1 11 No 3 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=72.63 E-value=0.038 Score=28.38 Aligned_cols=115 Identities=18% Similarity=0.250 Sum_probs=46.9 Q ss_pred CCcccccHHHHHHHHHHHHHHHHhhhhccCcc----cccccccc-c-----ccc----ccchHHHHHHHhhhcCCCceee Q lcl|NC_011273. 1 MPLVGLPAQTCAVISQMATTKARQDVMGRGWR----SAGALQPV-S-----NQG----EVGIRSTMKHLLYQNSGVKSFL 66 (156) Q Consensus 1 mplvg~p~~~~~vi~~~a~~~ar~d~~grgw~----s~galqp~-s-----~~g----~vgi~stmkhll~qn~g~~~f~ 66 (156) --|..++.+.. .+.+.++..+-.++...--. .+|.|.-- . +.. ..|+-|+..+-.|.+.| T Consensus 13 ~~l~~~~~~~~-~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA~~ve~G----- 86 (142) T protein:vir:86 13 YNPVGAAAQVG-PILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYAAAVHEG----- 86 (142) T ss_pred hhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCccccceeccC----- Confidence 11222333332 23344444443333322211 12333210 0 000 12233444444444333 Q ss_pred eeeecCeeeeeecCccceeeccccCCCCce-eeeCCcccccccccccCCCCchhHHHHHHHHHHHHhhHHHHHHHHHh Q lcl|NC_011273. 67 MYWVEGRTVPITDKTGTHFVRGREVGKPGY-VNIPGRGRVWRNQKWRYPGLQPKRFIESSIAQAVKENRELIRSSVIR 143 (156) Q Consensus 67 m~wvegr~vpitdktgt~~irgre~gkpgy-v~ipgrg~iwr~qkwr~pgl~pkrf~e~~ia~aike~~~~ir~~~~~ 143 (156) |+-|.|+.+ .++.. .+.+|+. +.. .+=+|||-+|..|++-|+.+..++- +.-++| T Consensus 87 --------------T~ph~i~pk--~~~al~f~~~g~~-~~~-k~v~hpG~~a~Pfl~~A~~~~~~~~----~~~~~r 142 (142) T protein:vir:86 87 --------------TRPHVIRAK--HAQALHFWWRGRE-VFV-RQVNHPGTRARPYLRNAGEAVVRRD----RRIRVR 142 (142) T ss_pred --------------Cccceeccc--cCceeeEecCCce-eee-eeeecCCCCCCchhHHHHHHHHhhh----hhhccC Confidence 344555544 22221 1233322 222 2345999999999998876555432 222333 No 4 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=72.63 E-value=0.038 Score=28.38 Aligned_cols=115 Identities=18% Similarity=0.250 Sum_probs=46.9 Q ss_pred CCcccccHHHHHHHHHHHHHHHHhhhhccCcc----cccccccc-c-----ccc----ccchHHHHHHHhhhcCCCceee Q lcl|NC_011273. 1 MPLVGLPAQTCAVISQMATTKARQDVMGRGWR----SAGALQPV-S-----NQG----EVGIRSTMKHLLYQNSGVKSFL 66 (156) Q Consensus 1 mplvg~p~~~~~vi~~~a~~~ar~d~~grgw~----s~galqp~-s-----~~g----~vgi~stmkhll~qn~g~~~f~ 66 (156) --|..++.+.. .+.+.++..+-.++...--. .+|.|.-- . +.. ..|+-|+..+-.|.+.| T Consensus 13 ~~l~~~~~~~~-~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA~~ve~G----- 86 (142) T protein:vir:99 13 YNPVGAAAQVG-PILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYAAAVHEG----- 86 (142) T ss_pred hhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCccccceeccC----- Confidence 11222333332 23344444443333322211 12333210 0 000 12233444444444333 Q ss_pred eeeecCeeeeeecCccceeeccccCCCCce-eeeCCcccccccccccCCCCchhHHHHHHHHHHHHhhHHHHHHHHHh Q lcl|NC_011273. 67 MYWVEGRTVPITDKTGTHFVRGREVGKPGY-VNIPGRGRVWRNQKWRYPGLQPKRFIESSIAQAVKENRELIRSSVIR 143 (156) Q Consensus 67 m~wvegr~vpitdktgt~~irgre~gkpgy-v~ipgrg~iwr~qkwr~pgl~pkrf~e~~ia~aike~~~~ir~~~~~ 143 (156) |+-|.|+.+ .++.. .+.+|+. +.. .+=+|||-+|..|++-|+.+..++- +.-++| T Consensus 87 --------------T~ph~i~pk--~~~al~f~~~g~~-~~~-k~v~hpG~~a~Pfl~~A~~~~~~~~----~~~~~r 142 (142) T protein:vir:99 87 --------------TRPHVIRAK--HAQALHFWWRGRE-VFV-RQVNHPGTRARPYLRNAGEAVVRRD----RRIRVR 142 (142) T ss_pred --------------Cccceeccc--cCceeeEecCCce-eee-eeeecCCCCCCchhHHHHHHHHhhh----hhhccC Confidence 344555544 22221 1233322 222 2345999999999998876555432 222333 No 5 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=67.24 E-value=0.059 Score=27.33 Aligned_cols=112 Identities=18% Similarity=0.189 Sum_probs=51.2 Q ss_pred CCcccccHHH----HHHHHH---HHHHHHHhhhhccCcccccccc------cccccc--c-cchHHHHHHHhhhcCCCce Q lcl|NC_011273. 1 MPLVGLPAQT----CAVISQ---MATTKARQDVMGRGWRSAGALQ------PVSNQG--E-VGIRSTMKHLLYQNSGVKS 64 (156) Q Consensus 1 mplvg~p~~~----~~vi~~---~a~~~ar~d~~grgw~s~galq------p~s~~g--~-vgi~stmkhll~qn~g~~~ 64 (156) ...-.+.-++ .+++.+ +-...|++..- | .+|.|. +..+.+ - ..+.+++++-.|.+.|=.| T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aP---v-dtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~Ve~GT~p 87 (140) T protein:vir:97 12 IDEAALERESGEHLRAFHRSLTRRIANQSRVAVP---V-RTGNLGRTIGELPQVYTPFRVRGGVEATADYAAPVHEGSRP 87 (140) T ss_pred eCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---c-cchhhhccceeeeeeCCCceEEEEecCCccchhhhccCCCC Confidence 1111111111 111111 11122222211 1 233332 222221 1 2345667777777777554 Q ss_pred eeeeeecCeeeeeecCccceeeccccCCCCc-eeeeCCcccccccccccCCCCchhHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_011273. 65 FLMYWVEGRTVPITDKTGTHFVRGREVGKPG-YVNIPGRGRVWRNQKWRYPGLQPKRFIESSIAQAVKENRELIRSS 140 (156) Q Consensus 65 f~m~wvegr~vpitdktgt~~irgre~gkpg-yv~ipgrg~iwr~qkwr~pgl~pkrf~e~~ia~aike~~~~ir~~ 140 (156) . .|+.+ +++. ..+..|+ -+--.+=.|||.+|..|++-|+.+. +.|++.|+.. T Consensus 88 h-------------------~I~pk--~~k~L~~~~~G~--~~~~k~V~hpG~~a~Pfl~~A~~~~-~~~~~~i~~~ 140 (140) T protein:vir:97 88 H-------------------AIRAR--NAQYLHFWWHGR--EMFRKSVWHPGTRARPFMRNSAQRV-VTNDPRVRMT 140 (140) T ss_pred c-------------------eeecC--CCccceeecCCC--EEEeeeeecCCCCCChhHHHHHHHH-hhhhhhccCC Confidence 3 33333 2222 1233333 2222234699999999999999776 5677888765 No 6 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=67.24 E-value=0.059 Score=27.33 Aligned_cols=112 Identities=18% Similarity=0.189 Sum_probs=51.2 Q ss_pred CCcccccHHH----HHHHHH---HHHHHHHhhhhccCcccccccc------cccccc--c-cchHHHHHHHhhhcCCCce Q lcl|NC_011273. 1 MPLVGLPAQT----CAVISQ---MATTKARQDVMGRGWRSAGALQ------PVSNQG--E-VGIRSTMKHLLYQNSGVKS 64 (156) Q Consensus 1 mplvg~p~~~----~~vi~~---~a~~~ar~d~~grgw~s~galq------p~s~~g--~-vgi~stmkhll~qn~g~~~ 64 (156) ...-.+.-++ .+++.+ +-...|++..- | .+|.|. +..+.+ - ..+.+++++-.|.+.|=.| T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aP---v-dtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~Ve~GT~p 87 (140) T protein:vir:10 12 IDEAALERESGEHLRAFHRSLTRRIANQSRVAVP---V-RTGNLGRTIGELPQVYTPFRVRGGVEATADYAAPVHEGSRP 87 (140) T ss_pred eCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---c-cchhhhccceeeeeeCCCceEEEEecCCccchhhhccCCCC Confidence 1111111111 111111 11122222211 1 233332 222221 1 2345667777777777554 Q ss_pred eeeeeecCeeeeeecCccceeeccccCCCCc-eeeeCCcccccccccccCCCCchhHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_011273. 65 FLMYWVEGRTVPITDKTGTHFVRGREVGKPG-YVNIPGRGRVWRNQKWRYPGLQPKRFIESSIAQAVKENRELIRSS 140 (156) Q Consensus 65 f~m~wvegr~vpitdktgt~~irgre~gkpg-yv~ipgrg~iwr~qkwr~pgl~pkrf~e~~ia~aike~~~~ir~~ 140 (156) . .|+.+ +++. ..+..|+ -+--.+=.|||.+|..|++-|+.+. +.|++.|+.. T Consensus 88 h-------------------~I~pk--~~k~L~~~~~G~--~~~~k~V~hpG~~a~Pfl~~A~~~~-~~~~~~i~~~ 140 (140) T protein:vir:10 88 H-------------------AIRAR--NAQYLHFWWHGR--EMFRKSVWHPGTRARPFMRNSAQRV-VTNDPRVRMT 140 (140) T ss_pred c-------------------eeecC--CCccceeecCCC--EEEeeeeecCCCCCChhHHHHHHHH-hhhhhhccCC Confidence 3 33333 2222 1233333 2222234699999999999999776 5677888765 No 7 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=65.90 E-value=0.029 Score=29.03 Aligned_cols=116 Identities=16% Similarity=0.213 Sum_probs=49.6 Q ss_pred CCc--------ccccHHHHHHHHHHHHHHHHhhhhccCc----cccccccc------ccccc---ccchHHHHHHHhhhc Q lcl|NC_011273. 1 MPL--------VGLPAQTCAVISQMATTKARQDVMGRGW----RSAGALQP------VSNQG---EVGIRSTMKHLLYQN 59 (156) Q Consensus 1 mpl--------vg~p~~~~~vi~~~a~~~ar~d~~grgw----~s~galqp------~s~~g---~vgi~stmkhll~qn 59 (156) ||. ..+.-++-.+ .+.++..+-.++....+ -.+|.|.- ..+.+ .+.+.++.++-.|.+ T Consensus 1 m~~s~~i~i~~~~l~~~v~~~-~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~ve 79 (137) T protein:vir:10 1 MPVTARIHINEPELERQTGAI-FRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQTYRPFHVGGGVEDNVDYAAPVH 79 (137) T ss_pred CCeeEEEeeCHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeeccccceEEEEEecCCCceeeee Confidence 442 2222222222 12222222222222111 12344432 11111 123445555555555 Q ss_pred CCCceeeeeeecCeeeeeecCccceeeccccCCCCceeeeCCcccccccccccCCCCchhHHHHHHHHHHHHhhHHHHHH Q lcl|NC_011273. 60 SGVKSFLMYWVEGRTVPITDKTGTHFVRGREVGKPGYVNIPGRGRVWRNQKWRYPGLQPKRFIESSIAQAVKENRELIRS 139 (156) Q Consensus 60 ~g~~~f~m~wvegr~vpitdktgt~~irgre~gkpgyv~ipgrg~iwr~qkwr~pgl~pkrf~e~~ia~aike~~~~ir~ 139 (156) -|-.| |.|+-+ ++. ++..+..|+-.--.+=+|||.+|..|++-|+.+. ..+++.|+- T Consensus 80 ~GT~p-------------------h~I~pk--~~k-~l~f~~~G~~v~~k~v~hpG~~a~Pfl~~A~~~~-~~~~~ri~~ 136 (137) T protein:vir:10 80 EGSRP-------------------HRITAR--HAN-ALHFFWHGREVFRKSVWHPGVRPRPFLRNAARRV-VAADPDIHM 136 (137) T ss_pred ecCCC-------------------ceeecc--cCc-eeeeeeCCceEEeeeeecCCCCCCchHHHHHHHH-hhccccccC Confidence 55443 233221 111 1111112221111233599999999999999875 567888875 Q ss_pred H Q lcl|NC_011273. 140 S 140 (156) Q Consensus 140 ~ 140 (156) . T Consensus 137 ~ 137 (137) T protein:vir:10 137 T 137 (137) T ss_pred C Confidence 4 No 8 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=59.06 E-value=0.16 Score=24.95 Aligned_cols=113 Identities=19% Similarity=0.200 Sum_probs=50.8 Q ss_pred CCcc--cc----------cHHHHHHHHHHHHHHHHhhhh----ccCccccccccc-----ccccc-ccchHHHHHHHhhh Q lcl|NC_011273. 1 MPLV--GL----------PAQTCAVISQMATTKARQDVM----GRGWRSAGALQP-----VSNQG-EVGIRSTMKHLLYQ 58 (156) Q Consensus 1 mplv--g~----------p~~~~~vi~~~a~~~ar~d~~----grgw~s~galqp-----~s~~g-~vgi~stmkhll~q 58 (156) |--+ |+ |-++..++ +.|++.+-+++. -.-=..||.|.- +++.| ..-|.|+..|.++. T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~-~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~v 79 (135) T protein:vir:96 1 MAKVKYGADSIVVDLEKYSKDMEKWV-KKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFENGGFTGVVKIGSNYAVYV 79 (135) T ss_pred CchhhhhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhCCccchhhhcceeEEeecCcEEEEEecCCCccchh Confidence 3322 21 33433332 223332222221 111112343322 12222 11233666666666 Q ss_pred cCCCceeeeeeecCeeeeeecCccceeeccccCCCCceeeeCCcccccccccccCCCCchhHHHHHHHHHHHHhhHHHHH Q lcl|NC_011273. 59 NSGVKSFLMYWVEGRTVPITDKTGTHFVRGREVGKPGYVNIPGRGRVWRNQKWRYPGLQPKRFIESSIAQAVKENRELIR 138 (156) Q Consensus 59 n~g~~~f~m~wvegr~vpitdktgt~~irgre~gkpgyv~ipgrg~iwr~qkwr~pgl~pkrf~e~~ia~aike~~~~ir 138 (156) +.| ||.|.++++.--+=++.+....|+ .-++||..|..||.-|+.+.-++-.+.|- T Consensus 80 e~G-------------------T~~~~~~~~~~~~~~~~~~~~~g~-----~~~~~~~~a~pfl~~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 80 NYG-------------------TGIYATKGSRAHKIPWTYKDPNGK-----WHTTYGQMPQPFWEPAIDAGRQTFEQYFS 135 (135) T ss_pred hcc-------------------cccccCCCccccccccccccCCcc-----eeecCCcCCCcchhHHHHHHHHHHHHhcC Confidence 666 555555543222222222222232 12478999999999888776666555554 No 9 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=57.07 E-value=0.084 Score=26.47 Aligned_cols=114 Identities=12% Similarity=0.119 Sum_probs=47.6 Q ss_pred CCcccccH-HHHHHHHHHHHHHHHhhhhccCcccccccc------ccccccc---cchHHHHHHHhhhcCCCceeeeeee Q lcl|NC_011273. 1 MPLVGLPA-QTCAVISQMATTKARQDVMGRGWRSAGALQ------PVSNQGE---VGIRSTMKHLLYQNSGVKSFLMYWV 70 (156) Q Consensus 1 mplvg~p~-~~~~vi~~~a~~~ar~d~~grgw~s~galq------p~s~~g~---vgi~stmkhll~qn~g~~~f~m~wv 70 (156) ..+.|.=+ +.+.-++......|++..- ..+|.|. +..+.|. .++.+++++-.|.+.|= T Consensus 14 ~~~~~~~~~~~~~~~a~~ve~~ak~~aP----v~TG~Lr~SI~~~~~~~~g~~v~~~V~~~~~YA~~ve~GT-------- 81 (137) T protein:vir:10 14 HGLGMDEARKAVNRVVRRTFTRSQILAP----VDTGYLRASGRLVLGRERGAVVIGSVEYTARYAAAVHNGR-------- 81 (137) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhcCC----cCchhhhccceeeeeeccccEEEEEecCCcccceeeecCC-------- Confidence 11111111 1111222222223333321 1123222 2112221 22335555555555443 Q ss_pred cCeeeeeecCccceeeccccCCCCceeeeCCcccccccccccCCCCchhHHHHHHHHHHHHhhHHHHHHHHHhhhcc Q lcl|NC_011273. 71 EGRTVPITDKTGTHFVRGREVGKPGYVNIPGRGRVWRNQKWRYPGLQPKRFIESSIAQAVKENRELIRSSVIRSVTE 147 (156) Q Consensus 71 egr~vpitdktgt~~irgre~gkpgyv~ipgrg~iwr~qkwr~pgl~pkrf~e~~ia~aike~~~~ir~~~~~~~s~ 147 (156) +-|.|+.+.- ..+..+..|+..--.+=+|||.+|..|++-|+.+.+.. ...++.-+ T Consensus 82 -----------~ph~I~pk~~---kaL~f~~~G~~vf~k~V~hPG~k~~PfL~~Al~~~~~~-------~~~~~~~~ 137 (137) T protein:vir:10 82 -----------RALTIRAKGN---GRLKFTVEGRTVYARSVHQPARAGRPYLSQALREVAPQ-------EGFRVTIG 137 (137) T ss_pred -----------CCceeecCCC---ccceeecCCeeEeccceecCCCCCChhhHHHHHHhhcc-------cceeEeeC Confidence 3355655521 23323333443333456899999999999988765432 22222222 No 10 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=36.12 E-value=0.4 Score=22.79 Aligned_cols=110 Identities=22% Similarity=0.156 Sum_probs=39.7 Q ss_pred CCcc--cc----------cHHHHHHHHHHHHHHHHhhhhccCcccccccccccccccc---------------chHHHHH Q lcl|NC_011273. 1 MPLV--GL----------PAQTCAVISQMATTKARQDVMGRGWRSAGALQPVSNQGEV---------------GIRSTMK 53 (156) Q Consensus 1 mplv--g~----------p~~~~~vi~~~a~~~ar~d~~grgw~s~galqp~s~~g~v---------------gi~stmk 53 (156) |--| |+ |-++.+.+ +.|++.+=+++.. .+=++-|+ |||.. -|-|+.. T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~-~~al~~~a~~i~~----~ak~~aPv-dTG~Lr~SI~~~~~~~~~~~~V~~~~~ 74 (137) T protein:vir:10 1 MAKVKYGNWELVKELEDFEKETIRWA-KKGIAKTTTIIHN----SIVSNMPV-DTGYLRESVSMDFKKGGLTGVINIGSE 74 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH----HHHHhCCc-CcchhhcCeeEEeeCCcEEEEEecCCC Confidence 4333 22 22222221 2233222222221 11112232 23332 2223344 Q ss_pred HHhhhcCCCceeeeeeecCeeeeeecCccceeeccccCCCCceeeeCCcccccccccccCCCCchhHHHHHHHHHHHHhh Q lcl|NC_011273. 54 HLLYQNSGVKSFLMYWVEGRTVPITDKTGTHFVRGREVGKPGYVNIPGRGRVWRNQKWRYPGLQPKRFIESSIAQAVKEN 133 (156) Q Consensus 54 hll~qn~g~~~f~m~wvegr~vpitdktgt~~irgre~gkpgyv~ipgrg~iwr~qkwr~pgl~pkrf~e~~ia~aike~ 133 (156) |-+|.+.| ||-|-+++ .++ .+...+..++...-..-.++|..|..||+-|+.+.-.+- T Consensus 75 Ya~~vE~G-------------------T~~~~~~~--~~~-~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i 132 (137) T protein:vir:10 75 YAVYVNYG-------------------TGIYAVGP--GGS-RAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFF 132 (137) T ss_pred cccccccC-------------------ccccccCC--Ccc-ccccccceeeccccceeccCCCCCCcchhHHHHHHHHHH Confidence 44443333 22221111 111 122222222222212224799999999997765544433 Q ss_pred HHHHH Q lcl|NC_011273. 134 RELIR 138 (156) Q Consensus 134 ~~~ir 138 (156) .++|- T Consensus 133 ~k~i~ 137 (137) T protein:vir:10 133 NKYFS 137 (137) T ss_pred HHhcC Confidence 33332 No 11 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=36.00 E-value=0.59 Score=21.85 Aligned_cols=114 Identities=13% Similarity=0.167 Sum_probs=46.3 Q ss_pred HHHHHHHHHHHHHhhhhccCccccccccccccccccchHHHHHHHhhhcCCCceeeeeeecCeeeeeecCccceeeccc- Q lcl|NC_011273. 11 CAVISQMATTKARQDVMGRGWRSAGALQPVSNQGEVGIRSTMKHLLYQNSGVKSFLMYWVEGRTVPITDKTGTHFVRGR- 89 (156) Q Consensus 11 ~~vi~~~a~~~ar~d~~grgw~s~galqp~s~~g~vgi~stmkhll~qn~g~~~f~m~wvegr~vpitdktgt~~irgr- 89 (156) -.-+.+.++.++-.+|.. .+=++-|+ |||+..=|=+ + -.++.|+.--+-- -...-..+-.-||-|-++|. T Consensus 1 v~~~v~~~~~~~~~~i~~----~ak~~apv-~TG~Lr~SI~--~-~~~~~~~~~~V~~-~~~Ya~yvE~GTg~~~~~~~~ 71 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHN----TIISLMPV-DTGYLRESVT--M-DFKDGGFTGVINI-GSEYAIYVNYGTGIYATGAGG 71 (116) T ss_pred ChHHHHHHHHHHHHHHHH----HHHhhCCc-ccccccccee--E-EeecCcEEEEEec-CCCccceeecCccccccCCCc Confidence 222333344333333322 12223342 4555432211 1 1123332211100 00000111222555555543 Q ss_pred -cCCCCceeeeCCcccccccccccCCCCchhHHHHHHHHHHHHhhHHHHHHHHHhhhc Q lcl|NC_011273. 90 -EVGKPGYVNIPGRGRVWRNQKWRYPGLQPKRFIESSIAQAVKENRELIRSSVIRSVT 146 (156) Q Consensus 90 -e~gkpgyv~ipgrg~iwr~qkwr~pgl~pkrf~e~~ia~aike~~~~ir~~~~~~~s 146 (156) ...+..+.|-...|+ | -+++|..|..||+-|+.+.-++-.++ || T Consensus 72 ~~~~~~~~~~~~~~g~-~----~~t~g~~a~Pfl~pA~~~~~~~i~k~--------is 116 (116) T protein:vir:95 72 SRAKNIPWSYKDANGK-W----HTTKGQHAQPFWEPAIDAGRAFFNKY--------FS 116 (116) T ss_pred cccccccceeecCccc-e----eeCCCCCCCcchHHHHHHHHHHHHHh--------hC Confidence 334444555554443 1 25889999999998765544433333 33 No 12 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=33.82 E-value=0.69 Score=21.48 Aligned_cols=117 Identities=21% Similarity=0.273 Sum_probs=45.4 Q ss_pred CCccc-----------------ccHHHHHHHHHHHHHHHHhhhhc----cCccccccccc-----ccccc-ccchHHHHH Q lcl|NC_011273. 1 MPLVG-----------------LPAQTCAVISQMATTKARQDVMG----RGWRSAGALQP-----VSNQG-EVGIRSTMK 53 (156) Q Consensus 1 mplvg-----------------~p~~~~~vi~~~a~~~ar~d~~g----rgw~s~galqp-----~s~~g-~vgi~stmk 53 (156) |+..- +|-++...|. .|+..+-+++.. .-=..||.|.- +++.| .+-|-|+.. T Consensus 1 m~~ms~~i~~~g~~~l~~~l~~~~~~~~~~v~-~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~~~g~~~~V~~~~~ 79 (144) T protein:vir:59 1 MALMSVRIDPSWRRIMSRNVRTFSGHVLTQVE-QVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYKNNGLTAEITVGAE 79 (144) T ss_pred CCcceeeehhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhCCccchhhhcCeeEEeecCcEEEEEecCCC Confidence 32211 1233333322 344443333332 11112333321 01111 111222222 Q ss_pred HHhhhcCCCceeeeeeecCeeeeeecCccceeeccccCCCCceeeeCCcccccccccccCCCCchhHHHHHHHHHHHHhh Q lcl|NC_011273. 54 HLLYQNSGVKSFLMYWVEGRTVPITDKTGTHFVRGREVGKPGYVNIPGRGRVWRNQKWRYPGLQPKRFIESSIAQAVKEN 133 (156) Q Consensus 54 hll~qn~g~~~f~m~wvegr~vpitdktgt~~irgre~gkpgyv~ipgrg~iwr~qkwr~pgl~pkrf~e~~ia~aike~ 133 (156) +-.|.+.| ||.|-+++.--..|..-+.++.|+.+ +++|..|.-||.-|+.+. T Consensus 80 YA~~vE~G-------------------T~~~~~~~~~~~~~~~~~~~~~g~~~-----~t~g~~a~Pfl~pA~~~~---- 131 (144) T protein:vir:59 80 YAIYVEYG-------------------TGIYAVDGNGRKTPWTYYSPKLGRYV-----RTQGAPAQPFFWPAVEEG---- 131 (144) T ss_pred ccchhhcC-------------------ccccccCCCcccccccccccccccee-----cCCCCCCCcchhHHHHHH---- Confidence 22222222 33344433322223333444555433 468999999999877654 Q ss_pred HHHHHHHHHhhhcc Q lcl|NC_011273. 134 RELIRSSVIRSVTE 147 (156) Q Consensus 134 ~~~ir~~~~~~~s~ 147 (156) ++.|..-+-+++ | T Consensus 132 ~~~~~~~i~~~~-g 144 (144) T protein:vir:59 132 GEYFEREMRRLR-G 144 (144) T ss_pred HHHHHHHHHHhc-C Confidence 444444333333 2 No 13 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=27.98 E-value=1.7 Score=19.37 Aligned_cols=133 Identities=11% Similarity=0.075 Sum_probs=53.6 Q ss_pred CCcccccHHHHHHHHHHHHHH-HHhhhhccCccccccccccccccccchHHHHHHHhhhcCCCceeeeeeecCeeeeeec Q lcl|NC_011273. 1 MPLVGLPAQTCAVISQMATTK-ARQDVMGRGWRSAGALQPVSNQGEVGIRSTMKHLLYQNSGVKSFLMYWVEGRTVPITD 79 (156) Q Consensus 1 mplvg~p~~~~~vi~~~a~~~-ar~d~~grgw~s~galqp~s~~g~vgi~stmkhll~qn~g~~~f~m~wvegr~vpitd 79 (156) +-.-.+-.++...+- .++.. |.+...=+-=.-+..+-|+ |+|+.-=| ++|. ++..|..--+-.= .-.-+.+-. T Consensus 8 ~~~~~~~~~~~k~~~-~~~~~~a~~~~~~~ie~~ak~~~pv-dtG~L~~S--I~~~-v~~~g~~~~V~~~-~~YA~yVE~ 81 (141) T protein:vir:78 8 SNIPKARKLIEKKVL-QALEDIGEHMTTELAEGGHGVTSNN-DTGEYAQK--SGYK-VRKSSKEVIVGNS-SDYAIYYEF 81 (141) T ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhhhcccc-ccchhhcc--eeee-eecCCcEEEEecC-CCccceeec Confidence 111111111221111 11111 1111100001123344553 57765422 2222 2333332111000 001112223 Q ss_pred CccceeeccccCCCCceeeeCCcccccccccccCCCCchhHHHHHHHHHHHHhhHHHHHHHHHhhhc Q lcl|NC_011273. 80 KTGTHFVRGREVGKPGYVNIPGRGRVWRNQKWRYPGLQPKRFIESSIAQAVKENRELIRSSVIRSVT 146 (156) Q Consensus 80 ktgt~~irgre~gkpgyv~ipgrg~iwr~qkwr~pgl~pkrf~e~~ia~aike~~~~ir~~~~~~~s 146 (156) -||.+-++|.- .|.++.|....|+ |. +..|..|..||..|+.+.-.+..+.|..+.-.+ . T Consensus 82 GTG~~~~~~~g-rk~~w~y~~~~g~-~~----~t~G~~aqpFl~~A~~~~~~~i~~~i~~~~~~l-~ 141 (141) T protein:vir:78 82 GTGEKSERGGG-KAGGWFYMDKKGH-WH----FTRGSQASKRMRYTFRDEQDKVRVFTERALRGI-N 141 (141) T ss_pred CCcccccCCCC-CcCcceeecCCCe-eE----eccCCCCchhhhhhHHhhHHHHHHHHHHHhhcc-C Confidence 36766666533 3555666655565 32 356999999998877766555555555443322 2 No 14 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=27.35 E-value=0.81 Score=21.07 Aligned_cols=115 Identities=21% Similarity=0.139 Sum_probs=41.3 Q ss_pred CCcc--cc----------cHHHHHHHHHHHHHHHHhhhhc----cCccccccccc-----ccccc-ccchHHHHHHHhhh Q lcl|NC_011273. 1 MPLV--GL----------PAQTCAVISQMATTKARQDVMG----RGWRSAGALQP-----VSNQG-EVGIRSTMKHLLYQ 58 (156) Q Consensus 1 mplv--g~----------p~~~~~vi~~~a~~~ar~d~~g----rgw~s~galqp-----~s~~g-~vgi~stmkhll~q 58 (156) |--+ |+ |-++.+.+ ..|++.+=.++.. .-=..||.|.- +++.| ..-|-|+..|-.+. T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~-~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~v 79 (137) T protein:vir:10 1 MAKVKYGNWDLVKELEEFEKETIRWA-KKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEYAVYV 79 (137) T ss_pred CccchhCHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEecCCcEEEEEecCCcccccc Confidence 3322 22 33333332 2333333222221 11112333321 11111 00111222222222 Q ss_pred cCCCceeeeeeecCeeeeeecCccceeeccccCCCCceeeeCCcccccccccccCCCCchhHHHHHHHHHHHHhhHHHHH Q lcl|NC_011273. 59 NSGVKSFLMYWVEGRTVPITDKTGTHFVRGREVGKPGYVNIPGRGRVWRNQKWRYPGLQPKRFIESSIAQAVKENRELIR 138 (156) Q Consensus 59 n~g~~~f~m~wvegr~vpitdktgt~~irgre~gkpgyv~ipgrg~iwr~qkwr~pgl~pkrf~e~~ia~aike~~~~ir 138 (156) +.| ||.|-+++ .+ ......|.+++.+...--++||..|..||+-|+.+..++-.++|- T Consensus 80 E~G-------------------T~~~~~~~--~~-~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 80 NYG-------------------TGIYAVGP--GG-SRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred ccC-------------------ccccccCC--Cc-ccccccceeeeccccccccCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 222 22222221 11 222333433433332222579999999999776554443333332 No 15 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=26.07 E-value=1.3 Score=19.91 Aligned_cols=108 Identities=21% Similarity=0.251 Sum_probs=41.6 Q ss_pred CCcc--cc----------cHHHHHHHHHHHHHHHHhhhhccCccccccccccccccccc---------------hHHHHH Q lcl|NC_011273. 1 MPLV--GL----------PAQTCAVISQMATTKARQDVMGRGWRSAGALQPVSNQGEVG---------------IRSTMK 53 (156) Q Consensus 1 mplv--g~----------p~~~~~vi~~~a~~~ar~d~~grgw~s~galqp~s~~g~vg---------------i~stmk 53 (156) |-=| |+ +.++.. ..+.|+..+-+++.+ .+-++-|+ |||+.- |-|+.. T Consensus 13 Ma~v~~Gld~l~~~l~~~~~~~~~-~~~~~l~~~a~~v~~----~ak~~aPv-dTG~L~~SI~~~~~~~g~~~~V~~~~~ 86 (149) T protein:vir:10 13 MAKVKYGADSMVVELDKFDKKIEE-WVKKGIAKTTTKIYN----TAVALAPV-DLGFLEESIDFKYFDGGLSSVISVGAD 86 (149) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH----HHHHhCCc-ccchhhccceEEecCCcEEEEEecCCC Confidence 2111 21 222222 112222222222211 11222233 344421 112222 Q ss_pred HHhhhcCCCceeeeeeecCeeeeeecCccceeec--cccCCCCceeeeCCcccccccccccCCCCchhHHHHHHHHHHHH Q lcl|NC_011273. 54 HLLYQNSGVKSFLMYWVEGRTVPITDKTGTHFVR--GREVGKPGYVNIPGRGRVWRNQKWRYPGLQPKRFIESSIAQAVK 131 (156) Q Consensus 54 hll~qn~g~~~f~m~wvegr~vpitdktgt~~ir--gre~gkpgyv~ipgrg~iwr~qkwr~pgl~pkrf~e~~ia~aik 131 (156) |-++.+.| ||.+-++ ++...+....|..+.|+. -++||.+|..||+-|+.+.-+ T Consensus 87 YA~~vE~G-------------------T~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~t~g~~a~PFl~pA~~~~k~ 142 (149) T protein:vir:10 87 YAIYVEYG-------------------TGIYATGPGGSRATKIPWSFKGDDGEW-----YTTYGQAPQPFWNPAIDAGRK 142 (149) T ss_pred cccccccC-------------------ccccccCCcccccccccceeeccccce-----ecCCCCCCCcchhHHHHHHHH Confidence 22222222 2222222 122222223333333321 257999999999988877766 Q ss_pred hhHHHHH Q lcl|NC_011273. 132 ENRELIR 138 (156) Q Consensus 132 e~~~~ir 138 (156) +-.++|- T Consensus 143 ~i~~~i~ 149 (149) T protein:vir:10 143 TFEQYFS 149 (149) T ss_pred HHHHhhC Confidence 6666655 No 16 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=24.04 E-value=1 Score=20.50 Aligned_cols=108 Identities=21% Similarity=0.258 Sum_probs=39.9 Q ss_pred CCcc--c----------ccHHHHHHHHHHHHHHHHhhhhccCccccccccccccccccc---------------hHHHHH Q lcl|NC_011273. 1 MPLV--G----------LPAQTCAVISQMATTKARQDVMGRGWRSAGALQPVSNQGEVG---------------IRSTMK 53 (156) Q Consensus 1 mplv--g----------~p~~~~~vi~~~a~~~ar~d~~grgw~s~galqp~s~~g~vg---------------i~stmk 53 (156) |-=| | ++.++.. ....|++.+-+++.. .+-++-|+ |||+.- |-|+.. T Consensus 13 Ma~~~~Gld~l~~~L~~~~~~~~~-~~~~al~~~a~~v~~----~ak~~aPv-dTG~Lr~SI~~~~~~~g~~~~V~~~~~ 86 (149) T protein:vir:94 13 MAKVKYGADSMVVELDKFDKKIEE-WVKKGIAKTTTKIYN----TAVALAPV-DLGFLEESIDFKYFDGGLSSVISVGAD 86 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH----HHHHhCCc-ccchhhcCeeEEeeCCcEEEEEecCCC Confidence 2111 1 1222221 112222222222211 12223333 344432 112222 Q ss_pred HHhhhcCCCceeeeeeecCeeeeeecCccceeec--cccCCCCceeeeCCcccccccccccCCCCchhHHHHHHHHHHHH Q lcl|NC_011273. 54 HLLYQNSGVKSFLMYWVEGRTVPITDKTGTHFVR--GREVGKPGYVNIPGRGRVWRNQKWRYPGLQPKRFIESSIAQAVK 131 (156) Q Consensus 54 hll~qn~g~~~f~m~wvegr~vpitdktgt~~ir--gre~gkpgyv~ipgrg~iwr~qkwr~pgl~pkrf~e~~ia~aik 131 (156) |-++.+.| ||-+-++ ++..++-...|..+.|+ .-++||.+|..||+-|+.+..+ T Consensus 87 YA~~VE~G-------------------T~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~g~~a~PFl~pA~~~~~~ 142 (149) T protein:vir:94 87 YAIYVEYG-------------------TGIYATGPGGSRATKIPWSFKGDDGE-----WYTTYGQAPQPFWNPAIDAGRK 142 (149) T ss_pred cccccccC-------------------ccccccCCCccccccccceeecCccc-----eecCCCCCCCcchHHHHHHHHH Confidence 22222222 1212111 12222222233333332 1237999999999988777665 Q ss_pred hhHHHHH Q lcl|NC_011273. 132 ENRELIR 138 (156) Q Consensus 132 e~~~~ir 138 (156) +-.++|- T Consensus 143 ~i~~~i~ 149 (149) T protein:vir:94 143 TFEQYFS 149 (149) T ss_pred HHHHhhC Confidence 5555554 No 17 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=23.09 E-value=1.3 Score=19.94 Aligned_cols=114 Identities=18% Similarity=0.115 Sum_probs=39.3 Q ss_pred CCcc--cc----------cHHHHHHHHHHHHHHHHhhh----hccCccccccccc------cccccccchHHHHHHHhhh Q lcl|NC_011273. 1 MPLV--GL----------PAQTCAVISQMATTKARQDV----MGRGWRSAGALQP------VSNQGEVGIRSTMKHLLYQ 58 (156) Q Consensus 1 mplv--g~----------p~~~~~vi~~~a~~~ar~d~----~grgw~s~galqp------~s~~g~vgi~stmkhll~q 58 (156) |--| |+ +-++.+.+ +.|++.+-.++ +..-=..||.|.- -.+...+-|-|+..|-.|. T Consensus 1 Ma~~~~G~~~l~~~L~~~~~~~~~~~-~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~v 79 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDIERWV-KRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYV 79 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEeecCcEEEEEecCCCccccc Confidence 3222 21 22332221 12222222111 1111112333321 1110011122333333333 Q ss_pred cCCCceeeeeeecCeeeeeecCccceeeccccCCCCceeeeCCcccccccccc-cCCCCchhHHHHHHHHHHHHhhHHHH Q lcl|NC_011273. 59 NSGVKSFLMYWVEGRTVPITDKTGTHFVRGREVGKPGYVNIPGRGRVWRNQKW-RYPGLQPKRFIESSIAQAVKENRELI 137 (156) Q Consensus 59 n~g~~~f~m~wvegr~vpitdktgt~~irgre~gkpgyv~ipgrg~iwr~qkw-r~pgl~pkrf~e~~ia~aike~~~~i 137 (156) +.| ||.|-++|+- .-+.-.+..++... .+| +.+|..|..||+-|+.+.-++-.++| T Consensus 80 E~G-------------------T~~~~~~~~~---~~~~~~~~~~~~~~-~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l 136 (137) T protein:vir:94 80 NYG-------------------TGIYATGAGG---SRAKKIPWSYKDAN-GKWHTTKGQHAQPFWEPAIDAGRVFFNKYF 136 (137) T ss_pred ccC-------------------ccccccCCCc---ccccccccceeccC-CceeecCCcCCCcchHHHHHHHHHHHHHhh Confidence 322 4444444321 01111111111111 122 25689999999988766555544444 Q ss_pred H Q lcl|NC_011273. 138 R 138 (156) Q Consensus 138 r 138 (156) - T Consensus 137 ~ 137 (137) T protein:vir:94 137 S 137 (137) T ss_pred C Confidence 3 No 18 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=22.39 E-value=1 Score=20.50 Aligned_cols=124 Identities=18% Similarity=0.200 Sum_probs=39.0 Q ss_pred CCcc--cc----------cHHHHHHHHHHHHHHHHhhhhccCccccccccccccccccchHHHHHHHhhhcCCCceeeee Q lcl|NC_011273. 1 MPLV--GL----------PAQTCAVISQMATTKARQDVMGRGWRSAGALQPVSNQGEVGIRSTMKHLLYQNSGVKSFLMY 68 (156) Q Consensus 1 mplv--g~----------p~~~~~vi~~~a~~~ar~d~~grgw~s~galqp~s~~g~vgi~stmkhll~qn~g~~~f~m~ 68 (156) |--| |+ +-++.+++ +.|+..+-.++ ...+-++-|+ |||..-=| .. .-..++|+..-+-- T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~-~~~l~~~a~~~----~~~ak~~~pv-dTG~L~~S--i~-~~~~~~g~~~~V~~ 71 (137) T protein:vir:96 1 MAKVKYGNWDLVAELEDYRDEMEEWV-KKGILKTTLAI----YNTAVALAPV-DLGFLKES--ID-FKVTDGGFSSVISV 71 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH----HHHHHHhCCc-CccchhcC--ce-eEeecCceEEEEec Confidence 3221 11 12222221 12222221111 1122233343 45543211 11 11122222110000 Q ss_pred eecCeeeeeecCccceeeccccCCCCceeeeCCcccccccccc-cCCCCchhHHHHHHHHHHHHhhHHHHH Q lcl|NC_011273. 69 WVEGRTVPITDKTGTHFVRGREVGKPGYVNIPGRGRVWRNQKW-RYPGLQPKRFIESSIAQAVKENRELIR 138 (156) Q Consensus 69 wvegr~vpitdktgt~~irgre~gkpgyv~ipgrg~iwr~qkw-r~pgl~pkrf~e~~ia~aike~~~~ir 138 (156) =+ .--..+-.-||-|-.+|+ +++ +...+..++ +-+.+| ++||.+|..||.-|+.+.-++-.++|- T Consensus 72 ~~-~YA~yvE~GT~~~~~~~~--~~~-~~~~~~~~~-~~~~~~~~t~g~~a~pFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:96 72 GA-EYAIYVEFGTGIYATGPG--GSR-ARKLPWTYK-GDDGEWHTTYGQQAQPFWNPAIDEGRKVFNRYFS 137 (137) T ss_pred CC-CcccccccCccccccCCC--ccc-cccccceee-ccCcceeecCCCCCCcchhHHHHHHHHHHHHhhC Confidence 00 000001111222222221 111 111111111 122333 479999999999877655544444443 No 19 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=21.08 E-value=1.3 Score=19.87 Aligned_cols=114 Identities=13% Similarity=0.153 Sum_probs=40.9 Q ss_pred HHHHHHHHHHHHHHhhhhccCccccccccccccccccchHHHHHHHhhhcCCCceeeeeeecCeeeeeecCccceeeccc Q lcl|NC_011273. 10 TCAVISQMATTKARQDVMGRGWRSAGALQPVSNQGEVGIRSTMKHLLYQNSGVKSFLMYWVEGRTVPITDKTGTHFVRGR 89 (156) Q Consensus 10 ~~~vi~~~a~~~ar~d~~grgw~s~galqp~s~~g~vgi~stmkhll~qn~g~~~f~m~wvegr~vpitdktgt~~irgr 89 (156) +.. +.+.++.++-.++... +=++-|+ |||+..=|= ++ -.++.|+.--+-- -.-.-..+-.-||-|-++|. T Consensus 1 v~~-~v~~~~~~~~~~i~~~----ak~~aPv-~TG~Lr~SI--~~-~~~~~~~~~~V~~-~~~YA~yvE~GTg~~~~~~~ 70 (116) T protein:vir:97 1 MER-WVKRGIAKTTAKIHNT----IISLMPV-DTGYLRESV--TM-DFKDGGFTGVINI-GSEYAIYVNYGTGIYATGAG 70 (116) T ss_pred ChH-HHHHHHHHHHHHHHHH----HHHhCCc-Ccccccccc--eE-EeecCcEEEEEec-CCCcccccccCCcccccCCC Confidence 222 2233333333222211 1112232 445442211 11 1122222111000 00000011112444444443 Q ss_pred c--CCCCceeeeCCcccccccccccCCCCchhHHHHHHHHHHHHhhHHHHH Q lcl|NC_011273. 90 E--VGKPGYVNIPGRGRVWRNQKWRYPGLQPKRFIESSIAQAVKENRELIR 138 (156) Q Consensus 90 e--~gkpgyv~ipgrg~iwr~qkwr~pgl~pkrf~e~~ia~aike~~~~ir 138 (156) - .-+.+..|....|+ | -+++|.+|..||.-|+.+.-++-.++|- T Consensus 71 ~~~~~~~~~~~~~~~g~-~----~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 71 GSRAKKIPWSYKDANGK-W----HTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred cccccccceeeecCCce-e----eecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 1 12223334333333 1 1478999999999876655444433332 No 20 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=21.08 E-value=1.3 Score=19.87 Aligned_cols=114 Identities=13% Similarity=0.153 Sum_probs=40.9 Q ss_pred HHHHHHHHHHHHHHhhhhccCccccccccccccccccchHHHHHHHhhhcCCCceeeeeeecCeeeeeecCccceeeccc Q lcl|NC_011273. 10 TCAVISQMATTKARQDVMGRGWRSAGALQPVSNQGEVGIRSTMKHLLYQNSGVKSFLMYWVEGRTVPITDKTGTHFVRGR 89 (156) Q Consensus 10 ~~~vi~~~a~~~ar~d~~grgw~s~galqp~s~~g~vgi~stmkhll~qn~g~~~f~m~wvegr~vpitdktgt~~irgr 89 (156) +.. +.+.++.++-.++... +=++-|+ |||+..=|= ++ -.++.|+.--+-- -.-.-..+-.-||-|-++|. T Consensus 1 v~~-~v~~~~~~~~~~i~~~----ak~~aPv-~TG~Lr~SI--~~-~~~~~~~~~~V~~-~~~YA~yvE~GTg~~~~~~~ 70 (116) T protein:vir:12 1 MER-WVKRGIAKTTAKIHNT----IISLMPV-DTGYLRESV--TM-DFKDGGFTGVINI-GSEYAIYVNYGTGIYATGAG 70 (116) T ss_pred ChH-HHHHHHHHHHHHHHHH----HHHhCCc-Ccccccccc--eE-EeecCcEEEEEec-CCCcccccccCCcccccCCC Confidence 222 2233333333222211 1112232 445442211 11 1122222111000 00000011112444444443 Q ss_pred c--CCCCceeeeCCcccccccccccCCCCchhHHHHHHHHHHHHhhHHHHH Q lcl|NC_011273. 90 E--VGKPGYVNIPGRGRVWRNQKWRYPGLQPKRFIESSIAQAVKENRELIR 138 (156) Q Consensus 90 e--~gkpgyv~ipgrg~iwr~qkwr~pgl~pkrf~e~~ia~aike~~~~ir 138 (156) - .-+.+..|....|+ | -+++|.+|..||.-|+.+.-++-.++|- T Consensus 71 ~~~~~~~~~~~~~~~g~-~----~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 71 GSRAKKIPWSYKDANGK-W----HTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred cccccccceeeecCCce-e----eecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 1 12223334333333 1 1478999999999876655444433332 Done!