Query lcl|NC_013694.1_cdsid_YP_003358723.1 [gene=20] [protein=gp20] [protein_id=YP_003358723.1] [location=13568..13960] Match_columns 130 No_of_seqs 20 out of 23 Neff 3.6 Searched_HMMs 1612 Date Thu Nov 7 13:43:53 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_20 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_20_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:78298 Length: 131 100.0 1.5E-65 9.1E-69 376.0 10.4 130 1-130 1-130 (131) 2 protein:vir:78503 Length: 131 100.0 1.5E-65 9.1E-69 376.0 10.4 130 1-130 1-130 (131) 3 protein:vir:2347 Length: 131 # 100.0 1.5E-65 9.1E-69 376.0 10.4 130 1-130 1-130 (131) 4 protein:vir:7776 Length: 119 # 100.0 5.7E-59 3.6E-62 339.9 9.3 117 1-117 1-119 (119) 5 protein:vir:104091 Length: 111 100.0 2.7E-49 1.7E-52 286.8 7.5 110 1-119 1-111 (111) 6 protein:vir:4230 Length: 111 # 100.0 4.6E-49 2.9E-52 285.6 8.5 110 1-120 1-111 (111) 7 protein:vir:2435 Length: 111 # 100.0 4.8E-49 3E-52 285.5 8.1 110 1-120 1-111 (111) 8 protein:vir:96121 Length: 137 88.0 0.023 1.4E-05 29.6 8.8 114 1-130 1-122 (137) 9 protein:vir:94490 Length: 137 86.4 0.033 2E-05 28.7 8.7 118 1-130 1-126 (137) 10 protein:vir:97427 Length: 137 86.4 0.033 2E-05 28.7 8.7 118 1-130 1-126 (137) 11 protein:vir:93738 Length: 137 86.4 0.033 2E-05 28.7 8.7 118 1-130 1-126 (137) 12 protein:vir:105330 Length: 137 84.7 0.051 3.2E-05 27.7 8.9 114 1-130 1-122 (137) 13 protein:vir:105467 Length: 144 83.9 0.0066 4.1E-06 32.5 3.7 114 1-130 1-129 (144) 14 protein:vir:94796 Length: 137 82.8 0.068 4.2E-05 27.0 8.7 118 1-130 1-126 (137) 15 protein:vir:95894 Length: 137 76.5 0.13 8.4E-05 25.4 8.6 118 1-130 1-126 (137) 16 protein:vir:107099 Length: 137 74.5 0.16 9.8E-05 25.0 8.8 118 1-130 1-126 (137) 17 protein:vir:94108 Length: 149 67.7 0.24 0.00015 24.0 7.5 114 1-130 13-134 (149) 18 protein:vir:96829 Length: 135 66.5 0.27 0.00017 23.7 8.7 118 1-130 1-124 (135) 19 protein:vir:94654 Length: 142 65.7 0.27 0.00017 23.7 7.4 114 1-130 1-124 (142) 20 protein:vir:105916 Length: 149 65.7 0.27 0.00017 23.7 7.4 114 1-130 13-134 (149) 21 protein:vir:5978 Length: 144 # 57.5 0.43 0.00027 22.6 8.7 110 1-130 4-129 (144) 22 protein:vir:107545 Length: 140 54.8 0.5 0.00031 22.3 6.9 121 1-130 1-138 (140) 23 protein:vir:97982 Length: 140 54.8 0.5 0.00031 22.3 6.9 121 1-130 1-138 (140) 24 protein:vir:99924 Length: 87 # 29.3 1.4 0.00085 19.8 4.9 87 1-128 1-87 (87) 25 protein:vir:102441 Length: 137 28.8 0.73 0.00045 21.3 3.3 116 1-130 4-127 (137) 26 protein:vir:102963 Length: 163 25.6 1.7 0.0011 19.3 4.8 116 1-130 1-151 (163) 27 protein:vir:79034 Length: 141 24.7 1.6 0.001 19.4 4.5 116 1-130 1-132 (141) 28 protein:vir:95062 Length: 116 23.2 2.3 0.0014 18.6 7.6 94 20-130 1-101 (116) 29 protein:vir:102338 Length: 116 22.1 2.2 0.0013 18.7 4.6 105 20-130 1-112 (116) No 1 >protein:vir:78298 Length: 131 # NCBI annotation: gp18 # Family: family:all:2819 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491670;genbank:gi:157786494;genbank:GeneID:5625776 Probab=100.00 E-value=1.5e-65 Score=376.03 Aligned_cols=130 Identities=52% Similarity=0.787 Sum_probs=129.6 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccCccccceeecccCCcceEEeecCCCccc Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGRARRNLAEARASTTHSKIVGPGHLTKIGSAADDPDVLVYMDAPNPMA 80 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~RAeanLa~arast~~~kI~g~~~~t~I~~~~G~vD~~V~L~d~nAlA 80 (130) ||+||++++||++|||||+||++|+.|+|+|.+|||+||++||++|.|+||+||+|+++||+++|||||||+|||||||| T Consensus 1 ma~~~~~~~ln~vvA~l~~vk~~v~~E~~~v~~RA~~NLa~Arast~~~k~~gp~hla~I~~~~gdvD~~v~Ldapna~a 80 (131) T protein:vir:78 1 MPLYYGRSGLNKVVSHLPGVVHEMRSEADEVADRAKANLAAARASTQWEKIHGPDHLTKITRTNGSVDAYVNMEAPSPES 80 (131) T ss_pred CcccccchhhhhhhhhchhHHHHHHHHHHhhhHHHHHHHHHHhhcCcccccccCCCcceeeeeeCCcceEEeecCCCchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeccCccccccCCCCCCCCCCCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 81 IEYGHGPSGYFDPDKYGKVTKAPAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 81 IEfGh~psg~f~p~k~G~~tka~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) |||||+|||||+|+|||+++|+|+||||||+||||+|++.|||++|||-+ T Consensus 81 IEfGhapsgvf~p~k~G~~tkapeglYILTrAA~lgg~~~~~t~~~RGkr 130 (131) T protein:vir:78 81 IEYGHYPSGVFDPEKYGRVTKAPQGLYILTGAAGFGGQTAISTGAKRGKR 130 (131) T ss_pred heeccccccccCCcccCcccCCCCcceeeecccccccccccccccccCCC Confidence 99999999999999999999999999999999999999999999999999 No 2 >protein:vir:78503 Length: 131 # NCBI annotation: gp18 # Family: family:all:2819 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491589;genbank:gi:157786412;genbank:GeneID:5625655 Probab=100.00 E-value=1.5e-65 Score=376.03 Aligned_cols=130 Identities=52% Similarity=0.787 Sum_probs=129.6 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccCccccceeecccCCcceEEeecCCCccc Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGRARRNLAEARASTTHSKIVGPGHLTKIGSAADDPDVLVYMDAPNPMA 80 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~RAeanLa~arast~~~kI~g~~~~t~I~~~~G~vD~~V~L~d~nAlA 80 (130) ||+||++++||++|||||+||++|+.|+|+|.+|||+||++||++|.|+||+||+|+++||+++|||||||+|||||||| T Consensus 1 ma~~~~~~~ln~vvA~l~~vk~~v~~E~~~v~~RA~~NLa~Arast~~~k~~gp~hla~I~~~~gdvD~~v~Ldapna~a 80 (131) T protein:vir:78 1 MPLYYGRSGLNKVVSHLPGVVHEMRSEADEVADRAKANLAAARASTQWEKIHGPDHLTKITRTNGSVDAYVNMEAPSPES 80 (131) T ss_pred CcccccchhhhhhhhhchhHHHHHHHHHHhhhHHHHHHHHHHhhcCcccccccCCCcceeeeeeCCcceEEeecCCCchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeccCccccccCCCCCCCCCCCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 81 IEYGHGPSGYFDPDKYGKVTKAPAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 81 IEfGh~psg~f~p~k~G~~tka~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) |||||+|||||+|+|||+++|+|+||||||+||||+|++.|||++|||-+ T Consensus 81 IEfGhapsgvf~p~k~G~~tkapeglYILTrAA~lgg~~~~~t~~~RGkr 130 (131) T protein:vir:78 81 IEYGHYPSGVFDPEKYGRVTKAPQGLYILTGAAGFGGQTAISTGAKRGKR 130 (131) T ss_pred heeccccccccCCcccCcccCCCCcceeeecccccccccccccccccCCC Confidence 99999999999999999999999999999999999999999999999999 No 3 >protein:vir:2347 Length: 131 # NCBI annotation: gp17 # Family: family:all:2819 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075284;genbank:gi:12657871;genbank:GeneID:920131 Probab=100.00 E-value=1.5e-65 Score=376.03 Aligned_cols=130 Identities=52% Similarity=0.787 Sum_probs=129.6 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccCccccceeecccCCcceEEeecCCCccc Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGRARRNLAEARASTTHSKIVGPGHLTKIGSAADDPDVLVYMDAPNPMA 80 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~RAeanLa~arast~~~kI~g~~~~t~I~~~~G~vD~~V~L~d~nAlA 80 (130) ||+||++++||++|||||+||++|+.|+|+|.+|||+||++||++|.|+||+||+|+++||+++|||||||+|||||||| T Consensus 1 ma~~~~~~~ln~vvA~l~~vk~~v~~E~~~v~~RA~~NLa~Arast~~~k~~gp~hla~I~~~~gdvD~~v~Ldapna~a 80 (131) T protein:vir:23 1 MPLYYGRSGLNKVVSHLPGVVHEMRSEADEVADRAKANLAAARASTQWEKIHGPDHLTKITRTNGSVDAYVNMEAPSPES 80 (131) T ss_pred CcccccchhhhhhhhhchhHHHHHHHHHHhhhHHHHHHHHHHhhcCcccccccCCCcceeeeeeCCcceEEeecCCCchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeccCccccccCCCCCCCCCCCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 81 IEYGHGPSGYFDPDKYGKVTKAPAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 81 IEfGh~psg~f~p~k~G~~tka~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) |||||+|||||+|+|||+++|+|+||||||+||||+|++.|||++|||-+ T Consensus 81 IEfGhapsgvf~p~k~G~~tkapeglYILTrAA~lgg~~~~~t~~~RGkr 130 (131) T protein:vir:23 81 IEYGHYPSGVFDPEKYGRVTKAPQGLYILTGAAGFGGQTAISTGAKRGKR 130 (131) T ss_pred heeccccccccCCcccCcccCCCCcceeeecccccccccccccccccCCC Confidence 99999999999999999999999999999999999999999999999999 No 4 >protein:vir:7776 Length: 119 # NCBI annotation: gp21 # Family: family:all:2819 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817610;genbank:gi:29566040;genbank:GeneID:1259234 Probab=100.00 E-value=5.7e-59 Score=339.88 Aligned_cols=117 Identities=50% Similarity=0.828 Sum_probs=115.6 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccCccccceeecccCCcceEEeecCCCccc Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGRARRNLAEARASTTHSKIVGPGHLTKIGSAADDPDVLVYMDAPNPMA 80 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~RAeanLa~arast~~~kI~g~~~~t~I~~~~G~vD~~V~L~d~nAlA 80 (130) ||++|++++||++|||||+||++|++|+|+|.+|||+||++||+||.|+||.||+|+++||+++|||||||+|||||||| T Consensus 1 Ma~~y~~~~ln~vvA~l~~v~~~v~~E~~~v~~RA~~NLa~Arast~~~k~~gp~~~~~Id~a~gdvD~~v~l~apna~a 80 (119) T protein:vir:77 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDHLTRVTVTRGDVDSFINLEAPNAMA 80 (119) T ss_pred CcccccccchhhhhhhchhHHHHHHHHHHhhhHHHHHHHHHHhhcccccceecCCCcceeccccCCcceEEeecCCCchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeccCccccccC-CCCCC-CCCCCCcceeeccccccCc Q lcl|NC_013694. 81 IEYGHGPSGYFDP-DKYGK-VTKAPAGLYILNRAAGIAG 117 (130) Q Consensus 81 IEfGh~psg~f~p-~k~G~-~tka~eGLyILT~AA~l~~ 117 (130) |||||+|||||+| ++||+ |||+|+||||||+||||+| T Consensus 81 IEfGhapsgvf~pG~~yg~vdtkapeglYILTrAA~l~g 119 (119) T protein:vir:77 81 IEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) T ss_pred hcccccccceecccccccccCCCCCCCeEeeecccccCC Confidence 9999999999999 89988 5999999999999999998 No 5 >protein:vir:104091 Length: 111 # NCBI annotation: gp23 # Family: family:all:2819 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655602;genbank:gi:109392473;genbank:GeneID:4156959 Probab=100.00 E-value=2.7e-49 Score=286.82 Aligned_cols=110 Identities=45% Similarity=0.648 Sum_probs=103.8 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccCcccc-ceeecccCCcceEEeecCCCcc Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGRARRNLAEARASTTHSKIVGPGHL-TKIGSAADDPDVLVYMDAPNPM 79 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~RAeanLa~arast~~~kI~g~~~~-t~I~~~~G~vD~~V~L~d~nAl 79 (130) ||++|.+ .|+++|||++||++|++|++++.+|||+||+|||++|.|. +++|+ ++|++++||||+||+||+|||| T Consensus 1 ma~vy~~--~n~vva~l~~vk~avr~e~~~v~~RAr~nLa~Arastr~~---~~G~~pt~I~~a~gDVD~~v~l~apnam 75 (111) T protein:vir:10 1 MAKVYAN--ANEAAARHVDTKRAVRRVNRDVEGRARSNLAQANSTTRVT---PTGYFPAEIDSSEHDVDCYTTLHAPNAM 75 (111) T ss_pred Ccccccc--cCcEEeechhhHHHHHHHHhhhhhHHHHHHHHhhhccccc---ccCcccceeeeecCCcceEEEecCCCch Confidence 9999999 8899999999999999999999999999999999998655 45677 6999999999999999999999 Q ss_pred ceeeccCccccccCCCCCCCCCCCCcceeeccccccCccc Q lcl|NC_013694. 80 AIEYGHGPSGYFDPDKYGKVTKAPAGLYILNRAAGIAGSM 119 (130) Q Consensus 80 AIEfGh~psg~f~p~k~G~~tka~eGLyILT~AA~l~~~~ 119 (130) ||||||+|||||. |.|||+|+||||||+||+.|.|| T Consensus 76 aiEfGh~psG~f~----~~~tkap~glyILTrAA~gg~t~ 111 (111) T protein:vir:10 76 ALEFGHEPSGVFA----GTDTKSPDPQYILTRAAYGGHTM 111 (111) T ss_pred hhhhccCccceec----ccccCCCCCceeeeecccccccC Confidence 9999999999994 99999999999999999877777 No 6 >protein:vir:4230 Length: 111 # NCBI annotation: predicted 12.0Kd protein # Family: family:all:2819 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039685;swissprot:sw:q05227;genbank:gi:9625451;uniprot:Q05227;genbank:GeneID:2942925 Probab=100.00 E-value=4.6e-49 Score=285.59 Aligned_cols=110 Identities=41% Similarity=0.681 Sum_probs=103.9 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccCccccc-eeecccCCcceEEeecCCCcc Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGRARRNLAEARASTTHSKIVGPGHLT-KIGSAADDPDVLVYMDAPNPM 79 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~RAeanLa~arast~~~kI~g~~~~t-~I~~~~G~vD~~V~L~d~nAl 79 (130) ||++|.+ .|++++||++|++++++|++++.+||++|||++|+|| +|++++|++ .|+.++||||+||+|++|||| T Consensus 1 makvyan--aN~v~a~~~~~k~avr~E~~~v~~RAraNLA~a~ast---ri~~~g~~p~~it~~~gdvD~~~~l~APnam 75 (111) T protein:vir:42 1 MAKVYAN--ANKVAARYVETRDAVRDERNKVTRRAKANLARQNSTT---RITDEGYFPATITEQDGDVDFHTILNAPNAL 75 (111) T ss_pred Ccceecc--hhhhhhhchhHHHHHHHHHhhhhhhHHHhHHHhhhcc---ccccccccCceeecccCCcceEEEecCCChh Confidence 9999999 8999999999999999999999999999999999998 455556665 388999999999999999999 Q ss_pred ceeeccCccccccCCCCCCCCCCCCcceeeccccccCcccc Q lcl|NC_013694. 80 AIEYGHGPSGYFDPDKYGKVTKAPAGLYILNRAAGIAGSMV 120 (130) Q Consensus 80 AIEfGh~psg~f~p~k~G~~tka~eGLyILT~AA~l~~~~~ 120 (130) ||||||+|||||. |.|||+|+||||||+|| +||||| T Consensus 76 AiEfGH~PSG~F~----g~dTKaPe~~YILt~AA-iggt~~ 111 (111) T protein:vir:42 76 ALEFGHAPSGFFA----GTDTKPPEATYILTRAA-IGGTVS 111 (111) T ss_pred hhhcccCCcceec----ccccCCCCceeeeeccc-cccccC Confidence 9999999999995 99999999999999999 999999 No 7 >protein:vir:2435 Length: 111 # NCBI annotation: gp21 # Family: family:all:2819 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046837;genbank:gi:9630405;genbank:GeneID:1261628 Probab=100.00 E-value=4.8e-49 Score=285.50 Aligned_cols=110 Identities=39% Similarity=0.600 Sum_probs=103.6 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHHHHHHHHHHhhccCCCcc-cCccccceeecccCCcceEEeecCCCcc Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGRARRNLAEARASTTHSKI-VGPGHLTKIGSAADDPDVLVYMDAPNPM 79 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~RAeanLa~arast~~~kI-~g~~~~t~I~~~~G~vD~~V~L~d~nAl 79 (130) ||++|.+ .|++++||++|++++++|++|+.+||++|||+||+||.|.|| ++|+ +|+...||||+||+|++|||| T Consensus 1 makvyan--aN~v~ahl~~vk~avr~Ea~ev~~RAr~NLA~arastri~k~g~~P~---~I~~~~gdvD~~~~l~APnam 75 (111) T protein:vir:24 1 MAKVYAN--ANKVAARHVDVRKRVKEERDGVTRRARTNLARANKTTRITKEGYFPA---SIEEVDGDVDFHTVLHAPNAF 75 (111) T ss_pred Ccccccc--hhhHhhhchhHHHHHHHHHhhhhhhHHHhHHHhhhcceecccccCcc---ccccccCCcceEEEecCCChh Confidence 9999999 899999999999999999999999999999999999976666 4444 488888999999999999999 Q ss_pred ceeeccCccccccCCCCCCCCCCCCcceeeccccccCcccc Q lcl|NC_013694. 80 AIEYGHGPSGYFDPDKYGKVTKAPAGLYILNRAAGIAGSMV 120 (130) Q Consensus 80 AIEfGh~psg~f~p~k~G~~tka~eGLyILT~AA~l~~~~~ 120 (130) ||||||+|||||. |.|||+|+||||||+|| +||||| T Consensus 76 AiEfGH~PSG~F~----g~dTKaP~glYILt~AA-~~g~~~ 111 (111) T protein:vir:24 76 ALEFGHAPSGFFA----GTDTKPPDPEYILTRAA-IGGTVS 111 (111) T ss_pred hhhccCCCcceec----ccccCCCCCceeeeccc-cccccC Confidence 9999999999995 99999999999999999 999999 No 8 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=87.96 E-value=0.023 Score=29.56 Aligned_cols=114 Identities=14% Similarity=0.068 Sum_probs=57.7 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHH-HHHHHHHHhhccCCCcccCccccc-eeec--ccCCcceEEeecCC Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGR-ARRNLAEARASTTHSKIVGPGHLT-KIGS--AADDPDVLVYMDAP 76 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~R-AeanLa~arast~~~kI~g~~~~t-~I~~--~~G~vD~~V~L~d~ 76 (130) ||++.- .+.+++..++.....+++.......+ |......++...+. ..+.|. +|+. ..+..-..|.-+.+ T Consensus 1 Ma~~~~--G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~pv----dTG~L~~Si~~~~~~~g~~~~V~~~~~ 74 (137) T protein:vir:96 1 MAKVKY--GNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAPV----DLGFLKESIDFKVTDGGFSSVISVGAE 74 (137) T ss_pred CchhHh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc----CccchhcCceeEeecCceEEEEecCCC Confidence 999973 36667777766555555554444333 34444455554442 234444 4543 55666777888888 Q ss_pred CccceeeccCccccccCCCCCCCCC----CCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 77 NPMAIEYGHGPSGYFDPDKYGKVTK----APAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 77 nAlAIEfGh~psg~f~p~k~G~~tk----a~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) -|.-+||||.+...-........++ .+.|-++-|+. ...+.=+. T Consensus 75 YA~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g----------~~a~pFl~ 122 (137) T protein:vir:96 75 YAIYVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYG----------QQAQPFWN 122 (137) T ss_pred cccccccCccccccCCCccccccccceeeccCcceeecCC----------CCCCcchh Confidence 9999999986654221111111111 22333333221 01111111 No 9 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=86.40 E-value=0.033 Score=28.71 Aligned_cols=118 Identities=13% Similarity=0.121 Sum_probs=57.5 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHH-HHHHHHHHhhccCCCcccCccccc-eeec--ccCCcceEEeecCC Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGR-ARRNLAEARASTTHSKIVGPGHLT-KIGS--AADDPDVLVYMDAP 76 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~R-AeanLa~arast~~~kI~g~~~~t-~I~~--~~G~vD~~V~L~d~ 76 (130) ||++.-. +.+++..+......+.+.......+ |+....+++...+. ..+.|- +|+. ..+.+-.-|.-..+ T Consensus 1 Ma~~~~g--~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPv----dTG~Lr~SI~~~~~~~~~~~~V~~~~~ 74 (137) T protein:vir:94 1 MAKVKYG--NWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPV----DTGYLRESVTMDFKDSGFTGVINIGSE 74 (137) T ss_pred CchhHHh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc----cccchhccceeEeecCceEEEEecCCC Confidence 9998733 5566666555544554444333333 23333445554442 134444 4553 44555566777778 Q ss_pred CccceeeccCccccccCCCCCCCCC----CCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 77 NPMAIEYGHGPSGYFDPDKYGKVTK----APAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 77 nAlAIEfGh~psg~f~p~k~G~~tk----a~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) -|.-+||||.|+.....+..+..++ .+.|-++-|+ +.++ -|-. ++.+. T Consensus 75 YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--g~~a---~PFl-~pA~~ 126 (137) T protein:vir:94 75 YAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTK--GQHA---QPFW-EPAID 126 (137) T ss_pred cccccccCccccccCCCcccccccccceeccCcceeecC--CCCC---Ccch-HHHHH Confidence 8999999998876555443333222 2333333322 0000 0000 01111 No 10 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=86.40 E-value=0.033 Score=28.71 Aligned_cols=118 Identities=13% Similarity=0.121 Sum_probs=57.5 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHH-HHHHHHHHhhccCCCcccCccccc-eeec--ccCCcceEEeecCC Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGR-ARRNLAEARASTTHSKIVGPGHLT-KIGS--AADDPDVLVYMDAP 76 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~R-AeanLa~arast~~~kI~g~~~~t-~I~~--~~G~vD~~V~L~d~ 76 (130) ||++.-. +.+++..+......+.+.......+ |+....+++...+. ..+.|- +|+. ..+.+-.-|.-..+ T Consensus 1 Ma~~~~g--~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPv----dTG~Lr~SI~~~~~~~~~~~~V~~~~~ 74 (137) T protein:vir:97 1 MAKVKYG--NWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPV----DTGYLRESVTMDFKDSGFTGVINIGSE 74 (137) T ss_pred CchhHHh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc----cccchhccceeEeecCceEEEEecCCC Confidence 9998733 5566666555544554444333333 23333445554442 134444 4553 44555566777778 Q ss_pred CccceeeccCccccccCCCCCCCCC----CCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 77 NPMAIEYGHGPSGYFDPDKYGKVTK----APAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 77 nAlAIEfGh~psg~f~p~k~G~~tk----a~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) -|.-+||||.|+.....+..+..++ .+.|-++-|+ +.++ -|-. ++.+. T Consensus 75 YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--g~~a---~PFl-~pA~~ 126 (137) T protein:vir:97 75 YAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTK--GQHA---QPFW-EPAID 126 (137) T ss_pred cccccccCccccccCCCcccccccccceeccCcceeecC--CCCC---Ccch-HHHHH Confidence 8999999998876555443333222 2333333322 0000 0000 01111 No 11 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=86.40 E-value=0.033 Score=28.71 Aligned_cols=118 Identities=13% Similarity=0.121 Sum_probs=57.5 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHH-HHHHHHHHhhccCCCcccCccccc-eeec--ccCCcceEEeecCC Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGR-ARRNLAEARASTTHSKIVGPGHLT-KIGS--AADDPDVLVYMDAP 76 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~R-AeanLa~arast~~~kI~g~~~~t-~I~~--~~G~vD~~V~L~d~ 76 (130) ||++.-. +.+++..+......+.+.......+ |+....+++...+. ..+.|- +|+. ..+.+-.-|.-..+ T Consensus 1 Ma~~~~g--~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPv----dTG~Lr~SI~~~~~~~~~~~~V~~~~~ 74 (137) T protein:vir:93 1 MAKVKYG--NWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPV----DTGYLRESVTMDFKDSGFTGVINIGSE 74 (137) T ss_pred CchhHHh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc----cccchhccceeEeecCceEEEEecCCC Confidence 9998733 5566666555544554444333333 23333445554442 134444 4553 44555566777778 Q ss_pred CccceeeccCccccccCCCCCCCCC----CCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 77 NPMAIEYGHGPSGYFDPDKYGKVTK----APAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 77 nAlAIEfGh~psg~f~p~k~G~~tk----a~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) -|.-+||||.|+.....+..+..++ .+.|-++-|+ +.++ -|-. ++.+. T Consensus 75 YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--g~~a---~PFl-~pA~~ 126 (137) T protein:vir:93 75 YAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTK--GQHA---QPFW-EPAID 126 (137) T ss_pred cccccccCccccccCCCcccccccccceeccCcceeecC--CCCC---Ccch-HHHHH Confidence 8999999998876555443333222 2333333322 0000 0000 01111 No 12 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=84.70 E-value=0.051 Score=27.67 Aligned_cols=114 Identities=13% Similarity=0.091 Sum_probs=58.6 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHHHHHHH-HHHhhccCCCcccCccccc-eee--cccCCcceEEeecCC Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGRARRNL-AEARASTTHSKIVGPGHLT-KIG--SAADDPDVLVYMDAP 76 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~RAeanL-a~arast~~~kI~g~~~~t-~I~--~~~G~vD~~V~L~d~ 76 (130) ||++.-. +.++...|......+.+.......++-..| .+++...+. ..+.|. +|+ ...+.+-..|.-+.+ T Consensus 1 Ma~~~~G--~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv----~TG~Lr~SI~~~~~~~~~~~~V~~~~~ 74 (137) T protein:vir:10 1 MAKVKYG--NWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMPV----DTGYLRESVSMDFKKGGLTGVINIGSE 74 (137) T ss_pred CccchhC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc----CcchhhcCeeeEecCCcEEEEEecCCc Confidence 9999633 566666666666666666555555544333 334433331 234444 454 355556677788888 Q ss_pred CccceeeccCccccccCCCCCCCCC----CCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 77 NPMAIEYGHGPSGYFDPDKYGKVTK----APAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 77 nAlAIEfGh~psg~f~p~k~G~~tk----a~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) -|.-+|||+.+.+.....+.+..++ .+.|-|+-|+. ...+.=+. T Consensus 75 YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g----------~~a~Pfl~ 122 (137) T protein:vir:10 75 YAVYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKG----------QHAQPFWE 122 (137) T ss_pred cccccccCccccccCCCcccccccceeeeccccccccCCC----------CCCCcchh Confidence 9999999987776433222222222 12333322210 01111111 No 13 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=83.88 E-value=0.0066 Score=32.54 Aligned_cols=114 Identities=14% Similarity=0.180 Sum_probs=60.7 Q ss_pred Cccc-ccchhhhhhhhcchh------HHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccCccccc-eee-----cccCCc Q lcl|NC_013694. 1 MAKL-IPRRRLNHIVAHLAE------TKAAIRREAREVEGRARRNLAEARASTTHSKIVGPGHLT-KIG-----SAADDP 67 (130) Q Consensus 1 MA~l-yg~~~~n~vva~~~g------v~~~v~~ea~~i~~RAeanLa~arast~~~kI~g~~~~t-~I~-----~~~G~v 67 (130) |+.- +--+.+++++..+.. ++..+.+...+++.+.. ..+.+.|| | +.+++- +|+ .+.+.. T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~---~~vk~~tP---V-dTG~Lr~S~~~~~~~~~~~~~ 73 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSL---RILEANTP---V-KQGNLRRSWTAEGPTYGCGGW 73 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHH---HHHHHhCC---C-CcchhccceeecceeeecCee Confidence 6542 112345555554443 34444444555544444 34444454 2 233433 343 333444 Q ss_pred ceEEeecCCCccceeeccCc-c-ccccCCCCCCCCCCCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 68 DVLVYMDAPNPMAIEYGHGP-S-GYFDPDKYGKVTKAPAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 68 D~~V~L~d~nAlAIEfGh~p-s-g~f~p~k~G~~tka~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) -.-|.=..+-|--+||||-. . .++-..+.+....|++|.|.|.+|... -++-+. T Consensus 74 ~~~V~n~~~YA~~VE~Ghr~~~G~~v~~~~~~~~~g~V~G~~~~~~a~~~---------~~~~~~ 129 (144) T protein:vir:10 74 TIKLINNAEYASYVESGHRQTPGRYVPVLKKRLVRDWVPGQFYMKKSIPQ---------IQRQLP 129 (144) T ss_pred EEEEecCCCcccccccceeecCCcccccCCCccccceecCccchHHHHHH---------HHHHHH Confidence 44445556779999999942 2 344445556677899999999999854 222222 No 14 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=82.76 E-value=0.068 Score=26.97 Aligned_cols=118 Identities=13% Similarity=0.093 Sum_probs=54.3 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHH-HHHHHHHHhhccCCCcccCccccc-eeec--ccCCcceEEeecCC Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGR-ARRNLAEARASTTHSKIVGPGHLT-KIGS--AADDPDVLVYMDAP 76 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~R-AeanLa~arast~~~kI~g~~~~t-~I~~--~~G~vD~~V~L~d~ 76 (130) ||++.- .+.++...|......+.+.......+ |+.....++...+. ..+.|- +|+. ..+.+-..|.-+.+ T Consensus 1 Ma~~~~--G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPv----dTG~Lr~SI~~~~~~~~~~~~V~~~~~ 74 (137) T protein:vir:94 1 MAKVKY--GNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISLMPV----DTGYLRESVTMDFKDGGFTGVINIGSE 74 (137) T ss_pred CchhHH--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc----CcchhhcCceeEeecCcEEEEEecCCC Confidence 999952 24555555544444444333322222 22333344444432 134444 4543 55556677777788 Q ss_pred CccceeeccCccccccCCCCCCCCC----CCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 77 NPMAIEYGHGPSGYFDPDKYGKVTK----APAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 77 nAlAIEfGh~psg~f~p~k~G~~tk----a~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) -|.-+||||.|+........+..++ .+.|-++-|+. .+ .-|- =++++. T Consensus 75 YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g--~~---a~PF-l~pA~~ 126 (137) T protein:vir:94 75 YAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKG--QH---AQPF-WEPAID 126 (137) T ss_pred cccccccCccccccCCCcccccccccceeccCCceeecCC--cC---CCcc-hHHHHH Confidence 8999999998876554333222222 22233322220 00 0000 001111 No 15 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=76.49 E-value=0.13 Score=25.35 Aligned_cols=118 Identities=13% Similarity=0.108 Sum_probs=55.5 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHHH-HHHHHHHhhccCCCcccCccccc-eee--cccCCcceEEeecCC Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGRA-RRNLAEARASTTHSKIVGPGHLT-KIG--SAADDPDVLVYMDAP 76 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~RA-eanLa~arast~~~kI~g~~~~t-~I~--~~~G~vD~~V~L~d~ 76 (130) ||++.-. +.++...+......+.+.......++ +.....++..++.. .+.|. +|+ ...+..-..|.-+.+ T Consensus 1 Ma~~~~G--~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv~----TG~L~~Si~~~~~~~~~~~~V~~~~~ 74 (137) T protein:vir:95 1 MAKVKYG--NWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVD----TGYLRESVTMDFKDGGFTGVINIGSE 74 (137) T ss_pred CchhHHh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcc----chhhhcCeeeEeeCCceEEEEecCCC Confidence 9998733 55666666554444444333333332 22333444444421 34444 454 344555556666678 Q ss_pred CccceeeccCccccccCCCCCCCCC----CCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 77 NPMAIEYGHGPSGYFDPDKYGKVTK----APAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 77 nAlAIEfGh~psg~f~p~k~G~~tk----a~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) -|.-+||||.++........+..++ .+.|-++-|+ +.++ -|- =++.+. T Consensus 75 YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--g~~a---~PF-l~pA~~ 126 (137) T protein:vir:95 75 YAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTK--GQHA---QPF-WEPAID 126 (137) T ss_pred cccccccCccccccCCCcccccccccceeccCcceeecC--CCCC---Ccc-hHHHHH Confidence 8999999998877555433222222 2333333222 0000 000 001111 No 16 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=74.53 E-value=0.16 Score=24.99 Aligned_cols=118 Identities=14% Similarity=0.107 Sum_probs=54.8 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHHHH-HHHHHHhhccCCCcccCccccc-eee--cccCCcceEEeecCC Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGRAR-RNLAEARASTTHSKIVGPGHLT-KIG--SAADDPDVLVYMDAP 76 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~RAe-anLa~arast~~~kI~g~~~~t-~I~--~~~G~vD~~V~L~d~ 76 (130) ||++.-. +.+++..+.....++.+.......++- ....+++...+. ..+.|- +|+ ...+.+-..|.-+.+ T Consensus 1 Ma~~~~G--l~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv----dTG~Lr~SI~~~~~~~~~~~~V~~~~~ 74 (137) T protein:vir:10 1 MAKVKYG--NWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPV----DTGYLRESVSMDFKKGGLTGVINIGSE 74 (137) T ss_pred CchhHhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc----CcchhhcCeeEEeeCCcEEEEEecCCC Confidence 9999522 445555554444444444433333322 222334443332 134444 454 456677778888888 Q ss_pred CccceeeccCccccccCCCCCCCCC----CCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 77 NPMAIEYGHGPSGYFDPDKYGKVTK----APAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 77 nAlAIEfGh~psg~f~p~k~G~~tk----a~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) -|.-+|||+.+.+.-...+.+..++ .+.|-++-|+ +.+ .-|= =+++++ T Consensus 75 Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--g~~---a~PF-l~pA~~ 126 (137) T protein:vir:10 75 YAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTK--GQH---AQPF-WEPAID 126 (137) T ss_pred cccccccCccccccCCCccccccccceeeccccceeccC--CCC---CCcc-hhHHHH Confidence 9999999987665333222222222 2333333222 000 0000 011111 No 17 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=67.72 E-value=0.24 Score=23.98 Aligned_cols=114 Identities=15% Similarity=0.076 Sum_probs=48.3 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHH-HHHHHHHHhhccCCCcccCccccc-eee--cccCCcceEEeecCC Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGR-ARRNLAEARASTTHSKIVGPGHLT-KIG--SAADDPDVLVYMDAP 76 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~R-AeanLa~arast~~~kI~g~~~~t-~I~--~~~G~vD~~V~L~d~ 76 (130) ||++.- -++++...+......+.+.......+ |+....+|+...+. ..+.+. +|+ .+.+.+-..|.-+.+ T Consensus 13 Ma~~~~--Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPv----dTG~Lr~SI~~~~~~~g~~~~V~~~~~ 86 (149) T protein:vir:94 13 MAKVKY--GADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPV----DLGFLEESIDFKYFDGGLSSVISVGAD 86 (149) T ss_pred HHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc----ccchhhcCeeEEeeCCcEEEEEecCCC Confidence 988752 24455544443333333222222222 22223334444331 123444 454 344556667777788 Q ss_pred CccceeeccCccccccCCCCCCCCCC----CCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 77 NPMAIEYGHGPSGYFDPDKYGKVTKA----PAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 77 nAlAIEfGh~psg~f~p~k~G~~tka----~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) -|.-+||||.+.+.-...+.+..+.+ +.|-++-|+ + +..+.=++ T Consensus 87 YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------g--~~a~PFl~ 134 (149) T protein:vir:94 87 YAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTY--------G--QAPQPFWN 134 (149) T ss_pred cccccccCccccccCCCccccccccceeecCccceecCC--------C--CCCCcchH Confidence 89999999987663222221111110 111111111 0 01111111 No 18 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=66.47 E-value=0.27 Score=23.73 Aligned_cols=118 Identities=15% Similarity=0.139 Sum_probs=51.7 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHH-HHHHHHHHhhccCCCcccCccccc-eeec--ccCCcceEEeecCC Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGR-ARRNLAEARASTTHSKIVGPGHLT-KIGS--AADDPDVLVYMDAP 76 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~R-AeanLa~arast~~~kI~g~~~~t-~I~~--~~G~vD~~V~L~d~ 76 (130) ||++.- -++++.+.+......+.+.......+ |+....+|+...+ + ..+.|- +|+. +.+..-.-|.=+.+ T Consensus 1 Ma~~~~--Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~ap---v-dTG~Lr~SI~~~~~~~g~~~~V~~~~~ 74 (135) T protein:vir:96 1 MAKVKY--GADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMP---V-DTGFLRQSTTVDFENGGFTGVVKIGSN 74 (135) T ss_pred Cchhhh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC---c-cchhhhcceeEEeecCcEEEEEecCCC Confidence 999742 36666666555544444443333222 2223334444433 1 123333 4543 44444455565677 Q ss_pred CccceeeccCccccccCC-CC-CCCCCCCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 77 NPMAIEYGHGPSGYFDPD-KY-GKVTKAPAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 77 nAlAIEfGh~psg~f~p~-k~-G~~tka~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) -|.-+||||.|...-.+. +. ....+-+.|.++-|+ +.+++ |- =++.+. T Consensus 75 YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~--~~~a~---pf-l~~A~~ 124 (135) T protein:vir:96 75 YAVYVNYGTGIYATKGSRAHKIPWTYKDPNGKWHTTY--GQMPQ---PF-WEPAID 124 (135) T ss_pred ccchhhcccccccCCCccccccccccccCCcceeecC--CcCCC---cc-hhHHHH Confidence 899999999887653321 11 111112233333322 00000 00 000111 No 19 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=65.72 E-value=0.27 Score=23.71 Aligned_cols=114 Identities=16% Similarity=0.147 Sum_probs=46.0 Q ss_pred Ccccc---cchhhhhhhhcchh-HHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccCccccc-eeec----ccCCcceEE Q lcl|NC_013694. 1 MAKLI---PRRRLNHIVAHLAE-TKAAIRREAREVEGRARRNLAEARASTTHSKIVGPGHLT-KIGS----AADDPDVLV 71 (130) Q Consensus 1 MA~ly---g~~~~n~vva~~~g-v~~~v~~ea~~i~~RAeanLa~arast~~~kI~g~~~~t-~I~~----~~G~vD~~V 71 (130) ||.+= +-..+...+..++. ++.++.+.-.+ -|+....+++..+++. .+.|- +|+. +-+.+-..| T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~---~a~~i~~~ak~~aPv~----TG~Lr~SI~~~~~~~g~~~~~~v 73 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLTGAAREATEA---AANDMVNMAKGLCPVD----TGRLRSSIQAVPSGGRFSFSVTI 73 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHhCCcc----chhhhccceeeeccCCceEEEEE Confidence 99883 44333333332211 23322222222 2333334445444421 34444 3543 222234556 Q ss_pred eecCCCccceeeccCccccccCCCCCCCCCCCCcceeeccccccCcccccC-cccccCCC Q lcl|NC_013694. 72 YMDAPNPMAIEYGHGPSGYFDPDKYGKVTKAPAGLYILNRAAGIAGSMVTP-SMGRRGVK 130 (130) Q Consensus 72 ~L~d~nAlAIEfGh~psg~f~p~k~G~~tka~eGLyILT~AA~l~~~~~~~-~~~kRgv~ 130 (130) .-..+-|.-+||||.|+-+... ..+-.+|+-+-+ .....-+| +..+.=++ T Consensus 74 ~~~~~YA~~vE~Gt~~~~i~pk--~~k~l~~~~~~~-------~~~~v~~pG~~~~pfl~ 124 (142) T protein:vir:94 74 GTNVTYAADVEYGTAPHVIVPK--DKKALYWPGAAH-------PVAKVNHPGTRAQPFMR 124 (142) T ss_pred ecCcccchhhhccCCCceeccC--CCccceecccce-------eeeeeeecCCCCCcchh Confidence 6667889999999988765431 121122221111 11111000 11111111 No 20 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=65.67 E-value=0.27 Score=23.67 Aligned_cols=114 Identities=15% Similarity=0.088 Sum_probs=48.2 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHHH-HHHHHHHhhccCCCcccCccccc-eee--cccCCcceEEeecCC Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGRA-RRNLAEARASTTHSKIVGPGHLT-KIG--SAADDPDVLVYMDAP 76 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~RA-eanLa~arast~~~kI~g~~~~t-~I~--~~~G~vD~~V~L~d~ 76 (130) ||++.- -++++...+......+.+.......++ +....+|+...+. ..+.+. +|+ ...+.+-..|.-+.+ T Consensus 13 Ma~v~~--Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~aPv----dTG~L~~SI~~~~~~~g~~~~V~~~~~ 86 (149) T protein:vir:10 13 MAKVKY--GADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPV----DLGFLEESIDFKYFDGGLSSVISVGAD 86 (149) T ss_pred hHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc----ccchhhccceEEecCCcEEEEEecCCC Confidence 999831 245555555433333333322222222 1122234433331 123444 454 344556677777788 Q ss_pred CccceeeccCccccccCCCCCCCCC----CCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 77 NPMAIEYGHGPSGYFDPDKYGKVTK----APAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 77 nAlAIEfGh~psg~f~p~k~G~~tk----a~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) -|.-+||||.+.+.-...+.+..+. -+.|-++-|+ ++ ..+.=++ T Consensus 87 YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--------g~--~a~PFl~ 134 (149) T protein:vir:10 87 YAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTY--------GQ--APQPFWN 134 (149) T ss_pred cccccccCccccccCCcccccccccceeeccccceecCC--------CC--CCCcchh Confidence 9999999997665322112111111 0112222111 00 1111111 No 21 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=57.48 E-value=0.43 Score=22.57 Aligned_cols=110 Identities=21% Similarity=0.269 Sum_probs=45.4 Q ss_pred Ccccc---cchhhhhhhhcc-----hhHHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccCccccc-eeec--ccCCcce Q lcl|NC_013694. 1 MAKLI---PRRRLNHIVAHL-----AETKAAIRREAREVEGRARRNLAEARASTTHSKIVGPGHLT-KIGS--AADDPDV 69 (130) Q Consensus 1 MA~ly---g~~~~n~vva~~-----~gv~~~v~~ea~~i~~RAeanLa~arast~~~kI~g~~~~t-~I~~--~~G~vD~ 69 (130) |+--+ |.+.+...+... +.|+.+|.+-+.++. .+++...+.. -+.|. +|+. +.+.+.. T Consensus 4 ms~~i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~-------~~ak~~apv~----TG~Lr~SI~~~~~~~g~~~ 72 (144) T protein:vir:59 4 MSVRIDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIA-------GLAASLAPVD----EGNLKNSIQIDYKNNGLTA 72 (144) T ss_pred ceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHhCCcc----chhhhcCeeEEeecCcEEE Confidence 44434 223333322222 123333333333333 3344433321 23333 4553 4455667 Q ss_pred EEeecCCCccceeeccCccccccCCCCCCCCC-----CCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 70 LVYMDAPNPMAIEYGHGPSGYFDPDKYGKVTK-----APAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 70 ~V~L~d~nAlAIEfGh~psg~f~p~k~G~~tk-----a~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) -|.-..+-|.-+|||+.|.+.... ++.++ ...|.|+-|+ +.+++ |- =++.++ T Consensus 73 ~V~~~~~YA~~vE~GT~~~~~~~~---~~~~~~~~~~~~~g~~~~t~--g~~a~---Pf-l~pA~~ 129 (144) T protein:vir:59 73 EITVGAEYAIYVEYGTGIYAVDGN---GRKTPWTYYSPKLGRYVRTQ--GAPAQ---PF-FWPAVE 129 (144) T ss_pred EEecCCCccchhhcCccccccCCC---ccccccccccccccceecCC--CCCCC---cc-hhHHHH Confidence 777788899999999988764332 11111 1123333322 00000 00 000111 No 22 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=54.76 E-value=0.5 Score=22.25 Aligned_cols=121 Identities=17% Similarity=0.184 Sum_probs=60.9 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHHHHHHH-HHHhhccCCCcccCccccc-eeeccc--CCcceEEeec-- Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGRARRNL-AEARASTTHSKIVGPGHLT-KIGSAA--DDPDVLVYMD-- 74 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~RAeanL-a~arast~~~kI~g~~~~t-~I~~~~--G~vD~~V~L~-- 74 (130) |+++-.+..+. .....+.+.+....+.+..++-.++ +++++..++ ..++|- +|+... +..-.++... T Consensus 1 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPv----dtG~Lr~SI~~~~~~~~~~~~~~~v~~ 73 (140) T protein:vir:10 1 MATIRARARIE---IDEAALERESGEHLRAFHRSLTRRIANQSRVAVPV----RTGNLGRTIGELPQVYTPFRVRGGVEA 73 (140) T ss_pred Ceeeeeeeeee---eCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc----cchhhhccceeeeeeCCCceEEEEecC Confidence 88877664333 2222333444444444444444433 344444442 245555 576533 2222334443 Q ss_pred -CCCccceeeccCccccccCCCCCCCCC-CCCcceeeccccccCcccccCcc---------cccCCC Q lcl|NC_013694. 75 -APNPMAIEYGHGPSGYFDPDKYGKVTK-APAGLYILNRAAGIAGSMVTPSM---------GRRGVK 130 (130) Q Consensus 75 -d~nAlAIEfGh~psg~f~p~k~G~~tk-a~eGLyILT~AA~l~~~~~~~~~---------~kRgv~ 130 (130) .+-|.-+|||..|+-+.... ++..+ ...|.++..+--+-||+-.-|-- .++-+| T Consensus 74 ~a~YA~~Ve~GT~ph~I~pk~--~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~ 138 (140) T protein:vir:10 74 TADYAAPVHEGSRPHAIRARN--AQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVR 138 (140) T ss_pred CccchhhhccCCCCceeecCC--CccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhcc Confidence 34699999999887654422 23222 46788888887777755433321 111222 No 23 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=54.76 E-value=0.5 Score=22.25 Aligned_cols=121 Identities=17% Similarity=0.184 Sum_probs=60.9 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHHHHHHH-HHHhhccCCCcccCccccc-eeeccc--CCcceEEeec-- Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGRARRNL-AEARASTTHSKIVGPGHLT-KIGSAA--DDPDVLVYMD-- 74 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~RAeanL-a~arast~~~kI~g~~~~t-~I~~~~--G~vD~~V~L~-- 74 (130) |+++-.+..+. .....+.+.+....+.+..++-.++ +++++..++ ..++|- +|+... +..-.++... T Consensus 1 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPv----dtG~Lr~SI~~~~~~~~~~~~~~~v~~ 73 (140) T protein:vir:97 1 MATIRARARIE---IDEAALERESGEHLRAFHRSLTRRIANQSRVAVPV----RTGNLGRTIGELPQVYTPFRVRGGVEA 73 (140) T ss_pred Ceeeeeeeeee---eCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc----cchhhhccceeeeeeCCCceEEEEecC Confidence 88877664333 2222333444444444444444433 344444442 245555 576533 2222334443 Q ss_pred -CCCccceeeccCccccccCCCCCCCCC-CCCcceeeccccccCcccccCcc---------cccCCC Q lcl|NC_013694. 75 -APNPMAIEYGHGPSGYFDPDKYGKVTK-APAGLYILNRAAGIAGSMVTPSM---------GRRGVK 130 (130) Q Consensus 75 -d~nAlAIEfGh~psg~f~p~k~G~~tk-a~eGLyILT~AA~l~~~~~~~~~---------~kRgv~ 130 (130) .+-|.-+|||..|+-+.... ++..+ ...|.++..+--+-||+-.-|-- .++-+| T Consensus 74 ~a~YA~~Ve~GT~ph~I~pk~--~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~ 138 (140) T protein:vir:97 74 TADYAAPVHEGSRPHAIRARN--AQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVR 138 (140) T ss_pred CccchhhhccCCCCceeecCC--CccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhcc Confidence 34699999999887654422 23222 46788888887777755433321 111222 No 24 >protein:vir:99924 Length: 87 # NCBI annotation: gp11 # Family: family:all:1171 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655528;genbank:gi:109392298;genbank:GeneID:4157093 Probab=29.25 E-value=1.4 Score=19.83 Aligned_cols=87 Identities=24% Similarity=0.182 Sum_probs=61.7 Q ss_pred CcccccchhhhhhhhcchhHHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccCccccceeecccCCcceEEeecCCCccc Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAETKAAIRREAREVEGRARRNLAEARASTTHSKIVGPGHLTKIGSAADDPDVLVYMDAPNPMA 80 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~gv~~~v~~ea~~i~~RAeanLa~arast~~~kI~g~~~~t~I~~~~G~vD~~V~L~d~nAlA 80 (130) |.+|-=+-.=++.|-++|+|+++++.-.-+|.++|-+- + + -+++|.++.+.-+..+-.||.-+-..+|- T Consensus 1 ~~~l~~P~SehrKIR~lpev~kacq~lga~ia~~Ag~i-----A-g-----a~DgYgte~~vGrdR~Rv~V~a~~~aaik 69 (87) T protein:vir:99 1 MGKLKIPISDDKKIRRSSEVRKACQTIGATIAVTAGRI-----A-G-----DSDGYGVEETVGSDRTRVNVYAQHNKTMK 69 (87) T ss_pred CCcccCCcccchhhccchhHHHHHHHhhhHHhhhhccc-----c-C-----CCCCceeeeeecCceeEEEEeecCCceee Confidence 77776554457889999999999999999998887432 1 1 14688888888888888888888788888 Q ss_pred eeeccCccccccCCCCCCCCCCCCcceeeccccccCcccccCcccccC Q lcl|NC_013694. 81 IEYGHGPSGYFDPDKYGKVTKAPAGLYILNRAAGIAGSMVTPSMGRRG 128 (130) Q Consensus 81 IEfGh~psg~f~p~k~G~~tka~eGLyILT~AA~l~~~~~~~~~~kRg 128 (130) .|.|-.|-- +.|| ++-|- T Consensus 70 aE~~~aPlm--------------------q~aA----------~~~~~ 87 (87) T protein:vir:99 70 AEAGATPPL--------------------QQAA----------MRVRK 87 (87) T ss_pred eccCCCchh--------------------hhhh----------hhhcC Confidence 888776532 2222 11111 No 25 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=28.76 E-value=0.73 Score=21.32 Aligned_cols=116 Identities=16% Similarity=0.117 Sum_probs=47.2 Q ss_pred Ccccc-cchhhhhhhhcchhHHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccCccccc-eee------cccCCcceEEe Q lcl|NC_013694. 1 MAKLI-PRRRLNHIVAHLAETKAAIRREAREVEGRARRNLAEARASTTHSKIVGPGHLT-KIG------SAADDPDVLVY 72 (130) Q Consensus 1 MA~ly-g~~~~n~vva~~~gv~~~v~~ea~~i~~RAeanLa~arast~~~kI~g~~~~t-~I~------~~~G~vD~~V~ 72 (130) -||+= ..+.+++-+ .+-.+..|.+.+..+...|++|. ++ ..+++- +|+ ...+.++..|. T Consensus 4 ~~~~~~~~~~~~~~~--~~v~r~~l~~~a~~v~~~Ak~~a-------Pv----~tG~Lr~SI~~~~~~~~~~~~~~~~V~ 70 (137) T protein:vir:10 4 TARYERNPVGEARQF--QVIARRRLSRITRGTANQARADV-------PV----KTGNLGRSIREDPIVVAGPLRLDSGVT 70 (137) T ss_pred EEEeccCchhHHHHH--HHHHHHHHHHHHHHHHHHHHhcC-------Cc----cchhhhcCceeeeeeccccceEEEEec Confidence 11111 112222110 11223344444444444443322 11 123333 343 23445667777 Q ss_pred ecCCCccceeeccCccccccCCCCCCCCCCCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 73 MDAPNPMAIEYGHGPSGYFDPDKYGKVTKAPAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 73 L~d~nAlAIEfGh~psg~f~p~k~G~~tka~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) -+.+-|.-+|||..|+-+.-..+.+...-+..|-++..+--+-||+-.-|-- ++.++ T Consensus 71 ~~~~YA~~ve~GT~ph~I~Pk~~k~~l~~~~~g~~vf~k~V~hPG~~a~PfL-~~A~~ 127 (137) T protein:vir:10 71 AHADYARYVHDGTRAHVIRPRRPGGVLRFTVGGRVVYARRVNHPGTRARPFL-RNAAE 127 (137) T ss_pred CCCccceeeecCCCCceeeccccceeeeEeeCCeeEecceeecCCCCCCchH-HHHHH Confidence 7888899999999887765433222211122344454444333322111100 00011 No 26 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=25.57 E-value=1.7 Score=19.26 Aligned_cols=116 Identities=15% Similarity=0.139 Sum_probs=56.1 Q ss_pred Ccccccchhhhhhhhcchh------HHHHHHHHHHHHHHHHHHHHHHHhhccCCCccc---------------------- Q lcl|NC_013694. 1 MAKLIPRRRLNHIVAHLAE------TKAAIRREAREVEGRARRNLAEARASTTHSKIV---------------------- 52 (130) Q Consensus 1 MA~lyg~~~~n~vva~~~g------v~~~v~~ea~~i~~RAeanLa~arast~~~kI~---------------------- 52 (130) |.-=+.-+.+.++.-.+.. ++..+++.+++++ +.+|+.+...||..... T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a---~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~ 77 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEG---TELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHG 77 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHH---HHHHHHHHHhCCcccchhhhhhhhhcccchhhhhccccc Confidence 8777777777776666543 2334445555543 34456666666542110 Q ss_pred -Cccccc------eeecccCCcceEEeecCCCccceeeccCccccccCCCCCCCCCCCCcceeeccccccCcccccCccc Q lcl|NC_013694. 53 -GPGHLT------KIGSAADDPDVLVYMDAPNPMAIEYGHGPSGYFDPDKYGKVTKAPAGLYILNRAAGIAGSMVTPSMG 125 (130) Q Consensus 53 -g~~~~t------~I~~~~G~vD~~V~L~d~nAlAIEfGh~psg~f~p~k~G~~tka~eGLyILT~AA~l~~~~~~~~~~ 125 (130) ..+++- ++..+.+..-.=|.=..+-|--||+||--.. ..+++|.|+|+++-.---+. .|.-- T Consensus 78 k~tG~lr~swk~~~~~k~~~~~~v~v~N~~~YA~~VE~GHR~~~----------gGfV~G~fml~~s~~~~~~~-~~~~~ 146 (163) T protein:vir:10 78 KQGGTLQKGWSKSRIEVSGRTYKQKVYNKVYYAPHVEYGHKTVN----------GGFVPGQFFLHKTVEDTKSD-MEKRV 146 (163) T ss_pred cccchhhccceecceeecCCceEEEEEecCCccchhhcceeecC----------CceeccchhhHHHHHHHHHH-HHHHH Confidence 112222 1222212111112223456888999996321 25889999999987431000 01111 Q ss_pred ccCCC Q lcl|NC_013694. 126 RRGVK 130 (130) Q Consensus 126 kRgv~ 130 (130) +.-++ T Consensus 147 e~~l~ 151 (163) T protein:vir:10 147 RDKYD 151 (163) T ss_pred HHHHH Confidence 11111 No 27 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=24.69 E-value=1.6 Score=19.44 Aligned_cols=116 Identities=15% Similarity=0.162 Sum_probs=51.0 Q ss_pred Ccccccc--hhhhhhhhcchhH-HHHHHHHHHHHHHH-HHHHHHHHhhccCCCcccCcccc------------ceeeccc Q lcl|NC_013694. 1 MAKLIPR--RRLNHIVAHLAET-KAAIRREAREVEGR-ARRNLAEARASTTHSKIVGPGHL------------TKIGSAA 64 (130) Q Consensus 1 MA~lyg~--~~~n~vva~~~gv-~~~v~~ea~~i~~R-AeanLa~arast~~~kI~g~~~~------------t~I~~~~ 64 (130) ||+.-+- +.++++...+... ...+..+.+..... |...|..+...|| | +.+.+ .+++.+. T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tP---V-dTG~Lr~sw~~~~~~~~~~~~~~g 76 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTP---V-DTGFLRQGWNGVAYARSLPVYKQG 76 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCC---C-cchhhcccccccccccccceeecC Confidence 8875432 4566666655432 22344333332222 3333444444443 1 11221 1222222 Q ss_pred CCcceEEeecCCCccceeeccCccccccCCCCCCCCCCCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 65 DDPDVLVYMDAPNPMAIEYGHGPSGYFDPDKYGKVTKAPAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 65 G~vD~~V~L~d~nAlAIEfGh~psg~f~p~k~G~~tka~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) +..-.-|.=..+-|--+|+||.-. .|+ .+++|.|+|+++..-- .--.+.--++.++ T Consensus 77 ~~~~v~v~n~~~YA~~VE~Ghr~~-------~~~--gfV~G~fml~~s~~~~-~~~~~~~~~~~l~ 132 (141) T protein:vir:79 77 NNYIIEVVNPTEYASYVNFGHRTK-------DGK--GWVKGQHFLTISEMEL-QSQVDKIIEKKLL 132 (141) T ss_pred CeeEEEEecCCcchhhhhcceeec-------CCc--ceeCCchhHHHHHHHH-HHHHHHHHHHHHH Confidence 222222333345688899999521 132 5889999998875320 0000111112222 No 28 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=23.21 E-value=2.3 Score=18.56 Aligned_cols=94 Identities=9% Similarity=0.088 Sum_probs=43.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccCccccc-eee--cccCCcceEEeecCCCccceeeccCccccccCCCC Q lcl|NC_013694. 20 TKAAIRREAREVEGRARRNLAEARASTTHSKIVGPGHLT-KIG--SAADDPDVLVYMDAPNPMAIEYGHGPSGYFDPDKY 96 (130) Q Consensus 20 v~~~v~~ea~~i~~RAeanLa~arast~~~kI~g~~~~t-~I~--~~~G~vD~~V~L~d~nAlAIEfGh~psg~f~p~k~ 96 (130) |+..++....+.....+++.. +.+ ++ ..++|. +|+ ...+.+-..|.-+.+-|.-+|||+.+.+.-..... T Consensus 1 v~~~v~~~~~~~~~~i~~~ak-~~a--pv----~TG~Lr~SI~~~~~~~~~~~~V~~~~~Ya~yvE~GTg~~~~~~~~~~ 73 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTII-SLM--PV----DTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSR 73 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHH-hhC--Cc----cccccccceeEEeecCcEEEEEecCCCccceeecCccccccCCCccc Confidence 555555554444444433322 122 21 134444 454 35666778888888899999999988774322221 Q ss_pred CCCCC----CCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 97 GKVTK----APAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 97 G~~tk----a~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) +..+. .+.|.++-|+. . ..+.=++ T Consensus 74 ~~~~~~~~~~~~g~~~~t~g--~--------~a~Pfl~ 101 (116) T protein:vir:95 74 AKNIPWSYKDANGKWHTTKG--Q--------HAQPFWE 101 (116) T ss_pred cccccceeecCccceeeCCC--C--------CCCcchH Confidence 21111 12222222210 0 0010011 No 29 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=22.07 E-value=2.2 Score=18.74 Aligned_cols=105 Identities=18% Similarity=0.185 Sum_probs=51.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccCCCcccCccccc---eeecccCCcceEEeecCCCccceeeccCcc-c--cccC Q lcl|NC_013694. 20 TKAAIRREAREVEGRARRNLAEARASTTHSKIVGPGHLT---KIGSAADDPDVLVYMDAPNPMAIEYGHGPS-G--YFDP 93 (130) Q Consensus 20 v~~~v~~ea~~i~~RAeanLa~arast~~~kI~g~~~~t---~I~~~~G~vD~~V~L~d~nAlAIEfGh~ps-g--~f~p 93 (130) +...+.+...+++ ...|..+...||-.++ ..+++- +|...+..=|- |.=..+-|--+||||=-. | .+.| T Consensus 1 l~~~~~~~~~~~a---~~l~~~vk~rTPv~~~-d~G~LR~sW~~g~v~k~~~~-v~N~~eYA~~VE~GHRq~~g~g~~~~ 75 (116) T protein:vir:10 1 MSKNLRRAKNNIG---NKLLRKVKPKTPVAKI-DGGTARKSWKYKELNLFDGV-VSNNVEYIHHLEYGHRTRQGTGTSEN 75 (116) T ss_pred CchHHHHHHHHHH---HHHHHHHHhhCCCCcC-CCcccccCceeeeeeccCce-eecCCcccccccCCceeeCCcceecc Confidence 6667777777664 4456666666763322 123333 12111111111 333345688999999652 3 3554 Q ss_pred CCCCC-CCCCCCcceeeccccccCcccccCcccccCCC Q lcl|NC_013694. 94 DKYGK-VTKAPAGLYILNRAAGIAGSMVTPSMGRRGVK 130 (130) Q Consensus 94 ~k~G~-~tka~eGLyILT~AA~l~~~~~~~~~~kRgv~ 130 (130) ..-.+ ..+|++|.|.|+++-.-= ..-.|.--+.-+. T Consensus 76 ~~gkrlk~~~V~G~fml~~s~~e~-~~~~~~~~~~~~~ 112 (116) T protein:vir:10 76 YRPKPNGISFVPGVFMLARSVDEM-SSIIDDELNQIII 112 (116) T ss_pred cccccccCCccCceehHHHHHHHH-HHHHHHHHHHHHH Confidence 43222 356999999999886331 0000111111111 Done!