Query lcl|NC_021331.1_cdsid_YP_008059795.1 [gene=M186_gp73] [protein=hypothetical protein] [protein_id=YP_008059795.1] [location=34020..34418] Match_columns 132 No_of_seqs 62 out of 66 Neff 6.4 Searched_HMMs 1612 Date Thu Nov 7 17:31:34 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_73 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_73_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:107704 Length: 132 100.0 2E-51 1.2E-54 298.6 15.5 131 1-132 1-131 (132) 2 protein:vir:103278 Length: 169 100.0 7.6E-51 4.7E-54 295.3 15.3 129 1-129 38-169 (169) 3 protein:vir:79637 Length: 130 100.0 8.3E-50 5.1E-53 289.7 15.0 130 1-130 1-130 (130) 4 protein:vir:104348 Length: 129 100.0 3.4E-47 2.1E-50 275.3 15.5 129 1-129 1-129 (129) 5 protein:vir:94921 Length: 125 99.9 1.6E-24 9.8E-28 151.1 14.6 124 1-130 1-125 (125) 6 protein:vir:80429 Length: 150 99.7 2.3E-20 1.5E-23 128.2 12.3 130 1-132 1-150 (150) 7 protein:vir:97211 Length: 150 99.7 4.9E-19 3.1E-22 120.9 11.6 130 1-132 1-150 (150) 8 protein:vir:95155 Length: 151 99.6 1.6E-18 1E-21 118.1 11.8 128 1-132 2-151 (151) 9 protein:vir:78379 Length: 139 99.0 9E-13 5.6E-16 86.6 5.7 130 1-132 1-137 (139) 10 protein:vir:94997 Length: 139 98.9 4.9E-12 3.1E-15 82.6 5.7 130 1-132 1-137 (139) 11 protein:vir:100242 Length: 114 97.0 1.4E-05 8.8E-09 47.2 8.7 114 1-126 1-114 (114) 12 protein:vir:94096 Length: 141 95.6 0.00087 5.4E-07 37.4 11.6 122 1-132 1-137 (141) 13 protein:vir:96260 Length: 141 95.6 0.00087 5.4E-07 37.4 11.6 122 1-132 1-137 (141) 14 protein:vir:105892 Length: 141 95.6 0.00087 5.4E-07 37.4 11.6 122 1-132 1-137 (141) 15 protein:vir:97325 Length: 145 94.9 0.0021 1.3E-06 35.3 11.5 122 1-132 1-135 (145) 16 protein:vir:95961 Length: 145 94.8 0.0024 1.5E-06 35.0 11.6 122 1-132 1-137 (145) 17 protein:vir:94794 Length: 145 94.8 0.0024 1.5E-06 34.9 11.6 122 1-132 1-137 (145) 18 protein:vir:96125 Length: 140 94.8 0.0022 1.3E-06 35.2 11.2 123 1-132 3-137 (140) 19 protein:vir:107096 Length: 145 94.6 0.0024 1.5E-06 34.9 11.1 123 1-132 1-137 (145) 20 protein:vir:105337 Length: 145 94.6 0.0025 1.5E-06 34.9 11.1 123 1-132 1-137 (145) 21 protein:vir:95111 Length: 145 94.5 0.0031 1.9E-06 34.3 11.5 122 1-132 1-137 (145) 22 protein:vir:93736 Length: 145 94.4 0.0035 2.2E-06 34.0 11.5 121 1-132 1-137 (145) 23 protein:vir:94488 Length: 145 94.4 0.0035 2.2E-06 34.0 11.5 121 1-132 1-137 (145) 24 protein:vir:97421 Length: 145 94.4 0.0035 2.2E-06 34.0 11.5 121 1-132 1-137 (145) 25 protein:vir:96894 Length: 140 94.3 0.0043 2.7E-06 33.5 11.8 121 1-132 1-137 (140) 26 protein:vir:1244 Length: 145 # 94.1 0.0035 2.2E-06 34.0 10.8 123 1-132 1-137 (145) 27 protein:vir:80371 Length: 115 91.9 0.0029 1.8E-06 34.5 7.2 113 1-126 1-115 (115) 28 protein:vir:4348 Length: 121 # 91.2 0.017 1.1E-05 30.3 11.9 116 1-130 1-121 (121) 29 protein:vir:1438 Length: 115 # 87.6 0.014 8.6E-06 30.8 7.3 112 1-126 1-115 (115) 30 protein:vir:10368 Length: 118 87.4 0.039 2.4E-05 28.3 12.4 113 1-130 1-118 (118) 31 protein:vir:5979 Length: 134 # 86.7 0.044 2.7E-05 28.0 10.4 121 1-131 1-134 (134) 32 protein:vir:397 Length: 132 # 86.3 0.046 2.9E-05 27.9 11.7 123 1-128 1-132 (132) 33 protein:vir:100116 Length: 115 86.3 0.019 1.2E-05 30.1 7.3 112 1-126 1-115 (115) 34 protein:vir:1892 Length: 121 # 86.2 0.048 2.9E-05 27.8 11.5 117 1-130 1-121 (121) 35 protein:vir:81066 Length: 118 83.1 0.071 4.4E-05 26.9 12.1 113 1-130 1-118 (118) 36 protein:vir:3428 Length: 131 # 80.9 0.09 5.6E-05 26.3 12.3 124 1-128 1-131 (131) 37 protein:vir:96800 Length: 127 80.0 0.05 3.1E-05 27.7 7.0 124 1-130 1-127 (127) 38 protein:vir:97070 Length: 118 76.7 0.13 8.2E-05 25.4 11.9 115 1-131 1-118 (118) 39 protein:vir:93602 Length: 114 70.9 0.2 0.00013 24.4 11.1 114 1-128 1-114 (114) 40 protein:vir:79047 Length: 145 68.3 0.24 0.00015 24.0 12.4 119 1-132 1-124 (145) 41 protein:vir:195 Length: 115 # 67.4 0.25 0.00016 23.9 10.1 113 1-126 1-115 (115) 42 protein:vir:3874 Length: 114 # 64.9 0.28 0.00017 23.6 7.3 106 1-110 1-114 (114) 43 protein:vir:79571 Length: 137 47.4 0.7 0.00044 21.4 12.0 125 1-128 6-137 (137) 44 protein:vir:80105 Length: 162 41.9 0.91 0.00056 20.8 11.6 123 1-132 8-146 (162) 45 protein:vir:105826 Length: 134 35.5 0.67 0.00041 21.5 4.3 113 1-132 1-120 (134) 46 protein:vir:102609 Length: 134 35.5 0.67 0.00041 21.5 4.3 113 1-132 1-120 (134) 47 protein:vir:7994 Length: 134 # 35.4 0.68 0.00042 21.5 4.3 113 1-132 1-120 (134) 48 protein:vir:98343 Length: 126 27.8 1.8 0.0011 19.2 8.3 112 1-132 1-120 (126) 49 protein:vir:9415 Length: 126 # 27.8 1.8 0.0011 19.2 8.3 112 1-132 1-120 (126) 50 protein:vir:8331 Length: 150 # 20.2 1.7 0.001 19.3 3.6 110 1-132 16-134 (150) No 1 >protein:vir:107704 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5121 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003903;genbank:gi:45686319;genbank:GeneID:2773044 Probab=100.00 E-value=2e-51 Score=298.59 Aligned_cols=131 Identities=59% Similarity=1.030 Sum_probs=127.4 Q ss_pred CCHHHHHHHHHHHhhcccCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCChHHH Q lcl|NC_021331. 1 MHYELMLSARKALATEYETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGTDRP 80 (132) Q Consensus 1 M~~el~~~al~~~a~a~~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~~~a 80 (132) ||||+..+++++++. +..++||||||+.|+||++|++|||+++||++|++.+|+++|+.|+|+|||+||+|+|+|+.++ T Consensus 1 ~hyE~~~a~r~~la~-~~~~lpVA~eNv~F~Pp~~G~~yLr~~~lpa~T~~~~L~~d~r~y~Gv~QI~Vv~paG~G~~~a 79 (132) T protein:vir:10 1 MHYELSAAARAAFLS-KYRDFPHYMENRNFTPPKDGGMWLRFNYIEGDTLYLSIDRKCKSYIAIVQIGVVFPPGSGVDEA 79 (132) T ss_pred CchHHHHHHHHHHHh-hhcCCcEeecCCCcCCCCCCceEEEEEEccCCceeeeccCcCcEEEEEEEEEEEecCCCCcchh Confidence 999999999998775 3457999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhhcCceeeeEEEecCccccCceeCCCeEEEEEEEEEEeecCC Q lcl|NC_021331. 81 RVLAKEIAQFFYDGKMLEHGYIYEGARVHKPLKSESGWILPIRFYVRIETKE 132 (132) Q Consensus 81 ~~lA~~iaa~F~~g~~l~~~~i~~~p~~~~~~~~~~~~~ipVsi~yRadt~~ 132 (132) .+|||+|+++|++|++|++|+|+++|+++|+|+++.+|++||||.||||||| T Consensus 80 ~~iAd~i~~~F~~g~~l~~Gyi~~~~~~~p~i~~~s~~~iPvrf~yR~Dt~~ 131 (132) T protein:vir:10 80 RLKAKEIADFFKDGKMLNVGYIFEGAIVHQIVKHESGWMIPVRFTVRVDTKE 131 (132) T ss_pred HHHHHHHHHhccCcceeecceecCCCccCCceeCCcceEEEEEEEEEecccC Confidence 9999999999999999999999999999999999999999999999999999 No 2 >protein:vir:103278 Length: 169 # NCBI annotation: phage-related conserved hypothetical protein # Family: family:all:5121 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277458;genbank:gi:71834100;genbank:GeneID:3562389 Probab=100.00 E-value=7.6e-51 Score=295.34 Aligned_cols=129 Identities=53% Similarity=1.019 Sum_probs=124.4 Q ss_pred CCHHHHH---HHHHHHhhcccCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCCh Q lcl|NC_021331. 1 MHYELML---SARKALATEYETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGT 77 (132) Q Consensus 1 M~~el~~---~al~~~a~a~~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~ 77 (132) ||||+.. |++.+|+.++++++||||||+.|+||++|++|||++++|++|.+.+|+++|+.|+|+|||+||+|+|+|+ T Consensus 38 ~h~ei~~a~rk~l~~~a~a~~~~LpVA~ENVaFtPp~dG~~YLr~~~lPadT~~~~L~gd~R~y~GVfQIsVV~PaGtG~ 117 (169) T protein:vir:10 38 VHYEMMVAARKLVSDAAVDIAGSLPVAYENCGFTPPKNGSSWLKFDYTEVDSVTWGLQRTCRYYVGMVQVSIFFSPGEGT 117 (169) T ss_pred hHHHHHHHHHHHHHHHHhhcccCCcEeeCCCCcCCCCCCccEEEEEEecCCceeeeccCCCceEEEEEEEEEEecCCCCc Confidence 9999955 4555678888899999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhhcCceeeeEEEecCccccCceeCCCeEEEEEEEEEEee Q lcl|NC_021331. 78 DRPRVLAKEIAQFFYDGKMLEHGYIYEGARVHKPLKSESGWILPIRFYVRIE 129 (132) Q Consensus 78 ~~a~~lA~~iaa~F~~g~~l~~~~i~~~p~~~~~~~~~~~~~ipVsi~yRad 129 (132) +++.++||||+++|++|++|++|||++.|+++|+|+++.+|+||||+.|||| T Consensus 118 ~ka~qiAdeiadlF~~gt~L~~Gyi~~~~~~~p~i~~~s~~~iPvr~~~R~D 169 (169) T protein:vir:10 118 DRPRQLAGRLSEAFADGTMLDSGYIYEGGSVFPPVKSQSGWFIPVRFYVRMD 169 (169) T ss_pred chhHHHHHHHHHhhhCCceeeceeecCCCeECCeeecCCceEEeEEEEEEeC Confidence 9999999999999999999999999999999999999999999999999999 No 3 >protein:vir:79637 Length: 130 # NCBI annotation: gp41 # Family: family:all:5121 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285530;genbank:gi:148734513;genbank:GeneID:5219995 Probab=100.00 E-value=8.3e-50 Score=289.66 Aligned_cols=130 Identities=65% Similarity=1.146 Sum_probs=128.1 Q ss_pred CCHHHHHHHHHHHhhcccCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCChHHH Q lcl|NC_021331. 1 MHYELMLSARKALATEYETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGTDRP 80 (132) Q Consensus 1 M~~el~~~al~~~a~a~~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~~~a 80 (132) ||||+..+||++.+......+||||||+.|+||++|++|||++++|++|++.+|+++|++|+|+|||+|++|+|+|++++ T Consensus 1 ~~~e~~~aaR~~~~~~~~~~lpVA~ENv~FtPp~~G~~YLr~~~lpa~T~~~~L~~d~r~y~Gv~QI~VV~paG~G~~~a 80 (130) T protein:vir:79 1 MHYELSVAARMALAQEYESEYMIAYENVEFTPPKGGGIWLKYDYKEADTIIHDLKRKCISYIGMVQIGIEFPPGSGIDKA 80 (130) T ss_pred CcchhhHHHHHHHHhhhhhhCceeecCCCcCCCCCCceEEEEEecCCCceeeeccCCCceEEEEEEEEEEecCCCCcchh Confidence 99999999999998877778999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhhcCceeeeEEEecCccccCceeCCCeEEEEEEEEEEeec Q lcl|NC_021331. 81 RVLAKEIAQFFYDGKMLEHGYIYEGARVHKPLKSESGWILPIRFYVRIET 130 (132) Q Consensus 81 ~~lA~~iaa~F~~g~~l~~~~i~~~p~~~~~~~~~~~~~ipVsi~yRadt 130 (132) ++|||+|+++|++|++|++|||++.|+++|+++++.+|+||||+.||||- T Consensus 81 ~~iA~ei~dlF~~g~~L~~Gyi~~~~~~~p~i~~~~~~~iPvr~~~R~d~ 130 (130) T protein:vir:79 81 RKLAKNIADFFEDGKMLSNGYISEGAKVHQVQKSESGWFYPVRFYVRYDG 130 (130) T ss_pred hHHHHHHHHhccCCceeeceeecCCCeECCeeecCCceEEeEEEEEEecC Confidence 99999999999999999999999999999999999999999999999999 No 4 >protein:vir:104348 Length: 129 # NCBI annotation: hypothetical protein # Family: family:all:5121 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398976;genbank:gi:81343960;genbank:GeneID:3778880 Probab=100.00 E-value=3.4e-47 Score=275.34 Aligned_cols=129 Identities=47% Similarity=0.913 Sum_probs=124.1 Q ss_pred CCHHHHHHHHHHHhhcccCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCChHHH Q lcl|NC_021331. 1 MHYELMLSARKALATEYETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGTDRP 80 (132) Q Consensus 1 M~~el~~~al~~~a~a~~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~~~a 80 (132) |+-.+.+....+++..++..+||||||+.|+||++|++|||++++|++|++.+|+++|++|+|+|||+|++|+|+|++++ T Consensus 1 ~s~aar~~v~d~~~~~~~~~lpVA~eNv~FtPp~~G~~YLr~~~lpa~T~~~~L~~d~r~y~Gv~QI~Vv~p~G~G~~~a 80 (129) T protein:vir:10 1 MSLAARKFVNDLLVNEFPVRYPVAWENAAFTPPADGSIWLKYDYTEVDTVTYGLSRKCKYYVGMVQISVFFSPGTGIDKP 80 (129) T ss_pred CchHHHHHHHHHHHHhhcCCCcEeecCCCcCCCCCCceEEEEEecCCCceeeeccCCCceEEEEEEEEEEecCCCCcchh Confidence 88777776666777778889999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhhcCceeeeEEEecCccccCceeCCCeEEEEEEEEEEee Q lcl|NC_021331. 81 RVLAKEIAQFFYDGKMLEHGYIYEGARVHKPLKSESGWILPIRFYVRIE 129 (132) Q Consensus 81 ~~lA~~iaa~F~~g~~l~~~~i~~~p~~~~~~~~~~~~~ipVsi~yRad 129 (132) ++|||+|+++|++|++|++|||++.|+++|+|+++.+|+||||+.|||| T Consensus 81 ~~iA~ei~d~F~~g~~L~~Gyi~~~~~~~p~i~~~~~~~ipvr~~~r~d 129 (129) T protein:vir:10 81 RQIANQLAESIVDGTMLDSGTIYESGVVNPVIKSKSGWFIPVRFYVRLD 129 (129) T ss_pred hHHHHHHHHhccCCceeeceeecCCCeECCeeecCCceEEeEEEEEEeC Confidence 9999999999999999999999999999999999999999999999999 No 5 >protein:vir:94921 Length: 125 # NCBI annotation: possible peptidoglycan binding protein # Family: family:all:5248 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239283;genbank:gi:66392065;genbank:GeneID:5076566 Probab=99.86 E-value=1.6e-24 Score=151.08 Aligned_cols=124 Identities=16% Similarity=0.152 Sum_probs=105.9 Q ss_pred CCHHHHHHHHHHHhhcccCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCChHHH Q lcl|NC_021331. 1 MHYELMLSARKALATEYETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGTDRP 80 (132) Q Consensus 1 M~~el~~~al~~~a~a~~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~~~a 80 (132) |+++.++.++.++..+..++.||+|||.+ ||. .++|+|+++.++++...++|+.|.+++|++.|+||+|.|.|...+ T Consensus 1 Mt~~q~r~~I~~r~~a~~~~~~I~~~N~p--p~~-~~~W~Rlti~~g~~~~a~iG~~~~~rtGli~iqiF~p~~~G~~~~ 77 (125) T protein:vir:94 1 MSYFQEKLDIENYFKANWPDTPIFYENRT--ANS-TGTWVRLTIQNGDAFQASNGEVSYRHPGVVFVQIFTKKEVGSGEA 77 (125) T ss_pred CCHHHHHHHHHHHHHhCCCccceeeCCCC--CCC-CCceEEEEeccCcccccccCCceeeeeeEEEEEeeecCCcChHHH Confidence 99999999999998888889999999974 444 469999999999999999998889999999999999999999999 Q ss_pred HHHHHHHHHhhhcCceeeeEEE-ecCccccCceeCCCeEEEEEEEEEEeec Q lcl|NC_021331. 81 RVLAKEIAQFFYDGKMLEHGYI-YEGARVHKPLKSESGWILPIRFYVRIET 130 (132) Q Consensus 81 ~~lA~~iaa~F~~g~~l~~~~i-~~~p~~~~~~~~~~~~~ipVsi~yRadt 130 (132) .++||++++||...+. |.+ ...++.-..-++++||+++|+|+||+-+ T Consensus 78 ~~~ad~~~~~f~~~~~---g~i~f~~~~~~~~g~~~gwyQ~Nv~I~f~~~~ 125 (125) T protein:vir:94 78 LKLADKVDALFRSKTL---GNIQFKVPQVQKVPSTTEWYQVNVSTEFYRGS 125 (125) T ss_pred HHHHHHHHHHHccCCC---CceEEeeceecCCCCCCCEEEEEEEEeeecCC Confidence 9999999999976532 221 1123333333579999999999999999 No 6 >protein:vir:80429 Length: 150 # NCBI annotation: BcepGomrgp11 # Family: family:all:5248 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210231;genbank:gi:146329923;genbank:GeneID:5123538 Probab=99.73 E-value=2.3e-20 Score=128.20 Aligned_cols=130 Identities=20% Similarity=0.212 Sum_probs=101.1 Q ss_pred CCHHHHHHH---HHHHhhccc---------CCCcEEcCCCCCCCCCCCc-cEEEEEEccCCceeeecCC----CceEEEE Q lcl|NC_021331. 1 MHYELMLSA---RKALATEYE---------TRFMIAYENVEFTPPGDGS-PWLKFDYAEVDTEYLSLDR----KCVSYIG 63 (132) Q Consensus 1 M~~el~~~a---l~~~a~a~~---------~~~pva~pN~~F~Pp~~g~-~yLr~~~~pa~t~~~~L~~----~~~~~~G 63 (132) |+.+.+++. +..+...|+ +.-.|+|||+.|.+|-+|+ +|.|+++.++++.+.+|++ .+.++.| T Consensus 1 ~~~~~~~ar~ei~~~f~~~W~~~~~~~~~g~~~~~~w~~~~~~~pP~g~~~WaRLti~h~~~~qA~~~~~~~gr~~~r~G 80 (150) T protein:vir:80 1 MIQDALQARSDINTMLFDQWSVADWSKVKGGKPNIAWEGRESARPPDGSAPYVAIFIKHVDGQQASLTDPDMLRRWSRDG 80 (150) T ss_pred CcchhhhhHHHHHHHHhhhhccCcchhhcCCcceeeecCcccCCcCCCCCceEEEEEecCCcccccccCCCCcceEeeCc Confidence 999988775 223333332 1124999999996666665 7999999999999999973 4678889 Q ss_pred EEEEEEEEe--CCCChHHHHHHHHHHHHhhhcCceeeeEEE-ecCccccCceeCCCeEEEEEEEEEEeecCC Q lcl|NC_021331. 64 MIQVGIVFP--PGYGTDRPRVLAKEIAQFFYDGKMLEHGYI-YEGARVHKPLKSESGWILPIRFYVRIETKE 132 (132) Q Consensus 64 ~~qv~v~~p--~G~G~~~a~~lA~~iaa~F~~g~~l~~~~i-~~~p~~~~~~~~~~~~~ipVsi~yRadt~~ 132 (132) ++.|+||+| .|+|..-+.++||++.+||.....- |.| ...+++-..-+++.||+++|+|+|+.|+.- T Consensus 81 lI~VQiF~p~~~G~G~~la~k~Ad~a~eaFe~~~t~--g~i~f~~as~~eiG~d~gWYQ~NV~ipF~yde~r 150 (150) T protein:vir:80 81 LITVQCFGMLSAGQGLEDATYQATIAMRAFEGKQSA--NGIWFRNARIKEIGSDRGWYQVNMIVEFEYDEVR 150 (150) T ss_pred EEEEEEeeeccCCchhhHHHHHHHHHHHHHhccCCC--CCcccccccccccCCCCceEEEEeEeeeeccccC Confidence 999999999 6999999999999999999765422 322 223444455567799999999999999999 No 7 >protein:vir:97211 Length: 150 # NCBI annotation: hypothetical protein ORF026 # Family: family:all:5248 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294534;genbank:gi:149408255;genbank:GeneID:5237076 Probab=99.66 E-value=4.9e-19 Score=120.95 Aligned_cols=130 Identities=15% Similarity=0.200 Sum_probs=99.2 Q ss_pred CCHHHHHHHHH----HHhhcc-------cCCC--cEEcCCCCCCCCCCCc-cEEEEEEccCCceeeecC---CCceEEEE Q lcl|NC_021331. 1 MHYELMLSARK----ALATEY-------ETRF--MIAYENVEFTPPGDGS-PWLKFDYAEVDTEYLSLD---RKCVSYIG 63 (132) Q Consensus 1 M~~el~~~al~----~~a~a~-------~~~~--pva~pN~~F~Pp~~g~-~yLr~~~~pa~t~~~~L~---~~~~~~~G 63 (132) |+.--+..+|. .+...| .+.. .++|||++|.+|-+|+ +|.|+++.+.++...+|| +.+.++.| T Consensus 1 ~~~~tF~qaR~ei~t~f~~~W~a~~~a~~g~~p~~~~w~~~~~~~~P~g~~~WaRLti~~~~~~~as~G~~~gr~~~r~G 80 (150) T protein:vir:97 1 MTLPTFDSARDEILGLFNTKWITDTPALNGGAPIRVEWPGVDAGDPPPADKPYARITLRHTTSRQATFGPTGGRRFTRPG 80 (150) T ss_pred CCCCcHHHHHHHHHhhhhhhccccchhhcCCcceeeccCCcccCCCcCCCCceEEEEeeccccccccccCCCCcEEeeCc Confidence 76555555544 222222 2334 4999999988777765 699999999999999998 45678889 Q ss_pred EEEEEEEEe--CCCChHHHHHHHHHHHHhhhcCceeeeEEE-ecCccccCceeCCCeEEEEEEEEEEeecCC Q lcl|NC_021331. 64 MIQVGIVFP--PGYGTDRPRVLAKEIAQFFYDGKMLEHGYI-YEGARVHKPLKSESGWILPIRFYVRIETKE 132 (132) Q Consensus 64 ~~qv~v~~p--~G~G~~~a~~lA~~iaa~F~~g~~l~~~~i-~~~p~~~~~~~~~~~~~ipVsi~yRadt~~ 132 (132) ++.|+||+| .|+|..-+.++||++.+||..... .+.| ...+++-..-+++.||+++|+|+|+.|+.- T Consensus 81 li~VQiF~p~~~G~G~~la~~~Ad~a~eaFe~~~t--~g~i~f~~a~~~eig~~~gWyQ~Nv~i~Feyde~r 150 (150) T protein:vir:97 81 LITVQVFTPLSGGQGLSLAEKCAIIARDAFEGRGT--ASGIWFRNARIQEIGPDGAWYQMNVVVEFEYDELR 150 (150) T ss_pred EEEEEEeeeccCCchhhHHHHHHHHHHHHHhccCC--cCCeecccccccccCCCCceEEEEeEeeeeccccC Confidence 999999999 599999999999999999976542 2222 223455555567799999999999999999 No 8 >protein:vir:95155 Length: 151 # NCBI annotation: hypothetical protein ORF015 # Family: family:all:5248 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293422;genbank:gi:148912843;genbank:GeneID:5228230 Probab=99.64 E-value=1.6e-18 Score=118.09 Aligned_cols=128 Identities=14% Similarity=0.187 Sum_probs=92.6 Q ss_pred CCHHHHHHHHHHHhhcc----cCCCc-----EEcCCC-CCCCCCCCccEEEEEEccCCceeeecC-------CCceEEEE Q lcl|NC_021331. 1 MHYELMLSARKALATEY----ETRFM-----IAYENV-EFTPPGDGSPWLKFDYAEVDTEYLSLD-------RKCVSYIG 63 (132) Q Consensus 1 M~~el~~~al~~~a~a~----~~~~p-----va~pN~-~F~Pp~~g~~yLr~~~~pa~t~~~~L~-------~~~~~~~G 63 (132) |.+|+++..+..+..+. ++++. |+|||. .|+||....+|+|+++.++++.+.+|+ +.|.+++| T Consensus 2 mtf~q~R~~i~~~~~~~w~~~~~~~a~~~p~v~~~~~~~~d~P~g~~~WaRLti~h~~~~qA~ls~~~eigggp~~~rtG 81 (151) T protein:vir:95 2 IEFDQVNDEVNALFLATWNAGSAAIAGYVPEIRWQGVQYRDLPDGSKFWVRLSKQTVFEEQATLSTCEGVPGQRKYTASG 81 (151) T ss_pred ccHHHHHHHHHHHhhhhcccCchhhhccccccccCCCCCCCCCCCCCceEEEEeecCCCccccccccccCCCCceEeeCc Confidence 99999999766654333 23444 455555 366666567999999999999999883 35789999 Q ss_pred EEEEEEEEeCCCChHHHHHHHHHH----HHhhhcCceeeeEEE-ecCccccCceeCCCeEEEEEEEEEEeecCC Q lcl|NC_021331. 64 MIQVGIVFPPGYGTDRPRVLAKEI----AQFFYDGKMLEHGYI-YEGARVHKPLKSESGWILPIRFYVRIETKE 132 (132) Q Consensus 64 ~~qv~v~~p~G~G~~~a~~lA~~i----aa~F~~g~~l~~~~i-~~~p~~~~~~~~~~~~~ipVsi~yRadt~~ 132 (132) ++.|+||+|+|+|.. .++|+++ .+||....+. |.| ...+++...-+++.||+++|+|+|-.|+.- T Consensus 82 li~VQiF~p~~~G~~--Le~Adkla~~a~eaFe~~~t~--g~i~f~~~s~~eiG~~~gWyQ~Nv~i~f~y~e~~ 151 (151) T protein:vir:95 82 LVFVQIFCPKSNTQA--FELGQKLAKLARNAFRGKSTP--GKVWFRNTRINELPPEELYERFNVVTEFEYDEIG 151 (151) T ss_pred EEEEEEeeeccCchh--hHHHHHHHHHHHHHhhccCCC--CCceeeeeeecccCCCCCeEEEEeeeeecccccC Confidence 999999999988844 3555555 6788655432 212 123444444467799999999999999999 No 9 >protein:vir:78379 Length: 139 # NCBI annotation: hypothetical protein # Family: family:all:12033 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110845;genbank:gi:134288606;genbank:GeneID:5179642 Probab=98.98 E-value=9e-13 Score=86.61 Aligned_cols=130 Identities=18% Similarity=0.198 Sum_probs=105.7 Q ss_pred CC--HHHHHHHHHHHhhcc--cCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCC Q lcl|NC_021331. 1 MH--YELMLSARKALATEY--ETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYG 76 (132) Q Consensus 1 M~--~el~~~al~~~a~a~--~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G 76 (132) |- +|-+-++......++ ..+++|+.||++ .|.+...|||..+++-.||++.+|-... ++.|++||+|.+-..-| T Consensus 1 matyfedltkafdtalvafgtnngikvalenid-aptstdtpylasymllsdteqadlfwte-qragvyqvdinvgsalg 78 (139) T protein:vir:78 1 MATYFEDLTKAFDTALVAFGTNNGIKVALENID-APTSTDTPYLASYMLLSDTEQADLFWTE-QRAGVYQVDINVGSALG 78 (139) T ss_pred CchhHHHHHHhhhheeeeeccCCceeEeeeccC-CCccCCcchhhheeeeccCcccceeeec-ccCceEEEeeecccccc Confidence 42 344444443333333 248999999998 4556667999999999999999998754 44799999999999999 Q ss_pred hHHHHHHHHHHHHhhhcCceeeeE---EEecCccccCceeCCCeEEEEEEEEEEeecCC Q lcl|NC_021331. 77 TDRPRVLAKEIAQFFYDGKMLEHG---YIYEGARVHKPLKSESGWILPIRFYVRIETKE 132 (132) Q Consensus 77 ~~~a~~lA~~iaa~F~~g~~l~~~---~i~~~p~~~~~~~~~~~~~ipVsi~yRadt~~ 132 (132) ......+||++.+.|..|-.+... .-.+..+.+|.|.+.+|-.-|+||+|-++|.. T Consensus 79 sapinrladklnaafaagncfsrneicaevqsvslgplivengwakrplsinfiaftar 137 (139) T protein:vir:78 79 SAPINRLADKLNAAFAAGNCFSRNEICAEVQSVSLGPLIVENGWAKRPLSINFIAFTAR 137 (139) T ss_pred cchhHHHHhhhhhhhhccccccchhhhhhhhhccccceeeccCcccCceeeeeeeeeee Confidence 999999999999999999777532 33567899999999999999999999999988 No 10 >protein:vir:94997 Length: 139 # NCBI annotation: hypothetical protein # Family: family:all:12033 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224021;genbank:gi:62327308;genbank:GeneID:5176825 Probab=98.87 E-value=4.9e-12 Score=82.57 Aligned_cols=130 Identities=18% Similarity=0.204 Sum_probs=105.5 Q ss_pred CC--HHHHHHHHHHHhhcc--cCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCC Q lcl|NC_021331. 1 MH--YELMLSARKALATEY--ETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYG 76 (132) Q Consensus 1 M~--~el~~~al~~~a~a~--~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G 76 (132) |- +|-+-++......++ +.+++||.||+. .|.+...|||..+++-.||++.+|-... ++.|++||+|.+-..-| T Consensus 1 matyfedltkafdtalvtfgtdndikvalenid-aptstdapylasymllsdteqadlfwte-qragvyqvdinvgsalg 78 (139) T protein:vir:94 1 MATYFEDLTKAFDTALVTFGTDNDIKVALENID-APTSTDAPYLASYMLLSDTEQADLFWTE-QRAGVYQVDINVGSALG 78 (139) T ss_pred CchhHHHHHHhhhheeeeeccCCCceEEeeccC-CCcccCcchhhheeecccCcccceeeec-ccCceEEEeeecccccc Confidence 42 333444433322322 458999999998 4556678999999999999999998754 44799999999999999 Q ss_pred hHHHHHHHHHHHHhhhcCceeeeE---EEecCccccCceeCCCeEEEEEEEEEEeecCC Q lcl|NC_021331. 77 TDRPRVLAKEIAQFFYDGKMLEHG---YIYEGARVHKPLKSESGWILPIRFYVRIETKE 132 (132) Q Consensus 77 ~~~a~~lA~~iaa~F~~g~~l~~~---~i~~~p~~~~~~~~~~~~~ipVsi~yRadt~~ 132 (132) ......+||++..-|..|-.+.-. .-.+..+.+|.|.+.+|-.-|+||+|-++|.. T Consensus 79 sapinrladklnttfaagncfsrneicaevqsvslgplivengwakrplsinfiaftar 137 (139) T protein:vir:94 79 SAPINRLADKLNTTFAAGNCFSRNEICAEVQSVSLGPLIVENGWAKRPLSINFIAFTAR 137 (139) T ss_pred cchhHHHHHhhhhhhhccccccchhhhhhhhhccccceeeccCcccCceeeeeeeeeee Confidence 999999999999999998777532 33567899999999999999999999999988 No 11 >protein:vir:100242 Length: 114 # NCBI annotation: gp71 # Family: family:all:2712 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355407;genbank:gi:77864697;genbank:GeneID:3725964 Probab=96.95 E-value=1.4e-05 Score=47.16 Aligned_cols=114 Identities=16% Similarity=0.072 Sum_probs=85.6 Q ss_pred CCHHHHHHHHHHHhhcccCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCChHHH Q lcl|NC_021331. 1 MHYELMLSARKALATEYETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGTDRP 80 (132) Q Consensus 1 M~~el~~~al~~~a~a~~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~~~a 80 (132) |+.=.+++|+-..+-+ -.|.++. |.....||+- +....+..-.+|+|..-..+|.|||+|+.+. -.+| T Consensus 1 ~~~~~i~~~l~~~~g~------~~~~~~a--P~~~~~Py~v-y~rvsg~p~~tL~G~~g~~~~r~QiD~yA~T---~~eA 68 (114) T protein:vir:10 1 MSALTIRDAIGIVGGA------KGYVSVA--SSAAQSPYYV-VSRVSGTRDMALGGATGGKSGMFQIDVYAKT---YTEA 68 (114) T ss_pred Cceeeeehhhcccccc------cccCCCC--CCCCCCceEE-EEeccCcccccccCCCCcceEEEEEEeeeCC---HHHH Confidence 8888888887775432 2344443 4455678887 4445566667899988888999999999765 5789 Q ss_pred HHHHHHHHHhhhcCceeeeEEEecCccccCceeCCCeEEEEEEEEE Q lcl|NC_021331. 81 RVLAKEIAQFFYDGKMLEHGYIYEGARVHKPLKSESGWILPIRFYV 126 (132) Q Consensus 81 ~~lA~~iaa~F~~g~~l~~~~i~~~p~~~~~~~~~~~~~ipVsi~y 126 (132) +++|+++..+-..+..|..+-+++.+........--+..+-+||.| T Consensus 69 ~~La~~~~~~l~~~~~f~~~~l~~~~d~ye~dT~l~Rvsld~si~f 114 (114) T protein:vir:10 69 DSLADQIIDRVESTGMFSVGGVSDLPDDYSSDTGVFRVSLEISVQF 114 (114) T ss_pred HHHHHHHHhhcccccCeeeeccccCCCCCCcccCceEEEEEEEEeC Confidence 9999999988877776776767777777776666677888899999 No 12 >protein:vir:94096 Length: 141 # NCBI annotation: ORF031 # Family: family:all:296 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240240;genbank:gi:66395916;genbank:GeneID:5133265 Probab=95.63 E-value=0.00087 Score=37.37 Aligned_cols=122 Identities=6% Similarity=0.002 Sum_probs=70.5 Q ss_pred CCHHH---HHHHHHHHhhcccCCC------cEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCc-eEEEEEEEEEEE Q lcl|NC_021331. 1 MHYEL---MLSARKALATEYETRF------MIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKC-VSYIGMIQVGIV 70 (132) Q Consensus 1 M~~el---~~~al~~~a~a~~~~~------pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~-~~~~G~~qv~v~ 70 (132) |++.. +.+|+-++..+.. .+ +| |.++. . +.+| .++.-+..+..+.+.+| ....-.++|+|+ T Consensus 1 Msms~~~aLQ~Ai~~~L~ada-al~alvg~rI-~D~~P----~-~~~~--PYv~lG~~~~~~~~~~~~~g~~~~~ti~Vw 71 (141) T protein:vir:94 1 MWVSVEPELTNQIYKRLISDP-NINKLVDDRV-FDVVQ----D-DAVY--PYIVVGESNVTNNESSATMRETVGIVIHVY 71 (141) T ss_pred CccchhHHHHHHHHHHhhcCh-hhHhhcCCcc-ccCCc----c-CCCC--CEEEeCCceeeecCCCcccceEEEEEEEEE Confidence 55443 3444444433321 11 33 55543 2 2333 34555677777777766 566788999998 Q ss_pred EeCCCChHHHHHHHHHHHHhhhcCceeeeEEE-ecCccccCce--eCCCeE--EEEEEEEEEeecCC Q lcl|NC_021331. 71 FPPGYGTDRPRVLAKEIAQFFYDGKMLEHGYI-YEGARVHKPL--KSESGW--ILPIRFYVRIETKE 132 (132) Q Consensus 71 ~p~G~G~~~a~~lA~~iaa~F~~g~~l~~~~i-~~~p~~~~~~--~~~~~~--~ipVsi~yRadt~~ 132 (132) . .+.|..++++||+.|.+.....+.+..+++ .--......+ +|+..+ .+-+++.+|.+||+ T Consensus 72 s-~~~g~~eak~ia~av~~AL~~~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~~~ 137 (141) T protein:vir:94 72 S-QFATQYEAKLILSAIGYVLNRPIEIDNYEFQFSRIDSQAVFPDIDRFTKHGTIRLLFKYRHKKKN 137 (141) T ss_pred E-cCCCHHHHHHHHHHHHHHhcccccCCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecccc Confidence 6 566899999999999998865555554332 1111111222 233333 46666667788887 No 13 >protein:vir:96260 Length: 141 # NCBI annotation: ORF027 # Family: family:all:296 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240317;genbank:gi:66395991;genbank:GeneID:5133337 Probab=95.63 E-value=0.00087 Score=37.37 Aligned_cols=122 Identities=6% Similarity=0.002 Sum_probs=70.5 Q ss_pred CCHHH---HHHHHHHHhhcccCCC------cEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCc-eEEEEEEEEEEE Q lcl|NC_021331. 1 MHYEL---MLSARKALATEYETRF------MIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKC-VSYIGMIQVGIV 70 (132) Q Consensus 1 M~~el---~~~al~~~a~a~~~~~------pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~-~~~~G~~qv~v~ 70 (132) |++.. +.+|+-++..+.. .+ +| |.++. . +.+| .++.-+..+..+.+.+| ....-.++|+|+ T Consensus 1 Msms~~~aLQ~Ai~~~L~ada-al~alvg~rI-~D~~P----~-~~~~--PYv~lG~~~~~~~~~~~~~g~~~~~ti~Vw 71 (141) T protein:vir:96 1 MWVSVEPELTNQIYKRLISDP-NINKLVDDRV-FDVVQ----D-DAVY--PYIVVGESNVTNNESSATMRETVGIVIHVY 71 (141) T ss_pred CccchhHHHHHHHHHHhhcCh-hhHhhcCCcc-ccCCc----c-CCCC--CEEEeCCceeeecCCCcccceEEEEEEEEE Confidence 55443 3444444433321 11 33 55543 2 2333 34555677777777766 566788999998 Q ss_pred EeCCCChHHHHHHHHHHHHhhhcCceeeeEEE-ecCccccCce--eCCCeE--EEEEEEEEEeecCC Q lcl|NC_021331. 71 FPPGYGTDRPRVLAKEIAQFFYDGKMLEHGYI-YEGARVHKPL--KSESGW--ILPIRFYVRIETKE 132 (132) Q Consensus 71 ~p~G~G~~~a~~lA~~iaa~F~~g~~l~~~~i-~~~p~~~~~~--~~~~~~--~ipVsi~yRadt~~ 132 (132) . .+.|..++++||+.|.+.....+.+..+++ .--......+ +|+..+ .+-+++.+|.+||+ T Consensus 72 s-~~~g~~eak~ia~av~~AL~~~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~~~ 137 (141) T protein:vir:96 72 S-QFATQYEAKLILSAIGYVLNRPIEIDNYEFQFSRIDSQAVFPDIDRFTKHGTIRLLFKYRHKKKN 137 (141) T ss_pred E-cCCCHHHHHHHHHHHHHHhcccccCCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecccc Confidence 6 566899999999999998865555554332 1111111222 233333 46666667788887 No 14 >protein:vir:105892 Length: 141 # NCBI annotation: tail protein # Family: family:all:296 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004380;genbank:gi:122891835;genbank:GeneID:4712363 Probab=95.63 E-value=0.00087 Score=37.37 Aligned_cols=122 Identities=6% Similarity=0.002 Sum_probs=70.5 Q ss_pred CCHHH---HHHHHHHHhhcccCCC------cEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCc-eEEEEEEEEEEE Q lcl|NC_021331. 1 MHYEL---MLSARKALATEYETRF------MIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKC-VSYIGMIQVGIV 70 (132) Q Consensus 1 M~~el---~~~al~~~a~a~~~~~------pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~-~~~~G~~qv~v~ 70 (132) |++.. +.+|+-++..+.. .+ +| |.++. . +.+| .++.-+..+..+.+.+| ....-.++|+|+ T Consensus 1 Msms~~~aLQ~Ai~~~L~ada-al~alvg~rI-~D~~P----~-~~~~--PYv~lG~~~~~~~~~~~~~g~~~~~ti~Vw 71 (141) T protein:vir:10 1 MWVSVEPELTNQIYKRLISDP-NINKLVDDRV-FDVVQ----D-DAVY--PYIVVGESNVTNNESSATMRETVGIVIHVY 71 (141) T ss_pred CccchhHHHHHHHHHHhhcCh-hhHhhcCCcc-ccCCc----c-CCCC--CEEEeCCceeeecCCCcccceEEEEEEEEE Confidence 55443 3444444433321 11 33 55543 2 2333 34555677777777766 566788999998 Q ss_pred EeCCCChHHHHHHHHHHHHhhhcCceeeeEEE-ecCccccCce--eCCCeE--EEEEEEEEEeecCC Q lcl|NC_021331. 71 FPPGYGTDRPRVLAKEIAQFFYDGKMLEHGYI-YEGARVHKPL--KSESGW--ILPIRFYVRIETKE 132 (132) Q Consensus 71 ~p~G~G~~~a~~lA~~iaa~F~~g~~l~~~~i-~~~p~~~~~~--~~~~~~--~ipVsi~yRadt~~ 132 (132) . .+.|..++++||+.|.+.....+.+..+++ .--......+ +|+..+ .+-+++.+|.+||+ T Consensus 72 s-~~~g~~eak~ia~av~~AL~~~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~~~ 137 (141) T protein:vir:10 72 S-QFATQYEAKLILSAIGYVLNRPIEIDNYEFQFSRIDSQAVFPDIDRFTKHGTIRLLFKYRHKKKN 137 (141) T ss_pred E-cCCCHHHHHHHHHHHHHHhcccccCCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEEEecccc Confidence 6 566899999999999998865555554332 1111111222 233333 46666667788887 No 15 >protein:vir:97325 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240617;genbank:gi:66396297;genbank:GeneID:5133681 Probab=94.94 E-value=0.0021 Score=35.32 Aligned_cols=122 Identities=7% Similarity=-0.032 Sum_probs=71.7 Q ss_pred CCHHH---HHHHHHHHhhcccCCC------cEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCc-eEEEEEEEEEEE Q lcl|NC_021331. 1 MHYEL---MLSARKALATEYETRF------MIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKC-VSYIGMIQVGIV 70 (132) Q Consensus 1 M~~el---~~~al~~~a~a~~~~~------pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~-~~~~G~~qv~v~ 70 (132) |++-+ +.+|+-++..+.. .+ +| |.++.=.+ -.|| +.-+.++..+.+.+| ....-.++|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~-~l~alvggrV-~D~~P~~a---~~PY----v~lG~~~~~d~~~~~~~g~~~~~ti~Vw 71 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNL-IIRKQLDGRV-FDCVQKDA---VYPY----IVVGETNVTNKETTTSMVEDVGITLHVY 71 (145) T ss_pred CcchHhHHHHHHHHHHhhcCh-hHHHhhcCce-ecCCccCC---CCCE----EEeCcceeeecCCCcccceEEEEEEEEE Confidence 99765 3334444333321 11 33 45544211 1354 444666666666665 566788999998 Q ss_pred EeCCCChHHHHHHHHHHHHhhhcCceeeeEEEe---cCccccCceeCCCeEEEEEEEEEEeecCC Q lcl|NC_021331. 71 FPPGYGTDRPRVLAKEIAQFFYDGKMLEHGYIY---EGARVHKPLKSESGWILPIRFYVRIETKE 132 (132) Q Consensus 71 ~p~G~G~~~a~~lA~~iaa~F~~g~~l~~~~i~---~~p~~~~~~~~~~~~~ipVsi~yRadt~~ 132 (132) .. ..|..++++||+.|.+.....+.|..+++. -.-+..--.+++..+..-++|.+|...+- T Consensus 72 s~-~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:97 72 SQ-ARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred Ec-CCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEecCc Confidence 75 668999999999999988766666654321 11111112245556666677777766655 No 16 >protein:vir:95961 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240391;genbank:gi:66396072;genbank:GeneID:5133472 Probab=94.81 E-value=0.0024 Score=34.96 Aligned_cols=122 Identities=7% Similarity=0.021 Sum_probs=68.7 Q ss_pred CCHHH---HHHHHHHHhhccc------CCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCc-eEEEEEEEEEEE Q lcl|NC_021331. 1 MHYEL---MLSARKALATEYE------TRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKC-VSYIGMIQVGIV 70 (132) Q Consensus 1 M~~el---~~~al~~~a~a~~------~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~-~~~~G~~qv~v~ 70 (132) |+|-+ +.+|+-++..+.+ +. +| |.++.=.+ --|| +.-+.+...+.+.+| ....-.|+|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvgg-rV-~D~~P~~~---~~PY----v~lG~~~~~d~~~~~~~g~~~~~ti~Vw 71 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNPIIQKQLDG-RV-FDCVQKDA---VYPY----IVVGETNVTNKETTTSMVEDVGITLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhHHHhhcc-cc-ccCCcCCC---CCCE----EEecCceeeecCCCcccceEEEEEEEEE Confidence 88665 3334444332221 11 32 45444111 1344 444666666666665 566788999998 Q ss_pred EeCCCChHHHHHHHHHHHHhhhcCceeeeEEEe-cCccccCc--eeCCCeE--EEEEEEEEEeecCC Q lcl|NC_021331. 71 FPPGYGTDRPRVLAKEIAQFFYDGKMLEHGYIY-EGARVHKP--LKSESGW--ILPIRFYVRIETKE 132 (132) Q Consensus 71 ~p~G~G~~~a~~lA~~iaa~F~~g~~l~~~~i~-~~p~~~~~--~~~~~~~--~ipVsi~yRadt~~ 132 (132) .. +.|..++++||+.|.+.....+.|..+++. --...... .+++..+ .+-+++.++-++|+ T Consensus 72 s~-~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~~~ 137 (145) T protein:vir:95 72 SQ-ARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKHGVIRLVFKYRHNTLQ 137 (145) T ss_pred Ec-CCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEEecccc Confidence 74 568999999999999988766666654321 11111111 2344444 45555556666766 No 17 >protein:vir:94794 Length: 145 # NCBI annotation: ORF028 # Family: family:all:296 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240542;genbank:gi:66396219;genbank:GeneID:5133574 Probab=94.79 E-value=0.0024 Score=34.92 Aligned_cols=122 Identities=8% Similarity=0.024 Sum_probs=68.8 Q ss_pred CCHHH---HHHHHHHHhhccc------CCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCc-eEEEEEEEEEEE Q lcl|NC_021331. 1 MHYEL---MLSARKALATEYE------TRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKC-VSYIGMIQVGIV 70 (132) Q Consensus 1 M~~el---~~~al~~~a~a~~------~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~-~~~~G~~qv~v~ 70 (132) |+|-+ +.+|+-++..+.+ +. +| |.++.=.+ --|| +.-+.+...+.+.+| ....-.|+|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvgg-rV-~D~~P~~~---~~PY----v~lG~~~~~d~~~~~~~g~~~~~ti~Vw 71 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNPIIQKQLDG-RV-FDCVQKDA---VYPY----IVVGETNVTNKETTTSMVEDVGITLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhHHHhhcc-cc-ccCCcCCC---CCCE----EEecCceeeecCCCcccceEEEEEEEEE Confidence 88665 3334444332221 11 32 45444111 1344 444666666666665 566788999998 Q ss_pred EeCCCChHHHHHHHHHHHHhhhcCceeeeEEEe-cCccccCc--eeCCCeE--EEEEEEEEEeecCC Q lcl|NC_021331. 71 FPPGYGTDRPRVLAKEIAQFFYDGKMLEHGYIY-EGARVHKP--LKSESGW--ILPIRFYVRIETKE 132 (132) Q Consensus 71 ~p~G~G~~~a~~lA~~iaa~F~~g~~l~~~~i~-~~p~~~~~--~~~~~~~--~ipVsi~yRadt~~ 132 (132) .. +.|..++++||+.|.+.....+.|..+++. --...... .+++..+ .+-+++.++-++|+ T Consensus 72 s~-~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~~~ 137 (145) T protein:vir:94 72 SQ-ARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNTLQ 137 (145) T ss_pred Ec-CCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEEecccc Confidence 74 568999999999999988766666654321 11111111 2344444 45555556667766 No 18 >protein:vir:96125 Length: 140 # NCBI annotation: ORF038 # Family: family:all:296 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240084;genbank:gi:66395765;genbank:GeneID:5133106 Probab=94.76 E-value=0.0022 Score=35.18 Aligned_cols=123 Identities=11% Similarity=0.013 Sum_probs=70.3 Q ss_pred CCHHH-HHHHHHHHhhcccC-----CCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCc-eEEEEEEEEEEEEeC Q lcl|NC_021331. 1 MHYEL-MLSARKALATEYET-----RFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKC-VSYIGMIQVGIVFPP 73 (132) Q Consensus 1 M~~el-~~~al~~~a~a~~~-----~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~-~~~~G~~qv~v~~p~ 73 (132) |+.+. +.+|+-+++.+... +-+ .|.+++ .++ .-||+ .-+.++..+-+.+| ....=.++|+|+. . T Consensus 3 msa~~aLq~Ai~~~L~ad~~l~alvggr-VyD~~P-~~~--~~PYV----~lG~~~~~~~~~~~~~g~~~~~tl~Vws-~ 73 (140) T protein:vir:96 3 VTAEPLLYNKIMNNLIENPITDKLVGGR-VFDCVQ-KDV--VYPYI----VVGESNVTESERSPGMREIIAITFHVYS-Q 73 (140) T ss_pred cchhHHHHHHHHHHhccChhHHhhcCcc-cccCCc-cCC--CCCEE----EeCCceeeecCCCcccceEEEEEEEEEE-c Confidence 77774 55555555443321 113 355544 111 13544 44566666666655 4456678889765 5 Q ss_pred CCChHHHHHHHHHHHHhhhcCceeeeEEE-ecCccccCce--eCCCeE--EEEEEEEEEeecCC Q lcl|NC_021331. 74 GYGTDRPRVLAKEIAQFFYDGKMLEHGYI-YEGARVHKPL--KSESGW--ILPIRFYVRIETKE 132 (132) Q Consensus 74 G~G~~~a~~lA~~iaa~F~~g~~l~~~~i-~~~p~~~~~~--~~~~~~--~ipVsi~yRadt~~ 132 (132) ..|..++.+||+.|.+.....+.++.+++ .-.-.....+ +|+..+ .+-+++.++.++|+ T Consensus 74 ~~g~~ea~~ia~ai~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~ve~~~~~ 137 (140) T protein:vir:96 74 YENGAEARELLKYLNYACRLNINFKDYELEWIKKDNSQVFTDIDQYTKHGVLRLLYKVRHKTLQ 137 (140) T ss_pred CCCHHHHHHHHHHHHHHhcCCccCCCceEEEEEEeeeEEeecCCCceEEEEEEEEEEEeecccc Confidence 77999999999999998865566654432 1111112222 233323 46777778888888 No 19 >protein:vir:107096 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950611;genbank:gi:119953691;genbank:GeneID:4643105 Probab=94.61 E-value=0.0024 Score=34.94 Aligned_cols=123 Identities=7% Similarity=0.017 Sum_probs=69.9 Q ss_pred CCHHH---HHHHHHHHhhcccCCCc-----EEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCc-eEEEEEEEEEEEE Q lcl|NC_021331. 1 MHYEL---MLSARKALATEYETRFM-----IAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKC-VSYIGMIQVGIVF 71 (132) Q Consensus 1 M~~el---~~~al~~~a~a~~~~~p-----va~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~-~~~~G~~qv~v~~ 71 (132) |+|-+ +.+|+-++..+.+ ++. --|.++.=.+ --|| +.-+.+...+.+.+| ....-.++|+|+. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~-al~alvg~rVyD~~P~~a---~~Py----V~lG~~~~~~~~~~~~~g~~~~~ti~Vws 72 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNP-IIKKQLGGRVFDCVQKDA---VYPY----IVVGETNVTNKETTTSMFEDVGVTLHVYS 72 (145) T ss_pred CchhHHHHHHHHHHHHhhcCh-hHHHhhccccccCCccCC---CCCE----EEeCcceeeecCCCcccceEEEEEEEEEE Confidence 98654 3344444433321 111 1255543111 1244 444666666777665 4667889999987 Q ss_pred eCCCChHHHHHHHHHHHHhhhcCceeeeEEE-ecCccccCce--eCCCeE--EEEEEEEEEeecCC Q lcl|NC_021331. 72 PPGYGTDRPRVLAKEIAQFFYDGKMLEHGYI-YEGARVHKPL--KSESGW--ILPIRFYVRIETKE 132 (132) Q Consensus 72 p~G~G~~~a~~lA~~iaa~F~~g~~l~~~~i-~~~p~~~~~~--~~~~~~--~ipVsi~yRadt~~ 132 (132) . ..|..++.+||+.|.+.....+.+..+++ .-.......+ +++..+ .+-+++..+-++|+ T Consensus 73 ~-~~g~~ea~~ia~av~~aL~a~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~ra~ve~~~~~ 137 (145) T protein:vir:10 73 Q-ARNRDEASQIIQYLGFVLNSEIEINNYSFIKSRIDTQEVITDIDQYTKHGIIRLIFKYRHNTLQ 137 (145) T ss_pred c-CCCHHHHHHHHHHHHHHhCCCcCCCCCeEEEEEEeeeeEeecCCCceEEEEEEEEEEEeecccc Confidence 4 66889999999999998865565654432 1111111222 233333 46666677788887 No 20 >protein:vir:105337 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950674;genbank:gi:119967844;genbank:GeneID:4643216 Probab=94.58 E-value=0.0025 Score=34.88 Aligned_cols=123 Identities=7% Similarity=0.014 Sum_probs=69.9 Q ss_pred CCHHH---HHHHHHHHhhcccCCCc-----EEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCc-eEEEEEEEEEEEE Q lcl|NC_021331. 1 MHYEL---MLSARKALATEYETRFM-----IAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKC-VSYIGMIQVGIVF 71 (132) Q Consensus 1 M~~el---~~~al~~~a~a~~~~~p-----va~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~-~~~~G~~qv~v~~ 71 (132) |+|-+ +.+|+-++..+.+ ++. --|.++.=.+ --|| +.-+.+...+.+.+| ....-.++|+|+. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~-al~alvg~rVyD~~P~~a---~~Py----V~lG~~~~~~~~~~~~~g~~~~~ti~Vws 72 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNP-IVSKQLGGRVFDCVQKDA---VYPY----IVVGETNVTNKETTTSMFEDVGVTLHVYS 72 (145) T ss_pred CchhHHHHHHHHHHHHhhcCh-hHHHhhccccccCCccCC---CCCE----EEeCcceeeecCCCcccceEEEEEEEEEE Confidence 98654 3344444433321 111 1255543111 1244 444666666677665 4667889999987 Q ss_pred eCCCChHHHHHHHHHHHHhhhcCceeeeEEE-ecCccccCce--eCCCeE--EEEEEEEEEeecCC Q lcl|NC_021331. 72 PPGYGTDRPRVLAKEIAQFFYDGKMLEHGYI-YEGARVHKPL--KSESGW--ILPIRFYVRIETKE 132 (132) Q Consensus 72 p~G~G~~~a~~lA~~iaa~F~~g~~l~~~~i-~~~p~~~~~~--~~~~~~--~ipVsi~yRadt~~ 132 (132) . ..|..++.+||+.|.+.....+.+..+++ .-.......+ +++..+ .+-+++..+-++|+ T Consensus 73 ~-~~g~~ea~~ia~av~~aL~a~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~ra~ve~~~~~ 137 (145) T protein:vir:10 73 Q-ARNRDEASQIIQYLGFVLNSEIEINNYSFIKSRIDTQEVITDIDQYTKHGIIRLIFKYRHNTLQ 137 (145) T ss_pred c-CCCHHHHHHHHHHHHHHhCCCcCCCCCeEEEEEEeeeeEeecCCCceEEEEEEEEEEEeecccc Confidence 4 66889999999999998865565654432 1111111222 233333 46666677788887 No 21 >protein:vir:95111 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240829;genbank:gi:66394699;genbank:GeneID:5133905 Probab=94.53 E-value=0.0031 Score=34.31 Aligned_cols=122 Identities=8% Similarity=0.013 Sum_probs=69.9 Q ss_pred CCHHH---HHHHHHHHhhcccCCC------cEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCc-eEEEEEEEEEEE Q lcl|NC_021331. 1 MHYEL---MLSARKALATEYETRF------MIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKC-VSYIGMIQVGIV 70 (132) Q Consensus 1 M~~el---~~~al~~~a~a~~~~~------pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~-~~~~G~~qv~v~ 70 (132) |++-+ +.+|+-++..+.. .+ + -|.++.=.+ -.|| +.-+.....+.+.+| ....-.++|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada-~l~alvggr-V~D~~P~~a---~~PY----V~lG~~~~~~~~~~~~~g~~~~~ti~Vw 71 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNS-IIQKQLDGR-VFDCVQKDA---VYPY----IVVGETNVTNKETTTSMVEDVGITLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcCh-hHHHhhcCc-eecCCcCCC---CCCE----EEecCceeeecCCCcccceEEEEEEEEE Confidence 98765 3334444433321 11 3 345544211 1355 444556666666665 466788999988 Q ss_pred EeCCCChHHHHHHHHHHHHhhhcCceeeeEEEe-cCccccCc--eeCCCeE--EEEEEEEEEeecCC Q lcl|NC_021331. 71 FPPGYGTDRPRVLAKEIAQFFYDGKMLEHGYIY-EGARVHKP--LKSESGW--ILPIRFYVRIETKE 132 (132) Q Consensus 71 ~p~G~G~~~a~~lA~~iaa~F~~g~~l~~~~i~-~~p~~~~~--~~~~~~~--~ipVsi~yRadt~~ 132 (132) .. +.|..++++||+.|.+.....+.|+.+++. --...... .+++..+ .+-+++.++-|||+ T Consensus 72 s~-~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~ra~ve~~~~~ 137 (145) T protein:vir:95 72 SQ-ARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDRYTKHGIIRLVFKYRHNTLQ 137 (145) T ss_pred Ec-CCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEEecccc Confidence 64 568999999999999988766667655321 11111111 2334344 45666667777777 No 22 >protein:vir:93736 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240465;genbank:gi:66396143;genbank:GeneID:5133505 Probab=94.38 E-value=0.0035 Score=34.03 Aligned_cols=121 Identities=9% Similarity=0.014 Sum_probs=69.9 Q ss_pred CCHHH---HHHHHHHHhhcccCCC------cEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCc-eEEEEEEEEEEE Q lcl|NC_021331. 1 MHYEL---MLSARKALATEYETRF------MIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKC-VSYIGMIQVGIV 70 (132) Q Consensus 1 M~~el---~~~al~~~a~a~~~~~------pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~-~~~~G~~qv~v~ 70 (132) |++-+ +.+|+-++..+.. .+ + -|.++.=.+ -.|| +.-+.....+.+.+| ....-.++|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada-~l~alvggr-I~D~~P~~a---~~PY----V~lG~~~~~d~~~~~~~g~~~~~ti~Vw 71 (145) T protein:vir:93 1 MWVSVERYLFNKVYNKLKSNL-IIQKQLDGR-VFDCVQKDA---VYPY----IVVGETNVTNKETTTSMVEDVGITLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcCh-hHHHhhcCc-eecCCcCCC---CCCE----EEeCCceeeecCCCcccceEEEEEEEEE Confidence 99765 3334444433321 11 3 355544211 1355 444566666666665 466788999998 Q ss_pred EeCCCChHHHHHHHHHHHHhhhcCceeeeEEEe-c---CccccCceeCCCeE--EEEEEEEEEeecCC Q lcl|NC_021331. 71 FPPGYGTDRPRVLAKEIAQFFYDGKMLEHGYIY-E---GARVHKPLKSESGW--ILPIRFYVRIETKE 132 (132) Q Consensus 71 ~p~G~G~~~a~~lA~~iaa~F~~g~~l~~~~i~-~---~p~~~~~~~~~~~~--~ipVsi~yRadt~~ 132 (132) .. +.|..++++||+.|.+.....+.+..+++. - .-.... .+++..+ .+-+++.++-|||+ T Consensus 72 s~-~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~r-d~dg~~~hgvl~fra~ve~~~~~ 137 (145) T protein:vir:93 72 SQ-ARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVIT-DIDQYTKHGIIRLVFKYRHNTLQ 137 (145) T ss_pred Ec-CCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEee-cCCcceEEEEEEEEEEEEecccc Confidence 75 668999999999999887666666655321 1 112221 2334334 45566667777777 No 23 >protein:vir:94488 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240682;genbank:gi:66396364;genbank:GeneID:5133752 Probab=94.38 E-value=0.0035 Score=34.03 Aligned_cols=121 Identities=9% Similarity=0.014 Sum_probs=69.9 Q ss_pred CCHHH---HHHHHHHHhhcccCCC------cEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCc-eEEEEEEEEEEE Q lcl|NC_021331. 1 MHYEL---MLSARKALATEYETRF------MIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKC-VSYIGMIQVGIV 70 (132) Q Consensus 1 M~~el---~~~al~~~a~a~~~~~------pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~-~~~~G~~qv~v~ 70 (132) |++-+ +.+|+-++..+.. .+ + -|.++.=.+ -.|| +.-+.....+.+.+| ....-.++|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada-~l~alvggr-I~D~~P~~a---~~PY----V~lG~~~~~d~~~~~~~g~~~~~ti~Vw 71 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNL-IIQKQLDGR-VFDCVQKDA---VYPY----IVVGETNVTNKETTTSMVEDVGITLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcCh-hHHHhhcCc-eecCCcCCC---CCCE----EEeCCceeeecCCCcccceEEEEEEEEE Confidence 99765 3334444433321 11 3 355544211 1355 444566666666665 466788999998 Q ss_pred EeCCCChHHHHHHHHHHHHhhhcCceeeeEEEe-c---CccccCceeCCCeE--EEEEEEEEEeecCC Q lcl|NC_021331. 71 FPPGYGTDRPRVLAKEIAQFFYDGKMLEHGYIY-E---GARVHKPLKSESGW--ILPIRFYVRIETKE 132 (132) Q Consensus 71 ~p~G~G~~~a~~lA~~iaa~F~~g~~l~~~~i~-~---~p~~~~~~~~~~~~--~ipVsi~yRadt~~ 132 (132) .. +.|..++++||+.|.+.....+.+..+++. - .-.... .+++..+ .+-+++.++-|||+ T Consensus 72 s~-~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~r-d~dg~~~hgvl~fra~ve~~~~~ 137 (145) T protein:vir:94 72 SQ-ARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVIT-DIDQYTKHGIIRLVFKYRHNTLQ 137 (145) T ss_pred Ec-CCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEee-cCCcceEEEEEEEEEEEEecccc Confidence 75 668999999999999887666666655321 1 112221 2334334 45566667777777 No 24 >protein:vir:97421 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240755;genbank:gi:66396436;genbank:GeneID:5133777 Probab=94.38 E-value=0.0035 Score=34.03 Aligned_cols=121 Identities=9% Similarity=0.014 Sum_probs=69.9 Q ss_pred CCHHH---HHHHHHHHhhcccCCC------cEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCc-eEEEEEEEEEEE Q lcl|NC_021331. 1 MHYEL---MLSARKALATEYETRF------MIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKC-VSYIGMIQVGIV 70 (132) Q Consensus 1 M~~el---~~~al~~~a~a~~~~~------pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~-~~~~G~~qv~v~ 70 (132) |++-+ +.+|+-++..+.. .+ + -|.++.=.+ -.|| +.-+.....+.+.+| ....-.++|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada-~l~alvggr-I~D~~P~~a---~~PY----V~lG~~~~~d~~~~~~~g~~~~~ti~Vw 71 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNL-IIQKQLDGR-VFDCVQKDA---VYPY----IVVGETNVTNKETTTSMVEDVGITLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcCh-hHHHhhcCc-eecCCcCCC---CCCE----EEeCCceeeecCCCcccceEEEEEEEEE Confidence 99765 3334444433321 11 3 355544211 1355 444566666666665 466788999998 Q ss_pred EeCCCChHHHHHHHHHHHHhhhcCceeeeEEEe-c---CccccCceeCCCeE--EEEEEEEEEeecCC Q lcl|NC_021331. 71 FPPGYGTDRPRVLAKEIAQFFYDGKMLEHGYIY-E---GARVHKPLKSESGW--ILPIRFYVRIETKE 132 (132) Q Consensus 71 ~p~G~G~~~a~~lA~~iaa~F~~g~~l~~~~i~-~---~p~~~~~~~~~~~~--~ipVsi~yRadt~~ 132 (132) .. +.|..++++||+.|.+.....+.+..+++. - .-.... .+++..+ .+-+++.++-|||+ T Consensus 72 s~-~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~r-d~dg~~~hgvl~fra~ve~~~~~ 137 (145) T protein:vir:97 72 SQ-ARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVIT-DIDQYTKHGIIRLVFKYRHNTLQ 137 (145) T ss_pred Ec-CCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEee-cCCcceEEEEEEEEEEEEecccc Confidence 75 668999999999999887666666655321 1 112221 2334334 45566667777777 No 25 >protein:vir:96894 Length: 140 # NCBI annotation: ORF029 # Family: family:all:296 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240162;genbank:gi:66395835;genbank:GeneID:5133235 Probab=94.28 E-value=0.0043 Score=33.53 Aligned_cols=121 Identities=8% Similarity=-0.047 Sum_probs=68.7 Q ss_pred CC--HHH-HHHHHHHHhhccc------CCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCc-eEEEEEEEEEEE Q lcl|NC_021331. 1 MH--YEL-MLSARKALATEYE------TRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKC-VSYIGMIQVGIV 70 (132) Q Consensus 1 M~--~el-~~~al~~~a~a~~------~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~-~~~~G~~qv~v~ 70 (132) |+ +++ +.+|+-++..+.. ++ +| |.++. . +.+| .++.-+..+..+.+.+| ....-.++|+|+ T Consensus 1 Msms~~~aLq~Ai~a~L~ada~l~alvg~-~V-yD~~P----~-~~~~--Pyv~lG~~~~~~~~~~~~~g~~~~~~i~Vw 71 (140) T protein:vir:96 1 MWVSVEPELTVQIYKRLKASPIINKFVGD-RV-FDVVQ----E-DAVY--PYIVVGESNVTNNESSTMMRETVGIVIHVY 71 (140) T ss_pred CCccHHHHHHHHHHHHhhcChhHHHhcCC-cc-ccCCc----c-CCCC--CEEEecCceeeecCCCcccceEEEEEEEEE Confidence 55 443 3444444433321 11 33 55543 2 2332 24445666677777665 566788999988 Q ss_pred EeCCCChHHHHHHHHHHHHhhhcCceeeeEEEe----cCccccCceeCCCeE--EEEEEEEEEeecCC Q lcl|NC_021331. 71 FPPGYGTDRPRVLAKEIAQFFYDGKMLEHGYIY----EGARVHKPLKSESGW--ILPIRFYVRIETKE 132 (132) Q Consensus 71 ~p~G~G~~~a~~lA~~iaa~F~~g~~l~~~~i~----~~p~~~~~~~~~~~~--~ipVsi~yRadt~~ 132 (132) . .+.|..++++||+.|.+.....+.++.+++. ..-.... .+|+..+ .+-+++.+|..+|. T Consensus 72 s-~~~g~~ea~~ia~av~~AL~~~l~l~~~~lv~l~~~~~~~~r-d~dg~~~hgvl~~r~~v~~~~~~ 137 (140) T protein:vir:96 72 S-QFATQYEAKQIISAIGYVLNRPIDIENYEFQFSRIDSQSVFP-DIDRFTKHGTIRLLFKYRHIKKG 137 (140) T ss_pred E-cCCCHHHHHHHHHHHHHHhCCCccCCCCeEEEEEEeeeEEEe-cCCCceEEEEEEEEEEEEeeccc Confidence 6 4668899999999999877655666544321 1111221 1334334 46666677777776 No 26 >protein:vir:1244 Length: 145 # NCBI annotation: similar to phage Spp1 gp17 # Family: family:all:296 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510943;genbank:gi:17426277;genbank:GeneID:927402 Probab=94.07 E-value=0.0035 Score=34.04 Aligned_cols=123 Identities=8% Similarity=0.024 Sum_probs=70.2 Q ss_pred CCHHH---HHHHHHHHhhccc---C--CCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCc-eEEEEEEEEEEEE Q lcl|NC_021331. 1 MHYEL---MLSARKALATEYE---T--RFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKC-VSYIGMIQVGIVF 71 (132) Q Consensus 1 M~~el---~~~al~~~a~a~~---~--~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~-~~~~G~~qv~v~~ 71 (132) |++.. +.+++-++..+.. . +-+ .|++++ +++ .-|| +.-+.+...+.+.++ ....-.++|+|+. T Consensus 1 M~~s~~~aLq~ai~~~L~ad~~l~~lvg~~-vyD~~P-~~~--~~Py----V~lG~~~~~~~~t~~~~~~~~~lti~Vws 72 (145) T protein:vir:12 1 MWVSVERYLFNKVYNKLKSNPIIQKQLGGR-VFDCVQ-KDA--VYPY----IVVGETNVTNKETTTSMVEDVGITLHVYS 72 (145) T ss_pred CcccHHHHHHHHHHHHhhcChhHHHhcCcc-cccCCc-cCC--CCCE----EEeccceeeecCCCcccceEEEEEEEEEE Confidence 87543 3334444332211 0 123 355544 211 1355 445667777777665 4566778999887 Q ss_pred eCCCChHHHHHHHHHHHHhhhcCceeeeEEE-ecCccccCce--eCCCeE--EEEEEEEEEeecCC Q lcl|NC_021331. 72 PPGYGTDRPRVLAKEIAQFFYDGKMLEHGYI-YEGARVHKPL--KSESGW--ILPIRFYVRIETKE 132 (132) Q Consensus 72 p~G~G~~~a~~lA~~iaa~F~~g~~l~~~~i-~~~p~~~~~~--~~~~~~--~ipVsi~yRadt~~ 132 (132) . ..|..++.+||+.|.+.....+.+...++ .-..+....+ +++..+ .+.+++.+|-|||+ T Consensus 73 ~-~~gr~ea~~ia~ai~~aL~~~l~l~~~~lv~l~~~~~~~~rd~d~~~~hgvl~~ra~i~~~~~~ 137 (145) T protein:vir:12 73 Q-ARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRHNTLQ 137 (145) T ss_pred c-CccHHHHHHHHHHHHHHhccccCCCCceEEEEEEeeEEEEecCCCceEEEEEEEEEEEEeCCcc Confidence 5 55899999999999987655555544322 1111111112 233233 57888889999999 No 27 >protein:vir:80371 Length: 115 # NCBI annotation: gp11 # Family: family:all:2712 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111090;genbank:gi:134288622;genbank:GeneID:4960618 Probab=91.93 E-value=0.0029 Score=34.49 Aligned_cols=113 Identities=12% Similarity=0.027 Sum_probs=78.5 Q ss_pred CCHHHHHHHHHHHhhcccCCCcEEcCCCCCCCCCC-CccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCChHH Q lcl|NC_021331. 1 MHYELMLSARKALATEYETRFMIAYENVEFTPPGD-GSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGTDR 79 (132) Q Consensus 1 M~~el~~~al~~~a~a~~~~~pva~pN~~F~Pp~~-g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~~~ 79 (132) |+--.++.||-...-+. .|.++. |++ -.||+-+..+. ...-..|.|......|.|||+++.+. -.+ T Consensus 1 ~~~~vir~al~~i~~~~------~~~~vA---p~~~~~pyivy~rvs-ga~e~~L~G~ag~~~~~~QID~yA~T---~~e 67 (115) T protein:vir:80 1 MSVIVVRDALQGIGGAK------GYLGVA---PEKAPARYFVVTRVH-GALDMALAGPTGGRSGSYQIDCYAPT---FTD 67 (115) T ss_pred Ceeeeeechhhhccccc------cceeec---cccCcCCeEEEeecC-CCccccccCCCCCceeEEEEeeecCC---HHH Confidence 88888888887764432 333443 221 24787655544 44556788877777999999999765 678 Q ss_pred HHHHHHHHHHh-hhcCceeeeEEEecCccccCceeCCCeEEEEEEEEE Q lcl|NC_021331. 80 PRVLAKEIAQF-FYDGKMLEHGYIYEGARVHKPLKSESGWILPIRFYV 126 (132) Q Consensus 80 a~~lA~~iaa~-F~~g~~l~~~~i~~~p~~~~~~~~~~~~~ipVsi~y 126 (132) +++||+++.+. |..-..+..|-+++-|..-.+...--+..+-|+|-| T Consensus 68 a~~La~~v~d~~~~~~~~~~vg~l~e~pd~Ye~DT~l~Rvs~dv~i~f 115 (115) T protein:vir:80 68 ADRLADLAVDRAMSVQDRFSVGGVDELPDDYSADTGLFRVSLELSVEF 115 (115) T ss_pred HHHHHHHHHHhhhCCccccceecccCCCcccccccceEEEEEEEEEeC Confidence 99999999983 322223456667777777777777778889999988 No 28 >protein:vir:4348 Length: 121 # NCBI annotation: Orf15 # Family: family:all:896 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061511;genbank:gi:9635607;genbank:GeneID:1262874 Probab=91.17 E-value=0.017 Score=30.27 Aligned_cols=116 Identities=14% Similarity=0.037 Sum_probs=68.8 Q ss_pred CCHHHHHHHHHHHhh-cccCC-CcEEcC-CCCCCCCCC-CccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCC Q lcl|NC_021331. 1 MHYELMLSARKALAT-EYETR-FMIAYE-NVEFTPPGD-GSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYG 76 (132) Q Consensus 1 M~~el~~~al~~~a~-a~~~~-~pva~p-N~~F~Pp~~-g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G 76 (132) |.+.+....++.-+. ++-+. -.-.|| ++. |.+ -.||+-.+.+-+.... .|+|.+-...+.+||+|+.. - T Consensus 1 m~~~i~~~l~~d~~v~allg~~~~Rvyp~~~a---P~~~~~Pyiv~q~vsg~p~~-~l~g~~~~~~~~vQIDvyA~---t 73 (121) T protein:vir:43 1 MYPPIFKVCSSSPAVTAILGASPLRMYQFGLA---PQLVVKPYATWQTISGSPEN-YLWGRPDADGFTIQVDIFSA---T 73 (121) T ss_pred CChHHHHHHhhChhhhhhhcCCCceeeccCCC---CCCCcCCeEEEEEecCcccc-eecCCCCcceeEEEEEeeeC---C Confidence 999888777664433 33222 224555 553 221 2478877777665544 48887777789999999954 4 Q ss_pred hHHHHHHHHHHHHhhhcCceeeeEEEecCccccCce-eCCCeEEEEEEEEEEeec Q lcl|NC_021331. 77 TDRPRVLAKEIAQFFYDGKMLEHGYIYEGARVHKPL-KSESGWILPIRFYVRIET 130 (132) Q Consensus 77 ~~~a~~lA~~iaa~F~~g~~l~~~~i~~~p~~~~~~-~~~~~~~ipVsi~yRadt 130 (132) ..+|++++++|.+.... . .+.+.... ..+ ++..-|.+-+-|+|--.- T Consensus 74 ~~~A~~l~~av~~Al~~-~---~~~~~~~~---~~ye~dT~lyR~s~Dv~w~~~r 121 (121) T protein:vir:43 74 AAEARDAAKAIRDAIEL-S---AYVVRWGG---ESVDPDTKTYRVSFDVDWIVQR 121 (121) T ss_pred HHHHHHHHHHHHHHhhh-c---CCcccCCC---CCCcccccceeeeeEEEEeecC Confidence 57889999999887742 1 11111111 112 333445555556654443 No 29 >protein:vir:1438 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:2712 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536367;genbank:gi:17975172;genbank:GeneID:929144 Probab=87.63 E-value=0.014 Score=30.77 Aligned_cols=112 Identities=13% Similarity=0.040 Sum_probs=65.6 Q ss_pred CCHHHHHHHHHHHhhcccCCCcEEcCCCCCCCCCC-CccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCChHH Q lcl|NC_021331. 1 MHYELMLSARKALATEYETRFMIAYENVEFTPPGD-GSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGTDR 79 (132) Q Consensus 1 M~~el~~~al~~~a~a~~~~~pva~pN~~F~Pp~~-g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~~~ 79 (132) |+.=++.++|...+ .-+ .||++. |.+ ..||+-...+-+.. ...|+|.+-...+.+||+|+.. ...+ T Consensus 1 ~~~~~i~~aL~~l~-----~~R-Vyp~~a---P~~~~~Pyiv~q~vsg~p-~~~L~G~~~~~~~~vQIDvyA~---t~~~ 67 (115) T protein:vir:14 1 MSVIVIRDALQGIG-----GAK-GYLGVA---PAKAPAPYFVVTRVHGAL-DMALAGLTGGRSGSYQIDCYAP---TFTD 67 (115) T ss_pred CeeEeeehhhcccc-----ccc-cccccC---CCCCCCCEEEEEeecCcc-cccccCCCCCcceEEEEEEeeC---CHHH Confidence 77666666665443 223 345553 222 24787766666544 5588988877799999999975 4567 Q ss_pred HHHHHHHHHHhhhcCce--eeeEEEecCccccCceeCCCeEEEEEEEEE Q lcl|NC_021331. 80 PRVLAKEIAQFFYDGKM--LEHGYIYEGARVHKPLKSESGWILPIRFYV 126 (132) Q Consensus 80 a~~lA~~iaa~F~~g~~--l~~~~i~~~p~~~~~~~~~~~~~ipVsi~y 126 (132) |+++++++.+.-. +.. +..+-+++.+....+...-.+..+-++|=| T Consensus 68 A~~l~~~v~~~~~-~~~~~~~~~~~~~~~d~ye~dt~lyR~s~D~~vWf 115 (115) T protein:vir:14 68 ADRLADLAVDRAM-SVQDRFSVGGVDELPDDYSEDTGLFRISLELSVEF 115 (115) T ss_pred HHHHHHHHHHHHh-cCccceeeeeecCCCCCCcccccceeeEEEEEEeC Confidence 7888888765421 111 122234444443443333455566666667 No 30 >protein:vir:10368 Length: 118 # NCBI annotation: conserved phage protein # Family: family:all:3244 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858960;genbank:gi:32128425;genbank:GeneID:2648389 Probab=87.38 E-value=0.039 Score=28.30 Aligned_cols=113 Identities=7% Similarity=0.021 Sum_probs=64.7 Q ss_pred CCHHHHHHHH-HHHhhcccCCCcEEcCCCCCCCCCC-C-ccEEEEEEccCCceeeecCCC-ceEEEEEEEEEEEEeCCCC Q lcl|NC_021331. 1 MHYELMLSAR-KALATEYETRFMIAYENVEFTPPGD-G-SPWLKFDYAEVDTEYLSLDRK-CVSYIGMIQVGIVFPPGYG 76 (132) Q Consensus 1 M~~el~~~al-~~~a~a~~~~~pva~pN~~F~Pp~~-g-~~yLr~~~~pa~t~~~~L~~~-~~~~~G~~qv~v~~p~G~G 76 (132) |+.|-...++ .... +-+| ||++. |.+ - .||+-...+-+.. ...|+|. +......+||+|+.. - T Consensus 1 Ms~e~~l~a~L~~~~-----~~RV-yp~~a---P~~~~~~Pyiv~q~vsg~p-~~~l~G~~~~~~~~rvQIdvyA~---t 67 (118) T protein:vir:10 1 MSYGRVLKDLLDPVF-----SGRV-YADIP---PDSPPLDAYAIYQRVGGVP-VYWQEGGMPEKVNARVQIQIWSR---S 67 (118) T ss_pred CchHHHHHHHHhhhc-----CCcc-ccccC---CCCCCcCCEEEEEecCCcc-cccccCCCCccceeEEEEEEeeC---C Confidence 9977655554 3322 1233 44443 222 2 3788888877755 5558775 556678899999975 4 Q ss_pred hHHHHHHHHHHHHhhhcCceeeeEEEecCccccCceeCCCeEEEEEEEE-EEeec Q lcl|NC_021331. 77 TDRPRVLAKEIAQFFYDGKMLEHGYIYEGARVHKPLKSESGWILPIRFY-VRIET 130 (132) Q Consensus 77 ~~~a~~lA~~iaa~F~~g~~l~~~~i~~~p~~~~~~~~~~~~~ipVsi~-yRadt 130 (132) ..+|.+++++|+........+. -+..+.+..- ++..-|.+-+-|. |-+-| T Consensus 68 ~~~A~~l~~av~~al~~~~~~~--~~~~~~d~ye--~dt~l~r~~~Df~vw~~~~ 118 (118) T protein:vir:10 68 KQEAYLATVQVLRLVSEANDMQ--VLSQPIDDYV--REIKLYGSRVDISMWYNLT 118 (118) T ss_pred HHHHHHHHHHHHHHhhhcccce--eccCCCcccc--ccCCceEEEEEEEEeeecC Confidence 6788888888888775433221 1122222222 2334455554444 55556 No 31 >protein:vir:5979 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:296 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690679;genbank:geneid:6329147;genbank:gi:22855073;uniprot:O48448;genbank:GeneID:955319 Probab=86.66 E-value=0.044 Score=28.02 Aligned_cols=121 Identities=11% Similarity=-0.014 Sum_probs=66.0 Q ss_pred CCHHH----HHHHHHHHhhcccCCCc----EEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCc-eEEEEEEEEEEEE Q lcl|NC_021331. 1 MHYEL----MLSARKALATEYETRFM----IAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKC-VSYIGMIQVGIVF 71 (132) Q Consensus 1 M~~el----~~~al~~~a~a~~~~~p----va~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~-~~~~G~~qv~v~~ 71 (132) |+|-. +.+|+-++..+.. .+. -.|.++.=.+ --|| +.-+.++..+.+.+| ....-.++|+|+. T Consensus 1 m~~~s~~~aLq~Ai~~~L~ad~-~l~alvg~I~D~~P~~~---~~PY----V~lG~~~~~d~~~~~~~g~~~~~ti~Vws 72 (134) T protein:vir:59 1 MTWKLASRALQKATVENLESYQ-PLMEMVNQVTESPGKDD---PYPY----VVIGDQSSTPFETKSSFGENITMDFHVWG 72 (134) T ss_pred CCccchhHHHHHHHHHHhhcCh-hHHHhhhhhhcCCCCCC---CCCE----EEeCCceeeecCCCcccceEEEEEEEEEE Confidence 98763 4555555544422 121 2455543111 1344 444566666676665 5667888999998 Q ss_pred eCCCChHHHHHHHHHHHHhhhc-CceeeeEEEe---cCccccCceeCCCeEEEEEEEEEEeecC Q lcl|NC_021331. 72 PPGYGTDRPRVLAKEIAQFFYD-GKMLEHGYIY---EGARVHKPLKSESGWILPIRFYVRIETK 131 (132) Q Consensus 72 p~G~G~~~a~~lA~~iaa~F~~-g~~l~~~~i~---~~p~~~~~~~~~~~~~ipVsi~yRadt~ 131 (132) .. |..++++||+.|.+...+ .+.|..+++. -.-+..--.+++..+..-++|..+-+.- T Consensus 73 ~~--g~~ea~~ia~av~~aL~~~~L~l~~~~lv~l~~~~~~~~rd~dg~~~hg~l~fra~ve~~ 134 (134) T protein:vir:59 73 GT--TRAEAQDISSRVLEALTYKPLMFEGFTFVAKKLVLAQVITDTDGVTKHGIIKVRFTINNN 134 (134) T ss_pred CC--ChHHHHHHHHHHHHHhcCCCcccCCceEEEeEEeeeeEEecCCCceEEEEEEEEEEEecC Confidence 65 557899999999998743 3445444321 1111111224455555555555444443 No 32 >protein:vir:397 Length: 132 # NCBI annotation: gp12 # Family: family:all:911 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046907;genbank:gi:9630477;genbank:GeneID:1261651 Probab=86.33 E-value=0.046 Score=27.90 Aligned_cols=123 Identities=7% Similarity=-0.028 Sum_probs=74.6 Q ss_pred CCHHHHHHHHHHHhhcccCCCcEEcCCC-CCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCChHH Q lcl|NC_021331. 1 MHYELMLSARKALATEYETRFMIAYENV-EFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGTDR 79 (132) Q Consensus 1 M~~el~~~al~~~a~a~~~~~pva~pN~-~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~~~ 79 (132) |+...+++++.+....--+..-..+.|. .|.-+. -.|=..|++-.+.....+|.++ ..+..|.|.||-|+.++.++ T Consensus 1 ~~ht~IR~~Vid~L~~~l~~~~~ffdGrP~fiDe~-elPAVAV~l~d~~~~~~~ld~~--~w~A~LhI~iyLka~~~ds~ 77 (132) T protein:vir:39 1 MKHRDIRKVIIDALESAIGTDAIYFDGRPAVLEEG-DFPAVAVYLTDAEYTGEELDAD--TWQAILHIEVFLEAQVPDSE 77 (132) T ss_pred CchHHHHHHHHHHHHhhCCCceEEecCcceeeccc-cCcEEEEEeecCCCCcceecCC--eeEEEEEEEEEeecCCCHHH Confidence 9999999998877665444555556665 454433 3576677777766666666544 78899999999999999999 Q ss_pred HHHHHHHHHHhhhcCc---eeeeE--EE-ecCccccCceeCCCeEEE--EEEEEEEe Q lcl|NC_021331. 80 PRVLAKEIAQFFYDGK---MLEHG--YI-YEGARVHKPLKSESGWIL--PIRFYVRI 128 (132) Q Consensus 80 a~~lA~~iaa~F~~g~---~l~~~--~i-~~~p~~~~~~~~~~~~~i--pVsi~yRa 128 (132) ..++|.++ .|+... .|..- .+ ...=+=.+-.+..+|... --+|+|.. T Consensus 78 LD~~aE~~--i~p~i~~~~~l~~l~~~~~~~gy~Y~rD~~~atW~sadL~y~ItY~~ 132 (132) T protein:vir:39 78 LDDWMETR--VYPVLAEVPGLESLITTMVQQGYDYQRDDDMALWSSADLKYSITYDM 132 (132) T ss_pred HHHHHHHH--hHhhhcccchhhhHhhhhhhcCCCcccccccceEEEEEEEEEEEEeC Confidence 99999987 332211 11110 01 000111111122345543 33455666 No 33 >protein:vir:100116 Length: 115 # NCBI annotation: gp10 # Family: family:all:2712 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945040;genbank:gi:38707900;genbank:GeneID:2744163 Probab=86.25 E-value=0.019 Score=30.07 Aligned_cols=112 Identities=14% Similarity=0.053 Sum_probs=64.2 Q ss_pred CCHHHHHHHHHHHhhcccCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCChHHH Q lcl|NC_021331. 1 MHYELMLSARKALATEYETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGTDRP 80 (132) Q Consensus 1 M~~el~~~al~~~a~a~~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~~~a 80 (132) |+.=++.++|.... .- -.||++. |-....||+-...+-+.. ...|+|.+-...+.+||+|+... ..+| T Consensus 1 ~~~~~i~~aL~~l~-----~~-RVyp~~a--P~~~~~Pyiv~q~vsg~p-~~~L~G~~~~~~~~vQIDvyA~t---~~~A 68 (115) T protein:vir:10 1 MSVIVIRDALQGIG-----GA-KGYLGVA--PEKAPAPYFVVTRVHGAL-DMALAGLTGGRSGSYQIDCYAPT---FTDA 68 (115) T ss_pred CeeEEeehhhcccC-----Cc-eeecccC--CCCCCCCEEEEEeecCcc-ccccCCCCCCcceEEEEEEeeCC---HHHH Confidence 65555555544432 22 3446664 111235787777666644 55889888777999999999754 5677 Q ss_pred HHHHHHHHHh---hhcCceeeeEEEecCccccCceeCCCeEEEEEEEEE Q lcl|NC_021331. 81 RVLAKEIAQF---FYDGKMLEHGYIYEGARVHKPLKSESGWILPIRFYV 126 (132) Q Consensus 81 ~~lA~~iaa~---F~~g~~l~~~~i~~~p~~~~~~~~~~~~~ipVsi~y 126 (132) +++++++.+. ++.. +..+-+++.+....+...-.+..+-++|=| T Consensus 69 ~~l~~~v~~~~~~~~~~--~~~~~~~~~~d~ye~dt~lyR~s~D~~vWf 115 (115) T protein:vir:10 69 DRLADLAVDRAMSVQDR--FSVGGVDELPDDYSEDTGLFRISLELSVEF 115 (115) T ss_pred HHHHHHHHHHHhcCccc--eeEeeecCCCCCCcccccceeeEEEEEEeC Confidence 7778777643 2221 122234444443433333455566666667 No 34 >protein:vir:1892 Length: 121 # NCBI annotation: gp11 # Family: family:all:896 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037672;genbank:gi:9634130;genbank:GeneID:1262490 Probab=86.16 E-value=0.048 Score=27.84 Aligned_cols=117 Identities=8% Similarity=0.005 Sum_probs=69.3 Q ss_pred CCHHHHHHHHHHHhh-cccC-CCcEEcC-CCCCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCCh Q lcl|NC_021331. 1 MHYELMLSARKALAT-EYET-RFMIAYE-NVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGT 77 (132) Q Consensus 1 M~~el~~~al~~~a~-a~~~-~~pva~p-N~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~ 77 (132) |++.+....++.-+. ++-+ .-+-.|| ++. |-..-.||+-.+.+-+ +....|+|.+-...+.+||+|+... . T Consensus 1 m~~~i~~~l~~d~~v~allg~~~~Rvyp~~~a--P~~~~~Pyiv~q~vsg-~p~~~l~G~~~~~~~~vQIDvyA~t---~ 74 (121) T protein:vir:18 1 MIAPIFSVCASSPEVTDLLGSNPVRIYPFGIQ--DDNVVYPYVVWQNITG-SPENYIAQRPDADFFTLQVDAYADT---V 74 (121) T ss_pred CchHHHHHHhcChhhhhhhcCCCceeeeccCC--CCcCcCCeEEEEEecC-cccceecCCCCcceeEEEEEeecCC---H Confidence 999998776554322 2222 1234555 543 2122247776665544 5566788877788899999999764 4 Q ss_pred HHHHHHHHHHHHhhhcCceeeeEEEe-cCccccCceeCCCeEEEEEEEEEEeec Q lcl|NC_021331. 78 DRPRVLAKEIAQFFYDGKMLEHGYIY-EGARVHKPLKSESGWILPIRFYVRIET 130 (132) Q Consensus 78 ~~a~~lA~~iaa~F~~g~~l~~~~i~-~~p~~~~~~~~~~~~~ipVsi~yRadt 130 (132) .+|.+++++|.+.... . +++. .... .=.++..-|.+-.-|.|--.- T Consensus 75 ~~A~~l~~avr~Ale~-~----~~~~~~~~~--~ye~dT~lyR~s~Dv~~~~~r 121 (121) T protein:vir:18 75 DEVIAVATALRDAIEP-H----AHITRWGGQ--ERDPETKRYRYSFDVDWIVTR 121 (121) T ss_pred HHHHHHHHHHHHHhhh-c----CcccCCCCC--CCcccccceeeeeEEEEeecC Confidence 5788999999987742 1 2221 1111 112344556666666665554 No 35 >protein:vir:81066 Length: 118 # NCBI annotation: p13 # Family: family:all:3244 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285683;genbank:gi:148727191;genbank:GeneID:5247111 Probab=83.10 E-value=0.071 Score=26.87 Aligned_cols=113 Identities=7% Similarity=-0.002 Sum_probs=63.5 Q ss_pred CCHHHHHHH-HHHHhhcccCCCcEEcCCCCCCCCCC-C-ccEEEEEEccCCceeeecCCC-ceEEEEEEEEEEEEeCCCC Q lcl|NC_021331. 1 MHYELMLSA-RKALATEYETRFMIAYENVEFTPPGD-G-SPWLKFDYAEVDTEYLSLDRK-CVSYIGMIQVGIVFPPGYG 76 (132) Q Consensus 1 M~~el~~~a-l~~~a~a~~~~~pva~pN~~F~Pp~~-g-~~yLr~~~~pa~t~~~~L~~~-~~~~~G~~qv~v~~p~G~G 76 (132) |+.|-...+ |...+ +-+ .||++. |.+ - .||+-...+-+.. ...|+|. +......+||+|+.. . T Consensus 1 Ms~e~~l~a~L~~~~-----~~R-vyp~~a---P~~~~~~Pyiv~q~vsg~p-~~~l~G~~~~~~~~rvQIdvyA~---t 67 (118) T protein:vir:81 1 MSYGRVLKDLLDPVF-----SGR-VYADIP---PDSPPLDAYAIYQRVGGVP-VYWQEGGMPEKVNARVQIQIWSR---S 67 (118) T ss_pred CchHHHHHHHHHhhc-----CCc-cccccC---CCCCccCceEEEEecCCcc-cccccCCCCCccceeEEEEEeeC---C Confidence 996654444 43322 112 344333 222 2 3788888887755 4558775 445567899999975 4 Q ss_pred hHHHHHHHHHHHHhhhcCceeeeEEEecCccccCceeCCCeEEEEEEEE-EEeec Q lcl|NC_021331. 77 TDRPRVLAKEIAQFFYDGKMLEHGYIYEGARVHKPLKSESGWILPIRFY-VRIET 130 (132) Q Consensus 77 ~~~a~~lA~~iaa~F~~g~~l~~~~i~~~p~~~~~~~~~~~~~ipVsi~-yRadt 130 (132) ..+|++++++|++.......+ ..+..+....- ++..-|.+-+-|. |-..| T Consensus 68 ~~~A~~l~~av~~al~~~~~~--~~~~~~~d~ye--~dt~l~r~~~Df~iw~~~~ 118 (118) T protein:vir:81 68 KQEAYLATVQVLRLVSEAPDM--QVLSQPIDDYV--REIKLYGSRVDVSMWYPIT 118 (118) T ss_pred HHHHHHHHHHHHHHhhhccce--eeccCCccccc--cccCceeEEEEEEEEecCC Confidence 678889999998877543222 11122222111 3345555555555 44555 No 36 >protein:vir:3428 Length: 131 # NCBI annotation: tail component # Family: family:all:911 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040591;genbank:gi:9626255;genbank:GeneID:2703486 Probab=80.93 E-value=0.09 Score=26.31 Aligned_cols=124 Identities=9% Similarity=0.023 Sum_probs=73.5 Q ss_pred CCHHHHHHHHHHHhhcccCCCcEEcCCC-CCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCChHH Q lcl|NC_021331. 1 MHYELMLSARKALATEYETRFMIAYENV-EFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGTDR 79 (132) Q Consensus 1 M~~el~~~al~~~a~a~~~~~pva~pN~-~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~~~ 79 (132) |+...+++++.+....--+.+. .+.|. .|--+. -.|=..|++-.+.....+|.++ ..+..|.|.||-|+.++.++ T Consensus 1 ~~ht~IR~~Vid~L~~~l~~v~-~fdG~P~fide~-ElPAVAV~l~d~~~~~~~ld~~--~w~A~LhI~iyLka~~~ds~ 76 (131) T protein:vir:34 1 MKHTELRAAVLDALEKHDTGAT-FFDGRPAVFDEA-DFPAVAVYLTGAEYTGEELDSD--TWQAELHIEVFLPAQVPDSE 76 (131) T ss_pred CchHHHHHHHHHHHhccCCceE-EecCCceeeccc-cCcEEEEEeecCCCCcceecCC--eeEEEEEEEEEeecCCCHHH Confidence 9999999998887654334443 55554 343322 3466677776666666666544 78899999999999999999 Q ss_pred HHHHHHH-HHHhhhcCceeee--EEE-ecCccccCceeCCCeEEEE--EEEEEEe Q lcl|NC_021331. 80 PRVLAKE-IAQFFYDGKMLEH--GYI-YEGARVHKPLKSESGWILP--IRFYVRI 128 (132) Q Consensus 80 a~~lA~~-iaa~F~~g~~l~~--~~i-~~~p~~~~~~~~~~~~~ip--Vsi~yRa 128 (132) ..++|.+ |..-.+....|.. +.+ ...=+=.+-.+..+|...- -+|+|.. T Consensus 77 LD~~~E~~i~~v~~~~~~l~~l~~~~~~~gy~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) T protein:vir:34 77 LDAWMESRIYPVMSDIPALSDLITSMVASGYDYRRDDDAGLWSSADLTYVITYEM 131 (131) T ss_pred HHHHHHHHhHHHhhcchhhhhHhhhhhhccCCcccccccceEEEEEEEEEEEEeC Confidence 9999998 4444443222220 001 0001111111223455433 3455655 No 37 >protein:vir:96800 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:32155 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224254;genbank:gi:62362389;genbank:GeneID:3345739 Probab=79.99 E-value=0.05 Score=27.72 Aligned_cols=124 Identities=15% Similarity=0.147 Sum_probs=87.3 Q ss_pred CCHHHHHHHHHHHhhcccCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCChHHH Q lcl|NC_021331. 1 MHYELMLSARKALATEYETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGTDRP 80 (132) Q Consensus 1 M~~el~~~al~~~a~a~~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~~~a 80 (132) |++--..+|...-..+. -.+|||-..+ |.+| .-+|+-+-.+|..-..|....+..+|-|-|.|-...|+..-+- T Consensus 1 mtfldavkafeddlkak-vnipvanksi----ptdg-vsmrvalnnadadglflnsgarvmtgqfnveisaelgtnkyam 74 (127) T protein:vir:96 1 MTFLDAVKAFEDDLKAK-VNIPVANKSI----PTDG-VSMRVALNNADADGLFLNSGARVMTGQFNVEISAELGTNKYAM 74 (127) T ss_pred Cchhhhhhhhhhcccee-eecccccccc----CcCc-eEEEEEeccCCcceeEeecCceeeeeeeeeEEeeccCCceeee Confidence 87654444444333332 2566664443 3665 7789999999999999998899999999999999999887776 Q ss_pred HHHHHHHHHhhhcCceeee--EEE-ecCccccCceeCCCeEEEEEEEEEEeec Q lcl|NC_021331. 81 RVLAKEIAQFFYDGKMLEH--GYI-YEGARVHKPLKSESGWILPIRFYVRIET 130 (132) Q Consensus 81 ~~lA~~iaa~F~~g~~l~~--~~i-~~~p~~~~~~~~~~~~~ipVsi~yRadt 130 (132) ..-|.++-+-+.+|-+... -++ .--....-+++.+.+-.|+|-|.|+.-. T Consensus 75 maeankvlavyergysvpvldrrvlilqanqstpypteahqkinviidfqitk 127 (127) T protein:vir:96 75 MAEANKVLAVYERGYSVPVLDRRVLILQANQSTPYPTEAHQKINVIIDFQITK 127 (127) T ss_pred eeccceeEEeeecCcccceecceEEEEEcCCCCCCcccccceeeEEEEEEEcC Confidence 6667777788877766432 121 1123445567778888899999888754 No 38 >protein:vir:97070 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:3244 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453569;genbank:gi:84662604;genbank:GeneID:5142485 Probab=76.68 E-value=0.13 Score=25.39 Aligned_cols=115 Identities=8% Similarity=0.006 Sum_probs=63.7 Q ss_pred CCHHHHHHHHHHHhhcccCCCcEEcCCCCCCCCCC-C-ccEEEEEEccCCceeeecCCC-ceEEEEEEEEEEEEeCCCCh Q lcl|NC_021331. 1 MHYELMLSARKALATEYETRFMIAYENVEFTPPGD-G-SPWLKFDYAEVDTEYLSLDRK-CVSYIGMIQVGIVFPPGYGT 77 (132) Q Consensus 1 M~~el~~~al~~~a~a~~~~~pva~pN~~F~Pp~~-g-~~yLr~~~~pa~t~~~~L~~~-~~~~~G~~qv~v~~p~G~G~ 77 (132) |+.|-...++.+ +.. +-+ .||++. |.+ . .||+-...+-+.+. ..|.|. +..+...+||+|+.. .. T Consensus 1 M~~e~~l~a~L~---~~~-~~R-vyp~~a---P~~~~~~Pyiv~q~vsg~p~-~~ldG~~~~~~~~rvQIdvyA~---t~ 68 (118) T protein:vir:97 1 MSYGRMLKDLLD---PVF-SGR-VYADIP---PDSPPLDAYAIYQRVGGVPV-YWKEGGMPDKVNARVQVQIWSR---SK 68 (118) T ss_pred CchHHHHHHHHh---hhc-CCc-cccccC---CCCCCcCCEEEEEecCCccc-ccccCCCCCccceeEEEEEeeC---CH Confidence 998876665432 111 223 344444 332 2 37888887777555 558775 556677899999975 46 Q ss_pred HHHHHHHHHHHHhhhcCceeeeEEEecCccccCceeCCCeEEEEEEEEEEeecC Q lcl|NC_021331. 78 DRPRVLAKEIAQFFYDGKMLEHGYIYEGARVHKPLKSESGWILPIRFYVRIETK 131 (132) Q Consensus 78 ~~a~~lA~~iaa~F~~g~~l~~~~i~~~p~~~~~~~~~~~~~ipVsi~yRadt~ 131 (132) .+|.+++++|.........+. -+..+.+..- ++...|.+-+-|.-=.+|- T Consensus 69 ~~A~~l~~av~~al~~~~~~~--~~~~~~~~ye--~dt~lyr~~~Df~iw~~~~ 118 (118) T protein:vir:97 69 QEAYLATVQVLRIVSEANDMQ--VLSQPIDDYV--RELKLYGSRVDISMWYNLT 118 (118) T ss_pred HHHHHHHHHHHHHhhcccccc--cccCCccccc--ccCCceEEEEEEEEEeecC Confidence 788888888888774432221 0111221111 3344455544444333333 No 39 >protein:vir:93602 Length: 114 # NCBI annotation: putative structural component # Family: family:all:896 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449300;genbank:gi:157166048;uniprot:Q6H9U1;genbank:GeneID:5580424 Probab=70.93 E-value=0.2 Score=24.39 Aligned_cols=114 Identities=10% Similarity=0.062 Sum_probs=59.5 Q ss_pred CCHHHHHHHHHHHhhcccCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCChHHH Q lcl|NC_021331. 1 MHYELMLSARKALATEYETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGTDRP 80 (132) Q Consensus 1 M~~el~~~al~~~a~a~~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~~~a 80 (132) |+.+.+.++|...+... -+|-.-|-..-.+ ....||+-...+-+.+ ...|+|.. .-.-.+||+|+.. ...+| T Consensus 1 M~e~~i~~lL~~~~~gR--vyp~~~P~~~~~~-~~~~Pyiv~q~vsg~p-~~~l~gp~-~~~~~vQIDvyA~---t~~~A 72 (114) T protein:vir:93 1 MTEADLYPHLAHLAGGQ--VYPYVVPLLDGRP-SVALPWVVFSLISSVS-ADVMGGQA-ESSVSVQIDVYAG---TVTQA 72 (114) T ss_pred CchHHHHHHHHhhcCcc--cccccCCcccCcC-CccCceEEEEeccCcc-cccccCcc-ccceEEEEEeeeC---CHHHH Confidence 99888888888765442 1222222221111 1224777766665544 34466633 3457999999975 56788 Q ss_pred HHHHHHHHHhhhcCceeeeEEEecCccccCceeCCCeEEEEEEEEEEe Q lcl|NC_021331. 81 RVLAKEIAQFFYDGKMLEHGYIYEGARVHKPLKSESGWILPIRFYVRI 128 (132) Q Consensus 81 ~~lA~~iaa~F~~g~~l~~~~i~~~p~~~~~~~~~~~~~ipVsi~yRa 128 (132) ++++++|+.......... +. ..+-.- ++..-|.+-+-|.+.. T Consensus 73 ~~l~~~v~~Al~~~~~~~---~~-~~~~ye--~dt~lyR~~~d~~v~~ 114 (114) T protein:vir:93 73 RQIRQDAREAIMLLAPGS---VS-EMQDYI--PENRCYRATLEFQVTV 114 (114) T ss_pred HHHHHHHHHHHhhcCcEe---ec-CCCccc--ccccceeeEEEEEEeC Confidence 999999988775322111 11 111111 2223333333333222 No 40 >protein:vir:79047 Length: 145 # NCBI annotation: hypothetical protein # Family: family:all:1535 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110730;genbank:gi:134287347;genbank:GeneID:4955221 Probab=68.25 E-value=0.24 Score=23.98 Aligned_cols=119 Identities=8% Similarity=0.051 Sum_probs=64.5 Q ss_pred CCHHHHHHHHHHHhhcccCCCcEEcCCCC--CCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEE-eCC-CC Q lcl|NC_021331. 1 MHYELMLSARKALATEYETRFMIAYENVE--FTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVF-PPG-YG 76 (132) Q Consensus 1 M~~el~~~al~~~a~a~~~~~pva~pN~~--F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~-p~G-~G 76 (132) |+.|++.+--+.+...++..++|-=+.+. |.+|. =| +.+++.. ....-+. +|.=.+.++|.+ |.+ .. T Consensus 1 mi~dI~~aI~~~Lk~~Fp~~~~IY~e~i~Qgf~~Pc---FF--I~ll~~~--~~~~~~~--r~~r~~~~dI~Yfp~~~~~ 71 (145) T protein:vir:79 1 MLNNIIDGISVKLDKSFGEKYTIYSEDVEQGINEPC---FF--IVPLNPS--KTPYPSG--RELKKNSFDVHYFPRSEAK 71 (145) T ss_pred ChHHHHHHHHHHHHHhcCCceEEEecccccCccCCe---eE--EEEeccc--cccccCc--eEEEEEEEEEEEeecCCCC Confidence 99999888877777677666899999986 88763 12 2333322 2222222 233334444433 544 35 Q ss_pred hHHHHHHHHHHHHhhhcCceeeeEEEecCccccCceeC-CCeEEEEEEEEEEeecCC Q lcl|NC_021331. 77 TDRPRVLAKEIAQFFYDGKMLEHGYIYEGARVHKPLKS-ESGWILPIRFYVRIETKE 132 (132) Q Consensus 77 ~~~a~~lA~~iaa~F~~g~~l~~~~i~~~p~~~~~~~~-~~~~~ipVsi~yRadt~~ 132 (132) ..++.++|+++-+.|. ...+....+.. -.....+.| .-++.+ ++.|+..-.| T Consensus 72 ~~e~~ev~e~L~~~le-~i~v~~~~~~~-~~~~~eivDgvLhf~~--~~~~~~~k~~ 124 (145) T protein:vir:79 72 NFEINEIAEMLLEELE-YIEINGDLVRG-TNMNFEIIDNVLHFFV--DYNYFTIKSN 124 (145) T ss_pred chhHHHHHHHHHhhhc-ceeecCcEEee-ecceeEEeeceEEEEE--EEEEEEeeec Confidence 6689999999999994 34453333322 111222222 233333 3334333222 No 41 >protein:vir:195 Length: 115 # NCBI annotation: Gp11 # Family: family:all:896 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037705;genbank:gi:9634170;genbank:GeneID:1262540 Probab=67.37 E-value=0.25 Score=23.85 Aligned_cols=113 Identities=12% Similarity=0.015 Sum_probs=61.2 Q ss_pred CCHHHHHHHHHHHhhcccCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCChHHH Q lcl|NC_021331. 1 MHYELMLSARKALATEYETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGTDRP 80 (132) Q Consensus 1 M~~el~~~al~~~a~a~~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~~~a 80 (132) |+.+.+-+.|...+..+ -+|-.-|-..-..|+...||+-...+-+.. ...|+|... -...+||+|+.+ ...+| T Consensus 1 M~e~~i~~lL~~l~~gR--vyp~~aP~~~~~~~~~~~Pyiv~q~vsg~p-~~~L~G~~~-~~~~vQIDvyA~---t~~~A 73 (115) T protein:vir:19 1 MNEDNIYALLSPLAEGR--VYPYVAPLGSDGKPSVSPPWIIFSIVDDVS-ADVLCGQAE-SRVSVQVDVYST---SIAES 73 (115) T ss_pred CchhHHHHHHhhhcCcc--cceeeccCCCCCCccccCCeEEEEeccCcc-cccccCCCc-cceEEEEEEeeC---ChHHH Confidence 99888888887665543 355555554333334445888766665533 344676433 456999998875 45678 Q ss_pred HHHHHHHHHhhhcCceeeeEEEecCccccCceeCCCeE--EEEEEEEE Q lcl|NC_021331. 81 RVLAKEIAQFFYDGKMLEHGYIYEGARVHKPLKSESGW--ILPIRFYV 126 (132) Q Consensus 81 ~~lA~~iaa~F~~g~~l~~~~i~~~p~~~~~~~~~~~~--~ipVsi~y 126 (132) ++++++|++........ ... ..+..- ++..-| .+-++|+= T Consensus 74 ~~l~~~i~~Al~~~~p~---~~~-~~~~ye--~dt~lyR~s~d~~V~~ 115 (115) T protein:vir:19 74 RSLRDLVLASLEPLTPT---EVV-KIPGYE--PDYRLYRATLDFKVTP 115 (115) T ss_pred HHHHHHHHHHhhhcCCE---Eec-CCCCcc--cchhceeeEEEEEecC Confidence 88888888776422111 111 111111 222222 33333322 No 42 >protein:vir:3874 Length: 114 # NCBI annotation: putative head-tail joining protein # Family: family:all:28620 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680491;swissprot:trembl:p94215;genbank:gi:22296531;uniprot:P94215;genbank:GeneID:951676 Probab=64.87 E-value=0.28 Score=23.63 Aligned_cols=106 Identities=15% Similarity=0.088 Sum_probs=61.8 Q ss_pred CCHHHHHH----HHHHHhhcccCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCce-EEEEEEEEEEEEeCCC Q lcl|NC_021331. 1 MHYELMLS----ARKALATEYETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCV-SYIGMIQVGIVFPPGY 75 (132) Q Consensus 1 M~~el~~~----al~~~a~a~~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~-~~~G~~qv~v~~p~G~ 75 (132) |-+|.-.+ +-...+.--=-+++..=|...|+| .+..||.|++.+|+|..... ++.+ ..---+||+ ||=.-. T Consensus 1 ~~PE~~vaDiLsad~~lv~~mYipift~tpdd~fik-~SsAPWiRiTpiPGDda~ya--DD~R~~EYPrVqVD-fWvr~e 76 (114) T protein:vir:38 1 MAPEKRVYDILSANLDIADKVYIGTPNFNNQTSATP-ESLAPWVRITYLPGDAADYA--DDSRILEYPKVQVD-FWVGIT 76 (114) T ss_pred CCchhhhhhhhccchhhhhheeccCCCCCCCCcccc-cccCCeeEeeecCCcccccc--ccceeeecCceeEE-EeeccC Confidence 88876333 211211111124666667778997 45689999999999976432 2322 222456777 455678 Q ss_pred ChHHHHHHHHHHHHhhh-cCcee--eeEEEecCccccC Q lcl|NC_021331. 76 GTDRPRVLAKEIAQFFY-DGKML--EHGYIYEGARVHK 110 (132) Q Consensus 76 G~~~a~~lA~~iaa~F~-~g~~l--~~~~i~~~p~~~~ 110 (132) |....+++-.+|=+... .|-+- ..-++.+-|+.+. T Consensus 77 ~~d~~e~iqe~IY~~Lha~gweRYY~nsY~D~~~~~~~ 114 (114) T protein:vir:38 77 DWDQQEKIETQIYQALHAADWERYYRNSYVDGIPQPFA 114 (114) T ss_pred ChhhHHHHHHHHHHHHHhcCcceeeeccccCCCCCCCC Confidence 88999988888766432 22221 1224455555555 No 43 >protein:vir:79571 Length: 137 # NCBI annotation: putative tail component # Family: family:all:911 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272522;genbank:gi:148609391;genbank:GeneID:5204407 Probab=47.43 E-value=0.7 Score=21.43 Aligned_cols=125 Identities=8% Similarity=0.007 Sum_probs=73.0 Q ss_pred CCHHHHHHHHHHHhhcccCCCcEEcCCC-CCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCChHH Q lcl|NC_021331. 1 MHYELMLSARKALATEYETRFMIAYENV-EFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGTDR 79 (132) Q Consensus 1 M~~el~~~al~~~a~a~~~~~pva~pN~-~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~~~ 79 (132) -+...+++++.+....--+.....+.+. .|... .-.|=..|++--+.....+|..+ ..+..|.|.||-|+.++.++ T Consensus 6 ~iht~IR~~Vid~L~~~l~~~~~ffdGrP~fiDe-~ElPAVAV~l~da~~~~~~ld~~--~W~A~LhI~iyLka~~~ds~ 82 (137) T protein:vir:79 6 NRHTQIRQVVLARLREQCGDSATFFDGLPAFVDA-QELPAVSVWLSDAQYTGKMTDED--DWQAVLHIAVFIRAQAPDSE 82 (137) T ss_pred HHHHHHHHHHHHHHHhhcCCcEEEeCCccceech-hhCcEEEEEeecCCCCcceecCC--eeEEEEEEEEEeecCCCHHH Confidence 2456677777666554334555566666 56654 34576677776666666666554 58899999999999999999 Q ss_pred HHHHHHH-HHHhhhcCceeee--EEE-ecCccccCceeCCCeEEEEEE--EEEEe Q lcl|NC_021331. 80 PRVLAKE-IAQFFYDGKMLEH--GYI-YEGARVHKPLKSESGWILPIR--FYVRI 128 (132) Q Consensus 80 a~~lA~~-iaa~F~~g~~l~~--~~i-~~~p~~~~~~~~~~~~~ipVs--i~yRa 128 (132) ..++|.+ |..-.+....|.. +.+ ...=+=.+-.+..+|...-++ |.|.- T Consensus 83 LD~~~E~~I~~v~~~~~~l~~l~~~~~~~gY~Y~rD~e~~tW~sadL~y~ItYe~ 137 (137) T protein:vir:79 83 LDMWMESTIFPALNDVPALSGLIDTLIPLGFNYQRDNEMATWAMAEITYQITYTN 137 (137) T ss_pred HHHHHHHHHHHhhcchhhhhhHhhhhhcccCCcccccccceeEEEEEEEEEEEcC Confidence 9999997 5444443322221 111 011111111122356555444 44544 No 44 >protein:vir:80105 Length: 162 # NCBI annotation: gp13 # Family: family:all:2729 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468717;genbank:gi:157325297;genbank:GeneID:5601796 Probab=41.91 E-value=0.91 Score=20.82 Aligned_cols=123 Identities=12% Similarity=0.202 Sum_probs=74.5 Q ss_pred CCHHHHHHHHHHHhhcccCCCcEEcCCCCCCCCCCCccEEEEEE----ccCCceeeecCCCceEEEEEEEEEEEEeCCCC Q lcl|NC_021331. 1 MHYELMLSARKALATEYETRFMIAYENVEFTPPGDGSPWLKFDY----AEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYG 76 (132) Q Consensus 1 M~~el~~~al~~~a~a~~~~~pva~pN~~F~Pp~~g~~yLr~~~----~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G 76 (132) -.++++.+-+......+...+.+-..+-.-.-|+ -||....+ .|-+.. + .+...|.=.+|++|+.-.. T Consensus 8 ~~~~~lv~~ii~~i~~~~~gl~vI~~~~~g~~p~--yPF~TY~v~~pyi~~~~~---~-~~~e~~~~~isi~~~S~~~-- 79 (162) T protein:vir:80 8 YDYGKLVKTLINAVNELSGGLQLIESSSGGEQPE--YPFCQYTITSPYIAISPD---I-VEGEQFEIVISLTWRALSG-- 79 (162) T ss_pred ccHHHHHHHHHHHHHhhhcceeEEEccCCCCCCC--CCeEEEEEecCccccCCc---c-cCCcceEEEEEEEEEeCCH-- Confidence 5677777766544455556788888777766665 68888775 333322 2 1344666678999988665 Q ss_pred hHHHHHHHHHHHHhhhc--Cce-ee--eEE-E-e-cC---ccccCceeCCCeEEEEEEEEE-EeecCC Q lcl|NC_021331. 77 TDRPRVLAKEIAQFFYD--GKM-LE--HGY-I-Y-EG---ARVHKPLKSESGWILPIRFYV-RIETKE 132 (132) Q Consensus 77 ~~~a~~lA~~iaa~F~~--g~~-l~--~~~-i-~-~~---p~~~~~~~~~~~~~ipVsi~y-Radt~~ 132 (132) .+|.++|.++.++|.. +-. +. .|- + + +. =+....+.-+-+|-.=++|+| |-++++ T Consensus 80 -~eAl~la~~l~~~f~~~~~~~~~~~~~gIvvvdv~~~~~R~~~~~~~yerR~GFD~~~Rv~r~~e~~ 146 (162) T protein:vir:80 80 -HQALNLANITNKYFRSQKGRFFMQENGGIVVVSVQNSGLRDTFISIEYERSAGIDLRLRVVDSYSSE 146 (162) T ss_pred -HHHHHHHHHHHHHhhcCCceeeeeecCcEEEEecCCCccceeEeeeeeeeeecceEEEEEeeccccc Confidence 8999999999999953 111 11 121 1 1 11 223333444566666777775 344444 No 45 >protein:vir:105826 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655771;genbank:gi:109522094;genbank:GeneID:4157634 Probab=35.52 E-value=0.67 Score=21.55 Aligned_cols=113 Identities=10% Similarity=-0.059 Sum_probs=63.2 Q ss_pred CCHHH---HHHHHHHHhhcccCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCCh Q lcl|NC_021331. 1 MHYEL---MLSARKALATEYETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGT 77 (132) Q Consensus 1 M~~el---~~~al~~~a~a~~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~ 77 (132) |..+- +++-+.+|+... ..++=.- +|.+--|++-+.-+++.-... +-.-.+++||.+|. .|. T Consensus 1 m~~~saP~~e~~vv~WLsp~---~~va~~R----~~~~PLPf~~V~Rv~G~d~~e-----~~tD~avvsv~~fg---~~~ 65 (134) T protein:vir:10 1 MATDSAPSIHRVLVAWLSPL---GKVSTRR----LSGDPLPHRVVRRVDGRDVPE-----EGSDVAVVSVHTFA---ASD 65 (134) T ss_pred CCcccCCChheeeeeecccc---hhceecc----CCCCCCCeEEEEEeCCCCCcc-----cccccceEEEEEee---CCH Confidence 65443 444455554433 2233221 334446899998887654432 33446899999997 899 Q ss_pred HHHHHHHHHHHHhhh----cCceeeeEEEecCccccCceeCCCeEEEEEEEEEEeecCC Q lcl|NC_021331. 78 DRPRVLAKEIAQFFY----DGKMLEHGYIYEGARVHKPLKSESGWILPIRFYVRIETKE 132 (132) Q Consensus 78 ~~a~~lA~~iaa~F~----~g~~l~~~~i~~~p~~~~~~~~~~~~~ipVsi~yRadt~~ 132 (132) .+|..+|++...-.. +-.+. ..+.+... ..|-=..-..-|+.+.|--|+++ T Consensus 66 eaA~d~ad~vHrRM~kL~~~~~~~--~~~~gG~~--~~id~~~v~~~P~~~eY~dD~~~ 120 (134) T protein:vir:10 66 EAAENEAELTHQRMLELVVNPLTE--IPVGGGVV--ARIDYARVLMKPVLVEYDDDGHL 120 (134) T ss_pred HHhhHHHHHHHHHHHHHhcccccc--eecCCceE--EEeehhhhhccceeeeeCCCceE Confidence 999999999887542 22221 00000000 00111123356788888888777 No 46 >protein:vir:102609 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655006;genbank:gi:109392196;genbank:GeneID:4157231 Probab=35.52 E-value=0.67 Score=21.55 Aligned_cols=113 Identities=10% Similarity=-0.059 Sum_probs=63.2 Q ss_pred CCHHH---HHHHHHHHhhcccCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCCh Q lcl|NC_021331. 1 MHYEL---MLSARKALATEYETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGT 77 (132) Q Consensus 1 M~~el---~~~al~~~a~a~~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~ 77 (132) |..+- +++-+.+|+... ..++=.- +|.+--|++-+.-+++.-... +-.-.+++||.+|. .|. T Consensus 1 m~~~saP~~e~~vv~WLsp~---~~va~~R----~~~~PLPf~~V~Rv~G~d~~e-----~~tD~avvsv~~fg---~~~ 65 (134) T protein:vir:10 1 MATDSAPSIHRVLVAWLSPL---GKVSTRR----LSGDPLPHRVVRRVDGRDVPE-----EGSDVAVVSVHTFA---ASD 65 (134) T ss_pred CCcccCCChheeeeeecccc---hhceecc----CCCCCCCeEEEEEeCCCCCcc-----cccccceEEEEEee---CCH Confidence 65443 444455554433 2233221 334446899998887654432 33446899999997 899 Q ss_pred HHHHHHHHHHHHhhh----cCceeeeEEEecCccccCceeCCCeEEEEEEEEEEeecCC Q lcl|NC_021331. 78 DRPRVLAKEIAQFFY----DGKMLEHGYIYEGARVHKPLKSESGWILPIRFYVRIETKE 132 (132) Q Consensus 78 ~~a~~lA~~iaa~F~----~g~~l~~~~i~~~p~~~~~~~~~~~~~ipVsi~yRadt~~ 132 (132) .+|..+|++...-.. +-.+. ..+.+... ..|-=..-..-|+.+.|--|+++ T Consensus 66 eaA~d~ad~vHrRM~kL~~~~~~~--~~~~gG~~--~~id~~~v~~~P~~~eY~dD~~~ 120 (134) T protein:vir:10 66 EAAENEAELTHQRMLELVVNPLTE--IPVGGGVV--ARIDYARVLMKPVLVEYDDDGHL 120 (134) T ss_pred HHhhHHHHHHHHHHHHHhcccccc--eecCCceE--EEeehhhhhccceeeeeCCCceE Confidence 999999999887542 22221 00000000 00111123356788888888777 No 47 >protein:vir:7994 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817348;genbank:gi:29565776;genbank:GeneID:1259015 Probab=35.41 E-value=0.68 Score=21.50 Aligned_cols=113 Identities=10% Similarity=-0.074 Sum_probs=63.2 Q ss_pred CCHHH---HHHHHHHHhhcccCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeCCCCh Q lcl|NC_021331. 1 MHYEL---MLSARKALATEYETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPPGYGT 77 (132) Q Consensus 1 M~~el---~~~al~~~a~a~~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~G~G~ 77 (132) |..+- +++-+.+|+... ..++=.- +|.+--|++-+.-+++.-... +-.-.+++||.+|. .|. T Consensus 1 m~~~saP~~e~~vv~WLsp~---~~va~~R----~~~~PLPf~~V~Rv~G~d~~e-----~~tD~avvsv~~fg---~~~ 65 (134) T protein:vir:79 1 MATDSAPSIHRVLVAWLSPL---GKVSTRR----LSGDPLPHRVVRRVDGRDVPE-----EGSDSAVVSVHTFA---ASD 65 (134) T ss_pred CCcccCCChheeeeeecccc---hhceecc----CCCCCCCeEEEEEeCCCCCcc-----ccccCceeEEEEee---CCH Confidence 65443 444455554433 2233221 334446899998887654433 33446899999997 899 Q ss_pred HHHHHHHHHHHHhhh----cCceeeeEEEecCccccCceeCCCeEEEEEEEEEEeecCC Q lcl|NC_021331. 78 DRPRVLAKEIAQFFY----DGKMLEHGYIYEGARVHKPLKSESGWILPIRFYVRIETKE 132 (132) Q Consensus 78 ~~a~~lA~~iaa~F~----~g~~l~~~~i~~~p~~~~~~~~~~~~~ipVsi~yRadt~~ 132 (132) .+|..+|++...-.. +-.+. ..+.+... ..|-=..-..-|+.+.|--|+++ T Consensus 66 eaA~d~ad~vHrRM~kL~~~~~~~--~~~~gG~~--~~id~~~vl~~P~~~eY~dD~~~ 120 (134) T protein:vir:79 66 EAAENEAELTHQRMLELVVNPLTE--IPVGGGVV--ARIDYARVLMKPVLVEYDDDGHL 120 (134) T ss_pred HHhhHHHHHHHHHHHHHhcccccc--eecCCceE--EEeehhhhhccceeeeeCCCceE Confidence 999999999887552 22211 00000000 00111123356788888888777 No 48 >protein:vir:98343 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918935;genbank:gi:119443697;genbank:GeneID:4594505 Probab=27.77 E-value=1.8 Score=19.17 Aligned_cols=112 Identities=13% Similarity=0.040 Sum_probs=49.6 Q ss_pred CCH--HHHHHHHHHH--hhcc-cCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCC-ceEEEEEEEEEEEEeCC Q lcl|NC_021331. 1 MHY--ELMLSARKAL--ATEY-ETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRK-CVSYIGMIQVGIVFPPG 74 (132) Q Consensus 1 M~~--el~~~al~~~--a~a~-~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~-~~~~~G~~qv~v~~p~G 74 (132) |.. .++.++.+.- ..++ +-..||++.... ....||+.++-...... +-.++ -..-.=.+||+|+|..+ T Consensus 1 ~~~~~k~l~~~~I~~li~~~L~~~nvpv~~~~y~----~~~~tyItf~ey~~~~~--~yaDD~e~~t~~~iQVDIw~sk~ 74 (126) T protein:vir:98 1 MINVTKLIRNAIIANNITDEVNVFNYTIDDHFHE----KTDKPIIRIYPLPFNPD--TYADDNEISREYHYQIDVWWSQD 74 (126) T ss_pred CccchhhhhhhHHHHhhhhhhhccCceeeeeeec----CCCceEEEEEeecCCCC--cccccceeeeEEEEEEEEeecCC Confidence 221 1222222211 1111 126899998775 23468999998844432 22333 34444578999988887 Q ss_pred CChHHHHHHHHHHHHhhhc-CceeeeEEEecCccccCceeCCC-eEEEEEEEEEEeecCC Q lcl|NC_021331. 75 YGTDRPRVLAKEIAQFFYD-GKMLEHGYIYEGARVHKPLKSES-GWILPIRFYVRIETKE 132 (132) Q Consensus 75 ~G~~~a~~lA~~iaa~F~~-g~~l~~~~i~~~p~~~~~~~~~~-~~~ipVsi~yRadt~~ 132 (132) .-+ +|+.+|...+.. |....++. ..++.|+ -|.--.| ||.--+- T Consensus 75 d~~----~l~~~V~~lMk~~GF~r~~~~--------dlYE~DtklyHk~~R--F~~~~~~ 120 (126) T protein:vir:98 75 EPN----EQAEKIVELLKVINFQCYYRE--------PLYESDVMSFRHIIR--AKGSILS 120 (126) T ss_pred CHH----HHHHHHHHHHHHcCCeeeecC--------CCccchhhhheeeee--eeeeecc Confidence 733 356666665532 33222211 0111111 1111111 1111111 No 49 >protein:vir:9415 Length: 126 # NCBI annotation: phi PVL orf 12-like protein # Family: family:all:517 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803393;genbank:gi:29028705;genbank:GeneID:1258143 Probab=27.77 E-value=1.8 Score=19.17 Aligned_cols=112 Identities=13% Similarity=0.040 Sum_probs=49.6 Q ss_pred CCH--HHHHHHHHHH--hhcc-cCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCC-ceEEEEEEEEEEEEeCC Q lcl|NC_021331. 1 MHY--ELMLSARKAL--ATEY-ETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRK-CVSYIGMIQVGIVFPPG 74 (132) Q Consensus 1 M~~--el~~~al~~~--a~a~-~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~-~~~~~G~~qv~v~~p~G 74 (132) |.. .++.++.+.- ..++ +-..||++.... ....||+.++-...... +-.++ -..-.=.+||+|+|..+ T Consensus 1 ~~~~~k~l~~~~I~~li~~~L~~~nvpv~~~~y~----~~~~tyItf~ey~~~~~--~yaDD~e~~t~~~iQVDIw~sk~ 74 (126) T protein:vir:94 1 MINVTKLIRNAIIANNITDEVNVFNYTIDDHFHE----KTDKPIIRIYPLPFNPD--TYADDNEISREYHYQIDVWWSQD 74 (126) T ss_pred CccchhhhhhhHHHHhhhhhhhccCceeeeeeec----CCCceEEEEEeecCCCC--cccccceeeeEEEEEEEEeecCC Confidence 221 1222222211 1111 126899998775 23468999998844432 22333 34444578999988887 Q ss_pred CChHHHHHHHHHHHHhhhc-CceeeeEEEecCccccCceeCCC-eEEEEEEEEEEeecCC Q lcl|NC_021331. 75 YGTDRPRVLAKEIAQFFYD-GKMLEHGYIYEGARVHKPLKSES-GWILPIRFYVRIETKE 132 (132) Q Consensus 75 ~G~~~a~~lA~~iaa~F~~-g~~l~~~~i~~~p~~~~~~~~~~-~~~ipVsi~yRadt~~ 132 (132) .-+ +|+.+|...+.. |....++. ..++.|+ -|.--.| ||.--+- T Consensus 75 d~~----~l~~~V~~lMk~~GF~r~~~~--------dlYE~DtklyHk~~R--F~~~~~~ 120 (126) T protein:vir:94 75 EPN----EQAEKIVELLKVINFQCYYRE--------PLYESDVMSFRHIIR--AKGSILS 120 (126) T ss_pred CHH----HHHHHHHHHHHHcCCeeeecC--------CCccchhhhheeeee--eeeeecc Confidence 733 356666665532 33222211 0111111 1111111 1111111 No 50 >protein:vir:8331 Length: 150 # NCBI annotation: gp48 # Family: family:all:2795 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817899;genbank:gi:29566332;genbank:GeneID:1259527 Probab=20.18 E-value=1.7 Score=19.35 Aligned_cols=110 Identities=13% Similarity=-0.018 Sum_probs=61.8 Q ss_pred CCHHH-------HHHHHHHHhhcccCCCcEEcCCCCCCCCCCCccEEEEEEccCCceeeecCCCceEEEEEEEEEEEEeC Q lcl|NC_021331. 1 MHYEL-------MLSARKALATEYETRFMIAYENVEFTPPGDGSPWLKFDYAEVDTEYLSLDRKCVSYIGMIQVGIVFPP 73 (132) Q Consensus 1 M~~el-------~~~al~~~a~a~~~~~pva~pN~~F~Pp~~g~~yLr~~~~pa~t~~~~L~~~~~~~~G~~qv~v~~p~ 73 (132) =.+|+ +++-+..|+.-+ .++|-.-..-+ ..|++-+.-+++.-....-+ + .-++||.+|... T Consensus 16 ~~~~~~~~sapdae~~vv~wLsp~---~rvA~~R~~~d----plPf~lv~rv~G~d~pde~t-d----~avvsv~~fg~~ 83 (150) T protein:vir:83 16 PEPEILNEGPADAETFVVKWLGEV---YRAANTRRPGD----PLPFLLIQQVAGKENLDEST-A----DPVVQVDILCDK 83 (150) T ss_pred CCcccccCCCccHHHHHHHHhhHH---hhhhhcccCCC----CCCeEEEEecCCCCCccccc-c----cceeeeeecccc Confidence 11122 444555554432 33333333322 25898888887654432222 2 357999999999 Q ss_pred CCChHHHHHHHHHHHHhhhcCce--eeeEEEecCccccCceeCCCeEEEEEEEEEEeecCC Q lcl|NC_021331. 74 GYGTDRPRVLAKEIAQFFYDGKM--LEHGYIYEGARVHKPLKSESGWILPIRFYVRIETKE 132 (132) Q Consensus 74 G~G~~~a~~lA~~iaa~F~~g~~--l~~~~i~~~p~~~~~~~~~~~~~ipVsi~yRadt~~ 132 (132) -.|..+|..+||+.........+ +..|.++. ..-..-|+++.|--|+-+ T Consensus 84 v~G~daA~~~ad~vH~RM~~l~r~tl~~Gtld~----------~~v~~aP~~leY~dD~vv 134 (150) T protein:vir:83 84 VDGEDAARDIKDRVHRRMLLLGRYLEMDGTLDW----------MKVFESPRRLEYTNDKVI 134 (150) T ss_pred ccchhhhhhhhhhHHHHHHHHhhhhccCCcchh----------hhhhccccccccCCCeEE Confidence 99999999999999875433221 22333221 133445666666666433 Done!