Query lcl|NC_016566.1_cdsid_YP_004957451.1 [gene=EP23p15] [protein=hypothetical protein] [protein_id=YP_004957451.1] [location=10095..10517] Match_columns 140 No_of_seqs 66 out of 71 Neff 6.6 Searched_HMMs 1612 Date Thu Nov 7 13:07:16 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_15 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_15_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:103278 Length: 169 100.0 4.2E-46 2.6E-49 269.4 15.3 133 1-139 37-169 (169) 2 protein:vir:107704 Length: 132 100.0 3.2E-44 2E-47 259.1 15.2 129 2-140 1-129 (132) 3 protein:vir:104348 Length: 129 100.0 2.4E-43 1.5E-46 254.2 14.7 129 1-139 1-129 (129) 4 protein:vir:79637 Length: 130 100.0 2.1E-42 1.3E-45 249.1 14.5 130 2-140 1-130 (130) 5 protein:vir:94921 Length: 125 99.9 2.1E-25 1.3E-28 155.9 15.0 125 1-140 1-125 (125) 6 protein:vir:97211 Length: 150 99.7 8.6E-20 5.3E-23 125.1 13.4 134 1-140 1-148 (150) 7 protein:vir:95155 Length: 151 99.7 2.8E-19 1.7E-22 122.3 13.3 134 1-140 2-149 (151) 8 protein:vir:80429 Length: 150 99.7 2.6E-19 1.6E-22 122.5 12.3 134 1-140 1-148 (150) 9 protein:vir:78379 Length: 139 99.0 3.2E-13 2E-16 89.1 5.5 133 1-140 1-135 (139) 10 protein:vir:94997 Length: 139 98.9 2.2E-12 1.4E-15 84.5 5.4 133 1-140 1-135 (139) 11 protein:vir:100242 Length: 114 96.7 2.8E-05 1.8E-08 45.5 8.9 114 1-136 1-114 (114) 12 protein:vir:80371 Length: 115 94.6 0.00065 4E-07 38.1 7.9 114 1-136 1-115 (115) 13 protein:vir:81066 Length: 118 93.0 0.0089 5.5E-06 31.8 11.2 113 1-139 1-118 (118) 14 protein:vir:397 Length: 132 # 91.7 0.015 9.1E-06 30.6 12.1 125 1-138 1-132 (132) 15 protein:vir:95961 Length: 145 90.9 0.018 1.1E-05 30.1 12.5 130 1-140 1-133 (145) 16 protein:vir:94794 Length: 145 90.9 0.019 1.2E-05 30.1 12.5 130 1-140 1-133 (145) 17 protein:vir:97325 Length: 145 90.8 0.019 1.2E-05 30.0 12.5 130 1-140 1-133 (145) 18 protein:vir:105892 Length: 141 90.0 0.023 1.4E-05 29.5 12.6 128 1-140 1-133 (141) 19 protein:vir:96260 Length: 141 90.0 0.023 1.4E-05 29.5 12.6 128 1-140 1-133 (141) 20 protein:vir:94096 Length: 141 90.0 0.023 1.4E-05 29.5 12.6 128 1-140 1-133 (141) 21 protein:vir:10368 Length: 118 89.5 0.026 1.6E-05 29.3 11.5 114 1-140 1-118 (118) 22 protein:vir:97421 Length: 145 89.3 0.027 1.7E-05 29.1 12.4 130 1-140 1-133 (145) 23 protein:vir:93736 Length: 145 89.3 0.027 1.7E-05 29.1 12.4 130 1-140 1-133 (145) 24 protein:vir:94488 Length: 145 89.3 0.027 1.7E-05 29.1 12.4 130 1-140 1-133 (145) 25 protein:vir:96894 Length: 140 89.2 0.027 1.7E-05 29.1 12.3 130 1-140 1-133 (140) 26 protein:vir:95111 Length: 145 89.2 0.028 1.7E-05 29.1 12.5 130 1-140 1-133 (145) 27 protein:vir:1438 Length: 115 # 88.9 0.0092 5.7E-06 31.8 7.2 114 1-136 1-115 (115) 28 protein:vir:97070 Length: 118 88.2 0.033 2.1E-05 28.7 10.9 113 1-139 1-118 (118) 29 protein:vir:3428 Length: 131 # 88.1 0.034 2.1E-05 28.6 13.1 124 1-138 1-131 (131) 30 protein:vir:96800 Length: 127 87.9 0.011 6.5E-06 31.4 6.9 126 1-140 1-127 (127) 31 protein:vir:1244 Length: 145 # 86.6 0.044 2.7E-05 28.0 12.1 129 1-140 1-135 (145) 32 protein:vir:100116 Length: 115 86.4 0.014 8.6E-06 30.8 6.7 114 1-136 1-115 (115) 33 protein:vir:107096 Length: 145 85.5 0.053 3.3E-05 27.6 12.7 130 1-140 1-133 (145) 34 protein:vir:105337 Length: 145 85.3 0.054 3.3E-05 27.5 12.7 130 1-140 1-133 (145) 35 protein:vir:79571 Length: 137 83.2 0.07 4.3E-05 26.9 11.1 126 1-138 5-137 (137) 36 protein:vir:4348 Length: 121 # 75.8 0.14 8.8E-05 25.2 12.3 119 1-140 1-121 (121) 37 protein:vir:195 Length: 115 # 73.2 0.17 0.00011 24.8 8.1 115 1-138 1-115 (115) 38 protein:vir:96125 Length: 140 69.1 0.23 0.00014 24.1 12.1 126 1-140 3-133 (140) 39 protein:vir:5979 Length: 134 # 63.0 0.33 0.0002 23.3 11.2 127 1-140 1-133 (134) 40 protein:vir:93602 Length: 114 58.2 0.42 0.00026 22.7 10.3 111 1-138 1-114 (114) 41 protein:vir:1892 Length: 121 # 50.0 0.62 0.00039 21.7 10.1 120 1-140 1-121 (121) 42 protein:vir:105772 Length: 128 33.5 1.3 0.00084 19.9 9.7 121 1-140 1-126 (128) 43 protein:vir:3874 Length: 114 # 30.6 1.6 0.00097 19.5 6.8 112 1-120 1-114 (114) 44 protein:vir:10327 Length: 182 27.5 1.8 0.0011 19.1 14.0 132 1-140 1-140 (182) 45 protein:vir:80105 Length: 162 22.6 2.4 0.0015 18.5 12.0 130 1-140 1-143 (162) No 1 >protein:vir:103278 Length: 169 # NCBI annotation: phage-related conserved hypothetical protein # Family: family:all:5121 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277458;genbank:gi:71834100;genbank:GeneID:3562389 Probab=100.00 E-value=4.2e-46 Score=269.37 Aligned_cols=133 Identities=18% Similarity=0.144 Sum_probs=128.0 Q ss_pred CchHHHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEEec Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVFPA 80 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~p~ 80 (140) --|.+|.+++|+ +|..|+++..+++||||||+.|+||+++++|||++++|++|.+.+|++||++|+|+|||+|++|+ T Consensus 37 ~~h~ei~~a~rk---~l~~~a~a~~~~LpVA~ENVaFtPp~dG~~YLr~~~lPadT~~~~L~gd~R~y~GVfQIsVV~Pa 113 (169) T protein:vir:10 37 NVHYEMMVAARK---LVSDAAVDIAGSLPVAYENCGFTPPKNGSSWLKFDYTEVDSVTWGLQRTCRYYVGMVQVSIFFSP 113 (169) T ss_pred chHHHHHHHHHH---HHHHHHhhcccCCcEeeCCCCcCCCCCCccEEEEEEecCCceeeeccCCCceEEEEEEEEEEecC Confidence 228899999995 77889999999999999999999999998999999999999999999999999999999999999 Q ss_pred CCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEee Q lcl|NC_016566. 81 GTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCC 139 (140) Q Consensus 81 G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~ 139 (140) |+|+.+++++||||+++|++++++++| ||++.|+++|+++++..|++|||+.|||| T Consensus 114 GtG~~ka~qiAdeiadlF~~gt~L~~G---yi~~~~~~~p~i~~~s~~~iPvr~~~R~D 169 (169) T protein:vir:10 114 GEGTDRPRQLAGRLSEAFADGTMLDSG---YIYEGGSVFPPVKSQSGWFIPVRFYVRMD 169 (169) T ss_pred CCCcchhHHHHHHHHHhhhCCceeece---eecCCCeECCeeecCCceEEeEEEEEEeC Confidence 999999999999999999999999999 99999999999999999999999999999 No 2 >protein:vir:107704 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5121 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003903;genbank:gi:45686319;genbank:GeneID:2773044 Probab=100.00 E-value=3.2e-44 Score=259.06 Aligned_cols=129 Identities=19% Similarity=0.202 Sum_probs=122.5 Q ss_pred chHHHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEEecC Q lcl|NC_016566. 2 SNTAIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVFPAG 81 (140) Q Consensus 2 s~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~p~G 81 (140) -|.+|+.+++++++++. .++||||||+.|+||+++++|||+++||++|.+.+|+++|+.|+|+|||+|++|+| T Consensus 1 ~hyE~~~a~r~~la~~~-------~~lpVA~eNv~F~Pp~~G~~yLr~~~lpa~T~~~~L~~d~r~y~Gv~QI~Vv~paG 73 (132) T protein:vir:10 1 MHYELSAAARAAFLSKY-------RDFPHYMENRNFTPPKDGGMWLRFNYIEGDTLYLSIDRKCKSYIAIVQIGVVFPPG 73 (132) T ss_pred CchHHHHHHHHHHHhhh-------cCCcEeecCCCcCCCCCCceEEEEEEccCCceeeeccCcCcEEEEEEEEEEEecCC Confidence 58899999998766542 36899999999999999889999999999999999999999999999999999999 Q ss_pred CChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 82 TGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 82 ~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) +|+.+++++||+|+++|+++.++.+| ||+++|+++|+|+++..|++|||+.||||. T Consensus 74 ~G~~~a~~iAd~i~~~F~~g~~l~~G---yi~~~~~~~p~i~~~s~~~iPvrf~yR~Dt 129 (132) T protein:vir:10 74 SGVDEARLKAKEIADFFKDGKMLNVG---YIFEGAIVHQIVKHESGWMIPVRFTVRVDT 129 (132) T ss_pred CCcchhHHHHHHHHHhccCcceeecc---eecCCCccCCceeCCcceEEEEEEEEEecc Confidence 99999999999999999999999999 999999999999999999999999999999 No 3 >protein:vir:104348 Length: 129 # NCBI annotation: hypothetical protein # Family: family:all:5121 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398976;genbank:gi:81343960;genbank:GeneID:3778880 Probab=100.00 E-value=2.4e-43 Score=254.23 Aligned_cols=129 Identities=22% Similarity=0.231 Sum_probs=118.5 Q ss_pred CchHHHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEEec Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVFPA 80 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~p~ 80 (140) ||-..++.+.+..+ +.|. ..+||||||+.|+||+++++|||++++|++|++.+|++||++|+|+|||+|++|+ T Consensus 1 ~s~aar~~v~d~~~---~~~~----~~lpVA~eNv~FtPp~~G~~YLr~~~lpa~T~~~~L~~d~r~y~Gv~QI~Vv~p~ 73 (129) T protein:vir:10 1 MSLAARKFVNDLLV---NEFP----VRYPVAWENAAFTPPADGSIWLKYDYTEVDTVTYGLSRKCKYYVGMVQISVFFSP 73 (129) T ss_pred CchHHHHHHHHHHH---Hhhc----CCCcEeecCCCcCCCCCCceEEEEEecCCCceeeeccCCCceEEEEEEEEEEecC Confidence 99876655554322 2232 3689999999999999999999999999999999999999999999999999999 Q ss_pred CCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEee Q lcl|NC_016566. 81 GTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCC 139 (140) Q Consensus 81 G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~ 139 (140) |+|+.+++++|+||+++|++++++.+| ||++.|+++|+++++..|++|||+.|||| T Consensus 74 G~G~~~a~~iA~ei~d~F~~g~~L~~G---yi~~~~~~~p~i~~~~~~~ipvr~~~r~d 129 (129) T protein:vir:10 74 GTGIDKPRQIANQLAESIVDGTMLDSG---TIYESGVVNPVIKSKSGWFIPVRFYVRLD 129 (129) T ss_pred CCCcchhhHHHHHHHHhccCCceeece---eecCCCeECCeeecCCceEEeEEEEEEeC Confidence 999999999999999999999999999 99999999999999999999999999999 No 4 >protein:vir:79637 Length: 130 # NCBI annotation: gp41 # Family: family:all:5121 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285530;genbank:gi:148734513;genbank:GeneID:5219995 Probab=100.00 E-value=2.1e-42 Score=249.13 Aligned_cols=130 Identities=19% Similarity=0.201 Sum_probs=115.3 Q ss_pred chHHHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEEecC Q lcl|NC_016566. 2 SNTAIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVFPAG 81 (140) Q Consensus 2 s~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~p~G 81 (140) -|.+|.++.++.++... + ..|||||||+.|+||+++++|||++++|++|++.+|++||++|+|+|||+|++|+| T Consensus 1 ~~~e~~~aaR~~~~~~~---~---~~lpVA~ENv~FtPp~~G~~YLr~~~lpa~T~~~~L~~d~r~y~Gv~QI~VV~paG 74 (130) T protein:vir:79 1 MHYELSVAARMALAQEY---E---SEYMIAYENVEFTPPKGGGIWLKYDYKEADTIIHDLKRKCISYIGMVQIGIEFPPG 74 (130) T ss_pred CcchhhHHHHHHHHhhh---h---hhCceeecCCCcCCCCCCceEEEEEecCCCceeeeccCCCceEEEEEEEEEEecCC Confidence 23344444443332221 1 14899999999999999999999999999999999999999999999999999999 Q ss_pred CChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 82 TGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 82 ~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) +|+.+++++|++|+++|++++++.+| ||++.|+++|+++++..|++|||+.||||- T Consensus 75 ~G~~~a~~iA~ei~dlF~~g~~L~~G---yi~~~~~~~p~i~~~~~~~iPvr~~~R~d~ 130 (130) T protein:vir:79 75 SGIDKARKLAKNIADFFEDGKMLSNG---YISEGAKVHQVQKSESGWFYPVRFYVRYDG 130 (130) T ss_pred CCcchhhHHHHHHHHhccCCceeece---eecCCCeECCeeecCCceEEeEEEEEEecC Confidence 99999999999999999999999999 999999999999999999999999999999 No 5 >protein:vir:94921 Length: 125 # NCBI annotation: possible peptidoglycan binding protein # Family: family:all:5248 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239283;genbank:gi:66392065;genbank:GeneID:5076566 Probab=99.88 E-value=2.1e-25 Score=155.88 Aligned_cols=125 Identities=14% Similarity=0.140 Sum_probs=107.5 Q ss_pred CchHHHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEEec Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVFPA 80 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~p~ 80 (140) |+..+||++|+++++ +.| ++.||.|||.+ |.+.++|.|++..++++...++|+.|.+++|++.|+||+|. T Consensus 1 Mt~~q~r~~I~~r~~--a~~-----~~~~I~~~N~p---p~~~~~W~Rlti~~g~~~~a~iG~~~~~rtGli~iqiF~p~ 70 (125) T protein:vir:94 1 MSYFQEKLDIENYFK--ANW-----PDTPIFYENRT---ANSTGTWVRLTIQNGDAFQASNGEVSYRHPGVVFVQIFTKK 70 (125) T ss_pred CCHHHHHHHHHHHHH--hCC-----CccceeeCCCC---CCCCCceEEEEeccCcccccccCCceeeeeeEEEEEeeecC Confidence 999999999998765 233 56789999974 44456899999999999999999878899999999999999 Q ss_pred CCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 81 GTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 81 G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) |.|+..+.++||.+++||++++. .++.++..+.+.++ +++.||+++|+|+||+-+ T Consensus 71 ~~G~~~~~~~ad~~~~~f~~~~~--g~i~f~~~~~~~~g---~~~gwyQ~Nv~I~f~~~~ 125 (125) T protein:vir:94 71 EVGSGEALKLADKVDALFRSKTL--GNIQFKVPQVQKVP---STTEWYQVNVSTEFYRGS 125 (125) T ss_pred CcChHHHHHHHHHHHHHHccCCC--CceEEeeceecCCC---CCCCEEEEEEEEeeecCC Confidence 99999999999999999998754 45666655555555 468999999999999999 No 6 >protein:vir:97211 Length: 150 # NCBI annotation: hypothetical protein ORF026 # Family: family:all:5248 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294534;genbank:gi:149408255;genbank:GeneID:5237076 Probab=99.71 E-value=8.6e-20 Score=125.12 Aligned_cols=134 Identities=17% Similarity=0.199 Sum_probs=104.4 Q ss_pred Cc---hHHHHHHHHHHHHHHHHhhhh---cCCC--cceecCCCCCCC-CCCCccEEEEEEcCCCccceecC---CCCeEE Q lcl|NC_016566. 1 MS---NTAIRKALNSVVEELSVSLST---SQRP--IQVNWENVSGDH-ANGSGVYLEPYLLPAPTQFVGFQ---QKGRIY 68 (140) Q Consensus 1 Ms---~~~Ir~~~e~~~a~l~~~a~~---~~~~--lpva~pN~~F~p-p~~~~~yLr~~~~pa~t~~~~l~---~~~~~~ 68 (140) |+ .++||+-+++++. +.|.+. ..++ +.++|||+.|.+ |.+..+|.|++..+......++| +.+.++ T Consensus 1 ~~~~tF~qaR~ei~t~f~--~~W~a~~~a~~g~~p~~~~w~~~~~~~~P~g~~~WaRLti~~~~~~~as~G~~~gr~~~r 78 (150) T protein:vir:97 1 MTLPTFDSARDEILGLFN--TKWITDTPALNGGAPIRVEWPGVDAGDPPPADKPYARITLRHTTSRQATFGPTGGRRFTR 78 (150) T ss_pred CCCCcHHHHHHHHHhhhh--hhccccchhhcCCcceeeccCCcccCCCcCCCCceEEEEeeccccccccccCCCCcEEee Confidence 65 7899998888643 567322 1223 349999998877 56667899999999999999998 344578 Q ss_pred EEEEEEEEEEe--cCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 69 AGVYQVAVVFP--AGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 69 ~G~~qv~v~~p--~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) .|++.|+||+| .|+|+..|.+.||+++++|++++- ..++.++. +++-..-+++.||+++|+|+|+-|- T Consensus 79 ~Gli~VQiF~p~~~G~G~~la~~~Ad~a~eaFe~~~t-~g~i~f~~---a~~~eig~~~gWyQ~Nv~i~Feyde 148 (150) T protein:vir:97 79 PGLITVQVFTPLSGGQGLSLAEKCAIIARDAFEGRGT-ASGIWFRN---ARIQEIGPDGAWYQMNVVVEFEYDE 148 (150) T ss_pred CcEEEEEEeeeccCCchhhHHHHHHHHHHHHHhccCC-cCCeeccc---ccccccCCCCceEEEEeEeeeeccc Confidence 89999999999 699999999999999999998752 23455553 2334444556899999999999999 No 7 >protein:vir:95155 Length: 151 # NCBI annotation: hypothetical protein ORF015 # Family: family:all:5248 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293422;genbank:gi:148912843;genbank:GeneID:5228230 Probab=99.69 E-value=2.8e-19 Score=122.28 Aligned_cols=134 Identities=11% Similarity=0.134 Sum_probs=96.1 Q ss_pred CchHHHHHHHHHHHHHHHHhhh----hcCCCcceecCCC-CCCCCCCCccEEEEEEcCCCccceec------CC-CCeEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLS----TSQRPIQVNWENV-SGDHANGSGVYLEPYLLPAPTQFVGF------QQ-KGRIY 68 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~----~~~~~lpva~pN~-~F~pp~~~~~yLr~~~~pa~t~~~~l------~~-~~~~~ 68 (140) |+.++||+.+.+++ ++.|-+ .....-.|+|||. .|++|++..+|+|++.-++++...+| ++ .|.++ T Consensus 2 mtf~q~R~~i~~~~--~~~w~~~~~~~a~~~p~v~~~~~~~~d~P~g~~~WaRLti~h~~~~qA~ls~~~eigggp~~~r 79 (151) T protein:vir:95 2 IEFDQVNDEVNALF--LATWNAGSAAIAGYVPEIRWQGVQYRDLPDGSKFWVRLSKQTVFEEQATLSTCEGVPGQRKYTA 79 (151) T ss_pred ccHHHHHHHHHHHh--hhhcccCchhhhccccccccCCCCCCCCCCCCCceEEEEeecCCCccccccccccCCCCceEee Confidence 99999999888764 344511 1111124788885 45668778899999999988888876 33 46789 Q ss_pred EEEEEEEEEEecCCChHH--HHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 69 AGVYQVAVVFPAGTGTQY--ASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 69 ~G~~qv~v~~p~G~G~~~--a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) +|++.|+||+|.|+|... |.++|+-++++|++++- ..++.++..+...+| +++.||+++|+|+|.-|- T Consensus 80 tGli~VQiF~p~~~G~~Le~Adkla~~a~eaFe~~~t-~g~i~f~~~s~~eiG---~~~gWyQ~Nv~i~f~y~e 149 (151) T protein:vir:95 80 SGLVFVQIFCPKSNTQAFELGQKLAKLARNAFRGKST-PGKVWFRNTRINELP---PEELYERFNVVTEFEYDE 149 (151) T ss_pred CcEEEEEEeeeccCchhhHHHHHHHHHHHHHhhccCC-CCCceeeeeeecccC---CCCCeEEEEeeeeecccc Confidence 999999999999888664 44444445799987651 234666644443333 556899999999999999 No 8 >protein:vir:80429 Length: 150 # NCBI annotation: BcepGomrgp11 # Family: family:all:5248 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210231;genbank:gi:146329923;genbank:GeneID:5123538 Probab=99.68 E-value=2.6e-19 Score=122.52 Aligned_cols=134 Identities=15% Similarity=0.155 Sum_probs=100.3 Q ss_pred CchH--HHHHHHHHHHHHHHHhhhhcC----CCcc-eecCCCCCC-CCCCCccEEEEEEcCCCccceecCC----CCeEE Q lcl|NC_016566. 1 MSNT--AIRKALNSVVEELSVSLSTSQ----RPIQ-VNWENVSGD-HANGSGVYLEPYLLPAPTQFVGFQQ----KGRIY 68 (140) Q Consensus 1 Ms~~--~Ir~~~e~~~a~l~~~a~~~~----~~lp-va~pN~~F~-pp~~~~~yLr~~~~pa~t~~~~l~~----~~~~~ 68 (140) |-+. +.|+-+++++. +.|-.+.+ ++.| |+|||+.|. ||++..+|.|++..+++....+|++ .+.++ T Consensus 1 ~~~~~~~ar~ei~~~f~--~~W~~~~~~~~~g~~~~~~w~~~~~~~pP~g~~~WaRLti~h~~~~qA~~~~~~~gr~~~r 78 (150) T protein:vir:80 1 MIQDALQARSDINTMLF--DQWSVADWSKVKGGKPNIAWEGRESARPPDGSAPYVAIFIKHVDGQQASLTDPDMLRRWSR 78 (150) T ss_pred CcchhhhhHHHHHHHHh--hhhccCcchhhcCCcceeeecCcccCCcCCCCCceEEEEEecCCcccccccCCCCcceEee Confidence 6432 34566666544 45643321 2233 999999995 4666668999999999999999973 34567 Q ss_pred EEEEEEEEEEe--cCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 69 AGVYQVAVVFP--AGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 69 ~G~~qv~v~~p--~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) .|++.|+||+| .|+|+..|.+.||+++++|++++-. .++.++. +++-..-+++.||+++|+|+|+-|- T Consensus 79 ~GlI~VQiF~p~~~G~G~~la~k~Ad~a~eaFe~~~t~-g~i~f~~---as~~eiG~d~gWYQ~NV~ipF~yde 148 (150) T protein:vir:80 79 DGLITVQCFGMLSAGQGLEDATYQATIAMRAFEGKQSA-NGIWFRN---ARIKEIGSDRGWYQVNMIVEFEYDE 148 (150) T ss_pred CcEEEEEEeeeccCCchhhHHHHHHHHHHHHHhccCCC-CCccccc---ccccccCCCCceEEEEeEeeeeccc Confidence 89999999999 6999999999999999999987522 3455553 2334444556899999999999999 No 9 >protein:vir:78379 Length: 139 # NCBI annotation: hypothetical protein # Family: family:all:12033 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110845;genbank:gi:134288606;genbank:GeneID:5179642 Probab=99.04 E-value=3.2e-13 Score=89.08 Aligned_cols=133 Identities=19% Similarity=0.231 Sum_probs=109.1 Q ss_pred Cch--HHHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEE Q lcl|NC_016566. 1 MSN--TAIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVF 78 (140) Q Consensus 1 Ms~--~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~ 78 (140) |.. ...-+++.+ +|.+. +.-.+++|+.||++ .|.+...+||..+++-++|++.+|-..- ++.|++||+|.+ T Consensus 1 matyfedltkafdt---alvaf--gtnngikvalenid-aptstdtpylasymllsdteqadlfwte-qragvyqvdinv 73 (139) T protein:vir:78 1 MATYFEDLTKAFDT---ALVAF--GTNNGIKVALENID-APTSTDTPYLASYMLLSDTEQADLFWTE-QRAGVYQVDINV 73 (139) T ss_pred CchhHHHHHHhhhh---eeeee--ccCCceeEeeeccC-CCccCCcchhhheeeeccCcccceeeec-ccCceEEEeeec Confidence 663 334445553 33322 33468999999997 5555555699999999999999997763 678999999999 Q ss_pred ecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 79 PAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 79 p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) -..-|+..+.++||++.+.|..+...+.+-.|-..++.+.||.+.+.+|-.-|+||+|-||. T Consensus 74 gsalgsapinrladklnaafaagncfsrneicaevqsvslgplivengwakrplsinfiaft 135 (139) T protein:vir:78 74 GSALGSAPINRLADKLNAAFAAGNCFSRNEICAEVQSVSLGPLIVENGWAKRPLSINFIAFT 135 (139) T ss_pred ccccccchhHHHHhhhhhhhhccccccchhhhhhhhhccccceeeccCcccCceeeeeeeee Confidence 99999999999999999999877777777667766788999999999999999999999998 No 10 >protein:vir:94997 Length: 139 # NCBI annotation: hypothetical protein # Family: family:all:12033 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224021;genbank:gi:62327308;genbank:GeneID:5176825 Probab=98.92 E-value=2.2e-12 Score=84.46 Aligned_cols=133 Identities=20% Similarity=0.243 Sum_probs=107.9 Q ss_pred Cch--HHHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEE Q lcl|NC_016566. 1 MSN--TAIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVF 78 (140) Q Consensus 1 Ms~--~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~ 78 (140) |.. ...-+++.+ +|... +....++||.||+. .|.+...+||..+++-++|++.+|-..- ++.|++||+|.+ T Consensus 1 matyfedltkafdt---alvtf--gtdndikvalenid-aptstdapylasymllsdteqadlfwte-qragvyqvdinv 73 (139) T protein:vir:94 1 MATYFEDLTKAFDT---ALVTF--GTDNDIKVALENID-APTSTDAPYLASYMLLSDTEQADLFWTE-QRAGVYQVDINV 73 (139) T ss_pred CchhHHHHHHhhhh---eeeee--ccCCCceEEeeccC-CCcccCcchhhheeecccCcccceeeec-ccCceEEEeeec Confidence 653 233344443 33322 23457899999997 5555445699999999999999997763 678999999999 Q ss_pred ecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 79 PAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 79 p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) -..-|+..+.++||++..-|..+...+.+-.|-..++.+.||.+.+.+|-.-|+||+|-||. T Consensus 74 gsalgsapinrladklnttfaagncfsrneicaevqsvslgplivengwakrplsinfiaft 135 (139) T protein:vir:94 74 GSALGSAPINRLADKLNTTFAAGNCFSRNEICAEVQSVSLGPLIVENGWAKRPLSINFIAFT 135 (139) T ss_pred ccccccchhHHHHHhhhhhhhccccccchhhhhhhhhccccceeeccCcccCceeeeeeeee Confidence 99999999999999999999877777777667766788999999999999999999999998 No 11 >protein:vir:100242 Length: 114 # NCBI annotation: gp71 # Family: family:all:2712 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355407;genbank:gi:77864697;genbank:GeneID:3725964 Probab=96.75 E-value=2.8e-05 Score=45.51 Aligned_cols=114 Identities=19% Similarity=0.164 Sum_probs=78.0 Q ss_pred CchHHHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEEec Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVFPA 80 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~p~ 80 (140) ||.-.||++|-. |+- +=.|.++. |...+.+|+- +...+...-.+|+|..-..+|.|||+++... T Consensus 1 ~~~~~i~~~l~~----~~g---------~~~~~~~a--P~~~~~Py~v-y~rvsg~p~~tL~G~~g~~~~r~QiD~yA~T 64 (114) T protein:vir:10 1 MSALTIRDAIGI----VGG---------AKGYVSVA--SSAAQSPYYV-VSRVSGTRDMALGGATGGKSGMFQIDVYAKT 64 (114) T ss_pred Cceeeeehhhcc----ccc---------ccccCCCC--CCCCCCceEE-EEeccCcccccccCCCCcceEEEEEEeeeCC Confidence 999999988863 321 12355554 5544556776 4445556667898888789999999999876 Q ss_pred CCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEE Q lcl|NC_016566. 81 GTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPY 136 (140) Q Consensus 81 G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~y 136 (140) -.+|+++|+++.++-.....++ .+.+++-+......+.-=+..+-+||.| T Consensus 65 ---~~eA~~La~~~~~~l~~~~~f~---~~~l~~~~d~ye~dT~l~Rvsld~si~f 114 (114) T protein:vir:10 65 ---YTEADSLADQIIDRVESTGMFS---VGGVSDLPDDYSSDTGVFRVSLEISVQF 114 (114) T ss_pred ---HHHHHHHHHHHHhhcccccCee---eeccccCCCCCCcccCceEEEEEEEEeC Confidence 4688999999987654322222 1346666666665555557888889999 No 12 >protein:vir:80371 Length: 115 # NCBI annotation: gp11 # Family: family:all:2712 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111090;genbank:gi:134288622;genbank:GeneID:4960618 Probab=94.58 E-value=0.00065 Score=38.07 Aligned_cols=114 Identities=18% Similarity=0.179 Sum_probs=74.2 Q ss_pred CchHHHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEEec Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVFPA 80 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~p~ 80 (140) ||-..||++|.++ .. -=.|..+. |-.-+.+|+-+..+- ......|.|+.-...|.|||+++.+. T Consensus 1 ~~~~vir~al~~i----~~---------~~~~~~vA--p~~~~~pyivy~rvs-ga~e~~L~G~ag~~~~~~QID~yA~T 64 (115) T protein:vir:80 1 MSVIVVRDALQGI----GG---------AKGYLGVA--PEKAPARYFVVTRVH-GALDMALAGPTGGRSGSYQIDCYAPT 64 (115) T ss_pred Ceeeeeechhhhc----cc---------cccceeec--cccCcCCeEEEeecC-CCccccccCCCCCceeEEEEeeecCC Confidence 9999999998742 11 12344443 222233476555444 34455677776678999999999765 Q ss_pred CCChHHHHHHHHHHHHhcc-CCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEE Q lcl|NC_016566. 81 GTGTQYASELADAIATSDK-WQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPY 136 (140) Q Consensus 81 G~G~~~a~~~A~~ia~~F~-~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~y 136 (140) ..+++++|+++++.-. .....+.| .+++-|..-...+.--+..+-++|-| T Consensus 65 ---~~ea~~La~~v~d~~~~~~~~~~vg---~l~e~pd~Ye~DT~l~Rvs~dv~i~f 115 (115) T protein:vir:80 65 ---FTDADRLADLAVDRAMSVQDRFSVG---GVDELPDDYSADTGLFRVSLELSVEF 115 (115) T ss_pred ---HHHHHHHHHHHHHhhhCCcccccee---cccCCCcccccccceEEEEEEEEEeC Confidence 5789999999988332 22233344 35555666666666668888999998 No 13 >protein:vir:81066 Length: 118 # NCBI annotation: p13 # Family: family:all:3244 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285683;genbank:gi:148727191;genbank:GeneID:5247111 Probab=92.96 E-value=0.0089 Score=31.84 Aligned_cols=113 Identities=11% Similarity=-0.030 Sum_probs=62.0 Q ss_pred Cc-hHHHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCC-ccEEEEEEcCCCccceecCCC-CeEEEEEEEEEEE Q lcl|NC_016566. 1 MS-NTAIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGS-GVYLEPYLLPAPTQFVGFQQK-GRIYAGVYQVAVV 77 (140) Q Consensus 1 Ms-~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~-~~yLr~~~~pa~t~~~~l~~~-~~~~~G~~qv~v~ 77 (140) || +.+|+++|. .+. .. =.||.+. |...+ .+|+-...+-+.. ...|.|. +......+||+|+ T Consensus 1 Ms~e~~l~a~L~----~~~---~~------Rvyp~~a--P~~~~~~Pyiv~q~vsg~p-~~~l~G~~~~~~~~rvQIdvy 64 (118) T protein:vir:81 1 MSYGRVLKDLLD----PVF---SG------RVYADIP--PDSPPLDAYAIYQRVGGVP-VYWQEGGMPEKVNARVQIQIW 64 (118) T ss_pred CchHHHHHHHHH----hhc---CC------ccccccC--CCCCccCceEEEEecCCcc-cccccCCCCCccceeEEEEEe Confidence 99 666766654 221 11 2456553 33323 2588888877754 4457664 4455678999999 Q ss_pred EecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEE--EEEEEEEee Q lcl|NC_016566. 78 FPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARI--VVTVPYTCC 139 (140) Q Consensus 78 ~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~i--PVsi~yra~ 139 (140) .. --.+|++++++|++...+.... ..+.++.... .++...|.. =++|-|.-- T Consensus 65 A~---t~~~A~~l~~av~~al~~~~~~-----~~~~~~~d~y--e~dt~l~r~~~Df~iw~~~~ 118 (118) T protein:vir:81 65 SR---SKQEAYLATVQVLRLVSEAPDM-----QVLSQPIDDY--VREIKLYGSRVDVSMWYPIT 118 (118) T ss_pred eC---CHHHHHHHHHHHHHHhhhccce-----eeccCCcccc--ccccCceeEEEEEEEEecCC Confidence 75 4578888888887766532211 1111222222 233344444 344444433 No 14 >protein:vir:397 Length: 132 # NCBI annotation: gp12 # Family: family:all:911 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046907;genbank:gi:9630477;genbank:GeneID:1261651 Probab=91.67 E-value=0.015 Score=30.62 Aligned_cols=125 Identities=10% Similarity=0.037 Sum_probs=68.3 Q ss_pred CchHHHHHHHHHHHHHHHHhhhhcCCCcceecCCC--CCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLSTSQRPIQVNWENV--SGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVF 78 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~--~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~ 78 (140) |+|++||+++-. +|.... +. .+.|-|- .|...+.- +=..|++-.+.....+|.++ ..+..|.|.||- T Consensus 1 ~~ht~IR~~Vid---~L~~~l----~~-~~~ffdGrP~fiDe~el-PAVAV~l~d~~~~~~~ld~~--~w~A~LhI~iyL 69 (132) T protein:vir:39 1 MKHRDIRKVIID---ALESAI----GT-DAIYFDGRPAVLEEGDF-PAVAVYLTDAEYTGEELDAD--TWQAILHIEVFL 69 (132) T ss_pred CchHHHHHHHHH---HHHhhC----CC-ceEEecCcceeeccccC-cEEEEEeecCCCCcceecCC--eeEEEEEEEEEe Confidence 999999977654 443321 12 2443332 23333322 35577777777777777644 789999999999 Q ss_pred ecCCChHHHHHHHHHHHHhccC-Ccccc-CCe-EEEEcCCccccCCccCCCeEEE--EEEEEEEe Q lcl|NC_016566. 79 PAGTGTQYASELADAIATSDKW-QAVRI-SGA-AFQLQDSPYTSSVVEDVDRARI--VVTVPYTC 138 (140) Q Consensus 79 p~G~G~~~a~~~A~~ia~~F~~-~~~~~-~g~-~v~v~~~p~~~~~~~~~~~~~i--PVsi~yra 138 (140) |+..+..+-.++|.++. |+. ..... .++ .+.....=+=.+-.+..+|... --+|+|.- T Consensus 70 ka~~~ds~LD~~aE~~i--~p~i~~~~~l~~l~~~~~~~gy~Y~rD~~~atW~sadL~y~ItY~~ 132 (132) T protein:vir:39 70 EAQVPDSELDDWMETRV--YPVLAEVPGLESLITTMVQQGYDYQRDDDMALWSSADLKYSITYDM 132 (132) T ss_pred ecCCCHHHHHHHHHHHh--HhhhcccchhhhHhhhhhhcCCCcccccccceEEEEEEEEEEEEeC Confidence 99999999999999873 221 11100 010 0000000000111222235443 34566666 No 15 >protein:vir:95961 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240391;genbank:gi:66396072;genbank:GeneID:5133472 Probab=90.91 E-value=0.018 Score=30.09 Aligned_cols=130 Identities=11% Similarity=-0.032 Sum_probs=68.4 Q ss_pred CchHHHHHHHHHHHHHHHHhhh--hcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCC-eEEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLS--TSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKG-RIYAGVYQVAVV 77 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~--~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~-~~~~G~~qv~v~ 77 (140) ||-..=..+-+++.++|...++ +..++ . .|.+++ .++ +.+|+- -+.++..+.+.+| ....-.|+|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvgg-r-V~D~~P-~~~--~~PYv~----lG~~~~~d~~~~~~~g~~~~~ti~Vw 71 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNPIIQKQLDG-R-VFDCVQ-KDA--VYPYIV----VGETNVTNKETTTSMVEDVGITLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhHHHhhcc-c-cccCCc-CCC--CCCEEE----ecCceeeecCCCcccceEEEEEEEEE Confidence 9954444444444556543321 11111 2 244443 111 223543 3445555555555 357788999988 Q ss_pred EecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 78 FPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 78 ~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) . .+.|...+.++|+.|.+...+...+..+..+.+.-.-+..-..+++..+..-++|.+|.-- T Consensus 72 s-~~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~ 133 (145) T protein:vir:95 72 S-QARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKHGVIRLVFKYRH 133 (145) T ss_pred E-cCCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEEe Confidence 6 5668999999999998777543333334323322221122233455556666666666555 No 16 >protein:vir:94794 Length: 145 # NCBI annotation: ORF028 # Family: family:all:296 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240542;genbank:gi:66396219;genbank:GeneID:5133574 Probab=90.86 E-value=0.019 Score=30.06 Aligned_cols=130 Identities=10% Similarity=-0.033 Sum_probs=68.3 Q ss_pred CchHHHHHHHHHHHHHHHHhhh--hcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCC-eEEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLS--TSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKG-RIYAGVYQVAVV 77 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~--~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~-~~~~G~~qv~v~ 77 (140) ||-..=..+-+++.++|...++ +..++ . .|.+++ .++ +.+|+- -+.++..+.+.+| ....-.|+|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvgg-r-V~D~~P-~~~--~~PYv~----lG~~~~~d~~~~~~~g~~~~~ti~Vw 71 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNPIIQKQLDG-R-VFDCVQ-KDA--VYPYIV----VGETNVTNKETTTSMVEDVGITLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhHHHhhcc-c-cccCCc-CCC--CCCEEE----ecCceeeecCCCcccceEEEEEEEEE Confidence 9954444444444556543321 11111 2 244443 111 223543 3445555555555 357788999988 Q ss_pred EecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 78 FPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 78 ~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) . .+.|...+.++|+.|.+...+...+..+..+.+.-.-+..-..+++..+..-++|.+|.-- T Consensus 72 s-~~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~ 133 (145) T protein:vir:94 72 S-QARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRH 133 (145) T ss_pred E-cCCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEEe Confidence 6 5668999999999998777543333334323322221122233455556666666666554 No 17 >protein:vir:97325 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240617;genbank:gi:66396297;genbank:GeneID:5133681 Probab=90.75 E-value=0.019 Score=29.99 Aligned_cols=130 Identities=11% Similarity=-0.017 Sum_probs=67.1 Q ss_pred CchHHHHHHHHHHHHHHHHhhh--hcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCC-eEEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLS--TSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKG-RIYAGVYQVAVV 77 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~--~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~-~~~~G~~qv~v~ 77 (140) ||-..=..+-+++.++|...++ +..++ . .|.+++ .++ +.+|+- -+.++..+.+.+| ....-.++|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~~l~alvgg-r-V~D~~P-~~a--~~PYv~----lG~~~~~d~~~~~~~g~~~~~ti~Vw 71 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNLIIRKQLDG-R-VFDCVQ-KDA--VYPYIV----VGETNVTNKETTTSMVEDVGITLHVY 71 (145) T ss_pred CcchHhHHHHHHHHHHhhcChhHHHhhcC-c-eecCCc-cCC--CCCEEE----eCcceeeecCCCcccceEEEEEEEEE Confidence 9954444444444556644321 11111 2 244443 111 223543 3445555555555 357788999998 Q ss_pred EecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 78 FPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 78 ~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) . .+.|...+.++|+.|.+...+...+..+..+.+.-.-+..-..+++..+..-++|.+|.-= T Consensus 72 s-~~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~ 133 (145) T protein:vir:97 72 S-QARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRH 133 (145) T ss_pred E-cCCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEec Confidence 7 4678999999999998777544333334323322121112233444445444555555443 No 18 >protein:vir:105892 Length: 141 # NCBI annotation: tail protein # Family: family:all:296 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004380;genbank:gi:122891835;genbank:GeneID:4712363 Probab=90.00 E-value=0.023 Score=29.55 Aligned_cols=128 Identities=9% Similarity=-0.100 Sum_probs=63.6 Q ss_pred CchHHHHHHHHHHHHHHHHhhh--hcCCCcceecCCCCCCCCCC-CccEEEEEEcCCCccceecCCCC-eEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLS--TSQRPIQVNWENVSGDHANG-SGVYLEPYLLPAPTQFVGFQQKG-RIYAGVYQVAV 76 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~--~~~~~lpva~pN~~F~pp~~-~~~yLr~~~~pa~t~~~~l~~~~-~~~~G~~qv~v 76 (140) ||-..-..+-.++.++|..-++ +..++ +| |.+++ .+ +.+| ..-+..+..+.+.+| ....-.++|+| T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~-rI-~D~~P----~~~~~PY----v~lG~~~~~~~~~~~~~g~~~~~ti~V 70 (141) T protein:vir:10 1 MWVSVEPELTNQIYKRLISDPNINKLVDD-RV-FDVVQ----DDAVYPY----IVVGESNVTNNESSATMRETVGIVIHV 70 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCC-cc-ccCCc----cCCCCCE----EEeCCceeeecCCCcccceEEEEEEEE Confidence 7732222233333345533211 11111 22 44433 22 2234 334555556666555 35778899999 Q ss_pred EEecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcC-CccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 77 VFPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQD-SPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 77 ~~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~-~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) +. .+.|-..+.++|+.|.+...+...+..+..+...- .-.+. ..+++..+.--++|.+|.-= T Consensus 71 ws-~~~g~~eak~ia~av~~AL~~~l~l~~~~lv~l~~~~~~~~-rd~dg~t~hgvl~~ra~v~~ 133 (141) T protein:vir:10 71 YS-QFATQYEAKLILSAIGYVLNRPIEIDNYEFQFSRIDSQAVF-PDIDRFTKHGTIRLLFKYRH 133 (141) T ss_pred EE-cCCCHHHHHHHHHHHHHHhcccccCCCceEEEEEEeeeeee-ecCCCceEEEEEEEEEEEEe Confidence 86 67799999999999987775332333332222221 11222 22344444444666555444 No 19 >protein:vir:96260 Length: 141 # NCBI annotation: ORF027 # Family: family:all:296 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240317;genbank:gi:66395991;genbank:GeneID:5133337 Probab=90.00 E-value=0.023 Score=29.55 Aligned_cols=128 Identities=9% Similarity=-0.100 Sum_probs=63.6 Q ss_pred CchHHHHHHHHHHHHHHHHhhh--hcCCCcceecCCCCCCCCCC-CccEEEEEEcCCCccceecCCCC-eEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLS--TSQRPIQVNWENVSGDHANG-SGVYLEPYLLPAPTQFVGFQQKG-RIYAGVYQVAV 76 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~--~~~~~lpva~pN~~F~pp~~-~~~yLr~~~~pa~t~~~~l~~~~-~~~~G~~qv~v 76 (140) ||-..-..+-.++.++|..-++ +..++ +| |.+++ .+ +.+| ..-+..+..+.+.+| ....-.++|+| T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~-rI-~D~~P----~~~~~PY----v~lG~~~~~~~~~~~~~g~~~~~ti~V 70 (141) T protein:vir:96 1 MWVSVEPELTNQIYKRLISDPNINKLVDD-RV-FDVVQ----DDAVYPY----IVVGESNVTNNESSATMRETVGIVIHV 70 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCC-cc-ccCCc----cCCCCCE----EEeCCceeeecCCCcccceEEEEEEEE Confidence 7732222233333345533211 11111 22 44433 22 2234 334555556666555 35778899999 Q ss_pred EEecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcC-CccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 77 VFPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQD-SPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 77 ~~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~-~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) +. .+.|-..+.++|+.|.+...+...+..+..+...- .-.+. ..+++..+.--++|.+|.-= T Consensus 71 ws-~~~g~~eak~ia~av~~AL~~~l~l~~~~lv~l~~~~~~~~-rd~dg~t~hgvl~~ra~v~~ 133 (141) T protein:vir:96 71 YS-QFATQYEAKLILSAIGYVLNRPIEIDNYEFQFSRIDSQAVF-PDIDRFTKHGTIRLLFKYRH 133 (141) T ss_pred EE-cCCCHHHHHHHHHHHHHHhcccccCCCceEEEEEEeeeeee-ecCCCceEEEEEEEEEEEEe Confidence 86 67799999999999987775332333332222221 11222 22344444444666555444 No 20 >protein:vir:94096 Length: 141 # NCBI annotation: ORF031 # Family: family:all:296 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240240;genbank:gi:66395916;genbank:GeneID:5133265 Probab=90.00 E-value=0.023 Score=29.55 Aligned_cols=128 Identities=9% Similarity=-0.100 Sum_probs=63.6 Q ss_pred CchHHHHHHHHHHHHHHHHhhh--hcCCCcceecCCCCCCCCCC-CccEEEEEEcCCCccceecCCCC-eEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLS--TSQRPIQVNWENVSGDHANG-SGVYLEPYLLPAPTQFVGFQQKG-RIYAGVYQVAV 76 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~--~~~~~lpva~pN~~F~pp~~-~~~yLr~~~~pa~t~~~~l~~~~-~~~~G~~qv~v 76 (140) ||-..-..+-.++.++|..-++ +..++ +| |.+++ .+ +.+| ..-+..+..+.+.+| ....-.++|+| T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~-rI-~D~~P----~~~~~PY----v~lG~~~~~~~~~~~~~g~~~~~ti~V 70 (141) T protein:vir:94 1 MWVSVEPELTNQIYKRLISDPNINKLVDD-RV-FDVVQ----DDAVYPY----IVVGESNVTNNESSATMRETVGIVIHV 70 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCC-cc-ccCCc----cCCCCCE----EEeCCceeeecCCCcccceEEEEEEEE Confidence 7732222233333345533211 11111 22 44433 22 2234 334555556666555 35778899999 Q ss_pred EEecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcC-CccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 77 VFPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQD-SPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 77 ~~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~-~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) +. .+.|-..+.++|+.|.+...+...+..+..+...- .-.+. ..+++..+.--++|.+|.-= T Consensus 71 ws-~~~g~~eak~ia~av~~AL~~~l~l~~~~lv~l~~~~~~~~-rd~dg~t~hgvl~~ra~v~~ 133 (141) T protein:vir:94 71 YS-QFATQYEAKLILSAIGYVLNRPIEIDNYEFQFSRIDSQAVF-PDIDRFTKHGTIRLLFKYRH 133 (141) T ss_pred EE-cCCCHHHHHHHHHHHHHHhcccccCCCceEEEEEEeeeeee-ecCCCceEEEEEEEEEEEEe Confidence 86 67799999999999987775332333332222221 11222 22344444444666555444 No 21 >protein:vir:10368 Length: 118 # NCBI annotation: conserved phage protein # Family: family:all:3244 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858960;genbank:gi:32128425;genbank:GeneID:2648389 Probab=89.47 E-value=0.026 Score=29.26 Aligned_cols=114 Identities=9% Similarity=-0.046 Sum_probs=61.4 Q ss_pred Cc-hHHHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCC-ccEEEEEEcCCCccceecCCC-CeEEEEEEEEEEE Q lcl|NC_016566. 1 MS-NTAIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGS-GVYLEPYLLPAPTQFVGFQQK-GRIYAGVYQVAVV 77 (140) Q Consensus 1 Ms-~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~-~~yLr~~~~pa~t~~~~l~~~-~~~~~G~~qv~v~ 77 (140) || ++.|+++|. .+. +- =.||++. |...+ .+|+-...+-+.. ...|+|. +......+||+|+ T Consensus 1 Ms~e~~l~a~L~----~~~--------~~-RVyp~~a--P~~~~~~Pyiv~q~vsg~p-~~~l~G~~~~~~~~rvQIdvy 64 (118) T protein:vir:10 1 MSYGRVLKDLLD----PVF--------SG-RVYADIP--PDSPPLDAYAIYQRVGGVP-VYWQEGGMPEKVNARVQIQIW 64 (118) T ss_pred CchHHHHHHHHh----hhc--------CC-ccccccC--CCCCCcCCEEEEEecCCcc-cccccCCCCccceeEEEEEEe Confidence 99 666665554 332 11 2466554 33323 2588888777754 4558765 5566788999999 Q ss_pred EecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEE-EEeeC Q lcl|NC_016566. 78 FPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVP-YTCCA 140 (140) Q Consensus 78 ~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~-yra~a 140 (140) .. --.+|.+++++|++...+..... -+..+.... .++...|...+-|. |-+-- T Consensus 65 A~---t~~~A~~l~~av~~al~~~~~~~-----~~~~~~d~y--e~dt~l~r~~~Df~vw~~~~ 118 (118) T protein:vir:10 65 SR---SKQEAYLATVQVLRLVSEANDMQ-----VLSQPIDDY--VREIKLYGSRVDISMWYNLT 118 (118) T ss_pred eC---CHHHHHHHHHHHHHHhhhcccce-----eccCCCccc--cccCCceEEEEEEEEeeecC Confidence 75 45788888888877665332111 111222222 22334444443333 32222 No 22 >protein:vir:97421 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240755;genbank:gi:66396436;genbank:GeneID:5133777 Probab=89.26 E-value=0.027 Score=29.15 Aligned_cols=130 Identities=11% Similarity=-0.017 Sum_probs=68.2 Q ss_pred CchHHHHHHHHHHHHHHHHhhh--hcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCC-eEEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLS--TSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKG-RIYAGVYQVAVV 77 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~--~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~-~~~~G~~qv~v~ 77 (140) ||-..-..+-+++.++|...++ +..++ . .|.+++ .++ +.+|+- -+.++..+.+.+| ....-.++|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvgg-r-I~D~~P-~~a--~~PYV~----lG~~~~~d~~~~~~~g~~~~~ti~Vw 71 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDG-R-VFDCVQ-KDA--VYPYIV----VGETNVTNKETTTSMVEDVGITLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcC-c-eecCCc-CCC--CCCEEE----eCCceeeecCCCcccceEEEEEEEEE Confidence 9954444444444556643321 11111 2 244443 112 223543 3444555555554 357788999988 Q ss_pred EecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 78 FPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 78 ~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) . .+.|...+.++|+.|.+...+...+..+..+.+.-.-+..-..+++..+..-++|.+|.-- T Consensus 72 s-~~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~ 133 (145) T protein:vir:97 72 S-QARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRH 133 (145) T ss_pred E-cCCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEeecCCcceEEEEEEEEEEEEe Confidence 6 4678999999999998776554344444322322221122233455555555666665554 No 23 >protein:vir:93736 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240465;genbank:gi:66396143;genbank:GeneID:5133505 Probab=89.26 E-value=0.027 Score=29.15 Aligned_cols=130 Identities=11% Similarity=-0.017 Sum_probs=68.2 Q ss_pred CchHHHHHHHHHHHHHHHHhhh--hcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCC-eEEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLS--TSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKG-RIYAGVYQVAVV 77 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~--~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~-~~~~G~~qv~v~ 77 (140) ||-..-..+-+++.++|...++ +..++ . .|.+++ .++ +.+|+- -+.++..+.+.+| ....-.++|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvgg-r-I~D~~P-~~a--~~PYV~----lG~~~~~d~~~~~~~g~~~~~ti~Vw 71 (145) T protein:vir:93 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDG-R-VFDCVQ-KDA--VYPYIV----VGETNVTNKETTTSMVEDVGITLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcC-c-eecCCc-CCC--CCCEEE----eCCceeeecCCCcccceEEEEEEEEE Confidence 9954444444444556643321 11111 2 244443 112 223543 3444555555554 357788999988 Q ss_pred EecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 78 FPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 78 ~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) . .+.|...+.++|+.|.+...+...+..+..+.+.-.-+..-..+++..+..-++|.+|.-- T Consensus 72 s-~~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~ 133 (145) T protein:vir:93 72 S-QARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRH 133 (145) T ss_pred E-cCCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEeecCCcceEEEEEEEEEEEEe Confidence 6 4678999999999998776554344444322322221122233455555555666665554 No 24 >protein:vir:94488 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240682;genbank:gi:66396364;genbank:GeneID:5133752 Probab=89.26 E-value=0.027 Score=29.15 Aligned_cols=130 Identities=11% Similarity=-0.017 Sum_probs=68.2 Q ss_pred CchHHHHHHHHHHHHHHHHhhh--hcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCC-eEEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLS--TSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKG-RIYAGVYQVAVV 77 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~--~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~-~~~~G~~qv~v~ 77 (140) ||-..-..+-+++.++|...++ +..++ . .|.+++ .++ +.+|+- -+.++..+.+.+| ....-.++|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvgg-r-I~D~~P-~~a--~~PYV~----lG~~~~~d~~~~~~~g~~~~~ti~Vw 71 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDG-R-VFDCVQ-KDA--VYPYIV----VGETNVTNKETTTSMVEDVGITLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcC-c-eecCCc-CCC--CCCEEE----eCCceeeecCCCcccceEEEEEEEEE Confidence 9954444444444556643321 11111 2 244443 112 223543 3444555555554 357788999988 Q ss_pred EecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 78 FPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 78 ~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) . .+.|...+.++|+.|.+...+...+..+..+.+.-.-+..-..+++..+..-++|.+|.-- T Consensus 72 s-~~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~ve~ 133 (145) T protein:vir:94 72 S-QARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKYRH 133 (145) T ss_pred E-cCCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEeecCCcceEEEEEEEEEEEEe Confidence 6 4678999999999998776554344444322322221122233455555555666665554 No 25 >protein:vir:96894 Length: 140 # NCBI annotation: ORF029 # Family: family:all:296 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240162;genbank:gi:66395835;genbank:GeneID:5133235 Probab=89.25 E-value=0.027 Score=29.15 Aligned_cols=130 Identities=9% Similarity=-0.098 Sum_probs=63.2 Q ss_pred CchHHHHHHHHHHHHHHHHhh--hhcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCC-eEEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSL--STSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKG-RIYAGVYQVAVV 77 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a--~~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~-~~~~G~~qv~v~ 77 (140) ||-.--..+-+++.++|...+ .+..++ +| |.+.+ .++ +.+|+. -+..+..+.+.+| ....-.++|+|+ T Consensus 1 Msms~~~aLq~Ai~a~L~ada~l~alvg~-~V-yD~~P-~~~--~~Pyv~----lG~~~~~~~~~~~~~g~~~~~~i~Vw 71 (140) T protein:vir:96 1 MWVSVEPELTVQIYKRLKASPIINKFVGD-RV-FDVVQ-EDA--VYPYIV----VGESNVTNNESSTMMRETVGIVIHVY 71 (140) T ss_pred CCccHHHHHHHHHHHHhhcChhHHHhcCC-cc-ccCCc-cCC--CCCEEE----ecCceeeecCCCcccceEEEEEEEEE Confidence 772222233333344554332 111111 22 44433 111 223543 3555555566555 356788999988 Q ss_pred EecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 78 FPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 78 ~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) . .+.|-..+.++|+.|.+.......+..+..+.+.-.-...-..+++..+.--++|.+|+-- T Consensus 72 s-~~~g~~ea~~ia~av~~AL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~r~~v~~ 133 (140) T protein:vir:96 72 S-QFATQYEAKQIISAIGYVLNRPIDIENYEFQFSRIDSQSVFPDIDRFTKHGTIRLLFKYRH 133 (140) T ss_pred E-cCCCHHHHHHHHHHHHHHhCCCccCCCCeEEEEEEeeeEEEecCCCceEEEEEEEEEEEEe Confidence 6 5668899999999997766532223333222322111111223444444444555555544 No 26 >protein:vir:95111 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240829;genbank:gi:66394699;genbank:GeneID:5133905 Probab=89.18 E-value=0.028 Score=29.11 Aligned_cols=130 Identities=10% Similarity=-0.034 Sum_probs=68.6 Q ss_pred CchHHHHHHHHHHHHHHHHhhh--hcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCC-eEEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLS--TSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKG-RIYAGVYQVAVV 77 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~--~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~-~~~~G~~qv~v~ 77 (140) ||-..-..+-+++.++|...++ +..++ . .|.+++ .++ +.+|+- -+.++..+.+.+| ....-.++|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvgg-r-V~D~~P-~~a--~~PYV~----lG~~~~~~~~~~~~~g~~~~~ti~Vw 71 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNSIIQKQLDG-R-VFDCVQ-KDA--VYPYIV----VGETNVTNKETTTSMVEDVGITLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcC-c-eecCCc-CCC--CCCEEE----ecCceeeecCCCcccceEEEEEEEEE Confidence 9954444444444556644321 11111 2 244443 112 223543 3444455555554 356788999988 Q ss_pred EecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 78 FPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 78 ~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) . .+.|...+.++|+.|.+...+...+..+..+.+.-.-+..-..+++..+..-++|.+|.-- T Consensus 72 s-~~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~ra~ve~ 133 (145) T protein:vir:95 72 S-QARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDRYTKHGIIRLVFKYRH 133 (145) T ss_pred E-cCCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEEEe Confidence 6 5679999999999998777643333344323322221222233555566666666666554 No 27 >protein:vir:1438 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:2712 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536367;genbank:gi:17975172;genbank:GeneID:929144 Probab=88.93 E-value=0.0092 Score=31.75 Aligned_cols=114 Identities=18% Similarity=0.165 Sum_probs=65.0 Q ss_pred CchHHHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEEec Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVFPA 80 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~p~ 80 (140) |+.-.|+++|..+.. .-.||++. |-..+.+|+-...+-+.. ...|+|.+....+.+||+|+... T Consensus 1 ~~~~~i~~aL~~l~~-------------~RVyp~~a--P~~~~~Pyiv~q~vsg~p-~~~L~G~~~~~~~~vQIDvyA~t 64 (115) T protein:vir:14 1 MSVIVIRDALQGIGG-------------AKGYLGVA--PAKAPAPYFVVTRVHGAL-DMALAGLTGGRSGSYQIDCYAPT 64 (115) T ss_pred CeeEeeehhhccccc-------------cccccccC--CCCCCCCEEEEEeecCcc-cccccCCCCCcceEEEEEEeeCC Confidence 999889988764211 13456664 322233477666665544 44888887778999999999754 Q ss_pred CCChHHHHHHHHHHHHhccCCccccCCeEEE-EcCCccccCCccCCCeEEEEEEEEE Q lcl|NC_016566. 81 GTGTQYASELADAIATSDKWQAVRISGAAFQ-LQDSPYTSSVVEDVDRARIVVTVPY 136 (140) Q Consensus 81 G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~-v~~~p~~~~~~~~~~~~~iPVsi~y 136 (140) -.+|+++++++.+.-.+.. ..+.+. +++.+......+.--+..+=++|=| T Consensus 65 ---~~~A~~l~~~v~~~~~~~~---~~~~~~~~~~~~d~ye~dt~lyR~s~D~~vWf 115 (115) T protein:vir:14 65 ---FTDADRLADLAVDRAMSVQ---DRFSVGGVDELPDDYSEDTGLFRISLELSVEF 115 (115) T ss_pred ---HHHHHHHHHHHHHHHhcCc---cceeeeeecCCCCCCcccccceeeEEEEEEeC Confidence 5677778888765332211 112222 2333333333333234555555556 No 28 >protein:vir:97070 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:3244 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453569;genbank:gi:84662604;genbank:GeneID:5142485 Probab=88.25 E-value=0.033 Score=28.67 Aligned_cols=113 Identities=11% Similarity=-0.017 Sum_probs=60.2 Q ss_pred CchH-HHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCC-ccEEEEEEcCCCccceecCCC-CeEEEEEEEEEEE Q lcl|NC_016566. 1 MSNT-AIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGS-GVYLEPYLLPAPTQFVGFQQK-GRIYAGVYQVAVV 77 (140) Q Consensus 1 Ms~~-~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~-~~yLr~~~~pa~t~~~~l~~~-~~~~~G~~qv~v~ 77 (140) ||.. .|+++| +.+. +- =.||++. |...+ .+|+-...+.+... ..|.|. +......+||+|+ T Consensus 1 M~~e~~l~a~L----~~~~--------~~-Rvyp~~a--P~~~~~~Pyiv~q~vsg~p~-~~ldG~~~~~~~~rvQIdvy 64 (118) T protein:vir:97 1 MSYGRMLKDLL----DPVF--------SG-RVYADIP--PDSPPLDAYAIYQRVGGVPV-YWKEGGMPDKVNARVQVQIW 64 (118) T ss_pred CchHHHHHHHH----hhhc--------CC-ccccccC--CCCCCcCCEEEEEecCCccc-ccccCCCCCccceeEEEEEe Confidence 9954 333333 3332 11 2466654 32223 25887777777555 447664 5566788999999 Q ss_pred EecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCC--eEEEEEEEEEEee Q lcl|NC_016566. 78 FPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVD--RARIVVTVPYTCC 139 (140) Q Consensus 78 ~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~--~~~iPVsi~yra~ 139 (140) .. .-.+|.+++++|++...+..... . +..+.+.. .++.. +..+=++|-|+-- T Consensus 65 A~---t~~~A~~l~~av~~al~~~~~~~----~-~~~~~~~y--e~dt~lyr~~~Df~iw~~~~ 118 (118) T protein:vir:97 65 SR---SKQEAYLATVQVLRIVSEANDMQ----V-LSQPIDDY--VRELKLYGSRVDISMWYNLT 118 (118) T ss_pred eC---CHHHHHHHHHHHHHHhhcccccc----c-ccCCcccc--cccCCceEEEEEEEEEeecC Confidence 75 45688888888877665432111 0 11121112 22223 4444555555554 No 29 >protein:vir:3428 Length: 131 # NCBI annotation: tail component # Family: family:all:911 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040591;genbank:gi:9626255;genbank:GeneID:2703486 Probab=88.15 E-value=0.034 Score=28.63 Aligned_cols=124 Identities=10% Similarity=-0.002 Sum_probs=68.1 Q ss_pred CchHHHHHHHHHHHHHHHHhhhhcCCCcceecCCC--CCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLSTSQRPIQVNWENV--SGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVF 78 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~--~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~ 78 (140) |+|++||+++-. +|..- .++ |.|-|- .|...+.- +=..|++-.+.....+|.++ ..+..|.|.||- T Consensus 1 ~~ht~IR~~Vid---~L~~~----l~~--v~~fdG~P~fide~El-PAVAV~l~d~~~~~~~ld~~--~w~A~LhI~iyL 68 (131) T protein:vir:34 1 MKHTELRAAVLD---ALEKH----DTG--ATFFDGRPAVFDEADF-PAVAVYLTGAEYTGEELDSD--TWQAELHIEVFL 68 (131) T ss_pred CchHHHHHHHHH---HHhcc----CCc--eEEecCCceeeccccC-cEEEEEeecCCCCcceecCC--eeEEEEEEEEEe Confidence 999999977654 44321 122 334332 23333322 35577777777777777644 789999999999 Q ss_pred ecCCChHHHHHHHHHHH-HhccCCccccCCe--EEEEcCCccccCCccCCCeEEE--EEEEEEEe Q lcl|NC_016566. 79 PAGTGTQYASELADAIA-TSDKWQAVRISGA--AFQLQDSPYTSSVVEDVDRARI--VVTVPYTC 138 (140) Q Consensus 79 p~G~G~~~a~~~A~~ia-~~F~~~~~~~~g~--~v~v~~~p~~~~~~~~~~~~~i--PVsi~yra 138 (140) |+..+..+-.++|.++. .-...-+.+. ++ .+...+- +=.+-.+..+|... --+|+|.- T Consensus 69 ka~~~ds~LD~~~E~~i~~v~~~~~~l~-~l~~~~~~~gy-~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) T protein:vir:34 69 PAQVPDSELDAWMESRIYPVMSDIPALS-DLITSMVASGY-DYRRDDDAGLWSSADLTYVITYEM 131 (131) T ss_pred ecCCCHHHHHHHHHHHhHHHhhcchhhh-hHhhhhhhccC-CcccccccceEEEEEEEEEEEEeC Confidence 99999999999999853 3322211111 11 0000000 01111222245543 34566666 No 30 >protein:vir:96800 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:32155 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224254;genbank:gi:62362389;genbank:GeneID:3345739 Probab=87.91 E-value=0.011 Score=31.43 Aligned_cols=126 Identities=16% Similarity=0.168 Sum_probs=91.0 Q ss_pred CchHHHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEEec Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVFPA 80 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~p~ 80 (140) |+.-.--++||. .|. ++ ..+|||-..+ |..| +-+|+.+..++.+-..|....++.+|-|-|.|-... T Consensus 1 mtfldavkafed---dlk----ak-vnipvanksi---ptdg--vsmrvalnnadadglflnsgarvmtgqfnveisael 67 (127) T protein:vir:96 1 MTFLDAVKAFED---DLK----AK-VNIPVANKSI---PTDG--VSMRVALNNADADGLFLNSGARVMTGQFNVEISAEL 67 (127) T ss_pred Cchhhhhhhhhh---ccc----ee-eecccccccc---CcCc--eEEEEEeccCCcceeEeecCceeeeeeeeeEEeecc Confidence 988777777775 332 11 2566764433 3433 588999999999999999888999999999999999 Q ss_pred CCChHHHHHHHHHHHHhc-cCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 81 GTGTQYASELADAIATSD-KWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 81 G~G~~~a~~~A~~ia~~F-~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) |+..-+-..-|.++-+.+ +|++.....-.+.|.+. ...-+.+.+.+-.|.|-|.|+--- T Consensus 68 gtnkyammaeankvlavyergysvpvldrrvlilqa-nqstpypteahqkinviidfqitk 127 (127) T protein:vir:96 68 GTNKYAMMAEANKVLAVYERGYSVPVLDRRVLILQA-NQSTPYPTEAHQKINVIIDFQITK 127 (127) T ss_pred CCceeeeeeccceeEEeeecCcccceecceEEEEEc-CCCCCCcccccceeeEEEEEEEcC Confidence 888777666677776666 47776655545555443 334567777788899999888777 No 31 >protein:vir:1244 Length: 145 # NCBI annotation: similar to phage Spp1 gp17 # Family: family:all:296 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510943;genbank:gi:17426277;genbank:GeneID:927402 Probab=86.64 E-value=0.044 Score=28.01 Aligned_cols=129 Identities=12% Similarity=-0.029 Sum_probs=64.5 Q ss_pred CchHHHHHHHHHHHHHHHHhhh--hcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCCe-EEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLS--TSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGR-IYAGVYQVAVV 77 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~--~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~-~~~G~~qv~v~ 77 (140) ||-+.=..+.++++++|...++ +..+ -+ .|++.+ .++. .+|+. -+.++..+.+.++. ...-.++|+|+ T Consensus 1 M~~s~~~aLq~ai~~~L~ad~~l~~lvg-~~-vyD~~P-~~~~--~PyV~----lG~~~~~~~~t~~~~~~~~~lti~Vw 71 (145) T protein:vir:12 1 MWVSVERYLFNKVYNKLKSNPIIQKQLG-GR-VFDCVQ-KDAV--YPYIV----VGETNVTNKETTTSMVEDVGITLHVY 71 (145) T ss_pred CcccHHHHHHHHHHHHhhcChhHHHhcC-cc-cccCCc-cCCC--CCEEE----eccceeeecCCCcccceEEEEEEEEE Confidence 9854444444455556532211 1111 12 355544 2222 23543 35555556655543 46677899988 Q ss_pred EecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCC-ccccCCccCCCeEE--EEEEEEEEeeC Q lcl|NC_016566. 78 FPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDS-PYTSSVVEDVDRAR--IVVTVPYTCCA 140 (140) Q Consensus 78 ~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~-p~~~~~~~~~~~~~--iPVsi~yra~a 140 (140) . ...|...+.++|+.|.+...+...+..+..+.+.-. -.+. ..+++..+. +.+++.+|=+- T Consensus 72 s-~~~gr~ea~~ia~ai~~aL~~~l~l~~~~lv~l~~~~~~~~-rd~d~~~~hgvl~~ra~i~~~~ 135 (145) T protein:vir:12 72 S-QARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVI-TDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred E-cCccHHHHHHHHHHHHHHhccccCCCCceEEEEEEeeEEEE-ecCCCceEEEEEEEEEEEEeCC Confidence 6 466899999999999776554333333322222211 1111 223333333 44444444444 No 32 >protein:vir:100116 Length: 115 # NCBI annotation: gp10 # Family: family:all:2712 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945040;genbank:gi:38707900;genbank:GeneID:2744163 Probab=86.44 E-value=0.014 Score=30.77 Aligned_cols=114 Identities=18% Similarity=0.150 Sum_probs=64.1 Q ss_pred CchHHHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEEec Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVFPA 80 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~p~ 80 (140) |+.-.|+++|..+. + .-.||++. |-..+.+|+-...+-+.. ...|+|.+....+.+||+++... T Consensus 1 ~~~~~i~~aL~~l~------------~-~RVyp~~a--P~~~~~Pyiv~q~vsg~p-~~~L~G~~~~~~~~vQIDvyA~t 64 (115) T protein:vir:10 1 MSVIVIRDALQGIG------------G-AKGYLGVA--PEKAPAPYFVVTRVHGAL-DMALAGLTGGRSGSYQIDCYAPT 64 (115) T ss_pred CeeEEeehhhcccC------------C-ceeecccC--CCCCCCCEEEEEeecCcc-ccccCCCCCCcceEEEEEEeeCC Confidence 99888888876321 1 13456664 222223477666665544 44888887778999999999754 Q ss_pred CCChHHHHHHHHHHHHhccCCccccCCeEEE-EcCCccccCCccCCCeEEEEEEEEE Q lcl|NC_016566. 81 GTGTQYASELADAIATSDKWQAVRISGAAFQ-LQDSPYTSSVVEDVDRARIVVTVPY 136 (140) Q Consensus 81 G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~-v~~~p~~~~~~~~~~~~~iPVsi~y 136 (140) -.+|+++++++.+.-.+. ...+.+. +++.+......+.--+..+=++|=| T Consensus 65 ---~~~A~~l~~~v~~~~~~~---~~~~~~~~~~~~~d~ye~dt~lyR~s~D~~vWf 115 (115) T protein:vir:10 65 ---FTDADRLADLAVDRAMSV---QDRFSVGGVDELPDDYSEDTGLFRISLELSVEF 115 (115) T ss_pred ---HHHHHHHHHHHHHHHhcC---ccceeEeeecCCCCCCcccccceeeEEEEEEeC Confidence 567777777776532211 1112222 2333333322322234555555556 No 33 >protein:vir:107096 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950611;genbank:gi:119953691;genbank:GeneID:4643105 Probab=85.46 E-value=0.053 Score=27.59 Aligned_cols=130 Identities=9% Similarity=-0.033 Sum_probs=64.0 Q ss_pred CchHHHHHHHHHHHHHHHHhhh--hcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCC-eEEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLS--TSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKG-RIYAGVYQVAVV 77 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~--~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~-~~~~G~~qv~v~ 77 (140) ||-..=..+-+++.++|...++ +..++ . .|.+.+ .++ +.+|+- -+.++..+.+.+| ....-.++|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~al~alvg~-r-VyD~~P-~~a--~~PyV~----lG~~~~~~~~~~~~~g~~~~~ti~Vw 71 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNPIIKKQLGG-R-VFDCVQ-KDA--VYPYIV----VGETNVTNKETTTSMFEDVGVTLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcc-c-cccCCc-cCC--CCCEEE----eCcceeeecCCCcccceEEEEEEEEE Confidence 9944434444444456544321 11111 1 244443 111 223533 3445555555555 356778999988 Q ss_pred EecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 78 FPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 78 ~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) . .+.|-..+.++|+.|.+.......+..+..+.+.-.-+..-...++..+.--++|.++.-- T Consensus 72 s-~~~g~~ea~~ia~av~~aL~a~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~ra~ve~ 133 (145) T protein:vir:10 72 S-QARNRDEASQIIQYLGFVLNSEIEINNYSFIKSRIDTQEVITDIDQYTKHGIIRLIFKYRH 133 (145) T ss_pred E-cCCCHHHHHHHHHHHHHHhCCCcCCCCCeEEEEEEeeeeEeecCCCceEEEEEEEEEEEee Confidence 7 4668899999999998777533333333222332221111123344444444444443333 No 34 >protein:vir:105337 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950674;genbank:gi:119967844;genbank:GeneID:4643216 Probab=85.31 E-value=0.054 Score=27.54 Aligned_cols=130 Identities=9% Similarity=-0.033 Sum_probs=63.9 Q ss_pred CchHHHHHHHHHHHHHHHHhhh--hcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCC-eEEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLS--TSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKG-RIYAGVYQVAVV 77 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~--~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~-~~~~G~~qv~v~ 77 (140) ||-..=..+-+++.++|...++ +..++ . .|.+.+ .++ +.+|+- -+.++..+.+.+| ....-.++|+|+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~al~alvg~-r-VyD~~P-~~a--~~PyV~----lG~~~~~~~~~~~~~g~~~~~ti~Vw 71 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNPIVSKQLGG-R-VFDCVQ-KDA--VYPYIV----VGETNVTNKETTTSMFEDVGVTLHVY 71 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcc-c-cccCCc-cCC--CCCEEE----eCcceeeecCCCcccceEEEEEEEEE Confidence 9944434444444456544321 11111 1 244443 111 223533 3445555555555 356778999988 Q ss_pred EecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 78 FPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 78 ~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) . .+.|-..+.++|+.|.+.......+..+..+.+.-.-+..-...++..+.--++|.++.-- T Consensus 72 s-~~~g~~ea~~ia~av~~aL~a~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~ra~ve~ 133 (145) T protein:vir:10 72 S-QARNRDEASQIIQYLGFVLNSEIEINNYSFIKSRIDTQEVITDIDQYTKHGIIRLIFKYRH 133 (145) T ss_pred E-cCCCHHHHHHHHHHHHHHhCCCcCCCCCeEEEEEEeeeeEeecCCCceEEEEEEEEEEEee Confidence 7 4668899999999998877533333333222332221111123344444444444443333 No 35 >protein:vir:79571 Length: 137 # NCBI annotation: putative tail component # Family: family:all:911 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272522;genbank:gi:148609391;genbank:GeneID:5204407 Probab=83.24 E-value=0.07 Score=26.91 Aligned_cols=126 Identities=15% Similarity=-0.007 Sum_probs=69.7 Q ss_pred Cc-hHHHHHHHHHHHHHHHHhhhhcCCCcceecCCC--CCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEE Q lcl|NC_016566. 1 MS-NTAIRKALNSVVEELSVSLSTSQRPIQVNWENV--SGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVV 77 (140) Q Consensus 1 Ms-~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~--~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~ 77 (140) |+ |++||+++-. +|.... +.. +.|-|- .|.+.+. -+=..|++-.+.....+|..+ ..+..|.|.|| T Consensus 5 M~iht~IR~~Vid---~L~~~l----~~~-~~ffdGrP~fiDe~E-lPAVAV~l~da~~~~~~ld~~--~W~A~LhI~iy 73 (137) T protein:vir:79 5 MNRHTQIRQVVLA---RLREQC----GDS-ATFFDGLPAFVDAQE-LPAVSVWLSDAQYTGKMTDED--DWQAVLHIAVF 73 (137) T ss_pred hHHHHHHHHHHHH---HHHhhc----CCc-EEEeCCccceechhh-CcEEEEEeecCCCCcceecCC--eeEEEEEEEEE Confidence 66 9999977654 443222 222 444442 3554432 235677777777777777655 58999999999 Q ss_pred EecCCChHHHHHHHHH-HHHhccCCccccCCeEEEE-cCCccccCCccCCCeEEEE--EEEEEEe Q lcl|NC_016566. 78 FPAGTGTQYASELADA-IATSDKWQAVRISGAAFQL-QDSPYTSSVVEDVDRARIV--VTVPYTC 138 (140) Q Consensus 78 ~p~G~G~~~a~~~A~~-ia~~F~~~~~~~~g~~v~v-~~~p~~~~~~~~~~~~~iP--Vsi~yra 138 (140) -|+..+..+-.++|.+ |..-...-..+. ++...+ ...=+=.+-.+..+|...- -+|+|.- T Consensus 74 Lka~~~ds~LD~~~E~~I~~v~~~~~~l~-~l~~~~~~~gY~Y~rD~e~~tW~sadL~y~ItYe~ 137 (137) T protein:vir:79 74 IRAQAPDSELDMWMESTIFPALNDVPALS-GLIDTLIPLGFNYQRDNEMATWAMAEITYQITYTN 137 (137) T ss_pred eecCCCHHHHHHHHHHHHHHhhcchhhhh-hHhhhhhcccCCcccccccceeEEEEEEEEEEEcC Confidence 9999999999999997 444333221111 110000 0000011112223455443 4566666 No 36 >protein:vir:4348 Length: 121 # NCBI annotation: Orf15 # Family: family:all:896 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061511;genbank:gi:9635607;genbank:GeneID:1262874 Probab=75.81 E-value=0.14 Score=25.22 Aligned_cols=119 Identities=16% Similarity=0.081 Sum_probs=67.7 Q ss_pred CchHHHHHHHHHHHHHHHHhhhhcCCCcceecC-CCCCCCCCC-CccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLSTSQRPIQVNWE-NVSGDHANG-SGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVF 78 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~p-N~~F~pp~~-~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~ 78 (140) |-. .|++++++= +++.+-+.+. + .-.|| ++. |.+ +.+|+-.+.+-+.... .|+|.+....+.+||+|+. T Consensus 1 m~~-~i~~~l~~d-~~v~allg~~--~-~Rvyp~~~a---P~~~~~Pyiv~q~vsg~p~~-~l~g~~~~~~~~vQIDvyA 71 (121) T protein:vir:43 1 MYP-PIFKVCSSS-PAVTAILGAS--P-LRMYQFGLA---PQLVVKPYATWQTISGSPEN-YLWGRPDADGFTIQVDIFS 71 (121) T ss_pred CCh-HHHHHHhhC-hhhhhhhcCC--C-ceeeccCCC---CCCCcCCeEEEEEecCcccc-eecCCCCcceeEEEEEeee Confidence 887 588777752 2332222211 1 23455 553 322 2347777766665444 4888776788999999995 Q ss_pred ecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 79 PAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 79 p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) . --.+|.+++++|++...+. .+.+. ... ..-.++..-|.+.+-|+|--.- T Consensus 72 ~---t~~~A~~l~~av~~Al~~~-----~~~~~--~~~--~~ye~dT~lyR~s~Dv~w~~~r 121 (121) T protein:vir:43 72 A---TAAEARDAAKAIRDAIELS-----AYVVR--WGG--ESVDPDTKTYRVSFDVDWIVQR 121 (121) T ss_pred C---CHHHHHHHHHHHHHHhhhc-----CCccc--CCC--CCCcccccceeeeeEEEEeecC Confidence 4 4578899999998766531 11111 111 1122333467777777764444 No 37 >protein:vir:195 Length: 115 # NCBI annotation: Gp11 # Family: family:all:896 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037705;genbank:gi:9634170;genbank:GeneID:1262540 Probab=73.17 E-value=0.17 Score=24.75 Aligned_cols=115 Identities=13% Similarity=-0.046 Sum_probs=57.0 Q ss_pred CchHHHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEEec Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVFPA 80 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~p~ 80 (140) |.+..|.++|.. |+ .++ -+|-.-|-..-..|+-+.+|+-...+-+.. ...|+|.. .....+||+++.+ T Consensus 1 M~e~~i~~lL~~----l~---~gR--vyp~~aP~~~~~~~~~~~Pyiv~q~vsg~p-~~~L~G~~-~~~~~vQIDvyA~- 68 (115) T protein:vir:19 1 MNEDNIYALLSP----LA---EGR--VYPYVAPLGSDGKPSVSPPWIIFSIVDDVS-ADVLCGQA-ESRVSVQVDVYST- 68 (115) T ss_pred CchhHHHHHHhh----hc---Ccc--cceeeccCCCCCCccccCCeEEEEeccCcc-cccccCCC-ccceEEEEEEeeC- Confidence 999999888753 22 122 234333332222333233577666554433 33466643 3556999998865 Q ss_pred CCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEe Q lcl|NC_016566. 81 GTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTC 138 (140) Q Consensus 81 G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra 138 (140) ....|++++++|++......- +.+. .. ..-.++-.-|+..+-|...- T Consensus 69 --t~~~A~~l~~~i~~Al~~~~p----~~~~---~~--~~ye~dt~lyR~s~d~~V~~ 115 (115) T protein:vir:19 69 --SIAESRSLRDLVLASLEPLTP----TEVV---KI--PGYEPDYRLYRATLDFKVTP 115 (115) T ss_pred --ChHHHHHHHHHHHHHhhhcCC----EEec---CC--CCcccchhceeeEEEEEecC Confidence 456788888888775542111 1110 10 11122223444333333333 No 38 >protein:vir:96125 Length: 140 # NCBI annotation: ORF038 # Family: family:all:296 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240084;genbank:gi:66395765;genbank:GeneID:5133106 Probab=69.08 E-value=0.23 Score=24.10 Aligned_cols=126 Identities=8% Similarity=-0.008 Sum_probs=58.5 Q ss_pred CchH-HHHHHHHHHHHHHHHhh--hhcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCC-eEEEEEEEEEE Q lcl|NC_016566. 1 MSNT-AIRKALNSVVEELSVSL--STSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKG-RIYAGVYQVAV 76 (140) Q Consensus 1 Ms~~-~Ir~~~e~~~a~l~~~a--~~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~-~~~~G~~qv~v 76 (140) ||-. +.++++- ++|..-+ .+..++ + .|.+.+ .++ +.+|+.+ +.++..+-+.+| ....=.++|+| T Consensus 3 msa~~aLq~Ai~---~~L~ad~~l~alvgg-r-VyD~~P-~~~--~~PYV~l----G~~~~~~~~~~~~~g~~~~~tl~V 70 (140) T protein:vir:96 3 VTAEPLLYNKIM---NNLIENPITDKLVGG-R-VFDCVQ-KDV--VYPYIVV----GESNVTESERSPGMREIIAITFHV 70 (140) T ss_pred cchhHHHHHHHH---HHhccChhHHhhcCc-c-cccCCc-cCC--CCCEEEe----CCceeeecCCCcccceEEEEEEEE Confidence 5532 4444443 4553322 111111 2 244433 222 2235533 444444444444 24566788886 Q ss_pred EEecCCChHHHHHHHHHHHHhccCCccccCCeEE-EEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 77 VFPAGTGTQYASELADAIATSDKWQAVRISGAAF-QLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 77 ~~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v-~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) + ..+.|-.++.++|+.|.+.... .+.-.|+.+ .+.-.-...-..+++..+.--++|.+|--- T Consensus 71 w-s~~~g~~ea~~ia~ai~~aL~~-~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~ve~ 133 (140) T protein:vir:96 71 Y-SQYENGAEARELLKYLNYACRL-NINFKDYELEWIKKDNSQVFTDIDQYTKHGVLRLLYKVRH 133 (140) T ss_pred E-EcCCCHHHHHHHHHHHHHHhcC-CccCCCceEEEEEEeeeEEeecCCCceEEEEEEEEEEEee Confidence 6 4677999999999999777743 333344332 222111111122333333333444444333 No 39 >protein:vir:5979 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:296 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690679;genbank:geneid:6329147;genbank:gi:22855073;uniprot:O48448;genbank:GeneID:955319 Probab=62.98 E-value=0.33 Score=23.26 Aligned_cols=127 Identities=12% Similarity=-0.023 Sum_probs=65.2 Q ss_pred CchH----HHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCC-eEEEEEEEEE Q lcl|NC_016566. 1 MSNT----AIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKG-RIYAGVYQVA 75 (140) Q Consensus 1 Ms~~----~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~-~~~~G~~qv~ 75 (140) ||-. ..++++- ++|...++-. .-+.=.|.+++ .++ +.+|+- -+.++..+.+.+| ....-.++|+ T Consensus 1 m~~~s~~~aLq~Ai~---~~L~ad~~l~-alvg~I~D~~P-~~~--~~PYV~----lG~~~~~d~~~~~~~g~~~~~ti~ 69 (134) T protein:vir:59 1 MTWKLASRALQKATV---ENLESYQPLM-EMVNQVTESPG-KDD--PYPYVV----IGDQSSTPFETKSSFGENITMDFH 69 (134) T ss_pred CCccchhHHHHHHHH---HHhhcChhHH-HhhhhhhcCCC-CCC--CCCEEE----eCCceeeecCCCcccceEEEEEEE Confidence 8844 4454444 4443322110 00001344433 111 223543 3445555555554 3567788999 Q ss_pred EEEecCCChHHHHHHHHHHHHhccCCccc-cCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 76 VVFPAGTGTQYASELADAIATSDKWQAVR-ISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 76 v~~p~G~G~~~a~~~A~~ia~~F~~~~~~-~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) |+... |...++++|+.|.+...+..+. ..+..+.+.-.-...-..+++..+..-++|..+-.. T Consensus 70 Vws~~--g~~ea~~ia~av~~aL~~~~L~l~~~~lv~l~~~~~~~~rd~dg~~~hg~l~fra~ve~ 133 (134) T protein:vir:59 70 VWGGT--TRAEAQDISSRVLEALTYKPLMFEGFTFVAKKLVLAQVITDTDGVTKHGIIKVRFTINN 133 (134) T ss_pred EEECC--ChHHHHHHHHHHHHHhcCCCcccCCceEEEeEEeeeeEEecCCCceEEEEEEEEEEEec Confidence 99765 5578999999998887655543 444333333222222334555555555555555544 No 40 >protein:vir:93602 Length: 114 # NCBI annotation: putative structural component # Family: family:all:896 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449300;genbank:gi:157166048;uniprot:Q6H9U1;genbank:GeneID:5580424 Probab=58.24 E-value=0.42 Score=22.67 Aligned_cols=111 Identities=10% Similarity=-0.021 Sum_probs=55.7 Q ss_pred CchHHHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCC---CCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDH---ANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVV 77 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~p---p~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~ 77 (140) |.+..|..+|.. +. .. =.||.+.=.- +....+|+-...+-+.. ...|+|. ..-.-.+||+|+ T Consensus 1 M~e~~i~~lL~~----~~---~g------Rvyp~~~P~~~~~~~~~~Pyiv~q~vsg~p-~~~l~gp-~~~~~~vQIDvy 65 (114) T protein:vir:93 1 MTEADLYPHLAH----LA---GG------QVYPYVVPLLDGRPSVALPWVVFSLISSVS-ADVMGGQ-AESSVSVQIDVY 65 (114) T ss_pred CchHHHHHHHHh----hc---Cc------ccccccCCcccCcCCccCceEEEEeccCcc-cccccCc-cccceEEEEEee Confidence 999999888753 22 11 2456643110 01112466655554433 3346663 334579999999 Q ss_pred EecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEe Q lcl|NC_016566. 78 FPAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTC 138 (140) Q Consensus 78 ~p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra 138 (140) .. ....|++++++|++....... +.+ ....-. .++-.-|+..+-|.+.- T Consensus 66 A~---t~~~A~~l~~~v~~Al~~~~~----~~~---~~~~~y--e~dt~lyR~~~d~~v~~ 114 (114) T protein:vir:93 66 AG---TVTQARQIRQDAREAIMLLAP----GSV---SEMQDY--IPENRCYRATLEFQVTV 114 (114) T ss_pred eC---CHHHHHHHHHHHHHHHhhcCc----Eee---cCCCcc--cccccceeeEEEEEEeC Confidence 65 567888999999876542111 111 010111 22223333333333333 No 41 >protein:vir:1892 Length: 121 # NCBI annotation: gp11 # Family: family:all:896 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037672;genbank:gi:9634130;genbank:GeneID:1262490 Probab=49.99 E-value=0.62 Score=21.71 Aligned_cols=120 Identities=13% Similarity=-0.017 Sum_probs=68.7 Q ss_pred CchHHHHHHHHHHHHHHHHhhhhcCCCcceecC-CCCCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEEe Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLSTSQRPIQVNWE-NVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVFP 79 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~p-N~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~p 79 (140) |-. -|.+++++- +.+.+-+.+. + +-.|| ++. |..-+.+|+-.+.+-+ .....|+|.+-...+.+||+++.. T Consensus 1 m~~-~i~~~l~~d-~~v~allg~~-~--~Rvyp~~~a--P~~~~~Pyiv~q~vsg-~p~~~l~G~~~~~~~~vQIDvyA~ 72 (121) T protein:vir:18 1 MIA-PIFSVCASS-PEVTDLLGSN-P--VRIYPFGIQ--DDNVVYPYVVWQNITG-SPENYIAQRPDADFFTLQVDAYAD 72 (121) T ss_pred Cch-HHHHHHhcC-hhhhhhhcCC-C--ceeeeccCC--CCcCcCCeEEEEEecC-cccceecCCCCcceeEEEEEeecC Confidence 876 477777643 3443332211 1 23455 553 2222334766655544 445667777777889999999976 Q ss_pred cCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 80 AGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 80 ~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) . -.+|.+++++|++..+. .++... .. ...-.++..-|.+.+-|+|--.- T Consensus 73 t---~~~A~~l~~avr~Ale~-----~~~~~~--~~--~~~ye~dT~lyR~s~Dv~~~~~r 121 (121) T protein:vir:18 73 T---VDEVIAVATALRDAIEP-----HAHITR--WG--GQERDPETKRYRYSFDVDWIVTR 121 (121) T ss_pred C---HHHHHHHHHHHHHHhhh-----cCcccC--CC--CCCCcccccceeeeeEEEEeecC Confidence 5 45688999999886652 222111 11 12233445577777777775555 No 42 >protein:vir:105772 Length: 128 # NCBI annotation: gp15 # Family: family:all:10994 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224153;genbank:gi:62362228;genbank:GeneID:3342525 Probab=33.55 E-value=1.3 Score=19.87 Aligned_cols=121 Identities=12% Similarity=0.092 Sum_probs=67.9 Q ss_pred CchHHHHHHHHHHHHHHHHhhhhcCCCcc---eecCCCCCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLSTSQRPIQ---VNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVV 77 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~~~~~~lp---va~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~ 77 (140) |+.+++-++++..+..- +.+.++. ..|.....+ ++. .|+-.-=..+. .-.++.++. -|+|.++ T Consensus 1 ~~~~~m~~~vr~~l~da-----GLt~GftvQl~~W~d~~g~--~~e-~~iV~qpNGGt-~i~d~~~~d-----y~~i~~V 66 (128) T protein:vir:10 1 MTRSEVYDALRVWLQSH-----GFDVGYRVQKRFWNEQEGT--EGE-RYLVIQQNGGG-KPEEAITRD-----FFRILVL 66 (128) T ss_pred CchhHHHHHHHHHHHhC-----CCcchheeeeeeeeccCCC--CCc-eEEEEecCCCC-chhhhcccc-----eeEEEEE Confidence 99888887777544331 2223333 467664322 121 36655544444 333443333 4778888 Q ss_pred EecCCCh-HHHHHHHHHHHHhccCCcccc-CCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 78 FPAGTGT-QYASELADAIATSDKWQAVRI-SGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 78 ~p~G~G~-~~a~~~A~~ia~~F~~~~~~~-~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) ..++.|. .++++.|++|.++-......+ -| +|..--.+-|..++++++-. ++.+||-- T Consensus 67 sg~~d~~~~~ve~ra~~Ii~yv~~np~~~cig---~i~n~Ggippi~T~EgR~if--rL~f~~i~ 126 (128) T protein:vir:10 67 SGQNDSDINEVEDRADAIRQAMIDDYRTECII---SMQPVGGITAIQTEEGRYLF--DISFQTII 126 (128) T ss_pred eecCCCcchhHHHHHHHHHHHHHhCccccccc---eeeccCCCCCccccCCceee--eehhhhhh Confidence 8888887 579999999999885444332 23 23211111256677776543 44555554 No 43 >protein:vir:3874 Length: 114 # NCBI annotation: putative head-tail joining protein # Family: family:all:28620 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680491;swissprot:trembl:p94215;genbank:gi:22296531;uniprot:P94215;genbank:GeneID:951676 Probab=30.55 E-value=1.6 Score=19.51 Aligned_cols=112 Identities=10% Similarity=-0.004 Sum_probs=63.4 Q ss_pred Cc-hHHHHHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCCccEEEEEEcCCCccceecCCCCe-EEEEEEEEEEEE Q lcl|NC_016566. 1 MS-NTAIRKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGR-IYAGVYQVAVVF 78 (140) Q Consensus 1 Ms-~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~-~~~G~~qv~v~~ 78 (140) |. +.+|.+.|.+-. +|++- -.-+++..-|..+|.+.+.. +|.|++.+|++..... ++.+ .+---+||+ || T Consensus 1 ~~PE~~vaDiLsad~-~lv~~---mYipift~tpdd~fik~SsA-PWiRiTpiPGDda~ya--DD~R~~EYPrVqVD-fW 72 (114) T protein:vir:38 1 MAPEKRVYDILSANL-DIADK---VYIGTPNFNNQTSATPESLA-PWVRITYLPGDAADYA--DDSRILEYPKVQVD-FW 72 (114) T ss_pred CCchhhhhhhhccch-hhhhh---eeccCCCCCCCCcccccccC-CeeEeeecCCcccccc--ccceeeecCceeEE-Ee Confidence 65 556666665321 22211 12345556667788887776 4999999999865432 3333 233556776 56 Q ss_pred ecCCChHHHHHHHHHHHHhccCCccccCCeEEEEcCCccccC Q lcl|NC_016566. 79 PAGTGTQYASELADAIATSDKWQAVRISGAAFQLQDSPYTSS 120 (140) Q Consensus 79 p~G~G~~~a~~~A~~ia~~F~~~~~~~~g~~v~v~~~p~~~~ 120 (140) =.-.|....+++-.+|-+.....-.-+.-...|+.+-|+.+. T Consensus 73 vr~e~~d~~e~iqe~IY~~Lha~gweRYY~nsY~D~~~~~~~ 114 (114) T protein:vir:38 73 VGITDWDQQEKIETQIYQALHAADWERYYRNSYVDGIPQPFA 114 (114) T ss_pred eccCChhhHHHHHHHHHHHHHhcCcceeeeccccCCCCCCCC Confidence 677899999999888855432111112212224444444443 No 44 >protein:vir:10327 Length: 182 # NCBI annotation: ORF29 # Family: family:all:1090 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758922;genbank:gi:27311196;genbank:GeneID:956141 Probab=27.51 E-value=1.8 Score=19.14 Aligned_cols=132 Identities=17% Similarity=0.043 Sum_probs=67.5 Q ss_pred CchHHHHHHHHHHHHHHHHhhhhcCCCcceec--CCCCCCCCCCCccEEEEEEcCCCccceecCCCCeEEEEEEEEEEEE Q lcl|NC_016566. 1 MSNTAIRKALNSVVEELSVSLSTSQRPIQVNW--ENVSGDHANGSGVYLEPYLLPAPTQFVGFQQKGRIYAGVYQVAVVF 78 (140) Q Consensus 1 Ms~~~Ir~~~e~~~a~l~~~a~~~~~~lpva~--pN~~F~pp~~~~~yLr~~~~pa~t~~~~l~~~~~~~~G~~qv~v~~ 78 (140) ||+..|-+..+++.+.|.+ ..|.+++.. |... .....+-+++.+.-+.-+. +.+..-......+.+.+++ T Consensus 1 mt~~~l~~lh~AI~~~Lk~----~~p~l~~~~~y~~~~-~~i~~PAv~vel~~~~~~~---d~~tGq~~~~~~~~a~~vv 72 (182) T protein:vir:10 1 MSQTTITEVHEAIKAKLRE----TFPKVTVDDYNPEPE-LSVLAPALLLELEEFPMGA---DVGDDRYPAACRFSVHCVL 72 (182) T ss_pred CCcCCHHHHHHHHHHHHHH----hcCCceeeecCcccc-CccccceeeeeeecCCcCC---CCCCCcEEEEEEEEEEEEe Confidence 9998877777766666653 345666543 3322 1111222466665554332 2222222344555555555 Q ss_pred e--cCCChHHHHHHHHHHHHhccCCcccc----CCeEEEEcCCccccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 79 P--AGTGTQYASELADAIATSDKWQAVRI----SGAAFQLQDSPYTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 79 p--~G~G~~~a~~~A~~ia~~F~~~~~~~----~g~~v~v~~~p~~~~~~~~~~~~~iPVsi~yra~a 140 (140) - ...-...++.+|.+|+.+-.+++.+= .+.-..|...|.-+.....++.-.--|+..=+..- T Consensus 73 ~~~~~~~~~~~~~lAa~l~~~v~~~~wGL~~~~v~~a~~i~a~p~~f~~~~~dgy~vW~VeW~Q~i~L 140 (182) T protein:vir:10 73 GWEVKSLALELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYL 140 (182) T ss_pred cccCCCchHHHHHHHHHHHHHHhcCcccCCccccCccceeeeccCccChhhcCceEEEEEEEEEEEee Confidence 4 44446889999999999888776642 23334455555554443323222222222222211 No 45 >protein:vir:80105 Length: 162 # NCBI annotation: gp13 # Family: family:all:2729 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468717;genbank:gi:157325297;genbank:GeneID:5601796 Probab=22.65 E-value=2.4 Score=18.48 Aligned_cols=130 Identities=15% Similarity=0.104 Sum_probs=62.7 Q ss_pred CchHHH----HHHHHHHHHHHHHhhhhcCCCcceecCCCCCCCCCCCccEEEEEE-cCCCccceecCCCCeEEEEEEEEE Q lcl|NC_016566. 1 MSNTAI----RKALNSVVEELSVSLSTSQRPIQVNWENVSGDHANGSGVYLEPYL-LPAPTQFVGFQQKGRIYAGVYQVA 75 (140) Q Consensus 1 Ms~~~I----r~~~e~~~a~l~~~a~~~~~~lpva~pN~~F~pp~~~~~yLr~~~-~pa~t~~~~l~~~~~~~~G~~qv~ 75 (140) |.+..- -.+++..+.+|.. ...++.+...+.....|+-| |....+ .|-.....++ .++..|.-.+|++ T Consensus 1 ~~~~~~~~~~~~lv~~ii~~i~~----~~~gl~vI~~~~~g~~p~yP--F~TY~v~~pyi~~~~~~-~~~e~~~~~isi~ 73 (162) T protein:vir:80 1 MPNDTAGYDYGKLVKTLINAVNE----LSGGLQLIESSSGGEQPEYP--FCQYTITSPYIAISPDI-VEGEQFEIVISLT 73 (162) T ss_pred CCCccccccHHHHHHHHHHHHHh----hhcceeEEEccCCCCCCCCC--eEEEEEecCccccCCcc-cCCcceEEEEEEE Confidence 553100 0111111122211 12367787777776677755 766664 2222222222 2445777889999 Q ss_pred EEEecCCChHHHHHHHHHHHHhccCCc---cc-cC-Ce-EEEEcCCc--cccCCccCCCeEEEEEEEEEEeeC Q lcl|NC_016566. 76 VVFPAGTGTQYASELADAIATSDKWQA---VR-IS-GA-AFQLQDSP--YTSSVVEDVDRARIVVTVPYTCCA 140 (140) Q Consensus 76 v~~p~G~G~~~a~~~A~~ia~~F~~~~---~~-~~-g~-~v~v~~~p--~~~~~~~~~~~~~iPVsi~yra~a 140 (140) ++.-.. .+|.++|.+|.++|+... .. .+ |. .+-+..+- +....++-+-+|-.=++|+|+-.- T Consensus 74 ~~S~~~---~eAl~la~~l~~~f~~~~~~~~~~~~~gIvvvdv~~~~~R~~~~~~~yerR~GFD~~~Rv~r~~ 143 (162) T protein:vir:80 74 WRALSG---HQALNLANITNKYFRSQKGRFFMQENGGIVVVSVQNSGLRDTFISIEYERSAGIDLRLRVVDSY 143 (162) T ss_pred EEeCCH---HHHHHHHHHHHHHhhcCCceeeeeecCcEEEEecCCCccceeEeeeeeeeeecceEEEEEeecc Confidence 988665 899999999999996432 22 22 32 22222211 111112222233444444443322 Done!