Query lcl|NC_018454.1_cdsid_YP_006590043.1 [gene=B887_gp38] [protein=hypothetical protein] [protein_id=YP_006590043.1] [location=24799..25212] Match_columns 137 No_of_seqs 74 out of 79 Neff 6.6 Searched_HMMs 1612 Date Thu Nov 7 12:58:24 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_38 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_38_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:103278 Length: 169 100.0 5E-43 3.1E-46 252.5 14.5 128 1-134 38-169 (169) 2 protein:vir:107704 Length: 132 100.0 8.8E-43 5.5E-46 251.1 15.2 129 1-137 1-131 (132) 3 protein:vir:104348 Length: 129 100.0 1.3E-40 7.8E-44 239.3 14.6 126 1-134 1-129 (129) 4 protein:vir:79637 Length: 130 100.0 5.2E-40 3.2E-43 235.9 13.7 124 1-135 5-130 (130) 5 protein:vir:94921 Length: 125 99.9 5.6E-27 3.5E-30 164.5 14.7 123 1-135 1-125 (125) 6 protein:vir:97211 Length: 150 99.8 7.3E-21 4.5E-24 131.0 12.9 130 1-137 1-150 (150) 7 protein:vir:80429 Length: 150 99.7 1.4E-20 8.9E-24 129.4 13.0 130 1-137 1-150 (150) 8 protein:vir:95155 Length: 151 99.7 2.4E-20 1.5E-23 128.2 12.9 130 1-137 1-151 (151) 9 protein:vir:78379 Length: 139 99.2 6.1E-15 3.8E-18 98.5 4.7 133 1-137 4-137 (139) 10 protein:vir:94997 Length: 139 99.2 2.3E-14 1.4E-17 95.3 4.8 133 1-137 4-137 (139) 11 protein:vir:95111 Length: 145 96.0 0.00072 4.4E-07 37.8 12.4 126 1-137 5-135 (145) 12 protein:vir:94488 Length: 145 95.9 0.00081 5E-07 37.5 12.5 126 1-137 5-135 (145) 13 protein:vir:97421 Length: 145 95.9 0.00081 5E-07 37.5 12.5 126 1-137 5-135 (145) 14 protein:vir:93736 Length: 145 95.9 0.00081 5E-07 37.5 12.5 126 1-137 5-135 (145) 15 protein:vir:94096 Length: 141 95.9 0.0012 7.3E-07 36.7 13.2 127 1-137 1-135 (141) 16 protein:vir:105892 Length: 141 95.9 0.0012 7.3E-07 36.7 13.2 127 1-137 1-135 (141) 17 protein:vir:96260 Length: 141 95.9 0.0012 7.3E-07 36.7 13.2 127 1-137 1-135 (141) 18 protein:vir:4348 Length: 121 # 95.6 0.0014 8.6E-07 36.2 12.7 118 1-135 1-121 (121) 19 protein:vir:96125 Length: 140 95.4 0.0021 1.3E-06 35.3 13.1 127 1-137 3-135 (140) 20 protein:vir:100242 Length: 114 95.3 0.00049 3E-07 38.7 9.1 112 1-131 1-114 (114) 21 protein:vir:1244 Length: 145 # 95.0 0.0021 1.3E-06 35.3 11.7 126 1-137 1-135 (145) 22 protein:vir:95961 Length: 145 94.9 0.0026 1.6E-06 34.7 11.9 127 1-137 5-135 (145) 23 protein:vir:94794 Length: 145 94.9 0.0027 1.7E-06 34.7 11.9 127 1-137 5-135 (145) 24 protein:vir:96894 Length: 140 94.6 0.0039 2.4E-06 33.8 13.3 127 1-137 1-137 (140) 25 protein:vir:97325 Length: 145 94.1 0.0053 3.3E-06 33.1 12.2 126 1-137 1-135 (145) 26 protein:vir:107096 Length: 145 94.1 0.0054 3.3E-06 33.0 12.2 126 1-137 1-135 (145) 27 protein:vir:105337 Length: 145 94.1 0.0055 3.4E-06 33.0 12.2 126 1-137 1-135 (145) 28 protein:vir:5979 Length: 134 # 93.4 0.0077 4.8E-06 32.2 11.8 127 1-136 1-134 (134) 29 protein:vir:1892 Length: 121 # 92.1 0.013 7.9E-06 31.0 11.8 118 1-135 1-121 (121) 30 protein:vir:97070 Length: 118 83.8 0.065 4.1E-05 27.1 12.1 116 1-136 1-118 (118) 31 protein:vir:80371 Length: 115 83.1 0.027 1.7E-05 29.2 6.7 113 1-131 1-115 (115) 32 protein:vir:10368 Length: 118 80.4 0.095 5.9E-05 26.2 13.1 116 1-136 1-118 (118) 33 protein:vir:100116 Length: 115 79.7 0.063 3.9E-05 27.2 7.4 114 1-131 1-115 (115) 34 protein:vir:1438 Length: 115 # 79.6 0.074 4.6E-05 26.8 7.8 114 1-131 1-115 (115) 35 protein:vir:195 Length: 115 # 74.0 0.16 0.0001 24.9 8.7 113 1-133 1-115 (115) 36 protein:vir:3428 Length: 131 # 71.2 0.2 0.00012 24.4 13.1 122 1-133 1-131 (131) 37 protein:vir:96800 Length: 127 67.1 0.18 0.00011 24.6 6.7 126 1-135 1-127 (127) 38 protein:vir:93602 Length: 114 66.6 0.26 0.00016 23.7 11.1 113 1-133 1-114 (114) 39 protein:vir:81066 Length: 118 51.5 0.58 0.00036 21.9 13.0 116 1-137 1-118 (118) 40 protein:vir:397 Length: 132 # 50.6 0.61 0.00038 21.8 12.8 122 1-133 1-132 (132) 41 protein:vir:79571 Length: 137 31.6 1.5 0.00092 19.6 12.0 122 1-133 5-137 (137) 42 protein:vir:79047 Length: 145 30.9 1.5 0.00096 19.6 12.3 121 1-137 1-126 (145) 43 protein:vir:80105 Length: 162 28.6 1.7 0.0011 19.3 11.4 127 1-137 13-145 (162) 44 protein:vir:105772 Length: 128 28.4 1.8 0.0011 19.2 9.7 121 1-137 1-128 (128) 45 protein:vir:3874 Length: 114 # 22.9 2.4 0.0015 18.5 7.1 108 1-120 1-114 (114) No 1 >protein:vir:103278 Length: 169 # NCBI annotation: phage-related conserved hypothetical protein # Family: family:all:5121 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277458;genbank:gi:71834100;genbank:GeneID:3562389 Probab=100.00 E-value=5e-43 Score=252.51 Aligned_cols=128 Identities=24% Similarity=0.346 Sum_probs=121.7 Q ss_pred CchHHHHHHHHHHHHhhcC--CCCceeeCCCCCCCC--CCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCCh Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADG--QKIPLFIENSPGDKP--AGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGT 76 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~--~~~pva~pN~~F~pp--~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~ 76 (137) |-.+|.+++|++|.+|++. .++||||||+.|+|| |++|||++++|++|...+|+++|+.|+|+|||+|++|+|+|+ T Consensus 38 ~h~ei~~a~rk~l~~~a~a~~~~LpVA~ENVaFtPp~dG~~YLr~~~lPadT~~~~L~gd~R~y~GVfQIsVV~PaGtG~ 117 (169) T protein:vir:10 38 VHYEMMVAARKLVSDAAVDIAGSLPVAYENCGFTPPKNGSSWLKFDYTEVDSVTWGLQRTCRYYVGMVQVSIFFSPGEGT 117 (169) T ss_pred hHHHHHHHHHHHHHHHHhhcccCCcEeeCCCCcCCCCCCccEEEEEEecCCceeeeccCCCceEEEEEEEEEEecCCCCc Confidence 9999999999999999885 589999999999997 468999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEE Q lcl|NC_018454. 77 STGRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRAD 134 (137) Q Consensus 77 ~~~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad 134 (137) .++.++||+|+++|++|++|+.| ||++.|+++|.+ +.+..|.+|||+.|||| T Consensus 118 ~ka~qiAdeiadlF~~gt~L~~G----yi~~~~~~~p~i--~~~s~~~iPvr~~~R~D 169 (169) T protein:vir:10 118 DRPRQLAGRLSEAFADGTMLDSG----YIYEGGSVFPPV--KSQSGWFIPVRFYVRMD 169 (169) T ss_pred chhHHHHHHHHHhhhCCceeece----eecCCCeECCee--ecCCceEEeEEEEEEeC Confidence 99999999999999999999988 999999999988 44779999999999999 No 2 >protein:vir:107704 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5121 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003903;genbank:gi:45686319;genbank:GeneID:2773044 Probab=100.00 E-value=8.8e-43 Score=251.14 Aligned_cols=129 Identities=26% Similarity=0.405 Sum_probs=122.6 Q ss_pred CchHHHHHHHHHHHHhhcCCCCceeeCCCCCCCC--CCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCChHH Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQKIPLFIENSPGDKP--AGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTST 78 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~~pva~pN~~F~pp--~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~~ 78 (137) |-.+|++|+++|+++++.. +||||||+.|+|| |++|||++++|++|...+|+++|+.|+|+|||+|++|+|+|+.+ T Consensus 1 ~hyE~~~a~r~~la~~~~~--lpVA~eNv~F~Pp~~G~~yLr~~~lpa~T~~~~L~~d~r~y~Gv~QI~Vv~paG~G~~~ 78 (132) T protein:vir:10 1 MHYELSAAARAAFLSKYRD--FPHYMENRNFTPPKDGGMWLRFNYIEGDTLYLSIDRKCKSYIAIVQIGVVFPPGSGVDE 78 (132) T ss_pred CchHHHHHHHHHHHhhhcC--CcEeecCCCcCCCCCCceEEEEEEccCCceeeeccCcCcEEEEEEEEEEEecCCCCcch Confidence 9999999999999987655 8999999999997 46999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 79 GRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 79 ~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) +.+|||+|+++|++|++|+.+ +|++.|+++|++ +++..|++|||+.|||||.. T Consensus 79 a~~iAd~i~~~F~~g~~l~~G----yi~~~~~~~p~i--~~~s~~~iPvrf~yR~Dt~~ 131 (132) T protein:vir:10 79 ARLKAKEIADFFKDGKMLNVG----YIFEGAIVHQIV--KHESGWMIPVRFTVRVDTKE 131 (132) T ss_pred hHHHHHHHHHhccCcceeecc----eecCCCccCCce--eCCcceEEEEEEEEEecccC Confidence 999999999999999999877 899999999988 55779999999999999998 No 3 >protein:vir:104348 Length: 129 # NCBI annotation: hypothetical protein # Family: family:all:5121 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398976;genbank:gi:81343960;genbank:GeneID:3778880 Probab=100.00 E-value=1.3e-40 Score=239.33 Aligned_cols=126 Identities=23% Similarity=0.304 Sum_probs=118.2 Q ss_pred CchHHHHHHHHHHHH-hhcCCCCceeeCCCCCCCC--CCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCChH Q lcl|NC_018454. 1 MIPDIGAAMNARLGA-WADGQKIPLFIENSPGDKP--AGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTS 77 (137) Q Consensus 1 m~~~Ir~al~~rl~~-~a~~~~~pva~pN~~F~pp--~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~ 77 (137) |....|+.++.++.+ |+.. +||||||+.|+|| |++|||++++|++|...+|+++|+.|+|+|||+|++|+|+|+. T Consensus 1 ~s~aar~~v~d~~~~~~~~~--lpVA~eNv~FtPp~~G~~YLr~~~lpa~T~~~~L~~d~r~y~Gv~QI~Vv~p~G~G~~ 78 (129) T protein:vir:10 1 MSLAARKFVNDLLVNEFPVR--YPVAWENAAFTPPADGSIWLKYDYTEVDTVTYGLSRKCKYYVGMVQISVFFSPGTGID 78 (129) T ss_pred CchHHHHHHHHHHHHhhcCC--CcEeecCCCcCCCCCCceEEEEEecCCCceeeeccCCCceEEEEEEEEEEecCCCCcc Confidence 999999999888877 7654 7999999999997 3689999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEE Q lcl|NC_018454. 78 TGRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRAD 134 (137) Q Consensus 78 ~~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad 134 (137) +++++||+|+++|++|++|+.| ||++.|+++|.+ +.+..|.+|||+.|||| T Consensus 79 ~a~~iA~ei~d~F~~g~~L~~G----yi~~~~~~~p~i--~~~~~~~ipvr~~~r~d 129 (129) T protein:vir:10 79 KPRQIANQLAESIVDGTMLDSG----TIYESGVVNPVI--KSKSGWFIPVRFYVRLD 129 (129) T ss_pred hhhHHHHHHHHhccCCceeece----eecCCCeECCee--ecCCceEEeEEEEEEeC Confidence 9999999999999999999988 999999999988 44779999999999999 No 4 >protein:vir:79637 Length: 130 # NCBI annotation: gp41 # Family: family:all:5121 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285530;genbank:gi:148734513;genbank:GeneID:5219995 Probab=100.00 E-value=5.2e-40 Score=235.95 Aligned_cols=124 Identities=24% Similarity=0.345 Sum_probs=113.9 Q ss_pred CchHHHHHHHHHHHHhhcCCCCceeeCCCCCCCC--CCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCChHH Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQKIPLFIENSPGDKP--AGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTST 78 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~~pva~pN~~F~pp--~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~~ 78 (137) |.+.+|+++.++... .|||||||+.|+|| +++|||++++|++|...+|+++|+.|+|+|||+|++|+|+|+.+ T Consensus 5 ~~~aaR~~~~~~~~~-----~lpVA~ENv~FtPp~~G~~YLr~~~lpa~T~~~~L~~d~r~y~Gv~QI~VV~paG~G~~~ 79 (130) T protein:vir:79 5 LSVAARMALAQEYES-----EYMIAYENVEFTPPKGGGIWLKYDYKEADTIIHDLKRKCISYIGMVQIGIEFPPGSGIDK 79 (130) T ss_pred hhHHHHHHHHhhhhh-----hCceeecCCCcCCCCCCceEEEEEecCCCceeeeccCCCceEEEEEEEEEEecCCCCcch Confidence 777777777555443 37999999999997 36899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEe Q lcl|NC_018454. 79 GRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADI 135 (137) Q Consensus 79 ~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~ 135 (137) ++++||+|+++|++|++|+.| ||++.|+++|.+ +.+..|.+|||+.||||- T Consensus 80 a~~iA~ei~dlF~~g~~L~~G----yi~~~~~~~p~i--~~~~~~~iPvr~~~R~d~ 130 (130) T protein:vir:79 80 ARKLAKNIADFFEDGKMLSNG----YISEGAKVHQVQ--KSESGWFYPVRFYVRYDG 130 (130) T ss_pred hhHHHHHHHHhccCCceeece----eecCCCeECCee--ecCCceEEeEEEEEEecC Confidence 999999999999999999988 999999999988 447799999999999999 No 5 >protein:vir:94921 Length: 125 # NCBI annotation: possible peptidoglycan binding protein # Family: family:all:5248 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239283;genbank:gi:66392065;genbank:GeneID:5076566 Probab=99.91 E-value=5.6e-27 Score=164.49 Aligned_cols=123 Identities=17% Similarity=0.190 Sum_probs=107.5 Q ss_pred Cc-hHHHHHHHHHHHHhhcCCCCceeeCCCCCCCC-CCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCChHH Q lcl|NC_018454. 1 MI-PDIGAAMNARLGAWADGQKIPLFIENSPGDKP-AGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTST 78 (137) Q Consensus 1 m~-~~Ir~al~~rl~~~a~~~~~pva~pN~~F~pp-~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~~ 78 (137) |. .|||++|++||.+ ..++.||+|||.+ || .+.|+|+++.++++..+++|+.|.+++|++.||||+|.|.|..+ T Consensus 1 Mt~~q~r~~I~~r~~a--~~~~~~I~~~N~p--p~~~~~W~Rlti~~g~~~~a~iG~~~~~rtGli~iqiF~p~~~G~~~ 76 (125) T protein:vir:94 1 MSYFQEKLDIENYFKA--NWPDTPIFYENRT--ANSTGTWVRLTIQNGDAFQASNGEVSYRHPGVVFVQIFTKKEVGSGE 76 (125) T ss_pred CCHHHHHHHHHHHHHh--CCCccceeeCCCC--CCCCCceEEEEeccCcccccccCCceeeeeeEEEEEeeecCCcChHH Confidence 75 7999999999985 4567799999975 55 57999999999999999999888899999999999999999999 Q ss_pred HHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEe Q lcl|NC_018454. 79 GRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADI 135 (137) Q Consensus 79 ~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~ 135 (137) +.++||++++||.+. +.+++.++..+.+.+. ++++|||++|+|+||+-+ T Consensus 77 ~~~~ad~~~~~f~~~---~~g~i~f~~~~~~~~g-----~~~gwyQ~Nv~I~f~~~~ 125 (125) T protein:vir:94 77 ALKLADKVDALFRSK---TLGNIQFKVPQVQKVP-----STTEWYQVNVSTEFYRGS 125 (125) T ss_pred HHHHHHHHHHHHccC---CCCceEEeeceecCCC-----CCCCEEEEEEEEeeecCC Confidence 999999999999555 5588888876655442 347799999999999999 No 6 >protein:vir:97211 Length: 150 # NCBI annotation: hypothetical protein ORF026 # Family: family:all:5248 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294534;genbank:gi:149408255;genbank:GeneID:5237076 Probab=99.75 E-value=7.3e-21 Score=130.99 Aligned_cols=130 Identities=18% Similarity=0.224 Sum_probs=102.2 Q ss_pred Cc----hHHHHHHHHHHHH-h-hcC------CCCceeeCCCCCCC-C-C-CcEEEEEEcCCCceeeecC---CCceEEEE Q lcl|NC_018454. 1 MI----PDIGAAMNARLGA-W-ADG------QKIPLFIENSPGDK-P-A-GIFLESFDMPATPQTLDLG---LTCHIYPG 62 (137) Q Consensus 1 m~----~~Ir~al~~rl~~-~-a~~------~~~pva~pN~~F~p-p-~-~~ylr~~~~p~~t~~~~l~---~~~~~~~G 62 (137) |. -|||+.+++++.+ | |.. .++.++|||+.|.+ | + .+|+|+++.+.++...++| +.+.+++| T Consensus 1 ~~~~tF~qaR~ei~t~f~~~W~a~~~a~~g~~p~~~~w~~~~~~~~P~g~~~WaRLti~~~~~~~as~G~~~gr~~~r~G 80 (150) T protein:vir:97 1 MTLPTFDSARDEILGLFNTKWITDTPALNGGAPIRVEWPGVDAGDPPPADKPYARITLRHTTSRQATFGPTGGRRFTRPG 80 (150) T ss_pred CCCCcHHHHHHHHHhhhhhhccccchhhcCCcceeeccCCcccCCCcCCCCceEEEEeeccccccccccCCCCcEEeeCc Confidence 54 6999999999876 7 222 13449999999864 4 4 5799999999999999998 35668899 Q ss_pred EEEEEEEEe--cCCChHHHHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 63 IFQVNVVVP--VGSGTSTGRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 63 ~~qI~v~~p--~G~G~~~~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) ++.||||+| .|+|...+.++||.+.+||+... +.+++.++.. .++..| ++++|||+||+|+|+.|--- T Consensus 81 li~VQiF~p~~~G~G~~la~~~Ad~a~eaFe~~~--t~g~i~f~~a--~~~eig---~~~gWyQ~Nv~i~Feyde~r 150 (150) T protein:vir:97 81 LITVQVFTPLSGGQGLSLAEKCAIIARDAFEGRG--TASGIWFRNA--RIQEIG---PDGAWYQMNVVVEFEYDELR 150 (150) T ss_pred EEEEEEeeeccCCchhhHHHHHHHHHHHHHhccC--CcCCeecccc--cccccC---CCCceEEEEeEeeeeccccC Confidence 999999999 59999999999999999996554 3456666533 223222 34589999999999998777 No 7 >protein:vir:80429 Length: 150 # NCBI annotation: BcepGomrgp11 # Family: family:all:5248 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210231;genbank:gi:146329923;genbank:GeneID:5123538 Probab=99.74 E-value=1.4e-20 Score=129.37 Aligned_cols=130 Identities=16% Similarity=0.171 Sum_probs=100.9 Q ss_pred Cch---HHHHHHHHHHHH-hhcCC------CC-ceeeCCCCCC-CC-C-CcEEEEEEcCCCceeeecCC----CceEEEE Q lcl|NC_018454. 1 MIP---DIGAAMNARLGA-WADGQ------KI-PLFIENSPGD-KP-A-GIFLESFDMPATPQTLDLGL----TCHIYPG 62 (137) Q Consensus 1 m~~---~Ir~al~~rl~~-~a~~~------~~-pva~pN~~F~-pp-~-~~ylr~~~~p~~t~~~~l~~----~~~~~~G 62 (137) |+. |.|+.|++++.+ |...+ .. .|+|||+.|. || + ++|+|+++.++.+.+++|++ .+.+++| T Consensus 1 ~~~~~~~ar~ei~~~f~~~W~~~~~~~~~g~~~~~~w~~~~~~~pP~g~~~WaRLti~h~~~~qA~~~~~~~gr~~~r~G 80 (150) T protein:vir:80 1 MIQDALQARSDINTMLFDQWSVADWSKVKGGKPNIAWEGRESARPPDGSAPYVAIFIKHVDGQQASLTDPDMLRRWSRDG 80 (150) T ss_pred CcchhhhhHHHHHHHHhhhhccCcchhhcCCcceeeecCcccCCcCCCCCceEEEEEecCCcccccccCCCCcceEeeCc Confidence 886 467778888865 74422 12 3999999984 44 4 48999999999999999984 4557889 Q ss_pred EEEEEEEEe--cCCChHHHHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 63 IFQVNVVVP--VGSGTSTGRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 63 ~~qI~v~~p--~G~G~~~~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) ++.||||+| .|+|...+.++||.+.+||+... +.+++.++..+ ++..| ++++|||+||+|+|+.|--- T Consensus 81 lI~VQiF~p~~~G~G~~la~k~Ad~a~eaFe~~~--t~g~i~f~~as--~~eiG---~d~gWYQ~NV~ipF~yde~r 150 (150) T protein:vir:80 81 LITVQCFGMLSAGQGLEDATYQATIAMRAFEGKQ--SANGIWFRNAR--IKEIG---SDRGWYQVNMIVEFEYDEVR 150 (150) T ss_pred EEEEEEeeeccCCchhhHHHHHHHHHHHHHhccC--CCCCccccccc--ccccC---CCCceEEEEeEeeeeccccC Confidence 999999999 59999999999999999996654 33667665432 23222 34589999999999998777 No 8 >protein:vir:95155 Length: 151 # NCBI annotation: hypothetical protein ORF015 # Family: family:all:5248 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293422;genbank:gi:148912843;genbank:GeneID:5228230 Probab=99.73 E-value=2.4e-20 Score=128.19 Aligned_cols=130 Identities=16% Similarity=0.191 Sum_probs=96.1 Q ss_pred Cc--hHHHHHHHHHHHH-h-hcCCCCc-----eeeCCCC-CCCC-C-CcEEEEEEcCCCceeeecC-------CCceEEE Q lcl|NC_018454. 1 MI--PDIGAAMNARLGA-W-ADGQKIP-----LFIENSP-GDKP-A-GIFLESFDMPATPQTLDLG-------LTCHIYP 61 (137) Q Consensus 1 m~--~~Ir~al~~rl~~-~-a~~~~~p-----va~pN~~-F~pp-~-~~ylr~~~~p~~t~~~~l~-------~~~~~~~ 61 (137) |+ -|||+++.+++.+ | +.+++.. |+|||.. |++| + ++|+|+++.++++.+++|+ +.|.+++ T Consensus 1 ~mtf~q~R~~i~~~~~~~w~~~~~~~a~~~p~v~~~~~~~~d~P~g~~~WaRLti~h~~~~qA~ls~~~eigggp~~~rt 80 (151) T protein:vir:95 1 MIEFDQVNDEVNALFLATWNAGSAAIAGYVPEIRWQGVQYRDLPDGSKFWVRLSKQTVFEEQATLSTCEGVPGQRKYTAS 80 (151) T ss_pred CccHHHHHHHHHHHhhhhcccCchhhhccccccccCCCCCCCCCCCCCceEEEEeecCCCccccccccccCCCCceEeeC Confidence 43 7999999999965 6 4554443 6677765 6655 3 6999999999999999873 4577899 Q ss_pred EEEEEEEEEecCCChHH--HHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 62 GIFQVNVVVPVGSGTST--GRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 62 G~~qI~v~~p~G~G~~~--~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) |++.||||+|+|+|... |+++|+-+++||+... +.+++.++..+.-++ | ++++|||+||+|+|--|--- T Consensus 81 Gli~VQiF~p~~~G~~Le~Adkla~~a~eaFe~~~--t~g~i~f~~~s~~ei--G---~~~gWyQ~Nv~i~f~y~e~~ 151 (151) T protein:vir:95 81 GLVFVQIFCPKSNTQAFELGQKLAKLARNAFRGKS--TPGKVWFRNTRINEL--P---PEELYERFNVVTEFEYDEIG 151 (151) T ss_pred cEEEEEEeeeccCchhhHHHHHHHHHHHHHhhccC--CCCCceeeeeeeccc--C---CCCCeEEEEeeeeecccccC Confidence 99999999999888553 5555555579996654 345666665544222 2 44589999999999998777 No 9 >protein:vir:78379 Length: 139 # NCBI annotation: hypothetical protein # Family: family:all:12033 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110845;genbank:gi:134288606;genbank:GeneID:5179642 Probab=99.25 E-value=6.1e-15 Score=98.51 Aligned_cols=133 Identities=23% Similarity=0.288 Sum_probs=116.4 Q ss_pred CchHHHHHHHHHHHHhhcCCCCceeeCCCCCC-CCCCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCChHHH Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQKIPLFIENSPGD-KPAGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTSTG 79 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~~pva~pN~~F~-pp~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~~~ 79 (137) --.+..+++.+.|.+|..-.+++|+.||+.-. ..+.|||..+++-.++++.+|-... .+.|++||+|.+-..-|...+ T Consensus 4 yfedltkafdtalvafgtnngikvalenidaptstdtpylasymllsdteqadlfwte-qragvyqvdinvgsalgsapi 82 (139) T protein:vir:78 4 YFEDLTKAFDTALVAFGTNNGIKVALENIDAPTSTDTPYLASYMLLSDTEQADLFWTE-QRAGVYQVDINVGSALGSAPI 82 (139) T ss_pred hHHHHHHhhhheeeeeccCCceeEeeeccCCCccCCcchhhheeeeccCcccceeeec-ccCceEEEeeecccccccchh Confidence 34677888999999998888999999999732 2367999999999999999997654 458999999999999999999 Q ss_pred HHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 80 RALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 80 ~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) .++||++.+.|..|.++.++..+..|.+ |+-|+..-+++|-.-|+||+|-|+|.- T Consensus 83 nrladklnaafaagncfsrneicaevqs---vslgplivengwakrplsinfiaftar 137 (139) T protein:vir:78 83 NRLADKLNAAFAAGNCFSRNEICAEVQS---VSLGPLIVENGWAKRPLSINFIAFTAR 137 (139) T ss_pred HHHHhhhhhhhhccccccchhhhhhhhh---ccccceeeccCcccCceeeeeeeeeee Confidence 9999999999999999999988887654 555667778899999999999999999 No 10 >protein:vir:94997 Length: 139 # NCBI annotation: hypothetical protein # Family: family:all:12033 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224021;genbank:gi:62327308;genbank:GeneID:5176825 Probab=99.18 E-value=2.3e-14 Score=95.33 Aligned_cols=133 Identities=21% Similarity=0.286 Sum_probs=116.5 Q ss_pred CchHHHHHHHHHHHHhhcCCCCceeeCCCCCC-CCCCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCChHHH Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQKIPLFIENSPGD-KPAGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTSTG 79 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~~pva~pN~~F~-pp~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~~~ 79 (137) --.+..+++.+.|..|.....++|+.||+.-. ..+.|||..+++-.++++.+|-... .+.|++||+|.+-..-|...+ T Consensus 4 yfedltkafdtalvtfgtdndikvalenidaptstdapylasymllsdteqadlfwte-qragvyqvdinvgsalgsapi 82 (139) T protein:vir:94 4 YFEDLTKAFDTALVTFGTDNDIKVALENIDAPTSTDAPYLASYMLLSDTEQADLFWTE-QRAGVYQVDINVGSALGSAPI 82 (139) T ss_pred hHHHHHHhhhheeeeeccCCCceEEeeccCCCcccCcchhhheeecccCcccceeeec-ccCceEEEeeecccccccchh Confidence 34677888999999999999999999999732 2367999999999999999997654 458999999999999999999 Q ss_pred HHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 80 RALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 80 ~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) .++||++..-|..|.++.++..+..|.+ |+-|+..-+++|-.-|+||+|-|+|+- T Consensus 83 nrladklnttfaagncfsrneicaevqs---vslgplivengwakrplsinfiaftar 137 (139) T protein:vir:94 83 NRLADKLNTTFAAGNCFSRNEICAEVQS---VSLGPLIVENGWAKRPLSINFIAFTAR 137 (139) T ss_pred HHHHHhhhhhhhccccccchhhhhhhhh---ccccceeeccCcccCceeeeeeeeeee Confidence 9999999999999999999988887654 555667778899999999999999999 No 11 >protein:vir:95111 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240829;genbank:gi:66394699;genbank:GeneID:5133905 Probab=95.97 E-value=0.00072 Score=37.83 Aligned_cols=126 Identities=5% Similarity=-0.038 Sum_probs=77.5 Q ss_pred CchHHHHHHHHHHHHhhcCCCC---ceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCc-eEEEEEEEEEEEEecCCCh Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQKI---PLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTC-HIYPGIFQVNVVVPVGSGT 76 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~~---pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~-~~~~G~~qI~v~~p~G~G~ 76 (137) +...+++||.+||.+-+.-..+ + .|.+.+ .++..||+. -+.....+.+.+| ....-.++|+|+.. +.|. T Consensus 5 ~~~aLq~Ai~~~L~ada~l~alvggr-V~D~~P-~~a~~PYV~----lG~~~~~~~~~~~~~g~~~~~ti~Vws~-~~g~ 77 (145) T protein:vir:95 5 VERYLFNKVYNKLKSNSIIQKQLDGR-VFDCVQ-KDAVYPYIV----VGETNVTNKETTTSMVEDVGITLHVYSQ-ARNR 77 (145) T ss_pred HHHHHHHHHHHHhhcChhHHHhhcCc-eecCCc-CCCCCCEEE----ecCceeeecCCCcccceEEEEEEEEEEc-CCCH Confidence 4477888888888654322110 2 244443 222346654 3555556666555 35677899998864 5689 Q ss_pred HHHHHHHHHHHHhhhccceeccCCeEE-EecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 77 STGRALARQVAALFPEGQSVQGDGFAC-WISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 77 ~~~~~~Ad~i~a~F~~g~~l~~~~~~v-~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) .++.+||+.|.+..... +...++.+ .+.- .-..-..++++..+..-++|.+|..-++ T Consensus 78 ~eak~ia~av~~aL~~~--l~l~~~~lv~l~~--~~~~~~rd~dg~~~hgvl~~ra~ve~~~ 135 (145) T protein:vir:95 78 DEASQIIQFLGFVLNNE--IEIDYYSFIKSRI--DTQEVITDIDRYTKHGIIRLVFKYRHNT 135 (145) T ss_pred HHHHHHHHHHHHHhccc--cCCCCCeEEEeEE--eeeeEeecCCCceEEEEEEEEEEEEecc Confidence 99999999999988654 44444443 1111 1122234445557888888888888888 No 12 >protein:vir:94488 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240682;genbank:gi:66396364;genbank:GeneID:5133752 Probab=95.90 E-value=0.00081 Score=37.52 Aligned_cols=126 Identities=5% Similarity=-0.050 Sum_probs=77.4 Q ss_pred CchHHHHHHHHHHHHhhcCCC---CceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCc-eEEEEEEEEEEEEecCCCh Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQK---IPLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTC-HIYPGIFQVNVVVPVGSGT 76 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~---~pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~-~~~~G~~qI~v~~p~G~G~ 76 (137) +...+++||.+||.+-+.-.. -+ .|.+.+ .++..||+. -+..+..+.+.+| ....-.++|+|+.. +.|. T Consensus 5 ~~~aLq~Ai~~~L~ada~l~alvggr-I~D~~P-~~a~~PYV~----lG~~~~~d~~~~~~~g~~~~~ti~Vws~-~~g~ 77 (145) T protein:vir:94 5 VERYLFNKVYNKLKSNLIIQKQLDGR-VFDCVQ-KDAVYPYIV----VGETNVTNKETTTSMVEDVGITLHVYSQ-ARNR 77 (145) T ss_pred HHHHHHHHHHHHhhcChhHHHhhcCc-eecCCc-CCCCCCEEE----eCCceeeecCCCcccceEEEEEEEEEEc-CCCH Confidence 447788888888865432111 02 244443 222346644 3555566666565 35677899999875 6789 Q ss_pred HHHHHHHHHHHHhhhccceeccCCeEE-EecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 77 STGRALARQVAALFPEGQSVQGDGFAC-WISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 77 ~~~~~~Ad~i~a~F~~g~~l~~~~~~v-~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) .++.+||+.|.+..... +...++.+ .+.- .-..-..++++..+..-++|.+|..-++ T Consensus 78 ~eak~ia~av~~aL~~~--l~l~~~~lv~l~~--~~~~~~rd~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:94 78 DEASQIIQFLGFVLNNE--IEIDYYSFIKSRI--DTQEVITDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred HHHHHHHHHHHHHhccc--cCCCCCeEEEeEE--eeeeEeecCCcceEEEEEEEEEEEEecc Confidence 99999999999888654 44444443 2111 1122233444556777788888888888 No 13 >protein:vir:97421 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240755;genbank:gi:66396436;genbank:GeneID:5133777 Probab=95.90 E-value=0.00081 Score=37.52 Aligned_cols=126 Identities=5% Similarity=-0.050 Sum_probs=77.4 Q ss_pred CchHHHHHHHHHHHHhhcCCC---CceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCc-eEEEEEEEEEEEEecCCCh Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQK---IPLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTC-HIYPGIFQVNVVVPVGSGT 76 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~---~pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~-~~~~G~~qI~v~~p~G~G~ 76 (137) +...+++||.+||.+-+.-.. -+ .|.+.+ .++..||+. -+..+..+.+.+| ....-.++|+|+.. +.|. T Consensus 5 ~~~aLq~Ai~~~L~ada~l~alvggr-I~D~~P-~~a~~PYV~----lG~~~~~d~~~~~~~g~~~~~ti~Vws~-~~g~ 77 (145) T protein:vir:97 5 VERYLFNKVYNKLKSNLIIQKQLDGR-VFDCVQ-KDAVYPYIV----VGETNVTNKETTTSMVEDVGITLHVYSQ-ARNR 77 (145) T ss_pred HHHHHHHHHHHHhhcChhHHHhhcCc-eecCCc-CCCCCCEEE----eCCceeeecCCCcccceEEEEEEEEEEc-CCCH Confidence 447788888888865432111 02 244443 222346644 3555566666565 35677899999875 6789 Q ss_pred HHHHHHHHHHHHhhhccceeccCCeEE-EecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 77 STGRALARQVAALFPEGQSVQGDGFAC-WISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 77 ~~~~~~Ad~i~a~F~~g~~l~~~~~~v-~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) .++.+||+.|.+..... +...++.+ .+.- .-..-..++++..+..-++|.+|..-++ T Consensus 78 ~eak~ia~av~~aL~~~--l~l~~~~lv~l~~--~~~~~~rd~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:97 78 DEASQIIQFLGFVLNNE--IEIDYYSFIKSRI--DTQEVITDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred HHHHHHHHHHHHHhccc--cCCCCCeEEEeEE--eeeeEeecCCcceEEEEEEEEEEEEecc Confidence 99999999999888654 44444443 2111 1122233444556777788888888888 No 14 >protein:vir:93736 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240465;genbank:gi:66396143;genbank:GeneID:5133505 Probab=95.90 E-value=0.00081 Score=37.52 Aligned_cols=126 Identities=5% Similarity=-0.050 Sum_probs=77.4 Q ss_pred CchHHHHHHHHHHHHhhcCCC---CceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCc-eEEEEEEEEEEEEecCCCh Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQK---IPLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTC-HIYPGIFQVNVVVPVGSGT 76 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~---~pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~-~~~~G~~qI~v~~p~G~G~ 76 (137) +...+++||.+||.+-+.-.. -+ .|.+.+ .++..||+. -+..+..+.+.+| ....-.++|+|+.. +.|. T Consensus 5 ~~~aLq~Ai~~~L~ada~l~alvggr-I~D~~P-~~a~~PYV~----lG~~~~~d~~~~~~~g~~~~~ti~Vws~-~~g~ 77 (145) T protein:vir:93 5 VERYLFNKVYNKLKSNLIIQKQLDGR-VFDCVQ-KDAVYPYIV----VGETNVTNKETTTSMVEDVGITLHVYSQ-ARNR 77 (145) T ss_pred HHHHHHHHHHHHhhcChhHHHhhcCc-eecCCc-CCCCCCEEE----eCCceeeecCCCcccceEEEEEEEEEEc-CCCH Confidence 447788888888865432111 02 244443 222346644 3555566666565 35677899999875 6789 Q ss_pred HHHHHHHHHHHHhhhccceeccCCeEE-EecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 77 STGRALARQVAALFPEGQSVQGDGFAC-WISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 77 ~~~~~~Ad~i~a~F~~g~~l~~~~~~v-~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) .++.+||+.|.+..... +...++.+ .+.- .-..-..++++..+..-++|.+|..-++ T Consensus 78 ~eak~ia~av~~aL~~~--l~l~~~~lv~l~~--~~~~~~rd~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:93 78 DEASQIIQFLGFVLNNE--IEIDYYSFIKSRI--DTQEVITDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred HHHHHHHHHHHHHhccc--cCCCCCeEEEeEE--eeeeEeecCCcceEEEEEEEEEEEEecc Confidence 99999999999888654 44444443 2111 1122233444556777788888888888 No 15 >protein:vir:94096 Length: 141 # NCBI annotation: ORF031 # Family: family:all:296 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240240;genbank:gi:66395916;genbank:GeneID:5133265 Probab=95.87 E-value=0.0012 Score=36.66 Aligned_cols=127 Identities=8% Similarity=-0.001 Sum_probs=74.9 Q ss_pred Cc----hHHHHHHHHHHHHhhcCCC---CceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCce-EEEEEEEEEEEEec Q lcl|NC_018454. 1 MI----PDIGAAMNARLGAWADGQK---IPLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTCH-IYPGIFQVNVVVPV 72 (137) Q Consensus 1 m~----~~Ir~al~~rl~~~a~~~~---~pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~~-~~~G~~qI~v~~p~ 72 (137) |. ...++||.+||.+-+.-.. -+| |.+++ .++..|| +.-+..+..+.+.+|. ...-.++|+|+. . T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~rI-~D~~P-~~~~~PY----v~lG~~~~~~~~~~~~~g~~~~~ti~Vws-~ 73 (141) T protein:vir:94 1 MWVSVEPELTNQIYKRLISDPNINKLVDDRV-FDVVQ-DDAVYPY----IVVGESNVTNNESSATMRETVGIVIHVYS-Q 73 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCCcc-ccCCc-cCCCCCE----EEeCCceeeecCCCcccceEEEEEEEEEE-c Confidence 44 6788999999976332111 122 44433 1122355 4446666677776663 567889999987 5 Q ss_pred CCChHHHHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 73 GSGTSTGRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 73 G~G~~~~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) +.|..++.++|+.|.+..... +...++.+. .-...-..-..++++..+.--++|.+|.--|+ T Consensus 74 ~~g~~eak~ia~av~~AL~~~--l~l~~~~lv-~l~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~ 135 (141) T protein:vir:94 74 FATQYEAKLILSAIGYVLNRP--IEIDNYEFQ-FSRIDSQAVFPDIDRFTKHGTIRLLFKYRHKK 135 (141) T ss_pred CCCHHHHHHHHHHHHHHhccc--ccCCCceEE-EEEEeeeeeeecCCCceEEEEEEEEEEEEecc Confidence 668999999999999998543 445555442 11111122233333334555566666666666 No 16 >protein:vir:105892 Length: 141 # NCBI annotation: tail protein # Family: family:all:296 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004380;genbank:gi:122891835;genbank:GeneID:4712363 Probab=95.87 E-value=0.0012 Score=36.66 Aligned_cols=127 Identities=8% Similarity=-0.001 Sum_probs=74.9 Q ss_pred Cc----hHHHHHHHHHHHHhhcCCC---CceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCce-EEEEEEEEEEEEec Q lcl|NC_018454. 1 MI----PDIGAAMNARLGAWADGQK---IPLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTCH-IYPGIFQVNVVVPV 72 (137) Q Consensus 1 m~----~~Ir~al~~rl~~~a~~~~---~pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~~-~~~G~~qI~v~~p~ 72 (137) |. ...++||.+||.+-+.-.. -+| |.+++ .++..|| +.-+..+..+.+.+|. ...-.++|+|+. . T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~rI-~D~~P-~~~~~PY----v~lG~~~~~~~~~~~~~g~~~~~ti~Vws-~ 73 (141) T protein:vir:10 1 MWVSVEPELTNQIYKRLISDPNINKLVDDRV-FDVVQ-DDAVYPY----IVVGESNVTNNESSATMRETVGIVIHVYS-Q 73 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCCcc-ccCCc-cCCCCCE----EEeCCceeeecCCCcccceEEEEEEEEEE-c Confidence 44 6788999999976332111 122 44433 1122355 4446666677776663 567889999987 5 Q ss_pred CCChHHHHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 73 GSGTSTGRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 73 G~G~~~~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) +.|..++.++|+.|.+..... +...++.+. .-...-..-..++++..+.--++|.+|.--|+ T Consensus 74 ~~g~~eak~ia~av~~AL~~~--l~l~~~~lv-~l~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~ 135 (141) T protein:vir:10 74 FATQYEAKLILSAIGYVLNRP--IEIDNYEFQ-FSRIDSQAVFPDIDRFTKHGTIRLLFKYRHKK 135 (141) T ss_pred CCCHHHHHHHHHHHHHHhccc--ccCCCceEE-EEEEeeeeeeecCCCceEEEEEEEEEEEEecc Confidence 668999999999999998543 445555442 11111122233333334555566666666666 No 17 >protein:vir:96260 Length: 141 # NCBI annotation: ORF027 # Family: family:all:296 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240317;genbank:gi:66395991;genbank:GeneID:5133337 Probab=95.87 E-value=0.0012 Score=36.66 Aligned_cols=127 Identities=8% Similarity=-0.001 Sum_probs=74.9 Q ss_pred Cc----hHHHHHHHHHHHHhhcCCC---CceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCce-EEEEEEEEEEEEec Q lcl|NC_018454. 1 MI----PDIGAAMNARLGAWADGQK---IPLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTCH-IYPGIFQVNVVVPV 72 (137) Q Consensus 1 m~----~~Ir~al~~rl~~~a~~~~---~pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~~-~~~G~~qI~v~~p~ 72 (137) |. ...++||.+||.+-+.-.. -+| |.+++ .++..|| +.-+..+..+.+.+|. ...-.++|+|+. . T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~rI-~D~~P-~~~~~PY----v~lG~~~~~~~~~~~~~g~~~~~ti~Vws-~ 73 (141) T protein:vir:96 1 MWVSVEPELTNQIYKRLISDPNINKLVDDRV-FDVVQ-DDAVYPY----IVVGESNVTNNESSATMRETVGIVIHVYS-Q 73 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCCcc-ccCCc-cCCCCCE----EEeCCceeeecCCCcccceEEEEEEEEEE-c Confidence 44 6788999999976332111 122 44433 1122355 4446666677776663 567889999987 5 Q ss_pred CCChHHHHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 73 GSGTSTGRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 73 G~G~~~~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) +.|..++.++|+.|.+..... +...++.+. .-...-..-..++++..+.--++|.+|.--|+ T Consensus 74 ~~g~~eak~ia~av~~AL~~~--l~l~~~~lv-~l~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~ 135 (141) T protein:vir:96 74 FATQYEAKLILSAIGYVLNRP--IEIDNYEFQ-FSRIDSQAVFPDIDRFTKHGTIRLLFKYRHKK 135 (141) T ss_pred CCCHHHHHHHHHHHHHHhccc--ccCCCceEE-EEEEeeeeeeecCCCceEEEEEEEEEEEEecc Confidence 668999999999999998543 445555442 11111122233333334555566666666666 No 18 >protein:vir:4348 Length: 121 # NCBI annotation: Orf15 # Family: family:all:896 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061511;genbank:gi:9635607;genbank:GeneID:1262874 Probab=95.64 E-value=0.0014 Score=36.24 Aligned_cols=118 Identities=11% Similarity=0.091 Sum_probs=72.2 Q ss_pred CchHHHHHHHH--HHHHhhcCCCCceeeC-CCCCCCCCCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCChH Q lcl|NC_018454. 1 MIPDIGAAMNA--RLGAWADGQKIPLFIE-NSPGDKPAGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTS 77 (137) Q Consensus 1 m~~~Ir~al~~--rl~~~a~~~~~pva~p-N~~F~pp~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~ 77 (137) |.+-|+++|++ .+.++.+..+ .-.|| +++=.-+-.||+-...+.+..... |+|.+....+.+||+|+.. -.. T Consensus 1 m~~~i~~~l~~d~~v~allg~~~-~Rvyp~~~aP~~~~~Pyiv~q~vsg~p~~~-l~g~~~~~~~~vQIDvyA~---t~~ 75 (121) T protein:vir:43 1 MYPPIFKVCSSSPAVTAILGASP-LRMYQFGLAPQLVVKPYATWQTISGSPENY-LWGRPDADGFTIQVDIFSA---TAA 75 (121) T ss_pred CChHHHHHHhhChhhhhhhcCCC-ceeeccCCCCCCCcCCeEEEEEecCcccce-ecCCCCcceeEEEEEeeeC---CHH Confidence 99999999997 4455655433 24565 443011124788877777655554 7777667789999999954 457 Q ss_pred HHHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEe Q lcl|NC_018454. 78 TGRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADI 135 (137) Q Consensus 78 ~~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~ 135 (137) +|.+++++|++..+ +... .+. ... ... +++-.-|.+-+-|+|--.- T Consensus 76 ~A~~l~~av~~Al~-~~~~-----~~~--~~~---~~y-e~dT~lyR~s~Dv~w~~~r 121 (121) T protein:vir:43 76 EARDAAKAIRDAIE-LSAY-----VVR--WGG---ESV-DPDTKTYRVSFDVDWIVQR 121 (121) T ss_pred HHHHHHHHHHHHhh-hcCC-----ccc--CCC---CCC-cccccceeeeeEEEEeecC Confidence 88999999998874 2221 111 111 111 1222357766666664444 No 19 >protein:vir:96125 Length: 140 # NCBI annotation: ORF038 # Family: family:all:296 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240084;genbank:gi:66395765;genbank:GeneID:5133106 Probab=95.43 E-value=0.0021 Score=35.28 Aligned_cols=127 Identities=5% Similarity=-0.097 Sum_probs=74.9 Q ss_pred Cch--HHHHHHHHHHHHhhcCCCC---ceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCce-EEEEEEEEEEEEecCC Q lcl|NC_018454. 1 MIP--DIGAAMNARLGAWADGQKI---PLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTCH-IYPGIFQVNVVVPVGS 74 (137) Q Consensus 1 m~~--~Ir~al~~rl~~~a~~~~~---pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~~-~~~G~~qI~v~~p~G~ 74 (137) |.+ .+++||.+||.+-+.-..+ + .|.+.+ .++.-||+.+ +.++..+-+.+|. ...=.++|+|+. ... T Consensus 3 msa~~aLq~Ai~~~L~ad~~l~alvggr-VyD~~P-~~~~~PYV~l----G~~~~~~~~~~~~~g~~~~~tl~Vws-~~~ 75 (140) T protein:vir:96 3 VTAEPLLYNKIMNNLIENPITDKLVGGR-VFDCVQ-KDVVYPYIVV----GESNVTESERSPGMREIIAITFHVYS-QYE 75 (140) T ss_pred cchhHHHHHHHHHHhccChhHHhhcCcc-cccCCc-cCCCCCEEEe----CCceeeecCCCcccceEEEEEEEEEE-cCC Confidence 655 5788888888754322110 2 244433 2223466643 4555555555542 445678889765 577 Q ss_pred ChHHHHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 75 GTSTGRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 75 G~~~~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) |..++.++|+.|.+.... .+..+++.+. .-.-.-..-..++++..+.--++|.+|-.-|+ T Consensus 76 g~~ea~~ia~ai~~aL~~--~l~l~~~~lv-~l~~~~~~~~rd~dg~t~hgvl~~ra~ve~~~ 135 (140) T protein:vir:96 76 NGAEARELLKYLNYACRL--NINFKDYELE-WIKKDNSQVFTDIDQYTKHGVLRLLYKVRHKT 135 (140) T ss_pred CHHHHHHHHHHHHHHhcC--CccCCCceEE-EEEEeeeEEeecCCCceEEEEEEEEEEEeecc Confidence 999999999999999843 5565665542 11111122233344444666688888888888 No 20 >protein:vir:100242 Length: 114 # NCBI annotation: gp71 # Family: family:all:2712 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355407;genbank:gi:77864697;genbank:GeneID:3725964 Probab=95.29 E-value=0.00049 Score=38.72 Aligned_cols=112 Identities=16% Similarity=0.195 Sum_probs=69.6 Q ss_pred CchHHHHHHHHHHHHhhcCCCCceeeCCCCCCCCC--CcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCChHH Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQKIPLFIENSPGDKPA--GIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTST 78 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~~pva~pN~~F~pp~--~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~~ 78 (137) |..- .|.++|.++++.+ .|.++. |.+ .||+...-+ +...-.+|+|..-...|.|||+|+.+. -.+ T Consensus 1 ~~~~---~i~~~l~~~~g~~----~~~~~a--P~~~~~Py~vy~rv-sg~p~~tL~G~~g~~~~r~QiD~yA~T---~~e 67 (114) T protein:vir:10 1 MSAL---TIRDAIGIVGGAK----GYVSVA--SSAAQSPYYVVSRV-SGTRDMALGGATGGKSGMFQIDVYAKT---YTE 67 (114) T ss_pred Ccee---eeehhhccccccc----ccCCCC--CCCCCCceEEEEec-cCcccccccCCCCcceEEEEEEeeeCC---HHH Confidence 7654 4556666777653 344443 333 477764433 344456788887788999999999875 568 Q ss_pred HHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEE Q lcl|NC_018454. 79 GRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPY 131 (137) Q Consensus 79 ~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~y 131 (137) |+++|+++.++-..+..|..+ .+.+.+...- .++.-=+.++-+||.| T Consensus 68 A~~La~~~~~~l~~~~~f~~~----~l~~~~d~ye--~dT~l~Rvsld~si~f 114 (114) T protein:vir:10 68 ADSLADQIIDRVESTGMFSVG----GVSDLPDDYS--SDTGVFRVSLEISVQF 114 (114) T ss_pred HHHHHHHHHhhcccccCeeee----ccccCCCCCC--cccCceEEEEEEEEeC Confidence 999999998887665544422 2444433321 1122125677788888 No 21 >protein:vir:1244 Length: 145 # NCBI annotation: similar to phage Spp1 gp17 # Family: family:all:296 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510943;genbank:gi:17426277;genbank:GeneID:927402 Probab=95.00 E-value=0.0021 Score=35.31 Aligned_cols=126 Identities=4% Similarity=-0.041 Sum_probs=73.1 Q ss_pred CchH----HHHHHHHHHHHhhc---CCCCceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCce-EEEEEEEEEEEEec Q lcl|NC_018454. 1 MIPD----IGAAMNARLGAWAD---GQKIPLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTCH-IYPGIFQVNVVVPV 72 (137) Q Consensus 1 m~~~----Ir~al~~rl~~~a~---~~~~pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~~-~~~G~~qI~v~~p~ 72 (137) |... +++||.+||.+-+. .-+.+ .|++.+ .++.-||+. -+.+...+.+.+|. ...=.++|+|+.. T Consensus 1 M~~s~~~aLq~ai~~~L~ad~~l~~lvg~~-vyD~~P-~~~~~PyV~----lG~~~~~~~~t~~~~~~~~~lti~Vws~- 73 (145) T protein:vir:12 1 MWVSVERYLFNKVYNKLKSNPIIQKQLGGR-VFDCVQ-KDAVYPYIV----VGETNVTNKETTTSMVEDVGITLHVYSQ- 73 (145) T ss_pred CcccHHHHHHHHHHHHhhcChhHHHhcCcc-cccCCc-cCCCCCEEE----eccceeeecCCCcccceEEEEEEEEEEc- Confidence 7644 46666666643211 11112 355544 233346754 45666666666653 4567889999875 Q ss_pred CCChHHHHHHHHHHHHhhhccceeccCCeEE-EecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 73 GSGTSTGRALARQVAALFPEGQSVQGDGFAC-WISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 73 G~G~~~~~~~Ad~i~a~F~~g~~l~~~~~~v-~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) ..|..++.++|+.|.+..... +...++.+ .+.- +-.....++++..+.--+++.++-..++ T Consensus 74 ~~gr~ea~~ia~ai~~aL~~~--l~l~~~~lv~l~~--~~~~~~rd~d~~~~hgvl~~ra~i~~~~ 135 (145) T protein:vir:12 74 ARNRDEASQIIQFLGFVLNNE--IEIDYYSFIKSRI--DTQEVITDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred CccHHHHHHHHHHHHHHhccc--cCCCCceEEEEEE--eeEEEEecCCCceEEEEEEEEEEEEeCC Confidence 558999999999999887544 44444443 2211 1122334444556666677777777777 No 22 >protein:vir:95961 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240391;genbank:gi:66396072;genbank:GeneID:5133472 Probab=94.86 E-value=0.0026 Score=34.72 Aligned_cols=127 Identities=6% Similarity=-0.021 Sum_probs=77.0 Q ss_pred CchHHHHHHHHHHHHhhcCCC---CceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCc-eEEEEEEEEEEEEecCCCh Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQK---IPLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTC-HIYPGIFQVNVVVPVGSGT 76 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~---~pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~-~~~~G~~qI~v~~p~G~G~ 76 (137) +...+++||.+||.+-+.-.. -+| |.+.+ .++..||+. -+..+..+.+.+| ....-.++|+|+.. ..|. T Consensus 5 ~~~aLq~Ai~~~L~ada~l~alvggrV-~D~~P-~~~~~PYv~----lG~~~~~d~~~~~~~g~~~~~ti~Vws~-~~g~ 77 (145) T protein:vir:95 5 VERYLFNKVYNKLKSNPIIQKQLDGRV-FDCVQ-KDAVYPYIV----VGETNVTNKETTTSMVEDVGITLHVYSQ-ARNR 77 (145) T ss_pred HHHHHHHHHHHHhhcCHhHHHhhcccc-ccCCc-CCCCCCEEE----ecCceeeecCCCcccceEEEEEEEEEEc-CCCH Confidence 446778888888865332111 022 33333 122246644 4556666666665 35677899999874 5689 Q ss_pred HHHHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 77 STGRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 77 ~~~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) .++.++|+.|.+..... +...++.+. .-.-.-..-..++++..+..-++|.+|..-|+ T Consensus 78 ~eak~ia~av~~aL~~~--l~l~~~~lv-~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:95 78 DEASQIIQFLGFVLNNE--IEIDYYSFI-KSRIDTQEVITDIDQYTKHGVIRLVFKYRHNT 135 (145) T ss_pred HHHHHHHHHHHHHhccc--cCCCCCeEE-EeEEeeeeEeecCCCceEEEEEEEEEEEEecc Confidence 99999999999988654 444444431 11111122234445557888888888888888 No 23 >protein:vir:94794 Length: 145 # NCBI annotation: ORF028 # Family: family:all:296 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240542;genbank:gi:66396219;genbank:GeneID:5133574 Probab=94.85 E-value=0.0027 Score=34.70 Aligned_cols=127 Identities=6% Similarity=-0.024 Sum_probs=77.0 Q ss_pred CchHHHHHHHHHHHHhhcCCC---CceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCc-eEEEEEEEEEEEEecCCCh Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQK---IPLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTC-HIYPGIFQVNVVVPVGSGT 76 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~---~pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~-~~~~G~~qI~v~~p~G~G~ 76 (137) +...+++||.+||.+-+.-.. -+| |.+.+ .++..||+. -+..+..+.+.+| ....-.++|+|+.. ..|. T Consensus 5 ~~~aLq~Ai~~~L~ada~l~alvggrV-~D~~P-~~~~~PYv~----lG~~~~~d~~~~~~~g~~~~~ti~Vws~-~~g~ 77 (145) T protein:vir:94 5 VERYLFNKVYNKLKSNPIIQKQLDGRV-FDCVQ-KDAVYPYIV----VGETNVTNKETTTSMVEDVGITLHVYSQ-ARNR 77 (145) T ss_pred HHHHHHHHHHHHhhcCHhHHHhhcccc-ccCCc-CCCCCCEEE----ecCceeeecCCCcccceEEEEEEEEEEc-CCCH Confidence 446778888888865332111 022 33333 122246644 4555666666665 35677899999874 5689 Q ss_pred HHHHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 77 STGRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 77 ~~~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) .++.++|+.|.+..... +...++.+. .-.-.-..-..++++..+..-++|.+|..-|+ T Consensus 78 ~eak~ia~av~~aL~~~--l~l~~~~lv-~l~~~~~~~~rd~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:94 78 DEASQIIQFLGFVLNNE--IEIDYYSFI-KSRIDTQEVITDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred HHHHHHHHHHHHHhccc--cCCCCCeEE-EeEEeeeeEeecCCCceEEEEEEEEEEEEecc Confidence 99999999999988654 444444431 11111122234445557888888888888888 No 24 >protein:vir:96894 Length: 140 # NCBI annotation: ORF029 # Family: family:all:296 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240162;genbank:gi:66395835;genbank:GeneID:5133235 Probab=94.64 E-value=0.0039 Score=33.80 Aligned_cols=127 Identities=9% Similarity=0.024 Sum_probs=69.9 Q ss_pred Cc----hHHHHHHHHHHHHhhcCCC---CceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCc-eEEEEEEEEEEEEec Q lcl|NC_018454. 1 MI----PDIGAAMNARLGAWADGQK---IPLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTC-HIYPGIFQVNVVVPV 72 (137) Q Consensus 1 m~----~~Ir~al~~rl~~~a~~~~---~pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~-~~~~G~~qI~v~~p~ 72 (137) |. ..+++||.+||.+-+.-.. -+| |.+.+ .++..||+. -+..+..+.+.+| ....-.++|+|+. . T Consensus 1 Msms~~~aLq~Ai~a~L~ada~l~alvg~~V-yD~~P-~~~~~Pyv~----lG~~~~~~~~~~~~~g~~~~~~i~Vws-~ 73 (140) T protein:vir:96 1 MWVSVEPELTVQIYKRLKASPIINKFVGDRV-FDVVQ-EDAVYPYIV----VGESNVTNNESSTMMRETVGIVIHVYS-Q 73 (140) T ss_pred CCccHHHHHHHHHHHHhhcChhHHHhcCCcc-ccCCc-cCCCCCEEE----ecCceeeecCCCcccceEEEEEEEEEE-c Confidence 55 5788888888876432111 122 44333 122235654 4566666666665 3567789999887 4 Q ss_pred CCChHHHHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEE--EEEEEEEEEEecC Q lcl|NC_018454. 73 GSGTSTGRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYS--IPVSIPYRADISS 137 (137) Q Consensus 73 G~G~~~~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~--ipVsi~yrad~~~ 137 (137) ..|..++.++|+.|.+.... .+...++.+.--.- .-..-..++++..+. +-+++.+|...-- T Consensus 74 ~~g~~ea~~ia~av~~AL~~--~l~l~~~~lv~l~~-~~~~~~rd~dg~~~hgvl~~r~~v~~~~~~ 137 (140) T protein:vir:96 74 FATQYEAKQIISAIGYVLNR--PIDIENYEFQFSRI-DSQSVFPDIDRFTKHGTIRLLFKYRHIKKG 137 (140) T ss_pred CCCHHHHHHHHHHHHHHhCC--CccCCCCeEEEEEE-eeeEEEecCCCceEEEEEEEEEEEEeeccc Confidence 66889999999999988754 35555555421111 111222333333444 3444444444333 No 25 >protein:vir:97325 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240617;genbank:gi:66396297;genbank:GeneID:5133681 Probab=94.14 E-value=0.0053 Score=33.07 Aligned_cols=126 Identities=5% Similarity=-0.032 Sum_probs=75.5 Q ss_pred Cc----hHHHHHHHHHHHHhhcCCCC---ceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCc-eEEEEEEEEEEEEec Q lcl|NC_018454. 1 MI----PDIGAAMNARLGAWADGQKI---PLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTC-HIYPGIFQVNVVVPV 72 (137) Q Consensus 1 m~----~~Ir~al~~rl~~~a~~~~~---pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~-~~~~G~~qI~v~~p~ 72 (137) |. ..+++||.+||.+-+.-..+ +| |.+.+ .++-.||+. -+..+..+.+.+| ....-.++|+|+.. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~~l~alvggrV-~D~~P-~~a~~PYv~----lG~~~~~d~~~~~~~g~~~~~ti~Vws~- 73 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNLIIRKQLDGRV-FDCVQ-KDAVYPYIV----VGETNVTNKETTTSMVEDVGITLHVYSQ- 73 (145) T ss_pred CcchHhHHHHHHHHHHhhcChhHHHhhcCce-ecCCc-cCCCCCEEE----eCcceeeecCCCcccceEEEEEEEEEEc- Confidence 65 66777788888654321110 22 34333 122246644 3556666666665 35677899999875 Q ss_pred CCChHHHHHHHHHHHHhhhccceeccCCeEE-EecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 73 GSGTSTGRALARQVAALFPEGQSVQGDGFAC-WISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 73 G~G~~~~~~~Ad~i~a~F~~g~~l~~~~~~v-~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) ..|..++.++|+.|.+..... +...++.+ .+.- .-..-..++++..+..-++|.+|..-|. T Consensus 74 ~~g~~eak~ia~av~~aL~~~--l~l~~~~lv~l~~--~~~~~~rd~dg~~~hgvl~fra~ve~~~ 135 (145) T protein:vir:97 74 ARNRDEASQIIQFLGFVLNNE--IEIDYYSFIKSRI--DTQEVITDIDQYTKHGIIRLVFKYRHNT 135 (145) T ss_pred CCCHHHHHHHHHHHHHHhccc--cCCCCCeEEEeEE--eeeeEeecCCCceEEEEEEEEEEEecCc Confidence 668999999999999988654 44444443 2211 1122233344456777788887777776 No 26 >protein:vir:107096 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950611;genbank:gi:119953691;genbank:GeneID:4643105 Probab=94.11 E-value=0.0054 Score=33.03 Aligned_cols=126 Identities=4% Similarity=-0.036 Sum_probs=75.6 Q ss_pred Cc----hHHHHHHHHHHHHhhcCCC---CceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCce-EEEEEEEEEEEEec Q lcl|NC_018454. 1 MI----PDIGAAMNARLGAWADGQK---IPLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTCH-IYPGIFQVNVVVPV 72 (137) Q Consensus 1 m~----~~Ir~al~~rl~~~a~~~~---~pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~~-~~~G~~qI~v~~p~ 72 (137) |. ..+++||.+||.+-+.-.. -+ .|.+.+ .++..||+. -+.+...+.+.+|. ...-.++|+|+.. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~al~alvg~r-VyD~~P-~~a~~PyV~----lG~~~~~~~~~~~~~g~~~~~ti~Vws~- 73 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNPIIKKQLGGR-VFDCVQ-KDAVYPYIV----VGETNVTNKETTTSMFEDVGVTLHVYSQ- 73 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhccc-cccCCc-cCCCCCEEE----eCcceeeecCCCcccceEEEEEEEEEEc- Confidence 55 6677777777765432111 01 244333 122346644 35566666666653 5677899999875 Q ss_pred CCChHHHHHHHHHHHHhhhccceeccCCeEE-EecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 73 GSGTSTGRALARQVAALFPEGQSVQGDGFAC-WISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 73 G~G~~~~~~~Ad~i~a~F~~g~~l~~~~~~v-~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) ..|..++.++|+.|.+.... .+...++.+ .+.- .-..-..++++..+..-++|.++..-|+ T Consensus 74 ~~g~~ea~~ia~av~~aL~a--~l~l~~~~lv~l~~--~~~~~~rd~dg~~~hgvl~~ra~ve~~~ 135 (145) T protein:vir:10 74 ARNRDEASQIIQYLGFVLNS--EIEINNYSFIKSRI--DTQEVITDIDQYTKHGIIRLIFKYRHNT 135 (145) T ss_pred CCCHHHHHHHHHHHHHHhCC--CcCCCCCeEEEEEE--eeeeEeecCCCceEEEEEEEEEEEeecc Confidence 66889999999999999853 455555554 2211 1122234444556777788888877777 No 27 >protein:vir:105337 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950674;genbank:gi:119967844;genbank:GeneID:4643216 Probab=94.07 E-value=0.0055 Score=32.97 Aligned_cols=126 Identities=4% Similarity=-0.036 Sum_probs=75.6 Q ss_pred Cc----hHHHHHHHHHHHHhhcCCC---CceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCce-EEEEEEEEEEEEec Q lcl|NC_018454. 1 MI----PDIGAAMNARLGAWADGQK---IPLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTCH-IYPGIFQVNVVVPV 72 (137) Q Consensus 1 m~----~~Ir~al~~rl~~~a~~~~---~pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~~-~~~G~~qI~v~~p~ 72 (137) |. ..+++||.+||.+-+.-.. -+ .|.+.+ .++..||+. -+.+...+.+.+|. ...-.++|+|+.. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~al~alvg~r-VyD~~P-~~a~~PyV~----lG~~~~~~~~~~~~~g~~~~~ti~Vws~- 73 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNPIVSKQLGGR-VFDCVQ-KDAVYPYIV----VGETNVTNKETTTSMFEDVGVTLHVYSQ- 73 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhccc-cccCCc-cCCCCCEEE----eCcceeeecCCCcccceEEEEEEEEEEc- Confidence 55 6677777777765332111 01 244333 122346644 35566666666653 5677899999875 Q ss_pred CCChHHHHHHHHHHHHhhhccceeccCCeEE-EecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 73 GSGTSTGRALARQVAALFPEGQSVQGDGFAC-WISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 73 G~G~~~~~~~Ad~i~a~F~~g~~l~~~~~~v-~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) ..|..++.++|+.|.+.... .+...++.+ .+.- .-..-..++++..+..-++|.++..-|+ T Consensus 74 ~~g~~ea~~ia~av~~aL~a--~l~l~~~~lv~l~~--~~~~~~rd~dg~~~hgvl~~ra~ve~~~ 135 (145) T protein:vir:10 74 ARNRDEASQIIQYLGFVLNS--EIEINNYSFIKSRI--DTQEVITDIDQYTKHGIIRLIFKYRHNT 135 (145) T ss_pred CCCHHHHHHHHHHHHHHhCC--CcCCCCCeEEEEEE--eeeeEeecCCCceEEEEEEEEEEEeecc Confidence 66889999999999999853 455555554 2211 1122234444556777788888877777 No 28 >protein:vir:5979 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:296 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690679;genbank:geneid:6329147;genbank:gi:22855073;uniprot:O48448;genbank:GeneID:955319 Probab=93.41 E-value=0.0077 Score=32.18 Aligned_cols=127 Identities=6% Similarity=0.022 Sum_probs=72.1 Q ss_pred Cc-----hHHHHHHHHHHHHhhcCCCC-ceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCc-eEEEEEEEEEEEEecC Q lcl|NC_018454. 1 MI-----PDIGAAMNARLGAWADGQKI-PLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTC-HIYPGIFQVNVVVPVG 73 (137) Q Consensus 1 m~-----~~Ir~al~~rl~~~a~~~~~-pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~-~~~~G~~qI~v~~p~G 73 (137) |. ...++||.+||.+-+.-..+ -=.|.+.+ .++..||+. -+.++..+.+.+| ....-.++|+|+... T Consensus 1 m~~~s~~~aLq~Ai~~~L~ad~~l~alvg~I~D~~P-~~~~~PYV~----lG~~~~~d~~~~~~~g~~~~~ti~Vws~~- 74 (134) T protein:vir:59 1 MTWKLASRALQKATVENLESYQPLMEMVNQVTESPG-KDDPYPYVV----IGDQSSTPFETKSSFGENITMDFHVWGGT- 74 (134) T ss_pred CCccchhHHHHHHHHHHhhcChhHHHhhhhhhcCCC-CCCCCCEEE----eCCceeeecCCCcccceEEEEEEEEEECC- Confidence 43 36889999999764332211 01333333 122346654 3556666666555 356778899999865 Q ss_pred CChHHHHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEec Q lcl|NC_018454. 74 SGTSTGRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADIS 136 (137) Q Consensus 74 ~G~~~~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~ 136 (137) |..+++++|+.|.+.. .+..|...+..+.--.- .-..-..++++..+..-++|.++-+-| T Consensus 75 -g~~ea~~ia~av~~aL-~~~~L~l~~~~lv~l~~-~~~~~~rd~dg~~~hg~l~fra~ve~~ 134 (134) T protein:vir:59 75 -TRAEAQDISSRVLEAL-TYKPLMFEGFTFVAKKL-VLAQVITDTDGVTKHGIIKVRFTINNN 134 (134) T ss_pred -ChHHHHHHHHHHHHHh-cCCCcccCCceEEEeEE-eeeeEEecCCCceEEEEEEEEEEEecC Confidence 5578999999999998 44556555444321111 112223344444566656555555555 No 29 >protein:vir:1892 Length: 121 # NCBI annotation: gp11 # Family: family:all:896 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037672;genbank:gi:9634130;genbank:GeneID:1262490 Probab=92.11 E-value=0.013 Score=30.96 Aligned_cols=118 Identities=9% Similarity=0.017 Sum_probs=71.0 Q ss_pred CchHHHHHHHHH--HHHhhcCCCCceeeC-CCCCCCCCCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCChH Q lcl|NC_018454. 1 MIPDIGAAMNAR--LGAWADGQKIPLFIE-NSPGDKPAGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTS 77 (137) Q Consensus 1 m~~~Ir~al~~r--l~~~a~~~~~pva~p-N~~F~pp~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~ 77 (137) |.+-|+++|++= +.++.+... +-.|| ++.=.-+-.||+-.+.+.+ +....|+|.+-...+.+||+|+... .. T Consensus 1 m~~~i~~~l~~d~~v~allg~~~-~Rvyp~~~aP~~~~~Pyiv~q~vsg-~p~~~l~G~~~~~~~~vQIDvyA~t---~~ 75 (121) T protein:vir:18 1 MIAPIFSVCASSPEVTDLLGSNP-VRIYPFGIQDDNVVYPYVVWQNITG-SPENYIAQRPDADFFTLQVDAYADT---VD 75 (121) T ss_pred CchHHHHHHhcChhhhhhhcCCC-ceeeeccCCCCcCcCCeEEEEEecC-cccceecCCCCcceeEEEEEeecCC---HH Confidence 999999999764 345544322 23455 4431111247777766655 4445666766677899999999765 45 Q ss_pred HHHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEe Q lcl|NC_018454. 78 TGRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADI 135 (137) Q Consensus 78 ~~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~ 135 (137) +|.+++++|++..+ ... +... ... . .-+++-+-|.+-+-|+|--+- T Consensus 76 ~A~~l~~avr~Ale-~~~-----~~~~--~~~---~-~ye~dT~lyR~s~Dv~~~~~r 121 (121) T protein:vir:18 76 EVIAVATALRDAIE-PHA-----HITR--WGG---Q-ERDPETKRYRYSFDVDWIVTR 121 (121) T ss_pred HHHHHHHHHHHHhh-hcC-----cccC--CCC---C-CCcccccceeeeeEEEEeecC Confidence 78899999998874 222 1111 111 1 122333467777777775555 No 30 >protein:vir:97070 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:3244 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453569;genbank:gi:84662604;genbank:GeneID:5142485 Probab=83.81 E-value=0.065 Score=27.08 Aligned_cols=116 Identities=14% Similarity=0.100 Sum_probs=68.1 Q ss_pred CchHHHHHHHHHHHHhhcCCCCceeeCCCC-CCCCCCcEEEEEEcCCCceeeecCCC-ceEEEEEEEEEEEEecCCChHH Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQKIPLFIENSP-GDKPAGIFLESFDMPATPQTLDLGLT-CHIYPGIFQVNVVVPVGSGTST 78 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~~pva~pN~~-F~pp~~~ylr~~~~p~~t~~~~l~~~-~~~~~G~~qI~v~~p~G~G~~~ 78 (137) |+ +.+.|.+.|+.++..+ .||++. ...+..||+-...+.+... ..|.|. +......+||+|+.. ...+ T Consensus 1 M~--~e~~l~a~L~~~~~~R----vyp~~aP~~~~~~Pyiv~q~vsg~p~-~~ldG~~~~~~~~rvQIdvyA~---t~~~ 70 (118) T protein:vir:97 1 MS--YGRMLKDLLDPVFSGR----VYADIPPDSPPLDAYAIYQRVGGVPV-YWKEGGMPDKVNARVQVQIWSR---SKQE 70 (118) T ss_pred Cc--hHHHHHHHHhhhcCCc----cccccCCCCCCcCCEEEEEecCCccc-ccccCCCCCccceeEEEEEeeC---CHHH Confidence 76 3466777777766543 455543 2223348888887777555 447665 455667899999975 4568 Q ss_pred HHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEec Q lcl|NC_018454. 79 GRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADIS 136 (137) Q Consensus 79 ~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~ 136 (137) |.+++++|+........+ .. +..+.+ .-+++.+.|.+-+-|.-=.+|| T Consensus 71 A~~l~~av~~al~~~~~~-----~~-~~~~~~----~ye~dt~lyr~~~Df~iw~~~~ 118 (118) T protein:vir:97 71 AYLATVQVLRIVSEANDM-----QV-LSQPID----DYVRELKLYGSRVDISMWYNLT 118 (118) T ss_pred HHHHHHHHHHHhhccccc-----cc-ccCCcc----cccccCCceEEEEEEEEEeecC Confidence 888888888877443221 11 111111 1123344677666666655666 No 31 >protein:vir:80371 Length: 115 # NCBI annotation: gp11 # Family: family:all:2712 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111090;genbank:gi:134288622;genbank:GeneID:4960618 Probab=83.07 E-value=0.027 Score=29.15 Aligned_cols=113 Identities=12% Similarity=0.122 Sum_probs=64.3 Q ss_pred CchHHHHHHHHHHHHhhcCCCCceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCChHHHH Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQKIPLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTSTGR 80 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~~pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~~~~ 80 (137) |... .++..|..++.... |+.++=.-...||+-+.-+.+..+ ..|.|+.....|.|||+++.+. ..+++ T Consensus 1 ~~~~---vir~al~~i~~~~~----~~~vAp~~~~~pyivy~rvsga~e-~~L~G~ag~~~~~~QID~yA~T---~~ea~ 69 (115) T protein:vir:80 1 MSVI---VVRDALQGIGGAKG----YLGVAPEKAPARYFVVTRVHGALD-MALAGPTGGRSGSYQIDCYAPT---FTDAD 69 (115) T ss_pred Ceee---eeechhhhcccccc----ceeeccccCcCCeEEEeecCCCcc-ccccCCCCCceeEEEEeeecCC---HHHHH Confidence 7654 33444555665543 333321111247776665555444 4566666677899999999875 67899 Q ss_pred HHHHHHHHhhhccce--eccCCeEEEecccccccCCcccCCCCEEEEEEEEEE Q lcl|NC_018454. 81 ALARQVAALFPEGQS--VQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPY 131 (137) Q Consensus 81 ~~Ad~i~a~F~~g~~--l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~y 131 (137) ++|+++.+.- .+.. +..+ -+++-|.... .++..=+..+.|+|.| T Consensus 70 ~La~~v~d~~-~~~~~~~~vg----~l~e~pd~Ye--~DT~l~Rvs~dv~i~f 115 (115) T protein:vir:80 70 RLADLAVDRA-MSVQDRFSVG----GVDELPDDYS--ADTGLFRVSLELSVEF 115 (115) T ss_pred HHHHHHHHhh-hCCcccccee----cccCCCcccc--cccceEEEEEEEEEeC Confidence 9999999842 2211 1222 1334443322 1222236778888888 No 32 >protein:vir:10368 Length: 118 # NCBI annotation: conserved phage protein # Family: family:all:3244 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858960;genbank:gi:32128425;genbank:GeneID:2648389 Probab=80.40 E-value=0.095 Score=26.18 Aligned_cols=116 Identities=14% Similarity=0.111 Sum_probs=66.7 Q ss_pred CchHHHHHHHHHHHHhhcCCCCceeeCCCC-CCCCCCcEEEEEEcCCCceeeecCCC-ceEEEEEEEEEEEEecCCChHH Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQKIPLFIENSP-GDKPAGIFLESFDMPATPQTLDLGLT-CHIYPGIFQVNVVVPVGSGTST 78 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~~pva~pN~~-F~pp~~~ylr~~~~p~~t~~~~l~~~-~~~~~G~~qI~v~~p~G~G~~~ 78 (137) |+ +...|.+.|+.++..+ .||.+. -..|-.||+-...+.+.. ...|+|. +......+||+|+.. -..+ T Consensus 1 Ms--~e~~l~a~L~~~~~~R----Vyp~~aP~~~~~~Pyiv~q~vsg~p-~~~l~G~~~~~~~~rvQIdvyA~---t~~~ 70 (118) T protein:vir:10 1 MS--YGRVLKDLLDPVFSGR----VYADIPPDSPPLDAYAIYQRVGGVP-VYWQEGGMPEKVNARVQIQIWSR---SKQE 70 (118) T ss_pred Cc--hHHHHHHHHhhhcCCc----cccccCCCCCCcCCEEEEEecCCcc-cccccCCCCccceeEEEEEEeeC---CHHH Confidence 76 2345666666666542 455443 222334888888887765 4457775 455667899999975 4678 Q ss_pred HHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEec Q lcl|NC_018454. 79 GRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADIS 136 (137) Q Consensus 79 ~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~ 136 (137) |.+++++|+........+ .. +..+.+ .-+++.+.|.+-+-|.-=-+|| T Consensus 71 A~~l~~av~~al~~~~~~-----~~-~~~~~d----~ye~dt~l~r~~~Df~vw~~~~ 118 (118) T protein:vir:10 71 AYLATVQVLRLVSEANDM-----QV-LSQPID----DYVREIKLYGSRVDISMWYNLT 118 (118) T ss_pred HHHHHHHHHHHhhhcccc-----ee-ccCCCc----cccccCCceEEEEEEEEeeecC Confidence 888888888887443221 11 111111 1123334666666666434444 No 33 >protein:vir:100116 Length: 115 # NCBI annotation: gp10 # Family: family:all:2712 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945040;genbank:gi:38707900;genbank:GeneID:2744163 Probab=79.71 E-value=0.063 Score=27.18 Aligned_cols=114 Identities=14% Similarity=0.136 Sum_probs=57.2 Q ss_pred CchHHHHHHHHHHHHhhcCCCCceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCChHHHH Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQKIPLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTSTGR 80 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~~pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~~~~ 80 (137) |..- .+.+.|..+... -.||++.=.-+..||+-...+-+... ..|+|.+....+.+||+|+... ..+|+ T Consensus 1 ~~~~---~i~~aL~~l~~~----RVyp~~aP~~~~~Pyiv~q~vsg~p~-~~L~G~~~~~~~~vQIDvyA~t---~~~A~ 69 (115) T protein:vir:10 1 MSVI---VIRDALQGIGGA----KGYLGVAPEKAPAPYFVVTRVHGALD-MALAGLTGGRSGSYQIDCYAPT---FTDAD 69 (115) T ss_pred CeeE---EeehhhcccCCc----eeecccCCCCCCCCEEEEEeecCccc-cccCCCCCCcceEEEEEEeeCC---HHHHH Confidence 4332 122222222221 34566531112248877777666544 4888877777899999999764 56777 Q ss_pred HHHHHHHHhhhccceeccCCeEEE-ecccccccCCcccCCCCEEEEEEEEEE Q lcl|NC_018454. 81 ALARQVAALFPEGQSVQGDGFACW-ISSQPSIYAGVLNPRNTRYSIPVSIPY 131 (137) Q Consensus 81 ~~Ad~i~a~F~~g~~l~~~~~~v~-v~~~p~v~~g~~~~~~~~~~ipVsi~y 131 (137) ++++++.+.- .+.. ..+.+. +++.+.... .++.-=+..+-++|=| T Consensus 70 ~l~~~v~~~~-~~~~---~~~~~~~~~~~~d~ye--~dt~lyR~s~D~~vWf 115 (115) T protein:vir:10 70 RLADLAVDRA-MSVQ---DRFSVGGVDELPDDYS--EDTGLFRISLELSVEF 115 (115) T ss_pred HHHHHHHHHH-hcCc---cceeEeeecCCCCCCc--ccccceeeEEEEEEeC Confidence 7777776532 1111 112222 222222111 1112124555566666 No 34 >protein:vir:1438 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:2712 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536367;genbank:gi:17975172;genbank:GeneID:929144 Probab=79.62 E-value=0.074 Score=26.79 Aligned_cols=114 Identities=14% Similarity=0.128 Sum_probs=57.8 Q ss_pred CchHHHHHHHHHHHHhhcCCCCceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCChHHHH Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQKIPLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTSTGR 80 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~~pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~~~~ 80 (137) |..- .+.+.|..++.. -.||++.=.-+..||+-...+-+... ..|+|.+....+.+||+|+... ..+|. T Consensus 1 ~~~~---~i~~aL~~l~~~----RVyp~~aP~~~~~Pyiv~q~vsg~p~-~~L~G~~~~~~~~vQIDvyA~t---~~~A~ 69 (115) T protein:vir:14 1 MSVI---VIRDALQGIGGA----KGYLGVAPAKAPAPYFVVTRVHGALD-MALAGLTGGRSGSYQIDCYAPT---FTDAD 69 (115) T ss_pred CeeE---eeehhhcccccc----ccccccCCCCCCCCEEEEEeecCccc-ccccCCCCCcceEEEEEEeeCC---HHHHH Confidence 5433 222333333332 24566531111247877776666544 4888887777899999999754 56777 Q ss_pred HHHHHHHHhhhccceeccCCeEEE-ecccccccCCcccCCCCEEEEEEEEEE Q lcl|NC_018454. 81 ALARQVAALFPEGQSVQGDGFACW-ISSQPSIYAGVLNPRNTRYSIPVSIPY 131 (137) Q Consensus 81 ~~Ad~i~a~F~~g~~l~~~~~~v~-v~~~p~v~~g~~~~~~~~~~ipVsi~y 131 (137) ++++++.+.- .+.. ..+.+. +++.+.... .++.-=+..+-++|=| T Consensus 70 ~l~~~v~~~~-~~~~---~~~~~~~~~~~~d~ye--~dt~lyR~s~D~~vWf 115 (115) T protein:vir:14 70 RLADLAVDRA-MSVQ---DRFSVGGVDELPDDYS--EDTGLFRISLELSVEF 115 (115) T ss_pred HHHHHHHHHH-hcCc---cceeeeeecCCCCCCc--ccccceeeEEEEEEeC Confidence 8888876543 1111 112222 222222111 1111114555556666 No 35 >protein:vir:195 Length: 115 # NCBI annotation: Gp11 # Family: family:all:896 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037705;genbank:gi:9634170;genbank:GeneID:1262540 Probab=73.98 E-value=0.16 Score=24.89 Aligned_cols=113 Identities=17% Similarity=0.098 Sum_probs=63.1 Q ss_pred CchHHHHHHHHHHHHhhcCCCCceeeCCCCCCCCC--CcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCChHH Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQKIPLFIENSPGDKPA--GIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTST 78 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~~pva~pN~~F~pp~--~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~~ 78 (137) |+. +.|.+.|..++..+-+|-.-|...-..|+ .||+....+-+... ..|+|.. .....+||+|+.+ ...+ T Consensus 1 M~e---~~i~~lL~~l~~gRvyp~~aP~~~~~~~~~~~Pyiv~q~vsg~p~-~~L~G~~-~~~~~vQIDvyA~---t~~~ 72 (115) T protein:vir:19 1 MNE---DNIYALLSPLAEGRVYPYVAPLGSDGKPSVSPPWIIFSIVDDVSA-DVLCGQA-ESRVSVQVDVYST---SIAE 72 (115) T ss_pred Cch---hHHHHHHhhhcCcccceeeccCCCCCCccccCCeEEEEeccCccc-ccccCCC-ccceEEEEEEeeC---ChHH Confidence 985 57888888888887777766665433333 47776666655333 3366643 2456999999875 4567 Q ss_pred HHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEE Q lcl|NC_018454. 79 GRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRA 133 (137) Q Consensus 79 ~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yra 133 (137) |.+++++|++..+.-. .+.+. +. .++ +++-.-|+.-+-|.-.- T Consensus 73 A~~l~~~i~~Al~~~~-----p~~~~--~~----~~y-e~dt~lyR~s~d~~V~~ 115 (115) T protein:vir:19 73 SRSLRDLVLASLEPLT-----PTEVV--KI----PGY-EPDYRLYRATLDFKVTP 115 (115) T ss_pred HHHHHHHHHHHhhhcC-----CEEec--CC----CCc-ccchhceeeEEEEEecC Confidence 8888888888764211 12211 11 111 11112344333322222 No 36 >protein:vir:3428 Length: 131 # NCBI annotation: tail component # Family: family:all:911 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040591;genbank:gi:9626255;genbank:GeneID:2703486 Probab=71.17 E-value=0.2 Score=24.43 Aligned_cols=122 Identities=15% Similarity=0.133 Sum_probs=73.3 Q ss_pred Cc-hHHHHHHHHHHHHhhcCCCCceeeCCCC-C-CCCCCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCChH Q lcl|NC_018454. 1 MI-PDIGAAMNARLGAWADGQKIPLFIENSP-G-DKPAGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTS 77 (137) Q Consensus 1 m~-~~Ir~al~~rl~~~a~~~~~pva~pN~~-F-~pp~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~ 77 (137) |+ .+||+++..+|.+ ..+... .|-|.+ | ++-..|=..|++-.+.....++. ....+..|.|.||-|+.++-. T Consensus 1 ~~ht~IR~~Vid~L~~--~l~~v~-~fdG~P~fide~ElPAVAV~l~d~~~~~~~ld--~~~w~A~LhI~iyLka~~~ds 75 (131) T protein:vir:34 1 MKHTELRAAVLDALEK--HDTGAT-FFDGRPAVFDEADFPAVAVYLTGAEYTGEELD--SDTWQAELHIEVFLPAQVPDS 75 (131) T ss_pred CchHHHHHHHHHHHhc--cCCceE-EecCCceeeccccCcEEEEEeecCCCCcceec--CCeeEEEEEEEEEeecCCCHH Confidence 77 6899999999976 122322 455543 3 42235777777777777666665 347789999999999999999 Q ss_pred HHHHHHHH-HHHhhhccceec--cCCeEEEecccccccCCcccC-CCCEEEEE--EEEEEEE Q lcl|NC_018454. 78 TGRALARQ-VAALFPEGQSVQ--GDGFACWISSQPSIYAGVLNP-RNTRYSIP--VSIPYRA 133 (137) Q Consensus 78 ~~~~~Ad~-i~a~F~~g~~l~--~~~~~v~v~~~p~v~~g~~~~-~~~~~~ip--Vsi~yra 133 (137) +..++|.+ |..-.+....|. .+ .+...+ ..=..|+ ..+|...- -+|+|.- T Consensus 76 ~LD~~~E~~i~~v~~~~~~l~~l~~--~~~~~g----y~Y~rD~e~~tW~sadL~y~ItY~~ 131 (131) T protein:vir:34 76 ELDAWMESRIYPVMSDIPALSDLIT--SMVASG----YDYRRDDDAGLWSSADLTYVITYEM 131 (131) T ss_pred HHHHHHHHHhHHHhhcchhhhhHhh--hhhhcc----CCcccccccceEEEEEEEEEEEEeC Confidence 99999998 445544333322 11 111111 1101111 22354433 4556665 No 37 >protein:vir:96800 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:32155 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224254;genbank:gi:62362389;genbank:GeneID:3345739 Probab=67.08 E-value=0.18 Score=24.63 Aligned_cols=126 Identities=22% Similarity=0.212 Sum_probs=87.5 Q ss_pred Cc-hHHHHHHHHHHHHhhcCCCCceeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCChHHH Q lcl|NC_018454. 1 MI-PDIGAAMNARLGAWADGQKIPLFIENSPGDKPAGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTSTG 79 (137) Q Consensus 1 m~-~~Ir~al~~rl~~~a~~~~~pva~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~~~ 79 (137) |. -+.-+|+|.-|.+-. .+||+ |.. -|.++.-.|+.+-.++..-..|....+..+|.|-|.|-...|+..-+- T Consensus 1 mtfldavkafeddlkakv---nipva--nks-iptdgvsmrvalnnadadglflnsgarvmtgqfnveisaelgtnkyam 74 (127) T protein:vir:96 1 MTFLDAVKAFEDDLKAKV---NIPVA--NKS-IPTDGVSMRVALNNADADGLFLNSGARVMTGQFNVEISAELGTNKYAM 74 (127) T ss_pred Cchhhhhhhhhhccceee---ecccc--ccc-cCcCceEEEEEeccCCcceeEeecCceeeeeeeeeEEeeccCCceeee Confidence 32 122233333332211 23333 332 466889999999999999999988889999999999999999988877 Q ss_pred HHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEe Q lcl|NC_018454. 80 RALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADI 135 (137) Q Consensus 80 ~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~ 135 (137) ..-|.++.+.+++|-+...-+-.+.|-+.-+ +.++ +.+.+-.|+|-|.|+--- T Consensus 75 maeankvlavyergysvpvldrrvlilqanq-stpy--pteahqkinviidfqitk 127 (127) T protein:vir:96 75 MAEANKVLAVYERGYSVPVLDRRVLILQANQ-STPY--PTEAHQKINVIIDFQITK 127 (127) T ss_pred eeccceeEEeeecCcccceecceEEEEEcCC-CCCC--cccccceeeEEEEEEEcC Confidence 7778888899998888776666666655432 3334 334577899999887654 No 38 >protein:vir:93602 Length: 114 # NCBI annotation: putative structural component # Family: family:all:896 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449300;genbank:gi:157166048;uniprot:Q6H9U1;genbank:GeneID:5580424 Probab=66.62 E-value=0.26 Score=23.75 Aligned_cols=113 Identities=14% Similarity=0.089 Sum_probs=59.1 Q ss_pred CchHHHHHHHHHHHHhhcCCCCceeeCCCCCCCC-CCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCChHHH Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQKIPLFIENSPGDKP-AGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTSTG 79 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~~pva~pN~~F~pp-~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~~~ 79 (137) |+. +.|.+.|..++..+-+|-.-|-....+. ..||+....+-+... ..|+|.. .-.-.+||+|+.. ...+| T Consensus 1 M~e---~~i~~lL~~~~~gRvyp~~~P~~~~~~~~~~Pyiv~q~vsg~p~-~~l~gp~-~~~~~vQIDvyA~---t~~~A 72 (114) T protein:vir:93 1 MTE---ADLYPHLAHLAGGQVYPYVVPLLDGRPSVALPWVVFSLISSVSA-DVMGGQA-ESSVSVQIDVYAG---TVTQA 72 (114) T ss_pred Cch---HHHHHHHHhhcCcccccccCCcccCcCCccCceEEEEeccCccc-ccccCcc-ccceEEEEEeeeC---CHHHH Confidence 985 5788888887776533333332221111 237776666655443 3356633 3456999999975 56788 Q ss_pred HHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEE Q lcl|NC_018454. 80 RALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRA 133 (137) Q Consensus 80 ~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yra 133 (137) .+++++|+............ +. .+. +++-.-|..-+-|.+.- T Consensus 73 ~~l~~~v~~Al~~~~~~~~~-------~~----~~y-e~dt~lyR~~~d~~v~~ 114 (114) T protein:vir:93 73 RQIRQDAREAIMLLAPGSVS-------EM----QDY-IPENRCYRATLEFQVTV 114 (114) T ss_pred HHHHHHHHHHHhhcCcEeec-------CC----Ccc-cccccceeeEEEEEEeC Confidence 99999998877433221111 10 111 11112233332222222 No 39 >protein:vir:81066 Length: 118 # NCBI annotation: p13 # Family: family:all:3244 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285683;genbank:gi:148727191;genbank:GeneID:5247111 Probab=51.54 E-value=0.58 Score=21.89 Aligned_cols=116 Identities=13% Similarity=0.063 Sum_probs=62.1 Q ss_pred CchHHHHHHHHHHHHhhcCCCCceeeCCCC-CCCCCCcEEEEEEcCCCceeeecCCC-ceEEEEEEEEEEEEecCCChHH Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQKIPLFIENSP-GDKPAGIFLESFDMPATPQTLDLGLT-CHIYPGIFQVNVVVPVGSGTST 78 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~~pva~pN~~-F~pp~~~ylr~~~~p~~t~~~~l~~~-~~~~~G~~qI~v~~p~G~G~~~ 78 (137) |. +..+|.+.|+..+..+ .||.+. -.++-.||+-...+-+.. ...|.|. +......+||+|+.. -..+ T Consensus 1 Ms--~e~~l~a~L~~~~~~R----vyp~~aP~~~~~~Pyiv~q~vsg~p-~~~l~G~~~~~~~~rvQIdvyA~---t~~~ 70 (118) T protein:vir:81 1 MS--YGRVLKDLLDPVFSGR----VYADIPPDSPPLDAYAIYQRVGGVP-VYWQEGGMPEKVNARVQIQIWSR---SKQE 70 (118) T ss_pred Cc--hHHHHHHHHHhhcCCc----cccccCCCCCccCceEEEEecCCcc-cccccCCCCCccceeEEEEEeeC---CHHH Confidence 76 2245555566655432 455432 122224888888887765 4447665 444467899999975 4678 Q ss_pred HHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 79 GRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 79 ~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) |.+++++|++.......+ . .+.++. + . -+++.+.|..-+-|.- -+.|| T Consensus 71 A~~l~~av~~al~~~~~~-----~-~~~~~~--d-~-ye~dt~l~r~~~Df~i-w~~~~ 118 (118) T protein:vir:81 71 AYLATVQVLRLVSEAPDM-----Q-VLSQPI--D-D-YVREIKLYGSRVDVSM-WYPIT 118 (118) T ss_pred HHHHHHHHHHHhhhccce-----e-eccCCc--c-c-cccccCceeEEEEEEE-EecCC Confidence 888898888887443221 1 111211 1 1 1223345555444331 12233 No 40 >protein:vir:397 Length: 132 # NCBI annotation: gp12 # Family: family:all:911 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046907;genbank:gi:9630477;genbank:GeneID:1261651 Probab=50.57 E-value=0.61 Score=21.78 Aligned_cols=122 Identities=14% Similarity=0.108 Sum_probs=72.4 Q ss_pred Cc-hHHHHHHHHHHHHhhcCCCCceeeCCCC-CCCC-CCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCChH Q lcl|NC_018454. 1 MI-PDIGAAMNARLGAWADGQKIPLFIENSP-GDKP-AGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTS 77 (137) Q Consensus 1 m~-~~Ir~al~~rl~~~a~~~~~pva~pN~~-F~pp-~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~ 77 (137) |+ .+||+++..+|.+--. ..-..|.+.+ |... ..|=..|++-.+.....+|.+ ...+..|.|.||-|+.++-. T Consensus 1 ~~ht~IR~~Vid~L~~~l~--~~~~ffdGrP~fiDe~elPAVAV~l~d~~~~~~~ld~--~~w~A~LhI~iyLka~~~ds 76 (132) T protein:vir:39 1 MKHRDIRKVIIDALESAIG--TDAIYFDGRPAVLEEGDFPAVAVYLTDAEYTGEELDA--DTWQAILHIEVFLEAQVPDS 76 (132) T ss_pred CchHHHHHHHHHHHHhhCC--CceEEecCcceeeccccCcEEEEEeecCCCCcceecC--CeeEEEEEEEEEeecCCCHH Confidence 76 6899999999976322 1223455543 4333 357777777777766666653 47789999999999999999 Q ss_pred HHHHHHHHHHHhhhccceec-cCCeEEEecccccccCCcc---c-CCCCEEEE--EEEEEEEE Q lcl|NC_018454. 78 TGRALARQVAALFPEGQSVQ-GDGFACWISSQPSIYAGVL---N-PRNTRYSI--PVSIPYRA 133 (137) Q Consensus 78 ~~~~~Ad~i~a~F~~g~~l~-~~~~~v~v~~~p~v~~g~~---~-~~~~~~~i--pVsi~yra 133 (137) +..++|.++ .||.-.... .+++...+ ..+|+. | +..+|... --+|+|.. T Consensus 77 ~LD~~aE~~--i~p~i~~~~~l~~l~~~~-----~~~gy~Y~rD~~~atW~sadL~y~ItY~~ 132 (132) T protein:vir:39 77 ELDDWMETR--VYPVLAEVPGLESLITTM-----VQQGYDYQRDDDMALWSSADLKYSITYDM 132 (132) T ss_pred HHHHHHHHH--hHhhhcccchhhhHhhhh-----hhcCCCcccccccceEEEEEEEEEEEEeC Confidence 999999987 333222111 12221111 112221 1 22235443 33455555 No 41 >protein:vir:79571 Length: 137 # NCBI annotation: putative tail component # Family: family:all:911 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272522;genbank:gi:148609391;genbank:GeneID:5204407 Probab=31.60 E-value=1.5 Score=19.64 Aligned_cols=122 Identities=13% Similarity=0.075 Sum_probs=72.4 Q ss_pred Cc--hHHHHHHHHHHHHhhcCCCCceeeCCCC-CCCCC-CcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCCh Q lcl|NC_018454. 1 MI--PDIGAAMNARLGAWADGQKIPLFIENSP-GDKPA-GIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGT 76 (137) Q Consensus 1 m~--~~Ir~al~~rl~~~a~~~~~pva~pN~~-F~pp~-~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~ 76 (137) |+ .+||+++..+|..-- +.....|.+.+ |.... .|=..|++-.+.....++..+ ..+..|.|.||-|+.++- T Consensus 5 M~iht~IR~~Vid~L~~~l--~~~~~ffdGrP~fiDe~ElPAVAV~l~da~~~~~~ld~~--~W~A~LhI~iyLka~~~d 80 (137) T protein:vir:79 5 MNRHTQIRQVVLARLREQC--GDSATFFDGLPAFVDAQELPAVSVWLSDAQYTGKMTDED--DWQAVLHIAVFIRAQAPD 80 (137) T ss_pred hHHHHHHHHHHHHHHHhhc--CCcEEEeCCccceechhhCcEEEEEeecCCCCcceecCC--eeEEEEEEEEEeecCCCH Confidence 65 699999999996532 22233455553 65543 476777776666666666444 578999999999999999 Q ss_pred HHHHHHHHH-HHHhhhccceeccCCeEEEecccccccCCcc---cC-CCCEEEEEE--EEEEEE Q lcl|NC_018454. 77 STGRALARQ-VAALFPEGQSVQGDGFACWISSQPSIYAGVL---NP-RNTRYSIPV--SIPYRA 133 (137) Q Consensus 77 ~~~~~~Ad~-i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~---~~-~~~~~~ipV--si~yra 133 (137) .+..++|.+ |..-.+....|. ++.-.+ ...|+. |+ ..+|...-+ +|+|.- T Consensus 81 s~LD~~~E~~I~~v~~~~~~l~--~l~~~~-----~~~gY~Y~rD~e~~tW~sadL~y~ItYe~ 137 (137) T protein:vir:79 81 SELDMWMESTIFPALNDVPALS--GLIDTL-----IPLGFNYQRDNEMATWAMAEITYQITYTN 137 (137) T ss_pred HHHHHHHHHHHHHhhcchhhhh--hHhhhh-----hcccCCcccccccceeEEEEEEEEEEEcC Confidence 999999997 555443333322 110000 011221 11 223554443 455544 No 42 >protein:vir:79047 Length: 145 # NCBI annotation: hypothetical protein # Family: family:all:1535 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110730;genbank:gi:134287347;genbank:GeneID:4955221 Probab=30.88 E-value=1.5 Score=19.55 Aligned_cols=121 Identities=7% Similarity=0.035 Sum_probs=62.4 Q ss_pred CchHHHHHHHHHHHH-hhcCCCCceeeCCCC--CCCCCCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEE-ecC-CC Q lcl|NC_018454. 1 MIPDIGAAMNARLGA-WADGQKIPLFIENSP--GDKPAGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVV-PVG-SG 75 (137) Q Consensus 1 m~~~Ir~al~~rl~~-~a~~~~~pva~pN~~--F~pp~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~-p~G-~G 75 (137) |+-+|+.++...|.+ +... ++|-=+.+. |.+|- -=+.+++.... ..+ +. ++.=.+.++|.+ |.+ .. T Consensus 1 mi~dI~~aI~~~Lk~~Fp~~--~~IY~e~i~Qgf~~Pc---FFI~ll~~~~~-~~~-~~--r~~r~~~~dI~Yfp~~~~~ 71 (145) T protein:vir:79 1 MLNNIIDGISVKLDKSFGEK--YTIYSEDVEQGINEPC---FFIVPLNPSKT-PYP-SG--RELKKNSFDVHYFPRSEAK 71 (145) T ss_pred ChHHHHHHHHHHHHHhcCCc--eEEEecccccCccCCe---eEEEEeccccc-ccc-Cc--eEEEEEEEEEEEeecCCCC Confidence 999999999999986 6422 467777764 66663 11233332221 122 12 222234444433 543 45 Q ss_pred hHHHHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 76 TSTGRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 76 ~~~~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) ...+.++|++|-+.|++ ..+ ++-.+.+...- .-+. ++.-++.+.++......... T Consensus 72 ~~e~~ev~e~L~~~le~-i~v--~~~~~~~~~~~---~eiv-DgvLhf~~~~~~~~~k~~~~ 126 (145) T protein:vir:79 72 NFEINEIAEMLLEELEY-IEI--NGDLVRGTNMN---FEII-DNVLHFFVDYNYFTIKSNNA 126 (145) T ss_pred chhHHHHHHHHHhhhcc-eee--cCcEEeeecce---eEEe-eceEEEEEEEEEEEeeecCc Confidence 66899999999999943 443 34455554431 1122 23223433333322222111 No 43 >protein:vir:80105 Length: 162 # NCBI annotation: gp13 # Family: family:all:2729 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468717;genbank:gi:157325297;genbank:GeneID:5601796 Probab=28.63 E-value=1.7 Score=19.28 Aligned_cols=127 Identities=15% Similarity=0.127 Sum_probs=63.3 Q ss_pred CchHHHHHHHHHHHHhhcCCCCceeeCCCCCCCCCCcEEEEEEc-CCCceeeecCCCceEEEEEEEEEEEEecCCChHHH Q lcl|NC_018454. 1 MIPDIGAAMNARLGAWADGQKIPLFIENSPGDKPAGIFLESFDM-PATPQTLDLGLTCHIYPGIFQVNVVVPVGSGTSTG 79 (137) Q Consensus 1 m~~~Ir~al~~rl~~~a~~~~~pva~pN~~F~pp~~~ylr~~~~-p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~~~~ 79 (137) |+.-|+..|.+ +. .++.+...+..-.-|.-||....++ |-.....++. ++..+.=.+|++|+.-.. .+| T Consensus 13 lv~~ii~~i~~----~~--~gl~vI~~~~~g~~p~yPF~TY~v~~pyi~~~~~~~-~~e~~~~~isi~~~S~~~---~eA 82 (162) T protein:vir:80 13 LVKTLINAVNE----LS--GGLQLIESSSGGEQPEYPFCQYTITSPYIAISPDIV-EGEQFEIVISLTWRALSG---HQA 82 (162) T ss_pred HHHHHHHHHHh----hh--cceeEEEccCCCCCCCCCeEEEEEecCccccCCccc-CCcceEEEEEEEEEeCCH---HHH Confidence 55555554433 22 2456777776666678888887752 2112222221 233555668888887655 899 Q ss_pred HHHHHHHHHhhhc--c-ceeccC-Ce-EEEecccccccCCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 80 RALARQVAALFPE--G-QSVQGD-GF-ACWISSQPSIYAGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 80 ~~~Ad~i~a~F~~--g-~~l~~~-~~-~v~v~~~p~v~~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) .++|.+|.++|.. . -.+..+ |. .+.+...-+..--...+.+=+|-.=++|+|+-.-.+ T Consensus 83 l~la~~l~~~f~~~~~~~~~~~~~gIvvvdv~~~~~R~~~~~~~yerR~GFD~~~Rv~r~~e~ 145 (162) T protein:vir:80 83 LNLANITNKYFRSQKGRFFMQENGGIVVVSVQNSGLRDTFISIEYERSAGIDLRLRVVDSYSS 145 (162) T ss_pred HHHHHHHHHHhhcCCceeeeeecCcEEEEecCCCccceeEeeeeeeeeecceEEEEEeecccc Confidence 9999999999942 2 122222 32 232222111110000111113444455555432211 No 44 >protein:vir:105772 Length: 128 # NCBI annotation: gp15 # Family: family:all:10994 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224153;genbank:gi:62362228;genbank:GeneID:3342525 Probab=28.39 E-value=1.8 Score=19.25 Aligned_cols=121 Identities=12% Similarity=0.123 Sum_probs=69.2 Q ss_pred Cc-hHHHHHHHHHHHHhhcCCCCc---eeeCCCCCCCCCCcEEEEEEcCCCceeeecCCCceEEEEEEEEEEEEecCCCh Q lcl|NC_018454. 1 MI-PDIGAAMNARLGAWADGQKIP---LFIENSPGDKPAGIFLESFDMPATPQTLDLGLTCHIYPGIFQVNVVVPVGSGT 76 (137) Q Consensus 1 m~-~~Ir~al~~rl~~~a~~~~~p---va~pN~~F~pp~~~ylr~~~~p~~t~~~~l~~~~~~~~G~~qI~v~~p~G~G~ 76 (137) |+ +++..+++..|..-....++. ..|....-+ -+.+|+-+-= .+.+...++.+.+ -++|.++.-++.|. T Consensus 1 ~~~~~m~~~vr~~l~daGLt~GftvQl~~W~d~~g~-~~e~~iV~qp-NGGt~i~d~~~~d-----y~~i~~Vsg~~d~~ 73 (128) T protein:vir:10 1 MTRSEVYDALRVWLQSHGFDVGYRVQKRFWNEQEGT-EGERYLVIQQ-NGGGKPEEAITRD-----FFRILVLSGQNDSD 73 (128) T ss_pred CchhHHHHHHHHHHHhCCCcchheeeeeeeeccCCC-CCceEEEEec-CCCCchhhhcccc-----eeEEEEEeecCCCc Confidence 54 566666666666544444444 457664311 1347776654 3333333444443 57788888888887 Q ss_pred -HHHHHHHHHHHHhhhccceeccCCeEEEeccccccc--CCcccCCCCEEEEEEEEEEEEEecC Q lcl|NC_018454. 77 -STGRALARQVAALFPEGQSVQGDGFACWISSQPSIY--AGVLNPRNTRYSIPVSIPYRADISS 137 (137) Q Consensus 77 -~~~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~--~g~~~~~~~~~~ipVsi~yrad~~~ 137 (137) .++++.|++|.++-..+-. + +.+. +|. .++ |+ ..++++++ -.++.|||-.+- T Consensus 74 ~~~ve~ra~~Ii~yv~~np~-~-~cig-~i~---n~Ggipp-i~T~EgR~--ifrL~f~~i~~~ 128 (128) T protein:vir:10 74 INEVEDRADAIRQAMIDDYR-T-ECII-SMQ---PVGGITA-IQTEEGRY--LFDISFQTIISR 128 (128) T ss_pred chhHHHHHHHHHHHHHhCcc-c-cccc-eee---ccCCCCC-ccccCCce--eeeehhhhhhcC Confidence 4799999999999855432 2 1111 121 122 22 22555543 456777887777 No 45 >protein:vir:3874 Length: 114 # NCBI annotation: putative head-tail joining protein # Family: family:all:28620 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680491;swissprot:trembl:p94215;genbank:gi:22296531;uniprot:P94215;genbank:GeneID:951676 Probab=22.86 E-value=2.4 Score=18.51 Aligned_cols=108 Identities=11% Similarity=0.062 Sum_probs=55.1 Q ss_pred CchHHHHH--HH--HHHHHhhcCCCCceeeCCCCCCCC-CCcEEEEEEcCCCceeeecCCCce-EEEEEEEEEEEEecCC Q lcl|NC_018454. 1 MIPDIGAA--MN--ARLGAWADGQKIPLFIENSPGDKP-AGIFLESFDMPATPQTLDLGLTCH-IYPGIFQVNVVVPVGS 74 (137) Q Consensus 1 m~~~Ir~a--l~--~rl~~~a~~~~~pva~pN~~F~pp-~~~ylr~~~~p~~t~~~~l~~~~~-~~~G~~qI~v~~p~G~ 74 (137) |-|+++-+ |- .+|++..-+ +++..=|+.+|.+- ..||.|++.+|++..... ++.+ .+---+||+ ||=.-. T Consensus 1 ~~PE~~vaDiLsad~~lv~~mYi-pift~tpdd~fik~SsAPWiRiTpiPGDda~ya--DD~R~~EYPrVqVD-fWvr~e 76 (114) T protein:vir:38 1 MAPEKRVYDILSANLDIADKVYI-GTPNFNNQTSATPESLAPWVRITYLPGDAADYA--DDSRILEYPKVQVD-FWVGIT 76 (114) T ss_pred CCchhhhhhhhccchhhhhheec-cCCCCCCCCcccccccCCeeEeeecCCcccccc--ccceeeecCceeEE-EeeccC Confidence 87766532 11 122222111 12333444567765 469999999999976543 3333 122456776 555678 Q ss_pred ChHHHHHHHHHHHHhhhccceeccCCeEEEecccccccCCcccCCC Q lcl|NC_018454. 75 GTSTGRALARQVAALFPEGQSVQGDGFACWISSQPSIYAGVLNPRN 120 (137) Q Consensus 75 G~~~~~~~Ad~i~a~F~~g~~l~~~~~~v~v~~~p~v~~g~~~~~~ 120 (137) |....+++-.+|-+..... +=-+-|.++-+. |+.++-. T Consensus 77 ~~d~~e~iqe~IY~~Lha~-----gweRYY~nsY~D---~~~~~~~ 114 (114) T protein:vir:38 77 DWDQQEKIETQIYQALHAA-----DWERYYRNSYVD---GIPQPFA 114 (114) T ss_pred ChhhHHHHHHHHHHHHHhc-----CcceeeeccccC---CCCCCCC Confidence 8888888887775543211 111233333321 1222111 Done!