Query lcl|NC_017981.1_cdsid_YP_006383621.1 [gene=DIBBI_gp14] [protein=hypothetical protein] [protein_id=YP_006383621.1] [location=11136..11594] Match_columns 152 No_of_seqs 19 out of 22 Neff 3.2 Searched_HMMs 1612 Date Thu Nov 7 13:36:35 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_14 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_14_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95261 Length: 133 100.0 7.5E-42 4.6E-45 246.0 9.6 115 6-152 1-125 (133) 2 protein:vir:5258 Length: 123 # 99.8 1.3E-22 7.9E-26 140.6 8.3 104 1-152 3-112 (123) 3 protein:vir:96107 Length: 133 97.2 3.3E-06 2.1E-09 50.6 7.2 105 1-152 15-125 (133) 4 protein:vir:99571 Length: 131 96.7 1.5E-05 9.6E-09 47.0 7.2 105 1-152 15-126 (131) 5 protein:vir:107716 Length: 132 96.1 4.9E-05 3.1E-08 44.2 6.7 106 1-152 15-127 (132) 6 protein:vir:97237 Length: 122 94.8 0.00032 2E-07 39.8 6.7 108 1-149 9-122 (122) 7 protein:vir:5977 Length: 109 # 94.7 0.001 6.3E-07 37.0 9.4 104 6-152 1-105 (109) 8 protein:vir:100244 Length: 109 93.9 0.0032 2E-06 34.3 10.3 104 1-152 1-105 (109) 9 protein:vir:81177 Length: 109 92.6 0.0062 3.8E-06 32.7 9.8 103 2-152 1-109 (109) 10 protein:vir:79639 Length: 123 92.5 0.00064 4E-07 38.1 4.4 104 1-152 18-122 (123) 11 protein:vir:1436 Length: 108 # 92.2 0.0084 5.2E-06 32.0 10.0 103 2-152 1-104 (108) 12 protein:vir:100134 Length: 109 92.2 0.01 6.5E-06 31.5 10.5 107 1-152 1-108 (109) 13 protein:vir:106729 Length: 152 92.1 0.0034 2.1E-06 34.1 7.8 109 1-152 10-121 (152) 14 protein:vir:80342 Length: 108 92.1 0.0089 5.5E-06 31.8 10.0 103 2-152 1-104 (108) 15 protein:vir:78609 Length: 152 92.0 0.0033 2E-06 34.2 7.5 109 1-152 10-121 (152) 16 protein:vir:107606 Length: 107 91.8 0.011 6.6E-06 31.4 10.2 104 5-152 1-107 (107) 17 protein:vir:105006 Length: 107 91.8 0.011 6.6E-06 31.4 10.2 104 5-152 1-107 (107) 18 protein:vir:102084 Length: 107 91.8 0.011 6.6E-06 31.4 10.2 104 5-152 1-107 (107) 19 protein:vir:102856 Length: 107 91.8 0.011 6.6E-06 31.4 10.2 104 5-152 1-107 (107) 20 protein:vir:94034 Length: 141 90.3 0.0054 3.3E-06 33.0 7.0 110 1-152 13-125 (141) 21 protein:vir:101560 Length: 152 89.9 0.0041 2.6E-06 33.7 6.0 109 1-152 10-121 (152) 22 protein:vir:77651 Length: 152 89.6 0.0047 2.9E-06 33.4 6.1 109 1-152 10-121 (152) 23 protein:vir:104346 Length: 123 88.9 0.015 9.3E-06 30.6 8.4 106 1-152 13-122 (123) 24 protein:vir:107669 Length: 123 88.4 0.0089 5.5E-06 31.8 6.8 102 1-152 18-122 (123) 25 protein:vir:4955 Length: 116 # 87.6 0.034 2.1E-05 28.6 9.4 112 1-152 1-114 (116) 26 protein:vir:4459 Length: 134 # 85.6 0.051 3.2E-05 27.7 10.3 109 1-152 14-123 (134) 27 protein:vir:1890 Length: 110 # 82.4 0.077 4.8E-05 26.7 10.2 105 2-152 1-110 (110) 28 protein:vir:81216 Length: 118 81.3 0.042 2.6E-05 28.1 7.0 107 6-151 1-118 (118) 29 protein:vir:100226 Length: 114 80.9 0.091 5.6E-05 26.3 9.9 112 1-152 1-113 (114) 30 protein:vir:4832 Length: 116 # 79.9 0.097 6E-05 26.1 8.5 112 1-152 1-114 (116) 31 protein:vir:93599 Length: 116 78.2 0.12 7.3E-05 25.7 9.9 107 1-152 2-113 (116) 32 protein:vir:4999 Length: 116 # 76.7 0.13 8.2E-05 25.4 9.7 112 1-152 1-114 (116) 33 protein:vir:1385 Length: 107 # 75.7 0.14 8.9E-05 25.2 8.8 102 10-150 1-107 (107) 34 protein:vir:4858 Length: 116 # 74.7 0.15 9.6E-05 25.0 9.7 112 1-152 1-114 (116) 35 protein:vir:193 Length: 112 # 73.0 0.18 0.00011 24.7 9.3 104 2-152 1-109 (112) 36 protein:vir:80114 Length: 101 72.2 0.16 9.8E-05 25.0 7.5 100 1-152 1-100 (101) 37 protein:vir:96294 Length: 111 71.4 0.088 5.4E-05 26.4 5.9 107 1-152 1-111 (111) 38 protein:vir:105908 Length: 111 71.4 0.088 5.4E-05 26.4 5.9 107 1-152 1-111 (111) 39 protein:vir:80382 Length: 122 70.6 0.096 6E-05 26.2 5.9 109 1-149 1-122 (122) 40 protein:vir:965 Length: 97 # N 70.3 0.11 7E-05 25.8 6.3 92 16-152 1-93 (97) 41 protein:vir:94092 Length: 114 70.2 0.084 5.2E-05 26.5 5.5 109 1-152 2-114 (114) 42 protein:vir:5743 Length: 117 # 70.2 0.21 0.00013 24.3 10.5 106 1-152 5-115 (117) 43 protein:vir:4513 Length: 98 # 69.0 0.23 0.00014 24.1 8.1 79 30-152 1-88 (98) 44 protein:vir:1027 Length: 116 # 64.1 0.31 0.00019 23.4 9.9 112 1-152 1-115 (116) 45 protein:vir:102144 Length: 113 63.6 0.31 0.00019 23.3 10.8 112 2-150 1-113 (113) 46 protein:vir:100886 Length: 113 62.0 0.34 0.00021 23.1 9.7 111 2-152 1-112 (113) 47 protein:vir:4343 Length: 118 # 62.0 0.34 0.00021 23.1 10.0 107 2-152 1-113 (118) 48 protein:vir:3993 Length: 117 # 61.1 0.36 0.00022 23.0 10.0 112 1-152 1-114 (117) 49 protein:vir:96131 Length: 111 56.5 0.24 0.00015 23.9 5.4 102 1-152 1-111 (111) 50 protein:vir:96830 Length: 111 55.1 0.18 0.00011 24.7 4.4 104 1-152 1-111 (111) 51 protein:vir:3872 Length: 146 # 52.8 0.55 0.00034 22.0 8.5 103 1-150 37-146 (146) 52 protein:vir:7411 Length: 116 # 52.4 0.56 0.00034 22.0 9.6 112 1-152 1-115 (116) 53 protein:vir:107100 Length: 109 49.2 0.32 0.0002 23.3 4.8 102 1-152 1-109 (109) 54 protein:vir:105322 Length: 109 49.0 0.32 0.0002 23.3 4.8 101 1-152 1-109 (109) 55 protein:vir:95373 Length: 101 44.7 0.8 0.00049 21.1 7.2 100 1-152 1-101 (101) 56 protein:vir:1242 Length: 111 # 44.4 0.72 0.00044 21.4 5.9 104 3-152 1-107 (111) 57 protein:vir:105035 Length: 112 29.7 1.6 0.001 19.4 9.7 104 1-152 1-111 (112) 58 protein:vir:94767 Length: 104 28.7 1.5 0.00092 19.6 5.0 90 14-143 1-104 (104) 59 protein:vir:94797 Length: 111 27.9 1.4 0.00085 19.8 4.7 102 3-152 1-107 (111) 60 protein:vir:97328 Length: 111 24.4 1.8 0.0011 19.1 4.7 102 3-152 1-107 (111) 61 protein:vir:94713 Length: 785 21.6 2.6 0.0016 18.3 5.4 147 1-152 391-566 (785) 62 protein:vir:95069 Length: 111 21.4 2.1 0.0013 18.8 4.4 102 3-152 1-107 (111) 63 protein:vir:80104 Length: 124 21.4 1.7 0.001 19.4 3.8 111 1-152 4-123 (124) 64 protein:vir:3616 Length: 103 # 21.3 2.6 0.0016 18.3 7.7 98 10-152 1-103 (103) 65 protein:vir:93739 Length: 111 21.0 2.2 0.0014 18.7 4.4 102 3-152 1-107 (111) 66 protein:vir:94491 Length: 111 21.0 2.2 0.0014 18.7 4.4 102 3-152 1-107 (111) 67 protein:vir:97429 Length: 111 21.0 2.2 0.0014 18.7 4.4 102 3-152 1-107 (111) No 1 >protein:vir:95261 Length: 133 # NCBI annotation: Phage hypothetical protein # Family: family:all:31736 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944894;genbank:gi:38707834;genbank:GeneID:2744047 Probab=100.00 E-value=7.5e-42 Score=246.05 Aligned_cols=115 Identities=27% Similarity=0.411 Sum_probs=96.2 Q ss_pred CCCccccceee-EecCCCcccC--CeecCCCCe-eEEEEEEEEeec--CCccee--eccccccceeeeEEeecccccccc Q lcl|NC_017981. 6 PFNKFRKDHTV-IIVSDSYFKD--GVIMPGERM-YTKAKFSVQAIK--NNEEIQ--GFAEGRRVSDWRRLYSDTKLPLAG 77 (152) Q Consensus 6 ~~~~FRk~~~V-~r~~~G~Yv~--GrwV~G~~~-~~ti~aSVQPi~--d~e~~q--~lpeGrRitd~~rIYTd~~L~vag 77 (152) -=..||++.++ |++++|.|+| |+||+|++. +++|+|||||++ +.++++ .||||+|++|++|||||++|+||| T Consensus 1 M~~~~rhs~~~~R~~seg~Y~~~~GrWV~g~~~v~~~i~asIQP~~~ss~~~~q~~~lpeGrrit~avrIYTda~L~vag 80 (133) T protein:vir:95 1 MRLLNRHSFVVKRKVSEDGYYNDDGDWVASQDIVEVNCKGNIQPYIKGSVKNGTQIALPEGIRLTDTRILYTTYKLRTSD 80 (133) T ss_pred CCccccceeEEEEeecCCceEccCCcccCCCCccceeeeeeecccccccccccchhcccCCeeeeeEEEEEeeeeeeeec Confidence 22445555544 5559999987 999999876 899999999944 444555 589999999999999999999999 Q ss_pred cccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCc--ccceeeeeeeecCC Q lcl|NC_017981. 78 DQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGI--ISHYKYYVVRKTYG 152 (152) Q Consensus 78 d~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gi--i~HykYl~Vr~~d~ 152 (152) |. .+.++|+|+|||.+||||++++||+|+ |||||||+||+.-= T Consensus 81 e~--------------------------------~~~~gDvvl~dg~eYev~~r~~w~~Gv~~isHyrY~aVR~~~~ 125 (133) T protein:vir:95 81 DV--------------------------------EWNESDIVMIDGHEYEVFMTMDWSQQLSHTSHYEYIIIRRDKM 125 (133) T ss_pred cc--------------------------------ccCCCcEEEEcCCceEEEEecchhhccccCCceeEEEEeecch Confidence 74 445579999999999999999999999 99999999998654 No 2 >protein:vir:5258 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4880 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852763;genbank:gi:31544038;uniprot:Q776V7;genbank:GeneID:2777139 Probab=99.78 E-value=1.3e-22 Score=140.63 Aligned_cols=104 Identities=21% Similarity=0.338 Sum_probs=85.6 Q ss_pred CcCCCC-C--CccccceeeEecCCCcccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccc Q lcl|NC_017981. 1 MIGNGP-F--NKFRKDHTVIIVSDSYFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAG 77 (152) Q Consensus 1 ~~~~~~-~--~~FRk~~~V~r~~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vag 77 (152) ||--.+ | ..||+..+|. +++|.|++|.|... .+..++.|+|||.+ .++++.+|||+||+.+++|||+++|.+ T Consensus 3 ~ldvs~v~ldpdF~~titv~-R~~g~~~~~g~~~~-t~~~t~~avVqP~~-~~dlq~LpeG~ri~~sIkI~Tq~~L~v-- 77 (123) T protein:vir:52 3 LINQSGRFLNSRFRQQITVQ-KQSGSHSASGFDVR-YEKQQITAIVIPTS-PNDVLLLPEGERYLPSIKVYTQQQLNI-- 77 (123) T ss_pred cccccccccCcccCceEEEE-ccCccEeCCccccc-cccceEEEEEeeCC-hhhcccccccccccceEEEEecccccc-- Confidence 332221 1 3689996666 58999999999777 45777999999998 688999999999999999999999975 Q ss_pred cccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeee---eeeecCC Q lcl|NC_017981. 78 DQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYY---VVRKTYG 152 (152) Q Consensus 78 d~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl---~Vr~~d~ 152 (152) +|+|+|+|++|+|+++++| +||.|| ++|+.-- T Consensus 78 --------------------------------------GD~vlw~G~~YrVi~~~d~-----s~YGYy~~i~~~~~~t 112 (123) T protein:vir:52 78 --------------------------------------GDLVDYRGQTYKIKTAANW-----GDYGYYNNIGVRHSQT 112 (123) T ss_pred --------------------------------------ccEEEeCCcEEEEEEcCCc-----cccceecceeeccccc Confidence 3899999999999999999 999886 5776422 No 3 >protein:vir:96107 Length: 133 # NCBI annotation: conserved hypothetical protein ORF026 # Family: family:all:7161 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294443;genbank:gi:149408340;genbank:GeneID:5237226 Probab=97.19 E-value=3.3e-06 Score=50.63 Aligned_cols=105 Identities=13% Similarity=0.115 Sum_probs=69.0 Q ss_pred CcCCCCCCccccceeeEecCCCccc--CCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFK--DGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGD 78 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv--~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd 78 (152) .|+.-|---|| ..++-. +|.|+.......+|.+|+||++.....+.=.+ --..++++||. +.+.+. T Consensus 15 VI~~Q~V~y~r--------f~~Rt~n~~gq~i~~y~~p~~i~gS~Q~V~~~~v~~~GLd--~~~~Yv~lf~s--~~i~~i 82 (133) T protein:vir:96 15 VIGTQLVQYRK--------FEQRTKNSQAQYVSVFGEPFQLAASIQRVRRDQYVQFNLE--FQRNYVMIFAN--FEMVDL 82 (133) T ss_pred hhccccchhhc--------ccccccccccceeeeecCCccceeeEEecChhheeecCcc--eeeeeeEeecC--cceeec Confidence 55555544444 333333 59999998888888999999975444433333 34689999974 333332 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecch--hhCcccceeeeeeee--cCC Q lcl|NC_017981. 79 QLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISW--QNGIISHYKYYVVRK--TYG 152 (152) Q Consensus 79 ~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~w--q~Gii~HykYl~Vr~--~d~ 152 (152) +-.-.||.++|+|+.|.|....+| |.|-= -.|||.. .|| T Consensus 83 --------------------------------qRg~agD~liwnGrr~~v~g~~dW~~QDGW~---~~lcv~~G~~~g 125 (133) T protein:vir:96 83 --------------------------------DRDLAGDQFIWTGRVFQLESQGSWFYQDGWG---VCLAVDIGTAKL 125 (133) T ss_pred --------------------------------ccCCCCCEEEECCeEEEecccceeeeeccce---EEEEEeecCCCC Confidence 223457999999999999999999 66642 1456654 233 No 4 >protein:vir:99571 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:7161 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039798;genbank:gi:126011048;genbank:GeneID:4818266 Probab=96.71 E-value=1.5e-05 Score=46.95 Aligned_cols=105 Identities=16% Similarity=0.189 Sum_probs=67.6 Q ss_pred CcCCCCCCccccceeeEecCCCccc--CCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFK--DGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGD 78 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv--~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd 78 (152) .|+.-|---||-. ++-. +|.|+.......+|.+|+||++.....+.=.+ --..++++||.-. +.+. T Consensus 15 VI~~Q~V~y~rf~--------~Rt~n~~gq~i~~y~~p~~i~gS~Q~V~~~~v~~~GLd--~~~~Yv~lfts~~--i~~i 82 (131) T protein:vir:99 15 VIALTPVPYLRFT--------QRVLNPARQWITTYAAAVDVPMSVQRVPRNKYVQFGLE--FQRNYVRLFAPIE--MVDL 82 (131) T ss_pred hhccccchhhccc--------ccccccccceeeeecCCccceeeEEecChhheeecCcc--eeeeEEEEeecCc--ceec Confidence 5555554444432 2333 59999998888888999999975444433232 3468999999533 2221 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecch--hhCcccceeeeeeee---cCC Q lcl|NC_017981. 79 QLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISW--QNGIISHYKYYVVRK---TYG 152 (152) Q Consensus 79 ~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~w--q~Gii~HykYl~Vr~---~d~ 152 (152) +-.-.||.++|+|+.|.|....+| |.|-=. .|||.. +|- T Consensus 83 --------------------------------qRg~agD~liwnGrr~~v~g~~dW~~QDGW~~---~lcv~~Gi~~~~ 126 (131) T protein:vir:99 83 --------------------------------DRDCGGDMIIWHGRQHKIESQNTWYLQDGWAM---SLAVDLGIRSDQ 126 (131) T ss_pred --------------------------------ccCCCCCEEEECCeEEEecccceeeeeccceE---EEEEEeecccCc Confidence 223457999999999999999999 555321 345432 222 No 5 >protein:vir:107716 Length: 132 # NCBI annotation: gp19 # Family: family:all:7161 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024867;genbank:gi:48697509;genbank:GeneID:2948332 Probab=96.10 E-value=4.9e-05 Score=44.19 Aligned_cols=106 Identities=15% Similarity=0.178 Sum_probs=66.8 Q ss_pred CcCCCCCCccccceeeEecCCCccc--CCeecCCCCeeEEE-EEEEEeecCCcceeeccccccceeeeEEeecccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFK--DGVIMPGERMYTKA-KFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAG 77 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv--~GrwV~G~~~~~ti-~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vag 77 (152) .|+.-|---||-. ++-. +|.|+........+ ++|+||++.....+.=.+ --..++++||.-.- +. T Consensus 15 vI~~q~V~y~rf~--------~Rt~n~~gq~i~~y~~p~~i~~gS~Q~V~~~~v~~~GLd--~~~~Yv~lf~s~~~-i~- 82 (132) T protein:vir:10 15 LIASETVEYFAET--------GRTKQPNGVFIASYASPVPIEECSVQAVDRSKYTDLGLD--FQKTYVTWFVPNQA-FT- 82 (132) T ss_pred hhccccchhhccc--------ccccccccceeeeecCCcccccceeeecChhhheecccc--eeeeeeeEeecchh-hh- Confidence 6666655445433 2333 59999998766666 799999975444433233 34689999983210 00 Q ss_pred cccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecch--hhCcccceeeeeeee--cCC Q lcl|NC_017981. 78 DQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISW--QNGIISHYKYYVVRK--TYG 152 (152) Q Consensus 78 d~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~w--q~Gii~HykYl~Vr~--~d~ 152 (152) .-+-.-.||.++|+|+.|.|....+| |.|-= -.|||.. .|| T Consensus 83 -------------------------------~iqRg~agD~liwnGrr~~v~g~~dW~~QDGW~---~~lcv~~G~~~g 127 (132) T protein:vir:10 83 -------------------------------TIKRGKAGDVLEWNGGRYQMNGGIDWTGQDSWG---TATCVLIGPATG 127 (132) T ss_pred -------------------------------hcccCCCCCEEEECCeEEEecccceeeeeccce---EEEEEEecCccc Confidence 01223457999999999999999999 66542 1355543 233 No 6 >protein:vir:97237 Length: 122 # NCBI annotation: hypothetical protein ORF025 # Family: family:all:704 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294533;genbank:gi:149408254;genbank:GeneID:5237102 Probab=94.82 E-value=0.00032 Score=39.78 Aligned_cols=108 Identities=15% Similarity=0.121 Sum_probs=71.6 Q ss_pred CcCCCCCCccccceeeEecCCCcccC--CeecCCCCe--eEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKD--GVIMPGERM--YTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLA 76 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~--GrwV~G~~~--~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~va 76 (152) .+...-+.+|=++-+++|...|+|+. |-|.|+.++ ..++++-+-+++-.+. .|..|. =+|-+|-++ T Consensus 9 ~~a~~Li~kfG~~vtl~r~~~g~y~~~~g~~~p~~~t~~~~~~~gv~~~~~~~~i-----dGtlI~-----~GD~~l~~~ 78 (122) T protein:vir:97 9 ALAKKLIKKNGQAVTLRGFTAGAAPDPAKPWKPGGNVAADQTIEAVFLDYEQRYI-----DGQTIR-----MGDQRVFMP 78 (122) T ss_pred HHHHHHHHHhCCceEEEEeccceeCCCCCceecCCceeeeeeeEEEeeccchhhc-----cCcEEe-----ecCEEEEEe Confidence 33445567899999999999999995 789998765 5578898888764322 232221 122222211 Q ss_pred -ccccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecc-hhhCcccceeeeeeee Q lcl|NC_017981. 77 -GDQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERIS-WQNGIISHYKYYVVRK 149 (152) Q Consensus 77 -gd~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~-wq~Gii~HykYl~Vr~ 149 (152) .++ ..--..+|+|.++|+.|.||...+ +..|..-+|+-. +|+ T Consensus 79 a~~~------------------------------~~~P~~gD~v~~~g~~~~Vi~v~~i~pa~~~v~y~lq-lRk 122 (122) T protein:vir:97 79 AEGL------------------------------TAPPEVEGLVLRGLEVWKVIAVKPLNPNGQAIMYELQ-VRQ 122 (122) T ss_pred eCCC------------------------------ccccccCCEEEeCCEEEEEEeccccCCCCceEEEEEE-eeC Confidence 000 011123599999999999999987 478888899865 455 No 7 >protein:vir:5977 Length: 109 # NCBI annotation: hypothetical protein # Family: family:all:788 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690677;genbank:geneid:6329133;genbank:gi:22855071;interpro:IPR013045;uniprot:O48446;genbank:GeneID:955315 Probab=94.75 E-value=0.001 Score=36.98 Aligned_cols=104 Identities=15% Similarity=0.076 Sum_probs=70.6 Q ss_pred CCCccccceeeEecCCCcccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccccccccc Q lcl|NC_017981. 6 PFNKFRKDHTVIIVSDSYFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQLFISAS 85 (152) Q Consensus 6 ~~~~FRk~~~V~r~~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~~~~a~ 85 (152) -|+.||+.-++.++....-..|-+++.-....++-|+|-|++..|.+.+-..- .....++.. T Consensus 1 ~~~~L~~RI~i~~~~~~~D~~G~~~~~w~~~~~~WA~v~~~sg~E~~~a~~~~--~~~~~~i~i---------------- 62 (109) T protein:vir:59 1 MYEEFPDVITFQSYVEQSNGEGGKTYKWVDEFTAAAHVQPISQEEYYKAQQLQ--TPIGYNIYT---------------- 62 (109) T ss_pred CccccCccEEEEeeeeeeCCCCCeeeeeEeeEEEEEEEecCChhheeeccccc--eeeEEEEEE---------------- Confidence 67899999888888776666688887766667899999999988777543322 222233333 Q ss_pred cccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeee-ecCC Q lcl|NC_017981. 86 VDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVR-KTYG 152 (152) Q Consensus 86 ~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr-~~d~ 152 (152) ||. ++-.+.+.|+|+|+.|.|.+..+=.++. +++|+++ +.+| T Consensus 63 -------------------Ry~---~~I~~~~Ri~~~gr~y~I~~v~~d~~~~---~~~l~~~~~e~g 105 (109) T protein:vir:59 63 -------------------PYD---DRIDKKMRVIYRGKIVTFIGDPVDLSGL---QEITRIKGKEDG 105 (109) T ss_pred -------------------eeC---CCCCcccEEEECCeEEEEEeccCCCCCC---eEEEEEEEEEee Confidence 221 1334568999999999999876544443 4466665 4455 No 8 >protein:vir:100244 Length: 109 # NCBI annotation: gp73 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355409;genbank:gi:77864699;genbank:GeneID:3725966 Probab=93.91 E-value=0.0032 Score=34.28 Aligned_cols=104 Identities=11% Similarity=0.086 Sum_probs=69.2 Q ss_pred CcCCCCCCccccceeeEecCCCcccCCeecCCC-CeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGE-RMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQ 79 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~-~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~ 79 (152) |+--|. |++.-++.++..+.-..|-++... ...-++-|+|.|++..|..++-......+..+.|=-. T Consensus 1 mm~~g~---L~~rI~i~~~~~~~d~~G~~~~~~w~~~~~~wA~i~~~~g~e~~~a~~~~~~~~~~i~iR~~--------- 68 (109) T protein:vir:10 1 MLRSSD---LTEFIVIERKGGRTNENGEPLPDDWVTHDEVWASVRFVSGKEHVISGAVRSSAIASIRIRFR--------- 68 (109) T ss_pred CCCccc---cCccEEEEeeeeccCCCCCeeccceeeEEEEEEEEEecCchheeeccceeeeeeEEEEEEec--------- Confidence 887765 556667777766666667776544 4456899999999988877654444433333333211 Q ss_pred cccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 80 LFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 80 ~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) .+-.+.+.++|+|+.|+|....+ . + .-+||.+.-..| T Consensus 69 -------------------------------~~I~~~~ri~~~g~~y~I~~v~~-~-~---~~~~l~i~c~eg 105 (109) T protein:vir:10 69 -------------------------------EDIDSEMRIRYGDQLYDIVAVLP-N-R---RKGSLDLPVKVG 105 (109) T ss_pred -------------------------------CCCCcccEEEECCeEEEEEeecc-C-C---CCcEEEEEEEee Confidence 12344589999999999999865 3 2 236777777777 No 9 >protein:vir:81177 Length: 109 # NCBI annotation: putative head-tail adaptor # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285814;genbank:gi:148747735;genbank:GeneID:5247220 Probab=92.62 E-value=0.0062 Score=32.71 Aligned_cols=103 Identities=12% Similarity=0.152 Sum_probs=65.8 Q ss_pred cCCCCCCccccceeeEecCCCcccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccccc Q lcl|NC_017981. 2 IGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQLF 81 (152) Q Consensus 2 ~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~~ 81 (152) +- ...+|+.-++.++.++.-..|-+++.....-++-|.|.|++..|.+++....- ....++.. T Consensus 1 M~---~g~L~~rI~i~~~~~~~d~~G~~~~~w~~~~~~wA~v~~~s~~e~~~a~~~~~--~~~~~f~i------------ 63 (109) T protein:vir:81 1 MN---PGQFRHKITLMKLVTTQDEIGNTIEEWQPVRTCWAAIKTVNGREYFAAASVQA--ERTYRFII------------ 63 (109) T ss_pred CC---ccccCccEEEEeeeeeeCCCCCeecceeeEEEEEEEEEecCchheeeccceee--eeeEEEEE------------ Confidence 22 34577777787877776667888888777778999999999887775533333 33333332 Q ss_pred cccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecch-hhCcccceeeeeeee-----cCC Q lcl|NC_017981. 82 ISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISW-QNGIISHYKYYVVRK-----TYG 152 (152) Q Consensus 82 ~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~w-q~Gii~HykYl~Vr~-----~d~ 152 (152) |+. .+..+.+.|+|+|+.|+|..+.+= +.. +||.+.- ++| T Consensus 64 -----------------------R~~---~~i~~~~ri~~~g~~y~I~~v~~~~~~~-----~~l~i~~~e~~~~~g 109 (109) T protein:vir:81 64 -----------------------RYT---PGINETMKIDYQGRLFDIQSVLNDDEGK-----KTLTIIATERVAADG 109 (109) T ss_pred -----------------------EeC---CCCCcccEEEECCeEEEEEeecCCccCC-----cEEEEEEEEeecCCC Confidence 111 123456899999999999998653 222 4444332 333 No 10 >protein:vir:79639 Length: 123 # NCBI annotation: gp39 # Family: family:all:704 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285528;genbank:gi:148734511;genbank:GeneID:5219997 Probab=92.54 E-value=0.00064 Score=38.08 Aligned_cols=104 Identities=20% Similarity=0.247 Sum_probs=69.5 Q ss_pred CcCCCCCCccccceeeEecCCCcccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQL 80 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~ 80 (152) -=|||+| .++-+++-++.++|.-++-.++..++++-++-.+-.+. .|..| .+||+. T Consensus 18 sd~~G~~------~~~t~~g~~~dv~G~ev~~p~~~~~~~Gv~~~~~~reI-----Dg~lI-------------q~gDvk 73 (123) T protein:vir:79 18 SDGTGEF------DCITQPGSVEIVGGIEVEKPEIKVKIKGLVRAPRTREV-----DGEVI-------------RVTDKL 73 (123) T ss_pred ccCCCce------eeeecCcceeecCCeeccccceEEeEEEEeecCCcccc-----CCeeE-------------EeccEE Confidence 2345554 68888888999999999888888888888887664433 22222 223322 Q ss_pred ccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchh-hCcccceeeeeeeecCC Q lcl|NC_017981. 81 FISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQ-NGIISHYKYYVVRKTYG 152 (152) Q Consensus 81 ~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq-~Gii~HykYl~Vr~~d~ 152 (152) .+-++ .-...|+|++.+||+.|.||...+|+ .++.=-|+..+=|-+-+ T Consensus 74 ~i~~a------------------------~veik~Gd~i~vDge~~rVV~~~pvkPa~~~I~y~~qLRrv~~~ 122 (123) T protein:vir:79 74 GVFNA------------------------DVELKNGYQIDIDGERYVMVETRPIRPTSITVAYRPIMRRVAVH 122 (123) T ss_pred EEEec------------------------ceeeccCCEEEECCeEEEEecCccccchhhhhhhhhhhcccccC Confidence 22110 01337899999999999999999997 55555666666555444 No 11 >protein:vir:1436 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536365;genbank:gi:17975170;genbank:GeneID:929148 Probab=92.19 E-value=0.0084 Score=31.96 Aligned_cols=103 Identities=15% Similarity=0.182 Sum_probs=70.3 Q ss_pred cCCCCCCccccceeeEecCCCcccCCeecCCC-CeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccccc Q lcl|NC_017981. 2 IGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGE-RMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQL 80 (152) Q Consensus 2 ~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~-~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~ 80 (152) +- .-.+|+.-++.++..+.-..|-+++.. ...-++-|+|.|++..|.+++-..+-.++.-+.|=-. T Consensus 1 M~---~G~L~~rI~i~~~~~~~d~~G~~~~~~w~~~~~~wA~i~~~~g~e~~~a~~~~~~~t~~i~iR~~---------- 67 (108) T protein:vir:14 1 ME---AGKLKERIVIERPSGETNENDEPIPGAWVVHARPWADVRFLNGKEHVISGAVRGATVASMRIRYR---------- 67 (108) T ss_pred CC---ccccCccEEEEeeeeccCCCCCeeccceeeEEEEEEEEEecCchheeeccceeeeeeEEEEEEec---------- Confidence 22 335777878888877776678888765 4466899999999988887765544444443333221 Q ss_pred ccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 81 FISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 81 ~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) .+-.+.+.|+|+|+.|+|..+.+. + .-+||.+.-..| T Consensus 68 ------------------------------~~I~~~~ri~~~g~~y~I~~v~~~-~----~~~~l~i~~~~~ 104 (108) T protein:vir:14 68 ------------------------------AGIGDQMRIRYDGRLYDITAVLPA-R----KRGYLDLSVKVG 104 (108) T ss_pred ------------------------------CCCCcccEEEECCeEEEEEeeccC-C----CCCEEEEEEEee Confidence 123446899999999999998764 2 236777777777 No 12 >protein:vir:100134 Length: 109 # NCBI annotation: gp8 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945038;genbank:gi:38707898;genbank:GeneID:2744181 Probab=92.16 E-value=0.01 Score=31.45 Aligned_cols=107 Identities=14% Similarity=0.107 Sum_probs=69.8 Q ss_pred CcCCCCCCccccceeeEecCCCcccCCeecCCCCe-eEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGERM-YTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQ 79 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~~~-~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~ 79 (152) |+--|. ||+.-++.++....-..|-.++++-+ .-++-|+|.|++..|..++-......+..+.|--.. T Consensus 1 mm~~G~---L~~rI~i~~~~~~~d~~G~~~~~~w~~~~~~wA~v~~~s~~e~~~a~~~~~~~~~~~~iR~~~-------- 69 (109) T protein:vir:10 1 MLKAGE---LTERITIEKRGGGVNENGEPLPGDWVEHASVWANVRFLSGKEYVVSGAIHSSAIASMRIRFRR-------- 69 (109) T ss_pred CCCccc---cCccEEEEeeeeeeCCCCCeeccceEEEEEEEEEEEecCchheeeccceeeeeEEEEEEEeCC-------- Confidence 887775 56666777776666566777777544 457999999999988887666555555544444321 Q ss_pred cccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 80 LFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 80 ~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) +-...+.++|+|+.|+|.+..+ + .-=+|..-.|....-+ T Consensus 70 --------------------------------~I~~~~ri~~~g~~y~I~~v~~-d-~~~~~~~l~~~~~e~~ 108 (109) T protein:vir:10 70 --------------------------------DVDSEMRIRHDGRLYDIAAVLP-N-RRQGYVDLSVKVGEKY 108 (109) T ss_pred --------------------------------CCCcccEEEECCeEEEEeecCC-C-CCCCeEEEEEEEEEee Confidence 2345689999999999999764 2 2223333444333334 No 13 >protein:vir:106729 Length: 152 # NCBI annotation: gp06 # Family: family:all:3177 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944314;genbank:gi:38638613;genbank:GeneID:2657358 Probab=92.13 E-value=0.0034 Score=34.14 Aligned_cols=109 Identities=17% Similarity=0.233 Sum_probs=66.3 Q ss_pred CcC-CCCCCccccceeeEecCCCccc-CCeecCCCCeeEEEEEEEEeecCCcceeecccccc-ceeeeEEeecccccccc Q lcl|NC_017981. 1 MIG-NGPFNKFRKDHTVIIVSDSYFK-DGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRR-VSDWRRLYSDTKLPLAG 77 (152) Q Consensus 1 ~~~-~~~~~~FRk~~~V~r~~~G~Yv-~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrR-itd~~rIYTd~~L~vag 77 (152) .|+ ..|+..+ .+++|.|+-. +|.-++..+ ++.+...+||+|. ++++-+ .|-. -...+.||.+-.. T Consensus 10 aI~aVNP~~pA-----~l~~stG~t~~~G~r~p~Y~-~~~v~vQ~QalS~-~dL~h~-dglnqQG~~~~iY~~Gn~---- 77 (152) T protein:vir:10 10 AITQVNPDEPG-----TMFVSTGRNNVRGILTPTFS-SVDAQLQIQAQKH-TPLQHE-RGALYTNSFLTVYAYGKF---- 77 (152) T ss_pred hhhccCCCCce-----EEEEeccceecCceecceec-cceeEEEEeecCc-hHHHHh-hcccccceeeEEEeccch---- Confidence 222 2444432 4455777643 365555544 4677788899984 444332 2222 2357889995322 Q ss_pred cccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 78 DQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 78 d~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) |.+.- ....-+|+|+|+|+++.|+++..|= =+--|-++.|+.|- T Consensus 78 -------------~gv~R---------------~~~qGGD~~vf~g~~WLVv~v~E~W---pDWc~V~v~lQ~~a 121 (152) T protein:vir:10 78 -------------DDLSR---------------PLGKGGDFAAFRGGWWYITQFLEWW---PDWCAFEVTQQLNA 121 (152) T ss_pred -------------hheec---------------hhhcCccEEEECCceEEEEEccccc---ccceeeeeeeccCh Confidence 22211 3445689999999999999998763 22667778888887 No 14 >protein:vir:80342 Length: 108 # NCBI annotation: gp9, phage head-tail adaptor, putative # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111088;genbank:gi:134288641;genbank:GeneID:4960589 Probab=92.08 E-value=0.0089 Score=31.81 Aligned_cols=103 Identities=16% Similarity=0.184 Sum_probs=70.3 Q ss_pred cCCCCCCccccceeeEecCCCcccCCeecCCC-CeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccccc Q lcl|NC_017981. 2 IGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGE-RMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQL 80 (152) Q Consensus 2 ~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~-~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~ 80 (152) +-- -.+|+.-++.++.+..-..|-+++.. ...-++-|+|.|++..|...+-..+-..+.-+.|--.. T Consensus 1 M~~---G~L~~rI~i~~~~~~~d~~G~~~~~~w~~~~~~wA~v~~~~~~e~~~a~~~~~~~~~~i~iR~~~--------- 68 (108) T protein:vir:80 1 MKT---GKLKERIVIERPSGETNENDEPIPGAWIVHARPWADVLFLNGKEHVISGAVRGATIASMRIRYRA--------- 68 (108) T ss_pred CCc---cccCccEEEEeeeeccCCCCCeeccceeeEEEEEEEEEecCchheeeccceeeeeeEEEEEEecC--------- Confidence 222 35777878888877666678888765 44568999999999888877655544444444443221 Q ss_pred ccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 81 FISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 81 ~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) +-.+.+.|+|+|+.|+|....+ . +.-+||.+.-..| T Consensus 69 -------------------------------~I~~~~Ri~~~g~~y~I~~v~~-~----~~~~~l~i~~~e~ 104 (108) T protein:vir:80 69 -------------------------------GIDEQMRVRYDGRLYDITAVLP-A----RKRGYLDLSVKVG 104 (108) T ss_pred -------------------------------CCCcccEEEECCeEEEEEeecc-C----CCCCEEEEEEEee Confidence 2234578999999999998865 2 2346777777777 No 15 >protein:vir:78609 Length: 152 # NCBI annotation: BcepNY3gp05 # Family: family:all:3177 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294842;genbank:gi:149882905;genbank:GeneID:5291080 Probab=91.96 E-value=0.0033 Score=34.20 Aligned_cols=109 Identities=17% Similarity=0.234 Sum_probs=66.0 Q ss_pred CcC-CCCCCccccceeeEecCCCccc-CCeecCCCCeeEEEEEEEEeecCCcceeecccccc-ceeeeEEeecccccccc Q lcl|NC_017981. 1 MIG-NGPFNKFRKDHTVIIVSDSYFK-DGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRR-VSDWRRLYSDTKLPLAG 77 (152) Q Consensus 1 ~~~-~~~~~~FRk~~~V~r~~~G~Yv-~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrR-itd~~rIYTd~~L~vag 77 (152) .|+ ..|+..+ .+++|.|+-. +|.-++..+ +..+...+||+|. ++++-+ .|-. -...+.||.+-.. T Consensus 10 aI~aVNP~~~A-----~l~~stG~T~~~G~r~P~Y~-~~~v~vQ~QalS~-~dL~h~-dglnqQG~~~~iY~~Gn~---- 77 (152) T protein:vir:78 10 AITQVNPDEAG-----TMFVSTGRTNVRGILTPTFS-SIDAQLQIQAQKH-TPLQHE-RGALYTNSFLTVYAYGKF---- 77 (152) T ss_pred hhhccCCCCce-----EEEEeeceEcCCCcccceec-ceeeEEEEeecCc-hHHHHh-hcccccceeeEEEeccch---- Confidence 222 3454432 4455777543 365444444 4667788888884 444332 2222 2357889985322 Q ss_pred cccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 78 DQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 78 d~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) |.+.- ....-+|+|+|+|+++.|+++..|= =+--|-++.|+.|- T Consensus 78 -------------~gv~R---------------~~~qGGD~~vf~g~~WLVv~v~E~W---pDWc~V~v~lQ~~a 121 (152) T protein:vir:78 78 -------------DDLSR---------------PLGKGGDFAAFRGGWWYITQFLEWW---PDWCAFEVTQQLNA 121 (152) T ss_pred -------------hheec---------------hhhcCccEEEECCceEEEEEccccc---ccceeeeeeeccCh Confidence 22211 3445689999999999999998763 22667778888887 No 16 >protein:vir:107606 Length: 107 # NCBI annotation: head-tail adaptor protein # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338190;genbank:gi:77020176;genbank:GeneID:3703737 Probab=91.84 E-value=0.011 Score=31.39 Aligned_cols=104 Identities=15% Similarity=0.095 Sum_probs=67.2 Q ss_pred CCCCccccceeeEecCCC-cccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccccccc Q lcl|NC_017981. 5 GPFNKFRKDHTVIIVSDS-YFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQLFIS 83 (152) Q Consensus 5 ~~~~~FRk~~~V~r~~~G-~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~~~~ 83 (152) =..-.+|+.-++..+... .-..|-.++.-...-++-|+|.|++..|.+++-.....++--++|=-. T Consensus 1 M~~G~L~~rI~i~~~~~~~~d~~G~~~~~w~~~~~~wA~v~~~sg~e~~~a~~~~~~~t~~i~iR~~------------- 67 (107) T protein:vir:10 1 MNPAKLDKRLTFQVKDENAKGPDGDPIDGYKDAFTVWGSFVYLKGRKYFEAAAANSEVQGETEIRNR------------- 67 (107) T ss_pred CCccccCccEEEEeceeeccCCCCccccceEeeEEEEEEEEecCchhheeccceeeeeeEEEEEEec------------- Confidence 122356666566555432 234577777766667899999999988887665444444433333221 Q ss_pred cccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeee--eeecCC Q lcl|NC_017981. 84 ASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYV--VRKTYG 152 (152) Q Consensus 84 a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~--Vr~~d~ 152 (152) .+-.+.+.|+|+|+.|+|.++.+.+. |..++. -..+|| T Consensus 68 ---------------------------~~I~~~~ri~~~g~~y~I~~v~~~~~----~~~~l~~~~~~~~G 107 (107) T protein:vir:10 68 ---------------------------DDVSADMKIKYKNVIYDIVSVIPTQD----HTLLIMWKRGEMNG 107 (107) T ss_pred ---------------------------CCCCcccEEEECCeEEEEEeecCCCC----CcEEEEEEEeecCC Confidence 13345689999999999999988753 555544 455777 No 17 >protein:vir:105006 Length: 107 # NCBI annotation: putative head-tail adaptor protein # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459971;genbank:gi:85701386;genbank:GeneID:3882147 Probab=91.84 E-value=0.011 Score=31.39 Aligned_cols=104 Identities=15% Similarity=0.095 Sum_probs=67.2 Q ss_pred CCCCccccceeeEecCCC-cccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccccccc Q lcl|NC_017981. 5 GPFNKFRKDHTVIIVSDS-YFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQLFIS 83 (152) Q Consensus 5 ~~~~~FRk~~~V~r~~~G-~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~~~~ 83 (152) =..-.+|+.-++..+... .-..|-.++.-...-++-|+|.|++..|.+++-.....++--++|=-. T Consensus 1 M~~G~L~~rI~i~~~~~~~~d~~G~~~~~w~~~~~~wA~v~~~sg~e~~~a~~~~~~~t~~i~iR~~------------- 67 (107) T protein:vir:10 1 MNPAKLDKRLTFQVKDENAKGPDGDPIDGYKDAFTVWGSFVYLKGRKYFEAAAANSEVQGETEIRNR------------- 67 (107) T ss_pred CCccccCccEEEEeceeeccCCCCccccceEeeEEEEEEEEecCchhheeccceeeeeeEEEEEEec------------- Confidence 122356666566555432 234577777766667899999999988887665444444433333221 Q ss_pred cccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeee--eeecCC Q lcl|NC_017981. 84 ASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYV--VRKTYG 152 (152) Q Consensus 84 a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~--Vr~~d~ 152 (152) .+-.+.+.|+|+|+.|+|.++.+.+. |..++. -..+|| T Consensus 68 ---------------------------~~I~~~~ri~~~g~~y~I~~v~~~~~----~~~~l~~~~~~~~G 107 (107) T protein:vir:10 68 ---------------------------DDVSADMKIKYKNVIYDIVSVIPTQD----HTLLIMWKRGEMNG 107 (107) T ss_pred ---------------------------CCCCcccEEEECCeEEEEEeecCCCC----CcEEEEEEEeecCC Confidence 13345689999999999999988753 555544 455777 No 18 >protein:vir:102084 Length: 107 # NCBI annotation: head-tail adaptor # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512317;genbank:gi:89152486;genbank:GeneID:3953077 Probab=91.84 E-value=0.011 Score=31.39 Aligned_cols=104 Identities=15% Similarity=0.095 Sum_probs=67.2 Q ss_pred CCCCccccceeeEecCCC-cccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccccccc Q lcl|NC_017981. 5 GPFNKFRKDHTVIIVSDS-YFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQLFIS 83 (152) Q Consensus 5 ~~~~~FRk~~~V~r~~~G-~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~~~~ 83 (152) =..-.+|+.-++..+... .-..|-.++.-...-++-|+|.|++..|.+++-.....++--++|=-. T Consensus 1 M~~G~L~~rI~i~~~~~~~~d~~G~~~~~w~~~~~~wA~v~~~sg~e~~~a~~~~~~~t~~i~iR~~------------- 67 (107) T protein:vir:10 1 MNPAKLDKRLTFQVKDENAKGPDGDPIDGYKDAFTVWGSFVYLKGRKYFEAAAANSEVQGETEIRNR------------- 67 (107) T ss_pred CCccccCccEEEEeceeeccCCCCccccceEeeEEEEEEEEecCchhheeccceeeeeeEEEEEEec------------- Confidence 122356666566555432 234577777766667899999999988887665444444433333221 Q ss_pred cccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeee--eeecCC Q lcl|NC_017981. 84 ASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYV--VRKTYG 152 (152) Q Consensus 84 a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~--Vr~~d~ 152 (152) .+-.+.+.|+|+|+.|+|.++.+.+. |..++. -..+|| T Consensus 68 ---------------------------~~I~~~~ri~~~g~~y~I~~v~~~~~----~~~~l~~~~~~~~G 107 (107) T protein:vir:10 68 ---------------------------DDVSADMKIKYKNVIYDIVSVIPTQD----HTLLIMWKRGEMNG 107 (107) T ss_pred ---------------------------CCCCcccEEEECCeEEEEEeecCCCC----CcEEEEEEEeecCC Confidence 13345689999999999999988753 555544 455777 No 19 >protein:vir:102856 Length: 107 # NCBI annotation: head-tail adaptor protein # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338139;genbank:gi:77020229;genbank:GeneID:3703765 Probab=91.84 E-value=0.011 Score=31.39 Aligned_cols=104 Identities=15% Similarity=0.095 Sum_probs=67.2 Q ss_pred CCCCccccceeeEecCCC-cccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccccccc Q lcl|NC_017981. 5 GPFNKFRKDHTVIIVSDS-YFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQLFIS 83 (152) Q Consensus 5 ~~~~~FRk~~~V~r~~~G-~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~~~~ 83 (152) =..-.+|+.-++..+... .-..|-.++.-...-++-|+|.|++..|.+++-.....++--++|=-. T Consensus 1 M~~G~L~~rI~i~~~~~~~~d~~G~~~~~w~~~~~~wA~v~~~sg~e~~~a~~~~~~~t~~i~iR~~------------- 67 (107) T protein:vir:10 1 MNPAKLDKRLTFQVKDENAKGPDGDPIDGYKDAFTVWGSFVYLKGRKYFEAAAANSEVQGETEIRNR------------- 67 (107) T ss_pred CCccccCccEEEEeceeeccCCCCccccceEeeEEEEEEEEecCchhheeccceeeeeeEEEEEEec------------- Confidence 122356666566555432 234577777766667899999999988887665444444433333221 Q ss_pred cccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeee--eeecCC Q lcl|NC_017981. 84 ASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYV--VRKTYG 152 (152) Q Consensus 84 a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~--Vr~~d~ 152 (152) .+-.+.+.|+|+|+.|+|.++.+.+. |..++. -..+|| T Consensus 68 ---------------------------~~I~~~~ri~~~g~~y~I~~v~~~~~----~~~~l~~~~~~~~G 107 (107) T protein:vir:10 68 ---------------------------DDVSADMKIKYKNVIYDIVSVIPTQD----HTLLIMWKRGEMNG 107 (107) T ss_pred ---------------------------CCCCcccEEEECCeEEEEEeecCCCC----CcEEEEEEEeecCC Confidence 13345689999999999999988753 555544 455777 No 20 >protein:vir:94034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:3177 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453621;genbank:gi:84662657;genbank:GeneID:5142544 Probab=90.30 E-value=0.0054 Score=33.02 Aligned_cols=110 Identities=16% Similarity=0.222 Sum_probs=63.3 Q ss_pred CcC-CCCCCccccceeeEecCCCccc-CCeecCCCCeeEEEEEEEEeecCCcceeeccccccc-eeeeEEeecccccccc Q lcl|NC_017981. 1 MIG-NGPFNKFRKDHTVIIVSDSYFK-DGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRV-SDWRRLYSDTKLPLAG 77 (152) Q Consensus 1 ~~~-~~~~~~FRk~~~V~r~~~G~Yv-~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRi-td~~rIYTd~~L~vag 77 (152) .|+ ..|+..+ .++++.|+-. +|.=+|..+.....+.-+||+|- ++++-+ .|-.+ ...+.||.+-.. T Consensus 13 aI~aVNP~~pa-----~l~~stG~t~~~G~~~P~y~~~~a~~iQ~QalS~-~dL~h~-dgln~QG~~~~iY~~G~~---- 81 (141) T protein:vir:94 13 PIQVVNPDVPG-----DVYISTGHTTLRGIVTPTFQRLPAQRLQVQAVTT-NDLYQL-NGLGYAKDTQKLYAYGTL---- 81 (141) T ss_pred hhcccCCCCce-----EEEEeeccEecCCceEeeeecccceEEEeeccCh-hHHHHh-hcccccceeeEEEeccch---- Confidence 222 2444432 3455777554 24434444333345667788873 334332 33323 357889985322 Q ss_pred cccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 78 DQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 78 d~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) |.+.- +...-+|+|+|+|+++.|+++..|=. +--|-++.|+.+- T Consensus 82 -------------~gv~R---------------~~~~GGD~~vf~g~~WLV~~v~E~Wp---DWc~v~v~lQ~~~ 125 (141) T protein:vir:94 82 -------------SGIVR---------------PEGKGGDLVNLANTWWAIQGVIEWWP---QWCSVAITRQVDA 125 (141) T ss_pred -------------hheec---------------hhhcCccEEEECCceEEEEEcccccc---cceeEeeeeccCh Confidence 22211 34456899999999999999987621 2567777788777 No 21 >protein:vir:101560 Length: 152 # NCBI annotation: gp06 # Family: family:all:3177 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958110;genbank:gi:41057656;genbank:GeneID:2716817 Probab=89.86 E-value=0.0041 Score=33.65 Aligned_cols=109 Identities=17% Similarity=0.229 Sum_probs=63.5 Q ss_pred CcC-CCCCCccccceeeEecCCCccc-CCeecCCCCeeEEEEEEEEeecCCcceeecccccc-ceeeeEEeecccccccc Q lcl|NC_017981. 1 MIG-NGPFNKFRKDHTVIIVSDSYFK-DGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRR-VSDWRRLYSDTKLPLAG 77 (152) Q Consensus 1 ~~~-~~~~~~FRk~~~V~r~~~G~Yv-~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrR-itd~~rIYTd~~L~vag 77 (152) .|+ ..|+..+ .+++|.|+-. +|.-++..+ ++++...+||+|. ++++-+ .|-. -...+.||.+-.. T Consensus 10 aI~aVNP~~pA-----~l~~stG~T~~~G~r~P~Y~-~~~v~vQ~QalS~-~dL~h~-dglnqQG~~~~iY~~Gn~---- 77 (152) T protein:vir:10 10 AITQVNPDEPG-----TMFVSTGRTNVRGILTPMFS-SVNAQLQIQAQKH-TPLQHE-RGALYTNSFLTVYAYGKF---- 77 (152) T ss_pred hhcccCCCCce-----EEEEeeceEcCCCcccceec-ceeeEEEEeecCc-hHHHHh-hcccccceeeEeEeccch---- Confidence 222 3454432 3455777532 366555555 4778888899984 444332 2222 2357889985322 Q ss_pred cccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 78 DQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 78 d~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) |.+.- ....-+|+|+|+|+++.|+++..|= =+--|-++.|+-|- T Consensus 78 -------------~gv~R---------------~~~qGGD~~vf~g~~WLVv~v~E~W---pDWc~V~v~lQ~~a 121 (152) T protein:vir:10 78 -------------DDLSR---------------PLGKGGDFAAFRGGWWYITQFLEWW---PDWCAFEVTQQLNA 121 (152) T ss_pred -------------hheec---------------hhhcCccEEEECCceEEEEEccccc---chhhhhhhhhhhch Confidence 22211 3445689999999999999998762 11444555555554 No 22 >protein:vir:77651 Length: 152 # NCBI annotation: gp06 # Family: family:all:3177 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022740;genbank:gi:47835021;genbank:GeneID:2821448 Probab=89.55 E-value=0.0047 Score=33.36 Aligned_cols=109 Identities=17% Similarity=0.226 Sum_probs=62.9 Q ss_pred CcC-CCCCCccccceeeEecCCCccc-CCeecCCCCeeEEEEEEEEeecCCcceeecccccc-ceeeeEEeecccccccc Q lcl|NC_017981. 1 MIG-NGPFNKFRKDHTVIIVSDSYFK-DGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRR-VSDWRRLYSDTKLPLAG 77 (152) Q Consensus 1 ~~~-~~~~~~FRk~~~V~r~~~G~Yv-~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrR-itd~~rIYTd~~L~vag 77 (152) .|+ ..|+..+ .+++|.|+-. +|.-++..+ ++++...+||+|. ++++-+ .|-. -...+.||.+-.. T Consensus 10 aI~aVNP~~pA-----~l~~stG~T~~~G~r~P~Y~-~~~v~vQ~QalS~-~dL~h~-dglnqQG~~~~iY~~Gn~---- 77 (152) T protein:vir:77 10 AITQVNPDEPG-----TMFVSTGRTNVRGILTPMFS-SVNAQLQIQAQKH-TPLQHE-RGALYTNSFLTVYAYGKF---- 77 (152) T ss_pred hhhccCCCCce-----EEEEeeceEcCCCcccceec-ceeeEEEEeecCc-hHHHHh-hcccccceeeEEEeccch---- Confidence 222 3454432 3455777532 366555555 4778888899984 444332 2222 2357889985322 Q ss_pred cccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 78 DQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 78 d~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) |.+.- ....-+|+|+|+|++|.|+++..|= =+--|-++.|+-|- T Consensus 78 -------------~gv~R---------------~~~qGGD~~vf~g~~WLVv~v~E~W---pDWc~V~v~lQ~~a 121 (152) T protein:vir:77 78 -------------DDLSR---------------PLGKGGDFASFRGGWWYITQFLEWW---PDWCAFEVTQQLNA 121 (152) T ss_pred -------------hheec---------------hhhcCccEEEECCceEEEEEccccc---chhhhhhhhhhhch Confidence 22211 3445689999999999999998762 11334445555444 No 23 >protein:vir:104346 Length: 123 # NCBI annotation: conserved phage-related protein # Family: family:all:704 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398974;genbank:gi:81343958;genbank:GeneID:3778878 Probab=88.89 E-value=0.015 Score=30.59 Aligned_cols=106 Identities=14% Similarity=0.077 Sum_probs=59.0 Q ss_pred CcC--CCCCCccccceeeEecCCCcccCCeecCCCCeeEEEEEEEEeecCCcceeecccccccee-eeEEeecccccccc Q lcl|NC_017981. 1 MIG--NGPFNKFRKDHTVIIVSDSYFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSD-WRRLYSDTKLPLAG 77 (152) Q Consensus 1 ~~~--~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd-~~rIYTd~~L~vag 77 (152) +|- ..|+ .--+.+-++..++.++|..+.-.++..++++-..-.+-.+. .|..|.. =+++|-.++++ T Consensus 13 lI~~f~~~~---gv~~~~t~~~~v~~v~G~ev~~p~~~~~~~Gv~~~y~~r~I-----DG~lIq~gD~~~i~~a~~e--- 81 (123) T protein:vir:10 13 GINFFSDAN---GVYEMSTGAGYVEIVNGVEVEVPAQTFQLKGLVREIKTRDI-----DGEFIQFGDKRGIFTAQVE--- 81 (123) T ss_pred HHHHhCCCC---CceEEecCCCeeeCCCCceeeccceeEeeEEEeccCChhhc-----cceeeeeccEEEEEecCce--- Confidence 331 1111 22233444555688899988888877777754444332211 1222211 12333333322 Q ss_pred cccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchh-hCcccceeeeeeeecCC Q lcl|NC_017981. 78 DQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQ-NGIISHYKYYVVRKTYG 152 (152) Q Consensus 78 d~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq-~Gii~HykYl~Vr~~d~ 152 (152) -.++|++.+||+.|.||...+|+ .|+.==|+..+=|-+-+ T Consensus 82 -----------------------------------ik~Gd~i~vdGe~~rVV~~~pikPa~~~v~y~~qlRrv~~~ 122 (123) T protein:vir:10 82 -----------------------------------IKQGYQIKVDGETFVVVDPRPVKPTGTTVGYRPILRRVATY 122 (123) T ss_pred -----------------------------------eccCCEEEECCeEEEEecCCccCccceeEEEeeeeeeeeec Confidence 25789999999999999999997 55554455444443333 No 24 >protein:vir:107669 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:704 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003901;genbank:gi:45686317;genbank:GeneID:2773009 Probab=88.45 E-value=0.0089 Score=31.83 Aligned_cols=102 Identities=18% Similarity=0.163 Sum_probs=65.0 Q ss_pred CcCCCCCCccccceeeEecCCCcccCCeecCCCCeeEEEEEEEEeecCCcceeecccccc--ceeeeEEeeccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRR--VSDWRRLYSDTKLPLAGD 78 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrR--itd~~rIYTd~~L~vagd 78 (152) -=++|||+-- +++.-++.++|.-|+-.++..++.+-++..+-.|.. |.. .+|-+-|.+ -| T Consensus 18 sd~~g~~~l~------t~~~~~~~v~G~Ev~~p~~~~~~~G~~~~y~~reID-----G~lI~~gDvk~if~-------a~ 79 (123) T protein:vir:10 18 TDPSRPMNLI------KQGEYGYDENGFEIPPMEQVIPISGATRRPNAREID-----GETIRASDILGIFN-------ND 79 (123) T ss_pred cCCCCeEEee------eCCcccccCCCeecccCCeeeeeEEEEeeccccccc-----cceeeeccEEEeec-------cc Confidence 2457777643 344567888999999988888888887776643332 221 223333332 11 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchh-hCcccceeeeeeeecCC Q lcl|NC_017981. 79 QLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQ-NGIISHYKYYVVRKTYG 152 (152) Q Consensus 79 ~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq-~Gii~HykYl~Vr~~d~ 152 (152) -...|+|.+.+||+.|.||..-+|+ .|..=-||..+=|-+-+ T Consensus 80 --------------------------------veik~Gd~I~vDg~~~rVV~~~pvkPa~~~I~y~~qLRrv~~~ 122 (123) T protein:vir:10 80 --------------------------------HEINEGDYIEIDGIRHVVVDARPVQASLEPVAYRPVLRRVSVG 122 (123) T ss_pred --------------------------------eeeccCCEEEECCeEEEEecCcccchhhhhhhhhhhhceeccC Confidence 1237899999999999999999997 44444455544444333 No 25 >protein:vir:4955 Length: 116 # NCBI annotation: putative head-tail joining protein # Family: family:all:1030 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049931;genbank:gi:9632902;genbank:GeneID:1262078 Probab=87.58 E-value=0.034 Score=28.64 Aligned_cols=112 Identities=16% Similarity=0.151 Sum_probs=70.8 Q ss_pred CcCC-CCCCccccceeeEecCCCccc-CCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccc Q lcl|NC_017981. 1 MIGN-GPFNKFRKDHTVIIVSDSYFK-DGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGD 78 (152) Q Consensus 1 ~~~~-~~~~~FRk~~~V~r~~~G~Yv-~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd 78 (152) |.-| =.-+.|++.-+.-......-. +|.+++..-...++-|++++.+-.++++++ |....+.+.+-- T Consensus 1 m~~~k~~~~~ln~rI~Fg~~~~~~n~~tG~~~~~f~~~f~~wa~~~~~~~~q~~~~~--gt~~e~T~~~vI--------- 69 (116) T protein:vir:49 1 MARGRYLPSDFRYKADFGTYQSTPNKFTGVSVPKFVKQFTLHYKPHTRTLNQEYLAI--QNGENDTRVIVI--------- 69 (116) T ss_pred CcccccccccCceeEEeeeeeeeecCCCCcccceeeeeEEEEEEeecccceeeeeee--cccccccEEEEE--------- Confidence 6655 334556666444343443333 377777755566799999998877777654 333444333322 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 79 QLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 79 ~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) |+-. .-.+.=.+.|+|+.|+|+...+=+.--...|.++.+.+... T Consensus 70 --------------------------Rh~~---~i~~~~~v~~~g~~Y~I~~IspD~~~~~~~yD~iTlk~~~k 114 (116) T protein:vir:49 70 --------------------------RHNA---KVLEGQVVTLNGTQYDIVRISPDDNFGFNHYDFLTLKKRKK 114 (116) T ss_pred --------------------------EeCC---CCCcccEEEECCeEEEEEEeCCCCccCcceeeEEEEEEEee Confidence 2211 12333479999999999998885555588999999887766 No 26 >protein:vir:4459 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700382;genbank:gi:23505454;genbank:GeneID:955661 Probab=85.63 E-value=0.051 Score=27.65 Aligned_cols=109 Identities=11% Similarity=0.011 Sum_probs=65.9 Q ss_pred CcCCCCCCccccceeeEecCCCcccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQL 80 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~ 80 (152) |+--|. +|+--++.++.+..-..|-++++-...-++-|.|-|++..|...+-...-..+.-++|-- T Consensus 14 ~M~aG~---L~~RI~i~~~~~~~D~~G~~~~~w~~~~~vwA~v~~~sg~E~~~a~~~~~~~t~~i~IR~----------- 79 (134) T protein:vir:44 14 LPDPGE---LDQRIVIRRRVDVPADDFGVTPTYPEQIRTWAKKAQPGAAAYQGSVQIENRVTHYFTIRF----------- 79 (134) T ss_pred ccCccc---cCccEEEEeeeeeeCCCCCeecceEeeEEEEEEEEecCchheeeccceeeeeeEEEEEEe----------- Confidence 776665 455556666666666678888876667789999999998777654333333333333321 Q ss_pred ccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchh-hCcccceeeeeeeecCC Q lcl|NC_017981. 81 FISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQ-NGIISHYKYYVVRKTYG 152 (152) Q Consensus 81 ~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq-~Gii~HykYl~Vr~~d~ 152 (152) . ++-.+.+.|+|+|+.|+|....+=. .+.-=....-.|=.++| T Consensus 80 --------------------------~---~~It~~~RI~~~g~~y~I~~I~~~~~~~~~L~i~c~evg~~~g 123 (134) T protein:vir:44 80 --------------------------R---RGITADHEVLHDDISYRVKRVRDLNGKRRFLLIECEALGTDNG 123 (134) T ss_pred --------------------------C---CCCCcccEEEECCeEEEEEEecCCCcCCcEEEEEEEEeeecCC Confidence 1 1234568999999999999986543 22211111122334454 No 27 >protein:vir:1890 Length: 110 # NCBI annotation: gp9 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037670;genbank:gi:9634128;genbank:GeneID:1262503 Probab=82.42 E-value=0.077 Score=26.69 Aligned_cols=105 Identities=13% Similarity=0.145 Sum_probs=64.0 Q ss_pred cCCCCCCccccceeeEecCCCccc-CCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccccc Q lcl|NC_017981. 2 IGNGPFNKFRKDHTVIIVSDSYFK-DGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQL 80 (152) Q Consensus 2 ~~~~~~~~FRk~~~V~r~~~G~Yv-~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~ 80 (152) +- .-.+|+--++.++....-. +|-+++.....-++-|+|.|++..|.+++-......+-.+.|--.. T Consensus 1 M~---~G~L~~rI~i~~~~~~~d~~~G~~~~~~~~~~~~wA~v~~~~~~e~~~a~~~~~~~~~~~~iR~~~--------- 68 (110) T protein:vir:18 1 MQ---AGKLRHRITLQEPVKVQNPTTGAVINTWRDVATVRAEVSPLSAREFIAAQASQGEITTRIVIRYRA--------- 68 (110) T ss_pred CC---ccccCccEEEEeeeeeecCCCCccccceeeeEEEEEEEEecCchheeecceeeeeeeEEEEEEecC--------- Confidence 22 3356666667666655543 4667777677778999999999888776654444433333332211 Q ss_pred ccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeee----ecCC Q lcl|NC_017981. 81 FISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVR----KTYG 152 (152) Q Consensus 81 ~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr----~~d~ 152 (152) +-.+.+.|+|+|+.|+|.++.+=..+ ..+||.+. -++| T Consensus 69 -------------------------------~I~~~~ri~~~g~~y~I~~v~~d~~~---~~~~l~i~~~e~~~~G 110 (110) T protein:vir:18 69 -------------------------------GVTRKHRILFRGAVYNIHGVLPDPKS---GREYLTLPCSEGVNDG 110 (110) T ss_pred -------------------------------CCCcccEEEECCeEEEEEeccCCccc---CCeEEEEEEEEeccCC Confidence 22345789999999999998542111 12344433 3555 No 28 >protein:vir:81216 Length: 118 # NCBI annotation: gp9 # Family: family:all:10295 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456739;genbank:gi:157168382;uniprot:Q9MBJ6;genbank:GeneID:5580339 Probab=81.29 E-value=0.042 Score=28.15 Aligned_cols=107 Identities=17% Similarity=0.141 Sum_probs=64.5 Q ss_pred CCCccccceeeEec--CCCcccC-Ce--ecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccccc Q lcl|NC_017981. 6 PFNKFRKDHTVIIV--SDSYFKD-GV--IMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQL 80 (152) Q Consensus 6 ~~~~FRk~~~V~r~--~~G~Yv~-Gr--wV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~ 80 (152) --.-|-+.-+|+|. ..+.|-+ +. |-+.+.+.++.-.||||+. +++.. ----.-+++++ +|+-.-+ T Consensus 1 m~~~F~~~v~ilRa~~~~~~yg~d~~~dw~~pv~ipV~~~vSvQPv~-StE~~-~~r~~vVt~w~-l~~Ppg~------- 70 (118) T protein:vir:81 1 MTVIFVNAVTVLRAREVGSVYSSEKTLTWDDPVRIDVPFLVSVQPRG-STEGG-TDRPTVVSAWW-MCTPPGT------- 70 (118) T ss_pred CeeeeeeeEEEecCCccccccCCCcccccCCceeeeccCcceeeecC-ccccC-CCCceeeeeeE-eecCCCC------- Confidence 11226666666665 3445543 32 8888888888889999986 34421 11112233333 5552211 Q ss_pred ccccccccccccccccccccccccccccccCCCCCccEEEEC-CceEEEEEe-cch----hhCcccceeeeeeeecC Q lcl|NC_017981. 81 FISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVID-GREYEVIER-ISW----QNGIISHYKYYVVRKTY 151 (152) Q Consensus 81 ~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~-G~~YeVi~~-~~w----q~Gii~HykYl~Vr~~d 151 (152) ..+...-|.|-++ |..|||..+ ..| .-+.+.|-++-+-+.+- T Consensus 71 -----------------------------d~~Lra~DRVr~a~G~~~eV~G~P~~wp~P~~t~~v~Hvea~LevvtG 118 (118) T protein:vir:81 71 -----------------------------DLDLRPEDRVELATGLQLEVVGQPLRWPDPVNQDQVHHVEANLEVVDG 118 (118) T ss_pred -----------------------------CcCCCccceeeeccccEEEEecCcccccCccccccccceeEEEEEecC Confidence 1223445899996 999999986 356 57778899987755554 No 29 >protein:vir:100226 Length: 114 # NCBI annotation: putative head-tail joining protein # Family: family:all:1030 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025033;genbank:gi:48697266;genbank:GeneID:2948324 Probab=80.89 E-value=0.091 Score=26.30 Aligned_cols=112 Identities=13% Similarity=0.103 Sum_probs=72.8 Q ss_pred CcCCCCCCccccceeeEecCCCcccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQL 80 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~ 80 (152) |..+=.-+.|++.-+.-......-.+|..+++.-...++-|+.+-.+-.++++. .|..+.+.+.+-- T Consensus 1 m~~~~~p~~ln~ri~FG~~~t~~~~~G~~~~~fv~~f~~~~~~~t~t~~q~~~~--~Gt~~e~t~~~vI----------- 67 (114) T protein:vir:10 1 MMAKFKVADFSRKVDLGSPQSHKTGAGINITSFVPNYSLHFKQQTRTLTQQYTL--VGTRLDNSITIIV----------- 67 (114) T ss_pred CCccccccccceEEEeeeeeeecCCCCcccceeeeeEEEEEEEeecchheeeee--ccccccccEEEEE----------- Confidence 988888888888754434333333368877776556678888888776666544 3555555444322 Q ss_pred ccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeee-cCC Q lcl|NC_017981. 81 FISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRK-TYG 152 (152) Q Consensus 81 ~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~-~d~ 152 (152) |+- ...++.=.+.+||+.|+|+...+=++--..+|-++-+++ .-| T Consensus 68 ------------------------Rh~---~~it~~m~v~~~g~~Y~Iv~IspDd~~~~~~yD~iTlkk~~~g 113 (114) T protein:vir:10 68 ------------------------RHD---TRNASQKQARLDGIVYDISDISPDDSNDAIRYDYLTLVKTTKG 113 (114) T ss_pred ------------------------EeC---CCCCcccEEEECCeEEEEEEeCCCCccCcceeeeEEEEEEecC Confidence 110 112233368999999999999995555688899996544 455 No 30 >protein:vir:4832 Length: 116 # NCBI annotation: ORF28 # Family: family:all:1030 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038329;genbank:gi:9634655;genbank:GeneID:1262597 Probab=79.86 E-value=0.097 Score=26.13 Aligned_cols=112 Identities=17% Similarity=0.143 Sum_probs=68.9 Q ss_pred CcCC-CCCCccccceeeEecCCCccc-CCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccc Q lcl|NC_017981. 1 MIGN-GPFNKFRKDHTVIIVSDSYFK-DGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGD 78 (152) Q Consensus 1 ~~~~-~~~~~FRk~~~V~r~~~G~Yv-~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd 78 (152) |.-| =.-+.|++.-+.-......-- .|..++......++.|..+..+-.++++.+ |-.+.+.+.+-- T Consensus 1 M~~~k~~p~~~n~ri~FG~~~~~~n~~tG~~~~~f~~~ft~~~~~~t~t~~q~~~~~--Gt~~edt~~~vI--------- 69 (116) T protein:vir:48 1 MARVRYLPSDFRFKADFGTYQSSPNKFTGVSVPKFVKQFTLHYKPHTRTLNQEYLAI--QNGENDTRVIVI--------- 69 (116) T ss_pred CeeeeeehhhccEEEEeceeeeeeCCCCCcccceeeeeEEEEEEEeecchhheeeec--cccccCcEEEEE--------- Confidence 6544 223456555333222332222 377777765566788888888766666543 444444444322 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 79 QLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 79 ~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) |+- ....+.=.+.|+|+.|+|+...+=+..-...|.++-+++... T Consensus 70 --------------------------Rh~---~~i~~~~~v~~~g~~Y~I~~Is~Dd~~~~~~yD~lTlk~~~k 114 (116) T protein:vir:48 70 --------------------------RHN---SKVLEGQVVTLNGTQYDIVRISPDENFGFNHYDFLTLKKRKK 114 (116) T ss_pred --------------------------EeC---CCCCcccEEEECCeEEEEEEeCCCCCcCcceeeEEEEEEEee Confidence 110 112333468999999999999996666789999999888776 No 31 >protein:vir:93599 Length: 116 # NCBI annotation: putative head-tail adaptor # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449298;genbank:gi:157166046;interpro:IPR006453;interpro:IPR013045;uniprot:Q6H9U3;genbank:GeneID:5580421 Probab=78.16 E-value=0.12 Score=25.69 Aligned_cols=107 Identities=11% Similarity=-0.015 Sum_probs=65.2 Q ss_pred CcCCCCCCccccceeeEecCCCcccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQL 80 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~ 80 (152) |+--|.+ |+.-++.++....--.|-++++-...-++-|+|.|++..|.+++-...-..+.-+.|--...+ T Consensus 2 ~m~aG~L---~~rI~iq~~~~~~d~~G~~~~~w~~~~~~wA~v~~~sg~e~~~a~~~~~~~~~~i~iRy~~~~------- 71 (116) T protein:vir:93 2 AISAGRL---TQMISVLNPVLTRNAAGEMTEEWVSCGKIHADIRGRSSRERMQSGAEMAQAEIRIWVRGQSGR------- 71 (116) T ss_pred CcCcccc---CccEEEEeeeeccCCCCCeecceEeEEEEEEEEEecChhheeeccceeeeeeEEEEEEecCCC------- Confidence 7777765 555566666665555688887766677899999999988887654443323222222111111 Q ss_pred ccccccccccccccccccccccccccccccCCCCCccEEE-----ECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 81 FISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVV-----IDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 81 ~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~-----~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) ...+.+.++ ++|+.|+|.++.+-.. =.++|.++=..| T Consensus 72 -------------------------------dI~~~~Ri~~~~~~~~g~~y~I~~v~~~~~----~~~~l~i~c~eg 113 (116) T protein:vir:93 72 -------------------------------EITAASRLHVLSGPWRDRILNVVGLPVPDA----TGGRLEILCRLG 113 (116) T ss_pred -------------------------------CCCcccEEEEcCcccCCeEEEEEecCCCCC----CCcEEEEEEEec Confidence 122345565 7999999999854321 125666665556 No 32 >protein:vir:4999 Length: 116 # NCBI annotation: putative head-tail joining protein # Family: family:all:1030 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049973;genbank:gi:9632945;genbank:GeneID:1262108 Probab=76.73 E-value=0.13 Score=25.40 Aligned_cols=112 Identities=19% Similarity=0.128 Sum_probs=69.7 Q ss_pred CcCC-CCCCccccceeeEecCCCccc-CCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccc Q lcl|NC_017981. 1 MIGN-GPFNKFRKDHTVIIVSDSYFK-DGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGD 78 (152) Q Consensus 1 ~~~~-~~~~~FRk~~~V~r~~~G~Yv-~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd 78 (152) |--| =.-+.|++.-+.-......-. .|.+++......++.|+++..+-.++++++ |-.+.+.+.+-- T Consensus 1 M~~~k~~p~~ln~ri~Fg~~~~~~n~~tG~~~~~f~~~ft~~~~~~t~t~~q~~~~~--Gt~~edT~~~vI--------- 69 (116) T protein:vir:49 1 MRKVKYLPSDFPYKADFGTYQSTPNKFTGVSVPKFVKQFTLHYQPHIRTLNQEYLAQ--QNGEDDTRVIVI--------- 69 (116) T ss_pred CcccccccccCcEeEEeeeeeeeecCCCCcccceeeeeEEEEEEEeecchheeeeec--cccccccEEEEE--------- Confidence 5444 222345555333332332222 488888876667788888888766666543 444444443322 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 79 QLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 79 ~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) |+- ...++.=.+.++|+.|+|+...+=+..-...|.+|.+++... T Consensus 70 --------------------------Rh~---~~i~~~~~v~~~g~~Y~I~~is~Dd~~~~~~yD~lTlk~~~~ 114 (116) T protein:vir:49 70 --------------------------RHN---SKVIEGQVVVLHGTQYDIIRASSNENFGINRYDFLPVRQRKN 114 (116) T ss_pred --------------------------EeC---CCCCcccEEEECCeEEEEEEeCCCCCCCcceeeEEEEEEeec Confidence 110 112233479999999999999996666789999999888776 No 33 >protein:vir:1385 Length: 107 # NCBI annotation: Gp8 protein # Family: family:all:3858 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612837;genbank:gi:20065971;genbank:GeneID:935786 Probab=75.65 E-value=0.14 Score=25.19 Aligned_cols=102 Identities=11% Similarity=-0.028 Sum_probs=62.9 Q ss_pred cccceeeEecCCCcc--cCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEE-eeccccccccccccccccc Q lcl|NC_017981. 10 FRKDHTVIIVSDSYF--KDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRL-YSDTKLPLAGDQLFISASV 86 (152) Q Consensus 10 FRk~~~V~r~~~G~Y--v~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rI-YTd~~L~vagd~~~~~a~~ 86 (152) .+|-|-|...++-.- ..|-+++.-...-++-|+|-|++..|.+.+-......+.-+.| |. T Consensus 1 ~~~~hRI~i~~~~~~~D~~G~~~~~w~~~~~~WA~v~~~~g~E~~~a~~~~~~~~~~f~iRy~----------------- 63 (107) T protein:vir:13 1 MARYERISIKKLEEKNIKGRRQEECLIPFYDCWAEILDLYGQELYGALQMKLENTIIFKIRYC----------------- 63 (107) T ss_pred CCcceEEEEEeeeeeeCCCCCeecceEeEEEEEEEEecCCchheeecceeheeeeEEEEEEec----------------- Confidence 667777766544333 3488888766667899999999988887766665555555555 43 Q ss_pred ccccccccccccccccccccccccCC--CCCccEEEECCceEEEEEecchhhCcccceeeeeeeec Q lcl|NC_017981. 87 DVNKDTLATEDGKVIAVGAYIGEKEG--MSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKT 150 (152) Q Consensus 87 ~~n~~~l~~~~~~~~~~~~~~~~~e~--~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~ 150 (152) +.-+. ..+...++|+|+.|+|.+..+-...- .-.+-+|-+-= T Consensus 64 ---------------------~~i~~~~~t~~~Ri~~~g~~y~I~~v~~~~~~~-~~l~i~c~eV~ 107 (107) T protein:vir:13 64 ---------------------KKVEELRNKENFIVEWQGRKYEIYYPDFLGYNK-QFVKLKCKEVL 107 (107) T ss_pred ---------------------CCccccccCcCcEEEECCeEEEEEecCCcccCC-eEEEEEEEEeC Confidence 21122 13457999999999999987654221 01111221111 No 34 >protein:vir:4858 Length: 116 # NCBI annotation: putative head-tail joining protein # Family: family:all:1030 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049398;genbank:gi:9632426;genbank:GeneID:1258519 Probab=74.72 E-value=0.15 Score=25.02 Aligned_cols=112 Identities=17% Similarity=0.124 Sum_probs=70.3 Q ss_pred CcCC-CCCCccccceeeEecCCCccc-CCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccc Q lcl|NC_017981. 1 MIGN-GPFNKFRKDHTVIIVSDSYFK-DGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGD 78 (152) Q Consensus 1 ~~~~-~~~~~FRk~~~V~r~~~G~Yv-~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd 78 (152) |.-| =.-+.||+.-+.-......-. +|.+++......++-|++++.+-.++++++ |-.+.+.+.+-- T Consensus 1 M~~~k~~~~~ln~ri~Fg~~~~~~n~~~G~~~~~f~~~~~~w~~~~~~t~~q~~~~~--gt~~e~T~~~vI--------- 69 (116) T protein:vir:48 1 MARVGYLPSDFRYKADFGTYQSTPNKFTGVSVPKFVKQFTLHYKPHTRTLNQEYLAQ--QNGESDTIVIVI--------- 69 (116) T ss_pred CcccccccccccEeEEeeeeeeeecCCCCcccceeeeeEEEEEEeeecchheeeeee--cccccccEEEEE--------- Confidence 5555 223456665333333333323 377877766677899999998877776554 444444443322 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 79 QLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 79 ~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) |+- ..-.+.=.+.|+|+.|+|+...+=+.--...|.++.+++... T Consensus 70 --------------------------Rh~---~~i~~~~~v~~~G~~Y~I~~Is~D~~~~~~~yD~iTlk~~~k 114 (116) T protein:vir:48 70 --------------------------RHN---AKVLEGQVVTLNGTQYDIVRISADENFGFNHYDFLTLRKHKK 114 (116) T ss_pred --------------------------EeC---CCCCcccEEEECCeEEEEEEeCCCCCcCcceeeEEEEEEEee Confidence 111 012223468999999999998885555688999999888776 No 35 >protein:vir:193 Length: 112 # NCBI annotation: putative head-tail adaptor # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037703;genbank:gi:9634168;genbank:GeneID:1262533 Probab=73.00 E-value=0.18 Score=24.73 Aligned_cols=104 Identities=14% Similarity=0.114 Sum_probs=64.8 Q ss_pred cCCCCCCccccceeeEecCCCcccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccccc Q lcl|NC_017981. 2 IGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQLF 81 (152) Q Consensus 2 ~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~~ 81 (152) +--| .+|+.-++.++....-..|-+++.-...-++-|+|-|++..|...+-......+.-++|=-...| T Consensus 1 M~~G---~L~~rI~i~~~~~~~d~~G~~~~~w~~~~~~wA~v~~~sg~e~~~a~~~~~~~~~~i~iR~~~~I-------- 69 (112) T protein:vir:19 1 MEPG---RFRNRVKILTFTTSRDPSGQPVESWTGGNPVPAEVKGISGREQLSGGAETAQATIRVWMRFRSEL-------- 69 (112) T ss_pred CCcc---ccCccEEEEeeeeeeCCCCCeecceEeEEEEEEEEEecCchheeeccceeeeeeEEEEEEecCCC-------- Confidence 2234 56777777777777666799998877777899999999988777654333333322222111111 Q ss_pred cccccccccccccccccccccccccccccCCCCCccEE-----EECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 82 ISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALV-----VIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 82 ~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v-----~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) .+.+.| .|+|+.|+|..+.+-... .+||.+.-..| T Consensus 70 --------------------------------~~~~ri~~~~~~~~g~~y~I~~v~~~~~~----~~~l~l~c~eg 109 (112) T protein:vir:19 70 --------------------------------NASSRLEVLSGPYKGQVLNIIGPPVANAT----GTRLEILCKTG 109 (112) T ss_pred --------------------------------CcccceeecceeeCCeEEEEEecCCCccC----CcEEEEEEEEc Confidence 112223 589999999998765432 25665555455 No 36 >protein:vir:80114 Length: 101 # NCBI annotation: hypothetical protein # Family: family:all:1410 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425607;genbank:gi:158348414;genbank:GeneID:5469540 Probab=72.22 E-value=0.16 Score=24.97 Aligned_cols=100 Identities=16% Similarity=0.058 Sum_probs=70.3 Q ss_pred CcCCCCCCccccceeeEecCCCcccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQL 80 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~ 80 (152) |. |-..-+.+-.....-.-|--++. +...++.+++=+++.+|-.++-..|-++.-...|-+ T Consensus 1 M~-------~~~~i~Li~~~~~~D~~G~~~~~-~~~r~V~~~~ksV~~sEfy~A~~~G~kpe~~~~i~~----------- 61 (101) T protein:vir:80 1 MT-------YDNELVLIAQEFTEDEIGNQVPI-ETRRTVLCNVKSVGRNEFYSAATAGLRPSIVFVIHG----------- 61 (101) T ss_pred Cc-------ccceEEecceeeeecCCCccccc-cceeEEEEEecccChhHHHHHHhcCCceEEEEEEeh----------- Confidence 32 33333333333333334777775 347889999999999999999999999886666644 Q ss_pred ccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 81 FISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 81 ~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) ..=.+...|.++|++|.|+.+-+=+++. .+.+|+|..-+ T Consensus 62 ------------------------------~eY~gE~~v~~~g~~Y~I~RTy~~~~e~---iEL~c~~~~g~ 100 (101) T protein:vir:80 62 ------------------------------YEYDGEQEVEFESVKYRVIRTYSVSFEE---VELTCERVLND 100 (101) T ss_pred ------------------------------hhhCCceEEEECCEEEEEEEeEeCCCCE---EEEEEEEeccC Confidence 1223447899999999999998888776 46788887655 No 37 >protein:vir:96294 Length: 111 # NCBI annotation: ORF043 # Family: family:all:788 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240314;genbank:gi:66396005;genbank:GeneID:5133371 Probab=71.44 E-value=0.088 Score=26.38 Aligned_cols=107 Identities=17% Similarity=0.176 Sum_probs=66.2 Q ss_pred CcCCCCCCccccceeeEecCCCcccCCeecCC--CCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKDGVIMPG--ERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGD 78 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G--~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd 78 (152) |. .||..|++--++=+-.. ++|.+-.. -.-.-|+.|-|||++..|.+++-.--. .=-+++|+-..+.++ T Consensus 1 ~~--~~y~EfpHrITiQ~~~~---v~g~g~~~~~wvd~~T~~A~V~~lt~~Ey~~AqQ~q~--~v~~nvy~rY~~~I~-- 71 (111) T protein:vir:96 1 MF--NPYDEFPHTISIGSIKK---VGEYPIIQERFVSDKTIKGFMDTPTTSEQLKFHQMSQ--EYDRNLYVPYDLPIA-- 71 (111) T ss_pred CC--CchhhCCceEEEEEEEE---ecCCCCccccccchhheeeeecCCChHHHHHHHhhcC--ceeeEEEEeecCCCC-- Confidence 54 58889987655444322 24333332 222447999999999888775543211 123578886666654 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCc--ccceeeeeeeecCC Q lcl|NC_017981. 79 QLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGI--ISHYKYYVVRKTYG 152 (152) Q Consensus 79 ~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gi--i~HykYl~Vr~~d~ 152 (152) +-.+|.|.||-|+|++-.-=|+|+ |.-.|--.+=.--| T Consensus 72 ------------------------------------~~mri~yegRi~~Ivs~pvDqgg~~ei~~~~~~~~~~~~~ 111 (111) T protein:vir:96 72 ------------------------------------KNNLFEYEGRIFSIVGDSVDQGGQHEIKLLRLKQVPYGKS 111 (111) T ss_pred ------------------------------------cccEEEECCeEEEEeecccccCcceeeeeeeeeeccccCC Confidence 347999999999999998889998 54444322211112 No 38 >protein:vir:105908 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:788 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004378;genbank:gi:122891833;genbank:GeneID:4712379 Probab=71.44 E-value=0.088 Score=26.38 Aligned_cols=107 Identities=17% Similarity=0.176 Sum_probs=66.2 Q ss_pred CcCCCCCCccccceeeEecCCCcccCCeecCC--CCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKDGVIMPG--ERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGD 78 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G--~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd 78 (152) |. .||..|++--++=+-.. ++|.+-.. -.-.-|+.|-|||++..|.+++-.--. .=-+++|+-..+.++ T Consensus 1 ~~--~~y~EfpHrITiQ~~~~---v~g~g~~~~~wvd~~T~~A~V~~lt~~Ey~~AqQ~q~--~v~~nvy~rY~~~I~-- 71 (111) T protein:vir:10 1 MF--NPYDEFPHTISIGSIKK---VGEYPIIQERFVSDKTIKGFMDTPTTSEQLKFHQMSQ--EYDRNLYVPYDLPIA-- 71 (111) T ss_pred CC--CchhhCCceEEEEEEEE---ecCCCCccccccchhheeeeecCCChHHHHHHHhhcC--ceeeEEEEeecCCCC-- Confidence 54 58889987655444322 24333332 222447999999999888775543211 123578886666654 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCc--ccceeeeeeeecCC Q lcl|NC_017981. 79 QLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGI--ISHYKYYVVRKTYG 152 (152) Q Consensus 79 ~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gi--i~HykYl~Vr~~d~ 152 (152) +-.+|.|.||-|+|++-.-=|+|+ |.-.|--.+=.--| T Consensus 72 ------------------------------------~~mri~yegRi~~Ivs~pvDqgg~~ei~~~~~~~~~~~~~ 111 (111) T protein:vir:10 72 ------------------------------------KNNLFEYEGRIFSIVGDSVDQGGQHEIKLLRLKQVPYGKS 111 (111) T ss_pred ------------------------------------cccEEEECCeEEEEeecccccCcceeeeeeeeeeccccCC Confidence 347999999999999998889998 54444322211112 No 39 >protein:vir:80382 Length: 122 # NCBI annotation: BcepGomrgp10 # Family: family:all:704 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210230;genbank:gi:146329922;genbank:GeneID:5123491 Probab=70.55 E-value=0.096 Score=26.17 Aligned_cols=109 Identities=13% Similarity=0.146 Sum_probs=61.0 Q ss_pred CcC----------CCCCCccccceeeEecCCCcccCCeecCCC--CeeEEEEEEEEeecCCcceeeccccccceeeeEEe Q lcl|NC_017981. 1 MIG----------NGPFNKFRKDHTVIIVSDSYFKDGVIMPGE--RMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLY 68 (152) Q Consensus 1 ~~~----------~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~--~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIY 68 (152) |-. .--+.+|=++.+|++.-...--++.|-|+. +...++.+-+.+++-.+ ..|..|. = T Consensus 1 m~~f~Y~rl~~~A~~Li~kfG~~~tv~~~~~~~~~~~~~~P~t~~~~~~~v~gv~~~y~~r~-----idGtlIq-----~ 70 (122) T protein:vir:80 1 MKSFNYPRLLKTVDRLIEKFGEECFIVEYIDAVDPTRPFDPPVRTEVRTPVKGVFVKATEKH-----ADGTLIH-----I 70 (122) T ss_pred CCCcchhHHHHHHHHHHHHhCCCeEEEEeeccCCCCCccccCCCceeeccceEEEecccccc-----cCCcEEe-----e Confidence 221 123456668888876544344456777765 44567888887765221 1232111 1 Q ss_pred ecccccccccccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecch-hhCcccceeeeee Q lcl|NC_017981. 69 SDTKLPLAGDQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISW-QNGIISHYKYYVV 147 (152) Q Consensus 69 Td~~L~vagd~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~w-q~Gii~HykYl~V 147 (152) +|-+|-++.+. + ..--..+|+|+++|+.|.||++.++ .+|+.=+|+-. + T Consensus 71 GD~~~~~~a~~-------------~----------------~~~P~~~D~v~~~g~~~~Vi~v~p~~pag~~v~y~~q-~ 120 (122) T protein:vir:80 71 GDQLVLISGSL-------------K----------------REAANIKGDLFRGAEKWKMWNVVPLKPGPVNMLFKIK-V 120 (122) T ss_pred CCEEEEEeccc-------------c----------------cccCCcCCEEEeCCeeEEEEeccccCCCCceEEEEEE-E Confidence 12122111100 0 0011224999999999999999887 58887777744 5 Q ss_pred ee Q lcl|NC_017981. 148 RK 149 (152) Q Consensus 148 r~ 149 (152) || T Consensus 121 Rk 122 (122) T protein:vir:80 121 SQ 122 (122) T ss_pred eC Confidence 66 No 40 >protein:vir:965 Length: 97 # NCBI annotation: Orf47 # Family: family:all:1410 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076619;genbank:gi:13095727;genbank:GeneID:920260 Probab=70.29 E-value=0.11 Score=25.78 Aligned_cols=92 Identities=12% Similarity=-0.007 Sum_probs=66.3 Q ss_pred eEec-CCCcccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccccccccccccccccccc Q lcl|NC_017981. 16 VIIV-SDSYFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQLFISASVDVNKDTLA 94 (152) Q Consensus 16 V~r~-~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~~~~a~~~~n~~~l~ 94 (152) |+.. ....-.-|--++ +.....+.+++=||+.+|-.++-..|-+++-...|-. T Consensus 1 ~Li~~~~~~D~~G~q~~-~~~~r~V~~~~ksV~~sEfy~A~q~G~kpe~~~~i~~------------------------- 54 (97) T protein:vir:96 1 MLTPDGYDEDSLGQQIP-KTKKNIVLGYEKPMNRAEFYQAGQSGIEVTHTLVIHP------------------------- 54 (97) T ss_pred CccccceeeccCCCccC-ceeeeEEEEeecccChHHHHHHHhcCCceEEEEEEch------------------------- Confidence 2222 222222366665 5667789999999999999999999999987776644 Q ss_pred ccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 95 TEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 95 ~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) ..=.+...|.++|++|.|+.+.+=+++.| +.+|+|..-+ T Consensus 55 ----------------~eY~gE~~v~~eg~~Y~I~RTy~~~~e~i---EL~c~~~~g~ 93 (97) T protein:vir:96 55 ----------------FEYNNEQTLLYQGLLLTVVRHYKTSNEEL---ELVCRLKVGD 93 (97) T ss_pred ----------------hhhCCceEEEECCEEEEEEEeeeCCCCEE---EEEEEeeecC Confidence 12223467899999999999988777765 5778777655 No 41 >protein:vir:94092 Length: 114 # NCBI annotation: ORF044 # Family: family:all:788 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240237;genbank:gi:66395929;genbank:GeneID:5133261 Probab=70.23 E-value=0.084 Score=26.49 Aligned_cols=109 Identities=16% Similarity=0.150 Sum_probs=66.9 Q ss_pred CcCCCCCCccccceeeEecCCCcccCCeecCCC--CeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGE--RMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGD 78 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~--~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd 78 (152) +.=-.||..|++--++=+-.. ++|.+-..+ .-.-|+.|-|||++..|.+++-.--. .=-+++|+-..+.++ T Consensus 2 ~~~~~~y~EfpHrITiQ~~~~---v~g~g~~~~~wvd~~T~~A~V~~lt~~Ey~~AqQ~q~--~v~~nvy~rY~~~I~-- 74 (114) T protein:vir:94 2 LFVFNPYDEFPHTISIGSIKK---VGEYPIIQERFVSDKTIKGFMDTPTTSEQLKFHQMSQ--EYDRNLYVPYDLPIS-- 74 (114) T ss_pred eeeecchhhcCceEEEEEEEE---ecCCCCccccccchhheeeeecCCChHHHHHHHhhcC--ceeEEEEEeecCCCC-- Confidence 333579999987655544322 243333322 22447999999999888775543211 123578886666554 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCc--ccceeeeeeeecCC Q lcl|NC_017981. 79 QLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGI--ISHYKYYVVRKTYG 152 (152) Q Consensus 79 ~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gi--i~HykYl~Vr~~d~ 152 (152) +-.+|.|.||-|+|++-.-=|+|+ |.-.|--.+=.--| T Consensus 75 ------------------------------------~~mri~yegRi~~I~s~pvDqgg~~ei~~~~~~~~~~~~~ 114 (114) T protein:vir:94 75 ------------------------------------KNNLFEYEGRIFSIEGDSVDQGGQHEIKLLRLKQVPYGKS 114 (114) T ss_pred ------------------------------------cccEEEECCeEEEEeecccccCcceeeeeeeeeeccccCC Confidence 447999999999999998889998 54444322211112 No 42 >protein:vir:5743 Length: 117 # NCBI annotation: hypothetical protein # Family: family:all:11389 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892054;genbank:gi:33770517;interpro:IPR006453;uniprot:Q7Y406;genbank:GeneID:2637487;interpro:IPR013045 Probab=70.22 E-value=0.21 Score=24.28 Aligned_cols=106 Identities=13% Similarity=0.054 Sum_probs=65.7 Q ss_pred CcCCCCCCccccceeeEecCCCcccCCeecCCCCe-eEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGERM-YTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQ 79 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~~~-~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~ 79 (152) ++--| .+|+.-++.++..+.--.|-.++..-. ..++-|+|.|++..|.+.+-.+.--++.-++|-=- T Consensus 5 ~M~aG---~Lr~RItiq~~~~~~D~~G~~~~~~W~~~~~vWA~v~~lsgre~~~A~~~~~~~t~ri~IRyr--------- 72 (117) T protein:vir:57 5 PLNPG---DLNCRVILREVKSGRGPLGEVLPAAPVVMGKAWAKIEPISNRKIRSADQQQIVETCLFTLYPR--------- 72 (117) T ss_pred ccCcc---ccCCcEEEEecccccCCCCCeeccceeeeEEEEEeEEcCcchheeeccccceeeEEEEEEEec--------- Confidence 33334 356666666666665556777765444 45799999999988888776665555544444321 Q ss_pred cccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeee----eeecCC Q lcl|NC_017981. 80 LFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYV----VRKTYG 152 (152) Q Consensus 80 ~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~----Vr~~d~ 152 (152) ++....-.|+|+|+.|+|.++.+.+. .|--.+| ||+.-| T Consensus 73 -------------------------------~dI~~~mRV~~~gr~y~I~~V~d~~~---~~r~Lic~e~~i~~~~~ 115 (117) T protein:vir:57 73 -------------------------------RDISIDWQIVTTAGVFTVRVVDRVSH---ADRILITGEADIRHDRT 115 (117) T ss_pred -------------------------------CCCCcCCEEEECCEEEEEEeccCCCC---CccEEEEEecccccccC Confidence 22334578999999999999987432 1222344 333333 No 43 >protein:vir:4513 Length: 98 # NCBI annotation: unknown # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599039;genbank:gi:19548997;genbank:GeneID:935202 Probab=69.00 E-value=0.23 Score=24.09 Aligned_cols=79 Identities=11% Similarity=0.167 Sum_probs=58.7 Q ss_pred cCCC-CeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccccccccccccccccccccccccccccccccc Q lcl|NC_017981. 30 MPGE-RMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQLFISASVDVNKDTLATEDGKVIAVGAYIG 108 (152) Q Consensus 30 V~G~-~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~ 108 (152) |+.+ ....++-|.|+|++..+.+.+-.....||-.++|=-- T Consensus 1 ~ep~~~~~~~vWAkI~~~s~~~y~~a~q~~~~vThrItIRyR-------------------------------------- 42 (98) T protein:vir:45 1 MEPQYPVTFRTWAKVIQTSATTWQETAQTGDAITHYITIRYR-------------------------------------- 42 (98) T ss_pred CCCCCCcceeEEEEEeeccccchhhhhhhcccceEEEEEEEc-------------------------------------- Confidence 7764 5577899999999999999888888888887777431 Q ss_pred ccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeee--------ecCC Q lcl|NC_017981. 109 EKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVR--------KTYG 152 (152) Q Consensus 109 ~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr--------~~d~ 152 (152) .|...-..|+++|+.|.|-++.+++.+ -|||.+. +.-| T Consensus 43 --~gIT~~~rvv~gg~vy~I~~v~D~~~~----rrfL~L~CeElg~~~~~~~ 88 (98) T protein:vir:45 43 --RGITTDYEVVCGDSVYRVKRQRDLNGA----RRFLLLECTELGECRQSHG 88 (98) T ss_pred --cCCCcccEEEECCeEEEEEEecCCCCC----ceEEEEeeeeccccccccC Confidence 234455789999999999999998754 4666521 1112 No 44 >protein:vir:1027 Length: 116 # NCBI annotation: Orf47 # Family: family:all:1030 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076681;genbank:gi:13095790;genbank:GeneID:920343 Probab=64.08 E-value=0.31 Score=23.40 Aligned_cols=112 Identities=13% Similarity=0.115 Sum_probs=70.7 Q ss_pred CcCCCCCCccccceeeEecCCCcccCCeecCCC-Cee-EEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGE-RMY-TKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGD 78 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~-~~~-~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd 78 (152) |.-+=.-+.|++.-+.=......-.+|-.++.. +.. .++-|+++-.+-.++++. .|-.+.+.+.+-- T Consensus 1 m~~~~~p~~ln~ri~FG~~~~~~~~~g~~~~~~~~v~~f~~w~~~~t~t~~q~~~~--~Gt~ledT~~~vI--------- 69 (116) T protein:vir:10 1 MVKTYKPNDFNRKCKIGVTKTVTVPSGGKIEKIDPATVLNVRFAAKMRSLALQFQI--IGTTTADTFDIAI--------- 69 (116) T ss_pred CCCcccccccceeEEeeeeeeeeCCCCCccccceeeeEEEEEEEEeecchheeeee--ccccccCcEEEEE--------- Confidence 877777788888744433333333356665543 333 478888888876666655 4555555444322 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecC-C Q lcl|NC_017981. 79 QLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTY-G 152 (152) Q Consensus 79 ~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d-~ 152 (152) |+- ...++.=.+.++|+.|+|+...+=+..-..+|-++-+.+.. | T Consensus 70 --------------------------Rh~---~~i~~~m~v~~~g~~Y~I~~Is~Dd~~~~~~yD~iTlk~~~kg 115 (116) T protein:vir:10 70 --------------------------RHN---KLVTKKMFVQIDDVLYNIINISSDESAKLIKFDILTLQAKKKG 115 (116) T ss_pred --------------------------EeC---CCCCcCcEEEECCeEEEEEEeCCCCccCceeeeeEEEEEeecc Confidence 111 11223346899999999999999656668899999876554 4 No 45 >protein:vir:102144 Length: 113 # NCBI annotation: phage head-tail adaptor, putative # Family: family:all:3858 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699939;genbank:gi:110804032;genbank:GeneID:4206688 Probab=63.61 E-value=0.31 Score=23.34 Aligned_cols=112 Identities=11% Similarity=0.031 Sum_probs=66.8 Q ss_pred cCCCCCCccccceeeEecCCCcccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEE-eeccccccccccc Q lcl|NC_017981. 2 IGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRL-YSDTKLPLAGDQL 80 (152) Q Consensus 2 ~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rI-YTd~~L~vagd~~ 80 (152) +--| .||+.-++.++....-..|-++++-...-++-|+|.|++..|..++-......+.-++| |. ..|.. T Consensus 1 M~~G---~L~~rI~i~~~~~~~d~~G~~~~~w~~~~~~wA~v~~~~g~E~~~a~~~~~~~~~~f~iRy~-~~i~~----- 71 (113) T protein:vir:10 1 MAEC---RLNERIIIEELAIIQNSNGFEEEKWHEYYRCWSSFKKVKGSKFIAAKADNAENIVTFTIRYC-NKVKI----- 71 (113) T ss_pred CCcc---ccCceEEEEeeeeccCCCCCeecceEeEEEEEEEEEecCchheeeccceeeeeeEEEEEEec-CCCcc----- Confidence 3333 46666677777777777888988766677899999999988776554443333322222 11 11100 Q ss_pred ccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeec Q lcl|NC_017981. 81 FISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKT 150 (152) Q Consensus 81 ~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~ 150 (152) .-.....+...+.|+|+.|+|.+..+-...- ....-+|-..+ T Consensus 72 ---------------------------~~~~~it~~~ri~~~g~~y~I~~i~~~~~~~-~~l~i~~~~v~ 113 (113) T protein:vir:10 72 ---------------------------LLDIEAINKFRINFKGHYYKLEYVDDYDQGH-EWVDLKAKIIS 113 (113) T ss_pred ---------------------------cccccCCCCCeEEECCeEEEEEecCCcccCC-eEEEEEEEEeC Confidence 0012334568899999999999887653321 11233344455 No 46 >protein:vir:100886 Length: 113 # NCBI annotation: putative head-tail joining protein # Family: family:all:1030 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358766;genbank:gi:77999992;genbank:GeneID:3726157 Probab=62.02 E-value=0.34 Score=23.13 Aligned_cols=111 Identities=13% Similarity=0.097 Sum_probs=65.7 Q ss_pred cCCCCCCccccceeeEecCCCcccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccccc Q lcl|NC_017981. 2 IGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQLF 81 (152) Q Consensus 2 ~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~~ 81 (152) +.+=.-+.|++.-+.-......-.+|..+++.....++-|+.+-.+-.++++. .|..+.+.+.+-- T Consensus 1 m~k~~p~~ln~ri~FG~~~t~~~~~G~~~~~f~~~f~~~~~~~t~t~~q~~~~--~Gt~~e~t~~~vI------------ 66 (113) T protein:vir:10 1 MAKFKVADFSRKVDLGSPKSHTTGAGLNITSFVPNYSLHFKQQTRTLTQQYTL--VGTRLDNSITVIV------------ 66 (113) T ss_pred CCcccccccceEEEeeeeecccCCCCcccceeEeeEEEEEEEeecchheeeee--ccccccccEEEEE------------ Confidence 44434455555533322222222358777776666678888888776666544 3554555443322 Q ss_pred cccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeee-cCC Q lcl|NC_017981. 82 ISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRK-TYG 152 (152) Q Consensus 82 ~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~-~d~ 152 (152) |+ .....+.=.+.+||+.|+|+...+=+.--..+|-++-+++ .-| T Consensus 67 -----------------------Rh---~~~it~~m~v~~~g~~Y~I~~Is~Dd~~~~~~yD~iTlkk~~~g 112 (113) T protein:vir:10 67 -----------------------RH---DPRNASQKQARLDGIVYDISDISPDDSNDAIRYDYLTLVKTTKG 112 (113) T ss_pred -----------------------Ee---CCCCCcccEEEECCeEEEEEEeCCCCCCCcceeeeEEEEEEecC Confidence 11 0112233458999999999999994444588999996554 455 No 47 >protein:vir:4343 Length: 118 # NCBI annotation: Orf10 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061506;genbank:gi:9635594;genbank:GeneID:1262867 Probab=61.96 E-value=0.34 Score=23.13 Aligned_cols=107 Identities=11% Similarity=-0.055 Sum_probs=60.3 Q ss_pred cCCCCCCccccceeeEecCCCc-----ccCCeecCCC-CeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccc Q lcl|NC_017981. 2 IGNGPFNKFRKDHTVIIVSDSY-----FKDGVIMPGE-RMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPL 75 (152) Q Consensus 2 ~~~~~~~~FRk~~~V~r~~~G~-----Yv~GrwV~G~-~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~v 75 (152) +--|. +|+--++.++.... ..-..|++-. +...++-|+|-|++..|...+-..-...+.-++| T Consensus 1 M~~G~---l~~rI~i~~~~~~~d~~~G~~~~~w~~~~~~~~~~~WA~v~~~sg~e~~~a~~~~~~~~~~f~i-------- 69 (118) T protein:vir:43 1 MLAYR---MRHRIQFQRQVHTQDPDTGEETTTWETVLFSGHADLPAEVLTGPGRELIAADATQAETTARINC-------- 69 (118) T ss_pred CCccc---cCccEEEEeeeeecCCCCCcccCceeeeeecccceEEEEEEecCccceeecccchheeeEEEEE-------- Confidence 34444 44444555544332 2223344332 2235899999999988877654444333333333 Q ss_pred cccccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 76 AGDQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 76 agd~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) ||.+.-++-.+.+.|+|+|+.|+|..+.+=. ...+||.+.-..| T Consensus 70 -----------------------------Ry~~~~~~It~~~Ri~~~g~~y~I~~v~~~~----~~~~~l~i~~~e~ 113 (118) T protein:vir:43 70 -----------------------------RWFPVERLELYTWRVLWDGRVYNITSAETDV----TARREWRLRCSDG 113 (118) T ss_pred -----------------------------EecccccCCCcccEEEECCeEEEEEecCCcc----cCCeEEEEEEEEe Confidence 2222223445568999999999999987432 2235666554444 No 48 >protein:vir:3993 Length: 117 # NCBI annotation: unknown # Family: family:all:1030 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116501;genbank:gi:14251134;genbank:GeneID:921310 Probab=61.10 E-value=0.36 Score=23.02 Aligned_cols=112 Identities=13% Similarity=0.079 Sum_probs=69.0 Q ss_pred CcCCCCCCccccceeeEecCCCcccCCeecCCC-Ce-eEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGE-RM-YTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGD 78 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~-~~-~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd 78 (152) |.-+=.-+.|++.-+.-......-.+|--++.. +. ..++-|+++-.+-.++++. .|-.+.+.+.+-- T Consensus 1 m~~~~~p~~~n~ri~Fg~~~~~~~~~g~~~~~~~~~~~f~~w~~~~t~t~~q~~~~--~Gt~~e~T~~~vI--------- 69 (117) T protein:vir:39 1 MVKTYKPNDFNRKCKIGVTKTVTTPTGGKIEKIDPATVLNVRFAAKMRSLALQFQI--IGTTTADTLDIAI--------- 69 (117) T ss_pred CCccccccccceeEEeeeecceecCCCCcccccEEeeEEEEEEEEeecccceeeee--ecccccCcEEEEE--------- Confidence 777777778888755444443333355544432 33 2467888888776565544 3444444443322 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 79 QLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 79 ~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) |+- ...++.=.+.|+|+.|+|+...+=+..-...|-++.+.+.+. T Consensus 70 --------------------------Rh~---~~i~~~m~v~~~g~~Y~I~~Is~Dd~~~~~~yD~iTlk~~~~ 114 (117) T protein:vir:39 70 --------------------------RHN---KLVTKKMCVQIDDVLYNIINISSDESAKLIKFDILTLQAKKK 114 (117) T ss_pred --------------------------EeC---CCCCcccEEEECCeEEEEeEeCCCCccCceeeeeEEEEEeec Confidence 111 112223378999999999999996566688999999866554 No 49 >protein:vir:96131 Length: 111 # NCBI annotation: ORF047 # Family: family:all:788 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240081;genbank:gi:66395773;genbank:GeneID:5133113 Probab=56.51 E-value=0.24 Score=23.93 Aligned_cols=102 Identities=20% Similarity=0.300 Sum_probs=64.0 Q ss_pred CcCCCCCCccccceeeEec-------CCCcccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIV-------SDSYFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKL 73 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~-------~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L 73 (152) |. .||..| ||+|-.. .++.| -+||+.. |++|-||++..+|..+.-+=-. . =-+-||+-..+ T Consensus 1 ~~--~pyeEF--PhtIt~~~~~~vg~~~~~~--ekfv~~~----T~~~fvdt~T~~E~~k~~Ql~~-~-~d~NlY~pY~~ 68 (111) T protein:vir:96 1 MY--DPFDEY--PHTIEVGKMKLVGKYPNQR--KEFVPER----QMQGFMDTPTTSETLKFHQMNK-T-FDRNLYTRYEL 68 (111) T ss_pred CC--CcchhC--CceEeeeeEEEecCCCCcc--ceeccch----hhhhhcCCCCchhhhhhhhccC-c-cccceeecccC Confidence 32 466666 5555433 44455 3488754 4899999998888776543211 1 12468997777 Q ss_pred cccccccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCc--ccceeeeeeeecC Q lcl|NC_017981. 74 PLAGDQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGI--ISHYKYYVVRKTY 151 (152) Q Consensus 74 ~vagd~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gi--i~HykYl~Vr~~d 151 (152) +|. |-++|.|.|+-|.|+.---=|.|+ |.-.|--.+=.-- T Consensus 69 ~I~--------------------------------------k~~~~~yegri~~i~g~PVDqgG~hEI~~~k~~~~~~gk 110 (111) T protein:vir:96 69 PIN--------------------------------------KEDIFKYEGRIYQVVGYPVDQGGMHEVNLTRLQEVPNGQ 110 (111) T ss_pred CCC--------------------------------------cccEEEECCeEEEEeeCcccccccceeeeeeeeeccccC Confidence 764 448999999999999988888887 5444432221111 Q ss_pred C Q lcl|NC_017981. 152 G 152 (152) Q Consensus 152 ~ 152 (152) | T Consensus 111 g 111 (111) T protein:vir:96 111 G 111 (111) T ss_pred C Confidence 2 No 50 >protein:vir:96830 Length: 111 # NCBI annotation: ORF043 # Family: family:all:788 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240160;genbank:gi:66395847;genbank:GeneID:5133171 Probab=55.15 E-value=0.18 Score=24.68 Aligned_cols=104 Identities=22% Similarity=0.293 Sum_probs=66.4 Q ss_pred CcCCCCCCccccceeeEecC---CCccc--CCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVS---DSYFK--DGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPL 75 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~---~G~Yv--~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~v 75 (152) |. .||..| ||||-..+ =|.|- .-+||+.. |++|-|++++.+|.++.-+=-. . =-+-||+-..|++ T Consensus 1 ~~--~pyeEF--PhtIt~~~~e~vg~~~~~~~~fv~~~----t~~~fmdt~T~~E~~k~~Qm~~-~-~d~NlY~PY~~~I 70 (111) T protein:vir:96 1 MF--DPYNEF--PHTISIGVIKNVGEYPIIKERFVSEK----QINGFMDTPSTSKQLKFHQMTK-D-YDRNLYVPYDLPI 70 (111) T ss_pred CC--CcchhC--CCeEEEEEEEeecCCCceeeeecchh----hhhhhcCCCCchhhheeecccC-c-cccceeecccCCC Confidence 43 577777 66665543 12221 24676643 4899999999888876543211 1 1246899888877 Q ss_pred cccccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCc--ccceeeeeeeecCC Q lcl|NC_017981. 76 AGDQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGI--ISHYKYYVVRKTYG 152 (152) Q Consensus 76 agd~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gi--i~HykYl~Vr~~d~ 152 (152) +- -++|.|.|+-|.|+.-.-=|.|+ |.-+|--.+=---| T Consensus 71 ~k--------------------------------------~~~~~Yegri~~i~gdpvDqgG~~EI~~~r~k~~~~gkg 111 (111) T protein:vir:96 71 AT--------------------------------------NNLFEYEGRIYGIVGDSIDQGGQREIKMYRLKQVPYGKG 111 (111) T ss_pred Cc--------------------------------------ccEEEECCeEEEEecCCccccccceeeeeeeeeccccCC Confidence 53 47999999999999988888888 55555332222222 No 51 >protein:vir:3872 Length: 146 # NCBI annotation: putative head-tail joining protein # Family: family:all:28619 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680489;swissprot:trembl:p94213;genbank:gi:22296529;uniprot:P94213;genbank:GeneID:951708 Probab=52.77 E-value=0.55 Score=22.03 Aligned_cols=103 Identities=9% Similarity=-0.009 Sum_probs=58.4 Q ss_pred CcCCCCCCccccceeeEecCCCcccCCeecCCCCeeEEEEEEEEeecCCcceeeccc--ccc--ceeeeEEeeccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAE--GRR--VSDWRRLYSDTKLPLA 76 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpe--GrR--itd~~rIYTd~~L~va 76 (152) |.-- ..+|+.-++.+...+.-..|-++++....-++-|+|.|++..|-.+..-+ .+. ++=.+| |.. T Consensus 37 ~M~~---gkLn~RItfqk~~~~~d~~g~~~~~w~~v~tvWA~V~~~~grE~~~~~a~~~~~e~ti~F~IR-Y~~------ 106 (146) T protein:vir:38 37 LMRI---NRMTERIAFVSYESKKVNGVPVDGVIVKHMTVWAEVPKVPIREANDPQTKLGTRKDSPTFLVR-FLT------ 106 (146) T ss_pred eecc---ccCCccEEEEEeeeeecCCCcCCCcceeeeEEEEeeeccchhhhHhhhhhhhhhcceeEEEEE-ecC------ Confidence 2233 36677766666655443335555555556689999999987776433222 111 111122 321 Q ss_pred ccccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeee---eeec Q lcl|NC_017981. 77 GDQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYV---VRKT 150 (152) Q Consensus 77 gd~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~---Vr~~ 150 (152) .++-.|--.|.|+|+.|+|....+= -.|..|+. ..-+ T Consensus 107 ---------------------------------~~~I~~~mRI~y~gk~YeI~~I~pd----~~~k~~~~I~akeVS 146 (146) T protein:vir:38 107 ---------------------------------AEEIQPTWRIQWRGNEYQITGLDPD----YERRDLTTITAKAVS 146 (146) T ss_pred ---------------------------------CccCCcccEEEECCeEEEEeeeCCc----cccCcEEEEEEEEeC Confidence 1233445689999999999997543 34555553 3444 No 52 >protein:vir:7411 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:1030 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839928;genbank:gi:30089898;genbank:GeneID:1260685 Probab=52.40 E-value=0.56 Score=21.98 Aligned_cols=112 Identities=13% Similarity=0.095 Sum_probs=67.6 Q ss_pred CcCCCCCCccccceeeEecCCCcccCCeecCCC-Cee-EEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGE-RMY-TKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGD 78 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~-~~~-~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd 78 (152) |.-+=.-+.|++.-+.=......-.+|-.++.. +.. .++-|+++-.+-.++++. .|-.+.+.+.+-- T Consensus 1 M~~~~~p~~ln~ri~Fg~~~~~~n~~g~~~~~~~~~~~f~~w~a~~t~t~~q~~~~--~Gt~~edT~~~vI--------- 69 (116) T protein:vir:74 1 MAKTYKPNDFNRKCKIGVTKTVTTPTGGKIEKIDPATVLNVRFAAKMRSLALQFQI--IGTTTADTFDIAI--------- 69 (116) T ss_pred CcccccccccceeEEeeeeeeeeCCCCCcccceEEeeeEEEEEEEeecchheeeee--ccccccCcEEEEE--------- Confidence 666655677777644433333333355555542 332 367788877776666644 4555555444322 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecC-C Q lcl|NC_017981. 79 QLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTY-G 152 (152) Q Consensus 79 ~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d-~ 152 (152) |+- ...++.=.+.++|+.|+|+...+=+..-..+|-++-+.+.. | T Consensus 70 --------------------------Rh~---~~i~~~m~v~~~g~~Y~Iv~Is~Dd~~~~~~yD~iTlk~~~~g 115 (116) T protein:vir:74 70 --------------------------RHN---KLVTKKMFVQIDDVLYNIINISSDESAKLIKFDILTLQAKKKG 115 (116) T ss_pred --------------------------EeC---CCCCcCcEEEECCeEEEEEEeCCCCccCcceeeeEEEEEEeec Confidence 111 12233346899999999999998555568899999876554 4 No 53 >protein:vir:107100 Length: 109 # NCBI annotation: conserved phage protein # Family: family:all:788 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950609;genbank:gi:119953689;genbank:GeneID:4643109 Probab=49.16 E-value=0.32 Score=23.27 Aligned_cols=102 Identities=24% Similarity=0.312 Sum_probs=65.8 Q ss_pred CcCCCCCCccccceeeEecC---CCccc--CCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVS---DSYFK--DGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPL 75 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~---~G~Yv--~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~v 75 (152) |. .||..| ||+|-..+ =|.|- .-+||+. .|++|=||++..+|.++.-+=-. .=-+-|||-..|++ T Consensus 1 ~~--~pyeEF--PhtIt~~~~e~vg~~~~~~~~fv~~----~T~~~fmdt~T~~E~~k~~Qm~~--~~d~NlYtPY~~~I 70 (109) T protein:vir:10 1 MF--NPLNEF--PHTIELGSREVVGEYPRERERFKSE----KTIQGFMDTPTSSEQLKFHQMNQ--SYDRNLYTPYSLPI 70 (109) T ss_pred CC--CchhhC--CCeEeeeeEEEecCCCceeeeeccc----hhhhhhcCCCCchhhhhhhhccC--ccccceeecccCCC Confidence 43 477777 66665543 12221 2456654 35899999998888776543211 11246899877776 Q ss_pred cccccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCc--ccceeeeeeeecCC Q lcl|NC_017981. 76 AGDQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGI--ISHYKYYVVRKTYG 152 (152) Q Consensus 76 agd~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gi--i~HykYl~Vr~~d~ 152 (152) + |-++|.|+|+-|+|+.---=|.|+ |+-.|- -....| T Consensus 71 ~--------------------------------------k~~~~~Yegki~~v~g~PVDqgG~hEI~~~r~--k~~~~g 109 (109) T protein:vir:10 71 T--------------------------------------NKNLFKYNGKTYEVVGEPVDQGGQQEINLTRL--KECPIG 109 (109) T ss_pred C--------------------------------------cccEEEECCeEEEEeeCcccccccceeeeeee--eeecCC Confidence 5 448999999999999988888887 554443 333455 No 54 >protein:vir:105322 Length: 109 # NCBI annotation: conserved phage protein # Family: family:all:788 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950672;genbank:gi:119967842;genbank:GeneID:4643201 Probab=49.03 E-value=0.32 Score=23.28 Aligned_cols=101 Identities=24% Similarity=0.305 Sum_probs=65.8 Q ss_pred CcCCCCCCccccceeeEecCCCcccC------CeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKD------GVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLP 74 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~------GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~ 74 (152) |. .||..| ||+|-..+ =.-|+ -+||+. .|++|=|+++..+|.++.-+=-. .=-+-|||-..|+ T Consensus 1 ~~--~pyeEF--PhtIt~~~-~e~vg~~~~~~~~fv~~----~T~~~fmdt~T~~E~~k~~Qm~~--~~d~NlYtPY~~~ 69 (109) T protein:vir:10 1 MF--NPLNEF--PHTIELGS-REVVGEYPREQERFKSE----KTIQGFMDTPTSSEQLKFHQMNQ--SYDRNLYTPYSLP 69 (109) T ss_pred CC--CchhhC--CCeEeeee-EEEecCCCceeeeeccc----hhhhhhcCCCCchhhhhhhhccC--ccccceeecccCC Confidence 43 477777 66665543 12223 356654 35899999998888776543211 1124689987777 Q ss_pred ccccccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCc--ccceeeeeeeecCC Q lcl|NC_017981. 75 LAGDQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGI--ISHYKYYVVRKTYG 152 (152) Q Consensus 75 vagd~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gi--i~HykYl~Vr~~d~ 152 (152) ++ |-++|.|+|+-|+|+.---=|.|+ |+-.|- -....| T Consensus 70 I~--------------------------------------k~~~~~Yegki~~v~g~PVDqgG~~EI~~~r~--~~~~~g 109 (109) T protein:vir:10 70 IT--------------------------------------NTNLFKYNGKTYEVVGEPVDQGGQQEINLTRL--RECPIG 109 (109) T ss_pred CC--------------------------------------cccEEEECCeEEEEeeCcccccccceeeeeee--eeecCC Confidence 65 448999999999999988888887 554442 333455 No 55 >protein:vir:95373 Length: 101 # NCBI annotation: hypothetical protein # Family: family:all:1410 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764479;genbank:gi:115334633;genbank:GeneID:5179260 Probab=44.67 E-value=0.8 Score=21.12 Aligned_cols=100 Identities=19% Similarity=0.145 Sum_probs=69.4 Q ss_pred CcCCCCCCccccceeeEecCCCcccCCeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVIIVSDSYFKDGVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQL 80 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~ 80 (152) |. |-..-+.+-.....-.-|--++. +...++.+++=+++.+|-.++-..|-++.-...|-+ T Consensus 1 M~-------~~~~i~Li~~~~~~D~~G~~~~~-~~~r~V~~~~ksV~~sEfy~A~~~G~kpe~~~~i~~----------- 61 (101) T protein:vir:95 1 MT-------YDNELVLIAQEFVEDEIGNQIPI-ETRKTVLCNVKSVGRNEFYSAATSGLRPSVVFVVHR----------- 61 (101) T ss_pred Cc-------ccceEEecceeeeecCCCccccc-cceeEEEEEecccChhHHHHHHhcCCceEEEEEEeh----------- Confidence 32 33333333333333344777775 347889999999999999999999999987777755 Q ss_pred ccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeee-ecCC Q lcl|NC_017981. 81 FISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVR-KTYG 152 (152) Q Consensus 81 ~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr-~~d~ 152 (152) ..=.+...|.++|++|.|+.+-+=+++.|. -+|+| -.|| T Consensus 62 ------------------------------~eY~gE~~v~~eg~~Y~I~RTy~~~~e~iE---L~c~~~vg~g 101 (101) T protein:vir:95 62 ------------------------------YEYSGESEVEFEGIRYRVIRTYAVDFEEVE---LTCERVLAYG 101 (101) T ss_pred ------------------------------hhcCCCeEEEECCEEEEEEEeeeCCCCEEE---EEEEEeecCC Confidence 122344789999999999999877777753 44544 4677 No 56 >protein:vir:1242 Length: 111 # NCBI annotation: similar to phage Spp1 gp16 # Family: family:all:788 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510941;genbank:gi:17426275;genbank:GeneID:927372 Probab=44.39 E-value=0.72 Score=21.38 Aligned_cols=104 Identities=18% Similarity=0.171 Sum_probs=62.4 Q ss_pred CCCCCCccccceeeEec-CCCcccCC--eecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccc Q lcl|NC_017981. 3 GNGPFNKFRKDHTVIIV-SDSYFKDG--VIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQ 79 (152) Q Consensus 3 ~~~~~~~FRk~~~V~r~-~~G~Yv~G--rwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~ 79 (152) =-.||..||+--++-.- .-|.+-+| +|++ ..|+.|-|+|++..|.+++-.-= -.--.++|+-..+.++ T Consensus 1 ~~~~~~EfpH~ITiqk~~~v~~~g~~~e~w~d----~~T~wA~v~~ltg~Ey~~AqQ~Q--~ev~~ri~irY~~~I~--- 71 (111) T protein:vir:12 1 MFNPFDEFPHTIEIGEVEVVGTHPKEYERFKS----NETIKGFMDTPTSSETLKFHQMS--KDFDRNLYTPYHIPIT--- 71 (111) T ss_pred CCCchhhCCceEEEEEEEEecCCCCcCccccc----hhheeeeccCcChHHHHHHHhhc--CcceeEEEEeecCCCC--- Confidence 23688899875444332 11222222 2443 34699999999988877554311 1223577775555443 Q ss_pred cccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 80 LFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 80 ~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) +-.+|.|.||.|+|++..-=|.|. |=--+...+-+| T Consensus 72 -----------------------------------~~mri~y~gRif~I~s~~vD~~g~--hei~~~~~~~~~ 107 (111) T protein:vir:12 72 -----------------------------------NKTLFNYEGKTYEVVGEPVDQGGQ--HEINLTRLRVRS 107 (111) T ss_pred -----------------------------------ccceEEECCeEEEEeeeecCcccc--eeeeeeeeeecc Confidence 347899999999999998888887 211122233333 No 57 >protein:vir:105035 Length: 112 # NCBI annotation: Gp9 # Family: family:all:11389 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006589;genbank:gi:46402095;genbank:GeneID:2777900 Probab=29.68 E-value=1.6 Score=19.41 Aligned_cols=104 Identities=12% Similarity=-0.059 Sum_probs=63.1 Q ss_pred Cc-CCCCCCccccceeeEecCCCcccCCeecCCC-CeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccc Q lcl|NC_017981. 1 MI-GNGPFNKFRKDHTVIIVSDSYFKDGVIMPGE-RMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGD 78 (152) Q Consensus 1 ~~-~~~~~~~FRk~~~V~r~~~G~Yv~GrwV~G~-~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd 78 (152) |- --| .+|+.-++.++....-..|-++... ....++-|+|.|++..|.+.+-...-.++.-++|-=- T Consensus 1 m~M~aG---~L~~RItiq~~~~~~D~~G~~~~~~W~~~~tvWA~v~~~sg~E~~~A~~~~~~~t~~f~IRyr-------- 69 (112) T protein:vir:10 1 MSLKPG---DMNCRITIGYLQSGRGPLGEPLPEKLVESGKAWAKRELVSGRKVRTMDQQQVMETCLFTVYPG-------- 69 (112) T ss_pred CCCCcc---ccCCcEEEEeeeeccCCCCCEeccceeeeEEEEEEEEccCchheeecccccceeeEEEEEeeC-------- Confidence 31 122 4677777777776665567776554 5566899999999988877665554444433333221 Q ss_pred ccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeee----eeecC-C Q lcl|NC_017981. 79 QLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYV----VRKTY-G 152 (152) Q Consensus 79 ~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~----Vr~~d-~ 152 (152) ++-.....|+|+|+.|.|.++.+- .+. -+|| +|+.- | T Consensus 70 --------------------------------~dI~~~mRI~~~gr~y~I~~Vd~~-~~r----~llc~E~~~~~~~~~ 111 (112) T protein:vir:10 70 --------------------------------VVVDIDWKITTKDLVYTVRNIDRK-TDQ----IIITGEADGRHDRTG 111 (112) T ss_pred --------------------------------CCCCcccEEEECCeEEEEecccCC-CCc----EEEEeecccccccCC Confidence 223445789999999999987432 222 2444 22211 1 No 58 >protein:vir:94767 Length: 104 # NCBI annotation: unknown # Family: family:all:1270 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996709;genbank:gi:45597424;genbank:GeneID:2769039 Probab=28.73 E-value=1.5 Score=19.64 Aligned_cols=90 Identities=18% Similarity=0.195 Sum_probs=55.6 Q ss_pred eeeEe-cCCCcccCCeecCCCCeeEEEEEEEEeecCCcc-eeeccccccceeeeEEeecccccccccccccccccccccc Q lcl|NC_017981. 14 HTVII-VSDSYFKDGVIMPGERMYTKAKFSVQAIKNNEE-IQGFAEGRRVSDWRRLYSDTKLPLAGDQLFISASVDVNKD 91 (152) Q Consensus 14 ~~V~r-~~~G~Yv~GrwV~G~~~~~ti~aSVQPi~d~e~-~q~lpeGrRitd~~rIYTd~~L~vagd~~~~~a~~~~n~~ 91 (152) -++++ ...|.--.|-++-....++---.-|+|.+..+. .+..|.|.++. ||= + T Consensus 1 Vtl~~~~~~G~D~~g~pi~~~~~e~V~nVLV~P~s~~d~~~~~~p~G~~v~-----~tl-----a--------------- 55 (104) T protein:vir:94 1 MTLIDKVETGKDPFGNPIYEDKEIVVNNVLVSPTSSDDIVNQLTLTGKKAI-----YTL-----A--------------- 55 (104) T ss_pred CEeccceecCcCCCCCCcccccceEcCceeeCCCChhhcccccCcCceEEE-----EEE-----e--------------- Confidence 33333 366665568888887666666778999875555 55688888654 551 0 Q ss_pred ccccccccccccccccccc-CCCCCccEEEECCceEEEEEec----------chhhCc-cccee Q lcl|NC_017981. 92 TLATEDGKVIAVGAYIGEK-EGMSNPALVVIDGREYEVIERI----------SWQNGI-ISHYK 143 (152) Q Consensus 92 ~l~~~~~~~~~~~~~~~~~-e~~~n~d~v~~~G~~YeVi~~~----------~wq~Gi-i~Hyk 143 (152) +++. .+.+-+.-|.+.|+.|+|+.-- .|.+-+ +.+|- T Consensus 56 ---------------~PK~~~~~l~g~~V~~~G~~~~vvGdP~~~~~~~~P~~WN~~V~ver~~ 104 (104) T protein:vir:94 56 ---------------IPKKDTHDWENKKVRFFGKTWRTFGEPLEGIEELIPLDWNKKVTVEHYG 104 (104) T ss_pred ---------------cCCCCCCcccCceEEEeCcEEEEecCCccccCCcCCcccCCeEEEEEeC Confidence 1111 2345567899999999998754 454443 33333 No 59 >protein:vir:94797 Length: 111 # NCBI annotation: ORF040 # Family: family:all:788 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240539;genbank:gi:66396230;genbank:GeneID:5133577 Probab=27.85 E-value=1.4 Score=19.84 Aligned_cols=102 Identities=20% Similarity=0.222 Sum_probs=61.5 Q ss_pred CCCCCCccccceeeEec-CCCcccCC--eecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccc Q lcl|NC_017981. 3 GNGPFNKFRKDHTVIIV-SDSYFKDG--VIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQ 79 (152) Q Consensus 3 ~~~~~~~FRk~~~V~r~-~~G~Yv~G--rwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~ 79 (152) =-.||..|++--++=+- .-|.|-+| +|++-. |+.|-|||++.+|.+++-.--. .--++||+-..+.++ T Consensus 1 ~~~~y~efpHtItiqk~~~v~d~g~~~e~wvd~~----Tv~A~v~~ltg~Ey~~aqQmq~--~v~~niy~rY~~~I~--- 71 (111) T protein:vir:94 1 MFNPFDEFPHTIEIGEVEVVGTYPKEYERFKSKE----TIKGFMDTPTSSETLKFHQMSK--DFDRNLYTPYHIPIT--- 71 (111) T ss_pred CCCcchhcCceEEEEEEEEecCCCccceeecchh----HhhhhccCCChHHHHHHhhhcC--ccceEEEEeeeCCCC--- Confidence 23588888764333221 11222222 366533 4899999999888776544211 234678886666554 Q ss_pred cccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCc--ccceeeeeeeecCC Q lcl|NC_017981. 80 LFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGI--ISHYKYYVVRKTYG 152 (152) Q Consensus 80 ~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gi--i~HykYl~Vr~~d~ 152 (152) +-.++.|.||-|+|++-.-=|.|+ |.-.| .+-+| T Consensus 72 -----------------------------------~~mri~yegRi~~I~s~pvDqgg~hei~~~~----~k~~~ 107 (111) T protein:vir:94 72 -----------------------------------NKTLFNYEGKTYEVVGEPVDQGGQHEINLTR----LRVRL 107 (111) T ss_pred -----------------------------------ccceEEECCeEEEEeccccCCCcchhhhhhe----eeecc Confidence 347999999999999976667776 33322 23333 No 60 >protein:vir:97328 Length: 111 # NCBI annotation: ORF043 # Family: family:all:788 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240614;genbank:gi:66396306;genbank:GeneID:5133684 Probab=24.38 E-value=1.8 Score=19.13 Aligned_cols=102 Identities=18% Similarity=0.210 Sum_probs=61.7 Q ss_pred CCCCCCccccceeeEec-CCCcccCC--eecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccc Q lcl|NC_017981. 3 GNGPFNKFRKDHTVIIV-SDSYFKDG--VIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQ 79 (152) Q Consensus 3 ~~~~~~~FRk~~~V~r~-~~G~Yv~G--rwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~ 79 (152) =-.||..|++--++=+- .-|.|-+| +|++-. |+.|-|||++.+|.+++-.--. .--++||+-..+.++ T Consensus 1 ~~~~y~efpHtItiQk~~~v~d~g~~~e~wvd~~----Tv~A~v~~ltg~Ey~~aqQmq~--~v~~niy~rY~~~I~--- 71 (111) T protein:vir:97 1 MFNPFNEFPHTIEIGEIEVVGTYPKEYERFKSNK----TIKGFMDTPTSSETLKFHQMSK--DFDRNLYTPYHIPIT--- 71 (111) T ss_pred CCCcchhCCceEEEEEEEEecCCCccceeecchh----hhhhcccCCChHHHHHHhhhcC--ccceEEEEeeeCCCC--- Confidence 23689999764333221 11222222 366533 4899999999888776544211 234678886666554 Q ss_pred cccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCc--ccceeeeeeeecCC Q lcl|NC_017981. 80 LFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGI--ISHYKYYVVRKTYG 152 (152) Q Consensus 80 ~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gi--i~HykYl~Vr~~d~ 152 (152) +-.++.|.||-|+|++-.-=|.|+ |.-. ..+-+| T Consensus 72 -----------------------------------~~mri~yegRi~~I~s~pvDqgg~hei~~~----~~k~~~ 107 (111) T protein:vir:97 72 -----------------------------------NKTLFNYEGKTYKVVGEPVDQGGQHEINLT----RLRVRP 107 (111) T ss_pred -----------------------------------ccceEEECCeEEEEeccccCCCcchhhhhh----eeeecc Confidence 347899999999999966667776 3322 223333 No 61 >protein:vir:94713 Length: 785 # NCBI annotation: tail tube # Family: family:all:825 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338122;genbank:gi:77118200;genbank:GeneID:3707736 Probab=21.58 E-value=2.6 Score=18.33 Aligned_cols=147 Identities=10% Similarity=-0.020 Sum_probs=51.3 Q ss_pred CcCCCC-----CCccccceeeEecCCCccc-CC-eecCCCCeeE------EEEEEEEeec-CCcceeeccccccceeeeE Q lcl|NC_017981. 1 MIGNGP-----FNKFRKDHTVIIVSDSYFK-DG-VIMPGERMYT------KAKFSVQAIK-NNEEIQGFAEGRRVSDWRR 66 (152) Q Consensus 1 ~~~~~~-----~~~FRk~~~V~r~~~G~Yv-~G-rwV~G~~~~~------ti~aSVQPi~-d~e~~q~lpeGrRitd~~r 66 (152) ..+.-+ --+|++... +-.+.++|+ .| ..+....... .....+||+. +...++..+.|+++.-.+- T Consensus 391 ~~~~~~~~i~~~v~~~~~L~-l~T~~~e~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~v~r~ 469 (785) T protein:vir:94 391 VSHPRISILKYAVPFSEQLL-LWSDEVQFVMTSSGVLTSKSIQLDVGSEFALGDNARPFAVGRSVFFSAPRGSFTSIKRY 469 (785) T ss_pred ecCCcceeeEEEeecCCcEE-EEecCcEEEEcCCCcccceeEEEEEEEeeeccCCCCceEeCCeEEEEecCCCeeEEEee Confidence 111111 124666633 445555665 22 2222221111 1334577754 5567777778875554444 Q ss_pred Ee-ecccc-cccccccccccccccccccccccccccccccccccccCCCCCccEE----EECCceEEEEEecchh---hC Q lcl|NC_017981. 67 LY-SDTKL-PLAGDQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALV----VIDGREYEVIERISWQ---NG 137 (152) Q Consensus 67 IY-Td~~L-~vagd~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v----~~~G~~YeVi~~~~wq---~G 137 (152) .+ .+..= ..+.|+-.--.....++-...|.....+...-... ..++-++ ++.+++=.|.+=+.|. ++ T Consensus 470 ~~~~~~~d~y~~~dlt~~~~~~~~g~~~~~~a~~~~~~~~~~~~----~~~g~l~~~~y~~~~~e~~v~aW~r~~~~~~~ 545 (785) T protein:vir:94 470 FAVADVSDVKDADDTTGHVLSYIPNGVFDIQGTGTENYICVNST----GAYNRIYIYKFLFKDSVQLQASWSHWEFPKDD 545 (785) T ss_pred eeecccccceehhhHHHHHHHhcCCCcEEEEEecCCCcEEEEEE----cCCCEEEEEEEeecCCceEEEEEEEEEeCCCe Confidence 22 21111 11222100011111111111110000000000000 0011111 1123444444444443 22 Q ss_pred c-ccc-----eeeeeeeecCC Q lcl|NC_017981. 138 I-ISH-----YKYYVVRKTYG 152 (152) Q Consensus 138 i-i~H-----ykYl~Vr~~d~ 152 (152) . +.| .-|++||+.+| T Consensus 546 ~~~~~~~~~d~~~~vv~r~~g 566 (785) T protein:vir:94 546 KILASASIGSTMFIVRQHQGG 566 (785) T ss_pred EEEEEEEeCCEEEEEEEcCCC Confidence 2 222 23777888877 No 62 >protein:vir:95069 Length: 111 # NCBI annotation: ORF046 # Family: family:all:788 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240826;genbank:gi:66394713;genbank:GeneID:5133863 Probab=21.42 E-value=2.1 Score=18.81 Aligned_cols=102 Identities=20% Similarity=0.204 Sum_probs=61.3 Q ss_pred CCCCCCccccceeeEec-CCCcccC--CeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccc Q lcl|NC_017981. 3 GNGPFNKFRKDHTVIIV-SDSYFKD--GVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQ 79 (152) Q Consensus 3 ~~~~~~~FRk~~~V~r~-~~G~Yv~--GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~ 79 (152) =-.||..|++--++=+- .-|.+-+ -+|++-. |+.|-|||++.+|.+++-.--. .--++||+-..+.++ T Consensus 1 ~~~~y~efpHtItiqk~~~v~d~g~~~e~wvd~~----Tv~A~v~~ltg~Ey~~aqQmq~--~v~~niy~rY~~~I~--- 71 (111) T protein:vir:95 1 MFNPFDEFPHTIEIGEVEVVGTHPKEYERFKSNE----TIKGFMDTPTSSETLKFHQMSK--DFDRNLYTPYHIPIT--- 71 (111) T ss_pred CCCcchhcCceEEEEEEEEecCCCccceeecchh----HhhhhccCCChHHHHHHhhhcC--ccceEEEEeeeCCCC--- Confidence 23588888764333221 1122222 2366543 4899999999888776544211 234678886666554 Q ss_pred cccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCc--ccceeeeeeeecCC Q lcl|NC_017981. 80 LFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGI--ISHYKYYVVRKTYG 152 (152) Q Consensus 80 ~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gi--i~HykYl~Vr~~d~ 152 (152) +-.++.|.||-|+|++-.-=|.|+ |.-. ..+-+| T Consensus 72 -----------------------------------~~mri~yegRi~~I~s~pvDqgg~hei~~~----~~k~~~ 107 (111) T protein:vir:95 72 -----------------------------------NKTLFNYEGKTYEVVGEPVDQGGQHEINLT----RLRVRP 107 (111) T ss_pred -----------------------------------ccceEEECCeEEEEeccccCCCcchhhhhh----eeeecc Confidence 347999999999999976667776 3322 223333 No 63 >protein:vir:80104 Length: 124 # NCBI annotation: gp12 # Family: family:all:4890 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468716;genbank:gi:157325296;genbank:GeneID:5601795 Probab=21.39 E-value=1.7 Score=19.36 Aligned_cols=111 Identities=17% Similarity=0.150 Sum_probs=55.7 Q ss_pred CcCCCCCCccccceeeEe--cCCCcccCCeecCCCCe-eEEEE--EEEEeecCC---cceeeccccccceeeeEEeeccc Q lcl|NC_017981. 1 MIGNGPFNKFRKDHTVII--VSDSYFKDGVIMPGERM-YTKAK--FSVQAIKNN---EEIQGFAEGRRVSDWRRLYSDTK 72 (152) Q Consensus 1 ~~~~~~~~~FRk~~~V~r--~~~G~Yv~GrwV~G~~~-~~ti~--aSVQPi~d~---e~~q~lpeGrRitd~~rIYTd~~ 72 (152) |.=..-.-+|==|.+|.. ...|.|+.|.|++.... .++++ -=|=|.++. .+...+-+|+-...=+..||..+ T Consensus 4 M~Fasmi~~fGVpi~V~~~~~~gg~~~~G~w~~~~~~~~~~~~~~EPviP~s~~t~~~q~~~~tgG~~~~~dl~WySs~~ 83 (124) T protein:vir:80 4 MIFQSLLDSFGVPLTVFPKQEKGGEFVNGEWVVSQLDETSKIEVNEPFIPSSLMTQMPQTSAYTAARYEKYEMIWFSSQV 83 (124) T ss_pred cchhhhHHhhCCCeEEeecCCCCCcccCCccccCCCCCCChhhccccccCCcccchhhhhhhccCccchhhhhhhhhccc Confidence 321111122333333332 26788899999998622 22221 124454433 22223446665555555566544 Q ss_pred ccccccccccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeee-ecC Q lcl|NC_017981. 73 LPLAGDQLFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVR-KTY 151 (152) Q Consensus 73 L~vagd~~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr-~~d 151 (152) +++ +-.|.-.|+.|+|....+|+- .-+=+-|.+== .++ T Consensus 84 ~p~----------------------------------------gs~V~~~g~~y~V~~~~~Y~~-Ysdv~~Y~LK~vs~~ 122 (124) T protein:vir:80 84 LPL----------------------------------------KSKVIHKGITYSVEDAIPFTD-YSDVTQYGCKAVSVS 122 (124) T ss_pred ccc----------------------------------------ccEecCCCeeEEEeeccCccc-cCCceEEeeeecccc Confidence 443 123456699999999999973 33333454411 112 Q ss_pred C Q lcl|NC_017981. 152 G 152 (152) Q Consensus 152 ~ 152 (152) + T Consensus 123 ~ 123 (124) T protein:vir:80 123 A 123 (124) T ss_pred C Confidence 2 No 64 >protein:vir:3616 Length: 103 # NCBI annotation: ORF39 # Family: family:all:666 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112702;genbank:gi:13786570;genbank:GeneID:921068 Probab=21.32 E-value=2.6 Score=18.29 Aligned_cols=98 Identities=11% Similarity=0.077 Sum_probs=58.7 Q ss_pred cccceeeEec---CCCcccC--CeecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeeccccccccccccccc Q lcl|NC_017981. 10 FRKDHTVIIV---SDSYFKD--GVIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQLFISA 84 (152) Q Consensus 10 FRk~~~V~r~---~~G~Yv~--GrwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~~~~~a 84 (152) -|-...|.-. +++.|.- |+|+++++...++.|+|=+++ .+.+.+.=|.--+++.-|-... T Consensus 1 MRy~~~V~F~~~s~~~~YnP~tg~~~~~~~~~~~~~aNVTdlg--t~rs~~lFG~i~~~~kVIRl~~------------- 65 (103) T protein:vir:36 1 MRYLDEVTFIKESPDSHYDPDLGEWVEKEPTRTVFSANITDIG--TDRSVEVFGDIKKGAKVMRMMP------------- 65 (103) T ss_pred CcccceEEEEEeCCCCeeCCCCCCccCCeeEEEEEEeeccccc--chhheeecchhhcCeEEEEecC------------- Confidence 5666555444 4789993 999999999999999999986 2333333333333333333211 Q ss_pred ccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCcccceeeeeeeecCC Q lcl|NC_017981. 85 SVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGIISHYKYYVVRKTYG 152 (152) Q Consensus 85 ~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gii~HykYl~Vr~~d~ 152 (152) .-+...-|-+.|+|+.|+++..-...-+. .+.|.-..- T Consensus 66 -------------------------~i~~~~~~yi~i~~kkY~~~t~r~~~~~~-----~~iv~Ev~~ 103 (103) T protein:vir:36 66 -------------------------LFNMPKYDYIEFDNKKWALMTYRNPSERN-----TFILQEVSQ 103 (103) T ss_pred -------------------------CCCcCcccEEEECCcEEEEEEeeccccCc-----EEEEEEecC Confidence 11222348999999999999887764322 222222111 No 65 >protein:vir:93739 Length: 111 # NCBI annotation: ORF043 # Family: family:all:788 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240462;genbank:gi:66396155;genbank:GeneID:5133508 Probab=20.95 E-value=2.2 Score=18.70 Aligned_cols=102 Identities=20% Similarity=0.221 Sum_probs=61.5 Q ss_pred CCCCCCccccceeeEec-CCCcccCC--eecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccc Q lcl|NC_017981. 3 GNGPFNKFRKDHTVIIV-SDSYFKDG--VIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQ 79 (152) Q Consensus 3 ~~~~~~~FRk~~~V~r~-~~G~Yv~G--rwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~ 79 (152) =-.||..|++--++=+- .-|.|-+| +|++-. |+.|-|||++.+|.+++-.--. .--++||+-..+.++ T Consensus 1 ~~~~y~efpHtItiqk~~~v~d~g~~~e~wvd~~----Tv~A~v~~ltg~Ey~~aqQmq~--~v~~niy~rY~~~I~--- 71 (111) T protein:vir:93 1 MFNPFDEFPHTIEIGEVEVVGTYPKEYERFKSNE----TIKGFMDTPTSSETLKFHQMSK--DFDRNLYTPYHIPIT--- 71 (111) T ss_pred CCCcchhcCceEEEEEEEEecCCCccceeecchh----HhhhhccCCChHHHHHHhhhcC--ccceEEEEeeeCCCC--- Confidence 23588888764333221 11222222 366543 4899999999888776544211 234678886666554 Q ss_pred cccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCc--ccceeeeeeeecCC Q lcl|NC_017981. 80 LFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGI--ISHYKYYVVRKTYG 152 (152) Q Consensus 80 ~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gi--i~HykYl~Vr~~d~ 152 (152) +-.++.|.||-|+|++-.-=|.|+ |.-. ..+-+| T Consensus 72 -----------------------------------~~mri~yegRi~~I~s~pvDqgg~hei~~~----~~k~~~ 107 (111) T protein:vir:93 72 -----------------------------------NKTLFNYEGKTYEVVGEPVDQGGQHEINLT----RLRVRP 107 (111) T ss_pred -----------------------------------ccceEEECCeEEEEeccccCCCcchhhhhh----eeeecc Confidence 347999999999999976667776 3322 223333 No 66 >protein:vir:94491 Length: 111 # NCBI annotation: ORF046 # Family: family:all:788 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240679;genbank:gi:66396377;genbank:GeneID:5133755 Probab=20.95 E-value=2.2 Score=18.70 Aligned_cols=102 Identities=20% Similarity=0.221 Sum_probs=61.5 Q ss_pred CCCCCCccccceeeEec-CCCcccCC--eecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccc Q lcl|NC_017981. 3 GNGPFNKFRKDHTVIIV-SDSYFKDG--VIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQ 79 (152) Q Consensus 3 ~~~~~~~FRk~~~V~r~-~~G~Yv~G--rwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~ 79 (152) =-.||..|++--++=+- .-|.|-+| +|++-. |+.|-|||++.+|.+++-.--. .--++||+-..+.++ T Consensus 1 ~~~~y~efpHtItiqk~~~v~d~g~~~e~wvd~~----Tv~A~v~~ltg~Ey~~aqQmq~--~v~~niy~rY~~~I~--- 71 (111) T protein:vir:94 1 MFNPFDEFPHTIEIGEVEVVGTYPKEYERFKSNE----TIKGFMDTPTSSETLKFHQMSK--DFDRNLYTPYHIPIT--- 71 (111) T ss_pred CCCcchhcCceEEEEEEEEecCCCccceeecchh----HhhhhccCCChHHHHHHhhhcC--ccceEEEEeeeCCCC--- Confidence 23588888764333221 11222222 366543 4899999999888776544211 234678886666554 Q ss_pred cccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCc--ccceeeeeeeecCC Q lcl|NC_017981. 80 LFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGI--ISHYKYYVVRKTYG 152 (152) Q Consensus 80 ~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gi--i~HykYl~Vr~~d~ 152 (152) +-.++.|.||-|+|++-.-=|.|+ |.-. ..+-+| T Consensus 72 -----------------------------------~~mri~yegRi~~I~s~pvDqgg~hei~~~----~~k~~~ 107 (111) T protein:vir:94 72 -----------------------------------NKTLFNYEGKTYEVVGEPVDQGGQHEINLT----RLRVRP 107 (111) T ss_pred -----------------------------------ccceEEECCeEEEEeccccCCCcchhhhhh----eeeecc Confidence 347999999999999976667776 3322 223333 No 67 >protein:vir:97429 Length: 111 # NCBI annotation: ORF046 # Family: family:all:788 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240752;genbank:gi:66396450;genbank:GeneID:5133785 Probab=20.95 E-value=2.2 Score=18.70 Aligned_cols=102 Identities=20% Similarity=0.221 Sum_probs=61.5 Q ss_pred CCCCCCccccceeeEec-CCCcccCC--eecCCCCeeEEEEEEEEeecCCcceeeccccccceeeeEEeecccccccccc Q lcl|NC_017981. 3 GNGPFNKFRKDHTVIIV-SDSYFKDG--VIMPGERMYTKAKFSVQAIKNNEEIQGFAEGRRVSDWRRLYSDTKLPLAGDQ 79 (152) Q Consensus 3 ~~~~~~~FRk~~~V~r~-~~G~Yv~G--rwV~G~~~~~ti~aSVQPi~d~e~~q~lpeGrRitd~~rIYTd~~L~vagd~ 79 (152) =-.||..|++--++=+- .-|.|-+| +|++-. |+.|-|||++.+|.+++-.--. .--++||+-..+.++ T Consensus 1 ~~~~y~efpHtItiqk~~~v~d~g~~~e~wvd~~----Tv~A~v~~ltg~Ey~~aqQmq~--~v~~niy~rY~~~I~--- 71 (111) T protein:vir:97 1 MFNPFDEFPHTIEIGEVEVVGTYPKEYERFKSNE----TIKGFMDTPTSSETLKFHQMSK--DFDRNLYTPYHIPIT--- 71 (111) T ss_pred CCCcchhcCceEEEEEEEEecCCCccceeecchh----HhhhhccCCChHHHHHHhhhcC--ccceEEEEeeeCCCC--- Confidence 23588888764333221 11222222 366543 4899999999888776544211 234678886666554 Q ss_pred cccccccccccccccccccccccccccccccCCCCCccEEEECCceEEEEEecchhhCc--ccceeeeeeeecCC Q lcl|NC_017981. 80 LFISASVDVNKDTLATEDGKVIAVGAYIGEKEGMSNPALVVIDGREYEVIERISWQNGI--ISHYKYYVVRKTYG 152 (152) Q Consensus 80 ~~~~a~~~~n~~~l~~~~~~~~~~~~~~~~~e~~~n~d~v~~~G~~YeVi~~~~wq~Gi--i~HykYl~Vr~~d~ 152 (152) +-.++.|.||-|+|++-.-=|.|+ |.-. ..+-+| T Consensus 72 -----------------------------------~~mri~yegRi~~I~s~pvDqgg~hei~~~----~~k~~~ 107 (111) T protein:vir:97 72 -----------------------------------NKTLFNYEGKTYEVVGEPVDQGGQHEINLT----RLRVRP 107 (111) T ss_pred -----------------------------------ccceEEECCeEEEEeccccCCCcchhhhhh----eeeecc Confidence 347999999999999976667776 3322 223333 Done!