Query lcl|NC_012223.1_cdsid_YP_002720071.1 [gene=EpSSL_gp32] [protein=phage structural protein] [protein_id=YP_002720071.1] [location=complement(25789..26148)] Match_columns 119 No_of_seqs 68 out of 75 Neff 6.3 Searched_HMMs 1612 Date Thu Nov 7 12:57:21 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_32 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_32_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:80382 Length: 122 100.0 4.1E-41 2.5E-44 242.0 13.1 117 3-119 1-122 (122) 2 protein:vir:104346 Length: 123 100.0 7.4E-40 4.6E-43 235.1 12.8 114 6-119 1-118 (123) 3 protein:vir:97237 Length: 122 100.0 2.3E-39 1.4E-42 232.4 13.8 115 3-119 1-122 (122) 4 protein:vir:79639 Length: 123 100.0 1.1E-37 7.1E-41 223.1 9.9 114 6-119 1-118 (123) 5 protein:vir:107669 Length: 123 100.0 7.8E-37 4.8E-40 218.5 9.4 114 6-119 1-118 (123) 6 protein:vir:95145 Length: 126 99.9 2E-30 1.2E-33 183.5 11.5 115 3-119 1-126 (126) 7 protein:vir:78381 Length: 119 99.8 1.5E-22 9.3E-26 140.2 5.9 117 3-119 1-119 (119) 8 protein:vir:103282 Length: 104 99.6 5.3E-18 3.3E-21 115.3 8.0 100 3-103 1-104 (104) 9 protein:vir:96807 Length: 132 99.3 1.1E-15 6.9E-19 102.6 3.8 113 3-119 1-132 (132) 10 protein:vir:94992 Length: 78 # 99.0 4.9E-13 3.1E-16 88.0 6.4 78 42-119 1-78 (78) 11 protein:vir:3426 Length: 117 # 97.8 8.4E-07 5.2E-10 53.9 10.5 99 3-119 1-108 (117) 12 protein:vir:395 Length: 117 # 97.1 2.2E-05 1.4E-08 46.1 11.0 101 3-119 1-108 (117) 13 protein:vir:5258 Length: 123 # 95.4 0.00023 1.4E-07 40.5 7.7 103 3-119 1-109 (123) 14 protein:vir:94034 Length: 141 95.3 0.0004 2.5E-07 39.2 8.6 111 3-119 1-125 (141) 15 protein:vir:4199 Length: 113 # 94.7 0.0002 1.3E-07 40.8 5.4 105 8-119 1-109 (113) 16 protein:vir:4161 Length: 112 # 93.4 0.00064 4E-07 38.1 5.6 105 3-119 1-109 (112) 17 protein:vir:106729 Length: 152 93.4 0.0023 1.4E-06 35.0 8.6 110 6-119 1-121 (152) 18 protein:vir:95261 Length: 133 92.9 0.0017 1E-06 35.8 7.1 101 16-119 1-122 (133) 19 protein:vir:78609 Length: 152 92.8 0.0042 2.6E-06 33.6 9.1 110 6-119 1-121 (152) 20 protein:vir:105466 Length: 120 92.1 0.0094 5.9E-06 31.7 10.3 112 3-119 1-120 (120) 21 protein:vir:77651 Length: 152 92.0 0.0029 1.8E-06 34.5 7.3 110 6-119 1-121 (152) 22 protein:vir:101560 Length: 152 91.3 0.004 2.5E-06 33.7 7.3 110 6-119 1-121 (152) 23 protein:vir:100886 Length: 113 89.6 0.016 9.7E-06 30.5 9.0 104 3-119 1-108 (113) 24 protein:vir:81177 Length: 109 88.6 0.031 2E-05 28.8 10.3 102 6-119 1-105 (109) 25 protein:vir:78985 Length: 115 88.0 0.035 2.2E-05 28.6 10.6 108 11-119 1-115 (115) 26 protein:vir:4858 Length: 116 # 87.6 0.035 2.2E-05 28.6 9.5 106 1-119 1-111 (116) 27 protein:vir:5977 Length: 109 # 86.0 0.049 3E-05 27.8 10.3 98 18-119 1-101 (109) 28 protein:vir:100226 Length: 114 84.8 0.058 3.6E-05 27.4 9.2 105 2-119 1-109 (114) 29 protein:vir:102144 Length: 113 84.4 0.061 3.8E-05 27.2 10.2 103 6-119 1-113 (113) 30 protein:vir:4459 Length: 134 # 83.1 0.063 3.9E-05 27.2 8.7 111 3-119 1-117 (134) 31 protein:vir:102962 Length: 115 82.9 0.072 4.5E-05 26.9 8.9 105 14-119 1-115 (115) 32 protein:vir:1890 Length: 110 # 82.9 0.073 4.5E-05 26.8 10.4 103 6-119 1-107 (110) 33 protein:vir:78106 Length: 236 81.6 0.012 7.6E-06 31.1 4.2 102 1-119 125-236 (236) 34 protein:vir:102084 Length: 107 80.5 0.094 5.9E-05 26.2 10.4 102 6-119 1-105 (107) 35 protein:vir:102856 Length: 107 80.5 0.094 5.9E-05 26.2 10.4 102 6-119 1-105 (107) 36 protein:vir:105006 Length: 107 80.5 0.094 5.9E-05 26.2 10.4 102 6-119 1-105 (107) 37 protein:vir:107606 Length: 107 80.5 0.094 5.9E-05 26.2 10.4 102 6-119 1-105 (107) 38 protein:vir:4999 Length: 116 # 72.9 0.18 0.00011 24.7 9.7 107 1-119 1-111 (116) 39 protein:vir:3616 Length: 103 # 70.8 0.2 0.00013 24.4 8.8 96 20-117 1-103 (103) 40 protein:vir:4343 Length: 118 # 62.7 0.33 0.0002 23.2 9.3 103 6-119 1-114 (118) 41 protein:vir:100134 Length: 109 62.4 0.33 0.00021 23.2 10.6 102 2-119 1-108 (109) 42 protein:vir:742 Length: 102 # 62.2 0.34 0.00021 23.2 9.0 96 20-116 1-102 (102) 43 protein:vir:7858 Length: 111 # 62.2 0.28 0.00017 23.6 6.8 99 19-119 1-106 (111) 44 protein:vir:101653 Length: 111 62.2 0.28 0.00017 23.6 6.8 99 19-119 1-106 (111) 45 protein:vir:4832 Length: 116 # 62.2 0.34 0.00021 23.2 9.6 106 3-119 1-111 (116) 46 protein:vir:4955 Length: 116 # 62.1 0.34 0.00021 23.1 9.8 106 3-119 1-111 (116) 47 protein:vir:1641 Length: 115 # 60.8 0.32 0.0002 23.3 6.8 102 1-119 1-111 (115) 48 protein:vir:80104 Length: 124 51.3 0.42 0.00026 22.6 5.8 106 3-119 1-120 (124) 49 protein:vir:102337 Length: 116 50.3 0.61 0.00038 21.7 7.1 101 1-119 1-112 (116) 50 protein:vir:1385 Length: 107 # 49.8 0.63 0.00039 21.7 9.3 97 3-119 1-105 (107) 51 protein:vir:105035 Length: 112 47.2 0.71 0.00044 21.4 8.8 103 1-119 1-107 (112) 52 protein:vir:5743 Length: 117 # 44.2 0.82 0.00051 21.1 8.9 107 1-119 1-112 (117) 53 protein:vir:100244 Length: 109 41.0 0.95 0.00059 20.7 10.1 101 2-119 1-104 (109) 54 protein:vir:1436 Length: 108 # 40.9 0.95 0.00059 20.7 9.1 99 6-119 1-103 (108) 55 protein:vir:3971 Length: 103 # 37.5 1.1 0.00069 20.3 8.4 96 20-117 1-103 (103) 56 protein:vir:193 Length: 112 # 35.9 1.2 0.00075 20.1 9.7 102 6-119 1-110 (112) 57 protein:vir:99571 Length: 131 35.7 1.2 0.00076 20.1 6.9 104 1-119 1-124 (131) 58 protein:vir:80342 Length: 108 35.2 1.2 0.00077 20.1 9.4 99 6-119 1-103 (108) 59 protein:vir:1272 Length: 119 # 35.2 1.2 0.00077 20.1 9.8 104 1-119 1-116 (119) 60 protein:vir:9878 Length: 103 # 34.3 0.41 0.00025 22.7 2.9 95 1-119 1-100 (103) 61 protein:vir:94910 Length: 119 33.6 1.3 0.00083 19.9 6.3 110 1-119 1-119 (119) 62 protein:vir:79989 Length: 111 31.7 1.5 0.00092 19.6 9.0 102 3-117 1-111 (111) 63 protein:vir:9413 Length: 111 # 31.7 1.5 0.00092 19.6 9.0 102 3-117 1-111 (111) 64 protein:vir:4603 Length: 111 # 31.7 1.5 0.00092 19.6 9.0 102 3-117 1-111 (111) 65 protein:vir:81103 Length: 111 31.7 1.5 0.00092 19.6 9.0 102 3-117 1-111 (111) 66 protein:vir:98341 Length: 111 31.7 1.5 0.00092 19.6 9.0 102 3-117 1-111 (111) 67 protein:vir:3993 Length: 117 # 29.3 1.7 0.001 19.4 10.1 105 2-119 1-111 (117) 68 protein:vir:7447 Length: 131 # 25.6 2 0.0013 18.9 8.3 109 3-119 1-121 (131) 69 protein:vir:9762 Length: 112 # 25.4 2.1 0.0013 18.9 5.6 99 3-119 1-108 (112) 70 protein:vir:79686 Length: 118 25.2 2.1 0.0013 18.8 9.1 105 1-118 1-118 (118) 71 protein:vir:101507 Length: 135 24.1 2.2 0.0014 18.7 7.9 111 1-119 1-125 (135) 72 protein:vir:102189 Length: 135 24.1 2.2 0.0014 18.7 7.9 111 1-119 1-125 (135) 73 protein:vir:3872 Length: 146 # 22.7 2.4 0.0015 18.5 10.3 116 1-119 20-146 (146) 74 protein:vir:93599 Length: 116 22.0 2.5 0.0016 18.4 9.8 104 1-119 1-114 (116) No 1 >protein:vir:80382 Length: 122 # NCBI annotation: BcepGomrgp10 # Family: family:all:704 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210230;genbank:gi:146329922;genbank:GeneID:5123491 Probab=100.00 E-value=4.1e-41 Score=242.03 Aligned_cols=117 Identities=26% Similarity=0.350 Sum_probs=108.3 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEEeC-CEeccCC-CcccCccceeeeeEEEEeccchhhcCCeEEeecCEEEEEccC Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVTRP-GTVDRVD-GDEVVIPPTSFDVIGLREEYKPSEIDGTRIVAGDVKFLCQAI 80 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~-g~~dp~~-g~~~~~~~~~~~~~gv~~~~~~~~idGtlI~~GD~~~~~sa~ 80 (119) ||+|||+||+++|+|||+|||++|+++|. +.+||.. +.+++.++.+|+++|++.+|++|++||+|||+||++++++|+ T Consensus 1 m~~f~Y~rl~~~A~~Li~kfG~~~tv~~~~~~~~~~~~~~P~t~~~~~~~v~gv~~~y~~r~idGtlIq~GD~~~~~~a~ 80 (122) T protein:vir:80 1 MKSFNYPRLLKTVDRLIEKFGEECFIVEYIDAVDPTRPFDPPVRTEVRTPVKGVFVKATEKHADGTLIHIGDQLVLISGS 80 (122) T ss_pred CCCcchhHHHHHHHHHHHHhCCCeEEEEeeccCCCCCccccCCCceeeccceEEEecccccccCCcEEeeCCEEEEEecc Confidence 99999999999999999999999999974 4567754 455556788999999999999999999999999999999987 Q ss_pred c---ccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 81 K---QVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 81 ~---~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) . +|+++|+|++||++|+||+++|++||+++|+|++|+|= T Consensus 81 ~~~~~P~~~D~v~~~g~~~~Vi~v~p~~pag~~v~y~~q~Rk 122 (122) T protein:vir:80 81 LKREAANIKGDLFRGAEKWKMWNVVPLKPGPVNMLFKIKVSQ 122 (122) T ss_pred cccccCCcCCEEEeCCeeEEEEeccccCCCCceEEEEEEEeC Confidence 5 69999999999999999999999999999999999999 No 2 >protein:vir:104346 Length: 123 # NCBI annotation: conserved phage-related protein # Family: family:all:704 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398974;genbank:gi:81343958;genbank:GeneID:3778878 Probab=100.00 E-value=7.4e-40 Score=235.12 Aligned_cols=114 Identities=31% Similarity=0.518 Sum_probs=109.4 Q ss_pred cChHHHHHHHHHHHHhc----CCeEEEEeCCEeccCCCcccCccceeeeeEEEEeccchhhcCCeEEeecCEEEEEccCc Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKF----GMTATVTRPGTVDRVDGDEVVIPPTSFDVIGLREEYKPSEIDGTRIVAGDVKFLCQAIK 81 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~----G~~~tl~r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~idGtlI~~GD~~~~~sa~~ 81 (119) |||.+|+++|++||++| |....+|++|+.+-++|.++..++++|+++|++++|++|||||++||+||++++|+|++ T Consensus 1 mnY~~l~~~a~~lI~~f~~~~gv~~~~t~~~~v~~v~G~ev~~p~~~~~~~Gv~~~y~~r~IDG~lIq~gD~~~i~~a~~ 80 (123) T protein:vir:10 1 MNYNEIESITRDGINFFSDANGVYEMSTGAGYVEIVNGVEVEVPAQTFQLKGLVREIKTRDIDGEFIQFGDKRGIFTAQV 80 (123) T ss_pred CChHHHHHHHHHHHHHhCCCCCceEEecCCCeeeCCCCceeeccceeEeeEEEeccCChhhccceeeeeccEEEEEecCc Confidence 89999999999999999 55667888888777788899999999999999999999999999999999999999999 Q ss_pred ccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 82 QVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 82 ~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) +++.||+|++||+.|+||+++|+||++++|+|++|||. T Consensus 81 eik~Gd~i~vdGe~~rVV~~~pikPa~~~v~y~~qlRr 118 (123) T protein:vir:10 81 EIKQGYQIKVDGETFVVVDPRPVKPTGTTVGYRPILRR 118 (123) T ss_pred eeccCCEEEECCeEEEEecCCccCccceeEEEeeeeee Confidence 99999999999999999999999999999999999999 No 3 >protein:vir:97237 Length: 122 # NCBI annotation: hypothetical protein ORF025 # Family: family:all:704 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294533;genbank:gi:149408254;genbank:GeneID:5237102 Probab=100.00 E-value=2.3e-39 Score=232.41 Aligned_cols=115 Identities=26% Similarity=0.321 Sum_probs=106.9 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEEeC--CEeccCCCc--ccCccceeeeeEEEEeccchhhcCCeEEeecCEEEEEc Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVTRP--GTVDRVDGD--EVVIPPTSFDVIGLREEYKPSEIDGTRIVAGDVKFLCQ 78 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~--g~~dp~~g~--~~~~~~~~~~~~gv~~~~~~~~idGtlI~~GD~~~~~s 78 (119) |+ | |+||+++|.+||+|||++|+++|. |+|||.+|. ++++++++|+++|++.+|++++|||++||+||++++++ T Consensus 1 M~-~-y~~~~~~a~~Li~kfG~~vtl~r~~~g~y~~~~g~~~p~~~t~~~~~~~gv~~~~~~~~idGtlI~~GD~~l~~~ 78 (122) T protein:vir:97 1 MA-R-FDSAIALAKKLIKKNGQAVTLRGFTAGAAPDPAKPWKPGGNVAADQTIEAVFLDYEQRYIDGQTIRMGDQRVFMP 78 (122) T ss_pred Cc-c-chHHHHHHHHHHHHhCCceEEEEeccceeCCCCCceecCCceeeeeeeEEEeeccchhhccCcEEeecCEEEEEe Confidence 88 4 999999999999999999999996 569988764 34566789999999999999999999999999999999 Q ss_pred cC---cccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 79 AI---KQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 79 a~---~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) |. .+|++||+|++||+.|+||+++|++|++++++|++|||= T Consensus 79 a~~~~~~P~~gD~v~~~g~~~~Vi~v~~i~pa~~~v~y~lqlRk 122 (122) T protein:vir:97 79 AEGLTAPPEVEGLVLRGLEVWKVIAVKPLNPNGQAIMYELQVRQ 122 (122) T ss_pred eCCCccccccCCEEEeCCEEEEEEeccccCCCCceEEEEEEeeC Confidence 84 689999999999999999999999999999999999999 No 4 >protein:vir:79639 Length: 123 # NCBI annotation: gp39 # Family: family:all:704 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285528;genbank:gi:148734511;genbank:GeneID:5219997 Probab=100.00 E-value=1.1e-37 Score=223.12 Aligned_cols=114 Identities=25% Similarity=0.423 Sum_probs=112.8 Q ss_pred cChHHHHHHHHHHHHhc----CCeEEEEeCCEeccCCCcccCccceeeeeEEEEeccchhhcCCeEEeecCEEEEEccCc Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKF----GMTATVTRPGTVDRVDGDEVVIPPTSFDVIGLREEYKPSEIDGTRIVAGDVKFLCQAIK 81 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~----G~~~tl~r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~idGtlI~~GD~~~~~sa~~ 81 (119) |||..++.+|+.+|+.| |..+++||+|+.+-++|.|+++++.+|+++|++.+|++|||||++||+||++++|+|++ T Consensus 1 mNy~~l~~~~~~~I~~fsd~~G~~~~~t~~g~~~dv~G~ev~~p~~~~~~~Gv~~~~~~reIDg~lIq~gDvk~i~~a~v 80 (123) T protein:vir:79 1 MNYEQIRSMASAGINFFSDGTGEFDCITQPGSVEIVGGIEVEKPEIKVKIKGLVRAPRTREVDGEVIRVTDKLGVFNADV 80 (123) T ss_pred CchHHHHHHHHHHHHhhccCCCceeeeecCcceeecCCeeccccceEEeEEEEeecCCccccCCeeEEeccEEEEEecce Confidence 89999999999999999 99999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 82 QVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 82 ~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) +++.||+|++|||.|+||+++|||||+++|||++|||. T Consensus 81 eik~Gd~i~vDge~~rVV~~~pvkPa~~~I~y~~qLRr 118 (123) T protein:vir:79 81 ELKNGYQIDIDGERYVMVETRPIRPTSITVAYRPIMRR 118 (123) T ss_pred eeccCCEEEECCeEEEEecCccccchhhhhhhhhhhcc Confidence 99999999999999999999999999999999999999 No 5 >protein:vir:107669 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:704 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003901;genbank:gi:45686317;genbank:GeneID:2773009 Probab=100.00 E-value=7.8e-37 Score=218.54 Aligned_cols=114 Identities=18% Similarity=0.274 Sum_probs=109.2 Q ss_pred cChHHHHHHHHHHHHhc---CCeEEE-EeCCEeccCCCcccCccceeeeeEEEEeccchhhcCCeEEeecCEEEEEccCc Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKF---GMTATV-TRPGTVDRVDGDEVVIPPTSFDVIGLREEYKPSEIDGTRIVAGDVKFLCQAIK 81 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~---G~~~tl-~r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~idGtlI~~GD~~~~~sa~~ 81 (119) |||.+|+++|+++|+.| |.++.+ |++|+++-++|.|+++|+.+|+++|+.++|++|||||++||+||++++|+|++ T Consensus 1 mNY~~i~~~a~~~I~~fsd~~g~~~l~t~~~~~~~v~G~Ev~~p~~~~~~~G~~~~y~~reIDG~lI~~gDvk~if~a~v 80 (123) T protein:vir:10 1 MNYSQIERMARKGVAFFTDPSRPMNLIKQGEYGYDENGFEIPPMEQVIPISGATRRPNAREIDGETIRASDILGIFNNDH 80 (123) T ss_pred CChHHHHHHHHHHHhhhcCCCCeEEeeeCCcccccCCCeecccCCeeeeeEEEEeeccccccccceeeeccEEEeeccce Confidence 89999999999999988 778877 55677888999999999999999999999999999999999999999999999 Q ss_pred ccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 82 QVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 82 ~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) +++.||+|++||+.|+||+++||+|++++|||++|||. T Consensus 81 eik~Gd~I~vDg~~~rVV~~~pvkPa~~~I~y~~qLRr 118 (123) T protein:vir:10 81 EINEGDYIEIDGIRHVVVDARPVQASLEPVAYRPVLRR 118 (123) T ss_pred eeccCCEEEECCeEEEEecCcccchhhhhhhhhhhhce Confidence 99999999999999999999999999999999999999 No 6 >protein:vir:95145 Length: 126 # NCBI annotation: hypothetical protein ORF014 # Family: family:all:704 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293421;genbank:gi:148912842;genbank:GeneID:5228220 Probab=99.94 E-value=2e-30 Score=183.46 Aligned_cols=115 Identities=22% Similarity=0.193 Sum_probs=106.8 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEEeC---CEeccCC-CcccCccceeeeeEEEEeccchh------hcCCeEEeecC Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVTRP---GTVDRVD-GDEVVIPPTSFDVIGLREEYKPS------EIDGTRIVAGD 72 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~---g~~dp~~-g~~~~~~~~~~~~~gv~~~~~~~------~idGtlI~~GD 72 (119) ||. |+|++++|.|||+|||++|++++. ..+||+. |++...+++++++++++..|++| +|||++|+.|| T Consensus 1 ma~--y~rl~ata~rLIaK~Gq~~~~r~~~~~~~~DP~~p~~p~~~t~~d~p~t~v~l~yd~r~~~~~~~idGt~I~~GD 78 (126) T protein:vir:95 1 MAQ--FDRAIKTAQRLIAKNGEKVKWRVIEDVTPTDPNKPWEPGPALPEDKDVTICFLPVDRQTQETFNFIKGTEVPKGS 78 (126) T ss_pred Ccc--hHHHHHHHHHHHHHhCCceEEEEEeeccCCCCccCCCCCCCceeecccceEEecccccccchhhhcccceeccCc Confidence 887 779999999999999999999985 4599975 77778899999999999999999 89999999999 Q ss_pred EEEEEcc-CcccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 73 VKFLCQA-IKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 73 ~~~~~sa-~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) ++++++. ..+|++||.++.||+.|+|++++|++|+|+.|+|++-..+ T Consensus 79 ~~i~i~gl~~ap~vgd~V~~~g~~~~ivav~pl~P~gvavLy~l~~~~ 126 (126) T protein:vir:95 79 VMGLMGNVPFAPNLKDVVIRNGVELRLAYIDVLSPNGQKVLYTMVFQA 126 (126) T ss_pred EEEEecccccccccCceEEECcEEEEEEeeeeeCCCcceeeeeeeecC Confidence 9999974 3579999999999999999999999999999999999999 No 7 >protein:vir:78381 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:704 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110843;genbank:gi:134288604;genbank:GeneID:5179644 Probab=99.77 E-value=1.5e-22 Score=140.22 Aligned_cols=117 Identities=21% Similarity=0.256 Sum_probs=113.3 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEEeCCE--eccCCCcccCccceeeeeEEEEeccchhhcCCeEEeecCEEEEEccC Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVTRPGT--VDRVDGDEVVIPPTSFDVIGLREEYKPSEIDGTRIVAGDVKFLCQAI 80 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~g~--~dp~~g~~~~~~~~~~~~~gv~~~~~~~~idGtlI~~GD~~~~~sa~ 80 (119) |+.....|||...+|||+|||.++.+.|.|+ |||+.|+.++.+.+..|++++....++..++|+.||+||+++.+..+ T Consensus 1 m~t~fskrmqgvgtrll~k~gstv~lvr~g~k~wd~vlgeyiw~~d~vlplkavpvpvn~glvngttiqagdm~vkad~s 80 (119) T protein:vir:78 1 MGTSFSKRMQGVGTRLLSKYGSTVNLVRKGQKTWDPVLGEYVWGPDVVLPLKAVPVPVNAGLVNGTTIQAGDMMVKADYS 80 (119) T ss_pred CCcchhhHHhhhhHHHHHhhcchhhhhhccchhhhhhhhhhccCCceeeecccccccccccccccceeeccceeeeccce Confidence 8887789999999999999999999999986 99999999999999999999999999999999999999999999999 Q ss_pred cccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 81 KQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 81 ~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) ++|+..|++.++||.|+|++++.-.-++.+++|.+|+|- T Consensus 81 vvpkm~dkv~f~geqwsvvaiekkmvnddvvayfiqvrk 119 (119) T protein:vir:78 81 VVPKMDDKVRFSGEQWSVVAIEKKMVNDDVVAYFIQVRK 119 (119) T ss_pred eccccccceeecCceeeeeeeehhhcchhheeheeeecC Confidence 999999999999999999999988889999999999999 No 8 >protein:vir:103282 Length: 104 # NCBI annotation: hypothetical protein # Family: family:all:704 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277461;genbank:gi:71834104;genbank:GeneID:3562393 Probab=99.56 E-value=5.3e-18 Score=115.27 Aligned_cols=100 Identities=28% Similarity=0.401 Sum_probs=88.0 Q ss_pred ecccChHHHHHHHHH----HHHhcCCeEEEEeCCEeccCCCcccCccceeeeeEEEEeccchhhcCCeEEeecCEEEEEc Q lcl|NC_012223. 3 MAGFNYTGLKRKVNP----LIKKFGMTATVTRPGTVDRVDGDEVVIPPTSFDVIGLREEYKPSEIDGTRIVAGDVKFLCQ 78 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~----ll~~~G~~~tl~r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~idGtlI~~GD~~~~~s 78 (119) |--|||..++.+|.. |.+..|..-.+|++| |..++|.|++.+.+.|+++|+++.++.|+|||+.|+.||++.+|+ T Consensus 1 ~~~MNy~~ie~~~r~GInffSD~~g~~e~~tq~~-~~ivnG~EV~~p~~~~~ikG~vR~~kaReIDGe~Ir~~D~~GIF~ 79 (104) T protein:vir:10 1 MLPMNHALLQQQIKAGINLLSDGDGVFEATTQPA-ISIVNGYEVRTPGTSYTVRGVIREFKARDIDGDIIKFGDRRGIFT 79 (104) T ss_pred CCccCHHHHHHHHhhhhhhhcCCCceeeeecCCc-eeeecCeecCCCCeEEEeeeeeecccccccCccEEEeeceeceee Confidence 899999999998854 444556655666666 888999999999999999999999999999999999999999999 Q ss_pred cCcccccCCEEEeCCeEEEEEecce Q lcl|NC_012223. 79 AIKQVKVGDLVSLNNTDYRVINPNP 103 (119) Q Consensus 79 a~~~p~~gD~i~i~G~~~~Vv~v~p 103 (119) |+++++.||+|.||||+-+-...+- T Consensus 80 ad~ei~eG~~I~IDge~~~~~~~~~ 104 (104) T protein:vir:10 80 ADSVVSEGDRIYIDQEATQSLTLDQ 104 (104) T ss_pred cceeecCCcEEEEcccccceeeecC Confidence 9999999999999999887666543 No 9 >protein:vir:96807 Length: 132 # NCBI annotation: hypothetical phage protein # Family: family:all:32158 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224252;genbank:gi:62362387;genbank:GeneID:3345746 Probab=99.31 E-value=1.1e-15 Score=102.57 Aligned_cols=113 Identities=19% Similarity=0.341 Sum_probs=97.3 Q ss_pred ecc--cChHHHHHHHHHHHHhcCCeEEEEeC----CEeccCCCcccCccceeeeeEEEEeccchhhcCCeEEeecCEEEE Q lcl|NC_012223. 3 MAG--FNYTGLKRKVNPLIKKFGMTATVTRP----GTVDRVDGDEVVIPPTSFDVIGLREEYKPSEIDGTRIVAGDVKFL 76 (119) Q Consensus 3 Ma~--~~Y~~~~~~A~~ll~~~G~~~tl~r~----g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~idGtlI~~GD~~~~ 76 (119) |-. -||+.++.+ |++.|.++++.++ |+||.+......+|..+||+.|+..+|+++|.....-|+||++++ T Consensus 1 mqtyfedyqdaret----lkedgfavtlikkglpgggydengdiqaaepdieypgygittsfsswhlkegiaqagdvkli 76 (132) T protein:vir:96 1 MQTYFEDYQDARET----LKEDGFAVTLIKKGLPGGGYDENGDIQAAEPDIEYPGYGITTSFSSWHLKEGIAQAGDVKLI 76 (132) T ss_pred CcchhhhhHHHHHH----HhhcCceEeeeeccCCCCCcCCCCceeccCCCccCCCcceecccchhhhhhhhccccceeEE Confidence 544 356666555 9999999998774 468888777788899999999999999999998888999999999 Q ss_pred EccCccc-----------ccCCEE--EeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 77 CQAIKQV-----------KVGDLV--SLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 77 ~sa~~~p-----------~~gD~i--~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) |++++.. .-||++ ++||+-|+||.-+.++|.++.|.-++++|- T Consensus 77 fapevmsdeyitfynqlrnggdrmyaevdgelwrvvmgeevkptstqiiaklhlrr 132 (132) T protein:vir:96 77 FAPEVMSDEYITFYNQLRNGGDRMYAEVDGELWRVVMGEEVKPTSTQIIAKLHLRR 132 (132) T ss_pred eccccccchhhhhHHhhhcchhhhhhhhcchheehccccccccchhhhhhhhcccC Confidence 9998632 337886 889999999999999999999999999999 No 10 >protein:vir:94992 Length: 78 # NCBI annotation: hypothetical protein # Family: family:all:704 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224023;genbank:gi:62327310;genbank:GeneID:5176820 Probab=99.04 E-value=4.9e-13 Score=88.05 Aligned_cols=78 Identities=17% Similarity=0.222 Sum_probs=74.6 Q ss_pred cCccceeeeeEEEEeccchhhcCCeEEeecCEEEEEccCcccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 42 VVIPPTSFDVIGLREEYKPSEIDGTRIVAGDVKFLCQAIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 42 ~~~~~~~~~~~gv~~~~~~~~idGtlI~~GD~~~~~sa~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) .+.+.+..|++++....++..++|+.||+||+++.+..+++|+..|++.++||.|+|++++.-.-++.+++|.+|+|- T Consensus 1 ~w~~d~vlplkavpvpvn~glvngttiqagdm~vkad~svvpkm~dkvqf~geqwsvvaiekkmvnddvvayfiqvrk 78 (78) T protein:vir:94 1 MWGPDVVLPLKAVPVPVNAGLVNGTTIQAGDMMVKADYSVVPKMDDKVQFSGEQWSVVAIEKKMVNDDVVAYFIQVRK 78 (78) T ss_pred CCCCceeeecccccccccccccccceeeccceEEeccceeccccccceeecCceeEEEeeeeccccccceeeeeeecC Confidence 566778899999999999999999999999999999999999999999999999999999988889999999999999 No 11 >protein:vir:3426 Length: 117 # NCBI annotation: head-tail joining protein # Family: family:all:1908 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040589;genbank:gi:9626253;genbank:GeneID:2703484 Probab=97.76 E-value=8.4e-07 Score=53.90 Aligned_cols=99 Identities=19% Similarity=0.279 Sum_probs=71.8 Q ss_pred ecccChHHH--HHHH---HHHHHhcCCeEEEEeCCEeccCCCcccCccceeeeeEEEEeccc-hhhcC-CeEEeecCEEE Q lcl|NC_012223. 3 MAGFNYTGL--KRKV---NPLIKKFGMTATVTRPGTVDRVDGDEVVIPPTSFDVIGLREEYK-PSEID-GTRIVAGDVKF 75 (119) Q Consensus 3 Ma~~~Y~~~--~~~A---~~ll~~~G~~~tl~r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~-~~~id-GtlI~~GD~~~ 75 (119) |++|+ ++ +++| ..++..||.+++|+-.+ | ....++|||++.. ...+. |.-|....=.+ T Consensus 1 m~~~d--Nlfd~a~~~aD~~i~~~fg~~a~i~~~~------g-------~~~~i~gVFDdP~~~~~~~gG~~i~~s~P~L 65 (117) T protein:vir:34 1 MADFD--NLFDAAIARADETIRGYMGTSATITSGE------Q-------SGAVIRGVFDDPENISYAGQGVRVEGSSPSL 65 (117) T ss_pred CCccc--chhHHHHhhcchhhHhhcCeeEEEEeCC------C-------cceEEEEEecCccchhhccCCEEeecCCcEE Confidence 99987 44 3333 57788999999887543 1 2367899999843 44444 35666666666 Q ss_pred EE-ccC-cccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 76 LC-QAI-KQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 76 ~~-sa~-~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) ++ +++ +.++.+|.|+|+|+.|.|..++| .++=.+|-.-.|| T Consensus 66 ~vk~aDv~~l~r~D~v~I~G~~y~V~~~~P---D~~G~~~l~L~rg 108 (117) T protein:vir:34 66 FVRTDEVRQLRRGDTLTIGEENFWVDRVSP---DDGGSCHLWLGRG 108 (117) T ss_pred EeeechhhccCCCCEEEECCCeeEeeeccc---CCCceEEEEeecC Confidence 66 443 47899999999999999998655 6666678788899 No 12 >protein:vir:395 Length: 117 # NCBI annotation: gp10 # Family: family:all:1908 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046905;genbank:gi:9630475;genbank:GeneID:1261649 Probab=97.11 E-value=2.2e-05 Score=46.12 Aligned_cols=101 Identities=16% Similarity=0.209 Sum_probs=66.7 Q ss_pred ecccCh--H-HHHHHHHHHHHhcCCeEEEEeCCEeccCCCcccCccceeeeeEEEEeccchh--hcCCeEEeecCEEEEE Q lcl|NC_012223. 3 MAGFNY--T-GLKRKVNPLIKKFGMTATVTRPGTVDRVDGDEVVIPPTSFDVIGLREEYKPS--EIDGTRIVAGDVKFLC 77 (119) Q Consensus 3 Ma~~~Y--~-~~~~~A~~ll~~~G~~~tl~r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~--~idGtlI~~GD~~~~~ 77 (119) |++||= + .|...=...+..||.+++++-.+. ..-+++|+|++...- ...|--|...==.+++ T Consensus 1 m~~~dNlFd~ama~aD~aI~~~~g~~a~i~~g~~-------------~~rti~gVFDdP~~~~~~aggg~ie~saP~Lfv 67 (117) T protein:vir:39 1 MADFDNLFDEAMSRADGAIRGVMGTEAKVMSGTL-------------SGATLVGVFDDPENIGYAGAGIRVEGTSPTLFV 67 (117) T ss_pred CCcccchHHHHHHhhhHHHHHhcCceEEEEeCCC-------------CceEEEEEecCccccccccCceEEeccCcEEEE Confidence 999873 1 233322567889999988875441 124678899887422 2233334422233333 Q ss_pred -ccC-cccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 78 -QAI-KQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 78 -sa~-~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) +++ +.++.+|.|+|+|+.|.|++++| .++=.+|-.-.|| T Consensus 68 ktaDv~gl~r~D~vtI~g~~y~V~~~~p---Dg~G~~~l~L~rg 108 (117) T protein:vir:39 68 KTSTVSQLQRMDTLTINGRQFWVDRVGP---DDCGSCHIWLGNG 108 (117) T ss_pred eeccccccCCCCEEEECCCceEEeeecc---CCCceEEEEeecC Confidence 455 46899999999999999999665 6666677778899 No 13 >protein:vir:5258 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4880 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852763;genbank:gi:31544038;uniprot:Q776V7;genbank:GeneID:2777139 Probab=95.41 E-value=0.00023 Score=40.50 Aligned_cols=103 Identities=18% Similarity=0.320 Sum_probs=64.7 Q ss_pred ecccChHHHHHHHHHHHH-hcCCeEEEEe-CCEeccCCCcccCccceeeeeEEEEeccchhhc----CCeEEeecCEEEE Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIK-KFGMTATVTR-PGTVDRVDGDEVVIPPTSFDVIGLREEYKPSEI----DGTRIVAGDVKFL 76 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~-~~G~~~tl~r-~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~i----dGtlI~~GD~~~~ 76 (119) |+-+| ....|++ +|-++.+++| .|+|.+..+ ... .+..++.|++..-+..+. +|+.|... ++ T Consensus 1 m~~ld------vs~v~ldpdF~~titv~R~~g~~~~~g~--~~~-t~~~t~~avVqP~~~~dlq~LpeG~ri~~s-Ik-- 68 (123) T protein:vir:52 1 MSLIN------QSGRFLNSRFRQQITVQKQSGSHSASGF--DVR-YEKQQITAIVIPTSPNDVLLLPEGERYLPS-IK-- 68 (123) T ss_pred CCccc------ccccccCcccCceEEEEccCccEeCCcc--ccc-cccceEEEEEeeCChhhcccccccccccce-EE-- Confidence 55444 4455565 7778888877 477887544 222 356778999998887665 34444322 22 Q ss_pred EccCcccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 77 CQAIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 77 ~sa~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) |-....+..||.|.-+|++|+|+++.++.=-|= .+-+=.|= T Consensus 69 I~Tq~~L~vGD~vlw~G~~YrVi~~~d~s~YGY--y~~i~~~~ 109 (123) T protein:vir:52 69 VYTQQQLNIGDLVDYRGQTYKIKTAANWGDYGY--YNNIGVRH 109 (123) T ss_pred EEeccccccccEEEeCCcEEEEEEcCCccccce--ecceeecc Confidence 222336677999999999999999998743331 12222222 No 14 >protein:vir:94034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:3177 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453621;genbank:gi:84662657;genbank:GeneID:5142544 Probab=95.28 E-value=0.0004 Score=39.23 Aligned_cols=111 Identities=19% Similarity=0.150 Sum_probs=74.8 Q ss_pred ecccChHHHHHHHHHHHHhcC--CeEEEEeCCEeccCCCcccCccceeeeeEEE---Eeccc---hhhcCCeEEeecCEE Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFG--MTATVTRPGTVDRVDGDEVVIPPTSFDVIGL---REEYK---PSEIDGTRIVAGDVK 74 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G--~~~tl~r~g~~dp~~g~~~~~~~~~~~~~gv---~~~~~---~~~idGtlI~~GD~~ 74 (119) |++|| |.+.|...+..-. .+.++++-.+|.-..|...+. -.+...+ ....+ .+|.||-.||--=+. T Consensus 1 ~~~MN---Lh~Ia~~aI~aVNP~~pa~l~~stG~t~~~G~~~P~---y~~~~a~~iQ~QalS~~dL~h~dgln~QG~~~~ 74 (141) T protein:vir:94 1 MSGLN---LHRIVRGPIQVVNPDVPGDVYISTGHTTLRGIVTPT---FQRLPAQRLQVQAVTTNDLYQLNGLGYAKDTQK 74 (141) T ss_pred CCcch---hHHHhhhhhcccCCCCceEEEEeeccEecCCceEee---eecccceEEEeeccChhHHHHhhcccccceeeE Confidence 99996 7777877777554 578888865554334432211 1111111 12222 468898766655566 Q ss_pred EEEccCc------ccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 75 FLCQAIK------QVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 75 ~~~sa~~------~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) +|+.... .=+=||++.++|++|.|+.+-..+|.=..++-.+|+=+ T Consensus 75 iY~~G~~~gv~R~~~~GGD~~vf~g~~WLV~~v~E~WpDWc~v~v~lQ~~~ 125 (141) T protein:vir:94 75 LYAYGTLSGIVRPEGKGGDLVNLANTWWAIQGVIEWWPQWCSVAITRQVDA 125 (141) T ss_pred EEeccchhheechhhcCccEEEECCceEEEEEcccccccceeEeeeeccCh Confidence 6765541 23569999999999999999999999888888888887 No 15 >protein:vir:4199 Length: 113 # NCBI annotation: unknown # Family: family:all:11763 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071824;genbank:gi:11863107;genbank:GeneID:1257609 Probab=94.70 E-value=0.0002 Score=40.83 Aligned_cols=105 Identities=17% Similarity=0.222 Sum_probs=71.0 Q ss_pred hHHHHHHHHHHHHhcCCeEEEEeCC--EeccCCCcccCccceeeeeEEEEe--ccchhhcCCeEEeecCEEEEEccCccc Q lcl|NC_012223. 8 YTGLKRKVNPLIKKFGMTATVTRPG--TVDRVDGDEVVIPPTSFDVIGLRE--EYKPSEIDGTRIVAGDVKFLCQAIKQV 83 (119) Q Consensus 8 Y~~~~~~A~~ll~~~G~~~tl~r~g--~~dp~~g~~~~~~~~~~~~~gv~~--~~~~~~idGtlI~~GD~~~~~sa~~~p 83 (119) -.|++.--.+||.+||+.++|...- +-|. -|.++. -..+.+++.+|- .=..|-|.|..|--=|--.++...... T Consensus 1 mkrikggF~~Li~rYG~~~~LiH~~~~~RD~-RGQPi~-ED~~~~iK~FFHVN~G~ERiI~GQ~i~DYDAY~LI~~~~~i 78 (113) T protein:vir:41 1 MKRIKGGFQRLLRRYGQEVTLVHHSLKERDQ-RGQPVY-EDDTRPLKAFFHVNRGGERIINGQRISDYDAYVLIPLGVSI 78 (113) T ss_pred CccchhHHHHHHHHhCCchhhhhhhhccccc-cCCCcc-ccCCceEEEEEEeeCCCceeeecceeecccceEEEeeeeee Confidence 2366777789999999999986532 2222 133332 234566676664 234466788777777777777766777 Q ss_pred ccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 84 KVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 84 ~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) .-||.|+++|.+|+|-++-| .-+- -++.||- T Consensus 79 ~~~D~I~i~~~rY~V~aiI~---~RTH--~E~~L~R 109 (113) T protein:vir:41 79 MRGDHILIGEDKYTVTSIIK---NRTH--IEATLQR 109 (113) T ss_pred ccCCeEEECCceEEEeeecc---cccc--hhhheee Confidence 88999999999999998865 4442 4555665 No 16 >protein:vir:4161 Length: 112 # NCBI annotation: unknown # Family: family:all:11763 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046970;genbank:gi:9630540;genbank:GeneID:1261714 Probab=93.40 E-value=0.00064 Score=38.10 Aligned_cols=105 Identities=25% Similarity=0.249 Sum_probs=68.7 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEEeCC--EeccCCCcccCccceeeeeEEEEe--ccchhhcCCeEEeecCEEEEEc Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVTRPG--TVDRVDGDEVVIPPTSFDVIGLRE--EYKPSEIDGTRIVAGDVKFLCQ 78 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~g--~~dp~~g~~~~~~~~~~~~~gv~~--~~~~~~idGtlI~~GD~~~~~s 78 (119) |.-. -. .-.+||.+||+.++|...- +-|. -|.++. -..+-+++.+|- .=..|-|.|..|--=|--.++. T Consensus 1 mkps-vt----rF~~Li~~YG~~~~LiH~~~~~RD~-RGQPi~-ED~~~~iK~FFHVN~G~ERiI~GQ~i~DYDAY~LI~ 73 (112) T protein:vir:41 1 MKPS-VT----RFNQLIYKYGMDAKLIHKVMNGRDE-RGQPIT-EDTETSIKVFFHVNTGNERLILGQQVVDYDAYALIV 73 (112) T ss_pred CCcc-hH----HHHHHHHHhCCchhhhhhhhccccc-cCCCcc-ccCCceEEEEEEeeCCCceeeecceeecccceEEEe Confidence 3332 12 2357899999999986532 2232 133332 234566677664 2344667887777777777777 Q ss_pred cCcccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 79 AIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 79 a~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) ......-||.|++||.+|+|-++-| ..+ .-++.||- T Consensus 74 ~~~~i~~~D~I~i~~~~Y~V~aiI~---~RT--H~E~~L~R 109 (112) T protein:vir:41 74 KTLNVNDDDEIEVDGKRYRVGAVIP---QRT--HQELHLRR 109 (112) T ss_pred eeeeeccCCeEEECCceEEEeeecc---ccc--cceeeeee Confidence 6677788999999999999988865 444 35677777 No 17 >protein:vir:106729 Length: 152 # NCBI annotation: gp06 # Family: family:all:3177 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944314;genbank:gi:38638613;genbank:GeneID:2657358 Probab=93.39 E-value=0.0023 Score=35.04 Aligned_cols=110 Identities=15% Similarity=0.033 Sum_probs=77.1 Q ss_pred cChHHHHHHHHHHHHhcC--CeEEEEeCCEeccCCCcccCccceeeeeEEEEeccch---hhcCCeEEeecCEEEEEccC Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKFG--MTATVTRPGTVDRVDGDEVVIPPTSFDVIGLREEYKP---SEIDGTRIVAGDVKFLCQAI 80 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~G--~~~tl~r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~---~~idGtlI~~GD~~~~~sa~ 80 (119) || |.+.|...|..-. .+.++++..+|.-..|...+ .-.+.++..=+...+. +|.||-.||--=+.+|+... T Consensus 1 MN---Lh~Ia~~aI~aVNP~~pA~l~~stG~t~~~G~r~p-~Y~~~~v~vQ~QalS~~dL~h~dglnqQG~~~~iY~~Gn 76 (152) T protein:vir:10 1 MN---LHDIVRGAITQVNPDEPGTMFVSTGRNNVRGILTP-TFSSVDAQLQIQAQKHTPLQHERGALYTNSFLTVYAYGK 76 (152) T ss_pred Cc---hHHhhhhhhhccCCCCceEEEEeccceecCceecc-eeccceeEEEEeecCchHHHHhhcccccceeeEEEeccc Confidence 54 7777777777554 57888887556545554332 2224555555555555 57888766655566777554 Q ss_pred c------ccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 81 K------QVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 81 ~------~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) . .=+=||++.++|++|.|+.+-..+|.=..++-.+|+=+ T Consensus 77 ~~gv~R~~~qGGD~~vf~g~~WLVv~v~E~WpDWc~V~v~lQ~~a 121 (152) T protein:vir:10 77 FDDLSRPLGKGGDFAAFRGGWWYITQFLEWWPDWCAFEVTQQLNA 121 (152) T ss_pred hhheechhhcCccEEEECCceEEEEEcccccccceeeeeeeccCh Confidence 1 23569999999999999999999999888888899888 No 18 >protein:vir:95261 Length: 133 # NCBI annotation: Phage hypothetical protein # Family: family:all:31736 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944894;genbank:gi:38707834;genbank:GeneID:2744047 Probab=92.87 E-value=0.0017 Score=35.78 Aligned_cols=101 Identities=18% Similarity=0.177 Sum_probs=61.7 Q ss_pred HHHHHhcCCeEEEEeC---CEeccCCCcccC-ccceeeeeEEEEeccchhhcCCe-------EEeecC-------EEEEE Q lcl|NC_012223. 16 NPLIKKFGMTATVTRP---GTVDRVDGDEVV-IPPTSFDVIGLREEYKPSEIDGT-------RIVAGD-------VKFLC 77 (119) Q Consensus 16 ~~ll~~~G~~~tl~r~---g~~dp~~g~~~~-~~~~~~~~~gv~~~~~~~~idGt-------lI~~GD-------~~~~~ 77 (119) -+|+. -.+.++.|+ |+|--+.|..+. +..++.+++|-+..|+...++.. -...+| -++-+ T Consensus 1 M~~~~--rhs~~~~R~~seg~Y~~~~GrWV~g~~~v~~~i~asIQP~~~ss~~~~q~~~lpeGrrit~avrIYTda~L~v 78 (133) T protein:vir:95 1 MRLLN--RHSFVVKRKVSEDGYYNDDGDWVASQDIVEVNCKGNIQPYIKGSVKNGTQIALPEGIRLTDTRILYTTYKLRT 78 (133) T ss_pred CCccc--cceeEEEEeecCCceEccCCcccCCCCccceeeeeeecccccccccccchhcccCCeeeeeEEEEEeeeeeee Confidence 12222 223444443 667666666666 34457999999999777644321 123334 33334 Q ss_pred ccCcccccCCEEEeCCeEEEEEecceeCCCCc--EEEEEEE-EeC Q lcl|NC_012223. 78 QAIKQVKVGDLVSLNNTDYRVINPNPLQPAGQ--TMLFQLQ-LRG 119 (119) Q Consensus 78 sa~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~--~v~y~~q-~R~ 119 (119) +-+..-..||.++++|+.|.|++..|+ -.|+ +-.|+-+ +|- T Consensus 79 age~~~~~gDvvl~dg~eYev~~r~~w-~~Gv~~isHyrY~aVR~ 122 (133) T protein:vir:95 79 SDDVEWNESDIVMIDGHEYEVFMTMDW-SQQLSHTSHYEYIIIRR 122 (133) T ss_pred ecccccCCCcEEEEcCCceEEEEecch-hhccccCCceeEEEEee Confidence 445567889999999999999999994 4555 4455432 333 No 19 >protein:vir:78609 Length: 152 # NCBI annotation: BcepNY3gp05 # Family: family:all:3177 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294842;genbank:gi:149882905;genbank:GeneID:5291080 Probab=92.76 E-value=0.0042 Score=33.63 Aligned_cols=110 Identities=15% Similarity=0.031 Sum_probs=75.6 Q ss_pred cChHHHHHHHHHHHHhcC--CeEEEEeCCEeccCCCcccCccceeeeeEEEEeccch---hhcCCeEEeecCEEEEEccC Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKFG--MTATVTRPGTVDRVDGDEVVIPPTSFDVIGLREEYKP---SEIDGTRIVAGDVKFLCQAI 80 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~G--~~~tl~r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~---~~idGtlI~~GD~~~~~sa~ 80 (119) || |.+.|...|..-. .+.++++..+|.-..|.-.+ .-.+.++..=+...+. +|.||-.||--=+.+|+... T Consensus 1 MN---Lh~Ia~~aI~aVNP~~~A~l~~stG~T~~~G~r~P-~Y~~~~v~vQ~QalS~~dL~h~dglnqQG~~~~iY~~Gn 76 (152) T protein:vir:78 1 MN---LHDIVRGAITQVNPDEAGTMFVSTGRTNVRGILTP-TFSSIDAQLQIQAQKHTPLQHERGALYTNSFLTVYAYGK 76 (152) T ss_pred Cc---hHHhhhhhhhccCCCCceEEEEeeceEcCCCcccc-eecceeeEEEEeecCchHHHHhhcccccceeeEEEeccc Confidence 54 7777777777554 57888886555433343222 2223455554555554 57888766655566777554 Q ss_pred c------ccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 81 K------QVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 81 ~------~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) . .=+=||++.++|++|.|+.+-..+|.=..++-.+|+=+ T Consensus 77 ~~gv~R~~~qGGD~~vf~g~~WLVv~v~E~WpDWc~V~v~lQ~~a 121 (152) T protein:vir:78 77 FDDLSRPLGKGGDFAAFRGGWWYITQFLEWWPDWCAFEVTQQLNA 121 (152) T ss_pred hhheechhhcCccEEEECCceEEEEEcccccccceeeeeeeccCh Confidence 1 23569999999999999999999999888888899888 No 20 >protein:vir:105466 Length: 120 # NCBI annotation: hypothetical protein # Family: family:all:2370 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529876;genbank:gi:90592616;genbank:GeneID:3974530 Probab=92.15 E-value=0.0094 Score=31.69 Aligned_cols=112 Identities=11% Similarity=0.002 Sum_probs=61.7 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEEeCC--EeccCCCcccCccceeeeeEEEEeccchhhcCCeEEeecCEEEEEccC Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVTRPG--TVDRVDGDEVVIPPTSFDVIGLREEYKPSEIDGTRIVAGDVKFLCQAI 80 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~g--~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~idGtlI~~GD~~~~~sa~ 80 (119) |+-| .+++.|..++ |--.|+|.|-. ..++.+...-..--...||+--..+-+........--.-+.+|+++|+ T Consensus 1 m~~~---~~~rkale~l--Y~d~c~I~r~~~~~~~giT~~~~~~i~e~ipCrlS~~~l~~~~~~~~~~~~~~~kLF~~Pe 75 (120) T protein:vir:10 1 MSYF---NGLKNALPKL--WNDRVKIVGTQPTKHGYITNSEDVTIVEDEPAKVVLKGQSTSEQSFFGTDEYDAKLIIRNG 75 (120) T ss_pred CchH---HHHHHHhhhh--hcCeEEEEEeeeeecCCccCceeeEEecCCcceEeeccccccccccccceeeEEEEEeecC Confidence 5554 4444443333 33457776532 233443322222223456665555443332221111256789999999 Q ss_pred cccccCCEEEe---CCeEEEEE-eccee--CCCCcEEEEEEEEeC Q lcl|NC_012223. 81 KQVKVGDLVSL---NNTDYRVI-NPNPL--QPAGQTMLFQLQLRG 119 (119) Q Consensus 81 ~~p~~gD~i~i---~G~~~~Vv-~v~pi--~Pag~~v~y~~q~R~ 119 (119) ..++.||+|+| +|....-. +-.|. -|.=.-|.-++.-|| T Consensus 76 idIk~Gd~I~Vt~~nG~~~~y~~s~~p~a~Y~sHqEI~L~~~eka 120 (120) T protein:vir:10 76 IKIPAGADIYVTDVNGQMTKYKRASKGYSGYFSHQEVAMVRSEKA 120 (120) T ss_pred cccCCCCEEEEEecCCCEEEEEeccCCCccccccceEEEEEeecC Confidence 99999999988 36666554 44554 455455666666677 No 21 >protein:vir:77651 Length: 152 # NCBI annotation: gp06 # Family: family:all:3177 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022740;genbank:gi:47835021;genbank:GeneID:2821448 Probab=92.05 E-value=0.0029 Score=34.48 Aligned_cols=110 Identities=15% Similarity=0.015 Sum_probs=75.3 Q ss_pred cChHHHHHHHHHHHHhcC--CeEEEEeCCEeccCCCcccCccceeeeeEEEEeccch---hhcCCeEEeecCEEEEEccC Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKFG--MTATVTRPGTVDRVDGDEVVIPPTSFDVIGLREEYKP---SEIDGTRIVAGDVKFLCQAI 80 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~G--~~~tl~r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~---~~idGtlI~~GD~~~~~sa~ 80 (119) || |.+.|...|..-. .+.++++..+|.-..|...+ .-.+.++..=+...+. +|.||-.||--=+.+|+... T Consensus 1 MN---Lh~Ia~~aI~aVNP~~pA~l~~stG~T~~~G~r~P-~Y~~~~v~vQ~QalS~~dL~h~dglnqQG~~~~iY~~Gn 76 (152) T protein:vir:77 1 MN---LHDIVRGAITQVNPDEPGTMFVSTGRTNVRGILTP-MFSSVNAQLQIQAQKHTPLQHERGALYTNSFLTVYAYGK 76 (152) T ss_pred Cc---hHHhhhhhhhccCCCCceEEEEeeceEcCCCcccc-eecceeeEEEEeecCchHHHHhhcccccceeeEEEeccc Confidence 54 7777877787554 57888886545433343222 2223455544555554 57888766655566676554 Q ss_pred c------ccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 81 K------QVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 81 ~------~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) . .=+=||+++++|++|.|+.+-..+|.=..+.-.+|+=+ T Consensus 77 ~~gv~R~~~qGGD~~vf~g~~WLVv~v~E~WpDWc~V~v~lQ~~a 121 (152) T protein:vir:77 77 FDDLSRPLGKGGDFASFRGGWWYITQFLEWWPDWCAFEVTQQLNA 121 (152) T ss_pred hhheechhhcCccEEEECCceEEEEEcccccchhhhhhhhhhhch Confidence 1 23569999999999999999999999888888888888 No 22 >protein:vir:101560 Length: 152 # NCBI annotation: gp06 # Family: family:all:3177 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958110;genbank:gi:41057656;genbank:GeneID:2716817 Probab=91.33 E-value=0.004 Score=33.75 Aligned_cols=110 Identities=15% Similarity=0.012 Sum_probs=75.2 Q ss_pred cChHHHHHHHHHHHHhcC--CeEEEEeCCEeccCCCcccCccceeeeeEEEEeccch---hhcCCeEEeecCEEEEEccC Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKFG--MTATVTRPGTVDRVDGDEVVIPPTSFDVIGLREEYKP---SEIDGTRIVAGDVKFLCQAI 80 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~G--~~~tl~r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~---~~idGtlI~~GD~~~~~sa~ 80 (119) || |.+.|...|..-. .+.++++..+|.-..|...+ .-.+.++..=+...+. +|.||-.||--=+.+|+... T Consensus 1 MN---Lh~Ia~~aI~aVNP~~pA~l~~stG~T~~~G~r~P-~Y~~~~v~vQ~QalS~~dL~h~dglnqQG~~~~iY~~Gn 76 (152) T protein:vir:10 1 MN---LHDIVRGAITQVNPDEPGTMFVSTGRTNVRGILTP-MFSSVNAQLQIQAQKHTPLQHERGALYTNSFLTVYAYGK 76 (152) T ss_pred Cc---hHHhhhhhhcccCCCCceEEEEeeceEcCCCcccc-eecceeeEEEEeecCchHHHHhhcccccceeeEeEeccc Confidence 54 7777777777554 57888886545433343222 2223455554555554 57888766655566777554 Q ss_pred c------ccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 81 K------QVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 81 ~------~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) . .=+=||++.++|++|.|+.+-..+|.=..++-.+|+=+ T Consensus 77 ~~gv~R~~~qGGD~~vf~g~~WLVv~v~E~WpDWc~V~v~lQ~~a 121 (152) T protein:vir:10 77 FDDLSRPLGKGGDFAAFRGGWWYITQFLEWWPDWCAFEVTQQLNA 121 (152) T ss_pred hhheechhhcCccEEEECCceEEEEEcccccchhhhhhhhhhhch Confidence 1 23569999999999999999999999888888888888 No 23 >protein:vir:100886 Length: 113 # NCBI annotation: putative head-tail joining protein # Family: family:all:1030 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358766;genbank:gi:77999992;genbank:GeneID:3726157 Probab=89.61 E-value=0.016 Score=30.47 Aligned_cols=104 Identities=12% Similarity=0.199 Sum_probs=65.4 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEEeCCEeccCCCcccCccceeeeeEEEEeccchhh---cCCeEEeecCEEEEEcc Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVTRPGTVDRVDGDEVVIPPTSFDVIGLREEYKPSE---IDGTRIVAGDVKFLCQA 79 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~---idGtlI~~GD~~~~~sa 79 (119) |..++ ...|-.-++.-..++..|++|.+..+-...|.|.+-...-+..+ +-|+ -......+++-. T Consensus 1 m~k~~-----------p~~ln~ri~FG~~~t~~~~~G~~~~~f~~~f~~~~~~~t~t~~q~~~~~Gt-~~e~t~~~vIRh 68 (113) T protein:vir:10 1 MAKFK-----------VADFSRKVDLGSPKSHTTGAGLNITSFVPNYSLHFKQQTRTLTQQYTLVGT-RLDNSITVIVRH 68 (113) T ss_pred CCccc-----------ccccceEEEeeeeecccCCCCcccceeEeeEEEEEEEeecchheeeeeccc-cccccEEEEEEe Confidence 55543 33333334433344567788887777777788877776554442 3354 334556666655 Q ss_pred CcccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEE-EeC Q lcl|NC_012223. 80 IKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQ-LRG 119 (119) Q Consensus 80 ~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q-~R~ 119 (119) .......=++.+||+.|.|+++.| +...-.+.|-+. |+. T Consensus 69 ~~~it~~m~v~~~g~~Y~I~~Is~-Dd~~~~~~yD~iTlkk 108 (113) T protein:vir:10 69 DPRNASQKQARLDGIVYDISDISP-DDSNDAIRYDYLTLVK 108 (113) T ss_pred CCCCCcccEEEECCeEEEEEEeCC-CCCCCcceeeeEEEEE Confidence 555566667899999999999998 555445666554 555 No 24 >protein:vir:81177 Length: 109 # NCBI annotation: putative head-tail adaptor # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285814;genbank:gi:148747735;genbank:GeneID:5247220 Probab=88.57 E-value=0.031 Score=28.82 Aligned_cols=102 Identities=10% Similarity=0.003 Sum_probs=64.0 Q ss_pred cChHHHHHHHHHHHHhcCCeEEEEeCCE-eccCCCcccCccceeeeeEEEEeccchhhcC--CeEEeecCEEEEEccCcc Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKFGMTATVTRPGT-VDRVDGDEVVIPPTSFDVIGLREEYKPSEID--GTRIVAGDVKFLCQAIKQ 82 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~G~~~tl~r~g~-~dp~~g~~~~~~~~~~~~~gv~~~~~~~~id--GtlI~~GD~~~~~sa~~~ 82 (119) |+=. +.-...++.++.. .|. .|+....-...+.|-|-+...+.++.. +.+...-...+.+-.... T Consensus 1 M~~g-----------~L~~rI~i~~~~~~~d~-~G~~~~~w~~~~~~wA~v~~~s~~e~~~a~~~~~~~~~~f~iR~~~~ 68 (109) T protein:vir:81 1 MNPG-----------QFRHKITLMKLVTTQDE-IGNTIEEWQPVRTCWAAIKTVNGREYFAAASVQAERTYRFIIRYTPG 68 (109) T ss_pred CCcc-----------ccCccEEEEeeeeeeCC-CCCeecceeeEEEEEEEEEecCchheeeccceeeeeeEEEEEEeCCC Confidence 1111 1122355555543 343 455555566678899999988888763 333333344455544446 Q ss_pred cccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 83 VKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 83 p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) ++..++|..+|+.|.|.++.|......-+.-.+.-.. T Consensus 69 i~~~~ri~~~g~~y~I~~v~~~~~~~~~l~i~~~e~~ 105 (109) T protein:vir:81 69 INETMKIDYQGRLFDIQSVLNDDEGKKTLTIIATERV 105 (109) T ss_pred CCcccEEEECCeEEEEEeecCCccCCcEEEEEEEEee Confidence 7899999999999999999988887755444444333 No 25 >protein:vir:78985 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:2370 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110728;genbank:gi:134287345;genbank:GeneID:4955159 Probab=87.98 E-value=0.035 Score=28.55 Aligned_cols=108 Identities=10% Similarity=0.094 Sum_probs=60.7 Q ss_pred HHHHHHHHHHh-cCCeEEEEeCC-EeccCCCcccCccce---eeeeEEEEeccchhhcCCeEEeecCEEEEEccCccccc Q lcl|NC_012223. 11 LKRKVNPLIKK-FGMTATVTRPG-TVDRVDGDEVVIPPT---SFDVIGLREEYKPSEIDGTRIVAGDVKFLCQAIKQVKV 85 (119) Q Consensus 11 ~~~~A~~ll~~-~G~~~tl~r~g-~~dp~~g~~~~~~~~---~~~~~gv~~~~~~~~idGtlI~~GD~~~~~sa~~~p~~ 85 (119) |-.-|++.|.. |--.|+|.|.. .-||.+|........ ..||+--+.+-+.....-..--.-+.+|+++++..++. T Consensus 1 ~~~~~rk~le~lY~d~c~I~r~~~~~dp~tgiT~~~~~~i~e~~pCrlS~~~~~~~~~~~~~~~~~~~kLF~~PeidIk~ 80 (115) T protein:vir:78 1 MVSKTRKAIEMLYRYKCTIVEYQPIKDPVTKRTNNKEVIVLENQPCKLSYKNIVSATEGKLAKLEQTIKLFISPDIEIKA 80 (115) T ss_pred CchHHHHHhhhhhcCeEEEEeeeeeeccccceeccccEEEEcCCcceEEeccCCccccccccceeEEEEEEeecCcccCC Confidence 55556666664 44468888764 368877643222111 24777666554443221122335569999999999999 Q ss_pred CCEEEeCCeEEEEEecceeCCCC-cEEEEE-EEEeC Q lcl|NC_012223. 86 GDLVSLNNTDYRVINPNPLQPAG-QTMLFQ-LQLRG 119 (119) Q Consensus 86 gD~i~i~G~~~~Vv~v~pi~Pag-~~v~y~-~q~R~ 119 (119) ||+|+|..+.|.- +-.|-.+.. .-|.-+ .+-+| T Consensus 81 Gd~I~Vt~~~y~~-a~~p~~Y~tHqEI~L~l~~~~a 115 (115) T protein:vir:78 81 GSKLIINDKEYVR-SGESAIYPNHQEIILELFKDKA 115 (115) T ss_pred CCEEEEeceEeEE-eecCeecCCcceEEEEEeeecC Confidence 9999999876532 222322222 112222 24455 No 26 >protein:vir:4858 Length: 116 # NCBI annotation: putative head-tail joining protein # Family: family:all:1030 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049398;genbank:gi:9632426;genbank:GeneID:1258519 Probab=87.59 E-value=0.035 Score=28.56 Aligned_cols=106 Identities=17% Similarity=0.157 Sum_probs=64.4 Q ss_pred CeecccChHHHHHHHHHHHHhcCCeEEEEeC-CEeccCCCcccCccceeeeeEEEEeccchhhc---CCeEEeecCEEEE Q lcl|NC_012223. 1 MIMAGFNYTGLKRKVNPLIKKFGMTATVTRP-GTVDRVDGDEVVIPPTSFDVIGLREEYKPSEI---DGTRIVAGDVKFL 76 (119) Q Consensus 1 ~~Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~-g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~i---dGtlI~~GD~~~~ 76 (119) |-|..++ ...|-.-++.-.. ..-||.+|.+...-...+.|.|-...-+.++. -|+ -......++ T Consensus 1 M~~~k~~-----------~~~ln~ri~Fg~~~~~~n~~~G~~~~~f~~~~~~w~~~~~~t~~q~~~~~gt-~~e~T~~~v 68 (116) T protein:vir:48 1 MARVGYL-----------PSDFRYKADFGTYQSTPNKFTGVSVPKFVKQFTLHYKPHTRTLNQEYLAQQN-GESDTIVIV 68 (116) T ss_pred Ccccccc-----------cccccEeEEeeeeeeeecCCCCcccceeeeeEEEEEEeeecchheeeeeecc-cccccEEEE Confidence 6655543 2233333333222 23466677766666667888888877666543 233 234445555 Q ss_pred EccCcccccCCEEEeCCeEEEEEecceeCCCCcEEEE-EEEEeC Q lcl|NC_012223. 77 CQAIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLF-QLQLRG 119 (119) Q Consensus 77 ~sa~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y-~~q~R~ 119 (119) +--.......=++.++|+.|.|+++.| +...-.+.| -+.++. T Consensus 69 IRh~~~i~~~~~v~~~G~~Y~I~~Is~-D~~~~~~~yD~iTlk~ 111 (116) T protein:vir:48 69 IRHNAKVLEGQVVTLNGTQYDIVRISA-DENFGFNHYDFLTLRK 111 (116) T ss_pred EEeCCCCCcccEEEECCeEEEEEEeCC-CCCcCcceeeEEEEEE Confidence 544444556667999999999999998 666656677 455666 No 27 >protein:vir:5977 Length: 109 # NCBI annotation: hypothetical protein # Family: family:all:788 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690677;genbank:geneid:6329133;genbank:gi:22855071;interpro:IPR013045;uniprot:O48446;genbank:GeneID:955315 Probab=86.01 E-value=0.049 Score=27.78 Aligned_cols=98 Identities=9% Similarity=-0.011 Sum_probs=63.0 Q ss_pred HHHhcCCeEEEEeCCE-eccCCCcccCccceeeeeEEEEeccchhhcC--CeEEeecCEEEEEccCcccccCCEEEeCCe Q lcl|NC_012223. 18 LIKKFGMTATVTRPGT-VDRVDGDEVVIPPTSFDVIGLREEYKPSEID--GTRIVAGDVKFLCQAIKQVKVGDLVSLNNT 94 (119) Q Consensus 18 ll~~~G~~~tl~r~g~-~dp~~g~~~~~~~~~~~~~gv~~~~~~~~id--GtlI~~GD~~~~~sa~~~p~~gD~i~i~G~ 94 (119) |+.|+-...++.++.. .|+. |.+...=...+.+-|-+...+.++.. +..-..-..++.+=....+..+++|..+|+ T Consensus 1 ~~~~L~~RI~i~~~~~~~D~~-G~~~~~w~~~~~~WA~v~~~sg~E~~~a~~~~~~~~~~i~iRy~~~I~~~~Ri~~~gr 79 (109) T protein:vir:59 1 MYEEFPDVITFQSYVEQSNGE-GGKTYKWVDEFTAAAHVQPISQEEYYKAQQLQTPIGYNIYTPYDDRIDKKMRVIYRGK 79 (109) T ss_pred CccccCccEEEEeeeeeeCCC-CCeeeeeEeeEEEEEEEecCChhheeeccccceeeEEEEEEeeCCCCCcccEEEECCe Confidence 7888888888887653 4543 43443333457788888888888653 333334455555544445688999999999 Q ss_pred EEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 95 DYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 95 ~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) .|.|+++- .++.+- ...+.+++ T Consensus 80 ~y~I~~v~-~d~~~~--~~~l~~~~ 101 (109) T protein:vir:59 80 IVTFIGDP-VDLSGL--QEITRIKG 101 (109) T ss_pred EEEEEecc-CCCCCC--eEEEEEEE Confidence 99999873 234332 23344444 No 28 >protein:vir:100226 Length: 114 # NCBI annotation: putative head-tail joining protein # Family: family:all:1030 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025033;genbank:gi:48697266;genbank:GeneID:2948324 Probab=84.78 E-value=0.058 Score=27.38 Aligned_cols=105 Identities=11% Similarity=0.213 Sum_probs=62.9 Q ss_pred eecccChHHHHHHHHHHHHhcCCeEEEEeCCEeccCCCcccCccceeeeeEEEEeccchhh---cCCeEEeecCEEEEEc Q lcl|NC_012223. 2 IMAGFNYTGLKRKVNPLIKKFGMTATVTRPGTVDRVDGDEVVIPPTSFDVIGLREEYKPSE---IDGTRIVAGDVKFLCQ 78 (119) Q Consensus 2 ~Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~---idGtlI~~GD~~~~~s 78 (119) -|..++ +..|-.-++.-..++.++++|.+...-...|.|.+-...-+..+ +-|+ -......+++- T Consensus 1 m~~~~~-----------p~~ln~ri~FG~~~t~~~~~G~~~~~fv~~f~~~~~~~t~t~~q~~~~~Gt-~~e~t~~~vIR 68 (114) T protein:vir:10 1 MMAKFK-----------VADFSRKVDLGSPQSHKTGAGINITSFVPNYSLHFKQQTRTLTQQYTLVGT-RLDNSITIIVR 68 (114) T ss_pred CCcccc-----------ccccceEEEeeeeeeecCCCCcccceeeeeEEEEEEEeecchheeeeeccc-cccccEEEEEE Confidence 122222 23333334443344678888887776666788777776554442 3354 33445566665 Q ss_pred cCcccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEE-EeC Q lcl|NC_012223. 79 AIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQ-LRG 119 (119) Q Consensus 79 a~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q-~R~ 119 (119) ........=++.+||+.|.|+++.| +...-.+.|-+. |+. T Consensus 69 h~~~it~~m~v~~~g~~Y~Iv~Isp-Dd~~~~~~yD~iTlkk 109 (114) T protein:vir:10 69 HDTRNASQKQARLDGIVYDISDISP-DDSNDAIRYDYLTLVK 109 (114) T ss_pred eCCCCCcccEEEECCeEEEEEEeCC-CCccCcceeeeEEEEE Confidence 4444455557899999999999998 555555666654 555 No 29 >protein:vir:102144 Length: 113 # NCBI annotation: phage head-tail adaptor, putative # Family: family:all:3858 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699939;genbank:gi:110804032;genbank:GeneID:4206688 Probab=84.37 E-value=0.061 Score=27.24 Aligned_cols=103 Identities=8% Similarity=0.007 Sum_probs=65.3 Q ss_pred cChHHHHHHHHHHHHhcCCeEEEEeCCEeccCCCcccCccceeeeeEEEEeccchhhc--CCeEEeecCEEEEEc--cC- Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKFGMTATVTRPGTVDRVDGDEVVIPPTSFDVIGLREEYKPSEI--DGTRIVAGDVKFLCQ--AI- 80 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~G~~~tl~r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~i--dGtlI~~GD~~~~~s--a~- 80 (119) |+-.+| -...++.++.....+.|.+...-...+++-|-+...+.++. .+.........+.+= ++ T Consensus 1 M~~G~L-----------~~rI~i~~~~~~~d~~G~~~~~w~~~~~~wA~v~~~~g~E~~~a~~~~~~~~~~f~iRy~~~i 69 (113) T protein:vir:10 1 MAECRL-----------NERIIIEELAIIQNSNGFEEEKWHEYYRCWSSFKKVKGSKFIAAKADNAENIVTFTIRYCNKV 69 (113) T ss_pred CCcccc-----------CceEEEEeeeeccCCCCCeecceEeEEEEEEEEEecCchheeeccceeeeeeEEEEEEecCCC Confidence 221122 22356666543322355555555566889999988888765 455444555555552 22 Q ss_pred -----cccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 81 -----KQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 81 -----~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) ..+..+++|..+|..|.|.++.+.+..+.-+.-.+..-+ T Consensus 70 ~~~~~~~it~~~ri~~~g~~y~I~~i~~~~~~~~~l~i~~~~v~ 113 (113) T protein:vir:10 70 KILLDIEAINKFRINFKGHYYKLEYVDDYDQGHEWVDLKAKIIS 113 (113) T ss_pred cccccccCCCCCeEEECCeEEEEEecCCcccCCeEEEEEEEEeC Confidence 245678999999999999999988888865555555555 No 30 >protein:vir:4459 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700382;genbank:gi:23505454;genbank:GeneID:955661 Probab=83.09 E-value=0.063 Score=27.16 Aligned_cols=111 Identities=10% Similarity=0.001 Sum_probs=64.2 Q ss_pred ecccChHHHHHHHHHHHHhcC---CeEEEEeCCE-eccCCCcccCccceeeeeEEEEeccchhhc--CCeEEeecCEEEE Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFG---MTATVTRPGT-VDRVDGDEVVIPPTSFDVIGLREEYKPSEI--DGTRIVAGDVKFL 76 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G---~~~tl~r~g~-~dp~~g~~~~~~~~~~~~~gv~~~~~~~~i--dGtlI~~GD~~~~ 76 (119) |+.-. ++=+|+| ++=+.| .-.++.++.. .|+. |+....-...+.+-|-+...+.++. .+.+-.....++. T Consensus 1 ~~~~~-~~~~~~~--~~M~aG~L~~RI~i~~~~~~~D~~-G~~~~~w~~~~~vwA~v~~~sg~E~~~a~~~~~~~t~~i~ 76 (134) T protein:vir:44 1 MKIRQ-AQTSATY--LLPDPGELDQRIVIRRRVDVPADD-FGVTPTYPEQIRTWAKKAQPGAAAYQGSVQIENRVTHYFT 76 (134) T ss_pred Ccccc-ccceeeE--eccCccccCccEEEEeeeeeeCCC-CCeecceEeeEEEEEEEEecCchheeeccceeeeeeEEEE Confidence 44321 1112221 111222 2345555543 4443 4444444556888888888888764 3333334556666 Q ss_pred EccCcccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 77 CQAIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 77 ~sa~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) +-....+..+++|..+|+.|.|.++.+.+..+-- .++..+. T Consensus 77 IR~~~~It~~~RI~~~g~~y~I~~I~~~~~~~~~--L~i~c~e 117 (134) T protein:vir:44 77 IRFRRGITADHEVLHDDISYRVKRVRDLNGKRRF--LLIECEA 117 (134) T ss_pred EEeCCCCCcccEEEECCeEEEEEEecCCCcCCcE--EEEEEEE Confidence 6444457889999999999999999877777653 3444444 No 31 >protein:vir:102962 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:2370 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945288;genbank:gi:39653723;uniprot:Q708M4;genbank:GeneID:2672876 Probab=82.93 E-value=0.072 Score=26.85 Aligned_cols=105 Identities=10% Similarity=0.071 Sum_probs=54.6 Q ss_pred HHHHHHHh-cCCeEEEEeCCEeccCCC---cccCccceeeeeEEEEeccchhhcCCeEEeecCEEEEEccCcccccCCEE Q lcl|NC_012223. 14 KVNPLIKK-FGMTATVTRPGTVDRVDG---DEVVIPPTSFDVIGLREEYKPSEIDGTRIVAGDVKFLCQAIKQVKVGDLV 89 (119) Q Consensus 14 ~A~~ll~~-~G~~~tl~r~g~~dp~~g---~~~~~~~~~~~~~gv~~~~~~~~idGtlI~~GD~~~~~sa~~~p~~gD~i 89 (119) +|++.|.. |--.|+|.|.-. ++.+| .....--...||+--..+-+.....-+.--.-+.+|+++++..++.||+| T Consensus 1 ~ark~le~lY~d~ctI~r~~k-~~~~g~T~~~~~~i~e~ipCrlS~~~l~~~~q~~~~~~~~~~kLF~~PeidIk~Gd~I 79 (115) T protein:vir:10 1 MLRQSLDCLYACKMTVKGYTE-QEIDGLTSMLESVLLEDIPCRISQMSNSSTNGSDYQANGYDMKLFCSVAYDIPAGCKI 79 (115) T ss_pred CchhHhhhhhcCeEEEEEeec-ccCCCccCccceEEecCCcceEEeccCcccccccccceeEEEEEEeecCcccCCCCEE Confidence 44444543 334578876422 33333 22222223356666555543322211112256799999999999999999 Q ss_pred EeC---CeEEEEEecce---eCCCCcEEEEEEEEeC Q lcl|NC_012223. 90 SLN---NTDYRVINPNP---LQPAGQTMLFQLQLRG 119 (119) Q Consensus 90 ~i~---G~~~~Vv~v~p---i~Pag~~v~y~~q~R~ 119 (119) +|- |....-.+-.| .=|.=.-|.-++--+| T Consensus 80 ~Vt~~~g~~~~~~~s~~~~~~Y~tHqEV~L~l~e~a 115 (115) T protein:vir:10 80 EVTDRNGHVKVFTRSNVPIGQYWSHQEIAIKLEGKS 115 (115) T ss_pred EEEcCCCeEEEEEecCCccccccccceEEEEEeecC Confidence 873 54432222332 2333344444555555 No 32 >protein:vir:1890 Length: 110 # NCBI annotation: gp9 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037670;genbank:gi:9634128;genbank:GeneID:1262503 Probab=82.86 E-value=0.073 Score=26.81 Aligned_cols=103 Identities=9% Similarity=0.003 Sum_probs=62.8 Q ss_pred cChHHHHHHHHHHHHhcCCeEEEEeCCE-eccCCCcccCccceeeeeEEEEeccchhhc--CCeEEeecCEEEEEccCcc Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKFGMTATVTRPGT-VDRVDGDEVVIPPTSFDVIGLREEYKPSEI--DGTRIVAGDVKFLCQAIKQ 82 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~G~~~tl~r~g~-~dp~~g~~~~~~~~~~~~~gv~~~~~~~~i--dGtlI~~GD~~~~~sa~~~ 82 (119) |+=. ++-.-.++.++.. .|+.+|.....-...+++-|-+...+.++. .+........++.+=.... T Consensus 1 M~~G-----------~L~~rI~i~~~~~~~d~~~G~~~~~~~~~~~~wA~v~~~~~~e~~~a~~~~~~~~~~~~iR~~~~ 69 (110) T protein:vir:18 1 MQAG-----------KLRHRITLQEPVKVQNPTTGAVINTWRDVATVRAEVSPLSAREFIAAQASQGEITTRIVIRYRAG 69 (110) T ss_pred CCcc-----------ccCccEEEEeeeeeecCCCCccccceeeeEEEEEEEEecCchheeecceeeeeeeEEEEEEecCC Confidence 1111 1112355556543 566666656555667899999888888765 3444445555555544445 Q ss_pred cccCCEEEeCCeEEEEEeccee-CCCCcEEEEEEEEeC Q lcl|NC_012223. 83 VKVGDLVSLNNTDYRVINPNPL-QPAGQTMLFQLQLRG 119 (119) Q Consensus 83 p~~gD~i~i~G~~~~Vv~v~pi-~Pag~~v~y~~q~R~ 119 (119) ++.+++|..+|..|.|.++.|- +....-+.-.+.-+. T Consensus 70 I~~~~ri~~~g~~y~I~~v~~d~~~~~~~l~i~~~e~~ 107 (110) T protein:vir:18 70 VTRKHRILFRGAVYNIHGVLPDPKSGREYLTLPCSEGV 107 (110) T ss_pred CCcccEEEECCeEEEEEeccCCcccCCeEEEEEEEEec Confidence 7899999999999999999762 223333333444333 No 33 >protein:vir:78106 Length: 236 # NCBI annotation: hypothetical protein # Family: family:all:29846 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294804;genbank:gi:149882825;genbank:GeneID:5309134 Probab=81.59 E-value=0.012 Score=31.05 Aligned_cols=102 Identities=18% Similarity=0.169 Sum_probs=66.1 Q ss_pred CeecccChHHHHHHHHHHHHhcCCeEEEEeCCEeccCCCcccCccceeeeeEEEEeccchhh-cCCeEEeecCEEEEE-- Q lcl|NC_012223. 1 MIMAGFNYTGLKRKVNPLIKKFGMTATVTRPGTVDRVDGDEVVIPPTSFDVIGLREEYKPSE-IDGTRIVAGDVKFLC-- 77 (119) Q Consensus 1 ~~Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~-idGtlI~~GD~~~~~-- 77 (119) |.+-.|- |+++. +++.-|+|....+-+..+.+.++.|.+.+-++.+ -|---.|+--.-++. T Consensus 125 mhldsFV--RlRA~--------------RkpdPYNpaqt~eDWt~P~EL~i~GalassSStrtpD~ld~qt~Sta~Lti~ 188 (236) T protein:vir:78 125 MHLDSFV--RQRAA--------------RAPDPYNPDSTVEDWTAPEELPLDGFFDPASSTEEYDAVRKKAVSRELLVLW 188 (236) T ss_pred hhhHHHH--HHhhh--------------cCCCCCCcccCCccccCccceeecccccccccccccchhheeecceeEEEee Confidence 6555553 33332 3456688876566666778999999999887754 444334443333333 Q ss_pred ccCcccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEE-------eC Q lcl|NC_012223. 78 QAIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQL-------RG 119 (119) Q Consensus 78 sa~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~-------R~ 119 (119) .+.+-.+.||+|.-||+.|.|--. |-+|++.--.|+..+ || T Consensus 189 DpnADV~~GDRIp~dgR~WeV~GF-PS~daNaFTGwrptlevrltE~~G 236 (236) T protein:vir:78 189 DPNADVERGDRIVQGSRVWTVEGF-PSAPMNPFTGRRRHRFVRVMEAVG 236 (236) T ss_pred CCCCCeeecccccCCceeEEeccC-CCCCCcccccccceeEEEEeeccC Confidence 344567899999999999998877 567777544555443 22 No 34 >protein:vir:102084 Length: 107 # NCBI annotation: head-tail adaptor # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512317;genbank:gi:89152486;genbank:GeneID:3953077 Probab=80.48 E-value=0.094 Score=26.20 Aligned_cols=102 Identities=15% Similarity=0.072 Sum_probs=61.9 Q ss_pred cChHHHHHHHHHHHHhcCCeEEEEeCCE-eccCCCcccCccceeeeeEEEEeccchhhcC--CeEEeecCEEEEEccCcc Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKFGMTATVTRPGT-VDRVDGDEVVIPPTSFDVIGLREEYKPSEID--GTRIVAGDVKFLCQAIKQ 82 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~G~~~tl~r~g~-~dp~~g~~~~~~~~~~~~~gv~~~~~~~~id--GtlI~~GD~~~~~sa~~~ 82 (119) |+=. +.-...++.++.. .+...|+....-...+++-|-+...+.++.- +.+-..--.++.+-.... T Consensus 1 M~~G-----------~L~~rI~i~~~~~~~~d~~G~~~~~w~~~~~~wA~v~~~sg~e~~~a~~~~~~~t~~i~iR~~~~ 69 (107) T protein:vir:10 1 MNPA-----------KLDKRLTFQVKDENAKGPDGDPIDGYKDAFTVWGSFVYLKGRKYFEAAAANSEVQGETEIRNRDD 69 (107) T ss_pred CCcc-----------ccCccEEEEeceeeccCCCCccccceEeeEEEEEEEEecCchhheeccceeeeeeEEEEEEecCC Confidence 2211 1222356655553 4555666555555668899999988887653 222223333444433445 Q ss_pred cccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 83 VKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 83 p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) ++.+++|..+|+.|.|.++.|.+.....+... .... T Consensus 70 I~~~~ri~~~g~~y~I~~v~~~~~~~~~l~~~-~~~~ 105 (107) T protein:vir:10 70 VSADMKIKYKNVIYDIVSVIPTQDHTLLIMWK-RGEM 105 (107) T ss_pred CCcccEEEECCeEEEEEeecCCCCCcEEEEEE-Eeec Confidence 78899999999999999998887776544332 2222 No 35 >protein:vir:102856 Length: 107 # NCBI annotation: head-tail adaptor protein # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338139;genbank:gi:77020229;genbank:GeneID:3703765 Probab=80.48 E-value=0.094 Score=26.20 Aligned_cols=102 Identities=15% Similarity=0.072 Sum_probs=61.9 Q ss_pred cChHHHHHHHHHHHHhcCCeEEEEeCCE-eccCCCcccCccceeeeeEEEEeccchhhcC--CeEEeecCEEEEEccCcc Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKFGMTATVTRPGT-VDRVDGDEVVIPPTSFDVIGLREEYKPSEID--GTRIVAGDVKFLCQAIKQ 82 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~G~~~tl~r~g~-~dp~~g~~~~~~~~~~~~~gv~~~~~~~~id--GtlI~~GD~~~~~sa~~~ 82 (119) |+=. +.-...++.++.. .+...|+....-...+++-|-+...+.++.- +.+-..--.++.+-.... T Consensus 1 M~~G-----------~L~~rI~i~~~~~~~~d~~G~~~~~w~~~~~~wA~v~~~sg~e~~~a~~~~~~~t~~i~iR~~~~ 69 (107) T protein:vir:10 1 MNPA-----------KLDKRLTFQVKDENAKGPDGDPIDGYKDAFTVWGSFVYLKGRKYFEAAAANSEVQGETEIRNRDD 69 (107) T ss_pred CCcc-----------ccCccEEEEeceeeccCCCCccccceEeeEEEEEEEEecCchhheeccceeeeeeEEEEEEecCC Confidence 2211 1222356655553 4555666555555668899999988887653 222223333444433445 Q ss_pred cccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 83 VKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 83 p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) ++.+++|..+|+.|.|.++.|.+.....+... .... T Consensus 70 I~~~~ri~~~g~~y~I~~v~~~~~~~~~l~~~-~~~~ 105 (107) T protein:vir:10 70 VSADMKIKYKNVIYDIVSVIPTQDHTLLIMWK-RGEM 105 (107) T ss_pred CCcccEEEECCeEEEEEeecCCCCCcEEEEEE-Eeec Confidence 78899999999999999998887776544332 2222 No 36 >protein:vir:105006 Length: 107 # NCBI annotation: putative head-tail adaptor protein # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459971;genbank:gi:85701386;genbank:GeneID:3882147 Probab=80.48 E-value=0.094 Score=26.20 Aligned_cols=102 Identities=15% Similarity=0.072 Sum_probs=61.9 Q ss_pred cChHHHHHHHHHHHHhcCCeEEEEeCCE-eccCCCcccCccceeeeeEEEEeccchhhcC--CeEEeecCEEEEEccCcc Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKFGMTATVTRPGT-VDRVDGDEVVIPPTSFDVIGLREEYKPSEID--GTRIVAGDVKFLCQAIKQ 82 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~G~~~tl~r~g~-~dp~~g~~~~~~~~~~~~~gv~~~~~~~~id--GtlI~~GD~~~~~sa~~~ 82 (119) |+=. +.-...++.++.. .+...|+....-...+++-|-+...+.++.- +.+-..--.++.+-.... T Consensus 1 M~~G-----------~L~~rI~i~~~~~~~~d~~G~~~~~w~~~~~~wA~v~~~sg~e~~~a~~~~~~~t~~i~iR~~~~ 69 (107) T protein:vir:10 1 MNPA-----------KLDKRLTFQVKDENAKGPDGDPIDGYKDAFTVWGSFVYLKGRKYFEAAAANSEVQGETEIRNRDD 69 (107) T ss_pred CCcc-----------ccCccEEEEeceeeccCCCCccccceEeeEEEEEEEEecCchhheeccceeeeeeEEEEEEecCC Confidence 2211 1222356655553 4555666555555668899999988887653 222223333444433445 Q ss_pred cccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 83 VKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 83 p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) ++.+++|..+|+.|.|.++.|.+.....+... .... T Consensus 70 I~~~~ri~~~g~~y~I~~v~~~~~~~~~l~~~-~~~~ 105 (107) T protein:vir:10 70 VSADMKIKYKNVIYDIVSVIPTQDHTLLIMWK-RGEM 105 (107) T ss_pred CCcccEEEECCeEEEEEeecCCCCCcEEEEEE-Eeec Confidence 78899999999999999998887776544332 2222 No 37 >protein:vir:107606 Length: 107 # NCBI annotation: head-tail adaptor protein # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338190;genbank:gi:77020176;genbank:GeneID:3703737 Probab=80.48 E-value=0.094 Score=26.20 Aligned_cols=102 Identities=15% Similarity=0.072 Sum_probs=61.9 Q ss_pred cChHHHHHHHHHHHHhcCCeEEEEeCCE-eccCCCcccCccceeeeeEEEEeccchhhcC--CeEEeecCEEEEEccCcc Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKFGMTATVTRPGT-VDRVDGDEVVIPPTSFDVIGLREEYKPSEID--GTRIVAGDVKFLCQAIKQ 82 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~G~~~tl~r~g~-~dp~~g~~~~~~~~~~~~~gv~~~~~~~~id--GtlI~~GD~~~~~sa~~~ 82 (119) |+=. +.-...++.++.. .+...|+....-...+++-|-+...+.++.- +.+-..--.++.+-.... T Consensus 1 M~~G-----------~L~~rI~i~~~~~~~~d~~G~~~~~w~~~~~~wA~v~~~sg~e~~~a~~~~~~~t~~i~iR~~~~ 69 (107) T protein:vir:10 1 MNPA-----------KLDKRLTFQVKDENAKGPDGDPIDGYKDAFTVWGSFVYLKGRKYFEAAAANSEVQGETEIRNRDD 69 (107) T ss_pred CCcc-----------ccCccEEEEeceeeccCCCCccccceEeeEEEEEEEEecCchhheeccceeeeeeEEEEEEecCC Confidence 2211 1222356655553 4555666555555668899999988887653 222223333444433445 Q ss_pred cccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 83 VKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 83 p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) ++.+++|..+|+.|.|.++.|.+.....+... .... T Consensus 70 I~~~~ri~~~g~~y~I~~v~~~~~~~~~l~~~-~~~~ 105 (107) T protein:vir:10 70 VSADMKIKYKNVIYDIVSVIPTQDHTLLIMWK-RGEM 105 (107) T ss_pred CCcccEEEECCeEEEEEeecCCCCCcEEEEEE-Eeec Confidence 78899999999999999998887776544332 2222 No 38 >protein:vir:4999 Length: 116 # NCBI annotation: putative head-tail joining protein # Family: family:all:1030 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049973;genbank:gi:9632945;genbank:GeneID:1262108 Probab=72.91 E-value=0.18 Score=24.71 Aligned_cols=107 Identities=15% Similarity=0.131 Sum_probs=61.8 Q ss_pred CeecccChHHHHHHHHHHHHhcCCeEEEEeCCEeccCCCcccCccceeeeeEEEEeccchhh---cCCeEEeecCEEEEE Q lcl|NC_012223. 1 MIMAGFNYTGLKRKVNPLIKKFGMTATVTRPGTVDRVDGDEVVIPPTSFDVIGLREEYKPSE---IDGTRIVAGDVKFLC 77 (119) Q Consensus 1 ~~Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~---idGtlI~~GD~~~~~ 77 (119) |-|..++=.++.. ..+||...+ .-||.+|.+...-...+.|.+-...-+..+ +-|+ -......+++ T Consensus 1 M~~~k~~p~~ln~-----ri~Fg~~~~-----~~n~~tG~~~~~f~~~ft~~~~~~t~t~~q~~~~~Gt-~~edT~~~vI 69 (116) T protein:vir:49 1 MRKVKYLPSDFPY-----KADFGTYQS-----TPNKFTGVSVPKFVKQFTLHYQPHIRTLNQEYLAQQN-GEDDTRVIVI 69 (116) T ss_pred CcccccccccCcE-----eEEeeeeee-----eecCCCCcccceeeeeEEEEEEEeecchheeeeeccc-cccccEEEEE Confidence 6666654111111 123443222 245666766655556787777776655443 2343 2334455555 Q ss_pred ccCcccccCCEEEeCCeEEEEEecceeCCCCcEEEEE-EEEeC Q lcl|NC_012223. 78 QAIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQ-LQLRG 119 (119) Q Consensus 78 sa~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~-~q~R~ 119 (119) -.......+=++.++|+.|.|+++.| +...-.+.|. +.++. T Consensus 70 Rh~~~i~~~~~v~~~g~~Y~I~~is~-Dd~~~~~~yD~lTlk~ 111 (116) T protein:vir:49 70 RHNSKVIEGQVVVLHGTQYDIIRASS-NENFGINRYDFLPVRQ 111 (116) T ss_pred EeCCCCCcccEEEECCeEEEEEEeCC-CCCCCcceeeEEEEEE Confidence 44444555667999999999999998 6555455554 44555 No 39 >protein:vir:3616 Length: 103 # NCBI annotation: ORF39 # Family: family:all:666 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112702;genbank:gi:13786570;genbank:GeneID:921068 Probab=70.79 E-value=0.2 Score=24.37 Aligned_cols=96 Identities=15% Similarity=0.145 Sum_probs=56.2 Q ss_pred HhcCCeEEEEe---CCEeccCCCcccCccceeeeeEEEEeccchh---hcCCeEEeecCEEEEEccCcccccCCEEEeCC Q lcl|NC_012223. 20 KKFGMTATVTR---PGTVDRVDGDEVVIPPTSFDVIGLREEYKPS---EIDGTRIVAGDVKFLCQAIKQVKVGDLVSLNN 93 (119) Q Consensus 20 ~~~G~~~tl~r---~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~---~idGtlI~~GD~~~~~sa~~~p~~gD~i~i~G 93 (119) =.|-..++.-. ++.|||.+|......+..-.+.|-+.+.... .+-|. |++|=+.+-+-+...+..=|.|+++| T Consensus 1 MRy~~~V~F~~~s~~~~YnP~tg~~~~~~~~~~~~~aNVTdlgt~rs~~lFG~-i~~~~kVIRl~~~i~~~~~~yi~i~~ 79 (103) T protein:vir:36 1 MRYLDEVTFIKESPDSHYDPDLGEWVEKEPTRTVFSANITDIGTDRSVEVFGD-IKKGAKVMRMMPLFNMPKYDYIEFDN 79 (103) T ss_pred CcccceEEEEEeCCCCeeCCCCCCccCCeeEEEEEEeecccccchhheeecch-hhcCeEEEEecCCCCcCcccEEEECC Confidence 23334556543 5789999997666666666667777776553 45554 66665555553332223457899999 Q ss_pred eEEEEEecc-eeCCCCcEEEEEEEE Q lcl|NC_012223. 94 TDYRVINPN-PLQPAGQTMLFQLQL 117 (119) Q Consensus 94 ~~~~Vv~v~-pi~Pag~~v~y~~q~ 117 (119) .+|+++... |.+=. +.|.=+.-- T Consensus 80 kkY~~~t~r~~~~~~-~~iv~Ev~~ 103 (103) T protein:vir:36 80 KKWALMTYRNPSERN-TFILQEVSQ 103 (103) T ss_pred cEEEEEEeeccccCc-EEEEEEecC Confidence 999999884 22211 111111111 No 40 >protein:vir:4343 Length: 118 # NCBI annotation: Orf10 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061506;genbank:gi:9635594;genbank:GeneID:1262867 Probab=62.70 E-value=0.33 Score=23.22 Aligned_cols=103 Identities=12% Similarity=-0.012 Sum_probs=57.0 Q ss_pred cChHHHHHHHHHHHHhcCCeEEEEeCC-EeccCCCcccCcccee-----eeeEEEEeccchhhcC--CeEEeecCEEEEE Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKFGMTATVTRPG-TVDRVDGDEVVIPPTS-----FDVIGLREEYKPSEID--GTRIVAGDVKFLC 77 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~G~~~tl~r~g-~~dp~~g~~~~~~~~~-----~~~~gv~~~~~~~~id--GtlI~~GD~~~~~ 77 (119) |+-.+|. ...++.++. ..|+.+|.+...-... .++-|-+...+.++.- +.+-..-..++.+ T Consensus 1 M~~G~l~-----------~rI~i~~~~~~~d~~~G~~~~~w~~~~~~~~~~~WA~v~~~sg~e~~~a~~~~~~~~~~f~i 69 (118) T protein:vir:43 1 MLAYRMR-----------HRIQFQRQVHTQDPDTGEETTTWETVLFSGHADLPAEVLTGPGRELIAADATQAETTARINC 69 (118) T ss_pred CCccccC-----------ccEEEEeeeeecCCCCCcccCceeeeeecccceEEEEEEecCccceeecccchheeeEEEEE Confidence 3322222 235665543 3566555433221111 3678888888887652 2222233444444 Q ss_pred --ccCc-ccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 78 --QAIK-QVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 78 --sa~~-~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) .+.. -+...++|..+|+.|.|..+.+.+..+--+.-.+.-.. T Consensus 70 Ry~~~~~~It~~~Ri~~~g~~y~I~~v~~~~~~~~~l~i~~~e~v 114 (118) T protein:vir:43 70 RWFPVERLELYTWRVLWDGRVYNITSAETDVTARREWRLRCSDGL 114 (118) T ss_pred EecccccCCCcccEEEECCeEEEEEecCCcccCCeEEEEEEEEec Confidence 3432 46889999999999999999876666643222222222 No 41 >protein:vir:100134 Length: 109 # NCBI annotation: gp8 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945038;genbank:gi:38707898;genbank:GeneID:2744181 Probab=62.44 E-value=0.33 Score=23.19 Aligned_cols=102 Identities=9% Similarity=-0.021 Sum_probs=59.6 Q ss_pred eecccChHHHHHHHHHHHHhcCCeEEEEeCCE-eccCCCcccCc-cceeeeeEEEEeccchhhcCCeEEe--ecCEEEEE Q lcl|NC_012223. 2 IMAGFNYTGLKRKVNPLIKKFGMTATVTRPGT-VDRVDGDEVVI-PPTSFDVIGLREEYKPSEIDGTRIV--AGDVKFLC 77 (119) Q Consensus 2 ~Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~g~-~dp~~g~~~~~-~~~~~~~~gv~~~~~~~~idGtlI~--~GD~~~~~ 77 (119) .| +-.+| -...++.++-. .|+.++ ++.. -...+++-|-+...+.++....--. .-..++++ T Consensus 1 mm---~~G~L-----------~~rI~i~~~~~~~d~~G~-~~~~~w~~~~~~wA~v~~~s~~e~~~a~~~~~~~~~~~~i 65 (109) T protein:vir:10 1 ML---KAGEL-----------TERITIEKRGGGVNENGE-PLPGDWVEHASVWANVRFLSGKEYVVSGAIHSSAIASMRI 65 (109) T ss_pred CC---Ccccc-----------CccEEEEeeeeeeCCCCC-eeccceEEEEEEEEEEEecCchheeeccceeeeeEEEEEE Confidence 11 11122 23356666543 455443 3332 2345888999998888766432222 22234444 Q ss_pred ccCcccccCCEEEeCCeEEEEEecceeCCCC--cEEEEEEEEeC Q lcl|NC_012223. 78 QAIKQVKVGDLVSLNNTDYRVINPNPLQPAG--QTMLFQLQLRG 119 (119) Q Consensus 78 sa~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag--~~v~y~~q~R~ 119 (119) -....++.+++|..+|..|.|.++.|. +.+ ..+.-++-.|+ T Consensus 66 R~~~~I~~~~ri~~~g~~y~I~~v~~d-~~~~~~~l~~~~~e~~ 108 (109) T protein:vir:10 66 RFRRDVDSEMRIRHDGRLYDIAAVLPN-RRQGYVDLSVKVGEKY 108 (109) T ss_pred EeCCCCCcccEEEECCeEEEEeecCCC-CCCCeEEEEEEEEEee Confidence 334457899999999999999998763 333 44556666666 No 42 >protein:vir:742 Length: 102 # NCBI annotation: unknown # Family: family:all:666 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108719;genbank:gi:13487841;genbank:GeneID:920874 Probab=62.24 E-value=0.34 Score=23.16 Aligned_cols=96 Identities=14% Similarity=0.104 Sum_probs=57.4 Q ss_pred HhcCCeEEEEe---CCEeccCCCcccCccceeeeeEEEEeccchh---hcCCeEEeecCEEEEEccCcccccCCEEEeCC Q lcl|NC_012223. 20 KKFGMTATVTR---PGTVDRVDGDEVVIPPTSFDVIGLREEYKPS---EIDGTRIVAGDVKFLCQAIKQVKVGDLVSLNN 93 (119) Q Consensus 20 ~~~G~~~tl~r---~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~---~idGtlI~~GD~~~~~sa~~~p~~gD~i~i~G 93 (119) =.|-..++.-. ++.|||.+|......+..-.+.|-+.+.... .+-|. |++|=+.+-+.+....-.=|.|.++| T Consensus 1 MRy~~~V~F~~~s~~~~YnP~tg~~~~~~~~~~~~~aNVTdlgt~rs~~lFG~-i~~~~kViRl~~~~~~~~~~~i~~~~ 79 (102) T protein:vir:74 1 MRYLDEVTFIKGSPDSHYDPDLGEWVEKEPTRAVFSANITDIGTDRSVEVFGD-IKKGAKVMRMMPLFTMPEYDYIEFDN 79 (102) T ss_pred CcccceEEEEEeCCCCeeCCCCCCccCCceEEEEEEeecccccchhheeecch-hhcCeeEEEEecCCCCCceEEEEeCC Confidence 23334555543 4789999998666666666777777776553 45564 67765656563322322357899999 Q ss_pred eEEEEEecceeCCCCcEEEEEEE Q lcl|NC_012223. 94 TDYRVINPNPLQPAGQTMLFQLQ 116 (119) Q Consensus 94 ~~~~Vv~v~pi~Pag~~v~y~~q 116 (119) ..|+++...-..-..+.|.=+.- T Consensus 80 kkY~~~t~r~~~~~~~~iv~Ev~ 102 (102) T protein:vir:74 80 KKWALTTYRNPSKRNTFILQEVN 102 (102) T ss_pred eEEEEEEEeecccCcEEEEEecC Confidence 99999988422222222222222 No 43 >protein:vir:7858 Length: 111 # NCBI annotation: gp15 # Family: family:all:3991 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817465;genbank:gi:29565894;genbank:GeneID:1259087 Probab=62.22 E-value=0.28 Score=23.62 Aligned_cols=99 Identities=18% Similarity=0.138 Sum_probs=49.0 Q ss_pred HHhcCC-eEEEEeCCE-eccCCCcccCccceeeeeEEEEeccchhhcCCeEEeecCEEEEEccC-----cccccCCEEEe Q lcl|NC_012223. 19 IKKFGM-TATVTRPGT-VDRVDGDEVVIPPTSFDVIGLREEYKPSEIDGTRIVAGDVKFLCQAI-----KQVKVGDLVSL 91 (119) Q Consensus 19 l~~~G~-~~tl~r~g~-~dp~~g~~~~~~~~~~~~~gv~~~~~~~~idGtlI~~GD~~~~~sa~-----~~p~~gD~i~i 91 (119) ..+||. .+++...++ --|+-|.......++.+.-|.....=.-+..=+.+-.=|..-.+++. ..-+.+|.+.+ T Consensus 1 ~e~fG~~tVtfV~i~e~~~~~~G~~~~v~~T~~~~pgc~frPl~~~~~~~~va~~~~t~~~~app~pa~lA~k~~~~li~ 80 (111) T protein:vir:78 1 MERIGEDTVTFVQIGKGAKSDRGIPQAVEESSTDVQWCSFQPLGVHYHVTDVSLPDATDRCLAPPVPAAVACKAGDKLIF 80 (111) T ss_pred CccccCceEEEEEeccCCCCCCCCccchheeecCCCceeeccccccccccccccccccccccCCCcceEEEeCCCCeEEE Confidence 788985 456544321 12233332222112233333332221111100111111122234333 24588999999 Q ss_pred CCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 92 NNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 92 ~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) ||..|.|+-..-..+.|-+ |++-+=. T Consensus 81 dGv~~~i~G~~~~f~dg~~--~~~TIi~ 106 (111) T protein:vir:78 81 HGVSHLVLGVRDHWDKGKL--HHLTVIT 106 (111) T ss_pred cceeeEEeeeeeeccCCCc--eEEEEEe Confidence 9999999999888888765 5554433 No 44 >protein:vir:101653 Length: 111 # NCBI annotation: gp16 # Family: family:all:3991 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654771;genbank:gi:109302769;genbank:GeneID:4156087 Probab=62.22 E-value=0.28 Score=23.62 Aligned_cols=99 Identities=18% Similarity=0.138 Sum_probs=49.0 Q ss_pred HHhcCC-eEEEEeCCE-eccCCCcccCccceeeeeEEEEeccchhhcCCeEEeecCEEEEEccC-----cccccCCEEEe Q lcl|NC_012223. 19 IKKFGM-TATVTRPGT-VDRVDGDEVVIPPTSFDVIGLREEYKPSEIDGTRIVAGDVKFLCQAI-----KQVKVGDLVSL 91 (119) Q Consensus 19 l~~~G~-~~tl~r~g~-~dp~~g~~~~~~~~~~~~~gv~~~~~~~~idGtlI~~GD~~~~~sa~-----~~p~~gD~i~i 91 (119) ..+||. .+++...++ --|+-|.......++.+.-|.....=.-+..=+.+-.=|..-.+++. ..-+.+|.+.+ T Consensus 1 ~e~fG~~tVtfV~i~e~~~~~~G~~~~v~~T~~~~pgc~frPl~~~~~~~~va~~~~t~~~~app~pa~lA~k~~~~li~ 80 (111) T protein:vir:10 1 MERIGEDTVTFVQIGKGAKSDRGIPQAVEESSTDVQWCSFQPLGVHYHVTDVSLPDATDRCLAPPVPAAVACKAGDKLIF 80 (111) T ss_pred CccccCceEEEEEeccCCCCCCCCccchheeecCCCceeeccccccccccccccccccccccCCCcceEEEeCCCCeEEE Confidence 788985 456544321 12233332222112233333332221111100111111122234333 24588999999 Q ss_pred CCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 92 NNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 92 ~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) ||..|.|+-..-..+.|-+ |++-+=. T Consensus 81 dGv~~~i~G~~~~f~dg~~--~~~TIi~ 106 (111) T protein:vir:10 81 HGVSHLVLGVRDHWDKGKL--HHLTVIT 106 (111) T ss_pred cceeeEEeeeeeeccCCCc--eEEEEEe Confidence 9999999999888888765 5554433 No 45 >protein:vir:4832 Length: 116 # NCBI annotation: ORF28 # Family: family:all:1030 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038329;genbank:gi:9634655;genbank:GeneID:1262597 Probab=62.19 E-value=0.34 Score=23.16 Aligned_cols=106 Identities=16% Similarity=0.164 Sum_probs=61.0 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEE-eCCEeccCCCcccCccceeeeeEEEEeccchhh---cCCeEEeecCEEEEEc Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVT-RPGTVDRVDGDEVVIPPTSFDVIGLREEYKPSE---IDGTRIVAGDVKFLCQ 78 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~-r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~---idGtlI~~GD~~~~~s 78 (119) |+.-.|. ...|-.-++.- ....-||.+|.+...-...|.|.+-...-+..+ +-|+ -......+++- T Consensus 1 M~~~k~~---------p~~~n~ri~FG~~~~~~n~~tG~~~~~f~~~ft~~~~~~t~t~~q~~~~~Gt-~~edt~~~vIR 70 (116) T protein:vir:48 1 MARVRYL---------PSDFRFKADFGTYQSSPNKFTGVSVPKFVKQFTLHYKPHTRTLNQEYLAIQN-GENDTRVIVIR 70 (116) T ss_pred Ceeeeee---------hhhccEEEEeceeeeeeCCCCCcccceeeeeEEEEEEEeecchhheeeeccc-cccCcEEEEEE Confidence 4443331 12222223321 122356667766655556788888777655543 2343 23344555554 Q ss_pred cCcccccCCEEEeCCeEEEEEecceeCCCCcEEEEEE-EEeC Q lcl|NC_012223. 79 AIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQL-QLRG 119 (119) Q Consensus 79 a~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~-q~R~ 119 (119) ........=++.++|+.|.|+++.| +...-.+.|.+ .++. T Consensus 71 h~~~i~~~~~v~~~g~~Y~I~~Is~-Dd~~~~~~yD~lTlk~ 111 (116) T protein:vir:48 71 HNSKVLEGQVVTLNGTQYDIVRISP-DENFGFNHYDFLTLKK 111 (116) T ss_pred eCCCCCcccEEEECCeEEEEEEeCC-CCCcCcceeeEEEEEE Confidence 4444455667899999999999998 66665666654 4555 No 46 >protein:vir:4955 Length: 116 # NCBI annotation: putative head-tail joining protein # Family: family:all:1030 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049931;genbank:gi:9632902;genbank:GeneID:1262078 Probab=62.13 E-value=0.34 Score=23.15 Aligned_cols=106 Identities=18% Similarity=0.139 Sum_probs=63.5 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEEeC-CEeccCCCcccCccceeeeeEEEEeccchhhc---CCeEEeecCEEEEEc Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVTRP-GTVDRVDGDEVVIPPTSFDVIGLREEYKPSEI---DGTRIVAGDVKFLCQ 78 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~-g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~i---dGtlI~~GD~~~~~s 78 (119) |+. + .--+..|-.-++.-.. ..-||.+|.+...-...|.|.|-...-+.++. -|+. ......+.+- T Consensus 1 m~~-~--------k~~~~~ln~rI~Fg~~~~~~n~~tG~~~~~f~~~f~~wa~~~~~~~~q~~~~~gt~-~e~T~~~vIR 70 (116) T protein:vir:49 1 MAR-G--------RYLPSDFRYKADFGTYQSTPNKFTGVSVPKFVKQFTLHYKPHTRTLNQEYLAIQNG-ENDTRVIVIR 70 (116) T ss_pred Ccc-c--------ccccccCceeEEeeeeeeeecCCCCcccceeeeeEEEEEEeecccceeeeeeeccc-ccccEEEEEE Confidence 333 1 0113334444444222 23567778777666667888888777666543 3442 2344555554 Q ss_pred cCcccccCCEEEeCCeEEEEEecceeCCCCcEEEE-EEEEeC Q lcl|NC_012223. 79 AIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLF-QLQLRG 119 (119) Q Consensus 79 a~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y-~~q~R~ 119 (119) -.......=+|.++|+.|.|+++.| +...-.+.| -+.++. T Consensus 71 h~~~i~~~~~v~~~g~~Y~I~~Isp-D~~~~~~~yD~iTlk~ 111 (116) T protein:vir:49 71 HNAKVLEGQVVTLNGTQYDIVRISP-DDNFGFNHYDFLTLKK 111 (116) T ss_pred eCCCCCcccEEEECCeEEEEEEeCC-CCccCcceeeEEEEEE Confidence 4444556668999999999999998 666656677 445555 No 47 >protein:vir:1641 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:1270 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695062;genbank:gi:23455753;genbank:GeneID:955487 Probab=60.80 E-value=0.32 Score=23.28 Aligned_cols=102 Identities=15% Similarity=0.148 Sum_probs=59.0 Q ss_pred CeecccChHHHHHHHHHHHHhcCCeEEEEeCC--EeccCCCcccCccceeeeeEEEEec-cchhhcCCeEEeecCEEEEE Q lcl|NC_012223. 1 MIMAGFNYTGLKRKVNPLIKKFGMTATVTRPG--TVDRVDGDEVVIPPTSFDVIGLREE-YKPSEIDGTRIVAGDVKFLC 77 (119) Q Consensus 1 ~~Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~g--~~dp~~g~~~~~~~~~~~~~gv~~~-~~~~~idGtlI~~GD~~~~~ 77 (119) |||-++. .|.+++|.++- +-||- |.+..+ ...-++.+|+.. -+..++++++=..|.+..+- T Consensus 1 ~~~m~~i--------------kGetVtvi~~~~tG~D~~-g~pi~~-~~~e~V~nVLV~P~s~~d~~~~~~p~G~~v~~t 64 (115) T protein:vir:16 1 MIFMGMI--------------KGIAVTLIDKVETGKDPF-GNPIYE-DKEIVVNNVLVSPTSSDDIVNQLTLTGKKAIYT 64 (115) T ss_pred Ceeeccc--------------CceeEEEecceecccCCC-CCCccc-ccceEcCceeeCCCChhhcccccCcceeEEEEE Confidence 6654442 47788876643 33553 223332 445666666654 55567766544455444433 Q ss_pred cc----CcccccCCEEEeCCeEEEEEecceeCCCC--cEEEEEEEEeC Q lcl|NC_012223. 78 QA----IKQVKVGDLVSLNNTDYRVINPNPLQPAG--QTMLFQLQLRG 119 (119) Q Consensus 78 sa----~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag--~~v~y~~q~R~ 119 (119) -+ ..-.-.|.+|.+.|++|+|+-- |+.+.+ ++..|...+=- T Consensus 65 la~PK~~~~~lrg~~V~~~G~~~~vvGd-P~~~~~~~~P~~WN~~V~v 111 (115) T protein:vir:16 65 LAIPKKDTHDWENKKVRFFGKTWRTFGE-PLEGIEGLIPLDWNKKVTV 111 (115) T ss_pred EecCCCCCCcccCceEEEeCceeEEecC-CCCcccccCCCccCCeEEE Confidence 22 1346789999999999998864 555544 44556655544 No 48 >protein:vir:80104 Length: 124 # NCBI annotation: gp12 # Family: family:all:4890 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468716;genbank:gi:157325296;genbank:GeneID:5601795 Probab=51.30 E-value=0.42 Score=22.63 Aligned_cols=106 Identities=15% Similarity=0.191 Sum_probs=52.8 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEEeC---CEeccCCCcccCcccee---eeeEEEEe------ccchhhc--CCeEE Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVTRP---GTVDRVDGDEVVIPPTS---FDVIGLRE------EYKPSEI--DGTRI 68 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~---g~~dp~~g~~~~~~~~~---~~~~gv~~------~~~~~~i--dGtlI 68 (119) |-.||.. +||++||.+.+|-.. |+.+ +.|+..+..... ....+-+. +..+... +|+ = T Consensus 1 m~~M~Fa-------smi~~fGVpi~V~~~~~~gg~~-~~G~w~~~~~~~~~~~~~~EPviP~s~~t~~~q~~~~tgG~-~ 71 (124) T protein:vir:80 1 MEKMIFQ-------SLLDSFGVPLTVFPKQEKGGEF-VNGEWVVSQLDETSKIEVNEPFIPSSLMTQMPQTSAYTAAR-Y 71 (124) T ss_pred Ccccchh-------hhHHhhCCCeEEeecCCCCCcc-cCCccccCCCCCCChhhccccccCCcccchhhhhhhccCcc-c Confidence 6666654 779999999988642 2211 123322211000 00111111 1122111 233 2 Q ss_pred eecCEEEEEccCcccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 69 VAGDVKFLCQAIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 69 ~~GD~~~~~sa~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) ...|..-+=+.. ..+|.+|.=-|+.|+|.+..|-..=.-+..|.|-.=+ T Consensus 72 ~~~dl~WySs~~--~p~gs~V~~~g~~y~V~~~~~Y~~Ysdv~~Y~LK~vs 120 (124) T protein:vir:80 72 EKYEMIWFSSQV--LPLKSKVIHKGITYSVEDAIPFTDYSDVTQYGCKAVS 120 (124) T ss_pred hhhhhhhhhccc--cccccEecCCCeeEEEeeccCccccCCceEEeeeecc Confidence 233433332222 4678888878999999999885544444455443333 No 49 >protein:vir:102337 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:2370 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529562;genbank:gi:90592647;genbank:GeneID:3974469 Probab=50.32 E-value=0.61 Score=21.75 Aligned_cols=101 Identities=15% Similarity=0.143 Sum_probs=54.3 Q ss_pred CeecccChHHHHHHHHHHHH-hcCCeEEEEeCC-EeccCCCcc--cCcc--ceeeeeEEEEeccch--hhcCCeEEeecC Q lcl|NC_012223. 1 MIMAGFNYTGLKRKVNPLIK-KFGMTATVTRPG-TVDRVDGDE--VVIP--PTSFDVIGLREEYKP--SEIDGTRIVAGD 72 (119) Q Consensus 1 ~~Ma~~~Y~~~~~~A~~ll~-~~G~~~tl~r~g-~~dp~~g~~--~~~~--~~~~~~~gv~~~~~~--~~idGtlI~~GD 72 (119) |-=|++ |+ =|--.|+|.|.. ..||++|.. .... ....||+--...-+. ++. ...++ . T Consensus 1 m~eadi------------Le~lY~d~c~I~r~~~~~d~~t~it~~~e~~~i~e~~pCrlS~~~~~~~~~~~-~~~~~--~ 65 (116) T protein:vir:10 1 MTEADI------------LALTYFCKMTIRRCVSIKNEDTGVTYFNENVVIAEDVPCGLNGNIPNVIDTDI-TNSIS--V 65 (116) T ss_pred CChHHH------------hhhhccCeEEEEEeeeeeCCcccccccccceeeeCCcceEEeecccccccCcc-cceeE--E Confidence 222222 22 244568887754 478887742 1111 234566655432222 222 23333 4 Q ss_pred EEEEEccCcccccCCEEEeC---CeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 73 VKFLCQAIKQVKVGDLVSLN---NTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 73 ~~~~~sa~~~p~~gD~i~i~---G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) .+|+++|+..++.||+|+|. |..+..-+-.|-.+.. +=+++|.. T Consensus 66 ~kLF~~PeidIk~Gd~I~Vt~~~g~~~~~~ag~p~~Y~s---HqEi~L~~ 112 (116) T protein:vir:10 66 FELYCRPEIDLQVGDILDITLENGNVETFIASKPFPYSS---HLQVNLTL 112 (116) T ss_pred EEEEeccCcccCCCCEEEEecCCCeEEEEEcCCCcccCC---cceEEEEE Confidence 79999999999999999887 4444444444433221 23444544 No 50 >protein:vir:1385 Length: 107 # NCBI annotation: Gp8 protein # Family: family:all:3858 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612837;genbank:gi:20065971;genbank:GeneID:935786 Probab=49.82 E-value=0.63 Score=21.69 Aligned_cols=97 Identities=12% Similarity=0.076 Sum_probs=59.8 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEEeCC-EeccCCCcccCccceeeeeEEEEeccchhhc--CCeEEeecCEEEEE-- Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVTRPG-TVDRVDGDEVVIPPTSFDVIGLREEYKPSEI--DGTRIVAGDVKFLC-- 77 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~g-~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~i--dGtlI~~GD~~~~~-- 77 (119) |+.+ = -.++.++- ..|+ .|.+...-..-+++-|-+...+.+|. .|..-.....++.+ T Consensus 1 ~~~~----------------h-RI~i~~~~~~~D~-~G~~~~~w~~~~~~WA~v~~~~g~E~~~a~~~~~~~~~~f~iRy 62 (107) T protein:vir:13 1 MARY----------------E-RISIKKLEEKNIK-GRRQEECLIPFYDCWAEILDLYGQELYGALQMKLENTIIFKIRY 62 (107) T ss_pred CCcc----------------e-EEEEEeeeeeeCC-CCCeecceEeEEEEEEEEecCCchheeecceeheeeeEEEEEEe Confidence 2221 1 24554443 3453 34445455556899999999888865 34443455566666 Q ss_pred ccCcc---cccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 78 QAIKQ---VKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 78 sa~~~---p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) .++.. +..+++|..+|+.|.|.++.|.+....- .++..+. T Consensus 63 ~~~i~~~~~t~~~Ri~~~g~~y~I~~v~~~~~~~~~--l~i~c~e 105 (107) T protein:vir:13 63 CKKVEELRNKENFIVEWQGRKYEIYYPDFLGYNKQF--VKLKCKE 105 (107) T ss_pred cCCccccccCcCcEEEECCeEEEEEecCCcccCCeE--EEEEEEE Confidence 34443 4678999999999999999877766642 2333333 No 51 >protein:vir:105035 Length: 112 # NCBI annotation: Gp9 # Family: family:all:11389 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006589;genbank:gi:46402095;genbank:GeneID:2777900 Probab=47.23 E-value=0.71 Score=21.40 Aligned_cols=103 Identities=11% Similarity=-0.049 Sum_probs=61.4 Q ss_pred CeecccChHHHHHHHHHHHHhcCCeEEEEeCC-EeccCCCcccCc-cceeeeeEEEEeccchhhcC-CeEEe-ecCEEEE Q lcl|NC_012223. 1 MIMAGFNYTGLKRKVNPLIKKFGMTATVTRPG-TVDRVDGDEVVI-PPTSFDVIGLREEYKPSEID-GTRIV-AGDVKFL 76 (119) Q Consensus 1 ~~Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~g-~~dp~~g~~~~~-~~~~~~~~gv~~~~~~~~id-GtlI~-~GD~~~~ 76 (119) |-|-.= +| -...++.++. ..|+.+ ++... =...+++-|-+...+.++.- +...+ .--..+. T Consensus 1 m~M~aG---~L-----------~~RItiq~~~~~~D~~G-~~~~~~W~~~~tvWA~v~~~sg~E~~~A~~~~~~~t~~f~ 65 (112) T protein:vir:10 1 MSLKPG---DM-----------NCRITIGYLQSGRGPLG-EPLPEKLVESGKAWAKRELVSGRKVRTMDQQQVMETCLFT 65 (112) T ss_pred CCCCcc---cc-----------CCcEEEEeeeeccCCCC-CEeccceeeeEEEEEEEEccCchheeecccccceeeEEEE Confidence 544332 12 2336666654 355543 34322 24557888888888887642 22222 2223334 Q ss_pred EccCcccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 77 CQAIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 77 ~sa~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) +=.-..+..+.+|..+|+.|.|.+|.+-+ .-..+|-+.++|- T Consensus 66 IRyr~dI~~~mRI~~~gr~y~I~~Vd~~~-~r~llc~E~~~~~ 107 (112) T protein:vir:10 66 VYPGVVVDIDWKITTKDLVYTVRNIDRKT-DQIIITGEADGRH 107 (112) T ss_pred EeeCCCCCcccEEEECCeEEEEecccCCC-CcEEEEeeccccc Confidence 43333467899999999999999997522 3355888888887 No 52 >protein:vir:5743 Length: 117 # NCBI annotation: hypothetical protein # Family: family:all:11389 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892054;genbank:gi:33770517;interpro:IPR006453;uniprot:Q7Y406;genbank:GeneID:2637487;interpro:IPR013045 Probab=44.19 E-value=0.82 Score=21.07 Aligned_cols=107 Identities=9% Similarity=0.044 Sum_probs=59.1 Q ss_pred CeecccChHHHHHHHHHHHHhcCCeEEEEeCC-EeccCCCcccCccceee-eeEEEEeccchhhc-CCeEEe-ecCEEEE Q lcl|NC_012223. 1 MIMAGFNYTGLKRKVNPLIKKFGMTATVTRPG-TVDRVDGDEVVIPPTSF-DVIGLREEYKPSEI-DGTRIV-AGDVKFL 76 (119) Q Consensus 1 ~~Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~g-~~dp~~g~~~~~~~~~~-~~~gv~~~~~~~~i-dGtlI~-~GD~~~~ 76 (119) |.=..||=.+|. .-++|.++. ..|+.+ ++.+....++ .+-|-+...+.++. .+.-.+ .-...+. T Consensus 1 ~~~~~M~aG~Lr-----------~RItiq~~~~~~D~~G-~~~~~~W~~~~~vWA~v~~lsgre~~~A~~~~~~~t~ri~ 68 (117) T protein:vir:57 1 MSDKPLNPGDLN-----------CRVILREVKSGRGPLG-EVLPAAPVVMGKAWAKIEPISNRKIRSADQQQIVETCLFT 68 (117) T ss_pred CCCCccCccccC-----------CcEEEEecccccCCCC-CeeccceeeeEEEEEeEEcCcchheeeccccceeeEEEEE Confidence 111113211211 235555543 456654 3443444554 67888888887764 222222 2223334 Q ss_pred EccCcccccCCEEEeCCeEEEEEecceeCCCC-cEEEEEEEEeC Q lcl|NC_012223. 77 CQAIKQVKVGDLVSLNNTDYRVINPNPLQPAG-QTMLFQLQLRG 119 (119) Q Consensus 77 ~sa~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag-~~v~y~~q~R~ 119 (119) +=.-.-+..+.+|..+|..|.|.+|...+-.+ ..+|.+..+|- T Consensus 69 IRyr~dI~~~mRV~~~gr~y~I~~V~d~~~~~r~Lic~e~~i~~ 112 (117) T protein:vir:57 69 LYPRRDISIDWQIVTTAGVFTVRVVDRVSHADRILITGEADIRH 112 (117) T ss_pred EEecCCCCcCCEEEECCEEEEEEeccCCCCCccEEEEEeccccc Confidence 43333457899999999999999996544444 34777777776 No 53 >protein:vir:100244 Length: 109 # NCBI annotation: gp73 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355409;genbank:gi:77864699;genbank:GeneID:3725966 Probab=41.00 E-value=0.95 Score=20.72 Aligned_cols=101 Identities=9% Similarity=-0.011 Sum_probs=56.8 Q ss_pred eecccChHHHHHHHHHHHHhcCCeEEEEeCCE-eccCCCcccCccceeeeeEEEEeccchhhcC--CeEEeecCEEEEEc Q lcl|NC_012223. 2 IMAGFNYTGLKRKVNPLIKKFGMTATVTRPGT-VDRVDGDEVVIPPTSFDVIGLREEYKPSEID--GTRIVAGDVKFLCQ 78 (119) Q Consensus 2 ~Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~g~-~dp~~g~~~~~~~~~~~~~gv~~~~~~~~id--GtlI~~GD~~~~~s 78 (119) .| +-. ++-...++.++.. .|+.++.-...-...+++-|-+...+.++.- +.+-..--.++.+= T Consensus 1 mm---~~g-----------~L~~rI~i~~~~~~~d~~G~~~~~~w~~~~~~wA~i~~~~g~e~~~a~~~~~~~~~~i~iR 66 (109) T protein:vir:10 1 ML---RSS-----------DLTEFIVIERKGGRTNENGEPLPDDWVTHDEVWASVRFVSGKEHVISGAVRSSAIASIRIR 66 (109) T ss_pred CC---Ccc-----------ccCccEEEEeeeeccCCCCCeeccceeeEEEEEEEEEecCchheeeccceeeeeeEEEEEE Confidence 11 111 2223466666543 4544332222334568889988888887653 33333333444443 Q ss_pred cCcccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 79 AIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 79 a~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) ....++.+++|..+|+.|.|.++.|. ...- ...+..+- T Consensus 67 ~~~~I~~~~ri~~~g~~y~I~~v~~~-~~~~--~l~i~c~e 104 (109) T protein:vir:10 67 FREDIDSEMRIRYGDQLYDIVAVLPN-RRKG--SLDLPVKV 104 (109) T ss_pred ecCCCCcccEEEECCeEEEEEeeccC-CCCc--EEEEEEEe Confidence 34457899999999999999999774 3332 23333333 No 54 >protein:vir:1436 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536365;genbank:gi:17975170;genbank:GeneID:929148 Probab=40.86 E-value=0.95 Score=20.70 Aligned_cols=99 Identities=11% Similarity=0.028 Sum_probs=55.8 Q ss_pred cChHHHHHHHHHHHHhcCCeEEEEeCC-EeccCCCcccC-ccceeeeeEEEEeccchhhcCCeEEeecCEEEEE--ccCc Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKFGMTATVTRPG-TVDRVDGDEVV-IPPTSFDVIGLREEYKPSEIDGTRIVAGDVKFLC--QAIK 81 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~G~~~tl~r~g-~~dp~~g~~~~-~~~~~~~~~gv~~~~~~~~idGtlI~~GD~~~~~--sa~~ 81 (119) |+=. +.-..+++.++- ..|+.++ +.. .-...+++-|-+...+.++.-...-...+....| =... T Consensus 1 M~~G-----------~L~~rI~i~~~~~~~d~~G~-~~~~~w~~~~~~wA~i~~~~g~e~~~a~~~~~~~t~~i~iR~~~ 68 (108) T protein:vir:14 1 MEAG-----------KLKERIVIERPSGETNENDE-PIPGAWVVHARPWADVRFLNGKEHVISGAVRGATVASMRIRYRA 68 (108) T ss_pred CCcc-----------ccCccEEEEeeeeccCCCCC-eeccceeeEEEEEEEEEecCchheeeccceeeeeeEEEEEEecC Confidence 2211 122235665553 3455443 332 2345688999999888876543323333333333 2233 Q ss_pred ccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 82 QVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 82 ~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) .++..++|..+|..|.|.++.|..-.+- ..+...- T Consensus 69 ~I~~~~ri~~~g~~y~I~~v~~~~~~~~---l~i~~~~ 103 (108) T protein:vir:14 69 GIGDQMRIRYDGRLYDITAVLPARKRGY---LDLSVKV 103 (108) T ss_pred CCCcccEEEECCeEEEEEeeccCCCCCE---EEEEEEe Confidence 4788999999999999999988654331 1222222 No 55 >protein:vir:3971 Length: 103 # NCBI annotation: hypothetical protein # Family: family:all:666 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663679;genbank:gi:21716116;genbank:GeneID:951216 Probab=37.47 E-value=1.1 Score=20.32 Aligned_cols=96 Identities=14% Similarity=0.101 Sum_probs=55.4 Q ss_pred HhcCCeEEEEe---CCEeccCCCcccCccceeeeeEEEEeccchh---hcCCeEEeecCEEEEEccCccccc-CCEEEeC Q lcl|NC_012223. 20 KKFGMTATVTR---PGTVDRVDGDEVVIPPTSFDVIGLREEYKPS---EIDGTRIVAGDVKFLCQAIKQVKV-GDLVSLN 92 (119) Q Consensus 20 ~~~G~~~tl~r---~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~---~idGtlI~~GD~~~~~sa~~~p~~-gD~i~i~ 92 (119) =.|-..+++-. ++.|||.+|.........-.+.|-+.+.... .+-|. |++|=+.+-+..... .. =|.|.++ T Consensus 1 MRy~~~V~F~~~s~~~~YnP~tg~~~~~~~~~~~~~aNVTdlgt~rs~~lFG~-i~~~~kVIRl~~~~~-~~~~~~i~~~ 78 (103) T protein:vir:39 1 MRFLDEVTFIKESPDSHYDPDLGEWVEKEPTRAVFSANITDIGTDRSIKVFGD-IKQGAKVMRMMPLFT-MPEYDYIEFD 78 (103) T ss_pred CcccceEEEEEeCCCCeeCCCCCCccCCceEEEEEEeecccccchhheeecch-hccCeeEEEeeCCCC-CCceEEEEcC Confidence 23334555543 4679999998666666666777777776553 45554 666655555533222 22 4689999 Q ss_pred CeEEEEEecceeCCCCcEEEEEEEE Q lcl|NC_012223. 93 NTDYRVINPNPLQPAGQTMLFQLQL 117 (119) Q Consensus 93 G~~~~Vv~v~pi~Pag~~v~y~~q~ 117 (119) |..|+++...-..-.-+.|.=+.-- T Consensus 79 ~kkY~~~t~r~~~~~~~~iv~Evn~ 103 (103) T protein:vir:39 79 NKKWALMTYRNPSERNTFILQEVNQ 103 (103) T ss_pred CeEEEEEEEeecccCcEEEEEEecC Confidence 9999999884211111111111111 No 56 >protein:vir:193 Length: 112 # NCBI annotation: putative head-tail adaptor # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037703;genbank:gi:9634168;genbank:GeneID:1262533 Probab=35.86 E-value=1.2 Score=20.14 Aligned_cols=102 Identities=11% Similarity=0.084 Sum_probs=58.1 Q ss_pred cChHHHHHHHHHHHHhcCCeEEEEeCCE-eccCCCcccCccceeeeeEEEEeccchhhcC--CeEEeecCEEEEEccCcc Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKFGMTATVTRPGT-VDRVDGDEVVIPPTSFDVIGLREEYKPSEID--GTRIVAGDVKFLCQAIKQ 82 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~G~~~tl~r~g~-~dp~~g~~~~~~~~~~~~~gv~~~~~~~~id--GtlI~~GD~~~~~sa~~~ 82 (119) |+=. +.-...++.++.. .|. .|++...-...+++-|-+...+.++.- +..-...-.++.+-.... T Consensus 1 M~~G-----------~L~~rI~i~~~~~~~d~-~G~~~~~w~~~~~~wA~v~~~sg~e~~~a~~~~~~~~~~i~iR~~~~ 68 (112) T protein:vir:19 1 MEPG-----------RFRNRVKILTFTTSRDP-SGQPVESWTGGNPVPAEVKGISGREQLSGGAETAQATIRVWMRFRSE 68 (112) T ss_pred CCcc-----------ccCccEEEEeeeeeeCC-CCCeecceEeEEEEEEEEEecCchheeeccceeeeeeEEEEEEecCC Confidence 1111 1122356655543 443 455555555668888988888887653 222223334444433345 Q ss_pred cccCCEEE-----eCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 83 VKVGDLVS-----LNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 83 p~~gD~i~-----i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) ++.+++|. .+|+.|.|.++.+....+--+.-.+.-.. T Consensus 69 I~~~~ri~~~~~~~~g~~y~I~~v~~~~~~~~~l~l~c~egv 110 (112) T protein:vir:19 69 LNASSRLEVLSGPYKGQVLNIIGPPVANATGTRLEILCKTGA 110 (112) T ss_pred CCcccceeecceeeCCeEEEEEecCCCccCCcEEEEEEEEcc Confidence 67888885 69999999999876766654333333222 No 57 >protein:vir:99571 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:7161 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039798;genbank:gi:126011048;genbank:GeneID:4818266 Probab=35.66 E-value=1.2 Score=20.12 Aligned_cols=104 Identities=15% Similarity=0.190 Sum_probs=51.1 Q ss_pred CeecccChHHHHHHHHHHHHhcCCeEE----EEe--CC----Eec-cC--CCcccCccceeeeeEEEEeccchhhcCCeE Q lcl|NC_012223. 1 MIMAGFNYTGLKRKVNPLIKKFGMTAT----VTR--PG----TVD-RV--DGDEVVIPPTSFDVIGLREEYKPSEIDGTR 67 (119) Q Consensus 1 ~~Ma~~~Y~~~~~~A~~ll~~~G~~~t----l~r--~g----~~d-p~--~g~~~~~~~~~~~~~gv~~~~~~~~idGtl 67 (119) |+.-.. +|-.+|.++|..--..+- .+. .| +|. |+ .|.--..+...|.--|+- ++. T Consensus 1 m~iPG~---NLl~~A~~VI~~Q~V~y~rf~~Rt~n~~gq~i~~y~~p~~i~gS~Q~V~~~~v~~~GLd--~~~------- 68 (131) T protein:vir:99 1 MIVPGS---NLFMQAASVIALTPVPYLRFTQRVLNPARQWITTYAAAVDVPMSVQRVPRNKYVQFGLE--FQR------- 68 (131) T ss_pred Ccccch---HHHHHHhhhhccccchhhcccccccccccceeeeecCCccceeeEEecChhheeecCcc--eee------- Confidence 444444 577777777763221110 011 12 121 11 111001111112222221 111 Q ss_pred EeecCEEEEEccCc----ccccCCEEEeCCeEEEEEecceeCCCC---cEEEEEEEEeC Q lcl|NC_012223. 68 IVAGDVKFLCQAIK----QVKVGDLVSLNNTDYRVINPNPLQPAG---QTMLFQLQLRG 119 (119) Q Consensus 68 I~~GD~~~~~sa~~----~p~~gD~i~i~G~~~~Vv~v~pi~Pag---~~v~y~~q~R~ 119 (119) -=+.++.+.+. .=..||.++.+|++|.|+...-+-.-+ ..+|-++=+|+ T Consensus 69 ---~Yv~lfts~~i~~iqRg~agD~liwnGrr~~v~g~~dW~~QDGW~~~lcv~~Gi~~ 124 (131) T protein:vir:99 69 ---NYVRLFAPIEMVDLDRDCGGDMIIWHGRQHKIESQNTWYLQDGWAMSLAVDLGIRS 124 (131) T ss_pred ---eEEEEeecCcceecccCCCCCEEEECCeEEEecccceeeeeccceEEEEEEeeccc Confidence 12233333332 125799999999999999997654433 67888999999 No 58 >protein:vir:80342 Length: 108 # NCBI annotation: gp9, phage head-tail adaptor, putative # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111088;genbank:gi:134288641;genbank:GeneID:4960589 Probab=35.23 E-value=1.2 Score=20.07 Aligned_cols=99 Identities=12% Similarity=0.031 Sum_probs=55.4 Q ss_pred cChHHHHHHHHHHHHhcCCeEEEEeCC-EeccCCCcccC-ccceeeeeEEEEeccchhhcCCeEEeec--CEEEEEccCc Q lcl|NC_012223. 6 FNYTGLKRKVNPLIKKFGMTATVTRPG-TVDRVDGDEVV-IPPTSFDVIGLREEYKPSEIDGTRIVAG--DVKFLCQAIK 81 (119) Q Consensus 6 ~~Y~~~~~~A~~ll~~~G~~~tl~r~g-~~dp~~g~~~~-~~~~~~~~~gv~~~~~~~~idGtlI~~G--D~~~~~sa~~ 81 (119) |+=. ++-...++.++. ..|+.++ +.. .-...+++-|-+...+.++.....-..+ -.++.+-... T Consensus 1 M~~G-----------~L~~rI~i~~~~~~~d~~G~-~~~~~w~~~~~~wA~v~~~~~~e~~~a~~~~~~~~~~i~iR~~~ 68 (108) T protein:vir:80 1 MKTG-----------KLKERIVIERPSGETNENDE-PIPGAWIVHARPWADVLFLNGKEHVISGAVRGATIASMRIRYRA 68 (108) T ss_pred CCcc-----------ccCccEEEEeeeeccCCCCC-eeccceeeEEEEEEEEEecCchheeeccceeeeeeEEEEEEecC Confidence 1111 112235555553 3555433 332 3345688999998888877633222223 2333333333 Q ss_pred ccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 82 QVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 82 ~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) .++.+++|..+|+.|.|..+.|..-.+ - ..|...- T Consensus 69 ~I~~~~Ri~~~g~~y~I~~v~~~~~~~-~--l~i~~~e 103 (108) T protein:vir:80 69 GIDEQMRVRYDGRLYDITAVLPARKRG-Y--LDLSVKV 103 (108) T ss_pred CCCcccEEEECCeEEEEEeeccCCCCC-E--EEEEEEe Confidence 578899999999999999998865433 1 2222222 No 59 >protein:vir:1272 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690764;genbank:gi:22855004;genbank:GeneID:955243 Probab=35.20 E-value=1.2 Score=20.06 Aligned_cols=104 Identities=18% Similarity=0.103 Sum_probs=58.4 Q ss_pred CeecccChHHHHHHHHHHHHhcCCeEEEEeCCE-eccCCCcccCccceeeeeEEEEecc---chhhc--CCeEEeecCEE Q lcl|NC_012223. 1 MIMAGFNYTGLKRKVNPLIKKFGMTATVTRPGT-VDRVDGDEVVIPPTSFDVIGLREEY---KPSEI--DGTRIVAGDVK 74 (119) Q Consensus 1 ~~Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~g~-~dp~~g~~~~~~~~~~~~~gv~~~~---~~~~i--dGtlI~~GD~~ 74 (119) |-|.+= +| -.-.++.++.. .|+ .|++...-...++|-|-+... +.++. .+...-.-..+ T Consensus 1 M~m~aG---~L-----------~~rI~i~~~~~~~D~-~G~~~~~w~~~~~~wA~v~~~~~l~g~E~~~a~~~~~~~~~~ 65 (119) T protein:vir:12 1 MRKKIS---QL-----------RHRLTFQKKNETQDE-EGNWNATYVDLFTVWGAVEGAGSLGNSESMIAGALGVKTPKK 65 (119) T ss_pred CCCCcc---cc-----------CceEEEEeeeeccCC-CCCeecceEeEEEEEEEEeecCccchhheeecceeeeeeEEE Confidence 555321 12 22356655532 454 444444444557788877653 33444 34333344555 Q ss_pred EEEccCcccccCCEEEe------CCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 75 FLCQAIKQVKVGDLVSL------NNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 75 ~~~sa~~~p~~gD~i~i------~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) +.+-.-.-++.+++|.. +|+.|.|.++.+.+..+--+.-.+.-.+ T Consensus 66 i~iRy~~~I~~~~Ri~~~~~~~~~g~~y~I~~v~~~~~~~~~l~l~c~e~~ 116 (119) T protein:vir:12 66 ITVRYRKDIKPNMRIVKRVPKDKTERVFDILDTNDPDDQEEELEILCQEVG 116 (119) T ss_pred EEEEeCCCCCccceEeeccccccCCeEEEEEecCCCccCCeEEEEEEEEee Confidence 55543334577888854 8999999999766666654444555555 No 60 >protein:vir:9878 Length: 103 # NCBI annotation: hypothetical protein # Family: family:all:2717 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795640;genbank:gi:28876401;genbank:GeneID:1257932 Probab=34.34 E-value=0.41 Score=22.73 Aligned_cols=95 Identities=19% Similarity=0.377 Sum_probs=58.6 Q ss_pred CeecccChHHHHHHHHHHHHhcCCeEEEEeCCEeccCCCcccCccceeeeeEEEEeccchhhc--CCeEEeecCEEEEE- Q lcl|NC_012223. 1 MIMAGFNYTGLKRKVNPLIKKFGMTATVTRPGTVDRVDGDEVVIPPTSFDVIGLREEYKPSEI--DGTRIVAGDVKFLC- 77 (119) Q Consensus 1 ~~Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~i--dGtlI~~GD~~~~~- 77 (119) |.|+.|+ -+++..-|+-|.- |-+..++..--.-+|.+.+.+.-+| ++.-+...-++++- T Consensus 1 M~f~p~~-----------------l~a~~~tG~kDrL-gNpI~e~v~ir~~kGrlt~Wsaedia~~~R~lt~t~rKllT~ 62 (103) T protein:vir:98 1 MMFVNFD-----------------LVTSQKTGEKDRL-GNDITKDVVKRVAKGRFTEWSADDVSLYGRDLTSSARKLLTN 62 (103) T ss_pred Cccccee-----------------eEEeeeecccccc-CCCcccceeeeeecceeeccchhhhhhhhhhhHHHHHHhhcc Confidence 7777764 2455555655542 2233334444556899999999887 34444455555554 Q ss_pred -ccCcccccCCEEEeCCeEEEEEecceeCCCCcEEEEEE-EEeC Q lcl|NC_012223. 78 -QAIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQL-QLRG 119 (119) Q Consensus 78 -sa~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~-q~R~ 119 (119) ++..+.+...+|.++|..|.|++++ +..- |++ .+.| T Consensus 63 ~ap~~~~k~ae~V~ieG~~Yki~~~K--d~gR----WRLl~vKa 100 (103) T protein:vir:98 63 QVSKAEAKQASHVVIDGSKYKVESVK--DLGR----WRLLVIKG 100 (103) T ss_pred ccChhhhcccceeEeccceeEEechh--hcCe----eEEEEEee Confidence 3445678889999999999999984 3333 222 2333 No 61 >protein:vir:94910 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:31609 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239281;genbank:gi:66392063;genbank:GeneID:5076555 Probab=33.61 E-value=1.3 Score=19.88 Aligned_cols=110 Identities=16% Similarity=0.209 Sum_probs=64.5 Q ss_pred CeecccChHHHHHHHHHHHHhc---CCeEEEE---eCCEeccCCCcccCccceeeeeEEEEeccchhhcCCeEEeecCEE Q lcl|NC_012223. 1 MIMAGFNYTGLKRKVNPLIKKF---GMTATVT---RPGTVDRVDGDEVVIPPTSFDVIGLREEYKPSEIDGTRIVAGDVK 74 (119) Q Consensus 1 ~~Ma~~~Y~~~~~~A~~ll~~~---G~~~tl~---r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~idGtlI~~GD~~ 74 (119) |..+.-.+ .+-++|.|-|-+. |+-.+.+ +.|+|-.++ ++-.+.-+.+.|+-.+.-.-.|..=.++ T Consensus 1 mslsrqif-nairtakrslgdmiltgqlitytteyqdgeyvnvg--------vernidivpdqfsyeeltslnineievk 71 (119) T protein:vir:94 1 MSLSRQIF-NAIRTAKRSLGDMILTGQLITYTTEYQDGEYVNVG--------VERNIDIVPDQFSYEELTSLNINEIEVK 71 (119) T ss_pred CchhHHHH-HHHhhhhhhhcceeeeceeeeeeecccCCceEEee--------eeccceeccCcccchhhcccccceeEEE Confidence 76666222 4566787777654 3333333 234432221 1222334555565555544445544555 Q ss_pred EEE---ccCcccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 75 FLC---QAIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 75 ~~~---sa~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) +++ .-+-+++..|+|..-|..|+|--++|-.-+-.+-.|.+-+.- T Consensus 72 llvfnvnddlviktedkirykgdeysiylvkpesvglltpvytvmlkk 119 (119) T protein:vir:94 72 LLVFNVNDDLVIKTEDKIRYKGDEYSIYLVKPESVGLLTPVYTVMLKK 119 (119) T ss_pred EEEEecCCceEEeeccceeecCCceEEEEEccccccccccceeeeecC Confidence 554 234688999999999999999988875544455567777767 No 62 >protein:vir:79989 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:267 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430005;genbank:gi:156604060;genbank:GeneID:5525449 Probab=31.67 E-value=1.5 Score=19.65 Aligned_cols=102 Identities=10% Similarity=0.019 Sum_probs=56.7 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEEe--CCEeccCCCcccCccceeeeeEEEEeccchhhcCCeEEe--ecCEEEEE- Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVTR--PGTVDRVDGDEVVIPPTSFDVIGLREEYKPSEIDGTRIV--AGDVKFLC- 77 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r--~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~idGtlI~--~GD~~~~~- 77 (119) |..++=.+| -..++.-+ .+.-.|..+.+ ....-|.|-|-+.+-+.+|..-+.-. ...+.+++ T Consensus 1 mp~~~~g~L-----------~trI~F~~~~t~~dg~~~~~~--~~e~~~~CwA~v~~~~~kD~~~~~g~~~e~~iTf~IR 67 (111) T protein:vir:79 1 MMKFNSNKL-----------NERIDFCEDVSERVNGNPMKP--KTKILYSCFACIQESKESDTQTNLNTGSKFIKTIIIR 67 (111) T ss_pred CCcccCCCC-----------CccEEEEEeecCcCCCCCCCC--eeEEEEEEEEeeecCchhhhhhhhhhhccceeEEEEe Confidence 333321111 12233311 11212222222 23567999999999999987532222 24455555 Q ss_pred ccCc--ccccCCEEEeCCeEEEEEecceeCCCC--cEEEEEEEE Q lcl|NC_012223. 78 QAIK--QVKVGDLVSLNNTDYRVINPNPLQPAG--QTMLFQLQL 117 (119) Q Consensus 78 sa~~--~p~~gD~i~i~G~~~~Vv~v~pi~Pag--~~v~y~~q~ 117 (119) .+.. .|...++|+++|..|.|+++.|--+.. ++|.=..-+ T Consensus 68 ~~~~~y~~~n~~~V~~~~~~ynI~~V~pd~~~~~f~~Iv~~~V~ 111 (111) T protein:vir:79 68 DTRGDYKPTNKHYVLHEGQRFNIKYVKPDYQDKSYLRIYGEVVI 111 (111) T ss_pred cCCCCcccCCccEEEEcceEeeEEEecCCCCcCceEEEEEEEeC Confidence 3333 688999999999999999999955554 222222222 No 63 >protein:vir:9413 Length: 111 # NCBI annotation: phi PVL orf 10-like protein # Family: family:all:267 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803391;genbank:gi:29028703;genbank:GeneID:1258140 Probab=31.67 E-value=1.5 Score=19.65 Aligned_cols=102 Identities=10% Similarity=0.019 Sum_probs=56.7 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEEe--CCEeccCCCcccCccceeeeeEEEEeccchhhcCCeEEe--ecCEEEEE- Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVTR--PGTVDRVDGDEVVIPPTSFDVIGLREEYKPSEIDGTRIV--AGDVKFLC- 77 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r--~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~idGtlI~--~GD~~~~~- 77 (119) |..++=.+| -..++.-+ .+.-.|..+.+ ....-|.|-|-+.+-+.+|..-+.-. ...+.+++ T Consensus 1 mp~~~~g~L-----------~trI~F~~~~t~~dg~~~~~~--~~e~~~~CwA~v~~~~~kD~~~~~g~~~e~~iTf~IR 67 (111) T protein:vir:94 1 MMKFNSNKL-----------NERIDFCEDVSERVNGNPMKP--KTKILYSCFACIQESKESDTQTNLNTGSKFIKTIIIR 67 (111) T ss_pred CCcccCCCC-----------CccEEEEEeecCcCCCCCCCC--eeEEEEEEEEeeecCchhhhhhhhhhhccceeEEEEe Confidence 333321111 12233311 11212222222 23567999999999999987532222 24455555 Q ss_pred ccCc--ccccCCEEEeCCeEEEEEecceeCCCC--cEEEEEEEE Q lcl|NC_012223. 78 QAIK--QVKVGDLVSLNNTDYRVINPNPLQPAG--QTMLFQLQL 117 (119) Q Consensus 78 sa~~--~p~~gD~i~i~G~~~~Vv~v~pi~Pag--~~v~y~~q~ 117 (119) .+.. .|...++|+++|..|.|+++.|--+.. ++|.=..-+ T Consensus 68 ~~~~~y~~~n~~~V~~~~~~ynI~~V~pd~~~~~f~~Iv~~~V~ 111 (111) T protein:vir:94 68 DTRGDYKPTNKHYVLHEGQRFNIKYVKPDYQDKSYLRIYGEVVI 111 (111) T ss_pred cCCCCcccCCccEEEEcceEeeEEEecCCCCcCceEEEEEEEeC Confidence 3333 688999999999999999999955554 222222222 No 64 >protein:vir:4603 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:267 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058448;genbank:gi:9635174;genbank:GeneID:1262719 Probab=31.67 E-value=1.5 Score=19.65 Aligned_cols=102 Identities=10% Similarity=0.019 Sum_probs=56.7 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEEe--CCEeccCCCcccCccceeeeeEEEEeccchhhcCCeEEe--ecCEEEEE- Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVTR--PGTVDRVDGDEVVIPPTSFDVIGLREEYKPSEIDGTRIV--AGDVKFLC- 77 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r--~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~idGtlI~--~GD~~~~~- 77 (119) |..++=.+| -..++.-+ .+.-.|..+.+ ....-|.|-|-+.+-+.+|..-+.-. ...+.+++ T Consensus 1 mp~~~~g~L-----------~trI~F~~~~t~~dg~~~~~~--~~e~~~~CwA~v~~~~~kD~~~~~g~~~e~~iTf~IR 67 (111) T protein:vir:46 1 MMKFNSNKL-----------NERIDFCEDVSERVNGNPMKP--KTKILYSCFACIQESKESDTQTNLNTGSKFIKTIIIR 67 (111) T ss_pred CCcccCCCC-----------CccEEEEEeecCcCCCCCCCC--eeEEEEEEEEeeecCchhhhhhhhhhhccceeEEEEe Confidence 333321111 12233311 11212222222 23567999999999999987532222 24455555 Q ss_pred ccCc--ccccCCEEEeCCeEEEEEecceeCCCC--cEEEEEEEE Q lcl|NC_012223. 78 QAIK--QVKVGDLVSLNNTDYRVINPNPLQPAG--QTMLFQLQL 117 (119) Q Consensus 78 sa~~--~p~~gD~i~i~G~~~~Vv~v~pi~Pag--~~v~y~~q~ 117 (119) .+.. .|...++|+++|..|.|+++.|--+.. ++|.=..-+ T Consensus 68 ~~~~~y~~~n~~~V~~~~~~ynI~~V~pd~~~~~f~~Iv~~~V~ 111 (111) T protein:vir:46 68 DTRGDYKPTNKHYVLHEGQRFNIKYVKPDYQDKSYLRIYGEVVI 111 (111) T ss_pred cCCCCcccCCccEEEEcceEeeEEEecCCCCcCceEEEEEEEeC Confidence 3333 688999999999999999999955554 222222222 No 65 >protein:vir:81103 Length: 111 # NCBI annotation: putative phage head tail adapter # Family: family:all:267 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429877;genbank:gi:156603930;genbank:GeneID:5525323 Probab=31.67 E-value=1.5 Score=19.65 Aligned_cols=102 Identities=10% Similarity=0.019 Sum_probs=56.7 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEEe--CCEeccCCCcccCccceeeeeEEEEeccchhhcCCeEEe--ecCEEEEE- Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVTR--PGTVDRVDGDEVVIPPTSFDVIGLREEYKPSEIDGTRIV--AGDVKFLC- 77 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r--~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~idGtlI~--~GD~~~~~- 77 (119) |..++=.+| -..++.-+ .+.-.|..+.+ ....-|.|-|-+.+-+.+|..-+.-. ...+.+++ T Consensus 1 mp~~~~g~L-----------~trI~F~~~~t~~dg~~~~~~--~~e~~~~CwA~v~~~~~kD~~~~~g~~~e~~iTf~IR 67 (111) T protein:vir:81 1 MMKFNSNKL-----------NERIDFCEDVSERVNGNPMKP--KTKILYSCFACIQESKESDTQTNLNTGSKFIKTIIIR 67 (111) T ss_pred CCcccCCCC-----------CccEEEEEeecCcCCCCCCCC--eeEEEEEEEEeeecCchhhhhhhhhhhccceeEEEEe Confidence 333321111 12233311 11212222222 23567999999999999987532222 24455555 Q ss_pred ccCc--ccccCCEEEeCCeEEEEEecceeCCCC--cEEEEEEEE Q lcl|NC_012223. 78 QAIK--QVKVGDLVSLNNTDYRVINPNPLQPAG--QTMLFQLQL 117 (119) Q Consensus 78 sa~~--~p~~gD~i~i~G~~~~Vv~v~pi~Pag--~~v~y~~q~ 117 (119) .+.. .|...++|+++|..|.|+++.|--+.. ++|.=..-+ T Consensus 68 ~~~~~y~~~n~~~V~~~~~~ynI~~V~pd~~~~~f~~Iv~~~V~ 111 (111) T protein:vir:81 68 DTRGDYKPTNKHYVLHEGQRFNIKYVKPDYQDKSYLRIYGEVVI 111 (111) T ss_pred cCCCCcccCCccEEEEcceEeeEEEecCCCCcCceEEEEEEEeC Confidence 3333 688999999999999999999955554 222222222 No 66 >protein:vir:98341 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:267 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918933;genbank:gi:119443695;genbank:GeneID:4594503 Probab=31.67 E-value=1.5 Score=19.65 Aligned_cols=102 Identities=10% Similarity=0.019 Sum_probs=56.7 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEEe--CCEeccCCCcccCccceeeeeEEEEeccchhhcCCeEEe--ecCEEEEE- Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVTR--PGTVDRVDGDEVVIPPTSFDVIGLREEYKPSEIDGTRIV--AGDVKFLC- 77 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r--~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~idGtlI~--~GD~~~~~- 77 (119) |..++=.+| -..++.-+ .+.-.|..+.+ ....-|.|-|-+.+-+.+|..-+.-. ...+.+++ T Consensus 1 mp~~~~g~L-----------~trI~F~~~~t~~dg~~~~~~--~~e~~~~CwA~v~~~~~kD~~~~~g~~~e~~iTf~IR 67 (111) T protein:vir:98 1 MMKFNSNKL-----------NERIDFCEDVSERVNGNPMKP--KTKILYSCFACIQESKESDTQTNLNTGSKFIKTIIIR 67 (111) T ss_pred CCcccCCCC-----------CccEEEEEeecCcCCCCCCCC--eeEEEEEEEEeeecCchhhhhhhhhhhccceeEEEEe Confidence 333321111 12233311 11212222222 23567999999999999987532222 24455555 Q ss_pred ccCc--ccccCCEEEeCCeEEEEEecceeCCCC--cEEEEEEEE Q lcl|NC_012223. 78 QAIK--QVKVGDLVSLNNTDYRVINPNPLQPAG--QTMLFQLQL 117 (119) Q Consensus 78 sa~~--~p~~gD~i~i~G~~~~Vv~v~pi~Pag--~~v~y~~q~ 117 (119) .+.. .|...++|+++|..|.|+++.|--+.. ++|.=..-+ T Consensus 68 ~~~~~y~~~n~~~V~~~~~~ynI~~V~pd~~~~~f~~Iv~~~V~ 111 (111) T protein:vir:98 68 DTRGDYKPTNKHYVLHEGQRFNIKYVKPDYQDKSYLRIYGEVVI 111 (111) T ss_pred cCCCCcccCCccEEEEcceEeeEEEecCCCCcCceEEEEEEEeC Confidence 3333 688999999999999999999955554 222222222 No 67 >protein:vir:3993 Length: 117 # NCBI annotation: unknown # Family: family:all:1030 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116501;genbank:gi:14251134;genbank:GeneID:921310 Probab=29.27 E-value=1.7 Score=19.36 Aligned_cols=105 Identities=18% Similarity=0.235 Sum_probs=55.6 Q ss_pred eecccChHHHHHHHHHHHHhcCCeEEEEe-CCEeccCCCc-ccCccceeeeeEEEEeccchhh---cCCeEEeecCEEEE Q lcl|NC_012223. 2 IMAGFNYTGLKRKVNPLIKKFGMTATVTR-PGTVDRVDGD-EVVIPPTSFDVIGLREEYKPSE---IDGTRIVAGDVKFL 76 (119) Q Consensus 2 ~Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r-~g~~dp~~g~-~~~~~~~~~~~~gv~~~~~~~~---idGtlI~~GD~~~~ 76 (119) -|..++ ...|-.-++.-. +..-||.+|. +...+...|.|.+-...-+..+ +-|+ -......+. T Consensus 1 m~~~~~-----------p~~~n~ri~Fg~~~~~~~~~g~~~~~~~~~~~f~~w~~~~t~t~~q~~~~~Gt-~~e~T~~~v 68 (117) T protein:vir:39 1 MVKTYK-----------PNDFNRKCKIGVTKTVTTPTGGKIEKIDPATVLNVRFAAKMRSLALQFQIIGT-TTADTLDIA 68 (117) T ss_pred CCcccc-----------ccccceeEEeeeecceecCCCCcccccEEeeEEEEEEEEeecccceeeeeecc-cccCcEEEE Confidence 122222 222223333322 2235665553 3334455687777776655443 3454 334455555 Q ss_pred EccCcccccCCEEEeCCeEEEEEecceeCCCCcEEEEE-EEEeC Q lcl|NC_012223. 77 CQAIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQ-LQLRG 119 (119) Q Consensus 77 ~sa~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~-~q~R~ 119 (119) +-........=++.++|+.|.|+++.| +...-.+.|- +.|++ T Consensus 69 IRh~~~i~~~m~v~~~g~~Y~I~~Is~-Dd~~~~~~yD~iTlk~ 111 (117) T protein:vir:39 69 IRHNKLVTKKMCVQIDDVLYNIINISS-DESAKLIKFDILTLQA 111 (117) T ss_pred EEeCCCCCcccEEEECCeEEEEeEeCC-CCccCceeeeeEEEEE Confidence 644444444557899999999999998 5444444443 44554 No 68 >protein:vir:7447 Length: 131 # NCBI annotation: gp24 # Family: family:all:6925 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818562;genbank:gi:29566999;genbank:GeneID:1260239 Probab=25.59 E-value=2 Score=18.89 Aligned_cols=109 Identities=18% Similarity=0.158 Sum_probs=60.2 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEE-----eC-CEeccCCCcccCccceeeeeEEEEec--cchhhcCCeEEeecCEE Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVT-----RP-GTVDRVDGDEVVIPPTSFDVIGLREE--YKPSEIDGTRIVAGDVK 74 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~-----r~-g~~dp~~g~~~~~~~~~~~~~gv~~~--~~~~~idGtlI~~GD~~ 74 (119) |-+..-.=.++....+|...-.+.+|- |+ |+++...| ++.+++.|.+.---.+ -...-.||+.-.+=|-. T Consensus 1 ~~~~el~~~Rk~T~~FI~~Dpt~ivL~~~~~~rpgG~~~~~~g--ppR~pQ~Fkvi~~~gd~~~~~~tadg~~Trrfdfi 78 (131) T protein:vir:74 1 MNAAELALHRKGTRDFINRDKTSLVLYPTNETWVAGTKVYNDV--PPRPAQDFKVIWPGADQGGKVNTDEGTETSRYDFI 78 (131) T ss_pred CchhHhHHHhhhhhhhhcCCCceEEEeccccccCCCccccCCC--CCCCcceEEEEecCCCccceeeeccCcccceeEEE Confidence 333322233566677787777777664 34 45666655 3455666644211111 00011233222222333 Q ss_pred EEEccCcccccCCEEEeCC----eEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 75 FLCQAIKQVKVGDLVSLNN----TDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 75 ~~~sa~~~p~~gD~i~i~G----~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) ++=+-+++.++||.=+=+. .+|.|..+.|-++ |+...++ T Consensus 79 lvG~~DAvveiGD~W~eGd~~~~q~YvV~~l~~~ng------YevK~~~ 121 (131) T protein:vir:74 79 LVGNWDAVVEIGDHWTEGDDENRQTYVVEWVQPYNG------YEVKAGG 121 (131) T ss_pred EeecccceeeecceeecCCCCccceEEEEEeecCCC------eEEeeeE Confidence 3335677889999977544 5799999988665 7777777 No 69 >protein:vir:9762 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:1270 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795524;genbank:gi:28876280;genbank:GeneID:1257821 Probab=25.42 E-value=2.1 Score=18.86 Aligned_cols=99 Identities=13% Similarity=0.124 Sum_probs=56.6 Q ss_pred ecccChHHHHHHHHHHHHhcCCeEEEEeC--CEeccCCCcccCccceeeeeEEEEec-cchhhcCCeEEeecCEEEEEcc Q lcl|NC_012223. 3 MAGFNYTGLKRKVNPLIKKFGMTATVTRP--GTVDRVDGDEVVIPPTSFDVIGLREE-YKPSEIDGTRIVAGDVKFLCQA 79 (119) Q Consensus 3 Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~--g~~dp~~g~~~~~~~~~~~~~gv~~~-~~~~~idGtlI~~GD~~~~~sa 79 (119) |+-+ .|.+++|.++ .+-||- |.+..+ ...-++.+|+.. -+..++.+++=..|.+..+--+ T Consensus 1 m~~i---------------kGetVtvi~~~~tG~D~~-g~p~~~-~~~e~V~nVLV~P~s~~d~~~~~~p~G~~v~~tl~ 63 (112) T protein:vir:97 1 MGKL---------------RGITITLIDKVTIDIDPF-GNPIKK-DKEISVDNVLVSPATSDDITSQLSLSGKKAVYTLA 63 (112) T ss_pred Cccc---------------cceeEEEeccccccccCC-CCceec-ccceecCcEEeCCCChhhcccccCcCceEEEEEEe Confidence 3333 4778888754 233553 223332 455677777765 4445665554445555444322 Q ss_pred ----CcccccCCEEEeCCeEEEEEecceeCCCC--cEEEEEEEEeC Q lcl|NC_012223. 80 ----IKQVKVGDLVSLNNTDYRVINPNPLQPAG--QTMLFQLQLRG 119 (119) Q Consensus 80 ----~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag--~~v~y~~q~R~ 119 (119) ..-.-.|..|.+.|++|+||-- |+...+ ++..|...+=- T Consensus 64 fPK~~~~~lrg~~V~~~G~~~~vvG~-P~~~~~~~~P~~WN~~V~V 108 (112) T protein:vir:97 64 IPKGDNHDWGDKEVRFFGEKWRTVGL-ALEGIEELIPLEWNKKVMV 108 (112) T ss_pred cCCCCCCcccCcEEEEeCCeeEEecC-CccccCCCCCCccCCeEEE Confidence 1246789999999999998864 433322 44556555443 No 70 >protein:vir:79686 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:2747 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285885;genbank:gi:148750842;genbank:GeneID:5220385 Probab=25.18 E-value=2.1 Score=18.83 Aligned_cols=105 Identities=14% Similarity=0.183 Sum_probs=50.8 Q ss_pred CeecccChHHHHHHHHHHHHhcCCeEEEEeCCEeccC-CCcccCccceeeeeEEEEeccchhh---cCCeEEeecCEEEE Q lcl|NC_012223. 1 MIMAGFNYTGLKRKVNPLIKKFGMTATVTRPGTVDRV-DGDEVVIPPTSFDVIGLREEYKPSE---IDGTRIVAGDVKFL 76 (119) Q Consensus 1 ~~Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~g~~dp~-~g~~~~~~~~~~~~~gv~~~~~~~~---idGtlI~~GD~~~~ 76 (119) |+ -.++.+ || -+..+++..-+++-. .++.++ .+...+++-++.+-...+ .++..|.+.=+.++ T Consensus 1 ~m-~~ipk~--------~l---~~sit~k~~~~~~~~D~yg~~~-y~~p~~I~nvrvd~~t~ySgt~n~rq~~~navif~ 67 (118) T protein:vir:79 1 MK-LPIPYQ--------MA---VSTVHLKLTDQSAKKDRYGRTV-PTWEGDITKCVVNMQTTYSGTNNDRQIVANGLIVM 67 (118) T ss_pred CC-Cccchh--------hc---cceEEEEEeccccCcCCCCCee-ccCCeeeeeeEecccceecccCCCCeEEeceEEEE Confidence 32 223321 11 134677654322211 111111 111234444444422221 12222332222222 Q ss_pred EccCccc-------ccCCEEEeCCeEEEEEecceeC--CCCcEEEEEEEEe Q lcl|NC_012223. 77 CQAIKQV-------KVGDLVSLNNTDYRVINPNPLQ--PAGQTMLFQLQLR 118 (119) Q Consensus 77 ~sa~~~p-------~~gD~i~i~G~~~~Vv~v~pi~--Pag~~v~y~~q~R 118 (119) .+--+.| ..|++|.++|+.|.|.++.|.- ..+.+-+|+|-+= T Consensus 68 y~~~s~p~~~~~~~s~g~kivfdG~eYtI~~i~~~~ep~sn~vy~yElEVi 118 (118) T protein:vir:79 68 YAGYSNPIPTLTKENLGSKLTYQGLDYTVTSLNRFDQPGTEDLYCYELEVI 118 (118) T ss_pred ecccCccccEEeccccccceeeCCeeEEeeeeeecccCcCCcEEEEEEEeC Confidence 2211222 2388999999999999999987 5567789999888 No 71 >protein:vir:101507 Length: 135 # NCBI annotation: gp20 # Family: family:all:6925 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655399;genbank:gi:109522587;genbank:GeneID:4157579 Probab=24.15 E-value=2.2 Score=18.69 Aligned_cols=111 Identities=20% Similarity=0.210 Sum_probs=59.6 Q ss_pred CeecccChHH---HHHHHHHHHHhcCCeEEEE--------eC-CEeccCCCcccCccceeeeeEEEE--eccchhhcCCe Q lcl|NC_012223. 1 MIMAGFNYTG---LKRKVNPLIKKFGMTATVT--------RP-GTVDRVDGDEVVIPPTSFDVIGLR--EEYKPSEIDGT 66 (119) Q Consensus 1 ~~Ma~~~Y~~---~~~~A~~ll~~~G~~~tl~--------r~-g~~dp~~g~~~~~~~~~~~~~gv~--~~~~~~~idGt 66 (119) |..++.--.. .++-...+|...-.+.+|. || |++|...| ++.+++.|.+.--- .......-||. T Consensus 1 ~~~~~~~~~~l~~~Rk~T~~FI~~Dpt~ivL~p~~~t~~ekPgG~~~~~~g--ppR~pQ~Fkvi~~~gdg~~~t~~~dgg 78 (135) T protein:vir:10 1 MTVSASAQRGLEFLRAGTRAFIDDDPTTIVLNRGQATRVEKPGGGYDFTPG--APRTAQIFKVINQTGDGSALTEAQDGI 78 (135) T ss_pred CccccchhhhHHHHhhhhhhhhcCCCceEEEECCcceEEecCCCCcccCCC--CCCCcceEEEEecCCCcceeeccccCc Confidence 5555431111 1333344555444444442 34 56777655 34455666432111 11111222332 Q ss_pred EEeecCEEEEEccCcccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 67 RIVAGDVKFLCQAIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 67 lI~~GD~~~~~sa~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) .=.+=|-.++=+-++..++||.=+=+..+|.|..+.|-++ |+...++ T Consensus 79 ~TrrfdfilvG~~DAvveiGD~W~egd~~YvVe~l~~~ng------YEvK~~~ 125 (135) T protein:vir:10 79 QTPVREYILLGAHDSVAEVGDWWVDGNNRYTVTELLVANG------YERKWRV 125 (135) T ss_pred ccceeEEEEeecccceeeecceeecCCCcEEEEEEecCCC------eEEeeeE Confidence 2223333333356778899999888888999999988665 7777777 No 72 >protein:vir:102189 Length: 135 # NCBI annotation: gp20 # Family: family:all:6925 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655216;genbank:gi:109522796;genbank:GeneID:4157428 Probab=24.15 E-value=2.2 Score=18.69 Aligned_cols=111 Identities=20% Similarity=0.210 Sum_probs=59.6 Q ss_pred CeecccChHH---HHHHHHHHHHhcCCeEEEE--------eC-CEeccCCCcccCccceeeeeEEEE--eccchhhcCCe Q lcl|NC_012223. 1 MIMAGFNYTG---LKRKVNPLIKKFGMTATVT--------RP-GTVDRVDGDEVVIPPTSFDVIGLR--EEYKPSEIDGT 66 (119) Q Consensus 1 ~~Ma~~~Y~~---~~~~A~~ll~~~G~~~tl~--------r~-g~~dp~~g~~~~~~~~~~~~~gv~--~~~~~~~idGt 66 (119) |..++.--.. .++-...+|...-.+.+|. || |++|...| ++.+++.|.+.--- .......-||. T Consensus 1 ~~~~~~~~~~l~~~Rk~T~~FI~~Dpt~ivL~p~~~t~~ekPgG~~~~~~g--ppR~pQ~Fkvi~~~gdg~~~t~~~dgg 78 (135) T protein:vir:10 1 MTVSASAQRGLEFLRAGTRAFIDDDPTTIVLNRGQATRVEKPGGGYDFTPG--APRTAQIFKVINQTGDGSALTEAQDGI 78 (135) T ss_pred CccccchhhhHHHHhhhhhhhhcCCCceEEEECCcceEEecCCCCcccCCC--CCCCcceEEEEecCCCcceeeccccCc Confidence 5555431111 1333344555444444442 34 56777655 34455666432111 11111222332 Q ss_pred EEeecCEEEEEccCcccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 67 RIVAGDVKFLCQAIKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 67 lI~~GD~~~~~sa~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) .=.+=|-.++=+-++..++||.=+=+..+|.|..+.|-++ |+...++ T Consensus 79 ~TrrfdfilvG~~DAvveiGD~W~egd~~YvVe~l~~~ng------YEvK~~~ 125 (135) T protein:vir:10 79 QTPVREYILLGAHDSVAEVGDWWVDGNNRYTVTELLVANG------YERKWRV 125 (135) T ss_pred ccceeEEEEeecccceeeecceeecCCCcEEEEEEecCCC------eEEeeeE Confidence 2223333333356778899999888888999999988665 7777777 No 73 >protein:vir:3872 Length: 146 # NCBI annotation: putative head-tail joining protein # Family: family:all:28619 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680489;swissprot:trembl:p94213;genbank:gi:22296529;uniprot:P94213;genbank:GeneID:951708 Probab=22.70 E-value=2.4 Score=18.49 Aligned_cols=116 Identities=6% Similarity=-0.038 Sum_probs=63.5 Q ss_pred CeecccChHHHHHHHHHHH--HhcC---CeEEEEeCCEeccCCCcccCccceeeeeEEEEeccchhhc---CCeEEe-ec Q lcl|NC_012223. 1 MIMAGFNYTGLKRKVNPLI--KKFG---MTATVTRPGTVDRVDGDEVVIPPTSFDVIGLREEYKPSEI---DGTRIV-AG 71 (119) Q Consensus 1 ~~Ma~~~Y~~~~~~A~~ll--~~~G---~~~tl~r~g~~dp~~g~~~~~~~~~~~~~gv~~~~~~~~i---dGtlI~-~G 71 (119) .|.+-. =++..-.++- -+.| .-.++-+......+.|.........++|=|-+..-+.+|. .+++++ .. T Consensus 20 ~~~~~~---~~~~~~~~mp~~M~~gkLn~RItfqk~~~~~d~~g~~~~~w~~v~tvWA~V~~~~grE~~~~~a~~~~~e~ 96 (146) T protein:vir:38 20 QILSIS---FAQNCRKRMVILMRINRMTERIAFVSYESKKVNGVPVDGVIVKHMTVWAEVPKVPIREANDPQTKLGTRKD 96 (146) T ss_pred chhhhh---hHhhccccceeeeccccCCccEEEEEeeeeecCCCcCCCcceeeeEEEEeeeccchhhhHhhhhhhhhhcc Confidence 122111 0111111111 1222 2345444433222223333344456889999998888885 455555 44 Q ss_pred CEEEEEcc--CcccccCCEEEeCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 72 DVKFLCQA--IKQVKVGDLVSLNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 72 D~~~~~sa--~~~p~~gD~i~i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) .+++++-- ...+...-+|..+|+.|.|+++.|-.-+.--+.-.+.-.+ T Consensus 97 ti~F~IRY~~~~~I~~~mRI~y~gk~YeI~~I~pd~~~k~~~~I~akeVS 146 (146) T protein:vir:38 97 SPTFLVRFLTAEEIQPTWRIQWRGNEYQITGLDPDYERRDLTTITAKAVS 146 (146) T ss_pred eeEEEEEecCCccCCcccEEEECCeEEEEeeeCCccccCcEEEEEEEEeC Confidence 56677744 4456778899999999999999886666543333344444 No 74 >protein:vir:93599 Length: 116 # NCBI annotation: putative head-tail adaptor # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449298;genbank:gi:157166046;interpro:IPR006453;interpro:IPR013045;uniprot:Q6H9U3;genbank:GeneID:5580421 Probab=21.98 E-value=2.5 Score=18.39 Aligned_cols=104 Identities=12% Similarity=0.122 Sum_probs=58.0 Q ss_pred CeecccChHHHHHHHHHHHHhcCCeEEEEeCCE-eccCCCcccCccceeeeeEEEEeccchhhc--CCeEEeecCEEEEE Q lcl|NC_012223. 1 MIMAGFNYTGLKRKVNPLIKKFGMTATVTRPGT-VDRVDGDEVVIPPTSFDVIGLREEYKPSEI--DGTRIVAGDVKFLC 77 (119) Q Consensus 1 ~~Ma~~~Y~~~~~~A~~ll~~~G~~~tl~r~g~-~dp~~g~~~~~~~~~~~~~gv~~~~~~~~i--dGtlI~~GD~~~~~ 77 (119) |-|.+= +|. ...++-++.. .|. .|+....-...+++-|-+...+.++. .+.....--.++.+ T Consensus 1 m~m~aG---~L~-----------~rI~iq~~~~~~d~-~G~~~~~w~~~~~~wA~v~~~sg~e~~~a~~~~~~~~~~i~i 65 (116) T protein:vir:93 1 MAISAG---RLT-----------QMISVLNPVLTRNA-AGEMTEEWVSCGKIHADIRGRSSRERMQSGAEMAQAEIRIWV 65 (116) T ss_pred CCcCcc---ccC-----------ccEEEEeeeeccCC-CCCeecceEeEEEEEEEEEecChhheeeccceeeeeeEEEEE Confidence 555442 111 2356656543 343 34444444456888898888888754 23322222334443 Q ss_pred --ccCcccccCCEEE-----eCCeEEEEEecceeCCCCcEEEEEEEEeC Q lcl|NC_012223. 78 --QAIKQVKVGDLVS-----LNNTDYRVINPNPLQPAGQTMLFQLQLRG 119 (119) Q Consensus 78 --sa~~~p~~gD~i~-----i~G~~~~Vv~v~pi~Pag~~v~y~~q~R~ 119 (119) .+...+..+++|. .+|+.|.|.++.+.+..+.-+.-.++-=| T Consensus 66 Ry~~~~dI~~~~Ri~~~~~~~~g~~y~I~~v~~~~~~~~~l~i~c~eg~ 114 (116) T protein:vir:93 66 RGQSGREITAASRLHVLSGPWRDRILNVVGLPVPDATGGRLEILCRLGG 114 (116) T ss_pred EecCCCCCCcccEEEEcCcccCCeEEEEEecCCCCCCCcEEEEEEEecC Confidence 2334578899987 68999999999644555543222222222 Done!