Query lcl|Aclame:protein:vir:2770|NCBI_annot:hypothetical protein|genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Match_columns 318 No_of_seqs 69 out of 80 Neff 5.5 Searched_HMMs 1612 Date Sat Nov 30 08:09:05 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_8 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_8_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:2770 Length: 318 # 100.0 4E-148 2E-151 828.9 30.7 318 1-318 1-318 (318) 2 protein:vir:819 Length: 404 # 100.0 3E-139 2E-142 780.4 29.9 318 1-318 1-318 (404) 3 protein:vir:104439 Length: 404 100.0 3E-139 2E-142 780.4 29.9 318 1-318 1-318 (404) 4 protein:vir:10123 Length: 404 100.0 3E-139 2E-142 780.4 29.9 318 1-318 1-318 (404) 5 protein:vir:3298 Length: 404 # 100.0 3E-139 2E-142 780.4 29.9 318 1-318 1-318 (404) 6 protein:vir:105610 Length: 430 100.0 1E-123 9E-127 694.0 27.0 314 1-318 5-328 (430) 7 protein:vir:93696 Length: 364 100.0 4E-113 3E-116 636.6 23.9 288 13-318 1-299 (364) 8 protein:vir:80180 Length: 381 99.2 5.8E-13 3.6E-16 87.7 14.4 251 1-318 1-264 (381) 9 protein:vir:1541 Length: 347 # 99.2 5.4E-13 3.3E-16 87.8 14.0 263 1-318 1-267 (347) 10 protein:vir:8885 Length: 347 # 99.2 1.8E-12 1.1E-15 85.0 15.3 260 1-318 1-279 (347) 11 protein:vir:2201 Length: 345 # 99.2 3.2E-12 2E-15 83.6 16.0 262 1-318 1-307 (345) 12 protein:vir:3364 Length: 347 # 99.1 5E-12 3.1E-15 82.5 14.9 264 1-318 1-309 (347) 13 protein:vir:10450 Length: 344 99.1 7.1E-12 4.4E-15 81.7 15.5 261 1-318 1-292 (344) 14 protein:vir:3613 Length: 272 # 99.1 1.5E-11 9.3E-15 79.9 16.5 222 1-318 1-234 (272) 15 protein:vir:94711 Length: 347 99.1 1.1E-11 6.9E-15 80.6 14.8 262 1-318 1-285 (347) 16 protein:vir:94622 Length: 341 99.0 3.5E-11 2.1E-14 77.9 16.1 238 13-318 1-256 (341) 17 protein:vir:7990 Length: 273 # 99.0 1.5E-10 9.2E-14 74.5 18.3 226 13-318 1-234 (273) 18 protein:vir:94576 Length: 347 99.0 4.1E-11 2.6E-14 77.5 14.5 260 1-318 1-286 (347) 19 protein:vir:80213 Length: 334 98.9 1.4E-10 8.7E-14 74.6 16.4 255 1-318 1-294 (334) 20 protein:vir:96123 Length: 274 98.9 3.1E-10 2E-13 72.7 16.1 224 1-318 1-232 (274) 21 protein:vir:96262 Length: 274 98.9 3.4E-10 2.1E-13 72.5 16.1 224 1-318 1-239 (274) 22 protein:vir:95898 Length: 274 98.9 3.4E-10 2.1E-13 72.5 16.1 224 1-318 1-239 (274) 23 protein:vir:78739 Length: 332 98.8 3.9E-10 2.4E-13 72.1 14.0 248 1-318 1-297 (332) 24 protein:vir:739 Length: 231 # 98.8 4.1E-10 2.5E-13 72.1 14.0 178 62-318 1-200 (231) 25 protein:vir:94494 Length: 274 98.8 1.4E-09 8.6E-13 69.1 16.8 224 1-318 1-232 (274) 26 protein:vir:97433 Length: 274 98.8 1.4E-09 8.6E-13 69.1 16.8 224 1-318 1-232 (274) 27 protein:vir:78935 Length: 335 98.8 1E-09 6.2E-13 69.9 15.8 248 1-318 1-269 (335) 28 protein:vir:105334 Length: 276 98.8 8E-10 5E-13 70.5 15.2 224 1-318 1-239 (276) 29 protein:vir:93742 Length: 274 98.8 1.1E-09 6.8E-13 69.7 16.0 225 1-318 1-232 (274) 30 protein:vir:1239 Length: 274 # 98.7 2.9E-09 1.8E-12 67.4 16.2 224 1-318 1-232 (274) 31 protein:vir:105822 Length: 273 98.7 1.3E-08 8.1E-12 63.8 18.8 223 13-318 1-234 (273) 32 protein:vir:102605 Length: 273 98.7 1.3E-08 8.1E-12 63.8 18.8 223 13-318 1-234 (273) 33 protein:vir:99675 Length: 324 98.7 2.6E-09 1.6E-12 67.7 14.9 213 56-318 1-243 (324) 34 protein:vir:6324 Length: 335 # 98.7 4.3E-09 2.7E-12 66.4 15.6 255 1-318 1-290 (335) 35 protein:vir:100057 Length: 375 98.6 5.5E-09 3.4E-12 65.9 15.0 261 1-318 1-284 (375) 36 protein:vir:96833 Length: 275 98.6 9.8E-09 6.1E-12 64.5 15.4 223 1-318 3-240 (275) 37 protein:vir:95107 Length: 270 98.6 1E-08 6.5E-12 64.3 15.4 218 13-318 1-234 (270) 38 protein:vir:97031 Length: 402 98.5 1.5E-08 9.1E-12 63.5 15.7 248 1-318 1-259 (402) 39 protein:vir:3033 Length: 272 # 98.5 2.3E-08 1.4E-11 62.5 16.5 223 1-318 1-228 (272) 40 protein:vir:9820 Length: 272 # 98.5 2.3E-08 1.4E-11 62.5 16.5 223 1-318 1-228 (272) 41 protein:vir:80930 Length: 278 98.5 2.7E-08 1.6E-11 62.1 16.5 231 1-318 1-238 (278) 42 protein:vir:95875 Length: 401 98.4 1.9E-08 1.2E-11 62.9 13.2 271 1-318 1-331 (401) 43 protein:vir:103323 Length: 364 98.4 8.1E-08 5E-11 59.5 16.2 257 1-318 1-288 (364) 44 protein:vir:1583 Length: 351 # 98.4 4E-08 2.5E-11 61.1 14.1 235 1-318 1-252 (351) 45 protein:vir:105645 Length: 400 98.2 6E-07 3.7E-10 54.7 16.8 253 1-318 1-259 (400) 46 protein:vir:5974 Length: 324 # 98.1 1.3E-07 8.4E-11 58.3 12.4 234 1-318 1-250 (324) 47 protein:vir:99075 Length: 392 98.0 2E-06 1.2E-09 51.8 15.9 229 13-318 1-241 (392) 48 protein:vir:102944 Length: 330 98.0 4.8E-07 3E-10 55.2 12.3 239 1-318 1-257 (330) 49 protein:vir:79008 Length: 299 97.8 7.4E-06 4.6E-09 48.7 16.3 225 1-318 1-258 (299) 50 protein:vir:108303 Length: 418 97.7 7.3E-06 4.6E-09 48.7 15.4 221 20-318 1-229 (418) 51 protein:vir:7019 Length: 401 # 97.7 5.7E-06 3.6E-09 49.3 14.8 253 1-318 1-259 (401) 52 protein:vir:80446 Length: 367 97.5 1.4E-05 9E-09 47.1 13.6 256 4-318 1-277 (367) 53 protein:vir:107120 Length: 329 96.8 0.00033 2E-07 39.7 16.1 239 1-318 9-270 (329) 54 protein:vir:97331 Length: 319 96.6 0.00047 2.9E-07 38.8 15.4 239 1-318 5-259 (319) 55 protein:vir:94800 Length: 319 96.6 0.00047 2.9E-07 38.8 15.4 239 1-318 5-259 (319) 56 protein:vir:78387 Length: 349 96.0 0.00068 4.2E-07 37.9 12.6 245 1-318 1-264 (349) 57 protein:vir:94989 Length: 349 95.8 0.0011 7E-07 36.8 13.0 246 1-318 1-264 (349) 58 protein:vir:1781 Length: 221 # 95.5 0.00071 4.4E-07 37.8 10.7 161 106-318 1-180 (221) 59 protein:vir:102655 Length: 322 95.4 0.0022 1.3E-06 35.2 16.4 249 1-318 3-282 (322) 60 protein:vir:3525 Length: 423 # 94.4 0.0045 2.8E-06 33.5 16.6 226 22-318 1-255 (423) 61 protein:vir:78920 Length: 290 94.4 0.0046 2.9E-06 33.4 16.2 220 13-318 1-251 (290) 62 protein:vir:105374 Length: 423 94.1 0.0055 3.4E-06 33.0 16.9 226 22-318 1-255 (423) 63 protein:vir:3136 Length: 322 # 92.2 0.0042 2.6E-06 33.6 8.3 231 13-318 1-264 (322) 64 protein:vir:102335 Length: 312 89.2 0.028 1.7E-05 29.1 16.3 226 13-318 1-268 (312) 65 protein:vir:174 Length: 423 # 87.7 0.037 2.3E-05 28.5 16.7 225 22-318 1-255 (423) 66 protein:vir:79928 Length: 393 85.5 0.053 3.3E-05 27.6 9.8 264 1-318 31-344 (393) 67 protein:vir:105522 Length: 423 84.8 0.058 3.6E-05 27.4 16.6 224 13-318 1-240 (423) 68 protein:vir:105464 Length: 346 75.5 0.15 9E-05 25.2 16.0 225 1-318 1-259 (346) 69 protein:vir:9927 Length: 295 # 70.9 0.2 0.00013 24.4 12.3 211 13-318 1-229 (295) 70 protein:vir:79712 Length: 285 67.1 0.26 0.00016 23.8 14.4 222 1-318 1-244 (285) 71 protein:vir:105905 Length: 304 65.5 0.28 0.00017 23.6 12.7 225 1-318 9-256 (304) 72 protein:vir:94142 Length: 304 65.5 0.28 0.00017 23.6 12.7 225 1-318 9-256 (304) 73 protein:vir:41 Length: 299 # N 48.3 0.67 0.00042 21.5 14.8 219 21-318 1-251 (299) 74 protein:vir:95131 Length: 325 37.4 1.1 0.0007 20.3 13.8 238 15-318 1-269 (325) 75 protein:vir:78148 Length: 123 27.3 0.64 0.0004 21.7 2.8 63 247-318 1-88 (123) 76 protein:vir:106647 Length: 303 22.3 2.5 0.0015 18.4 15.1 218 13-318 1-238 (303) 77 protein:vir:9574 Length: 300 # 21.1 2.7 0.0017 18.3 13.7 228 1-318 1-279 (300) 78 protein:vir:9309 Length: 324 # 20.6 2.7 0.0017 18.2 14.4 243 1-318 1-268 (324) No 1 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=100.00 E-value=3.7e-148 Score=828.88 Aligned_cols=318 Identities=100% Similarity=1.480 Sum_probs=316.8 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) |||+++++|++++++||||++++|+|+||+|+++|+++++++++|+++||+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 mt~~~~~~~~~~~~~~~ft~~~~~~~~vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vtf~L~~~L~ 80 (318) T protein:vir:27 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) T ss_pred CCccCCCChHHHHHHHHHHHHhcCChHHHHHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEEEeEeeccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCeecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (318) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~ 160 (318) |+||+||+++|||||+|+|++|+|+|||.||+|++||+|+||||+||||++||+.|++||++++||++|+||||+||++. T Consensus 81 g~gv~Gd~~lEGnee~L~~~~d~l~IDq~r~~V~~gg~msqqRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laGarg~~~ 160 (318) T protein:vir:27 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (318) T ss_pred cCccccCceeeccccceEEEeeEEEEeeeccccccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCC Q lcl|Aclame:pro 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (318) Q Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~ 240 (318) |++|++|+.+|++|+++++|+|+|||++||||+|++|++++|+++|+||+++||+++++++++++||+||+++|++++++ T Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~g~at~~~~l~stD~~s~~lid~~~~~~~~~a~pi~PV~v~g~~~~~~ 240 (318) T protein:vir:27 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (318) T ss_pred cccceEecccCccchhhhhcccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceeeccccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEcCCCeeeeeeeC Q lcl|Aclame:pro 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGQRFWYQRIT 318 (318) Q Consensus 241 ~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~ag~~v~~a~~~ 318 (318) +|+|||||||+|++|||+|+++++|+++||+|++|++|++||||+|++||||||||||||+||||||||++|+||||| T Consensus 241 ~~~yV~~~~p~q~~~Lrtdt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~vpIrf~~G~~v~~~~~~ 318 (318) T protein:vir:27 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGQRFWYQRIT 318 (318) T ss_pred cceEEEEechHHHHHHhhcCCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeecCCccEEEcCCCeeeeeecC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=100.00 E-value=2.5e-139 Score=780.44 Aligned_cols=318 Identities=97% Similarity=1.425 Sum_probs=315.5 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) ||++.+++|+++|+|||||++++|+|++|+|+++++.++++++++++++|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:81 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCeecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (318) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~ 160 (318) |+||+||++||||||+|+|++|+|+|||.||+|+++|+|+||||+||||++||+.|++||++++||++|+||||+||++. T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~ 160 (404) T protein:vir:81 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCC Q lcl|Aclame:pro 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (318) Q Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~ 240 (318) |++|++|+.+|++|+++++|+|+|||++||||+|+++++++|+++|+||+++||+++++|+++++||+||+++|++++++ T Consensus 161 n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~ 240 (404) T protein:vir:81 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) T ss_pred cccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEcCCCeeeeeeeC Q lcl|Aclame:pro 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGQRFWYQRIT 318 (318) Q Consensus 241 ~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~ag~~v~~a~~~ 318 (318) +|+|||||||+|++|||+|+++++|+++||+|++|++|++||||+|+|||||||||||||++|||||+|+++.+++-. T Consensus 241 ~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~ 318 (404) T protein:vir:81 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENN 318 (404) T ss_pred cceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCc Confidence 999999999999999999999999999999999999999999999999999999999999999999999999988886 No 3 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=100.00 E-value=2.5e-139 Score=780.44 Aligned_cols=318 Identities=97% Similarity=1.425 Sum_probs=315.5 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) ||++.+++|+++|+|||||++++|+|++|+|+++++.++++++++++++|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCeecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (318) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~ 160 (318) |+||+||++||||||+|+|++|+|+|||.||+|+++|+|+||||+||||++||+.|++||++++||++|+||||+||++. T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~ 160 (404) T protein:vir:10 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCC Q lcl|Aclame:pro 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (318) Q Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~ 240 (318) |++|++|+.+|++|+++++|+|+|||++||||+|+++++++|+++|+||+++||+++++|+++++||+||+++|++++++ T Consensus 161 n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~ 240 (404) T protein:vir:10 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) T ss_pred cccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEcCCCeeeeeeeC Q lcl|Aclame:pro 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGQRFWYQRIT 318 (318) Q Consensus 241 ~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~ag~~v~~a~~~ 318 (318) +|+|||||||+|++|||+|+++++|+++||+|++|++|++||||+|+|||||||||||||++|||||+|+++.+++-. T Consensus 241 ~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~ 318 (404) T protein:vir:10 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENN 318 (404) T ss_pred cceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCc Confidence 999999999999999999999999999999999999999999999999999999999999999999999999988886 No 4 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=100.00 E-value=2.5e-139 Score=780.44 Aligned_cols=318 Identities=97% Similarity=1.425 Sum_probs=315.5 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) ||++.+++|+++|+|||||++++|+|++|+|+++++.++++++++++++|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCeecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (318) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~ 160 (318) |+||+||++||||||+|+|++|+|+|||.||+|+++|+|+||||+||||++||+.|++||++++||++|+||||+||++. T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~ 160 (404) T protein:vir:10 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCC Q lcl|Aclame:pro 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (318) Q Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~ 240 (318) |++|++|+.+|++|+++++|+|+|||++||||+|+++++++|+++|+||+++||+++++|+++++||+||+++|++++++ T Consensus 161 n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~ 240 (404) T protein:vir:10 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) T ss_pred cccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEcCCCeeeeeeeC Q lcl|Aclame:pro 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGQRFWYQRIT 318 (318) Q Consensus 241 ~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~ag~~v~~a~~~ 318 (318) +|+|||||||+|++|||+|+++++|+++||+|++|++|++||||+|+|||||||||||||++|||||+|+++.+++-. T Consensus 241 ~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~ 318 (404) T protein:vir:10 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENN 318 (404) T ss_pred cceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCc Confidence 999999999999999999999999999999999999999999999999999999999999999999999999988886 No 5 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=100.00 E-value=2.5e-139 Score=780.44 Aligned_cols=318 Identities=97% Similarity=1.425 Sum_probs=315.5 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) ||++.+++|+++|+|||||++++|+|++|+|+++++.++++++++++++|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:32 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCeecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (318) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~ 160 (318) |+||+||++||||||+|+|++|+|+|||.||+|+++|+|+||||+||||++||+.|++||++++||++|+||||+||++. T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~ 160 (404) T protein:vir:32 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCC Q lcl|Aclame:pro 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (318) Q Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~ 240 (318) |++|++|+.+|++|+++++|+|+|||++||||+|+++++++|+++|+||+++||+++++|+++++||+||+++|++++++ T Consensus 161 n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~ 240 (404) T protein:vir:32 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) T ss_pred cccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEcCCCeeeeeeeC Q lcl|Aclame:pro 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGQRFWYQRIT 318 (318) Q Consensus 241 ~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~ag~~v~~a~~~ 318 (318) +|+|||||||+|++|||+|+++++|+++||+|++|++|++||||+|+|||||||||||||++|||||+|+++.+++-. T Consensus 241 ~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~ 318 (404) T protein:vir:32 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENN 318 (404) T ss_pred cceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCc Confidence 999999999999999999999999999999999999999999999999999999999999999999999999988886 No 6 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=100.00 E-value=1.5e-123 Score=694.00 Aligned_cols=314 Identities=32% Similarity=0.543 Sum_probs=290.8 Q ss_pred CCcCCcc--chhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeec Q lcl|Aclame:pro 1 MTTVTSA--QANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHK 78 (318) Q Consensus 1 ~t~~~~~--~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~ 78 (318) -|+++++ +|++.||++||+++.+++++++++.++.+...... ++....+++.++|||+++||+|++||+|+|+|++| T Consensus 5 ~T~~~~~~p~a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~-~~~~~~~ts~~~pI~r~~dL~K~~GD~Vtf~L~~~ 83 (430) T protein:vir:10 5 KTTMRYGDPNAMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDA-EKKTKGQSSLELPIVQAQDLGRNKGDEVRFHFVQP 83 (430) T ss_pred eeecccCChhHHHHHHHHHHHHHhhhhhhHHHhhccccccccch-hhhccCCCCCCccEEEeccCCCCCccEEEEeEeec Confidence 4567776 56667999999999999999999999877766654 35667799999999999999999999999999999 Q ss_pred cccCCeecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Q lcl|Aclame:pro 79 LSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (318) Q Consensus 79 L~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~ 158 (318) |+|+||+||++||||||+|+|++|+|+|||.||+|++||+|+||||+||||++||+.|++||++++||++|+||||+||+ T Consensus 84 L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~R~~V~~gg~msqQRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laGarg~ 163 (430) T protein:vir:10 84 ANAFPIMGSEYAEGKGTGLKIGSDQLRVNQARFPVDLGDVMSQIRNPYDLRRLGRPKAKWFMDAYLDQSMLVHLAGARGN 163 (430) T ss_pred cccCceecCceeeccccceEEEeeEEEEeeeccccccCCchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccceeecccccccccccccccCCCCCCceEeecCCc-c-------ccccccccccCHHHHHHHHHHHHhcCCCCcee Q lcl|Aclame:pro 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDAT-S-------FEQIEAADIFSIGLVDNLSLFIDEMAHPLQPV 230 (318) Q Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at-~-------~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv 230 (318) +.|++|++|+.+|+.|+.+++|+|+|||+||||++++.+ + +.+|+++|+|++++||+|+++|+++++||+|| T Consensus 164 ~~~~~~~~~~~~~~~~~~~~~N~v~aPt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s~~~id~a~~~a~~~~~~i~Pv 243 (430) T protein:vir:10 164 HYNKEWCLPLETHPKLADMLVNRVKAPTKNRHFVASADAITGVAPNAGEYNITTADVLDVDVVDSIATYMDQIELPPPPV 243 (430) T ss_pred cccccccccccCCcchhhhhccccCCCCCceeEeecccccccccccccccchhhhcccCHHHHHHHHHHHHhhCCCCcce Confidence 999999999999999999999999999999999977653 3 56799999999999999999999999999999 Q ss_pred EeccccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEcCCC Q lcl|Aclame:pro 231 RLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGQ 310 (318) Q Consensus 231 ~v~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~ag~ 310 (318) +|+|+++++++|+|||||||+|++|||+|+++++| ++|+.+.+ ++|++||||+|++|||||||||||| +|||||+|+ T Consensus 244 ~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~w-q~~~~a~a-~~g~~nPlF~G~~gm~ngvii~~~~-~virf~~g~ 320 (430) T protein:vir:10 244 KFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSW-QAAALARA-SNAKQHPIFRVDAGLWSNTLIIKMP-KPIRFYAGD 320 (430) T ss_pred EeecccccCCccEEEEEechHHHHHHhhCcchHHH-HHHHHHhh-cccccCCceecceeeecCeEEecCC-ceeeecCCC Confidence 99999999999999999999999999999999999 67776655 4699999999999999999999999 679999998 Q ss_pred eeeeeeeC Q lcl|Aclame:pro 311 RFWYQRIT 318 (318) Q Consensus 311 ~v~~a~~~ 318 (318) .+++..-- T Consensus 321 ~~~~~a~~ 328 (430) T protein:vir:10 321 TIKYCAAY 328 (430) T ss_pred ccccccCC Confidence 77664421 No 7 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=100.00 E-value=4.4e-113 Score=636.56 Aligned_cols=288 Identities=32% Similarity=0.441 Sum_probs=267.6 Q ss_pred HHHHHHHHhcccchH-HHHhhhhhhhhhhhhccccc-ccCCCCCccEEEeeccCCCCCcEEEEEEeeccccCCeecCcee Q lcl|Aclame:pro 13 FQVALFTAANRNRSM-VNILTEQQEAPKAVSPDKKS-TKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERV 90 (318) Q Consensus 13 ~a~~lft~~~~~~~~-v~~ws~~l~~~~~~~~~~~~-~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~l 90 (318) ||. |.++.++|. +++|+++|+.++++.++|.+ |||+++++|||+++||+|++||+|+|+|++||+|+||+||++| T Consensus 1 Ma~---T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~g~gv~Gd~~l 77 (364) T protein:vir:93 1 MSQ---TVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHLRGKPTYGDARV 77 (364) T ss_pred Cce---eccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeecccCCcccCcee Confidence 554 444558884 88899999999988888875 9999999999999999999999999999999999999999999 Q ss_pred ecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccc Q lcl|Aclame:pro 91 EGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAE 170 (318) Q Consensus 91 eGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~n~~~~~p~~~ 170 (318) |||||+|+|++|+|+|||.||+|+++|+|+||||+||||++||++|++||++++|+++|+||||+||.+ +|... T Consensus 78 eGnee~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~------~~~~~ 151 (364) T protein:vir:93 78 EGKEESLRFYQDEVRIDQVRHSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGIN------LDFIE 151 (364) T ss_pred eccccceeEEeeEEEEeeccccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------ccccc Confidence 999999999999999999999999999999999999999999999999999999999999999999943 66778 Q ss_pred ccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhc------CCCCceeEeccccccCCcceE Q lcl|Aclame:pro 171 HPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEM------AHPLQPVRLSGDELHGEDPYY 244 (318) Q Consensus 171 ~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~------~~pi~Pv~v~g~~~~~~~~~y 244 (318) ++.|..+++|+|+|||++||||+++++++++|+++|+||+++||+|+++|+++ ++||+||+++|++ +| T Consensus 152 ~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~------~y 225 (364) T protein:vir:93 152 TPDFTGYAGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDD------HY 225 (364) T ss_pred ccCcccccccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcc------ee Confidence 99999999999999999999999999999999999999999999999999998 4679999999987 69 Q ss_pred EEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEc---CCCeeeeeeeC Q lcl|Aclame:pro 245 VLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFY---QGQRFWYQRIT 318 (318) Q Consensus 245 V~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~---ag~~v~~a~~~ 318 (318) ||||||+|++|||++++ ++|+|+||+|++ ++|++||||+|++|||||||||||+++ |||+ ++.+|.++|-. T Consensus 226 V~~l~p~q~~~Lr~~t~-~~w~d~qk~A~~-~~g~~nPlF~G~~gm~ngvii~~~~~v-i~~~~~~~~~~v~~~ral 299 (364) T protein:vir:93 226 VCVMSEYQATDMRTAAG-GTWIDFQKAAAA-AEGRNNPIFKGGLGMINNVVLHKHRNV-IRFNDYGAGANVEAARAL 299 (364) T ss_pred EEEEcchhhhhhhhcCC-HHHHHHHHHhhh-cccccCCceecCeeeEcCeEEeccCCc-ccccccccCccccchhhh Confidence 99999999999999886 689999999854 579999999999999999999999976 9997 77788777765 No 8 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.22 E-value=5.8e-13 Score=87.68 Aligned_cols=251 Identities=12% Similarity=0.083 Sum_probs=149.9 Q ss_pred CCcCCccchhHHHHHHHHHHhcccch----HH-HHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEE Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRS----MV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSI 75 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~----~v-~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L 75 (318) |-||- .+ +-|-....+.+ ++ +.|++.+...-.+..-+ ..+....+++...||+|+|+- T Consensus 1 ~~~~~---~~-----~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~---------~~l~~~~~~~~~~GdTV~ip~ 63 (381) T protein:vir:80 1 MATIQ---GT-----GGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAA---------LEATKKIPFEGKKGDLIHIPN 63 (381) T ss_pred Cceec---cc-----ccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhh---------hhccccccceeecCceEEeec Confidence 66554 12 22222222211 22 56777653333222222 122333567667899999987 Q ss_pred eeccccCCeecCceeecchhhhhheeeEEEEeccccc-ccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 76 MHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHL-VDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAG 154 (318) Q Consensus 76 ~~~L~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~-V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG 154 (318) ....+-.....+..+ ..+++...+.+|.||+.+.. +.+. .+++..+..|+|.+....+...+++..|+.++-.++. T Consensus 64 ~g~~~a~d~~~g~~i--~~~~~~~~~~~itID~~~~~~~~Id-d~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~ 140 (381) T protein:vir:80 64 ISRAAVYDKQPQTPV--NLQARTDSEFTFTVTKYKESSFMIE-DIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAV 140 (381) T ss_pred cCcceeeeecCCCcc--cccccCCceEEEEEeeeeecceeec-hHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 655543334433333 34567778889999998754 4443 7788899999999999999999999999999877764 Q ss_pred cccccccccceeecccccccccccccccCCCCCCceEeecCCccccccc-cccccCHHHHHHHHHHHHhcCCCCceeEec Q lcl|Aclame:pro 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIE-AADIFSIGLVDNLSLFIDEMAHPLQPVRLS 233 (318) Q Consensus 155 ~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~-a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~ 233 (318) ......... ...++. -.+++....++ .+..++++.|..|.+..++..-| .+ T Consensus 141 ~~~~~~~~~------------------~t~~~~-----i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP-----~e 192 (381) T protein:vir:80 141 INAFPSQRI------------------YSYDTT-----LGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVP-----QE 192 (381) T ss_pred ccccccccc------------------cccccc-----ccccccccccccchhhHHHHHHHHHHHHHhhcCCC-----cC Confidence 443111100 011100 00111122233 23457788888999988886544 11 Q ss_pred cccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEE------c Q lcl|Aclame:pro 234 GDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRF------Y 307 (318) Q Consensus 234 g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf------~ 307 (318) | ++++++|.++.+|++++.+ .+. .. +..+.|..|.+|+|.|+-|++.+++|.-. . T Consensus 193 g---------R~lvv~P~~~~~Ll~~~~~---~~a----d~---~~~~~l~~G~Ig~i~G~~Vv~Sn~lp~~~~t~~~~~ 253 (381) T protein:vir:80 193 G---------RIVMVSPAQYIDLLSINQF---ISV----DF---SQVKPVTSGVVGTILGMEVIVTTQIGINSLTGYVNG 253 (381) T ss_pred C---------cEEEeCHHHHHHHhhchhh---hhh----hh---ccchhhhceeeeEEcceEEEeecccccccccceeee Confidence 1 5788999999999999753 321 11 33467999999999999999998887621 1 Q ss_pred CCCeeeeeeeC Q lcl|Aclame:pro 308 QGQRFWYQRIT 318 (318) Q Consensus 308 ag~~v~~a~~~ 318 (318) +|......-.+ T Consensus 254 agap~~~~~~~ 264 (381) T protein:vir:80 254 QGAPTQPTPGV 264 (381) T ss_pred ccccccccccc Confidence 22111111110 No 9 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.22 E-value=5.4e-13 Score=87.84 Aligned_cols=263 Identities=11% Similarity=0.081 Sum_probs=163.0 Q ss_pred CCcCCccchhH-HHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSAQANK-LFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (318) Q Consensus 1 ~t~~~~~~~~~-~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L 79 (318) |-++.++.... ....+- +...+-+.+++.|++.+...-.+.+.+..+ ++..++. .|++|.|+-+... T Consensus 1 ma~~~~~~~~~t~~~~~~-~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~---------~~~~~~~--~G~sv~i~~ig~~ 68 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQ-SAADKLALFLKVFGGEVLTAFARTSVTMPR---------HMLRSIA--SGKSAQFPVIGRT 68 (347) T ss_pred CCccccCCccccccccCC-CcchHHHHHHHHHHHHHHHHHHHhhhhhhc---------ccccccc--ccceeEeeeccce Confidence 88888876553 222222 122333456999999987766665554333 2333443 4999999999999 Q ss_pred ccCCeecCceeecchhhhhheeeEEEEecc---cccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|Aclame:pro 80 SKRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGAR 156 (318) Q Consensus 80 ~G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~r 156 (318) +....+..+.+.++.++.+....+|.||+. ++.|+ .+++..+.+|+|.+.-.....-+++..|+.++.+|.++. T Consensus 69 t~~~~~~g~~l~~~~~~~~~~e~~ltID~~~~~~~~Vd---dlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~ 145 (347) T protein:vir:15 69 KAAYLKPGENLDDKRKDIKHTEKVIHIDGLLTADVLIY---DIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLV 145 (347) T ss_pred eeeeeccCCCCCCCCCCCccceEEEEechhhhhhHHhh---hHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 988888888899998899999999999988 55664 678888999999999999999999999999999997654 Q ss_pred cccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecccc Q lcl|Aclame:pro 157 GDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDE 236 (318) Q Consensus 157 g~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~ 236 (318) .... ....+. ..+.. ..+.+ +..+.+........+.+.+ ++.|..|.+...+..-| .++ T Consensus 146 ~~~~--~~~~~~-~~~g~-----~~~~~-----~~~~~~~~~~~~~~~~~~i-~d~~~~a~~~Lde~~VP-------~~g 204 (347) T protein:vir:15 146 NLPD--ASNENI-EGLGK-----PTVLT-----LVKPTTGDLTDPVELGKAI-IAQLTIARASLTKNYVP-------AAD 204 (347) T ss_pred hccc--cccccc-cccCc-----ccccc-----ccccccccchhhhhHHHHH-HHHHHHHHHHHhhcCCC-------ccC Confidence 3110 000000 00000 00000 0000000000001111222 55666666677665443 111 Q ss_pred ccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEcCCCeeeeee Q lcl|Aclame:pro 237 LHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGQRFWYQR 316 (318) Q Consensus 237 ~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~ag~~v~~a~ 316 (318) ++++++|.+|..|..++.+-. +.- +....+-+|.+|.++|+-|.+.+++|. +++.+.+... T Consensus 205 -------R~~vv~P~~y~~LL~~~~~~~-------~d~---~~~~~~~~G~Vg~i~G~~V~~Sn~lp~--~~~t~~~~~~ 265 (347) T protein:vir:15 205 -------RTFYTTPDNYSAILAALMPNA-------ANY---QALIDHERGTIRNVMGFEVVEVPHLTA--GGAGDTREDA 265 (347) T ss_pred -------CEEEeCHHHHHHHhccccccc-------ccc---cccccccceEEEEEeceEEEecccccc--cccccccccc Confidence 579999999999999986421 111 122457789999999999999887763 2222211111 Q ss_pred eC Q lcl|Aclame:pro 317 IT 318 (318) Q Consensus 317 ~~ 318 (318) .+ T Consensus 266 ~~ 267 (347) T protein:vir:15 266 PA 267 (347) T ss_pred cc Confidence 11 No 10 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.18 E-value=1.8e-12 Score=84.99 Aligned_cols=260 Identities=10% Similarity=0.078 Sum_probs=157.4 Q ss_pred CCcCCccchhHH-HHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSAQANKL-FQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (318) Q Consensus 1 ~t~~~~~~~~~~-~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L 79 (318) |-+.++++.... +..+- ++.....-+++.|++.+...-.+.+-|..++ +..++. .|++|.|+-+... T Consensus 1 ~a~~~~~~~~~~~~g~~~-~~~d~~al~ie~~~geV~~~f~~~s~~~~~~---------~~r~i~--~G~sv~~~~iG~~ 68 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQ-SAADKLALFLKVFGGEVLTAFVRRSVTMDKH---------MVRTIQ--NGKSASFPVMGRT 68 (347) T ss_pred CCCcccchhhhccCCCCc-cccchHHHHHHHHHHHHHHHHHHHhhhhhcc---------cccccc--CcceEEEeeecce Confidence 888887765432 11110 0011112268999998765555544432222 223343 5999999999998 Q ss_pred ccCCeecCceeecchhhhhheeeEEEEecc---cccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|Aclame:pro 80 SKRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGAR 156 (318) Q Consensus 80 ~G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~r 156 (318) +.....-.+.+.+..+++.....+|.||+. +|.|+ .+++....+|+|++.-.....-+++..|+.+|.+|..+. T Consensus 69 ~~~~~~~g~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vd---d~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a 145 (347) T protein:vir:88 69 KGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIY---DIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLC 145 (347) T ss_pred eeeeeccccCCCCCCCCCccceEEEEEechhhhhhhhh---hHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 888777777788888889999999999998 66776 678888899999999999999999999999999987543 Q ss_pred cccccccceeecccccccccccccccCCCCCCceEeecCCcccccccccccc---CHHHHHHHHHHHHhcCCCCceeEec Q lcl|Aclame:pro 157 GDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIF---SIGLVDNLSLFIDEMAHPLQPVRLS 233 (318) Q Consensus 157 g~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~---s~~~Id~a~~~a~~~~~pi~Pv~v~ 233 (318) ..... .+..++ +-..-+....+++++ ++..... -++.|..|.+..++..-| T Consensus 146 ~~~~~--------~~~~~~---------g~~~~~~~~~~~~~~--~~~~~~~~~~~~~~i~~a~~~Lde~~VP------- 199 (347) T protein:vir:88 146 NLPAA--------SNENIA---------GLGQAVVLNIGAAAD--LVDVEARGKAILKGLTLARARLTKNYVP------- 199 (347) T ss_pred ccccc--------cccccC---------Ccccccccccccccc--ccchhhhHHHHHHHHHHHHHHHhhcCCC------- Confidence 21110 011111 000000000011111 1111111 145566666666665444 Q ss_pred cccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeE------EEc Q lcl|Aclame:pro 234 GDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPI------RFY 307 (318) Q Consensus 234 g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~i------Rf~ 307 (318) .++ +++++.|.||.+|.+++.+... .. .....+-.|.+|.++|+-|.+.+++|+ ++. T Consensus 200 ~~g-------R~~vv~P~~y~~Ll~~~~~~~~--------~~--~~~~~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~ 262 (347) T protein:vir:88 200 AGD-------RRFYCAPEDYSAILSALMPNAA--------NY--AALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPA 262 (347) T ss_pred CCC-------CEEEeCHHHHHHHhcchhhhhh--------hh--ccccchhcceeeeeccceEEEeeccccccccccccc Confidence 111 6788999999999998754211 11 123457789999999999999999984 111 Q ss_pred CCCeeee------eeeC Q lcl|Aclame:pro 308 QGQRFWY------QRIT 318 (318) Q Consensus 308 ag~~v~~------a~~~ 318 (318) .+.++-. +.+. T Consensus 263 ~~~~~t~~~~~~~~~~~ 279 (347) T protein:vir:88 263 DGVAPTNQKHIFPATAT 279 (347) T ss_pred ccccccccccccccccc Confidence 1111100 0010 No 11 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.17 E-value=3.2e-12 Score=83.60 Aligned_cols=262 Identities=11% Similarity=0.099 Sum_probs=160.7 Q ss_pred CCcCCcc-chhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSA-QANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (318) Q Consensus 1 ~t~~~~~-~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L 79 (318) |+++..+ ++.-....+-.++-....-+++.|++.+..--.+.+-+..+ ++..+++ .|.++.|+-+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~---------~~~r~i~--~gks~~~~~iG~~ 69 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSR---------HMVRSIS--SGKSAQFPVLGRT 69 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhccc---------ceeeecc--ccceEEEeeecce Confidence 9988876 44444333333333334446999999886666665554322 2334554 5999999999888 Q ss_pred ccCCeecCceeecchhhhhheeeEEEEecc---cccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|Aclame:pro 80 SKRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGAR 156 (318) Q Consensus 80 ~G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~r 156 (318) +-....-.+.+.+..++.+....+|.||+. ||.|+ .+++....+|+|.+.-..+..-+++..|+.++.+|..+. T Consensus 70 ~~~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~Vd---diD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a 146 (345) T protein:vir:22 70 QAAYLAPGENLDDKRKDIKHTEKVITIDGLLTADVLIY---DIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLC 146 (345) T ss_pred EEEeeecCCCCCCCCCCcccceEEEEecchhhhhhhHh---hHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 877777778899988888888899999998 55555 678889999999999999999999999999999887544 Q ss_pred cccccccceeecccccccccccccccCCCCCCceEeecCCcccccccccccc---CHHHHHHHHHHHHhcCCCCceeEec Q lcl|Aclame:pro 157 GDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIF---SIGLVDNLSLFIDEMAHPLQPVRLS 233 (318) Q Consensus 157 g~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~---s~~~Id~a~~~a~~~~~pi~Pv~v~ 233 (318) ... +.....|. +.+.+.... + .+ ....++..-+. -++.|..|....++..-|. T Consensus 147 ~~~-~~~~~~~~---~~~~~~~~~-~------------~~-~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~------ 202 (345) T protein:vir:22 147 NVE-SKYNENIE---GLGTATVIE-T------------TQ-NKAALTDQVALGKEIIAALTKARAALTKNYVPA------ 202 (345) T ss_pred ccc-cccccccc---ccccccccc-c------------cc-ccccccccccCHHHHHHHHHHHHHHhhhcCCCc------ Confidence 311 10000000 111110000 0 00 01111111111 1344555556666644442 Q ss_pred cccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEc---CCC Q lcl|Aclame:pro 234 GDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFY---QGQ 310 (318) Q Consensus 234 g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~---ag~ 310 (318) ++ ++++++|.|+..|+.++.+- ++ .-+..+.+=+|.+|.++|+-|.|-++.|.-.. .+. T Consensus 203 -~~-------R~~vv~P~~y~~Ll~~~~~~-------~~---~~~~~~~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~ 264 (345) T protein:vir:22 203 -AD-------RVFYCDPDSYSAILAALMPN-------AA---NYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTAREG 264 (345) T ss_pred -cC-------CEEEeChHHHHHHhcccccc-------cc---ccccccccccceEEEEeceEEEecccccccccCccccC Confidence 11 67999999999999998541 11 11334555689999999999999887763110 000 Q ss_pred eeeeeee-----------------------------------C Q lcl|Aclame:pro 311 RFWYQRI-----------------------------------T 318 (318) Q Consensus 311 ~v~~a~~-----------------------------------~ 318 (318) .+.-+.. + T Consensus 265 ~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~ 307 (345) T protein:vir:22 265 TTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLA 307 (345) T ss_pred cccccccccccccceeeeeccCceEEEEEehhheeeeeeecce Confidence 0000000 0 No 12 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.12 E-value=5e-12 Score=82.53 Aligned_cols=264 Identities=11% Similarity=0.087 Sum_probs=161.4 Q ss_pred CCcCCccchhH-HHHHHHHHHhcccc-hHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeec Q lcl|Aclame:pro 1 MTTVTSAQANK-LFQVALFTAANRNR-SMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHK 78 (318) Q Consensus 1 ~t~~~~~~~~~-~~a~~lft~~~~~~-~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~ 78 (318) |-++.+|+... .+..+- +.+-.. -+++.|++.+...-.+.+.+..++ +..++ ..|++|.|+-+.. T Consensus 1 ~~~~~~~~~~~t~~g~~~--~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v---------~~r~~--~~G~sv~i~~iG~ 67 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQ--SAADKLALFLKVFGGEVLTAFARTSVTMPRH---------MLRSI--ASGKSAQFPVIGR 67 (347) T ss_pred CCCCccCcccccccccCC--cccchHHHHHHHHHHHHHHHHHHHHhhhhhh---------ccccc--cccceeEeeeccc Confidence 88887775432 222220 111112 268999998876666665543332 22233 2499999999999 Q ss_pred cccCCeecCceeecchhhhhheeeEEEEeccc---ccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 79 LSKRPTMGDERVEGRGEDLSHADFSLKINQGR---HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGA 155 (318) Q Consensus 79 L~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R---~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~ 155 (318) .+....+..+.+.|+.++......+|.||+.. +.|+ .+++-.+.+|+|.+.-.....-+++..|+.++.+|+.+ T Consensus 68 ~t~~~~~~g~~l~~~~~~~~~~e~~ltiD~~~y~~~~Vd---diD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~ 144 (347) T protein:vir:33 68 TKAAYLKPGENLDDKRKDIKHTEKVIHIDGLLTADVLIY---DIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGL 144 (347) T ss_pred eeeeeecCCCCCCCCCCCCccceEEEEechhhhhhHHHh---hHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99888888889999989999999999999884 5665 56778889999999999999999999999999998766 Q ss_pred ccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccc Q lcl|Aclame:pro 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (318) Q Consensus 156 rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~ 235 (318) .+........ .+.+.+.-..++..+ +.+.+.+. ...++ .-++.|..|.+...+..-|- + T Consensus 145 ~~~~~~~~~~-----~~~~~~~~~~~~~~~-------~tg~~~d~-~~~a~-~i~~~i~~a~~~Lde~~VP~-----~-- 203 (347) T protein:vir:33 145 VNLPDGSNEN-----IEGLGKPTVLTLVKP-------TTGSLTDP-VELGK-AIIAQLTIARASLTKNYVPA-----A-- 203 (347) T ss_pred hhhhcccccc-----ccccccccccccccc-------ccccccch-hhhHH-HHHHHHHHHHHHHhhcCCCc-----c-- Confidence 5432111111 011100000000000 00001110 11111 22456666777777755441 1 Q ss_pred cccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEc-------- Q lcl|Aclame:pro 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFY-------- 307 (318) Q Consensus 236 ~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~-------- 307 (318) + ++++++|.|+..|..++.+- ++.. +..-.+-+|.+|.|+|+-|.+.+++|---. T Consensus 204 g-------R~~vv~P~~y~~Ll~~~~~~-------~~d~---~~~~~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ 266 (347) T protein:vir:33 204 D-------RTFYTTPDNYSAILAALMPN-------AANY---QALLDPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAP 266 (347) T ss_pred C-------cEEEeCHHHHHHHhcccccc-------cccc---ccccccccceeEEEeceeEEEecccccCcccccccccc Confidence 1 57889999999999998642 1111 122357889999999999999988765210 Q ss_pred CC--------Ceeee--e----------------------eeC Q lcl|Aclame:pro 308 QG--------QRFWY--Q----------------------RIT 318 (318) Q Consensus 308 ag--------~~v~~--a----------------------~~~ 318 (318) +| .+..+ + .+. T Consensus 267 ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e 309 (347) T protein:vir:33 267 ADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALE 309 (347) T ss_pred ccccccccCCcccceeccccceeeeeecchhheeeeeeceeee Confidence 00 00000 0 000 No 13 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.11 E-value=7.1e-12 Score=81.71 Aligned_cols=261 Identities=12% Similarity=0.109 Sum_probs=160.0 Q ss_pred CCcCCcc-chhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSA-QANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (318) Q Consensus 1 ~t~~~~~-~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L 79 (318) |.++..+ +++-..-.+-..+-....-+++.|++.+..--.+.+-|..+ ++..+++ .|.++.|+-+... T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~---------~~~r~i~--~g~s~~~~~iG~~ 69 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSR---------HMVRSIS--SGKSAQFPVLGRT 69 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhccc---------ceeeeec--ccceEEEEeecee Confidence 8755444 45544433333333444557999999886666665554322 1233454 4999999999888 Q ss_pred ccCCeecCceeecchhhhhheeeEEEEecc---cccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|Aclame:pro 80 SKRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGAR 156 (318) Q Consensus 80 ~G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~r 156 (318) +-...+-++.+.|.-+++.-...+|.||+. ||.|+ .+++..+.+|+|.+.-..+..-+++..|+.++.+|+.+. T Consensus 70 ~~~~~~~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~Vd---DiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a 146 (344) T protein:vir:10 70 QAAYLAPGENLDDIRKDIKHTEKVITIDGLLTADVLIY---DIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLC 146 (344) T ss_pred EEEeeecCCCCCCCCCCcccceEEEEEcchhhhhhhhh---hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 766777777788887788888899999996 66665 678899999999999999999999999999999987544 Q ss_pred cccccccceeecccccccccccccccCCCCCCceEeecCCcccccccccccc----CHHHHHHHHHHHHhcCCCCceeEe Q lcl|Aclame:pro 157 GDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIF----SIGLVDNLSLFIDEMAHPLQPVRL 232 (318) Q Consensus 157 g~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~----s~~~Id~a~~~a~~~~~pi~Pv~v 232 (318) ... -|....+.+. ++. ...... ......++... -++.|.+|.+.+++..-| T Consensus 147 ~~~------~~~~~~~~g~---------~~~--~~~~~~--~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP------ 201 (344) T protein:vir:10 147 NVE------SQYNENITGL---------GTA--TVIETT--QDKTTLTDQVALGKEIIAALTKARAALTKNYVP------ 201 (344) T ss_pred ccc------cccccccccc---------ccc--ceeecc--cccccccchhhhHHHHHHHHHHHHHHHhhcCCC------ Confidence 311 0110000000 000 000000 00001111111 235566677777775544 Q ss_pred ccccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE------- Q lcl|Aclame:pro 233 SGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR------- 305 (318) Q Consensus 233 ~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR------- 305 (318) .++ +++++.|+||..|+.++.+- . . .-+..+.+-+|.+|.++|+-|.+-+++|.= T Consensus 202 -~~g-------R~~vv~P~~y~~Ll~~~~~~------~-~---~~~~~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~ 263 (344) T protein:vir:10 202 -SSD-------RVFYCDPDSYSAILAALMPN------A-A---NYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSRE 263 (344) T ss_pred -ccC-------CEEEeChHHHHHHhhccccc------c-c---ccccccceeeeEEEEEeceEEEeccccccccCCcccc Confidence 111 57889999999999997541 1 1 113345677899999999999999887641 Q ss_pred --------EcCCCe----eeeee----eC Q lcl|Aclame:pro 306 --------FYQGQR----FWYQR----IT 318 (318) Q Consensus 306 --------f~ag~~----v~~a~----~~ 318 (318) |.++.. +-+++ +. T Consensus 264 ~~tg~~~~~~~~~~~~~~~~~s~~~~l~~ 292 (344) T protein:vir:10 264 GTTGQKHAFPATKSGNDKVAKDNVIGLFM 292 (344) T ss_pred cccCccccccCCcccceeeecceeEEEee Confidence 001100 00011 10 No 14 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.09 E-value=1.5e-11 Score=79.92 Aligned_cols=222 Identities=11% Similarity=0.057 Sum_probs=135.4 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) |.+.....+. .=.|- .|+.-+.....++..+ .+ . .....+|+-++|++|+|+....+ T Consensus 1 ma~~~T~~~d------------~iiPe--v~~~~v~~~~~~~~~~---~~----~-~~~~~~l~g~~G~ti~iP~~~~~- 57 (272) T protein:vir:36 1 MSKQKTTLAD------------LVNPE--VLAPIVSYELNKALRF---AP----L-AQVDTTLQGQPGNTLKFPAFTYI- 57 (272) T ss_pred CCCcceehhh------------hhchH--HHHHHHHHHHHhhhhh---cc----c-cccccccccCCCCEEEEeeeccC- Confidence 3322111111 01122 2554432222222222 11 1 12245687789999999998655 Q ss_pred cCC--eecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Q lcl|Aclame:pro 81 KRP--TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (318) Q Consensus 81 G~g--v~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~ 158 (318) |+. +..++.+ ..+.|+..++++.|.+.-.++.... ++...+.-|+..++...++.+|++..|..++..|.|+.. T Consensus 58 gda~~~~eg~~i--~~~~lt~~~~~~~i~~~~k~~~vtD-~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~- 133 (272) T protein:vir:36 58 GDAADVAEGGEI--SLDKIGTTTKSVTIKKAAKGTEITD-EAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQ- 133 (272) T ss_pred ccccccCCCCcc--ChhhcCCcceeEeeehhhccccccH-HHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc- Confidence 443 2222222 4788999999999999888888764 466667889999999999999999999999887765431 Q ss_pred cccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecccccc Q lcl|Aclame:pro 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELH 238 (318) Q Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~ 238 (318) .+ +-..+.+.|.+|.......+. T Consensus 134 ----------------------~~----------------------~~~~~~d~i~~A~~~lgd~~~------------- 156 (272) T protein:vir:36 134 ----------------------TV----------------------STKANVDGVQAALDIFNDEDA------------- 156 (272) T ss_pred ----------------------cc----------------------cccccHHHHHHHHHHhhhcCC------------- Confidence 00 112355667777765543221 Q ss_pred CCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE------E-cCCCe Q lcl|Aclame:pro 239 GEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR------F-YQGQR 311 (318) Q Consensus 239 ~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR------f-~ag~~ 311 (318) +-++++|||.++..|++|+.+ .... ..+.++++++|.+|.|.|+-|..-..+|-= + ..-+. T Consensus 157 ---~~~~ivv~p~~~~~L~k~~~~---~~~~------~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~~gA 224 (272) T protein:vir:36 157 ---QAYVLIVNPKDAAKIRKDANA---KNIG------SEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPA 224 (272) T ss_pred ---CceEEEEcHHHHHHHhccccc---cccc------ccccccceeeeccceecCeeEEEeCCCCCCceeEEEEEecccc Confidence 126899999999999999754 2211 123457899999999999999998777720 0 00011 Q ss_pred eeee--ee-C Q lcl|Aclame:pro 312 FWYQ--RI-T 318 (318) Q Consensus 312 v~~a--~~-~ 318 (318) +.+. +. + T Consensus 225 ~~~~~~~~~~ 234 (272) T protein:vir:36 225 LKLVLKRGVQ 234 (272) T ss_pred eeeeecCCcc Confidence 1110 11 1 No 15 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.07 E-value=1.1e-11 Score=80.62 Aligned_cols=262 Identities=11% Similarity=0.097 Sum_probs=153.9 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) |..+++....-....+- +....-.-+++.|.+.++..-...+-+..+ ++..++ ..|++|.|+-+...+ T Consensus 1 m~~~~~~~~~t~~g~~~-~~~d~~al~ik~f~~eV~~~f~~~s~~~~~---------~~~r~i--~~G~sv~i~~iG~~t 68 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGK-SSSDALALFLKVFAGEVLTAFTRRSVTADK---------HIVRTI--QNGKSAQFPVMGRTS 68 (347) T ss_pred CCCCCccccccccccCC-ccccHHHHHHHHHhHHHHHHHHHHHhhhcc---------cccccc--cccceEEEeccccee Confidence 87777654432221110 011111234777877776665554443222 233344 359999999999998 Q ss_pred cCCeecCceeecchhhhhheeeEEEEecc---cccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (318) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg 157 (318) -...+-++.+.|+-+++.-....|.||+. |+.|+ .+++....+|+|.+.-.....-+++..|+.++.+|....+ T Consensus 69 v~~~t~G~~l~~~~~~~~~~e~~itID~~~~~~~~Vd---diD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa 145 (347) T protein:vir:94 69 GVYLAPGERLSDKRKGIKHTEKVITIDGLLTADVMIF---DIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCN 145 (347) T ss_pred eeeecCCCCcCCCCCCCCcceEEEEecchhhhhHHhh---hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 88888888898988888888899999998 56665 5678888999999999999999999999999987764443 Q ss_pred ccccccceeecccccccccccccccCCCCCCceEeecCCccc-cccccccccCHHHHHHHHHHHHhcCCCCceeEecccc Q lcl|Aclame:pro 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSF-EQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDE 236 (318) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~-~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~ 236 (318) ....... ..+.+ .+++ .+-.+.++.. ......+.+ ++.|.+|.+..++...| .++ T Consensus 146 ~~~~~~~-----~~~g~--------~~~s---~~~~~~~~~~~~~~~~~~~~-~~~i~~a~~~Lde~~VP-------~~~ 201 (347) T protein:vir:94 146 LPAASNE-----NIAGL--------GTAS---VLEVGKKADLDTPAKLGEAI-IGQLTIARAKLTSNYVP-------AGD 201 (347) T ss_pred ccccccc-----ccCCC--------cccc---eeeccccccccchhhhHHHH-HHHHHHHHHHHhhcCCC-------CCC Confidence 2111000 00011 0010 0000000000 000011111 34566666666665544 111 Q ss_pred ccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE------EcCCC Q lcl|Aclame:pro 237 LHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR------FYQGQ 310 (318) Q Consensus 237 ~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR------f~ag~ 310 (318) ++++++|+++..|..++.+.. +.. ..+..+=.|.+|.++|+-|.+-+++|.= -..+- T Consensus 202 -------R~~vv~P~~~~~Ll~~~~~~~-------~~~---~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~ 264 (347) T protein:vir:94 202 -------RYFYTTPDNYSAILAALMPNA-------ANY---AALIDPETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGI 264 (347) T ss_pred -------cEEEeCHHHHHHHhccchhhh-------hhc---cccccccccceEEEeceEEEecCcccccccccccccCcc Confidence 678899999999998875421 111 1122355799999999999999988851 11111 Q ss_pred eeeee--e-----------eC Q lcl|Aclame:pro 311 RFWYQ--R-----------IT 318 (318) Q Consensus 311 ~v~~a--~-----------~~ 318 (318) ++.+. + .. T Consensus 265 ~~~aG~~~~~~~~~~~~~~~~ 285 (347) T protein:vir:94 265 TIASGQKHAFPATASSDVKVT 285 (347) T ss_pred eecCcccccccccchhhhccc Confidence 11110 0 00 No 16 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.03 E-value=3.5e-11 Score=77.94 Aligned_cols=238 Identities=13% Similarity=0.090 Sum_probs=137.6 Q ss_pred HHHHH-HHHhcccchHH-----HHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccccCCeec Q lcl|Aclame:pro 13 FQVAL-FTAANRNRSMV-----NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMG 86 (318) Q Consensus 13 ~a~~l-ft~~~~~~~~v-----~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~G 86 (318) ||.++ ||...-+.+.| ++|++.+-..-.+..-| ....+..+..-.+||+|+|+.....+-..... T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~---------~~~~~d~~~~~~~Gdtv~ip~~g~~~~~d~~~ 71 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLD---------TSVVKTWGAQVKKGDTFHVPRISELGVEDKAT 71 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcch---------hhccccccccccCCceEEEeccCcceeeeecC Confidence 33222 12211222222 34655432211111111 11112222333459999999765444323332 Q ss_pred CceeecchhhhhheeeEEEEeccc-ccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccce Q lcl|Aclame:pro 87 DERVEGRGEDLSHADFSLKINQGR-HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTI 165 (318) Q Consensus 87 d~~leGnee~L~~~sd~v~Idq~R-~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~n~~~~ 165 (318) +..+ ..+++.-.+.+|.||+.+ .++.+. .+++..+..|+|.+........+++..|+.++..++++.+... T Consensus 72 ~~~i--~~~~~~~~~~~itiD~~~~~~~~i~-d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~----- 143 (341) T protein:vir:94 72 DVPV--GVQPVNDTDFVITVDTDRTTAVALD-DLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTAS----- 143 (341) T ss_pred CCcc--ccccccCceEEEEEeeeeecceeec-hHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc----- Confidence 3333 346777889999999986 555554 6688888999999999999999999999998877765443100 Q ss_pred eecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCCcceEE Q lcl|Aclame:pro 166 LPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYV 245 (318) Q Consensus 166 ~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~~~~yV 245 (318) .+.+..++. ......+.++.+.|..|.+.+++..-| -+| ++ T Consensus 144 -------------~~~~~~~~~------------~~t~~~~~~~~~~i~~a~~~Lde~~VP-----~~g---------R~ 184 (341) T protein:vir:94 144 -------------QNVFSSSNG------------AITGNGQAFSFAVFLAARRLLLEADVP-----EEK---------IV 184 (341) T ss_pred -------------CccccCccc------------cccCchhhhhHHHHHHHHHHHhhcCCC-----ccC---------CE Confidence 011111110 011123557788888898888886544 111 56 Q ss_pred EEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEcCCCe-----eeee----- Q lcl|Aclame:pro 246 LYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGQR-----FWYQ----- 315 (318) Q Consensus 246 ~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~ag~~-----v~~a----- 315 (318) ++|+|.++.+|++|+.+ .+... +.++.|-+|.+|.|.|+-|++.+++|. +.+.. ..++ T Consensus 185 lvv~P~~~~~Ll~~~~~---~~~~~-------~g~~~l~~G~ig~i~G~~V~~Sn~lp~--~~~~~~~~~~~~~~~~~~~ 252 (341) T protein:vir:94 185 LLISPGQESALFTIPQF---ISKDF-------INNAPIAQGQIGSLMGVRVIRTSLIGN--NSATGWRNGAPTIAPAEAT 252 (341) T ss_pred EEeCHHHHHHHhhchhh---hhhhc-------cccchhheeeeeeEeceEEEEeccccc--cccccccccccceeccccc Confidence 78999999999999754 22111 224568899999999999999888764 11100 0000 Q ss_pred -eeC Q lcl|Aclame:pro 316 -RIT 318 (318) Q Consensus 316 -~~~ 318 (318) -|+ T Consensus 253 ~~i~ 256 (341) T protein:vir:94 253 PGFT 256 (341) T ss_pred cccc Confidence 000 No 17 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.00 E-value=1.5e-10 Score=74.47 Aligned_cols=226 Identities=13% Similarity=0.037 Sum_probs=135.8 Q ss_pred HHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccccCCeecCceeec Q lcl|Aclame:pro 13 FQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEG 92 (318) Q Consensus 13 ~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leG 92 (318) ||..+| .-++|++.+...-.+...+..+. .+.-++....||+|+|+.....+..-..+... .+ T Consensus 1 MA~~~~--------~pei~~~~v~~~~~~~lv~~~l~--------~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~-~~ 63 (273) T protein:vir:79 1 MAFNNF--------IPELWSDMLLEEWTAQTVFANLV--------NREYEGIASKGNVVHIAGVVAPTVKDYKAAGR-QT 63 (273) T ss_pred Ccchhh--------hHHHHHHHHHHHHHhhccchhhh--------hccccccccCCcEEEEeecCcccccccccCCC-cc Confidence 444332 12457776544444444333332 22223445689999999876544222111111 24 Q ss_pred chhhhhheeeEEEEecc-cccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeecccc Q lcl|Aclame:pro 93 RGEDLSHADFSLKINQG-RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEH 171 (318) Q Consensus 93 nee~L~~~sd~v~Idq~-R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~n~~~~~p~~~~ 171 (318) ..+++...+.++.||+. .+++.+. .+++..+..||++..+. +..=+++..|+.++..++++... T Consensus 64 ~~~~~~~~~~~~tid~~~~~~~~i~-d~d~~~~~~~~~~~~~~-~~~ala~~vD~~i~~~~~~a~~~------------- 128 (273) T protein:vir:79 64 SADAISDTGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGTA------------- 128 (273) T ss_pred CccccccceEEEEEeeecccceeec-cHHHHhhcccHHHHHHH-HHHHHHHHHHHHHHHHHhhcccc------------- Confidence 57889999999999996 4567665 44666678899986665 55668899999888777643310 Q ss_pred cccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCCcceEEEEecHH Q lcl|Aclame:pro 172 PEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPR 251 (318) Q Consensus 172 ~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~~~~yV~~l~P~ 251 (318) |+..+ .++... .++.|..|...++...-|- +| ++++++|. T Consensus 129 --------~~~~~----------------~~~~~~--~~~~i~~a~~~ld~~~vP~-----~~---------R~lvv~p~ 168 (273) T protein:vir:79 129 --------LTGSA----------------PSDADD--AFDLIASALKELTKANVPN-----VG---------RVVVVNAE 168 (273) T ss_pred --------ccccc----------------ccchhh--HHHHHHHHHHHhhhccCCc-----cC---------cEEEECHH Confidence 11111 111111 3567778888887766541 11 57899999 Q ss_pred HHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE-------EcCCCeeeeeeeC Q lcl|Aclame:pro 252 QWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR-------FYQGQRFWYQRIT 318 (318) Q Consensus 252 q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR-------f~ag~~v~~a~~~ 318 (318) ++..|++++.+ +.+ +.. .|..+.|-.|.+|.|.|+-|++..++|.= |..+..+-+.+|. T Consensus 169 ~~~~Ll~~~~~--~~~----~~~--~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~~~A~~~a~~~~ 234 (273) T protein:vir:79 169 MAFWLRSSGSK--LTS----ADT--SGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQID 234 (273) T ss_pred HHHHHhhchhh--hhh----hhh--cccccceeeeEeeEEeceEEEecccccccCceEEEEEeccceeeeeehh Confidence 99999998642 222 211 25567899999999999999998877641 1111112222332 No 18 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=98.98 E-value=4.1e-11 Score=77.52 Aligned_cols=260 Identities=11% Similarity=0.098 Sum_probs=155.2 Q ss_pred CCcCCccchhH-HHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSAQANK-LFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (318) Q Consensus 1 ~t~~~~~~~~~-~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L 79 (318) |-++.+|+... ....+. +....-+-+++.|++.+..--.+.+-|..++ +..++. .|+++.|+-+... T Consensus 1 ma~~~~~~~~~t~~g~~~-~~~d~~al~ie~~~geV~~~f~~~s~~~~~~---------~~rti~--~G~sv~~~~iG~~ 68 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGM-SAGDKLALFLKVFGGEVLTAFTRTSVTMNKH---------LVRSIQ--SGKSAQFPVLGRT 68 (347) T ss_pred CCccccccccccccccCC-cccchHHHHHHHHhHHHHHHHHHHHhhhhhh---------hheecc--ccceEEeeeccce Confidence 88888886443 222221 0111112268999998765555555543332 222343 5999999999888 Q ss_pred ccCCeecCceeecchhhhhheeeEEEEecc---cccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|Aclame:pro 80 SKRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGAR 156 (318) Q Consensus 80 ~G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~r 156 (318) +-....-.+.+.+.-+++.....+|.||+. +|.|+ .+++....+|+|.+.-.....-+++..|+.+|.+|.-+. T Consensus 69 ~~~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~Vd---diD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a 145 (347) T protein:vir:94 69 KAAYLQPGENLDDKRKDMKHTEKTINIDGLLTADVLIY---DIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLC 145 (347) T ss_pred eEeeeecCcCCCCCcCCccccceEEEEcchhhhhhhhh---hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 877777777777777889999999999997 55665 678888899999999999999999999999998886333 Q ss_pred cccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccC----HHHHHHHHHHHHhcCCCCceeEe Q lcl|Aclame:pro 157 GDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFS----IGLVDNLSLFIDEMAHPLQPVRL 232 (318) Q Consensus 157 g~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s----~~~Id~a~~~a~~~~~pi~Pv~v 232 (318) ... .|....+.+. |..--+-.+. .+.++.+..-+ ++.|.+|....++..-| T Consensus 146 ~~~------~~~~~~~~g~---------~~~~~v~i~~----~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP------ 200 (347) T protein:vir:94 146 NLP------TANNENIAGL---------GKAHVLEVGD----QATLQGDQVKLGQAIIAQLTLARAKLTGNYVP------ 200 (347) T ss_pred ccc------cccccccccC---------CcceeEeeec----cccccccccccHHHHHHHHHHHHHHhhhcCCC------ Confidence 210 0110000000 0000000000 11122121111 44566666666664443 Q ss_pred ccccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE------E Q lcl|Aclame:pro 233 SGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR------F 306 (318) Q Consensus 233 ~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR------f 306 (318) .++ +++++.|.||..|.+..... .. ..+..+.+=+|.+|.++|+-|.+-+++|+= . T Consensus 201 -~~~-------R~~vv~P~~y~~LLk~~~~~--------~~--~~~~~~~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~ 262 (347) T protein:vir:94 201 -SSD-------RVFYTTPDNYSAILAALMPN--------AA--NYQALIDPSTGSIRNVMGFEVIEVPHLTAGGAGDNRA 262 (347) T ss_pred -CCC-------CEEEeChHHHHHHHHhhccc--------cc--ccccccccccceeEEeeceEEEEcCccccccCccccc Confidence 111 78999999999999753221 11 113345566899999999999999998862 1 Q ss_pred cCCCee------------eeeeeC Q lcl|Aclame:pro 307 YQGQRF------------WYQRIT 318 (318) Q Consensus 307 ~ag~~v------------~~a~~~ 318 (318) .+|... ..-|.. T Consensus 263 ~~~~~~~~~~~~~~~~~~~~y~~d 286 (347) T protein:vir:94 263 EEGVAPTNQKHAFPDTASGDTRVA 286 (347) T ss_pred cccccccccccccccccccccccc Confidence 111100 000110 No 19 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=98.95 E-value=1.4e-10 Score=74.60 Aligned_cols=255 Identities=13% Similarity=0.085 Sum_probs=147.7 Q ss_pred CCcCCcc-chhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSA-QANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (318) Q Consensus 1 ~t~~~~~-~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L 79 (318) |+|+..+ +-...++ .+-....-+++.|++.+.......+-|..+ . ++.++ ..|+++.|+-+... T Consensus 1 m~~~~~~~~t~~~~~----~~~~~~~l~le~~~geV~~af~~~s~~~~~------~---~~r~i--~~G~s~~~~~iG~~ 65 (334) T protein:vir:80 1 MTYPAANTHTRPGWG----GANSDVSLHIEEHLGLVDASFMYSSKFASW------M---NVRSL--RGTNQLRVDRVGAS 65 (334) T ss_pred CCCCcCCCccccccc----cccchheehhhhhhhHHHHHHHHhhhhhcc------c---eeeec--cccceEEEeeecce Confidence 8888543 2111222 111122224899999886666655544322 1 22344 34999999988887 Q ss_pred ccCCeecCceeecchhhhhheeeEEEEecc---cccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh-cc Q lcl|Aclame:pro 80 SKRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA-GA 155 (318) Q Consensus 80 ~G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~Ls-G~ 155 (318) +-....=++.+.+. .+.....+|.||+. ||.|+ .+++-...+|+|.+.-.....-+++..||.+|..|. ++ T Consensus 66 ~~~~~~~g~~l~~~--~~~~~~~~l~ID~~l~~~~~Vd---diD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa 140 (334) T protein:vir:80 66 TIAGRKAGEELVVQ--KNVSDKLNLTVDTVLYARHFFD---KFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCG 140 (334) T ss_pred eeeeecCCCCCCCC--CcccCceEEEEeeeeehhhhHh---hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 76666666677664 47778889999996 55664 567788889999999999999999999999987775 22 Q ss_pred ccccccccceeecccccccccccccccCCCCCCceEeecCCcccccc--cc-ccccCHHHHHHH----HHHHHhcCCCCc Q lcl|Aclame:pro 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQI--EA-ADIFSIGLVDNL----SLFIDEMAHPLQ 228 (318) Q Consensus 156 rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i--~a-~D~~s~~~Id~a----~~~a~~~~~pi~ 228 (318) +. .-|....+.| ++|......+ ++ ...-+.+.|-+| .+...+..-|-. T Consensus 141 ~~-------~~~~~~~~~~------------------~~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~ 195 (334) T protein:vir:80 141 DF-------LAPAHLKPAF------------------HDGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQ 195 (334) T ss_pred hh-------cccccccccc------------------cCCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCC Confidence 21 1111111111 1111111011 11 112223344444 444444333311 Q ss_pred eeEeccccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE--- Q lcl|Aclame:pro 229 PVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR--- 305 (318) Q Consensus 229 Pv~v~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR--- 305 (318) | . .+ +++++.|.||..|..++.+- ++.-...+..+++=.|.++.|+||-|.+-+++|-- T Consensus 196 ~--~--~~-------R~~vv~P~~y~~Ll~~~r~~-------n~d~~~s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t 257 (334) T protein:vir:80 196 L--M--SE-------GVTLLDPVIFSFLLEHDRLM-------NVEFGAKEGGNSFVGGRIAMLNGVRVVETPRFPQSAIT 257 (334) T ss_pred c--C--Cc-------eEEEeChHHHHHHhcccccc-------cceeccccccccccceeEEEEeceEEEeecCCCCcccc Confidence 0 0 11 79999999999999998541 11111123357788899999999999997776621 Q ss_pred -------E--cCCC---------------eeeeeeeC Q lcl|Aclame:pro 306 -------F--YQGQ---------------RFWYQRIT 318 (318) Q Consensus 306 -------f--~ag~---------------~v~~a~~~ 318 (318) | |+|+ ++++-.++ T Consensus 258 ~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~ 294 (334) T protein:vir:80 258 ANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVS 294 (334) T ss_pred ccccccccccccccccceEEEEEeCceEEEEEEeecc Confidence 1 1221 12222222 No 20 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=98.88 E-value=3.1e-10 Score=72.68 Aligned_cols=224 Identities=14% Similarity=0.056 Sum_probs=137.5 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) |.+.+...+ ..=.|- .|+..+.....+...+ . .+....++|.-++|++|+|+... +. T Consensus 1 ma~~~T~~~------------d~i~Pe--v~s~~v~~~~~~~~~~---~-----~~~~~~~~l~g~~G~tv~ip~~~-~~ 57 (274) T protein:vir:96 1 MAQGTTKVS------------NLIVPE--VLAPMMQAELDKKLRF---A-----QFADIDSTLVGQPGDTLTFPAFT-YS 57 (274) T ss_pred CCccccchh------------hhhhhH--HHHHHHHHHHHhhhhh---c-----ccccccccccCCCCCEEEEEeec-cC Confidence 332222111 111222 3665543322222222 1 11223457777899999999986 44 Q ss_pred cCC--eecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Q lcl|Aclame:pro 81 KRP--TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (318) Q Consensus 81 G~g--v~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~ 158 (318) |+. +..++.+ ..+.+...++++.|++.-.++..... +...+..|+...+...++.+|++..|..++..|.|+.. T Consensus 58 g~~~~~~~g~~i--~~~~it~~~~~~~i~~~~~~~~i~D~-~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~- 133 (274) T protein:vir:96 58 GDAQVIAEGEKI--PVDQIGTSKREAKVRKIGKGTELTDE-AVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL- 133 (274) T ss_pred CCccccCCCCcC--chhhcccceeEEEEEeeeceeeecHH-HHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC- Confidence 432 2222222 37789999999999998888877654 55557889999999999999999999999877754321 Q ss_pred cccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecccccc Q lcl|Aclame:pro 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELH 238 (318) Q Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~ 238 (318) -..++.++++.|..|...+..... T Consensus 134 -------------------------------------------~~~~~~~~~d~i~dA~~~l~d~~~------------- 157 (274) T protein:vir:96 134 -------------------------------------------TVEADITKLDGLQTAIDKFNDEDL------------- 157 (274) T ss_pred -------------------------------------------CcCcccccHHHHHHHHHHhcccCC------------- Confidence 011345677888777777654211 Q ss_pred CCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE---EcCCCeeeee Q lcl|Aclame:pro 239 GEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR---FYQGQRFWYQ 315 (318) Q Consensus 239 ~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR---f~ag~~v~~a 315 (318) +..+++|||.++..|+++... +|.. + ..+-++.+..|.+|.|.|+-|..-.++|-- +..-+.+.+. T Consensus 158 ---~~~~ivv~p~~~~~L~k~~~~-~f~~----~---~~~g~~~~~~g~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~~ 226 (274) T protein:vir:96 158 ---EPMVLFVNPLDAGGLRTSASD-NFTR----P---TQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKLI 226 (274) T ss_pred ---CceEEEeCHHHHHHHHhcccc-cccc----c---ccccccceeecccceecCeeEEEcCCCCcceEEEEeCcceeee Confidence 136899999999999999642 2321 1 122357788999999999999998777731 0111112221 Q ss_pred ee---C Q lcl|Aclame:pro 316 RI---T 318 (318) Q Consensus 316 ~~---~ 318 (318) .- + T Consensus 227 ~~~~~~ 232 (274) T protein:vir:96 227 TKRDFF 232 (274) T ss_pred ecCCcc Confidence 10 1 No 21 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=98.88 E-value=3.4e-10 Score=72.46 Aligned_cols=224 Identities=13% Similarity=0.043 Sum_probs=137.2 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccE-EEeeccCCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPV-VRITDLNKQAGDEVTFSIMHKL 79 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I-~~~~dL~k~~Gd~v~f~L~~~L 79 (318) |.+-... ....=.|- .|+..+.....+...| ++| ....+|+-++||+|+|+....+ T Consensus 1 m~~~~T~------------l~d~i~Pe--v~~~~v~~~~~~~l~~---------~~~~~~~~~l~g~~G~tv~iP~~~~i 57 (274) T protein:vir:96 1 MAQGMTK------------LTNQIVPE--VLAPMMQAELEKKLRF---------ASFAEIDNTLVGQPGDTLTFPAFIYS 57 (274) T ss_pred CCcceee------------hhheechH--HHHHHHHHHHHhhhhc---------cccceecccccCCCCCEEEeeeecCC Confidence 2221111 11111232 3666653333222222 222 2345788789999999998755 Q ss_pred ccCC-eecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Q lcl|Aclame:pro 80 SKRP-TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (318) Q Consensus 80 ~G~g-v~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~ 158 (318) +..- +..++.+ ..+.|+..++++.|++.-+++.... .+...+.-|+..++...++.+|++..|..++..|.++.. T Consensus 58 g~a~~~~~g~~i--~~~~lt~~~~~~~i~~~~~a~~i~D-~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~- 133 (274) T protein:vir:96 58 GDAKVVAEGEKI--PTDILETKKREAKIRKIAKGTSISD-EALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKL- 133 (274) T ss_pred CccccccCCCcc--chhhcccceeEEEeeeeecceeehH-HHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccc- Confidence 3221 2212222 2678999999999999888887764 355666779999999999999999999999877754321 Q ss_pred cccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecccccc Q lcl|Aclame:pro 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELH 238 (318) Q Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~ 238 (318) . .+++.++.+.|..|......... T Consensus 134 ----------------------~---------------------~~~~~~~~d~i~~A~~~lgd~~~------------- 157 (274) T protein:vir:96 134 ----------------------T---------------------VEADITKLTGLQTAIDKFNDEDL------------- 157 (274) T ss_pred ----------------------c---------------------ccccccCHHHHHHHHHHhccccc------------- Confidence 0 01234667888888776643211 Q ss_pred CCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE--E-cCCCeeee- Q lcl|Aclame:pro 239 GEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR--F-YQGQRFWY- 314 (318) Q Consensus 239 ~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR--f-~ag~~v~~- 314 (318) .-++++|||.++..|++|+.. +|. + + ..+.++.+..|.+|.|.|+-|.+-..+|-= | ..-+.+.+ T Consensus 158 ---~~~~ivv~p~~~~~L~k~~~~-~f~---~-~---s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~~gA~~~~ 226 (274) T protein:vir:96 158 ---EPMVLFISPLDAGKLRGDATT-NFT---R-A---TELGDDVIVKGAFGEALGAVIVRSNKLEAGTAILAKKGAVKLI 226 (274) T ss_pred ---cccEEEeCHHHHHHHHhhccc-ccc---c-c---ccccccceeccccceecCeEEEEeCCCCCceEEEEeccceeee Confidence 126899999999999999742 232 1 1 233468899999999999999886665520 0 00111111 Q ss_pred ---------eeeC Q lcl|Aclame:pro 315 ---------QRIT 318 (318) Q Consensus 315 ---------a~~~ 318 (318) -|-- T Consensus 227 ~~~~~~vE~~Rd~ 239 (274) T protein:vir:96 227 TKRDFFLETDRDP 239 (274) T ss_pred ecCCccccccccc Confidence 1100 No 22 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=98.88 E-value=3.4e-10 Score=72.46 Aligned_cols=224 Identities=13% Similarity=0.043 Sum_probs=137.2 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccE-EEeeccCCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPV-VRITDLNKQAGDEVTFSIMHKL 79 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I-~~~~dL~k~~Gd~v~f~L~~~L 79 (318) |.+-... ....=.|- .|+..+.....+...| ++| ....+|+-++||+|+|+....+ T Consensus 1 m~~~~T~------------l~d~i~Pe--v~~~~v~~~~~~~l~~---------~~~~~~~~~l~g~~G~tv~iP~~~~i 57 (274) T protein:vir:95 1 MAQGMTK------------LTNQIVPE--VLAPMMQAELEKKLRF---------ASFAEIDNTLVGQPGDTLTFPAFIYS 57 (274) T ss_pred CCcceee------------hhheechH--HHHHHHHHHHHhhhhc---------cccceecccccCCCCCEEEeeeecCC Confidence 2221111 11111232 3666653333222222 222 2345788789999999998755 Q ss_pred ccCC-eecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Q lcl|Aclame:pro 80 SKRP-TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (318) Q Consensus 80 ~G~g-v~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~ 158 (318) +..- +..++.+ ..+.|+..++++.|++.-+++.... .+...+.-|+..++...++.+|++..|..++..|.++.. T Consensus 58 g~a~~~~~g~~i--~~~~lt~~~~~~~i~~~~~a~~i~D-~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~- 133 (274) T protein:vir:95 58 GDAKVVAEGEKI--PTDILETKKREAKIRKIAKGTSISD-EALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKL- 133 (274) T ss_pred CccccccCCCcc--chhhcccceeEEEeeeeecceeehH-HHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccc- Confidence 3221 2212222 2678999999999999888887764 355666779999999999999999999999877754321 Q ss_pred cccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecccccc Q lcl|Aclame:pro 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELH 238 (318) Q Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~ 238 (318) . .+++.++.+.|..|......... T Consensus 134 ----------------------~---------------------~~~~~~~~d~i~~A~~~lgd~~~------------- 157 (274) T protein:vir:95 134 ----------------------T---------------------VEADITKLTGLQTAIDKFNDEDL------------- 157 (274) T ss_pred ----------------------c---------------------ccccccCHHHHHHHHHHhccccc------------- Confidence 0 01234667888888776643211 Q ss_pred CCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE--E-cCCCeeee- Q lcl|Aclame:pro 239 GEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR--F-YQGQRFWY- 314 (318) Q Consensus 239 ~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR--f-~ag~~v~~- 314 (318) .-++++|||.++..|++|+.. +|. + + ..+.++.+..|.+|.|.|+-|.+-..+|-= | ..-+.+.+ T Consensus 158 ---~~~~ivv~p~~~~~L~k~~~~-~f~---~-~---s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~~gA~~~~ 226 (274) T protein:vir:95 158 ---EPMVLFISPLDAGKLRGDATT-NFT---R-A---TELGDDVIVKGAFGEALGAVIVRSNKLEAGTAILAKKGAVKLI 226 (274) T ss_pred ---cccEEEeCHHHHHHHHhhccc-ccc---c-c---ccccccceeccccceecCeEEEEeCCCCCceEEEEeccceeee Confidence 126899999999999999742 232 1 1 233468899999999999999886665520 0 00111111 Q ss_pred ---------eeeC Q lcl|Aclame:pro 315 ---------QRIT 318 (318) Q Consensus 315 ---------a~~~ 318 (318) -|-- T Consensus 227 ~~~~~~vE~~Rd~ 239 (274) T protein:vir:95 227 TKRDFFLETDRDP 239 (274) T ss_pred ecCCccccccccc Confidence 1100 No 23 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.80 E-value=3.9e-10 Score=72.14 Aligned_cols=248 Identities=17% Similarity=0.154 Sum_probs=147.3 Q ss_pred CCcCCcc----chhHHHHHHHHHHhccc-chHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEE Q lcl|Aclame:pro 1 MTTVTSA----QANKLFQVALFTAANRN-RSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSI 75 (318) Q Consensus 1 ~t~~~~~----~~~~~~a~~lft~~~~~-~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L 75 (318) ||+...= ++...+. - +....+ .-++++|++.+...-.+.+.+..+ ++..++. .|++|.|+- T Consensus 1 ~~~~~~~~~~~~~~~~~~--~-~~~d~~~al~le~~~geV~~~f~~~s~~~~~---------~~~r~i~--~G~tv~i~~ 66 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGAR--N-ADYDVRYATALKLFSGEVFTAFNNASIFKGL---------VRSYDLR--GGKSKQFMF 66 (332) T ss_pred CcccccccCCccccCCcc--c-cccccchhhhhhhhhhhHHHHHHHHhhhhhc---------ccccccc--ccceEEEEe Confidence 5554321 1111110 0 011122 235899999987777766665322 2323342 599999999 Q ss_pred eeccccCCeecCceeecchhhhhheeeEEEEecc---cccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 76 MHKLSKRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHL 152 (318) Q Consensus 76 ~~~L~G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~L 152 (318) +...+-...+.++.+.++ +++.-...+|.||+. ++.|+ .+++-.+.+|||.+.-.....-+++..|+.++.+| T Consensus 67 ig~~~~~~~~~g~~l~~~-~~~~~~~~~l~ID~~ky~~~~Vd---diD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l 142 (332) T protein:vir:78 67 TGKLSAGYHTPGTPIVGD-AGIKANEKTLVMDDLLVSSQFVY---SLDEIFSQYSTRAEVSKQIGEALATHYDERIARVL 142 (332) T ss_pred ccceeEeeecCCCCCCCC-CCCCCceEEEEEehhhhhHHHHH---hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 988887777777778786 458888889999995 55554 57888899999999999999999999999999888 Q ss_pred hccccccccccceeecccccccccccccccCCCCCCceEeecCCcccccccccccc----CHHHHHHHHHHHHhcCCCCc Q lcl|Aclame:pro 153 AGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIF----SIGLVDNLSLFIDEMAHPLQ 228 (318) Q Consensus 153 sG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~----s~~~Id~a~~~a~~~~~pi~ 228 (318) ..+-.... |....+++. . ..++++... -++.|.+|.+.+++..-| T Consensus 143 ~~aa~~~~------~~~~~~g~~-----~------------------~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP-- 191 (332) T protein:vir:78 143 AKASAEAS------PVTGEPGGF-----H------------------VNIGAGNTNDAQAIVDGFFEAAAVLDERSAP-- 191 (332) T ss_pred HhhhcccC------ccccccccc-----c------------------cccCCccccCHHHHHHHHHHHHHHHhhcCCC-- Confidence 64332110 000011110 0 111211122 245566666666665443 Q ss_pred eeEeccccccCCcceEEEEecHHHHHHHHh--CcchHHHHHHHHHHhhccccccCCcccCC-eEEEcCEEEEecCceeEE Q lcl|Aclame:pro 229 PVRLSGDELHGEDPYYVLYVTPRQWNDWYT--STSGKDWNQMMVRAVNRAKGFNHPLFKGE-CAMWRNILVRKYAGMPIR 305 (318) Q Consensus 229 Pv~v~g~~~~~~~~~yV~~l~P~q~~dLr~--d~~~~~w~~~qk~A~~r~~g~~nPlF~G~-~gm~ngvii~e~~~~~iR 305 (318) .++ +++++.|+++..|.+ |+.+ . ++ ..-+.+..+..|. +|.|+|+-|.+.+++|.- T Consensus 192 -----~~g-------R~~vv~P~~y~~Ll~~~d~~~------~-n~--~~~~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~ 250 (332) T protein:vir:78 192 -----QEG-------RVAVLSPRQYYSLISSVDTNI------L-NR--EIGNSQGDMNSGKGLYSIAGIRILKSNNLAGL 250 (332) T ss_pred -----ccC-------CEEEeCHHHHHHHHhhcCcee------e-ee--eccccccceecceeeeEEeeeEEEecCccccC Confidence 111 677899999999987 5432 0 11 1113344577775 999999999998887731 Q ss_pred E---c-----CCC----------------------eeeee--e--eC Q lcl|Aclame:pro 306 F---Y-----QGQ----------------------RFWYQ--R--IT 318 (318) Q Consensus 306 f---~-----ag~----------------------~v~~a--~--~~ 318 (318) - + +|. .|+.- . +| T Consensus 251 ~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t 297 (332) T protein:vir:78 251 YGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTT 297 (332) T ss_pred cccccccccccccccccccccccceEEeecccceeeeeeeccchhhh Confidence 1 0 000 00000 0 01 No 24 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=98.80 E-value=4.1e-10 Score=72.07 Aligned_cols=178 Identities=11% Similarity=0.043 Sum_probs=123.3 Q ss_pred ccCCCCCcEEEEEEeeccccCCeecCceeecc---hhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHH Q lcl|Aclame:pro 62 DLNKQAGDEVTFSIMHKLSKRPTMGDERVEGR---GEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGT 138 (318) Q Consensus 62 dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn---ee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~ 138 (318) |=.-+.||+|+|+- ..|+. .+..||. .|.|++.+++..|-+.-.+|.+... ++....-|+..++...|+. T Consensus 1 ~~~~~~Gdtit~P~---~iGda---~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~-a~l~~~gDp~~ea~~Q~~~ 73 (231) T protein:vir:73 1 ENGINLANLCEYPN---DIGDA---ADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDE-AALSGYGDPIGESNKQLGL 73 (231) T ss_pred CccccCCceEEecc---cccch---hhhcCCCcCChhhccccceeeeEeeeccceeeeHH-HHhhccCchHHHHHHHHHH Confidence 55568999999993 34443 2344555 6889999999999999999888743 5555667999999999999 Q ss_pred HHHHHHHHHHHHHHhccccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHH Q lcl|Aclame:pro 139 YFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSL 218 (318) Q Consensus 139 w~~~~~D~~~~~~LsG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~ 218 (318) -+++..|..++..|.++. ++.+=.++++.|.+|.. T Consensus 74 ~iA~kvD~di~~~~~~a~---------------------------------------------l~~~~~~t~d~i~~A~~ 108 (231) T protein:vir:73 74 SLANKVDDDLLKAAKTTS---------------------------------------------QTVSTKANVDGVQAALD 108 (231) T ss_pred HHHHhhhHHHHHhhcccc---------------------------------------------ccccccccHHHHHHHHH Confidence 999999999887664322 11111357888888887 Q ss_pred HHHhcCCCCceeEeccccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEe Q lcl|Aclame:pro 219 FIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRK 298 (318) Q Consensus 219 ~a~~~~~pi~Pv~v~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e 298 (318) .... + +++-+|++|||.++.+||+++.+ .|. . ..+-++.+++|.+|++.||-|.. T Consensus 109 ~fgd-------------e---~~~~~vivv~p~~~~~Lrk~~~~-~~~--~------~~~g~~i~~~G~iG~i~G~~Vi~ 163 (231) T protein:vir:73 109 IFND-------------E---DAQAYVLIVNPKDAAKIRKDANA-KNI--G------SEVGANALINGTYADVLGAQIVR 163 (231) T ss_pred Hhcc-------------c---cccceEEEEcchHHHhhhhccch-hhh--h------hhhccceeeecccceEcceEEEE Confidence 7643 2 12237899999999999999864 121 1 22456789999999999999888 Q ss_pred cCceeEEEcCCCeeee-------------------eeeC Q lcl|Aclame:pro 299 YAGMPIRFYQGQRFWY-------------------QRIT 318 (318) Q Consensus 299 ~~~~~iRf~ag~~v~~-------------------a~~~ 318 (318) -+.+|- ..|--.++ -|-- T Consensus 164 S~~~~~--~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~ 200 (231) T protein:vir:73 164 SKKLAE--GSALMFKIVSNSPALKLVLKRGVQVETDRDI 200 (231) T ss_pred cCCCCC--CceeeeeEEeeccceeeeecccceeeccccc Confidence 766652 11111111 0000 No 25 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=98.79 E-value=1.4e-09 Score=69.14 Aligned_cols=224 Identities=14% Similarity=0.059 Sum_probs=137.9 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) |.+-... ....=.|- .|+..+.....+...|. .+..+..+|+-++|++|+|+....+ T Consensus 1 ma~~~T~------------~~d~iiPe--v~~~~v~~~~~~~l~~~--------~~~~~d~~l~g~~G~tv~iP~~~~~- 57 (274) T protein:vir:94 1 MPQGLTK------------TSDQIIPE--VLAPMMQAQLEKKLRFA--------SFAEVDSTLQGQPGDTLTFPAFVYS- 57 (274) T ss_pred CCcccee------------hhheechH--HHHHHHHHhhhhhhhhc--------ccceecccccCCCCCEEEEeeecCC- Confidence 2221111 11111232 36665533322222111 1223346677789999999998644 Q ss_pred cCC--eecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Q lcl|Aclame:pro 81 KRP--TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (318) Q Consensus 81 G~g--v~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~ 158 (318) |+. +..++.+ ..+.|...++++.|++.-+++..... +...+.-|+..++...++.+|++..|..++.+|.++.. T Consensus 58 g~a~~~~~g~~i--~~~~lt~~~~~~~i~~~~~~~~i~D~-~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~- 133 (274) T protein:vir:94 58 GDAQVVAEGEKI--PTDILETKKREAKIRKIAKGTSITDE-ALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL- 133 (274) T ss_pred CccccccCCCcc--cccccccceeEEEeeeecceecccHH-HHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc- Confidence 432 2222222 26788999999999998878777654 45556679999999999999999999999888754321 Q ss_pred cccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecccccc Q lcl|Aclame:pro 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELH 238 (318) Q Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~ 238 (318) .+ .++.++++.|.+|...+..... T Consensus 134 ----------------------~~---------------------~~~~~~~d~i~dA~~~l~d~~~------------- 157 (274) T protein:vir:94 134 ----------------------TV---------------------NADITKLNGLQSAIDKFNDEDL------------- 157 (274) T ss_pred ----------------------cc---------------------cccccCHHHHHHHHHHhhccCC------------- Confidence 00 1235678888888877654211 Q ss_pred CCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE---EcCCCeeeee Q lcl|Aclame:pro 239 GEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR---FYQGQRFWYQ 315 (318) Q Consensus 239 ~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR---f~ag~~v~~a 315 (318) ..++++|||.++..|++|+.. +|.+ + +.+-++.+..|.+|.|.|+-|..-..+|-= +..-+.+.+. T Consensus 158 ---~~~~ivv~p~~~~~L~k~~~~-~f~~----~---s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~~ 226 (274) T protein:vir:94 158 ---EPMVLFVNPLDAGKLRGDAST-NFTR----A---TELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLI 226 (274) T ss_pred ---CceEEEeCHHHHHHHHhhhhh-hccc----c---CcccccceeccccceecCeeEEEcCCCCcceEEEEeCcceEee Confidence 137899999999999999743 2321 1 223357899999999999999998877731 1111111111 Q ss_pred --ee-C Q lcl|Aclame:pro 316 --RI-T 318 (318) Q Consensus 316 --~~-~ 318 (318) +. + T Consensus 227 ~~~~~~ 232 (274) T protein:vir:94 227 LKRDFF 232 (274) T ss_pred ecCCce Confidence 11 1 No 26 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=98.79 E-value=1.4e-09 Score=69.14 Aligned_cols=224 Identities=14% Similarity=0.059 Sum_probs=137.9 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) |.+-... ....=.|- .|+..+.....+...|. .+..+..+|+-++|++|+|+....+ T Consensus 1 ma~~~T~------------~~d~iiPe--v~~~~v~~~~~~~l~~~--------~~~~~d~~l~g~~G~tv~iP~~~~~- 57 (274) T protein:vir:97 1 MPQGLTK------------TSDQIIPE--VLAPMMQAQLEKKLRFA--------SFAEVDSTLQGQPGDTLTFPAFVYS- 57 (274) T ss_pred CCcccee------------hhheechH--HHHHHHHHhhhhhhhhc--------ccceecccccCCCCCEEEEeeecCC- Confidence 2221111 11111232 36665533322222111 1223346677789999999998644 Q ss_pred cCC--eecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Q lcl|Aclame:pro 81 KRP--TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (318) Q Consensus 81 G~g--v~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~ 158 (318) |+. +..++.+ ..+.|...++++.|++.-+++..... +...+.-|+..++...++.+|++..|..++.+|.++.. T Consensus 58 g~a~~~~~g~~i--~~~~lt~~~~~~~i~~~~~~~~i~D~-~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~- 133 (274) T protein:vir:97 58 GDAQVVAEGEKI--PTDILETKKREAKIRKIAKGTSITDE-ALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL- 133 (274) T ss_pred CccccccCCCcc--cccccccceeEEEeeeecceecccHH-HHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc- Confidence 432 2222222 26788999999999998878777654 45556679999999999999999999999888754321 Q ss_pred cccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecccccc Q lcl|Aclame:pro 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELH 238 (318) Q Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~ 238 (318) .+ .++.++++.|.+|...+..... T Consensus 134 ----------------------~~---------------------~~~~~~~d~i~dA~~~l~d~~~------------- 157 (274) T protein:vir:97 134 ----------------------TV---------------------NADITKLNGLQSAIDKFNDEDL------------- 157 (274) T ss_pred ----------------------cc---------------------cccccCHHHHHHHHHHhhccCC------------- Confidence 00 1235678888888877654211 Q ss_pred CCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE---EcCCCeeeee Q lcl|Aclame:pro 239 GEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR---FYQGQRFWYQ 315 (318) Q Consensus 239 ~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR---f~ag~~v~~a 315 (318) ..++++|||.++..|++|+.. +|.+ + +.+-++.+..|.+|.|.|+-|..-..+|-= +..-+.+.+. T Consensus 158 ---~~~~ivv~p~~~~~L~k~~~~-~f~~----~---s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~~ 226 (274) T protein:vir:97 158 ---EPMVLFVNPLDAGKLRGDAST-NFTR----A---TELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLI 226 (274) T ss_pred ---CceEEEeCHHHHHHHHhhhhh-hccc----c---CcccccceeccccceecCeeEEEcCCCCcceEEEEeCcceEee Confidence 137899999999999999743 2321 1 223357899999999999999998877731 1111111111 Q ss_pred --ee-C Q lcl|Aclame:pro 316 --RI-T 318 (318) Q Consensus 316 --~~-~ 318 (318) +. + T Consensus 227 ~~~~~~ 232 (274) T protein:vir:97 227 LKRDFF 232 (274) T ss_pred ecCCce Confidence 11 1 No 27 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.79 E-value=1e-09 Score=69.93 Aligned_cols=248 Identities=12% Similarity=0.098 Sum_probs=143.7 Q ss_pred CCcCCcc------chhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEE Q lcl|Aclame:pro 1 MTTVTSA------QANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFS 74 (318) Q Consensus 1 ~t~~~~~------~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~ 74 (318) ||+.+.. .+..- .+| +++.|++.+...-...+.|..+ ..+ .++ ..|.++.|+ T Consensus 1 ms~~~~~t~~~~~~s~~d--~al---------~le~f~geV~~af~~~s~~~~~------~~~---rti--~~g~s~~~~ 58 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNAD--VDI---------HLEEHLGIVDKHFAYTSKFAPL------MNI---RDL--RGSNVVRLD 58 (335) T ss_pred CCccccccccccccccch--hhh---------hhhhhhhHHHHHHHHhhhhccc------cce---eee--ccceeEEEe Confidence 8887532 11111 123 3899999876666655554322 122 234 459999999 Q ss_pred EeeccccCCeecCceeecchhhhhheeeEEEEec---ccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 75 IMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQ---GRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVH 151 (318) Q Consensus 75 L~~~L~G~gv~Gd~~leGnee~L~~~sd~v~Idq---~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~ 151 (318) -+.+..-...+=.+.+.|+ ........|.||+ .||.|+ .+++-.+.+|+|++--..+..-+++..||.+|.. T Consensus 59 ~iG~~~~~~~~pG~~l~~~--~~~~~k~~itID~ll~a~~~Vd---dlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~ 133 (335) T protein:vir:78 59 RLGNVEAKGRRAGEELERS--RVVNDKWNLTVDTLLYLRHQFD---HQDEWTQSFDMRKEVAELDGQELARKFDQACLIQ 133 (335) T ss_pred eeeeeeecccccCcccCCC--CcccCCeEEEecceeechhhHh---hHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 9888876666666677666 3566777899999 577775 4677777899999999999999999999999988 Q ss_pred HhccccccccccceeecccccccccccccccCCCCCCceEeecCCcccccccccc-ccCHHHHHHHHHHHHh----cCCC Q lcl|Aclame:pro 152 LAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAAD-IFSIGLVDNLSLFIDE----MAHP 226 (318) Q Consensus 152 LsG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D-~~s~~~Id~a~~~a~~----~~~p 226 (318) |.=+.. .......+ +. +..|.+....|+..+ .-....+..|++.|.. ..-| T Consensus 134 l~~aa~--~~a~~~~~----~~------------------~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP 189 (335) T protein:vir:78 134 VIKAAA--MDAPVDLE----DA------------------FSPGVLEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLG 189 (335) T ss_pred HHhhcc--cccccccC----CC------------------cCCCcceeeeeccccccccHHHHHHHHHHHHHHHHhccCC Confidence 863322 11110000 00 011111111122111 0123344444444432 2222 Q ss_pred CceeEeccccccCCc-ceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE Q lcl|Aclame:pro 227 LQPVRLSGDELHGED-PYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR 305 (318) Q Consensus 227 i~Pv~v~g~~~~~~~-~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR 305 (318) ... .-.|++|+|.||..|..++.+- .+. -...+..+++-.|.++..+||-|.+-+++|-- T Consensus 190 ------------~~~~~~rv~vv~P~~y~~Ll~~~~l~------n~~-~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~ 250 (335) T protein:vir:78 190 ------------DAVYSEGLTPMSPRVFSLLLEHDKLM------SVE-YQATGATNDYVKSRVAILNGVKVLETPRFATK 250 (335) T ss_pred ------------CCCCCccEEEeChHHHHHHhcccccc------ccc-ccccccccccccceeEEeeceEEEeeccCCCC Confidence 110 1279999999999999997531 111 01112346777899999999999998888732 Q ss_pred EcC------CCeeeeeeeC Q lcl|Aclame:pro 306 FYQ------GQRFWYQRIT 318 (318) Q Consensus 306 f~a------g~~v~~a~~~ 318 (318) -.. ..|+-..-.+ T Consensus 251 ~~t~~~lg~a~n~~~~d~~ 269 (335) T protein:vir:78 251 AISAHPLGRHFNVSAEEAE 269 (335) T ss_pred CCccccccccCCccccccc Confidence 100 0011111111 No 28 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=98.78 E-value=8e-10 Score=70.45 Aligned_cols=224 Identities=14% Similarity=0.068 Sum_probs=139.4 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) |.+....++. .=.|- .|+..+.....+...|..+ ....++|+-++|++|+|+....+ T Consensus 1 Ma~~~T~l~d------------~i~Pe--v~~~~v~~~~~~~~~~~~~--------~~~~~~l~g~~G~ti~iP~~~~i- 57 (276) T protein:vir:10 1 MAQGTTTKST------------QIVPE--VLAPMMQAELDKKLRFAQF--------ADIDSTLVGQPGDTLTFPAFVYS- 57 (276) T ss_pred CCcceeehhh------------hhchH--HHHHHHHHHHHhhhhhccc--------ceecccccCCCCCEEEeeeecCC- Confidence 3321111111 11222 3665554444433333111 12346788889999999999777 Q ss_pred cCC--eecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Q lcl|Aclame:pro 81 KRP--TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (318) Q Consensus 81 G~g--v~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~ 158 (318) |+. +..++.+ ..+.|+..++++.|.+.-.++.... ++...+.-|+..++-..++.+|++..|..++..|.+.... T Consensus 58 gda~~~~eg~~i--~~~~lt~~~~~a~i~~~~k~~~~tD-~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~ 134 (276) T protein:vir:10 58 GDATVVPEGQKI--PVDKIETNRREAKIHKIGKGTDITD-EALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLT 134 (276) T ss_pred CccccccCCCcc--CccccccceeeEEeehccccccccH-HHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 442 2222222 3678999999999999888887764 3556667799999999999999999999998777543320 Q ss_pred cccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecccccc Q lcl|Aclame:pro 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELH 238 (318) Q Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~ 238 (318) .+++.++.+.|.+|...+.... T Consensus 135 --------------------------------------------~~~~~~t~d~i~~A~~~lgd~~-------------- 156 (276) T protein:vir:10 135 --------------------------------------------VSADIGTLAGLEAAIDTFDDED-------------- 156 (276) T ss_pred --------------------------------------------ccccccCHHHHHHHHHHhcccc-------------- Confidence 0234567788888887664321 Q ss_pred CCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE----EcCC----- Q lcl|Aclame:pro 239 GEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR----FYQG----- 309 (318) Q Consensus 239 ~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR----f~ag----- 309 (318) ...++++|||.++..|+++... +|.+. ..+.++.+..|.+|.+.|+-|..-..+|-= |..| T Consensus 157 --~~~~~ivv~p~~~~~L~k~~~~-~f~~~-------s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gAi~~~ 226 (276) T protein:vir:10 157 --LEPMVLFINPKDAGKLRSSASD-NFTRA-------TELGDNIIVKGAFGEALGAVIVRSKKLDEGEAILAKRGAVKLI 226 (276) T ss_pred --CcccEEEEcHHHHHHHHHhccc-ccccc-------ccccccceeccccceecceeEEEcCCCCcceEEEEeccceeee Confidence 1237999999999999987432 34321 223467899999999999999887766621 1111 Q ss_pred ----CeeeeeeeC Q lcl|Aclame:pro 310 ----QRFWYQRIT 318 (318) Q Consensus 310 ----~~v~~a~~~ 318 (318) -+|.+-|-- T Consensus 227 ~~~~~~vE~dRd~ 239 (276) T protein:vir:10 227 TKRDFFLETDRDP 239 (276) T ss_pred ecCCceeecccch Confidence 011111111 No 29 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=98.78 E-value=1.1e-09 Score=69.70 Aligned_cols=225 Identities=13% Similarity=0.043 Sum_probs=137.2 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) |.+-....+ ..=-|- .|+..+.....+...|..+ ..+..+|.-++|++|+|+....+. T Consensus 1 ma~~~T~~~------------~~iiPe--v~~~~v~~~~~~~~~~~~~--------~~~~~~l~g~~G~tv~ip~~~~~g 58 (274) T protein:vir:93 1 MPQGITKTS------------NQIIPE--VLAPMMQAQLEKKLRFASF--------AEVDSTLQGQPGDTLTFPAFVYSG 58 (274) T ss_pred CCccceehh------------heechH--HHHHHHHHHHHhhhhhccc--------ccccccccCCCCCEEEEEeeccCC Confidence 222111110 011122 3666543332222222111 123456777889999999986553 Q ss_pred cC-CeecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Q lcl|Aclame:pro 81 KR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (318) Q Consensus 81 G~-gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~ 159 (318) .. .+..++.+ ..+.+...++++.|++.-.++..... +...+.-|+...+...+++.|++..|..++..|.++.. T Consensus 59 ~~~~~~eg~~i--~~~~it~~~~~~~i~~~~~~~~i~D~-~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~-- 133 (274) T protein:vir:93 59 DAQVVAEGEKI--PTDILETKKREAKIRKIAKGTSITDE-ALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-- 133 (274) T ss_pred CcccccCCCcc--cccccccceeEEEeeeecccccccHH-HHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc-- Confidence 22 12222222 37788999999999998888777654 44556679999999999999999999999887754321 Q ss_pred ccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccC Q lcl|Aclame:pro 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (318) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~ 239 (318) . ..++.++.+.|.+|...+.... T Consensus 134 ---------------------~---------------------~~~~~~~~d~i~dA~~~l~d~~--------------- 156 (274) T protein:vir:93 134 ---------------------T---------------------VNADITKLNGLQSAIDKFNDED--------------- 156 (274) T ss_pred ---------------------c---------------------ccccccCHHHHHHHHHHhhhcc--------------- Confidence 0 0134567788888877665421 Q ss_pred CcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE---EcCCCeeeee- Q lcl|Aclame:pro 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR---FYQGQRFWYQ- 315 (318) Q Consensus 240 ~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR---f~ag~~v~~a- 315 (318) ....+++|||..+..|++|+.. +|. +. +..-++.+..|.+|.|.|+-|.+-+.+|-= +..-+.+.+. T Consensus 157 -~~~~~ivv~p~~~~~L~k~~~~-~f~---~~----s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gai~~~~ 227 (274) T protein:vir:93 157 -LEPMVLFINPLDAGKLRGDAST-NFT---RA----TELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLIL 227 (274) T ss_pred -CCccEEEeCHHHHHHHHhhhhh-ccc---cc----ccccccceeecccceecCeeEEEcCCCCcceEEEEeCCeEEEEe Confidence 0126899999999999999743 232 11 222357899999999999999997777631 1111222222 Q ss_pred -ee-C Q lcl|Aclame:pro 316 -RI-T 318 (318) Q Consensus 316 -~~-~ 318 (318) +. . T Consensus 228 ~~~~~ 232 (274) T protein:vir:93 228 KRDFF 232 (274) T ss_pred cCCcc Confidence 11 1 No 30 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=98.71 E-value=2.9e-09 Score=67.38 Aligned_cols=224 Identities=13% Similarity=0.054 Sum_probs=135.9 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) |.+.....+ ..=.| ..|+.-+.....++..|. .+..+..+|+-.+|++|+|+....+ T Consensus 1 ma~~~T~l~------------d~iiP--ev~~~~v~~~~~~~l~~~--------~~~~~d~~l~g~~G~tv~iP~~~~i- 57 (274) T protein:vir:12 1 MAQGLTKTS------------NQIIP--EVLAPMMQAQLEKKLRFA--------SFAEVDSTLQGQPGDTLTFPAFVYS- 57 (274) T ss_pred CCcceeehh------------hhhch--HHHHHHHHHHHHhhhhhc--------ccceecccccCCCCCEEEEeeecCC- Confidence 322211111 11122 236665432222221221 1223346777789999999998654 Q ss_pred cCC--eecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Q lcl|Aclame:pro 81 KRP--TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (318) Q Consensus 81 G~g--v~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~ 158 (318) |+. +..++.+ ..+.|+..++++.|++.-.++..... +..-+.-|+..++...++.+|++..|..++..+.++.. T Consensus 58 g~a~~~~~g~~i--~~~~lt~~~~~~~i~~~~~~~~i~D~-~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~- 133 (274) T protein:vir:12 58 GDAQVVAEGEKI--PTDILETKKREAKIRKIAKGTSITDE-ALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL- 133 (274) T ss_pred CccccccCCCcc--chhhcccceeeEEeeeecceeeecHH-HHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc- Confidence 332 2222222 26789999999999998888777643 44555679999999999999999999999877754321 Q ss_pred cccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecccccc Q lcl|Aclame:pro 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELH 238 (318) Q Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~ 238 (318) . .+++.++.+.|..|..++.... + T Consensus 134 ----------------------~---------------------~~~~a~~~d~i~dA~~~lgd~~----------~--- 157 (274) T protein:vir:12 134 ----------------------T---------------------VNADITKLNGLQSAIDKFNDED----------L--- 157 (274) T ss_pred ----------------------c---------------------ccccccCHHHHHHHHHHhcccc----------c--- Confidence 0 0123567888888876664321 0 Q ss_pred CCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE---EcCCCeeeee Q lcl|Aclame:pro 239 GEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR---FYQGQRFWYQ 315 (318) Q Consensus 239 ~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR---f~ag~~v~~a 315 (318) ..++++|||.++..|++|+.. +|.. + ..+..+.+..|.+|.|.|+-|.+-..+|-= +..-+.+.+. T Consensus 158 ---~~~~ivv~p~~~~~L~k~~~~-~fv~----~---s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~~ 226 (274) T protein:vir:12 158 ---EPMVLFINPLDAGKLRGDAST-NFTR----A---TELGDDIIVKGAFGEALGAIIVRSNKLEAGTAILAKKGAVKLI 226 (274) T ss_pred ---cccEEEeCHHHHHHHHhhhhh-hccc----c---ccccccceecccceeecCeeEEEeCCCCcceEEEEeccceeee Confidence 126899999999999999742 3421 1 234457789999999999999987767631 0111111111 Q ss_pred --ee-C Q lcl|Aclame:pro 316 --RI-T 318 (318) Q Consensus 316 --~~-~ 318 (318) +. + T Consensus 227 ~~~~~~ 232 (274) T protein:vir:12 227 LKRDFF 232 (274) T ss_pred ecCCce Confidence 11 1 No 31 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=98.68 E-value=1.3e-08 Score=63.81 Aligned_cols=223 Identities=13% Similarity=0.053 Sum_probs=129.9 Q ss_pred HHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccccCCeecCceee- Q lcl|Aclame:pro 13 FQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVE- 91 (318) Q Consensus 13 ~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~le- 91 (318) ||..+| .-++|++.+...-.+..-+..+... -.+.+-..||+|+|.....++ ..|...+ T Consensus 1 MA~~~~--------~pe~~~~~v~~~~~~~lv~~~l~~~--------~~~~~~~~Gdtv~ip~~~~~~----~~d~~~~~ 60 (273) T protein:vir:10 1 MAFNNF--------IPELWSDMLLEEWTAQTVFANLVNR--------EYEGTASKGNVVHIAGVVAPT----VKDYKAAG 60 (273) T ss_pred Ccchhh--------hHHHHHHHHHHHHHhhhccchhhcc--------ccccccccCceEEEeeccccc----ccccccCC Confidence 333222 1245777654433333333222221 122333569999999876554 2222222 Q ss_pred --cchhhhhheeeEEEEecccc-cccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeec Q lcl|Aclame:pro 92 --GRGEDLSHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPT 168 (318) Q Consensus 92 --Gnee~L~~~sd~v~Idq~R~-~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~n~~~~~p~ 168 (318) ...+++...+.++.||+.++ ++.+. .+++.-+..|+++..+. +..-+++..|+.++-.++++-.. T Consensus 61 ~~~~~~~~~~~~~~~tid~~~~~~~~i~-d~d~~~~~~~~~~~~~~-~~~alA~~vD~~i~~~~~~a~~~---------- 128 (273) T protein:vir:10 61 RQTSADAISDTGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGTA---------- 128 (273) T ss_pred CccCccccccceEEEEEeeeeecceEee-cHHHhhhhccHHHHHHH-HHHHHHHHHHHHHHHHHhccccc---------- Confidence 34678899999999999754 45554 44556667888876554 56678889999988777653210 Q ss_pred ccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCCcceEEEEe Q lcl|Aclame:pro 169 AEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYV 248 (318) Q Consensus 169 ~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~~~~yV~~l 248 (318) |.. ...++... .++.|..|...++...-| .+| +++++ T Consensus 129 -----------~~~----------------~~~~~~~~--~~~~i~~a~~~ld~~~vP-----~~~---------R~lvv 165 (273) T protein:vir:10 129 -----------LTG----------------SAPTDADD--AFDLIAKALKELTKANVP-----NVG---------RVVVV 165 (273) T ss_pred -----------ccc----------------ccccchhH--HHHHHHHHHHHhhhcCCC-----cCC---------CEEEE Confidence 000 11122222 256787888888775555 111 57899 Q ss_pred cHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeE----EE---cCCCeeeeeeeC Q lcl|Aclame:pro 249 TPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPI----RF---YQGQRFWYQRIT 318 (318) Q Consensus 249 ~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~i----Rf---~ag~~v~~a~~~ 318 (318) +|.++..|++++.+ +.+ +.. .|..+.|=.|.+|.+.|+-|.+..++|. .+ ..+...-+-+|. T Consensus 166 ~p~~~~~L~~~~~~--~~~----~~~--~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q~~ 234 (273) T protein:vir:10 166 NAEMAFWLRSSGSK--LTS----ADT--SGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQID 234 (273) T ss_pred CHHHHHHHhcchhh--hhh----hhc--cccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeeeeee Confidence 99999999998743 222 211 1445667789999999999999887763 11 111111222222 No 32 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=98.68 E-value=1.3e-08 Score=63.81 Aligned_cols=223 Identities=13% Similarity=0.053 Sum_probs=129.9 Q ss_pred HHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccccCCeecCceee- Q lcl|Aclame:pro 13 FQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVE- 91 (318) Q Consensus 13 ~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~le- 91 (318) ||..+| .-++|++.+...-.+..-+..+... -.+.+-..||+|+|.....++ ..|...+ T Consensus 1 MA~~~~--------~pe~~~~~v~~~~~~~lv~~~l~~~--------~~~~~~~~Gdtv~ip~~~~~~----~~d~~~~~ 60 (273) T protein:vir:10 1 MAFNNF--------IPELWSDMLLEEWTAQTVFANLVNR--------EYEGTASKGNVVHIAGVVAPT----VKDYKAAG 60 (273) T ss_pred Ccchhh--------hHHHHHHHHHHHHHhhhccchhhcc--------ccccccccCceEEEeeccccc----ccccccCC Confidence 333222 1245777654433333333222221 122333569999999876554 2222222 Q ss_pred --cchhhhhheeeEEEEecccc-cccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeec Q lcl|Aclame:pro 92 --GRGEDLSHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPT 168 (318) Q Consensus 92 --Gnee~L~~~sd~v~Idq~R~-~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~n~~~~~p~ 168 (318) ...+++...+.++.||+.++ ++.+. .+++.-+..|+++..+. +..-+++..|+.++-.++++-.. T Consensus 61 ~~~~~~~~~~~~~~~tid~~~~~~~~i~-d~d~~~~~~~~~~~~~~-~~~alA~~vD~~i~~~~~~a~~~---------- 128 (273) T protein:vir:10 61 RQTSADAISDTGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGTA---------- 128 (273) T ss_pred CccCccccccceEEEEEeeeeecceEee-cHHHhhhhccHHHHHHH-HHHHHHHHHHHHHHHHHhccccc---------- Confidence 34678899999999999754 45554 44556667888876554 56678889999988777653210 Q ss_pred ccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCCcceEEEEe Q lcl|Aclame:pro 169 AEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYV 248 (318) Q Consensus 169 ~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~~~~yV~~l 248 (318) |.. ...++... .++.|..|...++...-| .+| +++++ T Consensus 129 -----------~~~----------------~~~~~~~~--~~~~i~~a~~~ld~~~vP-----~~~---------R~lvv 165 (273) T protein:vir:10 129 -----------LTG----------------SAPTDADD--AFDLIAKALKELTKANVP-----NVG---------RVVVV 165 (273) T ss_pred -----------ccc----------------ccccchhH--HHHHHHHHHHHhhhcCCC-----cCC---------CEEEE Confidence 000 11122222 256787888888775555 111 57899 Q ss_pred cHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeE----EE---cCCCeeeeeeeC Q lcl|Aclame:pro 249 TPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPI----RF---YQGQRFWYQRIT 318 (318) Q Consensus 249 ~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~i----Rf---~ag~~v~~a~~~ 318 (318) +|.++..|++++.+ +.+ +.. .|..+.|=.|.+|.+.|+-|.+..++|. .+ ..+...-+-+|. T Consensus 166 ~p~~~~~L~~~~~~--~~~----~~~--~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q~~ 234 (273) T protein:vir:10 166 NAEMAFWLRSSGSK--LTS----ADT--SGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQID 234 (273) T ss_pred CHHHHHHHhcchhh--hhh----hhc--cccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeeeeee Confidence 99999999998743 222 211 1445667789999999999999887763 11 111111222222 No 33 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.68 E-value=2.6e-09 Score=67.69 Aligned_cols=213 Identities=11% Similarity=0.042 Sum_probs=129.8 Q ss_pred cEEEeeccCCCCCcEEEEEEeeccccCCeecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHH Q lcl|Aclame:pro 56 PVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTL 135 (318) Q Consensus 56 ~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~ 135 (318) .|+. |+ .|.++.|+-+...+-...+=.+.+.|+-+++.-....|.||+..-.=-.=..+++....+|||.+.-.. T Consensus 1 ~vr~---i~--~g~s~~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~ 75 (324) T protein:vir:99 1 MTRT---IT--SGKSAQFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQ 75 (324) T ss_pred Ceee---ee--cCceEEEeeeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHHH Confidence 3333 42 399999999988877777777788888788888888899999843321123567778889999999999 Q ss_pred HHHHHHHHHHHHHHHHHhccccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccC----HH Q lcl|Aclame:pro 136 LGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFS----IG 211 (318) Q Consensus 136 L~~w~~~~~D~~~~~~LsG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s----~~ 211 (318) ...-+++..|+.+|.++++...... |. ..+. ..-..... ++..++ .+.++.-+ ++ T Consensus 76 ~G~aLA~~~Dq~i~~~~a~~~~~~a------~~--~~~~------~~~~g~~~-~~~~~~------~~~~~~~~~~~~~d 134 (324) T protein:vir:99 76 MGEALAMAADVANYAEMAKLVNSRK------ET--TNEN------IEGLGAAS-LVKITG------KKEDPAKYGTQVIQ 134 (324) T ss_pred HHHHHHHHHHHHHHHHHHHhhhccc------cc--ccCC------cccCCccc-eecccc------cccccccCHHHHHH Confidence 9999999999999999875442111 11 0001 00000000 000001 11111222 45 Q ss_pred HHHHHHHHHHhcCCCCceeEeccccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEE Q lcl|Aclame:pro 212 LVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMW 291 (318) Q Consensus 212 ~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ 291 (318) .|..|.+.+++..-|- ++ +++++.|+|+..|+.++... +. ..+..+.+=.|.+|.+ T Consensus 135 ai~~a~~~Lde~~VP~-------~g-------R~~vv~P~~y~~Ll~~~~~~-------~~---~~~~~~~~~~G~V~~i 190 (324) T protein:vir:99 135 ALTYARAAFAKKYIPA-------GD-------RTFYTDPDTYSAILAALMPN-------AA---NYAALIDPETGNIRNV 190 (324) T ss_pred HHHHHHHHHhhcCCCC-------CC-------CEEEeChHHHHHHhhccccc-------cc---ccccccceecceEEEE Confidence 5556666776654441 11 57999999999998775321 11 1133456778999999 Q ss_pred cCEEEEecCceeEE---E----cCCCeeeeeee-------------------C Q lcl|Aclame:pro 292 RNILVRKYAGMPIR---F----YQGQRFWYQRI-------------------T 318 (318) Q Consensus 292 ngvii~e~~~~~iR---f----~ag~~v~~a~~-------------------~ 318 (318) +|+-|.+-+++|.= . +++.....++- . T Consensus 191 ~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~~~~~gl~ 243 (324) T protein:vir:99 191 MGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGADNVVGLF 243 (324) T ss_pred eceEEEecCCccccccccccccccccccccccccccccccccccccCceeEEE Confidence 99999998887741 1 01111111000 0 No 34 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=98.65 E-value=4.3e-09 Score=66.42 Aligned_cols=255 Identities=10% Similarity=0.072 Sum_probs=142.7 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) ||+.+.. -.-++.+ +....+=+++.|++.+...-...+.|..+ ..++ .+ ..|+++.|+-+.+.+ T Consensus 1 ms~~~~~-tr~~~~~----s~~d~al~le~f~geV~~af~~~s~~~~~------~~~r---ti--~~g~s~~~~~iG~~~ 64 (335) T protein:vir:63 1 MSFLNDL-TRPNYAG----KNADVDIHLEEHLGIVDKHFAYTSKFAPL------MNIR---DL--RGSNVVRLDRLGNVE 64 (335) T ss_pred CCCcccc-hhhhccc----ccchhheehhhhhhhHHHHHHhhhhhccc------ccee---ee--ccceeEEEeeeeeee Confidence 8887532 0111110 00111123788999876555555544322 2232 34 459999999998888 Q ss_pred cCCeecCceeecchhhhhheeeEEEEec---ccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQ---GRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (318) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq---~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg 157 (318) -...+=++.+.|+- -......|.||. .||.|+ .+++-.+.+|+|++--..+..-+++..||.+|.+|.=+.. T Consensus 65 ~~~~~pG~~l~~~~--~~~~k~~itVD~ll~a~~~I~---dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~ 139 (335) T protein:vir:63 65 AKGRRAGEELERSR--VVNDKWNLTVDTLLYLRHQFD---HQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAA 139 (335) T ss_pred eecccCCcCcCCCC--ccccceEEEecceeechhhhh---hHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 77666677777774 344567899999 577765 4677777899999999999999999999999988863322 Q ss_pred ccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccc-cCHHHHHHHHHHH----HhcCCCCceeEe Q lcl|Aclame:pro 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADI-FSIGLVDNLSLFI----DEMAHPLQPVRL 232 (318) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~-~s~~~Id~a~~~a----~~~~~pi~Pv~v 232 (318) .. -|..-.+.| ++|.+....++..+. =..+.|..|++.| .+..-|-. T Consensus 140 ~~------a~~~~~~~~------------------~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~---- 191 (335) T protein:vir:63 140 MD------APVDLEDAF------------------SPGVLEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDA---- 191 (335) T ss_pred cc------CccccCCCc------------------CCCcceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCc---- Confidence 10 000000011 111111111111110 1233444444433 33222210 Q ss_pred ccccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEc----- Q lcl|Aclame:pro 233 SGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFY----- 307 (318) Q Consensus 233 ~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~----- 307 (318) | -++ ++++|+|.||..|..++.+ . . ..-...+..++.-.|.++..+||-|.|-+++|---. T Consensus 192 -~-----~~d-r~~vv~P~~y~~Ll~~~~l---~---n-~~~~~s~~~~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~l 257 (335) T protein:vir:63 192 -V-----YSE-GLTPMSPRVFSLLLEHDKL---M---N-VEYQATGATNDYVKSRVAILNGVKVLETPRFATKAIAAHPL 257 (335) T ss_pred -c-----cCc-eEEEeChHHHHHHhccccc---c---c-cccccccccccccCceeEEeeceEEEeeccCCCCCcccccc Confidence 1 011 7899999999999999753 1 1 100111234677789999999999999888763111 Q ss_pred -------CCCe---------------eeeeeeC Q lcl|Aclame:pro 308 -------QGQR---------------FWYQRIT 318 (318) Q Consensus 308 -------ag~~---------------v~~a~~~ 318 (318) +|+. ++.-.++ T Consensus 258 g~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt 290 (335) T protein:vir:63 258 GRHFNVSAEESERQIALFLPSKTLITAQVAPVQ 290 (335) T ss_pred cccCCccccccceeEEEEEecceEEEEEEeecc Confidence 1111 1111111 No 35 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=98.61 E-value=5.5e-09 Score=65.85 Aligned_cols=261 Identities=11% Similarity=0.121 Sum_probs=147.7 Q ss_pred CCcCCccchhHHH-----HHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEE Q lcl|Aclame:pro 1 MTTVTSAQANKLF-----QVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSI 75 (318) Q Consensus 1 ~t~~~~~~~~~~~-----a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L 75 (318) |+..+.+.--... +.+ .+.....-+++.|++.+...-.+.+-+. ..+++.+++ .|.+|.|+- T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~--~~~~~~al~le~f~geV~~~f~~~si~~---------~~~~~rti~--~Gksv~f~~ 67 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYG--GATDKYALYLKLFSGEMFKGFQHETIAR---------DLVTKRTLK--NGKSLQFIY 67 (375) T ss_pred CccccccccCccccCCccccc--cccchHHHHHHHHhHHHHHHHHHHHhhh---------ccccccccc--cCceEEEEe Confidence 7666654211100 000 1112223358999998876666665543 223444453 599999999 Q ss_pred eeccccCCeecCceeecc-hhhhhheeeEEEEecc---cccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 76 MHKLSKRPTMGDERVEGR-GEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVH 151 (318) Q Consensus 76 ~~~L~G~gv~Gd~~leGn-ee~L~~~sd~v~Idq~---R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~ 151 (318) +...+-...+..+.+.|+ .++..-.+.+|.||+. +|.|+ .+++-...+|||++.-.....-+++..|+.++.. T Consensus 68 iG~~t~~~~t~G~~i~~~~~~d~~~te~~l~ID~~~y~~~~Vd---DiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~ 144 (375) T protein:vir:10 68 TGRMTSSFHTPGTPILGNADKAPPVAEKTIVMDDLLISSAFVY---DLDETLAHYELRGEISKKIGYALAEKYDRLIFRS 144 (375) T ss_pred eeeeEEeeecCCcCcCCccccCCCCCceEEEecchhhhhhhHh---hHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 988888888888888887 4567778889999998 56665 6788888999999999999999999999999988 Q ss_pred Hh-ccccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCcee Q lcl|Aclame:pro 152 LA-GARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPV 230 (318) Q Consensus 152 Ls-G~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv 230 (318) |. +++.. .|... ++.-.|-...+--+.+.++...++. .--++.|..+.+.+++..-| T Consensus 145 l~kaa~~~-------~p~~~---------~~~~~~Gg~~i~~~sg~~~~~~~ta--~~~~~ai~~a~~~Lde~~VP---- 202 (375) T protein:vir:10 145 ITRGARSA-------SPVSA---------TNFVEPGGTQIRVGSGTNESDAFTA--SALVNAFYDAAAAMDEKGVS---- 202 (375) T ss_pred HHHhhhhc-------ccccc---------ccccccCcceeeeccccccccccCH--HHHHHHHHHHHHHHhhcCCC---- Confidence 86 23321 01000 0000000001111112222211211 11234455555666664444 Q ss_pred EeccccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEcCCC Q lcl|Aclame:pro 231 RLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGQ 310 (318) Q Consensus 231 ~v~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~ag~ 310 (318) .++ ++++++|++|.-|..+-+..... .+.-+.+.-.=.|.++.++|+-|.+-.++|- .+|. T Consensus 203 ---~~~-------R~~vv~P~~y~~Ll~~~d~~~~~-------n~d~~~~~~~~~g~v~~i~Gv~V~~Sn~lP~--~~~~ 263 (375) T protein:vir:10 203 ---SQG-------RCAVLNPRQYYALIQDIGSNGLV-------NRDVQGSALQSGNGVIEIAGIHIYKSMNIPF--LGKY 263 (375) T ss_pred ---CCC-------CEEEeChHHHHHHHhcCCcccee-------eecccccceeccceEEEEeceEEEEeccccc--cccc Confidence 111 56889999999998762211111 1110112223357799999999999666552 1111 Q ss_pred --------eee-----eeeeC Q lcl|Aclame:pro 311 --------RFW-----YQRIT 318 (318) Q Consensus 311 --------~v~-----~a~~~ 318 (318) ++- ...+. T Consensus 264 ~~~~g~~~~~~a~~~~~~~~~ 284 (375) T protein:vir:10 264 GVKYGGTTGETSPGNLGSHIG 284 (375) T ss_pred cccccccccccchhhhhcccc Confidence 000 00111 No 36 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=98.57 E-value=9.8e-09 Score=64.49 Aligned_cols=223 Identities=14% Similarity=0.051 Sum_probs=136.7 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) |++.|.- + ..=.|- .|+.-+.....+...|.. .....++|+-++|++|+|+....+ T Consensus 3 ~~~~T~l-~------------d~i~PE--v~~~~v~~~~~~~~~~~~--------~~~~~~~l~g~~G~tv~iP~~~~i- 58 (275) T protein:vir:96 3 LENMTKL-A------------NMVNPE--VLAPMMQAELDKKLKFAQ--------FADIDNTLVGQPGNTITFPAFVYS- 58 (275) T ss_pred Ccccchh-h------------hhhchH--HHHHHHHHHHHHhhhhcc--------cceecccccCCCCCEEEeeeeccC- Confidence 3222111 0 001222 366554333333333311 122346788889999999998765 Q ss_pred cCC--eecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Q lcl|Aclame:pro 81 KRP--TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (318) Q Consensus 81 G~g--v~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~ 158 (318) |+. +..++.+ ..+.|+..++++.|.+.-+++..... +..-+.-|+..++...++.+|++..|..++..|.++.. T Consensus 59 g~a~~~~~g~~i--~~~~lt~~~~~~~i~~~~~~~~i~D~-~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~- 134 (275) T protein:vir:96 59 GDAKVVPEGEEI--PIDLIETKKRQATIRKIGKGTVLTDE-ALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATL- 134 (275) T ss_pred CccccccCCCCc--chhhcccceeeEEeehhcccccccHH-HHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc- Confidence 332 1111222 26789999999999998888877653 55555679999999999999999999999877754321 Q ss_pred cccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecccccc Q lcl|Aclame:pro 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELH 238 (318) Q Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~ 238 (318) .+ +++.++.+.|..|...+.... T Consensus 135 ----------------------~~---------------------~~~~~~~d~i~dA~~~lgd~~-------------- 157 (275) T protein:vir:96 135 ----------------------KV---------------------EADITKLAGLQTAIDKFNDED-------------- 157 (275) T ss_pred ----------------------cc---------------------cccccCHHHHHHHHHHhcccc-------------- Confidence 00 134567888888887664321 Q ss_pred CCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE----EcCC--C-- Q lcl|Aclame:pro 239 GEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR----FYQG--Q-- 310 (318) Q Consensus 239 ~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR----f~ag--~-- 310 (318) +..++++|||.++..|++++.. +|.. + ..+-.+.+..|.+|.|.|+-|.+-..+|-= |..| . T Consensus 158 --~~~~~ivv~p~~~~~L~k~~~~-~f~~----~---~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~i~~~gA~~~~ 227 (275) T protein:vir:96 158 --LEPMVLFVNPLDAGKLRASATD-NFTR----A---TLLGDNVIVKGAFGEALGAIIVRSNKIKEGEAILAKRGAVKLI 227 (275) T ss_pred --CCccEEEeCHHHHHHHHhcccc-cccc----c---ccccccceeccccceecCeeEEEeCCCCcceEEEEeccceeee Confidence 1236899999999999999742 3431 1 222356788999999999999987777620 1111 0 Q ss_pred -----eeeeeeeC Q lcl|Aclame:pro 311 -----RFWYQRIT 318 (318) Q Consensus 311 -----~v~~a~~~ 318 (318) +|..-|-- T Consensus 228 ~~~~~~vE~~Rd~ 240 (275) T protein:vir:96 228 TKRDFFLETERHA 240 (275) T ss_pred ecCCcccccccch Confidence 11111111 No 37 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=98.57 E-value=1e-08 Score=64.33 Aligned_cols=218 Identities=14% Similarity=0.069 Sum_probs=128.9 Q ss_pred HHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccccCC--eecCcee Q lcl|Aclame:pro 13 FQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRP--TMGDERV 90 (318) Q Consensus 13 ~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~G~g--v~Gd~~l 90 (318) ||.+-++..- .|-| |+.-+.....+...|..+ . ...++|.-.+|++|+|+... +.|+. +..++.+ T Consensus 1 Ma~T~~~d~I--~Pev--~~~~V~e~~~~~~~~~~~------~--~~d~~L~g~~G~ti~~P~~~-~igdae~~~eg~~i 67 (270) T protein:vir:95 1 MTQTKKANLI--NPEV--LANVVSAQMQNAIRFTPY------A--VTDDTLVGQPGDTITRPKYA-YIGAAEDLQEGVAM 67 (270) T ss_pred CCceehhhhc--chHH--HHHHHHHHHHhHHhhccc------c--ccccccCCCCCCEEEeeeec-CCCccccccCCCcc Confidence 4433332221 2322 554443333332333221 1 12467888899999999986 66552 2223333 Q ss_pred ecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccc Q lcl|Aclame:pro 91 EGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAE 170 (318) Q Consensus 91 eGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~n~~~~~p~~~ 170 (318) + .+.|+...+..+|-+.-.++..... +..-+.-|...++-..++.+|++..|..++-.|.|+... T Consensus 68 ~--~~~lt~~~~~a~i~~~gk~~~itD~-a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~------------ 132 (270) T protein:vir:95 68 D--TTQMSMTTTKVTVKETGKAVEVTQT-AIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQT------------ 132 (270) T ss_pred c--hhhcccchheeeeehhhCcceecHH-HHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhcccccc------------ Confidence 3 6789999999999888888877643 444444588999999999999999999999888765431 Q ss_pred ccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCCcceEEEEecH Q lcl|Aclame:pro 171 HPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTP 250 (318) Q Consensus 171 ~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~~~~yV~~l~P 250 (318) .+..++.+.+..|.... ||+ ++...+++||| T Consensus 133 ---------------------------------~~~~~t~~~~~dA~~~l-------------gd~---~~~~~~i~vhs 163 (270) T protein:vir:95 133 ---------------------------------ATVSADATGILDAIEVF-------------NSE---NDEDYVLYVNP 163 (270) T ss_pred ---------------------------------cccccCHHHHHHHHHHh-------------ccc---cCCCcEEEEcH Confidence 11124555555565544 221 12236899999 Q ss_pred HHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCE-EEE-e-cCceeEEE-cCCCeeee----------ee Q lcl|Aclame:pro 251 RQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNI-LVR-K-YAGMPIRF-YQGQRFWY----------QR 316 (318) Q Consensus 251 ~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngv-ii~-e-~~~~~iRf-~ag~~v~~----------a~ 316 (318) .++..||++. |.+ ..++-++.+.+|.+|.|.|+ ||. . .+...--| ..-+.+.+ -| T Consensus 164 ~~~~~Lrk~~----~~~-------~~~~~~~~~~~G~ig~~~G~~Viv~s~~~~~~~~~l~~~gAi~~~~~~~~~vEtdR 232 (270) T protein:vir:95 164 KDYNKLVKSL----FKV-------GGNVQDRAISKGDLVEIVGVSDIVKSKRVSENTAFLQRYGAMEIVNKKKPEAYTDF 232 (270) T ss_pred HHHHHHHhhh----ccc-------ccccccchhcccccceecceeEEEeCCCCCceeEEEEeccceeeeecCCceeeecc Confidence 9999999985 332 12345678999999999997 333 2 22111011 11111111 11 Q ss_pred eC Q lcl|Aclame:pro 317 IT 318 (318) Q Consensus 317 ~~ 318 (318) -- T Consensus 233 d~ 234 (270) T protein:vir:95 233 DI 234 (270) T ss_pred ch Confidence 11 No 38 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.55 E-value=1.5e-08 Score=63.53 Aligned_cols=248 Identities=10% Similarity=0.050 Sum_probs=136.9 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) ||..+..-=. .+++ +-....-++|.|.+.+...-...+.+..+ . ++.++. .|.++.|+-+...+ T Consensus 1 Ms~~n~~t~~-~~~~----s~~~~al~le~f~geV~taF~~~si~~~~------~---~vrti~--~GkS~qf~~iG~~~ 64 (402) T protein:vir:97 1 MSTPNTLTNV-AVSA----SGEVDSLLIEKFNGKVNEQYLKGENILSY------F---DVQTVT--GTNTVSNKYLGETE 64 (402) T ss_pred CCCccccccc-cccc----ccchhhhhhhhhhhhHHHHHHHHHhhcCc------c---eeeeec--ccceEEEEEEeeeE Confidence 7766543111 0111 11222234788888765555544443222 2 223443 78899999997777 Q ss_pred cCCeecCceeecchhhhhheeeEEEEecc---cccccccchhhhhhhhhh-HHHHHHHHHHHHHHHHHHHHHH--HHHhc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFN-LASSARTLLGTYFNDLQDQCAI--VHLAG 154 (318) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~g~ms~qrs~~d-lr~~ar~~L~~w~~~~~D~~~~--~~LsG 154 (318) -...+-.+.+.| +.+......|.||++ ||.|+ .+++-.+-+| +|.+--..+..-+++..||.+| +.+++ T Consensus 65 a~y~~~G~~ldg--~~~~~~k~~ItID~lL~a~~~V~---diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa 139 (402) T protein:vir:97 65 LQVLAPGQSPNA--TPTQADKNQLVIDTTVIARNTVA---HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGG 139 (402) T ss_pred EeeeccccccCC--CCcccccEEEEeCceeechhhhh---hHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 666665566765 467777777999995 66664 4567777789 8999888899999999999875 33444 Q ss_pred cccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccc-cCHHHHH----HHHHHHHhcCCCCce Q lcl|Aclame:pro 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADI-FSIGLVD----NLSLFIDEMAHPLQP 229 (318) Q Consensus 155 ~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~-~s~~~Id----~a~~~a~~~~~pi~P 229 (318) ... ... |+.+--..+.+.......+.++. -+...|- .|....++..-|. T Consensus 140 ~a~-t~~-----------------------~~~~~~~~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~-- 193 (402) T protein:vir:97 140 IAN-TKA-----------------------ERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDI-- 193 (402) T ss_pred ccc-ccc-----------------------ccccCcccccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCc-- Confidence 321 111 11000111111110001111111 1333333 3334444433331 Q ss_pred eEeccccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEcCC Q lcl|Aclame:pro 230 VRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQG 309 (318) Q Consensus 230 v~v~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~ag 309 (318) ++ ++++++|.||.-|..++.+- + +.... ...+.+=.|.+++.+||-|.|-+++| +. + T Consensus 194 -----~d-------Rv~vv~P~~y~~Ll~~~rl~---n---~d~~~--~~~g~~~~G~v~~v~Gv~Vv~SnnlP--~~-a 250 (402) T protein:vir:97 194 -----SD-------VAIMMPWKFFNALRDADRIV---D---KTYTI--SQSGATINGFVLSSYNCPVIPSNRFP--TF-A 250 (402) T ss_pred -----cc-------cEEEeChHHHHHHhhccccc---c---hhhcc--ccCCccccceeEEEeceEEEecCccc--cc-c Confidence 11 69999999999999997541 1 11100 12244557999999999999988776 22 2 Q ss_pred CeeeeeeeC Q lcl|Aclame:pro 310 QRFWYQRIT 318 (318) Q Consensus 310 ~~v~~a~~~ 318 (318) +++....++ T Consensus 251 ~~it~~~ls 259 (402) T protein:vir:97 251 QDQAHHLLS 259 (402) T ss_pred ccccccccc Confidence 233222222 No 39 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=98.53 E-value=2.3e-08 Score=62.45 Aligned_cols=223 Identities=11% Similarity=0.075 Sum_probs=132.5 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) |.+.+...+. ++ -| ..|+..+.....+..-+.. ...+-.+|+..+|++|+++....+. T Consensus 1 MA~~~T~~~~------~~------iP--ev~s~~v~~~~~~~~~~~~--------~~~~~~~~~g~~G~tv~iP~~~~~~ 58 (272) T protein:vir:30 1 MAVGTTKMAQ------ML------DP--EVLADMIDAEVGKAIRFAP--------LAEVDTTLEGQPGTTLTVPKWDYIG 58 (272) T ss_pred CCCccccchh------ee------ch--HHHHHHHHHHHHHHhhhhc--------cccccccccCCCCCEEEEEEecCCC Confidence 5432222111 11 12 2355544322222222211 1112245777899999998875443 Q ss_pred cC-CeecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Q lcl|Aclame:pro 81 KR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (318) Q Consensus 81 G~-gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~ 159 (318) .. .+..++.+ -.+.+.+.+.++.|.+.-+.+..... ...++..|+.....+.|.+.|++..|..++..+.|+.- T Consensus 59 ~a~~v~eg~~i--~~~~~~~~~~~~~~~~~~~~~~itd~-~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~-- 133 (272) T protein:vir:30 59 DAEDVAEGEAI--PMTQLGFKKTTMTIKKAGKGVEITDE-AILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQ-- 133 (272) T ss_pred CcccccCCCcc--cccccccceEEEEeeeeeeeeeecHH-HHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-- Confidence 22 23222222 26778999999999998888777655 44557789999999999999999999999877754321 Q ss_pred ccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccC Q lcl|Aclame:pro 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (318) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~ 239 (318) .+ +...+++.|.+|...+...+.. T Consensus 134 ---------------------~~----------------------~~~~t~d~i~da~~~l~~~~~~------------- 157 (272) T protein:vir:30 134 ---------------------TV----------------------EATATVDGVSKALDIFNDEDDA------------- 157 (272) T ss_pred ---------------------cc----------------------ccccCHHHHHHHHHHHhccCCC------------- Confidence 11 1122456666776665543211 Q ss_pred CcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE----EcCCCeeeee Q lcl|Aclame:pro 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR----FYQGQRFWYQ 315 (318) Q Consensus 240 ~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR----f~ag~~v~~a 315 (318) ..+++|||..+..|+++... +|. +. .....+.+..|.+|.|.|+-|..-+.+|-= |..| .+.++ T Consensus 158 ---~~~~vv~p~~~~~L~k~~~~-~~~---~~----~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~-a~~~~ 225 (272) T protein:vir:30 158 ---ETVIVMNPADASTLRLDAAK-EWL---GA----TEVGANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKG-ALRIM 225 (272) T ss_pred ---ccEEEEcHHHHHHHHHhccc-ccc---cc----ccccccccccccchhhcCeeEEEcCCCCcceEEEEcCC-eEEEE Confidence 25799999999999988532 221 11 123356788999999999999998877721 2222 22222 Q ss_pred eeC Q lcl|Aclame:pro 316 RIT 318 (318) Q Consensus 316 ~~~ 318 (318) .-. T Consensus 226 ~~~ 228 (272) T protein:vir:30 226 LKR 228 (272) T ss_pred ecC Confidence 111 No 40 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=98.53 E-value=2.3e-08 Score=62.45 Aligned_cols=223 Identities=11% Similarity=0.075 Sum_probs=132.5 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) |.+.+...+. ++ -| ..|+..+.....+..-+.. ...+-.+|+..+|++|+++....+. T Consensus 1 MA~~~T~~~~------~~------iP--ev~s~~v~~~~~~~~~~~~--------~~~~~~~~~g~~G~tv~iP~~~~~~ 58 (272) T protein:vir:98 1 MAVGTTKMAQ------ML------DP--EVLADMIDAEVGKAIRFAP--------LAEVDTTLEGQPGTTLTVPKWDYIG 58 (272) T ss_pred CCCccccchh------ee------ch--HHHHHHHHHHHHHHhhhhc--------cccccccccCCCCCEEEEEEecCCC Confidence 5432222111 11 12 2355544322222222211 1112245777899999998875443 Q ss_pred cC-CeecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Q lcl|Aclame:pro 81 KR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (318) Q Consensus 81 G~-gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~ 159 (318) .. .+..++.+ -.+.+.+.+.++.|.+.-+.+..... ...++..|+.....+.|.+.|++..|..++..+.|+.- T Consensus 59 ~a~~v~eg~~i--~~~~~~~~~~~~~~~~~~~~~~itd~-~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~-- 133 (272) T protein:vir:98 59 DAEDVAEGEAI--PMTQLGFKKTTMTIKKAGKGVEITDE-AILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQ-- 133 (272) T ss_pred CcccccCCCcc--cccccccceEEEEeeeeeeeeeecHH-HHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-- Confidence 22 23222222 26778999999999998888777655 44557789999999999999999999999877754321 Q ss_pred ccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccC Q lcl|Aclame:pro 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (318) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~ 239 (318) .+ +...+++.|.+|...+...+.. T Consensus 134 ---------------------~~----------------------~~~~t~d~i~da~~~l~~~~~~------------- 157 (272) T protein:vir:98 134 ---------------------TV----------------------EATATVDGVSKALDIFNDEDDA------------- 157 (272) T ss_pred ---------------------cc----------------------ccccCHHHHHHHHHHHhccCCC------------- Confidence 11 1122456666776665543211 Q ss_pred CcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE----EcCCCeeeee Q lcl|Aclame:pro 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR----FYQGQRFWYQ 315 (318) Q Consensus 240 ~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR----f~ag~~v~~a 315 (318) ..+++|||..+..|+++... +|. +. .....+.+..|.+|.|.|+-|..-+.+|-= |..| .+.++ T Consensus 158 ---~~~~vv~p~~~~~L~k~~~~-~~~---~~----~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~-a~~~~ 225 (272) T protein:vir:98 158 ---ETVIVMNPADASTLRLDAAK-EWL---GA----TEVGANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKG-ALRIM 225 (272) T ss_pred ---ccEEEEcHHHHHHHHHhccc-ccc---cc----ccccccccccccchhhcCeeEEEcCCCCcceEEEEcCC-eEEEE Confidence 25799999999999988532 221 11 123356788999999999999998877721 2222 22222 Q ss_pred eeC Q lcl|Aclame:pro 316 RIT 318 (318) Q Consensus 316 ~~~ 318 (318) .-. T Consensus 226 ~~~ 228 (272) T protein:vir:98 226 LKR 228 (272) T ss_pred ecC Confidence 111 No 41 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=98.52 E-value=2.7e-08 Score=62.12 Aligned_cols=231 Identities=13% Similarity=0.050 Sum_probs=131.2 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) |-+.+...+. +| .| ..|+..+.....+..-+. +. .....+|+-++||+|+|+....++ T Consensus 1 Ma~~~T~~~~------~i------iP--ev~s~~v~~~~~~~~v~~-------~~-~~~~~~l~g~~G~tv~ip~~~~~g 58 (278) T protein:vir:80 1 MADLTTKLAN------LI------DP--EVMGPMISAKLPKAIKFG-------KI-APIDNSLEGQPGSEITVPKYKYIG 58 (278) T ss_pred CCCcceehhh------ee------cH--HHHHHHHHHHHHHhhhhc-------cc-ceecccccCCCCCEEEEeeeccCC Confidence 3321111110 01 12 236555422222211111 11 123456777889999999987663 Q ss_pred cC-CeecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Q lcl|Aclame:pro 81 KR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (318) Q Consensus 81 G~-gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~ 159 (318) .. .+..++.+ ..+.|+..++++.|++.-.++.... ++..-+..|+...+...++.+|++..|..++.+|.|+.... T Consensus 59 ~a~~~~~g~~i--~~~~lt~~~~~~~i~~~~~a~~v~D-~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~ 135 (278) T protein:vir:80 59 DAQDVAEGAAI--DYSALETESVKHGIKKAGKGVKLTD-ESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEV 135 (278) T ss_pred cceeecCCCcC--cccccccceeeEeeehhhccccccH-HHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 22 12222222 2578999999999999877877664 46666778999999999999999999999999987654210 Q ss_pred ccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccC Q lcl|Aclame:pro 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (318) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~ 239 (318) . .+++ ....| -.++.+..+.......+.| T Consensus 136 ---~-------------------~~~t---------------~~~~~-~~~~~~~da~~~l~~~~~~------------- 164 (278) T protein:vir:80 136 ---K-------------------GAIN---------------IGLID-KIENTFTDAPDAIEDESIT------------- 164 (278) T ss_pred ---c-------------------cccc---------------cchhh-hHHHHHHHHHHhhcccCCC------------- Confidence 0 0000 00000 1133444444433332222 Q ss_pred CcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE----EcCCCeeeee Q lcl|Aclame:pro 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR----FYQGQRFWYQ 315 (318) Q Consensus 240 ~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR----f~ag~~v~~a 315 (318) ...+++|||.++..|++++.. +|. + + ...-++.+..|.+|.|.|+-|.+-.++|-- |..| .+.+. T Consensus 165 --~~~~ivv~p~~~~~L~k~~~~-~~~---~-~---~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~g-Ai~~~ 233 (278) T protein:vir:80 165 --TTGVLFLNYKDTAKLREEAAG-SWT---K-A---SQLGDDLLVKGAFGELLGWEIVRTKKLADGNALAVKAG-ALKTF 233 (278) T ss_pred --cccEEEECHHHHHHHHhhhhh-hcc---c-c---ccccccceeeccceeecceeEEEcCCCCcceEEEEecc-ceeee Confidence 124689999999999999743 221 1 1 122245677899999999999998888741 1111 22211 Q ss_pred --eeC Q lcl|Aclame:pro 316 --RIT 318 (318) Q Consensus 316 --~~~ 318 (318) +.. T Consensus 234 ~~~~~ 238 (278) T protein:vir:80 234 LKRNL 238 (278) T ss_pred ecCCc Confidence 111 No 42 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=98.41 E-value=1.9e-08 Score=62.93 Aligned_cols=271 Identities=13% Similarity=0.119 Sum_probs=141.1 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchH--HHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeec Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSM--VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHK 78 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~--v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~ 78 (318) |-+ |-.+..-+.-.-+ +-+++. ...|-+|+-..+.+.-.+..|- ...++-|+.|-+|.|--..+ T Consensus 1 ~~~--~~a~~~~~~~s~~---g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA---------~~~piPkn~GkTIk~r~y~p 66 (401) T protein:vir:95 1 MLN--YNAPTDGQKSSID---GANSDQMQTFFWLKKAIITARKEQYFMPLA---------SVTNMPKHYGKTIKVYEYVP 66 (401) T ss_pred CCc--cCCCccccccccc---ccccceeeehhhHHHHHhhhhhhhhhhhcc---------cccccccccCCeEEEEeccc Confidence 322 2211111111111 222332 2247788766666665565554 23456789999999988877 Q ss_pred cccC--CeecCceeec------chh--h----------hhheeeEEEEeccccc-ccccchhhhhhhhhhHHHHHHHHHH Q lcl|Aclame:pro 79 LSKR--PTMGDERVEG------RGE--D----------LSHADFSLKINQGRHL-VDAGGRMSQQRTKFNLASSARTLLG 137 (318) Q Consensus 79 L~G~--gv~Gd~~leG------nee--~----------L~~~sd~v~Idq~R~~-V~~~g~ms~qrs~~dlr~~ar~~L~ 137 (318) |.-. |-...-..+| +.= . +......=++|+..+. ++..|++.|.=-...|-.+.-+-- T Consensus 67 l~~~~~pl~eGv~a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~- 145 (401) T protein:vir:95 67 LLDDRNINDQGIDASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFD- 145 (401) T ss_pred ccccccchhcCCCcccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhh- Confidence 7532 2212111222 100 0 1111112234444332 234444444221112211111111 Q ss_pred HHHHHHHHHHHHHH-----HhccccccccccceeecccccccccccccccCCCCCCceEeecCCccccccc----ccccc Q lcl|Aclame:pro 138 TYFNDLQDQCAIVH-----LAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIE----AADIF 208 (318) Q Consensus 138 ~w~~~~~D~~~~~~-----LsG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~----a~D~~ 208 (318) .|..+.-| |.|..+... +-+ .+++.+ ...-++|++.+++.++++ +...+ T Consensus 146 ------~D~~l~~h~s~ell~g~~~~t~--d~i-------------~~dll~-ag~~viyAg~ats~At~~~~~~~~t~v 203 (401) T protein:vir:95 146 ------SDDGLMEHLSRELMNGATQITE--AVL-------------QKDLLA-AAGTVLYAGAATSDATITGEGSTPSVV 203 (401) T ss_pred ------cchHHHHHHHHHHhhhhhhhHH--HHH-------------HHHHHh-hcCeeecCCccceeeecccccccccee Confidence 11222222 222222100 000 111111 122367777777777555 66789 Q ss_pred CHHHHHHHHHHHHhcCCCCceeEeccccccC---CcceEEEEecH------HHHHHHHhCcchHHHHHHHHHHhhccccc Q lcl|Aclame:pro 209 SIGLVDNLSLFIDEMAHPLQPVRLSGDELHG---EDPYYVLYVTP------RQWNDWYTSTSGKDWNQMMVRAVNRAKGF 279 (318) Q Consensus 209 s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~---~~~~yV~~l~P------~q~~dLr~d~~~~~w~~~qk~A~~r~~g~ 279 (318) +++.|.++...+..-..|.+-..+.|-.+.+ ..+.||.|||| +.++||..||+ |.+.+|+|.+ T Consensus 204 t~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~s~va~~h~~L~~di~a~~D~~~~~~---fi~v~kYa~~----- 275 (401) T protein:vir:95 204 SYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGATRVMYVGSELVPELKAMKDLFGNKA---FIETQHYADA----- 275 (401) T ss_pred chhHHHHHHHHHHhcccccchhhhhhhhccCccccccceEEEEecCchhHHHHHHHhcCCCC---ceehhhcCCc----- Confidence 9999999999999877665544444433333 23569999999 88888889986 7888998744 Q ss_pred cCCcccCCeEEEcCEEEEecCceeEEE-cCCCeeeee-----ee-------------C Q lcl|Aclame:pro 280 NHPLFKGECAMWRNILVRKYAGMPIRF-YQGQRFWYQ-----RI-------------T 318 (318) Q Consensus 280 ~nPlF~G~~gm~ngvii~e~~~~~iRf-~ag~~v~~a-----~~-------------~ 318 (318) .++|.||+|.++|+-+..-|.+ -+| .+|.....+ .- . T Consensus 276 -~~i~~gEiG~i~~vR~i~~p~~-~~w~~ag~~a~~~~~~y~~~~~~~gg~~dVyp~l 331 (401) T protein:vir:95 276 -GTIMNGEVGSIDKFRIIQVPEM-LHWAGAGAQATGANPGYRTSMVSGQEHYDVYPML 331 (401) T ss_pred -cccccccccccCceeEEecccc-eeecCCcccccccccccccccccCCCcceeeeee Confidence 5799999999999999998876 445 344322221 11 1 No 43 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=98.39 E-value=8.1e-08 Score=59.46 Aligned_cols=257 Identities=9% Similarity=0.026 Sum_probs=136.7 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) ||..+..-=. .+++ +-....-++|.|.+.+...-...+-+..+ . ++.++ ..|.++.|+-+...+ T Consensus 1 ms~~n~~t~~-~~~~----~~~~~al~le~f~geV~taf~~~s~~~~~------~---~~rti--~~gkS~q~~~iG~~~ 64 (364) T protein:vir:10 1 MSNPNVLTQP-AVSA----SGEVDSLLIEKFNNRVHEQYLKGENLLQW------F---DVQEV--VGTNSVSNKYIGETE 64 (364) T ss_pred CCCccccccc-cccc----ccchhhhhhhhhhhhHHHHHHHHHhhcCc------c---eeeee--cccceEEeeeeeeeE Confidence 7766543111 1111 11222235788888765555444443222 2 22334 388999999997777 Q ss_pred cCCeecCceeecchhhhhheeeEEEEecc---cccccccchhhhhhhhhh-HHHHHHHHHHHHHHHHHHHHHHHHH-hcc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFN-LASSARTLLGTYFNDLQDQCAIVHL-AGA 155 (318) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~g~ms~qrs~~d-lr~~ar~~L~~w~~~~~D~~~~~~L-sG~ 155 (318) -...+-.+.+.| +.+.....+|.||+. ||.|+ .+++-..-+| +|++--..+..-+++..||.++..+ +++ T Consensus 65 ~~~~~~G~~ld~--~~~~~~k~~itID~ll~a~~~V~---diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa 139 (364) T protein:vir:10 65 LQVLSPGKSPDA--SPTEFDKNRLVVDTTVIARNTVA---HFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGG 139 (364) T ss_pred EeeeccCcccCC--CCcccCcEEEEecceeeechhhh---hHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 665555555655 567777889999995 66664 4566677789 8988888889999999999887544 222 Q ss_pred ccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccc Q lcl|Aclame:pro 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (318) Q Consensus 156 rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~ 235 (318) .. | ..+.+. ++.-.|-...+ -.++.++. ..+..+.+ ++.|..|....++..-|. + T Consensus 140 ~a---~--------~~~~~~----~~~~~~~g~~i-~~~~~a~~-~~~~~~~l-~~ai~~a~~~LdEkdVP~-------~ 194 (364) T protein:vir:10 140 IS---N--------TEAIRK----NPRVAGHGFSI-HIVGLASS-FLTSPQYM-MAAIEMAMEQQTEQEVDT-------S 194 (364) T ss_pred hh---c--------cccccc----CCcccCCccee-eecccCcc-hhhhHHHH-HHHHHHHHHHHhhcCCCc-------c Confidence 11 0 000000 01000100000 00111111 01111111 122333444445544331 1 Q ss_pred cccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeE----------- Q lcl|Aclame:pro 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPI----------- 304 (318) Q Consensus 236 ~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~i----------- 304 (318) + ++++|.|.||..|..++.+- + ......+ .+..-+|.+++.+||-|.|-+++|- T Consensus 195 ~-------R~~vv~P~~y~~Ll~~~~lv---n----~d~~~~~-~~~~~~G~v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~ 259 (364) T protein:vir:10 195 E-------LCGLMPWTAFNCLRDADRIV---D----KSYTIAA-SDNTVDGFVLKSWNTPIVPSNRFPKLSDNTEGTGNT 259 (364) T ss_pred c-------cEEEeChHHHHHHhcCCccc---c----ccccccC-CCccccceeEEEeceEEEeccccccccccccccccc Confidence 1 79999999999999987542 1 0001111 2445689999999999999888874 Q ss_pred ------------EEc--CCCee-eeeeeC Q lcl|Aclame:pro 305 ------------RFY--QGQRF-WYQRIT 318 (318) Q Consensus 305 ------------Rf~--ag~~v-~~a~~~ 318 (318) +|+ ++.+- .+...+ T Consensus 260 t~h~ls~~~~g~~y~v~~d~~~~~~~~f~ 288 (364) T protein:vir:10 260 KHHKLSNAGNGNRYDVTAGQTSAQAVLFT 288 (364) T ss_pred cccccccccCCcccccccccceeEEEEEe Confidence 121 11111 111111 No 44 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.37 E-value=4e-08 Score=61.12 Aligned_cols=235 Identities=9% Similarity=0.088 Sum_probs=138.6 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeecc---CCCCCcEEEEEEee Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDL---NKQAGDEVTFSIMH 77 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL---~k~~Gd~v~f~L~~ 77 (318) |.+..-++-.. |- .|..-+.... .....|.+. .+|+...+| -.++|+.|+|++.. T Consensus 1 MA~T~lsd~i~--------------PE--vf~~yv~~~~---~~~~~l~qS---G~i~~~~~l~~~~~~~G~~it~P~~~ 58 (351) T protein:vir:15 1 MAETHLSDLIV--------------PE--VFGNYVVNQI---IKTNRFVQS---GILTPDPDLGPHLLEAGTRITVPFLN 58 (351) T ss_pred CCceeeeeeec--------------hh--HHHHHHhhhh---HHhhhHhhc---ccccccHHHHHHhhcCCCEEEecccc Confidence 44322221111 11 1222221111 112345553 455544444 34799999999999 Q ss_pred ccccCCe--ecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 78 KLSKRPT--MGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGA 155 (318) Q Consensus 78 ~L~G~gv--~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~ 155 (318) .|+|++- .|+..++ .+.|...++.-+|=..-.++... .+++..+.-|...++...|++||++..+..+|..|.|+ T Consensus 59 ~l~Gd~~~~~~~~~i~--~~kitt~~~~a~i~~~~kg~~~t-D~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv 135 (351) T protein:vir:15 59 DLTGDPDNWTDSDDID--VNNLTSGKQQGIKFYQTKAYGYT-DLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGV 135 (351) T ss_pred cCCCcccccCCCcccc--hheecccceeEEEEeeccceehh-hhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9988753 3333343 57888899999988877887765 44677777899999999999999999999999999888 Q ss_pred ccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccc Q lcl|Aclame:pro 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (318) Q Consensus 156 rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~ 235 (318) .+... . .|. |.+ +.+. .-.++-+||.+.+-+|..++-. T Consensus 136 ~~~~~-------------~----~~~--------~~~--d~t~--~~~~~~~is~~~l~~A~~~~GD------------- 173 (351) T protein:vir:15 136 MGVTK-------------I----ANS--------KVY--DQTK--VSPSEPMFGAKGFTGAIGLMGD------------- 173 (351) T ss_pred hhchh-------------h----ccc--------cee--cccc--ccccccccCHHHHHHHHHHhcc------------- Confidence 76321 0 010 111 0010 1123446888988888776522 Q ss_pred cccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEc-------- Q lcl|Aclame:pro 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFY-------- 307 (318) Q Consensus 236 ~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~-------- 307 (318) + .++...+++|||..+++|+++- . ++..++. .| .+.+|.|+|+.|.--..+|+=.. T Consensus 174 ~--~~~~~~~ivmhS~v~~~L~~~~-l---i~~~~~s----~~------~~~i~t~~G~~VivdD~~p~~~~~~~~~~yt 237 (351) T protein:vir:15 174 L--QDTAFGAIAVNSATYSLMKVQG-L---IETIQPQ----NG------ATPFEAYNGLRIVLDDDIEIDLTDKTKPVST 237 (351) T ss_pred c--cccceEEEEEChHHHHHHHhhh-h---hhhcccc----cc------CcccceecceEEEEcCCCccccCCCCCceeE Confidence 1 1123589999999999999973 2 3333321 11 23578899988877666664111 Q ss_pred ----CCCeeeeeeeC Q lcl|Aclame:pro 308 ----QGQRFWYQRIT 318 (318) Q Consensus 308 ----ag~~v~~a~~~ 318 (318) .-+.+.|.... T Consensus 238 syl~~~GAi~~~~~~ 252 (351) T protein:vir:15 238 SYIFAPGAVRYSTNM 252 (351) T ss_pred EEEEecceeeeecCC Confidence 11222222211 No 45 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.19 E-value=6e-07 Score=54.70 Aligned_cols=253 Identities=10% Similarity=0.010 Sum_probs=135.7 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) ||+.+...=.. +++ +-...+=+++.|.+.+...-...+-+..+ +.++. + ..|.++.|+-....+ T Consensus 1 Ms~~n~~t~p~-~~g----sg~~~aL~Le~f~GeV~taF~~~si~~~~------~~vRt---I--~~gkS~qf~~lG~s~ 64 (400) T protein:vir:10 1 MSTPNNLTNVA-VSA----SGEVDSLLIEKFNGKVNEQYLKGENIMSY------FDVQT---V--TGTNTVSNKYLGETE 64 (400) T ss_pred CCCCccccccc-ccc----ccchhhhHHhHhcchHHHHHHHHhhhccc------ceeee---e--cccceEEEEEeeeeE Confidence 88886552111 110 01112224889999876666655554322 23332 3 568899999998887 Q ss_pred cCCeecCceeecchhhhhheeeEEEEecc---cccccccchhhhhhhhhh-HHHHHHHHHHHHHHHHHHHHHHH--HHhc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFN-LASSARTLLGTYFNDLQDQCAIV--HLAG 154 (318) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~g~ms~qrs~~d-lr~~ar~~L~~w~~~~~D~~~~~--~LsG 154 (318) -...+-.+.+.|+ ........|+||.+ ||.|. .+++...-+| +|.+--..+..-+++..||.+|- .+++ T Consensus 65 a~y~~pG~~ldg~--~~~~dk~~ItIDtLL~a~~~V~---dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~ 139 (400) T protein:vir:10 65 LQVLAPGQSPAAT--STQADKNQLVIDATVIARNTVA---HLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGG 139 (400) T ss_pred EeeecCCCCcCCC--CcccCcEEEEeCceeeecchhh---hHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 7777777778877 46777778999987 55553 4566677789 89999999999999999998873 3443 Q ss_pred cccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecc Q lcl|Aclame:pro 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSG 234 (318) Q Consensus 155 ~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g 234 (318) ... ...|. .+.+...+.+.. .+. +++ .+.+...+.+.-++.+ |.....+..-| . + T Consensus 140 ~a~------t~~~~----~~~~g~~~g~s~-----~v~--~~~-~~~~~~~~~l~~A~~~-A~~~LdEkdVP-----~-~ 194 (400) T protein:vir:10 140 IAN------TQAKR----TNPRVKGHGFSV-----NVE--VNE-GEALVNPQYVMAAVEF-ALEQQLEQEVD-----I-S 194 (400) T ss_pred ccc------ccccc----ccCCccccccce-----eec--ccc-cccccCHHHHHHHHHH-HHHHHHhcCCC-----c-c Confidence 211 00111 111000011100 000 111 1111111222222222 22222232222 1 1 Q ss_pred ccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEcCCCeeee Q lcl|Aclame:pro 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGQRFWY 314 (318) Q Consensus 235 ~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~ag~~v~~ 314 (318) + +|+|+.|..|.-|+..+- . .- ..-... ..+..=+|.+++.+||-|.|-+++|- ..+++.. T Consensus 195 -d-------~vvl~pp~~Ys~Ll~~dk---L---vn-rdf~~s-~~g~~~~g~v~~v~Gv~Iv~Sn~lP~---~a~~~~~ 255 (400) T protein:vir:10 195 -D-------VAILMPWRYFNVLRDADR---I---VD-KSYTIS-QSGATIQGFVLSSYNCPVIPSNRFPK---YSQGQKH 255 (400) T ss_pred -c-------eEEEcCHHHHHHHHhCCc---c---cc-hhcccc-CCCccccceEEEEeceEEEeeCcCCc---ccCcccc Confidence 1 678887777777765431 1 11 111000 12556679999999999999988872 1222222 Q ss_pred eeeC Q lcl|Aclame:pro 315 QRIT 318 (318) Q Consensus 315 a~~~ 318 (318) ..++ T Consensus 256 ~~lS 259 (400) T protein:vir:10 256 HLLS 259 (400) T ss_pred cccc Confidence 2222 No 46 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.15 E-value=1.3e-07 Score=58.25 Aligned_cols=234 Identities=10% Similarity=0.064 Sum_probs=137.9 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccC--CCCCcEEEEEEeec Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLN--KQAGDEVTFSIMHK 78 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~--k~~Gd~v~f~L~~~ 78 (318) |++..-++-. .| +.|..-+.... .....|.|++--.+...+.++- ..+|+.|+|+.... T Consensus 1 MA~T~lsd~i--------------~p--eVf~~yv~~~~---~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~ 61 (324) T protein:vir:59 1 MAYTKISDVI--------------VP--ELFNPYVINTT---TQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWND 61 (324) T ss_pred CCceeeecee--------------ch--hHHHHHHHhhh---HHHHHHhhcccccccHHHHHHhhccCCCCEEEeccccc Confidence 4432211111 11 11322221111 1234566666444444455543 25899999999999 Q ss_pred cccCCe--ecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|Aclame:pro 79 LSKRPT--MGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGAR 156 (318) Q Consensus 79 L~G~gv--~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~r 156 (318) |.|++. .++..++ -+.|...++.-+|=+.-.++... ..++..+.-|...++...|++||++..+..+|..|.|+. T Consensus 62 l~Gd~~~v~~~~~i~--~~~l~t~~~~a~i~~~~k~~~~t-D~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~ 138 (324) T protein:vir:59 62 LDGDSQVLNDTDDLV--PQKINAGQDKAVLILRGNAWSSH-DLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVF 138 (324) T ss_pred CCCcccccCCCcccc--hhhcccceeeEEEEeecCceeeh-hhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 988743 3333333 57888888888888877787665 446677788999999999999999999999999998877 Q ss_pred cccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecccc Q lcl|Aclame:pro 157 GDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDE 236 (318) Q Consensus 157 g~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~ 236 (318) +.....+ ...++.+ +++-+||.+.+.+|..+. ||+ T Consensus 139 ~~~~~~~--------------~~~dvsa------------------~~~~~~s~~~l~~A~~~~-------------GD~ 173 (324) T protein:vir:59 139 SNDDMKD--------------NKLDISG------------------TADGIYSAETFVDASYKL-------------GDH 173 (324) T ss_pred hcccccc--------------ceeeeec------------------cccceecHHHHHHHHHHh-------------CCc Confidence 5321100 0001111 122358888888877664 332 Q ss_pred ccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEc--------- Q lcl|Aclame:pro 237 LHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFY--------- 307 (318) Q Consensus 237 ~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~--------- 307 (318) .+...+++|||..+++|+++- .-+|. ++. .+ .+.+|.|+|+.|..--.+|+=.. T Consensus 174 ---~~~~~~ivmhS~v~~~L~~~~-li~~~---~~s----~~------~~~i~~~~G~~VivdD~~p~~~~~~~~~~y~s 236 (324) T protein:vir:59 174 ---ESLLTAIGMHSATMASAVKQD-LIEFV---KDS----QS------GIRFPTYMNKRVIVDDSMPVETLEDGTKVFTS 236 (324) T ss_pred ---ccCcEEEEEchHHHHHHHHhh-hhhhc---ccc----cc------CceeeeecccEEEEeCCCCccccCCCCceEEE Confidence 123589999999999999983 22332 221 11 13568888887766444554211 Q ss_pred ---CCCeeeeeeeC Q lcl|Aclame:pro 308 ---QGQRFWYQRIT 318 (318) Q Consensus 308 ---ag~~v~~a~~~ 318 (318) .-+.+.+.... T Consensus 237 ~l~~~GAi~~~~~~ 250 (324) T protein:vir:59 237 YLFGAGALGYAEGQ 250 (324) T ss_pred EEEecCeEEEeecC Confidence 11222222111 No 47 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=97.98 E-value=2e-06 Score=51.82 Aligned_cols=229 Identities=9% Similarity=-0.051 Sum_probs=126.6 Q ss_pred HHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEe--eccCCCCCcEEEEEEeeccccCCeecCcee Q lcl|Aclame:pro 13 FQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRI--TDLNKQAGDEVTFSIMHKLSKRPTMGDERV 90 (318) Q Consensus 13 ~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~--~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~l 90 (318) ||..+|. -++|+..+-..-.+..-|..+ +.+- .|+.-..||+|++............-.... T Consensus 1 Ma~~~~~--------p~~~a~~~l~~l~~~lv~~~l--------v~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~ 64 (392) T protein:vir:99 1 MANAFSK--------PTAVVDTAIQMLQNELILTNL--------VWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAG 64 (392) T ss_pred Ccccccc--------HHHHHHHHHHHHHhhccchhh--------hccccccccccCCCCeEEEeecccccceeeeccccc Confidence 4433332 234665432222111112111 1111 244446799999987766654433322222 Q ss_pred e---cchhhhhheeeEEEEeccccc-ccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccee Q lcl|Aclame:pro 91 E---GRGEDLSHADFSLKINQGRHL-VDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTIL 166 (318) Q Consensus 91 e---Gnee~L~~~sd~v~Idq~R~~-V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~n~~~~~ 166 (318) + ...+++.-...++.|||.++. +...+ .+.-....|+++..-+....=+++..|+.++..++++.... T Consensus 65 ~~~~~~~~~~~~~~~~~~id~~k~~~~~i~d-~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~------- 136 (392) T protein:vir:99 65 AERNLTVSDFTEDSFPVTLTDVAYHLGVLTD-EELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEA------- 136 (392) T ss_pred cCCcccccccccceEEEEEeeeeecceeech-HHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------- Confidence 2 234567778889999888654 44543 35555678898888787888888889988887776543210 Q ss_pred ecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCCcceEEE Q lcl|Aclame:pro 167 PTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVL 246 (318) Q Consensus 167 p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~~~~yV~ 246 (318) . +....++. ...++.|-.|...++...-| +| +++ T Consensus 137 ---------~--------------------~~~~~~~~--~~~~~~i~~a~~~L~~~~vP------~~---------R~~ 170 (392) T protein:vir:99 137 ---------A--------------------GAVHEVAP--DEFFKGVNGARRALNELYIP------QG---------RVL 170 (392) T ss_pred ---------c--------------------ccccccCh--hhhHHHHHHHHHHHhhcCCC------CC---------CEE Confidence 0 00000111 12356676777777776544 12 467 Q ss_pred EecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE----EcCCCeeee--eeeC Q lcl|Aclame:pro 247 YVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR----FYQGQRFWY--QRIT 318 (318) Q Consensus 247 ~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR----f~ag~~v~~--a~~~ 318 (318) +++|..+..|.+|+.+..+..... .....|-.|.+|.+.|+-+++.+.+|-- |.....+-+ +-++ T Consensus 171 vv~p~~~~~l~~~~~~~~~~~~g~-------~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~~a~~~~a~~~at~a~v~ 241 (392) T protein:vir:99 171 VVGTAVTEQILNDDRFIKYESQGQ-------SAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAP 241 (392) T ss_pred EEcHHHHHHHhcccceeecccccc-------hhhhhhhcceeeeeeeeEEEeecccccccceeeeccccccccccccc Confidence 789999999999987644322111 1113466799999999999998765421 110000000 1011 No 48 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=97.97 E-value=4.8e-07 Score=55.23 Aligned_cols=239 Identities=11% Similarity=0.081 Sum_probs=132.3 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeecc---CCCCCcEEEEEEee Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDL---NKQAGDEVTFSIMH 77 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL---~k~~Gd~v~f~L~~ 77 (318) |.+-+...+. .=.|-| |..-+.....+ ...|.|++ +|+...+| -.++|+.|+|++.. T Consensus 1 Ma~~~T~l~d------------~i~pev--f~~yv~~~~~~---~~~l~qSG---~i~~~~~i~~~~~~~G~~i~~P~~~ 60 (330) T protein:vir:10 1 MANELTKILD------------TITPQQ--YNAYMQQYTAA---KSAFVQSG---IAVSDERVSKNITSGGLLVNMPFWN 60 (330) T ss_pred CCCCceEeee------------eechhH--HHHHHHHHhHH---hhhhhhcc---cccccHHHHHHhhcCCCEEEecccc Confidence 3321111110 001111 22221111111 23455553 45453333 34799999999999 Q ss_pred ccccCC-ee--cCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 78 KLSKRP-TM--GDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAG 154 (318) Q Consensus 78 ~L~G~g-v~--Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG 154 (318) .|+|+. +. |++.++ -+.|...++..+|=+.-.++.... ++..=+.-|...++...+++||++..+..++..|.| T Consensus 61 ~l~G~~~~~~dg~~~i~--~~ki~t~~~~a~i~~~~k~~~~tD-~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~g 137 (330) T protein:vir:10 61 DLTGDSEVLGNGDKALE--TGKITAGADIACVLYRGRGWAANE-LTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNG 137 (330) T ss_pred cCCCcccccCCCccccc--hhhcccceeEEEEEeecceeeehh-hhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHh Confidence 998874 33 222333 478888999998888877776643 345556679999999999999999999999999988 Q ss_pred cccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecc Q lcl|Aclame:pro 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSG 234 (318) Q Consensus 155 ~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g 234 (318) ..+........ .+ ..|.... .-+++-.|+.+.+-+|....-. T Consensus 138 vf~~~~~~~~~-------~~---~~~~~~~----------------~~~~~a~~s~~~l~~A~~~~GD------------ 179 (330) T protein:vir:10 138 IFATGTAGEKG-------AL---EETHVSD----------------QSKASTGIDAGMVLDAKQLLGD------------ 179 (330) T ss_pred hhhhhhcccch-------hh---hhhheec----------------ccccccccCHHHHHHHHHHhcc------------ Confidence 87642211100 00 0011000 0112336788877776554322 Q ss_pred ccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeE------E-Ec Q lcl|Aclame:pro 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPI------R-FY 307 (318) Q Consensus 235 ~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~i------R-f~ 307 (318) +. +...+++|||.++++|+++- .-+| .++. .+ .+.+|.|+|+.|..--.+|. - +. T Consensus 180 -~~---~~~~~ivmhS~v~~~L~~~~-li~~---~~~s----~~------~~~i~~~~G~~VivdD~~p~~~~~yt~yl~ 241 (330) T protein:vir:10 180 -SA---DQVTAIAMHSAVYTKLQKDN-LIQY---IQPT----TA------TINIPTYLGYRVIIDDGIAPTGDIYTSYLF 241 (330) T ss_pred -cc---ccceEEEEcHHHHHHHHHhh-hhhh---hccc----cc------CcccccccceEEEEeCCCCCCCCceeEEEE Confidence 21 12589999999999999863 2233 2321 11 25678888888775444441 0 11 Q ss_pred CCCeeeeee-----eC Q lcl|Aclame:pro 308 QGQRFWYQR-----IT 318 (318) Q Consensus 308 ag~~v~~a~-----~~ 318 (318) ..+.+.+.. .. T Consensus 242 ~~GAi~~~~~~~~~~v 257 (330) T protein:vir:10 242 RTGSIGLNTGNPSGLT 257 (330) T ss_pred ecCceeeecccCCccc Confidence 223333321 11 No 49 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=97.81 E-value=7.4e-06 Score=48.73 Aligned_cols=225 Identities=11% Similarity=0.033 Sum_probs=121.7 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) |.+++++ .+|+..|.....+.+.+..+-++..+.-| .-..|++|.++-+ + T Consensus 1 MA~~n~a---------------------~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v------~~~gg~tVkI~~i---~ 50 (299) T protein:vir:79 1 MAALNYA---------------------KEYSNVLAQAYPYTLNFGDLYATPNNGRY------RWTGSKTIEIPTI---S 50 (299) T ss_pred CccchhH---------------------HHHHHHHHHHHHhhceeeeeccCccccee------eecCCCEEEEecc---c Confidence 3333322 34666655555544444322222211111 1134899998744 3 Q ss_pred cCCeecCceee--c-chhhhhheeeEEEEecccccccccchhhhhhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHh-c Q lcl|Aclame:pro 81 KRPTMGDERVE--G-RGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNL--ASSARTLLGTYFNDLQDQCAIVHLA-G 154 (318) Q Consensus 81 G~gv~Gd~~le--G-nee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dl--r~~ar~~L~~w~~~~~D~~~~~~Ls-G 154 (318) ..++ +|-... | +.++++....++.+||.|----.=..|+.-.|-..+ -...+....+...-.+|.-.|-.|+ + T Consensus 51 ~~gl-~DY~R~~~g~~~g~~~~~~~t~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~ 129 (299) T protein:vir:79 51 TTGR-VDSNRDTIAVAQRNYDNAWEPKVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYAD 129 (299) T ss_pred cccc-cccccCCCcccccccCcceeEEEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHh Confidence 3333 454432 2 344788889999999998553333444444333222 2223333333444455665565553 2 Q ss_pred cccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecc Q lcl|Aclame:pro 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSG 234 (318) Q Consensus 155 ~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g 234 (318) +.+. +..++...++++.+ ++.|+.+.+.+++.+-|- T Consensus 130 a~~~-----------------------------------g~~~~~~~~T~~n~--y~~i~~~~~~lde~~vP~------- 165 (299) T protein:vir:79 130 WTAL-----------------------------------GNTADTTVLTTTNV--LEVFDKLMEKMTEARVPE------- 165 (299) T ss_pred hhhc-----------------------------------CCcccccccCHHHH--HHHHHHHHHHHHhcCCCC------- Confidence 1110 01122233555554 688999999999866551 Q ss_pred ccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEc------C Q lcl|Aclame:pro 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFY------Q 308 (318) Q Consensus 235 ~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~------a 308 (318) +. +||+|+|.-+.-|+.++.| ++.-.. +..+....|.+|.++||-|.|-|. -||+ . T Consensus 166 ~~-------rvl~vtp~~~~~L~~~~~f------~k~~~~---~~~~~~~~g~Vg~idG~~Ii~Vps--~r~~t~~~~~~ 227 (299) T protein:vir:79 166 NG-------RILYVTPVVNTLIKNAKEI------QRTVNI---KDAGTSLNRQTTDIDTVKIIKVPS--NLMKTAYDFTT 227 (299) T ss_pred CC-------eEEEeCHHHHHHHhhchhh------hccccc---ccccceeeeeeeeecceEEEEech--hhcCccceecc Confidence 11 7999999999999999754 222111 234568899999999999999886 3665 2 Q ss_pred CCeee---------------------eeeeC Q lcl|Aclame:pro 309 GQRFW---------------------YQRIT 318 (318) Q Consensus 309 g~~v~---------------------~a~~~ 318 (318) |-... +..+- T Consensus 228 G~~~~~~ak~in~ii~~~~a~~~~~K~~~~~ 258 (299) T protein:vir:79 228 GWKVGAGAKQIFMSLVHPSAIITPVSYQFSK 258 (299) T ss_pred CccccCcccccceEEEcCCeeeeeEeeeeEE Confidence 31111 11100 No 50 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=97.75 E-value=7.3e-06 Score=48.73 Aligned_cols=221 Identities=9% Similarity=0.010 Sum_probs=121.9 Q ss_pred HhcccchHHH--HhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccccCCeecC-ceeecchhh Q lcl|Aclame:pro 20 AANRNRSMVN--ILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGD-ERVEGRGED 96 (318) Q Consensus 20 ~~~~~~~~v~--~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd-~~leGnee~ 96 (318) ...++|.+++ +|+..+-..-.+..-+...+-+... .|. .++||+|++.....+. ..| ..+ ..++ T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~------~e~-~~~GDTV~I~vp~~~~----v~dg~~~--~~~~ 67 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYE------KTF-GKVGDTIRLKLPYRVK----SASGRTL--VKQP 67 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCc------hHH-hhCCCEEEEeeCCcee----ecccCCc--cccc Confidence 3445555543 7777654333333333222222211 223 3579999998765553 111 112 2356 Q ss_pred hhheeeEEEEecccc-cccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeecccccccc Q lcl|Aclame:pro 97 LSHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFK 175 (318) Q Consensus 97 L~~~sd~v~Idq~R~-~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~n~~~~~p~~~~~~~~ 175 (318) +.-.+-.|.||+..+ ++...++ ++--+.-||++..-.....-+++..|+.++-.+.++.. T Consensus 68 ~te~~v~l~id~~k~~~~~itD~-e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~------------------ 128 (418) T protein:vir:10 68 MVDQTIPFKIAYQEHVGLEYTVK-DKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFH------------------ 128 (418) T ss_pred cccceEEEEEecccccceeechH-HHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc------------------ Confidence 666777899988865 4555544 33334568876666666666777777776654433221 Q ss_pred cccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCCcceEEEEecHHHHHH Q lcl|Aclame:pro 176 KIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWND 255 (318) Q Consensus 176 ~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~~~~yV~~l~P~q~~d 255 (318) .+..+ .. . . -.++.|-.+...++..+-| -+|+ +.+++.|..+.. T Consensus 129 -----~~gt~-----------gt----~-~--~~~~~i~~a~~~Ld~~~VP-----~~G~--------R~lVv~P~~~~~ 172 (418) T protein:vir:10 129 -----SSGTP-----------GV----R-P--GAFIDFANAGAKQTTYAVP-----QDGM--------RHAVLDPFTCAS 172 (418) T ss_pred -----ccccC-----------Cc----C-c--chHHHHHHHHHHHHhcCCC-----CCCc--------eEEEeCHHHHHH Confidence 00000 00 0 0 1256666778888776655 1222 577899999999 Q ss_pred HHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEE---cCC-CeeeeeeeC Q lcl|Aclame:pro 256 WYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRF---YQG-QRFWYQRIT 318 (318) Q Consensus 256 Lr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf---~ag-~~v~~a~~~ 318 (318) |..|..+ .+ . .. +....|-.|.+|.+.|+-|++..++|.-= +.| ..|..|.-+ T Consensus 173 L~~~~~~-~~-------~-~~-~~~~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~ga~~~ 229 (418) T protein:vir:10 173 LSDEVTK-LF-------K-ES-MVEQAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNGTVVN 229 (418) T ss_pred Hhhhccc-cc-------c-cc-ccchhhheeeeeeeeceEEEEecCCCcccccccccceeeeccccc Confidence 9987643 11 1 11 33456778999999999999999887421 111 122212111 No 51 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=97.75 E-value=5.7e-06 Score=49.32 Aligned_cols=253 Identities=11% Similarity=0.030 Sum_probs=134.7 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) ||+.+...=.. +++ +-...+-+++.|.+.+...-...+-|..+ +.++. + ..|.++.|+-+...+ T Consensus 1 Ms~~n~~t~~~-~~~----sg~~~al~Le~f~GeV~taF~~~si~~~~------~~vRt---i--~~gkS~qf~~~G~s~ 64 (401) T protein:vir:70 1 MSTPNNLTNVA-VSA----SGEVDSLLIEKFNGKVNEQYLKGENIMSY------FDVQT---V--TGTNTVSNKYLGETE 64 (401) T ss_pred CCCCccccccc-ccc----ccchhHhHHhHhcchHHHHHHHHhhhccc------ceeee---e--cccceEEEEEeeeeE Confidence 88886552211 111 01122335888999876666655554322 23332 3 568899999998877 Q ss_pred cCCeecCceeecchhhhhheeeEEEEecc---cccccccchhhhhhhhhh-HHHHHHHHHHHHHHHHHHHHHH--HHHhc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFN-LASSARTLLGTYFNDLQDQCAI--VHLAG 154 (318) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~g~ms~qrs~~d-lr~~ar~~L~~w~~~~~D~~~~--~~LsG 154 (318) -....-.+.+.|+ ........|.||.. ||.|. .+++..+-+| +|.+--..+..-+++..||.++ +.++| T Consensus 65 ~~~~~pG~~ld~~--~~~~dK~~ItID~lL~a~~~V~---dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa 139 (401) T protein:vir:70 65 LQVLAPGQSPAAT--STQADKNQLVIDATVIARNTVA---HLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGG 139 (401) T ss_pred eeeecCCCCcCCC--CcccccEEEEeCceeehhhhhh---hHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 6666666667764 56667777999987 55553 4567777789 8999989999999999999774 33444 Q ss_pred cccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecc Q lcl|Aclame:pro 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSG 234 (318) Q Consensus 155 ~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g 234 (318) ... . .|... ||..-+ ....+-.+++.....++....+ +.+..|....++..-| .+ T Consensus 140 ~an-a------~~~~~---------~p~~~~-~G~~i~v~~~~~~~~~~~~~l~--~ai~dA~~~LdEkdVP------~~ 194 (401) T protein:vir:70 140 IAN-T------QAKRT---------NPRVKG-HGFSINVEVAEGEALVNPQYVM--AAVEFALEQQLEQEVD------IS 194 (401) T ss_pred ccc-c------ccccc---------CCCcCC-CceEEeccccccccccCHHHHH--HHHHHHHHHHHhcCCC------cc Confidence 321 0 00000 110000 0001111111111111111111 1122233333332222 11 Q ss_pred ccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEcCCCeeee Q lcl|Aclame:pro 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGQRFWY 314 (318) Q Consensus 235 ~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~ag~~v~~ 314 (318) -||+|+.|..|+-|..-+.. .+ ..... ...+..=+|.+++.+||-|.|-+++| |.+ +++.- T Consensus 195 --------r~vvl~pp~~Ys~Ll~~d~L---~n----rd~~~-s~~g~~~~G~v~~vaGv~Vv~SnnlP--~~a-~~it~ 255 (401) T protein:vir:70 195 --------DVAILMPWRYFNVLRDADRI---VD----KTYTI-SQSGATIQGFTLSSYNCPVIPSNRFP--KYS-QGQTH 255 (401) T ss_pred --------ceEEEcCHHHHHHHHhcCcc---cc----hhhcc-ccCCccccceEEEEeceEEEeecccc--ccc-ccccc Confidence 27888888888777765421 11 11100 11244567889999999999988766 221 11111 Q ss_pred eeeC Q lcl|Aclame:pro 315 QRIT 318 (318) Q Consensus 315 a~~~ 318 (318) ..++ T Consensus 256 ~~ls 259 (401) T protein:vir:70 256 HLLS 259 (401) T ss_pred cccc Confidence 1111 No 52 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=97.48 E-value=1.4e-05 Score=47.12 Aligned_cols=256 Identities=13% Similarity=0.111 Sum_probs=141.9 Q ss_pred CCccchhH----HHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccC---CCCCcEEEEEEe Q lcl|Aclame:pro 4 VTSAQANK----LFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLN---KQAGDEVTFSIM 76 (318) Q Consensus 4 ~~~~~~~~----~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~---k~~Gd~v~f~L~ 76 (318) |+...+.- ++--=+|+....+.+. ....|.+++ +|+.-.+|. ...|+.|++++. T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~----------------e~~~l~qSG---iv~~d~~l~~~~~~gG~~v~iPf~ 61 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRP----------------ELTAFFLSG---AVASNDFLSQFLSAPGRLINIPFW 61 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhh----------------hhhhhhhcc---eeecCHHHHHHhhcCCCEEEeeee Confidence 22221111 1222234333333331 124455443 566666664 489999999999 Q ss_pred eccccCC-ee-cCce-eecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 77 HKLSKRP-TM-GDER-VEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA 153 (318) Q Consensus 77 ~~L~G~g-v~-Gd~~-leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~Ls 153 (318) ..|.|+. .. +|+. .+---..+...+|.-+|=.+-.+.... .+++-=+.-|..+....++++||.+.....+|..|. T Consensus 62 ~~L~g~~~n~~~d~~~~~~t~~kittg~~~a~v~~r~kaw~~~-Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~ 140 (367) T protein:vir:80 62 RDLDSLEPNYGSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAM-DLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAV 140 (367) T ss_pred ccCCCCccccCCCCCcccccccccccchheeeeehhcccchhh-hHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHH Confidence 9998753 22 2221 222235666666666655555554332 334444456899999999999999999999999999 Q ss_pred ccccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEec Q lcl|Aclame:pro 154 GARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLS 233 (318) Q Consensus 154 G~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~ 233 (318) |..+.....+..... +.....-+++....+|++=-.+.+. +++-+||.+.+-+|...+-. . T Consensus 141 Gvf~~~~a~~~~~~~-----~~~~~~a~~~~~~~~~~~Dis~~t~----~~~~~~s~~~~~~A~~~lGD---~------- 201 (367) T protein:vir:80 141 GVYKSNLAGNFATIK-----TRGRVPAEVLGTAGDMVIDISGQTN----PADAVFNREAFVDAAFTMGD---H------- 201 (367) T ss_pred Hhhccccccchhhhh-----hhhccccccccccCceeeeeeccCC----CccceecHHHHHHHHHHhcc---c------- Confidence 998865443322111 0000011122223333332222221 12236888887777443322 1 Q ss_pred cccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE-------- Q lcl|Aclame:pro 234 GDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR-------- 305 (318) Q Consensus 234 g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR-------- 305 (318) +++ +=+++||+..+..|++.- .++..+.. .| ...++.|+|..|..--.||+- T Consensus 202 ~~~------l~~i~mHS~V~~~L~~~~----li~~i~~s----d~------~~~i~ty~G~~VIvDD~~Pv~~~~a~~~y 261 (367) T protein:vir:80 202 VGS------IAAIAVHSMVYKRMTNND----EIEFIPDS----KG------QLTIPTYMGKVVIVDDGMPVFGTGADKTY 261 (367) T ss_pred ccc------ccEEEEchHHHHHHHhcc----ccccccCC----CC------ccccceecceeEEEeCCCcccccCCCceE Confidence 111 358999999999999973 33333321 22 246899999888887777762 Q ss_pred ---EcCCCeeeeeeeC Q lcl|Aclame:pro 306 ---FYQGQRFWYQRIT 318 (318) Q Consensus 306 ---f~ag~~v~~a~~~ 318 (318) +.+.+.|.|.... T Consensus 262 ttYlfg~GAi~~~~~~ 277 (367) T protein:vir:80 262 LSILFGGAAFGYADGA 277 (367) T ss_pred EEEEEecceeeecccC Confidence 1344555555444 No 53 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=96.80 E-value=0.00033 Score=39.68 Aligned_cols=239 Identities=13% Similarity=0.060 Sum_probs=127.3 Q ss_pred CCcCCc-------cchhHHHHHHHHHHhcccchH-HHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEE Q lcl|Aclame:pro 1 MTTVTS-------AQANKLFQVALFTAANRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVT 72 (318) Q Consensus 1 ~t~~~~-------~~~~~~~a~~lft~~~~~~~~-v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~ 72 (318) .-|++. .+.-.+|-.++ -++.+|.-. -+++...|.......+. .++...-++.+...|++|. T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~nt~~l~~k~~~~LD~~~~~~~~---------s~~~~~N~~~e~~~g~tVk 78 (329) T protein:vir:10 9 VKTMNKEIKNATGKLKLNLQHFAN-KSVEPGDTLLKNKHVGILEKVTAANSY---------SAPAVISNDAIFMQGRSFT 78 (329) T ss_pred hhhhhhhhhcccceeEEehhhhcC-CccCCchhHHHHHHHHHHHHHHHhhce---------eeeeecccceeeccCcEEE Confidence 111211 11111121111 223444442 34454444332222222 1111112344566899999 Q ss_pred EEEeeccccCCeecCcee-ec-chhhhhheeeEEEEecccccccccchhhhhhhhhhH--HHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 73 FSIMHKLSKRPTMGDERV-EG-RGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNL--ASSARTLLGTYFNDLQDQCA 148 (318) Q Consensus 73 f~L~~~L~G~gv~Gd~~l-eG-nee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dl--r~~ar~~L~~w~~~~~D~~~ 148 (318) ++-+.- .++ +|-.. .| +.++++....++.|||.|----.=..|+..-+...+ -......+.+.+....|.-. T Consensus 79 Ip~i~~---~gl-~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~ 154 (329) T protein:vir:10 79 VIKGDV---TEL-KDYKRNATNEFDHPQIQETTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLR 154 (329) T ss_pred Eeeecc---ccc-ccccCCCCccccccccceeEEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHH Confidence 986643 222 33322 22 345788899999999988553333456655555444 34445555666666677777 Q ss_pred HHHHhccccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCc Q lcl|Aclame:pro 149 IVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQ 228 (318) Q Consensus 149 ~~~LsG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~ 228 (318) |-.|++.-+. ..+..++++. .++.|+.+..++++.+.| T Consensus 155 ~skla~~a~~--------------------------------------~~~~~~t~~n--ay~~i~~a~~~Lde~~vp-- 192 (329) T protein:vir:10 155 FATLARNKAK--------------------------------------HLTVGSGADA--QYDAVLDVSVELDEIGAG-- 192 (329) T ss_pred HHHHHhhccc--------------------------------------ccccccCHHH--HHHHHHHHHHHHHhcCCC-- Confidence 7666532210 0111233222 488899999999886543 Q ss_pred eeEeccccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeE---E Q lcl|Aclame:pro 229 PVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPI---R 305 (318) Q Consensus 229 Pv~v~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~i---R 305 (318) +| +|||++|..+.-|+.++.+ .+. ..+....++.|.+|.+|||-|.+-|+... = T Consensus 193 ----~~---------Rvl~VtP~~~~~Lk~~~~f------~~~----~~~~~~~~~~g~Vg~idG~~Ii~vps~~~k~in 249 (329) T protein:vir:10 193 ----AS---------RILFVTPKFYKGIKKFVIE------LPQ----GDNRQQVLGKGVQGELDGFTIVKVPSKMLQGVE 249 (329) T ss_pred ----CC---------cEEEeCHHHHHHHHhhhhh------hcc----ccccccceeeeeeeeecCeEEEEecCCccccee Confidence 11 6899999999999998754 111 12345688999999999999999876411 1 Q ss_pred EcCC------Ceeeeee--eC Q lcl|Aclame:pro 306 FYQG------QRFWYQR--IT 318 (318) Q Consensus 306 f~ag------~~v~~a~--~~ 318 (318) |-+| ..+++.. |. T Consensus 250 ~ii~~~~A~~~~~K~~~~~~~ 270 (329) T protein:vir:10 250 AMAVIGEVMASPIQANEAKLN 270 (329) T ss_pred EEEEcCCceeeeeeeeeeeee Confidence 2111 1112111 11 No 54 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=96.61 E-value=0.00047 Score=38.81 Aligned_cols=239 Identities=12% Similarity=0.030 Sum_probs=129.5 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchH-HHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~-v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L 79 (318) +-+.+-.+.-.+| -.-+-+...|+.. -++|+..|..-......-.+.. .|. +.+-..|++|.++-+.- T Consensus 5 ~~~~~~~~~~~~~-~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~---~N~------~~e~~gg~tVkIp~i~~- 73 (319) T protein:vir:97 5 IKNATGMLKLNLQ-HFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPAL---ISN------DAIFMEGRSFTVMKGDT- 73 (319) T ss_pred cccccceeEeehh-hhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcc---cCc------ceEeccCcEEEEeeecc- Confidence 1111111111111 1112344455553 4567766654333333222221 111 12224699999876543 Q ss_pred ccCCeecCcee--ecchhhhhheeeEEEEecccccccccchhhhhhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 80 SKRPTMGDERV--EGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNL--ASSARTLLGTYFNDLQDQCAIVHLAGA 155 (318) Q Consensus 80 ~G~gv~Gd~~l--eGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dl--r~~ar~~L~~w~~~~~D~~~~~~LsG~ 155 (318) .++ +|-.. -.+.++++....++.+||.|----.=..|+..-+..++ -........+.+....|.-.|..|++. T Consensus 74 --~gl-~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~ 150 (319) T protein:vir:97 74 --TEL-KDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARN 150 (319) T ss_pred --ccc-ccccCCCCcccCCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhh Confidence 233 33322 23456789999999999998654444566766665554 334445555556666777677666532 Q ss_pred ccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccc Q lcl|Aclame:pro 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (318) Q Consensus 156 rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~ 235 (318) -+. ..+..++++. .++.|+.+.+++++.+-| + T Consensus 151 a~~--------------------------------------~~~~~~t~~n--~y~~i~~a~~~Lde~~VP-------~- 182 (319) T protein:vir:97 151 KAK--------------------------------------HLTVGTGSDA--QYDAVLDVSVELDEIKAP-------E- 182 (319) T ss_pred ccc--------------------------------------ccccccCHHH--HHHHHHHHHHHHHhcCCC-------C- Confidence 220 0111122222 388899999999886543 1 Q ss_pred cccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCcee---EEEcCCC-- Q lcl|Aclame:pro 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMP---IRFYQGQ-- 310 (318) Q Consensus 236 ~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~---iRf~ag~-- 310 (318) . +||||+|..+.-|+.++.+ ++.-. ..+..++.|.+|.+|||-|.|-|+.. |-|-+|- T Consensus 183 ~-------Rvl~Vtp~~~~~L~~~~~f------~~~~~----~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~ 245 (319) T protein:vir:97 183 N-------RVLFVSPTFYKGIKKFVIA------LPQGD----TRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGE 245 (319) T ss_pred C-------cEEEeCHHHHHHHHhhhhh------hcccc----ccccceeeeeceeecCeEEEEecccccccceEEEEcCC Confidence 1 6899999999999999754 22211 22467899999999999999987631 1121111 Q ss_pred ----eeeeee--eC Q lcl|Aclame:pro 311 ----RFWYQR--IT 318 (318) Q Consensus 311 ----~v~~a~--~~ 318 (318) .+++.. |. T Consensus 246 A~~~~~k~~~~~~~ 259 (319) T protein:vir:97 246 VLASPIQADLAKTN 259 (319) T ss_pred eeeeeeeeeeeecc Confidence 112111 11 No 55 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=96.61 E-value=0.00047 Score=38.81 Aligned_cols=239 Identities=12% Similarity=0.030 Sum_probs=129.5 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchH-HHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~-v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L 79 (318) +-+.+-.+.-.+| -.-+-+...|+.. -++|+..|..-......-.+.. .|. +.+-..|++|.++-+.- T Consensus 5 ~~~~~~~~~~~~~-~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~---~N~------~~e~~gg~tVkIp~i~~- 73 (319) T protein:vir:94 5 IKNATGMLKLNLQ-HFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPAL---ISN------DAIFMEGRSFTVMKGDT- 73 (319) T ss_pred cccccceeEeehh-hhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcc---cCc------ceEeccCcEEEEeeecc- Confidence 1111111111111 1112344455553 4567766654333333222221 111 12224699999876543 Q ss_pred ccCCeecCcee--ecchhhhhheeeEEEEecccccccccchhhhhhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 80 SKRPTMGDERV--EGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNL--ASSARTLLGTYFNDLQDQCAIVHLAGA 155 (318) Q Consensus 80 ~G~gv~Gd~~l--eGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dl--r~~ar~~L~~w~~~~~D~~~~~~LsG~ 155 (318) .++ +|-.. -.+.++++....++.+||.|----.=..|+..-+..++ -........+.+....|.-.|..|++. T Consensus 74 --~gl-~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~ 150 (319) T protein:vir:94 74 --TEL-KDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARN 150 (319) T ss_pred --ccc-ccccCCCCcccCCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhh Confidence 233 33322 23456789999999999998654444566766665554 334445555556666777677666532 Q ss_pred ccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccc Q lcl|Aclame:pro 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (318) Q Consensus 156 rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~ 235 (318) -+. ..+..++++. .++.|+.+.+++++.+-| + T Consensus 151 a~~--------------------------------------~~~~~~t~~n--~y~~i~~a~~~Lde~~VP-------~- 182 (319) T protein:vir:94 151 KAK--------------------------------------HLTVGTGSDA--QYDAVLDVSVELDEIKAP-------E- 182 (319) T ss_pred ccc--------------------------------------ccccccCHHH--HHHHHHHHHHHHHhcCCC-------C- Confidence 220 0111122222 388899999999886543 1 Q ss_pred cccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCcee---EEEcCCC-- Q lcl|Aclame:pro 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMP---IRFYQGQ-- 310 (318) Q Consensus 236 ~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~---iRf~ag~-- 310 (318) . +||||+|..+.-|+.++.+ ++.-. ..+..++.|.+|.+|||-|.|-|+.. |-|-+|- T Consensus 183 ~-------Rvl~Vtp~~~~~L~~~~~f------~~~~~----~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~ 245 (319) T protein:vir:94 183 N-------RVLFVSPTFYKGIKKFVIA------LPQGD----TRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGE 245 (319) T ss_pred C-------cEEEeCHHHHHHHHhhhhh------hcccc----ccccceeeeeceeecCeEEEEecccccccceEEEEcCC Confidence 1 6899999999999999754 22211 22467899999999999999987631 1121111 Q ss_pred ----eeeeee--eC Q lcl|Aclame:pro 311 ----RFWYQR--IT 318 (318) Q Consensus 311 ----~v~~a~--~~ 318 (318) .+++.. |. T Consensus 246 A~~~~~k~~~~~~~ 259 (319) T protein:vir:94 246 VLASPIQADLAKTN 259 (319) T ss_pred eeeeeeeeeeeecc Confidence 112111 11 No 56 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=96.03 E-value=0.00068 Score=37.95 Aligned_cols=245 Identities=10% Similarity=0.098 Sum_probs=124.5 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccC---CCCCcEEEEEEee Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLN---KQAGDEVTFSIMH 77 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~---k~~Gd~v~f~L~~ 77 (318) |++..-.+. ...-.-+|+..-.+.+. ....|.++ .+|+.-.+|. .+.|+.|++++.. T Consensus 1 Ma~T~l~D~-iipe~~vf~~Yv~~~~~----------------e~~~l~qS---Gii~~d~~l~~~~~~gG~~~~iPf~~ 60 (349) T protein:vir:78 1 MAITTIGDI-VTGNIPVLASYMTEDPV----------------EKTAFFDS---GILTSTPYAAEIANGPSNIANLPFWK 60 (349) T ss_pred CCceEEeee-eccCHHHHHHHHHHhhH----------------Hhhhhhhc---cceeccHHHHHHhhcCCCEEEeeeee Confidence 443322211 11222234444444431 12345553 5666666664 4789999999999 Q ss_pred ccccC--CeecCceeecc--hhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 78 KLSKR--PTMGDERVEGR--GEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA 153 (318) Q Consensus 78 ~L~G~--gv~Gd~~leGn--ee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~Ls 153 (318) +|.|+ +-..++.-++. -+.+...++.-++=.+-++.... .+++.=|.-|..+....++++||.+.....+|-.|. T Consensus 61 ~L~g~~e~nv~~D~~~~~~t~~kitt~~~~a~~~~r~kaw~~~-Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~ 139 (349) T protein:vir:78 61 AIDTSIEPNYSNDVYQDIATPRAIQTGEMMARVAYLNEGFGQA-DLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATAL 139 (349) T ss_pred cCCCCcccccCCCCcccccccccccccceeeeeeeeccccchh-HHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHH Confidence 99984 32333222222 33566666665554444443322 223333444888888999999999999999999998 Q ss_pred ccccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHH-HHHhcCCCCceeEe Q lcl|Aclame:pro 154 GARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSL-FIDEMAHPLQPVRL 232 (318) Q Consensus 154 G~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~-~a~~~~~pi~Pv~v 232 (318) |+-+...... ....+. ++ + . .++.++..++.+.+-.|.. +.+. .. T Consensus 140 Gvf~~~~~a~------~~~~~~----~~-------~-t--------~d~s~~a~~~~~~~~dA~~~lgda-~~------- 185 (349) T protein:vir:78 140 GLYNDNVSAT------DAYHEQ----ND-------M-V--------VDVSATLGFDAGAFIDATQTMGDA-LM------- 185 (349) T ss_pred Hhhccccccc------chhhhc----cc-------c-e--------eeeccccCCChhhhhhhHHHHHHH-hc------- Confidence 8876321100 000111 10 0 0 0122222356554444443 3332 11 Q ss_pred ccccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeE-------E Q lcl|Aclame:pro 233 SGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPI-------R 305 (318) Q Consensus 233 ~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~i-------R 305 (318) |+. .+.+=+++||+..+..|++.- .++..+ .+++ ...++.|+|..+..--.||+ . T Consensus 186 -Gd~---~~~lt~i~mHS~v~~~L~~~~----li~~i~----~s~~------~~~i~ty~G~~VivDD~~Pv~~~g~~~~ 247 (349) T protein:vir:78 186 -GNG---GEVLGAIAMHSFVYAQARKAQ----LIDFIR----DAEN------NTMFATYQGYRVIVDDSMTVVGQGAQRK 247 (349) T ss_pred -ccc---ccceeEEEEchHHHHHHHhhh----hhhhcc----Cccc------CcccceecCeEEEEeCCCccccCCCCce Confidence 211 122468999999999999863 233222 1111 12356666666655444554 1 Q ss_pred ----EcCCCeeeeeeeC Q lcl|Aclame:pro 306 ----FYQGQRFWYQRIT 318 (318) Q Consensus 306 ----f~ag~~v~~a~~~ 318 (318) +.+.+.|.|.... T Consensus 248 yttylfg~GAi~~~~~~ 264 (349) T protein:vir:78 248 FISIIFGQGAIGYGEGN 264 (349) T ss_pred EEEEEeecceEEEccCC Confidence 1233333333322 No 57 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=95.84 E-value=0.0011 Score=36.75 Aligned_cols=246 Identities=10% Similarity=0.086 Sum_probs=125.8 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccC---CCCCcEEEEEEee Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLN---KQAGDEVTFSIMH 77 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~---k~~Gd~v~f~L~~ 77 (318) |++..-.+. ...-.-+|+..-.+.+. ....|.++ .+|+.-.+|. .+.|+.|++++.. T Consensus 1 Ma~T~l~D~-iipe~~vf~~Yv~~~~~----------------e~~~l~qS---Gii~~d~~l~~~~~~gG~~~~iPf~~ 60 (349) T protein:vir:94 1 MAITTIGNI-VTGNIPVLASYMTEDPV----------------EKTAFFNS---GILTPTPYAAEIARGPSNIANLPFWK 60 (349) T ss_pred CCceEEeee-eccChHHHHHHHHHhHH----------------Hhhhhhhc---cceeccHHHHHHHhcCCCEEEeeeee Confidence 543322221 11222234444444441 12445553 5666666665 4789999999999 Q ss_pred ccccC--C-eecCceee-cchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 78 KLSKR--P-TMGDERVE-GRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA 153 (318) Q Consensus 78 ~L~G~--g-v~Gd~~le-Gnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~Ls 153 (318) +|.|+ + +.||...+ .--..+...++.-++=.+-++.... .+++.=|.-|..+....++++||.+.....+|..|. T Consensus 61 ~l~g~~e~n~~~dt~~~~~t~~kit~~~~~a~~~~r~kaw~~~-Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~ 139 (349) T protein:vir:94 61 AIDTSIEPNYSNDVYQDIATPRAIQTGEMMARVAYLNEGFGQA-DLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATAL 139 (349) T ss_pred cCCCCcccccCCCCcccccccccccccceeeeeeeeccccchh-HHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHH Confidence 99885 3 44443321 2223455555555544443343222 233333445888888999999999999999999998 Q ss_pred ccccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEec Q lcl|Aclame:pro 154 GARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLS 233 (318) Q Consensus 154 G~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~ 233 (318) |.-+...... ....+. + ++ ...+.++..++.+.+-.|...+-......+ T Consensus 140 Gvf~~~~~~~------~~~~~~----~-------~~---------~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~----- 188 (349) T protein:vir:94 140 GLYNDNVSAT------DAYHEQ----N-------DM---------VVDVSATSGFDAGAFIDATQTMGDALMGNG----- 188 (349) T ss_pred hhhccccccc------cccccc----C-------ce---------eEEecccCCCChhhHHHHHHHHHHHhcccc----- Confidence 8876321100 000110 0 00 111233444666654444433322111111 Q ss_pred cccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE-------- Q lcl|Aclame:pro 234 GDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR-------- 305 (318) Q Consensus 234 g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR-------- 305 (318) + +.+=++.||+..+..|++.-- ++..+ .++| ...++.|+|..+..--.||+- T Consensus 189 ~------~~lt~i~mHS~v~~~L~~~~l----i~~i~----~s~~------~~~i~ty~G~~VivDD~~Pv~~~g~~~~y 248 (349) T protein:vir:94 189 G------EVLGAIAMHSFVYAQARKAQL----IDFIR----DAEN------NTMFATYQGYRVIVDDSMTVVGQDTSRKF 248 (349) T ss_pred c------cceeEEEEchHHHHHHHhcch----hhhcc----Cccc------CcccceecCcEEEEeCCCccccCCCCceE Confidence 1 224689999999999999732 22222 1111 112456666655554445541 Q ss_pred ---EcCCCeeeeeeeC Q lcl|Aclame:pro 306 ---FYQGQRFWYQRIT 318 (318) Q Consensus 306 ---f~ag~~v~~a~~~ 318 (318) +.+.+.|.|.... T Consensus 249 ttylfg~GAi~~~~~~ 264 (349) T protein:vir:94 249 ISIIFGQGAIGYGEGN 264 (349) T ss_pred EEEEeecceEEeecCC Confidence 1233334443332 No 58 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=95.51 E-value=0.00071 Score=37.85 Aligned_cols=161 Identities=15% Similarity=0.049 Sum_probs=90.9 Q ss_pred EecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeecccccccccccccccCCC Q lcl|Aclame:pro 106 INQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPP 185 (318) Q Consensus 106 Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aP 185 (318) ||.+..+=-.=..+++..+..|+|.+.-..+..-+++-.|+-++..+..+... ..|....+.. T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~------~~p~~~~~~g----------- 63 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIA------AAPVTGQDGG----------- 63 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh------cCcccccccC----------- Confidence 77765442222467888889999999999999999999999999888633211 0011100000 Q ss_pred CCCceEeecCCccccccccccccC----HHHHHHHHHHHHhcCCCCceeEeccccccCCcceEEEEecHHHHHHHHh--C Q lcl|Aclame:pro 186 THDRHFFGGDATSFEQIEAADIFS----IGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYT--S 259 (318) Q Consensus 186 t~~r~~~~~~at~~~~i~a~D~~s----~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~~~~yV~~l~P~q~~dLr~--d 259 (318) + ...++++..-+ ++.|-.|....++..-| .++ .++++.|.|+..|-+ | T Consensus 64 ---------~---~~~~~a~~t~~~~~l~dai~~a~~~LdekdVP-------~~g-------R~~vv~P~~y~~LL~~~d 117 (221) T protein:vir:17 64 ---------F---SVNIGAGNTNNAQAIVDGFFEAAAVLDERSAP-------MDG-------RVAVLSPRQYYSLISSVD 117 (221) T ss_pred ---------c---ceeccccccCCHHHHHHHHHHHHHHHhhcCCC-------CCC-------CEEEeCcHHHHHHHHhcC Confidence 0 00111111112 35555566666664444 112 688999999999975 3 Q ss_pred cchHHHHHHHHHHhhccccccCCcccC-CeEEEcCEEEEecCceeEEE------cCCCeeeee------eeC Q lcl|Aclame:pro 260 TSGKDWNQMMVRAVNRAKGFNHPLFKG-ECAMWRNILVRKYAGMPIRF------YQGQRFWYQ------RIT 318 (318) Q Consensus 260 ~~~~~w~~~qk~A~~r~~g~~nPlF~G-~~gm~ngvii~e~~~~~iRf------~ag~~v~~a------~~~ 318 (318) +-. .++... +..--+..| ++|+++|+-|.+-+++|--. ++|..+-.+ |-. T Consensus 118 ~~~-------~n~d~~--~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt~~~~~ag~~~~~~~~~~~yr~~ 180 (221) T protein:vir:17 118 TNI-------LNREIG--NTQGDMNTGKGLYVNAGIRIYKSNVLASLYGTNLVTDPGDATTSGENNGSYRPA 180 (221) T ss_pred cce-------eeeecc--cccccccccceeeeecCcEEEEeccCCcccccccccCCcccccccccccccccc Confidence 421 111111 112225556 69999999999998887422 233332111 111 No 59 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=95.39 E-value=0.0022 Score=35.19 Aligned_cols=249 Identities=11% Similarity=-0.015 Sum_probs=127.7 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeec-- Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHK-- 78 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~-- 78 (318) .-.+-++.+ .|+. ..-+-+|+.|++.+........ .++-++ ++..+ .-+.++.+..-.... T Consensus 3 ~~~~~~~~~--~Ms~------~i~~~fv~qy~~~v~~~~qq~~--s~L~~t-----V~~~~--~~~~~~~~~~~~~~~~~ 65 (322) T protein:vir:10 3 LNAIMSMLP--LIAG------DIDQAFVQTYETTLRILSQQKS--AKLKQY-----CQHKN--ESSESHNWETLASMDPD 65 (322) T ss_pred ccceeeeee--eeec------hhhhHHHHHHHHHHHHHHHHhh--hhhhcc-----ccccc--ccccccceeeccccccc Confidence 111222211 1222 1233468888887644433221 223222 11111 112222222111111 Q ss_pred cccCCeecCceeecc---h-hhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 79 LSKRPTMGDERVEGR---G-EDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAG 154 (318) Q Consensus 79 L~G~gv~Gd~~leGn---e-e~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG 154 (318) .-|..-.+.....+. - .+.....-.+.+++...++.+ ..++.-|..+|+|...-..++.-+++..|+.++..+.| T Consensus 66 ~~~~~~~~~~~~d~~~dtp~~~~~~~~r~~~~~d~~~~~~V-Dd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g 144 (322) T protein:vir:10 66 AVKRKRSRQQSADGTYPTPVNNKPFAKRRTNVDTYDTGHVV-EQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWK 144 (322) T ss_pred ccccccccccccCcccCCCccccccceEEEeecccccceec-chHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhc Confidence 111111111111111 1 122333334444555555444 47788889999999999999999999999988755543 Q ss_pred cccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecc Q lcl|Aclame:pro 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSG 234 (318) Q Consensus 155 ~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g 234 (318) .-. . +. ...+|..|+++-+-- ++=.++.+.|..|+...+...-| -+| T Consensus 145 ~a~--~------------~~---~gt~v~~~ss~~i~~-----------g~~g~t~~kl~~a~~~l~~~dvp-----~d~ 191 (322) T protein:vir:10 145 PAS--I------------KG---TGQPVEFLATQEIGD-----------GTKPISFDYVTEITERFLENEIE-----PEV 191 (322) T ss_pred ccc--c------------cc---cccccccCCCccccc-----------CccchhHHHHHHHHHHHHhcCCC-----CCC Confidence 321 0 00 011233333221111 12257778888888888775533 112 Q ss_pred ccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCccc-CCeEEEcCEEEEecCceeE--------- Q lcl|Aclame:pro 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK-GECAMWRNILVRKYAGMPI--------- 304 (318) Q Consensus 235 ~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~-G~~gm~ngvii~e~~~~~i--------- 304 (318) + .+|++.|.|+.+|.+|+.+.. +.- ....+|+. |.+|.|-|+-+..+.++|. T Consensus 192 -------~-R~~vv~p~~~~~LL~d~~~ts-------~D~---~~~~~l~~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~ 253 (322) T protein:vir:10 192 -------S-KVIVIGPTQARKLLQITEATS-------ADY---TSAMDLQSKGIITNWMGYTWIVSTRLDKFDPTQWGMA 253 (322) T ss_pred -------C-eEEEeCHHHHHHHhcchhhhh-------hhc---ccchhhhhcCeeeeeeeEEEEEeccCCcccccccccc Confidence 1 468999999999999986531 111 12467875 8899999999999887772 Q ss_pred ---------E---EcCCCeeeeee---eC Q lcl|Aclame:pro 305 ---------R---FYQGQRFWYQR---IT 318 (318) Q Consensus 305 ---------R---f~ag~~v~~a~---~~ 318 (318) | +|.-+.|..|. |+ T Consensus 254 ~~~~~~~~~~~~~a~~k~Av~~a~~~dv~ 282 (322) T protein:vir:10 254 AEDGPQGDEIWCIAMTDMALGYHSCKDIW 282 (322) T ss_pred ccCCCCccceeEEEEecCceeEEEeeeee Confidence 1 23334444442 33 No 60 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=94.41 E-value=0.0045 Score=33.45 Aligned_cols=226 Identities=14% Similarity=0.057 Sum_probs=116.2 Q ss_pred cccc--hH-HHHhhhhhhhhhhhhcccccccCCCCCccEEEeecc-CCCCCcEEEEEEeeccccCCeecCceeecchhhh Q lcl|Aclame:pro 22 NRNR--SM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDL-NKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDL 97 (318) Q Consensus 22 ~~~~--~~-v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL-~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGnee~L 97 (318) +-|+ +. -++|+..+-..-.+..-+...+-.+.. .|. ..+.||+|+|.......-.-....+....+.+.+ T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~lV~r~y~------ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~ 74 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKTVDRQLL------SGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGL 74 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhcccCCC------cccccccCCCEEEEeeCCcceeecccCcCCCCcccccc Confidence 2233 11 255665532222222222222211111 111 2467999999987655422111111122234666 Q ss_pred hheeeEEEEecccc-cccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccccc Q lcl|Aclame:pro 98 SHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKK 176 (318) Q Consensus 98 ~~~sd~v~Idq~R~-~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~n~~~~~p~~~~~~~~~ 176 (318) .-.+-.|.||+..+ ++...++=..| ..-||.+..++.... +++..|+.++..|..... T Consensus 75 ~e~~v~l~id~~k~~a~~v~d~e~~l-~i~~~~~~l~~a~~a-la~~vd~~l~~~l~~~a~------------------- 133 (423) T protein:vir:35 75 FSAKATGKVGKYITVAVEWTQIEEAL-KLNQLDQILSPIHER-MVTDLETELAHFMMNNGA------------------- 133 (423) T ss_pred ccceeeEEeccceeccceeCHHHHHh-hHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhccc------------------- Confidence 66677899999987 66776542122 455777666666544 555567666544421110 Q ss_pred ccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCCcceEEEEecHHHHHHH Q lcl|Aclame:pro 177 IMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDW 256 (318) Q Consensus 177 ~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~~~~yV~~l~P~q~~dL 256 (318) |.+-.|. +..-.++.|-.+...+...+-|- ++ +.+++.|+-+..| T Consensus 134 ---~~vgt~~------------------t~~~~~~~i~~a~~~Ld~~~vP~------~~--------R~~Vv~p~~~a~L 178 (423) T protein:vir:35 134 ---LSLGSPN------------------TAIKKWADVAQTASFIKDIGIKT------GE--------NYAIMDPWSAQRL 178 (423) T ss_pred ---ccccccc------------------CCcchHHHHHHHHHHHHHhcCCc------CC--------CEEEeCHHHHHHH Confidence 1010110 01123677888888888877662 11 5889999999888 Q ss_pred HhCcchHHHHHHHHHHhhccccccCCcccCCe-EEEcCEEEEecCceeEEE---cCC-------Ceeeee---------- Q lcl|Aclame:pro 257 YTSTSGKDWNQMMVRAVNRAKGFNHPLFKGEC-AMWRNILVRKYAGMPIRF---YQG-------QRFWYQ---------- 315 (318) Q Consensus 257 r~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~-gm~ngvii~e~~~~~iRf---~ag-------~~v~~a---------- 315 (318) ..+... . .+ +. .+...-|=.|++ |.+.|+=|++..++|-.= ++| .++..+ T Consensus 179 l~~~~~--~---~~-~~---~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v~~a~~v~~~a~~~~~~~~~ 249 (423) T protein:vir:35 179 ADAQSG--L---HA-AD---QLVRTAWENAQISGNFGGIRALMSNGLASRKQGDFDGAITVKTAPNVDYLSVKDSYQFTV 249 (423) T ss_pred hccccc--e---ec-cc---cchhHHHhhccceeeecceEEEEcCCCccccccccccceeecccccccccccccccccee Confidence 865431 1 11 11 122344667776 999999999998888541 111 111110 Q ss_pred ---eeC Q lcl|Aclame:pro 316 ---RIT 318 (318) Q Consensus 316 ---~~~ 318 (318) -.+ T Consensus 250 ~~~~~~ 255 (423) T protein:vir:35 250 ALTGAT 255 (423) T ss_pred eeeeee Confidence 000 No 61 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=94.37 E-value=0.0046 Score=33.39 Aligned_cols=220 Identities=11% Similarity=0.056 Sum_probs=114.0 Q ss_pred HHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccccCCeecCce-ee Q lcl|Aclame:pro 13 FQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDER-VE 91 (318) Q Consensus 13 ~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~-le 91 (318) |+..+| ++|+..|.....+.+....+ . +. +..=..|++|.++=+. -.++ +|-. -. T Consensus 1 Main~a----------~~~~~~Ld~~~~~~~~t~~l-~---~~------~~~~~ggktVkI~~i~---~~gl-~DY~R~~ 56 (290) T protein:vir:78 1 MAINYV----------DKYGKELDQKLVFGTYTNEL-E---TP------NLLWLDAKTFKIQTIT---TTGL-KAHTRNK 56 (290) T ss_pred CchhHH----------HHHHHHHHHHHHhhheeeec-c---cc------ceeeccCCEEEEeeec---cCcc-cccccCC Confidence 333332 34555554444444443322 1 11 1112358999988443 2222 2222 12 Q ss_pred cc-hhhhhheeeEEEEecccccccccchhhhhhh--hhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeec Q lcl|Aclame:pro 92 GR-GEDLSHADFSLKINQGRHLVDAGGRMSQQRT--KFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPT 168 (318) Q Consensus 92 Gn-ee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs--~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~n~~~~~p~ 168 (318) |. ..+.+....++.+||.|----.=..|+.-.| ...+-........+.....+|.-.|-.|++.-+.. T Consensus 57 g~~~g~v~~~~et~tl~qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~--------- 127 (290) T protein:vir:78 57 GYNEGSASNTNKSYTIDFDRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTN--------- 127 (290) T ss_pred CcccCccccceeeEEeeccccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhcc--------- Confidence 22 2335667788889998754332234443333 24455555556666667777877777775333210 Q ss_pred ccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCCcceEEEEe Q lcl|Aclame:pro 169 AEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYV 248 (318) Q Consensus 169 ~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~~~~yV~~l 248 (318) | ......++++ =-++.|+.+...++. .| .+. +||+| T Consensus 128 -----------~---------------~~~~~t~t~~--n~~~~i~~~~~~lde--vp-------~~~-------rvl~v 163 (290) T protein:vir:78 128 -----------S---------------NSVAEEITKD--NVFTKLKAAIRKVKK--YG-------TQN-------LVMYV 163 (290) T ss_pred -----------C---------------cccccccCHH--HHHHHHHHHHHHHHh--cC-------CCC-------eEEEE Confidence 0 0011122222 125566677766665 11 111 89999 Q ss_pred cHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEc------CCCe----------- Q lcl|Aclame:pro 249 TPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFY------QGQR----------- 311 (318) Q Consensus 249 ~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~------ag~~----------- 311 (318) +|.-+.-|+.++.| ++..... ....-...|.+|-+||+-|.|-|+. =||+ .|-. T Consensus 164 tp~~~~lL~~~~~f------~r~~~~~--~~~~~~i~~~V~~idG~~ii~vps~-~r~~t~~~f~~G~~~~~~ak~in~i 234 (290) T protein:vir:78 164 SPDVMAALELSDDF------VRAINVQ--NIGPSSIETRITAIDGTRIVEVEAE-DRFYDTFDFTDGYKPAAGAKKLNFL 234 (290) T ss_pred CHHHHHHHhhChhh------hcccccc--ccccccccceeeeecCcEEEEeccc-chhhhhhhhcccccccCCccceeEE Confidence 99999999999865 3322111 1123356999999999999998753 3654 2211 Q ss_pred ----------eeeeeeC Q lcl|Aclame:pro 312 ----------FWYQRIT 318 (318) Q Consensus 312 ----------v~~a~~~ 318 (318) +|++.+- T Consensus 235 i~~~~a~i~~~K~~~~~ 251 (290) T protein:vir:78 235 LVNKGSVVGGAKHASIY 251 (290) T ss_pred EEcCCceeeeeeeeEEE Confidence 1121111 No 62 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=94.07 E-value=0.0055 Score=32.98 Aligned_cols=226 Identities=11% Similarity=0.008 Sum_probs=111.9 Q ss_pred cccch-H--HHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccC-CCCCcEEEEEEeeccccCCeecCceeecchhhh Q lcl|Aclame:pro 22 NRNRS-M--VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLN-KQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDL 97 (318) Q Consensus 22 ~~~~~-~--v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~-k~~Gd~v~f~L~~~L~G~gv~Gd~~leGnee~L 97 (318) +.|+= + -++|+..+-..-.+..-+...+-.... .|.. ...||+|+|.......-.-..+..--.-+-++| T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~------~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl 74 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLL------AGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNL 74 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCC------CcccccccCCEEEEeeCCceeeeccCCccccccccCcc Confidence 33331 1 245655432222221222111111110 1111 247999999877655433222221111245788 Q ss_pred hheeeEEEEecccc-cccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccccc Q lcl|Aclame:pro 98 SHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKK 176 (318) Q Consensus 98 ~~~sd~v~Idq~R~-~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~n~~~~~p~~~~~~~~~ 176 (318) .-.+-.|.||+..| ++....+ +..-..-||-+..++ ...-+++..|+.++..+.+... T Consensus 75 ~e~~v~l~id~~k~va~~v~d~-E~~~~i~~~~~~l~~-A~~aLA~~vd~~ia~~~~~~~~------------------- 133 (423) T protein:vir:10 75 ISGKATGRVGNYITVAVEYQQL-EEAIKLNQLEEILAP-VRQRIVTDLETELAHFMMNNGA------------------- 133 (423) T ss_pred ccceeEEEeeceeeeeeeechH-HHhcChhhHHHHHHH-HHHHHHHHHHHHHHHHHhhccc------------------- Confidence 88888999999988 6666543 222233445322222 2344566677666543322110 Q ss_pred ccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCCcceEEEEecHHHHHHH Q lcl|Aclame:pro 177 IMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDW 256 (318) Q Consensus 177 ~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~~~~yV~~l~P~q~~dL 256 (318) |.+..| .+. .-.++.+-.+...+.+.+-|- ++ +.+++.|+-+..| T Consensus 134 ---~~~gt~-----------~t~-------~~a~~~i~~a~~~Ld~~~vP~------~~--------R~~Vv~p~~~a~L 178 (423) T protein:vir:10 134 ---LSLGSP-----------NTP-------ITKWSDVAQTASFLKDLGVNE------GE--------NYAVMDPWSAQRL 178 (423) T ss_pred ---cccccC-----------Ccc-------cchHHHHHHHHHHHHhccCCc------CC--------CEEEeChHHHHHH Confidence 000000 000 113667777888888877661 11 5679999999998 Q ss_pred HhCcchHHHHHHHHHHhhccccccCCcccCCe-EEEcCEEEEecCceeEE-Ec-CCC---------------------ee Q lcl|Aclame:pro 257 YTSTSGKDWNQMMVRAVNRAKGFNHPLFKGEC-AMWRNILVRKYAGMPIR-FY-QGQ---------------------RF 312 (318) Q Consensus 257 r~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~-gm~ngvii~e~~~~~iR-f~-ag~---------------------~v 312 (318) ..+... . .++. .+...-|=.|.+ |.+.|+=+++..++|-. =+ +++ ++ T Consensus 179 l~~~~~--~----~~~~---~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~a~~~~~ 249 (423) T protein:vir:10 179 ADAQTG--L----HASD---QLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTV 249 (423) T ss_pred hccccc--e----eccc---ccchhhhhhccceeeecceEEEEeCCCccccccccccceeeeecceeccccccccceeee Confidence 876421 1 1111 122344666876 89999999998888742 11 111 11 Q ss_pred eeeeeC Q lcl|Aclame:pro 313 WYQRIT 318 (318) Q Consensus 313 ~~a~~~ 318 (318) .+++.+ T Consensus 250 ~~~~~~ 255 (423) T protein:vir:10 250 TLTGAT 255 (423) T ss_pred eeeecc Confidence 112111 No 63 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=92.16 E-value=0.0042 Score=33.63 Aligned_cols=231 Identities=14% Similarity=0.196 Sum_probs=117.8 Q ss_pred HHHHHHHHhcccchH----H--HHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccccCCeec Q lcl|Aclame:pro 13 FQVALFTAANRNRSM----V--NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMG 86 (318) Q Consensus 13 ~a~~lft~~~~~~~~----v--~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~G 86 (318) |+-|+ |.+. + .+||..+..--- .+++ . ..|.++.|. +.||+|.|+=+ |.++.+ T Consensus 1 ~~~~n------~ts~~qafi~~EiWsa~il~~l~-----~~Lv---~-~~~~~~~d~--g~GDtV~InsI----g~~tV~ 59 (322) T protein:vir:31 1 MSTGN------NTSNTQALIVSEIWADEIEDILH-----EKLL---D-VNIARVVDF--PDGDKLTIPSV----GTPVVR 59 (322) T ss_pred CCCCC------CcccceEEeehhhhHHHHHHHhh-----hhhh---h-hhhhccccc--CCCCeEEeccc----cccccc Confidence 33333 3331 2 368765321111 1222 1 113334444 45999999765 555566 Q ss_pred Cceeecc--hhhhhheeeEEEEeccc---ccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh-ccccccc Q lcl|Aclame:pro 87 DERVEGR--GEDLSHADFSLKINQGR---HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA-GARGDFV 160 (318) Q Consensus 87 d~~leGn--ee~L~~~sd~v~Idq~R---~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~Ls-G~rg~~~ 160 (318) |....+. -+.|+-...+|.|||.. +.|+- .+ .| ...||+..+-...++-+++-.|+-..-.|. |+-.... T Consensus 60 dY~~~~~i~~d~ltt~~~~l~IDq~KYfaf~VdD-D~--~Q-a~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~ 135 (322) T protein:vir:31 60 SRPEQGDFTFDNLDTGEISIILRDEVYAGNAISK-KL--RQ-DSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAG 135 (322) T ss_pred cccCCCCcccccCCCceEEEEEehhhhhccccch-hH--HH-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc Confidence 6655554 57788899999999964 44554 33 23 456999999999999999989887755443 2211000 Q ss_pred cccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCC Q lcl|Aclame:pro 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (318) Q Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~ 240 (318) ..... .+|++ | +.++. +.+.-++.++.|.++..++.+..-| .+. T Consensus 136 ~~~p~------------vin~~--~---~~iv~--------~gt~~~~ay~~lv~l~~kLdkanVP-------~~g---- 179 (322) T protein:vir:31 136 QNDPN------------VINGV--P---HRFVG--------TGTDQTMDVTDFSRVNYVMTQSKMP-------MGG---- 179 (322) T ss_pred cCCcc------------eecCC--c---cceec--------cCCCchhhHHHHHHHHHHhccccCC-------CCC---- Confidence 00000 01111 1 11111 1223367889999999988886655 111 Q ss_pred cceEEEEecHHHHHHHHh---------CcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeE---EEcC Q lcl|Aclame:pro 241 DPYYVLYVTPRQWNDWYT---------STSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPI---RFYQ 308 (318) Q Consensus 241 ~~~yV~~l~P~q~~dLr~---------d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~i---Rf~a 308 (318) ++++++|.+++.|.. |+. |-.+..+..++ |. .|- |-+-|+=|..--.+|. -.++ T Consensus 180 ---R~vVV~P~~~~~L~~i~~~~~l~~D~r---f~~i~~sG~a~--g~---~~V---g~~~GF~V~~SN~l~~~~~~i~a 245 (322) T protein:vir:31 180 ---MIGIIDPSVAHHLETITNISNISNNPR---WEGIVESGIAP--DM---QFV---RSVYGIDLFVSNLLADANETINA 245 (322) T ss_pred ---eEEEeCchhhhhhhhhhhhhhhhcccc---ccccccccchh--hH---HHH---HHHhceeeeeecccccccccccc Confidence 688999999887744 543 22222222211 21 233 3334444443222221 1223 Q ss_pred CCee--e-------eeeeC Q lcl|Aclame:pro 309 GQRF--W-------YQRIT 318 (318) Q Consensus 309 g~~v--~-------~a~~~ 318 (318) |..- . ..+|. T Consensus 246 G~d~~~t~ag~~n~f~~~~ 264 (322) T protein:vir:31 246 GGDARSTTAGKCNMFMNVS 264 (322) T ss_pred Ccccccccceeeccccccc Confidence 3221 1 11221 No 64 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=89.16 E-value=0.028 Score=29.10 Aligned_cols=226 Identities=15% Similarity=0.090 Sum_probs=114.3 Q ss_pred HHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccccCCeecCcee-e Q lcl|Aclame:pro 13 FQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERV-E 91 (318) Q Consensus 13 ~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~l-e 91 (318) ||-+| -...+|...|.......+... .+-.+.+. + +. ..|.+|.++=+ +-.|. +|-.. . T Consensus 1 Mantl--------~ya~~~~~~LD~~~~~~~~s~-~l~~~~~~-v-~~-----~ggktVkIp~i---~~~gl-~DY~R~~ 60 (312) T protein:vir:10 1 MANTL--------AYGQVLQQGLDKQATQELLTG-WMDSNAKQ-I-KY-----EGGKEVKIGKL---STDGL-GDYSRGS 60 (312) T ss_pred CCcch--------hHHHHHHHHHHHHHHhhhccc-cccCCCce-E-EE-----ecCcEEEEEee---ecccc-ccccccc Confidence 22111 012334444433333333322 22112112 2 22 35889998843 33333 34333 3 Q ss_pred c---chhhhhheeeEEEEecccccccccchhhhhhhh--hhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccee Q lcl|Aclame:pro 92 G---RGEDLSHADFSLKINQGRHLVDAGGRMSQQRTK--FNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTIL 166 (318) Q Consensus 92 G---nee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~--~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~n~~~~~ 166 (318) | +..+++....+..++|-|----.=.+|+...|- ..+-...+....+...-.+|.-.|-.|+..-....... T Consensus 61 g~~~~~g~v~~~~et~tl~qDR~~~F~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~~--- 137 (312) T protein:vir:10 61 ANAYVGGDVKFEYETKTMTQDRGRKFTLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGDT--- 137 (312) T ss_pred CCccccccccccceeEEeeecccceeeccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccc--- Confidence 4 334688888889998887553333445544332 33444444444444555666666766651111000000 Q ss_pred ecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCCcceEEE Q lcl|Aclame:pro 167 PTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVL 246 (318) Q Consensus 167 p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~~~~yV~ 246 (318) ..+...+++++.+ ++.|+.+.+.++..+-| + -+|| T Consensus 138 ----------------------------~~~~~~~~T~~ni--~~~i~~~~~~lde~~vp-------~--------~rvl 172 (312) T protein:vir:10 138 ----------------------------NVEYSYSVNSSTI--INKIKTGIKIIRENGYN-------G--------PLVC 172 (312) T ss_pred ----------------------------ccccccccCHHHH--HHHHHHHHHHHHHccCC-------C--------ceEE Confidence 0011112333333 45677888888885544 1 1799 Q ss_pred EecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEc--------------CCCee Q lcl|Aclame:pro 247 YVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFY--------------QGQRF 312 (318) Q Consensus 247 ~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~--------------ag~~v 312 (318) +|+|.-+.-|+++.. ..+. .. -..+-...+.++.+|||-|.|.|.- ||+ +|+-+ T Consensus 173 ~vTp~~~~lLk~~~~-~~~~-----~~----~~~~~~i~~~V~~iDgv~Ii~VPs~--r~~t~~~f~dG~t~~~~~gg~~ 240 (312) T protein:vir:10 173 HLTYDSMFAIEEKVL-EKLT-----AV----TFAQGGIQTQVPSIDGCALIKTPQN--RMYSSILLNDGTTSNQTAGGYL 240 (312) T ss_pred EeChHHHHHHhhhhh-ceec-----cc----ccccceeeeeeeeecccEEEEchhh--hccceeeeccCcccccccCcee Confidence 999999888887532 1121 11 1234456999999999999999963 885 12221 Q ss_pred ----------------------eeeeeC Q lcl|Aclame:pro 313 ----------------------WYQRIT 318 (318) Q Consensus 313 ----------------------~~a~~~ 318 (318) |++.+- T Consensus 241 ~~~~ak~INfiiv~~~a~i~~~K~~~~~ 268 (312) T protein:vir:10 241 KGTKALDTNFIIAPVDVPLAITKQDKMR 268 (312) T ss_pred ecCcccccceEEeCCceeeceeeeeeee Confidence 111111 No 65 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=87.75 E-value=0.037 Score=28.45 Aligned_cols=225 Identities=11% Similarity=0.038 Sum_probs=109.4 Q ss_pred cccch-H--HHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccC-CCCCcEEEEEEeeccccCCeecCceeec-chhh Q lcl|Aclame:pro 22 NRNRS-M--VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLN-KQAGDEVTFSIMHKLSKRPTMGDERVEG-RGED 96 (318) Q Consensus 22 ~~~~~-~--v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~-k~~Gd~v~f~L~~~L~G~gv~Gd~~leG-nee~ 96 (318) +.|+= + .++|+..+-..-.+..-+...+-.... .|.. ...||+|++.......-.-..|. ...| .-++ T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~lVnr~y~------~e~~~~k~GDTV~I~~p~~~~~~~~~~~-~~~~~~~~~ 73 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLL------AGEINSSTGDSVSFKRPHQFSSLRTPTG-DISGQNKNN 73 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCC------cchhhcccCCEEEEeeCCcceeecccCc-ccCCcccCc Confidence 33331 1 255655432222222222111111100 1111 24799999987655443222221 1112 3577 Q ss_pred hhheeeEEEEecccc-cccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeecccccccc Q lcl|Aclame:pro 97 LSHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFK 175 (318) Q Consensus 97 L~~~sd~v~Idq~R~-~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~n~~~~~p~~~~~~~~ 175 (318) |.-.+-.|.||+..| ++...++ ++.-...||-+.. .....-+++..|+.++..+.+... T Consensus 74 l~e~~v~l~id~~k~va~~v~d~-E~~~~i~~~~~~l-~~A~~aLA~~vd~~ia~~~~~~a~------------------ 133 (423) T protein:vir:17 74 LISGKATGRVGNYITVAVEYQQL-EEAIKLNQLEEIL-APVRQRIVTDLETELAHFMMNNGA------------------ 133 (423) T ss_pred cccceeEEEeeceeeeeeeecHH-HHhcChhHHHHHH-HHHHHHHHHHHHHHHHHHHhhccc------------------ Confidence 777778899999988 5666543 2212334452222 222344566667665443322110 Q ss_pred cccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCCcceEEEEecHHHHHH Q lcl|Aclame:pro 176 KIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWND 255 (318) Q Consensus 176 ~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~~~~yV~~l~P~q~~d 255 (318) |.+..|. +..+ .++.|-.+...+.+.+-|- ++ +.+++.|+-+.. T Consensus 134 ----~~~gt~~----------------t~~~--a~~~i~~a~~~Ld~~~vP~------~~--------R~~Vv~p~~~a~ 177 (423) T protein:vir:17 134 ----LSLGSPN----------------TPIT--KWSDVAQTASFLKDLGVNE------GE--------NYAVMDPWSAQR 177 (423) T ss_pred ----cccccCC----------------cccc--cHHHHHHHHHHHHhccCCc------CC--------CEEEeChHHHHH Confidence 1010110 0011 3677778888888877661 11 567999999999 Q ss_pred HHhCcchHHHHHHHHHHhhccccccCCcccCCe-EEEcCEEEEecCceeEE----Ec------CCC-------------e Q lcl|Aclame:pro 256 WYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGEC-AMWRNILVRKYAGMPIR----FY------QGQ-------------R 311 (318) Q Consensus 256 Lr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~-gm~ngvii~e~~~~~iR----f~------ag~-------------~ 311 (318) |..+... .. + +. .+...-|=.|.+ |.+.|+=+++..++|-. |. .+. . T Consensus 178 Ll~~~~~--~~---~-~~---~~~~~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~ 248 (423) T protein:vir:17 178 LADAQTG--LH---A-SD---QLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFT 248 (423) T ss_pred Hhccccc--ee---c-cc---ccchHHHhhccceeeecceEEEEeCCCccccccceeceeeeccccccccccccccccee Confidence 8876431 11 1 11 122233556766 89999999998888732 11 011 1 Q ss_pred eeeeeeC Q lcl|Aclame:pro 312 FWYQRIT 318 (318) Q Consensus 312 v~~a~~~ 318 (318) +..+.++ T Consensus 249 ~~~~~~~ 255 (423) T protein:vir:17 249 VTLTGAT 255 (423) T ss_pred eeeeeee Confidence 1112121 No 66 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=85.45 E-value=0.053 Score=27.59 Aligned_cols=264 Identities=16% Similarity=0.189 Sum_probs=130.5 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCcc---------EEE-----------e Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAP---------VVR-----------I 60 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~---------I~~-----------~ 60 (318) =-|..-+++++. +-.=+.+..--++.|.+.|+++.-.-. ..-+|-+.+..| |+. + T Consensus 31 ~et~~e~~~~~~--~~~~~e~el~E~f~Kmm~G~~p~~eV~---~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~ 105 (393) T protein:vir:79 31 GETLAEADANKL--ALNEEETQILESFAKMMEGETPTNEVN---LREFMATPSAQILIPRVIVGTMREAAEPLYIGTKML 105 (393) T ss_pred hhhhhhhhhhhh--hcchhHHHHHHHHHHHhcCCCchhhee---hhhhhcCCCcceechhhhhhhhhhcccchhHHHHHH Confidence 112222222221 000011112223344444444322211 112232222222 111 1 Q ss_pred eccCCCCCcEEEEEEeeccccCCeecCceeecchhhhh-heeeEEEEecccccccccchhhhh---hhhhhHHHHHHHHH Q lcl|Aclame:pro 61 TDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLS-HADFSLKINQGRHLVDAGGRMSQQ---RTKFNLASSARTLL 136 (318) Q Consensus 61 ~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGnee~L~-~~sd~v~Idq~R~~V~~~g~ms~q---rs~~dlr~~ar~~L 136 (318) ++..-+.|.+..|.=+.-++..-|-.. -|=.+.+|+ +..|.|.+-+.|-++.+ .+||+ +|..|+...+-... T Consensus 106 qk~~L~~Grsm~F~~~g~~Ra~~IgEG--gE~~~~sld~~T~dsv~~~~gK~G~~I--a~SqEmIsDSg~Dvin~~l~aA 181 (393) T protein:vir:79 106 QKIRLKSGQSMIFPSIGIMRAYDVAEG--QEIPEDSIDWQTHESPEIRVGKSGIRL--RFTDEMISDSQWDLMSMMIKQA 181 (393) T ss_pred HHHhhhcCcceeccchheeeecccccc--ccccccchhhhcCCceeEEechhhhhh--hhHHHHhhcchHHHHHHHHHHH Confidence 122235788888876665555544222 222356777 77789999999999887 45554 67899999999999 Q ss_pred HHHHHHHHHHHHHHHHhccccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHH Q lcl|Aclame:pro 137 GTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNL 216 (318) Q Consensus 137 ~~w~~~~~D~~~~~~LsG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a 216 (318) .+-|+++.|+.+|..+- +.| |.-|..+-.+++.-|+ |-+..+ .-+++||+.-+.++ T Consensus 182 ~RaMaRkKee~a~n~fk-~~g-------------htvfDa~st~t~ahpt------Gr~~~~----~qNGTlSleDllDm 237 (393) T protein:vir:79 182 GRAMGRHKEQKAYHQFR-SHG-------------HTVFDNYSTNKLAHTT------GLDKNG----VQNDTFSAEDFLDL 237 (393) T ss_pred HHHHHhhhHHHHHhhhh-ccc-------------ceeeeccccCccceee------cCCccc----cccccccHHHHHHH Confidence 99999999999986662 111 1123333334443343 212111 45789999987777 Q ss_pred HHHHHhcCCCCceeEeccccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhh--ccccccCC------cccCCe Q lcl|Aclame:pro 217 SLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVN--RAKGFNHP------LFKGEC 288 (318) Q Consensus 217 ~~~a~~~~~pi~Pv~v~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~--r~~g~~nP------lF~G~~ 288 (318) ...+...... + =|++|||..|+.++++.....- |.+|.- ..++.+.- +-+|.+ T Consensus 238 ~~av~~~hyt-------~---------svi~MHPLAWnv~AKna~me~~---~~na~gN~~~~~~~ts~algp~~i~~~~ 298 (393) T protein:vir:79 238 IIAVMANEYT-------P---------SDLMMHPLAWTVFAKNELMGSL---QANPYGNYPAKGAPSSMALGPDSIQGRL 298 (393) T ss_pred HHHHhcccCC-------c---------ceEEEcCchhhhhhhhhhhcce---eeccccccCccccchhhhhchhhhcccc Confidence 6555442221 1 3899999999999998754333 222221 01111111 112222 Q ss_pred EEEcCEEEEecCceeE-----EE--cCCC--eee---------eeeeC Q lcl|Aclame:pro 289 AMWRNILVRKYAGMPI-----RF--YQGQ--RFW---------YQRIT 318 (318) Q Consensus 289 gm~ngvii~e~~~~~i-----Rf--~ag~--~v~---------~a~~~ 318 (318) -.-=+|++ -|-+|. || |+-+ +|. .-||- T Consensus 299 ~~nlnv~~--sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~d 344 (393) T protein:vir:79 299 PFNFNVNL--SPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWD 344 (393) T ss_pred ccceeEEE--ecccccccccceeeEEEeecCCceEEEEecCcceeccc Confidence 22224454 344444 45 2111 111 11111 No 67 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=84.78 E-value=0.058 Score=27.37 Aligned_cols=224 Identities=12% Similarity=0.038 Sum_probs=107.6 Q ss_pred HHHHHHHHhcccch---HHHHhhhhhhhhhhhhcccccccCCCCCccEEEeecc-CCCCCcEEEEEEeeccccCCeecCc Q lcl|Aclame:pro 13 FQVALFTAANRNRS---MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDL-NKQAGDEVTFSIMHKLSKRPTMGDE 88 (318) Q Consensus 13 ~a~~lft~~~~~~~---~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL-~k~~Gd~v~f~L~~~L~G~gv~Gd~ 88 (318) || |+= .-++|+..+-..-.+..-+...+-..... |. ....||+|++..-....-.-..+. T Consensus 1 MA---------Nsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~------ef~~ak~GDTV~I~~P~~~~~~d~~~~- 64 (423) T protein:vir:10 1 MA---------NNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLA------GEINSSTGDSVSFKRPHQFKSERTMDG- 64 (423) T ss_pred Cc---------cccccccHHHHHHHHHHHHHhhcccchhhccCCCc------cccccccCCEEEEeeCCceeeecccCc- Confidence 22 111 13345554322222111121111111110 11 135899999977665532222221 Q ss_pred eeecc-hhhhhheeeEEEEecccc-cccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccee Q lcl|Aclame:pro 89 RVEGR-GEDLSHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTIL 166 (318) Q Consensus 89 ~leGn-ee~L~~~sd~v~Idq~R~-~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~n~~~~~ 166 (318) .+.++ .++|.-.+-.|.|||..+ ++....+ +..-...||-+ .-.....-+++..|+.+...|..... T Consensus 65 ~~t~~~~~~l~e~~v~l~id~~k~~a~~v~d~-E~~l~i~~~~~-~l~~A~~aLA~~vd~~ia~~~~~~~~--------- 133 (423) T protein:vir:10 65 DITGKSKNSLISAKATGEVGNYITVAVEYRQI-EEALKLNQLDQ-ILVPINERMVTDLETELALFMMKHGA--------- 133 (423) T ss_pred ccCcccccccccceEEEEecceeeeeeeeChH-HHhcChhHHHH-HHHHHHHHHHHHHHHHHHHHhhhccc--------- Confidence 12333 345655667899999887 6666543 22224556633 33333445666677766545532221 Q ss_pred ecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCCcceEEE Q lcl|Aclame:pro 167 PTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVL 246 (318) Q Consensus 167 p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~~~~yV~ 246 (318) |.+..|. +. .+ .++.+-.+...+...+-|- ++ +.+ T Consensus 134 -------------~~vgt~~-----------t~-----~~--a~~~~a~a~~~L~~~~vP~------~~--------R~~ 168 (423) T protein:vir:10 134 -------------LSLGSPN-----------TP-----IK--KWSDVAQTASFLKDLGINS------GE--------NYA 168 (423) T ss_pred -------------ccccccc-----------cc-----cc--cHHHHHHHHHHHhhccCCc------CC--------CEE Confidence 1111111 00 11 2566677778888866661 11 577 Q ss_pred EecHHHHHHHHhC-cchHHHHHHHHHHhhccccccCCcccCCe-EEEcCEEEEecCceeEEEc--CCC------eeeeee Q lcl|Aclame:pro 247 YVTPRQWNDWYTS-TSGKDWNQMMVRAVNRAKGFNHPLFKGEC-AMWRNILVRKYAGMPIRFY--QGQ------RFWYQR 316 (318) Q Consensus 247 ~l~P~q~~dLr~d-~~~~~w~~~qk~A~~r~~g~~nPlF~G~~-gm~ngvii~e~~~~~iRf~--ag~------~v~~a~ 316 (318) ++.|+-+..|..+ +.+. .+.. +..-.|=.|.+ |.+.|+=+++...+|..=. .|+ .+.+.+ T Consensus 169 Vv~p~~~a~Ll~~~~~~~-------~~~~---~~~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~~~~~~vt~ 238 (423) T protein:vir:10 169 VMDPWAAQRLADAQSGLH-------VSEQ---LVRTAWENAQISGNFGGIRALMSNGLASRTQGAFGGKLTVKGTPEVNY 238 (423) T ss_pred EeCHHHHHHHhhhhhhhc-------cccc---cchHHHHhcccceeecceEEEEecCCcccccccccceeeeeeeeEEEe Confidence 9999999998654 3221 1111 12233555665 8999999999888764311 011 111111 Q ss_pred eC Q lcl|Aclame:pro 317 IT 318 (318) Q Consensus 317 ~~ 318 (318) -+ T Consensus 239 a~ 240 (423) T protein:vir:10 239 DS 240 (423) T ss_pred cc Confidence 00 No 68 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=75.55 E-value=0.15 Score=25.17 Aligned_cols=225 Identities=14% Similarity=0.104 Sum_probs=104.2 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhc-ccccccCCCCCccEEEeeccCCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSP-DKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~-~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L 79 (318) || ++++ +.++..|= ....... ......++..+.-+ . =..|.+|.++=+.-- T Consensus 1 Ma-inya---~~~~~~Ld------------------~~~~~~~lts~~l~~~~~~~~v-~-----~~ggktVkIp~is~t 52 (346) T protein:vir:10 1 MT-INYA---EKYQAAVQ------------------QAFYDGHLYSAELWNSPSNSII-K-----FDGAKHIKVPRLEIT 52 (346) T ss_pred Cc-chhH---HHHHHHHH------------------HHHHhhhccchhhcccccccce-E-----ecCCCEEEEEEeeee Confidence 33 2221 11222221 1111110 00111111111111 1 124788887655211 Q ss_pred ccCCeecCc-eeecc--hhhhhheeeEEEEecccccccccchhhhhhh--hhhHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 80 SKRPTMGDE-RVEGR--GEDLSHADFSLKINQGRHLVDAGGRMSQQRT--KFNLASSARTLLGTYFNDLQDQCAIVHLAG 154 (318) Q Consensus 80 ~G~gv~Gd~-~leGn--ee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs--~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG 154 (318) +| . +|- +-.|- ..+++....++.++|-|----.=..|+..-| ...+-........+...-.+|.-.|-.|+. T Consensus 53 sG--l-~DY~R~~g~~~~g~v~~~~et~tl~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~ 129 (346) T protein:vir:10 53 SG--R-KDRQRRTITTPVANYSNDWDSYELKNERYWSTLVDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYS 129 (346) T ss_pred cc--c-ccccccCCcccccccccceeEEEeeccccceecccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHH Confidence 12 1 222 11222 2467788888888888754333234442222 112222222233333333456666666641 Q ss_pred c-ccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEec Q lcl|Aclame:pro 155 A-RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLS 233 (318) Q Consensus 155 ~-rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~ 233 (318) . .+- + ++.....+++++.+ ++.|+.+.+.++..+-|- T Consensus 130 ~a~~~---------------------~-------------~~~~~~~a~T~~ni--~~~i~~~~~~lde~~vp~------ 167 (346) T protein:vir:10 130 GKEAA---------------------H-------------DGGITTNTLDEKNI--LPAFDNMMLDFDEARIPS------ 167 (346) T ss_pred hhhhh---------------------c-------------cccccccccCHHHH--HHHHHHHHHHHHHccCCC------ Confidence 1 110 0 01111223333322 567888888888755441 Q ss_pred cccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEEEc------ Q lcl|Aclame:pro 234 GDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFY------ 307 (318) Q Consensus 234 g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iRf~------ 307 (318) + -+||+|+|.-+.-|+.++.| ++.-... ..+ ...|.+|.+|||-|.|.|. -||+ T Consensus 168 -~-------~rvl~vTp~~~~lLk~s~~f------~k~~~v~---~~~-~i~~~V~siDGv~Ii~VPs--~r~~t~~~f~ 227 (346) T protein:vir:10 168 -T-------NRILYVTPKTNAILKRAEAM------NRALTLK---DPN-NIQRTVYSLDDVTIRVVPS--DLMQTAYDFS 227 (346) T ss_pred -C-------CeEEEECHHHHHHHhhchhh------eeccccc---ccc-ccceeeeeecCeEEEEcch--hhcccchhhc Confidence 1 17999999999999998765 2322221 112 3599999999999999886 3775 Q ss_pred CCCee---------------------eeeeeC Q lcl|Aclame:pro 308 QGQRF---------------------WYQRIT 318 (318) Q Consensus 308 ag~~v---------------------~~a~~~ 318 (318) .|-.. +++.+- T Consensus 228 ~G~~~~t~ak~INfiiv~~~A~ia~~K~~~~~ 259 (346) T protein:vir:10 228 DGSKIIDTAKQIEMFLIYNGVQIAPEKYSFVG 259 (346) T ss_pred cCccccCCccceeEEEECCceeeeeeeeeeeE Confidence 23211 111111 No 69 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=70.90 E-value=0.2 Score=24.38 Aligned_cols=211 Identities=10% Similarity=0.018 Sum_probs=107.7 Q ss_pred HHHH-HH--HHhc-ccch-HHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccccCCeecC Q lcl|Aclame:pro 13 FQVA-LF--TAAN-RNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGD 87 (318) Q Consensus 13 ~a~~-lf--t~~~-~~~~-~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd 87 (318) ||=. |= +.+. ..+= +++++++.+. + +.... -|.|+. ....|++|++.=. ...|+. . T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~-~------L~~~L------gi~r~~--p~a~G~tIt~pK~-~~tgda---~ 61 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNIN-D------LLKLL------GVTRRE--TLTNDLKIQTYKW-EVTLDQ---T 61 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHH-H------HHHHh------cccccc--ccccCCeEEeeee-eeeccc---c Confidence 2211 11 1111 1111 2433433210 0 00011 122222 2245999998653 333332 3 Q ss_pred ceeecchhhhhhee------eEEEEecccccccccchhhh-hhhhhhH-HHHHHHHHHHHHHHHHHHHHHHHHhcccccc Q lcl|Aclame:pro 88 ERVEGRGEDLSHAD------FSLKINQGRHLVDAGGRMSQ-QRTKFNL-ASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (318) Q Consensus 88 ~~leGnee~L~~~s------d~v~Idq~R~~V~~~g~ms~-qrs~~dl-r~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~ 159 (318) +..||-+-+|+..+ .++.|+..|.++. -+. |++.++- .-++-..|..-+++..|..+|..|..++.- T Consensus 62 dVaEGe~Iplskvt~~~~~t~t~kikK~rK~tT----dEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t- 136 (295) T protein:vir:99 62 DPGEGETIPLSKVTRTKDKDYTVKWFKKRRATT----AEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTK- 136 (295) T ss_pred cccCCcccchhhheeeeeeeeEEEeeeeccccc----HHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCcee- Confidence 47888877776655 7888999999873 344 4666543 778888999999999999999999644431 Q ss_pred ccccceeecccccccccccccccCCCCCCceEeecCCcccccccccc--ccCHHHHHHHHHHHHhcCCCCceeEeccccc Q lcl|Aclame:pro 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAAD--IFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (318) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D--~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~ 237 (318) .+ .| ..+++.+..+... ..+.++ T Consensus 137 ------------------------------------------~t-g~~lq~a~a~~~~al~~-----------f~Ee~~- 161 (295) T protein:vir:99 137 ------------------------------------------VK-GVGLQKALSASWAKLAT-----------FNEFEG- 161 (295) T ss_pred ------------------------------------------ee-hhhHHHHHHHhhhhhhh-----------cccccC- Confidence 00 11 1122222222111 122221 Q ss_pred cCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeE---EEcCCCeeee Q lcl|Aclame:pro 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPI---RFYQGQRFWY 314 (318) Q Consensus 238 ~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~i---Rf~ag~~v~~ 314 (318) . -+|+|++|.++.+||.|... .|+..+.. +...-.| |.|-- +||+.-. +|- =--+.+|+.+ T Consensus 162 ---~-~~V~FVnP~D~a~yl~~A~~-~~~~a~~f---G~~~L~n--fLG~q-----~II~S~k-v~~G~~~aT~~~Ni~~ 225 (295) T protein:vir:99 162 ---S-PLVSFVSPLDVANYLGDTKV-GADASNVF---GMTLLKN--FLGMQ-----NVIVMPS-VPEGKIYSTAVENLVF 225 (295) T ss_pred ---C-ceEEEEehHHHHHHHhcccc-ccchhhhh---hhhhhhh--hhccc-----eEEEccc-CCCceEEEeeccceEE Confidence 2 38999999999999999764 68643222 2223334 55521 2333321 211 1125556655 Q ss_pred eeeC Q lcl|Aclame:pro 315 QRIT 318 (318) Q Consensus 315 a~~~ 318 (318) |-+- T Consensus 226 ay~~ 229 (295) T protein:vir:99 226 ASLN 229 (295) T ss_pred EEec Confidence 4444 No 70 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=67.14 E-value=0.26 Score=23.82 Aligned_cols=222 Identities=11% Similarity=0.108 Sum_probs=102.9 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (318) |+ +++ ...+.+.|+-.+..+..+-..|+ + . |.-.++ =..|.+|.++=+.-. T Consensus 1 Ma-in~---~~k~~~~ld~~~~~~~~~~~l~~-----------------~-~-n~~~~~-----~~gak~VkIp~ist~- 51 (285) T protein:vir:79 1 MT-VVL---DSKDLARIDEEYKADSQVWSYLT-----------------G-G-NGVTQR-----FRGHNEVRINKLSGF- 51 (285) T ss_pred Cc-chh---hHHHHHHHHHHHHHhhhhhhhcc-----------------c-C-CcceeE-----ecCCCEEEEeeeccc- Confidence 43 222 12334444444433322211111 1 1 111111 123677777644222 Q ss_pred cCCeecCceeec-chhhhhheeeEEEEecccccccccchhhhhhh-hhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Q lcl|Aclame:pro 81 KRPTMGDERVEG-RGEDLSHADFSLKINQGRHLVDAGGRMSQQRT-KFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (318) Q Consensus 81 G~gv~Gd~~leG-nee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs-~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~ 158 (318) .|+.--++-.| +..+++....++.++|-|----.=..|+..-+ ...+-...+....+...-.+|.-.|-.|++.-+ T Consensus 52 -~gl~dY~R~~g~~~g~v~~~~et~tl~~DR~~~f~iD~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~- 129 (285) T protein:vir:79 52 -VDATAYKRGQDNARKTISVGKETVKLTHEDWFGYDLDQFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSAA- 129 (285) T ss_pred -ccccccccccCccccccceeeeEEEeeccccceecccccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhcc- Confidence 12222222223 56667777888888887744222223332111 111111222222222223444444555542111 Q ss_pred cccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecccccc Q lcl|Aclame:pro 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELH 238 (318) Q Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~ 238 (318) .....+|+++.+ ++.|+.+.++++..+-| . T Consensus 130 -------------------------------------~~~~~~~T~~nv--~~~i~~~~~~lde~~vp-~---------- 159 (285) T protein:vir:79 130 -------------------------------------KKATDSITKDNA--LDAYDTAEAYMFDNEVP-G---------- 159 (285) T ss_pred -------------------------------------cccccccCHHHH--HHHHHHHHHHHHHcCCC-C---------- Confidence 112234655553 78889999999986544 1 Q ss_pred CCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCc--ccCCeEEEcC-EEEEecCceeEEEcCCC---ee Q lcl|Aclame:pro 239 GEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPL--FKGECAMWRN-ILVRKYAGMPIRFYQGQ---RF 312 (318) Q Consensus 239 ~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPl--F~G~~gm~ng-vii~e~~~~~iRf~ag~---~v 312 (318) -+||||+|.-+.-|+.++.+. +.-... ...+. +.+.++.+|| |-|.|.|.- ||...+ .| T Consensus 160 ----~rvl~vTp~~~~~Lk~s~~~~------r~~~~~---~~~~~~~i~~~V~~lDg~v~ii~Vps~--r~kt~~~~k~I 224 (285) T protein:vir:79 160 ----GFVMFVSSAYYTALKQSAAVT------RTFSTD---GTMVINGIDRRVAQLDGGVPIVRVSSD--RLKGLGITNHV 224 (285) T ss_pred ----ceEEEEChHHHHHHHhhhhhh------eecccc---cceeccceeeeeccccceeEEEEcchh--hccCcCcchhc Confidence 179999999999999987652 210000 00111 4456899998 889998853 664211 11 Q ss_pred --------------eeeeeC Q lcl|Aclame:pro 313 --------------WYQRIT 318 (318) Q Consensus 313 --------------~~a~~~ 318 (318) ++..+- T Consensus 225 nfiiv~~~a~i~~~K~~~~~ 244 (285) T protein:vir:79 225 NFILTPLSAIAPIVKYDSVS 244 (285) T ss_pred cEEEecCceeccceeeeeeE Confidence 111111 No 71 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=65.54 E-value=0.28 Score=23.60 Aligned_cols=225 Identities=10% Similarity=0.026 Sum_probs=100.5 Q ss_pred CCcCCcc----chhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEe Q lcl|Aclame:pro 1 MTTVTSA----QANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIM 76 (318) Q Consensus 1 ~t~~~~~----~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~ 76 (318) +++.+.. .-...++--++.....+++..+.. . .. .++ +..+++... T Consensus 9 ~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~-~-----------~~-~~~-----------------~~~~~ip~~ 58 (304) T protein:vir:10 9 GNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLA-K-----------NE-PMT-----------------AQKKKFTYL 58 (304) T ss_pred ccccccCCCceecchhHHHHHHHHHHhccchhhhc-c-----------ee-ecc-----------------CCceEEEEE Confidence 1111110 000112222333333333322110 0 00 011 111222222 Q ss_pred ecc-ccCCeecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 77 HKL-SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGA 155 (318) Q Consensus 77 ~~L-~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~ 155 (318) ... ...++. +.-+-.+...+|..-++.+-....-+....++ .+.+.+||...-+..|.+-+++..|+.+| .|. T Consensus 59 ~~~~~a~~v~--E~~~~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ia~~~d~~~l---~G~ 132 (304) T protein:vir:10 59 AKGVGAYWVS--ETERIQTSKPEYAQAEMEAKKIGVIIPLSKEF-LKWTAKDFFNEVKPLIAEAFYKAFDQAVI---FGT 132 (304) T ss_pred eCCcceEEee--cCcccccccceeeEEEEEEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHhhhe---ecc Confidence 111 111221 12233355677776666666666666554443 33467899999999999999999998884 221 Q ss_pred ccccccccceeecccccccccccccccCCCCCCceEeecCCccccccc-cccccCHHHHHHHHHHHHhcCCCCceeEecc Q lcl|Aclame:pro 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIE-AADIFSIGLVDNLSLFIDEMAHPLQPVRLSG 234 (318) Q Consensus 156 rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~-a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g 234 (318) ..+. |... ..+.+.. + ......+ ++...+++.|.++.......... T Consensus 133 ---g~~~----~~~~-------~~~~~~~---------~--~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~-------- 179 (304) T protein:vir:10 133 ---KSPY----NTST-------SGKPLVE---------G--AEEKGNVVTDTNNLYVDLSALMATIEDEELD-------- 179 (304) T ss_pred ---CCCc----cccc-------ccccccc---------c--ccccccccccccchHHHHHHHHHHhhhccCC-------- Confidence 1110 1100 0010000 0 0111111 22344577676666555442111 Q ss_pred ccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE-----Ec-- Q lcl|Aclame:pro 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR-----FY-- 307 (318) Q Consensus 235 ~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR-----f~-- 307 (318) .-+++|||.-+..|++= + . +..+|||.+..+.+.|..++.-+.+|.- +. T Consensus 180 --------~~~~v~~~~~~~~L~~l----------k----d--~~G~~l~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~g 235 (304) T protein:vir:10 180 --------PNGVLTTRSFRSKMRNA----------L----D--ANDRPLFDANGNEIMGLPLSYTGADVYDKKKSLALMG 235 (304) T ss_pred --------cCEEEEcHHHHHHHHHh----------h----c--cCCcEeecCCCccccceeeEEecccccCCCCcEEEEE Confidence 12578999999998851 1 1 1236889888888888888766665410 11 Q ss_pred ----------CCCeeeeeeeC Q lcl|Aclame:pro 308 ----------QGQRFWYQRIT 318 (318) Q Consensus 308 ----------ag~~v~~a~~~ 318 (318) .+=++++.+-- T Consensus 236 d~~~~~~~~~~~~~i~~~~e~ 256 (304) T protein:vir:10 236 DWDYARYGILQGIEYAISEDA 256 (304) T ss_pred ehhhEEEEEecceEEEEeecc Confidence 11122222210 No 72 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=65.54 E-value=0.28 Score=23.60 Aligned_cols=225 Identities=10% Similarity=0.026 Sum_probs=100.5 Q ss_pred CCcCCcc----chhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEe Q lcl|Aclame:pro 1 MTTVTSA----QANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIM 76 (318) Q Consensus 1 ~t~~~~~----~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~ 76 (318) +++.+.. .-...++--++.....+++..+.. . .. .++ +..+++... T Consensus 9 ~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~-~-----------~~-~~~-----------------~~~~~ip~~ 58 (304) T protein:vir:94 9 GNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLA-K-----------NE-PMT-----------------AQKKKFTYL 58 (304) T ss_pred ccccccCCCceecchhHHHHHHHHHHhccchhhhc-c-----------ee-ecc-----------------CCceEEEEE Confidence 1111110 000112222333333333322110 0 00 011 111222222 Q ss_pred ecc-ccCCeecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 77 HKL-SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGA 155 (318) Q Consensus 77 ~~L-~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~ 155 (318) ... ...++. +.-+-.+...+|..-++.+-....-+....++ .+.+.+||...-+..|.+-+++..|+.+| .|. T Consensus 59 ~~~~~a~~v~--E~~~~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ia~~~d~~~l---~G~ 132 (304) T protein:vir:94 59 AKGVGAYWVS--ETERIQTSKPEYAQAEMEAKKIGVIIPLSKEF-LKWTAKDFFNEVKPLIAEAFYKAFDQAVI---FGT 132 (304) T ss_pred eCCcceEEee--cCcccccccceeeEEEEEEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHhhhe---ecc Confidence 111 111221 12233355677776666666666666554443 33467899999999999999999998884 221 Q ss_pred ccccccccceeecccccccccccccccCCCCCCceEeecCCccccccc-cccccCHHHHHHHHHHHHhcCCCCceeEecc Q lcl|Aclame:pro 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIE-AADIFSIGLVDNLSLFIDEMAHPLQPVRLSG 234 (318) Q Consensus 156 rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~-a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g 234 (318) ..+. |... ..+.+.. + ......+ ++...+++.|.++.......... T Consensus 133 ---g~~~----~~~~-------~~~~~~~---------~--~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~-------- 179 (304) T protein:vir:94 133 ---KSPY----NTST-------SGKPLVE---------G--AEEKGNVVTDTNNLYVDLSALMATIEDEELD-------- 179 (304) T ss_pred ---CCCc----cccc-------ccccccc---------c--ccccccccccccchHHHHHHHHHHhhhccCC-------- Confidence 1110 1100 0010000 0 0111111 22344577676666555442111 Q ss_pred ccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeEE-----Ec-- Q lcl|Aclame:pro 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR-----FY-- 307 (318) Q Consensus 235 ~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~iR-----f~-- 307 (318) .-+++|||.-+..|++= + . +..+|||.+..+.+.|..++.-+.+|.- +. T Consensus 180 --------~~~~v~~~~~~~~L~~l----------k----d--~~G~~l~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~g 235 (304) T protein:vir:94 180 --------PNGVLTTRSFRSKMRNA----------L----D--ANDRPLFDANGNEIMGLPLSYTGADVYDKKKSLALMG 235 (304) T ss_pred --------cCEEEEcHHHHHHHHHh----------h----c--cCCcEeecCCCccccceeeEEecccccCCCCcEEEEE Confidence 12578999999998851 1 1 1236889888888888888766665410 11 Q ss_pred ----------CCCeeeeeeeC Q lcl|Aclame:pro 308 ----------QGQRFWYQRIT 318 (318) Q Consensus 308 ----------ag~~v~~a~~~ 318 (318) .+=++++.+-- T Consensus 236 d~~~~~~~~~~~~~i~~~~e~ 256 (304) T protein:vir:94 236 DWDYARYGILQGIEYAISEDA 256 (304) T ss_pred ehhhEEEEEecceEEEEeecc Confidence 11122222210 No 73 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=48.30 E-value=0.67 Score=21.52 Aligned_cols=219 Identities=10% Similarity=0.030 Sum_probs=100.3 Q ss_pred hcccchHH-----------HHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccccCCeecCce Q lcl|Aclame:pro 21 ANRNRSMV-----------NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDER 89 (318) Q Consensus 21 ~~~~~~~v-----------~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~ 89 (318) .+-|.-.+ ..++..+...... .+|+...--..+-.|...++..........|.. - T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~------------~s~l~~~~~~~~~~~~~~~~~~~~~~~a~~v~E--~ 66 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKN------------GSAAMKLAKAVPMTKPEEEFTFMSGVGAFWVDE--A 66 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHh------------cchhhhhceeeecCCCcEEEEEEcCCceeeeec--C Confidence 22221111 1122222222221 222211111112223344444433333334422 2 Q ss_pred eecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeecc Q lcl|Aclame:pro 90 VEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTA 169 (318) Q Consensus 90 leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG~rg~~~n~~~~~p~~ 169 (318) -+..+...+|..-++.+.....-+.+.-++- +.+..||...-...|++-+++..|+.+| .|. -.+. |. T Consensus 67 ~~~~~~~~~f~~v~l~~~k~~~~~~is~ell-~ds~~~~~~~i~~~l~~a~~~~~d~a~l---~G~---g~~~----~~- 134 (299) T protein:vir:41 67 ERIQTSKPTFTKAKMRSKKMGVIIPTTKENL-NYSVTNFFSLMQAEIVEAFYKKFDQAVF---TGV---ESPY----NW- 134 (299) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHH-hcCHHHHHHHHHHHHHHHHHHHHHHHHh---hcc---cCcc----cc- Confidence 2233555666555555555555455543333 3466899999999999999999998886 221 1110 10 Q ss_pred cccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCCcceEEEEec Q lcl|Aclame:pro 170 EHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVT 249 (318) Q Consensus 170 ~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~~~~yV~~l~ 249 (318) ++. + .++...+..+.+..+++.|-++......... + --+++|| T Consensus 135 ------gil-~--------------~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~--~--------------~~~~v~n 177 (299) T protein:vir:41 135 ------NIL-K--------------SATDASNLVEETANKYDDLNEAIGLIEAEDL--E--------------PNGIATI 177 (299) T ss_pred ------ccc-c--------------cccccceeeccccccHHHHHHHHHhhhcccC--C--------------cCEEEEc Confidence 110 0 0011112233445667767666654443211 1 1367899 Q ss_pred HHHHHHHHhCcchHHHHHHHHHHhhccccccCCccc----CCeEEEcCEEEEecCceeE-----EEcCCC---------- Q lcl|Aclame:pro 250 PRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK----GECAMWRNILVRKYAGMPI-----RFYQGQ---------- 310 (318) Q Consensus 250 P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~----G~~gm~ngvii~e~~~~~i-----Rf~ag~---------- 310 (318) |.-+..|++= +. +..+|||. ++.+.+.|..++.-+.+|. .++-|+ T Consensus 178 ~~~~~~L~~l----------kd------~~G~~l~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gdfs~~~i~~~~ 241 (299) T protein:vir:41 178 RKQRVKYRST----------KD------GNGMPIFNTATSNGVDDVLGLPIAYTPKYTFGDKDISELVGDWNQAYYGILR 241 (299) T ss_pred HHHHHHHHHh----------hc------cCCceeecCCcCCCCceecceeeEEecccCCCCCceEEEEEecccEEEEEec Confidence 9988888852 11 12256664 5556777888887776651 011111 Q ss_pred --eeeeeeeC Q lcl|Aclame:pro 311 --RFWYQRIT 318 (318) Q Consensus 311 --~v~~a~~~ 318 (318) .++..|-. T Consensus 242 ~~~i~~~~~~ 251 (299) T protein:vir:41 242 GVEYEILTEA 251 (299) T ss_pred CcEEEEeecc Confidence 11111110 No 74 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=37.38 E-value=1.1 Score=20.31 Aligned_cols=238 Identities=13% Similarity=0.043 Sum_probs=93.0 Q ss_pred HHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEeeccccCC-----eecCce Q lcl|Aclame:pro 15 VALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRP-----TMGDER 89 (318) Q Consensus 15 ~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~~L~G~g-----v~Gd~~ 89 (318) -+||-..-=|..+.....+.+.- ...+|. ..++..|+.-+++. .||-|.+++..+|.|.- +.++.. T Consensus 1 m~lsD~~vfN~~~~~a~~e~~~q------~~~~fn-~as~gai~l~~~~~--~Gd~~~~pf~~~l~g~~~~~~~~~~~~~ 71 (325) T protein:vir:95 1 MALSDLAVYSEYAYSAFSETLRQ------QVDLFN-TATGGAIMLQSAAH--QGDFSDVAFFAKVTGGLVRRRNAYGSGT 71 (325) T ss_pred Cchhhhhhhhhhhhhhhhhhhhh------hHhhhh-hcccceeEeccccc--cCceeeccccccccccccccccCCCCce Confidence 12221111122221111111110 112344 33455666555554 49999999999998743 434444 Q ss_pred eecchhhhhheee-EEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH----hccccccccccc Q lcl|Aclame:pro 90 VEGRGEDLSHADF-SLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHL----AGARGDFVADDT 164 (318) Q Consensus 90 leGnee~L~~~sd-~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~L----sG~rg~~~n~~~ 164 (318) ++. ..|....+ .+.+-..+-++.. ..++....-|-...+...++++|++++.+..+..+ .++-+. +. T Consensus 72 vt~--~kitt~~~~av~~~r~~g~~~~--d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~--~~-- 143 (325) T protein:vir:95 72 VAE--KVLKHLVDTSVKVAAGTPPVRL--DPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQ--VS-- 143 (325) T ss_pred ecc--ceeccccceeeEEecccCcccc--cHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--cc-- Confidence 433 23333333 2222222222111 11222222222233344555555555555544444 322211 10 Q ss_pred eeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccccccCCcceE Q lcl|Aclame:pro 165 ILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYY 244 (318) Q Consensus 165 ~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~~~~~~~~~y 244 (318) .+++++.+ +++ .++..+|.+.+-+|..++=. . ++ .+= T Consensus 144 ------------~~v~dis~-----------~~~----~~~~~~s~~~l~~A~~klGD---~-------~~------~l~ 180 (325) T protein:vir:95 144 ------------DVVYDATA-----------NTD----AADKLPTWNNLNNGQAKFGD---Q-------SS------QIA 180 (325) T ss_pred ------------cceeeeec-----------ccC----cccccccHHHHHHHHHHhcc---c-------cc------cee Confidence 11222222 011 12335788887777666422 1 11 235 Q ss_pred EEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCE---------------------EEEecCcee Q lcl|Aclame:pro 245 VLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNI---------------------LVRKYAGMP 303 (318) Q Consensus 245 V~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngv---------------------ii~e~~~~~ 303 (318) .++||+.-+++|+++.-. ++..+..+. ... .=|-|.|-..++++. +.+... -| T Consensus 181 ~~~MHS~v~~~L~~~~L~-~~~~~~~~~--g~~--~i~t~~G~~VIVdD~~p~~~~g~~~~ytty~lg~GAi~~~~~-~~ 254 (325) T protein:vir:95 181 AWIMHSTPMHKLYGSNLT-NGERLFTYG--TVN--VVRDPFGKLLVMTDSPNLFAAGTPNVYHILGLVPGGVLIGQN-ND 254 (325) T ss_pred EEEEchHHHHHHHHhhcc-ccccccccC--Ccc--cccccCCcEEEEeCCCCCCCccCceeEEEEEEecCeEEecCC-CC Confidence 889999999999986432 111111111 000 013344554444432 111111 11 Q ss_pred EEEcCCCeeeeeeeC Q lcl|Aclame:pro 304 IRFYQGQRFWYQRIT 318 (318) Q Consensus 304 iRf~ag~~v~~a~~~ 318 (318) +++.+.....--|+. T Consensus 255 ~~~~~~~~~~~~~~~ 269 (325) T protein:vir:95 255 FDANEETKNGDENII 269 (325) T ss_pred ccccccccCccccee Confidence 222211111111111 No 75 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=27.35 E-value=0.64 Score=21.65 Aligned_cols=63 Identities=6% Similarity=-0.014 Sum_probs=35.4 Q ss_pred EecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEE-EcCEEEEecCceeEE---------E---------- Q lcl|Aclame:pro 247 YVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAM-WRNILVRKYAGMPIR---------F---------- 306 (318) Q Consensus 247 ~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm-~ngvii~e~~~~~iR---------f---------- 306 (318) +++-.|+..+-.++.. ..|..| -.+||+|+|++-+ .+|.--..-|+.|=- | T Consensus 1 vvsdlqfA~~~g~~v~-------~~aLpR--E~aNp~ltG~lpV~~~GltWl~tpnlpg~~a~vlDst~lGgmaDE~l~~ 71 (123) T protein:vir:78 1 MLSGAQFAKLIGILVD-------DKALPR--EQANIVLTGSLPVSAYGLTWVTSRHITGTDPWLFDVEQLGGMADEKLLS 71 (123) T ss_pred CcchhhHHHHhcchhc-------cccccc--ccCCceEecCcceeeeceeeeecCCCCCCccceeehhhhccccccccCC Confidence 6777888888887642 123333 4579999999877 677666556654300 0 Q ss_pred --c---CCCeeeeeeeC Q lcl|Aclame:pro 307 --Y---QGQRFWYQRIT 318 (318) Q Consensus 307 --~---ag~~v~~a~~~ 318 (318) | ++..|.+.-+. T Consensus 72 Pgya~~~~~Gvevkt~R 88 (123) T protein:vir:78 72 PEFAPAGNTGVEASTER 88 (123) T ss_pred CcccCCCCcceeEEeec Confidence 0 11222222222 No 76 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=22.27 E-value=2.5 Score=18.43 Aligned_cols=218 Identities=13% Similarity=0.088 Sum_probs=107.2 Q ss_pred HHH--HHH--HHhcccch--HHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEE---EEEeeccccCC Q lcl|Aclame:pro 13 FQV--ALF--TAANRNRS--MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVT---FSIMHKLSKRP 83 (318) Q Consensus 13 ~a~--~lf--t~~~~~~~--~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~---f~L~~~L~G~g 83 (318) |++ .|= +.++.-.+ +++++++.+. +-.+. --|.|..-|. .|.++. |.-- ...|+ T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~-~L~~~------------LGv~r~~pla--~Gt~iktyK~~~~-~y~gd- 63 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLN-KLFEA------------LAIQNKIPMN--VGSALKQYRFKVE-DSEKP- 63 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHH-HHHHH------------hhhhcccccc--CCceeeeeeeece-eeccc- Confidence 221 111 11111111 2333332210 00000 0122333332 344443 2111 11111 Q ss_pred eecCceeecchhhhhhee------eEEEEecccccccccchhhh-hhhhhhH-HHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 84 TMGDERVEGRGEDLSHAD------FSLKINQGRHLVDAGGRMSQ-QRTKFNL-ASSARTLLGTYFNDLQDQCAIVHLAGA 155 (318) Q Consensus 84 v~Gd~~leGnee~L~~~s------d~v~Idq~R~~V~~~g~ms~-qrs~~dl-r~~ar~~L~~w~~~~~D~~~~~~LsG~ 155 (318) -.+..||-.=+|+... .++.|+..|.++. -+. |++.++. .-++-..|..-+++..|..+|..|..+ T Consensus 64 --a~dVaEGe~Iplskvt~~~~~t~~~~~kK~rK~tT----dEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lkta 137 (303) T protein:vir:10 64 --NGDVAEGDVIPLTKVTREQVDITELQFAKYRKSTS----AEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSA 137 (303) T ss_pred --cccccCCcccchhhheeeecceEEEEeeccccccc----HHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhc Confidence 1245578777776655 6888999999882 233 4565544 667888899999999999999999877 Q ss_pred ccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEeccc Q lcl|Aclame:pro 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (318) Q Consensus 156 rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g~ 235 (318) ++-... +..=..+.+-|.+|.......-... .+ T Consensus 138 T~t~~~-----------------------------------------t~~t~~s~~glq~Al~~~~~kl~~~----~e-- 170 (303) T protein:vir:10 138 IENGKR-----------------------------------------TNKTKLSAENLQGALSKGRANLSVL----LD-- 170 (303) T ss_pred cccccc-----------------------------------------ccceeecHHHHHHHHHhhhhhcccc----cc-- Confidence 751100 0001345666777765443200000 11 Q ss_pred cccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCcccCCeEEEcCEEEEecCceeE---EEcCCCee Q lcl|Aclame:pro 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPI---RFYQGQRF 312 (318) Q Consensus 236 ~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~G~~gm~ngvii~e~~~~~i---Rf~ag~~v 312 (318) +..-+|+|++|.++.+++.++.. |- |..+ -+...-.| |.|- .| |+.-. +|- =--+.+|+ T Consensus 171 ----d~~~~V~FvNP~Daa~yl~~A~i--~~--~~t~-fG~n~L~n--fLG~-----~I-I~S~k-v~~G~~~~T~~~Ni 232 (303) T protein:vir:10 171 ----DEITPIAFVNPNDTAEYLANGFI--NS--TGAQ-FGVNLLTP--YVGV-----KI-VEFAD-VPQGEVWMTVAENL 232 (303) T ss_pred ----ccccEEEEEchHHHHHHhhcCCc--ch--hhhh-hhhhhhhh--hhcc-----eE-EEecc-CCCceEEEeeccce Confidence 12237999999999999998853 42 2111 12333444 5554 22 33321 221 12266666 Q ss_pred eeeeeC Q lcl|Aclame:pro 313 WYQRIT 318 (318) Q Consensus 313 ~~a~~~ 318 (318) .+|-+- T Consensus 233 ~~ay~~ 238 (303) T protein:vir:10 233 NVAYAN 238 (303) T ss_pred EEEEec Confidence 666555 No 77 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=21.06 E-value=2.7 Score=18.25 Aligned_cols=228 Identities=8% Similarity=-0.051 Sum_probs=92.7 Q ss_pred CCcCCcc---chhHHHHHHHHHHhcccchHHHHhhhhhhhhhhhhcccccccCCCCCccEEEeeccCCCCCcEEEEEEee Q lcl|Aclame:pro 1 MTTVTSA---QANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMH 77 (318) Q Consensus 1 ~t~~~~~---~~~~~~a~~lft~~~~~~~~v~~ws~~l~~~~~~~~~~~~~~G~~~~s~I~~~~dL~k~~Gd~v~f~L~~ 77 (318) |-..+.. .-...++.-+...+...++..+ .+. . ....+..+++.... T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~-l~~-----------------------~------~~~~~~~~~~p~~~ 50 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAK-LSP-----------------------Q------KPIPFNGQREFVFD 50 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhh-hcc-----------------------e------eeccCCceEEEEEe Confidence 3222222 1111122222222222222110 000 0 00111122322211 Q ss_pred c-cccCCeecCceeecchhhhhheeeEEEEecccccccccchhhhh--hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 78 K-LSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQ--RTKFNLASSARTLLGTYFNDLQDQCAIVHLAG 154 (318) Q Consensus 78 ~-L~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~q--rs~~dlr~~ar~~L~~w~~~~~D~~~~~~LsG 154 (318) . ....+|-.++ +-.+...+|..-++..-....-+.+..++-++ -+..||-..-++.|..-++...|+.+|.-.-. T Consensus 51 ~~~~a~wv~Eg~--~~~~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~ 128 (300) T protein:vir:95 51 FDSDIDIVAENG--KKTHGGVSLDPVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINP 128 (300) T ss_pred cCcceEEeeCCc--ccccccccceeeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccC Confidence 1 1112232222 22245567766666666666665555444332 24588999999999999999999999732210 Q ss_pred cccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeEecc Q lcl|Aclame:pro 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSG 234 (318) Q Consensus 155 ~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~v~g 234 (318) ..|.-. .+.. .....+......++++....+.|.++.......+.. T Consensus 129 ~~g~~~----------~~~~----------------~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-------- 174 (300) T protein:vir:95 129 RTKQAS----------TIIG----------------DNCFDKKVTQTVPFKDTNPDESMEDAVGMIDGSERD-------- 174 (300) T ss_pred CCCCCc----------cccc----------------ccccccccceeecccccchHHHHHHHHHHhhhcCCC-------- Confidence 111000 0000 000011122223345566677777777766552211 Q ss_pred ccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCccc-----CCeEEEcCEEEEecCceeE----- Q lcl|Aclame:pro 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK-----GECAMWRNILVRKYAGMPI----- 304 (318) Q Consensus 235 ~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~-----G~~gm~ngvii~e~~~~~i----- 304 (318) . =+++|||..+..|++=.+ +..+|||. |..+.+.|..++--+.+|- T Consensus 175 --~------~~~vmn~~~~~~L~~lkd----------------~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~ 230 (300) T protein:vir:95 175 --I------TGAILDPIFTTALSKMKN----------------AEGGKLYPELAWGGVPDAINGLAVDKNRTVSYSQTDP 230 (300) T ss_pred --c------cEEEECHHHHHHHHHhhc----------------cCCCeeccCccccCCCceecceeeEEecCCCCCCCCC Confidence 0 257899999888874211 01133332 3344445544433222210 Q ss_pred --EEcCCCe-------------eeee--------------------eeC Q lcl|Aclame:pro 305 --RFYQGQR-------------FWYQ--------------------RIT 318 (318) Q Consensus 305 --Rf~ag~~-------------v~~a--------------------~~~ 318 (318) .+..|+= ++.. |.. T Consensus 231 ~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~ 279 (300) T protein:vir:95 231 KNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCE 279 (300) T ss_pred ccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEE Confidence 0111110 0000 000 No 78 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=20.60 E-value=2.7 Score=18.18 Aligned_cols=243 Identities=7% Similarity=0.052 Sum_probs=102.8 Q ss_pred CCcCCccchhHHHHHHHHHHhcccchHH-------HHhhhhhhhhhhhhccc-ccccCCCCCccEEEeeccCCCCCcEEE Q lcl|Aclame:pro 1 MTTVTSAQANKLFQVALFTAANRNRSMV-------NILTEQQEAPKAVSPDK-KSTKQTSAGAPVVRITDLNKQAGDEVT 72 (318) Q Consensus 1 ~t~~~~~~~~~~~a~~lft~~~~~~~~v-------~~ws~~l~~~~~~~~~~-~~~~G~~~~s~I~~~~dL~k~~Gd~v~ 72 (318) |= ...+..+....|+....+-... .-+.+-+-.+.. ...+ .... +.+++..+-....-.|..++ T Consensus 1 ~~----~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~-~~~ii~~~~---~~s~l~~l~~~~~~~~~~~~ 72 (324) T protein:vir:93 1 ME----QTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDF-TTPILQEVM---ENSKIMQLGKYEPMEGTEKK 72 (324) T ss_pred Cc----hhHHHHHHHHHHHHhhhhhhhcccccccccCCCcceechhH-HHHHHHHHH---hhchhhhhcceeeccCCceE Confidence 11 1111123334444332211111 011111111111 1111 1111 22222221111122344455 Q ss_pred EEEeecc-ccCCeecCceeecchhhhhheeeEEEEecccccccccchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 73 FSIMHKL-SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVH 151 (318) Q Consensus 73 f~L~~~L-~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~g~ms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~ 151 (318) +...... ...+|..++.. .+..++|.+-++.+-....-+.+..++-++ +..||...-+..|++-+++..|+.+|. T Consensus 73 ip~~~~~~~a~~v~Eg~~~--~~~~~~f~~i~~~~~k~~~~~~iS~ell~d-s~~~l~~~i~~~l~~aia~~~d~a~l~- 148 (324) T protein:vir:93 73 FTFWADKPGAYWVGEGQKI--ETSKATWVNATMRAFKLGVILPVTKEFLNY-TYSQFFEEMKPMIAEAFYKKFDEAGIL- 148 (324) T ss_pred EEEEecCcceeeecCCccc--cccccceeEEEEEeEEEEEeehhhHHHHhc-chHHHHHHHHHHHHHHHHHHHHHHHhc- Confidence 5443221 11122211111 244567776666666666666665555443 567999999999999999999998852 Q ss_pred HhccccccccccceeecccccccccccccccCCCCCCceEeecCCccccccccccccCHHHHHHHHHHHHhcCCCCceeE Q lcl|Aclame:pro 152 LAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVR 231 (318) Q Consensus 152 LsG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~ 231 (318) |.-. +. ...++. +. ....+..+.+..+.+.|.++.......... T Consensus 149 --G~g~---~~----------~~~~~~-~~---------------~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~----- 192 (324) T protein:vir:93 149 --NQGN---NP----------FGKSIA-QS---------------IEKTNKVIKGDFTQDNIIDLEALLEDDELE----- 192 (324) T ss_pred --CCCC---CC----------cCcccc-cc---------------ccccceeccccccHHHHHHHHHhhhhccCC----- Confidence 2111 00 000000 00 001112233456677777776655442111 Q ss_pred eccccccCCcceEEEEecHHHHHHHHhCcchHHHHHHHHHHhhccccccCCccc-CCeEEEcCEEEEecCceeE---EEc Q lcl|Aclame:pro 232 LSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK-GECAMWRNILVRKYAGMPI---RFY 307 (318) Q Consensus 232 v~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~~r~~g~~nPlF~-G~~gm~ngvii~e~~~~~i---Rf~ 307 (318) . =+++|||..+..|++=. . ...+|+|. |.-+.+.|+.+.-.+..++ ..+ T Consensus 193 ----~-------~~~v~n~~~~~~L~~l~--------------d--~~G~~~~~~~~~~~l~G~PVv~~~~~~~~~~~i~ 245 (324) T protein:vir:93 193 ----A-------NAFISKTQNRSLLRKIV--------------D--PETKERIYDRNSDSLDGLPVVNLKSSNLKRGELI 245 (324) T ss_pred ----C-------CEEEEcHHHHHHHHHhh--------------C--CCCCeeecCCCCCcccceeeEeecCCCCCcceEE Confidence 0 15789999888887521 1 12267776 4455666765554332221 111 Q ss_pred CCC------------eeeeeeeC Q lcl|Aclame:pro 308 QGQ------------RFWYQRIT 318 (318) Q Consensus 308 ag~------------~v~~a~~~ 318 (318) .|+ .|+..+-. T Consensus 246 ~gdfs~~~~~~~~~~~i~~~~~~ 268 (324) T protein:vir:93 246 TGDFDKLIYGIPQLIEYKIDETA 268 (324) T ss_pred EEecceEEEEEecCcEEEEeecc Confidence 122 11111110 Done!