Query lcl|NC_018846.1_cdsid_YP_006906877.1 [gene=D300_gp60] [protein=structural protein] [protein_id=YP_006906877.1] [location=37335..38549] Match_columns 404 No_of_seqs 67 out of 81 Neff 5.8 Searched_HMMs 1612 Date Thu Nov 7 13:36:22 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_60 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_60_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:819 Length: 404 # 100.0 5E-185 3E-188 1031.0 36.5 404 1-404 1-404 (404) 2 protein:vir:3298 Length: 404 # 100.0 5E-185 3E-188 1031.0 36.5 404 1-404 1-404 (404) 3 protein:vir:104439 Length: 404 100.0 5E-185 3E-188 1031.0 36.5 404 1-404 1-404 (404) 4 protein:vir:10123 Length: 404 100.0 5E-185 3E-188 1031.0 36.5 404 1-404 1-404 (404) 5 protein:vir:105610 Length: 430 100.0 8E-159 5E-162 887.2 32.4 400 1-404 1-425 (430) 6 protein:vir:93696 Length: 364 100.0 8E-145 5E-148 810.6 29.5 354 13-404 1-362 (364) 7 protein:vir:2770 Length: 318 # 100.0 1E-138 7E-142 776.7 29.3 318 1-318 1-318 (318) 8 protein:vir:80213 Length: 334 99.3 2.9E-12 1.8E-15 83.8 20.2 329 1-403 1-334 (334) 9 protein:vir:95875 Length: 401 99.2 1.8E-12 1.1E-15 85.0 16.3 352 1-404 1-401 (401) 10 protein:vir:8885 Length: 347 # 99.1 2.7E-11 1.7E-14 78.5 17.9 325 1-402 1-347 (347) 11 protein:vir:80180 Length: 381 99.0 1.1E-11 7.1E-15 80.6 13.6 316 1-404 1-344 (381) 12 protein:vir:3364 Length: 347 # 99.0 8.1E-11 5E-14 75.9 17.1 338 1-403 1-347 (347) 13 protein:vir:94576 Length: 347 99.0 3.9E-11 2.4E-14 77.7 15.2 328 1-402 1-347 (347) 14 protein:vir:3613 Length: 272 # 99.0 1.1E-10 7.1E-14 75.1 16.9 262 1-404 1-267 (272) 15 protein:vir:10450 Length: 344 98.9 1.7E-10 1E-13 74.2 16.4 333 1-401 1-344 (344) 16 protein:vir:1541 Length: 347 # 98.9 2.8E-10 1.7E-13 73.0 17.4 334 1-403 2-347 (347) 17 protein:vir:2201 Length: 345 # 98.9 5.6E-10 3.5E-13 71.3 18.6 330 1-403 1-345 (345) 18 protein:vir:78739 Length: 332 98.9 2.7E-10 1.7E-13 73.0 16.1 318 1-401 1-332 (332) 19 protein:vir:96123 Length: 274 98.9 1.8E-10 1.1E-13 74.0 14.8 260 1-404 1-273 (274) 20 protein:vir:6324 Length: 335 # 98.9 3.3E-09 2.1E-12 67.1 20.9 327 1-404 1-330 (335) 21 protein:vir:94711 Length: 347 98.8 1E-09 6.5E-13 69.8 16.4 331 1-403 1-347 (347) 22 protein:vir:739 Length: 231 # 98.8 2.3E-10 1.4E-13 73.4 12.2 221 62-404 1-226 (231) 23 protein:vir:78935 Length: 335 98.8 7.4E-09 4.6E-12 65.2 20.3 327 1-404 1-330 (335) 24 protein:vir:93742 Length: 274 98.8 7E-10 4.3E-13 70.8 14.0 260 1-404 1-272 (274) 25 protein:vir:97433 Length: 274 98.7 2.4E-09 1.5E-12 67.9 16.6 260 1-404 1-272 (274) 26 protein:vir:94494 Length: 274 98.7 2.4E-09 1.5E-12 67.9 16.6 260 1-404 1-272 (274) 27 protein:vir:95107 Length: 270 98.7 9.4E-10 5.8E-13 70.1 14.1 255 1-404 1-260 (270) 28 protein:vir:105334 Length: 276 98.7 5.3E-10 3.3E-13 71.5 12.4 260 1-404 1-265 (276) 29 protein:vir:94622 Length: 341 98.7 2.1E-08 1.3E-11 62.6 21.0 309 1-404 4-340 (341) 30 protein:vir:95898 Length: 274 98.7 1E-09 6.5E-13 69.8 13.7 262 1-404 1-265 (274) 31 protein:vir:96262 Length: 274 98.7 1E-09 6.5E-13 69.8 13.7 262 1-404 1-265 (274) 32 protein:vir:80930 Length: 278 98.7 2.9E-09 1.8E-12 67.4 15.6 268 1-404 1-272 (278) 33 protein:vir:7990 Length: 273 # 98.7 9.1E-09 5.6E-12 64.7 17.6 253 22-353 1-273 (273) 34 protein:vir:1239 Length: 274 # 98.7 3.6E-09 2.2E-12 66.9 15.3 262 1-404 1-265 (274) 35 protein:vir:99675 Length: 324 98.6 2.1E-08 1.3E-11 62.6 19.2 289 56-404 1-299 (324) 36 protein:vir:9820 Length: 272 # 98.6 5.8E-09 3.6E-12 65.7 14.8 271 1-404 1-272 (272) 37 protein:vir:3033 Length: 272 # 98.6 5.8E-09 3.6E-12 65.7 14.8 271 1-404 1-272 (272) 38 protein:vir:96833 Length: 275 98.5 3.4E-08 2.1E-11 61.5 16.9 268 1-395 3-275 (275) 39 protein:vir:97031 Length: 402 98.4 1.6E-07 9.8E-11 57.9 18.5 337 1-404 1-357 (402) 40 protein:vir:105822 Length: 273 98.3 3.5E-07 2.2E-10 56.0 17.9 255 22-353 1-273 (273) 41 protein:vir:102605 Length: 273 98.3 3.5E-07 2.2E-10 56.0 17.9 255 22-353 1-273 (273) 42 protein:vir:100057 Length: 375 98.3 7.2E-07 4.4E-10 54.3 18.7 330 1-404 1-373 (375) 43 protein:vir:103323 Length: 364 98.3 1.9E-07 1.2E-10 57.5 15.4 336 1-404 1-357 (364) 44 protein:vir:5974 Length: 324 # 98.2 1.9E-07 1.2E-10 57.5 14.5 296 17-404 1-324 (324) 45 protein:vir:102944 Length: 330 98.1 2E-07 1.2E-10 57.3 12.8 296 1-404 1-328 (330) 46 protein:vir:1583 Length: 351 # 98.1 1.5E-07 9.5E-11 58.0 11.7 296 1-404 1-330 (351) 47 protein:vir:79008 Length: 299 97.8 2E-05 1.3E-08 46.3 21.1 280 22-404 1-289 (299) 48 protein:vir:105645 Length: 400 97.7 2.4E-05 1.5E-08 45.9 19.2 334 1-404 1-366 (400) 49 protein:vir:7019 Length: 401 # 97.7 2.7E-05 1.6E-08 45.7 18.2 333 1-404 1-358 (401) 50 protein:vir:99075 Length: 392 97.6 3.8E-05 2.4E-08 44.8 17.7 298 22-404 1-308 (392) 51 protein:vir:80446 Length: 367 96.9 6.2E-05 3.9E-08 43.6 11.5 321 1-404 1-360 (367) 52 protein:vir:108303 Length: 418 96.7 0.00041 2.5E-07 39.2 16.7 279 20-404 1-292 (418) 53 protein:vir:102655 Length: 322 96.4 0.00063 3.9E-07 38.1 18.9 302 1-404 13-322 (322) 54 protein:vir:94800 Length: 319 95.5 0.002 1.3E-06 35.3 15.7 284 1-404 1-297 (319) 55 protein:vir:97331 Length: 319 95.5 0.002 1.3E-06 35.3 15.7 284 1-404 1-297 (319) 56 protein:vir:78387 Length: 349 95.1 0.0028 1.7E-06 34.6 12.8 307 1-404 1-340 (349) 57 protein:vir:78920 Length: 290 94.2 0.005 3.1E-06 33.2 15.5 276 22-404 1-283 (290) 58 protein:vir:94989 Length: 349 92.2 0.012 7.7E-06 31.0 13.1 307 1-404 1-340 (349) 59 protein:vir:1781 Length: 221 # 91.3 0.017 1E-05 30.3 12.2 211 106-387 1-221 (221) 60 protein:vir:3525 Length: 423 # 90.6 0.02 1.2E-05 29.9 18.2 289 22-404 1-313 (423) 61 protein:vir:107120 Length: 329 90.6 0.02 1.2E-05 29.9 17.3 281 1-404 12-309 (329) 62 protein:vir:105374 Length: 423 85.2 0.054 3.4E-05 27.5 17.7 294 22-404 1-313 (423) 63 protein:vir:102335 Length: 312 84.0 0.064 3.9E-05 27.1 19.8 280 22-404 1-300 (312) 64 protein:vir:105464 Length: 346 73.8 0.17 0.0001 24.9 16.8 280 22-404 1-290 (346) 65 protein:vir:95131 Length: 325 73.2 0.17 0.00011 24.8 13.4 291 15-404 1-301 (325) 66 protein:vir:174 Length: 423 # 61.7 0.35 0.00022 23.1 18.5 295 22-404 1-313 (423) 67 protein:vir:8102 Length: 543 # 56.5 0.46 0.00028 22.5 12.5 304 1-402 212-543 (543) 68 protein:vir:105522 Length: 423 52.5 0.55 0.00034 22.0 17.9 290 22-404 1-313 (423) 69 protein:vir:191 Length: 385 # 51.9 0.57 0.00035 21.9 14.0 291 1-402 68-385 (385) 70 protein:vir:1886 Length: 385 # 51.9 0.57 0.00035 21.9 14.0 291 1-402 68-385 (385) 71 protein:vir:9759 Length: 303 # 49.1 0.65 0.0004 21.6 15.5 292 15-404 1-303 (303) 72 protein:vir:94771 Length: 298 40.4 0.97 0.0006 20.7 16.0 285 1-404 1-294 (298) 73 protein:vir:3136 Length: 322 # 33.2 1.4 0.00085 19.8 13.6 295 19-404 1-320 (322) 74 protein:vir:9574 Length: 300 # 28.1 1.8 0.0011 19.2 14.3 286 1-404 1-295 (300) 75 protein:vir:102119 Length: 404 26.6 1.9 0.0012 19.0 13.9 298 1-404 84-403 (404) No 1 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=100.00 E-value=5e-185 Score=1031.01 Aligned_cols=404 Identities=100% Similarity=1.433 Sum_probs=402.0 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) ||+||+|+||++|+|||||++++|+|++|+||++++.++++++++++++|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:81 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_018846. 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) Q Consensus 81 G~gV~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~ 160 (404) |+||+||++||||||+|+|++|+|+|||.||+|+++|+|+|||++||||++||++|++||++++||++|+||||+||+++ T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~ 160 (404) T protein:vir:81 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCc Q lcl|NC_018846. 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) Q Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~ 240 (404) |++|++|+.+|+.|+++++|+|+|||+|||||+|++|++++|+++|+||+++||+++++++++++||+||+++|++++++ T Consensus 161 n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~ 240 (404) T protein:vir:81 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) T ss_pred cccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeecccccc Q lcl|NC_018846. 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLT 320 (404) Q Consensus 241 ~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~ 320 (404) +|+|||||||+|+++||+||++++|+|+|++|.+|+||++||||+|+|||||||+|||||++|||||+++++.+|+|+++ T Consensus 241 ~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~ 320 (404) T protein:vir:81 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLT 320 (404) T ss_pred cceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEEEec Q lcl|NC_018846. 321 ATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDT 400 (404) Q Consensus 321 ~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idt 400 (404) +++++++++++|+|+||||||||++|||+++|+||+|+||.+||||++||++++|+|+||+||++++|++||||||+||| T Consensus 321 a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idt 400 (404) T protein:vir:81 321 ATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDT 400 (404) T ss_pred cccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecC Q lcl|NC_018846. 401 AVKL 404 (404) Q Consensus 401 aa~~ 404 (404) |||| T Consensus 401 a~~~ 404 (404) T protein:vir:81 401 AVKL 404 (404) T ss_pred cccC Confidence 9999 No 2 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=100.00 E-value=5e-185 Score=1031.01 Aligned_cols=404 Identities=100% Similarity=1.433 Sum_probs=402.0 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) ||+||+|+||++|+|||||++++|+|++|+||++++.++++++++++++|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:32 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_018846. 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) Q Consensus 81 G~gV~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~ 160 (404) |+||+||++||||||+|+|++|+|+|||.||+|+++|+|+|||++||||++||++|++||++++||++|+||||+||+++ T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~ 160 (404) T protein:vir:32 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCc Q lcl|NC_018846. 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) Q Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~ 240 (404) |++|++|+.+|+.|+++++|+|+|||+|||||+|++|++++|+++|+||+++||+++++++++++||+||+++|++++++ T Consensus 161 n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~ 240 (404) T protein:vir:32 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) T ss_pred cccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeecccccc Q lcl|NC_018846. 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLT 320 (404) Q Consensus 241 ~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~ 320 (404) +|+|||||||+|+++||+||++++|+|+|++|.+|+||++||||+|+|||||||+|||||++|||||+++++.+|+|+++ T Consensus 241 ~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~ 320 (404) T protein:vir:32 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLT 320 (404) T ss_pred cceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEEEec Q lcl|NC_018846. 321 ATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDT 400 (404) Q Consensus 321 ~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idt 400 (404) +++++++++++|+|+||||||||++|||+++|+||+|+||.+||||++||++++|+|+||+||++++|++||||||+||| T Consensus 321 a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idt 400 (404) T protein:vir:32 321 ATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDT 400 (404) T ss_pred cccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecC Q lcl|NC_018846. 401 AVKL 404 (404) Q Consensus 401 aa~~ 404 (404) |||| T Consensus 401 a~~~ 404 (404) T protein:vir:32 401 AVKL 404 (404) T ss_pred cccC Confidence 9999 No 3 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=100.00 E-value=5e-185 Score=1031.01 Aligned_cols=404 Identities=100% Similarity=1.433 Sum_probs=402.0 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) ||+||+|+||++|+|||||++++|+|++|+||++++.++++++++++++|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_018846. 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) Q Consensus 81 G~gV~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~ 160 (404) |+||+||++||||||+|+|++|+|+|||.||+|+++|+|+|||++||||++||++|++||++++||++|+||||+||+++ T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~ 160 (404) T protein:vir:10 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCc Q lcl|NC_018846. 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) Q Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~ 240 (404) |++|++|+.+|+.|+++++|+|+|||+|||||+|++|++++|+++|+||+++||+++++++++++||+||+++|++++++ T Consensus 161 n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~ 240 (404) T protein:vir:10 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) T ss_pred cccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeecccccc Q lcl|NC_018846. 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLT 320 (404) Q Consensus 241 ~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~ 320 (404) +|+|||||||+|+++||+||++++|+|+|++|.+|+||++||||+|+|||||||+|||||++|||||+++++.+|+|+++ T Consensus 241 ~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~ 320 (404) T protein:vir:10 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLT 320 (404) T ss_pred cceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEEEec Q lcl|NC_018846. 321 ATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDT 400 (404) Q Consensus 321 ~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idt 400 (404) +++++++++++|+|+||||||||++|||+++|+||+|+||.+||||++||++++|+|+||+||++++|++||||||+||| T Consensus 321 a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idt 400 (404) T protein:vir:10 321 ATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDT 400 (404) T ss_pred cccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecC Q lcl|NC_018846. 401 AVKL 404 (404) Q Consensus 401 aa~~ 404 (404) |||| T Consensus 401 a~~~ 404 (404) T protein:vir:10 401 AVKL 404 (404) T ss_pred cccC Confidence 9999 No 4 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=100.00 E-value=5e-185 Score=1031.01 Aligned_cols=404 Identities=100% Similarity=1.433 Sum_probs=402.0 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) ||+||+|+||++|+|||||++++|+|++|+||++++.++++++++++++|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_018846. 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) Q Consensus 81 G~gV~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~ 160 (404) |+||+||++||||||+|+|++|+|+|||.||+|+++|+|+|||++||||++||++|++||++++||++|+||||+||+++ T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~ 160 (404) T protein:vir:10 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCc Q lcl|NC_018846. 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) Q Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~ 240 (404) |++|++|+.+|+.|+++++|+|+|||+|||||+|++|++++|+++|+||+++||+++++++++++||+||+++|++++++ T Consensus 161 n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~ 240 (404) T protein:vir:10 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) T ss_pred cccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeecccccc Q lcl|NC_018846. 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLT 320 (404) Q Consensus 241 ~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~ 320 (404) +|+|||||||+|+++||+||++++|+|+|++|.+|+||++||||+|+|||||||+|||||++|||||+++++.+|+|+++ T Consensus 241 ~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~ 320 (404) T protein:vir:10 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLT 320 (404) T ss_pred cceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEEEec Q lcl|NC_018846. 321 ATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDT 400 (404) Q Consensus 321 ~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idt 400 (404) +++++++++++|+|+||||||||++|||+++|+||+|+||.+||||++||++++|+|+||+||++++|++||||||+||| T Consensus 321 a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idt 400 (404) T protein:vir:10 321 ATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDT 400 (404) T ss_pred cccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecC Q lcl|NC_018846. 401 AVKL 404 (404) Q Consensus 401 aa~~ 404 (404) |||| T Consensus 401 a~~~ 404 (404) T protein:vir:10 401 AVKL 404 (404) T ss_pred cccC Confidence 9999 No 5 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=100.00 E-value=8.5e-159 Score=887.19 Aligned_cols=400 Identities=33% Similarity=0.552 Sum_probs=370.6 Q ss_pred CC------cccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEE Q lcl|NC_018846. 1 MT------TVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFS 74 (404) Q Consensus 1 ~~------~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~ 74 (404) || ++++|+|+++||++||+.+.+++++.+++.++.++...-. +.....+++.++|||+++||+|++||+|+|+ T Consensus 1 ~~~a~T~~~~~~p~a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~-~~~~~~~ts~~~pI~r~~dL~K~~GD~Vtf~ 79 (430) T protein:vir:10 1 MTASKTTMRYGDPNAMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDA-EKKTKGQSSLELPIVQAQDLGRNKGDEVRFH 79 (430) T ss_pred CcceeeecccCChhHHHHHHHHHHHHHhhhhhhHHHhhccccccccch-hhhccCCCCCCccEEEeccCCCCCccEEEEe Confidence 66 4789999999999999999999999999999877766533 3446678899999999999999999999999 Q ss_pred EeeccccCceecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018846. 75 IMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAG 154 (404) Q Consensus 75 L~~~L~G~gV~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG 154 (404) |++||+|+||+||++||||||+|+|++|+|+|||.||+|+++|+|+|||++||||++||++|++||++++||++|+|||| T Consensus 80 L~~~L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~R~~V~~gg~msqQRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laG 159 (430) T protein:vir:10 80 FVQPANAFPIMGSEYAEGKGTGLKIGSDQLRVNQARFPVDLGDVMSQIRNPYDLRRLGRPKAKWFMDAYLDQSMLVHLAG 159 (430) T ss_pred EeeccccCceecCceeeccccceEEEeeEEEEeeeccccccCCchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhccccccccccccccccccccccccccCCCCCCcEEecCCcc-c-------hhhhhhhccccHHHHHHHHHHHHHhCCC Q lcl|NC_018846. 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDAT-S-------FEQIEAADIFSIGLVDNLSLFIDEMAHP 226 (404) Q Consensus 155 ~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at-~-------~~~i~~~D~~s~~~Id~~~~~a~~~a~p 226 (404) +||++.|++|++|+.+|++|+.+++|+|+|||+||||++++.+ + +.+|+++|+||+++||+++++++++++| T Consensus 160 arg~~~~~~~~~~~~~~~~~~~~~~N~v~aPt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s~~~id~a~~~a~~~~~~ 239 (430) T protein:vir:10 160 ARGNHYNKEWCLPLETHPKLADMLVNRVKAPTKNRHFVASADAITGVAPNAGEYNITTADVLDVDVVDSIATYMDQIELP 239 (430) T ss_pred hhcccccccccccccCCcchhhhhccccCCCCCceeEeecccccccccccccccchhhhcccCHHHHHHHHHHHHhhCCC Confidence 9999999999999999999999999999999999999977643 3 4579999999999999999999999999 Q ss_pred CccEEeecccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeee Q lcl|NC_018846. 227 LQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRF 306 (404) Q Consensus 227 i~Pv~~~g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf 306 (404) |+||+++|+++++++|+|||||||+|+++||+||++++| |+|+.+.+ ++|++||||+|+|||||||||||||+ +||| T Consensus 240 i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~w-q~~~~a~a-~~g~~nPlF~G~~gm~ngvii~~~~~-virf 316 (430) T protein:vir:10 240 PPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSW-QAAALARA-SNAKQHPIFRVDAGLWSNTLIIKMPK-PIRF 316 (430) T ss_pred CcceEeecccccCCccEEEEEechHHHHHHhhCcchHHH-HHHHHHhh-cccccCCceecceeeecCeEEecCCc-eeee Confidence 999999999999999999999999999999999999999 77777655 47899999999999999999999995 6999 Q ss_pred ccccceeeccc--cccc----ccccccccccchhheeecCceeEEEeecC--CCCCceeeeccccccchHHHHHHHHhhh Q lcl|NC_018846. 307 YQGSKVLVSEN--NLTA----TTKEVAAATNIDRAMLLGAQALANAYGQK--AGGHFNMVEKKTDMDNRTEIAISWINGL 378 (404) Q Consensus 307 ~~~~~~~~~~~--~~~~----~~~~~a~~~~v~ralllGaqAl~~A~g~~--~g~r~~w~Ee~~D~g~~~~i~i~~i~G~ 378 (404) |+|+..++++. +... .+..++++++|+|+|||||||+++|||+. +|+||+|+||.+||||++||++++|+|+ T Consensus 317 ~~g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~~~~~g~~f~w~Ee~~D~g~~~~i~~~~i~G~ 396 (430) T protein:vir:10 317 YAGDTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALLLGGQALAQAWAASEHSGMPFFWSEKDMDHGDKLELLIGAILGC 396 (430) T ss_pred cCCCccccccCCcccccccccccccccccccchhhhhccchhheeeeeccCCCCcceeeeeeccccCchhhhhhhHHhcc Confidence 99999888773 3333 33445667899999999999999999984 7899999999999999999999999999 Q ss_pred hhccccCCCCC---ceEEEEEEEeceecC Q lcl|NC_018846. 379 KKIRFPEKSGK---MQDHGVIAVDTAVKL 404 (404) Q Consensus 379 ~K~rF~~~~g~---~~DfGvi~idtaa~~ 404 (404) ||+||++++++ ++|||||+||||||| T Consensus 397 kK~rF~~~~~~~~~~~DfGvi~idtaa~~ 425 (430) T protein:vir:10 397 SKIRFAVEATNGLEYTDHGVMAIDTAVKI 425 (430) T ss_pred ceeeecCCCCCCceeeeeEEEEhhhhhhh Confidence 99999987764 689999999999999 No 6 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=100.00 E-value=7.9e-145 Score=810.61 Aligned_cols=354 Identities=34% Similarity=0.476 Sum_probs=330.3 Q ss_pred HHHHHHHHhhcCch-hHHHHHhhhhhhhhhhccccc-ccCCCCCccEEEEeecCCCCCcEEEEEEeeccccCceecCcee Q lcl|NC_018846. 13 YQVALFTAANRNRS-MVNILTEQQEAPKAVSPDKKS-TKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERV 90 (404) Q Consensus 13 ~~~~lft~~~~n~~-~~~~~~~~l~~~~~k~s~~~~-~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gV~Gd~~l 90 (404) +| .|.++.|+| ++++|+++|+.+++++++|.+ ++|+++++|||+++||+|++||+|+|+|++||+|+||+||++| T Consensus 1 Ma---~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~g~gv~Gd~~l 77 (364) T protein:vir:93 1 MS---QTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHLRGKPTYGDARV 77 (364) T ss_pred Cc---eeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeecccCCcccCcee Confidence 11 588889999 478899999999999999996 8899999999999999999999999999999999999999999 Q ss_pred ecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccccc Q lcl|NC_018846. 91 EGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAE 170 (404) Q Consensus 91 eGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~ 170 (404) |||||+|+|++|+|+|||.||+|+++|+|+|||++||||++||++|++||++++|+++|+||||+||. .+|... T Consensus 78 eGnee~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~------~~~~~~ 151 (364) T protein:vir:93 78 EGKEESLRFYQDEVRIDQVRHSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGI------NLDFIE 151 (364) T ss_pred eccccceeEEeeEEEEeeccccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------cccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999983 256678 Q ss_pred ccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHh------CCCCccEEeecccccCccceE Q lcl|NC_018846. 171 HPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEM------AHPLQPVRLSGDELHGEDPYY 244 (404) Q Consensus 171 ~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~------a~pi~Pv~~~g~~~~~~~~~y 244 (404) ++.|..+++|+|+|||++|||+++++|++++|+++|+||+++||+++.+++++ ++||+||+++|++ +| T Consensus 152 ~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~------~y 225 (364) T protein:vir:93 152 TPDFTGYAGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDD------HY 225 (364) T ss_pred ccCcccccccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcc------ee Confidence 89999999999999999999999999999999999999999999999999998 4679999999987 79 Q ss_pred EEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeecccccccccc Q lcl|NC_018846. 245 VLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTK 324 (404) Q Consensus 245 V~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~~ 324 (404) ||||||+|+++||++++ ++|+|+|++|. .++|++||||+|+|||||||+||||+++ |||+.... T Consensus 226 V~~l~p~q~~~Lr~~t~-~~w~d~qk~A~-~~~g~~nPlF~G~~gm~ngvii~~~~~v-i~~~~~~~------------- 289 (364) T protein:vir:93 226 VCVMSEYQATDMRTAAG-GTWIDFQKAAA-AAEGRNNPIFKGGLGMINNVVLHKHRNV-IRFNDYGA------------- 289 (364) T ss_pred EEEEcchhhhhhhhcCC-HHHHHHHHHhh-hcccccCCceecCeeeEcCeEEeccCCc-cccccccc------------- Confidence 99999999999999887 67999999984 4689999999999999999999999987 88854332 Q ss_pred cccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEEEeceecC Q lcl|NC_018846. 325 EVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 325 ~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idtaa~~ 404 (404) ++.++|+|+|||||||+++|||+++|+||+|+||.+||||+.||++++|+|+||+||++ +|||||+||||||+ T Consensus 290 --~~~v~~~ralllGaQA~~~a~g~~~g~~~~w~Ee~~D~gn~~~i~~~~i~G~kK~rF~~-----~DfGvi~idtaa~~ 362 (364) T protein:vir:93 290 --GANVEAARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAIAAGFIAGMKKARFNN-----KDFGVISIDTAAKK 362 (364) T ss_pred --CccccchhhheecceeeEEEeecCCCCCceeeecccCCCCchhhhhhhHhhhhhcccCC-----ccceEEEecccccc Confidence 23467899999999999999999999999999999999999999999999999999986 49999999999999 No 7 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=100.00 E-value=1.2e-138 Score=776.73 Aligned_cols=318 Identities=97% Similarity=1.425 Sum_probs=313.3 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) ||+++..++++.|+|||||++++|+|++|+|+++|+++++++++|.+++|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 mt~~~~~~~~~~~~~~~ft~~~~~~~~vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vtf~L~~~L~ 80 (318) T protein:vir:27 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) T ss_pred CCccCCCChHHHHHHHHHHHHhcCChHHHHHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEEEeEeeccc Confidence 99998888888899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_018846. 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) Q Consensus 81 G~gV~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~ 160 (404) |+||+||+++|||||+|+|++|+|+|||.||+|+++|+|+|||++||||++||++|++||++++||++|+||||+||+|+ T Consensus 81 g~gv~Gd~~lEGnee~L~~~~d~l~IDq~r~~V~~gg~msqqRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laGarg~~~ 160 (318) T protein:vir:27 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (318) T ss_pred cCccccCceeeccccceEEEeeEEEEeeeccccccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCc Q lcl|NC_018846. 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) Q Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~ 240 (404) |++|++|+.+|++|+++++|+++|||+||||++|++|++++|+++|+||+++||++++++++|++||+||+++|++++++ T Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~g~at~~~~l~stD~~s~~lid~~~~~~~~~a~pi~PV~v~g~~~~~~ 240 (318) T protein:vir:27 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (318) T ss_pred cccceEecccCccchhhhhcccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceeeccccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeecccc Q lcl|NC_018846. 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENN 318 (404) Q Consensus 241 ~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~ 318 (404) +|+|||||||+|+++||+|+++++|+++||+|++|++|++||||+|+|||||||||||||+||||||+|+++++++-. T Consensus 241 ~~~yV~~~~p~q~~~Lrtdt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~vpIrf~~G~~v~~~~~~ 318 (318) T protein:vir:27 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGQRFWYQRIT 318 (318) T ss_pred cceEEEEechHHHHHHhhcCCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeecCCccEEEcCCCeeeeeecC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999887765 No 8 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.26 E-value=2.9e-12 Score=83.85 Aligned_cols=329 Identities=14% Similarity=0.084 Sum_probs=181.5 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |++.-.. .-..++.+ ..-....-.+|.|++.+...-+.++.|.. -+++.++ ..|+++.|+-+...+ T Consensus 1 m~~~~~~-~~t~~~~~--~~~~~~~l~le~~~geV~~af~~~s~~~~---------~~~~r~i--~~G~s~~~~~iG~~~ 66 (334) T protein:vir:80 1 MTYPAAN-THTRPGWG--GANSDVSLHIEEHLGLVDASFMYSSKFAS---------WMNVRSL--RGTNQLRVDRVGAST 66 (334) T ss_pred CCCCcCC-Cccccccc--cccchheehhhhhhhHHHHHHHHhhhhhc---------cceeeec--cccceEEEeeeccee Confidence 8876221 11111111 00011111368999998777776666532 2223344 449999999888887 Q ss_pred cCceecCceeecchhhhhhceeEEEEeec---cceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh-hhh Q lcl|NC_018846. 81 KRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA-GAR 156 (404) Q Consensus 81 G~gV~Gd~~leGnee~L~~~s~~v~Idq~---R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~la-G~~ 156 (404) -...+=++.+.+. .+.....+|.||+. ||.|+ .+++-...+|+|.+.-.....=+++..||.+|..|. |++ T Consensus 67 ~~~~~~g~~l~~~--~~~~~~~~l~ID~~l~~~~~Vd---diD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~ 141 (334) T protein:vir:80 67 IAGRKAGEELVVQ--KNVSDKLNLTVDTVLYARHFFD---KFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGD 141 (334) T ss_pred eeeecCCCCCCCC--CcccCceEEEEeeeeehhhhHh---hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 6666667777665 47778889999994 56666 589999999999999999999999999999988765 443 Q ss_pred cccccccccccccccccc-ccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecc Q lcl|NC_018846. 157 GDFVADDTILPTAEHPEF-KKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (404) Q Consensus 157 g~~~n~~~~~p~~~~~~~-~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~ 235 (404) . ..|....+.| .+... ..-..+.+ ....+..|.+- +.+..++..+.+..-|-.| .. T Consensus 142 ~-------~~~~~~~~~~~~G~~~----------~~~~~g~~-~~~~~~~~~l~-~a~~~a~~~L~e~dvp~~~--~~-- 198 (334) T protein:vir:80 142 F-------LAPAHLKPAFHDGILL----------PSTISGLA-ADAAADADVLV-AAHRQGVEAMVFRDLGDQL--MS-- 198 (334) T ss_pred h-------cccccccccccCCcce----------eecccccc-cchhhhHHHHH-HHHHHHHHHHHhcCCCCCc--CC-- Confidence 2 1111000000 00000 00000111 11122222221 2222344445444444110 11 Q ss_pred cccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeec Q lcl|NC_018846. 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVS 315 (404) Q Consensus 236 ~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~ 315 (404) + ++++++|.|+..|..++.+- ++.-...+..+++=.|.++.|+|+.|.+-+++|..- . . T Consensus 199 ~-------R~~vv~P~~y~~Ll~~~r~~-------n~d~~~s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~--~-----t 257 (334) T protein:vir:80 199 E-------GVTLLDPVIFSFLLEHDRLM-------NVEFGAKEGGNSFVGGRIAMLNGVRVVETPRFPQSA--I-----T 257 (334) T ss_pred c-------eEEEeChHHHHHHhcccccc-------cceeccccccccccceeEEEEeceEEEeecCCCCcc--c-----c Confidence 1 79999999999999998641 111122333577778899999999999988877211 0 1 Q ss_pred ccccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEE Q lcl|NC_018846. 316 ENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGV 395 (404) Q Consensus 316 ~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGv 395 (404) ++..+...+.++..+.-.-+++....|++.+-...--+..++.|+.+ -.-|-....+|.+=+| + +=.+| T Consensus 258 ~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~----~d~i~~~~a~G~g~lR-P------eaa~v 326 (334) T protein:vir:80 258 ANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDF----GHYLDTFQSYNIGQRR-P------DAVAV 326 (334) T ss_pred ccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechhhH----HHHHHHHHHcCCceec-c------ceEEE Confidence 11112222233333333345777777887765542122333333321 1123334556666666 3 24555 Q ss_pred EEEeceec Q lcl|NC_018846. 396 IAVDTAVK 403 (404) Q Consensus 396 i~idtaa~ 403 (404) |-|+.--| T Consensus 327 v~~~~~~~ 334 (334) T protein:vir:80 327 HDITVTNP 334 (334) T ss_pred EEEeeecC Confidence 55555555 No 9 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=99.21 E-value=1.8e-12 Score=85.01 Aligned_cols=352 Identities=13% Similarity=0.154 Sum_probs=191.3 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHH--HHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeec Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNI--LTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHK 78 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~--~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~ 78 (404) |-.|-.|.--. +=+-.+.+.|+++. |-+|+-....+.-.+..+-+ +.++-|+.|-+|.|.-..+ T Consensus 1 ~~~~~a~~~~~-----~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~---------~~piPkn~GkTIk~r~y~p 66 (401) T protein:vir:95 1 MLNYNAPTDGQ-----KSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLAS---------VTNMPKHYGKTIKVYEYVP 66 (401) T ss_pred CCccCCCcccc-----cccccccccceeeehhhHHHHHhhhhhhhhhhhccc---------ccccccccCCeEEEEeccc Confidence 66665553211 12346777887764 66776666665555533322 4556688899999887777 Q ss_pred cccC------ce--ecCceeecchh------------hhhhceeEEEEeeccce-eccCChhhhhhhhhhHHHHHHHHHH Q lcl|NC_018846. 79 LSKR------PT--MGDERVEGRGE------------DLSHADFSLKINQGRHL-VDAGGRMSQQRTKFNLASSARTLLG 137 (404) Q Consensus 79 L~G~------gV--~Gd~~leGnee------------~L~~~s~~v~Idq~R~a-V~~~g~m~~qrs~~dlrk~ar~~L~ 137 (404) |.-. || .|-+...|+.= =+......=+||+..+. ++..+++.|-=-...|-.+.-+--. T Consensus 67 l~~~~~pl~eGv~a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~ 146 (401) T protein:vir:95 67 LLDDRNINDQGIDASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDS 146 (401) T ss_pred ccccccchhcCCCcccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhc Confidence 7531 22 11111111100 01111222234433322 1122222221000001111100000 Q ss_pred H-HHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhh----hhhccccHHH Q lcl|NC_018846. 138 T-YFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQI----EAADIFSIGL 212 (404) Q Consensus 138 ~-w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i----~~~D~~s~~~ 212 (404) | =|..-+++.++ .|+.+-. .+. -.+++. .....+++++.+++.+++ ..+..++++. T Consensus 147 D~~l~~h~s~ell---~g~~~~t----------~d~-----i~~dll-~ag~~viyAg~ats~At~~~~~~~~t~vt~~~ 207 (401) T protein:vir:95 147 DDGLMEHLSRELM---NGATQIT----------EAV-----LQKDLL-AAAGTVLYAGAATSDATITGEGSTPSVVSYKN 207 (401) T ss_pred chHHHHHHHHHHh---hhhhhhH----------HHH-----HHHHHH-hhcCeeecCCccceeeeccccccccceechhH Confidence 0 01111112221 2221100 000 011112 113346677777777754 4678899999 Q ss_pred HHHHHHHHHHhCCCCccEEeecccccC---ccceEEEEecH------HHHHHHhcCcchHHHHHHHHHHhhcccccCCcc Q lcl|NC_018846. 213 VDNLSLFIDEMAHPLQPVRLSGDELHG---EDPYYVLYVTP------RQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPL 283 (404) Q Consensus 213 Id~~~~~a~~~a~pi~Pv~~~g~~~~~---~~~~yV~~l~p------~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPl 283 (404) |.++...+..-..|.+-..+.|-.+.+ ..+.||.|||| +.++||..||+ |...+|+|. ..++ T Consensus 208 l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~s~va~~h~~L~~di~a~~D~~~~~~---fi~v~kYa~------~~~i 278 (401) T protein:vir:95 208 LMRLDQILTENRTPTQTTIITGSRMIDTKVIGATRVMYVGSELVPELKAMKDLFGNKA---FIETQHYAD------AGTI 278 (401) T ss_pred HHHHHHHHHhcccccchhhhhhhhccCccccccceEEEEecCchhHHHHHHHhcCCCC---ceehhhcCC------cccc Confidence 999999998777675444444432222 45789999999 88888889998 789999863 3689 Q ss_pred cccCceEEcCEEEEecCCceeeeccccceeecc-cccccccccccccccchhheeecCceeEEEeecCCCC--Cceee-- Q lcl|NC_018846. 284 FKGECAMWRNILVRKYAGMPIRFYQGSKVLVSE-NNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGG--HFNMV-- 358 (404) Q Consensus 284 F~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~-~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~--r~~w~-- 358 (404) |.||+|.++|+.++..|.+ ..|..-++..... ..+............|--.|.||.+|.+..-=+..|+ .|... T Consensus 279 ~~gEiG~i~~vR~i~~p~~-~~w~~ag~~a~~~~~~y~~~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk 357 (401) T protein:vir:95 279 MNGEVGSIDKFRIIQVPEM-LHWAGAGAQATGANPGYRTSMVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTK 357 (401) T ss_pred ccccccccCceeEEecccc-eeecCCcccccccccccccccccCCCcceeeeeeEEccccceecccccCCccccceeEee Confidence 9999999999999988875 3443333322111 1222222223444567788999999988764332332 12221 Q ss_pred -------eccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEEEeceecC Q lcl|NC_018846. 359 -------EKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 359 -------Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idtaa~~ 404 (404) +...-||..--+++++.+|...++ |==.+.|-|++|| T Consensus 358 ~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~---------~e~m~~ies~a~~ 401 (401) T protein:vir:95 358 MPGKETADRNDPYGETGFSSIKWYYGILVKR---------PERLALIKTVAPL 401 (401) T ss_pred cCCcCCCCCCCcccceehhhhhhhhhhheec---------cceeEEEEeecCC Confidence 223346777789999999998886 2235678999999 No 10 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.09 E-value=2.7e-11 Score=78.54 Aligned_cols=325 Identities=10% Similarity=0.075 Sum_probs=177.9 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCc-------hhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEE Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNR-------SMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTF 73 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~-------~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f 73 (404) |-. ...+.-|-|+.+... -.+++|++.+...-++.+.|..+ +++.++ ..|.++.| T Consensus 1 ~a~-------~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~---------~~~r~i--~~G~sv~~ 62 (347) T protein:vir:88 1 MAN-------ATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDK---------HMVRTI--QNGKSASF 62 (347) T ss_pred CCC-------cccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhc---------cccccc--cCcceEEE Confidence 321 111122223333331 14688998876666655554322 222334 35999999 Q ss_pred EEeeccccCceecCceeecchhhhhhceeEEEEeec---cceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018846. 74 SIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIV 150 (404) Q Consensus 74 ~L~~~L~G~gV~Gd~~leGnee~L~~~s~~v~Idq~---R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~ 150 (404) +-+...+....+-.+.+.+..+++.....+|.||+. +|.|+ .+++-...+|+|++.......=|++..|+.+|. T Consensus 63 ~~iG~~~~~~~~~g~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vd---d~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~ 139 (347) T protein:vir:88 63 PVMGRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIY---DIEDAMNHYDVRAEYSAQLGEALAIAADGAVLA 139 (347) T ss_pred eeecceeeeeeccccCCCCCCCCCccceEEEEEechhhhhhhhh---hHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHH Confidence 999999988877778888888889999999999997 67777 689999999999999999999999999999999 Q ss_pred HHhhhhccc-cccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhcccc---HHHHHHHHHHHHHhCCC Q lcl|NC_018846. 151 HLAGARGDF-VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFS---IGLVDNLSLFIDEMAHP 226 (404) Q Consensus 151 ~laG~~g~~-~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s---~~~Id~~~~~a~~~a~p 226 (404) ++..+.... .+.... ++-..-.....+.+++ +....... .+.|-.+...+++..-| T Consensus 140 ~l~~~a~~~~~~~~~~------------------~g~~~~~~~~~~~~~~--~~~~~~~~~~~~~~i~~a~~~Lde~~VP 199 (347) T protein:vir:88 140 EMAKLCNLPAASNENI------------------AGLGQAVVLNIGAAAD--LVDVEARGKAILKGLTLARARLTKNYVP 199 (347) T ss_pred HHHHhhcccccccccc------------------CCcccccccccccccc--ccchhhhHHHHHHHHHHHHHHHhhcCCC Confidence 987543210 111111 1100000001111111 11111111 34444566666655544 Q ss_pred CccEEeecccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeee Q lcl|NC_018846. 227 LQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRF 306 (404) Q Consensus 227 i~Pv~~~g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf 306 (404) . +. ++++++|.|+..|.+++.+... . ......+-.|.+|.++|+-|.+.+++|..- T Consensus 200 ~-------------~g-R~~vv~P~~y~~Ll~~~~~~~~--------~--~~~~~~~~~G~vg~i~G~~V~~s~nlp~~~ 255 (347) T protein:vir:88 200 A-------------GD-RRFYCAPEDYSAILSALMPNAA--------N--YAALIDPETGNIRNVMGFEVIEVPHLTVGG 255 (347) T ss_pred C-------------CC-CEEEeCHHHHHHHhcchhhhhh--------h--hccccchhcceeeeeccceEEEeecccccc Confidence 1 11 6788999999999998753211 1 122345778999999999999999987321 Q ss_pred ccccceeecccc----ccc---ccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccch-HHHHHHHHhhh Q lcl|NC_018846. 307 YQGSKVLVSENN----LTA---TTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGL 378 (404) Q Consensus 307 ~~~~~~~~~~~~----~~~---~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~-~~i~i~~i~G~ 378 (404) .-..+.....+. +.. ....+...+.-.-+|++-..|++.+=.. --+.|-..|-.++ ..|-....+|. T Consensus 256 ~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~-----d~~~e~~r~~~~~~d~i~~~~~~G~ 330 (347) T protein:vir:88 256 AGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLK-----DMALERARRPEFQADQIIGKYAMGH 330 (347) T ss_pred cccccccccccccccccccccccccccccccCcEEEEEechhhhhheecc-----cceeeeeechhhHHHHhhhhhhhcC Confidence 100000000000 000 0000111111111222222222222111 1123333333322 24555566777 Q ss_pred hhccccCCCCCceEEEEEEEecee Q lcl|NC_018846. 379 KKIRFPEKSGKMQDHGVIAVDTAV 402 (404) Q Consensus 379 ~K~rF~~~~g~~~DfGvi~idtaa 402 (404) +=+| + +=-++|.+..+| T Consensus 331 ~~~r-P------e~a~~~~~~~a~ 347 (347) T protein:vir:88 331 GGLR-P------EAAGALVFTPAA 347 (347) T ss_pred ceec-c------ceEEEEEeCCCC Confidence 7666 3 235667777777 No 11 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.03 E-value=1.1e-11 Score=80.56 Aligned_cols=316 Identities=12% Similarity=0.078 Sum_probs=159.9 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCch----hH-HHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEE Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRS----MV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSI 75 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~----~~-~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L 75 (404) |-++- .-|=|-..+.+.+ ++ ++|++.+...-.++.-+. .+....+++...||+|+|+- T Consensus 1 ~~~~~--------~~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~---------~l~~~~~~~~~~GdTV~ip~ 63 (381) T protein:vir:80 1 MATIQ--------GTGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAAL---------EATKKIPFEGKKGDLIHIPN 63 (381) T ss_pred Cceec--------ccccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhh---------hccccccceeecCceEEeec Confidence 43331 1122222222222 22 567776654444333332 22233466667799999987 Q ss_pred eeccccCceecCceeecchhhhhhceeEEEEeeccce-eccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018846. 76 MHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHL-VDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAG 154 (404) Q Consensus 76 ~~~L~G~gV~Gd~~leGnee~L~~~s~~v~Idq~R~a-V~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG 154 (404) ....+-..+..+..+ ..+++...+.+|.||+.+.. +.+. .+++.....|+|.+....+...+++..|+.++..++. T Consensus 64 ~g~~~a~d~~~g~~i--~~~~~~~~~~~itID~~~~~~~~Id-d~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~ 140 (381) T protein:vir:80 64 ISRAAVYDKQPQTPV--NLQARTDSEFTFTVTKYKESSFMIE-DIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAV 140 (381) T ss_pred cCcceeeeecCCCcc--cccccCCceEEEEEeeeeecceeec-hHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 665544344444433 34567778899999998753 4443 6888999999999999999999999999999877655 Q ss_pred hhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhh-hhccccHHHHHHHHHHHHHhCCCCccEEee Q lcl|NC_018846. 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIE-AADIFSIGLVDNLSLFIDEMAHPLQPVRLS 233 (404) Q Consensus 155 ~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~-~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~ 233 (404) .......... ..++. ..+++....++ .+..++.+.|..++.++++..-|- + T Consensus 141 ~~~~~~~~~~------------------t~~~~-----i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~-----e 192 (381) T protein:vir:80 141 INAFPSQRIY------------------SYDTT-----LGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQ-----E 192 (381) T ss_pred cccccccccc------------------ccccc-----ccccccccccccchhhHHHHHHHHHHHHHhhcCCCc-----C Confidence 4321111100 01110 00111111222 334556777778888888765541 1 Q ss_pred cccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeecccccee Q lcl|NC_018846. 234 GDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVL 313 (404) Q Consensus 234 g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~ 313 (404) | ++++++|.++.+|++++.+ .+.. -+..+.|..|.+|+|.|+.|++.+++|... +.... T Consensus 193 g---------R~lvv~P~~~~~Ll~~~~~---~~ad-------~~~~~~l~~G~Ig~i~G~~Vv~Sn~lp~~~--~t~~~ 251 (381) T protein:vir:80 193 G---------RIVMVSPAQYIDLLSINQF---ISVD-------FSQVKPVTSGVVGTILGMEVIVTTQIGINS--LTGYV 251 (381) T ss_pred C---------cEEEeCHHHHHHHhhchhh---hhhh-------hccchhhhceeeeEEcceEEEeeccccccc--cccee Confidence 1 5788999999999999763 3221 133567999999999999999988876321 11111 Q ss_pred ecccccccccccccccccchhheeecCc---eeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCc Q lcl|NC_018846. 314 VSENNLTATTKEVAAATNIDRAMLLGAQ---ALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKM 390 (404) Q Consensus 314 ~~~~~~~~~~~~~a~~~~v~ralllGaq---Al~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~ 390 (404) ..+. .+..... .+.-.-..|.+ |.++. -..+|+-+++....-+.-..++.+.-+.+.. T Consensus 252 ~~ag----ap~~~~~--~~~~~~~~g~~s~~a~av~-------------~~k~yd~~~~~~~~~~~~~~g~~~~~~~~~~ 312 (381) T protein:vir:80 252 NGQG----APTQPTP--GVLGSPYLPDQAGTANVVN-------------TGSASDLAVSLSYFGLPVFSGAGATAADGGQ 312 (381) T ss_pred eecc----ccccccc--cccccccccccccceeeee-------------eeeeeceeeeeeeccceeeecceeeecCCCc Confidence 0000 0000000 00001111111 11111 1224444443333333322222221111110 Q ss_pred e------------------EEEEEEEeceecC Q lcl|NC_018846. 391 Q------------------DHGVIAVDTAVKL 404 (404) Q Consensus 391 ~------------------DfGvi~idtaa~~ 404 (404) + ||--++.--++.- T Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 344 (381) T protein:vir:80 313 TLGSFGGANRWATAVVCHPDWLAVGVQQNVKS 344 (381) T ss_pred eeeeehhhhhhhhhcccccccccccceeEeec Confidence 0 1111111000000 No 12 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.00 E-value=8.1e-11 Score=75.91 Aligned_cols=338 Identities=12% Similarity=0.106 Sum_probs=185.6 Q ss_pred CC-cccchHHHHHHHHHHHHHhhcCc-hhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeec Q lcl|NC_018846. 1 MT-TVTSAQANKLYQVALFTAANRNR-SMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHK 78 (404) Q Consensus 1 ~~-~~~~~~a~~~~~~~lft~~~~n~-~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~ 78 (404) |- +.|-.+++.++..|= ..+-.. -.+++|++.+...-.+.+.+..+.- ..++ ..|++|.|+-+.. T Consensus 1 ~~~~~~~~~~~t~~g~~~--~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~---------~r~~--~~G~sv~i~~iG~ 67 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQ--SAADKLALFLKVFGGEVLTAFARTSVTMPRHM---------LRSI--ASGKSAQFPVIGR 67 (347) T ss_pred CCCCccCcccccccccCC--cccchHHHHHHHHHHHHHHHHHHHHhhhhhhc---------cccc--cccceeEeeeccc Confidence 43 333333334433330 011111 1578999998777777666543332 2233 3499999999999 Q ss_pred cccCceecCceeecchhhhhhceeEEEEeecc---ceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_018846. 79 LSKRPTMGDERVEGRGEDLSHADFSLKINQGR---HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGA 155 (404) Q Consensus 79 L~G~gV~Gd~~leGnee~L~~~s~~v~Idq~R---~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~ 155 (404) .+-...+..+.+.++.++......+|.||+.. +.|+ .+++-+..+|+|.+.-.....=+++..|+.++.+++.+ T Consensus 68 ~t~~~~~~g~~l~~~~~~~~~~e~~ltiD~~~y~~~~Vd---diD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~ 144 (347) T protein:vir:33 68 TKAAYLKPGENLDDKRKDIKHTEKVIHIDGLLTADVLIY---DIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGL 144 (347) T ss_pred eeeeeecCCCCCCCCCCCCccceEEEEechhhhhhHHHh---hHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99888888889999989999999999999875 6666 58888899999999999999999999999999998766 Q ss_pred hccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecc Q lcl|NC_018846. 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (404) Q Consensus 156 ~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~ 235 (404) .+........ .+.+.+-...++..++ .+...+. ...++. -.+.|..+.+.+++..-|- +| T Consensus 145 ~~~~~~~~~~-----~~~~~~~~~~~~~~~~-------tg~~~d~-~~~a~~-i~~~i~~a~~~Lde~~VP~-----~g- 204 (347) T protein:vir:33 145 VNLPDGSNEN-----IEGLGKPTVLTLVKPT-------TGSLTDP-VELGKA-IIAQLTIARASLTKNYVPA-----AD- 204 (347) T ss_pred hhhhcccccc-----cccccccccccccccc-------cccccch-hhhHHH-HHHHHHHHHHHHhhcCCCc-----cC- Confidence 5421111000 0001000000000000 0001111 111121 1345555677777666551 11 Q ss_pred cccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceee- Q lcl|NC_018846. 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLV- 314 (404) Q Consensus 236 ~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~- 314 (404) ++++++|.|+..|..++.+- ++.. +..-.+-+|.+|.|+|+.|.+.+++|.....+..... T Consensus 205 --------R~~vv~P~~y~~Ll~~~~~~-------~~d~---~~~~~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ 266 (347) T protein:vir:33 205 --------RTFYTTPDNYSAILAALMPN-------AANY---QALLDPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAP 266 (347) T ss_pred --------cEEEeCHHHHHHHhcccccc-------cccc---ccccccccceeEEEeceeEEEecccccCcccccccccc Confidence 57889999999999998642 1111 1234578899999999999999987732111000000 Q ss_pred --cccccccc-cccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCce Q lcl|NC_018846. 315 --SENNLTAT-TKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQ 391 (404) Q Consensus 315 --~~~~~~~~-~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~ 391 (404) ..+.+... +......+.-.-+|++-..|++.+=.+.--+.-.|.++ .++ ..|-....+|.+=+| +. T Consensus 267 ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~--~~~--d~i~~~~~~G~~vlr-P~------ 335 (347) T protein:vir:33 267 ADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN--YQA--DQIIAKYAMGHGGLR-PE------ 335 (347) T ss_pred ccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccchh--hhh--HhhhhhhhcCCceec-cc------ Confidence 00000000 00111222233457777777776643321111223332 111 233334444655554 31 Q ss_pred EEEEEEEeceec Q lcl|NC_018846. 392 DHGVIAVDTAVK 403 (404) Q Consensus 392 DfGvi~idtaa~ 403 (404) =-++|.+.-... T Consensus 336 ~av~i~~~~~~~ 347 (347) T protein:vir:33 336 AAGAIVLPKVSE 347 (347) T ss_pred ceEEEecCCCCC Confidence 122222222222 No 13 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.00 E-value=3.9e-11 Score=77.68 Aligned_cols=328 Identities=11% Similarity=0.070 Sum_probs=178.1 Q ss_pred CC-cccchHHHHHHHHHHHHHhhcCch-hHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeec Q lcl|NC_018846. 1 MT-TVTSAQANKLYQVALFTAANRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHK 78 (404) Q Consensus 1 ~~-~~~~~~a~~~~~~~lft~~~~n~~-~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~ 78 (404) |- +.|-.+...++..+=. .+--+. .+|.|++.+...-.+.+.|..+.- ..++ ..|+++.|+-+.. T Consensus 1 ma~~~~~~~~~t~~g~~~~--~~d~~al~ie~~~geV~~~f~~~s~~~~~~~---------~rti--~~G~sv~~~~iG~ 67 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMS--AGDKLALFLKVFGGEVLTAFTRTSVTMNKHL---------VRSI--QSGKSAQFPVLGR 67 (347) T ss_pred CCccccccccccccccCCc--ccchHHHHHHHHhHHHHHHHHHHHhhhhhhh---------heec--cccceEEeeeccc Confidence 42 2222222233322200 000011 468899988777776666643332 2234 3599999999999 Q ss_pred cccCceecCceeecchhhhhhceeEEEEeec---cceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_018846. 79 LSKRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGA 155 (404) Q Consensus 79 L~G~gV~Gd~~leGnee~L~~~s~~v~Idq~---R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~ 155 (404) .+-....-.+.+.+.-+++.....+|.||+. +|.|+ .+++....+|+|.+.-.....=|++..|+.+|.+|.-+ T Consensus 68 ~~~~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~Vd---diD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~ 144 (347) T protein:vir:94 68 TKAAYLQPGENLDDKRKDMKHTEKTINIDGLLTADVLIY---DIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKL 144 (347) T ss_pred eeEeeeecCcCCCCCcCCccccceEEEEcchhhhhhhhh---hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 9888777778888877889999999999996 55666 68899999999999999999999999999999887633 Q ss_pred hccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhcccc----HHHHHHHHHHHHHhCCCCccEE Q lcl|NC_018846. 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFS----IGLVDNLSLFIDEMAHPLQPVR 231 (404) Q Consensus 156 ~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s----~~~Id~~~~~a~~~a~pi~Pv~ 231 (404) ... ......+.. . -|..-.+-.+.+ +.++.+..-+ .+.|-+++..+++..-|- T Consensus 145 a~~--~~~~~~~~~----g---------~~~~~~v~i~~~----~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~---- 201 (347) T protein:vir:94 145 CNL--PTANNENIA----G---------LGKAHVLEVGDQ----ATLQGDQVKLGQAIIAQLTLARAKLTGNYVPS---- 201 (347) T ss_pred hcc--ccccccccc----c---------CCcceeEeeecc----ccccccccccHHHHHHHHHHHHHHhhhcCCCC---- Confidence 210 000000000 0 000000000000 0111111111 334555566666544441 Q ss_pred eecccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccc Q lcl|NC_018846. 232 LSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSK 311 (404) Q Consensus 232 ~~g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~ 311 (404) .+ +++++.|.|+..|.+..... . ..++..+.+=+|.++.++|+.|.+.++.|+.-. .. T Consensus 202 ---------~~-R~~vv~P~~y~~LLk~~~~~----~------~~~~~~~~~~~G~V~~v~G~~V~~Sn~~p~~~~--~~ 259 (347) T protein:vir:94 202 ---------SD-RVFYTTPDNYSAILAALMPN----A------ANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGA--GD 259 (347) T ss_pred ---------CC-CEEEeChHHHHHHHHhhccc----c------cccccccccccceeEEeeceEEEEcCccccccC--cc Confidence 11 78999999999999753311 1 112334566689999999999999999874221 11 Q ss_pred eeecccc-cc--------cccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchH-HHHHHHHhhhhhc Q lcl|NC_018846. 312 VLVSENN-LT--------ATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRT-EIAISWINGLKKI 381 (404) Q Consensus 312 ~~~~~~~-~~--------~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~-~i~i~~i~G~~K~ 381 (404) ...+... .+ ...+.+...+.-..+|++-..|++.+=....-+..+ +|..++. .|-....+|..=. T Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~-----~~~~~~~~~i~~~~a~G~g~~ 334 (347) T protein:vir:94 260 NRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERA-----RRANFQADQIIAKYAMGHGGL 334 (347) T ss_pred cccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeee-----echhhhhhhhhhhhhhcCccc Confidence 0000000 00 001112122222235666666555442221111222 2333322 3444556666666 Q ss_pred cccCCCCCceEEEEEEEecee Q lcl|NC_018846. 382 RFPEKSGKMQDHGVIAVDTAV 402 (404) Q Consensus 382 rF~~~~g~~~DfGvi~idtaa 402 (404) | + |.++..+.++| T Consensus 335 r-P-------e~a~~i~~~~a 347 (347) T protein:vir:94 335 R-P-------EACGALVFKKA 347 (347) T ss_pred c-c-------ceeEEEEecCC Confidence 5 3 56655555555 No 14 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=98.98 E-value=1.1e-10 Score=75.09 Aligned_cols=262 Identities=12% Similarity=0.087 Sum_probs=149.8 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhH-HHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeecc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~-~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 79 (404) |..-. .-..+-.+ +.|+..+.....++..+. .......+|+-++|++|+|+....+ T Consensus 1 ma~~~---------------T~~~d~iiPev~~~~v~~~~~~~~~~~--------~~~~~~~~l~g~~G~ti~iP~~~~~ 57 (272) T protein:vir:36 1 MSKQK---------------TTLADLVNPEVLAPIVSYELNKALRFA--------PLAQVDTTLQGQPGNTLKFPAFTYI 57 (272) T ss_pred CCCcc---------------eehhhhhchHHHHHHHHHHHHhhhhhc--------cccccccccccCCCCEEEEeeeccC Confidence 22100 00001001 345554433333222221 1123346788889999999998766 Q ss_pred ccCc--eecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_018846. 80 SKRP--TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 80 ~G~g--V~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g 157 (404) |+. +..++.+ ..+.|...++++.|.+...++.+.. ++...+.-|+..++.++++.+|++..|..++..|.|+.. T Consensus 58 -gda~~~~eg~~i--~~~~lt~~~~~~~i~~~~k~~~vtD-~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~ 133 (272) T protein:vir:36 58 -GDAADVAEGGEI--SLDKIGTTTKSVTIKKAAKGTEITD-EAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQ 133 (272) T ss_pred -ccccccCCCCcc--ChhhcCCcceeEeeehhhccccccH-HHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 432 2222222 4788999999999999988888765 566678899999999999999999999999877755321 Q ss_pred cccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccc Q lcl|NC_018846. 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~ 237 (404) .+ +-..+.+.|..|...+..... T Consensus 134 -----------------------~~----------------------~~~~~~d~i~~A~~~lgd~~~------------ 156 (272) T protein:vir:36 134 -----------------------TV----------------------STKANVDGVQAALDIFNDEDA------------ 156 (272) T ss_pred -----------------------cc----------------------cccccHHHHHHHHHHhhhcCC------------ Confidence 00 011244455555544432221 Q ss_pred cCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeeccc Q lcl|NC_018846. 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSEN 317 (404) Q Consensus 238 ~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~ 317 (404) +-++++|||.++..|++|+.+ .... ..+..+++++|.+|.|.|+.|..-.++|. T Consensus 157 ----~~~~ivv~p~~~~~L~k~~~~---~~~~------~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~------------- 210 (272) T protein:vir:36 157 ----QAYVLIVNPKDAAKIRKDANA---KNIG------SEVGANALINGTYADVLGAQIVRSKKLAE------------- 210 (272) T ss_pred ----CceEEEEcHHHHHHHhccccc---cccc------ccccccceeeeccceecCeeEEEeCCCCC------------- Confidence 236899999999999999763 2221 12345789999999999999987665541 Q ss_pred ccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEE Q lcl|NC_018846. 318 NLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIA 397 (404) Q Consensus 318 ~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~ 397 (404) +..+.-.++.|..|++..-.+ +. . +|...|-.. ..+.|.+ -+=||+-+ T Consensus 211 -----------~~~~~~~~~~~~gA~~~~~~~--~~--~-vE~~R~~~~----~~d~i~~------------~~~y~~~v 258 (272) T protein:vir:36 211 -----------GSALMFKIVSNSPALKLVLKR--GV--Q-VETDRDIVT----KTTVITA------------DEHYAAYL 258 (272) T ss_pred -----------CceeEEEEEecccceeeeecC--Cc--c-cccccchhh----cCcEEEE------------EEEEEEEE Confidence 111223456666666653222 11 1 333222211 1122222 12245544 Q ss_pred Eece--ecC Q lcl|NC_018846. 398 VDTA--VKL 404 (404) Q Consensus 398 idta--a~~ 404 (404) ++-. |++ T Consensus 259 ~~~~~vv~~ 267 (272) T protein:vir:36 259 YDLTKVVNI 267 (272) T ss_pred EcCccEEEE Confidence 4422 444 No 15 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=98.93 E-value=1.7e-10 Score=74.17 Aligned_cols=333 Identities=12% Similarity=0.099 Sum_probs=174.5 Q ss_pred CC-cccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeecc Q lcl|NC_018846. 1 MT-TVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) Q Consensus 1 ~~-~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 79 (404) |. +.|-++++-.+--+-...-....-.+++|++.+...-.+.+.|.. -+++.+++ .|.++.|+-+... T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~---------~~~~r~i~--~g~s~~~~~iG~~ 69 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTS---------RHMVRSIS--SGKSAQFPVLGRT 69 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcc---------cceeeeec--ccceEEEEeecee Confidence 44 223333333322221111112222578999998777776666542 22233553 4999999999888 Q ss_pred ccCceecCceeecchhhhhhceeEEEEeec---cceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_018846. 80 SKRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGAR 156 (404) Q Consensus 80 ~G~gV~Gd~~leGnee~L~~~s~~v~Idq~---R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~ 156 (404) +-...+-.+.+.|.-+++.-...+|.||+. ||.|+ .+++..+.+|+|.+.-.....=|++..|+.++.+++.+. T Consensus 70 ~~~~~~~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~Vd---DiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a 146 (344) T protein:vir:10 70 QAAYLAPGENLDDIRKDIKHTEKVITIDGLLTADVLIY---DIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLC 146 (344) T ss_pred EEEeeecCCCCCCCCCCcccceEEEEEcchhhhhhhhh---hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 777777777898887788888999999994 56666 689999999999999999999999999999999987543 Q ss_pred ccccccccccccccccccccccccccCCCCCCcEEecC--Cc-cchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEee Q lcl|NC_018846. 157 GDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGG--DA-TSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLS 233 (404) Q Consensus 157 g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~--~a-t~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~ 233 (404) .. .-|....+.. .++. ..... ++ +.....+..+. -.+.|..+...+++..-|. T Consensus 147 ~~------~~~~~~~~~g---------~~~~--~~~~~~~~~~~~t~~~~~~~~-~~~~i~~a~~~Lde~~VP~------ 202 (344) T protein:vir:10 147 NV------ESQYNENITG---------LGTA--TVIETTQDKTTLTDQVALGKE-IIAALTKARAALTKNYVPS------ 202 (344) T ss_pred cc------cccccccccc---------cccc--ceeecccccccccchhhhHHH-HHHHHHHHHHHHhhcCCCc------ Confidence 21 0000000000 0000 00000 00 00001111111 1344556677777665441 Q ss_pred cccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeecccccee Q lcl|NC_018846. 234 GDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVL 313 (404) Q Consensus 234 g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~ 313 (404) + . ++++++|.|+..|+.++.+- .. .-+..+.+-+|.+|.++|+.|.+-++.|...-.+.... T Consensus 203 -~------g-R~~vv~P~~y~~Ll~~~~~~------~~----~~~~~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~ 264 (344) T protein:vir:10 203 -S------D-RVFYCDPDSYSAILAALMPN------AA----NYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSREG 264 (344) T ss_pred -c------C-CEEEeChHHHHHHhhccccc------cc----ccccccceeeeEEEEEeceEEEeccccccccCCccccc Confidence 1 1 56889999999999997631 11 12345667789999999999999988763311111000 Q ss_pred ecccccccccccccccccch----hheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCC Q lcl|NC_018846. 314 VSENNLTATTKEVAAATNID----RAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGK 389 (404) Q Consensus 314 ~~~~~~~~~~~~~a~~~~v~----ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~ 389 (404) ...+.+ +..........++ .+|+|=--|++.+=....-+...|.|+ ..+ ..|-....+|.+=+| + T Consensus 265 ~tg~~~-~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~~--~~~--d~i~g~~~~G~~vlR-P----- 333 (344) T protein:vir:10 265 TTGQKH-AFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN--FQA--DQIIAKYAMGHGGLR-P----- 333 (344) T ss_pred ccCccc-cccCCcccceeeecceeEEEeechhhhhhhhhccceeecccchh--HHH--HHHHHHhhcccceec-c----- Confidence 000000 0000000111111 122222222222211110011122221 111 133344556666555 2 Q ss_pred ceEEEEEEEece Q lcl|NC_018846. 390 MQDHGVIAVDTA 401 (404) Q Consensus 390 ~~DfGvi~idta 401 (404) +=-|+|-+=|- T Consensus 334 -e~a~~v~~~~~ 344 (344) T protein:vir:10 334 -EAAGAVVFKTK 344 (344) T ss_pred -cceEEEEeecC Confidence 12344444333 No 16 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=98.93 E-value=2.8e-10 Score=72.99 Aligned_cols=334 Identities=12% Similarity=0.096 Sum_probs=181.2 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) =++.|-++...++..+=+ .-.+-...+++|++.+...-++.|.+..+ ++..++. .|++|.|+-+...+ T Consensus 2 a~~~~~~~~~t~~~~~~~-~~~~~a~~ie~f~g~V~~~f~~~s~~~~~---------~~~~~~~--~G~sv~i~~ig~~t 69 (347) T protein:vir:15 2 ANIQGGQQIGTNQGKGQS-AADKLALFLKVFGGEVLTAFARTSVTMPR---------HMLRSIA--SGKSAQFPVIGRTK 69 (347) T ss_pred CccccCCccccccccCCC-cchHHHHHHHHHHHHHHHHHHHhhhhhhc---------ccccccc--ccceeEeeecccee Confidence 112222222223222211 01111235789999988877776665332 2333443 49999999999999 Q ss_pred cCceecCceeecchhhhhhceeEEEEeec---cceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_018846. 81 KRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 81 G~gV~Gd~~leGnee~L~~~s~~v~Idq~---R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g 157 (404) ....+..+.+.++.++......+|.||+. ++.|+ .+++..+++|+|.+.-.....=|++..|+.++.+|.++.. T Consensus 70 ~~~~~~g~~l~~~~~~~~~~e~~ltID~~~~~~~~Vd---dlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~ 146 (347) T protein:vir:15 70 AAYLKPGENLDDKRKDIKHTEKVIHIDGLLTADVLIY---DIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVN 146 (347) T ss_pred eeeeccCCCCCCCCCCCccceEEEEechhhhhhHHhh---hHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 88888888899998899999999999987 56665 6899999999999999999999999999999999986642 Q ss_pred cccccccccccccccccccccccccCCCCCCcEEecC-Cccc--hhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeec Q lcl|NC_018846. 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGG-DATS--FEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSG 234 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~-~at~--~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g 234 (404) -. |...+ .. ..|-..-+.-.. ..++ .......+.+ .+.|-.++..+++..-|- +| T Consensus 147 ~~-------~~~~~-~~--------~~~g~~~~~~~~~~~~~~~~~~~~~~~~i-~d~~~~a~~~Lde~~VP~-----~g 204 (347) T protein:vir:15 147 LP-------DASNE-NI--------EGLGKPTVLTLVKPTTGDLTDPVELGKAI-IAQLTIARASLTKNYVPA-----AD 204 (347) T ss_pred cc-------ccccc-cc--------cccCccccccccccccccchhhhhHHHHH-HHHHHHHHHHHhhcCCCc-----cC Confidence 00 00000 00 000000000000 0000 0111122222 445555666676655441 11 Q ss_pred ccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceee Q lcl|NC_018846. 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) Q Consensus 235 ~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~ 314 (404) ++++++|.++..|..++.+- ++. -.....+-+|.+|.|+|+.|.+.+++|.. .+..... T Consensus 205 ---------R~~vv~P~~y~~LL~~~~~~-------~~d---~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~--~~t~~~~ 263 (347) T protein:vir:15 205 ---------RTFYTTPDNYSAILAALMPN-------AAN---YQALIDHERGTIRNVMGFEVVEVPHLTAG--GAGDTRE 263 (347) T ss_pred ---------CEEEeCHHHHHHHhcccccc-------ccc---ccccccccceEEEEEeceEEEeccccccc--ccccccc Confidence 57999999999999998642 111 11234577899999999999998887622 1111100 Q ss_pred cc-----ccccccc-ccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCC Q lcl|NC_018846. 315 SE-----NNLTATT-KEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSG 388 (404) Q Consensus 315 ~~-----~~~~~~~-~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g 388 (404) .+ +...+.. ......+...-+|++-..|++.+=.+.--+.-.|.++ .++ ..|-....+|.+=+| +. T Consensus 264 ~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~--~~~--d~i~~~~~~G~~vlr-P~--- 335 (347) T protein:vir:15 264 DAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN--YQA--DQIIAKYAMGHGGLR-PE--- 335 (347) T ss_pred cccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccch--hhh--hhhehhhhcCCceec-cc--- Confidence 00 0000100 0112222333456777777766644321111223332 111 222333344555444 31 Q ss_pred CceEEEEEEEeceec Q lcl|NC_018846. 389 KMQDHGVIAVDTAVK 403 (404) Q Consensus 389 ~~~DfGvi~idtaa~ 403 (404) =-++|.+.-... T Consensus 336 ---~av~~~~~~~~~ 347 (347) T protein:vir:15 336 ---AAGAIVLPKVSE 347 (347) T ss_pred ---cEEEEecCCCCC Confidence 112222221111 No 17 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=98.92 E-value=5.6e-10 Score=71.32 Aligned_cols=330 Identities=11% Similarity=0.091 Sum_probs=181.9 Q ss_pred CCcccc-hHHHHHHHHHHHHHhhcCch---hHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEe Q lcl|NC_018846. 1 MTTVTS-AQANKLYQVALFTAANRNRS---MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIM 76 (404) Q Consensus 1 ~~~~~~-~~a~~~~~~~lft~~~~n~~---~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~ 76 (404) |++... .+++..+..+-. +.+++ .+|.|++.+...-++.+.+. .-+++.+++ .|.++.|+-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~---~~~~~~al~le~f~geV~~~f~~~s~~~---------~~~~~r~i~--~gks~~~~~i 66 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVV---AAGDKLALFLKVFGGEVLTAFARTSVTT---------SRHMVRSIS--SGKSAQFPVL 66 (345) T ss_pred Ccccccchhcccccccccc---cCCchhHHHHHHHhHHHHHHHHHHhhhc---------ccceeeecc--ccceEEEeee Confidence 766654 233333333322 13333 57899999877777766663 223334553 4889999999 Q ss_pred eccccCceecCceeecchhhhhhceeEEEEeecc---ceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018846. 77 HKLSKRPTMGDERVEGRGEDLSHADFSLKINQGR---HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA 153 (404) Q Consensus 77 ~~L~G~gV~Gd~~leGnee~L~~~s~~v~Idq~R---~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~la 153 (404) ...+-...+-.+.+.+..++......+|.||+.. |.|+ .+++...++|+|.+.-..+..=|++..|+.++.+|. T Consensus 67 G~~~~~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~Vd---diD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~ 143 (345) T protein:vir:22 67 GRTQAAYLAPGENLDDKRKDIKHTEKVITIDGLLTADVLIY---DIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIA 143 (345) T ss_pred cceEEEeeecCCCCCCCCCCcccceEEEEecchhhhhhhHh---hHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8888777777788999988889999999999976 4555 689999999999999999999999999999999987 Q ss_pred hhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhcccc---HHHHHHHHHHHHHhCCCCccE Q lcl|NC_018846. 154 GARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFS---IGLVDNLSLFIDEMAHPLQPV 230 (404) Q Consensus 154 G~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s---~~~Id~~~~~a~~~a~pi~Pv 230 (404) .+... .+...-.|. +...++.. .+. .+...++..-+.. .+.|-.+...+++..-|. T Consensus 144 k~a~~-~~~~~~~~~---~~~~~~~~-~~~-------------~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~--- 202 (345) T protein:vir:22 144 GLCNV-ESKYNENIE---GLGTATVI-ETT-------------QNKAALTDQVALGKEIIAALTKARAALTKNYVPA--- 202 (345) T ss_pred Hhhcc-ccccccccc---cccccccc-ccc-------------cccccccccccCHHHHHHHHHHHHHHhhhcCCCc--- Confidence 54321 111000111 00110000 000 0011111111111 233444556666655552 Q ss_pred EeecccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeecccc Q lcl|NC_018846. 231 RLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGS 310 (404) Q Consensus 231 ~~~g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~ 310 (404) ++ ++++++|.|+..|+.++.+- ++ .-+..+.+=+|.++.++|+.|.+.++.|....... T Consensus 203 --~~---------R~~vv~P~~y~~Ll~~~~~~-------~~---~~~~~~~~~~G~V~~i~G~~V~~sn~lp~~~~~~~ 261 (345) T protein:vir:22 203 --AD---------RVFYCDPDSYSAILAALMPN-------AA---NYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTA 261 (345) T ss_pred --cC---------CEEEeChHHHHHHhcccccc-------cc---ccccccccccceEEEEeceEEEecccccccccCcc Confidence 11 67999999999999998642 11 12345666689999999999999888763211100 Q ss_pred ceee--ccccccccc---ccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccC Q lcl|NC_018846. 311 KVLV--SENNLTATT---KEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPE 385 (404) Q Consensus 311 ~~~~--~~~~~~~~~---~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~ 385 (404) .... ..+...... +...+..+ ..++++-..|++.+=...--+...|.|+ ..+ ..|-....+|.+=+|-. T Consensus 262 ~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~l~~h~~A~~~v~~~~~~~e~~r~~~--~~~--d~I~~~~a~G~~vlRPe- 335 (345) T protein:vir:22 262 REGTTGQKHVFPANKGEGNVKVAKDN-VIGLFMHRSAVGTVKLRDLALERARRAN--FQA--DQIIAKYAMGHGGLRPE- 335 (345) T ss_pred ccCcccccccccccccceeeeeccCc-eEEEEEehhheeeeeeecceeeeeechh--HHH--HHHHHHHhcCCcccccc- Confidence 0000 000000000 00001111 1356665665554422211112222222 111 24444556666666622 Q ss_pred CCCCceEEEEEEEeceec Q lcl|NC_018846. 386 KSGKMQDHGVIAVDTAVK 403 (404) Q Consensus 386 ~~g~~~DfGvi~idtaa~ 403 (404) =-++|.+. ++ T Consensus 336 ------aa~~i~~~--~~ 345 (345) T protein:vir:22 336 ------AAGAVVFK--VE 345 (345) T ss_pred ------eeEEEEEe--eC Confidence 22333322 22 No 18 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.89 E-value=2.7e-10 Score=73.04 Aligned_cols=318 Identities=14% Similarity=0.123 Sum_probs=181.8 Q ss_pred CCcccc---hH-HHHHHHHHHHHHhhcCc-hhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEE Q lcl|NC_018846. 1 MTTVTS---AQ-ANKLYQVALFTAANRNR-SMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSI 75 (404) Q Consensus 1 ~~~~~~---~~-a~~~~~~~lft~~~~n~-~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L 75 (404) ||+.-+ |+ +.- .+.+- ....+. -.+++|++.+...-.+.|.|..+. ++.++ ..|++|.|+- T Consensus 1 ~~~~~~~~~~~~~~~-~~~~~--~~d~~~al~le~~~geV~~~f~~~s~~~~~~---------~~r~i--~~G~tv~i~~ 66 (332) T protein:vir:78 1 MTTLSNFSLPNQANG-GARNA--DYDVRYATALKLFSGEVFTAFNNASIFKGLV---------RSYDL--RGGKSKQFMF 66 (332) T ss_pred CcccccccCCccccC-Ccccc--ccccchhhhhhhhhhhHHHHHHHHhhhhhcc---------ccccc--cccceEEEEe Confidence 777544 21 111 00000 011121 256899999888888777764332 22333 2599999999 Q ss_pred eeccccCceecCceeecchhhhhhceeEEEEeec---cceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018846. 76 MHKLSKRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHL 152 (404) Q Consensus 76 ~~~L~G~gV~Gd~~leGnee~L~~~s~~v~Idq~---R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~l 152 (404) +...+-...+.++.+.++ +.+.-...+|.||+. ++.|+ .+++..+++|||.+.-+....=|++..|+.++.++ T Consensus 67 ig~~~~~~~~~g~~l~~~-~~~~~~~~~l~ID~~ky~~~~Vd---diD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l 142 (332) T protein:vir:78 67 TGKLSAGYHTPGTPIVGD-AGIKANEKTLVMDDLLVSSQFVY---SLDEIFSQYSTRAEVSKQIGEALATHYDERIARVL 142 (332) T ss_pred ccceeEeeecCCCCCCCC-CCCCCceEEEEEehhhhhHHHHH---hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 998887777777888886 458888889999995 45565 58999999999999999999999999999999888 Q ss_pred hhhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEe Q lcl|NC_018846. 153 AGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRL 232 (404) Q Consensus 153 aG~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~ 232 (404) ..+..... |....+ ....+..+.+.+ + +.+. -.+.|-.+...+++..-| T Consensus 143 ~~aa~~~~------~~~~~~-------------g~~~~~~~~~~~----~-~~~~-~~~~i~~a~~~Lde~~VP------ 191 (332) T protein:vir:78 143 AKASAEAS------PVTGEP-------------GGFHVNIGAGNT----N-DAQA-IVDGFFEAAAVLDERSAP------ 191 (332) T ss_pred HhhhcccC------cccccc-------------cccccccCCccc----c-CHHH-HHHHHHHHHHHHhhcCCC------ Confidence 64321000 010011 111111111111 1 1111 234455566677665544 Q ss_pred ecccccCccceEEEEecHHHHHHHhc--CcchHHHHHHHHHHhhcccccCCcccccC-ceEEcCEEEEecCCceeeeccc Q lcl|NC_018846. 233 SGDELHGEDPYYVLYVTPRQWNDWYT--STSGKDWNQMMVRAVNRAKGFNHPLFKGE-CAMWRNILVRKYAGMPIRFYQG 309 (404) Q Consensus 233 ~g~~~~~~~~~yV~~l~p~q~~~Lr~--d~~~~~w~~~q~~A~~~~rg~~nPlF~G~-~gm~ngvii~~~~~~~irf~~~ 309 (404) .+ . +++++.|.++..|.+ |+.+ .++ ...+.+-.+..|. ++.|+|+.|.+.++.|... + T Consensus 192 -~~------g-R~~vv~P~~y~~Ll~~~d~~~-------~n~--~~~~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~--g 252 (332) T protein:vir:78 192 -QE------G-RVAVLSPRQYYSLISSVDTNI-------LNR--EIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLY--G 252 (332) T ss_pred -cc------C-CEEEeCHHHHHHHHhhcCcee-------eee--eccccccceecceeeeEEeeeEEEecCccccCc--c Confidence 11 1 577799999999987 5432 111 1123344567775 8999999999988876221 1 Q ss_pred cceeecccccccccccccccccchhheeecCceeEEEeecCC---CCCceeeeccccccchHHHHHHHHhhhhhccccCC Q lcl|NC_018846. 310 SKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKA---GGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEK 386 (404) Q Consensus 310 ~~~~~~~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~---g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~ 386 (404) .....++ .+...+.++..+.-.-++++...|++.+=.+.. -++-+|.|+.+ ...|-....+|.+=+| +. T Consensus 253 ~~~~~~~--~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~----~d~i~~~~~~G~~v~r-Pe- 324 (332) T protein:vir:78 253 QDLSSAA--VTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQ----GDLIVGKLAMGCGSLR-TS- 324 (332) T ss_pred ccccccc--ccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhh----HhhhhhhhhhcCceec-cc- Confidence 1110000 111122334434445578888888877744321 11224444432 2345555567764444 32 Q ss_pred CCCceEEEEEEEece Q lcl|NC_018846. 387 SGKMQDHGVIAVDTA 401 (404) Q Consensus 387 ~g~~~DfGvi~idta 401 (404) ++++|-+| T Consensus 325 -------~~v~l~~a 332 (332) T protein:vir:78 325 -------VAGSFQAA 332 (332) T ss_pred -------ceEEEeeC Confidence 34444444 No 19 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=98.88 E-value=1.8e-10 Score=74.02 Aligned_cols=260 Identities=15% Similarity=0.079 Sum_probs=149.8 Q ss_pred CCcccchHHHHHHHHHHHHHhh-cCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeecc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAAN-RNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~-~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 79 (404) |.+-. |... .-.| +.|+..+.....++.-+. ......++|.-++|++|+|+... + T Consensus 1 ma~~~-------------T~~~d~i~P--ev~s~~v~~~~~~~~~~~--------~~~~~~~~l~g~~G~tv~ip~~~-~ 56 (274) T protein:vir:96 1 MAQGT-------------TKVSNLIVP--EVLAPMMQAELDKKLRFA--------QFADIDSTLVGQPGDTLTFPAFT-Y 56 (274) T ss_pred CCccc-------------cchhhhhhh--HHHHHHHHHHHHhhhhhc--------ccccccccccCCCCCEEEEEeec-c Confidence 22111 1100 0011 356655433333222221 12233457777899999999876 4 Q ss_pred ccCc--eecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_018846. 80 SKRP--TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 80 ~G~g--V~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g 157 (404) .|+. +..++.+ ..+.+...++++.|++...++.... .+...+..|+..++..+++.+|++..|..++..|.|++. T Consensus 57 ~g~~~~~~~g~~i--~~~~it~~~~~~~i~~~~~~~~i~D-~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~ 133 (274) T protein:vir:96 57 SGDAQVIAEGEKI--PVDQIGTSKREAKVRKIGKGTELTD-EAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL 133 (274) T ss_pred CCCccccCCCCcC--chhhcccceeEEEEEeeeceeeecH-HHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 3332 2222222 3778999999999999888888765 455668889999999999999999999999866644210 Q ss_pred cccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccc Q lcl|NC_018846. 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~ 237 (404) -..++.++.+.|..|..++..... T Consensus 134 --------------------------------------------~~~~~~~~~d~i~dA~~~l~d~~~------------ 157 (274) T protein:vir:96 134 --------------------------------------------TVEADITKLDGLQTAIDKFNDEDL------------ 157 (274) T ss_pred --------------------------------------------CcCcccccHHHHHHHHHHhcccCC------------ Confidence 002233456666666655543211 Q ss_pred cCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeeccc Q lcl|NC_018846. 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSEN 317 (404) Q Consensus 238 ~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~ 317 (404) ...+++|||.++..|+++... +|..- ..+-++.+-.|.+|.|.|+.|..-.++| T Consensus 158 ----~~~~ivv~p~~~~~L~k~~~~-~f~~~-------~~~g~~~~~~g~ig~~~G~~Vi~s~~~p-------------- 211 (274) T protein:vir:96 158 ----EPMVLFVNPLDAGGLRTSASD-NFTRP-------TQLGDNIIVKGAFGEALGAVIVRSNKLN-------------- 211 (274) T ss_pred ----CceEEEeCHHHHHHHHhcccc-ccccc-------ccccccceeecccceecCeeEEEcCCCC-------------- Confidence 136799999999999998641 12211 1223577889999999999987655543 Q ss_pred ccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEE Q lcl|NC_018846. 318 NLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIA 397 (404) Q Consensus 318 ~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~ 397 (404) ...++|+|..|++.+-++. .. .|...|-.. ..+.|.|. +=||+-. T Consensus 212 --------------~~t~~l~~~gA~~~~~~~~----~~-vE~~Rd~~~----~~d~i~~~------------~~yg~~~ 256 (274) T protein:vir:96 212 --------------KGEALLAKKGAVKLITKRD----FF-LEKDRDASR----KSTALYSD------------KHYVAYL 256 (274) T ss_pred --------------cceEEEEeCcceeeeecCC----cc-cccccchhh----cccEEEEe------------eEEEEEE Confidence 1125888988888764432 11 233222221 11222221 1133333 Q ss_pred Ee--------c--eecC Q lcl|NC_018846. 398 VD--------T--AVKL 404 (404) Q Consensus 398 id--------t--aa~~ 404 (404) ++ + |=+. T Consensus 257 ~~~~~vv~~t~~~~~~~ 273 (274) T protein:vir:96 257 YDESKVVKITKGAGDEV 273 (274) T ss_pred EcCccEEEEEcCccccc Confidence 32 2 1122 No 20 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=98.86 E-value=3.3e-09 Score=67.08 Aligned_cols=327 Identities=13% Similarity=0.095 Sum_probs=176.4 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) ||+--+ ..+...+=- ....+=.+|.|.+.+...-+..+.|..+ .++..+ ..|+++.|+-+.+.+ T Consensus 1 ms~~~~---~tr~~~~~s--~~d~al~le~f~geV~~af~~~s~~~~~---------~~~rti--~~g~s~~~~~iG~~~ 64 (335) T protein:vir:63 1 MSFLND---LTRPNYAGK--NADVDIHLEEHLGIVDKHFAYTSKFAPL---------MNIRDL--RGSNVVRLDRLGNVE 64 (335) T ss_pred CCCccc---chhhhcccc--cchhheehhhhhhhHHHHHHhhhhhccc---------cceeee--ccceeEEEeeeeeee Confidence 776521 111111100 0001113688999877766666665422 222334 449999999998888 Q ss_pred cCceecCceeecchhhhhhceeEEEEee---ccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_018846. 81 KRPTMGDERVEGRGEDLSHADFSLKINQ---GRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 81 G~gV~Gd~~leGnee~L~~~s~~v~Idq---~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g 157 (404) -...+=.+.+.|+- -......|.||. .||.|+ .+++-..++|+|++.-..+..=+++..||.+|.+++=+.. T Consensus 65 ~~~~~pG~~l~~~~--~~~~k~~itVD~ll~a~~~I~---dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~ 139 (335) T protein:vir:63 65 AKGRRAGEELERSR--VVNDKWNLTVDTLLYLRHQFD---HQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAA 139 (335) T ss_pred eecccCCcCcCCCC--ccccceEEEecceeechhhhh---hHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 77777777777774 344677899999 778877 5888999999999999999999999999999988763321 Q ss_pred cccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccc Q lcl|NC_018846. 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~ 237 (404) .......+...++.. .. +-+ .+...+.+..|.+.-.+. .+..++.+..-|-.|+ T Consensus 140 --~~a~~~~~~~~~~G~---~~--------~~~-----~tg~~~~~~~~~l~~a~~-~a~~~L~e~dVP~~~~------- 193 (335) T protein:vir:63 140 --MDAPVDLEDAFSPGV---LE--------KLD-----LTGLTAKQAADKIVRMHR-RVVETFIDRDLGDAVY------- 193 (335) T ss_pred --ccCccccCCCcCCCc---ce--------eee-----eccCcccccHHHHHHHHH-HHHHHHHhccCCCccc------- Confidence 000000111101110 00 001 111122233343332222 3344444444331100 Q ss_pred cCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeeccc Q lcl|NC_018846. 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSEN 317 (404) Q Consensus 238 ~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~ 317 (404) ++ ++++++|.||..|..++.+ .+ ......+..++.-.|.++.++||.|.+-+++|-.. +... T Consensus 194 ---~d-r~~vv~P~~y~~Ll~~~~l---~n----~~~~~s~~~~~~~~g~v~~v~Gv~V~~sn~lP~~~--~t~~----- 255 (335) T protein:vir:63 194 ---SE-GLTPMSPRVFSLLLEHDKL---MN----VEYQATGATNDYVKSRVAILNGVKVLETPRFATKA--IAAH----- 255 (335) T ss_pred ---Cc-eEEEeChHHHHHHhccccc---cc----cccccccccccccCceeEEeeceEEEeeccCCCCC--cccc----- Confidence 11 7899999999999999763 11 11111233466778999999999999999887321 1111 Q ss_pred ccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEE Q lcl|NC_018846. 318 NLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIA 397 (404) Q Consensus 318 ~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~ 397 (404) ..+...+.++.-+.-.-++++-..|++.+=...-.+...|.+.. .. .-|-....+|..=.| + +=.++|. T Consensus 256 ~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~--~~--~~i~~~~a~G~g~lR-P------e~a~~i~ 324 (335) T protein:vir:63 256 PLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEK--FS--WVLDTFQMYNIGARR-P------DTAGAIE 324 (335) T ss_pred cccccCCccccccceeEEEEEecceEEEEEEeecccceeeccch--hh--HHhHHHHHcCCcccc-c------ceEEEEE Confidence 11111111122122223567777776666443222333333332 11 234444556666555 2 2233333 Q ss_pred EeceecC Q lcl|NC_018846. 398 VDTAVKL 404 (404) Q Consensus 398 idtaa~~ 404 (404) + |.+.- T Consensus 325 ~-tg~~~ 330 (335) T protein:vir:63 325 L-KGIGA 330 (335) T ss_pred E-cCCCc Confidence 2 43333 No 21 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=98.80 E-value=1e-09 Score=69.81 Aligned_cols=331 Identities=11% Similarity=0.130 Sum_probs=172.9 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |.-.+-+....++..|=. ...+-.-.++.|.+.++..-+..+.+.. .+++.++ ..|++|.|+-+...+ T Consensus 1 m~~~~~~~~~t~~g~~~~-~~d~~al~ik~f~~eV~~~f~~~s~~~~---------~~~~r~i--~~G~sv~i~~iG~~t 68 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKS-SSDALALFLKVFAGEVLTAFTRRSVTAD---------KHIVRTI--QNGKSAQFPVMGRTS 68 (347) T ss_pred CCCCCccccccccccCCc-cccHHHHHHHHHhHHHHHHHHHHHhhhc---------ccccccc--cccceEEEeccccee Confidence 443333332222221100 0000112356777777666555554432 2233344 359999999999999 Q ss_pred cCceecCceeecchhhhhhceeEEEEeec---cceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_018846. 81 KRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 81 G~gV~Gd~~leGnee~L~~~s~~v~Idq~---R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g 157 (404) -...+-++.+.|+-+++.-...+|.||+. |+.|+ .+++....+|+|++.-.....=+++..|+.++.+++.+.+ T Consensus 69 v~~~t~G~~l~~~~~~~~~~e~~itID~~~~~~~~Vd---diD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa 145 (347) T protein:vir:94 69 GVYLAPGERLSDKRKGIKHTEKVITIDGLLTADVMIF---DIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCN 145 (347) T ss_pred eeeecCCCCcCCCCCCCCcceEEEEecchhhhhHHhh---hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 88888889999988888999999999998 56776 5889999999999999999999999999999988765443 Q ss_pred cccccccccccccccccccccccccCCCCCCcEEecCCccch-hhhhhhccccHHHHHHHHHHHHHhCCCCccEEeeccc Q lcl|NC_018846. 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSF-EQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDE 236 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~-~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~ 236 (404) ..... +....+ ...++ .+-.+..+.. ......+.+ .+.|-.+...+++..-|- T Consensus 146 ~~~~~--------~~~~~g-----~~~~s---~~~~~~~~~~~~~~~~~~~~-~~~i~~a~~~Lde~~VP~--------- 199 (347) T protein:vir:94 146 LPAAS--------NENIAG-----LGTAS---VLEVGKKADLDTPAKLGEAI-IGQLTIARAKLTSNYVPA--------- 199 (347) T ss_pred ccccc--------ccccCC-----Ccccc---eeeccccccccchhhhHHHH-HHHHHHHHHHHhhcCCCC--------- Confidence 11110 000000 00010 0000000000 000111111 244555666676655551 Q ss_pred ccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeecc Q lcl|NC_018846. 237 LHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSE 316 (404) Q Consensus 237 ~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~ 316 (404) +. ++++++|.++..|..++.+.. +. ...+..+=.|.+|.++|+.|.+-+++|.. ...... .. T Consensus 200 ----~~-R~~vv~P~~~~~Ll~~~~~~~-------~~---~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~--~~t~~~-~~ 261 (347) T protein:vir:94 200 ----GD-RYFYTTPDNYSAILAALMPNA-------AN---YAALIDPETGNIRNVMGFVVVEVPHLVQG--GAGETR-GD 261 (347) T ss_pred ----CC-cEEEeCHHHHHHHhccchhhh-------hh---ccccccccccceEEEeceEEEecCccccc--cccccc-cc Confidence 11 677899999999999876421 10 11123355799999999999999988731 110000 00 Q ss_pred ccccccc-----------ccccccccchhheeecCceeEEEeecCCCCCceeeeccccccch-HHHHHHHHhhhhhcccc Q lcl|NC_018846. 317 NNLTATT-----------KEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGLKKIRFP 384 (404) Q Consensus 317 ~~~~~~~-----------~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~-~~i~i~~i~G~~K~rF~ 384 (404) ...+... ..+...+.-..+|++=.-|++.+ +.-... .|-.+|-.+. ..|-....+|.+=+| + T Consensus 262 ~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v--~~~~~~---~e~~r~~~~~~d~i~~~~~~G~~~~r-P 335 (347) T protein:vir:94 262 DGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTV--KLRDLA---LERDRDVDAQGDLIVGKYAMGHGGLR-P 335 (347) T ss_pred CcceecCcccccccccchhhhcccccceeEEEeehhhhhhh--hccccc---ccchhchhhHHHHhhhhhhhcCcccc-c Confidence 0000000 01111111123333333333322 111000 1111121111 244445566666665 3 Q ss_pred CCCCCceEEEEEEEeceec Q lcl|NC_018846. 385 EKSGKMQDHGVIAVDTAVK 403 (404) Q Consensus 385 ~~~g~~~DfGvi~idtaa~ 403 (404) +=-|+|.+. +|. T Consensus 336 ------~~a~~~~~~-~A~ 347 (347) T protein:vir:94 336 ------EAAGALVFS-PAE 347 (347) T ss_pred ------ceeEEEEec-CCC Confidence 233444444 555 No 22 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=98.78 E-value=2.3e-10 Score=73.42 Aligned_cols=221 Identities=13% Similarity=0.101 Sum_probs=135.0 Q ss_pred ecCCCCCcEEEEEEeeccccCceecCceeecc---hhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHH Q lcl|NC_018846. 62 DLNKQAGDEVTFSIMHKLSKRPTMGDERVEGR---GEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGT 138 (404) Q Consensus 62 dL~k~~Gd~v~f~L~~~L~G~gV~Gd~~leGn---ee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~ 138 (404) |=.-+.||+|+|+- ..|+- .+..||. .|.|++.+++..|.+...++.+.. ..+....-|+..++...|+. T Consensus 1 ~~~~~~Gdtit~P~---~iGda---~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD-~a~l~~~gDp~~ea~~Q~~~ 73 (231) T protein:vir:73 1 ENGINLANLCEYPN---DIGDA---ADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITD-EAALSGYGDPIGESNKQLGL 73 (231) T ss_pred CccccCCceEEecc---cccch---hhhcCCCcCChhhccccceeeeEeeeccceeeeH-HHHhhccCchHHHHHHHHHH Confidence 44558899999982 34442 3344554 688999999999999999999875 45555678999999999999 Q ss_pred HHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHH Q lcl|NC_018846. 139 YFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSL 218 (404) Q Consensus 139 w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~ 218 (404) -+++..|..++..+.+++ ++.+-.++.+.|.+|.. T Consensus 74 ~iA~kvD~di~~~~~~a~---------------------------------------------l~~~~~~t~d~i~~A~~ 108 (231) T protein:vir:73 74 SLANKVDDDLLKAAKTTS---------------------------------------------QTVSTKANVDGVQAALD 108 (231) T ss_pred HHHHhhhHHHHHhhcccc---------------------------------------------ccccccccHHHHHHHHH Confidence 999999999875554322 00111246777776655 Q ss_pred HHHHhCCCCccEEeecccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEe Q lcl|NC_018846. 219 FIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRK 298 (404) Q Consensus 219 ~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~ 298 (404) ... ++ +..-++++|||.++.+||+++.+- |. ...+-++.+++|.+|++.||.|.. T Consensus 109 ~fg-------------de---~~~~~vivv~p~~~~~Lrk~~~~~-~~--------~~~~g~~i~~~G~iG~i~G~~Vi~ 163 (231) T protein:vir:73 109 IFN-------------DE---DAQAYVLIVNPKDAAKIRKDANAK-NI--------GSEVGANALINGTYADVLGAQIVR 163 (231) T ss_pred Hhc-------------cc---cccceEEEEcchHHHhhhhccchh-hh--------hhhhccceeeecccceEcceEEEE Confidence 542 22 122378999999999999998731 22 113456889999999999999876 Q ss_pred cCCceeeeccccceeecccccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhh Q lcl|NC_018846. 299 YAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGL 378 (404) Q Consensus 299 ~~~~~irf~~~~~~~~~~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~ 378 (404) -+++|. ++. +.--++..-.|+++..-+. +. +|...|-..+.-.- T Consensus 164 S~~~~~----~~~--------------------~~~~~i~~~gAl~~~~k~~----~~-vEtdRd~~~k~~~i------- 207 (231) T protein:vir:73 164 SKKLAE----GSA--------------------LMFKIVSNSPALKLVLKRG----VQ-VETDRDIVTKTTVI------- 207 (231) T ss_pred cCCCCC----Cce--------------------eeeeEEeeccceeeeeccc----ce-eeccccccccccEE------- Confidence 655431 000 0011333444555553321 11 45444433332211 Q ss_pred hhccccCCCCCceEEEEEEEece--ecC Q lcl|NC_018846. 379 KKIRFPEKSGKMQDHGVIAVDTA--VKL 404 (404) Q Consensus 379 ~K~rF~~~~g~~~DfGvi~idta--a~~ 404 (404) .. -+-|+|-.++-. |+| T Consensus 208 -----~~----~~~y~v~l~~~~~vv~~ 226 (231) T protein:vir:73 208 -----TA----DEHYAAYLYDLTKVVNI 226 (231) T ss_pred -----EE----eEEEEEEEEcCccEEEE Confidence 11 123333333221 112 No 23 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.78 E-value=7.4e-09 Score=65.17 Aligned_cols=327 Identities=13% Similarity=0.097 Sum_probs=170.7 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) ||+--+ ..+...+=-+ ...+=.+|.|++.+...-..++.|..+ ..+.++ ..|.++.|+-+.+.+ T Consensus 1 ms~~~~---~t~~~~~~s~--~d~al~le~f~geV~~af~~~s~~~~~---------~~~rti--~~g~s~~~~~iG~~~ 64 (335) T protein:vir:78 1 MSFLND---LTRPNYAGKN--ADVDIHLEEHLGIVDKHFAYTSKFAPL---------MNIRDL--RGSNVVRLDRLGNVE 64 (335) T ss_pred CCcccc---cccccccccc--chhhhhhhhhhhHHHHHHHHhhhhccc---------cceeee--ccceeEEEeeeeeee Confidence 776521 1111110000 000113689999887777766666422 223344 449999999988887 Q ss_pred cCceecCceeecchhhhhhceeEEEEee---ccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_018846. 81 KRPTMGDERVEGRGEDLSHADFSLKINQ---GRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 81 G~gV~Gd~~leGnee~L~~~s~~v~Idq---~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g 157 (404) -...+=.+.+.|+ ........|.||. .||.|+ .+++-.+.+|+|++.-..+..-+++..||.+|.++.=+.. T Consensus 65 ~~~~~pG~~l~~~--~~~~~k~~itID~ll~a~~~Vd---dlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~ 139 (335) T protein:vir:78 65 AKGRRAGEELERS--RVVNDKWNLTVDTLLYLRHQFD---HQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAA 139 (335) T ss_pred ecccccCcccCCC--CcccCCeEEEecceeechhhHh---hHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 7666666777666 4566777999999 778877 5889999999999999999999999999999988763321 Q ss_pred cccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccc Q lcl|NC_018846. 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~ 237 (404) .......+...++.. . ...++-+ ..+....+.+. +.+..+.....+..-| -.-. + T Consensus 140 --~~a~~~~~~~~~~G~---~--------~~~~~tg-----~~~~~~~~~l~-~a~~~a~~~l~ekdvP--~~~~--~-- 194 (335) T protein:vir:78 140 --MDAPVDLEDAFSPGV---L--------EKLDLTG-----LTAKEAAEKIV-RMHRRVVETFIERDLG--DAVY--S-- 194 (335) T ss_pred --cccccccCCCcCCCc---c--------eeeeecc-----ccccccHHHHH-HHHHHHHHHHHhccCC--CCCC--C-- Confidence 111111111100000 0 0000111 11111122221 1222233333333333 0000 0 Q ss_pred cCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeeccc Q lcl|NC_018846. 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSEN 317 (404) Q Consensus 238 ~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~ 317 (404) -.|++++|.||..|..++.+- + ......+..+++=.|.++.++||.|.+-+++|-.. ++. + T Consensus 195 -----~rv~vv~P~~y~~Ll~~~~l~---n----~~~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~~--~t~-----~ 255 (335) T protein:vir:78 195 -----EGLTPMSPRVFSLLLEHDKLM---S----VEYQATGATNDYVKSRVAILNGVKVLETPRFATKA--ISA-----H 255 (335) T ss_pred -----ccEEEeChHHHHHHhcccccc---c----ccccccccccccccceeEEeeceEEEeeccCCCCC--Ccc-----c Confidence 179999999999999997631 1 11111233567778999999999999999887321 111 1 Q ss_pred ccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEE Q lcl|NC_018846. 318 NLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIA 397 (404) Q Consensus 318 ~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~ 397 (404) ..+...+.+...+.-.-++++=..|++-+=-..-.++..|.+..+ -.-|-....+|..=.| + |..+.. T Consensus 256 ~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~----~~~i~~~~a~G~g~lR-P-------e~a~~i 323 (335) T protein:vir:78 256 PLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQF----SWVLDTFQMYNIGARR-P-------DTAGAI 323 (335) T ss_pred cccccCCcccccccceEEEEEecceEEEEEEEecccceeeccchh----hHhhhHHHHcCCcccC-c-------ceEEEE Confidence 111111111111111124444444544442222123333333221 1234445556666666 3 344443 Q ss_pred EeceecC Q lcl|NC_018846. 398 VDTAVKL 404 (404) Q Consensus 398 idtaa~~ 404 (404) -.|.+.- T Consensus 324 ~~tg~~~ 330 (335) T protein:vir:78 324 ELKGIEA 330 (335) T ss_pred EecCCCc Confidence 3343333 No 24 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=98.75 E-value=7e-10 Score=70.78 Aligned_cols=260 Identities=14% Similarity=0.078 Sum_probs=150.2 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhH--HHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeec Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMV--NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHK 78 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~--~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~ 78 (404) |-. ..| +....+ +.|+..+.....++..|. .+..+..+|+-++|++|+|+.... T Consensus 1 ma~-------------~~T---~~~~~iiPev~~~~v~~~~~~~~~~~--------~~~~~~~~l~g~~G~tv~ip~~~~ 56 (274) T protein:vir:93 1 MPQ-------------GIT---KTSNQIIPEVLAPMMQAQLEKKLRFA--------SFAEVDSTLQGQPGDTLTFPAFVY 56 (274) T ss_pred CCc-------------cce---ehhheechHHHHHHHHHHHHhhhhhc--------ccccccccccCCCCCEEEEEeecc Confidence 111 001 000111 356665543333332221 112334567778999999999876 Q ss_pred cccC-ceecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_018846. 79 LSKR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 79 L~G~-gV~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g 157 (404) +... -+..++.+ ..+.+...++++.|++...++.... .....+..|+..++.+++++.|++..|..++..+.++.. T Consensus 57 ~g~~~~~~eg~~i--~~~~it~~~~~~~i~~~~~~~~i~D-~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~ 133 (274) T protein:vir:93 57 SGDAQVVAEGEKI--PTDILETKKREAKIRKIAKGTSITD-EALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL 133 (274) T ss_pred CCCcccccCCCcc--cccccccceeEEEeeeecccccccH-HHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 5322 12222222 3778999999999999888887765 355557789999999999999999999999876654321 Q ss_pred cccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccc Q lcl|NC_018846. 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~ 237 (404) + .+++.++.+.|..|..++.... T Consensus 134 ----------------------------~----------------~~~~~~~~d~i~dA~~~l~d~~------------- 156 (274) T protein:vir:93 134 ----------------------------T----------------VNADITKLNGLQSAIDKFNDED------------- 156 (274) T ss_pred ----------------------------c----------------ccccccCHHHHHHHHHHhhhcc------------- Confidence 0 0223445666666665554311 Q ss_pred cCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeeccc Q lcl|NC_018846. 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSEN 317 (404) Q Consensus 238 ~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~ 317 (404) . ...+++|||..+..|++|+... |. .. ....++.+..|.+|.|.|+.|..-+.+| T Consensus 157 --~-~~~~ivv~p~~~~~L~k~~~~~-f~---~~----s~~g~~~~~~G~ig~~~G~~Vi~s~~~p-------------- 211 (274) T protein:vir:93 157 --L-EPMVLFINPLDAGKLRGDASTN-FT---RA----TELGDDIIVKGAFGEALGAIIVRTNKLE-------------- 211 (274) T ss_pred --C-CccEEEeCHHHHHHHHhhhhhc-cc---cc----ccccccceeecccceecCeeEEEcCCCC-------------- Confidence 1 1368999999999999987421 22 11 1223578999999999999997765543 Q ss_pred ccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEE Q lcl|NC_018846. 318 NLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIA 397 (404) Q Consensus 318 ~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~ 397 (404) ..-++|+|..|++++-.+. .. .|...|-..+ .+.|.|. +=||+-. T Consensus 212 --------------~~t~~l~~~gai~~~~~~~----~~-vE~~Rd~~~~----~d~i~~~------------~~y~~~~ 256 (274) T protein:vir:93 212 --------------AGTAILAKKGAVKLILKRD----FF-LEVARDASTK----TTALYSD------------KHYVAYL 256 (274) T ss_pred --------------cceEEEEeCCeEEEEecCC----cc-cccccchhhc----ccEEEEE------------EEEEEEE Confidence 1124788888888764332 12 3443332211 1111111 1233322 Q ss_pred Eec---------eecC Q lcl|NC_018846. 398 VDT---------AVKL 404 (404) Q Consensus 398 idt---------aa~~ 404 (404) ++- ++-| T Consensus 257 ~~~~~~v~~t~~~~s~ 272 (274) T protein:vir:93 257 YDESKAVKITKGSGSL 272 (274) T ss_pred EcCCceEEEeeCcccc Confidence 222 1222 No 25 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=98.74 E-value=2.4e-09 Score=67.88 Aligned_cols=260 Identities=14% Similarity=0.079 Sum_probs=152.4 Q ss_pred CCcccchHHHHHHHHHHHHH-hhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeecc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTA-ANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~-~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 79 (404) |.. + .|. ...-.| +.|+..+.....++..|. .+..+..+|+-++|++|+|+....+ T Consensus 1 ma~------------~-~T~~~d~iiP--ev~~~~v~~~~~~~l~~~--------~~~~~d~~l~g~~G~tv~iP~~~~~ 57 (274) T protein:vir:97 1 MPQ------------G-LTKTSDQIIP--EVLAPMMQAQLEKKLRFA--------SFAEVDSTLQGQPGDTLTFPAFVYS 57 (274) T ss_pred CCc------------c-ceehhheech--HHHHHHHHHhhhhhhhhc--------ccceecccccCCCCCEEEEeeecCC Confidence 211 0 000 000111 456665543332222221 2233456777789999999988655 Q ss_pred ccCc--eecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_018846. 80 SKRP--TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 80 ~G~g--V~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g 157 (404) |+. +..++.+ ..+.|...++++.|++...++.+.. .+...+.-|+..++.++++.+|++..|..++.+|.++.. T Consensus 58 -g~a~~~~~g~~i--~~~~lt~~~~~~~i~~~~~~~~i~D-~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~ 133 (274) T protein:vir:97 58 -GDAQVVAEGEKI--PTDILETKKREAKIRKIAKGTSITD-EALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL 133 (274) T ss_pred -CccccccCCCcc--cccccccceeEEEeeeecceecccH-HHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc Confidence 332 2222222 3678899999999999888888776 355557789999999999999999999999877754221 Q ss_pred cccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccc Q lcl|NC_018846. 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~ 237 (404) . ..++.++.+.|..|...+.... T Consensus 134 -----------------------~---------------------~~~~~~~~d~i~dA~~~l~d~~------------- 156 (274) T protein:vir:97 134 -----------------------T---------------------VNADITKLNGLQSAIDKFNDED------------- 156 (274) T ss_pred -----------------------c---------------------ccccccCHHHHHHHHHHhhccC------------- Confidence 0 0123455666666665554321 Q ss_pred cCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeeccc Q lcl|NC_018846. 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSEN 317 (404) Q Consensus 238 ~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~ 317 (404) ...++++|||.++..|++|+.. +|..- ..+.++.+..|.+|.|.|+.|+.-..+| T Consensus 157 ---~~~~~ivv~p~~~~~L~k~~~~-~f~~~-------s~~g~~~~~~G~ig~~~G~~Vi~s~~~p-------------- 211 (274) T protein:vir:97 157 ---LEPMVLFVNPLDAGKLRGDAST-NFTRA-------TELGDDIIVKGAFGEALGAIIVRTNKLE-------------- 211 (274) T ss_pred ---CCceEEEeCHHHHHHHHhhhhh-hcccc-------CcccccceeccccceecCeeEEEcCCCC-------------- Confidence 1137899999999999999742 13211 1233578999999999999998766543 Q ss_pred ccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEE Q lcl|NC_018846. 318 NLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIA 397 (404) Q Consensus 318 ~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~ 397 (404) ...++|+|..|+++.-++. .. .|...|-..+ .+.|.+. +=|||-+ T Consensus 212 --------------~~t~~l~~~gA~~~~~~~~----~~-vE~~Rd~~~~----~d~i~~~------------~~y~~~~ 256 (274) T protein:vir:97 212 --------------AGTAILAKKGAVKLILKRD----FF-LEVARDASTK----TTALYSD------------KHYVAYL 256 (274) T ss_pred --------------cceEEEEeCcceEeeecCC----ce-eccccchhhc----ccEEEEE------------EEEEEEE Confidence 1124788888888653321 12 4444443221 1122111 1234433 Q ss_pred Eec---------eecC Q lcl|NC_018846. 398 VDT---------AVKL 404 (404) Q Consensus 398 idt---------aa~~ 404 (404) ++. .+-+ T Consensus 257 ~~~~~vv~~t~~~~~~ 272 (274) T protein:vir:97 257 YDESKAVKITKGSGSL 272 (274) T ss_pred EcCCceEEEecCcccc Confidence 332 2222 No 26 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=98.74 E-value=2.4e-09 Score=67.88 Aligned_cols=260 Identities=14% Similarity=0.079 Sum_probs=152.4 Q ss_pred CCcccchHHHHHHHHHHHHH-hhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeecc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTA-ANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~-~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 79 (404) |.. + .|. ...-.| +.|+..+.....++..|. .+..+..+|+-++|++|+|+....+ T Consensus 1 ma~------------~-~T~~~d~iiP--ev~~~~v~~~~~~~l~~~--------~~~~~d~~l~g~~G~tv~iP~~~~~ 57 (274) T protein:vir:94 1 MPQ------------G-LTKTSDQIIP--EVLAPMMQAQLEKKLRFA--------SFAEVDSTLQGQPGDTLTFPAFVYS 57 (274) T ss_pred CCc------------c-ceehhheech--HHHHHHHHHhhhhhhhhc--------ccceecccccCCCCCEEEEeeecCC Confidence 211 0 000 000111 456665543332222221 2233456777789999999988655 Q ss_pred ccCc--eecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_018846. 80 SKRP--TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 80 ~G~g--V~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g 157 (404) |+. +..++.+ ..+.|...++++.|++...++.+.. .+...+.-|+..++.++++.+|++..|..++.+|.++.. T Consensus 58 -g~a~~~~~g~~i--~~~~lt~~~~~~~i~~~~~~~~i~D-~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~ 133 (274) T protein:vir:94 58 -GDAQVVAEGEKI--PTDILETKKREAKIRKIAKGTSITD-EALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL 133 (274) T ss_pred -CccccccCCCcc--cccccccceeEEEeeeecceecccH-HHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc Confidence 332 2222222 3678899999999999888888776 355557789999999999999999999999877754221 Q ss_pred cccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccc Q lcl|NC_018846. 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~ 237 (404) . ..++.++.+.|..|...+.... T Consensus 134 -----------------------~---------------------~~~~~~~~d~i~dA~~~l~d~~------------- 156 (274) T protein:vir:94 134 -----------------------T---------------------VNADITKLNGLQSAIDKFNDED------------- 156 (274) T ss_pred -----------------------c---------------------ccccccCHHHHHHHHHHhhccC------------- Confidence 0 0123455666666665554321 Q ss_pred cCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeeccc Q lcl|NC_018846. 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSEN 317 (404) Q Consensus 238 ~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~ 317 (404) ...++++|||.++..|++|+.. +|..- ..+.++.+..|.+|.|.|+.|+.-..+| T Consensus 157 ---~~~~~ivv~p~~~~~L~k~~~~-~f~~~-------s~~g~~~~~~G~ig~~~G~~Vi~s~~~p-------------- 211 (274) T protein:vir:94 157 ---LEPMVLFVNPLDAGKLRGDAST-NFTRA-------TELGDDIIVKGAFGEALGAIIVRTNKLE-------------- 211 (274) T ss_pred ---CCceEEEeCHHHHHHHHhhhhh-hcccc-------CcccccceeccccceecCeeEEEcCCCC-------------- Confidence 1137899999999999999742 13211 1233578999999999999998766543 Q ss_pred ccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEE Q lcl|NC_018846. 318 NLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIA 397 (404) Q Consensus 318 ~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~ 397 (404) ...++|+|..|+++.-++. .. .|...|-..+ .+.|.+. +=|||-+ T Consensus 212 --------------~~t~~l~~~gA~~~~~~~~----~~-vE~~Rd~~~~----~d~i~~~------------~~y~~~~ 256 (274) T protein:vir:94 212 --------------AGTAILAKKGAVKLILKRD----FF-LEVARDASTK----TTALYSD------------KHYVAYL 256 (274) T ss_pred --------------cceEEEEeCcceEeeecCC----ce-eccccchhhc----ccEEEEE------------EEEEEEE Confidence 1124788888888653321 12 4444443221 1122111 1234433 Q ss_pred Eec---------eecC Q lcl|NC_018846. 398 VDT---------AVKL 404 (404) Q Consensus 398 idt---------aa~~ 404 (404) ++. .+-+ T Consensus 257 ~~~~~vv~~t~~~~~~ 272 (274) T protein:vir:94 257 YDESKAVKITKGSGSL 272 (274) T ss_pred EcCCceEEEecCcccc Confidence 332 2222 No 27 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=98.73 E-value=9.4e-10 Score=70.08 Aligned_cols=255 Identities=16% Similarity=0.102 Sum_probs=147.3 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |+. .-+.. .-.| +.|+..+.....++..|... ....++|.-.+|+.|+|+... +. T Consensus 1 Ma~------------T~~~d--~I~P--ev~~~~V~e~~~~~~~~~~~--------~~~d~~L~g~~G~ti~~P~~~-~i 55 (270) T protein:vir:95 1 MTQ------------TKKAN--LINP--EVLANVVSAQMQNAIRFTPY--------AVTDDTLVGQPGDTITRPKYA-YI 55 (270) T ss_pred CCc------------eehhh--hcch--HHHHHHHHHHHHhHHhhccc--------cccccccCCCCCCEEEeeeec-CC Confidence 221 11100 0122 24444443333333333211 122467888899999999986 65 Q ss_pred cCc--eecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_018846. 81 KRP--TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (404) Q Consensus 81 G~g--V~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~ 158 (404) |+. +..++.++ .+.|+..++..+|-+...++.... .+..-+.-|...++.++++.+|++..|..++-.|.|+... T Consensus 56 gdae~~~eg~~i~--~~~lt~~~~~a~i~~~gk~~~itD-~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~ 132 (270) T protein:vir:95 56 GAAEDLQEGVAMD--TTQMSMTTTKVTVKETGKAVEVTQ-TAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQT 132 (270) T ss_pred CccccccCCCccc--hhhcccchheeeeehhhCcceecH-HHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 542 22333333 679999999999999988888765 3444445699999999999999999999998887665421 Q ss_pred ccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeeccccc Q lcl|NC_018846. 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELH 238 (404) Q Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~ 238 (404) .+..++.+.|..+..+. |++ T Consensus 133 ---------------------------------------------~~~~~t~~~~~dA~~~l-------------gd~-- 152 (270) T protein:vir:95 133 ---------------------------------------------ATVSADATGILDAIEVF-------------NSE-- 152 (270) T ss_pred ---------------------------------------------cccccCHHHHHHHHHHh-------------ccc-- Confidence 01112333333333332 222 Q ss_pred CccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEE-ecCCceeeeccccceeeccc Q lcl|NC_018846. 239 GEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVR-KYAGMPIRFYQGSKVLVSEN 317 (404) Q Consensus 239 ~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~-~~~~~~irf~~~~~~~~~~~ 317 (404) ++...+++|||.++..||++. |.+ ..++..+.+.+|.+|.+.|+.++ .-. ++ T Consensus 153 -~~~~~~i~vhs~~~~~Lrk~~----~~~-------~~~~~~~~~~~G~ig~~~G~~Viv~s~-~~-------------- 205 (270) T protein:vir:95 153 -NDEDYVLYVNPKDYNKLVKSL----FKV-------GGNVQDRAISKGDLVEIVGVSDIVKSK-RV-------------- 205 (270) T ss_pred -cCCCcEEEEcHHHHHHHHhhh----ccc-------ccccccchhcccccceecceeEEEeCC-CC-------------- Confidence 122478999999999999985 222 12445688999999999998542 211 00 Q ss_pred ccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEE Q lcl|NC_018846. 318 NLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIA 397 (404) Q Consensus 318 ~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~ 397 (404) .. ..++|.+..|+++.-.+ + +. +|...|-..+. +. +.. -+-|+|-. T Consensus 206 ------~~-------~~~~l~~~gAi~~~~~~--~--~~-vEtdRd~~~~~----d~--------i~~----~~~y~v~~ 251 (270) T protein:vir:95 206 ------SE-------NTAFLQRYGAMEIVNKK--K--PE-AYTDFDILKRT----HL--------LST----NYHYSVNL 251 (270) T ss_pred ------Cc-------eeEEEEeccceeeeecC--C--ce-eeeccchhhcc----cE--------EEe----eeEEEEEE Confidence 01 13578888777765433 2 12 45443332211 11 111 13566666 Q ss_pred Eece--ecC Q lcl|NC_018846. 398 VDTA--VKL 404 (404) Q Consensus 398 idta--a~~ 404 (404) ++.. ++| T Consensus 252 ~~~skvv~~ 260 (270) T protein:vir:95 252 KDETGVVKV 260 (270) T ss_pred EccceEEEE Confidence 6644 444 No 28 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=98.72 E-value=5.3e-10 Score=71.45 Aligned_cols=260 Identities=14% Similarity=0.079 Sum_probs=153.3 Q ss_pred CCc-ccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeecc Q lcl|NC_018846. 1 MTT-VTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) Q Consensus 1 ~~~-~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 79 (404) |.. .|.-..+ -.| +.|+..+.....++..|. ......++|+-++|++|+|+....+ T Consensus 1 Ma~~~T~l~d~-------------i~P--ev~~~~v~~~~~~~~~~~--------~~~~~~~~l~g~~G~ti~iP~~~~i 57 (276) T protein:vir:10 1 MAQGTTTKSTQ-------------IVP--EVLAPMMQAELDKKLRFA--------QFADIDSTLVGQPGDTLTFPAFVYS 57 (276) T ss_pred CCcceeehhhh-------------hch--HHHHHHHHHHHHhhhhhc--------ccceecccccCCCCCEEEeeeecCC Confidence 221 0100000 011 356665555444444441 1223356788889999999998777 Q ss_pred ccCc--eecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_018846. 80 SKRP--TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 80 ~G~g--V~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g 157 (404) |+. +..++.+ ..+.|...++++.|.+...++.+.. .+...+.-|+..++-+.++.+|++..|..++-.|.+... T Consensus 58 -gda~~~~eg~~i--~~~~lt~~~~~a~i~~~~k~~~~tD-~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~ 133 (276) T protein:vir:10 58 -GDATVVPEGQKI--PVDKIETNRREAKIHKIGKGTDITD-EALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKL 133 (276) T ss_pred -CccccccCCCcc--CccccccceeeEEeehccccccccH-HHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 432 2222222 3778999999999999888888764 566667789999999999999999999999866644221 Q ss_pred cccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccc Q lcl|NC_018846. 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~ 237 (404) . .+++.++.+.|..+...+.... T Consensus 134 ~--------------------------------------------~~~~~~t~d~i~~A~~~lgd~~------------- 156 (276) T protein:vir:10 134 T--------------------------------------------VSADIGTLAGLEAAIDTFDDED------------- 156 (276) T ss_pred c--------------------------------------------ccccccCHHHHHHHHHHhcccc------------- Confidence 0 0233456666766665543211 Q ss_pred cCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeeccc Q lcl|NC_018846. 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSEN 317 (404) Q Consensus 238 ~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~ 317 (404) ...++++|||.++..|+++... +|.+- ..+..+.+..|.+|.+.|+.|..-..+| T Consensus 157 ---~~~~~ivv~p~~~~~L~k~~~~-~f~~~-------s~~g~~~~~~G~ig~~~G~~Vi~s~~~p-------------- 211 (276) T protein:vir:10 157 ---LEPMVLFINPKDAGKLRSSASD-NFTRA-------TELGDNIIVKGAFGEALGAVIVRSKKLD-------------- 211 (276) T ss_pred ---CcccEEEEcHHHHHHHHHhccc-ccccc-------ccccccceeccccceecceeEEEcCCCC-------------- Confidence 1237899999999999986431 13211 2334678999999999999887655432 Q ss_pred ccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEE Q lcl|NC_018846. 318 NLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIA 397 (404) Q Consensus 318 ~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~ 397 (404) ...++|+|..|+++.-++ +. . +|...|-..+ .+.|.+. +=|||.. T Consensus 212 --------------~~t~~l~~~gAi~~~~~~--~~--~-vE~dRd~~~~----~d~i~~~------------~~y~~~~ 256 (276) T protein:vir:10 212 --------------EGEAILAKRGAVKLITKR--DF--F-LETDRDPSTK----TTALYSD------------KHYVAYL 256 (276) T ss_pred --------------cceEEEEeccceeeeecC--Cc--e-eecccchhhc----ccEEEEe------------eEEEEEE Confidence 123478888888765432 11 1 4443333222 1222111 1233333 Q ss_pred Eece--ecC Q lcl|NC_018846. 398 VDTA--VKL 404 (404) Q Consensus 398 idta--a~~ 404 (404) ++-. +++ T Consensus 257 ~~~~~vv~~ 265 (276) T protein:vir:10 257 YDESKAVKV 265 (276) T ss_pred EcCcceEEE Confidence 3221 111 No 29 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=98.71 E-value=2.1e-08 Score=62.63 Aligned_cols=309 Identities=14% Similarity=0.108 Sum_probs=154.1 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhH-----HHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEE Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMV-----NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSI 75 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~-----~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L 75 (404) -.|.|.| ..+++.+ ++|++.+...-.++.-|.. ..+..+.+-.+||+|+|+. T Consensus 4 ~~~~~~~--------------~~~t~~v~~fipei~s~~i~~~l~~~~v~~~---------~~~d~~~~~~~Gdtv~ip~ 60 (341) T protein:vir:94 4 GNTITGP--------------SINTQRGQQFIPEQWLSEVQMFRKAKMLDTS---------VVKTWGAQVKKGDTFHVPR 60 (341) T ss_pred hhhhccc--------------cccchhHHHHHHHHHHHHHHHHHHhhcchhh---------ccccccccccCCceEEEec Confidence 1122221 1122222 5566655433333222211 1122223334599999987 Q ss_pred eeccccCceecCceeecchhhhhhceeEEEEeecc-ceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018846. 76 MHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGR-HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAG 154 (404) Q Consensus 76 ~~~L~G~gV~Gd~~leGnee~L~~~s~~v~Idq~R-~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG 154 (404) ....+-.....+..+ ..+++.-.+.+|.||+.+ .++.+. .+++..+..|+|.+........+++..|+.++-.+++ T Consensus 61 ~g~~~~~d~~~~~~i--~~~~~~~~~~~itiD~~~~~~~~i~-d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~ 137 (341) T protein:vir:94 61 ISELGVEDKATDVPV--GVQPVNDTDFVITVDTDRTTAVALD-DLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAA 137 (341) T ss_pred cCcceeeeecCCCcc--ccccccCceEEEEEeeeeecceeec-hHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 655443333333333 356777889999999986 455554 5788888999999999999999999999998877665 Q ss_pred hhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeec Q lcl|NC_018846. 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSG 234 (404) Q Consensus 155 ~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g 234 (404) +.+.. . .+.+..|+. ......+.++.+.|..++..+++..-|- +| T Consensus 138 ~~~~~--------~----------~~~~~~~~~------------~~t~~~~~~~~~~i~~a~~~Lde~~VP~-----~g 182 (341) T protein:vir:94 138 VQNTA--------S----------QNVFSSSNG------------AITGNGQAFSFAVFLAARRLLLEADVPE-----EK 182 (341) T ss_pred ccccc--------c----------CccccCccc------------cccCchhhhhHHHHHHHHHHHhhcCCCc-----cC Confidence 43200 0 011111110 0111334466677777888888765551 11 Q ss_pred ccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceee Q lcl|NC_018846. 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) Q Consensus 235 ~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~ 314 (404) ++++++|.++..|++|+.+ ..... +.++.|-+|.+|.|.|+.|++.+++|. ..+..... T Consensus 183 ---------R~lvv~P~~~~~Ll~~~~~---~~~~~-------~g~~~l~~G~ig~i~G~~V~~Sn~lp~--~~~~~~~~ 241 (341) T protein:vir:94 183 ---------IVLLISPGQESALFTIPQF---ISKDF-------INNAPIAQGQIGSLMGVRVIRTSLIGN--NSATGWRN 241 (341) T ss_pred ---------CEEEeCHHHHHHHhhchhh---hhhhc-------cccchhheeeeeeEeceEEEEeccccc--cccccccc Confidence 5678999999999999874 22211 224578899999999999999888762 21111100 Q ss_pred cccccccccccccccccchhheeecCce------eEEEeec-C----CCCCcee--------eeccccccchH---HHHH Q lcl|NC_018846. 315 SENNLTATTKEVAAATNIDRAMLLGAQA------LANAYGQ-K----AGGHFNM--------VEKKTDMDNRT---EIAI 372 (404) Q Consensus 315 ~~~~~~~~~~~~a~~~~v~ralllGaqA------l~~A~g~-~----~g~r~~w--------~Ee~~D~g~~~---~i~i 372 (404) .. ..+.....+. .+.-..-+|.+. .+++|-+ + -..++.| ..-..+|.-+. .|-. T Consensus 242 ~~--~~~~~~~~~~--~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 317 (341) T protein:vir:94 242 GA--PTIAPAEATP--GFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLMVG 317 (341) T ss_pred cc--cceecccccc--cccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhhhhhhhh Confidence 00 0000000000 000001111110 0111100 0 0001111 00011222111 2224 Q ss_pred HHHhhhhhccccCCCCCceEEEEEEEeceecC Q lcl|NC_018846. 373 SWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 373 ~~i~G~~K~rF~~~~g~~~DfGvi~idtaa~~ 404 (404) ...+|.+=+| | +.. +-|-+.+.- T Consensus 318 ~~~~G~~~lr-p-------~~~-v~~~~~~~~ 340 (341) T protein:vir:94 318 RQAYGARLYR-P-------LHA-VNIHTTGDT 340 (341) T ss_pred hhhhcccccC-c-------cee-EEEecCcCC Confidence 4455655555 3 333 333333333 No 30 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=98.71 E-value=1e-09 Score=69.82 Aligned_cols=262 Identities=14% Similarity=0.064 Sum_probs=151.2 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |..-..- ....=.| +.|+..+.....++..|. .+...-++|+-++|++|+|+....++ T Consensus 1 m~~~~T~------------l~d~i~P--ev~~~~v~~~~~~~l~~~--------~~~~~~~~l~g~~G~tv~iP~~~~ig 58 (274) T protein:vir:95 1 MAQGMTK------------LTNQIVP--EVLAPMMQAELEKKLRFA--------SFAEIDNTLVGQPGDTLTFPAFIYSG 58 (274) T ss_pred CCcceee------------hhheech--HHHHHHHHHHHHhhhhcc--------ccceecccccCCCCCEEEeeeecCCC Confidence 2111000 0000011 456666544433333331 12233467887899999999987653 Q ss_pred cCc-eecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_018846. 81 KRP-TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) Q Consensus 81 G~g-V~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~ 159 (404) ..- +..++.+ ..+.|...++++.|++..+++.... .+...+.-|+..++.++++.+|++..|..++..+.++.. T Consensus 59 ~a~~~~~g~~i--~~~~lt~~~~~~~i~~~~~a~~i~D-~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~-- 133 (274) T protein:vir:95 59 DAKVVAEGEKI--PTDILETKKREAKIRKIAKGTSISD-EALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKL-- 133 (274) T ss_pred ccccccCCCcc--chhhcccceeEEEeeeeecceeehH-HHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-- Confidence 222 2222222 2678999999999999888888765 456666789999999999999999999999866654321 Q ss_pred cccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccC Q lcl|NC_018846. 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~ 239 (404) . .+++.++.+.|..|..++.... T Consensus 134 ---------------------~---------------------~~~~~~~~d~i~~A~~~lgd~~--------------- 156 (274) T protein:vir:95 134 ---------------------T---------------------VEADITKLTGLQTAIDKFNDED--------------- 156 (274) T ss_pred ---------------------c---------------------ccccccCHHHHHHHHHHhcccc--------------- Confidence 0 0123345666666665553221 Q ss_pred ccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeeccccc Q lcl|NC_018846. 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) Q Consensus 240 ~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~ 319 (404) ..-++++|||.++..|++|+.. +|. . ...+..+.+..|.+|.|.|+.|..-..+| T Consensus 157 -~~~~~ivv~p~~~~~L~k~~~~-~f~---~----~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~---------------- 211 (274) T protein:vir:95 157 -LEPMVLFISPLDAGKLRGDATT-NFT---R----ATELGDDVIVKGAFGEALGAVIVRSNKLE---------------- 211 (274) T ss_pred -ccccEEEeCHHHHHHHHhhccc-ccc---c----cccccccceeccccceecCeEEEEeCCCC---------------- Confidence 1136899999999999999742 122 1 12334688999999999999886533321 Q ss_pred ccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEEEe Q lcl|NC_018846. 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVD 399 (404) Q Consensus 320 ~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~id 399 (404) ...++|+|-.|++..-.+ + .. +|-..|-..+ .+.|.+- +=||+-+++ T Consensus 212 ------------~~t~~l~~~gA~~~~~~~--~--~~-vE~~Rd~~~~----~d~i~~~------------~~y~~~~~~ 258 (274) T protein:vir:95 212 ------------AGTAILAKKGAVKLITKR--D--FF-LETDRDPSTK----TTALYSD------------KHYVAYLYD 258 (274) T ss_pred ------------CceEEEEeccceeeeecC--C--cc-cccccccccc----cCEEEEe------------EEEEEEEEc Confidence 113478888777764222 1 12 4443333221 1111111 224444433 Q ss_pred c--eecC Q lcl|NC_018846. 400 T--AVKL 404 (404) Q Consensus 400 t--aa~~ 404 (404) - .++| T Consensus 259 ~~~~v~~ 265 (274) T protein:vir:95 259 ESKAVKI 265 (274) T ss_pred CCcEEEE Confidence 2 2333 No 31 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=98.71 E-value=1e-09 Score=69.82 Aligned_cols=262 Identities=14% Similarity=0.064 Sum_probs=151.2 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |..-..- ....=.| +.|+..+.....++..|. .+...-++|+-++|++|+|+....++ T Consensus 1 m~~~~T~------------l~d~i~P--ev~~~~v~~~~~~~l~~~--------~~~~~~~~l~g~~G~tv~iP~~~~ig 58 (274) T protein:vir:96 1 MAQGMTK------------LTNQIVP--EVLAPMMQAELEKKLRFA--------SFAEIDNTLVGQPGDTLTFPAFIYSG 58 (274) T ss_pred CCcceee------------hhheech--HHHHHHHHHHHHhhhhcc--------ccceecccccCCCCCEEEeeeecCCC Confidence 2111000 0000011 456666544433333331 12233467887899999999987653 Q ss_pred cCc-eecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_018846. 81 KRP-TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) Q Consensus 81 G~g-V~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~ 159 (404) ..- +..++.+ ..+.|...++++.|++..+++.... .+...+.-|+..++.++++.+|++..|..++..+.++.. T Consensus 59 ~a~~~~~g~~i--~~~~lt~~~~~~~i~~~~~a~~i~D-~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~-- 133 (274) T protein:vir:96 59 DAKVVAEGEKI--PTDILETKKREAKIRKIAKGTSISD-EALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKL-- 133 (274) T ss_pred ccccccCCCcc--chhhcccceeEEEeeeeecceeehH-HHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-- Confidence 222 2222222 2678999999999999888888765 456666789999999999999999999999866654321 Q ss_pred cccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccC Q lcl|NC_018846. 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~ 239 (404) . .+++.++.+.|..|..++.... T Consensus 134 ---------------------~---------------------~~~~~~~~d~i~~A~~~lgd~~--------------- 156 (274) T protein:vir:96 134 ---------------------T---------------------VEADITKLTGLQTAIDKFNDED--------------- 156 (274) T ss_pred ---------------------c---------------------ccccccCHHHHHHHHHHhcccc--------------- Confidence 0 0123345666666665553221 Q ss_pred ccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeeccccc Q lcl|NC_018846. 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) Q Consensus 240 ~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~ 319 (404) ..-++++|||.++..|++|+.. +|. . ...+..+.+..|.+|.|.|+.|..-..+| T Consensus 157 -~~~~~ivv~p~~~~~L~k~~~~-~f~---~----~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~---------------- 211 (274) T protein:vir:96 157 -LEPMVLFISPLDAGKLRGDATT-NFT---R----ATELGDDVIVKGAFGEALGAVIVRSNKLE---------------- 211 (274) T ss_pred -ccccEEEeCHHHHHHHHhhccc-ccc---c----cccccccceeccccceecCeEEEEeCCCC---------------- Confidence 1136899999999999999742 122 1 12334688999999999999886533321 Q ss_pred ccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEEEe Q lcl|NC_018846. 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVD 399 (404) Q Consensus 320 ~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~id 399 (404) ...++|+|-.|++..-.+ + .. +|-..|-..+ .+.|.+- +=||+-+++ T Consensus 212 ------------~~t~~l~~~gA~~~~~~~--~--~~-vE~~Rd~~~~----~d~i~~~------------~~y~~~~~~ 258 (274) T protein:vir:96 212 ------------AGTAILAKKGAVKLITKR--D--FF-LETDRDPSTK----TTALYSD------------KHYVAYLYD 258 (274) T ss_pred ------------CceEEEEeccceeeeecC--C--cc-cccccccccc----cCEEEEe------------EEEEEEEEc Confidence 113478888777764222 1 12 4443333221 1111111 224444433 Q ss_pred c--eecC Q lcl|NC_018846. 400 T--AVKL 404 (404) Q Consensus 400 t--aa~~ 404 (404) - .++| T Consensus 259 ~~~~v~~ 265 (274) T protein:vir:96 259 ESKAVKI 265 (274) T ss_pred CCcEEEE Confidence 2 2333 No 32 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=98.69 E-value=2.9e-09 Score=67.42 Aligned_cols=268 Identities=15% Similarity=0.069 Sum_probs=144.0 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhH-HHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeecc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~-~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 79 (404) |-.-+ |. ..+-.+ +.|+..+.....++.-+. .......+|+-++|++|+|+....+ T Consensus 1 Ma~~~-------------T~--~~~~iiPev~s~~v~~~~~~~~v~~--------~~~~~~~~l~g~~G~tv~ip~~~~~ 57 (278) T protein:vir:80 1 MADLT-------------TK--LANLIDPEVMGPMISAKLPKAIKFG--------KIAPIDNSLEGQPGSEITVPKYKYI 57 (278) T ss_pred CCCcc-------------ee--hhheecHHHHHHHHHHHHHHhhhhc--------ccceecccccCCCCCEEEEeeeccC Confidence 11100 00 000011 456665433322222221 1123355677788999999998766 Q ss_pred ccC-ceecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_018846. 80 SKR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (404) Q Consensus 80 ~G~-gV~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~ 158 (404) +.. -+..++.+ ..+.|...++++.|++...++.... .+...+..|+..++.++++.+|++..|..++-+|.|+... T Consensus 58 g~a~~~~~g~~i--~~~~lt~~~~~~~i~~~~~a~~v~D-~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~ 134 (278) T protein:vir:80 58 GDAQDVAEGAAI--DYSALETESVKHGIKKAGKGVKLTD-ESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLE 134 (278) T ss_pred CcceeecCCCcC--cccccccceeeEeeehhhccccccH-HHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 422 12222222 2578999999999999888887765 5666678899999999999999999999999888765320 Q ss_pred ccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeeccccc Q lcl|NC_018846. 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELH 238 (404) Q Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~ 238 (404) . . .+++. +..|. ..+.+-.+..++....-| T Consensus 135 ---~------~-------------~~~t~---------------~~~~~-~~~~~~da~~~l~~~~~~------------ 164 (278) T protein:vir:80 135 ---V------K-------------GAINI---------------GLIDK-IENTFTDAPDAIEDESIT------------ 164 (278) T ss_pred ---c------c-------------ccccc---------------chhhh-HHHHHHHHHHhhcccCCC------------ Confidence 0 0 01110 00000 122232233233222212 Q ss_pred CccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeecccc Q lcl|NC_018846. 239 GEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENN 318 (404) Q Consensus 239 ~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~ 318 (404) . ..++++||.++..|++++... |. .. ....++.+..|.+|.|.|+.|..-.++|. T Consensus 165 --~-~~~ivv~p~~~~~L~k~~~~~-~~---~~----~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~-------------- 219 (278) T protein:vir:80 165 --T-TGVLFLNYKDTAKLREEAAGS-WT---KA----SQLGDDLLVKGAFGELLGWEIVRTKKLAD-------------- 219 (278) T ss_pred --c-ccEEEECHHHHHHHHhhhhhh-cc---cc----ccccccceeeccceeecceeEEEcCCCCc-------------- Confidence 1 146899999999999987421 21 11 12224567789999999999987666540 Q ss_pred cccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEEE Q lcl|NC_018846. 319 LTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAV 398 (404) Q Consensus 319 ~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~i 398 (404) ..++|++..|++..-++. . . +|...|-.. ..+.|.+. +=||+-++ T Consensus 220 --------------~t~~l~~~gAi~~~~~~~--~--~-vE~~Rd~~~----~~d~i~~~------------~~yg~~v~ 264 (278) T protein:vir:80 220 --------------GNALAVKAGALKTFLKRN--L--L-AESGRDMDH----KLTKFNAD------------QHYAVALV 264 (278) T ss_pred --------------ceEEEEeccceeeeecCC--c--c-cccccchhh----ccceeeee------------eEEEEEEE Confidence 124778877766543331 1 1 332222211 11222221 11333333 Q ss_pred e--ceecC Q lcl|NC_018846. 399 D--TAVKL 404 (404) Q Consensus 399 d--taa~~ 404 (404) + -+++| T Consensus 265 ~~~~~v~i 272 (278) T protein:vir:80 265 DETKAVKV 272 (278) T ss_pred cCcceEEE Confidence 1 12222 No 33 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=98.66 E-value=9.1e-09 Score=64.67 Aligned_cols=253 Identities=12% Similarity=0.031 Sum_probs=141.0 Q ss_pred hcCchhH-HHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccccCceecCceeecchhhhhhc Q lcl|NC_018846. 22 NRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHA 100 (404) Q Consensus 22 ~~n~~~~-~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gV~Gd~~leGnee~L~~~ 100 (404) +.++-.+ ++|++.+...-.++..+..+. .+--++....||+|+|+.....+-.-..+... ....+++... T Consensus 1 MA~~~~~pei~~~~v~~~~~~~lv~~~l~--------~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~-~~~~~~~~~~ 71 (273) T protein:vir:79 1 MAFNNFIPELWSDMLLEEWTAQTVFANLV--------NREYEGIASKGNVVHIAGVVAPTVKDYKAAGR-QTSADAISDT 71 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhccchhhh--------hccccccccCCcEEEEeecCcccccccccCCC-ccCccccccc Confidence 5555443 678887766666555554332 22224445679999999876555222221111 2457889999 Q ss_pred eeEEEEeecc-ceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccccccccccccc Q lcl|NC_018846. 101 DFSLKINQGR-HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMI 179 (404) Q Consensus 101 s~~v~Idq~R-~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~~~~~~~~~~ 179 (404) +.++.||+.+ .++.+.. +++..+..||++..+. +..=+++..|+.++..++++.. . T Consensus 72 ~~~~tid~~~~~~~~i~d-~d~~~~~~~~~~~~~~-~~~ala~~vD~~i~~~~~~a~~---------------------~ 128 (273) T protein:vir:79 72 GVDLLIDQEKSIDFLVDD-IDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGT---------------------A 128 (273) T ss_pred eEEEEEeeecccceeecc-HHHHhhcccHHHHHHH-HHHHHHHHHHHHHHHHHhhccc---------------------c Confidence 9999999964 5677654 5667788899986654 5556889999988877755321 0 Q ss_pred cccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEEEecHHHHHHHhcC Q lcl|NC_018846. 180 NDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTS 259 (404) Q Consensus 180 N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~q~~~Lr~d 259 (404) |...+ .++.+. .++.|..+..+++...-|- +| ++++++|.++..|+++ T Consensus 129 ~~~~~----------------~~~~~~--~~~~i~~a~~~ld~~~vP~-----~~---------R~lvv~p~~~~~Ll~~ 176 (273) T protein:vir:79 129 LTGSA----------------PSDADD--AFDLIASALKELTKANVPN-----VG---------RVVVVNAEMAFWLRSS 176 (273) T ss_pred ccccc----------------ccchhh--HHHHHHHHHHHhhhccCCc-----cC---------cEEEECHHHHHHHhhc Confidence 00011 111111 2456777777777766552 11 5788999999999998 Q ss_pred cchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeecccccccccccccccc-----cchh Q lcl|NC_018846. 260 TSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAAT-----NIDR 334 (404) Q Consensus 260 ~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~~~~a~~~-----~v~r 334 (404) +.+ +.+. ...+..+.|-+|.+|.|.|+.|.+..++|..- +..... ....+.+-....... .=.+ T Consensus 177 ~~~--~~~~------~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~--~~~~~a-~~~~A~~~a~~~~~~e~~r~~~~~ 245 (273) T protein:vir:79 177 GSK--LTSA------DTSGDAAGLRAGTIGNLLGARIVESNNLRDTD--DEQFVA-FHPSAAAYVSQIDTVEALRDQDSF 245 (273) T ss_pred hhh--hhhh------hhcccccceeeeEeeEEeceEEEecccccccC--ceEEEE-EeccceeeeeehhhhhcccCcccc Confidence 642 2221 11355678889999999999999988776321 110000 000000000000000 0000 Q ss_pred h-eeecCceeEEEee------------cCCCC Q lcl|NC_018846. 335 A-MLLGAQALANAYG------------QKAGG 353 (404) Q Consensus 335 a-lllGaqAl~~A~g------------~~~g~ 353 (404) + ++-| .+.|| +++|. T Consensus 246 ~~~v~~----~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 246 SDRIRA----LHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred eeeeee----eeeeeeEEecCceEEEEeccCC Confidence 0 1111 12232 12333 No 34 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=98.66 E-value=3.6e-09 Score=66.87 Aligned_cols=262 Identities=15% Similarity=0.080 Sum_probs=150.3 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |..-..-. ...-.| +.|+..+.....++..|. .+..+-.+|+-.+|++|+|+....++ T Consensus 1 ma~~~T~l------------~d~iiP--ev~~~~v~~~~~~~l~~~--------~~~~~d~~l~g~~G~tv~iP~~~~ig 58 (274) T protein:vir:12 1 MAQGLTKT------------SNQIIP--EVLAPMMQAQLEKKLRFA--------SFAEVDSTLQGQPGDTLTFPAFVYSG 58 (274) T ss_pred CCcceeeh------------hhhhch--HHHHHHHHHHHHhhhhhc--------ccceecccccCCCCCEEEEeeecCCC Confidence 21100000 000011 356665533322222221 22334467777899999999887553 Q ss_pred cC-ceecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_018846. 81 KR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) Q Consensus 81 G~-gV~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~ 159 (404) .. -+..++.+ ..+.|+..++++.|++...++.+.. .+...+.-|+..++.+.++.+|++..|..++..+.++.. T Consensus 59 ~a~~~~~g~~i--~~~~lt~~~~~~~i~~~~~~~~i~D-~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~-- 133 (274) T protein:vir:12 59 DAQVVAEGEKI--PTDILETKKREAKIRKIAKGTSITD-EALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-- 133 (274) T ss_pred ccccccCCCcc--chhhcccceeeEEeeeecceeeecH-HHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc-- Confidence 21 12222222 2678999999999999888888765 455556789999999999999999999999866643221 Q ss_pred cccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccC Q lcl|NC_018846. 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~ 239 (404) . .+++.++.+.|..|..++.... T Consensus 134 -----------------------~-------------------~~~~a~~~d~i~dA~~~lgd~~--------------- 156 (274) T protein:vir:12 134 -----------------------T-------------------VNADITKLNGLQSAIDKFNDED--------------- 156 (274) T ss_pred -----------------------c-------------------ccccccCHHHHHHHHHHhcccc--------------- Confidence 0 0123356676766665543211 Q ss_pred ccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeeccccc Q lcl|NC_018846. 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) Q Consensus 240 ~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~ 319 (404) ...++++|||.++..|++|+.. +|. +. ..+..+.+.+|.+|.|.|+.|..-..+| T Consensus 157 -~~~~~ivv~p~~~~~L~k~~~~-~fv---~~----s~~g~~~~~~G~ig~~~G~~Vi~s~~~p---------------- 211 (274) T protein:vir:12 157 -LEPMVLFINPLDAGKLRGDAST-NFT---RA----TELGDDIIVKGAFGEALGAIIVRSNKLE---------------- 211 (274) T ss_pred -ccccEEEeCHHHHHHHHhhhhh-hcc---cc----ccccccceecccceeecCeeEEEeCCCC---------------- Confidence 1136899999999999999741 132 11 2334578889999999999987655443 Q ss_pred ccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEEEe Q lcl|NC_018846. 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVD 399 (404) Q Consensus 320 ~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~id 399 (404) ..-++|+|..|++..-.+ + .. +|-..|-..+. +.|.+- +=|||-+++ T Consensus 212 ------------~~t~~l~~~gA~~~~~~~--~--~~-vE~~Rd~~~~~----d~i~~~------------~~y~~~~~~ 258 (274) T protein:vir:12 212 ------------AGTAILAKKGAVKLILKR--D--FF-LEVARDASTKT----TALYSD------------KHYVAYLYD 258 (274) T ss_pred ------------cceEEEEeccceeeeecC--C--ce-eccccchhhcc----cEEEee------------eEEEEEEEc Confidence 112478888777765322 1 12 44433332211 111111 224444443 Q ss_pred c--eecC Q lcl|NC_018846. 400 T--AVKL 404 (404) Q Consensus 400 t--aa~~ 404 (404) - .+++ T Consensus 259 ~~~vv~~ 265 (274) T protein:vir:12 259 ESKAVKI 265 (274) T ss_pred CCceEEE Confidence 2 2222 No 35 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.65 E-value=2.1e-08 Score=62.63 Aligned_cols=289 Identities=12% Similarity=0.095 Sum_probs=159.6 Q ss_pred cEEEEeecCCCCCcEEEEEEeeccccCceecCceeecchhhhhhceeEEEEeecc---ceeccCChhhhhhhhhhHHHHH Q lcl|NC_018846. 56 PVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGR---HLVDAGGRMSQQRTKFNLASSA 132 (404) Q Consensus 56 ~I~~~~dL~k~~Gd~v~f~L~~~L~G~gV~Gd~~leGnee~L~~~s~~v~Idq~R---~aV~~~g~m~~qrs~~dlrk~a 132 (404) .|+. + ..|.++.|+-+-..+-...+=.+.+.|+-+++.-...+|.||+.- |.|+ .+++.+.++|||.+. T Consensus 1 ~vr~---i--~~g~s~~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~Vd---DiD~~qa~~Dlr~e~ 72 (324) T protein:vir:99 1 MTRT---I--TSGKSAQFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIY---DIEDAMNHYDVRSEY 72 (324) T ss_pred Ceee---e--ecCceEEEeeeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhh---hHHHHhcCccchhHH Confidence 3333 4 338999999998888777777788888888888888899999975 4444 588889999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHH Q lcl|NC_018846. 133 RTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGL 212 (404) Q Consensus 133 r~~L~~w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~ 212 (404) -.....=|++..|+.+|.++++...... |. ... |..-.... -++...+.+... ....+.+ .+. T Consensus 73 s~~~G~aLA~~~Dq~i~~~~a~~~~~~a------~~--~~~------~~~~~g~~-~~~~~~~~~~~~-~~~~~~~-~da 135 (324) T protein:vir:99 73 STQMGEALAMAADVANYAEMAKLVNSRK------ET--TNE------NIEGLGAA-SLVKITGKKEDP-AKYGTQV-IQA 135 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccc------cc--ccC------CcccCCcc-ceeccccccccc-ccCHHHH-HHH Confidence 9999999999999999999875432100 00 000 00000000 000001111111 0111111 344 Q ss_pred HHHHHHHHHHhCCCCccEEeecccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEc Q lcl|NC_018846. 213 VDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWR 292 (404) Q Consensus 213 Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~n 292 (404) |..++..+++..-|-- + ++++++|.|+..|+.++... . ...+..+.+=.|.+|.++ T Consensus 136 i~~a~~~Lde~~VP~~-----g---------R~~vv~P~~y~~Ll~~~~~~----~------~~~~~~~~~~~G~V~~i~ 191 (324) T protein:vir:99 136 LTYARAAFAKKYIPAG-----D---------RTFYTDPDTYSAILAALMPN----A------ANYAALIDPETGNIRNVM 191 (324) T ss_pred HHHHHHHHhhcCCCCC-----C---------CEEEeChHHHHHHhhccccc----c------cccccccceecceEEEEe Confidence 5556677776665511 1 57999999999999775421 1 112334677789999999 Q ss_pred CEEEEecCCceeee-------ccccceeecccccccccccccccccchhheeecCceeEEEeecCCCCCceeeecccccc Q lcl|NC_018846. 293 NILVRKYAGMPIRF-------YQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMD 365 (404) Q Consensus 293 gvii~~~~~~~irf-------~~~~~~~~~~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g 365 (404) |+.|.+-+++|..- +.+..+..+..........+...+.-.++|++=.+|++..=...--+.-+|.|+ | T Consensus 192 Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~---~- 267 (324) T protein:vir:99 192 GFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGADNVVGLFVHRSAVATLKLKDMALERARRPE---Y- 267 (324) T ss_pred ceEEEecCCccccccccccccccccccccccccccccccccccccCceeEEEEehhheEEEeeecceecceechh---h- Confidence 99999988887311 011111111111111111222222333456666665544422221123344433 1 Q ss_pred chHHHHHHHHhhhhhccccCCCCCceEEEEEEEeceecC Q lcl|NC_018846. 366 NRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 366 ~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idtaa~~ 404 (404) .-.-|-....+|.+=+| + +=-++|.+..-+-= T Consensus 268 ~~d~i~~~~a~G~~~lR-P------e~a~~v~l~~~~~~ 299 (324) T protein:vir:99 268 QADQIIAKYAMGHGGLR-P------EAVGAIIFEDGETP 299 (324) T ss_pred HHHhhhhhhhhcCcccc-c------ceEEEEEEccCccc Confidence 11334444555666555 2 23444444333210 No 36 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=98.60 E-value=5.8e-09 Score=65.74 Aligned_cols=271 Identities=14% Similarity=0.097 Sum_probs=149.1 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |...+...+ .++ -| +.|+..+.....+++-+. ....+-.+++..+|++|+++....+. T Consensus 1 MA~~~T~~~------~~~------iP--ev~s~~v~~~~~~~~~~~--------~~~~~~~~~~g~~G~tv~iP~~~~~~ 58 (272) T protein:vir:98 1 MAVGTTKMA------QML------DP--EVLADMIDAEVGKAIRFA--------PLAEVDTTLEGQPGTTLTVPKWDYIG 58 (272) T ss_pred CCCccccch------hee------ch--HHHHHHHHHHHHHHhhhh--------ccccccccccCCCCCEEEEEEecCCC Confidence 442221110 011 11 456665544333333321 11122345677889999998775443 Q ss_pred cC-ceecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_018846. 81 KR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) Q Consensus 81 G~-gV~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~ 159 (404) .. .+..++.+ -.+.+.+.+.++.|.+..+++..... ...++..|+.....+.|++.|++..|..+|..+.|+.. T Consensus 59 ~a~~v~eg~~i--~~~~~~~~~~~~~~~~~~~~~~itd~-~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~-- 133 (272) T protein:vir:98 59 DAEDVAEGEAI--PMTQLGFKKTTMTIKKAGKGVEITDE-AILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQ-- 133 (272) T ss_pred CcccccCCCcc--cccccccceEEEEeeeeeeeeeecHH-HHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-- Confidence 32 23222222 26778999999999998888877654 45668899999999999999999999999876654321 Q ss_pred cccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccC Q lcl|NC_018846. 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~ 239 (404) .+. ...+.+.|..+...+.....+ T Consensus 134 ---------------------~~~----------------------~~~t~d~i~da~~~l~~~~~~------------- 157 (272) T protein:vir:98 134 ---------------------TVE----------------------ATATVDGVSKALDIFNDEDDA------------- 157 (272) T ss_pred ---------------------ccc----------------------cccCHHHHHHHHHHHhccCCC------------- Confidence 000 011334444444444322111 Q ss_pred ccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeeccccc Q lcl|NC_018846. 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) Q Consensus 240 ~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~ 319 (404) -.+++|||..+..|+++... +|. .. .....+.+.+|.+|++.|+.+..-+.+| T Consensus 158 ---~~~~vv~p~~~~~L~k~~~~-~~~---~~----~~~~~~~~~~g~ig~i~G~~Vi~s~~~p---------------- 210 (272) T protein:vir:98 158 ---ETVIVMNPADASTLRLDAAK-EWL---GA----TEVGANRVVSGVYGEVLGVQIVRSRKCP---------------- 210 (272) T ss_pred ---ccEEEEcHHHHHHHHHhccc-ccc---cc----ccccccccccccchhhcCeeEEEcCCCC---------------- Confidence 25799999999999987532 111 11 1233567889999999999998776654 Q ss_pred ccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEEEe Q lcl|NC_018846. 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVD 399 (404) Q Consensus 320 ~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~id 399 (404) .+ -++++|..|++++-.+. .. .|...|-.. ....|.+.. +|.-.--+.+=+-++.+. T Consensus 211 --------~~----t~~~~~~~a~~~~~~~~----~~-ve~~r~~~~----~~~~i~~~~--~~~~~v~~~~~vv~~t~~ 267 (272) T protein:vir:98 211 --------KG----TAYMVRKGALRIMLKRN----TM-VETDRDITK----AINQIVANK--HYGVYLYKAEKAVKITLK 267 (272) T ss_pred --------cc----eEEEEcCCeEEEEecCC----ce-eeecccccc----ceeEEEEEE--EEEEEEEcCCceEEEEec Confidence 00 14778877777764321 12 333333221 112222211 111100011123334444 Q ss_pred ceecC Q lcl|NC_018846. 400 TAVKL 404 (404) Q Consensus 400 taa~~ 404 (404) .|.|- T Consensus 268 ~a~~~ 272 (272) T protein:vir:98 268 DAAKK 272 (272) T ss_pred ccccC Confidence 44444 No 37 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=98.60 E-value=5.8e-09 Score=65.74 Aligned_cols=271 Identities=14% Similarity=0.097 Sum_probs=149.1 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |...+...+ .++ -| +.|+..+.....+++-+. ....+-.+++..+|++|+++....+. T Consensus 1 MA~~~T~~~------~~~------iP--ev~s~~v~~~~~~~~~~~--------~~~~~~~~~~g~~G~tv~iP~~~~~~ 58 (272) T protein:vir:30 1 MAVGTTKMA------QML------DP--EVLADMIDAEVGKAIRFA--------PLAEVDTTLEGQPGTTLTVPKWDYIG 58 (272) T ss_pred CCCccccch------hee------ch--HHHHHHHHHHHHHHhhhh--------ccccccccccCCCCCEEEEEEecCCC Confidence 442221110 011 11 456665544333333321 11122345677889999998775443 Q ss_pred cC-ceecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_018846. 81 KR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) Q Consensus 81 G~-gV~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~ 159 (404) .. .+..++.+ -.+.+.+.+.++.|.+..+++..... ...++..|+.....+.|++.|++..|..+|..+.|+.. T Consensus 59 ~a~~v~eg~~i--~~~~~~~~~~~~~~~~~~~~~~itd~-~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~-- 133 (272) T protein:vir:30 59 DAEDVAEGEAI--PMTQLGFKKTTMTIKKAGKGVEITDE-AILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQ-- 133 (272) T ss_pred CcccccCCCcc--cccccccceEEEEeeeeeeeeeecHH-HHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-- Confidence 32 23222222 26778999999999998888877654 45668899999999999999999999999876654321 Q ss_pred cccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccC Q lcl|NC_018846. 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~ 239 (404) .+. ...+.+.|..+...+.....+ T Consensus 134 ---------------------~~~----------------------~~~t~d~i~da~~~l~~~~~~------------- 157 (272) T protein:vir:30 134 ---------------------TVE----------------------ATATVDGVSKALDIFNDEDDA------------- 157 (272) T ss_pred ---------------------ccc----------------------cccCHHHHHHHHHHHhccCCC------------- Confidence 000 011334444444444322111 Q ss_pred ccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeeccccc Q lcl|NC_018846. 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) Q Consensus 240 ~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~ 319 (404) -.+++|||..+..|+++... +|. .. .....+.+.+|.+|++.|+.+..-+.+| T Consensus 158 ---~~~~vv~p~~~~~L~k~~~~-~~~---~~----~~~~~~~~~~g~ig~i~G~~Vi~s~~~p---------------- 210 (272) T protein:vir:30 158 ---ETVIVMNPADASTLRLDAAK-EWL---GA----TEVGANRVVSGVYGEVLGVQIVRSRKCP---------------- 210 (272) T ss_pred ---ccEEEEcHHHHHHHHHhccc-ccc---cc----ccccccccccccchhhcCeeEEEcCCCC---------------- Confidence 25799999999999987532 111 11 1233567889999999999998776654 Q ss_pred ccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEEEe Q lcl|NC_018846. 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVD 399 (404) Q Consensus 320 ~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~id 399 (404) .+ -++++|..|++++-.+. .. .|...|-.. ....|.+.. +|.-.--+.+=+-++.+. T Consensus 211 --------~~----t~~~~~~~a~~~~~~~~----~~-ve~~r~~~~----~~~~i~~~~--~~~~~v~~~~~vv~~t~~ 267 (272) T protein:vir:30 211 --------KG----TAYMVRKGALRIMLKRN----TM-VETDRDITK----AINQIVANK--HYGVYLYKAEKAVKITLK 267 (272) T ss_pred --------cc----eEEEEcCCeEEEEecCC----ce-eeecccccc----ceeEEEEEE--EEEEEEEcCCceEEEEec Confidence 00 14778877777764321 12 333333221 112222211 111100011123334444 Q ss_pred ceecC Q lcl|NC_018846. 400 TAVKL 404 (404) Q Consensus 400 taa~~ 404 (404) .|.|- T Consensus 268 ~a~~~ 272 (272) T protein:vir:30 268 DAAKK 272 (272) T ss_pred ccccC Confidence 44444 No 38 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=98.51 E-value=3.4e-08 Score=61.51 Aligned_cols=268 Identities=12% Similarity=0.022 Sum_probs=148.8 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |.+.|.-..+. .| +.|+..+.....++..|. ......++|+-++|++|+|+....++ T Consensus 3 ~~~~T~l~d~i-------------~P--Ev~~~~v~~~~~~~~~~~--------~~~~~~~~l~g~~G~tv~iP~~~~ig 59 (275) T protein:vir:96 3 LENMTKLANMV-------------NP--EVLAPMMQAELDKKLKFA--------QFADIDNTLVGQPGNTITFPAFVYSG 59 (275) T ss_pred Ccccchhhhhh-------------ch--HHHHHHHHHHHHHhhhhc--------ccceecccccCCCCCEEEeeeeccCC Confidence 43333221111 11 345555444333333331 12233467888899999999987663 Q ss_pred cCc-eecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_018846. 81 KRP-TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) Q Consensus 81 G~g-V~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~ 159 (404) ..- +..++.+ ..+.|+..++++.|.+..+++.... .+...+.-|+..++.++++..|++..|..++..|.++.. T Consensus 60 ~a~~~~~g~~i--~~~~lt~~~~~~~i~~~~~~~~i~D-~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~-- 134 (275) T protein:vir:96 60 DAKVVPEGEEI--PIDLIETKKRQATIRKIGKGTVLTD-EALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATL-- 134 (275) T ss_pred ccccccCCCCc--chhhcccceeeEEeehhcccccccH-HHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc-- Confidence 211 2111222 2778999999999999999988765 455555679999999999999999999999866644221 Q ss_pred cccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccC Q lcl|NC_018846. 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~ 239 (404) . .+++.++.+.|..|..++.... T Consensus 135 ---------------------~---------------------~~~~~~~~d~i~dA~~~lgd~~--------------- 157 (275) T protein:vir:96 135 ---------------------K---------------------VEADITKLAGLQTAIDKFNDED--------------- 157 (275) T ss_pred ---------------------c---------------------ccccccCHHHHHHHHHHhcccc--------------- Confidence 0 0223456666666665553211 Q ss_pred ccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeeccccc Q lcl|NC_018846. 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) Q Consensus 240 ~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~ 319 (404) ...++++|||.++..|++++.. +|.. . .....+.+-.|.+|.|.|+.|..-..+|. T Consensus 158 -~~~~~ivv~p~~~~~L~k~~~~-~f~~--~-----~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~--------------- 213 (275) T protein:vir:96 158 -LEPMVLFVNPLDAGKLRASATD-NFTR--A-----TLLGDNVIVKGAFGEALGAIIVRSNKIKE--------------- 213 (275) T ss_pred -CCccEEEeCHHHHHHHHhcccc-cccc--c-----ccccccceeccccceecCeeEEEeCCCCc--------------- Confidence 1236899999999999998742 1321 1 12235778899999999999976555431 Q ss_pred ccccccccccccchhheeecCceeEEEeecCCC---CCceeeeccccccchH-HHHHHHHhhhhhccccCCCCCceEEEE Q lcl|NC_018846. 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAG---GHFNMVEKKTDMDNRT-EIAISWINGLKKIRFPEKSGKMQDHGV 395 (404) Q Consensus 320 ~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g---~r~~w~Ee~~D~g~~~-~i~i~~i~G~~K~rF~~~~g~~~DfGv 395 (404) ..++|+|-.|+++.-.+... .|--..-.+.=+++++ ++.+-.=-++.|++|... =.|| T Consensus 214 -------------~t~~i~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~-----~~~~ 275 (275) T protein:vir:96 214 -------------GEAILAKRGAVKLITKRDFFLETERHASHKSTALFSDKHYVAYLYDESKVVKITKSAS-----GLGV 275 (275) T ss_pred -------------ceEEEEeccceeeeecCCcccccccchhhcCcEEEEeEEEEEEEEcCccEEEEEeccc-----ccCC Confidence 12355555555554432100 0100000000011111 111112234455555322 1122 No 39 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.42 E-value=1.6e-07 Score=57.86 Aligned_cols=337 Identities=14% Similarity=0.090 Sum_probs=168.3 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) ||.-... -.-.|+++ -....-.+|.|.+.+...-+..+.+. .-.++.++ ..|.++.|.-+...+ T Consensus 1 Ms~~n~~-t~~~~~~s----~~~~al~le~f~geV~taF~~~si~~---------~~~~vrti--~~GkS~qf~~iG~~~ 64 (402) T protein:vir:97 1 MSTPNTL-TNVAVSAS----GEVDSLLIEKFNGKVNEQYLKGENIL---------SYFDVQTV--TGTNTVSNKYLGETE 64 (402) T ss_pred CCCcccc-cccccccc----cchhhhhhhhhhhhHHHHHHHHHhhc---------Ccceeeee--cccceEEEEEEeeeE Confidence 6644221 11111110 01111135778887766666555553 22223345 378899999997777 Q ss_pred cCceecCceeecchhhhhhceeEEEEee---ccceeccCChhhhhhhhhh-HHHHHHHHHHHHHHHHHHHHHHHH--Hhh Q lcl|NC_018846. 81 KRPTMGDERVEGRGEDLSHADFSLKINQ---GRHLVDAGGRMSQQRTKFN-LASSARTLLGTYFNDLQDQCAIVH--LAG 154 (404) Q Consensus 81 G~gV~Gd~~leGnee~L~~~s~~v~Idq---~R~aV~~~g~m~~qrs~~d-lrk~ar~~L~~w~~~~~D~~~~~~--laG 154 (404) -...+-.+.+.| +.+......|.||. .||.|. .+++-...+| +|++--..+..-+++..||.+|-. +++ T Consensus 65 a~y~~~G~~ldg--~~~~~~k~~ItID~lL~a~~~V~---diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa 139 (402) T protein:vir:97 65 LQVLAPGQSPNA--TPTQADKNQLVIDTTVIARNTVA---HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGG 139 (402) T ss_pred EeeeccccccCC--CCcccccEEEEeCceeechhhhh---hHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 666665566765 46777777899998 566665 5888889999 899999999999999999977533 334 Q ss_pred hhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeec Q lcl|NC_018846. 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSG 234 (404) Q Consensus 155 ~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g 234 (404) ... ...... .|....... ..++ ..+.....+..+.+-. .|-.+...+++..-|. ++ T Consensus 140 ~a~-t~~~~~-~~~~~~~g~--------s~~~--------~~t~~~a~~~~~~l~~-ai~~a~~~LdEkdVP~-----~d 195 (402) T protein:vir:97 140 IAN-TKAERN-KPRVKGHGF--------SINV--------NVTESEALANPQYVMA-AVEYALEQQLEQEVDI-----SD 195 (402) T ss_pred ccc-cccccc-cCccccccc--------cccc--------ccccchhhcCHHHHHH-HHHHHHHHHHhcCCCc-----cc Confidence 321 111100 000000000 0000 0011111122222222 2223445555555451 11 Q ss_pred ccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceee Q lcl|NC_018846. 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) Q Consensus 235 ~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~ 314 (404) ++++++|.||.-|..++.+ .+.. .... ..+.+=.|.+++++|+.|.+-++.|-.--..+.... T Consensus 196 ---------Rv~vv~P~~y~~Ll~~~rl---~n~d----~~~~-~~g~~~~G~v~~v~Gv~Vv~SnnlP~~a~~it~~~l 258 (402) T protein:vir:97 196 ---------VAIMMPWKFFNALRDADRI---VDKT----YTIS-QSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLL 258 (402) T ss_pred ---------cEEEeChHHHHHHhhcccc---cchh----hccc-cCCccccceeEEEeceEEEecCcccccccccccccc Confidence 6999999999999999763 2111 1111 124445789999999999999887721101111111 Q ss_pred cccccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhcccc--------CC Q lcl|NC_018846. 315 SENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFP--------EK 386 (404) Q Consensus 315 ~~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~--------~~ 386 (404) ++.+.+.. ..+.+.+.-.+++++=..|++.+=...--+.++|.++.+ -.-|-....+|..=.|-. .. T Consensus 259 s~a~~G~~-y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~----~~~id~~~a~G~g~~RPeaa~vv~~~~~ 333 (402) T protein:vir:97 259 SNEDNGYR-YDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEK----TYYIDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) T ss_pred ccCCCCcc-CCcCcccceeEEEEEecceEEEEEeeccccchhhchhHH----HHHHHHHHHhCCcccCccceEEEEEecc Confidence 11111110 112234455566666666666653332223333333211 112445556676665532 10 Q ss_pred --CC----CceEEEEEEEeceecC Q lcl|NC_018846. 387 --SG----KMQDHGVIAVDTAVKL 404 (404) Q Consensus 387 --~g----~~~DfGvi~idtaa~~ 404 (404) .+ -.+||..+.--.--|. T Consensus 334 ~t~~~~~~~~~~~~~~~~~~~~~~ 357 (402) T protein:vir:97 334 ATTGDAGGPGDDHATVLARAQRKA 357 (402) T ss_pred cccccCCccccchhhhhcccccce Confidence 00 0133322211111111 No 40 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=98.31 E-value=3.5e-07 Score=55.96 Aligned_cols=255 Identities=11% Similarity=0.037 Sum_probs=137.0 Q ss_pred hcCchhH-HHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccccCceecCceeecchhhhhhc Q lcl|NC_018846. 22 NRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHA 100 (404) Q Consensus 22 ~~n~~~~-~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gV~Gd~~leGnee~L~~~ 100 (404) +.++..+ ++|++.+...-.+.+.+.....++ .+.+-..||+|+|+....++-..-.+.. -....+++... T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~--------~~~~~~~Gdtv~ip~~~~~~~~d~~~~~-~~~~~~~~~~~ 71 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNRE--------YEGTASKGNVVHIAGVVAPTVKDYKAAG-RQTSADAISDT 71 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccc--------cccccccCceEEEeecccccccccccCC-CccCccccccc Confidence 6666544 678887766655555544332222 2222356999999987665522111111 12346788999 Q ss_pred eeEEEEeecc-ceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccccccccccccc Q lcl|NC_018846. 101 DFSLKINQGR-HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMI 179 (404) Q Consensus 101 s~~v~Idq~R-~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~~~~~~~~~~ 179 (404) +.++.||+.+ .++.+. ..++.....|+++..+. +..=+++..|+.++..++++-. . T Consensus 72 ~~~~tid~~~~~~~~i~-d~d~~~~~~~~~~~~~~-~~~alA~~vD~~i~~~~~~a~~---------------------~ 128 (273) T protein:vir:10 72 GVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGT---------------------A 128 (273) T ss_pred eEEEEEeeeeecceEee-cHHHhhhhccHHHHHHH-HHHHHHHHHHHHHHHHHhcccc---------------------c Confidence 9999999975 455554 35666677898875554 5667889999998877765321 0 Q ss_pred cccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEEEecHHHHHHHhcC Q lcl|NC_018846. 180 NDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTS 259 (404) Q Consensus 180 N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~q~~~Lr~d 259 (404) |... ..++.+. .++.|..+...++...-|- ++ ++++++|.++..|+++ T Consensus 129 ~~~~----------------~~~~~~~--~~~~i~~a~~~ld~~~vP~-----~~---------R~lvv~p~~~~~L~~~ 176 (273) T protein:vir:10 129 LTGS----------------APTDADD--AFDLIAKALKELTKANVPN-----VG---------RVVVVNAEMAFWLRSS 176 (273) T ss_pred cccc----------------cccchhH--HHHHHHHHHHHhhhcCCCc-----CC---------CEEEECHHHHHHHhcc Confidence 0000 1122221 2456777777777766551 11 5789999999999998 Q ss_pred cchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeecccccccccc---------cccccc Q lcl|NC_018846. 260 TSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTK---------EVAAAT 330 (404) Q Consensus 260 ~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~~---------~~a~~~ 330 (404) +.+ +.+.. . .+..+.|=.|.+|.+.|+-|.+..++|.. .+.+.... ...+.+-. -.+..+ T Consensus 177 ~~~--~~~~~----~--~~~~~~l~~G~ig~i~G~~v~~s~~lp~~--~~~~~~~~-~~~A~~~a~q~~~~e~~r~~~~~ 245 (273) T protein:vir:10 177 GSK--LTSAD----T--SGDAAGLRAGTIGNLLGARIVESNNLRDT--DDEQFVAF-HPSAAAYVSQIDTVEALRDQDSF 245 (273) T ss_pred hhh--hhhhh----c--cccccceeeeeeeEEeceEEEEecccccC--CccEEEEE-eccceeeeeeeehhhcccCCCcc Confidence 753 22211 1 24456677899999999999998777621 11111000 00000000 000000 Q ss_pred --cchhheeecC-----ceeEEEeecCCCC Q lcl|NC_018846. 331 --NIDRAMLLGA-----QALANAYGQKAGG 353 (404) Q Consensus 331 --~v~ralllGa-----qAl~~A~g~~~g~ 353 (404) .|.--...|+ -+++. . ++.|. T Consensus 246 ~~~v~~~~~yg~~v~~~~~~~~-l-~~~g~ 273 (273) T protein:vir:10 246 SDRIRALHVYGGKVVRPTGVVV-F-NKTGS 273 (273) T ss_pred eeeeeeeeeeeeeEeccceEEE-E-eccCC Confidence 1110011111 11111 1 12333 No 41 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=98.31 E-value=3.5e-07 Score=55.96 Aligned_cols=255 Identities=11% Similarity=0.037 Sum_probs=137.0 Q ss_pred hcCchhH-HHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccccCceecCceeecchhhhhhc Q lcl|NC_018846. 22 NRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHA 100 (404) Q Consensus 22 ~~n~~~~-~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gV~Gd~~leGnee~L~~~ 100 (404) +.++..+ ++|++.+...-.+.+.+.....++ .+.+-..||+|+|+....++-..-.+.. -....+++... T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~--------~~~~~~~Gdtv~ip~~~~~~~~d~~~~~-~~~~~~~~~~~ 71 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNRE--------YEGTASKGNVVHIAGVVAPTVKDYKAAG-RQTSADAISDT 71 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccc--------cccccccCceEEEeecccccccccccCC-CccCccccccc Confidence 6666544 678887766655555544332222 2222356999999987665522111111 12346788999 Q ss_pred eeEEEEeecc-ceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccccccccccccc Q lcl|NC_018846. 101 DFSLKINQGR-HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMI 179 (404) Q Consensus 101 s~~v~Idq~R-~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~~~~~~~~~~ 179 (404) +.++.||+.+ .++.+. ..++.....|+++..+. +..=+++..|+.++..++++-. . T Consensus 72 ~~~~tid~~~~~~~~i~-d~d~~~~~~~~~~~~~~-~~~alA~~vD~~i~~~~~~a~~---------------------~ 128 (273) T protein:vir:10 72 GVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGT---------------------A 128 (273) T ss_pred eEEEEEeeeeecceEee-cHHHhhhhccHHHHHHH-HHHHHHHHHHHHHHHHHhcccc---------------------c Confidence 9999999975 455554 35666677898875554 5667889999998877765321 0 Q ss_pred cccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEEEecHHHHHHHhcC Q lcl|NC_018846. 180 NDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTS 259 (404) Q Consensus 180 N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~q~~~Lr~d 259 (404) |... ..++.+. .++.|..+...++...-|- ++ ++++++|.++..|+++ T Consensus 129 ~~~~----------------~~~~~~~--~~~~i~~a~~~ld~~~vP~-----~~---------R~lvv~p~~~~~L~~~ 176 (273) T protein:vir:10 129 LTGS----------------APTDADD--AFDLIAKALKELTKANVPN-----VG---------RVVVVNAEMAFWLRSS 176 (273) T ss_pred cccc----------------cccchhH--HHHHHHHHHHHhhhcCCCc-----CC---------CEEEECHHHHHHHhcc Confidence 0000 1122221 2456777777777766551 11 5789999999999998 Q ss_pred cchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeecccccccccc---------cccccc Q lcl|NC_018846. 260 TSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTK---------EVAAAT 330 (404) Q Consensus 260 ~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~~---------~~a~~~ 330 (404) +.+ +.+.. . .+..+.|=.|.+|.+.|+-|.+..++|.. .+.+.... ...+.+-. -.+..+ T Consensus 177 ~~~--~~~~~----~--~~~~~~l~~G~ig~i~G~~v~~s~~lp~~--~~~~~~~~-~~~A~~~a~q~~~~e~~r~~~~~ 245 (273) T protein:vir:10 177 GSK--LTSAD----T--SGDAAGLRAGTIGNLLGARIVESNNLRDT--DDEQFVAF-HPSAAAYVSQIDTVEALRDQDSF 245 (273) T ss_pred hhh--hhhhh----c--cccccceeeeeeeEEeceEEEEecccccC--CccEEEEE-eccceeeeeeeehhhcccCCCcc Confidence 753 22211 1 24456677899999999999998777621 11111000 00000000 000000 Q ss_pred --cchhheeecC-----ceeEEEeecCCCC Q lcl|NC_018846. 331 --NIDRAMLLGA-----QALANAYGQKAGG 353 (404) Q Consensus 331 --~v~ralllGa-----qAl~~A~g~~~g~ 353 (404) .|.--...|+ -+++. . ++.|. T Consensus 246 ~~~v~~~~~yg~~v~~~~~~~~-l-~~~g~ 273 (273) T protein:vir:10 246 SDRIRALHVYGGKVVRPTGVVV-F-NKTGS 273 (273) T ss_pred eeeeeeeeeeeeeEeccceEEE-E-eccCC Confidence 1110011111 11111 1 12333 No 42 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=98.27 E-value=7.2e-07 Score=54.27 Aligned_cols=330 Identities=11% Similarity=0.105 Sum_probs=157.3 Q ss_pred CCcccc-----hHHHHHHHHHHHHHhhcCch---hHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEE Q lcl|NC_018846. 1 MTTVTS-----AQANKLYQVALFTAANRNRS---MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVT 72 (404) Q Consensus 1 ~~~~~~-----~~a~~~~~~~lft~~~~n~~---~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~ 72 (404) |+...- ++.-.....+ +.++. .+|.|++.+...-.+.+.+. ..+++.++ ..|.++. T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~-----~~~~~~al~le~f~geV~~~f~~~si~~---------~~~~~rti--~~Gksv~ 64 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYG-----GATDKYALYLKLFSGEMFKGFQHETIAR---------DLVTKRTL--KNGKSLQ 64 (375) T ss_pred CccccccccCccccCCccccc-----cccchHHHHHHHHhHHHHHHHHHHHhhh---------cccccccc--ccCceEE Confidence 332211 1111111111 11222 46888888877777666653 33344454 3599999 Q ss_pred EEEeeccccCceecCceeecc-hhhhhhceeEEEEeec---cceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018846. 73 FSIMHKLSKRPTMGDERVEGR-GEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCA 148 (404) Q Consensus 73 f~L~~~L~G~gV~Gd~~leGn-ee~L~~~s~~v~Idq~---R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~ 148 (404) |+-+...+-...+..+.+.|+ .++..-.+.+|.||+. ++.|+ .+++...++|||++.......=|++..|+.+ T Consensus 65 f~~iG~~t~~~~t~G~~i~~~~~~d~~~te~~l~ID~~~y~~~~Vd---DiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i 141 (375) T protein:vir:10 65 FIYTGRMTSSFHTPGTPILGNADKAPPVAEKTIVMDDLLISSAFVY---DLDETLAHYELRGEISKKIGYALAEKYDRLI 141 (375) T ss_pred EEeeeeeEEeeecCCcCcCCccccCCCCCceEEEecchhhhhhhHh---hHHHHhcCchhHHHHHHHHHHHHHHHHHHHH Confidence 999999988888888888887 4567778889999998 46666 6899999999999999999999999999999 Q ss_pred HHHHh-hhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCC Q lcl|NC_018846. 149 IVHLA-GARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPL 227 (404) Q Consensus 149 ~~~la-G~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi 227 (404) +.++. +++.. .|... ++.-.|....+-.+.+.++...++ .+.+ .+.|..+.+.+++..-|- T Consensus 142 ~~~l~kaa~~~-------~p~~~---------~~~~~~Gg~~i~~~sg~~~~~~~t-a~~~-~~ai~~a~~~Lde~~VP~ 203 (375) T protein:vir:10 142 FRSITRGARSA-------SPVSA---------TNFVEPGGTQIRVGSGTNESDAFT-ASAL-VNAFYDAAAAMDEKGVSS 203 (375) T ss_pred HHHHHHhhhhc-------ccccc---------ccccccCcceeeeccccccccccC-HHHH-HHHHHHHHHHHhhcCCCC Confidence 98876 44421 11000 000011111111111222222222 2221 244445666666655551 Q ss_pred ccEEeecccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeec Q lcl|NC_018846. 228 QPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFY 307 (404) Q Consensus 228 ~Pv~~~g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~ 307 (404) - . ++++++|.|+..|..+-+-... .+ +.-+.+.-.=.|.++.++|+.|.+-.+.|. . T Consensus 204 ~-------------~-R~~vv~P~~y~~Ll~~~d~~~~----~n---~d~~~~~~~~~g~v~~i~Gv~V~~Sn~lP~--~ 260 (375) T protein:vir:10 204 Q-------------G-RCAVLNPRQYYALIQDIGSNGL----VN---RDVQGSALQSGNGVIEIAGIHIYKSMNIPF--L 260 (375) T ss_pred C-------------C-CEEEeChHHHHHHHhcCCccce----ee---ecccccceeccceEEEEeceEEEEeccccc--c Confidence 1 1 5688999999999865210000 01 111112222357789999999998777651 1 Q ss_pred cccceeecccccccccccccccccchhheeecCc--------------------eeEEEeec-------CCCCCceeeec Q lcl|NC_018846. 308 QGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQ--------------------ALANAYGQ-------KAGGHFNMVEK 360 (404) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~a~~~~v~ralllGaq--------------------Al~~A~g~-------~~g~r~~w~Ee 360 (404) .+..+.. +.....-++....++.+..+.. .+++.|-+ .-+....-.+ T Consensus 261 ~~~~~~~-----g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~A~g~v~~~~~~~~~~~- 334 (375) T protein:vir:10 261 GKYGVKY-----GGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKEAAGVVEAIGPQVQVTN- 334 (375) T ss_pred ccccccc-----cccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEchhheeeeeeecccccccc- Confidence 1110000 0000000111111122222222 22222210 0000000000 Q ss_pred cccccchH---HHHHHHHhhhhhccccCCCCCceEEEEEEEeceecC Q lcl|NC_018846. 361 KTDMDNRT---EIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 361 ~~D~g~~~---~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idtaa~~ 404 (404) -||.-+. .|-....+|..=.| .+=-+++..--.+++ T Consensus 335 -~~~~~~~q~~~i~~~~a~G~~~lr-------p~~av~l~~~~~~~~ 373 (375) T protein:vir:10 335 -GDVSVIYQGDVILGRMAMGADYLN-------PAAAVELYIGATAPS 373 (375) T ss_pred -chhhheeeeeeeeeeeeeccCccC-------ceeEEEEecCcCccc Confidence 0111000 01111122222222 111122222212222 No 43 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=98.26 E-value=1.9e-07 Score=57.46 Aligned_cols=336 Identities=10% Similarity=0.037 Sum_probs=170.1 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) ||.-... -.-.|+++ -....-.+|.|.+.+...-...+.+.. -..+.++ ..|.++.|+-+...+ T Consensus 1 ms~~n~~-t~~~~~~~----~~~~al~le~f~geV~taf~~~s~~~~---------~~~~rti--~~gkS~q~~~iG~~~ 64 (364) T protein:vir:10 1 MSNPNVL-TQPAVSAS----GEVDSLLIEKFNNRVHEQYLKGENLLQ---------WFDVQEV--VGTNSVSNKYIGETE 64 (364) T ss_pred CCCcccc-cccccccc----cchhhhhhhhhhhhHHHHHHHHHhhcC---------cceeeee--cccceEEeeeeeeeE Confidence 6644221 11111110 011111357788877666655555531 2223344 478899999997777 Q ss_pred cCceecCceeecchhhhhhceeEEEEee---ccceeccCChhhhhhhhhh-HHHHHHHHHHHHHHHHHHHHHHHHH-hhh Q lcl|NC_018846. 81 KRPTMGDERVEGRGEDLSHADFSLKINQ---GRHLVDAGGRMSQQRTKFN-LASSARTLLGTYFNDLQDQCAIVHL-AGA 155 (404) Q Consensus 81 G~gV~Gd~~leGnee~L~~~s~~v~Idq---~R~aV~~~g~m~~qrs~~d-lrk~ar~~L~~w~~~~~D~~~~~~l-aG~ 155 (404) -...+-.+.+.| +.+.....+|.||+ .||.|. .+++-...+| +|++.-..+..=+++..||.++..+ +++ T Consensus 65 ~~~~~~G~~ld~--~~~~~~k~~itID~ll~a~~~V~---diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa 139 (364) T protein:vir:10 65 LQVLSPGKSPDA--SPTEFDKNRLVVDTTVIARNTVA---HFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGG 139 (364) T ss_pred EeeeccCcccCC--CCcccCcEEEEecceeeechhhh---hHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 666665556655 56777788999998 567766 5888889999 8999988889899999999886554 222 Q ss_pred hccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecc Q lcl|NC_018846. 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (404) Q Consensus 156 ~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~ 235 (404) .. |. .+.+. ++.-.|-...+ -+++.++. ..+..+.+-- .|..+...+++..-|. ++ T Consensus 140 ~a---~~--------~~~~~----~~~~~~~g~~i-~~~~~a~~-~~~~~~~l~~-ai~~a~~~LdEkdVP~-----~~- 195 (364) T protein:vir:10 140 IS---NT--------EAIRK----NPRVAGHGFSI-HIVGLASS-FLTSPQYMMA-AIEMAMEQQTEQEVDT-----SE- 195 (364) T ss_pred hh---cc--------ccccc----CCcccCCccee-eecccCcc-hhhhHHHHHH-HHHHHHHHHhhcCCCc-----cc- Confidence 11 10 00000 00111111101 11111111 1222222222 2223455555555442 11 Q ss_pred cccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceee- Q lcl|NC_018846. 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLV- 314 (404) Q Consensus 236 ~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~- 314 (404) ++++++|.||..|..++.+ .+. .....+ .+..-+|.+++++||.|.+-++.|.. .+....+ T Consensus 196 --------R~~vv~P~~y~~Ll~~~~l---vn~----d~~~~~-~~~~~~G~v~~v~Gv~Vv~Sn~lP~~--~~~~~~t~ 257 (364) T protein:vir:10 196 --------LCGLMPWTAFNCLRDADRI---VDK----SYTIAA-SDNTVDGFVLKSWNTPIVPSNRFPKL--SDNTEGTG 257 (364) T ss_pred --------cEEEeChHHHHHHhcCCcc---ccc----cccccC-CCccccceeEEEeceEEEeccccccc--cccccccc Confidence 7999999999999998763 211 111112 34456899999999999999988732 1111000 Q ss_pred --ccc--cccccccccc--ccccchhheeecCceeEEEeecCCCCCceeeec-cccccchHHHHHHHHhhhhhccc---- Q lcl|NC_018846. 315 --SEN--NLTATTKEVA--AATNIDRAMLLGAQALANAYGQKAGGHFNMVEK-KTDMDNRTEIAISWINGLKKIRF---- 383 (404) Q Consensus 315 --~~~--~~~~~~~~~a--~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee-~~D~g~~~~i~i~~i~G~~K~rF---- 383 (404) +.. +....+..+. +.++-.+++++=..|++.+=...--+..+|.+. ..|+.+ ....+|..=.|- T Consensus 258 ~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~~~~~~id-----a~~a~G~g~lRPeaa~ 332 (364) T protein:vir:10 258 NTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKKEKTWYID-----TFLAEGAIPDRWEAVA 332 (364) T ss_pred cccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccceeeeeee-----eehcccCcccCccceE Confidence 000 0000112222 333445566665556654432221122233222 223222 244466655552 Q ss_pred ----cCCCCCceEEEEEEEeceecC Q lcl|NC_018846. 384 ----PEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 384 ----~~~~g~~~DfGvi~idtaa~~ 404 (404) ....+..+||..|.--.--|. T Consensus 333 ~i~~~~~~~~~~~~~~~~~~~~~~~ 357 (364) T protein:vir:10 333 VVTAADTAELATDHNAILARANRKV 357 (364) T ss_pred EEEecCCCCCccchhhhhhhccccE Confidence 222333466655432221111 No 44 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.22 E-value=1.9e-07 Score=57.47 Aligned_cols=296 Identities=12% Similarity=0.118 Sum_probs=157.7 Q ss_pred HHHHhhcCchhH-HHHHhhhhhhhhhhcccccccCCCCCccEEEEeecC--CCCCcEEEEEEeeccccCc--eecCceee Q lcl|NC_018846. 17 LFTAANRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLN--KQAGDEVTFSIMHKLSKRP--TMGDERVE 91 (404) Q Consensus 17 lft~~~~n~~~~-~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~--k~~Gd~v~f~L~~~L~G~g--V~Gd~~le 91 (404) +.+- -..+-.+ +.|...+.....+++ .+.++|--.+...+.++- ..+|+.|+++.-..|.|+. +.+++.+. T Consensus 1 MA~T-~lsd~i~peVf~~yv~~~~~~~~---~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~ 76 (324) T protein:vir:59 1 MAYT-KISDVIVPELFNPYVINTTTQLS---AFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLV 76 (324) T ss_pred CCce-eeeceechhHHHHHHHhhhHHHH---HHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccc Confidence 2210 0011111 234333333222222 345555333444444442 3589999999999998774 33333333 Q ss_pred cchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccccc Q lcl|NC_018846. 92 GRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEH 171 (404) Q Consensus 92 Gnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~~ 171 (404) -+.|...++.-+|-....++.... .++..+.-|...++..+|++||++..+..+|..|-|+.+... ...+ T Consensus 77 --~~~l~t~~~~a~i~~~~k~~~~tD-~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~-------~~~~ 146 (324) T protein:vir:59 77 --PQKINAGQDKAVLILRGNAWSSHD-LAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDD-------MKDN 146 (324) T ss_pred --hhhcccceeeEEEEeecCceeehh-hhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc-------cccc Confidence 688999999999999889887664 567788899999999999999999999999999988764210 0000 Q ss_pred cccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEEEecHH Q lcl|NC_018846. 172 PEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPR 251 (404) Q Consensus 172 ~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~ 251 (404) ..++.+ +++-+||.+.+.++..+. ||+ .+...+++|||. T Consensus 147 -------~~dvsa------------------~~~~~~s~~~l~~A~~~~-------------GD~---~~~~~~ivmhS~ 185 (324) T protein:vir:59 147 -------KLDISG------------------TADGIYSAETFVDASYKL-------------GDH---ESLLTAIGMHSA 185 (324) T ss_pred -------eeeeec------------------cccceecHHHHHHHHHHh-------------CCc---ccCcEEEEEchH Confidence 001111 122357777776655443 443 234689999999 Q ss_pred HHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeeccccccccccccccccc Q lcl|NC_018846. 252 QWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATN 331 (404) Q Consensus 252 q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~~~~a~~~~ 331 (404) .+.+|+++- ..+|. ++. .+ .+.++.|+|+.+..--.+|.-. ++.... T Consensus 186 v~~~L~~~~-li~~~---~~s----~~------~~~i~~~~G~~VivdD~~p~~~-------------------~~~~~~ 232 (324) T protein:vir:59 186 TMASAVKQD-LIEFV---KDS----QS------GIRFPTYMNKRVIVDDSMPVET-------------------LEDGTK 232 (324) T ss_pred HHHHHHHhh-hhhhc---ccc----cc------CceeeeecccEEEEeCCCCccc-------------------cCCCCc Confidence 999999973 22232 221 11 1347889998886544333111 111122 Q ss_pred chhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHH---HHhhhhhccccCC-CCC--c--e------------ Q lcl|NC_018846. 332 IDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAIS---WINGLKKIRFPEK-SGK--M--Q------------ 391 (404) Q Consensus 332 v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~---~i~G~~K~rF~~~-~g~--~--~------------ 391 (404) +-.++++|..|++..-++. ..-+|...|-..+....+. .+++++=..|..+ .++ . . T Consensus 233 ~y~s~l~~~GAi~~~~~~~----~v~vE~dRd~~~g~~~l~~r~~~~~~p~G~s~~~~~~~~~sPt~~~L~~~~NW~~v~ 308 (324) T protein:vir:59 233 VFTSYLFGAGALGYAEGQP----EVPTETARNALGSQDILINRKHFVLHPRGVKFTENAMAGTTPTDEELANGANWQRVY 308 (324) T ss_pred eEEEEEEecCeEEEeecCC----CcceecccCccccceEEEEeeEEEeEeeeEEecccccCCCCCChhhhcCCccccccc Confidence 3467999988887775542 2224554443222211100 1112222223111 000 0 0 Q ss_pred ---EEEEEEEeceecC Q lcl|NC_018846. 392 ---DHGVIAVDTAVKL 404 (404) Q Consensus 392 ---DfGvi~idtaa~~ 404 (404) --..+.+-|-..- T Consensus 309 ~~k~i~i~~~~~~~~~ 324 (324) T protein:vir:59 309 DPKKIRIVQFKHRLQA 324 (324) T ss_pred CccccceEEEEeeccC Confidence 0000000000000 No 45 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=98.12 E-value=2e-07 Score=57.30 Aligned_cols=296 Identities=14% Similarity=0.141 Sum_probs=151.0 Q ss_pred CCc-ccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEE---eecCCCCCcEEEEEEe Q lcl|NC_018846. 1 MTT-VTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRI---TDLNKQAGDEVTFSIM 76 (404) Q Consensus 1 ~~~-~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~---~dL~k~~Gd~v~f~L~ 76 (404) |.. .|-- ...=+|. .|...+.....+++. +.++ .+|+.- ..+-.++|+.|++++- T Consensus 1 Ma~~~T~l-------------~d~i~pe--vf~~yv~~~~~~~~~---l~qS---G~i~~~~~i~~~~~~~G~~i~~P~~ 59 (330) T protein:vir:10 1 MANELTKI-------------LDTITPQ--QYNAYMQQYTAAKSA---FVQS---GIAVSDERVSKNITSGGLLVNMPFW 59 (330) T ss_pred CCCCceEe-------------eeeechh--HHHHHHHHHhHHhhh---hhhc---ccccccHHHHHHhhcCCCEEEeccc Confidence 211 0000 0000121 222222222222222 3343 344443 3333479999999999 Q ss_pred eccccCc-ee--cCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018846. 77 HKLSKRP-TM--GDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA 153 (404) Q Consensus 77 ~~L~G~g-V~--Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~la 153 (404) ..|+|+. +. |++.++ -+.|...++..+|=....++.... ++..-+--|...++..+|++||++..+..+|..|. T Consensus 60 ~~l~G~~~~~~dg~~~i~--~~ki~t~~~~a~i~~~~k~~~~tD-~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~ 136 (330) T protein:vir:10 60 NDLTGDSEVLGNGDKALE--TGKITAGADIACVLYRGRGWAANE-LTGVVAGSDPVRAILNRIGAYWLREDQKALIATLN 136 (330) T ss_pred ccCCCcccccCCCccccc--hhhcccceeEEEEEeecceeeehh-hhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHH Confidence 9998864 33 222333 478999999999999999988765 45677888999999999999999999999999999 Q ss_pred hhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEee Q lcl|NC_018846. 154 GARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLS 233 (404) Q Consensus 154 G~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~ 233 (404) |..++....... .+ ..|.... .-+++-.|+.+.+-++..+. T Consensus 137 gvf~~~~~~~~~-------~~---~~~~~~~----------------~~~~~a~~s~~~l~~A~~~~------------- 177 (330) T protein:vir:10 137 GIFATGTAGEKG-------AL---EETHVSD----------------QSKASTGIDAGMVLDAKQLL------------- 177 (330) T ss_pred hhhhhhhcccch-------hh---hhhheec----------------ccccccccCHHHHHHHHHHh------------- Confidence 887532111100 00 0000000 01133356766665544332 Q ss_pred cccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeecccccee Q lcl|NC_018846. 234 GDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVL 313 (404) Q Consensus 234 g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~ 313 (404) ||+. +...+++|||.++.+|+++- .++..++. . ..+.++.|+|+.|..--.+|. T Consensus 178 GD~~---~~~~~ivmhS~v~~~L~~~~----li~~~~~s----~------~~~~i~~~~G~~VivdD~~p~--------- 231 (330) T protein:vir:10 178 GDSA---DQVTAIAMHSAVYTKLQKDN----LIQYIQPT----T------ATINIPTYLGYRVIIDDGIAP--------- 231 (330) T ss_pred cccc---ccceEEEEcHHHHHHHHHhh----hhhhhccc----c------cCcccccccceEEEEeCCCCC--------- Confidence 2221 23689999999999999852 22332221 1 135678999988864333320 Q ss_pred ecccccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHH---H---Hhhhhhcc----- Q lcl|NC_018846. 314 VSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAIS---W---INGLKKIR----- 382 (404) Q Consensus 314 ~~~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~---~---i~G~~K~r----- 382 (404) .. .+-.++++|..|+++.-+... ++-..|-..|-....+..+. . ..|++... T Consensus 232 --------~~-------~~yt~yl~~~GAi~~~~~~~~--~~v~~EtdRd~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~ 294 (330) T protein:vir:10 232 --------TG-------DIYTSYLFRTGSIGLNTGNPS--GLTTFETSREAAKGNDMIYTRRALVMHPYGVKWTGAEVDA 294 (330) T ss_pred --------CC-------CceeEEEEecCceeeecccCC--ccccccccCCccccceEEEEeeEEEeeeeeeeeccccccc Confidence 01 123568899888877755432 22334443332211111000 0 22222110 Q ss_pred ---ccCCC--CCce---------EEEEEEEeceecC Q lcl|NC_018846. 383 ---FPEKS--GKMQ---------DHGVIAVDTAVKL 404 (404) Q Consensus 383 ---F~~~~--g~~~---------DfGvi~idtaa~~ 404 (404) +|++. .+.. .-..+.+-| || T Consensus 295 ~~~sPt~~~L~~~~NW~~v~~~k~i~iv~~~~--~~ 328 (330) T protein:vir:10 295 GNITPSNADLAKFKNWKRVYEPKNIGIIALKH--KI 328 (330) T ss_pred CcCCcChHHhcCCcCcccccChhhcceEEEEE--ec Confidence 00000 0001 111111111 11 No 46 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.09 E-value=1.5e-07 Score=57.95 Aligned_cols=296 Identities=10% Similarity=0.094 Sum_probs=155.2 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEE---eecCCCCCcEEEEEEee Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRI---TDLNKQAGDEVTFSIMH 77 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~---~dL~k~~Gd~v~f~L~~ 77 (404) |.+-.-+ ..=.| +.|...+.....+++ .+++. .+|+.. ..+-.++|+.|+|++-. T Consensus 1 MA~T~ls--------------d~i~P--Evf~~yv~~~~~~~~---~l~qS---G~i~~~~~l~~~~~~~G~~it~P~~~ 58 (351) T protein:vir:15 1 MAETHLS--------------DLIVP--EVFGNYVVNQIIKTN---RFVQS---GILTPDPDLGPHLLEAGTRITVPFLN 58 (351) T ss_pred CCceeee--------------eeech--hHHHHHHhhhhHHhh---hHhhc---ccccccHHHHHHhhcCCCEEEecccc Confidence 4321100 00112 122222222222222 22333 345443 33334799999999999 Q ss_pred ccccCce--ecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_018846. 78 KLSKRPT--MGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGA 155 (404) Q Consensus 78 ~L~G~gV--~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~ 155 (404) .|+|++- .++..++ .+.|...++.-+|-....++.... ++...+.-|+..++..+|++||++..+..+|..|.|+ T Consensus 59 ~l~Gd~~~~~~~~~i~--~~kitt~~~~a~i~~~~kg~~~tD-~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv 135 (351) T protein:vir:15 59 DLTGDPDNWTDSDDID--VNNLTSGKQQGIKFYQTKAYGYTD-LGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGV 135 (351) T ss_pred cCCCcccccCCCcccc--hheecccceeEEEEeeccceehhh-hhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9988753 3333333 578999999999999889988764 6677888899999999999999999999999999987 Q ss_pred hccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecc Q lcl|NC_018846. 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (404) Q Consensus 156 ~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~ 235 (404) .+... +.| + |.+ +.+. .-.++-+||.+.+.++..+. || T Consensus 136 ~~~~~-----------------~~~-------~-~~~--d~t~--~~~~~~~is~~~l~~A~~~~-------------GD 173 (351) T protein:vir:15 136 MGVTK-----------------IAN-------S-KVY--DQTK--VSPSEPMFGAKGFTGAIGLM-------------GD 173 (351) T ss_pred hhchh-----------------hcc-------c-cee--cccc--ccccccccCHHHHHHHHHHh-------------cc Confidence 65211 000 0 111 1110 11244467877776655544 22 Q ss_pred cccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeec Q lcl|NC_018846. 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVS 315 (404) Q Consensus 236 ~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~ 315 (404) +. ++...+++|||..+.+|+++- .++..++. .+ .+.+|.|+|+.|..-..+|.-. T Consensus 174 ~~--~~~~~~ivmhS~v~~~L~~~~----li~~~~~s----~~------~~~i~t~~G~~VivdD~~p~~~--------- 228 (351) T protein:vir:15 174 LQ--DTAFGAIAVNSATYSLMKVQG----LIETIQPQ----NG------ATPFEAYNGLRIVLDDDIEIDL--------- 228 (351) T ss_pred cc--ccceEEEEEChHHHHHHHhhh----hhhhcccc----cc------CcccceecceEEEEcCCCcccc--------- Confidence 21 123689999999999999873 23333322 11 1347899998886554443110 Q ss_pred ccccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHH----HH-HHHHhhhhhccccCC---C Q lcl|NC_018846. 316 ENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTE----IA-ISWINGLKKIRFPEK---S 387 (404) Q Consensus 316 ~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~----i~-i~~i~G~~K~rF~~~---~ 387 (404) . ++...+-.++|+|..|+++.=+. ++ +|-..|....-+ +. ...+++..=..|... . T Consensus 229 ---~-------~~~~~~ytsyl~~~GAi~~~~~~----~~--ve~~rd~~~~~g~d~l~~r~~~~~hp~G~s~~~~~~~~ 292 (351) T protein:vir:15 229 ---T-------DKTKPVSTSYIFAPGAVRYSTNM----RS--TETKYDPLINGGQDVIVQKRVGTIHVAGTSIKASFSPS 292 (351) T ss_pred ---C-------CCCCceeEEEEEecceeeeecCC----cC--cceeecccCCCCceEEEEeeeeeeeeeeeeeccccccc Confidence 0 11112335688888887753221 11 232222221100 00 011122222223110 0 Q ss_pred C----------C-----------ceEEEEEEEeceecC Q lcl|NC_018846. 388 G----------K-----------MQDHGVIAVDTAVKL 404 (404) Q Consensus 388 g----------~-----------~~DfGvi~idtaa~~ 404 (404) + + .+--+.+.+-|-... T Consensus 293 ~~~sPt~~~L~~~~NW~~v~~~d~k~I~iv~~~~~~~~ 330 (351) T protein:vir:15 293 KASFPTIDELAKSSTWEVVDGIDVRSIGVVAYTAQLDP 330 (351) T ss_pred CcCCcChHHhcCCcccccccCCCccccceEEEEEecCc Confidence 0 0 012222222222211 No 47 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=97.77 E-value=2e-05 Score=46.29 Aligned_cols=280 Identities=9% Similarity=-0.016 Sum_probs=145.6 Q ss_pred hcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccccCceecCceee--c-chhhhh Q lcl|NC_018846. 22 NRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVE--G-RGEDLS 98 (404) Q Consensus 22 ~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gV~Gd~~le--G-nee~L~ 98 (404) +..--..++|+..|.....+.+.+...-+...+.-|. -..|++|.++-+ +..++ +|-... | +.++++ T Consensus 1 MA~~n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~------~~gg~tVkI~~i---~~~gl-~DY~R~~~g~~~g~~~ 70 (299) T protein:vir:79 1 MAALNYAKEYSNVLAQAYPYTLNFGDLYATPNNGRYR------WTGSKTIEIPTI---STTGR-VDSNRDTIAVAQRNYD 70 (299) T ss_pred CccchhHHHHHHHHHHHHHhhceeeeeccCcccceee------ecCCCEEEEecc---ccccc-cccccCCCcccccccC Confidence 2211134788888877777666654333333222221 134899997744 33333 454432 2 345788 Q ss_pred hceeEEEEeecc---ceeccCChhhhhhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHh-hhhcccccccccccccccc Q lcl|NC_018846. 99 HADFSLKINQGR---HLVDAGGRMSQQRTKFNL--ASSARTLLGTYFNDLQDQCAIVHLA-GARGDFVADDTILPTAEHP 172 (404) Q Consensus 99 ~~s~~v~Idq~R---~aV~~~g~m~~qrs~~dl--rk~ar~~L~~w~~~~~D~~~~~~la-G~~g~~~n~~~~~p~~~~~ 172 (404) ....++.+||.| +.|+ .|+...+...+ -...+....+...-.+|.-.|-.|+ ++.+ T Consensus 71 ~~~~t~~ldqdr~~~f~vD---~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~--------------- 132 (299) T protein:vir:79 71 NAWEPKVLTNQRKWSTLVH---PADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTA--------------- 132 (299) T ss_pred cceeEEEeeccccceeccc---hhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhh--------------- Confidence 899999999999 3344 34444333322 2233344444455556665555543 2110 Q ss_pred ccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEEEecHHH Q lcl|NC_018846. 173 EFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQ 252 (404) Q Consensus 173 ~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~q 252 (404) .+..++...++++.. ++.|+.+.+++++..-|-. . +||+++|.. T Consensus 133 --------------------~g~~~~~~~~T~~n~--y~~i~~~~~~lde~~vP~~-------~-------rvl~vtp~~ 176 (299) T protein:vir:79 133 --------------------LGNTADTTVLTTTNV--LEVFDKLMEKMTEARVPEN-------G-------RILYVTPVV 176 (299) T ss_pred --------------------cCCcccccccCHHHH--HHHHHHHHHHHHhcCCCCC-------C-------eEEEeCHHH Confidence 001112223444443 5788888888887765521 1 799999999 Q ss_pred HHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeecccccccccccccccccc Q lcl|NC_018846. 253 WNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNI 332 (404) Q Consensus 253 ~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~~~~a~~~~v 332 (404) +.-|+.++.+ ++.-.. ...+.+..|.+|.+||+.|.+.|. .|++..-.+ ..+.++.+ .+- T Consensus 177 ~~~L~~~~~f------~k~~~~---~~~~~~~~g~Vg~idG~~Ii~Vps--~r~~t~~~~-----~~G~~~~~----~ak 236 (299) T protein:vir:79 177 NTLIKNAKEI------QRTVNI---KDAGTSLNRQTTDIDTVKIIKVPS--NLMKTAYDF-----TTGWKVGA----GAK 236 (299) T ss_pred HHHHhhchhh------hccccc---ccccceeeeeeeeecceEEEEech--hhcCcccee-----ccCccccC----ccc Confidence 9999999864 222111 224567899999999999999886 466532110 01111111 123 Q ss_pred hhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEEEeceecC Q lcl|NC_018846. 333 DRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 333 ~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idtaa~~ 404 (404) .+.+||....+.++.-|....+.+ +=|....- +++. +=|.. |.+++++...+- T Consensus 237 ~in~ii~~~~a~~~~~K~~~~~~~------~P~~~~~~--~~~~---~~r~y--------~d~~v~~nk~~~ 289 (299) T protein:vir:79 237 QIFMSLVHPSAIITPVSYQFSKLD------EPTAVTEG--KYFY---FEESF--------EDVFILNKKADA 289 (299) T ss_pred ccceEEEcCCeeeeeEeeeeEEee------cCCCCCcc--ceee---eeeee--------eeeeeeccccCe Confidence 467888888888887653222211 10000000 0000 01111 122222222222 No 48 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=97.73 E-value=2.4e-05 Score=45.93 Aligned_cols=334 Identities=13% Similarity=0.027 Sum_probs=168.3 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCch---hHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEee Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRS---MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMH 77 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~---~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~ 77 (404) ||+..+-.= -.|+ +.++. .+|.|.+.+...-+..+.|..+ ..++ .+ ..|.++.|+-+. T Consensus 1 Ms~~n~~t~-p~~~-------gsg~~~aL~Le~f~GeV~taF~~~si~~~~------~~vR---tI--~~gkS~qf~~lG 61 (400) T protein:vir:10 1 MSTPNNLTN-VAVS-------ASGEVDSLLIEKFNGKVNEQYLKGENIMSY------FDVQ---TV--TGTNTVSNKYLG 61 (400) T ss_pred CCCCccccc-cccc-------cccchhhhHHhHhcchHHHHHHHHhhhccc------ceee---ee--cccceEEEEEee Confidence 887644211 1111 11111 3688999887777766665322 2233 33 567899999998 Q ss_pred ccccCceecCceeecchhhhhhceeEEEEeecc---ceeccCChhhhhhhhhh-HHHHHHHHHHHHHHHHHHHHHHHH-- Q lcl|NC_018846. 78 KLSKRPTMGDERVEGRGEDLSHADFSLKINQGR---HLVDAGGRMSQQRTKFN-LASSARTLLGTYFNDLQDQCAIVH-- 151 (404) Q Consensus 78 ~L~G~gV~Gd~~leGnee~L~~~s~~v~Idq~R---~aV~~~g~m~~qrs~~d-lrk~ar~~L~~w~~~~~D~~~~~~-- 151 (404) ..+-.+.+-.+.+.|+ ........|.||... |.|. .+++-...+| +|.+--..+..=+++..||.+|-+ T Consensus 62 ~s~a~y~~pG~~ldg~--~~~~dk~~ItIDtLL~a~~~V~---dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~ 136 (400) T protein:vir:10 62 ETELQVLAPGQSPAAT--STQADKNQLVIDATVIARNTVA---HLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQML 136 (400) T ss_pred eeEEeeecCCCCcCCC--CcccCcEEEEeCceeeecchhh---hHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8877777777778877 467777789999865 5554 5888899999 899999999999999999977743 Q ss_pred HhhhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEE Q lcl|NC_018846. 152 LAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVR 231 (404) Q Consensus 152 laG~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~ 231 (404) +++.. + +..|. .+.+ ...+...=.+. +++ .+.+...+.|.-.+.+. ...+++..-|. T Consensus 137 ~a~~a----~--t~~~~----~~~~-----g~~~g~s~~v~--~~~-~~~~~~~~~l~~A~~~A-~~~LdEkdVP~---- 193 (400) T protein:vir:10 137 LGGIA----N--TQAKR----TNPR-----VKGHGFSVNVE--VNE-GEALVNPQYVMAAVEFA-LEQQLEQEVDI---- 193 (400) T ss_pred Hhccc----c--ccccc----ccCC-----ccccccceeec--ccc-cccccCHHHHHHHHHHH-HHHHHhcCCCc---- Confidence 33321 1 01111 0100 00000000000 111 11222333333334443 33344333331 Q ss_pred eecccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccc Q lcl|NC_018846. 232 LSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSK 311 (404) Q Consensus 232 ~~g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~ 311 (404) + -+|+|++|..|.-|+..+-. . +...... ..+..=+|.+++++||.|.|-++.|-.--.... T Consensus 194 --~--------d~vvl~pp~~Ys~Ll~~dkL---v----nrdf~~s-~~g~~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~ 255 (400) T protein:vir:10 194 --S--------DVAILMPWRYFNVLRDADRI---V----DKSYTIS-QSGATIQGFVLSSYNCPVIPSNRFPKYSQGQKH 255 (400) T ss_pred --c--------ceEEEcCHHHHHHHHhCCcc---c----chhcccc-CCCccccceEEEEeceEEEeeCcCCcccCcccc Confidence 1 27888888888887765421 1 1111111 135556788999999999999988721111111 Q ss_pred eeecccccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhcccc------- Q lcl|NC_018846. 312 VLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFP------- 384 (404) Q Consensus 312 ~~~~~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~------- 384 (404) ...++.+.+..- .+.+.+.-.+++++=..|++.+=...--+++++.++. .-.-|-....+|+.-.|-. T Consensus 256 ~~lS~a~~G~~y-~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~----~~~~id~~~a~G~g~~RPeaa~vv~~ 330 (400) T protein:vir:10 256 HLLSNEDNGYRY-DPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKE----KTYYIDTFMSEGAIPDRWEAVSVVTT 330 (400) T ss_pred cccccCCCCccC-CccccccceeEEEEehhheEEEEeeccccccccchhh----HHHHHHHHHHhCCcccchhheEEEEe Confidence 222222211111 1123444556677666666664222111232222211 1123444556676666642 Q ss_pred -------CCCCCceEEEEE---------EEeceecC Q lcl|NC_018846. 385 -------EKSGKMQDHGVI---------AVDTAVKL 404 (404) Q Consensus 385 -------~~~g~~~DfGvi---------~idtaa~~ 404 (404) .+.|++.||..| -+-++++- T Consensus 331 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 366 (400) T protein:vir:10 331 KRQSTGAVDSGNAAQHTQVLNRAQRKAVYVKNAAPA 366 (400) T ss_pred cCCcccccccCcchhHHHHHhhcccceEEEeccccc Confidence 011111222111 11111111 No 49 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=97.71 E-value=2.7e-05 Score=45.67 Aligned_cols=333 Identities=13% Similarity=0.049 Sum_probs=163.5 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCch---hHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEee Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRS---MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMH 77 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~---~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~ 77 (404) ||...+-. ...|+ +.++. .+|.|.+.+...-+..+.|..+ ..++ .+ ..|.++.|+-+. T Consensus 1 Ms~~n~~t-~~~~~-------~sg~~~al~Le~f~GeV~taF~~~si~~~~------~~vR---ti--~~gkS~qf~~~G 61 (401) T protein:vir:70 1 MSTPNNLT-NVAVS-------ASGEVDSLLIEKFNGKVNEQYLKGENIMSY------FDVQ---TV--TGTNTVSNKYLG 61 (401) T ss_pred CCCCcccc-ccccc-------cccchhHhHHhHhcchHHHHHHHHhhhccc------ceee---ee--cccceEEEEEee Confidence 88764421 11121 11222 4688999887777766665322 2233 34 567899999997 Q ss_pred ccccCceecCceeecchhhhhhceeEEEEeecc---ceeccCChhhhhhhhhh-HHHHHHHHHHHHHHHHHHHHHHHH-- Q lcl|NC_018846. 78 KLSKRPTMGDERVEGRGEDLSHADFSLKINQGR---HLVDAGGRMSQQRTKFN-LASSARTLLGTYFNDLQDQCAIVH-- 151 (404) Q Consensus 78 ~L~G~gV~Gd~~leGnee~L~~~s~~v~Idq~R---~aV~~~g~m~~qrs~~d-lrk~ar~~L~~w~~~~~D~~~~~~-- 151 (404) ..+-...+-.+.+.|+ ........|.||... |.|. .+++-.+.+| +|.+--..+..=+++..||.++-. T Consensus 62 ~s~~~~~~pG~~ld~~--~~~~dK~~ItID~lL~a~~~V~---dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~ 136 (401) T protein:vir:70 62 ETELQVLAPGQSPAAT--STQADKNQLVIDATVIARNTVA---HLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMM 136 (401) T ss_pred eeEeeeecCCCCcCCC--CcccccEEEEeCceeehhhhhh---hHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 7776666666667764 566777789999865 5554 5888889999 899999999999999999977433 Q ss_pred HhhhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEE Q lcl|NC_018846. 152 LAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVR 231 (404) Q Consensus 152 laG~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~ 231 (404) ++|... . .|.. .|+.-.+ ....+-.+++....+++.. .+--.+++ +...+++..-|. T Consensus 137 ~aa~an----a---~~~~---------~~p~~~~-~G~~i~v~~~~~~~~~~~~-~l~~ai~d-A~~~LdEkdVP~---- 193 (401) T protein:vir:70 137 LGGIAN----T---QAKR---------TNPRVKG-HGFSINVEVAEGEALVNPQ-YVMAAVEF-ALEQQLEQEVDI---- 193 (401) T ss_pred Hhcccc----c---cccc---------cCCCcCC-CceEEeccccccccccCHH-HHHHHHHH-HHHHHHhcCCCc---- Confidence 344321 0 0000 0000000 0111111222212112111 12122333 333344444331 Q ss_pred eecccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccc Q lcl|NC_018846. 232 LSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSK 311 (404) Q Consensus 232 ~~g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~ 311 (404) + -||+|++|..|+-|..-+.. .+ +..... ..+..=.|.+++++||.|.+-++.|-.-..... T Consensus 194 --~--------r~vvl~pp~~Ys~Ll~~d~L---~n----rd~~~s-~~g~~~~G~v~~vaGv~Vv~SnnlP~~a~~it~ 255 (401) T protein:vir:70 194 --S--------DVAILMPWRYFNVLRDADRI---VD----KTYTIS-QSGATIQGFTLSSYNCPVIPSNRFPKYSQGQTH 255 (401) T ss_pred --c--------ceEEEcCHHHHHHHHhcCcc---cc----hhhccc-cCCccccceEEEEeceEEEeecccccccccccc Confidence 1 28999999999888775531 11 111111 134455788999999999999987621000001 Q ss_pred eeecccccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccch-HHHHHHHHhhhhhccc------- Q lcl|NC_018846. 312 VLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGLKKIRF------- 383 (404) Q Consensus 312 ~~~~~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~-~~i~i~~i~G~~K~rF------- 383 (404) ...++.+.+.. ..+.+.+.-.+++++=..|++.+=...--+++ |.|+ .++ .-|-....+|..-.|- T Consensus 256 ~~ls~a~~G~~-y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~-~~d~----r~~~~~id~~~a~g~g~~RPeaa~vv~ 329 (401) T protein:vir:70 256 HLLSNEDNGYR-YDPLPAMNGAIAVLFTADALLVGRSIDVTGDI-FYEK----KEKTYYIDTFMAEGAIPDRWEAVSVVT 329 (401) T ss_pred ccccccCCCcc-CCCCccccceeEEEEehhheEEEEeeccccch-hhhh----hhhHHHHHHHHHhCCcccchhheEEEe Confidence 11111111111 11123445556677666666664222111222 2222 111 1222444555554443 Q ss_pred ---cCCCC-----CceEEEEEEEeceecC Q lcl|NC_018846. 384 ---PEKSG-----KMQDHGVIAVDTAVKL 404 (404) Q Consensus 384 ---~~~~g-----~~~DfGvi~idtaa~~ 404 (404) .-..+ .+.||-.+-+--+-|- T Consensus 330 ~k~~~~~~~~~~~~~~~~~~~~~~~~~~~ 358 (401) T protein:vir:70 330 TKRNTTTGAVEGTDGAQHTIVKNRAQRKA 358 (401) T ss_pred ecCcccccccccCCcchhhhhhhhcccee Confidence 11000 0111222111111111 No 50 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=97.61 E-value=3.8e-05 Score=44.82 Aligned_cols=298 Identities=11% Similarity=0.018 Sum_probs=137.0 Q ss_pred hcCchh-HHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccccCceecCcee---ecchhhh Q lcl|NC_018846. 22 NRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERV---EGRGEDL 97 (404) Q Consensus 22 ~~n~~~-~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gV~Gd~~l---eGnee~L 97 (404) +.|.-. -++|+..+...-.+..-|....-+.- -.|+.-..||+|+++...........-.... +...+++ T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~------~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNG------IGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF 74 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchhhhcccc------ccccccCCCCeEEEeecccccceeeeccccccCCccccccc Confidence 444332 24677664444333333322111110 1244335799999987776654433322222 2334677 Q ss_pred hhceeEEEEeeccc-eeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccccccccccc Q lcl|NC_018846. 98 SHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKK 176 (404) Q Consensus 98 ~~~s~~v~Idq~R~-aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~~~~~~~ 176 (404) .-...++.||+.++ ++...+ .+.-....|++++.-+....=+++..|+.++..++++... .. T Consensus 75 ~~~~~~~~id~~k~~~~~i~d-~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~----------------~~ 137 (392) T protein:vir:99 75 TEDSFPVTLTDVAYHLGVLTD-EELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE----------------AA 137 (392) T ss_pred ccceEEEEEeeeeecceeech-HHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccc----------------cc Confidence 77888999988874 455553 4555678899888777777778888998887666654320 00 Q ss_pred ccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEEEecHHHHHHH Q lcl|NC_018846. 177 IMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDW 256 (404) Q Consensus 177 ~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~q~~~L 256 (404) .. ...++.+ ..++.|-.+..++++..-| +| ++++++|..+..| T Consensus 138 ------~~--------------~~~~~~~--~~~~~i~~a~~~L~~~~vP------~~---------R~~vv~p~~~~~l 180 (392) T protein:vir:99 138 ------GA--------------VHEVAPD--EFFKGVNGARRALNELYIP------QG---------RVLVVGTAVTEQI 180 (392) T ss_pred ------cc--------------ccccChh--hhHHHHHHHHHHHhhcCCC------CC---------CEEEEcHHHHHHH Confidence 00 0001111 1245555677777776655 12 4677899999999 Q ss_pred hcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeecccccccccccccccccchhhe Q lcl|NC_018846. 257 YTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAM 336 (404) Q Consensus 257 r~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~~~~a~~~~v~ral 336 (404) ++|+.+..+.... ......|-.|.+|.+.|+-+.+..+.|.. .+ +.+ +.........++ .. T Consensus 181 ~~~~~~~~~~~~g-------~~~~~~l~~G~vg~i~G~~v~~s~~~~~~--t~--~a~--~~~a~~~at~a~------v~ 241 (392) T protein:vir:99 181 LNDDRFIKYESQG-------QSAVSALQEARLGRIYGYEIVESTLIPHG--DA--YLY--HPTAFIMATRAP------AP 241 (392) T ss_pred hcccceeeccccc-------chhhhhhhcceeeeeeeeEEEeecccccc--cc--eee--eccccccccccc------cc Confidence 9998753222110 01124466799999999999887765411 10 000 000000000000 01 Q ss_pred eecCceeEEEeecCCCCCceeeeccccccchH---HHHHHHHhhhhhccccCCCCC--ceEEEEEEEeceecC Q lcl|NC_018846. 337 LLGAQALANAYGQKAGGHFNMVEKKTDMDNRT---EIAISWINGLKKIRFPEKSGK--MQDHGVIAVDTAVKL 404 (404) Q Consensus 337 llGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~---~i~i~~i~G~~K~rF~~~~g~--~~DfGvi~idtaa~~ 404 (404) ..|+..... +.........| ..||.... ...+....|...+. ...+. .....+-+...-+.+ T Consensus 242 ~~~~~~~~s-~s~~~~v~~~~---~~~~~~t~~s~~~~v~~~~g~~~v~--~~~~~~~~~~~~~~~~~~~v~v 308 (392) T protein:vir:99 242 PMGAVRSTA-ISGDQRIAMRW---LVDYDSTITSNRSLIDTYFGLKVVE--DPNGVGFVRARKIHLIPGSIEV 308 (392) T ss_pred cccccceeE-Eecccceecce---eecccceeeccccccceeEEEEEEe--eccccceeeeeeeeeecceeee Confidence 112111111 11000001112 12222211 11122222222111 00000 011111111000011 No 51 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=96.86 E-value=6.2e-05 Score=43.64 Aligned_cols=321 Identities=14% Similarity=0.124 Sum_probs=156.1 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhH-HHHHhhhhhhhhhhcccccccCCCCCccEEEEeecC---CCCCcEEEEEEe Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLN---KQAGDEVTFSIM 76 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~-~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~---k~~Gd~v~f~L~ 76 (404) |-.. .+..+. + ..=.|.| ..|..... .+++ .+.++ .+|+.-.+|. .+.|+.|++++. T Consensus 1 M~~~---~~~T~l-~------Dii~pEvF~~Yv~~~~---~e~~---~l~qS---Giv~~d~~l~~~~~~gG~~v~iPf~ 61 (367) T protein:vir:80 1 MPDF---NNQVRL-V------DAVIPEVYTSYTAIDR---PELT---AFFLS---GAVASNDFLSQFLSAPGRLINIPFW 61 (367) T ss_pred Ccch---hhhhhh-h------hccchhhhhHHHhhhh---hhhh---hhhhc---ceeecCHHHHHHhhcCCCEEEeeee Confidence 2110 000000 0 0011111 11222111 1111 22333 3555555554 488999999999 Q ss_pred eccccCc-eecCce--eecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018846. 77 HKLSKRP-TMGDER--VEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA 153 (404) Q Consensus 77 ~~L~G~g-V~Gd~~--leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~la 153 (404) ..|.|+. ..++.. .+---..+.-.+|.-+|=...++..... +++.-+--|.......++++||.+..-..+|..|. T Consensus 62 ~~L~g~~~n~~~d~~~~~~t~~kittg~~~a~v~~r~kaw~~~D-la~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~ 140 (367) T protein:vir:80 62 RDLDSLEPNYGSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMD-LTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAV 140 (367) T ss_pred ccCCCCccccCCCCCcccccccccccchheeeeehhcccchhhh-HHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHH Confidence 9998753 222222 2223357777777777777777766553 66667778999999999999999999999999999 Q ss_pred hhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEee Q lcl|NC_018846. 154 GARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLS 233 (404) Q Consensus 154 G~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~ 233 (404) |+-++....+...... ......+.+....++++=-.+.+. +++-+||.+.+-+|...+ T Consensus 141 Gvf~~~~a~~~~~~~~-----~~~~~a~~~~~~~~~~~Dis~~t~----~~~~~~s~~~~~~A~~~l------------- 198 (367) T protein:vir:80 141 GVYKSNLAGNFATIKT-----RGRVPAEVLGTAGDMVIDISGQTN----PADAVFNREAFVDAAFTM------------- 198 (367) T ss_pred Hhhccccccchhhhhh-----hhccccccccccCceeeeeeccCC----CccceecHHHHHHHHHHh------------- Confidence 9887543333211100 000011122222333332222111 233468877666553322 Q ss_pred cccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeecccccee Q lcl|NC_018846. 234 GDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVL 313 (404) Q Consensus 234 g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~ 313 (404) ||. .+..=+++||+..+..|++.- .++.-+.. .+ ...++.|+|..|+.--.||..- T Consensus 199 GD~---~~~l~~i~mHS~V~~~L~~~~----li~~i~~s----d~------~~~i~ty~G~~VIvDD~~Pv~~------- 254 (367) T protein:vir:80 199 GDH---VGSIAAIAVHSMVYKRMTNND----EIEFIPDS----KG------QLTIPTYMGKVVIVDDGMPVFG------- 254 (367) T ss_pred ccc---cccccEEEEchHHHHHHHhcc----ccccccCC----CC------ccccceecceeEEEeCCCcccc------- Confidence 221 123568999999999999973 33333321 11 2458899998887655554211 Q ss_pred ecccccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccc--hH--HHHHH---HHhhhhhccccCC Q lcl|NC_018846. 314 VSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDN--RT--EIAIS---WINGLKKIRFPEK 386 (404) Q Consensus 314 ~~~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~--~~--~i~i~---~i~G~~K~rF~~~ 386 (404) . +...+-.+.|+|..|.+..=+. +..=.|-..|--. .- ++.+. .++.-.=+.|.+. T Consensus 255 -----------~--~a~~~yttYlfg~GAi~~~~~~----~~~~~E~~Rd~~~~~~gG~d~L~~Rr~~~~hP~G~s~~~~ 317 (367) T protein:vir:80 255 -----------T--GADKTYLSILFGGAAFGYADGA----PQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDA 317 (367) T ss_pred -----------c--CCCceEEEEEEecceeeecccC----CccceecccchhhhcCCceEEEEeeeeEEeecceeeeccc Confidence 0 1112345689998887755332 1111333333311 11 11111 1222333334321 Q ss_pred CC-------------------------CceEEEEEEEeceecC Q lcl|NC_018846. 387 SG-------------------------KMQDHGVIAVDTAVKL 404 (404) Q Consensus 387 ~g-------------------------~~~DfGvi~idtaa~~ 404 (404) .- +..-+-.+-=.-.++| T Consensus 318 ~v~~~~~~~~~~~~~~~~~sPt~~eLa~~~NW~~v~d~K~I~i 360 (367) T protein:vir:80 318 DVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPM 360 (367) T ss_pred ccccccccccccccccccCCCChHHhcCCcccccccchhhcce Confidence 00 0001111100001111 No 52 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=96.69 E-value=0.00041 Score=39.15 Aligned_cols=279 Identities=10% Similarity=0.060 Sum_probs=137.9 Q ss_pred HhhcCchhH--HHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccccCceecCceeecchhhh Q lcl|NC_018846. 20 AANRNRSMV--NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDL 97 (404) Q Consensus 20 ~~~~n~~~~--~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gV~Gd~~leGnee~L 97 (404) ...++|..+ ++|+..+...-.+..-+....-+.-. .|. +++||+|+++....+. ..|.. .=..+++ T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~------~e~-~~~GDTV~I~vp~~~~----v~dg~-~~~~~~~ 68 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYE------KTF-GKVGDTIRLKLPYRVK----SASGR-TLVKQPM 68 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCc------hHH-hhCCCEEEEeeCCcee----ecccC-Ccccccc Confidence 223334433 47777765555544444333322211 122 2579999988766554 11111 1124567 Q ss_pred hhceeEEEEeeccc-eeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccccccccccc Q lcl|NC_018846. 98 SHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKK 176 (404) Q Consensus 98 ~~~s~~v~Idq~R~-aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~~~~~~~ 176 (404) .-.+-.|.||+..+ ++...++ ++-.+.-||+++.-+....=+++..|+.++-.+.++.. T Consensus 69 te~~v~l~id~~k~~~~~itD~-e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~------------------- 128 (418) T protein:vir:10 69 VDQTIPFKIAYQEHVGLEYTVK-DKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFH------------------- 128 (418) T ss_pred ccceEEEEEecccccceeechH-HHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc------------------- Confidence 77777899988874 5666543 33445678887666666666777788777654443321 Q ss_pred ccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEEEecHHHHHHH Q lcl|NC_018846. 177 IMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDW 256 (404) Q Consensus 177 ~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~q~~~L 256 (404) .+..+. + ..+ .++.|-.++.++++..-|- +|+ +.++++|..+..| T Consensus 129 ----~~gt~g----------t------~~~--~~~~i~~a~~~Ld~~~VP~-----~G~--------R~lVv~P~~~~~L 173 (418) T protein:vir:10 129 ----SSGTPG----------V------RPG--AFIDFANAGAKQTTYAVPQ-----DGM--------RHAVLDPFTCASL 173 (418) T ss_pred ----ccccCC----------c------Ccc--hHHHHHHHHHHHHhcCCCC-----CCc--------eEEEeCHHHHHHH Confidence 000000 0 000 2444445677787766661 121 5777999999999 Q ss_pred hcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeecccccccccccccccccchhhe Q lcl|NC_018846. 257 YTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAM 336 (404) Q Consensus 257 r~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~~~~a~~~~v~ral 336 (404) ..|..+. + .. -+....|-.|.+|.+.|+-+.+..++|..- .|.. ..+ .+ T Consensus 174 ~~~~~~~-~--------~~-~~~~~~lr~G~IG~i~GF~V~~S~nip~~t-ag~~-----~~t---------------~~ 222 (418) T protein:vir:10 174 SDEVTKL-F--------KE-SMVEQAYKMGYRGNVAAYEVYESQNLPKHT-VGDH-----GGT---------------PL 222 (418) T ss_pred hhhcccc-c--------cc-cccchhhheeeeeeeeceEEEEecCCCccc-cccc-----ccc---------------ee Confidence 9875421 1 11 123456778999999999999988765111 0100 000 01 Q ss_pred eecCceeEEEeecCCCCCcee--eeccccccchHHH-HHHHHhhhhhccccCCCCCceEEEEEEE-e------ceecC Q lcl|NC_018846. 337 LLGAQALANAYGQKAGGHFNM--VEKKTDMDNRTEI-AISWINGLKKIRFPEKSGKMQDHGVIAV-D------TAVKL 404 (404) Q Consensus 337 llGaqAl~~A~g~~~g~r~~w--~Ee~~D~g~~~~i-~i~~i~G~~K~rF~~~~g~~~DfGvi~i-d------taa~~ 404 (404) ..|+++-+-+-+-. ..| .+-..--|+...| ++..+.++.|-. .+..+-|-|..- + +.++| T Consensus 223 v~ga~~~~~~~~~~----~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~----~~~~~~f~V~~~~~~~~~~~~tv~i 292 (418) T protein:vir:10 223 VNGTVVNGDTVGFD----GGTASTTGFLKAGDVITFGGVFGVNPQNYET----TGLLQEFVVLEDVDTDAGGAGSIKI 292 (418) T ss_pred eecccccceeEEEe----ecceeeccceeeccEEEECceeecccccccc----cccceEEEEEeeccccccCcceeEe Confidence 11221111110000 001 0111222232222 223344444433 334567744332 1 34566 No 53 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=96.43 E-value=0.00063 Score=38.11 Aligned_cols=302 Identities=10% Similarity=0.004 Sum_probs=154.7 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhh-cccccccCCCCCccEEEEeecCCCCCcEEEEEEeec- Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVS-PDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHK- 78 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~-s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~- 78 (404) |++ .+ -.-+++.|.+.+....+.+ +.+. ++ ++..++ -+.++.+..-.... T Consensus 13 Ms~----------------~i--~~~fv~qy~~~v~~~~qq~~s~L~---~t-----V~~~~~--~~~~~~~~~~~~~~~ 64 (322) T protein:vir:10 13 IAG----------------DI--DQAFVQTYETTLRILSQQKSAKLK---QY-----CQHKNE--SSESHNWETLASMDP 64 (322) T ss_pred eec----------------hh--hhHHHHHHHHHHHHHHHHhhhhhh---cc-----cccccc--cccccceeecccccc Confidence 333 01 1115677888765555432 2222 11 111111 11222222111111 Q ss_pred -cccCceecCceeecc----hhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018846. 79 -LSKRPTMGDERVEGR----GEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA 153 (404) Q Consensus 79 -L~G~gV~Gd~~leGn----ee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~la 153 (404) .-|.+-.+.....+. ..+.....-.+.+++...++.+. .++.-|..+|+|...-..++.=|++..|+.++-.+. T Consensus 65 ~~~~~~~~~~~~~d~~~dtp~~~~~~~~r~~~~~d~~~~~~VD-d~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~ 143 (322) T protein:vir:10 65 DAVKRKRSRQQSADGTYPTPVNNKPFAKRRTNVDTYDTGHVVE-QEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAW 143 (322) T ss_pred cccccccccccccCcccCCCccccccceEEEeecccccceecc-hHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhh Confidence 111111111111111 11233444445555555665543 588889999999999999999999999998864443 Q ss_pred hhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEee Q lcl|NC_018846. 154 GARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLS 233 (404) Q Consensus 154 G~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~ 233 (404) |.- . . .. ...++..|++.-+ +. ++=.++.+.|..++...++..-| T Consensus 144 g~a---~-~----------~~---~gt~v~~~ss~~i--~~---------g~~g~t~~kl~~a~~~l~~~dvp------- 188 (322) T protein:vir:10 144 KPA---S-I----------KG---TGQPVEFLATQEI--GD---------GTKPISFDYVTEITERFLENEIE------- 188 (322) T ss_pred ccc---c-c----------cc---cccccccCCCccc--cc---------CccchhHHHHHHHHHHHHhcCCC------- Confidence 321 0 0 00 0122333333211 11 12245666666677777665533 Q ss_pred cccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccc-cCceEEcCEEEEecCCceeeeccccce Q lcl|NC_018846. 234 GDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK-GECAMWRNILVRKYAGMPIRFYQGSKV 312 (404) Q Consensus 234 g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~-G~~gm~ngvii~~~~~~~irf~~~~~~ 312 (404) ++.+ .+|++.|.|+.+|.+|+.+.+ +.. ....+|+. |.+|+|-|+-+..+.++|. +.... T Consensus 189 -----~d~~-R~~vv~p~~~~~LL~d~~~ts-------~D~---~~~~~l~~~G~ig~~lGf~~i~s~~lp~--~~~t~- 249 (322) T protein:vir:10 189 -----PEVS-KVIVIGPTQARKLLQITEATS-------ADY---TSAMDLQSKGIITNWMGYTWIVSTRLDK--FDPTQ- 249 (322) T ss_pred -----CCCC-eEEEeCHHHHHHHhcchhhhh-------hhc---ccchhhhhcCeeeeeeeEEEEEeccCCc--ccccc- Confidence 1112 468999999999999998531 111 12477875 8899999999988877651 11000 Q ss_pred eecccccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceE Q lcl|NC_018846. 313 LVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQD 392 (404) Q Consensus 313 ~~~~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~D 392 (404) ........++..+..++.-=-+|++.|=++.-.++. .|.-|..+-.-|-....+|-.-++ + T Consensus 250 -------~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i---~~~~~~~~a~~I~~~~~~Ga~ri~---------~ 310 (322) T protein:vir:10 250 -------WGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKV---AEDPSASFAWRIYSAFTADCVRVE---------D 310 (322) T ss_pred -------ccccccCCCCccceeEEEEecCceeEEEeeeeeEEe---eccCCcchhhhhhhhhhhCceEec---------c Confidence 011122233344556665556677777554322222 223344444555555666766553 5 Q ss_pred EEEEEEeceecC Q lcl|NC_018846. 393 HGVIAVDTAVKL 404 (404) Q Consensus 393 fGvi~idtaa~~ 404 (404) =||+.|+--=-| T Consensus 311 ~gVv~i~~~e~~ 322 (322) T protein:vir:10 311 EHIFKLRLKNSL 322 (322) T ss_pred CcEEEEEEeccC Confidence 578888764445 No 54 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=95.45 E-value=0.002 Score=35.33 Aligned_cols=284 Identities=12% Similarity=0.067 Sum_probs=146.8 Q ss_pred CC-cccchHHHHHHHHHHHH--HhhcCchh-HHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEe Q lcl|NC_018846. 1 MT-TVTSAQANKLYQVALFT--AANRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIM 76 (404) Q Consensus 1 ~~-~~~~~~a~~~~~~~lft--~~~~n~~~-~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~ 76 (404) |. ++-++.-+...----|+ ++.-|+.. .++|+..|..-..+.++-.+.. .|.-++ -..|++|.++-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~---~N~~~e------~~gg~tVkIp~i 71 (319) T protein:vir:94 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPAL---ISNDAI------FMEGRSFTVMKG 71 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcc---cCcceE------eccCcEEEEeee Confidence 43 22233222222222233 23344443 4678887765544444332221 121121 136999997765 Q ss_pred eccccCceecCcee-e-cchhhhhhceeEEEEeecc---ceeccCChhhhhhhhhhH--HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018846. 77 HKLSKRPTMGDERV-E-GRGEDLSHADFSLKINQGR---HLVDAGGRMSQQRTKFNL--ASSARTLLGTYFNDLQDQCAI 149 (404) Q Consensus 77 ~~L~G~gV~Gd~~l-e-Gnee~L~~~s~~v~Idq~R---~aV~~~g~m~~qrs~~dl--rk~ar~~L~~w~~~~~D~~~~ 149 (404) .- .++ +|-.. . .+.++++....++.+||.| +.|+ .|+..-+..++ -........+.+...+|.-.| T Consensus 72 ~~---~gl-~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD---~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~ 144 (319) T protein:vir:94 72 DT---TEL-KDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVD---ALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRF 144 (319) T ss_pred cc---ccc-ccccCCCCcccCCcccceeEEEeecccccccccc---hhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHH Confidence 53 333 33322 2 3456889999999999998 3444 56666665555 344555666666667777666 Q ss_pred HHHhhhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCcc Q lcl|NC_018846. 150 VHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQP 229 (404) Q Consensus 150 ~~laG~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~P 229 (404) ..|++.-+. .....++++- .++.|+.+.+++++..-| . T Consensus 145 skla~~a~~--------------------------------------~~~~~~t~~n--~y~~i~~a~~~Lde~~VP-~- 182 (319) T protein:vir:94 145 ATLARNKAK--------------------------------------HLTVGTGSDA--QYDAVLDVSVELDEIKAP-E- 182 (319) T ss_pred HHHHhhccc--------------------------------------ccccccCHHH--HHHHHHHHHHHHHhcCCC-C- Confidence 666532110 0001122221 377888888888876544 1 Q ss_pred EEeecccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccc Q lcl|NC_018846. 230 VRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQG 309 (404) Q Consensus 230 v~~~g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~ 309 (404) + +||+++|..+.-|+.++.+. +.- ......+..|.+|.+||+.|.+.|.. |.. T Consensus 183 ----~---------Rvl~Vtp~~~~~L~~~~~f~------~~~----~~~~~~~~~g~Vg~idG~~Vi~vps~--~~k-- 235 (319) T protein:vir:94 183 ----N---------RVLFVSPTFYKGIKKFVIAL------PQG----DTRQQVLGKGVQGELDGFVIVKVPTK--LLQ-- 235 (319) T ss_pred ----C---------cEEEeCHHHHHHHHhhhhhh------ccc----cccccceeeeeceeecCeEEEEeccc--ccc-- Confidence 1 68899999999999997642 221 11246789999999999999987752 220 Q ss_pred cceeecccccccccccccccccchhheeecCceeEEEeecCCCCCcee-eeccccccchHHHHHHHHhhhhhcccc-CCC Q lcl|NC_018846. 310 SKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNM-VEKKTDMDNRTEIAISWINGLKKIRFP-EKS 387 (404) Q Consensus 310 ~~~~~~~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w-~Ee~~D~g~~~~i~i~~i~G~~K~rF~-~~~ 387 (404) ...+++|......+--|...++.+- .+.. . .+.+.| ..|. ..- T Consensus 236 -----------------------~in~i~~h~~A~~~~~k~~~~~~~~p~~~~------~---a~~v~g---r~y~d~~V 280 (319) T protein:vir:94 236 -----------------------GLQAIAVVGEVLASPIQADLAKTNSNIPGM------F---GTLAEQ---LLYTGAFV 280 (319) T ss_pred -----------------------cceEEEEcCCeeeeeeeeeeeeccCCCccc------c---ceeeee---eeeeeeEE Confidence 1347777766655544432222111 1111 1 111211 1111 000 Q ss_pred CCceEEEEEEEeceecC Q lcl|NC_018846. 388 GKMQDHGVIAVDTAVKL 404 (404) Q Consensus 388 g~~~DfGvi~idtaa~~ 404 (404) -..+=-||++.-++.|- T Consensus 281 ~~~k~~~Iy~~~~~~~~ 297 (319) T protein:vir:94 281 PEHLQKYIFTIGGTEVA 297 (319) T ss_pred eccccceEEEeecCCcc Confidence 00112334333333332 No 55 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=95.45 E-value=0.002 Score=35.33 Aligned_cols=284 Identities=12% Similarity=0.067 Sum_probs=146.8 Q ss_pred CC-cccchHHHHHHHHHHHH--HhhcCchh-HHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEe Q lcl|NC_018846. 1 MT-TVTSAQANKLYQVALFT--AANRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIM 76 (404) Q Consensus 1 ~~-~~~~~~a~~~~~~~lft--~~~~n~~~-~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~ 76 (404) |. ++-++.-+...----|+ ++.-|+.. .++|+..|..-..+.++-.+.. .|.-++ -..|++|.++-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~---~N~~~e------~~gg~tVkIp~i 71 (319) T protein:vir:97 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPAL---ISNDAI------FMEGRSFTVMKG 71 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcc---cCcceE------eccCcEEEEeee Confidence 43 22233222222222233 23344443 4678887765544444332221 121121 136999997765 Q ss_pred eccccCceecCcee-e-cchhhhhhceeEEEEeecc---ceeccCChhhhhhhhhhH--HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018846. 77 HKLSKRPTMGDERV-E-GRGEDLSHADFSLKINQGR---HLVDAGGRMSQQRTKFNL--ASSARTLLGTYFNDLQDQCAI 149 (404) Q Consensus 77 ~~L~G~gV~Gd~~l-e-Gnee~L~~~s~~v~Idq~R---~aV~~~g~m~~qrs~~dl--rk~ar~~L~~w~~~~~D~~~~ 149 (404) .- .++ +|-.. . .+.++++....++.+||.| +.|+ .|+..-+..++ -........+.+...+|.-.| T Consensus 72 ~~---~gl-~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD---~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~ 144 (319) T protein:vir:97 72 DT---TEL-KDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVD---ALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRF 144 (319) T ss_pred cc---ccc-ccccCCCCcccCCcccceeEEEeecccccccccc---hhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHH Confidence 53 333 33322 2 3456889999999999998 3444 56666665555 344555666666667777666 Q ss_pred HHHhhhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCcc Q lcl|NC_018846. 150 VHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQP 229 (404) Q Consensus 150 ~~laG~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~P 229 (404) ..|++.-+. .....++++- .++.|+.+.+++++..-| . T Consensus 145 skla~~a~~--------------------------------------~~~~~~t~~n--~y~~i~~a~~~Lde~~VP-~- 182 (319) T protein:vir:97 145 ATLARNKAK--------------------------------------HLTVGTGSDA--QYDAVLDVSVELDEIKAP-E- 182 (319) T ss_pred HHHHhhccc--------------------------------------ccccccCHHH--HHHHHHHHHHHHHhcCCC-C- Confidence 666532110 0001122221 377888888888876544 1 Q ss_pred EEeecccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccc Q lcl|NC_018846. 230 VRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQG 309 (404) Q Consensus 230 v~~~g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~ 309 (404) + +||+++|..+.-|+.++.+. +.- ......+..|.+|.+||+.|.+.|.. |.. T Consensus 183 ----~---------Rvl~Vtp~~~~~L~~~~~f~------~~~----~~~~~~~~~g~Vg~idG~~Vi~vps~--~~k-- 235 (319) T protein:vir:97 183 ----N---------RVLFVSPTFYKGIKKFVIAL------PQG----DTRQQVLGKGVQGELDGFVIVKVPTK--LLQ-- 235 (319) T ss_pred ----C---------cEEEeCHHHHHHHHhhhhhh------ccc----cccccceeeeeceeecCeEEEEeccc--ccc-- Confidence 1 68899999999999997642 221 11246789999999999999987752 220 Q ss_pred cceeecccccccccccccccccchhheeecCceeEEEeecCCCCCcee-eeccccccchHHHHHHHHhhhhhcccc-CCC Q lcl|NC_018846. 310 SKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNM-VEKKTDMDNRTEIAISWINGLKKIRFP-EKS 387 (404) Q Consensus 310 ~~~~~~~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w-~Ee~~D~g~~~~i~i~~i~G~~K~rF~-~~~ 387 (404) ...+++|......+--|...++.+- .+.. . .+.+.| ..|. ..- T Consensus 236 -----------------------~in~i~~h~~A~~~~~k~~~~~~~~p~~~~------~---a~~v~g---r~y~d~~V 280 (319) T protein:vir:97 236 -----------------------GLQAIAVVGEVLASPIQADLAKTNSNIPGM------F---GTLAEQ---LLYTGAFV 280 (319) T ss_pred -----------------------cceEEEEcCCeeeeeeeeeeeeccCCCccc------c---ceeeee---eeeeeeEE Confidence 1347777766655544432222111 1111 1 111211 1111 000 Q ss_pred CCceEEEEEEEeceecC Q lcl|NC_018846. 388 GKMQDHGVIAVDTAVKL 404 (404) Q Consensus 388 g~~~DfGvi~idtaa~~ 404 (404) -..+=-||++.-++.|- T Consensus 281 ~~~k~~~Iy~~~~~~~~ 297 (319) T protein:vir:97 281 PEHLQKYIFTIGGTEVA 297 (319) T ss_pred eccccceEEEeecCCcc Confidence 00112334333333332 No 56 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=95.09 E-value=0.0028 Score=34.59 Aligned_cols=307 Identities=10% Similarity=0.095 Sum_probs=145.6 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecC---CCCCcEEEEEEee Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLN---KQAGDEVTFSIMH 77 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~---k~~Gd~v~f~L~~ 77 (404) |.+ |--.-++.-.--+|+..-.+.| .+.+ .+.++ .+|+.-.+|. .+.|+.|++++.. T Consensus 1 Ma~-T~l~D~iipe~~vf~~Yv~~~~-------------~e~~---~l~qS---Gii~~d~~l~~~~~~gG~~~~iPf~~ 60 (349) T protein:vir:78 1 MAI-TTIGDIVTGNIPVLASYMTEDP-------------VEKT---AFFDS---GILTSTPYAAEIANGPSNIANLPFWK 60 (349) T ss_pred CCc-eEEeeeeccCHHHHHHHHHHhh-------------HHhh---hhhhc---cceeccHHHHHHhhcCCCEEEeeeee Confidence 431 1111111111223333333322 1122 22332 4666556665 4789999999999 Q ss_pred ccccC--c-eecCceeec--chhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018846. 78 KLSKR--P-TMGDERVEG--RGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHL 152 (404) Q Consensus 78 ~L~G~--g-V~Gd~~leG--nee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~l 152 (404) +|.|+ + |.+|. -++ --+.+...++.-++=...++..... +++.-+--|..+....++++||.+.....+|..| T Consensus 61 ~L~g~~e~nv~~D~-~~~~~t~~kitt~~~~a~~~~r~kaw~~~D-la~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L 138 (349) T protein:vir:78 61 AIDTSIEPNYSNDV-YQDIATPRAIQTGEMMARVAYLNEGFGQAD-LTVELTSQNPLQSVASRLDNFWQRQAQRRLIATA 138 (349) T ss_pred cCCCCcccccCCCC-cccccccccccccceeeeeeeeccccchhH-HHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHH Confidence 99984 3 33332 222 2345777777776666666655442 5556666699999999999999999999999999 Q ss_pred hhhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEe Q lcl|NC_018846. 153 AGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRL 232 (404) Q Consensus 153 aG~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~ 232 (404) .|+-+...... ....+. + ++. + ++.++..++.+.+-.+...+-.... T Consensus 139 ~Gvf~~~~~a~------~~~~~~----~-------~~t-~--------d~s~~a~~~~~~~~dA~~~lgda~~------- 185 (349) T protein:vir:78 139 LGLYNDNVSAT------DAYHEQ----N-------DMV-V--------DVSATLGFDAGAFIDATQTMGDALM------- 185 (349) T ss_pred HHhhccccccc------chhhhc----c-------cce-e--------eeccccCCChhhhhhhHHHHHHHhc------- Confidence 99865211110 000010 0 000 0 1112223454433233333222221 Q ss_pred ecccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccce Q lcl|NC_018846. 233 SGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKV 312 (404) Q Consensus 233 ~g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~ 312 (404) |+. .+.+=++.||+..+..|++.- .++..+. +++ ...++.|+|..++.--.+|.- T Consensus 186 -Gd~---~~~lt~i~mHS~v~~~L~~~~----li~~i~~----s~~------~~~i~ty~G~~VivDD~~Pv~------- 240 (349) T protein:vir:78 186 -GNG---GEVLGAIAMHSFVYAQARKAQ----LIDFIRD----AEN------NTMFATYQGYRVIVDDSMTVV------- 240 (349) T ss_pred -ccc---ccceeEEEEchHHHHHHHhhh----hhhhccC----ccc------CcccceecCeEEEEeCCCccc------- Confidence 211 123568999999999999863 2333221 111 123678999777654444310 Q ss_pred eecccccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHH-------HHhhhhhccccC Q lcl|NC_018846. 313 LVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAIS-------WINGLKKIRFPE 385 (404) Q Consensus 313 ~~~~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~-------~i~G~~K~rF~~ 385 (404) . .+...+-..+|+|..|.+..-+.+ ..-.|-..|--.+.+-+.+ .+++.+=+.|.. T Consensus 241 -----------~--~g~~~~yttylfg~GAi~~~~~~~----~~~~et~rd~~~g~~~G~d~l~~R~~~~~hp~G~s~~~ 303 (349) T protein:vir:78 241 -----------G--QGAQRKFISIIFGQGAIGYGEGNP----VMPLEYEREASRANGGGVETLWTRKTWLLHPFGYRFTS 303 (349) T ss_pred -----------c--CCCCceEEEEEeecceEEEccCCC----ccceeeecccccCCcceeEEEEEeeEEEeeeeeeeecc Confidence 1 111234456999987777664432 2223333333211111111 122222333321 Q ss_pred CCC------------------CceEEEEEEEeceecC Q lcl|NC_018846. 386 KSG------------------KMQDHGVIAVDTAVKL 404 (404) Q Consensus 386 ~~g------------------~~~DfGvi~idtaa~~ 404 (404) ... ++.-+-.+.=.-.++| T Consensus 304 a~v~~~~~~~~~~sPt~aeLa~~~NW~~v~~~K~I~i 340 (349) T protein:vir:78 304 AVITGNGTETIARSASWQDLANATNWNRVVDRKHVPI 340 (349) T ss_pred ccccCCccccccCCCChHHhcCCcCcccccChhhcce Confidence 100 0011111100011111 No 57 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=94.24 E-value=0.005 Score=33.20 Aligned_cols=276 Identities=13% Similarity=0.077 Sum_probs=139.4 Q ss_pred hcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccccCceecCce-eecc-hhhhhh Q lcl|NC_018846. 22 NRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDER-VEGR-GEDLSH 99 (404) Q Consensus 22 ~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gV~Gd~~-leGn-ee~L~~ 99 (404) +..+. .++|+..|.....+.+...... +. +..=.-|++|.++=+. -.|+ +|-. -.|. ..+.+. T Consensus 1 Main~-a~~~~~~Ld~~~~~~~~t~~l~----~~------~~~~~ggktVkI~~i~---~~gl-~DY~R~~g~~~g~v~~ 65 (290) T protein:vir:78 1 MAINY-VDKYGKELDQKLVFGTYTNELE----TP------NLLWLDAKTFKIQTIT---TTGL-KAHTRNKGYNEGSASN 65 (290) T ss_pred CchhH-HHHHHHHHHHHHHhhheeeecc----cc------ceeeccCCEEEEeeec---cCcc-cccccCCCcccCcccc Confidence 22221 3567777766666665544331 11 1111358999977544 2233 2222 2222 234567 Q ss_pred ceeEEEEeecc---ceeccCChhhhhhh--hhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccccccccc Q lcl|NC_018846. 100 ADFSLKINQGR---HLVDAGGRMSQQRT--KFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEF 174 (404) Q Consensus 100 ~s~~v~Idq~R---~aV~~~g~m~~qrs--~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~~~~~ 174 (404) ...++.+||.| +.|+ .|+...+ ...+-........+...-.+|.-.|-.|++.-+. T Consensus 66 ~~et~tl~qdR~~~F~vD---~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~---------------- 126 (290) T protein:vir:78 66 TNKSYTIDFDRDVEFFVD---VMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKT---------------- 126 (290) T ss_pred ceeeEEeeccccceeecc---ccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhc---------------- Confidence 78889999988 4454 3444433 3455666666667777777887777666543210 Q ss_pred ccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEEEecHHHHH Q lcl|NC_018846. 175 KKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWN 254 (404) Q Consensus 175 ~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~q~~ 254 (404) .| ......++++. -++.|+.+..++++ .| .+. +||+++|.-+. T Consensus 127 ----~~---------------~~~~~t~t~~n--~~~~i~~~~~~lde--vp-------~~~-------rvl~vtp~~~~ 169 (290) T protein:vir:78 127 ----NS---------------NSVAEEITKDN--VFTKLKAAIRKVKK--YG-------TQN-------LVMYVSPDVMA 169 (290) T ss_pred ----cC---------------cccccccCHHH--HHHHHHHHHHHHHh--cC-------CCC-------eEEEECHHHHH Confidence 00 00001122221 23455555555543 12 111 89999999999 Q ss_pred HHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeecccccccccccccccccchh Q lcl|NC_018846. 255 DWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDR 334 (404) Q Consensus 255 ~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~~~~a~~~~v~r 334 (404) -|+.++.|. +.... .....-...|.+|.+||+.|.|.|.. -||+.--. ...+...+..+-.. T Consensus 170 lL~~~~~f~------r~~~~--~~~~~~~i~~~V~~idG~~ii~vps~-~r~~t~~~---------f~~G~~~~~~ak~i 231 (290) T protein:vir:78 170 ALELSDDFV------RAINV--QNIGPSSIETRITAIDGTRIVEVEAE-DRFYDTFD---------FTDGYKPAAGAKKL 231 (290) T ss_pred HHhhChhhh------ccccc--cccccccccceeeeecCcEEEEeccc-chhhhhhh---------hcccccccCCccce Confidence 999998753 32111 11123346899999999999998853 46653111 00111112224456 Q ss_pred heeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEEEeceecC Q lcl|NC_018846. 335 AMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 335 alllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idtaa~~ 404 (404) ++||-...+.+|.-|+.-.+.+==+...+ |+. +.+. =| .=|.+++++...+- T Consensus 232 n~ii~~~~a~i~~~K~~~~~~~~P~~~~~-~d~------~~~~---~r--------~y~d~~v~~nk~~~ 283 (290) T protein:vir:78 232 NFLLVNKGSVVGGAKHASIYLHAPGSVGQ-GDG------WLYQ---YR--------VYHDIFVLDQQKDG 283 (290) T ss_pred eEEEEcCCceeeeeeeeEEEeeCCCCCcC-cce------eeee---ee--------eeeeeeeeccccCe Confidence 79999999999987643222110000000 000 0000 00 01233444433333 No 58 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=92.20 E-value=0.012 Score=31.04 Aligned_cols=307 Identities=10% Similarity=0.095 Sum_probs=145.7 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecC---CCCCcEEEEEEee Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLN---KQAGDEVTFSIMH 77 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~---k~~Gd~v~f~L~~ 77 (404) |.+ |--.-++.-.--+|+..-.+.|. +.+ .+.++ .+|+.-.+|. ++.|+.|++++.. T Consensus 1 Ma~-T~l~D~iipe~~vf~~Yv~~~~~-------------e~~---~l~qS---Gii~~d~~l~~~~~~gG~~~~iPf~~ 60 (349) T protein:vir:94 1 MAI-TTIGNIVTGNIPVLASYMTEDPV-------------EKT---AFFNS---GILTPTPYAAEIARGPSNIANLPFWK 60 (349) T ss_pred CCc-eEEeeeeccChHHHHHHHHHhHH-------------Hhh---hhhhc---cceeccHHHHHHHhcCCCEEEeeeee Confidence 431 11111122122234433333331 112 22332 4666666665 4789999999999 Q ss_pred ccccC--c-eecCceee-cchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018846. 78 KLSKR--P-TMGDERVE-GRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA 153 (404) Q Consensus 78 ~L~G~--g-V~Gd~~le-Gnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~la 153 (404) +|.|+ + +.|+...+ .--..+...+|.-++=...++-... -+++.-+--|..+....++++||.+.....+|-.|. T Consensus 61 ~l~g~~e~n~~~dt~~~~~t~~kit~~~~~a~~~~r~kaw~~~-Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~ 139 (349) T protein:vir:94 61 AIDTSIEPNYSNDVYQDIATPRAIQTGEMMARVAYLNEGFGQA-DLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATAL 139 (349) T ss_pred cCCCCcccccCCCCcccccccccccccceeeeeeeeccccchh-HHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHH Confidence 99885 3 44444321 2234566666666665555664443 356666667999999999999999999999999999 Q ss_pred hhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEee Q lcl|NC_018846. 154 GARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLS 233 (404) Q Consensus 154 G~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~ 233 (404) |+-+..... .....+. .++. ..+.++..|+.+.+-.+...+-..+..-+ T Consensus 140 Gvf~~~~~~------~~~~~~~-----------~~~~---------~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~----- 188 (349) T protein:vir:94 140 GLYNDNVSA------TDAYHEQ-----------NDMV---------VDVSATSGFDAGAFIDATQTMGDALMGNG----- 188 (349) T ss_pred hhhcccccc------ccccccc-----------Ccee---------EEecccCCCChhhHHHHHHHHHHHhcccc----- Confidence 986521100 0000010 1111 11223334554433333333222222111 Q ss_pred cccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeecccccee Q lcl|NC_018846. 234 GDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVL 313 (404) Q Consensus 234 g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~ 313 (404) .+.+=++.||+..+..|++.-- ++.-+. +++ ...++.|+|..++.--.+|.- T Consensus 189 ------~~~lt~i~mHS~v~~~L~~~~l----i~~i~~----s~~------~~~i~ty~G~~VivDD~~Pv~-------- 240 (349) T protein:vir:94 189 ------GEVLGAIAMHSFVYAQARKAQL----IDFIRD----AEN------NTMFATYQGYRVIVDDSMTVV-------- 240 (349) T ss_pred ------ccceeEEEEchHHHHHHHhcch----hhhccC----ccc------CcccceecCcEEEEeCCCccc-------- Confidence 1235689999999999999632 322111 111 113577889766654444421 Q ss_pred ecccccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHH-------HHhhhhhccccCC Q lcl|NC_018846. 314 VSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAIS-------WINGLKKIRFPEK 386 (404) Q Consensus 314 ~~~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~-------~i~G~~K~rF~~~ 386 (404) ..+...+-...|+|..|.+..-+.++ .+ .|-..|--.+.+-+.+ .+++..=+.|... T Consensus 241 ------------~~g~~~~yttylfg~GAi~~~~~~~~-~~---~E~~rd~~~g~~~G~d~L~~R~~~~~hp~G~s~~~a 304 (349) T protein:vir:94 241 ------------GQDTSRKFISIIFGQGAIGYGEGNPE-MP---LEYEREASRANGGGVETLWTRKTWLLHPFGYSFTSA 304 (349) T ss_pred ------------cCCCCceEEEEEeecceEEeecCCCC-cc---eeeecccccCCcceeEEEEEeeEEEeeeeeeeeccc Confidence 01112244668999877776654321 11 2323333211111111 1222222333211 Q ss_pred C----C--------------CceEEEEEEEec-eecC Q lcl|NC_018846. 387 S----G--------------KMQDHGVIAVDT-AVKL 404 (404) Q Consensus 387 ~----g--------------~~~DfGvi~idt-aa~~ 404 (404) . + ++.-+-.+ +|. .++| T Consensus 305 ~v~~~~~~~~~~sPt~aeLa~~~NW~~v-~~~K~I~i 340 (349) T protein:vir:94 305 VITGNGTETIARSASWQDLANAANWNRV-VDRKHVPI 340 (349) T ss_pred ccCCCccccccCCCChHHhcCCcCcccc-cChhhcce Confidence 0 0 00111111 111 1111 No 59 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=91.26 E-value=0.017 Score=30.33 Aligned_cols=211 Identities=14% Similarity=0.123 Sum_probs=100.1 Q ss_pred EeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh-hhhccccccccccccccccccccccccccCC Q lcl|NC_018846. 106 INQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA-GARGDFVADDTILPTAEHPEFKKIMINDVLP 184 (404) Q Consensus 106 Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~la-G~~g~~~n~~~~~p~~~~~~~~~~~~N~v~a 184 (404) ||....+=-.=..+++..+..|+|.+.-.....=|++-.|+-++.++. +++.. .|....+... .. T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~-------~p~~~~~~g~--~~----- 66 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAA-------APVTGQDGGF--SV----- 66 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc-------CcccccccCc--ce----- Confidence 776654411112689999999999999999999999999999998876 33210 0110000000 00 Q ss_pred CCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEEEecHHHHHHHhc--Ccch Q lcl|NC_018846. 185 PTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYT--STSG 262 (404) Q Consensus 185 pt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~q~~~Lr~--d~~~ 262 (404) .+-++..+ +.+.+ .+.|-.+...+++..-|. +. .+++++|.|+..|-+ |+-. T Consensus 67 -----~~~a~~t~------~~~~l-~dai~~a~~~LdekdVP~-------~g-------R~~vv~P~~y~~LL~~~d~~~ 120 (221) T protein:vir:17 67 -----NIGAGNTN------NAQAI-VDGFFEAAAVLDERSAPM-------DG-------RVAVLSPRQYYSLISSVDTNI 120 (221) T ss_pred -----eccccccC------CHHHH-HHHHHHHHHHHhhcCCCC-------CC-------CEEEeCcHHHHHHHHhcCcce Confidence 00011111 11111 344445666676665551 11 688899999999875 4421 Q ss_pred HHHHHHHHHHhhcccccCCccccc-CceEEcCEEEEecCCceeeeccccceeeccccccccc----ccccccccchhhee Q lcl|NC_018846. 263 KDWNQMMVRAVNRAKGFNHPLFKG-ECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATT----KEVAAATNIDRAML 337 (404) Q Consensus 263 ~~w~~~q~~A~~~~rg~~nPlF~G-~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~----~~~a~~~~v~rall 337 (404) .++... +..--+..| ++++++|+-|.+-+++|... |..+ .++....++. ..+...+.=.-+|+ T Consensus 121 -------~n~d~~--~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~--gt~~-~~~ag~~~~~~~~~~~yr~~fs~~~glv 188 (221) T protein:vir:17 121 -------LNREIG--NTQGDMNTGKGLYVNAGIRIYKSNVLASLY--GTNL-VTDPGDATTSGENNGSYRPAITDRAGLV 188 (221) T ss_pred -------eeeecc--cccccccccceeeeecCcEEEEeccCCccc--cccc-ccCCccccccccccccccccccceEEEE Confidence 111111 111224456 69999999999999887321 1110 1111111110 11111111122344 Q ss_pred ecCceeEEE--eecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCC Q lcl|NC_018846. 338 LGAQALANA--YGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKS 387 (404) Q Consensus 338 lGaqAl~~A--~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~ 387 (404) .=..|++-. .|-+ .|+-.+-..|-- |-+... T Consensus 189 ~~~~Avgtvkl~~~~--~~~~~~~~~~~~-----------------~~~~~~ 221 (221) T protein:vir:17 189 FHKEAADTVEVLLPP--SRPPLVISMFSI-----------------RRPDRR 221 (221) T ss_pred EcchheeeeeeecCC--CCCceeeeeeec-----------------cCCCCC Confidence 444444433 2222 222222211110 000000 No 60 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=90.65 E-value=0.02 Score=29.93 Aligned_cols=289 Identities=14% Similarity=0.036 Sum_probs=128.6 Q ss_pred hcCch--h-HHHHHhhhhhhhhhhcccccccCCCCCccEEEEeec-CCCCCcEEEEEEeeccccCceecCceeecchhhh Q lcl|NC_018846. 22 NRNRS--M-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDL-NKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDL 97 (404) Q Consensus 22 ~~n~~--~-~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL-~k~~Gd~v~f~L~~~L~G~gV~Gd~~leGnee~L 97 (404) +.|+= . -++|++.+...-.+..-+....-+.-. .|. ..+.||+|++.......-.-....+...-+.+.+ T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~lV~r~y~------ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~ 74 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKTVDRQLL------SGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGL 74 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhcccCCC------cccccccCCCEEEEeeCCcceeecccCcCCCCcccccc Confidence 44442 2 266776654444433333222111100 011 2467999999987765422111111122234667 Q ss_pred hhceeEEEEeeccc-eeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccccccccccc Q lcl|NC_018846. 98 SHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKK 176 (404) Q Consensus 98 ~~~s~~v~Idq~R~-aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~~~~~~~ 176 (404) .-.+-.|.||+..+ ++...++ ++-...-||.+..++... -+++..|+.++..+..... T Consensus 75 ~e~~v~l~id~~k~~a~~v~d~-e~~l~i~~~~~~l~~a~~-ala~~vd~~l~~~l~~~a~------------------- 133 (423) T protein:vir:35 75 FSAKATGKVGKYITVAVEWTQI-EEALKLNQLDQILSPIHE-RMVTDLETELAHFMMNNGA------------------- 133 (423) T ss_pred ccceeeEEeccceeccceeCHH-HHHhhHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhccc------------------- Confidence 76677899999987 6776653 222255567666666654 3666677777644421100 Q ss_pred ccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEEEecHHHHHHH Q lcl|NC_018846. 177 IMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDW 256 (404) Q Consensus 177 ~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~q~~~L 256 (404) |.+-.|. +..-.++.|-.+..++++..-|- ++ +.++++|..+..| T Consensus 134 ---~~vgt~~------------------t~~~~~~~i~~a~~~Ld~~~vP~------~~--------R~~Vv~p~~~a~L 178 (423) T protein:vir:35 134 ---LSLGSPN------------------TAIKKWADVAQTASFIKDIGIKT------GE--------NYAIMDPWSAQRL 178 (423) T ss_pred ---ccccccc------------------CCcchHHHHHHHHHHHHHhcCCc------CC--------CEEEeCHHHHHHH Confidence 0000010 00112566777888888877772 11 5889999998888 Q ss_pred hcCcchHHHHHHHHHHhhcccccCCcccccCc-eEEcCEEEEecCCceeeeccccceeecccccccccccccccccchhh Q lcl|NC_018846. 257 YTSTSGKDWNQMMVRAVNRAKGFNHPLFKGEC-AMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRA 335 (404) Q Consensus 257 r~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~-gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~~~~a~~~~v~ra 335 (404) ..+... +.+ + ..+...-|=.|.+ |++.|+-+.+..++|..- .+......... .+. .+.=. T Consensus 179 l~~~~~-----~~~-~---~~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T-~gt~~~~~~v~-------~a~--~v~~~ 239 (423) T protein:vir:35 179 ADAQSG-----LHA-A---DQLVRTAWENAQISGNFGGIRALMSNGLASRK-QGDFDGAITVK-------TAP--NVDYL 239 (423) T ss_pred hccccc-----eec-c---ccchhHHHhhccceeeecceEEEEcCCCcccc-ccccccceeec-------ccc--ccccc Confidence 865331 111 1 1122344667776 999999999877776211 11000000000 000 00000 Q ss_pred eee--cCceeEEEeecCCCCCceeeec--cccccchHHHHHHHHhhh------hhccccC-CCCCceEEEEEEE------ Q lcl|NC_018846. 336 MLL--GAQALANAYGQKAGGHFNMVEK--KTDMDNRTEIAISWINGL------KKIRFPE-KSGKMQDHGVIAV------ 398 (404) Q Consensus 336 lll--GaqAl~~A~g~~~g~r~~w~Ee--~~D~g~~~~i~i~~i~G~------~K~rF~~-~~g~~~DfGvi~i------ 398 (404) -.- +++...++ -.|... ..--|+ +-.|-|+ .|-++.. +.+..+-|-|..- T Consensus 240 a~~~~~~~~~~~~--------~~~~~~~g~l~~GD-----~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~ 306 (423) T protein:vir:35 240 SVKDSYQFTVALT--------GATPSKTGFLKAGD-----QLKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNSTAS 306 (423) T ss_pred cccccccceeeee--------eeeeccCCcEEecc-----eEEeeeeeeccccccceeecccCCceeEEEEecccccccc Confidence 000 01111111 111110 111122 1123332 2222211 1112223333200 Q ss_pred -eceecC Q lcl|NC_018846. 399 -DTAVKL 404 (404) Q Consensus 399 -dtaa~~ 404 (404) .+.++| T Consensus 307 g~~~v~i 313 (423) T protein:vir:35 307 GDVTVKL 313 (423) T ss_pred CceeEEc Confidence 222333 No 61 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=90.57 E-value=0.02 Score=29.88 Aligned_cols=281 Identities=12% Similarity=0.047 Sum_probs=139.3 Q ss_pred CC-----cccchHHHHHHHHHHHHH--hhcCch-hHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEE Q lcl|NC_018846. 1 MT-----TVTSAQANKLYQVALFTA--ANRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVT 72 (404) Q Consensus 1 ~~-----~~~~~~a~~~~~~~lft~--~~~n~~-~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~ 72 (404) |. +-|.-+-+.+ -|++ +.-|+- ..+++...|.....+.+.-.+. ..-++.+...|++|. T Consensus 12 ~~~~~~~~~~~~~~~~~----~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~---------~~N~~~e~~~g~tVk 78 (329) T protein:vir:10 12 MNKEIKNATGKLKLNLQ----HFANKSVEPGDTLLKNKHVGILEKVTAANSYSAPA---------VISNDAIFMQGRSFT 78 (329) T ss_pred hhhhhhcccceeEEehh----hhcCCccCCchhHHHHHHHHHHHHHHHhhceeeee---------ecccceeeccCcEEE Confidence 11 1111111111 1322 112222 2356766666554444332111 111233456799999 Q ss_pred EEEeeccccCceecCce-eec-chhhhhhceeEEEEeecc---ceeccCChhhhhhhhhhH--HHHHHHHHHHHHHHHHH Q lcl|NC_018846. 73 FSIMHKLSKRPTMGDER-VEG-RGEDLSHADFSLKINQGR---HLVDAGGRMSQQRTKFNL--ASSARTLLGTYFNDLQD 145 (404) Q Consensus 73 f~L~~~L~G~gV~Gd~~-leG-nee~L~~~s~~v~Idq~R---~aV~~~g~m~~qrs~~dl--rk~ar~~L~~w~~~~~D 145 (404) ++-+.- .++ +|-. -.| +.++++....++.+||.| +.|+ .|+..-+...+ -........+.+...+| T Consensus 79 Ip~i~~---~gl-~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD---~~D~dEtn~~l~a~~i~~~~~~~~v~pEiD 151 (329) T protein:vir:10 79 VIKGDV---TEL-KDYKRNATNEFDHPQIQETTYFLDQEKYWGRFVD---ALDRRDTEGNIDINYVVAKQASEVVAPYLD 151 (329) T ss_pred Eeeecc---ccc-ccccCCCCccccccccceeEEEeecccceeeecc---hhhHhhhhhhhhHHHHHHHHHHHHhhhHHH Confidence 876654 233 3332 222 355788899999999998 3444 46655555444 34455566666667777 Q ss_pred HHHHHHHhhhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCC Q lcl|NC_018846. 146 QCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAH 225 (404) Q Consensus 146 ~~~~~~laG~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~ 225 (404) .-.|-.|++.-+. .....++++- .++.|+.+..++++... T Consensus 152 ay~~skla~~a~~--------------------------------------~~~~~~t~~n--ay~~i~~a~~~Lde~~v 191 (329) T protein:vir:10 152 NLRFATLARNKAK--------------------------------------HLTVGSGADA--QYDAVLDVSVELDEIGA 191 (329) T ss_pred HHHHHHHHhhccc--------------------------------------ccccccCHHH--HHHHHHHHHHHHHhcCC Confidence 6666555432110 0011122221 36778888888887543 Q ss_pred CCccEEeecccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceee Q lcl|NC_018846. 226 PLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR 305 (404) Q Consensus 226 pi~Pv~~~g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~ir 305 (404) | ++ ++||++|..+.-|+.++.+. +. .......++.|.+|.+||+.|.+.|.. | T Consensus 192 p------~~---------Rvl~VtP~~~~~Lk~~~~f~------~~----~~~~~~~~~~g~Vg~idG~~Ii~vps~--~ 244 (329) T protein:vir:10 192 G------AS---------RILFVTPKFYKGIKKFVIEL------PQ----GDNRQQVLGKGVQGELDGFTIVKVPSK--M 244 (329) T ss_pred C------CC---------cEEEeCHHHHHHHHhhhhhh------cc----ccccccceeeeeeeeecCeEEEEecCC--c Confidence 3 11 68999999999999987541 11 123456889999999999999987752 2 Q ss_pred eccccceeecccccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhcccc- Q lcl|NC_018846. 306 FYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFP- 384 (404) Q Consensus 306 f~~~~~~~~~~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~- 384 (404) + . ...+|+|......+--+...++.+--++ +. ..+.+.| ..|. T Consensus 245 ~---------------------k----~in~ii~~~~A~~~~~K~~~~~~~~p~~-----~~---~a~~v~g---r~yyd 288 (329) T protein:vir:10 245 L---------------------Q----GVEAMAVIGEVMASPIQANEAKLNSNVP-----GM---FGTLAEQ---MLYTG 288 (329) T ss_pred c---------------------c----ceeEEEEcCCceeeeeeeeeeeeeCCCC-----cc---chheeee---eeeee Confidence 1 0 1356777665555544432222111111 11 1122221 1110 Q ss_pred CCCCCceEEEEEEEeceec-C Q lcl|NC_018846. 385 EKSGKMQDHGVIAVDTAVK-L 404 (404) Q Consensus 385 ~~~g~~~DfGvi~idtaa~-~ 404 (404) ..--..+=-||++.-+.++ . T Consensus 289 ~~V~~~k~~~I~~~~~~a~~~ 309 (329) T protein:vir:10 289 AFVPEHLQKYIFTIGGKEVET 309 (329) T ss_pred eEEEccccCEEEEecccCccc Confidence 0000001123332221111 1 No 62 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=85.23 E-value=0.054 Score=27.52 Aligned_cols=294 Identities=12% Similarity=0.052 Sum_probs=130.4 Q ss_pred hcCch-h--HHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecC-CCCCcEEEEEEeeccccCceecCceeecchhhh Q lcl|NC_018846. 22 NRNRS-M--VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLN-KQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDL 97 (404) Q Consensus 22 ~~n~~-~--~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~-k~~Gd~v~f~L~~~L~G~gV~Gd~~leGnee~L 97 (404) +.|+= . -++|++.+...-.+...+....-+.... |.. ...||+|++.......-.-..+..--.-+-++| T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~------ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl 74 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLA------GEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNL 74 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCC------cccccccCCEEEEeeCCceeeeccCCccccccccCcc Confidence 45442 1 2567766544333333332222111100 111 247999999877765533222221112256888 Q ss_pred hhceeEEEEeeccc-eeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccccccccccc Q lcl|NC_018846. 98 SHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKK 176 (404) Q Consensus 98 ~~~s~~v~Idq~R~-aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~~~~~~~ 176 (404) .-.+-.|.||+..| ++....+ +.....-||-+..+++ ..=+++.+|+.++..+.+... T Consensus 75 ~e~~v~l~id~~k~va~~v~d~-E~~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~------------------- 133 (423) T protein:vir:10 75 ISGKATGRVGNYITVAVEYQQL-EEAIKLNQLEEILAPV-RQRIVTDLETELAHFMMNNGA------------------- 133 (423) T ss_pred ccceeEEEeeceeeeeeeechH-HHhcChhhHHHHHHHH-HHHHHHHHHHHHHHHHhhccc------------------- Confidence 88888999999987 6666542 2222334453222222 344667777776543322110 Q ss_pred ccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEEEecHHHHHHH Q lcl|NC_018846. 177 IMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDW 256 (404) Q Consensus 177 ~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~q~~~L 256 (404) |.+..|. . ..+ .++.|-.+..++++..-|- ++ +.++++|.-+..| T Consensus 134 ---~~~gt~~--------t--------~~~--a~~~i~~a~~~Ld~~~vP~------~~--------R~~Vv~p~~~a~L 178 (423) T protein:vir:10 134 ---LSLGSPN--------T--------PIT--KWSDVAQTASFLKDLGVNE------GE--------NYAVMDPWSAQRL 178 (423) T ss_pred ---cccccCC--------c--------ccc--hHHHHHHHHHHHHhccCCc------CC--------CEEEeChHHHHHH Confidence 0000000 0 001 2555666788888877772 11 5679999999888 Q ss_pred hcCcchHHHHHHHHHHhhcccccCCcccccCc-eEEcCEEEEecCCceeeeccccceeecccccccccccccccccchhh Q lcl|NC_018846. 257 YTSTSGKDWNQMMVRAVNRAKGFNHPLFKGEC-AMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRA 335 (404) Q Consensus 257 r~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~-gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~~~~a~~~~v~ra 335 (404) ..+... + .++ ..+...-|=.|.+ |.+.|+-+.+..++|-. -.+.... +.....+..|.-+ T Consensus 179 l~~~~~-----~-~~~---~~~~~~alr~g~i~G~i~GFdv~~Snnip~~-T~gt~~~---------t~~~~~~~~v~~~ 239 (423) T protein:vir:10 179 ADAQTG-----L-HAS---DQLVRTAWENAQIPTNFGGIRALMSNGLASR-TQGAFGG---------TLTVKTQPTVTYN 239 (423) T ss_pred hccccc-----e-ecc---cccchhhhhhccceeeecceEEEEeCCCccc-ccccccc---------ceeeeecceeccc Confidence 865431 0 111 1122344666776 89999999987776521 0110000 0000000111111 Q ss_pred eeecCce--eEEE--eecCCCCCceeeeccccccchHH-HHHHHHhhhhhccc-cCCCCCceEEEEEEE-------ecee Q lcl|NC_018846. 336 MLLGAQA--LANA--YGQKAGGHFNMVEKKTDMDNRTE-IAISWINGLKKIRF-PEKSGKMQDHGVIAV-------DTAV 402 (404) Q Consensus 336 lllGaqA--l~~A--~g~~~g~r~~w~Ee~~D~g~~~~-i~i~~i~G~~K~rF-~~~~g~~~DfGvi~i-------dtaa 402 (404) .--|++. +.++ |....| ..--|+..- -++..+.=+.|-.+ +.+.+..+-|-|.+- ++.+ T Consensus 240 a~~~a~~~~~~~~~~~~~~~~--------~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~g~~tv 311 (423) T protein:vir:10 240 AVKDSYQFTVTLTGATASVTG--------FLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSGGDVTV 311 (423) T ss_pred cccccceeeeeeeeccccccC--------ceeecceEEecceeeecccccccccccccCcceEEEEEeeeeeccCCceee Confidence 1111111 1111 111000 111122110 01111222333333 223445566666643 2344 Q ss_pred cC Q lcl|NC_018846. 403 KL 404 (404) Q Consensus 403 ~~ 404 (404) +| T Consensus 312 ~i 313 (423) T protein:vir:10 312 TL 313 (423) T ss_pred ec Confidence 44 No 63 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=84.03 E-value=0.064 Score=27.14 Aligned_cols=280 Identities=12% Similarity=0.048 Sum_probs=142.9 Q ss_pred hcCch-hHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccccCceecCcee-ec---chhh Q lcl|NC_018846. 22 NRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERV-EG---RGED 96 (404) Q Consensus 22 ~~n~~-~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gV~Gd~~l-eG---nee~ 96 (404) +.|+= ...+|...|.....+.+.+... +..+.-|+ . .-|.+|.++=+ +-.|. +|-.. .| +..+ T Consensus 1 Mantl~ya~~~~~~LD~~~~~~~~s~~l--~~~~~~v~-~-----~ggktVkIp~i---~~~gl-~DY~R~~g~~~~~g~ 68 (312) T protein:vir:10 1 MANTLAYGQVLQQGLDKQATQELLTGWM--DSNAKQIK-Y-----EGGKEVKIGKL---STDGL-GDYSRGSANAYVGGD 68 (312) T ss_pred CCcchhHHHHHHHHHHHHHHhhhccccc--cCCCceEE-E-----ecCcEEEEEee---ecccc-cccccccCCcccccc Confidence 44322 2467777776666666654433 22232232 2 34889997743 33343 44443 44 3346 Q ss_pred hhhceeEEEEeecc---ceeccCChhhhhhh--hhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccccc Q lcl|NC_018846. 97 LSHADFSLKINQGR---HLVDAGGRMSQQRT--KFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEH 171 (404) Q Consensus 97 L~~~s~~v~Idq~R---~aV~~~g~m~~qrs--~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~~ 171 (404) ++....+..++|-| +.|+ +|+..-| .+.+-...+....+...-.+|.-.|-.|+..-. T Consensus 69 v~~~~et~tl~qDR~~~F~vD---~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~-------------- 131 (312) T protein:vir:10 69 VKFEYETKTMTQDRGRKFTLD---AMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAI-------------- 131 (312) T ss_pred ccccceeEEeeecccceeecc---ccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhh-------------- Confidence 88888899999988 4555 3443333 234444555555555555666666655542110 Q ss_pred cccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEEEecHH Q lcl|NC_018846. 172 PEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPR 251 (404) Q Consensus 172 ~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~ 251 (404) ..... +..+...+++.+.. ++.|+.+.+++++.+-| + -+||+|+|. T Consensus 132 -----------~~~~~------~~~~~~~~~T~~ni--~~~i~~~~~~lde~~vp-------~--------~rvl~vTp~ 177 (312) T protein:vir:10 132 -----------GIKGD------TNVEYSYSVNSSTI--INKIKTGIKIIRENGYN-------G--------PLVCHLTYD 177 (312) T ss_pred -----------ccccc------cccccccccCHHHH--HHHHHHHHHHHHHccCC-------C--------ceEEEeChH Confidence 00000 00011112333332 35566666677765544 1 179999999 Q ss_pred HHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeeccccccccccccccccc Q lcl|NC_018846. 252 QWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATN 331 (404) Q Consensus 252 q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~~~~a~~~~ 331 (404) -+.-|+++..+ . ++. ....+....+.++.+|||.|++.|. -||+..=.+.--..+....+....+..+ T Consensus 178 ~~~lLk~~~~~---~-~~~------~~~~~~~i~~~V~~iDgv~Ii~VPs--~r~~t~~~f~dG~t~~~~~gg~~~~~~a 245 (312) T protein:vir:10 178 SMFAIEEKVLE---K-LTA------VTFAQGGIQTQVPSIDGCALIKTPQ--NRMYSSILLNDGTTSNQTAGGYLKGTKA 245 (312) T ss_pred HHHHHhhhhhc---e-ecc------cccccceeeeeeeeecccEEEEchh--hhccceeeeccCcccccccCceeecCcc Confidence 99888876321 1 100 1123445689999999999999886 3765321110000011111222223335 Q ss_pred chhheeecCceeEEEeecCCCCCce----------eeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEEEece Q lcl|NC_018846. 332 IDRAMLLGAQALANAYGQKAGGHFN----------MVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTA 401 (404) Q Consensus 332 v~ralllGaqAl~~A~g~~~g~r~~----------w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idta 401 (404) -.+++||=...+.+|.-|+.-.+.+ |.=+..-| |.+++++.. T Consensus 246 k~INfiiv~~~a~i~~~K~~~~~if~P~~~~~~d~~~~~~R~Y----------------------------~D~fv~~nk 297 (312) T protein:vir:10 246 LDTNFIIAPVDVPLAITKQDKMRIFDPETNQTANAWSMDYRRY----------------------------HDLWVTDNK 297 (312) T ss_pred cccceEEeCCceeeceeeeeeeeeeCCCCCCCcceeeeeeeee----------------------------eeeeeeccc Confidence 5688999999999998764322221 21111111 222333332 Q ss_pred ecC Q lcl|NC_018846. 402 VKL 404 (404) Q Consensus 402 a~~ 404 (404) .+- T Consensus 298 ~~~ 300 (312) T protein:vir:10 298 ANS 300 (312) T ss_pred cCe Confidence 222 No 64 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=73.80 E-value=0.17 Score=24.86 Aligned_cols=280 Identities=13% Similarity=0.053 Sum_probs=129.3 Q ss_pred hcCchhHHHHHhhhhhhhhhhccc-ccccCCCCCccEEEEeecCCCCCcEEEEEEeeccccCceecCc-eeecc--hhhh Q lcl|NC_018846. 22 NRNRSMVNILTEQQEAPKAVSPDK-KSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDE-RVEGR--GEDL 97 (404) Q Consensus 22 ~~n~~~~~~~~~~l~~~~~k~s~~-~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gV~Gd~-~leGn--ee~L 97 (404) +..+. .++|+..|.....+.+-. ....++-.+.-+. . ..|.+|.++-+.--+| .+|- +-.|- ..++ T Consensus 1 Mainy-a~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~-~-----~ggktVkIp~is~tsG---l~DY~R~~g~~~~g~v 70 (346) T protein:vir:10 1 MTINY-AEKYQAAVQQAFYDGHLYSAELWNSPSNSIIK-F-----DGAKHIKVPRLEITSG---RKDRQRRTITTPVANY 70 (346) T ss_pred Ccchh-HHHHHHHHHHHHHhhhccchhhcccccccceE-e-----cCCCEEEEEEeeeecc---cccccccCCccccccc Confidence 11111 234555443333322211 1122222221111 1 2478888665531122 1232 22222 2467 Q ss_pred hhceeEEEEeecc---ceeccCChhhhhhh--hhhHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhccccccccccccccc Q lcl|NC_018846. 98 SHADFSLKINQGR---HLVDAGGRMSQQRT--KFNLASSARTLLGTYFNDLQDQCAIVHLAG-ARGDFVADDTILPTAEH 171 (404) Q Consensus 98 ~~~s~~v~Idq~R---~aV~~~g~m~~qrs--~~dlrk~ar~~L~~w~~~~~D~~~~~~laG-~~g~~~n~~~~~p~~~~ 171 (404) +....++.++|-| +.|+ .|+..-| ...+-........+...-.+|.-.|-.|+. +.+ T Consensus 71 ~~~~et~tl~qDR~~~F~vD---~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~-------------- 133 (346) T protein:vir:10 71 SNDWDSYELKNERYWSTLVD---PSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEA-------------- 133 (346) T ss_pred ccceeEEEeeccccceeccc---ccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhh-------------- Confidence 8888888899888 4444 3432221 122233333333333444455555544432 111 Q ss_pred cccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEEEecHH Q lcl|NC_018846. 172 PEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPR 251 (404) Q Consensus 172 ~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~ 251 (404) .+ ++.....+++++.. ++.|+.+..+++...-|-. + +||+++|. T Consensus 134 -------~~-------------~~~~~~~a~T~~ni--~~~i~~~~~~lde~~vp~~-------------~-rvl~vTp~ 177 (346) T protein:vir:10 134 -------AH-------------DGGITTNTLDEKNI--LPAFDNMMLDFDEARIPST-------------N-RILYVTPK 177 (346) T ss_pred -------hc-------------cccccccccCHHHH--HHHHHHHHHHHHHccCCCC-------------C-eEEEECHH Confidence 00 01111122333322 4566666667766554411 1 89999999 Q ss_pred HHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeeccccccccccccccccc Q lcl|NC_018846. 252 QWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATN 331 (404) Q Consensus 252 q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~~~~a~~~~ 331 (404) -+.-|+.++.+. +.-.. +..+ ...|.+|.+|||.|++.|. -||+.--. ..+ +.. ++..+ T Consensus 178 ~~~lLk~s~~f~------k~~~v---~~~~-~i~~~V~siDGv~Ii~VPs--~r~~t~~~---f~~--G~~----~~t~a 236 (346) T protein:vir:10 178 TNAILKRAEAMN------RALTL---KDPN-NIQRTVYSLDDVTIRVVPS--DLMQTAYD---FSD--GSK----IIDTA 236 (346) T ss_pred HHHHHhhchhhe------ecccc---cccc-ccceeeeeecCeEEEEcch--hhcccchh---hcc--Ccc----ccCCc Confidence 999999988752 32211 1122 3589999999999999886 47763110 001 111 12223 Q ss_pred chhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHHHHhhhhhccccCCCCCceEEEEEEEeceecC Q lcl|NC_018846. 332 IDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 332 v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idtaa~~ 404 (404) -.+.+||-...+.+|.-|+.-.+.+=-. ....|. +.+. =|+ =|.+++++...+- T Consensus 237 k~INfiiv~~~A~ia~~K~~~~~if~P~-~~~~g~-------~l~~---~R~--------Y~D~fv~~nk~~~ 290 (346) T protein:vir:10 237 KQIEMFLIYNGVQIAPEKYSFVGFDQPS-AATSGN-------YLYY---EQS--------YDDVLLLNTKTKG 290 (346) T ss_pred cceeEEEECCceeeeeeeeeeeEeeCCC-CCcccc-------eeee---eee--------eeeeeeeccccce Confidence 4577999999999988764333222100 001110 0000 011 1223333333222 No 65 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=73.18 E-value=0.17 Score=24.75 Aligned_cols=291 Identities=13% Similarity=0.052 Sum_probs=114.5 Q ss_pred HHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccccC-----ceecCce Q lcl|NC_018846. 15 VALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKR-----PTMGDER 89 (404) Q Consensus 15 ~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~-----gV~Gd~~ 89 (404) -+||-..==|......+.+++.-... .+. ..+...|+.-+++. .||-|.+++-.+|.|. .+.++.. T Consensus 1 m~lsD~~vfN~~~~~a~~e~~~q~~~------~fn-~as~gai~l~~~~~--~Gd~~~~pf~~~l~g~~~~~~~~~~~~~ 71 (325) T protein:vir:95 1 MALSDLAVYSEYAYSAFSETLRQQVD------LFN-TATGGAIMLQSAAH--QGDFSDVAFFAKVTGGLVRRRNAYGSGT 71 (325) T ss_pred CchhhhhhhhhhhhhhhhhhhhhhHh------hhh-hcccceeEeccccc--cCceeeccccccccccccccccCCCCce Confidence 12222111111122222232211111 112 34456677666653 4999999999999873 3434433 Q ss_pred eecchhhhhhceeEEEE-eeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_018846. 90 VEGRGEDLSHADFSLKI-NQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPT 168 (404) Q Consensus 90 leGnee~L~~~s~~v~I-dq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~n~~~~~p~ 168 (404) ++. ..|....+.-++ -..+-++.. ..++...-.|-...+...++++|++++.+.++.++-|.-..-.. T Consensus 72 vt~--~kitt~~~~av~~~r~~g~~~~--d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~------- 140 (325) T protein:vir:95 72 VAE--KVLKHLVDTSVKVAAGTPPVRL--DPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALS------- 140 (325) T ss_pred ecc--ceeccccceeeEEecccCcccc--cHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc------- Confidence 333 233433332222 111111111 12222222222334445566666666666655554332210000 Q ss_pred ccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEEEe Q lcl|NC_018846. 169 AEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYV 248 (404) Q Consensus 169 ~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l 248 (404) ....+++++.+ .+. .++..+|.+.+-++..++ ||. .+..=.++| T Consensus 141 -----~~~~~v~dis~-----------~~~----~~~~~~s~~~l~~A~~kl-------------GD~---~~~l~~~~M 184 (325) T protein:vir:95 141 -----QVSDVVYDATA-----------NTD----AADKLPTWNNLNNGQAKF-------------GDQ---SSQIAAWIM 184 (325) T ss_pred -----ccccceeeeec-----------ccC----cccccccHHHHHHHHHHh-------------ccc---ccceeEEEE Confidence 00011222221 010 123456777665555443 332 133568999 Q ss_pred cHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeecccccccccccccc Q lcl|NC_018846. 249 TPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAA 328 (404) Q Consensus 249 ~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~~~~a~ 328 (404) |+.-+++|+++--.. +.++..+ .+ .+ . +.-|.|-.++---.+|. ...++. T Consensus 185 HS~v~~~L~~~~L~~-~~~~~~~-----~g-~~-~----i~t~~G~~VIVdD~~p~------------------~~~g~~ 234 (325) T protein:vir:95 185 HSTPMHKLYGSNLTN-GERLFTY-----GT-VN-V----VRDPFGKLLVMTDSPNL------------------FAAGTP 234 (325) T ss_pred chHHHHHHHHhhccc-ccccccc-----CC-cc-c----ccccCCcEEEEeCCCCC------------------CCccCc Confidence 999999999853210 1111010 00 01 1 12344432221111110 001111 Q ss_pred cccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHH----HHhhhhhccccCCCCCceEEEEEEEeceecC Q lcl|NC_018846. 329 ATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAIS----WINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 329 ~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~----~i~G~~K~rF~~~~g~~~DfGvi~idtaa~~ 404 (404) .+-+.++||..|.++-.+.+.+. .....+=..+.+.... .+++.+=.+|....++. . +|-+.| T Consensus 235 --~~ytty~lg~GAi~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~tf~lhp~G~sw~~s~~g~------s-Pt~aeL 301 (325) T protein:vir:95 235 --NVYHILGLVPGGVLIGQNNDFDA----NEETKNGDENIIRTYQAEWSYNIGVKGFAWDKANGGK------S-PTDAAL 301 (325) T ss_pred --eeEEEEEEecCeEEecCCCCccc----cccccCcccceeeeeeeeeeEEeecceeeeecccccC------C-cChHhh Confidence 13367899988877665443221 1111121222222222 23344444553222110 0 233333 No 66 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=61.67 E-value=0.35 Score=23.09 Aligned_cols=295 Identities=12% Similarity=0.053 Sum_probs=125.1 Q ss_pred hcCch-h--HHHHHhhhhhhhhhhcccccccCCCCCccEEEEeec-CCCCCcEEEEEEeeccccCceecCceeec-chhh Q lcl|NC_018846. 22 NRNRS-M--VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDL-NKQAGDEVTFSIMHKLSKRPTMGDERVEG-RGED 96 (404) Q Consensus 22 ~~n~~-~--~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL-~k~~Gd~v~f~L~~~L~G~gV~Gd~~leG-nee~ 96 (404) +.|+= . .++|++.+...-.+..-+....-+... .|. ....||+|++.......-.-..+.+ ..+ .-++ T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~lVnr~y~------~e~~~~k~GDTV~I~~p~~~~~~~~~~~~-~~~~~~~~ 73 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLL------AGEINSSTGDSVSFKRPHQFSSLRTPTGD-ISGQNKNN 73 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCC------cchhhcccCCEEEEeeCCcceeecccCcc-cCCcccCc Confidence 45442 1 256776654444433333222111100 011 1247999999876655432222211 112 3577 Q ss_pred hhhceeEEEEeeccc-eeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccccccccc Q lcl|NC_018846. 97 LSHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFK 175 (404) Q Consensus 97 L~~~s~~v~Idq~R~-aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~~~~~~ 175 (404) |.-.+-.|.||+..| ++...++ ++....-||-+..+++ ..=+++..|+.++..+.+... T Consensus 74 l~e~~v~l~id~~k~va~~v~d~-E~~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~a~------------------ 133 (423) T protein:vir:17 74 LISGKATGRVGNYITVAVEYQQL-EEAIKLNQLEEILAPV-RQRIVTDLETELAHFMMNNGA------------------ 133 (423) T ss_pred cccceeEEEeeceeeeeeeecHH-HHhcChhHHHHHHHHH-HHHHHHHHHHHHHHHHhhccc------------------ Confidence 777778999999987 5666542 1122334452222222 344666777766533322110 Q ss_pred cccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEEEecHHHHHH Q lcl|NC_018846. 176 KIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWND 255 (404) Q Consensus 176 ~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~q~~~ 255 (404) |.+..|. +..+ .++.|-.+..++++..-|- ++ +.++++|.-+.. T Consensus 134 ----~~~gt~~----------------t~~~--a~~~i~~a~~~Ld~~~vP~------~~--------R~~Vv~p~~~a~ 177 (423) T protein:vir:17 134 ----LSLGSPN----------------TPIT--KWSDVAQTASFLKDLGVNE------GE--------NYAVMDPWSAQR 177 (423) T ss_pred ----cccccCC----------------cccc--cHHHHHHHHHHHHhccCCc------CC--------CEEEeChHHHHH Confidence 0000010 0011 2555666788888877772 11 567999999988 Q ss_pred HhcCcchHHHHHHHHHHhhcccccCCcccccCc-eEEcCEEEEecCCceeeeccccceeecccccccccccccccccchh Q lcl|NC_018846. 256 WYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGEC-AMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDR 334 (404) Q Consensus 256 Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~-gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~~~~a~~~~v~r 334 (404) |..+... +.+ + ..+...-|=.|.+ |.+.|+-+.+..++|-. -.+.-.. +..........+++... T Consensus 178 Ll~~~~~-----~~~-~---~~~~~~alr~g~i~G~i~GFdvy~Snnip~~-T~gt~~~-t~~~~~~~~v~~~a~~~--- 243 (423) T protein:vir:17 178 LADAQTG-----LHA-S---DQLVRTAWENAQIPTNFGGIRALMSNGLASR-TQGAFGG-TLTVKTQPTVTYNAVKD--- 243 (423) T ss_pred Hhccccc-----eec-c---cccchHHHhhccceeeecceEEEEeCCCccc-cccceec-eeeeccccccccccccc--- Confidence 8876431 111 1 1122344555666 89999999887776511 0010000 00000000000000000 Q ss_pred heeecCceeEEEeecCCCCCceeee--ccccccchHHH-HHHHHhhhhhcccc-CCCCCceEEEEEE-------Eeceec Q lcl|NC_018846. 335 AMLLGAQALANAYGQKAGGHFNMVE--KKTDMDNRTEI-AISWINGLKKIRFP-EKSGKMQDHGVIA-------VDTAVK 403 (404) Q Consensus 335 alllGaqAl~~A~g~~~g~r~~w~E--e~~D~g~~~~i-~i~~i~G~~K~rF~-~~~g~~~DfGvi~-------idtaa~ 403 (404) + +.+-..++ -.|.. ...--|+..-+ ++.++.=+.|-.+. .+.+..+-|.|.+ =++.++ T Consensus 244 ~---~~~~~~~~--------~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~a~~~~tv~ 312 (423) T protein:vir:17 244 S---YQFTVTLT--------GATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSSGDVTVT 312 (423) T ss_pred c---cceeeeee--------eeeeeccCceeecceEEecceeeecccccccccccccccceEEEEEecccccccCceEEE Confidence 0 00000011 01110 01111221100 11111122333332 2333456666643 122333 Q ss_pred C Q lcl|NC_018846. 404 L 404 (404) Q Consensus 404 ~ 404 (404) | T Consensus 313 i 313 (423) T protein:vir:17 313 L 313 (423) T ss_pred e Confidence 3 No 67 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=56.45 E-value=0.46 Score=22.45 Aligned_cols=304 Identities=11% Similarity=0.038 Sum_probs=117.7 Q ss_pred CC-cccchHHHHH------------------HHHHHHHHhhcCchh----H-HHHHhhhhhhhhhhcccccccCCCCCcc Q lcl|NC_018846. 1 MT-TVTSAQANKL------------------YQVALFTAANRNRSM----V-NILTEQQEAPKAVSPDKKSTKQTSAGAP 56 (404) Q Consensus 1 ~~-~~~~~~a~~~------------------~~~~lft~~~~n~~~----~-~~~~~~l~~~~~k~s~~~~~~Gt~~~~~ 56 (404) +. ........+- .+..-+...+...+. + ..++..+... .-...++ T Consensus 212 ~~~~~~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~-----------~~~~~~~ 280 (543) T protein:vir:81 212 QCLATSSPAYLRAWSKMARNPHAAILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIIT-----------SNGSLND 280 (543) T ss_pred hhhhhhhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHH-----------HHhhhch Confidence 00 0000000000 000000000000000 0 0111111000 0011122 Q ss_pred EEEEeecCCCCCcEEEEEEeeccccCceecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHH Q lcl|NC_018846. 57 VVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLL 136 (404) Q Consensus 57 I~~~~dL~k~~Gd~v~f~L~~~L~G~gV~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L 136 (404) |..+-......|+...+.....-...+|-..+.. .+..++|..-++.+......|.+...+-+ -+ .||-..-...| T Consensus 281 l~~~~~~~~~~g~~~~~~~~~~~~a~~v~Eg~~~--~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~-~~~~~~i~~~l 356 (543) T protein:vir:81 281 IRRFARQVVATGDVWHGVSSAAVQWSWDAEFEEV--SDDSPEFGQPEIPVKKAQGFVPISIEALQ-DE-ANVTETVALLF 356 (543) T ss_pred hhhhcccccCCcceEEEEecCCcceeecccCccc--cccccccceeeeeeeeeEeeehhhHHHHh-cc-HHHHHHHHHHH Confidence 2222222222343222222221122233222222 35567777777777777777776665553 24 69999999999 Q ss_pred HHHHHHHHHHHHHHHHhhhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHH Q lcl|NC_018846. 137 GTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNL 216 (404) Q Consensus 137 ~~w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~ 216 (404) .+-+....|+.+| .| +.++. +..+++.+. +. ......-.+++.++.+.+.++ T Consensus 357 ~~~~~~~~d~ail---~G---~Gt~~----------~p~Gi~~~~----~~--------~~~~~~~~~~~~~~~~~~~~~ 408 (543) T protein:vir:81 357 AEGKDELEAVTLT---TG---TGQGN----------QPTGIVTAL----AG--------TAAEIAPVTAETFALADVYAV 408 (543) T ss_pred HHHHHHHHHHHHh---cc---CCCCc----------ccccchhhc----cc--------ccccccccccccccHHHHHHH Confidence 9999999999886 22 11111 112221110 00 000000113334455544433 Q ss_pred HHHHHHhCCCCccEEeecccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccccCceEEcCEEE Q lcl|NC_018846. 217 SLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILV 296 (404) Q Consensus 217 ~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii 296 (404) ..... ... .. --+++|+|.-+..|++=.+ ..++.--.|+..|.-+++.|.++ T Consensus 409 ~~~l~--~~~-------~~-------~~~~v~n~~~~~~l~~lkd------------~~G~~l~~~~~~g~~~~l~G~pv 460 (543) T protein:vir:81 409 YEQLA--ARH-------RR-------QGAWLANNLIYNKIRQFDT------------QGGAGLWTTIGNGEPSQLLGRPV 460 (543) T ss_pred HHhhh--ccc-------cC-------CcEEEEcHHHHHHHHHhhc------------CCCceeccCcCCCCCccccceee Confidence 32221 100 00 1368899988888875211 11111112334566778889888 Q ss_pred EecCCceeeeccccceeecccccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccc-ccc---chHHHHH Q lcl|NC_018846. 297 RKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKT-DMD---NRTEIAI 372 (404) Q Consensus 297 ~~~~~~~irf~~~~~~~~~~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~-D~g---~~~~i~i 372 (404) +....+|.... ...+++ ...+++|-=. .+.++-..++...+..+.+ ++. +.+.+-+ T Consensus 461 ~~~~~~~~~~~----------------~~~~~~---~~~i~~gd~~-~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 520 (543) T protein:vir:81 461 GEAEAMDANWN----------------TSASAD---NFVLLYGNFQ-NYVIADRIGMTVEFIPHLFGTNRRPNGSRGWFA 520 (543) T ss_pred EEecccccccc----------------ccccCC---cceEEEeecc-ceeEEeecccEEEEeccccccchhhcCceEEEE Confidence 76665541110 000111 1124455322 1222222334444444321 111 1111111 Q ss_pred HHHhhhhhccccCCCCCceEEEEEEEecee Q lcl|NC_018846. 373 SWINGLKKIRFPEKSGKMQDHGVIAVDTAV 402 (404) Q Consensus 373 ~~i~G~~K~rF~~~~g~~~DfGvi~idtaa 402 (404) ...+|++.. +.+=|-++.+-|+| T Consensus 521 ~~r~d~~v~-------~~~A~~~l~~~~~a 543 (543) T protein:vir:81 521 YYRMGADVV-------NPNAFRLLNVETAS 543 (543) T ss_pred EEeeccEee-------cccceEEEEecccC Confidence 111222111 12357777777777 No 68 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=52.46 E-value=0.55 Score=21.99 Aligned_cols=290 Identities=13% Similarity=0.053 Sum_probs=125.0 Q ss_pred hcCch--h-HHHHHhhhhhhhhhhcccccccCCCCCccEEEEeec-CCCCCcEEEEEEeeccccCceecCceeecc-hhh Q lcl|NC_018846. 22 NRNRS--M-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDL-NKQAGDEVTFSIMHKLSKRPTMGDERVEGR-GED 96 (404) Q Consensus 22 ~~n~~--~-~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL-~k~~Gd~v~f~L~~~L~G~gV~Gd~~leGn-ee~ 96 (404) +.|+= . -++|+..+...-.+..-+....-+.-.. |. ....||+|+++.-....-.-..+. .+.++ .++ T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~------ef~~ak~GDTV~I~~P~~~~~~d~~~~-~~t~~~~~~ 73 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLA------GEINSSTGDSVSFKRPHQFKSERTMDG-DITGKSKNS 73 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhhccCCCc------cccccccCCEEEEeeCCceeeecccCc-ccCcccccc Confidence 33322 1 2567666444333333332222111100 11 135799999877665532222111 12333 345 Q ss_pred hhhceeEEEEeeccc-eeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccccccccc Q lcl|NC_018846. 97 LSHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFK 175 (404) Q Consensus 97 L~~~s~~v~Idq~R~-aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~~~~~~ 175 (404) |.-.+-.|.||+..+ ++....+ +......||-+.. +....=+++.+|+.+...+..... T Consensus 74 l~e~~v~l~id~~k~~a~~v~d~-E~~l~i~~~~~~l-~~A~~aLA~~vd~~ia~~~~~~~~------------------ 133 (423) T protein:vir:10 74 LISAKATGEVGNYITVAVEYRQI-EEALKLNQLDQIL-VPINERMVTDLETELALFMMKHGA------------------ 133 (423) T ss_pred cccceEEEEecceeeeeeeeChH-HHhcChhHHHHHH-HHHHHHHHHHHHHHHHHHhhhccc------------------ Confidence 656678899999987 6766543 2223555663333 333445667777777544433211 Q ss_pred cccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEEEecHHHHHH Q lcl|NC_018846. 176 KIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWND 255 (404) Q Consensus 176 ~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~q~~~ 255 (404) |.+..|.. ..+ .++.+-.++.++++..-|- ++ +.++++|.-+.. T Consensus 134 ----~~vgt~~t----------------~~~--a~~~~a~a~~~L~~~~vP~------~~--------R~~Vv~p~~~a~ 177 (423) T protein:vir:10 134 ----LSLGSPNT----------------PIK--KWSDVAQTASFLKDLGINS------GE--------NYAVMDPWAAQR 177 (423) T ss_pred ----cccccccc----------------ccc--cHHHHHHHHHHHhhccCCc------CC--------CEEEeCHHHHHH Confidence 11111110 011 2445556777888777662 11 577999999988 Q ss_pred Hhc-CcchHHHHHHHHHHhhcccccCCcccccCc-eEEcCEEEEecCCceeeeccccceeeccccccccccccccc--cc Q lcl|NC_018846. 256 WYT-STSGKDWNQMMVRAVNRAKGFNHPLFKGEC-AMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAA--TN 331 (404) Q Consensus 256 Lr~-d~~~~~w~~~q~~A~~~~rg~~nPlF~G~~-gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~~~~a~~--~~ 331 (404) |.. ++.+. .+. .+..-.|=.|.+ |.+.|+-+.+..++|..- .++. .......+.....+++. .. T Consensus 178 Ll~~~~~~~-------~~~---~~~~~alr~~~i~G~~~GFdi~~Sn~vp~~T-~g~~-~ga~~~~~~~~vt~a~~~~~~ 245 (423) T protein:vir:10 178 LADAQSGLH-------VSE---QLVRTAWENAQISGNFGGIRALMSNGLASRT-QGAF-GGKLTVKGTPEVNYDSVKDSY 245 (423) T ss_pred Hhhhhhhhc-------ccc---ccchHHHHhcccceeecceEEEEecCCcccc-cccc-cceeeeeeeeEEEeccccccc Confidence 865 43221 110 112344555665 899999998866654211 1100 00000000000000000 11 Q ss_pred chhheeecCceeEEEeecCCCCC-----ceeeeccccccchHHHHHHHHhhhhhccc-cCCCCCceEEEEEEE------- Q lcl|NC_018846. 332 IDRAMLLGAQALANAYGQKAGGH-----FNMVEKKTDMDNRTEIAISWINGLKKIRF-PEKSGKMQDHGVIAV------- 398 (404) Q Consensus 332 v~ralllGaqAl~~A~g~~~g~r-----~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF-~~~~g~~~DfGvi~i------- 398 (404) +-++-.+++-+-.-++-+.+ .- .+|.+ =+.|-++ +...+..+-|-|..= T Consensus 246 ~~~~~~~~~T~s~~g~l~~G-D~~t~aGv~~v~-----------------~~tk~~l~~~~~~~~~~~~V~~~~~~~a~~ 307 (423) T protein:vir:10 246 AFTATLTGATASKKGFLKVG-DQLQFDDTHWLN-----------------QQSKQTLYNGASALSFTATVMEDANAHSSG 307 (423) T ss_pred ccccceeeccceeceeEEec-ceEeecceeeec-----------------ccccceeecccCCcceEEEEEecccccccC Confidence 22233333332211111110 00 12222 2333332 223334445555320 Q ss_pred eceecC Q lcl|NC_018846. 399 DTAVKL 404 (404) Q Consensus 399 dtaa~~ 404 (404) ++.++| T Consensus 308 ~~tv~i 313 (423) T protein:vir:10 308 DVTVKI 313 (423) T ss_pred ceEEEe Confidence 222333 No 69 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=51.91 E-value=0.57 Score=21.93 Aligned_cols=291 Identities=8% Similarity=0.039 Sum_probs=112.0 Q ss_pred CCcccchHHHHHHHHHH----HHHhhcCchhHHHHHhhhh----------hhhhhhcccccccCCCCCccEEE-EeecCC Q lcl|NC_018846. 1 MTTVTSAQANKLYQVAL----FTAANRNRSMVNILTEQQE----------APKAVSPDKKSTKQTSAGAPVVR-ITDLNK 65 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~l----ft~~~~n~~~~~~~~~~l~----------~~~~k~s~~~~~~Gt~~~~~I~~-~~dL~k 65 (404) -...+........+..- +.....+... ......+. .+......+... -..++|.. ++-. . T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~---~~~~~l~~~~~~~-~ 142 (385) T protein:vir:19 68 AENPGEKKSFSERAAEELIKSWDGKQGTFGA-KTFNKSLGSDADSAGSLIQPMQIPGIIMPG---LRRLTIRDLLAQG-R 142 (385) T ss_pred ccccchhhhhHHHHHHHHHHHHHHhhccchh-hHHHhhhccccccCCceecchhhhHHHHHh---hhccchhhhccee-c Confidence 11111111111111111 1111111110 00000000 000000000000 01122211 1111 1 Q ss_pred CCCcEEEEEEeecc--ccCceecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHH Q lcl|NC_018846. 66 QAGDEVTFSIMHKL--SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDL 143 (404) Q Consensus 66 ~~Gd~v~f~L~~~L--~G~gV~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~ 143 (404) -.|..+++.....- ...+|... -+-.+...+|..-++.+......+.+...+-+ ...+|-..-++.|++-+... T Consensus 143 ~~~~~~~~~~~~~~~~~a~~v~E~--~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~--d~~~l~~~i~~~la~a~~~~ 218 (385) T protein:vir:19 143 TSSNALEYVREEVFTNNADVVAEK--ALKPESDITFSKQTANVKTIAHWVQASRQVMD--DAPMLQSYINNRLMYGLALK 218 (385) T ss_pred ccCcceEEEEEecCCcceeeeccC--ccccccccceeEEEEeeeeEEEeehhhHHHHh--hHHHHHHHHHHHHHHHHHHH Confidence 11233444333211 12222211 12234456677777777776666665544332 33568888888899989999 Q ss_pred HHHHHHHHHhhhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHh Q lcl|NC_018846. 144 QDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEM 223 (404) Q Consensus 144 ~D~~~~~~laG~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~ 223 (404) .|+.+| .|. ..+. ...++... ++ +.....+.+...+.+.|..+...+... T Consensus 219 ~d~~~l---~G~---g~~~----------~~~Gi~~~----~~----------~~~~~~~~~~~~~~d~i~~~~~~l~~~ 268 (385) T protein:vir:19 219 EEGQLL---NGD---GTGD----------NLEGLNKV----AT----------AYDTSLNATGDTRADIIAHAIYQVTES 268 (385) T ss_pred HHHHHH---hcc---CCCC----------cccccccc----cc----------cccccccccccchHHHHHHHHHhhccc Confidence 998886 332 1111 01111110 00 001112223333455454443333211 Q ss_pred CCCCccEEeecccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCccc----ccCceEEcCEEEEec Q lcl|NC_018846. 224 AHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLF----KGECAMWRNILVRKY 299 (404) Q Consensus 224 a~pi~Pv~~~g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF----~G~~gm~ngvii~~~ 299 (404) . ... =+++|||..+..|+.=.+ . . .+||| .|.-+.+.|++++.. T Consensus 269 ~---------~~~-------~~~~~~~~~~~~l~~lkd------------~--~--G~~l~~~~~~~~~~~l~G~pV~~~ 316 (385) T protein:vir:19 269 E---------FSA-------SGIVLNPRDWHNIALLKD------------N--E--GRYIFGGPQAFTSNIMWGLPVVPT 316 (385) T ss_pred c---------CCC-------CEEEEcHHHHHHHHHhhc------------C--C--CceeccCcccCCCceecceeeEEc Confidence 0 001 278999999888875211 1 1 24454 567788889888776 Q ss_pred CCceeeeccccceeecccccccccccccccccchhheeecC--ceeEEEeecCCCCCceeeeccccccchHHHHHHHHhh Q lcl|NC_018846. 300 AGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGA--QALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWING 377 (404) Q Consensus 300 ~~~~irf~~~~~~~~~~~~~~~~~~~~a~~~~v~ralllGa--qAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G 377 (404) +.+|- + .+++|- ++..++-. .+....+..+..|+ +..+ T Consensus 317 ~~~p~------------------------~-----~~~~gd~~~~~~~~~~--~~~~v~~~~~~~~~---------~~~~ 356 (385) T protein:vir:19 317 KAQAA------------------------G-----TFTVGGFDMASQVWDR--MDATVEVSREDRDN---------FVKN 356 (385) T ss_pred CcCCC------------------------C-----cEEEeecccEEEEEEe--cceEEEEeccccch---------hhcC Confidence 66540 0 133443 33333321 23344444443332 1122 Q ss_pred hhhccccC----CCCCceEEEEEEEecee Q lcl|NC_018846. 378 LKKIRFPE----KSGKMQDHGVIAVDTAV 402 (404) Q Consensus 378 ~~K~rF~~----~~g~~~DfGvi~idtaa 402 (404) +-.+|... .--+.+=|-++.+=+|+ T Consensus 357 ~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 357 MLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred cEEEEEEEeeccEEecccceEEEEeccCC Confidence 22222111 11112345555555555 No 70 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=51.91 E-value=0.57 Score=21.93 Aligned_cols=291 Identities=8% Similarity=0.039 Sum_probs=112.0 Q ss_pred CCcccchHHHHHHHHHH----HHHhhcCchhHHHHHhhhh----------hhhhhhcccccccCCCCCccEEE-EeecCC Q lcl|NC_018846. 1 MTTVTSAQANKLYQVAL----FTAANRNRSMVNILTEQQE----------APKAVSPDKKSTKQTSAGAPVVR-ITDLNK 65 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~l----ft~~~~n~~~~~~~~~~l~----------~~~~k~s~~~~~~Gt~~~~~I~~-~~dL~k 65 (404) -...+........+..- +.....+... ......+. .+......+... -..++|.. ++-. . T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~---~~~~~l~~~~~~~-~ 142 (385) T protein:vir:18 68 AENPGEKKSFSERAAEELIKSWDGKQGTFGA-KTFNKSLGSDADSAGSLIQPMQIPGIIMPG---LRRLTIRDLLAQG-R 142 (385) T ss_pred ccccchhhhhHHHHHHHHHHHHHHhhccchh-hHHHhhhccccccCCceecchhhhHHHHHh---hhccchhhhccee-c Confidence 11111111111111111 1111111110 00000000 000000000000 01122211 1111 1 Q ss_pred CCCcEEEEEEeecc--ccCceecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHH Q lcl|NC_018846. 66 QAGDEVTFSIMHKL--SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDL 143 (404) Q Consensus 66 ~~Gd~v~f~L~~~L--~G~gV~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~ 143 (404) -.|..+++.....- ...+|... -+-.+...+|..-++.+......+.+...+-+ ...+|-..-++.|++-+... T Consensus 143 ~~~~~~~~~~~~~~~~~a~~v~E~--~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~--d~~~l~~~i~~~la~a~~~~ 218 (385) T protein:vir:18 143 TSSNALEYVREEVFTNNADVVAEK--ALKPESDITFSKQTANVKTIAHWVQASRQVMD--DAPMLQSYINNRLMYGLALK 218 (385) T ss_pred ccCcceEEEEEecCCcceeeeccC--ccccccccceeEEEEeeeeEEEeehhhHHHHh--hHHHHHHHHHHHHHHHHHHH Confidence 11233444333211 12222211 12234456677777777776666665544332 33568888888899989999 Q ss_pred HHHHHHHHHhhhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHh Q lcl|NC_018846. 144 QDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEM 223 (404) Q Consensus 144 ~D~~~~~~laG~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~ 223 (404) .|+.+| .|. ..+. ...++... ++ +.....+.+...+.+.|..+...+... T Consensus 219 ~d~~~l---~G~---g~~~----------~~~Gi~~~----~~----------~~~~~~~~~~~~~~d~i~~~~~~l~~~ 268 (385) T protein:vir:18 219 EEGQLL---NGD---GTGD----------NLEGLNKV----AT----------AYDTSLNATGDTRADIIAHAIYQVTES 268 (385) T ss_pred HHHHHH---hcc---CCCC----------cccccccc----cc----------cccccccccccchHHHHHHHHHhhccc Confidence 998886 332 1111 01111110 00 001112223333455454443333211 Q ss_pred CCCCccEEeecccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCccc----ccCceEEcCEEEEec Q lcl|NC_018846. 224 AHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLF----KGECAMWRNILVRKY 299 (404) Q Consensus 224 a~pi~Pv~~~g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF----~G~~gm~ngvii~~~ 299 (404) . ... =+++|||..+..|+.=.+ . . .+||| .|.-+.+.|++++.. T Consensus 269 ~---------~~~-------~~~~~~~~~~~~l~~lkd------------~--~--G~~l~~~~~~~~~~~l~G~pV~~~ 316 (385) T protein:vir:18 269 E---------FSA-------SGIVLNPRDWHNIALLKD------------N--E--GRYIFGGPQAFTSNIMWGLPVVPT 316 (385) T ss_pred c---------CCC-------CEEEEcHHHHHHHHHhhc------------C--C--CceeccCcccCCCceecceeeEEc Confidence 0 001 278999999888875211 1 1 24454 567788889888776 Q ss_pred CCceeeeccccceeecccccccccccccccccchhheeecC--ceeEEEeecCCCCCceeeeccccccchHHHHHHHHhh Q lcl|NC_018846. 300 AGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGA--QALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWING 377 (404) Q Consensus 300 ~~~~irf~~~~~~~~~~~~~~~~~~~~a~~~~v~ralllGa--qAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G 377 (404) +.+|- + .+++|- ++..++-. .+....+..+..|+ +..+ T Consensus 317 ~~~p~------------------------~-----~~~~gd~~~~~~~~~~--~~~~v~~~~~~~~~---------~~~~ 356 (385) T protein:vir:18 317 KAQAA------------------------G-----TFTVGGFDMASQVWDR--MDATVEVSREDRDN---------FVKN 356 (385) T ss_pred CcCCC------------------------C-----cEEEeecccEEEEEEe--cceEEEEeccccch---------hhcC Confidence 66540 0 133443 33333321 23344444443332 1122 Q ss_pred hhhccccC----CCCCceEEEEEEEecee Q lcl|NC_018846. 378 LKKIRFPE----KSGKMQDHGVIAVDTAV 402 (404) Q Consensus 378 ~~K~rF~~----~~g~~~DfGvi~idtaa 402 (404) +-.+|... .--+.+=|-++.+=+|+ T Consensus 357 ~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 357 MLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred cEEEEEEEeeccEEecccceEEEEeccCC Confidence 22222111 11112345555555555 No 71 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=49.06 E-value=0.65 Score=21.61 Aligned_cols=292 Identities=8% Similarity=-0.043 Sum_probs=114.8 Q ss_pred HHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEe-eccccCceecCceeecc Q lcl|NC_018846. 15 VALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIM-HKLSKRPTMGDERVEGR 93 (404) Q Consensus 15 ~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~-~~L~G~gV~Gd~~leGn 93 (404) -|..|.-+.-=| ..++..+.......+.+..+-. ...+ .+.++++... ......+|-..+ +-. T Consensus 1 m~t~t~gg~liP--~~~~~~ii~~l~~~s~i~~l~~---------~~~~---~~~~~~ip~~~~~~~a~wv~E~~--~~~ 64 (303) T protein:vir:97 1 MGTETSKASLFD--KHLVSDLINKVKGHSSLAKLSS---------QKPI---PFNGSKEFTFTLDSDIDVVAENG--KKT 64 (303) T ss_pred CcccCCCCeEcc--hhHHHHHHHHHHhhchhhhhcc---------eeec---CCCceEEEEEecCcceEEeecCc--ccc Confidence 112222111112 1223333222222222211110 0001 1112333221 111223333222 223 Q ss_pred hhhhhhceeEEEEeeccceeccCChhhhh--hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccccc Q lcl|NC_018846. 94 GEDLSHADFSLKINQGRHLVDAGGRMSQQ--RTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEH 171 (404) Q Consensus 94 ee~L~~~s~~v~Idq~R~aV~~~g~m~~q--rs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g~~~n~~~~~p~~~~ 171 (404) +..++|.+-++.+-....-+.....+-+| -+.++|-..-++.|++-+.+.+|+.+|...-...+ +... T Consensus 65 ~s~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g----------~~~~ 134 (303) T protein:vir:97 65 HGGLSLEPVTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTK----------KASD 134 (303) T ss_pred ccccceeeEEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCc----------cccc Confidence 55677777777766666666544332221 35688999999999999999999998722100010 0000 Q ss_pred cccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEEEecHH Q lcl|NC_018846. 172 PEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPR 251 (404) Q Consensus 172 ~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~~l~p~ 251 (404) +.. .. .+.+..+.....++.+. +.+.|.++...... .. . + + =.++|||. T Consensus 135 ~~~-----------~~---~~~~~~~~~~~~~~~~~-~~~~i~~~~~~~~~--~~-------~-~-----~-~~~vmn~~ 183 (303) T protein:vir:97 135 VIG-----------TN---HFDSKVTQVVKFTESED-ADANIEAAVNLIQG--AE-------G-V-----V-TGLAMDTE 183 (303) T ss_pred ccc-----------cc---ccccccccccccccccc-hHHHHHHHHHHHhh--cC-------C-C-----c-cEEEEcHH Confidence 000 00 00000111111122222 23344333322211 00 1 1 0 14888999 Q ss_pred HHHHHhcCcchHHHHHHHHHHhhcccccCCcccc------cCceEEcCEEEEecCCceeeeccccceeeccccccccccc Q lcl|NC_018846. 252 QWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK------GECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKE 325 (404) Q Consensus 252 q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~------G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~~~~~~ 325 (404) .+..|++=.+ ...+|||. +..+.+.|.+++.-..+|- .. .. T Consensus 184 ~~~~L~~lkd----------------~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~------------~~-----~~ 230 (303) T protein:vir:97 184 FSTALAKVTN----------------GEMGPKMYPELAWGANPDSINGLKSSVNTTVGA------------GA-----DE 230 (303) T ss_pred HHHHHHHhhc----------------cCCCeEEecCccCCCCCceecceeeEEecccCC------------cc-----cc Confidence 9998875211 11255553 3345778888866544430 00 00 Q ss_pred ccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHH-HHhhhhhccccCC-CCCceEEEEEEEeceec Q lcl|NC_018846. 326 VAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAIS-WINGLKKIRFPEK-SGKMQDHGVIAVDTAVK 403 (404) Q Consensus 326 ~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~-~i~G~~K~rF~~~-~g~~~DfGvi~idtaa~ 403 (404) ... ...+++|-=.-++.|+-..+....+.+ |++..+..+. +..++--.|.... +...-+-.-|+..+=+| T Consensus 231 ~~~----~~~~~~Gdf~~~~~~~~~~~~~~~~~~----~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~ 302 (303) T protein:vir:97 231 AES----KDLVIIGDFESMFKWGYAKQIPMEIIK----YGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGE 302 (303) T ss_pred CCC----ccEEEEeeccccEEEEEecCcEEEEee----ccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCC Confidence 000 012566654444455544444555544 3322221111 2222222332211 00111222233333444 Q ss_pred C Q lcl|NC_018846. 404 L 404 (404) Q Consensus 404 ~ 404 (404) + T Consensus 303 ~ 303 (303) T protein:vir:97 303 V 303 (303) T ss_pred C Confidence 4 No 72 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=40.45 E-value=0.97 Score=20.65 Aligned_cols=285 Identities=9% Similarity=0.020 Sum_probs=112.7 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEee-cc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMH-KL 79 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~-~L 79 (404) ||+-|.--.-..++.-++......++..+. ... ...++ ..+++.... .- T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~-~~~-------------~~~~~----------------~~~~~p~~~~~~ 50 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARL-SAQ-------------KPIPF----------------NGEKVFTFTMDS 50 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhh-cce-------------eeccC----------------CceEEEEEecCc Confidence 777664322222222222221111111000 000 00001 112222211 11 Q ss_pred ccCceecCceeecchhhhhhceeEEEEeeccceeccCChhhhh--hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_018846. 80 SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQ--RTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 80 ~G~gV~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~q--rs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g 157 (404) ...+|-..+ +-.+...+|.+-++.+......+.+..++=++ -...+|...-+..|++-+++..|+.+|.......| T Consensus 51 ~a~~v~Eg~--~~~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g 128 (298) T protein:vir:94 51 EIDVVAESG--KKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLG 128 (298) T ss_pred ceEEeeCCc--cccccccceeEEEEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCC Confidence 112222221 22345667777777776666666654333221 24578999999999999999999999722100011 Q ss_pred cccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccc Q lcl|NC_018846. 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~ 237 (404) .+..... .+.+. +..+......+......+.|.++....... ..+ T Consensus 129 --~~~~~~~------------~~~~~----------~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~--~~~--------- 173 (298) T protein:vir:94 129 --TASAVIG------------TNHFD----------SKVTQKVEAPRGIADPNGAIENAVELLTGV--DAD--------- 173 (298) T ss_pred --ccccccc------------ccccc----------cccccccccccccccHHHHHHHHHHhhhhc--CCC--------- Confidence 0000000 00000 011111111222222233444433333211 111 Q ss_pred cCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccc-----cCceEEcCEEEEecCCceeeeccccce Q lcl|NC_018846. 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK-----GECAMWRNILVRKYAGMPIRFYQGSKV 312 (404) Q Consensus 238 ~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~-----G~~gm~ngvii~~~~~~~irf~~~~~~ 312 (404) .-+++|||..+..|++=.+ ...+|||. |..+.+.|.++.--+.+| T Consensus 174 -----~~~~vmn~~~~~~l~~lkd----------------~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~--------- 223 (298) T protein:vir:94 174 -----VTGIAINPSFRSALAKQKD----------------LQGNALFPELKWGATPDTINGLPVDVNKTVS--------- 223 (298) T ss_pred -----ccEEEEcHHHHHHHHHhhc----------------cCCCeeecCcccCCCCceecceeeEEecccc--------- Confidence 1369999999988876211 11256663 445677787765433322 Q ss_pred eecccccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHH-HHhhhhhccccCCCCCce Q lcl|NC_018846. 313 LVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAIS-WINGLKKIRFPEKSGKMQ 391 (404) Q Consensus 313 ~~~~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~-~i~G~~K~rF~~~~g~~~ 391 (404) .....+. . -+|+|--+-++.|+-..+..+.+..+ +..-+..+. +-.++--.|.... - T Consensus 224 ----------~~~~~~~---~-~~~~Gdfs~~~~~~~~~~~~~~~~~~----~~~d~~~~~~f~~~~v~~r~~~r----~ 281 (298) T protein:vir:94 224 ----------DMSLTQR---D-RAIIGDFANGFKWGYAKEVPLEVIQY----GDPDNSGLDLKGYNQVYIRAELF----L 281 (298) T ss_pred ----------cccCCCc---c-EEEEeeccceEEEEEecCceEEEeec----CCCcCcchhhhhcCcEEEEEEEE----e Confidence 0000110 0 26777655556665444455554432 211111110 1112111221110 1 Q ss_pred EEEEEEEeceecC Q lcl|NC_018846. 392 DHGVIAVDTAVKL 404 (404) Q Consensus 392 DfGvi~idtaa~~ 404 (404) |+.+.-=...++| T Consensus 282 ~~~~~~~~a~~~l 294 (298) T protein:vir:94 282 GWGILDATKFARV 294 (298) T ss_pred ccEeecccceEEE Confidence 2222222222333 No 73 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=33.21 E-value=1.4 Score=19.83 Aligned_cols=295 Identities=15% Similarity=0.160 Sum_probs=128.6 Q ss_pred HHhhcCchhH------HHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEeeccccCceecCceeec Q lcl|NC_018846. 19 TAANRNRSMV------NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEG 92 (404) Q Consensus 19 t~~~~n~~~~------~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gV~Gd~~leG 92 (404) -..+.|++.+ .+|+..+..--.++- . ...|.++.|. +.||+|.++-+.. ++.+|....+ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~L-----v----~~~~~~~~d~--g~GDtV~InsIg~----~tV~dY~~~~ 65 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKL-----L----DVNIARVVDF--PDGDKLTIPSVGT----PVVRSRPEQG 65 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhh-----h----hhhhhccccc--CCCCeEEeccccc----cccccccCCC Confidence 2234455532 467766432222111 1 1113333343 4699999876544 4455554444 Q ss_pred c--hhhhhhceeEEEEeecc---ceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHh-hhhcccccccccc Q lcl|NC_018846. 93 R--GEDLSHADFSLKINQGR---HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA-GARGDFVADDTIL 166 (404) Q Consensus 93 n--ee~L~~~s~~v~Idq~R---~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~~~~~la-G~~g~~~n~~~~~ 166 (404) . -+.|.-.+.+|.|||.. +.|+- .+ --...||+..+-.+.++=+++-.|+-....|. |+-..-... . T Consensus 66 ~i~~d~ltt~~~~l~IDq~KYfaf~VdD-D~---~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~---~ 138 (322) T protein:vir:31 66 DFTFDNLDTGEISIILRDEVYAGNAISK-KL---RQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQN---D 138 (322) T ss_pred CcccccCCCceEEEEEehhhhhccccch-hH---HHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccC---C Confidence 4 57789999999999966 44553 22 23678999999999999899988887744332 221000000 0 Q ss_pred ccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccccCccceEEE Q lcl|NC_018846. 167 PTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVL 246 (404) Q Consensus 167 p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~~~~~~~yV~ 246 (404) |. ..|+. | +.++. ..+.=.+..+.|-++..++++..-| ... +++ T Consensus 139 p~---------vin~~--~---~~iv~--------~gt~~~~ay~~lv~l~~kLdkanVP-------~~g-------R~v 182 (322) T protein:vir:31 139 PN---------VINGV--P---HRFVG--------TGTDQTMDVTDFSRVNYVMTQSKMP-------MGG-------MIG 182 (322) T ss_pred cc---------eecCC--c---cceec--------cCCCchhhHHHHHHHHHHhccccCC-------CCC-------eEE Confidence 00 01111 1 11111 1122345677777888888876655 111 688 Q ss_pred EecHHHHHHHhcCcchHH------HHHHHHHHhhcccccCCcccccCceEEcCEEEEecCCceeeeccccceeecccccc Q lcl|NC_018846. 247 YVTPRQWNDWYTSTSGKD------WNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLT 320 (404) Q Consensus 247 ~l~p~q~~~Lr~d~~~~~------w~~~q~~A~~~~rg~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~ 320 (404) +++|.+++.|..=+.+.. |-.+..+..+ +|- .| +|-+-|+-|..--.++. ++-+. T Consensus 183 VV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a--~g~---~~---Vg~~~GF~V~~SN~l~~-----------~~~~i 243 (322) T protein:vir:31 183 IIDPSVAHHLETITNISNISNNPRWEGIVESGIA--PDM---QF---VRSVYGIDLFVSNLLAD-----------ANETI 243 (322) T ss_pred EeCchhhhhhhhhhhhhhhhccccccccccccch--hhH---HH---HHHHhceeeeeeccccc-----------ccccc Confidence 899999887755333211 2112111111 110 12 34444544432221110 00000 Q ss_pred cccccccccccchhhee-e----cCceeEEEeecCCCCCceeeeccccccc--hHHHHHHHHhhhhhccccCCCCCceEE Q lcl|NC_018846. 321 ATTKEVAAATNIDRAML-L----GAQALANAYGQKAGGHFNMVEKKTDMDN--RTEIAISWINGLKKIRFPEKSGKMQDH 393 (404) Q Consensus 321 ~~~~~~a~~~~v~rall-l----GaqAl~~A~g~~~g~r~~w~Ee~~D~g~--~~~i~i~~i~G~~K~rF~~~~g~~~Df 393 (404) ..+.+++...+.-+++| | |.-...-+|-+.+ +.|.|=+.+ .-+...-+.+|-.=.|= |- T Consensus 244 ~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~------~~e~~r~~~~~~d~~~~~~~~g~g~~r~--------e~ 309 (322) T protein:vir:31 244 NAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMP------TTKSFIDDYNDDLNTATTARWGNGLVRD--------EN 309 (322) T ss_pred ccCcccccccceeecccccccchhhhhhhhHhhhhh------hhhcccCccccccceeeeeeecceeecc--------cc Confidence 00111111111111111 1 2222222222110 111111111 11222223334333331 11 Q ss_pred EEEEEeceecC Q lcl|NC_018846. 394 GVIAVDTAVKL 404 (404) Q Consensus 394 Gvi~idtaa~~ 404 (404) =+.++-+++|. T Consensus 310 l~~~~a~~~~~ 320 (322) T protein:vir:31 310 LVCVLANADKV 320 (322) T ss_pred eEEEEeccccc Confidence 23334455555 No 74 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=28.11 E-value=1.8 Score=19.21 Aligned_cols=286 Identities=9% Similarity=0.012 Sum_probs=114.0 Q ss_pred CCcccchHHHHHHHHHHHHHhhcCchhHHHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCCCcEEEEEEee-cc Q lcl|NC_018846. 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMH-KL 79 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~n~~~~~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~Gd~v~f~L~~-~L 79 (404) |-.-++..... .....+..+.+++.............-.+ +..+++.... .. T Consensus 1 ma~~t~~~G~l-----------ip~~~~~~ii~~l~~~s~i~~l~~~~~~~----------------~~~~~~p~~~~~~ 53 (300) T protein:vir:95 1 MSEAQLSKGNL-----------FNPELVTKVINKVKGHSSIAKLSPQKPIP----------------FNGQREFVFDFDS 53 (300) T ss_pred CcccccCCcce-----------echhhHHHHHHHHHhhhhhhhhcceeecc----------------CCceEEEEEecCc Confidence 33222221110 01111222222221111100000000001 1112222211 11 Q ss_pred ccCceecCceeecchhhhhhceeEEEEeeccceeccCChhhhh--hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_018846. 80 SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQ--RTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 80 ~G~gV~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~q--rs~~dlrk~ar~~L~~w~~~~~D~~~~~~laG~~g 157 (404) ...+|-.++ +-.+...+|..-++.+...+.-+.+..++-++ -+.+||-..-++.|.+=++...|+.+| -|... T Consensus 54 ~a~wv~Eg~--~~~~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l---~G~~~ 128 (300) T protein:vir:95 54 DIDIVAENG--KKTHGGVSLDPVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSI---HGINP 128 (300) T ss_pred ceEEeeCCc--ccccccccceeeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhh---hcccC Confidence 112332222 22255677777777777776666654433221 245899999999999999999999997 33100 Q ss_pred cccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCCccEEeecccc Q lcl|NC_018846. 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi~Pv~~~g~~~ 237 (404) .. +....+.. ...-.+.......++...+.+.|.++........ . +. T Consensus 129 ---~~----g~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~---------~-~~ 175 (300) T protein:vir:95 129 ---RT----KQASTIIG----------------DNCFDKKVTQTVPFKDTNPDESMEDAVGMIDGSE---------R-DI 175 (300) T ss_pred ---CC----CCCccccc----------------ccccccccceeecccccchHHHHHHHHHHhhhcC---------C-Cc Confidence 00 00000000 0000011111222334444555555444443211 1 11 Q ss_pred cCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccc-----cCceEEcCEEEEecCCceeeeccccce Q lcl|NC_018846. 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK-----GECAMWRNILVRKYAGMPIRFYQGSKV 312 (404) Q Consensus 238 ~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~-----G~~gm~ngvii~~~~~~~irf~~~~~~ 312 (404) =+++|||..+..|++=.+ +..+|||. |..+.+.|.+++--+.+|- T Consensus 176 ------~~~vmn~~~~~~L~~lkd----------------~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~-------- 225 (300) T protein:vir:95 176 ------TGAILDPIFTTALSKMKN----------------AEGGKLYPELAWGGVPDAINGLAVDKNRTVSY-------- 225 (300) T ss_pred ------cEEEECHHHHHHHHHhhc----------------cCCCeeccCccccCCCceecceeeEEecCCCC-------- Confidence 257899999888875321 11256663 4567888887765443320 Q ss_pred eecccccccccccccccccchhheeecCceeEEEeecCCCCCceeeeccccccchHHHHHH-HHhhhhhccccCCCCCce Q lcl|NC_018846. 313 LVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAIS-WINGLKKIRFPEKSGKMQ 391 (404) Q Consensus 313 ~~~~~~~~~~~~~~a~~~~v~ralllGaqAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~-~i~G~~K~rF~~~~g~~~ 391 (404) ... .. .--+|+|--.-++.|+-..+..+.+.+ |+..-+-++. +-..+--.|+.. .- T Consensus 226 -------~~~----~~----~~~~~~GDf~~~~~~~~~~~~~~~v~~----~~~~d~~~~~~f~~~~v~~r~~~----r~ 282 (300) T protein:vir:95 226 -------SQT----DP----KNTAIVGDFETMFKWGYAKEVPMEIIK----YGDPDNSGRDLKGYNQIYIRCEA----YI 282 (300) T ss_pred -------CCC----CC----ccEEEEeeccceEEEEEecccEEEEee----ccCCCCcchhhhhcCcEEEEEEE----ee Confidence 000 00 011445654444445433344444443 2221111111 000011111100 12 Q ss_pred EEEEEEEeceecC Q lcl|NC_018846. 392 DHGVIAVDTAVKL 404 (404) Q Consensus 392 DfGvi~idtaa~~ 404 (404) |++|.--...++| T Consensus 283 d~~v~~~~a~~~l 295 (300) T protein:vir:95 283 GWGIMDAASFARI 295 (300) T ss_pred cceeecccceEEE Confidence 4555444444455 No 75 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=26.63 E-value=1.9 Score=19.02 Aligned_cols=298 Identities=11% Similarity=0.065 Sum_probs=105.6 Q ss_pred CCcccchHHHHHHH-----------HHHHHHhhcCch-hH-HHHHhhhhhhhhhhcccccccCCCCCccEEEEeecCCCC Q lcl|NC_018846. 1 MTTVTSAQANKLYQ-----------VALFTAANRNRS-MV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQA 67 (404) Q Consensus 1 ~~~~~~~~a~~~~~-----------~~lft~~~~n~~-~~-~~~~~~l~~~~~k~s~~~~~~Gt~~~~~I~~~~dL~k~~ 67 (404) ...+.+.. .+... .++=+.....-. .+ ..++..+.......+..... ..+..+.... T Consensus 84 ~~~~~~~~-~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l---------~~~~~~~~~~ 153 (404) T protein:vir:10 84 VRAIADNL-LKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNM---------VDYEPVFTRS 153 (404) T ss_pred HHHHHHHH-HHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhh---------hceeeccCCc Confidence 00000000 00000 000000000000 01 12222222222222221111 1111111122 Q ss_pred CcEEEEEEeeccccCceecCceeecchhhhhhceeEEEEeeccceeccCChhhhhhhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018846. 68 GDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQC 147 (404) Q Consensus 68 Gd~v~f~L~~~L~G~gV~Gd~~leGnee~L~~~s~~v~Idq~R~aV~~~g~m~~qrs~~dlrk~ar~~L~~w~~~~~D~~ 147 (404) |......+...-...+|..++........+.|..-++.+.....-+.....+- +.+.++|...-++.|++.+....|+. T Consensus 154 g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~la~~~~~~~~~~ 232 (404) T protein:vir:10 154 GSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLL-KFADKSLEDWIINWFVDKVRITRNAE 232 (404) T ss_pred cceEEEEecCCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHH-hhcHHHHHHHHHHHHHHHHHHHHHHH Confidence 22111122222223334333332222234556555555555555555443332 35678999999999999999999998 Q ss_pred HHHHHhhhhccccccccccccccccccccccccccCCCCCCcEEecCCccchhhhhhhccccHHHHHHHHHHHHHhCCCC Q lcl|NC_018846. 148 AIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPL 227 (404) Q Consensus 148 ~~~~laG~~g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~~~D~~s~~~Id~~~~~a~~~a~pi 227 (404) +| .|.-.. .+.. ++.. .++ ...++.+...+.+.+..+..+. + T Consensus 233 il---~G~g~~-~~~~------------gi~~----~~~------------~~~~~~~~~~~~~~~~~~~~~~------l 274 (404) T protein:vir:10 233 IL---YGAGGD-EHAT------------GIMT----ANK------------FKKITLPKSPALKDFKKCKNVE------L 274 (404) T ss_pred Hh---hcCCCC-Cccc------------ceee----ccc------------cceeeccccccHHHHHHHHHhh------h Confidence 86 332110 0110 1100 000 0011111222333333322111 1 Q ss_pred ccEEeecccccCccceEEEEecHHHHHHHhcCcchHHHHHHHHHHhhcccccCCcccc-----cCceEEcCEEEEecCCc Q lcl|NC_018846. 228 QPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK-----GECAMWRNILVRKYAGM 302 (404) Q Consensus 228 ~Pv~~~g~~~~~~~~~yV~~l~p~q~~~Lr~d~~~~~w~~~q~~A~~~~rg~~nPlF~-----G~~gm~ngvii~~~~~~ 302 (404) .|- ... --+++|||.-+..|++= +. +..+|||. |.-+++.|.++...+.. T Consensus 275 ~~~-~~~--------~~~~v~n~~~~~~L~~l----------kd------~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~ 329 (404) T protein:vir:10 275 LNV-FKA--------TSSWIVNQDGFNYLDSL----------ED------KTGRPYLQPDPKDPTQYRFLGLPVIELPND 329 (404) T ss_pred hcc-ccC--------CCEEEEcHHHHHHHHHh----------hc------cCCceeeccCcCCCCCccccceeeEEeccc Confidence 110 111 12678999888777651 10 12367774 34557778766432211 Q ss_pred eeeeccccceeecccccccccccccccccchhheeec--CceeEEEeecCCCCCceeeeccc-ccc-chHHHHHHHHhhh Q lcl|NC_018846. 303 PIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLG--AQALANAYGQKAGGHFNMVEKKT-DMD-NRTEIAISWINGL 378 (404) Q Consensus 303 ~irf~~~~~~~~~~~~~~~~~~~~a~~~~v~ralllG--aqAl~~A~g~~~g~r~~w~Ee~~-D~g-~~~~i~i~~i~G~ 378 (404) . .++...+-.+++| .+++.+... .+....+..+.. ||. +.+.+-+...+|+ T Consensus 330 ~-----------------------~~~~~~~~~~~~gd~s~~~~~~~~--~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~ 384 (404) T protein:vir:10 330 L-----------------------LLSTESAIPVLLGDTKEAYKYVSD--GAYELATTNIGAGAFETNTTKARIIMRIDG 384 (404) T ss_pred c-----------------------cCCCCCccEEEEEeccccEEEEEe--cceEEEEeccccchhhcCceEEEEEEeecc Confidence 0 0001112246677 344444322 233333332211 111 1111111111111 Q ss_pred hhccccCCCCCceEEEEEEEeceecC Q lcl|NC_018846. 379 KKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 379 ~K~rF~~~~g~~~DfGvi~idtaa~~ 404 (404) +-. ..+=|-++.+-+++.= T Consensus 385 ~v~-------~~~a~~~~~~~~aa~~ 403 (404) T protein:vir:10 385 NVK-------DSEALLIAEIPVESVQ 403 (404) T ss_pred EEe-------cccceEEEEeecccCC Confidence 111 1122333333333333 Done!